ãã®ã³ã³ãã³ãã¯ãããã§ããã?
Serverless Retrieval Augmented Generation (RAG) on AWS
çæ AI ãé²åãç¶ããäžã§ãå€éšã®ææ°æ å ±ã倧èŠæš¡èšèªã¢ãã« (LLM) ã«çµ±åããããšã¯ã倧ããªé²æ©ããããããŸãããã®èšäºã§ã¯ãçã®ãµãŒããŒã¬ã¹æ€çŽ¢æ¡åŒµçæ (RAG) ãœãªã¥ãŒã·ã§ã³ãæ§ç¯ããããæ£ç¢ºã§ã³ã³ããã¹ãçã«é¢é£ããå¿çãçæããã¢ããªã±ãŒã·ã§ã³ã®äœæã容æã«ããŸããç§ãã¡ã®ç®æšã¯ãã客æ§ãã³ã¹ãã«ç®ãå ãããŠã䜿çšããŠããªãã³ã³ãã¥ãŒãã£ã³ã°æéãæ¯æããªããŠãããããã«ãã€ã€ãGenAI ã掻çšããã¢ããªã±ãŒã·ã§ã³ãå¯èœãªéãè¿ éã«äœæã§ããããã«ãµããŒãããããšã§ãã
ãµãŒããŒã¬ã¹ RAG: æŠèŠ
ãµãŒããŒã¬ã¹ RAG ã¯ãåºç€ã¢ãã«ã®é«åºŠãªèšèªåŠçæ©èœãšããµãŒããŒã¬ã¹ã¢ãŒããã¯ãã£ã®ä¿ææ§ããã³ã³ã¹ãå¹çãå ŒãåããŠããŸãããã®çµ±åã«ãããããŒã¿ããŒã¹ãã€ã³ã¿ãŒããããã«ã¹ã¿ã ãã¬ããžããŒã¹ãªã©ã®å€éšãœãŒã¹ããã®æ å ±ã®åçãªååŸãå¯èœãšãªããæ£ç¢ºã§ã³ã³ããã¹ãçã«ãªããã§ããã ãã§ãªããææ°ã®æ å ±ãå«ãã³ã³ãã³ããçæã§ããŸãã
Amazon Bedrock ã¯ããµãŒããŒã¬ã¹ RAG ã¢ããªã±ãŒã·ã§ã³ã®ãããã€ãç°¡çŽ åããåºç¯ãªã€ã³ãã©ã¹ãã©ã¯ãã£ç®¡çãå¿ èŠãšããããšãªããGenAI ãããžã§ã¯ããäœæã管çãã¹ã±ãŒã«ããããã®ããŒã«ãããããããŒã«æäŸããŸããããã«å ããŠãããããããŒã¯ Lambda ã S3 ãªã©ã® AWS ãµãŒãã¹ãšãLanceDB ãªã©ã®é©æ°çãªãªãŒãã³ãœãŒã¹ã®ãã¯ãã«ããŒã¿ããŒã¹ã掻çšããŠãå¿çæ§ãšã³ã¹ãå¹çã®é«ã AI é§ååã®ãœãªã¥ãŒã·ã§ã³ãæ§ç¯ã§ããŸãã
ããã¥ã¡ã³ãã®åã蟌ã¿
ãµãŒããŒã¬ã¹ RAG ãœãªã¥ãŒã·ã§ã³ãæ¡çšããããã®åãçµã¿ã«ã¯ãããã€ãã®éèŠãªã¹ããããå«ãŸããŠãããåã¹ãããã¯åºç€ã¢ãã«ãšå€éšã®ç¥èã®ã·ãŒã ã¬ã¹ãªçµ±åãå®çŸããããã«èª¿æŽãããŠããŸãã

ãã®ããã»ã¹ã¯ããµãŒããŒã¬ã¹ã¢ãŒããã¯ãã£ãžã®ããã¥ã¡ã³ãã®åã蟌ã¿ããå§ãŸããã€ãã³ãé§ååã®ã¡ã«ããºã ãããã¹ãã³ã³ãã³ãã®æœåºãšåŠçãããªã¬ãŒããŠãåã蟌ã¿ãçæããŸããAmazon Titan ãªã©ã®ã¢ãã«ã䜿çšããŠäœæããããããã®åã蟌ã¿ã¯ãæ©æ¢°ã容æã«ç解ããŠåŠçã§ããæ°å€ãã¯ãã«ã«ã³ã³ãã³ããå€æããŸãã
Amazon S3 ãå©çšãããµãŒããŒã¬ã¹ãã¯ãã«ããŒã¿ããŒã¹ã§ãã LanceDB ã«ãããã®ãã¯ãã«ãä¿åãããšãå¹ççãªæ€çŽ¢ãšç®¡çã容æã«ãªããLLM ã®å¿çã匷åããããã«é¢é£ããæ å ±ã®ã¿ã䜿çšãããããã«ãªããŸãããã®ã¢ãããŒãã§ã¯ãçæãããã³ã³ãã³ãã®ç²ŸåºŠãšé¢é£æ§ãé«ãŸãã ãã§ãªããåŸéå¶æéã¢ãã«ã掻çšããããšã§éçšã³ã¹ããå€§å¹ ã«åæžãããŸãã
ãã¡ãã®ã³ãŒããã芧ãã ããã
åã蟌ã¿ãšã¯äœã§ãã?
èªç¶èšèªåŠç (NLP) ã®é åã§ã¯ãåã蟌ã¿ã¯ãæ©æ¢°ãç解ããŠåŠçã§ããæ°å€åœ¢åŒã«ããã¹ãæ å ±ãå€æã§ããããã«ãã極ããŠéèŠãªæŠå¿µã§ããããã¯æå³é¢ä¿ã幟äœåŠçé¢ä¿ã«å€æããæ¹æ³ã§ãããã³ã³ãã¥ãŒã¿ã¯ããã人éã®èšèªãããã¯ããã«è¯ãç解ã§ããŸããåºæ¬çã«ã¯ãåã蟌ã¿ãéããŠãããã¥ã¡ã³ãã®ã³ã³ãã³ããé«æ¬¡å 空éã®ãã¯ãã«ã«å€æããŸããããã«ããããã®ç©ºéå ã®å¹ŸäœåŠçè·é¢ã¯æå³è«çãªæå³ãæã¡ãŸãããã®ç©ºéã§ã¯ãç°ãªãæŠå¿µãè¡šããã¯ãã«ã¯äºãã«é ãé¢ããé¡äŒŒããæŠå¿µã¯ã°ã«ãŒãåãããŸãã
ããã¯ã倧éã®ããã¹ãã³ãŒãã¹ã§ãã¬ãŒãã³ã°ããããã¥ãŒã©ã«ãããã¯ãŒã¯ãæ¡çšããããŸããŸãªã³ã³ããã¹ãã§åèªã®ã°ã«ãŒããäžç·ã«åºçŸããå¯èœæ§ãèšç®ãã Amazon Titan Embedding ãªã©ã®ã¢ãã«ãéããŠå®çŸãããŸãã
幞ããªããšã«ããã®ã·ã¹ãã ãæåããæ§ç¯ããå¿ èŠã¯ãããŸãããBedrock ã¯ãåã蟌ã¿ã¢ãã«ãä»ã®åºç€ã¢ãã«ãžã®ã¢ã¯ã»ã¹ãæäŸããŸãã
ãã¬ããžããŒã¹ãåã蟌ã¿ãŸããã次ã¯ã©ãããã°ããã§ãã?
ãã¬ããžããŒã¹ãã©ããã«ä¿ç®¡ããŠããå¿ èŠããããŸããæ£ç¢ºã«ã¯ãã¯ãã«ããŒã¿ããŒã¹ã«ä¿ç®¡ããŸãããããŠããã¯ãã«ããŒã¿ããŒã¹ã¯çã®ãµãŒããŒã¬ã¹ã®éæ³ãèµ·ããå Žæã§ãã
LanceDB ã¯ãæ°žç¶ã¹ãã¬ãŒãžã䜿çšãããã¯ãã«æ€çŽ¢çšã«èšèšããããªãŒãã³ãœãŒã¹ã®ãã¯ãã«ããŒã¿ããŒã¹ã§ãæ€çŽ¢ããã£ã«ã¿ãªã³ã°ãåã蟌ã¿ã®ç®¡çãç°¡çŽ åããŸããç§ãã¡ãç¹ã«æçã ãšæããã®ã¯ãLanceDB ã S3 ã«çŽæ¥æ¥ç¶ã§ããæ©èœã§ãããããã«ãããã¢ã€ãã«ç¶æ ã®ã³ã³ãã¥ãŒãã£ã³ã°ãäžèŠã«ãªããŸããLambda é¢æ°ã®å®è¡äžã«ã®ã¿ããŒã¿ããŒã¹ã䜿çšããŸããç§ãã¡ã®è² è·ãã¹ãã§ã¯ãLanceDBãBedrockãLambda ã«å€§ããªè² è·ããããããšãªããæ倧 500 MB ã®ãµã€ãºã®ããã¥ã¡ã³ããåã蟌ããããšãããããŸããã
ãã®ã·ã¹ãã ã®æ¢ç¥ã®å¶é㯠Lambda ã®ã³ãŒã«ãã¹ã¿ãŒãã§ããã倧éšåã®æéãå ããããã»ã¹ã¯å®éã«ã¯ Lambda ã®å€éšã§è¡ãããåã蟌ã¿ã®èšç®ã§ããããšã枬å®ã§æãããšãªããŸãããåœç€Ÿã®ãŠãŒã¶ãŒããŒã¹ãã³ãŒã«ãã¹ã¿ãŒãã®åœ±é¿ãåããã®ã¯äºäŸã® 10% ã«éããªãããšã枬å®ã«ãã£ãŠããããŸãããããã軜æžããããã«ãMVP ã®æ¬¡ã®ãã§ãŒãºã§ããããžã§ããäœæããããšãæ€èšã§ããã»ããBatch ã ECS Fargate ãªã©ã®ä»ã®ãµãŒããŒã¬ã¹ AWS ãµãŒãã¹ãå©çšããã¹ãããæéã®æ©æµãåããªãããããã«ã³ã¹ããåæžããããšãèããããŸãã
ã¯ãšãªã®å®è¡

ãŠãŒã¶ãŒã¯ãLambda URL ãä»ããŠå ¥åãæšè«é¢æ°ã«è»¢éã§ããŸããããã¯ãBedrock ãä»ã㊠Titan Embedding ã¢ãã«ã«å ¥åããããã®ã¢ãã«ã¯ãã¯ãã«ãèšç®ããŸãããã®åŸããã®ãã¯ãã«ã䜿çšããŠããã¯ãã«ããŒã¿ããŒã¹å ã®ããã€ãã®é¡äŒŒã®ããã¥ã¡ã³ããååŸããããããæçµããã³ããã«è¿œå ããŸãããŠãŒã¶ãŒãéžæãã LLM ã«æçµããã³ãããéä¿¡ããŸããLLM ãã¹ããªãŒãã³ã°ããµããŒãããŠããå Žåãå¿çã¯ãªã¢ã«ã¿ã€ã ã§ãŠãŒã¶ãŒã«ã¹ããªãŒãã³ã°ãããŸããããã§ããé·æéå®è¡ãããã¢ã€ãã«èšç®ã¯ãããŸããããŸãããŠãŒã¶ãŒå ¥åã®ãµã€ãºã¯éåžžãåã蟌ãããã¥ã¡ã³ããããå°ãããããåã蟌ã¿ã®èšç®ã«ãããæéã®ççž®ãæåŸ ã§ããŸãã
ãã®æšè«ã·ã¹ãã ã®æ¢ç¥ã®å¶éã¯ãæ°ãã Lambda é¢æ°å ã§ãã¯ãã«ããŒã¿ããŒã¹ãã³ãŒã«ãã¹ã¿ãŒãããããšã§ããLanceDB 㯠S3 ã«ä¿åãããããŒã¿ããŒã¹ãåç §ãããããæ°ãã Lambda å®è¡ç°å¢ãäœæãããéã«ããã¯ãã«æ€çŽ¢ãå®è¡ã§ããããã«ããŒã¿ããŒã¹ã§ããŒãããå¿ èŠããããŸããããã¯ã¹ã±ãŒã«ã¢ããäžããŸãã¯ãã°ãã誰ã質åãããªãã£ãå Žåã«ã®ã¿çºçããŸããããã¯ãå®å šãªãµãŒããŒã¬ã¹ã¢ãŒããã¯ãã£ã«ããã³ã¹ãåæžãšã®ãã¬ãŒããªãã¯ããªãå°ãããšããããšãæå³ããŸãã
ãã¡ãã®ã³ãŒããã芧ãã ããã
ãµãŒããŒã¬ã¹ RAG ã®çµæžæ§ã®ç解
ãµãŒããŒã¬ã¹ RAG ãæ¡çšããã«ã¯ãã³ã¹ããžã®åœ±é¿ãç解ããããšãéèŠã§ããAmazon Bedrock ã®æéã¢ãã«ã¯ãããŒã¯ã³ã®äœ¿çšéãšãµãŒããŒã¬ã¹ãªãœãŒã¹ã®æ¶è²»éã«åºã¥ããŠãããããããããŒã¯ã³ã¹ããæ£ç¢ºã«èŠç©ããããšãã§ããŸããããã¥ã¡ã³ããåã蟌ã¿ã®ããã«åŠçããå Žåã§ããå¿çãåŸãããã«ã¢ãã«ã«å¯ŸããŠã¯ãšãªãå®è¡ããå Žåã§ããåŸéå¶æéã«ãããã³ã¹ãã¯äœ¿çšéã«çŽæ¥é¢é£ä»ããããããããæ¯æãããã ãã®ã¯äœ¿çšããåã®æéã®ã¿ã§ãã

åã蟌ã¿ã®çµæžæ§

ããã¥ã¡ã³ãåŠçã«ãµãŒããŒã¬ã¹ã¢ãŒããã¯ãã£ã䜿çšããçµæžæ§ã«ã€ããŠãããå°ã詳ããèŠãŠã¿ãŸããããèšç®ã¯ããã€ãã®ä»®å®ã«åºã¥ããŠããŸããåŠçæéã¯ããŒã¿ 1 MB ããã 1 åãšå€§ãŸãã«èŠç©ããããŠããããã®ãµã€ãºã®ããã¥ã¡ã³ãã«ã¯éåžž 30,000 匱ã®ããŒã¯ã³ãå«ãŸããŸãããããã®æ°åã¯ããŒã¹ã©ã€ã³ãæäŸããŸãããçŸå®ã§ã¯æ¡ä»¶ã¯ãã奜ãŸããå Žåãå€ããå€ãã®ããã¥ã¡ã³ãã¯å€§å¹ ã«ããè¿ éã«åŠçãããŸãã
1 åã® 1 MB ã®ããã¥ã¡ã³ããåŠçããã®ã«ãããè²»çšã¯ãããããã§ãã»ãšãã©ã®å Žå㯠0.5 USC æªæºã§ããããããã®ãµã€ãºã 1 MB ã®ããã¥ã¡ã³ãã 1,000 åãŸã§ã¹ã±ãŒã«ã¢ããããŠããåèšã³ã¹ã㯠4 USD æªæºãšé©ãã»ã©äœãæããããŸãããã®äŸã¯ãããã¥ã¡ã³ãåŠçã«ããããµãŒããŒã¬ã¹ã¢ãŒããã¯ãã£ã®è²»çšå¯Ÿå¹æãå®èšŒããã ãã§ãªããAmazon Bedrock ãªã©ã®ãã©ãããã©ãŒã ã§äœ¿çšãããããŒã¯ã³ããŒã¹ã®æéã¢ãã«ã®å¹çæ§ãæããã«ããŠããŸãããŸãããã㯠1 åéãã®ããã»ã¹ã§ããããã¥ã¡ã³ããåŠçãããšãåé€ãããŸã§ãã¯ãã«ããŒã¿ããŒã¹ã«ä¿åãããŸãã
ã¯ãšãªå®è¡ã®çµæžæ§

èšå®ã®ã€ã³ã¿ã©ã¯ãã£ããªéšåã«è©±é¡ãåãæ¿ããŠãå®éã« AI ã«ããã€ãã®è³ªåããå§ãããšäœãèµ·ãããã«ã€ããŠè©±ããŸããããããã€ãã®ä»®å®ã次ã«ç€ºããŸããAWS Lambda ããŠãŒã¶ãŒã«åçãè¿ãããã³ãããåã蟌ãã®ã«çŽ 20 ç§ããããšèããŠããŸãããŸããå質åãšãã®åçã¯ããããçŽ 1,000 ããŒã¯ã³ã§ãããšæ³å®ããŠããŸããæšè«ã³ã¹ããšæ¯èŒãããšãS3 ã«å¯Ÿãããªã¯ãšã¹ãã«é¢é£ããæéã¯ç¡èŠã§ããŸãã
ä»®å®ã¯ãããããã«ããŠã次ã«ã³ã¹ãã«ã€ããŠè©³ããèŠãŠãããŸããããAnthropic ã Claude V2 ã¢ãã«ã«å¯Ÿã㊠1 件ã®ã¯ãšãªãå®è¡ãããšãçŽ 3 USC ã®ã³ã¹ããããããŸããClaude Instant ã®ãããªããå°ã軜éã®ãã®ãéžæãããšãã³ã¹ãã¯ã¯ãšãªãããããã 1 USC ã«ãŸã§åçã«æžããŸããClaude V2 ã䜿çšããŠã¯ãšãªã 1,000 件ãŸã§å¢ãããšãåèšã³ã¹ãã¯çŽ 33 USD ã«ãªããŸããããã¯ã質åã LLM ã«éä¿¡ããããŒã¿ããŒã¹ããé¡äŒŒã®ããã¥ã¡ã³ãããã«ããŠã¯ãšãªããšã³ãªããããã³ã³ããã¹ãããã¥ã¡ã³ãã«çµã³ä»ããã«ã¹ã¿ãã€ãºãããåçãåŸããŸã§ã®ããã»ã¹å šäœãã«ããŒããŸãã
ãã®èšå®å šäœã®æ¥µããŠéèŠãªç¹ã¯ããµãŒããŒã¬ã¹ã®æ§è³ªã®ãããã§ããªã¯ãšã¹ãããšã«åäœããããã«èšèšãããŠãããšããããšã§ããã€ãŸãããæ¯æãããã ãã®ã¯äœ¿çšããåã®æéã®ã¿ãšãªããŸãã
ãµãŒããŒã¬ã¹ RAG ã«ããå°å¹³ã®æ¡åŒµ
å°æ¥ã«ç®ãåãããšããµãŒããŒã¬ã¹ RAG ã®æœåšçãªçšéã¯çŸåšã®ãŠãŒã¹ã±ãŒã¹ãã¯ããã«è¶ ããŠåºããã§ããããé«ãé¢é£æ§ãå®çŸããããã®ã¢ãã«ã®åã©ã³ã¯ä»ãã匷åãããã»ãã³ãã£ãã¯æ€çŽ¢ã®ããã®ã¢ããã¿ãŒã®åã蟌ã¿ããã«ãã¢ãŒãã«æ å ±çµ±åã®æ€èšãªã©ã®è¿œå æŠç¥ãçµã¿èŸŒãããšã§ãããããããŒã¯ GenAI ã¢ããªã±ãŒã·ã§ã³ãããã«æŽç·Žããæ¡åŒµã§ããŸãã
Amazon Bedrock ã®ãµãŒããŒã¬ã¹ RAG ã®ãµããŒãã¯ãçæ AI ã®åéã«ãããã€ãããŒã·ã§ã³ãžã®æ°ããªéãéããŸããAWS ã¯ãåå ¥éå£ã軜æžããã¹ã±ãŒã©ãã«ã§ã³ã¹ãå¹çã®é«ããã©ãããã©ãŒã ãæäŸããããšã«ãããããããããŒã AI é§ååã¢ããªã±ãŒã·ã§ã³ã®å¯èœæ§ãæ倧éã«æ¢çŽ¢ã§ããããã«ããŠããŸãããµãŒããŒã¬ã¹ RAG ã®æ©èœã®æ¢çŽ¢ãšæ¡åŒµãç¶ããäžã§ãããã€ã³ããªãžã§ã³ãã§å¿çæ§ãé«ããé¢é£æ§ã®é«ã AI ãœãªã¥ãŒã·ã§ã³ãçã¿åºãããšãã§ããå¯èœæ§ã¯ç¡éã«ãããŸãããã®ãžã£ãŒããŒã«åå ããŠãAmazon Bedrock ã§ã®ãµãŒããŒã¬ã¹ RAG ãã©ã®ããã« AI ãããžã§ã¯ããçŸå®ã«å€æã§ããã®ããã芧ãã ããã
ãªãœãŒã¹

Giuseppe Battista
Giuseppe Battista ã¯ãAmazon Web Services ã® Senior Solutions Architect ã§ããè±åœãšã¢ã€ã«ã©ã³ãã®åæ段éã®ã¹ã¿ãŒãã¢ããã®ãœãªã¥ãŒã·ã§ã³ã¢ãŒããã¯ãã£ãææ®ããŠããŸããGiuseppe 㯠twitch.tv/aws 㧠Twitch Show ã®ãLet's Build a Startupããäž»å¬ããŠãããUnicorn's Den ã¢ã¯ã»ã©ã¬ãŒã¿ãŒã®è²¬ä»»è ã§ããããŸãã

Kevin Shaffer-Morrison
Kevin Shaffer-Morrison ã¯ãAmazon Web Services ã® Senior Solutions Architect ã§ããKevin ã¯ãäœçŸãã®ã¹ã¿ãŒãã¢ãããè¿ éã«è»éã«ä¹ã£ãŠã¯ã©ãŠãã«ç§»è¡ã§ãããããµããŒãããŠããŸãããKevin ã¯ãã³ãŒããµã³ãã«ãš Twitch ã©ã€ãã¹ããªãŒã ãå©çšããŠãåµæ¥è ã®æåæã®æ®µéããµããŒãããããšã«éç¹çã«åãçµãã§ããŸãã
ãã®ã³ã³ãã³ãã¯ãããã§ããã?