Amazon Web Services

In this AWS re:Invent 2023 talk, the speaker discusses Retrieval Augmented Generation (RAG) and its implementation using Redis Enterprise as a vector database with Amazon Bedrock. The presentation covers challenges in building RAG systems, data strategy, and the benefits of using Redis for vector search and semantic caching. The speaker explains how RAG can address issues like cost, quality, performance, and security in large language model applications. Key topics include vector databases, semantic caching, and the integration of Redis with Amazon Bedrock for efficient RAG implementations. The talk provides insights into optimizing LLM-based systems for improved performance and reduced costs.

product-information
skills-and-how-to
generative-ai
ai-ml
databases
Show 2 more

Up Next

VideoThumbnail
15:58

Revolutionizing Business Intelligence: Generative AI Features in Amazon QuickSight

Nov 22, 2024
VideoThumbnail
1:01:07

Accelerate ML Model Delivery: Implementing End-to-End MLOps Solutions with Amazon SageMaker

Nov 22, 2024
VideoThumbnail
40:23

Set Up and Use Apache Iceberg Tables on Your Data Lake - AWS Virtual Workshop

Nov 22, 2024
VideoThumbnail
2:53:33

Streamlining Patch Management: AWS Systems Manager's Comprehensive Solution for Multi-Account and Multi-Region Patching Operations

Nov 22, 2024
VideoThumbnail
6:45

Grindr's Next-Gen Chat System: Leveraging AWS for Massive Scale and Security

Nov 22, 2024