Amazon Web Services

This video from AWS re:Invent 2023 explores best practices for querying vector data in PostgreSQL for generative AI applications. Jonathan Katz, a Postgres core team member, dives deep into vector search and retrieval techniques using the pgvector extension. He covers key concepts like retrieval-augmented generation, embedding models, and approximate nearest neighbor searches. The talk focuses on optimizing performance and relevancy when working with large vector datasets in Postgres, comparing indexing methods like HNSW and IVFFlat. Katz also discusses strategies for filtering, storage considerations, and new features in Amazon Aurora that can accelerate vector queries. This comprehensive overview provides valuable insights for developers looking to implement efficient vector search capabilities in their PostgreSQL databases for AI/ML workloads.

product-information
skills-and-how-to
generative-ai
ai-ml
databases
Show 2 more

Up Next

VideoThumbnail
15:58

Revolutionizing Business Intelligence: Generative AI Features in Amazon QuickSight

Nov 22, 2024
VideoThumbnail
1:01:07

Accelerate ML Model Delivery: Implementing End-to-End MLOps Solutions with Amazon SageMaker

Nov 22, 2024
VideoThumbnail
6:45

Grindr's Next-Gen Chat System: Leveraging AWS for Massive Scale and Security

Nov 22, 2024
VideoThumbnail
40:23

Set Up and Use Apache Iceberg Tables on Your Data Lake - AWS Virtual Workshop

Nov 22, 2024
VideoThumbnail
2:53:33

Streamlining Patch Management: AWS Systems Manager's Comprehensive Solution for Multi-Account and Multi-Region Patching Operations

Nov 22, 2024