AWS Storage Blog

Author: Venkata Sistla

Venkata is a Senior Specialist Solutions Architect at AWS, with over 12 years of experience in cloud architecture. He specializes in designing and implementing enterprise-scale AI/ML platforms across various industry sectors. He focuses on architecting highly scalable infrastructures that accelerate machine learning initiatives and deliver measurable business outcomes.

Building self-managed RAG applications with Amazon EKS and Amazon S3 Vectors

Retrieval-Augmented Generation (RAG) is a technique that optimizes large language model (LLM) outputs by referencing authoritative knowledge bases outside of the model’s training data before generating responses. This addresses common limitations of traditional LLMs, such as outdated knowledge, hallucinated facts, and misinterpreted terminology. Organizations can implement RAG to enhance their generative AI applications with current, […]

Architecting scalable checkpoint storage for large-scale ML training on AWS

The exponential growth in size and complexity of foundation models (FMs) has created unprecedented infrastructure demands across compute, networking, and storage resources. Storage systems, in particular, face intense requirements for throughput, latency, and capacity. In machine learning (ML) model training, these storage demands are particularly evident in checkpointing—a critical reliability mechanism that periodically saves and […]

AWS Storage Blog

Author: Venkata Sistla

Building self-managed RAG applications with Amazon EKS and Amazon S3 Vectors

Architecting scalable checkpoint storage for large-scale ML training on AWS

Learn

Resources

Developers

Help