AWS Gen AI Loft | Best Practices for Deploying AI Models on Amazon EKS

Events
AWS Gen AI Loft | Best Practices for Deploying AI Models on Amazon EKS

Contêineres

AWS GenAI Loft | Bangalore

IA generativa

Dia:

Hora:

Tipo:

PRESENCIAL

Palestrantes:

Nayanendu Mishra | Sr. Startup SA, AWS, Abhishek Nanda | Containers Specialist SA, AWS

Idioma:

English

Endereço:

2Moons, 5th Floor, 1MG Mall, Trinity Circle, MG Road, Opposite Taj MG Road, Bengaluru, Karnataka 560008, IN

Nível(is):

200 – Intermediário, 300 - Avançado

Detalhes do evento

Dia

Hora

Tipo

PRESENCIAL

Local

2Moons, 5th Floor, 1MG Mall, Trinity Circle, MG Road, Opposite Taj MG Road, Bengaluru, Karnataka 560008, IN

Palestrantes

Generative AI on Amazon Elastic Kubernetes Service (EKS)

The generative AI/ML space is rapidly evolving, with many innovations coming from the open-source community, which often includes native Kubernetes integration. Amazon EKS offers unparalleled customization, allowing organizations to control their infrastructure down to the instance level and configure environments to meet specific requirements, including support for all Amazon EC2 instance types and specialized GPUs. Amazon EKS provides versatile deployment options across on-premises, edge, and cloud environments, along with seamless scalability and continuous cost optimization through tools like Karpenter and GPU sharing mechanisms.

What You’ll Gain:

In this session, we will explore the reasons why customers are choosing to run generative AI on Amazon EKS, along with their common use cases. We will then delve into the technical details of how generative AI operates on Amazon EKS. Finally, we will demonstrate these concepts in a live demo and discuss best practices for deploying and performing inference with large language models (LLMs) on Amazon EKS.

Agenda

AI on Amazon EKS Introduction
- Benefits of using Amazon EKS for AI
Infrastructure Options for AI/ML Workloads on Amazon EKS
- Compute Resources
- Storage
- Amazon EKS - optimized accelerated AMI
Inference and Training on Amazon EKS
- Building Blocks
- Essential Frameworks for Deploying Generative AI on Amazon EKS
- Efficiently Scaling LLMs using vLLM, Ray, and AWS Neuron
- Observability for Generative AI workloads on Amazon EKS
- Deep Learning Containers
Customer Use Cases
- GPU sharing using Time Slicing and Multi-Instance GPU (MIG)
- Optimize Model loading and container startup time on Amazon EKS
- Running RAG workloads on Amazon EKS
- Building Model Context Protocol on Amazon EKS