Deploying DeepSeek V3 and DeepSeek-R1 on Amazon SageMaker

Events
Deploying DeepSeek V3 and DeepSeek-R1 on Amazon SageMaker

AWS Startups

Generative AI

Machine Learning

SageMaker

Technical

Day:

Time:

Type:

IN PERSON

Speakers:

Supreeth S Angadi | GenAI/ML Startups Solution Architect, AWS, Pradipta Dash | Senior Startups Solutions Architect, AWS, Sourabh Jain | Sr. GenAI Startups Account Manager, AWS

Language:

English

Address:

Bagmane Constellation Business Park Block-7, Bagmane Constellation Service Rd, Ferns City, Doddanekkundi, Bengaluru, Karnataka 560048, IN

Capacity:

Full - This event is at capacity

Level(s):

300 - Advanced, 400 - Expert

FULL

Event details

Day

Time

Type

IN PERSON

Place

Bagmane Constellation Business Park Block-7, Bagmane Constellation Service Rd, Ferns City, Doddanekkundi, Bengaluru, Karnataka 560048, IN

Capacity

Full - This event is at capacity

Speakers

Are you a startup founder or machine learning (ML) engineer looking to effectively deploy and manage AI models while optimizing costs?

Join us for an intensive hands-on workshop exploring Amazon SageMaker Studio's unified ML development environment and learn production-ready strategies for model deployment.

DeepSeek is a cutting-edge family of large language models that has gained significant attention in the AI community for its impressive performance, cost-effectiveness, and open-source nature. DeepSeek offers a range of models including the powerful DeepSeek-V3, the reasoning-focused DeepSeek-R1, and various distilled versions. These models stand out for their innovative architecture, utilizing techniques like Mixture-of-Experts and Multi-Head Latent Attention to achieve high performance with lower computational requirements.

In this hands-on workshop, you'll learn about Amazon SageMaker Studio's comprehensive toolkit to self-host large language models from DeepSeek while maintaining cost efficiency.

Who is this for? This workshop is ideal for:

Startup founders and technical leaders creating AI solutions
ML Engineers and Data Scientists
DevOps professionals managing GenAI/ML infrastructure
Technical decision-makers evaluating GenAI/ML platforms
Developers interested in self-hosting open-source LLMs
Engineers looking to optimize their GenAI/ML infrastructure costs

During this hands-on workshop, you'll learn how to leverage Amazon SageMaker Studio's unified environment to streamline your ML workflows and implement cost-effective model deployment strategies.

Key highlights:

Master Amazon SageMaker Studio's unified interface and development environment
Hands-on implementation of self-hosting DeepSeek and similar models
Deploy cost-optimization strategies including scale-to-zero capabilities
Enhance inference performance using Fast Model Loader and container caching
Best practices for managing GenAI/ML development lifecycle
Real-world examples of production GenAI/ML infrastructure optimization
Interactive troubleshooting and optimization sessions

This workshop is specifically designed for startup teams who want to productionze GenAI/ML infrastructure while maintaining cost efficiency. You'll gain hands-on experience with Amazon SageMaker's advanced features and learn practical strategies for managing GenAI/ML workloads.

Prerequisites: