Amazon Web Services

This video from AWS re:Invent 2023 explores training and tuning state-of-the-art machine learning models on Amazon SageMaker. Gal Oshri, Emily Webber, and Thomas Kollar discuss the challenges of training large-scale ML models and how SageMaker addresses them. They cover SageMaker's features like distributed training, cluster repair, and the new smart sifting capability. Emily demonstrates fine-tuning and pre-training large language models, including a demo of training Llama 7B on SageMaker. Thomas Kollar shares insights on how Toyota Research Institute leverages SageMaker for various ML use cases, including autonomous driving and robotics. The presenters highlight SageMaker's ability to scale from small experiments to large-scale training jobs efficiently, making it accessible for companies to create and customize their own foundation models.

product-information
skills-and-how-to
generative-ai
ai-ml
sagemaker