AWS Machine Learning Blog

Tag: GPUs

Achieve high performance at scale for model serving using Amazon SageMaker multi-model endpoints with GPU

Amazon SageMaker multi-model endpoints (MMEs) provide a scalable and cost-effective way to deploy a large number of machine learning (ML) models. It gives you the ability to deploy multiple ML models in a single serving container behind a single endpoint. From there, SageMaker manages loading and unloading the models and scaling resources on your behalf […]

AWS and NVIDIA Expand Deep Learning Partnership at GTC 2017

This year at NVIDIA’s GPU Technology Conference, AWS and NVIDIA partnered on multiple initiatives. The first is an exciting new Volta-based GPU instance that we think will completely change the face of the AI developer world through a 3x speedup on LSTM training. Second, we are announcing plans to train 100,000+ developers through the Deep […]