AWS Machine Learning Blog
Tag: GPUs
Achieve high performance at scale for model serving using Amazon SageMaker multi-model endpoints with GPU
Amazon SageMaker multi-model endpoints (MMEs) provide a scalable and cost-effective way to deploy a large number of machine learning (ML) models. It gives you the ability to deploy multiple ML models in a single serving container behind a single endpoint. From there, SageMaker manages loading and unloading the models and scaling resources on your behalf […]
AWS and NVIDIA Expand Deep Learning Partnership at GTC 2017
This year at NVIDIA’s GPU Technology Conference, AWS and NVIDIA partnered on multiple initiatives. The first is an exciting new Volta-based GPU instance that we think will completely change the face of the AI developer world through a 3x speedup on LSTM training. Second, we are announcing plans to train 100,000+ developers through the Deep […]