AWS Marketplace

Durga Sury and Victor Jaramillo

Author: Durga Sury and Victor Jaramillo

right sizing sagemaker endpoints

Rightsizing Amazon SageMaker endpoints

As AWS consultants, Victor and I often get asked about recommendations on the right instance configuration to use for real-time inference. Finding the correct instance size to host your trained machine learning (ML) models might be a challenging task. However, choosing the right instance and auto scaling configuration can help reduce model serving costs without […]