Compute›
Amazon EC2›
Capacity Blocks

Amazon EC2 Capacity Blocks for ML

Reserve GPU instances in Amazon EC2 UltraClusters to run your ML workloads

Get started with EC2 Capacity Blocks

Contact Sales

Why EC2 Capacity Blocks for ML?

With Amazon Elastic Compute Cloud (Amazon EC2) Capacity Blocks for ML, easily reserve Amazon EC2 P5 instances, powered by the latest NVIDIA H100 Tensor Core GPUs, and Amazon EC2 P4d instances, powered by NVIDIA A100 Tensor Core GPUs, for a future start date. EC2 Capacity Blocks are colocated in Amazon EC2 UltraClusters designed for high-performance machine learning (ML) workloads. You can reserve GPU instances for up to 28 days in cluster sizes of one to 64 instances (512 GPUs), giving you the flexibility to run a broad range of ML workloads. EC2 Capacity Blocks can be reserved up to eight weeks in advance.

Benefits

Plan with confidence

Plan your ML development with confidence by ensuring future available capacity for GPU instances.

Low-latency, high-throughput network connectivity

Get low-latency, high-throughput network connectivity through colocation in Amazon EC2 UltraClusters for distributed training.

High performance

Gain predictable access to GPU instances with the highest performance in Amazon EC2 for machine learning.

Use cases

Train or fine-tune ML models using GPU instances

Get uninterrupted access to the GPU instances that you reserve to complete ML model training and fine-tuning.

Get GPU instances for the amount of time you need to run your experiments

Run experiments and build prototypes that require GPU instances for short durations.

Plan for future surges in demand for ML applications

Meet your growth needs by reserving the right amount of capacity to serve your customers.

Get started

Learn how to use EC2 Capacity Blocks with Amazon EC2 Auto Scaling

EC2 Capacity Blocks prices are dynamic and change based on available supply and demand