Posted On: Dec 7, 2018
Today, we are announcing that Amazon EC2 P3dn.24xlarge GPU instances, the latest addition to the P3 instance family are generally available. With up to 4x the network bandwidth of P3.16xlarge instances, P3dn.24xlarge instances are optimized for distributed machine learning and HPC applications. These new instances provide up to 100 Gbps of networking throughput, 96 custom Intel®Xeon® Scalable (Skylake) vCPUs, 8 NVIDIA® V100 Tensor Core GPUs with 32 GB of memory each, 300 GB/s NVLINK GPU interconnect, and 1.8 TB of local NVMe-based SSD storage.
The faster networking, new processors with additional vCPUs, doubling of GPU memory, and fast local instance storage enable developers to not only optimize performance on a single instance but also significantly lower the time to train their ML models or run more HPC simulations by scaling out their jobs across several instances (e.g., 16, 32 or 64 instances).
AWS is the first cloud provider to deliver 100 Gbps of networking throughput which helps remove data transfer bottlenecks and optimizes the utilization of GPUs to deliver maximum instance performance. In addition, the doubling of GPU memory from 16 GB to 32 GB per GPU provides the flexibility to train more advanced and larger machine learning models as well as process larger batches of data such as 4k images for image classification and object detection systems.
Below is a comparison of how P3dn.24xlarge instances compare to existing P3 instances.
|Instance Size||GPUs - Tesla V100||GPU Peer to Peer||GPU Memory (GB)||vCPUs||Memory (GB)||Network Bandwidth||EBS Bandwidth||Local Instance Storage|
|p3.2xlarge||1||N/A||16||8 (Broadwell)||61||Up to 10 Gbps||1.5 Gbps||N/A|
||NVLink||64||32 (Broadwell)||244||10 Gbps||7 Gbps||N/A|
|p3.16xlarge||8||NVLink||128||64 (Broadwell)||488||25 Gbps||14 Gbps||N/A|
|p3dn.24xlarge||8||NVLink||256||96 (Skylake)||768||100 Gbps||14 Gbps||2 x 900 GB NVMe SSD|
P3dn.24xlarge instances are available in the US East (N. Virginia) and US West (Oregon) AWS regions as On-Demand, Reserved, or Spot Instances. To learn more about P3 instances and how they are being used by customers such as Airbnb, Salesforce, and Western Digital, visit the P3 page.