Posted On: Aug 15, 2024

The Amazon Elastic Compute Cloud (Amazon EC2) P5 instances, powered by NVIDIA H100 Tensor Core GPUs, are now available in AWS Secret cloud. Amazon EC2 P5 instances help you accelerate your time to solution by up to 4x compared to previous-generation GPU-based EC2 instances, and reduce cost to train ML models by up to 40%.

You can use P5 instances for training and deploying increasingly complex large language models (LLMs) and diffusion models powering the most demanding generative AI applications. This includes question answering, code generation, video and image generation, speech recognition, and more. You can also use P5 instances to deploy demanding HPC applications at scale in pharmaceutical discovery, seismic analysis, weather forecasting, and financial modeling.

P5 instances are powered by the latest NVIDIA H100 Tensor Core GPUs and provide 2x higher CPU performance, 2x higher system memory, and 4x higher local storage as compared to previous-generation GPU-based instances. They provide market-leading scale-out capabilities for distributed training and tightly coupled HPC workloads with up to 3,200 Gbps of networking using second-generation Elastic Fabric Adapter (EFA) technology. 

The content in this post is for informational purposes only. For more information on the Amazon EC2 P5 instances in the AWS Secret cloud, please contact us.