AWS and NVIDIA

AWS›
AWS and NVIDIA

AWS and NVIDIA

GPU power from the cloud to the edge

Apply for a free trial with NVIDIA and AWS

What's new

See how AWS and NVIDIA are helping organizations transform AI goals into measurable business outcomes across industries.

AWS AI infrastructure with NVIDIA Blackwell: Two powerful compute solutions for the next frontier of AI

Read the blog

New Amazon EC2 P6e-GB200 UltraServers accelerated by NVIDIA Grace Blackwell GPUs for the highest AI performance

Read the blog

Why AWS and NVIDIA?

AWS and NVIDIA have collaborated since 2010 to continually deliver large-scale, cost-effective, and flexible GPU-accelerated solutions for customers. Spanning from the cloud to the edge, these innovations extend across infrastructure, software, and services to offer a full-stack solution that accelerates time to solution when building and deploying AI into production. With GPU-accelerated solutions available in multiple AWS Regions, customers can access the compute power that the need to achieve low latency, high performance, and high reliability.

Generative AI and machine learning

GPU instances and software for the most complex AI/ML models

Organizations of all sizes are using generative AI for chatbots, document analysis, code generation, video and image generation, speech recognition, drug discovery, and synthetic data generation to fast-track innovation, improve customer service, and gain a competitive advantage. To realize the full value of these solutions, organizations need to customize AI and machine learning (ML) models using their own proprietary data but building models from scratch is expensive and time-consuming. Amazon EC2 instances, powered by NVIDIA GPUs, accelerate training and inference for increasingly complex LLMs and compute-intensive generative AI applications. NVIDIA NIM and NeMo microservices, part of NVIDIA AI Enterprise in the AWS Marketplace, enables organizations to unlock the potential of generative AI and LLMs at scale.

Learn more about NVIDIA AI Enterprise

High performance computing

Solve large computational problems and gain new insights

High performance computing (HPC) allows scientists and engineers to solve complex, compute-intensive problems, quickly. HPC applications often require a combination of network performance, fast storage, large amounts of memory, and compute capabilities. AWS enables customers to increase the speed of research and reduce time-to-results by running GPU-powered HPC in the cloud and scaling to larger numbers of parallel tasks than would be practical in most on-premises environments. Amazon EC2 instances, powered by NVIDIA GPUs, are an ideal platform to run engineering simulations, computational finance, seismic analysis, molecular modeling, genomics, rendering, and other high performance compute workloads.

Learn more about Amazon EC2 P5 instances, powered by NVIDIA H100 Tensor Core GPUs

Internet of Things

Seamlessly extend AWS to edge devices so they can act locally

IoT devices with machine learning face a number of challenges. Limited computational resources at the edge can restrict the complexity and size of ML models while balancing the need for more sophisticated algorithms. Ensuring real-time processing, low latency and network security is paramount as edge devices are often more vulnerable to tampering and malicious attacks. AWS IoT Greengrass seamlessly extends AWS to edge devices such as NVIDIA Jetson so they can act locally on the data they generate, while still using the cloud for management, analytics, and durable storage.

Learn about how to integrate NVIDIA DeepStream on Jetson Modules with AWS IoT Core and AWS IoT Greengrass

Industrial Metaverse

Optimize operations by easily creating simulations of real-world systems

Many industries are benefiting from the simulation of real-world objects which can be accurate and spatially-aware immersive representations of physical entities. The industrial metaverse, covering digital twins and other simulations, helps researchers and engineers better collaborate and test their products, such as virtual prototyping or remote monitoring in factories. NVIDIA Omniverse is a computing platform that enables individuals and teams to develop Universal Scene Description (OpenUSD) based 3D workflows and applications.

Learn more about NVIDIA Omniverse on AWS

Virtual workstations

Adapt your workforce and access creative talent across the globe

As remote working grows and demand for HPC increases, so too does the need for virtual access to powerful workstations as industries adopt a more decentralized approach. NVIDIA's GPU technology ensures that graphic-intensive tasks such as 3D modeling, video editing, and AI development can be seamlessly executed in the cloud, providing users with the performance and visual fidelity they would traditionally expect from on-premises workstations. Running on Amazon EC2 instances, powered by NVIDIA GPUs, virtual workstations using NVIDIA RTX technology enhance flexibility and scalability enabling a more agile work environment for geographically dispersed teams.

Learn more about virtual workstations on AWS

Industry

AI is powering change in every industry across the globe. From speech recognition and recommender systems to medical imaging and improved supply chain management, AI is giving enterprises the compute power, tools, and algorithms their teams need to do their life’s work.

Healthcare and life sciences

HPC and AI are transforming medicine today. From enhanced medical imaging, accelerated genomics analysis, and new drug discovery and development, the HCLS industry can offer more personalized treatments, next-generation clinics, and enhanced quality of care.

Example customer: Paige

Financial services

In a fast-changing financial landscape, Banking institutions can use AI to harness vast amounts of data to increase encryption, boost security, detect fraud, and offer a more personalized service to their customers.

Example customer: Nerdwallet

Telecommunications

Telcos are looking to maximize data analytics, AI, and automation to improve customer service and increase loyalty. Create AI-enabled solutions, build software-defined infrastructure for 5G, and bring connected intelligence to smart devices at the edge.

Example customer: NTT Docomo

Public sector

AI-powered applications enhance disaster relief and climate resilience efforts while also speeding up time-consuming administrative tasks such as drafting, editing, and summarizing documents, updating databases, recording expenditures for auditing and compliance, dealing with customer enquiries.

Media and entertainment

New technologies are transforming the media and entertainment industry. AI-accelerated production pipelines deliver higher quality content faster, data analytics provide deeper insights, distribution and monetization are optimized, and software-defined infrastructure is enhancing live entertainment.

Example customer: Hive VFX

Project Ceiba & DGX Cloud

Project Ceiba is a collaboration between AWS and NVIDIA to build world's largest supercomputer in the cloud for NVIDIA's AI R&D, hosted exclusively on AWS.

Learn more about Project Ceiba

AWS and NVIDIA services

Instances

Amazon EC2 P5 instances

Amazon Elastic Compute Cloud (Amazon EC2) P5 instances, powered by NVIDIA H100 Tensor Core GPUs, deliver the highest performance in Amazon EC2 for deep learning (DL) and high performance computing (HPC) applications.

Learn more

Instances

Amazon EC2 P4d instances

Amazon EC2 P4d instances deliver the highest performance for ML training and HPC applications in the cloud. P4d instances are powered by the latest NVIDIA A100 Tensor Core GPUs and deliver industry-leading high throughput and low latency networking.

Learn more

Instances

Amazon EC2 P3 instances

Amazon EC2 P3 instances feature up to 8 NVIDIA V100 Tensor Core GPUs and up to 100 Gbps of networking throughput for ML and HPC applications. P3 instances have been proven to reduce ML training times from days to minutes, as well as increase the number of simulations completed for HPC by 3-4x.

Learn more

IoT

AWS IoT Greengrass

AWS IoT Greengrass seamlessly extends AWS to edge devices, such as NVIDIA Jetson systems, so they can act locally on the data they generate, while still using the cloud for management, analytics, and durable storage. AWS IoT Greengrass lets connected devices operate even with intermittent connectivity to the cloud.

Learn more

Instances

Amazon EC2 G4 instances

Amazon EC2 G4 instances feature NVIDIA T4 Tensor Core GPUs, providing access to one GPU or multiple GPUs, with different amounts of vCPU and memory. G4 instances provide the industry’s most cost-effective and versatile GPU instance for deploying ML models in production and graphics-intensive applications.

Learn more