AWS and NVIDIA

GPU power from the cloud to the edge

Why AWS and NVIDIA?

AWS and NVIDIA have collaborated since 2010 to continually deliver large-scale, cost-effective, and flexible GPU-accelerated solutions for customers. Spanning from the cloud to the edge, these innovations extend across infrastructure, software, and services to offer a full-stack solution that accelerates time to solution when building and deploying AI into production. With GPU-accelerated solutions available in multiple AWS Regions, customers can access the compute power that the need to achieve low latency, high performance, and high reliability. AWS and NVIDIA have collaborated since 2010 to continually deliver large-scale, cost-effective, and flexible GPU-accelerated solutions for customers. Spanning from the cloud to the edge, these innovations extend across infrastructure, software, and services to offer a full-stack solution that accelerates time to solution when building and deploying AI into production. With GPU-accelerated solutions available in multiple AWS Regions, customers can access the compute power that the need to achieve low latency, high performance, and high reliability.

Solutions

Generative AI and machine learning

GPU instances and software for the most complex AI/ML models

Organizations of all sizes are using generative AI for chatbots, document analysis, code generation, video and image generation, speech recognition, drug discovery, and synthetic data generation to fast-track innovation, improve customer service, and gain a competitive advantage. To realize the full value of these solutions, organizations need to customize AI and machine learning (ML) models using their own proprietary data but building models from scratch is expensive and time-consuming. Amazon EC2 instances, powered by NVIDIA GPUs, accelerate training and inference for increasingly complex LLMs and compute-intensive generative AI applications. NVIDIA NIM and NeMo microservices, part of NVIDIA AI Enterprise in the AWS Marketplace, enables organizations to unlock the potential of generative AI and LLMs at scale.

Learn more about NVIDIA AI Enterprise

High performance computing

Solve large computational problems and gain new insights

High performance computing (HPC) allows scientists and engineers to solve complex, compute-intensive problems, quickly. HPC applications often require a combination of network performance, fast storage, large amounts of memory, and compute capabilities. AWS enables customers to increase the speed of research and reduce time-to-results by running GPU-powered HPC in the cloud and scaling to larger numbers of parallel tasks than would be practical in most on-premises environments. Amazon EC2 instances, powered by NVIDIA GPUs, are an ideal platform to run engineering simulations, computational finance, seismic analysis, molecular modeling, genomics, rendering, and other high performance compute workloads.

Learn more about Amazon EC2 P5 instances, powered by NVIDIA H100 Tensor Core GPUs

Internet of Things

Seamlessly extend AWS to edge devices so they can act locally

IoT devices with machine learning face a number of challenges. Limited computational resources at the edge can restrict the complexity and size of ML models while balancing the need for more sophisticated algorithms. Ensuring real-time processing, low latency and network security is paramount as edge devices are often more vulnerable to tampering and malicious attacks. AWS IoT Greengrass seamlessly extends AWS to edge devices such as NVIDIA Jetson so they can act locally on the data they generate, while still using the cloud for management, analytics, and durable storage.

Learn about how to integrate NVIDIA DeepStream on Jetson Modules with AWS IoT Core and AWS IoT Greengrass

Industrial Metaverse

Optimize operations by easily creating simulations of real-world systems

Many industries are benefiting from the simulation of real-world objects which can be accurate and spatially-aware immersive representations of physical entities. The industrial metaverse, covering digital twins and other simulations, helps researchers and engineers better collaborate and test their products, such as virtual prototyping or remote monitoring in factories. NVIDIA Omniverse is a computing platform that enables individuals and teams to develop Universal Scene Description (OpenUSD) based 3D workflows and applications.

Learn more about NVIDIA Omniverse on AWS

Virtual workstations

Adapt your workforce and access creative talent across the globe

As remote working grows and demand for HPC increases, so too does the need for virtual access to powerful workstations as industries adopt a more decentralized approach. NVIDIA's GPU technology ensures that graphic-intensive tasks such as 3D modeling, video editing, and AI development can be seamlessly executed in the cloud, providing users with the performance and visual fidelity they would traditionally expect from on-premises workstations. Running on Amazon EC2 instances, powered by NVIDIA GPUs, virtual workstations using NVIDIA RTX technology enhance flexibility and scalability enabling a more agile work environment for geographically dispersed teams.

Learn more about virtual workstations on AWS

AWS and NVIDIA services