Amazon EC2 G4 Instances

The industry’s most cost-effective GPU instances for machine learning inference and graphics-intensive applications

Why Amazon EC2 G4 Instances?

Amazon EC2 G4 instances are the industry’s most cost-effective and versatile GPU instances for deploying machine learning models such as image classification, object detection, and speech recognition, and for graphics-intensive applications such as remote graphics workstations, game streaming, and graphics rendering. G4 instances are available with a choice of NVIDIA GPUs (G4dn) or AMD GPUs (G4ad).

G4dn instances feature NVIDIA T4 GPUs and custom Intel Cascade Lake CPUs, and are optimized for machine learning inference and small scale training. These instances also bring high performance to graphics-intensive applications including remote workstations, game streaming, and graphics rendering. These instances are also ideal for customers who prefer to use NVIDIA software such as RTX Virtual Workstation and libraries such as CUDA, CuDNN, and NVENC.

G4ad instances feature the latest AMD Radeon Pro V520 GPUs and 2nd generation AMD EPYC processors. These instances provide the best price performance in the cloud for graphics applications including remote workstations, game streaming, and graphics rendering. Compared to comparable instances they offer up to 45% better price performance for graphics-intensive applications.

New Amazon EC2 G4ad Instances

Amazon EC2 G4dn Instances

G4dn instances, powered by NVIDIA T4 GPUs, are the lowest cost GPU-based instances in the cloud for machine learning inference and small scale training. They also provide high performance and are a cost-effective solution for graphics applications that are optimized for NVIDIA GPUs using NVIDIA libraries such as CUDA, CuDNN, and NVENC. They provide up to 8 NVIDIA T4 GPUs, 96 vCPUs, 100 Gbps networking, and 1.8 TB local NVMe-based SSD storage and are also available as bare metal instances.

G4dn Benefits

G4dn instances are equipped with NVIDIA T4 GPUs which deliver up to 40X better low-latency throughput than CPUs, so more requests can be served in real time. Also, G4dn instances are optimized to be cost-effective for machine learning inference, which can represent up to 90% of overall operational costs for machine learning initiatives.

G4dn instances are also useful for small-scale/entry-level machine learning training jobs for those businesses or institutions that are less sensitive to time-to-train. G4dn instances deliver up to 65 TFLOPs of FP16 performance and are a compelling solution for small-scale training jobs.

G4dn instances have up to 1.8X better graphics performance and up to 2X video transcoding capability over the previous generation G3 instances. Customers can configure virtual workstations with access to NVIDIA RTX Workstations at no additional cost.

G4dn Features

NVIDIA T4 GPUs accelerate diverse cloud workloads, including deep learning training and inference and graphics. Based on the new NVIDIA Turing architecture, T4 GPUs feature multi-precision Turing Tensor Cores and new RT Cores. Turing Tensor Core technology with multi-precision computing for ML powers breakthrough performance from FP32 to FP16 to INT8, as well as INT4 precisions. It delivers up to 9.3X higher performance than CPUs on training and up to 36X on inference.

G4dn instances offer up to 100 Gbps of networking for applications requiring high throughput. G4dn instances also support Elastic Fabric adapter (EFA) that enables customers to run applications requiring high levels of inter-node communications at scale. These instances offer up to 1.8 TB of NVMe-based SSD storage for applications that require fast access to locally stored data.

G4dn instances offer NVIDIA RTX and Gaming drivers to customers at no additional cost. RTX drivers can be used to provide high quality virtual workstations for a wide range of visually intensive workflows. The Gaming driver provides unparalleled graphics and compute support for game development.

Amazon EC2 G4ad Instances

G4ad instances, powered by AMD Radeon Pro V520 GPUs, provide the best price performance for graphics intensive applications in the cloud. These instances offer up to 45% better price performance compared to G4dn instances, which were already the lowest cost instances in the cloud, for graphics applications such as remote graphics workstations, game streaming, and rendering that leverage industry-standard APIs such as OpenGL, DirectX, and Vulkan. They provide up to 4 AMD Radeon Pro V520 GPUs, 64 vCPUs, 25 Gbps networking, and 2.4 TB local NVMe-based SSD storage.

G4ad Benefits

G4ad instances are the lowest cost instances in the cloud for graphics intensive applications. They provide up to 45% better price performance, including up to 40% better graphics performance, compared to comparable instances for graphics applications such as remote graphics workstations, game streaming, and rendering that leverage industry standard APIs such as OpenGL, DirectX, and Vulkan.

G4ad instances allow customers to configure virtual workstations with high-performance simulation, rendering, and design capabilities in minutes, allowing customers to scale quickly. Customers can use AMD Radeon Pro Software for Enterprise and high-performance remote display protocol, NICE DCV, with G4ad instances at no additional cost to manage their virtual workstation environments with support for up to two 4k monitors per GPU.

The AMD professional graphics solution includes an extensive Independent Software Vendor (ISV) application testing and certification process called the Day Zero Certification Program. This helps ensure that developers can leverage the latest AMD Radeon Pro Software for Enterprise features combined with the reliability of certified software on the day of the driver release.

G4ad Features

AMD Radeon Pro V520 GPUs provide high performance acceleration for graphics such as virtual workstations, computer generated imagery (CGI), game streaming, and digital content creation (DCC). These GPUs are built on AMD’s RDNA architecture that is hyper efficient, with low latency and high CPU to GPU bandwidth needed to enable high quality workstation and gaming experiences. With an improved graphics pipeline, RDNA architecture is designed to render your games faster with higher performance per clock.

G4ad instances offer up to 2.4 TB of local NVMe storage for fast data access, enabling customers to efficiently create photo-realistic and high-resolution 3D content for movies, games, and AR/VR experiences.

G4ad instances provide professional grade graphics drivers at no additional cost. These drivers can be used to provide the best virtual workstation experience for a wide range of visually intensive workflows and unparalleled graphics and compute support for game development.