AWS HPC Blog

Tag: AWS ParallelCluster

Deploying generative AI applications with NVIDIA NIMs on Amazon EKS

Deploying generative AI applications with NVIDIA NIMs on Amazon EKS

Learn how to deploy AI models at scale with @AWS using NVIDIA’s NIM and Amazon EKS! This step-by-step guide shows you how to create a GPU cluster for inference. Don’t miss part 1 of this 2-part blog series!

Strategies for distributing executable binaries across grids in financial services

Strategies for distributing executable binaries across grids in financial services

You can boost the performance of your compute grids by strategically distributing your binaries. Our experts looked at lots of strategies for fast & efficient compute grid operations – to save you some work.

Large scale training with NeMo Megatron on AWS ParallelCluster using P5 instances

Large scale training with NVIDIA NeMo Megatron on AWS ParallelCluster using P5 instances

Launching distributed GPT training? See how AWS ParallelCluster sets up a fast shared filesystem, SSH keys, host files, and more between nodes. Our guide has the details for creating a Slurm-managed cluster to train NeMo Megatron at scale.