AWS HPC Blog

Category: High Performance Computing

Deploying generative AI applications with NVIDIA NIMs on Amazon EKS

Deploying generative AI applications with NVIDIA NIMs on Amazon EKS

Learn how to deploy AI models at scale with @AWS using NVIDIA’s NIM and Amazon EKS! This step-by-step guide shows you how to create a GPU cluster for inference. Don’t miss part 1 of this 2-part blog series!

How vertical scaling and GPUs can accelerate mixed media modelling for marketing analytics

How vertical scaling and GPUs can accelerate mixed media modelling for marketing analytics

In marketing analytics, mixed media modeling (MMM) is a machine learning technique that combines information from various sources, like TV ads, online ads and social media to measure the impact of marketing and advertising campaigns. By using these techniques, businesses can make smarter decisions about where to invest their money for advertising, helping them get […]

Job queue snapshots: see what’s at the head of your queues in AWS Batch

Job queue snapshots: see what’s at the head of your queues in AWS Batch

AWS Batch just grew a neat new feature: Job queue snapshots. This gives you the visibility you need for managing throughput in a dynamic environment – with competing priorities – and across multiple queues and workloads. Today we give you the inside scoop on how this feature works – especially when you’re using fair share scheduling.