Containers

Category: Healthcare

Deploy production generative AI at the edge using Amazon EKS Hybrid Nodes with NVIDIA DGX

This post demonstrates a real-world example of integrating EKS Hybrid Nodes with NVIDIA DGX Spark, a compact and energy-efficient GPU platform optimized for edge AI deployment. In this post we walk you through deploying a large language model (LLM) for low-latency generative AI inference on-premises, setting up node monitoring and GPU observability with centralized management through Amazon EKS.

Migration considerations – Cloud Foundry to Amazon ECS with AWS Fargate

Introduction Customers often turn to Amazon Web Services (AWS) to scale their applications, reduce latency, and deploy new features quickly with pay as you go approach. Let’s take the case of customers running hundreds of applications using Cloud Foundry Platform as a Service (PaaS). To achieve cost efficiency, customers can migrate to AWS container services, […]