Containers

Deploy production generative AI at the edge using Amazon EKS Hybrid Nodes with NVIDIA DGX

This post demonstrates a real-world example of integrating EKS Hybrid Nodes with NVIDIA DGX Spark, a compact and energy-efficient GPU platform optimized for edge AI deployment. In this post we walk you through deploying a large language model (LLM) for low-latency generative AI inference on-premises, setting up node monitoring and GPU observability with centralized management through Amazon EKS.

Automated deployments with GitHub Actions for Amazon ECS Express Mode

In this post, we will walk you through building an automated deployment pipeline using GitHub Actions. You will create a workflow that triggers on code changes, builds Docker images, pushes them to Amazon ECR, and deploys to Amazon ECS Express Mode using IAM roles for secure authentication. By the end, you will have a continuous integration and continuous delivery (CI/CD) workflow that automatically deploys your application when you push code.

Announcing the end-of-support for the AWS Copilot CLI

We are announcing that AWS Copilot CLI will reach end of support on June 12, 2026. Copilot simplified building, releasing, and operating production-ready containerized applications on Amazon Elastic Container Service (ECS) or AWS App Runner by providing a command-line interface (CLI) tool. While AWS Copilot CLI will continue to be available as an open-source project on GitHub, it will no longer receive new features or security updates from AWS.

Migrate Amazon EC2 to ECS Express Mode using Kiro CLI and MCP servers

In this post, we’ll walk through a practical migration scenario where a Node.js web application running on EC2 instances is migrated into a highly scalable, containerized service on ECS Express Mode. You’ll learn how to configure and use the AWS and ECS MCP Servers with Kiro CLI to automate critical migration tasks, from Dockerfile creation and image optimization to ECS service configuration and production deployment.

Beyond metrics: Extracting actionable insights from Amazon EKS with Amazon Q Business

In this post, we demonstrate a solution that uses Amazon Data Firehose to aggregate logs from the Amazon EKS control plane and data plane, and send them to Amazon Simple Storage Service (Amazon S3). Finally, we use Amazon Q Business and its Amazon S3 connector to synchronize the logs, index the log data in Amazon S3, and enable a chat experience powered by the generative AI capabilities of Amazon Q Business.