Containers

Category: Technical How-to

Deploying Model Context Protocol (MCP) servers on Amazon ECS

In this post, we will walk you through a three-tier MCP application deployed entirely on Amazon ECS, using Service Connect for service-to-service communication and Express Mode for automated load balancing, to show how to take an MCP-based workload from concept to production.

Building intelligent knowledge graphs for Amazon EKS operations using AWS DevOps Agent

In this post, we demonstrate how AWS DevOps Agent works—from alert generation to identifying the affected EKS cluster, building knowledge graphs, and troubleshooting application or infrastructure issues, ultimately reducing MTTI and MTTR for your Kubernetes operations.

Building PCI DSS-Compliant Architectures on Amazon EKS

In this post, we explore key considerations, best practices, and architectural decisions hosting applications on EKS in shared tenancy environments while maintaining PCI DSS compliance. Please note this information is for reference purposes only and does not constitute legal or compliance advice—customers remain responsible for making their own independent assessment, and AWS products or services are provided ‘as is’ without warranties, representations, or conditions of any kind.

Deploy production generative AI at the edge using Amazon EKS Hybrid Nodes with NVIDIA DGX

This post demonstrates a real-world example of integrating EKS Hybrid Nodes with NVIDIA DGX Spark, a compact and energy-efficient GPU platform optimized for edge AI deployment. In this post we walk you through deploying a large language model (LLM) for low-latency generative AI inference on-premises, setting up node monitoring and GPU observability with centralized management through Amazon EKS.

Automated deployments with GitHub Actions for Amazon ECS Express Mode

In this post, we will walk you through building an automated deployment pipeline using GitHub Actions. You will create a workflow that triggers on code changes, builds Docker images, pushes them to Amazon ECR, and deploys to Amazon ECS Express Mode using IAM roles for secure authentication. By the end, you will have a continuous integration and continuous delivery (CI/CD) workflow that automatically deploys your application when you push code.

Beyond metrics: Extracting actionable insights from Amazon EKS with Amazon Q Business

In this post, we demonstrate a solution that uses Amazon Data Firehose to aggregate logs from the Amazon EKS control plane and data plane, and send them to Amazon Simple Storage Service (Amazon S3). Finally, we use Amazon Q Business and its Amazon S3 connector to synchronize the logs, index the log data in Amazon S3, and enable a chat experience powered by the generative AI capabilities of Amazon Q Business.

Monitor Amazon ECS Events with Amazon EventBridge Filtering

In this post, we demonstrate how to capture specific Amazon ECS events using EventBridge rules for enhanced monitoring and troubleshooting of your containerized applications. We show you how to customize EventBridge filtering patterns to capture the specific Amazon ECS events that matter for your troubleshooting and monitoring needs.

Streamline your containerized CI/CD with GitLab Runners and Amazon EKS Auto Mode

In this post we demonstrate how using GitLab Runners on EKS Auto Mode, combined with Amazon Elastic Compute Cloud (Amazon EC2) Spot Instances, can deliver enterprise-scale CI/CD capabilities while achieving up to 90% cost reduction when compared to traditional deployment models. This approach not only optimizes operational expenses, but also provides resilient, scalable pipeline execution.

Amazon EKS introduces Provisioned Control Plane

Amazon EKS introduces Provisioned Control Plane, a new capability that allows you to pre-allocate control plane capacity for predictable, high-performance Kubernetes operations at scale. In this post, we explore how this enhanced option complements the Standard Control Plane by offering multiple scaling tiers (XL, 2XL, 4XL) with well-defined performance characteristics for API request concurrency, pod scheduling rates, and cluster database size—enabling you to handle demanding workloads like ultra-scale AI training, high-performance computing, and mission-critical applications with confidence.