Containers
Category: Technical How-to
How Infinitium reduced fraud detection time by 95% with Amazon ECS and AWS Fargate on AWS Graviton
This post was created in collaboration with Infinitium Engineering Team. Introduction Infinitium (a Euronet Company) is a leading digital payments company in Southeast Asia, specializing in secure online payment solutions and risk management services. With a strong presence across the Asia Pacific region, Infinitium offers cutting-edge technologies such as 3D Secure (3DS) authentication, fraud detection […]
Scaling a Large Language Model with NVIDIA NIM on Amazon EKS with Karpenter
Many organizations are building artificial intelligence (AI) applications using Large Language Models (LLMs) to deliver new experiences to their customers, from content creation to customer service and data analysis. However, the substantial size and intensive computational requirements of these models may have challenges in configuring, deploying, and scaling them effectively on graphic processing units (GPUs). […]
How to optimize log management for Amazon EKS with Amazon FSx for NetApp ONTAP
Cloud native technologies empower organizations to build and run scalable applications in modern, dynamic environments. Among Cloud Native patterns, Containers, and Kubernetes are mainstream across multiple businesses. According to the Cloud Native Computing Foundation Annual Survey of 2022, 44% of respondents are already using containers for nearly all applications and business segments, and another 35% say containers are […]
Using GitOps for Stateful Workload Management with vSphere CSI driver on on-premises Kubernetes
Kubernetes has become the de-facto standard for container orchestration, providing powerful capabilities for deploying and managing stateless workloads. However, users running stateful applications on Kubernetes face unique challenges, especially in VMware environments. A key issue is that the virtual disks used by stateful apps can’t be attached to pods as easily as ephemeral storage. The […]
Accelerating development feedback loops with AWS CDK hotswap deployments for Amazon ECS
Introduction Culture Amp is an employee experience platform that provides the insights into employee engagement, performance, and development that organizations need to build a category-defining culture. Culture Amp’s mission is to create a better world of work. Developer feedback loops An efficient developer feedback loop is a critical part of an effective development process – […]
Patterns for TargetGroupBinding with AWS Load Balancer Controller
Although provisioning load balancers directly from clusters has been the Kubernetes native method for exposing services, in some cases this creates a provisioning process that doesn’t align with the architecture of the applications. Therefore, there’s a need to have another mechanism for it. For those use-cases, which we describe in this post, provides the functionality […]
Proactive scaling of Amazon ECS services using Amazon ECS Service Connect Metrics
Introduction This post discusses Amazon Elastic Container Service (Amazon ECS) Service Connect, a capability that provides a secure and scalable way to connect different Amazon ECS service deployments. This enables seamless communication between micro-services while reducing the operational overhead. It provides features such as service discovery, load balancing, and network traffic metrics. This post primarily […]
Host the Whisper Model with Streaming Mode on Amazon EKS and Ray Serve
OpenAI Whisper is a pre-trained model for automatic speech recognition (ASR) and speech translation. It has demonstrated strong ASR performance across various languages, including the ability to transcribe speech in multiple languages and translate them into English. The Whisper model is open-sourced under the Apache 2.0 license, making it accessible for developers to build useful […]
Deploying Karpenter Nodes with Multus on Amazon EKS
Container based Telco workloads use Multus CNI primarily for traffic or network segmentation. Amazon Elastic Kubernetes Service (Amazon EKS) supports Multus CNI enabling users to attach multiple network interfaces, apply advanced network configuration and segmentation to Kubernetes-based applications running on AWS. One of the many benefits of running applications on AWS is resource elasticity (scaling out and scaling […]
Ensuring fair bandwidth allocation for Amazon EKS Workloads
Independent Service Vendor (ISV) users often offer their end-user solutions hosted on a multi-tenant architecture to reduce cost and operational management. However, this approach can lead Kubernetes clusters to resource exhaustion or network starvation issues that impact neighboring workloads. By default, Kubernetes provides capabilities to enforce resource availability such as CPU and memory to prevent […]