Data on EKS | Containers

Scaling StarRocks on Amazon EKS with KEDA and Karpenter for enterprise OLAP workloads

Financial analytics at enterprise scale is unforgiving. Queries must return in seconds, not minutes. Thousands of finance professionals need concurrent access during monthly close cycles. And when data volumes grow from hundreds of gigabytes to terabytes, spanning billions of records, the infrastructure underneath must scale without forcing engineers to choose between performance and cost. This […]

Introducing AI on EKS: powering scalable AI workloads with Amazon EKS

This blog post was jointly authored by Vara Bonthu, Principal OSS Specialist Solutions Architect and Omri Shiv, Senior Open Source ML Engineer Introduction We’re excited to announce the launch of AI on EKS: a new open source initiative from Amazon Web Services (AWS) designed to help customers deploy, scale, and optimize artificial intelligence/machine learning (AI/ML) […]

Optimizing data lakes with Amazon S3 Tables and Apache Spark on Amazon EKS

This blog post was authored by Aritra Gupta (Senior Product Manager – S3), Vara Bonthu (Principal, Open Source Software Specialist Solutions Architect), Ratnopam Chakrabarti (Senior Solutions Architect – Containers & Open Source Software), and Manabu McCloskey (Senior Open Source Engineer). Introduction Managing business data has become increasingly challenging as companies collect more information than ever […]

AWS at KubeCon + CloudNativeCon Europe 2025

Amazon Web Services (AWS) is headed to London for KubeCon + CloudNativeCon Europe 2025. Join us at Booth S300 in Excel London from April 1-4 to discover our latest innovations and learn how our customers, partners, and community members are transforming their businesses with Kubernetes. Experience hands-on demonstrations of AWS solutions that simplify Kubernetes operations, […]

Scaling a Large Language Model with NVIDIA NIM on Amazon EKS with Karpenter

Many organizations are building artificial intelligence (AI) applications using Large Language Models (LLMs) to deliver new experiences to their customers, from content creation to customer service and data analysis. However, the substantial size and intensive computational requirements of these models may have challenges in configuring, deploying, and scaling them effectively on graphic processing units (GPUs). […]

Inside Pinterest’s Custom Spark Job logging and monitoring on Amazon EKS: Using AWS for Fluent Bit, Amazon S3, and ADOT

In Part 1, we explored Moka’s high-level design and logging infrastructure, showcasing how AWS for Fluent Bit, Amazon S3, and a robust logging framework make sure of operational visibility and facilitate issue resolution. For more details, read part 1 here. Introduction As we transition to the second part of our series, our focus shifts to […]

Quora: 3x Faster Machine Learning, 25% Lower Costs with Nvidia Triton on Amazon EKS

Quora achieved 3x lower latency and 25% lower Costs by modernizing model serving with Nvidia Triton on Amazon EKS

Introduction Quora is a leading Q&A platform with a mission to share and grow the world’s knowledge, serving hundreds of millions of users worldwide every month. Quora uses machine learning (ML) to generate a custom feed of questions, answers, and content recommendations based on each user’s activity, interests, and preferences. ML drives targeted advertising on […]

Deploying and scaling Apache Kafka on Amazon EKS

Introduction Apache Kafka, a distributed streaming platform, has become a popular choice for building real-time data pipelines, streaming applications, and event-driven architectures. It is horizontally scalable, fault-tolerant, and performant. However, managing and scaling Kafka clusters can be challenging and often time-consuming. This is where Kubernetes, an open-source platform for automating deployment, scaling, and management of […]

Building multi-tenant JupyterHub Platforms on Amazon EKS

Introduction In recent years, there’s been a remarkable surge in the adoption of Kubernetes for data analytics and machine learning (ML) workloads in the tech industry. This increase is underpinned by a growing recognition that Kubernetes offers a reliable and scalable infrastructure to handle these demanding computational workloads. Furthermore, a recent wave of Generative AI […]

Lacework’s batch workloads on Amazon EKS: Lessons learned

This post was co-written with Derek Brown, Infrastructure Engineer, Lacework Introduction Lacework is a security platform that automatically monitors and detects misconfigurations and security vulnerabilities across our customers’ cloud environments. Lacework takes a data-driven approach to alerting and detection, consuming trillions of events from a huge list of sources including eBPF, Kubernetes Audit Logs, cloud […]

Containers

Tag: Data on EKS