AWS Open Source Blog

AWS Open Source logo.

How AWS and Grafana Labs are scaling Cortex for the cloud

This post was co-authored by Jérôme Decq, Richard Anton, and Tom Wilkie. When we decided to offer a monitoring solution purpose-built for containers users, supporting Prometheus use-case patterns quickly became necessary. However, using Prometheus at cloud scale is difficult. We studied different architectures such as Prometheus plus a dedicated time series database, Thanos, and Cortex. […]

Read More

Deploy fast.ai-trained PyTorch model in TorchServe and host in Amazon SageMaker inference endpoint

Over the past few years, fast.ai has become one of the most cutting-edge, open source, deep learning frameworks and the go-to choice for many machine learning use cases based on PyTorch. It has not only democratized deep learning and made it approachable to general audiences, but fast.ai has also become a role model on how […]

Read More

Building a reliable metrics pipeline with the OpenTelemetry Collector for AWS Managed Service for Prometheus

In this blog post, AWS intern engineers Aman Brar and Jason Liu talk about their experience working with the OpenTelemetry Collector and Prometheus Remote Write Exporter. They share their experiences in tackling challenges they faced and how they applied lessons learned to ensure the reliability of the AWS Distro for the OpenTelemetry Collector as the […]

Read More

AWS Distro for OpenTelemetry adds Prometheus and Lambda support and other cool features

Today’s release of the AWS Distro for OpenTelemetry (ADOT) now brings support for Prometheus and AWS Lambda and adds AWS X-Ray support in Go and Python. The release also adds an OpenTelemetry Protocol (OTLP) HTTP exporter, an AWS EMF exporter, and an X-Ray exporter. Prometheus support: Prometheus support includes an out-of-process remote write exporter for […]

Read More
AWS Open Source logo.

How AWS and Grafana Labs are collaborating to improve Grafana for all

Torkel Ödegaard was fed up. As a developer and architect, he says he struggled to get people on his team to instrument their applications and services and build dashboards to make analyzing and understanding the company’s application data over time easier. In our recent interview, Torkel says his former employer had a forward-looking microservices architecture, […]

Read More
AWS Open Source logo.

How AWS Partners can help you get started with EKS-D

In case you missed it, last week during the re:Invent keynote, Andy Jassy announced Amazon EKS Anywhere, a new deployment option for Amazon Elastic Kubernetes Service (Amazon EKS) that enables you to easily create and operate Kubernetes clusters on-premises thanks to the launch of Amazon EKS Distro (EKS-D). EKS-D is a Kubernetes distribution based on […]

Read More

Using Kedro pipelines to train Amazon SageMaker models

Machine learning (ML) and artificial intelligence (AI) adoption is growing at nearly 25 percent per year in a variety of businesses, which results in data scientists and engineers building more analytical models per person with similar levels of resources as last year. To keep up with such high demand, builders need to remove manual and […]

Read More
AWS Open Source logo.

Migrating Cortex CI/CD workflows to GitHub Actions

In this blog post, intern engineers Azfaar Qureshi and Shovnik Bhattacharya talk about their experience working with Cortex, a popular open source observability project. They share the challenges they faced and how they applied lessons learned to improve the development experience for other contributors in the Cortex Project. The rise of open source has completely […]

Read More

Launching the AWS Distro for OpenTelemetry developer site with Gatsby and GraphQL

In this post, AWS intern Wilbert Guo shares his experience in building the AWS Distro for OpenTelemetry developer site using Gatsby and GraphQL. The developer site aims to provide a place where customers can find out more information about the project, as well as get involved and download the distribution. OpenTelemetry is a popular open […]

Read More

Managing AWS ParallelCluster SSH users with AWS OpsWorks

In a previous article, we highlighted the potential for deploying a local LDAP server to provide a mechanism for managing a multi-user AWS ParallelCluster deployment with low administrator overhead. If we want our cluster users to access or manage other AWS resources, it’s preferable to control their access via AWS Identity and Access Management (IAM). […]

Read More