AWS Open Source Blog

Tag: observability

AWS Lambda metrics support for Amazon Managed Service for Prometheus now available in AWS Distro for OpenTelemetry

In this blog post, intern engineers Karen Xu and Kelvin Lo describe how they added metric support to the OpenTelemetry and AWS Distro for OpenTelemetry Lambda layers, and built and tested the metric pipeline to generate, collect, and export application metrics from AWS Lambda to Amazon Managed Service for Prometheus (AMP). The demand for observability […]

Read More

Building a Prometheus Remote Write Exporter for the OpenTelemetry Python SDK

In this post, AWS intern engineers Azfaar Qureshi and Shovnik Bhattacharya talk about their experience building the OpenTelemetry Prometheus Remote Write Exporter for Python. They share their experiences in tackling challenges they faced while building this tool, which is used for sending metrics to Prometheus protocol-based service endpoints. As software deployments become increasingly more complex, […]

Read More

Set up cross-region metrics collection for Amazon Managed Service for Prometheus workspaces

Amazon Managed Service for Prometheus is a Prometheus-compatible monitoring service for container infrastructure and application metrics that makes it easy for customers to securely monitor container environments at scale. In a previous getting started blog post, we showed how to set up an Amazon Managed Service for Prometheus workspace and ingest metrics from an Amazon […]

Read More
Thitichaya – stock.adobe.com

Configuring Grafana Cloud Agent for Amazon Managed Service for Prometheus

This post was written by Robert Fratto, Imaya Kumar Jagannathan, and Alolita Sharma. The Grafana Cloud Agent is a lightweight alternative to running a full Prometheus server. It keeps the necessary parts for discovering and scraping Prometheus exporters and sending metrics to the backend, which in this case is the Amazon Managed Service for Prometheus […]

Read More
olegkruglyak3 – stock.adobe.com

How Netflix uses Deep Java Library (DJL) for distributed deep learning inference in real-time

This post was written by Stanislav Kirdey, Lan Qing, Lai Wei, and Lu Huang. Netflix is one of the world’s largest entertainment services and it supports more than 190 million subscribers from different countries viewing from more than a thousand types of devices every day. One of the ways Netflix is able to sustain a […]

Read More

Migrating X-Ray tracing to AWS Distro for OpenTelemetry

In the context of containerized microservices, we face the challenge of being able to tell where along the request path things happen and efficiently drill into signals. As a developer, you don’t want to fly blind and one popular way to provide these insights is distributed tracing. In this post we walk through migrating a […]

Read More