AWS Open Source Blog

Introducing OpenSearch

Today, we are introducing the OpenSearch project, a community-driven, open source fork of Elasticsearch and Kibana. We are making a long-term investment in OpenSearch to ensure users continue to have a secure, high-quality, fully open source search and analytics suite with a rich roadmap of new and innovative functionality. This project includes OpenSearch (derived from […]

Build, train, and deploy Amazon Lookout for Vision models using the Python SDK

Amazon Lookout for Vision is a new machine learning (ML) service that spots defects and anomalies in visual representations using computer vision (CV). It was made available in Preview at AWS re:Invent 2020 and became generally available in February 2021. This service lets manufacturing companies increase quality and reduce operational costs by quickly identifying differences […]

Run Selenium tests at scale using AWS Fargate

This article demonstrates an approach for running Selenium tests at scale for low cost by utilizing AWS Fargate Spot to run tests without having to manage and orchestrate their containers. Selenium framework Integration tests, as defined by Martin Fowler, “determine if independently developed units of software work correctly when they are connected to each other.” […]

Tracing AWS Lambda functions in AWS X-Ray with OpenTelemetry

AWS Distro for OpenTelemetry is a secure, Amazon Web Services (AWS)-supported, production-ready distribution of the Cloud Native Computing Foundation (CNCF) OpenTelemetry project that provides open source APIs, libraries, and agents to collect distributed traces and metrics for application monitoring. OpenTelemetry is a community effort to simplify observability instrumentation for all. As a committed, active member of […]

Using strong typing practices to declare a large number of resources with AWS CDK

AWS Cloud Development Kit (AWS CDK) is an open source software development framework that is used to declare Infrastructure as Code (IaC). It allows users to declare infrastructure in a general-purpose programming language and is an abstraction built on top of AWS CloudFormation. Resources declared in AWS CDK compile down to CloudFormation stacks that can […]

AWS Distro for OpenTelemetry adds StatsD and Java support

AWS Distro for OpenTelemetry (ADOT) 0.8.0 is now available with StatsD support in the Collector and stable Java 1.0 support with an auto-instrumentation agent for observing your Java applications. StatsD Receiver The StatsD receiver is part of the OpenTelemetry Collector and collects StatsD metrics for exporting to your choice of monitoring service. This StatsD receiver […]

Metrics collection from Amazon ECS using Amazon Managed Service for Prometheus

Prometheus is an open source monitoring solution that has emerged as a very popular tool for collecting metrics from microservices running in a variety of environments including Kubernetes. In tandem with Grafana, a widely deployed data visualization tool, Prometheus enables customers to query and visualize operational metrics collected from their workloads. Customers deploying their Prometheus […]

How and why AWS contributes to Jupyter

Artificial intelligence (AI) and machine learning (ML) have exploded in popularity as enterprises have sought to make better use of their data. At the heart of these efforts is Project Jupyter, a popular open source project widely used in data science, machine learning, and scientific computing. Although Jupyter is beloved for helping data scientists do […]

Diagram illustrating the setup outlined throughout the post.

Setting up cross-account ingestion into Amazon Managed Service for Prometheus

April 21, 2021: This article has been updated to reflect changes introduced by Sigv4 support on Prometheus server. The recently launched Amazon Managed Service for Prometheus (AMP) service provides a highly available and secure environment to ingest, query, and store Prometheus metrics. We can query the metrics from the AMP environment using Amazon Managed Grafana, […]

Move your Apache Airflow connections and variables to AWS Secrets Manager

Data scientists and engineers have made Apache Airflow a leading open source tool to create data pipelines due to its active open source community, familiar Python development as directed acyclic graph (DAG) workflows, and extensive library of prebuilt integrations. However, managing the connections and variables that these pipelines depend on can be a challenge, especially […]