AWS Open Source Blog

Category: Application Integration

phonlamaiphoto - stock.adobe.com cloud computing technology

Deploying to Amazon Managed Workflows for Apache Airflow with CI/CD tools

Apache Airflow‘s active open source community, familiar Python development as directed acyclic graph (DAG) workflows, and extensive library of pre-built integrations have helped it become a leading tool for data scientists and engineers for creating data pipelines. Amazon Managed Workflows for Apache Airflow (Amazon MWAA) is a fully managed service that makes running open source […]

Open Sourcing Event Ruler

Open Sourcing Event Ruler

Amazon is excited to open source a key technology that powers EventBridge as Event Ruler under the Apache 2.0 license. Event Ruler is a Java library that allows you to build applications that can match any number of rules against events at several hundred thousand events per second.

Using Apollo Server on AWS Lambda with Amazon EventBridge for real-time, event-driven streaming

GraphQL is an application-level query language that helps clients and servers communicate by establishing a common protocol for queries. It represents an alternative to the REST style: unlike REST, GraphQL gives the client, not the server, the power to define what kind of data will be included in the response to its query. GraphQL allows […]

solution architecture described in blog post

Amazon MWAA with AWS CodeArtifact for Python dependencies

This post was written by Dzenan Softic and Sam Dengler. Many organizations rely on Apache Airflow, an open source project, to orchestrate their data pipelines. In 2020, Amazon Web Services (AWS) released Amazon Managed Workflows for Apache Airflow (Amazon MWAA), which lets engineers focus on business solutions rather than on running and maintaining infrastructure for […]

Implementing a hub and spoke dashboard for multi-account data science projects

Modern data science environments often involve many independent projects, each spanning multiple accounts. In order to maintain a global overview of the activities within the projects, a mechanism to collect data from the different accounts into a central one is crucial. In this post, we show how to leverage existing services—Amazon DynamoDB, AWS Lambda, Amazon […]

Automating a DAG deployment with Amazon Managed Workflows for Apache Airflow

Many developers and data engineers use Managed Workflows for Apache Airflow (Amazon MWAA), a managed service for open source Apache Airflow, to programmatically author, schedule, and monitor workflows. With Amazon MWAA, you can focus on business logic and build workflows without worrying about the management responsibilities such as setup, patching, upgrades, security, scaling, and so […]

Tierney - stock.adobe.com Cloud computing with abstract high speed technology POV motion blur

Migrating from self-managed Apache Airflow to Amazon Managed Workflows for Apache Airflow (MWAA)

This post was written by Tomas Christ, Solution Architect at eprimo GmbH. eprimo GmbH is a wholly owned subsidiary of E.ON SE, situated near Frankfurt, Germany. It represents the largest purely green-energy supplier in Germany with some 1.7 million customers. Currently, eprimo has a staff of approximately 160 people. We have been using Amazon Web […]

Chepko Danil – stock.adobe.com

Move your Apache Airflow connections and variables to AWS Secrets Manager

Data scientists and engineers have made Apache Airflow a leading open source tool to create data pipelines due to its active open source community, familiar Python development as directed acyclic graph (DAG) workflows, and extensive library of prebuilt integrations. However, managing the connections and variables that these pipelines depend on can be a challenge, especially […]

peshkov – stock.adobe.com AdobeStock_303323028_news_1000x500

Amazon Managed Workflows for Apache Airflow unaffected by Airflow 1.10.12 vulnerability

Amazon Managed Workflows for Apache Airflow (MWAA) is not affected by the recently announced vulnerability in Apache Airflow 1.10.12. The default airflow.cfg file uses a temporary key that is the same for all installations. In Airflow 1.10.12 and earlier, there was no restriction in using that temporary key on the Airflow web server, meaning that […]

Improving the serverless developer experience with sls-dev-tools

This post was contributed by AWS Serverless Hero Ben Ellerby. One of the most exciting things about serverless is the acceleration it can provide to teams. No longer do you have to invest time in the undifferentiated, heavy lifting of managing the infrastructure of our applications. Instead you can spend time writing code that creates […]