AWS Big Data Blog
Optimize cost and performance for Amazon MWAA
Amazon Managed Workflows for Apache Airflow (Amazon MWAA) is a managed service for Apache Airflow that allows you to orchestrate data pipelines and workflows at scale. With Amazon MWAA, you can design Directed Acyclic Graphs (DAGs) that describe your workflows without managing the operational burden of scaling the infrastructure. In this post, we provide guidance […]
Build efficient ETL pipelines with AWS Step Functions distributed map and redrive feature
AWS Step Functions is a fully managed visual workflow service that enables you to build complex data processing pipelines involving a diverse set of extract, transform, and load (ETL) technologies such as AWS Glue, Amazon EMR, and Amazon Redshift. You can visually build the workflow by wiring individual data pipeline tasks and configuring payloads, retries, […]
Simplify semi-structured nested JSON data analysis with AWS Glue DataBrew and Amazon QuickSight
As the industry grows with more data volume, big data analytics is becoming a common requirement in data analytics and machine learning (ML) use cases. Data comes from many different sources in structured, semi-structured, and unstructured formats. For semi-structured data, one of the most common lightweight file formats is JSON. However, due to the complex […]
Enable Amazon Quick Sight federation with Google Workspace
October 2025: This post was reviewed for accuracy. Amazon Quick Sight is now an AWS IAM Identity Center enabled application. This capability allows administrators who subscribe to Quick Sight to use IAM Identity Center to enable their users to log in with Google Workspace and other external identity providers. For more information, see Simplify business intelligence […]



