Analyze petabyte-scale data where it lives with ease and flexibility
Get a streamlined, near-instant startup of SQL or Apache Spark analytics workloads with a serverless experience.
Build interactive, advanced analytics applications using data on premises, in your data lake, or in cloud stores.
Gain flexibility with support for choice of language, open-data formats, open-source frameworks, and BI and machine learning (ML) tool integration.
How it works
Amazon Athena is a serverless, interactive analytics service built on open-source frameworks, supporting open-table and file formats. Athena provides a simplified, flexible way to analyze petabytes of data where it lives. Analyze data or build applications from an Amazon Simple Storage Service (S3) data lake and 25-plus data sources, including on-premises data sources or other cloud systems using SQL or Python. Athena is built on open-source Trino and Presto engines and Apache Spark frameworks, with no provisioning or configuration effort required.
Run federated queries
Submit a single SQL query to analyze data in relational, nonrelational, object, and custom data sources running on premises or in the cloud.
Prepare data for ML models
Use ML models in SQL queries or Python to simplify complex tasks, such as anomaly detection, customer cohort analysis, and sales predictions.
Build distributed big data reconciliation engines
Deploy a reconciliation tool with an engine built for the cloud to validate vast amounts of data effectively at scale.
Analyze Google Analytics data
Extract Google Analytics data using Amazon AppFlow, store it in Amazon S3, and then query it.
How to get started
Access a data-querying tutorial
Learn how to start querying data with Athena.
Get your Athena questions answered
Read more about how to use Athena.
Check out what’s new with Athena
Explore the latest features and what’s next for the service.