Amazon Athena

Start querying data instantly. Get results in seconds. Pay only for the queries you run.

Amazon Athena is an interactive query service that makes it easy to analyze data in Amazon S3 using standard SQL. Athena is serverless, so there is no infrastructure to manage, and you pay only for the queries that you run.

Athena is easy to use. Simply point to your data in Amazon S3, define the schema, and start querying using standard SQL. Most results are delivered within seconds. With Athena, there’s no need for complex ETL jobs to prepare your data for analysis. This makes it easy for anyone with SQL skills to quickly analyze large-scale datasets.

Athena is out-of-the-box integrated with AWS Glue Data Catalog, allowing you to create a unified metadata repository across various services, crawl data sources to discover schemas and populate your Catalog with new and modified table and partition definitions, and maintain schema versioning.

Request support for your proof-of-concept or evaluation »

Benefits

Start Querying Instantly

Serverless, no ETL
Athena is serverless. You can quickly query your data without having to setup and manage any servers or data warehouses. Just point to your data in Amazon S3, define the schema, and start querying using the built-in query editor. Amazon Athena allows you to tap into all your data in S3 without the need to set up complex processes to extract, transform, and load the data (ETL).

Pay Per Query

Only pay for data scanned
With Amazon Athena, you pay only for the queries that you run. You are charged $5 per terabyte scanned by your queries. You can save from 30% to 90% on your per-query costs and get better performance by compressing, partitioning, and converting your data into columnar formats. Athena queries data directly in Amazon S3. There are no additional storage charges beyond S3.

Open, Powerful, Standard

Built on Presto, runs standard SQL
Amazon Athena uses Presto with ANSI SQL support and works with a variety of standard data formats, including CSV, JSON, ORC, Avro, and Parquet. Athena is ideal for quick, ad-hoc querying but it can also handle complex analysis, including large joins, window functions, and arrays. Amazon Athena is highly available; and executes queries using compute resources across multiple facilities and multiple devices in each facility. Amazon Athena uses Amazon S3 as its underlying data store, making your data highly available and durable.

Fast, Really Fast

Interactive performance even for large datasets
With Amazon Athena, you don't have to worry about having enough compute resources to get fast, interactive query performance. Amazon Athena automatically executes queries in parallel, so most results come back within seconds.

New features in preview now

Query Data Anywhere

Run federated queries against relational databases, data warehouses, object stores, and non-relational data stores. Federated SQL queries allow you to query the data in-place from wherever it resides. You can use familiar SQL to JOIN data across multiple data sources for quick analysis, and store results in Amazon S3 for subsequent use. Athena federated query also introduces a new Query Federation SDK that allows you to write your own data source connectors to query custom data stores.

Create your own User-Defined Functions (UDFs)

Write custom scalar functions and invoke them in your SQL queries. You can write your UDFs using the Athena Query Federation SDK. UDFs can be used in both SELECT and FILTER clauses of a SQL query. You can invoke multiple UDFs in the same query. While Athena provides built-in functions, UDFs enables you to perform custom processing such as compressing and decompressing data, redacting sensitive data, or applying customized decryption.

Machine Learning. In your SQL Queries.

Invoke machine learning models for inference directly from your SQL queries. Customers can use more than a dozen built-in machine learning algorithms provided by Amazon SageMaker, train their own models, or find and subscribe to model packages from the AWS Marketplace and deploy on Amazon SageMaker Hosting Services. There is no additional setup required. The ability to use machine learning models in SQL queries makes complex tasks such anomaly detection, customer cohort analysis, and sales predictions as simple as invoking a function in a SQL query.

Customers

600x400_Movable-Ink_Logo
600x400_atlassian
olx-logo

Movable Ink uses Amazon Athena to query seven years’ worth of historical data and get results in moments, with the flexibility to explore data for deeper insights.

Read the case study >>

Atlassian built a self-service data lake using Amazon Athena and other AWS Analytics services.

Watch the video >>

OLX reduced costs and improved time to market by deploying Athena across their organization.

Watch the video >>

 

Get started with AWS

Step 1 - Sign up for an AWS account

Sign up for an AWS account

Instantly get access to the AWS Free Tier.
icon2

Learn with 10-minute Tutorials

Explore and learn with our Getting Started documentation
icon3

Start building with AWS

Get started with Amazon Athena
Webpage image
Check out the product features

Learn more about the key features of Amazon Athena.

Learn more 
Account-signup image
Sign up for a free account

Instantly get access to the AWS Free Tier. 

Sign up 
Toolbox image
Start building on the console

Get started building with Amazon Athena on the AWS Management Console.

Sign in