Data Lakes and Analytics on AWS

Fastest way to get answers from all your data to all your users
Meet the needs of any workload
Amazon S3 provides durability and industry leading security, making it well suited for data lake storage. Amazon EC2 provides over 200 instance types to make choosing the right compute for your workloads simple.
Decrease time to answers
Deep integration between all the layers of the AWS analytics stack gives builders the tools to quickly analyze data using any approach.
Leverage a diverse portfolio
The breadth and depth of analytics services on AWS makes it easy for you to spin up the right resources to run whatever analysis is most appropriate for your specific need.
Power your machine learning
More machine learning happens on AWS than anywhere else, with over 10,000 customers using Amazon’s ML services. As you move to more advanced ML approaches, your data will be in the right place and format to take full advantage of the ML stack.

AWS Analytics services

Category
Use cases
AWS service
Analytics
Interactive analytics

Amazon Athena

Query data in S3 using SQL.

Big data processing

Amazon EMR

Hosted Hadoop framework.

Data warehousing

Amazon Redshift

Fast, simple, cost-effective data warehousing.

Real-time analytics

Amazon Kinesis

Analyze real-time video and data streams.

Operational analytics

Amazon Elasticsearch Service

Run and scale Elasticsearch clusters.

Dashboards and visualizations

Amazon QuickSight

Fast business analytics service.

Data movement
Real-time data movement

Amazon Kinesis Video Streams

Capture, process, and store video streams for analytics and machine learning.

Amazon Kinesis Data Firehose

Prepare and load real-time data streams into data stores and analytics tools.

Amazon Kinesis Data Streams

Collect streaming data, at scale, for real-time analytics.

Amazon Kinesis Data Analytics

Get actionable insights from streaming data in real-time.

Data lake
Object storage

Amazon S3

Object storage built to store and retrieve any amount of data from anywhere.

AWS Lake Formation

Build a secure data lake in days.

Backup and archive

AWS Lake Formation

Build a secure data lake in days.

Data catalog

AWS Glue

Prepare and load data.

AWS Lake Formation

Build a secure data lake in days.

Predictive analytics and machine learning
Frameworks and interfaces

AWS Deep Learning AMIs

Deep learning on Amazon EC2.

Platform services

Amazon SageMaker

Build, train, and deploy machine learning models at scale.

AWS Analytics services

Category Use cases AWS service
Analytics Interactive analytics Amazon Athena
Big data processing Amazon EMR
Data warehousing Amazon Redshift
Real-time analytics Amazon Kinesis
Operational analytics Amazon Elasticsearch Service
Dashboards and visualizations Amazon QuickSight
Data movement Real-time data movement Amazon Kinesis Data Firehose | Amazon Kinesis Video Streams | Amazon Kinesis Data Streams | Amazon Kinesis Data Analytics
Data lake Object storage Amazon S3 | AWS Lake Formation
Backup and archive AWS Lake Formation
Data catalog
AWS Glue | AWS Lake Formation
Predictive Analytics and Machine Learning Frameworks and interfaces AWS Deep Learning AMIs
Platform services Amazon SageMaker

Use cases

Page-Illo_Data-warehousing
Data warehousing

Run SQL and complex, analytic queries against structured and unstructured data, without the need for unnecessary data movement.

Try Amazon Redshift »
Page-Illo_Big-data-processing
Big data processing

Quickly and easily process vast amounts data for data engineering, data science development, and collaboration.
 

Try Amazon EMR »
Page-Illo_Real-time-analytics
Real time analytics

Collect, process and analyze streaming data as it arrives in your data lake, and respond in real-time.
 

Try Amazon Kinesis »
Page-Illo_Data-visualization
Operational analytics

Search, explore, filter, aggregate, and visualize your data in near real-time for application monitoring, log analytics, and clickstream analytics.

Try Amazon Elasticsearch Service »

Customers

JD-Power_Logo_@1x

"We built a 120TB data lake in Amazon S3, with 1500 different schemes and use AWS analytics services like Glue, Redshift, and Athena extensively. We couldn’t get these insights from a bunch of siloed databases and warehouses - we needed an S3 scale data lake."

- Bernardo Rodriguez
Chief Digital Officer, J.D. Power

View all customers »
netflix
Chick-fil-A_Logo
3M Company_Logo
280x100_Georgia-Pacific_Logo
Pinterest_Customer-Reference_Logo
TMobile_Logo_@1x
gt-customer_landing_page_graphics166x_epic
Adobe_Customer-Reference_Logo
Pfizer

Additional resources

AWS Data Lab

AWS Data Lab is a four-day intensive engagement between a team of customer builders and AWS technical resources to create tangible deliverables that accelerate data and analytics modernization initiatives.

Learn more »

Newsletter

Want to stay in the loop on educational content, upcoming events, and other innovations from AWS Analytics?

Subscribe to the AWS Analytics Newsletter »