AWS Cloud

Curated resources to help you learn how to build big data solutions on AWS

Use these comprehensive step-by-step guides to build a production-ready big data solution.

Select a simple tutorial or a guided lab to explore AWS.

Simple “Hello, World!” tutorials to help you get hands-on within your AWS account.


to Amazon S3 using the AWS CLI

Use the chart below to map out which AWS products can be used to build out your big data framework.

AWS Service What it does Documentation Links
Amazon SageMaker Amazon SageMaker is a fully managed machine learning service. With Amazon SageMaker, data scientists and developers can quickly and easily build and train machine learning models, and then directly deploy them into a production-ready hosted environment
Amazon S3 Provides object storage to make data accessible from any Internet location 
Amazon DynamoDB A managed NoSQL database that offers extremely fast performance, seamless scalability and reliability 
Amazon EMR A managed Hadoop service that allows you to run the latest versions of popular big data frameworks such as Apache Spark, Presto, Hbase, Hive, and more, on fully customizable clusters 
Amazon Route 53 A highly available and scalable cloud Domain Name System (DNS) web service 
Amazon Elasticsearch Service A popular open-source search and analytics engine for big data use cases such as log and click stream analysis
Amazon Kinesis Firehose A fully-managed service for delivering real-time streaming data to destinations such as Amazon S3, Amazon Redshift, or Amazon ES
Amazon Kinesis Streams A way to collect and process large streams of data records in real time from which you can create data-processing applications 
Amazon Kinesis Analytics A way to process streaming data in real time with standard SQL without having to learn new programming languages or processing frameworks 
Amazon Redshift A fast, fully managed, petabyte-scale data warehouse that makes it simple and cost-effective to analyze all your data using your existing business intelligence tools 
Amazon Machine Learning A managed service for building machine learning models and generating predictions 
Amazon EC2 Provides the virtual application servers, known as instances, to host websites or web applications 

Browse through our collection of videos, whitepapers, or SDKs to deepen your knowledge and experience with AWS.

Review the videos and webinars to learn more about building websites on AWS.

Getting Started with Big Data on AWS
Deep Dive: Big Data Analytics and Business Intelligence

Review the whitepapers on this topic for best practices and common use cases.

Title Summary Download
Big Data Analytics Options on AWS Provides an overview of the big data analytics options available in the AWS cloud by providing an overview of ideal usage patterns, cost models, performance, durability, availability, scalability, and anti patterns.
Download PDF
Lambda Architecture for Batch and Real - Time Processing on AWS Spark Streaming and Spark SQL
Learn which artifacts to use and how to configure infrastructure details, such as compute instances, bootstrap actions, storage, security, and networking.
Download PDF

Simplify using AWS services for your big data solution with an API tailored to your programming language or platform.