Extracting insights and actionable information from data requires a broad array of technology that can work with data efficiently, scalably, and cost-effectively. AWS offers a comprehensive set of services to handle every step of the analytics process chain including data warehousing, business intelligence, batch processing, stream processing, machine learning, and data workflow orchestration. These services are powerful, flexible, and yet simple to use, enabling organizations to put their raw data to work quickly and easily.
|Amazon Athena||Serverless Query Service||Easily analyze data in Amazon S3, using standard SQL. Pay only for the queries you run.|
||Provides a managed Hadoop framework to process vast amounts of data quickly and cost-effectively. Run open source frameworks such as Apache Spark, HBase, Presto, and Flink.|
|Amazon Elasticsearch Service||Elasticsearch
||Makes it easy to deploy, operate, and scale Elasticsearch on AWS.
|Amazon Kinesis||Streaming Data||Easiest way to work with streaming data on AWS.
||Very fast, easy-to-use, cloud-powered business analytics for 1/10th the cost of traditional BI solutions.
||Data Warehouse||Fast, fully managed, petabyte-scale data warehouse that makes it simple and cost-effective to analyze all of your data using your existing business intelligence tools.|
|AWS Glue||ETL||Prepare and load data to data stores.|
|AWS Data Pipeline
||Data Workflow Orchestration||Helps you reliably process and move data between different AWS compute and storage services, as well as on-premise data sources, at specified intervals.|
Amazon Athena is an interactive query service that makes it easy to analyze data in Amazon S3 using standard SQL. Athena is serverless, so there is no infrastructure to manage, and you pay only for the queries that you run. Athena is easy to use. Simply point to your data in Amazon S3, define the schema, and start querying using standard SQL. Most results are delivered within seconds. With Athena, there’s no need for complex ETL jobs to prepare your data for analysis. This makes it easy for anyone with SQL skills to quickly analyze large-scale datasets.
For more information visit the Amazon Athena product page.
Amazon EMR makes it easy to quickly and cost-effectively process vast amounts of data. Amazon EMR simplifies big data processing, providing a managed Hadoop framework that makes it easy, fast, and cost-effective for you to distribute and process vast amounts of your data across dynamically scalable Amazon EC2 instances. You can also run other popular distributed frameworks such as Apache Spark, Presto, and HBase in Amazon EMR, and interact with data in other AWS data stores such as Amazon S3 and Amazon DynamoDB.
For more information visit the Amazon EMR product page.
Amazon Elasticsearch Service is a managed service that makes it easy to deploy, operate, and scale Elasticsearch in the AWS Cloud. Elasticsearch is a popular open-source search and analytics engine for use cases such as log analytics, real-time application monitoring, and click stream analytics.
For more information visit the Amazon Elasticsearch Service product page.
Amazon Kinesis is a platform for streaming data on AWS, offering powerful services to make it easy to load and analyze streaming data, and also providing the ability for you to build custom streaming data applications for specialized needs. Web applications, mobile devices, wearables, industrial sensors, and many software applications and services can generate staggering amounts of streaming data – sometimes TBs per hour – that need to be collected, stored, and processed continuously. Amazon Kinesis services enable you to do that simply and at a low cost.
For more information visit the Amazon Kinesis product page.
Amazon QuickSight is a very fast, cloud-powered business analytics service that makes it easy for all employees to build visualizations, perform ad-hoc analysis, and quickly get business insights from their data. Amazon QuickSight uses a new, Super-fast, Parallel, In-memory Calculation Engine (“SPICE”) to perform advanced calculations and render visualizations rapidly. Amazon QuickSight integrates automatically with AWS data services, enables organizations to scale to hundreds of thousands of users, and delivers fast and responsive query performance to them via SPICE’s query engine. At one-tenth the cost of traditional solutions, Amazon QuickSight enables you to deliver rich BI functionality to everyone in your organization.
For more information visit the Amazon QuickSight product page.
Amazon Redshift is a fast, fully managed, petabyte-scale data warehouse that makes it simple and cost-effective to analyze all your data using your existing business intelligence tools. Start small for $0.25 per hour with no commitments and scale to petabytes for $1,000 per terabyte per year, less than a tenth the cost of traditional solutions.
For more information visit the Amazon Redshift product page.
AWS Glue is a fully managed ETL service that makes it easy to understand your data sources, prepare the data for analytics, and load it reliably to data stores. AWS Glue simplifies and automates the difficult and time consuming data discovery, conversion, mapping, and job scheduling tasks.
For more information visit the AWS Glue product page.
AWS Data Pipeline helps you reliably process and move data between different AWS compute and storage services, as well as on-premise data sources, at specified intervals. With AWS Data Pipeline, you can regularly access your data where it’s stored, transform and process it at scale, and efficiently transfer the results to AWS services such as Amazon S3, Amazon RDS, Amazon DynamoDB, and Amazon EMR.
For more information visit the Amazon Data Pipeline product page.