Posted On: Sep 25, 2017

This Quick Start configures a production-ready Qubole Data Service (QDS) environment that is built on a data lake foundation in the Amazon Web Services (AWS) Cloud. You can use this Qubole environment to process and analyze your own datasets, and extend it for your specific use cases. The Quick Start also deploys an optional environment with prepopulated data, notebooks, and queries to analyze structured and semi-structured data, in order to gain key business insights into product sales performance. 

QDS is a cloud-native, autonomous data platform for analyzing and processing big data. Qubole self-manages and constantly analyzes and learns about the platform’s usage through a combination of heuristics and machine learning, and provides insights and recommendations to optimize reliability, performance, and costs. Qubole works in concert with AWS services such as Amazon Simple Storage Service (Amazon S3), Amazon Elastic Compute Cloud (Amazon EC2), and Amazon Redshift. 

This Quick Start deploys QDS on a data lake foundation built with AWS services, to enable users to take advantage of additional AWS big data services such as Amazon Kinesis. (The underlying data lake foundation is also available as a separate Quick Start.) 

The deployment and configuration tasks are automated by AWS CloudFormation templates that you can customize during launch. You can also use the templates as a starting point for your own implementation, by downloading them from the GitHub repository. The Quick Start includes a guide with step-by-step deployment and configuration instructions. 

To get started, use the following resources:

About Quick Starts
Quick Starts are automated reference deployments for key workloads on the AWS Cloud. Each Quick Start launches, configures, and runs the AWS compute, network, storage, and other services required to deploy a specific workload on AWS, using AWS best practices for security and availability. This is the latest in a set of AWS customer-ready solutions, which are ready-to-deploy reference architectures and best practices that address specific use cases or business processes.