Deploy on AWS into a new VPC

View guide — HTML | PDF

Quick Start architecture for Cloudera EDH on the AWS Cloud

Cloudera EDH cluster in a private subnet on AWS
(for an alternate architecture, see the deployment guide)

For details and step-by-step instructions, view the Quick Start deployment guide. For additional Quick Starts, view the complete catalog.

200x100_Cloudera_Logo

This Quick Start helps you build a multi-node Cloudera Enterprise Data Hub (EDH) cluster on the AWS Cloud by integrating Cloudera Director with AWS services such as Amazon EC2 and Amazon VPC.

EDH enables you to store your data with the flexibility to run a variety of enterprise workloads--including batch processing, interactive SQL, enterprise search, and advanced analytics--while utilizing robust security, governance, data protection, and management.

You can choose to deploy Cloudera EDH into a new VPC or your existing VPC. The Quick Start includes AWS CloudFormation templates that automate each option.

  • What you'll build

    • A virtual private cloud (VPC) configured with four subnets, two public and two private.*
    • A NAT gateway configured in the public subnet to allow outbound Internet access for the instances deployed in the private subnet. The gateway is configured with an Elastic IP address.*
    • A Linux server instance deployed in the public subnet for downloading Cloudera Director and various configuration files and scripts.
    • An AWS Identity and Access Management (IAM) instance role with fine-grained permissions for access to AWS services necessary for the deployment process.
    • Security groups for each instance or function to restrict access to only necessary protocols and ports.
    • A placement group to provide a logical grouping of instances and enable applications to participate in a low-latency, 10 Gbps network (optional).
    • A fully customizable EDH cluster including worker nodes, edge nodes, and management nodes that you define based on your compute and storage requirements.
    • Your choice to create a new VPC or deploy into your existing VPC on AWS. The template that deploys the Quick Start into an existing VPC skips the components marked by asterisks above.


    For details, see the Quick Start deployment guide.

  • Deployment details

    Build your Cloudera EDH environment on AWS in a few simple steps:

    1. Prepare your AWS account.
    2. Launch the Quick Start, using one of these options:
      - Launch in a new VPC, if you want to build a new AWS infrastructure (view template)
      -or-
      - Launch in an existing VPC, if you already have your VPC, subnets, and NAT gateway set up (view template)
      The deployment takes about 30 minutes.
    3. Configure the EDH cluster and EDH services. For example, you can choose private or public subnets, EC2 instance types, the number of nodes, and other parameters.
    4. Deploy the EDH cluster using the Cloudera Director web UI or client.


    After deployment, you can follow the instructions in the Quick Start deployment guide to access and manage the EDH cluster.

  • Cost and licenses

    You are responsible for the cost of the AWS services used while running this Quick Start reference deployment. There is no additional cost for using the Quick Start.

    The AWS CloudFormation template for this Quick Start includes configuration parameters that you can customize. Some of these settings, such as instance type, will affect the cost of deployment. See the pricing pages for each AWS service you will be using for cost estimates.

    The Quick Start deployment activates a 60-day trial of Cloudera Enterprise.