reference deployment

IBM Cloud Pak for Data on AWS

An end-to-end data and AI platform with data management, governance, and analytics capabilities

This Quick Start automatically deploys a multi-master, production instance of IBM Cloud Pak for Data on a Red Hat OpenShift Container Platform 3.11 cluster on the Amazon Web Services (AWS) Cloud. The cluster is created in a new or existing virtual private cloud (VPC) on Red Hat Enterprise Linux (RHEL) 7.7 instances, using the Red Hat OpenShift on AWS Quick Start. See the Red Hat OpenShift Container Platform on the AWS Cloud deployment guide for details about the underlying OpenShift deployment architecture. 

Cloud Pak for Data is an analytics platform that helps prepare data for artificial intelligence (AI). It enables data engineers, data stewards, data scientists, and business analysts to collaborate using an integrated multiple-cloud platform to meet data and analytics challenges. 

IBM Cloud Pak for Data can use AWS services and features, including VPCs, Availability Zones, security groups, Amazon Elastic File System (Amazon EFS) storage, and Elastic Load Balancing to build a reliable and scalable cloud platform.

This Quick Start is for enterprise users who want to connect, catalog, govern, transform, and analyze data, regardless of where the data is.

ibm logo

This Quick Start was developed by IBM, in collaboration with AWS. IBM is an
APN Partner.

  •  What you'll build
  •  How to deploy
  •  Cost and licenses
  •  What you'll build
  • The Quick Start sets up the following:

    • A highly available architecture that spans one or three Availability Zones.*
    • A VPC configured with public and private subnets according to AWS best practices, to provide you with your own virtual network on AWS.*
    • In the public subnets:
      • Managed network address translation (NAT) gateways to allow outbound internet access for resources in the private subnets.*
      • A Linux Ansible config server Amazon Elastic Compute Cloud (Amazon EC2) instance that also serves as a bastion host to allow inbound Secure Shell (SSH) access to EC2 instances in private subnets.
    • In the private subnets:
      • OpenShift Container Platform (OCP) nodes in an Auto Scaling group that, in combination, contain Cloud Pak for Data Collect, Organize, and Analyze services.
      • OCP master instances in an Auto Scaling group.
      • OCP etcd instances in an Auto Scaling group.
    • A Classic Load Balancer spanning the public subnets for accessing Cloud Pak for Data from a web browser.
    • A Classic Load Balancer spanning the public subnets for accessing the OCP master instances.
    • A Network Load Balancer spanning the private subnets for routing internal OpenShift application programming interface (API) traffic to the OCP master instances.
    • An Amazon EFS file store, which is deployed for use by applications that need shared storage. A Kubernetes dynamic storage provisioner is configured to use Amazon EFS storage as part of the IBM Cloud Pak for Data deployment.
    • An Amazon Simple Storage Service (Amazon S3) bucket to store deployment log files.
    • Optionally, Amazon Route 53 as your public Domain Name System (DNS) for resolving domain names of the IBM Cloud Pak for Data management console and applications deployed on the cluster.
  •  How to deploy
  • To deploy IBM Cloud Pak for Data on AWS, follow the instructions in the deployment guide. The deployment process includes these steps:

    1. Sign up for a Red Hat subscription, and obtain a license. See step 1 of the “Deployment Steps” in the Red Hat OpenShift Container Platform on the AWS Cloud deployment guide. To procure a 60-day evaluation license for OpenShift, follow the instructions at Evaluate Red Hat OpenShift Container Platform.

      When you deploy the Quick Start, you receive a trial license of Cloud Pak for Data for up to 60 days. After 60 days, you must either purchase or uninstall. For more information about trial licensing terms, see the IBM Cloud Pak for Data software trial license agreement.
    2. If you don't already have an AWS account, sign up at https://aws.amazon.com, and sign in to your account.
    3. Launch the Quick Start, choosing from the following options:
    4. (Optional) Edit the application security group.
    5. Configure Domain Name System (DNS).
    6. Test the deployment by using the IBM Cloud Pak for Data web client.
    7. (Optional) Provide Ansible config server SSH access.
  •  Cost and licenses
  • You are responsible for the cost of the AWS services used while running this Quick Start reference deployment. There is no additional cost for using the Quick Start.

    The AWS CloudFormation template for this Quick Start includes configuration parameters that you can customize. Some of these parameters, such as instance type, will affect the cost of deployment. For cost estimates, see the pricing pages for each AWS service you will be using. Prices are subject to change.

    Tip     After you deploy the Quick Start, we recommend that you enable the AWS Cost and Usage Report to track costs associated with the Quick Start. This report delivers billing metrics to an Amazon Simple Storage Service (Amazon S3) bucket in your account. It provides cost estimates based on usage throughout each month, and finalizes the data at the end of the month. For more information about the report, see the AWS documentation.

    This Quick Start requires a Red Hat subscription. For detailed instructions on how to sign up, see step 1 of “Deployment Steps” in the Red Hat OpenShift Container Platform on AWS Quick Start deployment guide. To procure a 60-day evaluation license for OpenShift, follow the instructions at Evaluate Red Hat OpenShift Container Platform.

    When you deploy the Quick Start, you receive a trial license of Cloud Pak for Data for up to 60 days. After 60 days, you must either purchase or uninstall. You can view the number of trial days remaining in the Cloud Pak for Data menu. For more information about trial licensing terms, see the IBM Cloud Pak for Data software trial license agreement.