reference deployment

IBM Cloud Pak for Data on AWS

An end-to-end data and AI platform with data management, governance, and analytics capabilities

This Quick Start automatically deploys a multi-master, production instance of IBM Cloud Pak for Data into a new virtual private cloud (VPC) on the Amazon Web Services (AWS) Cloud. A standard deployment takes about 3 hours, and a high availability (HA) deployment takes about 4 hours.  

IBM Cloud Pak for Data is an end-to-end data and artificial intelligence (AI) platform that enables organizations to collect, organize, and analyze enterprise data. This open information architecture (IA) for AI is an integrated collection of microservices built on cloud-native architecture, which you can deploy on premises or on the cloud.  

IBM Cloud Pak for Data can use AWS services and features, including VPCs, Availability Zones, security groups, Amazon Elastic File System (Amazon EFS) storage, and Elastic Load Balancing to build a reliable and scalable cloud platform.

This Quick Start is for enterprise users who want to connect, catalog, govern, transform, and analyze data regardless of where the data is.  

ibm logo

This Quick Start was developed by IBM, in collaboration with AWS. IBM is an
APN Partner.

  •  What you'll build
  •  How to deploy
  •  Cost and licenses
  •  What you'll build
  • The Quick Start sets up the following:

    • A highly available architecture that spans three Availability Zones.
    • A VPC configured with public and private subnets according to AWS best practices, to provide you with your own virtual network on AWS.
    • In the public subnets:
      • Managed NAT gateways to allow outbound internet access for resources in the private subnets.
      • Two Network Load Balancers to provide Elastic Load Balancing (ELB). There is one load balancer (master ELB) for the master nodes and another (proxy ELB) for the proxy nodes in the private subnets. The load balancers are placed in the public subnets in each of the chosen Availability Zones. Access to cluster administration is limited to the public ports via master node ELB. Access to applications is limited to HTTP/HTTPS via proxy node ELB.
    • A boot node in one public subnet for command-line cluster administration tasks.
    • In the private subnets, the following types of IBM Cloud Pak cluster nodes run in each of the three Availability Zones:
      • One master node
      • One management node
      • One Vulnerability Advisor (VA) node
      • One proxy node
      • One worker node  
    • An Amazon Elastic File System (Amazon EFS) file store, deployed for use by applications that need shared storage. A Kubernetes dynamic storage provisioner is configured to use the EFS storage as part of the IBM Cloud Pak for Data deployment.
    • Optionally, Amazon Route 53 as your public Domain Name System (DNS) for resolving domain names of the IBM Cloud Pak for Data management console and applications deployed on the cluster.
    • The IBM Cloud Pak for Data microservices are preconfigured on each worker node.
  •  How to deploy
  • To deploy IBM Cloud Pak for Data on AWS, follow the instructions in the deployment guide. The deployment process includes these steps:

    1. If you don't already have an AWS account, sign up at https://aws.amazon.com.
    2. Obtain a license. When you deploy the Quick Start, you receive a trial license of IBM Cloud Pak for Data for up to 90 days. After 90 days, you must either purchase or uninstall. Contact your IBM sales representative for pricing or to use your existing IBM Cloud Pak for Data entitlements. For more information about licensing terms, see the IBM Cloud Pak for Data software license agreement.
    3. Launch the Quick Start. A standard deployment takes about 3 hours, and a high availability (HA) deployment takes about 4 hours.
    4. (Optional) Edit the AWS security group.
    5. Configure the Domain Name System (DNS) or hosts file.
    6. Test the deployment by using the IBM Cloud Pak for Data web client.
    7. (Optional) Edit the application security group.
    8. Provide boot node SSH access.
  •  Cost and licenses
  • You are responsible for the cost of the AWS services used while running this Quick Start reference deployment. There is no additional cost for using the Quick Start.

    The AWS CloudFormation template for this Quick Start includes configuration parameters that you can customize. Some of these parameters, such as instance type, will affect the cost of deployment. For cost estimates, see the pricing pages for each AWS service you will be using. Prices are subject to change.

    Tip     After you deploy the Quick Start, we recommend that you enable the AWS Cost and Usage Report to track costs associated with the Quick Start. This report delivers billing metrics to an Amazon Simple Storage Service (Amazon S3) bucket in your account. It provides cost estimates based on usage throughout each month, and finalizes the data at the end of the month. For more information about the report, see the AWS documentation.

    When you deploy the Quick Start, you receive a trial license of IBM Cloud Pak for Data for up to 90 days. After 90 days, you must either purchase or uninstall. To get pricing information, or to use your existing IBM Cloud Pak for Data entitlements, contact your IBM sales representative.