reference deployment

Informatica Big Data Management on AWS

Big data integration and transformation on the AWS Cloud

This Quick Start deploys Informatica Big Data Management automatically into an AWS Cloud configuration of your choice.

Big Data Management enables you to integrate, govern, and secure big data assets in Apache Hadoop. The automated deployment includes Informatica Domain, Model Repository Service, and Data Integration Service.

The Quick Start includes AWS CloudFormation templates and a guide that provides step-by-step instructions to help you get the most out of your deployment.



This Quick Start was developed by Informatica in collaboration with AWS. Informatica is an
APN Partner.

  •  What you'll build
  •  How to deploy
  •  Cost and licenses
  •  What you'll build
  • Use this Quick Start to set up the following Big Data Management environment on AWS:

    • A virtual private cloud (VPC) configured with public and private subnets across two Availability Zones. This provides the network infrastructure for your Big Data Management deployment.*
    • An internet gateway to provide access to the internet.*
    • An Informatica server and Data Integration Service.Informatica domain and repository databases hosted on Amazon RDS using Microsoft SQL Server. The domain database manages the service-oriented architecture (SOA) namespace, and the repository database holds all the metadata about objects.
    • Amazon EMR cluster for the Hadoop Distributed File System (HDFS) and Hive.

    To access Informatica Services on the AWS Cloud, you can install the Informatica client on a Microsoft Windows machine.

    * The template that deploys the Quick Start into an existing VPC skips the tasks marked by asterisks and prompts you for your existing VPC configuration.

  •  How to deploy
  • Build your Big Data Management cluster in a few simple steps:

    1. If you don't already have an AWS account, sign up at Upload your Informatica Big Data Management license file to an Amazon S3 bucket.
    2. Launch the Quick Start. The deployment takes about two hours. You can choose from two options:
    3. Monitor the creation of the stack.
    4. To get started with Big Data Management on AWS, use the links in the Outputs tab to download and install Informatica Developer.

    To customize your deployment, you can choose different instance types for your resources and change the number of Amazon EMR nodes.

  •  Cost and licenses
  • You are responsible for the cost of the AWS services used while running this Quick Start reference deployment. There is no additional cost for using the Quick Start.

    The AWS CloudFormation template for this Quick Start includes configuration parameters that you can customize. Some of these settings, such as instance type, will affect the cost of deployment. See the pricing pages for each AWS service you will be using for cost estimates.

    This Quick Start requires a license for Informatica Big Data Management. To sign up for a demo license, please contact Informatica.