Sign in
Your Saved List Become a Channel Partner Sell in AWS Marketplace Amazon Web Services Home Help

Pentaho Data Integration and Analytics [PayGo]

Pentaho Data Integration and Analytics [PayGo]

By: Hitachi Vantara LLC Latest Version:

Product Overview

"Pentaho Data Integration and Analytics (PDI&A) manages data at scale for rapid business innovation, ease of use, self-service automation and orchestration of your data workflows and analyze the data via Analytics reporting and dashboards and provide insights for IT and Business LOB users

Data Integration and Business Analytics in a single platform
Native connectivity and bulk-loading to most common data sources, including Amazon Redshift and Snowflake
Data services to virtualize transformations without staging, making data sets immediately available to reports and applications
Automatic metadata injection and publishing of metadata models to drive faster analytic results
Process streaming data in real time.
Native Containerization to support multi-cloud deployments

Code-free data transformation design that empowers 15x faster productivity versus hand-coding and executes in-cluster for high performance
Template-based approach to rapidly onboard data sources into Hadoop via metadata injection feature set
Ability to seamlessly switch between execution engines, such as Spark and the Pentaho native engine, to fit data volume and transformation complexity
Support for advanced analytics models from R, Python, Scala and Weka to operationalize predictive intelligence while reducing data prep time
Robust Dataflow Orchestration of pipeline
Support both structured and unstructured data

50% Faster Implementation Time - seamlessly integrate a multitude of diverse on-premises, cloud, and edge data sources with speed and efficiency using modern data integration.
3x Improve Pipeline Quality - No-code functionality versus hand-coding data pipelines using an easy drag and drop interface for on-premises and cloud Hadoop data lake integration.
50% Faster Data Delivery - Produce stellar reports with intuitive processes and seamless integration that enhances the user experience, reduce data management costs, and create business value

Onboard, prepare, filter data pipelines at scale combining data from both on-premises and cloud-based data sources
Cloud Data migration across hybrid cloud environments
Modernize and process large data sets in a data lake and data warehouse
Intelligent cloud automation for rapid value creation
Real time Analytics for business users"


Operating System

Linux/Unix, Ubuntu Ubuntu Server 20.04 LTS

Delivery Methods

  • Amazon Machine Image

Pricing Information

Usage Information

Support Information

Customer Reviews