Sign in
Your Saved List Become a Channel Partner Sell in AWS Marketplace Amazon Web Services Home Help

Pentaho Data Integration and Analytics [BYOL]

Pentaho Data Integration and Analytics [BYOL]

By: Hitachi Vantara LLC Latest Version: 9.5

Product Overview

Pentaho Enterprise Edition platform product offering manages enormous volumes and increased variety and velocity delivering data from any type of data source by automating your data pipeline from AWS, SaaS or Hybrid cloud


  • Data Integration and Analytics in a single platform
  • Native connectivity and bulk-loading to most common data sources, including Amazon Redshift and Snowflake
  • Data services to virtualize transformations without staging, making data sets immediately available to reports and applications
  • Automatic metadata injection and publishing of metadata models to drive faster analytic results
  • Process streaming data in real time.
  • Native Containerization to support multi-cloud deployments

  • Code-free data transformation design that empowers 15x faster productivity versus hand-coding and executes in-cluster for high performance
  • Template-based approach to rapidly onboard data sources into Hadoop via metadata injection feature set
  • Ability to seamlessly switch between execution engines, such as Spark and the Pentaho native engine, to fit data volume and transformation complexity
  • Support for advanced analytics models from R, Python, Scala and Weka to operationalize predictive intelligence while reducing data prep time
  • Robust Dataflow Orchestration of pipeline
  • Support both structured and unstructured data

  • 50% Less Implementation Time - onboard multiple hundreds of varied on-premises, cloud, and edge data sources efficiently and quickly with modern data integration.
  • 3x Improve Pipeline Quality - No-code functionality versus hand-coding data pipelines using an easy drag and drop interface for on-premises and cloud Hadoop data lake integration.
  • 50% Faster Data Delivery - Produce stellar reports with intuitive processes and seamless integration that enhances the user experience, reduce data management costs, and create business value

  • Onboard, prepare, filter data pipelines at scale combining data from both on-premises and cloud-based data sources
  • Cloud Data migration across hybrid cloud environments
  • Modernize and process large data sets in a data lake and data warehouse
  • Intelligent cloud automation for rapid value creation
  • Real time Analytics for business users



Operating System

Linux/Unix, Ubuntu Ubuntu Server 20.04 LTS

Delivery Methods

  • Amazon Machine Image

Pricing Information

Usage Information

Support Information

Customer Reviews