Sign in
Categories
Your Saved List Become a Channel Partner Sell in AWS Marketplace Amazon Web Services Home Help

Pentaho Data Integration

Hitachi Vantara LLC | 9.5.2.0.273

Linux/Unix, Ubuntu Ubuntu Server 20.04 LTS - 64-bit Amazon Machine Image (AMI)

Reviews from AWS Marketplace

0 AWS reviews
  • 5 star
    0
  • 4 star
    0
  • 3 star
    0
  • 2 star
    0
  • 1 star
    0

External reviews

15 reviews
from G2

External reviews are not included in the AWS star rating for the product.


    Karthick V.

Totally worth it!!

  • March 31, 2022
  • Review provided by G2

What do you like best about the product?
Best price in market, Hitachi sponsored and high quality in data integration.
What do you dislike about the product?
Limitation in features, connector is
having portability issue and less user friendly.
What problems is the product solving and how is that benefiting you?
We used PDI for data integration for designed reports. So far, had the best experience.


    Information Technology and Services

ETL for Dashboards

  • October 08, 2020
  • Review verified by G2

What do you like best about the product?
Pentaho Data Integration (aka Kettle) is a tool included in the Pentaho suite that we use in our Smart Cities projects to obtain data from various data sources. It has a large number of tools already built for Input, Ouput, Transform ... that allow developers to save a lot of time. Its use is easy even for inexperienced users.
What do you dislike about the product?
If we want to have support with the Pentaho suite we should not use its Community version (free), but in some Smart Cities specifications of our clients they require a free and open source tool with associated support.
What problems is the product solving and how is that benefiting you?
PDI allows us to obtain data from various data sources such as databases, excel files, csv, big data / hadoop type databases and use preconfigured tools so that obtaining this data is simple and parameterizable. Other languages such as python require the writing of complete modules, with PDI the implementation and debugging are integrated through Plug & Play tools.
Recommendations to others considering the product:
The Pentaho suite has a Community version that is free and free software, so our recommendation is to download it and test it to verify that this tool meets your requirements. For our part, we recommend it as we use it practically whenever we need to extract data from a data source quickly and easily.


    Information Technology and Services

ETL with graphical interface

  • June 10, 2020
  • Review provided by G2

What do you like best about the product?
Pentaho data integration is one of the most powerful tools for building ETL processes that we use within our Smart Cities projects. It is a tool with a graphical interface that allows you to debug quickly and easily and has a multitude of preconfigured modules. Furthermore, it combines very well with the Hitachi Pentaho CDE tool for the generation of Dashboards.
What do you dislike about the product?
When you want to do a very simple development maybe you can choose to use Python source code directly. There are other powerful alternatives like Talend Studio.
What problems is the product solving and how is that benefiting you?
Pentaho Data Integration allows us to collect data from different data sources such as both relational and non-relational databases such as Big Data (HDFS), it allows us to bring information from Excel files ... and almost from any source of information we need. Also, their debugging tools save us a lot of time.
Recommendations to others considering the product:
Pentaho has a suite called Community that is free and available to everyone. In addition, it has many examples and information. We recommend trying it out before deciding if we need to purchase the paid version. It is a great tool and we recommend it.


    Paco T.

PDI, best data cleaning tool

  • April 21, 2020
  • Review provided by G2

What do you like best about the product?
Pentaho comes in two editions, enterprise and community, I had experience with the community edition and here are all the advatages I see:

1. Its under apache2.0 license so while you read and work under the agreements, you can have this powerful tool for free
2. Has a very friendly user interface, so anybody, even without strong programming skill could make some transformations in just minutes
3. It has a wide variety of data inputs formats, allowing you to read from simple csv's or excels files to databases, json's and even s3 storage
4. It has a lot of tools for transformating your data without coding
5. If the functions that PDI has integrated aren't enough for you, you can add some scripting steps
What do you dislike about the product?
I see a strong oportunity on improving their documentation, sometimes its kinda hard finding examples for all the functionalities that PDI offers
What problems is the product solving and how is that benefiting you?
I mainly use pantaho for transforming data on the ETL cycle, so I do cleansing of different sources and storage it in a DWH


    zahit B.

Open Source ETL Tools

  • November 18, 2019
  • Review verified by G2

What do you like best about the product?
Pentaho Data Integration (PDI) is a free and open source tool for all users.
Pentaho Data Integration (PDI) is a very high performance product compared to the paid ETL tools. The product is quite simple to use. The components on the left side of the product have all the components that the user needs. (For example; excel connection, row value, etc.) In my experience, the Logging screen is not descriptive. Sometimes you cannot identify the source of the error. Other than that, I am very satisfied with the PDI tool
What do you dislike about the product?
Since there are no detailed explanations of the errors on the logging screen, sometimes we cannot find the cause of the error. Also in the user community microsoft, oracle is not as strong.
What problems is the product solving and how is that benefiting you?
We needed to import the data from the json file into the tables in the database. With the Pentaho Data Integration tool, we have transferred the json files to the database. We designed daily job with Windows Task Scheduler.


    Senando B.

Great Business Intelligence Tool

  • September 17, 2019
  • Review verified by G2

What do you like best about the product?
The most like about Pentaho report data integration is it can handle large, millions of data files with no hussle, You can extract data from different databases with such a small amount of time. From data extraction you can use the report to build more power analytic chart and business intelligence that my colleagues helps a lot specially the sales and production to overcome the problems we can face on the future.
What do you dislike about the product?
Its not a dislike but i observe that when i run pentaho verions 3 the bootup is fast while on version five on above it takes a 5 to 7 minutes. I dont know if this is involve with the specs of the computer or the pentaho version itself has lot of features to load.
What problems is the product solving and how is that benefiting you?
The problem that i solve using pentaho data integration is extract million of different data from our databases and turn it on the reports and analytic charts that help my sales and production team on analyze problem on our product sales.
Recommendations to others considering the product:
When you need a powerful business intelligence tool pentaho data integration is perfect for you, It's extraction capabilities is so great, no hustle and easy to use,


    Information Technology and Services

Pentaho review

  • April 12, 2019
  • Review provided by G2

What do you like best about the product?
Pentaho is a BI tool which can provide Data Integration, reporting , statistics dashboards data mining and Extract Transform Load, ETL tools.
What do you dislike about the product?
It turns out to be a bit slow as compared to its competitors. Other than that i do not dislike anything about the tool
What problems is the product solving and how is that benefiting you?
We have used it in our project to integrate and reconciliation of huge data and also to report the stats


    Shishir B.

Used pentaho for ETL job and to run dataflow in Clarabridge

  • April 12, 2019
  • Review provided by G2

What do you like best about the product?
It is easy to use with graphical options for various things such as transformation.
What do you dislike about the product?
There should be better documents for first-time users. When users use the tabs for the first time, it can ve confusing to them. Beside that better community support is required for the application so that enterprise can report bugs easily in the software.
What problems is the product solving and how is that benefiting you?
Collecting and analyzing customer experience.
Recommendations to others considering the product:
Better documents is needed.


    Telecommunications

ETL transformation done right

  • April 11, 2019
  • Review provided by G2

What do you like best about the product?
Ability to inject custom scripts as part of the transformation process
What do you dislike about the product?
UI/UX is not so friendly. Guess there is improvement on that front.
What problems is the product solving and how is that benefiting you?
I used it for data migration of millions of customer records for a big organization and I don't know if there was a better way to do it without Pentaho.


    Internet

Excellent ETL UI for the non-programmer

  • April 04, 2019
  • Review verified by G2

What do you like best about the product?
PDI (previously known as Kettle) is an excellent data cleansing and transformation tool for the non-programmer. It has an excellent UI for users to build data flows without knowing how to code!
What do you dislike about the product?
You are limited to the modules and steps that the tool offers. There are excellent modules that is already offered but I heard there are some that you will not find here.
What problems is the product solving and how is that benefiting you?
We use PDI for data cleansing and data prepping, especially when data is across multiple environments.
Recommendations to others considering the product:
Check out the community version.