Sign in
Categories
Your Saved List Become a Channel Partner Sell in AWS Marketplace Amazon Web Services Home Help

IBM StreamSets

IBM Software

Reviews from AWS customer

2 AWS reviews

External reviews

115 reviews
from and

External reviews are not included in the AWS star rating for the product.


    Abhigyan S.

Overall good experience, I like the ease of using it.

  • May 09, 2025
  • Review provided by G2

What do you like best about the product?
I like IBM StreamSets ease of use and Customer Support Team.
What do you dislike about the product?
Almost everything is good. Number of interactive features can be improved.
What problems is the product solving and how is that benefiting you?
Currently using it for Data Extraction.


    Research

Streamlining Data Pipelines with ease

  • April 30, 2025
  • Review provided by G2

What do you like best about the product?
I really like how user friendly IBM StreamSets is, especially the drag and drop interface for designing data pipelines. It makes the process much easier without needing to write complex code. The platform supports both real-time and batch processing, and it has a wide range of connectors, which helped me integrate different data sources without much hassle. I also appreciated the built-in monitoring tools that helped me keep an eye on data flows and troubleshoot issues quickly.
What do you dislike about the product?
One downside I experienced was performance lag when handling large volumes of data it wasn’t always as fast as I needed. The error logs were sometimes difficult to interpret, especially for more complex issues. Also, while basic tasks were easy to manage, getting into advanced configurations took more time than I expected, and the documentation didn’t always provide clear guidance. Support response times could also be slow when I needed urgent help.
What problems is the product solving and how is that benefiting you?
IBM StreamSets is solving the challenge of building and managing complex ETL workflows in a fast-changing data environment. It helps me extract data from various sources, transform it on the fly, and load it into target systems all while handling schema changes and data drift automatically. This has been a huge benefit for me because I no longer have to manually adjust pipelines when source formats change. It also supports real-time stream analytics, so I can process and analyze data as it flows in, which improves decision making speed and keeps my data infrastructure responsive and up to date.


    Riya K.

Good Product

  • April 30, 2025
  • Review provided by G2

What do you like best about the product?
I like the ease of use of this tool. Customer support is ok.
What do you dislike about the product?
Number of features can be improved upon.
What problems is the product solving and how is that benefiting you?
For ETL tools, it is easy to use and implement


    Verified User in Information Technology and Services

Powerful and Flexible ETL solution with IBM StreamSets

  • April 29, 2025
  • Review provided by G2

What do you like best about the product?
I like IBM StreamSets for its easy-to-use visual interface, real-time data handling, and strong integration with various cloud and on-premise systems.
What do you dislike about the product?
While IBM StreamSets is powerful, it can sometimes be complex to troubleshoot issues in large pipelines, and performance tuning may require additional effort for very high-volume data loads.
What problems is the product solving and how is that benefiting you?
IBM StreamSets solves the challenge of building, managing, and scaling complex data pipelines by providing real-time data integration and smart handling of data changes. It benefits me by simplifying pipeline development, reducing maintenance efforts, and enabling faster, more reliable data delivery across systems.


    Vasstav K.

streaming data pipelines through GUI is great

  • April 28, 2025
  • Review provided by G2

What do you like best about the product?
I like how it makes easy in the use-cases of AI, where you can do the continuous training process.
What do you dislike about the product?
I don't fee that there are any such. Have to use in-order to know.
What problems is the product solving and how is that benefiting you?
Training AI models.


    Information Technology and Services

Efficient Data Pipeline Tool with Some Limitations

  • April 26, 2025
  • Review provided by G2

What do you like best about the product?
The best thing is how simple it is to use. You don’t need to write much code and the drag and drop makes things fast. It connects to lots of sources which is helpful. Also the monitoring tools are good and helps when things go wrong.
What do you dislike about the product?
It can get slow when dealing with big amount of data or when you add many steps. The docs are sometimes confusing or missing stuff. Support takes time to respond sometimes and the price is a bit much for smaller teams.
What problems is the product solving and how is that benefiting you?
We needed a way to move and process data between different systems without building everything from scratch. StreamSets made it easier to connect data sources and automate the flows. It saves us a lot of time and helps catch issues early with built-in alerts, so we don’t have to monitor everything manually all the time. It also helps to scale things easier when data volume grows.


    SrinivasanSankar

Enables effective batch loading with visual interface and enterprise support

  • April 02, 2025
  • Review from a verified AWS customer

What is our primary use case?

We are using StreamSets for batch loading.

What is most valuable?

StreamSets is GUI-based and takes care of load balancing. It allows a hybrid installation approach, rather than being completely cloud-based or on-premises. Additionally, StreamSets provides good enterprise support with a quick turnaround.

What needs improvement?

One issue I observed with StreamSets is that the memory runs out quickly when processing large volumes of data. Because of this memory issue, we have to upgrade our EC2 boxes in the Amazon AWS infrastructure. I had to switch to a new EC2 box, even though the processor was not fully utilized. It would be beneficial if StreamSets addressed any potential memory leak issues to prevent unnecessary upgrades. Additionally, it would be a great enhancement if StreamSets could produce a lineage graph to visualize how the data has passed through the system.

For how long have I used the solution?

I started using StreamSets in 2022, so it's been almost four years now.

What do I think about the stability of the solution?

From one to ten, I would rate the stability of the product at eight point five.

What do I think about the scalability of the solution?

For scalability, I would also rate it at eight point five.

How are customer service and support?

IBM technical support sometimes transfers tickets between different teams due to shift changes, which can be frustrating. The transition can make resolution slow, as I have to explain the issue multiple times. Overall, I would rate the technical support as eight out of ten.

How would you rate customer service and support?

Positive

How was the initial setup?

The initial setup of StreamSets isn't simple, but it's not too complex either. It’s a standard setup and is fine.

Which other solutions did I evaluate?

StreamSets is the leader in the market. There are many products, and the choice depends on needed features and use cases, but I view StreamSets as the leader due to its capabilities.

What other advice do I have?

If asked, I definitely recommend StreamSets to other users. My overall rating for the solution is nine.

Which deployment model are you using for this solution?

Hybrid Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?


    Pharmaceuticals

Good integration tool

  • April 13, 2024
  • Review provided by G2

What do you like best about the product?
Streamsets is a good and lightweight integration tool with good ease of integration. It's fast and reliable. It has a decent library of connectors which are easy to use. I have been using streamsets for a year now and recently switched to data bricks. Customer support turnaround is decent. Ease of implementation is not that good as the learning curve is high without a good resource to study from
What do you dislike about the product?
lack of documentation and community support
What problems is the product solving and how is that benefiting you?
Stream sets have a good amount of pre-built connectors which accelerates the speed of data ingestion


    Dhaanish S.

StreamSets : Review

  • April 11, 2024
  • Review provided by G2

What do you like best about the product?
I love using streamsets because it helps in moving data and make necessary transformations to the data by using processors.
What do you dislike about the product?
As of now I haven't faced any issues for this.
What problems is the product solving and how is that benefiting you?
StreamSets helps me in transforming the data and move it from one place to another with ease.


    Ved Prakash Yadav

Useful for data transformation and helps with column encryption

  • April 10, 2024
  • Review provided by PeerSpot

What is our primary use case?

StreamSets is used for data transformation rather than ETL processes. It focuses on transforming data directly from sources without handling the extraction part of the process. The transformed data is loaded into Amazon Redshift or other data warehousing solutions.

What is most valuable?

The best thing about StreamSets is its plugins, which are very useful and work well with almost every data source. It's also easy to use, especially if you're comfortable with SQL. You can customize it to do what you need. Many other tools have started to use features similar to those introduced by StreamSets, like automated workflows that are easy to set up.

What needs improvement?

We often faced problems, especially with SAP ERP. We struggled because many columns weren't integers or primary keys, which StreamSets couldn't handle. We had to restructure our data tables, which was painful. Also, pipeline failures were common, and data drifting wasn't addressed, which made things worse. Licensing was another issue we encountered.

For how long have I used the solution?

I have been working with the product for five years. 

What do I think about the scalability of the solution?

The tool's flexibility and performance are good. It allows for task dependency management so others won't be affected if one task fails. It can handle large volumes of data and supports features like change data capture for tracking changes.

Around six months ago, many people in my company were using StreamSets. In the US team, about 42 people across different projects were using it. Similarly, in 2021, there were around 43 users. About 16-18 people in Mumbai used it in my previous company.

How are customer service and support?

The tool's support is good. 

How was the initial setup?

Installing StreamSets can take time because it has two versions: a data controller and a data transformer. The data controller is easier to install, but the transformer is more complicated and requires more steps, like setting up tasks and configurations.

It would be best to ensure the environment was ready, including that it worked well with other servers. The process can be both easy and difficult, but if you follow the documentation, it should be manageable.

What was our ROI?

Whether the tool is worth the money depends on the situation. If you don't want to spend a lot on competing products like Databricks or Glue, then StreamSets might be a better option. It's particularly valuable if you prefer not to invest heavily in training your team on new technologies. If your ETL developers or data engineers are comfortable with StreamSets, it can be worth the money.

What's my experience with pricing, setup cost, and licensing?

The licensing is expensive, and there are other costs involved too. I know from using the software that you have to buy new features whenever there are new updates, which I don't really like. But initially, it was very good.

What other advice do I have?

We use various tools and alerting systems to notify us of pipeline errors or failures. StreamSets supports data governance and compliance by allowing us to encrypt incoming data based on specified rules. We can easily encrypt columns by providing the column name and hash key. 

If you're considering using StreamSets for the first time, I would advise first understanding why you want to use it and how it will benefit you. If you're dealing with change tracking or handling large amounts of data, it could be cost-effective compared to services like Amazon. It's easy to schedule and manage tasks with the tool, and you can enhance your skills as an ETL developer. You can easily migrate traditional pipelines built on platforms like Informatica or Talend to StreamSets. I rate the overall solution an eight out of ten.