Sign in
Categories
Your Saved List Become a Channel Partner Sell in AWS Marketplace Amazon Web Services Home Help

StreamSets Platform

StreamSets | 1

Reviews from AWS Marketplace

0 AWS reviews
  • 5 star
    0
  • 4 star
    0
  • 3 star
    0
  • 2 star
    0
  • 1 star
    0

External reviews

98 reviews
from G2

External reviews are not included in the AWS star rating for the product.


    Zorik Z.

Very powerful tool with a lots of endpoints and high performance

  • October 21, 2020
  • Review provided by G2

What do you like best about the product?
Easy integration with different endpoints
What do you dislike about the product?
It will be better to have more tutorials and documentation.
What problems is the product solving and how is that benefiting you?
I was working on StreamSets integration with Greenplum using GRPC
Recommendations to others considering the product:
powerful tool with many endpoints and great support


    Sai Paramahamsa P.

Streamsets is an amazing tool for data movement across environments seemlessly

  • October 21, 2020
  • Review provided by G2

What do you like best about the product?
Developer friendly user interface and resonable speed of data transfer across various databases
What do you dislike about the product?
Debugging is a pain in streamsets because of the exhaustive java logs
What problems is the product solving and how is that benefiting you?
We use streamsets for both batch and near real time data ingestion and manipulation into our enterprise data warehouse
Recommendations to others considering the product:
The support team of Streamsets is very committed and will always help tweak the software for any specific but highly used components that are missing in the current version. So take advantage and get the best version of this tool for your enterprise needs.


    Marco M.

Friendly and powerful ETL framework, still evolving

  • October 21, 2020
  • Review verified by G2

What do you like best about the product?
Intuitive, very useful plugins, easy to deploy/maintain. It's really awesome how can you build pipelines for microservices, streaming and batch purposes in a single environment.
Very straightforward to be installed and ready to use, even for production (at least for the essential parts, then obviously should be reviewed with a security team)
What do you dislike about the product?
The bugs you may find are solved with workarounds, you have to wait a bit for a stable solution. Some missing plugins (but they will be added soon, if not already).
Many updates during the year, if you don't have a proper set up to move from test/pre-production to production, you may have some issue to face every time
What problems is the product solving and how is that benefiting you?
Managing the entire preprocessing during ingestion. It's very handy, easy to add new pipelines for new data sources or maintain the already present ones
Recommendations to others considering the product:
Create always a box that can be easily updated with the latest release: a lot of issues might be solved in every minor. Moreover, it can be easily updated using the logic of a git repo
Try to use always the Streamsets logic as much as possible and avoid to have big groovy/Jython block, you will benefit from it
Prepare a CI if possible to keep Stremasets always up-to-date
Share any debug or solution you may have found in the community, a lot of people may look for it


    Rizwan S.

I'll recommend it to my friends because there no single line of code

  • October 12, 2020
  • Review provided by G2

What do you like best about the product?
Easy to use and understand, just drag and drop. We can graphically monitor the flow.
What do you dislike about the product?
Please increase the origins and destinations
What problems is the product solving and how is that benefiting you?
Integrating cloud data with HDFS


    Denis Y.

Easy to setup fast data flow with a lot of features and flexibility.

  • October 07, 2020
  • Review verified by G2

What do you like best about the product?
Convenient access to Hadoop FS, access to web API and parsing a JSON response. You can easily combine a lot of different technologies in one flow (Hadoop, python, Java, web API).
I have started to use it while learning Big Data for learning purposes but now we use it in our company on an everyday basis.
What do you dislike about the product?
Everything beyond expectations. Only one limitation - how to convince people to use it more and it is not easy to find enough professionals on the market.
What problems is the product solving and how is that benefiting you?
Recalculating monthly accounting reports in different currencies


    Information Technology and Services

Solution Architect

  • October 01, 2020
  • Review provided by G2

What do you like best about the product?
As a Solution Architect, I found StreamSets tool very useful in understanding dataflows in our company
What do you dislike about the product?
To be honest I'd like more training to be offered by the company
What problems is the product solving and how is that benefiting you?
StreamSets solution was very helpful in getting an enormous amount of data from IoT devices into the central database.


    Pharmaceuticals

Review

  • September 30, 2020
  • Review provided by G2

What do you like best about the product?
The ease of development and it doesn’t have any windows OS level footprint.
What do you dislike about the product?
Metadata injection building transformation via template lot of python code a developer have to write down. Most of the ETL developers core and experienced one fear from python they know very well shell scripts due to Informatica and then most of the developers looks out. Try to build something from ui which inturn should generate a standard python code
What problems is the product solving and how is that benefiting you?
Lot of AI / ML . Like building transformations for the main build called for knowledge graph


    Chandrashekhar J.

Good Streaming application with flexibility of use and set up

  • September 30, 2020
  • Review provided by G2

What do you like best about the product?
Easy setup , flexible product . Can work with variety of databases.
What do you dislike about the product?
Connection issues , at times data is captured during failure
What problems is the product solving and how is that benefiting you?
Using the toll as ETL job on regular basis
Recommendations to others considering the product:
Streamsets is highly customizable streaming application . However before selecting the application user needs to analyze data transfer requirement and mode of transfer (Batch or streaming). The data conversion nodes are good , but needs to be configured in proper way. Heavy processing or file conversion may slow down the process. Need to have enough memory to support requirement


    Ayushman D.

Great product with a potential to become the best for Big Data

  • September 29, 2020
  • Review verified by G2

What do you like best about the product?
The sheer number of connections (origins, destinations, processors, etc) and additional things like webhook/notifications are among the best features offered by streamsets data collector.
What do you dislike about the product?
Too many frequent errors in some cases (like S3 destination) which are not internally retried seamlessly and ends up failing the pipeline. Due to this, a lot of manual intervention is required on a day to day basis
What problems is the product solving and how is that benefiting you?
Big data ETL is our primary problem to solve
Benefits - reasonable fast transfer, vast number of sources destination, prompt support etc


    Insurance

Review

  • November 07, 2019
  • Review verified by G2

What do you like best about the product?
Good user interface, lots of capabilities, good documentation/help
What do you dislike about the product?
This is the only data collection tool I have used, so I don't have anything to compare this with. No complaints
What problems is the product solving and how is that benefiting you?
Pulling data from multiple different types of data sources that we were previously unable to connect to