Sign in
Categories
Your Saved List Become a Channel Partner Sell in AWS Marketplace Amazon Web Services Home Help

StreamSets Data Collector

StreamSets | 3.22.3

Linux/Unix, Amazon Linux Amazon Linux 2 - 64-bit Amazon Machine Image (AMI)

Reviews from AWS Marketplace

5 AWS reviews

External reviews

98 reviews
from G2

External reviews are not included in the AWS star rating for the product.


    Investment Banking

Best Work flow for Tracking and processing application with Automation skill

  • March 21, 2021
  • Review provided by G2

What do you like best about the product?
Integrating different components and create pipelines and Preview the pipeline to see whether the pipeline works or not.
What do you dislike about the product?
No Specific dislike as off now, Everything was best
What problems is the product solving and how is that benefiting you?
All real-time streaming helps me to track with data flow.


    Telecommunications

Data Migration cross RDBMS and NO-SQL become very easy.

  • March 20, 2021
  • Review provided by G2

What do you like best about the product?
I found it very flexible and GUI-based configuration makes it very user-friendly.
What do you dislike about the product?
So good so far, didn't find anything wrong about streamsets as of now.
What problems is the product solving and how is that benefiting you?
Data Migration from RDBMS to RDBMS and RDBMS to NO-SQL.
By using StreamSets I am able to migrate data without any downtime and without any help from DBA. in the traditional way we were doing import and export for RDBMS to RDBMS which is not now needed. from RDBMS to NO-SQL I was using custom scripts to export data in CSV from Oracle and import it in Cassandra but now I have created a pipeline and all work is sorted now.


    Information Technology and Services

StreamSets

  • March 20, 2021
  • Review provided by G2

What do you like best about the product?
Its friendly environment and user interface
What do you dislike about the product?
StreamSets should add more features and should reduce some latency .
What problems is the product solving and how is that benefiting you?
We are collecting data by using streamsets
Recommendations to others considering the product:
You can use it


    Banking

Streamsets review

  • March 19, 2021
  • Review provided by G2

What do you like best about the product?
Debugging,ease of use.Streamsts was a useful tool for ETL processes.The difference from other tools it has is that it has lot of transformations.
What do you dislike about the product?
Lots of transformations,real time processing.
What problems is the product solving and how is that benefiting you?
Banking problems.Benifits are debugging standards can check at each stage the data passed.


    Sai Charan K.

New attractive tool, but few under the hood improvements needs to be done.

  • March 19, 2021
  • Review provided by G2

What do you like best about the product?
GUI is the best and much simpler. It is self explanatory for any range of experience guy to understand.
You need not write complex programming for any kind of implementation. It is as simple as dragging and configuring something you want to implement.
Literally you can connect to any kind of system as a source and any kind of system as a destination.
Scheduling was much more easier when it comes to streamsets, unlike other systems and tools we had, a wide variety of scheduling options here.
Wish there was an option to increase the rate of ingestion.
Having streamsets transfomers is an additional advantage while we are developing the applications.
It is very easy to save and export the jobs or the pipelines. Not just this, it also very easy to share the pipelines/jobs.
Last but not the least, we have topologies where you can view the status of all the pipelines which you have developed and monitor. This can used like a collective system where all the status of the project's jobs can be viewed.
What do you dislike about the product?
Debugging an issue will take a lot of time. Logs were not that clear while we were debugging.
You can only select one single source for a pipeline. There are few applications where you need to apply the same logic for multiple sources. For this use case you need to create multiple pipelines and add coordination between them.
What problems is the product solving and how is that benefiting you?
Problems is to trace out the issue while debugging and the benefits is its simplicity to use.


    Information Technology and Services

User friendly interface

  • March 18, 2021
  • Review provided by G2

What do you like best about the product?
Very easy to use and understand at very first time itself
What do you dislike about the product?
Nothing much but very few minimal things like code suggestions when using scripting languages like groovy,jython and javascript
What problems is the product solving and how is that benefiting you?
Mostly I worked on data movement from different sources to different destinations involving many transformations. Worked on both batch processing and live streaming modes. Worked on triggering events for notifications based on certain conditions.
Recommendations to others considering the product:
I suggest it as one of the best ETL/ELT tool for data ingestion


    Srigiri K.

Experience in using Streamsets in Data Ingestion PipeLine

  • October 31, 2020
  • Review verified by G2

What do you like best about the product?
Ease of usage including easy to install and configure. Nice GUI interface which is web based for development and admin work. Connectors availability for different systems.
What do you dislike about the product?
Missing auto performance and scalability option. No drifting support. Incase of any issues related to performance and scalability, it is next to impossible to understand what caused the issues and what will be the fix. Also, not useful for ETL operations which makes us to depend upon other tools in E2E integration needs.
What problems is the product solving and how is that benefiting you?
We have used Streamset in our Data Ingestion pipeline for extracting the data sets from various heterogenous source systems like SalesForce, Oracle databases etc. Is very good in extracting the data sets for CRM systems like Salesforce etc but was not able to use it as end to end integration tool as it lacks certain functionalities.
Recommendations to others considering the product:
There few things which StreamSets still lacks and need those to have one stop solution for Data Ingestion.


    Investment Banking

Lead Data Engineer

  • October 29, 2020
  • Review verified by G2

What do you like best about the product?
The development speed for a Spark Application.
What do you dislike about the product?
The control hub must be available as part of trail version, with minimal feature
What problems is the product solving and how is that benefiting you?
Convert Spark coding into drag and dropable UI
Recommendations to others considering the product:
If you want to exploit the full power of Apache Spark and maintain it easily then Streamsets in the best way to do it.


    Banking

Easy to use and very nice interface

  • October 27, 2020
  • Review provided by G2

What do you like best about the product?
The tool had a lot of options to integrate with different protocols, language and origin. We used this tool to integrate it with Kafka/Aws, send emails and develop different types of data feed. The user interface was quite nice and easy to use. Be it a simple task or a complex task, we were always able to find a processor or executor to achieve our goal.
What do you dislike about the product?
Since the tool was new, there was a limited support on the internet. Ask streamsets page is helpful but I expected a developed ecosystem. Sometimes we faced issue with using known libraries like moment.js. It's a pain to maintain these libraries in your server. We had to use different language to implement certain module because Javascript library for that task was not supported. So our pipelines looked like a bunch of lot of processors each having a different language/framework.
What problems is the product solving and how is that benefiting you?
We were trying to develop data feed for different downstreams originated from wide variety of sources. I really liked how Streamsets control hub had the option to schedule your pipelines. The streamsets control hub had internal version control which was an additional benefit.


    Harry Kim B.

It was powerful but lots of jobs failure

  • October 27, 2020
  • Review provided by G2

What do you like best about the product?
This tool can connect from the ftp or mft server to our MSSQ
What do you dislike about the product?
The jobs designed to our project are usually failing which led our team a lot of monitoring works and manual processing of data.
What problems is the product solving and how is that benefiting you?
It's about a scheduled extracting and storing of data from one server to another. This is very beneficial to our live dashboards which need a real tome update for our clients.
Recommendations to others considering the product:
Maybe, if we can add more real-time support that can cater all time-zones and making the tool more user-friendly.