Sign in
Categories
Your Saved List Become a Channel Partner Sell in AWS Marketplace Amazon Web Services Home Help

StreamSets Data Collector

StreamSets | 3.22.3

Linux/Unix, Amazon Linux Amazon Linux 2 - 64-bit Amazon Machine Image (AMI)

Reviews from AWS Marketplace

5 AWS reviews

External reviews

98 reviews
from G2

External reviews are not included in the AWS star rating for the product.


    Chetan R.

New Game Changer

  • March 09, 2024
  • Review provided by G2

What do you like best about the product?
Its interface and easy essentials components
What do you dislike about the product?
Why it didn't come early in market place for use
What problems is the product solving and how is that benefiting you?
The easy load of huge data with optimazaation and efficiency


    Financial Services

Utilized the technology to design and deploy efficient data pipelines for ingestion of the data.

  • March 08, 2024
  • Review provided by G2

What do you like best about the product?
The tool has userfriendly interface, which has simplified the process of designing data pipelines. The number of connectors has made it easy for me to integrate various data sources. Also an additional thing is that it can handle both stream and batch data. In the organisation, the client has migrated to StreamSets and it is used almost 80% of the time. The team is quite accessible in case of any defects raised and is very co-orperative.
What do you dislike about the product?
Down sides of using SteamSets might be the cost, for some scenarios the solution might be costlier than the competition, For Large scale data there have been reports of performance issues.
What problems is the product solving and how is that benefiting you?
For example, the client gets continous stream data from the source, media and other platforms, The extraction part of the streaming data in near real time has very much helped the client to make use of the data refreshes being more frequent and getting quicker insights with high efficiency.


    Neeraj G.

Best UI for Pipelines

  • February 22, 2024
  • Review verified by G2

What do you like best about the product?
I love the StreamSets UI and its interface. The components in StreamSets are very useful and very easy to use. You can esily implement a pipeline using the desired origin from the lits of various origins. You can use it on daily basis for your pipeline review. The customer support from the StreamSets side is very appreciated.
What do you dislike about the product?
There is nothing to say bad about it. Just sometimes the preview field lacks in previeing the high intensity data.
What problems is the product solving and how is that benefiting you?
Streamsets is very useful for the big pipelines and the ease of use of the origins, processors and the executors are well managed. It helps me to reduce my time to building complex pipelines very easily.


    Rohit S.

My feedback on StreamSets

  • November 26, 2023
  • Review verified by G2

What do you like best about the product?
The GI makes it easy to design and implement data pipelines. It has an active community to which I could rely for support. Well suited for real-time data streaming scenarios making it suitable for frequent use.Offers a diverse set of features such as data drift handling, and monitoring tools.
What do you dislike about the product?
It can be resource-extensive, therefore careful consideration of infrastructure is required for handling high volumes of data. Issues with highly customized or specific integration requirements.
What problems is the product solving and how is that benefiting you?
It helps in data drift handling and real-time data streaming.


    NIKHIL G.

StreamSets for building data pipelines

  • November 25, 2023
  • Review provided by G2

What do you like best about the product?
StremSets User interface is very useful friendly. It provides drag and drop facility which simplifies building, designing and managing the data pipelines.
It also handles a very large amount of data since it is scalable.
We can use it for building both batch pipelines and streaming pipelines.
It also provides security features such as encryption, access controls, data masking etc.
The team also provides a very good customer support.
Since it has drag and drop functionality we can easily implement and integrate it with different sources and destination.
What do you dislike about the product?
As of now I don't have any dislikes about StreamSets since it is improving a lot.
What problems is the product solving and how is that benefiting you?
Buliding the data integrations involves complex coding but with StreamSets we can drap and drop the data movements. It provides a very good intuitive visual interface. It also provides many kind of error handling mechanisms which is must during the data integrations. It also provides the monitoring capabilities for the data pipelines. Helps a lot in faster decision making capacity.


    Information Technology and Services

An easy and guided approach for creating data pipelines

  • November 25, 2023
  • Review provided by G2

What do you like best about the product?
Overall quite satisfactory.
It has multiple options to connect with hadoop,teradata quite easily and help us in cloud migration journey. Data migration, job scheduling and handling live data streaming are some key points that makes it stand par with other options. Easy to implement as this is used in many teams as a tool or medium for migrating our on premises data to cloud , say azure or databricks, snowflake.
What do you dislike about the product?
Overall very nicely built but few caveats are debugging of errors during job runs make it hard sometimes.
What problems is the product solving and how is that benefiting you?
Helped teams to deliver promptly on their migration projects and take some load off


    Pranesh G.

Streamsets tool as end-to-end data integration platform

  • November 24, 2023
  • Review provided by G2

What do you like best about the product?
This is used by our organization for below uses
>> it builds batches and streaming pipelines in Hours.
>> It protects sensitive data.
>> Ease of Use
What do you dislike about the product?
Below are the failures which I faced in our organziation
>> less numbers of connectors are available in cloud version.
>> Debugging becomes difficult when we worked on a large dataset and sometimes fails without generating any error.
What problems is the product solving and how is that benefiting you?
Below are the benifits of using StreamSets
>> This Maps and Monitor the performances at Runtime.
>> Also This protects sensitive data when it comes.
>> This provides simple methods for scheduling tasks.


    Pranshu G.

StreamSets: Adaptable data Intergration tool

  • November 24, 2023
  • Review provided by G2

What do you like best about the product?
The fact that it allows you to orchestrate your data pipelines with minimal coding and with not-so-deep understading of DE/ETL concepts is super helpful. Also, Streamsets data collector is an open source tool to be used for data ingestion and integration usecases.
What do you dislike about the product?
Few things I could say where they came short.
1. Their pricing structure is rather ambigous and could be hard to decode while choosing the right version for your usecase
2.For some data sources, streamsets may not be have connectors as compared to other tools in the market.
3. With the GUI based tool, and being on pricey end, there is always long term maintenance ovehead which makes more problems that it solves.
4. They offer customer support via third party zendesk and their documentation resources aren't comprehensive enough in my experience
What problems is the product solving and how is that benefiting you?
The business was looking for a ETL tool that can cater wide variety of data ingestin sources and destinations along with the moderate transformations capability on the fly.
We implemented streamsets to test out one of our pilot program focused on integrating diverse data sources (electricity users ) from vendors across world with the goal to find the behavioural patterns grouped by region/country/culture etc..
The pilot problem was a success. In future, team may want to scale it up to the point where the insights could be leveraged for growth of business


    Financial Services

Streamline ETL Workflows

  • November 23, 2023
  • Review provided by G2

What do you like best about the product?
Streamsets has been a game changer for me in the data integration workflow. Its intuitive interface has made wrangling and managing ETL workflows and handling transformations a breeze. It's easy to implement and integrate and has a great customer service when required.
What do you dislike about the product?
Although it's a great tool, there's still some learning curve required and has some gaps in documentation around some features
What problems is the product solving and how is that benefiting you?
StreamSets removes the challenges associated with integrating data from multiple sources which ensures fault tolerant pipelines across the system. It has helped in dealing with data transformation challenges and helped us improve the data efficiency and consistency across the project


    Ahmad Raza K.

StreamSets tool for data integration and pipeline

  • November 23, 2023
  • Review verified by G2

What do you like best about the product?
I've had a very positive overall experience with streamsets. Streamsets are a tool that our organization uses to migrate its on-premises data to the cloud, it is a potent tool that can assist us in our cloud migration journey by connecting to numerous tools, such as Hadoop and Teradata, with ease.
What do you dislike about the product?
So far i have observed that debugging becomes difficult when utilizing a large dataset because the pipeline fails without generating a specific error. It cannot establish parallel connections with Teradata to speed up pipeline execution.
What problems is the product solving and how is that benefiting you?
Its ability to establish connections with different data sources.Simple method for scheduling tasks, so it doesn't need additional tools like AutoSys. If you wish to establish dependencies between indivisual streamset pipelines, you can use Data Collector pipelines.