StreamSets Data Collector
StreamSets | 3.22.3Linux/Unix, Amazon Linux Amazon Linux 2 - 64-bit Amazon Machine Image (AMI)
External reviews
External reviews are not included in the AWS star rating for the product.
Powerful and user-friendly data integration platform
Furthermore, StreamSets has helped us improve data quality and governance. Its monitoring and validation features allow us to track data quality metrics, identify anomalies, and ensure compliance with data privacy regulations and industry standards. By ensuring high-quality data, we can make more accurate and reliable business decisions.
Moreover, as our data volumes grow, StreamSets scales effortlessly, handling large-scale data processing without compromising on performance. This scalability has allowed us to handle increasing data demands and grow our business without worrying about data integration bottlenecks.
- Leave a Comment |
- Mark review as helpful
Best Data Pipeline Building Platform
Streamsets is a great product for dataops.
Very Powerful and Easy Data Engineering platform. Capable to handle multiple platform and huge data.
They have a very easy and user-friendly user interface. It takes only a few days for new developers to start and deploy their first pipelines.
StreamSets provides easy and powerful stages(kind of connectors) to integrate StreamSets with different platforms such as Kafka, SalesForce, Oracle DB, Rest API, HTTPS connection, Data lakes and many more.
StreamSets uses regex expression for data transformation related operation which is really easy.
Monitoring StreamSets pipelines are very easy, you can register your Data collector to control hub using provisioning agents. After registering you can deploy pipelines to SCH and create jobs. All of this can be done using their Python SDK which can easily be integrated with ADO release pipelines.
After creating/deploying pipelines users can use SCH subscription to create alerts if pipelines/jobs changes their status.
For individual alerts pipeline have built-in capability to do so.
After their version 4.0.1 , sdc are merged with their data ops platform. This allows individual developers to have the feel of a Control Hub. It also remove platform dependancy.
They have very excellent security. Pipeline can be integrated with Azure Keyvaults which eliminates the needs of sharing credentials with Developers. Same goes for parametrs and runtime parameter. Developers can easily replace any value in pipeline with ADO library variables.
If you are an Organization they provide very extensive support, work instantly on any bug if found by an organization. They also have customer success team which will do anything to make sure your organisation's experience with StreamSets is seamless.
StreamSets allowed us to share real time data between platfoms which also removed dependancy from heavier ETL tools like SSIS, Abinitio.
Since it is easier which allows our talent developement team enable our developers to use StreamSets.
Excellent and Useful Engine for Everything data
I have been using streamsets for a while now and I can say this is a very powerful design and execution engine. Makes it easy of me to create pipelines, seamless transition from s3 specifically to my Kafka and all. This is very good and will highly recommend
streamsets review
best datastreaming app in aws marketplace, and im using it every time, and my experience is very good so it is highly recommended by me
StreamSets
It is one of best service, it is a lightweight, powerful design and execution engine that streams data in real time. Data Collector provides a web-based user interface (UI) to configure pipelines, preview data, monitor pipelines, and review snapshots of data.
Makes Life Easy
Data Migration cross RDBMS and NO-SQL become very easy.
By using StreamSets I am able to migrate data without any downtime and without any help from DBA. in the traditional way we were doing import and export for RDBMS to RDBMS which is not now needed. from RDBMS to NO-SQL I was using custom scripts to export data in CSV from Oracle and import it in Cassandra but now I have created a pipeline and all work is sorted now.