StreamSets Data Collector
StreamSets | 3.22.3Linux/Unix, Amazon Linux Amazon Linux 2 - 64-bit Amazon Machine Image (AMI)
External reviews
External reviews are not included in the AWS star rating for the product.
StreamSets is easy to use and maintain, has transparent appearance.
- Leave a Comment |
- Mark review as helpful
it was good, user friendly, the help desk is very good
Very Powerful and Easy Data Engineering platform. Capable to handle multiple platform and huge data.
They have a very easy and user-friendly user interface. It takes only a few days for new developers to start and deploy their first pipelines.
StreamSets provides easy and powerful stages(kind of connectors) to integrate StreamSets with different platforms such as Kafka, SalesForce, Oracle DB, Rest API, HTTPS connection, Data lakes and many more.
StreamSets uses regex expression for data transformation related operation which is really easy.
Monitoring StreamSets pipelines are very easy, you can register your Data collector to control hub using provisioning agents. After registering you can deploy pipelines to SCH and create jobs. All of this can be done using their Python SDK which can easily be integrated with ADO release pipelines.
After creating/deploying pipelines users can use SCH subscription to create alerts if pipelines/jobs changes their status.
For individual alerts pipeline have built-in capability to do so.
After their version 4.0.1 , sdc are merged with their data ops platform. This allows individual developers to have the feel of a Control Hub. It also remove platform dependancy.
They have very excellent security. Pipeline can be integrated with Azure Keyvaults which eliminates the needs of sharing credentials with Developers. Same goes for parametrs and runtime parameter. Developers can easily replace any value in pipeline with ADO library variables.
If you are an Organization they provide very extensive support, work instantly on any bug if found by an organization. They also have customer success team which will do anything to make sure your organisation's experience with StreamSets is seamless.
StreamSets allowed us to share real time data between platfoms which also removed dependancy from heavier ETL tools like SSIS, Abinitio.
Since it is easier which allows our talent developement team enable our developers to use StreamSets.
Streamsets : A Powerful Data Engineering + DataOps Tool
Scheduling Data Pipelines were never that easy.
Fetching application Secrets from Key-Vault for enhanced Security.
Better and Detailed logging/error information.
Fragment drill-down feature while monitoring data flow in a running Job.
Excellent and Useful Engine for Everything data
I have been using streamsets for a while now and I can say this is a very powerful design and execution engine. Makes it easy of me to create pipelines, seamless transition from s3 specifically to my Kafka and all. This is very good and will highly recommend
streamsets review
best datastreaming app in aws marketplace, and im using it every time, and my experience is very good so it is highly recommended by me
Solutions Architect
StreamSets Data Collector makes it easy to deploy execution engines from Oracle, Salesforce, JDBC, Hive, and more to Snowflake, Databricks, ADLS, and other core cloud platforms. Data Collector simplifies the design experience for Apache Kafka and runs on-premises or any cloud, wherever your data lives.
StreamSets
It is one of best service, it is a lightweight, powerful design and execution engine that streams data in real time. Data Collector provides a web-based user interface (UI) to configure pipelines, preview data, monitor pipelines, and review snapshots of data.
Makes Life Easy
Very good data operation platform, Hassle-free filtration of data and numerous options for the same
Decent data processing speed, lightweight data collector to configure pipeline, processing the data,preview the data, monitor the pipelines.
Friendly user interface for deleting or adding the connection ,stop,start the pipeline
Editing the single component should be more independent
Anomaly detection based on the traffic pattern.
Storing of raw data increases cost,using streamset we filtered out unnecessary data and used only required data for analysis.