Sign in
Migration Mapping Assistant Your Saved List Partners Sell in AWS Marketplace Amazon Web Services Home Help

StreamSets Transformer (Small)

StreamSets Transformer (Small)

By: Streamsets, Inc. Latest Version: 3.14.0

Product Overview

StreamSets Transformer is an execution engine within the StreamSets DataOps platform that runs data processing pipelines on Apache Spark. Using a simple-to-use, drag and drop user interface, users can create pipelines for modern ETL, stream processing and machine learning operations.

It allows everyone, not only the savvy Spark developer, to fully utilize the power of Apache Spark without having to code in Scala or PySpark. Transformer pipelines are instrumented to provide unparalleled visibility into the execution of Spark applications with built-in previews and easy trouble-shooting. Transformer is designed to run on all the major Spark distributions to ensure you have the flexibility to run on your platform of choice.

When you start a job with a Transformer pipeline, Transformer submits the pipeline as a Spark application to the cluster. Spark handles all of the pipeline processing, including performing complex transformations on the data for ETL, machine learning, and complex computing.

StreamSets Transformer (Small) allows a maximum of two spark executors per pipeline.



Operating System

Linux/Unix, Amazon Linux Amazon Linux 2

Delivery Methods

  • Amazon Machine Image

Pricing Information

Usage Information

Support Information

Customer Reviews