Posted On: Mar 31, 2023

AWS Glue Studio now offers 10 new visual transforms: Concatenate, Split string, Array to columns, Add current timestamp, Pivot rows to columns, Unpivot columns to rows, Lookup, Explode, Derived column, and Autobalance processing. AWS Glue Studio offers a visual extract-transform-and-load (ETL) interface to author, run, and monitor AWS Glue ETL jobs. With these new components, ETL developers can quickly build more sophisticated data pipelines without having to write code.

These 10 new transformations address a variety of data processing needs: Merge columns together with Concatenate or separate a string using a delimiter with Split string. Add current timestamp lets you record when your data was processed. Pivot rows to columns and Unpivot columns to rows help restructure data for aggregation. Lookup is a simple way to add columns from another table into the dataset. Derived column lets you define a new column based on other columns using mathematical or SQL expressions. Explode unpacks nested data structures and, finally, Autobalance processing is an advanced feature that improves job performance when the data is not well distributed among workers. All of these new transformations are available in both visual and code-based jobs.

These new components are available in the same AWS Regions as AWS Glue.

To learn more, visit the AWS Glue documentation.