Posted On: Aug 16, 2023

AWS Glue Studio now offers 5 new visual transforms: Record matching, Remove null rows, Extract string fragments from a regular expression, Parse JSON column, and Extract JSON path. AWS Glue Studio offers a visual extract-transform-and-load (ETL) interface to author, run, and monitor AWS Glue ETL jobs. With these new components, ETL developers can quickly build more sophisticated data pipelines without having to write code.

Each of these 5 new transforms address a unique data processing need: use Remove null rows to reduce the number of rows and keep only the ones with data, select Extract string fragments from a regular expression to extract the relevant parts of a string and select the relevant information, utilize your Glue’s ML models with Record matching to identify other potential duplicates in your dataset, transform JSON strings with Parse JSON column which allows you to covert a JSON string into a data struct or array, or extract just the JSON path you need with Extract JSON path. The new transformations are available in both visual and code-based jobs.

This new feature is now available in all AWS commercial regions where AWS Glue is available. Access the AWS Regional Services List for the most up-to-date availability information

To learn more, visit the AWS Glue documentation.