Posted On: Nov 18, 2021

AWS Glue DataBrew customers are now able to create datasets by writing Structured Query Language (SQL) statements to retrieve data from Amazon Redshift and Snowflake using Java Database Connectivity (JDBC) connections. You can use a purpose-built query to select the data you want and limit the data returned from large tables before cleaning, normalizing, and transforming that data with DataBrew. For a list of supported input formats, please see the AWS Glue DataBrew input formats list.

AWS Glue DataBrew is a visual data preparation tool that makes it easy to clean and normalize data using over 250 pre-built transformations, all without the need to write any code. You can automate filtering anomalies, converting data to standard formats, correcting invalid values, and other tasks.

To get started with DataBrew, visit the AWS Management Console or install the DataBrew plugin in your Notebook environment. To learn more, view this getting started video and refer to the DataBrew documentation.