Posted On: Jan 28, 2021
AWS Glue DataBrew now supports the following delimiter options for its datasets giving you flexibility to bring in a variety of .csv and .tsv files into DataBrew for data preparation. The supported delimiters include:
- comma (,)
- colon (:)
- semi-colon (;)
- pipe (|)
- tab (\t)
- caret (^)
- space ( )
AWS Glue DataBrew offers customers over 250 pre-built transformations to automate data preparation tasks (e.g. filtering anomalies, standardizing formats, and correcting invalid values) that would otherwise require days or weeks writing hand-coded transformations. Once the data is prepared, customers can immediately start using it with AWS and third-party analytics and machine learning services to query the data and train machine learning models.
To learn more, view this getting started video or use a sample dataset to explore DataBrew.
To get started, visit the AWS Management Console or install the DataBrew plugin in your Notebook environment and refer to the DataBrew documentation.