Posted On: Jan 28, 2021

AWS Glue DataBrew now supports the following delimiter options for its datasets giving you flexibility to bring in a variety of .csv and .tsv files into DataBrew for data preparation. The supported delimiters include:

  • comma (,)
  • colon (:)
  • semi-colon (;)
  • pipe (|)
  • tab (\t)
  • caret (^)
  • space ( )

AWS Glue DataBrew offers customers over 250 pre-built transformations to automate data preparation tasks (e.g. filtering anomalies, standardizing formats, and correcting invalid values) that would otherwise require days or weeks writing hand-coded transformations. Once the data is prepared, customers can immediately start using it with AWS and third-party analytics and machine learning services to query the data and train machine learning models.

To learn more, view this getting started video or use a sample dataset to explore DataBrew.

To get started, visit the AWS Management Console or install the DataBrew plugin in your Notebook environment and refer to the DataBrew documentation.