AWS Glue Streaming ETL jobs support reading records in the Apache Avro format

Posted on: Oct 15, 2020

Streaming extract, transform, and load (ETL) jobs in AWS Glue can now read data encoded in the Apache Avro format. Previously, streaming ETL jobs could read data in the JSON, CSV, Parquet, and XML formats. With the addition of Avro, streaming ETL jobs now support all the same formats as batch AWS Glue jobs.

AWS Glue streaming ETL jobs continuously consume data from streaming sources, clean and transform the data in-flight, and make it available for analysis in seconds. Apache Avro is a popular format for streaming data because of its support for schema evolution and fast serialization and deserialization. 

Apache Avro format read support in AWS Glue is available in the same AWS regions as AWS Glue.

To learn more, read about adding Streaming ETL jobs in our documentation.