Source code for the AWS Glue Data Catalog client for Apache Hive Metastore is now available for download

Posted on: Feb 4, 2019

You can now download the source code for the AWS Glue Data Catalog client for Apache Hive Metastore. This client runs on any Amazon EMR cluster with Apache Hive 2.x that uses the AWS Glue Data Catalog as an external Hive Metastore.

The AWS Glue Data Catalog is a fully managed, Apache Hive Metastore compatible, metadata repository. Customers can use the Data Catalog as a central repository to store structural and operational metadata for their data. AWS Glue provides out-of-box integration with Amazon EMR that enables customers to use the AWS Glue Data Catalog as a replacement for an external Hive Metastore. To learn more, visit our documentation.

This code serves as a reference implementation for building a Hive Metastore compatible client that connects to the AWS Glue Data Catalog. With this release, customers and partners can build custom clients that enable them to use AWS Glue Data Catalog with other Hive-Metastore compatible platforms such as other Hadoop and Apache Spark distributions. To learn more, please visit our GitHub repository.