Use Apache Hive Metastore as a metadata catalog with Amazon Athena (Preview)

Posted on: Nov 26, 2019

Today, Amazon Athena has released a new feature that allows you to connect Athena to your Apache Hive Metastore. 

Customers use a Hive Metastore as a common metadata catalog for their big data environments. Such customers run Apache Spark, Presto, and Apache Hive on Amazon EC2 and Amazon EMR clusters with a self-hosted Hive Metastore as a common catalog. AWS also offers the AWS Glue Data Catalog - a fully managed catalog and drop-in replacement for the Hive Metastore. With the current release, you can now connect multiple Hive Metastores in addition to the Glue Data Catalog with Athena. 

To connect to a self-hosted Hive Metastore, you need an Athena Hive Metastore connector. We have built a reference implementation of this connector and are making it available for you to use. The connector runs as an AWS Lambda function in your account. Detailed steps to add the Hive Metastore connector are available in our documentation.  

This feature is available in preview in the US-East-1 (N. Virginia) region. Begin your preview now by following these steps.  To learn more about this feature, please visit our documentation.