Amazon EMR integration with AWS Lake Formation is now generally available

Posted on: Oct 9, 2020

Amazon EMR now allows you to leverage AWS Lake Formation for defining and enforcing fine-grained access control policies for Apache Spark applications. Previously, this feature was in beta.  

By integrating Amazon EMR with AWS Lake Formation, you can enhance data access control on multi-tenant EMR clusters by managing Amazon S3 data access at the level of databases, tables, and columns. This feature also enables SAML-based single sign-on (SSO) to EMR Notebooks and Apache Zeppelin, and simplifies the authentication for organizations using Active Directory Federation Services (ADFS). With this integration, you have a single place to manage data access for Amazon EMR along with other AWS analytics services including Amazon Redshift Spectrum, Amazon Glue, and Amazon Athena.  

The integration is available at no additional cost with the EMR 5.31 release in the following AWS Regions: US East (N. Virginia, Ohio), US West (Oregon) US West (N. California), Europe (Stockholm, Ireland, Paris, Frankfurt, and London), Asia Pacific (Tokyo, Mumbai, Seoul, Sydney, and Singapore), South America (São Paulo), and Canada (Central).  

To get started, see Integrating Amazon EMR with AWS Lake Formation and Control data access and permissions with AWS Lake Formation and Amazon EMR.