The Amazon EMR-DynamoDB Connector for Apache Hive and Apache Spark is now open-source

Posted on: Sep 23, 2016

Amazon Web Services has open-sourced the emr-dynamodb-connector, which enables Apache Hive and Apache Spark on Amazon EMR to access data in Amazon DynamoDB. You can process data directly in Amazon DynamoDB using these applications, or join tables in Amazon DynamoDB with external tables in Amazon S3, Amazon RDS, or other data stores that can be accessed by Amazon EMR. The connector is still included for use on each node in your Amazon EMR cluster. To learn more or contribute to the project, visit the emr-dynamodb-connector GitHub page.