AWS Database Blog

Category: Amazon EMR

Run complex queries on massive amounts of data stored on your Amazon DocumentDB clusters using Apache Spark running on Amazon EMR

In this post, we demonstrate how to set up Amazon EMR to run complex queries on massive amounts of data stored in your Amazon DocumentDB (with MongoDB compatibility) clusters using Apache Spark. Amazon DocumentDB (with MongoDB compatibility) is a fully managed native JSON document database that makes it easy and cost effective to operate critical document […]

Backfilling an Amazon DynamoDB Time to Live (TTL) attribute with Amazon EMR

If you have complex data types such as maps and lists in your Amazon DynamoDB data, refer to Part 2 of this series. Bulk updates to a database can be disruptive and potentially cause downtime, performance impacts to your business processes, or overprovisioning of compute and storage resources. When performing bulk updates, you want to […]