AWS Partner Network (APN) Blog

Tag: EMR Clusters

Tamr-AWS-Partners

How Tamr Optimized Amazon EMR Workloads to Unify 200 Billion Records 5x Faster than On-Premises

Global business leaders recognize the value of advanced and augmented big data analytics over various internal and external data sources. However, technical leaders also face challenges capturing insights from data silos without unified master data. Learn how migrating Tamr’s data mastering solutions from on-premises to AWS allowed a customer to process billions of records five times faster with fully managed Amazon EMR clusters.

How to Create a Continually Refreshed Amazon S3 Data Lake in Just One Day

Data management architectures have evolved drastically from the traditional data warehousing model, to today’s more flexible systems that use pay-as-you-go cloud computing models for big data workloads. Learn how AWS services like Amazon EMR can be used with Bryte Systems to deploy an Amazon S3 data lake in one day. We’ll also detail how AWS and the BryteFlow solution can automate modern data architecture to significantly accelerate delivery and business insights at scale.

Scheduling Provisioning and Termination of Amazon EMR Clusters with AWS Service Catalog Connector for ServiceNow

Scheduling when to provision Amazon EMR clusters allows data scientists to run analytics workloads at their own schedule. Scheduling when to terminate clusters allows them to run analytics workloads only when they need them. Here, we demonstrates how to automatically schedule provisioning and termination of EMR clusters by utilizing AWS Service Catalog Connector for ServiceNow. The capability discussed in this post can also be extended to other AWS services.