AWS Partner Network (APN) Blog

Category: Amazon EMR

Flipboard Teams with Mactores to Modernize a High Volume HBase Data Platform to Fully-Managed Amazon EMR

Take an in-depth view of the cloud migration and data platform modernization process for Flipboard, which engaged Mactores Cognition for a thorough assessment of the self-managed platform and help migrating existing data workloads to a fully managed Amazon EMR serverless big data platform. The process streamlined Flipboard’s distributed database capabilities, allowing the social media platform to support user spikes at scale, maximize throughput performance, and prepare to expand the user base exponentially.

Best Practices from Provectus for Migrating and Optimizing Amazon EMR Workloads

Provectus, an AWS Premier Tier Services Partner with the Data and Analytics Competency, helps clients resolve issues of their legacy, on-premises data platforms by implementing best practices for the migration and optimization of Amazon EMR workloads. This post examines the challenges organizations face along the path to a successful migration, and explores best practices for re-architecting and migrating on-premises data platforms to AWS

Esri-APN-Blog-071322

Big Data Analytics with Amazon EMR and Esri’s ArcGIS GeoAnalytics Engine

Data is growing in all aspects of our world; every vertical and technical domain is being pushed to the limit by growing data—geospatial is no exception. Learn about Esri’s ArcGIS GeoAnalytics Engine on Amazon EMR and how its geospatial capabilities can complement your current analytics workflows. GeoAnalytics seamlessly integrates with Amazon EMR and can be deployed with an Esri-provided bootstrap script at EMR cluster creation.

Virtusa-APN-Blog-051922

Accelerate Hadoop-to-Amazon EMR Migration Using Virtusa’s Migration Factory

While the global Hadoop-as-a-service market size is growing at a CAGR of ~39%, it leaves out many small, mid, and large-scale organizational players due to the inherent pains of migration. This post explores how Virtusa complements Amazon EMR Migration by providing approaches and utilities to streamline and manage large-scale Hadoop-to-Amazon EMR migrations by creating a migration framework of automated modules.

Privacera-AWS-Partners

Fully Managed Data Governance with Amazon EMR Integration with Apache Ranger and Privacera

Privacera is an AWS Partner that provides security and privacy tools for enterprises to secure and govern user access to databases and datastores in the cloud. PrivaceraCloud reduces the burden of self-managing Apache Ranger by providing Ranger as a hosted service. It provides centralized management of data access, authorization policies, and auditing. Learn how Amazon EMR can integrate with PrivaceraCloud to provide a fully-managed data governance solution.

Tamr-AWS-Partners

How Tamr Optimized Amazon EMR Workloads to Unify 200 Billion Records 5x Faster than On-Premises

Global business leaders recognize the value of advanced and augmented big data analytics over various internal and external data sources. However, technical leaders also face challenges capturing insights from data silos without unified master data. Learn how migrating Tamr’s data mastering solutions from on-premises to AWS allowed a customer to process billions of records five times faster with fully managed Amazon EMR clusters.

Infosys-AWS-Partners

Migrating Netezza Workloads to AWS Using Amazon EMR and Amazon Redshift

Data warehouse modernization has been a key aspect of many customers’ broader cloud transformation stories. Legacy data warehouse systems, however, present many challenges when dealing with today’s enterprise data needs. Learn how AWS and Infosys collaborated to transform a legacy Netezza platform on AWS for a large retail customer. With Infosys tools, processes, and industry knowledge, the collaboration between AWS and Infosys enables customers to transform their analytics platforms.

Bigstream-AWS-Partners

Bigstream Provides Big Data Acceleration with Apache Spark and Amazon EMR

Apache Spark and its parallel processing framework, along with the ease of scaling up in public clouds, have pushed out the limits for data analytics. Learn how Bigstream addresses growing Spark needs, with software that optimizes existing CPU infrastructure and can also seamlessly incorporate advanced programmable hardware. With the same number of servers, Bigstream can accelerate Spark clusters 3x with software alone and 10x when introducing FPGAs.

Bursting Your On-Premises Data Lake Analytics and AI Workloads on AWS

Developing and maintaining an on-premises data lake is a complex undertaking. To maximize the value of data and use it as the basis for critical decisions, the data platform must be flexible and cost-effective. Learn how to build a hybrid data lake with Alluxio to leverage analytics and AI on AWS alongside a multi-petabyte on-premises data lake. Alluxio’s solution is called “zero-copy” hybrid cloud, indicating a cloud migration approach without first copying data to Amazon S3.

nClouds-AWS-Partners

How nClouds Helps Accelerate Data Delivery with Apache Hudi on Amazon EMR

Apache Hudi on Amazon EMR is an ideal solution for large-scale and near real-time applications that require incremental data pipelines and processing. This post provides a step-by-step method to perform a proof of concept for Apache Hudi on Amazon EMR. Learn how a non-customer-facing PoC solution from nClouds set up a new data and analytics platform using Apache Hudi on Amazon EMR and other managed services, including Amazon QuickSight for data visualization.