AWS Big Data Blog

Philippe Wanner

Author: Philippe Wanner

Philippe Wanner is a Senior Specialist Solutions Architect at AWS. His role is to spread the migration and modernization best practices for large organizations. His current focus is in a multidisciplinary area around distributed systems, serverless architecture and business transformation.

Modernize Apache Spark workflows using Spark Connect on Amazon EMR on Amazon EC2

In this post, we demonstrate how to implement Apache Spark Connect on Amazon EMR on Amazon Elastic Compute Cloud (Amazon EC2) to build decoupled data processing applications. We show how to set up and configure Spark Connect securely, so you can develop and test Spark applications locally while executing them on remote Amazon EMR clusters.