Author: Preshen Goobiah

Simplifying data processing at Capitec with Amazon Redshift integration for Apache Spark

This post is co-written with Preshen Goobiah and Johan Olivier from Capitec. Apache Spark is a widely-used open source distributed processing system renowned for handling large-scale data workloads. It finds frequent application among Spark developers working with Amazon EMR, Amazon SageMaker, AWS Glue and custom Spark applications. Amazon Redshift offers seamless integration with Apache Spark, […]

AWS Big Data Blog

Author: Preshen Goobiah

Simplifying data processing at Capitec with Amazon Redshift integration for Apache Spark

Learn

Resources

Developers

Help