AWS Big Data Blog

Tag: RStudio

Running sparklyr – RStudio’s R Interface to Spark on Amazon EMR

This post was last updated July 7th, 2021 (original version by Tom Zeng). The Sparklyr package by RStudio has made processing big data in R a lot easier. Sparklyr is an R interface to Spark, it allows using Spark as the backend for dplyr – one of the most popular data manipulation packages. Sparklyr also […]