AWS Big Data Blog
AWS Big Data Meetup March 31 in San Francisco: Intro to SparkR and breakout discussions
Join and RSVP! Guest Speaker: Cory Dolphin from Twitter Learn about how Answers, Fabric’s realtime analytics product, which processes billions of events in realtime, using Twitter’s new stream processing engine, Heron. Cory will explain some of the challenges the team faced while scaling Storm, and how Heron has helped them fly faster. Specifically, Cory will describe how Heron’s […]
AWS Big Data Meetup March 22 in Seattle: Intro to SparkR and breakout discussions
Join and RSVP! AWS Speaker Christopher Crosbie, Healthcare and Life Sciences Partner Solutions Architect for Amazon Web Services For a long time, R users have sliced and diced their computational problems into smaller pieces to be able to run it in smaller chunks. But what if you want to compute on a huge dataframe with […]
Join us at the AWS Big Data Meetup on February 24th in Palo Alto
Join and RSVP! Guest Speaker: Cory Dolphin from Twitter Learn about how Answers, Fabric’s realtime analytics product, which processes billions of events in realtime, using Twitter’s new stream processing engine, Heron. Cory will explain some of the challenges the team faced while scaling Storm, and how Heron has helped them fly faster. Specifically, Cory will describe how Heron’s […]
Join us at the AWS Big Data Meetup on January 13th in San Francisco
The AWS Big Data Meetup brings Big Data developers and enthusiasts together to discuss Big Data solutions with each other and AWS team members. At the event you will hear speakers from AWS and the wider community who are pushing the boundaries of Big Data. We are committed to maintaining a technical focus, and invite […]
Getting Started with Amazon EMR Bootstrap Actions
Steve McPherson is a Senior Manager for Amazon Elastic MapReduce Note: This post was updated 2/8/16. The Presto bootstrap action documented in the original post has been deprecated because EMR now offers a Presto-Sandbox as a full-fledged EMR application. For details, see the EMR sandbox. Amazon Elastic MapReduce (EMR) is a fully managed Hadoop-as-a-service platform […]