Articles & Tutorials

Articles & Tutorials>Elastic MapReduce
Showing 26-31 of 31 results.
Sort by:
Analyze your Amazon CloudFront Logs using Amazon Elastic MapReduce.
Last Modified: Jun 1, 2009 18:02 PM GMT
The Amazon Elastic MapReduce service allows users to create massively distributed data processing tasks built on Map and Reduce functions. Amazon Elastic Compute Cloud allows users to run any software on a scale out compute platform. EC2 can, for example be used for large scale data analysis by running an analytic database managementsystem. Often data analysis tasks start with a processing phase whereunstructured or semi-structured data needs to be processed or transformed before loading into a relational database. In this example we show how to use EMR to process and load a data set from S3 into the Vertica Analytic Database running on EC2.
Last Modified: May 30, 2009 14:08 PM GMT
Data Wrangling blogger and AWS developer Peter Skomoroch gives us an introduction to Amazon Elastic MapReduce. Peter Skomoroch is a consultant at Data Wrangling in Arlington, VA where he mines large datasets to solve problems in search, finance, and recommendation systems.
Last Modified: Apr 8, 2009 1:05 AM GMT
ItemSimilarity is a simple Hadoop streaming Python application that attempts to find similar items for each item in the input dataset. This example application finds similar artists using the Audioscrobbler user playlist dataset and Amazon Elastic MapReduce.
Last Modified: Apr 2, 2009 21:49 PM GMT
This example shows how to use Hadoop Streaming to count the number oftimes that words occur within a text collection.
Last Modified: Apr 2, 2009 20:53 PM GMT
CloudBurst provides highly-sensitive short read mapping with MapReduce.
Last Modified: Apr 2, 2009 20:53 PM GMT
Results per page:
©2013, Amazon Web Services, Inc. or its affiliates. All rights reserved.