Articles & Tutorials

Articles & Tutorials>Sample Code & Libraries
Showing 1-10 of 10 results.
Sort by:
This tutorial shows you how to develop a simple, log parsing application using Pig and Amazon Elastic MapReduce. The tutorial walks you through using Pig interactively (via SSH) on a subset of your data, which enables you to prototype your script quickly. The tutorial then takes you through uploading the script to Amazon S3 and running on a larger set of input data.
Last Modified: Mar 20, 2014 15:30 PM GMT
Analyze your Apache logs using Pig and Amazon Elastic MapReduce.
Last Modified: Mar 20, 2014 15:27 PM GMT
This article shows how to use EMR to efficiently export DynamoDB tables to S3, import S3 data into DynamoDB, and perform sophisticated queries across tables stored in both DynamoDB and other storage services such as S3.
Last Modified: Sep 26, 2013 0:23 AM GMT
This article and code sample is targeted at system architects, system administrators, and security professionals who want to integrate Shibboleth with Amazon Web Services (AWS). It describes federation proxy approach to giving users Single Sign On (SSO) access to on-premises resources and the AWS Management Console.
Last Modified: Sep 26, 2013 0:22 AM GMT

An internet advertising company operates a data warehouse using Hive and Amazon Elastic MapReduce. This company runs machines in Amazon EC2 that serve advertising impressions and redirect clicks to the advertised sites. The machines running in Amazon EC2 store each impression and click in log files pushed to Amazon S3.

Last Modified: Feb 15, 2012 2:55 AM GMT
This document provides a quick guide on how to use Elastic MapReduce to develop, debug, and run job flows that have multiple steps.
Last Modified: Jul 9, 2010 19:35 PM GMT
Data Wrangling blogger and AWS developer Peter Skomoroch gives us an introduction to Amazon Elastic MapReduce. Peter Skomoroch is a consultant at Data Wrangling in Arlington, VA where he mines large datasets to solve problems in search, finance, and recommendation systems.
Last Modified: Apr 8, 2009 1:05 AM GMT
ItemSimilarity is a simple Hadoop streaming Python application that attempts to find similar items for each item in the input dataset. This example application finds similar artists using the Audioscrobbler user playlist dataset and Amazon Elastic MapReduce.
Last Modified: Apr 2, 2009 21:49 PM GMT
This example shows how to use Hadoop Streaming to count the number oftimes that words occur within a text collection.
Last Modified: Apr 2, 2009 20:53 PM GMT
These examples illustrate using SimpleDB in a step-by-step approach. They should serve as a quick start introduction and tutorial. The goal is for each program to be small and east to understand. Includes a JSP and HTML page to put sample data into YUI Data Table
Last Modified: Mar 18, 2009 15:52 PM GMT
Results per page:
©2014, Amazon Web Services, Inc. or its affiliates. All rights reserved.