AWS News Blog

Tag: Amazon Elastic MapReduce

New AWS Quick Start – Cloudera Enterprise Data Hub

The new Quick Start Reference Deployment Guide for Cloudera Enterprise Data Hub does exactly what the title suggests! The comprehensive (20 page) guide includes the architectural considerations and configuration steps that will help you to launch the new Cloudera Director and an associated Cloudera Enterprise Data Hub (EDH) in a matter of minutes. As the […]

Consistent View for Elastic MapReduce’s File System

Many AWS developers are using Amazon EMR (a managed Hadoop service) to quickly and cost-effectively build applications that process vast amounts of data. The EMR File System (EMRFS) allows AWS customers to use Amazon Simple Storage Service (Amazon S3) as a durable and cost-effective data store that is independent of the memory and compute resources […]

Elastic MapReduce Now Supports Hive 13

I am pleased to announce that Elastic MapReduce now supports version 13 of Hive. Hive is a great tool for building and querying large data sets. It supports the ETL (Extract/Transform/Load) process with some powerful tools, and give you access to files stored on your EMR cluster in HDFS or in Amazon Simple Storage Service […]

Using Elastic MapReduce as a Generic Hadoop Cluster Manager

My colleague Steve McPherson sent along a nice guest post to get you thinking about ways to use Elastic MapReduce in non-traditional ways! — Jeff; Amazon Elastic MapReduce (EMR) is a fully managed Hadoop-as-a-service platform that removes the operational overhead of setting up, configuring and managing the end-to-end lifecycle of Hadoop clusters. Many of our […]

AWS CloudTrail Update – Seven New Services & Support From CloudCheckr

AWS CloudTrail records the API calls made in your AWS account and publishes the resulting log files to an Amazon S3 bucket in JSON format, with optional notification to an Amazon SNS topic each time a file is published. Our customers use the log files generated CloudTrail in many different ways. Popular use cases include […]

New Instance Types for Amazon Elastic MapReduce

Thousands of AWS customers use Amazon Elastic MapReduce to process and store vast amounts of data. Because Elastic MapReduce is built around the Hadoop framework, it is easy to use hundreds or thousands of Amazon EC2 instances in parallel. Hot on the heels of the price reductions that we made last week, we are also […]

Eight Years (And Counting) of Cloud Computing

We launched Amazon S3 on March 14, 2006 with a press release and a simple blog post. We knew that the developer community was interested in and hungry for powerful, scalable, and useful web services and we were eager to see how they would respond. S3 and the Amazon Values Almost every company has a […]