AWS News Blog

Category: Amazon Elastic MapReduce

Using Elastic MapReduce as a Generic Hadoop Cluster Manager

My colleague Steve McPherson sent along a nice guest post to get you thinking about ways to use Elastic MapReduce in non-traditional ways! — Jeff; Amazon Elastic MapReduce (EMR) is a fully managed Hadoop-as-a-service platform that removes the operational overhead of setting up, configuring and managing the end-to-end lifecycle of Hadoop clusters. Many of our […]

Read More

AWS CloudTrail Update – Seven New Services & Support From CloudCheckr

AWS CloudTrail records the API calls made in your AWS account and publishes the resulting log files to an Amazon S3 bucket in JSON format, with optional notification to an Amazon SNS topic each time a file is published. Our customers use the log files generated CloudTrail in many different ways. Popular use cases include […]

Read More

AWS Price Reduction #42 – EC2, S3, RDS, ElastiCache, and Elastic MapReduce

It is always fun to write about price reductions. I enjoy knowing that our customers will find AWS to be an even better value over time as we work on their behalf to make AWS more and more cost-effective over time. If you’ve been reading this blog for an extended period of time you know […]

Read More

Cross-Region Export and Import of DynamoDB Tables

Two of the most frequent feature requests for Amazon DynamoDB involve backup/restore and cross-Region data transfer. Today we are addressing both of these requests with the introduction of a pair of scalable tools (export and import) that you can use to move data between a DynamoDB table and an Amazon S3 bucket. The export and […]

Read More

Analyze Large Data Sets on Elastic MapReduce Clusters With Impala

Impala is an open source query tool for Hadoop. You can use familiar SQL-like statements to activate Impala’s distributed in-memory query engine, allowing you to quickly and efficiently process large amounts of data. In many cases, Impala is significantly faster than Hive, allowing you to interact with your data in real-time. Impala can process data […]

Read More

Tag Your Elastic MapReduce Clusters

Amazon Elastic MapReduce gives you the power to process vast amounts of data using Hadoop, an open source parallel processing framework. Behind the scenes, each Elastic MapReduce cluster runs on an array of Amazon EC2 instances. These clusters can grow to hundreds or even thousands of instances, and you can even run several clusters at […]

Read More