AWS News Blog

Analyze Large Data Sets on Elastic MapReduce Clusters With Impala

Impala is an open source query tool for Hadoop. You can use familiar SQL-like statements to activate Impala’s distributed in-memory query engine, allowing you to quickly and efficiently process large amounts of data. In many cases, Impala is significantly faster than Hive, allowing you to interact with your data in real-time. Impala can process data […]

C3 Instance Update

As you know, we launched our new compute optimized instance family (C3) a few weeks ago, and wow, are we seeing unprecedented demand across all sizes and all Regions!  As one of our product managers just told me, these instances are simply “fast in every dimension.” They have a high performance CPU, matched with SSD-based […]

Now Available – Global Secondary Indexes for Amazon DynamoDB

As I promised a few weeks ago, Amazon DynamoDB now supports Global Secondary Indexes. You can now create indexes and perform lookups using attributes other than the item’s primary key. With this change, DynamoDB goes beyond the functionality traditionally provided by a key/value store, while retaining the scalability and performance benefits that have made it […]

Background Task Handling for AWS Elastic Beanstalk

My colleague Abhishek Singh sent along a guest post to introduce a really important new feature for AWS Elastic Beanstalk. — Jeff; You can now launch Worker Tier environments in Elastic Beanstalk. These environments are optimized to process application background tasks at any scale. Worker tiers complement the existing web tiers and are ideal for time […]

AWS Week in Review – December 2, 2013

Let’s take a quick look at what happened in AWS-land last week: Monday, December 2 This week AWS Marketplace added new products including and OpenPhoto, Tendenci, Observium, PaaSLane, CloudWeaver, Chariot Data Broker and Moovweb. Tuesday, December 3 I noticed that the EC2 Public IP Address Ranges were updated. Wednesday, December 4 The US East (Northern […]

Tag Your Elastic MapReduce Clusters

Amazon Elastic MapReduce gives you the power to process vast amounts of data using Hadoop, an open source parallel processing framework. Behind the scenes, each Elastic MapReduce cluster runs on an array of Amazon EC2 instances. These clusters can grow to hundreds or even thousands of instances, and you can even run several clusters at […]