Posted On: Oct 1, 2009

Amazon Elastic MapReduce now supports Apache Hive, an open source data warehouse and analytics package operated by a SQL-based language. Apache Hive allows users to structure, summarize, and query data sources stored in Amazon S3 without requiring in-depth knowledge of the map/reduce development paradigm. Hive goes beyond standard SQL, adding first-class support for map/reduce functions and complex extensible user-defined data types like Json and Thrift. This capability allows processing of complex and even unstructured data sources such as text documents and log files. Apache Hive is a powerful addition to Amazon Elastic MapReduce that makes the service even more compelling for large data set processing and analytics. 

In addition, we are pleased to announce that Amazon Elastic MapReduce is now supported by Karmasphere Studio for Hadoop, a NetBeans-based integrated development environment (IDE) that makes it easy to develop, debug, and deploy job flows from your desktop directly to Amazon Elastic MapReduce. See Karmasphere Studio for Hadoop for more details on this IDE.

Modified 12/22/2021 – In an effort to ensure a great experience, expired links in this post have been updated or removed from the original post.