Karmasphere provides a graphical, high productivity solution for working with large structured and unstructured data sets on Amazon Elastic MapReduce. By combining the scalability and flexibility of Amazon Elastic MapReduce with the ease-of-use and graphical interface of Karmasphere desktop tools, you can quickly and cost-effectively build powerful Apache Hadoop-based applications to generate insights from your data. Launch new or access existing Amazon Elastic MapReduce job flows directly from the Karmasphere Analyst or Karmasphere Studio desktop tools, all with hourly pricing and no upfront fees or long-term commitments.
You can run Amazon Elastic MapReduce with Karmasphere Analytics under two different licensing models – “License Included” and “Bring-Your-Own-License (BYOL)”. In the "License Included" service model, you do not need separately purchased Karmasphere licenses; the Karmasphere software has been licensed by AWS. Simply launch your Elastic MapReduce job flows directly from Karmasphere's desktop tools or launch them through from the Java SDK or Ruby CLI with Karmasphere Analytics enabled. If you already own Karmasphere licenses, you can use the "BYOL" model to launch Amazon Elastic MapReduce job flows with Karmasphere Analytics. The “BYOL” model is designed for customers who prefer to use existing Karmasphere licenses or purchase new licenses directly from Karmasphere.
Karmasphere Analyst
Karmasphere Analyst is a visual, desktop workspace for data professionals and analysts to explore and interact with Big Data on Amazon Elastic MapReduce. It provides visual tools to use SQL, or other familiar languages, to make ad-hoc queries and interact with the results. The workspace provides access to structured and unstructured data located on Amazon S3, Amazon Elastic MapReduce job flows, or local file systems, supports intuitive analytics via graphical wizards and SQL, and allows users to publish results to files, databases, and other applications such as Microsoft Excel or Tableau. With Karmasphere Analyst you can:
Features
Access Elastic MapReduce Job Flows - Create new Elastic MapReduce job flows through an easy-to-use wizard or choose from a list of existing job flows that you launched with Karmasphere Analytics enabled.
Assemble Unstructured and Structured Data - With drag and drop file system access to Amazon S3, automatic discovery of common file formats and compression types, and wizards to aid with data discovery and assembly, Karmasphere Analyst makes it straightforward to analyze your data regardless of type, volume, or origin.
Analyze Rapidly with Familiar Capabilities – Karmasphere Analyst provides wizards, auto-complete, syntax highlighting, and visual query plans to simplify prototyping, debugging and deploying Apache Hadoop applications onto Amazon Elastic MapReduce.
Act on Results - You can export results to a file or database, integrate with single-click access to Microsoft Excel, Tableau, or other BI tools, and save and reuse the code that generated the result.
Karmasphere Studio is a plug-in for the Eclipse IDE that provides a familiar graphical environment for managing the complete lifecycle for developing Hadoop applications on Amazon Elastic MapReduce, including prototyping, developing, testing, debugging, optimizing, and deploying those applications. By simplifying the development of MapReduce jobs on Amazon Elastic MapReduce, Karmasphere Studio increases the productivity of developers, saving time and effort. Its intuitive, visual interface enables the full spectrum of developers - from those just starting with Big Data to those highly-experienced with Java, Cascading and Streaming - to take advantage of Amazon Elastic MapReduce. With Karmasphere Studio you can:
Features
Access Amazon Elastic MapReduce job flows - Create new Amazon Elastic MapReduce job flows through an easy-to-use wizard or choose from a list of existing job flows that you launched with Karmasphere Analytics enabled.
Prototype and Develop - Prototype using a graphical workflow and easy-to-use wizards to enable rapid development and debugging of applications before deploying them onto Amazon Elastic MapReduce.
Test and Debug - One-click deployment of applications, graphical monitoring and profiling tools, and drag-and-drop HDFS and AMAZON S3 navigation make it easier to test and debug Amazon Elastic MapReduce applications.
Optimize and Package - Profile job behavior, diagnose on-cluster problems, and get recommendations to help optimize performance. Guided packaging of Hadoop jobs enable them to be packaged appropriately, exported from the IDE, and deployed onto Amazon Elastic MapReduce job flows.
Your cost will depend on the number and type of Amazon EC2 Instances in your job flow and the amount of time it is running. Elastic MapReduce pricing is in addition to pricing for EC2 and S3.
Pricing for Amazon EC2 and Amazon Elastic MapReduce
You are charged from the time the job flow begins processing until it is terminated. Partial hours are rounded up.
Save Money with Reserved and Spot Instances
The Amazon EC2 prices above are for On-demand Instances. On-Demand Instances are the most expensive but give you the most flexibility. EC2 also offers Reserved Instances and Spot Instances.
Reserved Instances give you the option to make a low, one-time payment for each instance you want to reserve and in turn receive a significant discount on the hourly charge for that instance. There are three Reserved Instance types (Light, Medium, and Heavy Utilization Reserved Instances) that enable you to balance the amount you pay upfront with your effective hourly price.
Spot Instances enable you to bid for unused Amazon EC2 capacity. Instances are charged the Spot Price, which is set by Amazon EC2 and fluctuates periodically depending on the supply of and demand for Spot Instance capacity. To use Spot Instances, you specify the maximum price you are willing to pay per instance hour. If your maximum price bid exceeds the current Spot Price, your request is fulfilled and your instances will run until either you choose to terminate them or the Spot Price increases above your maximum price (whichever is sooner).
"Amazon Elastic MapReduce with Spot Instances has made it easy to prototype and surprisingly cost-effective to scale, decreasing our data processing costs by over 50%." - VP of Engineering at Fliptop
To view more information and current prices for Reserved Instances and Spot Instances, see the Amazon EC2 pricing page.
Other Pricing Details
Amazon S3 is billed separately. (Many customers store their input and output data in S3; others store all of the data locally on HDFS.) Currently it costs $668 per month to store 10 TB of data in S3 with reduced redundancy. The more data you store, the lower the monthly price per GB.
Amazon SimpleDB is also billed separately. (Only applies if you enable debugging for your job flow)
Karmasphere, a Big Data Intelligence company, unlocks the power of Apache Hadoop for developers and analysts in the enterprise. Karmasphere offers Big Data Analytics solutions to explore unstructured data produced by web, mobile, sensor, and social media together with traditional structured data. Its product suite builds on the Karmasphere Big Data Analytics Engine™ offering the use of standard programming and query languages and a graphical user interface to maximize productivity for greater discovery, insight and action.