With Amazon EMR you can launch a persistent cluster that stays up indefinitely or a temporary cluster that terminates after the analysis is complete. Amazon EMR supports a variety of Amazon EC2 instance types (standard, high CPU, high memory, high I/O, etc.) and Amazon EC2 pricing options (On-Demand, Reserved, and Spot). When you launch an Amazon EMR cluster (also called a "job flow"), you choose how many and what type of Amazon EC2 Instances to provision. The Amazon EMR price is in addition to the Amazon EC2 price.
You are charged from the time the cluster begins processing until it is terminated. Partial hours are rounded up.
Save Money with Reserved and Spot Instances
The Amazon EC2 prices above are for On-Demand Instances. On-Demand Instances are the most expensive but give you the most flexibility. EC2 also offers Reserved Instances and Spot Instances.
Reserved Instances give you the option to make a low, one-time payment for each instance you want to reserve and in turn receive a significant discount on the hourly charge for that instance. There are three Reserved Instance types (Light, Medium, and Heavy Utilization Reserved Instances) that enable you to balance the amount you pay upfront with your effective hourly price.
Spot Instances enable you to bid for unused Amazon EC2 capacity. Instances are charged the Spot Price, which is set by Amazon EC2 and fluctuates periodically depending on the supply of and demand for Spot Instance capacity. To use Spot Instances, you specify the maximum price you are willing to pay per instance hour. If your maximum price bid exceeds the current Spot Price, your request is fulfilled and your instances will run until either you choose to terminate them or the Spot Price increases above your maximum price (whichever is sooner).
"Amazon Elastic MapReduce with Spot Instances has made it easy to prototype and surprisingly cost-effective to scale, decreasing our data processing costs by over 50%." - VP of Engineering at Fliptop
To view more information and current prices for Reserved Instances and Spot Instances, see the Amazon EC2 pricing page.
Supported Instance Types
Standard Amazon EC2 Instances
Instances of this family are well suited for most applications.
Small Instance (Default) 1.7 GB of memory, 1 EC2 Compute Unit* (1 virtual core with 1 EC2 Compute Unit), 160 GB of instance storage, 32-bit platform
Medium Instance 3.75 GB of memory, 2 EC2 Compute Units* (1 virtual core with 2 EC2 Compute Units), 410 GB of instance storage, 64-bit platform
Large Instance 7.5 GB of memory, 4 EC2 Compute Units* (2 virtual cores with 2 EC2 Compute Units each), 850 GB of instance storage, 64-bit platform
Extra Large Instance 15 GB of memory, 8 EC2 Compute Units* (4 virtual cores with 2 EC2 Compute Units each), 1690 GB of instance storage, 64-bit platform
High Memory Amazon EC2 Instances
Instances of this family offer large memory sizes for high throughput applications, including database and memory caching applications.
High-Memory Extra Large Instance 17.1 GB memory, 6.5 EC2 Compute Units* (2 virtual cores with 3.25 EC2 Compute Units each), 420 GB of local instance storage, 64-bit platform
High-Memory Double Extra Large Instance 34.2 GB of memory, 13 EC2 Compute Units* (4 virtual cores with 3.25 EC2 Compute Units each), 850 GB of instance storage, 64-bit platform
High-Memory Quadruple Extra Large Instance 68.4 GB of memory, 26 EC2 Compute Units* (8 virtual cores with 3.25 EC2 Compute Units each), 1690 GB of instance storage, 64-bit platform
High CPU Amazon EC2 Instances
Instances of this family have proportionally more CPU resources than memory (RAM) and are well suited for compute-intensive applications.
High-CPU Medium Instance 1.7 GB of memory, 5 EC2 Compute Units* (2 virtual cores with 2.5 EC2 Compute Units each), 350 GB of instance storage, 32-bit platform
High-CPU Extra Large Instance 7 GB of memory, 20 EC2 Compute Units* (8 virtual cores with 2.5 EC2 Compute Units each), 1690 GB of instance storage, 64-bit platform
High Performance Computing Amazon EC2 Instances
Instances of this family combine large memory sizes and high CPU resources with 10 Gbps networking. They are well-suited for high performance, I/O intensive applications, such as mapping genomes for scientific research, simulating aerospace and automotive designs for engineering activities, and mining data for business intelligence.
Cluster Compute Quadruple Extra Large 23 GB memory, 33.5 EC2 Compute Units*, 1690 GB of local instance storage, 64-bit platform, 10 Gigabit Ethernet
Cluster Compute Eight Extra Large 60.5 GB memory, 88 EC2 Compute Units*, 3370 GB of local instance storage, 64-bit platform, 10 Gigabit Ethernet
Cluster GPU Quadruple Extra Large 22 GB memory, 33.5 EC2 Compute Units*, 2 x NVIDIA Tesla “Fermi” M2050 GPUs, 1690 GB of local instance storage, 64-bit platform, 10 Gigabit Ethernet
High I/O Amazon EC2 Instances
High I/O instances are ideal for high performance database applications such as HBase.
High I/O Four Extra Large 60.5 GiB memory, 35 EC2 Compute Units*, 2 SSD-based volumes each with 1024 GB of instance storage, 10 Gigabit Ethernet
High Storage Amazon EC2 Instances
High Storage instances are ideal for applications that require sequential access to very large data sets.
High Storage Eight Extra Large 117 GB memory, 35 EC2 Compute Units*, 48 TB storage across 24 disks, 10 Gigabit Ethernet
*EC2 Compute Unit (ECU) – One EC2 Compute Unit (ECU) provides the equivalent CPU capacity of a 1.0-1.2 GHz 2007 Opteron or 2007 Xeon processor.
Other Pricing Details
Amazon S3 is billed separately. (Many customers store their input and output data in S3; others store all of the data locally on HDFS.) Currently it costs $668 per month to store 10 TB of data in S3 with reduced redundancy. The more data you store, the lower the monthly price per GB.
Amazon SimpleDB is also billed separately. (Only applies if you enable debugging for your cluster)
AWS GovCloud (US) is an AWS Region designed to allow U.S. government agencies and contractors to move more sensitive workloads into the cloud by addressing their specific regulatory and compliance requirements. For pricing and more information on the AWS GovCloud (US) Region, please visit the AWS GovCloud (US) Pricing Page.
There is an additional charge for the MapR M5 and M7 distributions. See the MapR detail page for more information and current prices. There is no additional charge for MapR M3.