High Performance Computing (HPC) allows scientists and engineers to solve complex science, engineering, and business problems using applications that require high bandwidth, low latency networking, and very high compute capabilities. AWS allows you to increase the speed of research and to reduce costs by providing Cluster Compute or Cluster GPU servers on-demand. You have access to a full-bisection, high bandwidth network for tightly-coupled, IO-intensive workloads, which enables you to scale out across thousands of cores for throughput-oriented applications.
High Performance Computing (HPC) on Amazon Elastic Compute Cloud (EC2) is enabled by the Cluster Compute-optimized and GPU instance types (virtual machines). You can use them just like other EC2 instances, but they also have been specifically engineered to provide high performance networking and you can scale to tens of thousands of instances on-demand.
C3 instances provide you with the highest performance processors and the lowest price/compute performance compared to all other Amazon EC2 instances. I2 instances are engineered for extremely I/O intensive workloads. The largest I2 instance type can deliver over 365K random reads per second and over 315K random writes per second. We recommend using C3 instances for compute-intensive applications and the I2 instances for transactional systems.
GPU instances are ideally suited for 3D graphics applications, including modeling and simulation; and compute workloads, including computational chemistry, financial analysis, engineering design, and general purpose GPU computing.
AWS GPU instances provide access to NVIDIA GPUs each with up to 1,536 CUDA cores and 4GB of video memory. With the latest driver releases, these GPUs provide support for OpenGL, DirectX, CUDA, OpenCL, and the GRID SDK.
Cluster instances can be launched within a Placement Group. All instances launched within a Placement Group have low latency, full bisection, 10 Gbps bandwidth between instances. Like many other Amazon EC2 resources, Placement Groups are dynamic and are elastically scalable as needed. You can also connect multiple Placement Groups to create very large clusters for massively parallel processing.
AWS currently supports enhanced networking capabilities using SR-IOV (Single Root I/O Virtualization) for the C3 and I2 instance types. SR-IOV is a method of device virtualization that provides higher I/O performance and lower CPU utilization compared to traditional implementations. For supported Amazon EC2 instances, this feature provides higher packet per second (PPS) performance, lower inter-instance latencies, and very low network jitter.
Data has gravity; as datasets grow larger it becomes easier to move the compute closer to the data to reduce latency and increase throughput. With AWS big data storage and database services, such Amazon S3, Amazon Redshift, Amazon DynamoDB, and Amazon RDS, you have the perfect place to host your data for your HPC cluster. Furthermore, with Amazon Elastic Block Store (EBS) you can create large scale parallel filesystems to meet the high volume, performance, and throughput requirements of your HPC workload.
You can save time and money by leveraging Spot Instances for your HPC workloads. Spot Instances is a pricing model that enables you to bid on unused Amazon EC2 capacity at whatever price you choose. When your bid exceeds the Spot price, you gain access to the available Spot Instances and run as long as the bid exceeds the Spot Price. Historically, the Spot price has been 50% to 93% lower than the on-demand price.
AWS Marketplace is an online store that provides an easy way for developers and IT professionals to discover and use software to run in the AWS Cloud. You can find a selection of HPC software ready to run in your cluster, such as the Univa Grid Engine resource management system or the Intel Lustre HPC file system, with just a few clicks directly from the AWS Marketplace.