AWS HPC Blog
Tag: AWS ParallelCluster
Elastic visualization queues with NICE DCV in AWS ParallelCluster
In this blog post we’ll show you how to create an elastic pool of visualization nodes, by combining AWS ParallelCluster with NICE DCV in a novel way.
Checkpointing HPC applications using the Spot Instance two-minute notification from Amazon EC2
In this post we show you how to create an HPC cluster and capture the two-minute warning notifications from Amazon EC2 Spot to execute a checkpoint, reactively.
Install optimized software with Spack configs for AWS ParallelCluster
Today, we’re announcing the availability of Spack configs for AWS ParallelCluster. You can use these configurations to install optimized HPC applications quickly and easily on your AWS-powered HPC clusters.
Deploying Open OnDemand with AWS ParallelCluster
In this post, we describe an integration of Open OnDemand with AWS ParallelCluster so admins can provide web-based access to HPC resources beyond what they have at their site, by using the AWS cloud to add new capabilities and extend capacity.
Multiple Availability Zones now supported in AWS ParallelCluster 3.4
In AWS ParallelCluster 3.4, you can now build HPC clusters that span multiple Amazon EC2 Availability Zones. In this post, we describe how the new feature works, how to use it, and some implications for cluster design that it raises.
AWS ParallelCluster 3.3.0 now supports On-Demand Capacity Reservations
With #AWS #ParallelCluster 3.3, you can now easily take advantage of #EC2 On-Demand Capacity Reservations to help ensure your jobs have the capacity they need when they need it. This post describes the new feature and how you can benefit from it.
Second generation EFA: improving HPC and ML application performance in the cloud
Since launch, EFA has seen continuous improvements in performance. In this post, we talk about our 2nd generation of EFA, which takes another step in improving Machine Learning and High Performance Computing in the Cloud.
Launch self-supervised training jobs in the cloud with AWS ParallelCluster
In this post we describe the process to launch large, self-supervised training jobs using AWS ParallelCluster and Facebook’s Vision Self-Supervised Learning (VISSL) library.
Support for Instance Allocation Flexibility in AWS ParallelCluster 3.3
AWS ParallelCluster 3.3.0 now lets you define a list of Amazon EC2 instance types for resourcing a compute queue. This gives you more flexibility to optimize the cost and total time to solution of your HPC jobs, especially when capacity is limited or you’re using Spot Instances.
Easing your migration from SGE to Slurm in AWS ParallelCluster 3
This post will help you understand the tools available to ease the stress of migrating your cluster (and your users) from SGE to Slurm, which is necessary since the HPC community is no longer supporting SGE’s open-source codebase.