Posted On: May 22, 2023

AWS ParallelCluster 3.6 is now generally available. Key new features include support for automatic health checks for GPU instances and support for Red Hat Enterprise Linux (RHEL8). Other important features in this release include:

  1. Ability to customize Slurm settings not managed by ParallelCluster
  2. A programmatic interface to manage ParallelCluster using AWS CloudFormation
  3. Support for up to 50 queues and a total of 50 compute resources per cluster
  4. Tag-based cost monitoring in ParallelCluster UI
  5. Support for custom resource tags for queues, head node, and ParallelCluster-managed storage
  6. Extended Amazon CloudWatch metrics for disk usage, idle instances, and errors
  7. Improved head node resiliency with configurable log rotation

For more details on the release, review the AWS ParallelCluster 3.6 release notes.

AWS ParallelCluster is a fully-supported and maintained open-source cluster management tool that enables R&D customers and their IT administrators to operate high-performance computing (HPC) clusters on AWS. ParallelCluster is designed to automatically and securely provision cloud resources into elastically-scaling HPC clusters capable of running scientific, engineering, and machine-learning (ML/AI) workloads at scale on AWS.

ParallelCluster is available at no additional charge in the AWS Regions listed here, and you pay only for the AWS resources needed to run your applications. To learn more about launching HPC clusters on AWS, visit the AWS ParallelCluster User Guide. To start using ParallelCluster, see the installation instructions for ParallelCluster UI and CLI.