AWS HPC Blog

Brendan Bouffler

Author: Brendan Bouffler

Brendan Bouffler is the head of the Developer Relations in HPC Engineering at AWS. He’s been responsible for designing and building hundreds of HPC systems in all kind of environments, and joined AWS when it became clear to him that cloud would become the exceptional tool the global research & engineering community needed to bring on the discoveries that would change the world for us all. He holds a degree in Physics and an interest in testing several of its laws as they apply to bicycles. This has frequently resulted in hospitalization.

October was busy for HPC in the cloud

It’s been a busy month in the world of HPC on AWS: we’ve seen new data sets, refinements to cluster operations, and deeper thinking about how workloads map to infrastructure. For our customers driving R&D with HPC, those changes matter (and yes, the nerd in me is quietly excited). In today’s post, we’ll tell you […]

What’s the difference between AWS ParallelCluster and AWS Parallel Computing Service?

It’s been a year since we announced AWS Parallel Computing Service (PCS). In a way this is the third generation of Slurm-based HPC orchestrators that we’ve brought to you. We’ve learned much from helping customers deploy serious production workloads on AWS ParallelCluster, which itself grew from the foundations layed by CfnCluster – the open-source project […]

Announcing expanded support for Custom Slurm Settings in AWS Parallel Computing Service.png

Announcing expanded support for Custom Slurm Settings in AWS Parallel Computing Service

by Brendan Bouffler and Charunethran Panchalam Govindarajan on Permalink Share

Today we’re excited to announce expanded support for custom Slurm settings in AWS Parallel Computing Service (PCS). With this launch, PCS now enables you to configure over 65 Slurm parameters. And for the first time, you can also apply custom settings to queue resources, giving you partition-specific control over scheduling behavior. This release responds directly […]

Coming soon: dedicated HPC instances and hybrid functionality

This year, we’ve launched a lot of new capabilities for HPC customers, making AWS the best place for the length and breadth of their workflows. EFA went mainstream and is now available in sixteen instance families for fast fabric capabilities for scaling MPI and NCCL codes. We’ve written deep-dive studies to explore and explain the optimizations that will drive your workloads faster in the cloud than elsewhere. We released a major new version of AWS ParallelCluster with its own API for controlling the cluster lifecycle. AWS Batch became deeply integrated into AWS Step Functions and now supports fair-share scheduling, with multiple levers to control the experience. Today we’re signaling the arrival of a new HPC-dedicated instance family – the Hpc6a – and an enhanced EnginFrame that will bring the best of the cloud and on-premises together in a single interface.