AWS HPC Blog

Author: Brendan Bouffler

Brendan Bouffler is the head of the Developer Relations in HPC Engineering at AWS. He’s been responsible for designing and building hundreds of HPC systems in all kind of environments, and joined AWS when it became clear to him that cloud would become the exceptional tool the global research & engineering community needed to bring on the discoveries that would change the world for us all. He holds a degree in Physics and an interest in testing several of its laws as they apply to bicycles. This has frequently resulted in hospitalization.

October was busy for HPC in the cloud

It’s been a busy month in the world of HPC on AWS: we’ve seen new data sets, refinements to cluster operations, and deeper thinking about how workloads map to infrastructure. For our customers driving R&D with HPC, those changes matter (and yes, the nerd in me is quietly excited). In today’s post, we’ll tell you […]

What’s the difference between AWS ParallelCluster and AWS Parallel Computing Service?

It’s been a year since we announced AWS Parallel Computing Service (PCS). In a way this is the third generation of Slurm-based HPC orchestrators that we’ve brought to you. We’ve learned much from helping customers deploy serious production workloads on AWS ParallelCluster, which itself grew from the foundations layed by CfnCluster – the open-source project […]

Announcing expanded support for Custom Slurm Settings in AWS Parallel Computing Service

Today we’re excited to announce expanded support for custom Slurm settings in AWS Parallel Computing Service (PCS). With this launch, PCS now enables you to configure over 65 Slurm parameters. And for the first time, you can also apply custom settings to queue resources, giving you partition-specific control over scheduling behavior. This release responds directly […]

Three recipes you don’t want to miss for AWS Parallel Computing Service

AWS Parallel Computing Service now supports AWS CloudFormation, enabling you to deploy and scale HPC workloads as code. Check out our open-source HPC Recipes Library for quick cluster deployments.

Call for participation: HPC tutorial series from the HPCIC

Interested in getting hands-on experience with cutting-edge HPC tools? Check out this blog post on an upcoming virtual training series from @LLNL and @AWSCloud. Learn emerging technologies from the experts this August.

Announcing: Seqera Containers for the bioinformatics community

Genomics community: rejoice! Seqera and AWS have teamed up to announce Seqera Containers, an open-source, no cost, reliable way to generate containers.

Announcing the High Performance Software Foundation (HPSF)

We’re excited to share how we’re involved in launching the High Performance Software Foundation to increase access to and adoption of HPC. By bringing together key players to collaborate, we can lower barriers and accelerate development of portable HPC software stacks.

New: Research and Engineering Studio on AWS

Today we’re announcing Research and Engineering Studio on AWS, a self-service portal to help scientists and engineers access and manage virtual desktops to see their data and run their interactive applications in the cloud.

Call for participation: RADIUSS Tutorial Series 2023

Lawrence Livermore National Laboratory (LLNL) and AWS are again joining forces to provide a training opportunity for emerging HPC tools and application. In this post you’ll find out the details of those tutorials, and find out how to participate.

Coming soon: dedicated HPC instances and hybrid functionality

This year, we’ve launched a lot of new capabilities for HPC customers, making AWS the best place for the length and breadth of their workflows. EFA went mainstream and is now available in sixteen instance families for fast fabric capabilities for scaling MPI and NCCL codes. We’ve written deep-dive studies to explore and explain the optimizations that will drive your workloads faster in the cloud than elsewhere. We released a major new version of AWS ParallelCluster with its own API for controlling the cluster lifecycle. AWS Batch became deeply integrated into AWS Step Functions and now supports fair-share scheduling, with multiple levers to control the experience. Today we’re signaling the arrival of a new HPC-dedicated instance family – the Hpc6a – and an enhanced EnginFrame that will bring the best of the cloud and on-premises together in a single interface.

← Older posts