AWS HPC Blog

Tag: EFA

Cross-account HPC cluster monitoring using Amazon EventBridge

Cross-account HPC cluster monitoring using Amazon EventBridge

Managing extensive HPC workflows? This post details how to monitor resource consumption without compromising security. Check it out for a customizable reference architecture that sends only relevant data to your monitoring account.

Performance gains with AWS Graviton4 – a DevitoPRO case study

Performance gains with AWS Graviton4 – a DevitoPRO case study

This post was contributed by Gerard Gorman from Devito, and Cyril Lagrange, Gilles Tourpe, and Theo Wu from AWS The AWS Graviton4 processor represents a significant leap forward, with 96 Neoverse V2 cores and an enhanced memory subsystem. The 12 DDR5-5600 channels provide up to 75% more memory bandwidth than Graviton3 which is beneficial for […]

Recent improvement to Open MPI AllReduce and the impact to application performance

Recent improvement to Open MPI AllReduce and the impact to application performance

Our team engineered some Open MPI optimizations for EFA to enhance performance of HPC codes running in the cloud. By improving MPI_AllReduce they improved scaling – matching commercial MPIs. Tests show gains for apps including Code Saturne and OpenFOAM on both Arm64 and x86 instances. Check out how these tweaks can speed up your HPC workloads in the cloud.

Near-real-time energy production forecasts with NVIDIA Earth-2 and AWS Batch

Using AWS Batch and NVIDIA Earth-2, we built a scalable workflow that explores millions of scenarios at a fraction of the cost of traditional methods. This innovative approach not only provides rapid energy calculations, but also shows the potential of AI-driven meteorology.

Securing HPC on AWS – isolated clusters

Securing HPC on AWS – isolated clusters

In this post, we’ll share two ways customers can operate HPC workloads using AWS ParallelCluster while completely isolated from the Internet. ParallelCluster supports many different network configurations to support a range of uses. When referring to isolation we mean situations where your HPC cluster is completely self-contained inside AWS, or where you have a private […]