AWS HPC Blog
Tag: HPC
Scaling your LLM inference workloads: multi-node deployment with TensorRT-LLM and Triton on Amazon EKS
LLMs are scaling exponentially. Learn how advanced technologies like Triton, TRT-LLM and EKS enable seamless deployment of models like the 405B parameter Llama 3.1. Let’s go large.
Advancing research in the cloud: AWS announces expanded training resources
AWS is investing in researcher training with new learning plans for HPC, quantum, stats, AI/ML & generative AI. Check out the details!
On-demand visual login nodes – using RES with AWS Parallel Computing Service
Running large-scale modeling and simulations just got easier. Check out our new post on integrating AWS Parallel Computing Service with Research and Engineering Studio for individualized access to HPC resources.
Building a secure and compliant HPC environment on AWS following NIST SP 800-223
Check out our latest blog post to learn how AWS enables building secure, compliant high performance computing (HPC) environments aligned with NIST SP 800-223 guidelines. We walk through the key components, security considerations, and steps for deploying a zone-based HPC architecture on AWS.
Improve engineering productivity using AWS Engineering License Management
This post was contributed by Eran Brown, Principal Engagement Manager, Prototyping Team, Vedanth Srinivasan, Head of Solutions, Engineering & Design, Edmund Chute, Specialist SA, Solution Builder, Priyanka Mahankali, Senior specialist SA, Emerging Domains For engineering companies, the cost of Computer Aided Design and Engineering (CAD/CAE) tools can as high as 20% of product development cost. […]
Optimizing compute-intensive tasks on AWS
Optimizing workloads for performance and cost-effectiveness is crucial for businesses of all sizes – and especially helpful for workloads in the cloud, where there are a lot of levers you can pull to tune how things run. AWS offers a vast array of instance types in Amazon Elastic Compute Cloud (Amazon EC2) – each with […]
Cross-account HPC cluster monitoring using Amazon EventBridge
Managing extensive HPC workflows? This post details how to monitor resource consumption without compromising security. Check it out for a customizable reference architecture that sends only relevant data to your monitoring account.
Migration options for NICE EnginFrame Views customers
EnginFrame Views users: check out this post on migration options to maintain secure remote access to your HPC environment. As AWS sunsets NICE EnginFrame, alternatives built on Amazon DCV can provide a seamless transition.
Simulating complex systems with LLM-driven agents: leveraging AWS ParallelCluster for scalable AI experiments
How might AI change the rules of the energy game? A new post explores using large language models to power smarter, more adaptive agents in an energy supply chain simulation. Learn how LLMs could enable more nuanced decision-making behaviors.
Automotive component design at Nifco using generative AI and diffusion models
Combining generative AI with AWS services, Nifco USA is exploring new frontiers in structural design. See how they’re using diffusion models, SageMaker, and Batch to create game-changing lightweight auto parts.