PyTorch on AWS | AWS Compute Blog

Category: PyTorch on AWS

Accelerate CPU-based AI inference workloads using Intel AMX on Amazon EC2

This post shows you how to accelerate your AI inference workloads by up to 76% using Intel Advanced Matrix Extensions (AMX) – an accelerator that uses specialized hardware and instructions to perform matrix operations directly on processor cores – on Amazon Elastic Compute Cloud (Amazon EC2) 8th generation instances. You’ll learn when CPU-based inference is cost-effective, how to enable AMX with minimal code changes, and which configurations deliver optimal performance for your models.

AWS Compute Blog

Category: PyTorch on AWS

Accelerate CPU-based AI inference workloads using Intel AMX on Amazon EC2

Learn

Resources

Developers

Help