AWS Neuron

SDK to optimize machine learning inference on AWS Inferentia chips

AWS Neuron is a software development kit (SDK) for running machine learning inference using AWS Inferentia chips. It consists of a compiler, run-time, and profiling tools that enable developers to run high-performance and low latency inference using AWS Inferentia-based Amazon EC2 Inf1 instances.

Developers will find AWS Neuron easy to integrate into their existing and future machine learning workflows because it is natively integrated with popular frameworks including TensorFlow, PyTorch and MXNet. Neuron is pre-installed in AWS Deep Learning AMIs and can also be installed in your custom environment without a framework. In addition, Neuron will be pre-installed in AWS Deep Learning Containers and Amazon SageMaker, the easiest way to be successful with machine learning.


How it works


Getting Started

Tutorials / how-to guides / application notes, and documentation are available on GitHub
For further assistance, a developer's forum is available via the AWS console or at: