AWS Neuron is a software development kit (SDK) for running machine learning inference using AWS Inferentia chips. It consists of a compiler, run-time, and profiling tools that enable developers to run high-performance and low latency inference using AWS Inferentia-based Amazon EC2 Inf1 instances.
Developers will find AWS Neuron easy to integrate into their existing and future machine learning workflows because it is natively integrated with popular frameworks including TensorFlow, PyTorch and MXNet. Neuron is pre-installed in AWS Deep Learning AMIs and can also be installed in your custom environment without a framework. In addition, Neuron will be pre-installed in AWS Deep Learning Containers and Amazon SageMaker, the easiest way to be successful with machine learning.