Amazon SageMaker Autopilot

Automatically create machine learning models with full visibility

Amazon SageMaker Autopilot automatically builds, trains, and tunes the best machine learning models based on your data, while allowing you to maintain full control and visibility.

Building machine learning (ML) models requires you to manually prepare features, test multiple algorithms, and optimize hundreds of model parameters in order to find the best model for your data. However, this approach requires deep ML expertise. If you don’t have that expertise, you could use an automated approach (AutoML), but AutoML approaches typically provide very little visibility into the impact of your features for model predictions. As a result, you may have less trust in it because you can’t recreate it and you can’t learn how it makes predictions.

Amazon SageMaker Autopilot eliminates the heavy lifting of building ML models, and helps you automatically build, train, and tune the best ML model based on your data. With SageMaker Autopilot, you simply provide a tabular dataset and select the target column to predict, which can be a number (such as a house price, called regression), or a category (such as spam/not spam, called classification). SageMaker Autopilot will automatically explore different solutions to find the best model. You then can directly deploy the model to production with just one click, or iterate on the recommended solutions with Amazon SageMaker Studio to further improve the model quality.

How it works

How Amazon SageMaker Autopilot works

Key features

Automatic data pre-processing and feature engineering

You can use Amazon SageMaker Autopilot even when you have missing data. SageMaker Autopilot automatically fills in the missing data, provides statistical insights about columns in your dataset, and automatically extracts information from non-numeric columns, such as date and time information from timestamps.

Automatic ML model selection

Amazon SageMaker Autopilot automatically infers the type of predictions that best suit your data, such as binary classification, multi-class classification, or regression. SageMaker Autopilot then explores high-performing algorithms such as gradient boosting decision tree, feedforward deep neural networks, and logistic regression, and trains and optimizes hundreds of models based on these algorithms to find the model that best fits your data.

Model leaderboard

Amazon SageMaker Autopilot allows you to review all the ML models that are automatically generated for your data. You can view the list of models, ranked by metrics such as accuracy, precision, recall, and area under the curve (AUC), review model details such as the impact of features on predictions, and deploy the model that is best suited to your use case.

Automatic notebook creation

You can automatically generate a Amazon SageMaker Studio Notebook for any model Amazon SageMaker Autopilot creates and dive into the details of how it was created, refine it as desired, and recreate it from the notebook at any point in the future.

Feature importance

Amazon SageMaker Autopilot provides an explainability report, generated by Amazon SageMaker Clarify, that makes it easier for you to understand and explain how models created with SageMaker Autopilot make predictions. You can also identify how each attribute in your training data contributes to the predicted result as a percentage. The higher the percentage, the more strongly that feature impacts your model’s predictions.

Easy integration with your applications

You can use the Amazon SageMaker Autopilot application programming interface (API) to easily create models and make inferences right from your applications, such as your data analytics and data warehousing tools.

Use Cases

Price predictions

Price prediction models are used heavily in financial services, real estate, and energy and utilities to predict the price of stocks, real estate, and natural resources. Amazon SageMaker Autopilot can predict future prices to help you make sound investment decisions based on your historical data such as demand, seasonal trends, and price of other commodities.

Churn prediction

Customer churn is the loss of customers or clients, and every company looks for ways to eliminate it. Models automatically generated by Amazon SageMaker Autopilot help you understand churn patterns. Churn prediction models work by first learning patterns in your existing data and identifying patterns in new datasets so you can get a prediction about customers mostly likely to churn.

Risk assessment

Risk assessment requires identifying and analyzing potential events that may negatively impact individuals, assets, and your company. Models automatically generated by Amazon SageMaker Autopilot predict risks as new events unfold. Risk assessment models are trained using your existing datasets so you can get optimized predictions for your business.


Skullcandy Inc.
"Sisense’s new ML service powered by Amazon SageMaker Autopilot was exactly what we needed to keep ahead of the curve in customer service during this COVID-19 pandemic. Skullcandy was able to gain deep insights into our customers’ needs, improve our issue resolution, and increase customer satisfaction scores."

Mark Hopkins, Chief Information Officer, Skullcandy Inc.

“Previously, we would simply pick two restaurants that looked similar, but now we have a true understanding of the relationships between our menu items, customers, and locations. Amazon SageMaker Autopilot, which powers Domo’s new ML capability, has been a force multiplier for our marketing and purchasing teams to try new ideas and improve our customers’ experience.”

Sean Thompson, IT Director, Freddy’s

"The primary goal in demographic mapping is optimizing across both accuracy and scale. While this is generally difficult, we were able to use Amazon SageMaker Autopilot with our comprehensive training data and sophisticated features to produce better models that improved our prediction accuracy by 137%."

Anindya Datta, CEO, Mobilewalla

"At RetentionX, we provide one-click business insights to e-commerce companies. To serve our customers, it is important that they can get started quickly and make timely business decisions, however building accurate machine learning models can be costly and take months of trial-and-error. In addition, model accuracy is also highly dependent on the breadth and depth of training data and unique feature set available for each of our customers. With the help of Amazon SageMaker Autopilot, our customers can automatically generate the best ML models based on unique datasets. Thanks to SageMaker Autopilot, we can provide personalized insights to tens of millions of shoppers tapping into the power of AutoML."

Alexander Jost, CEO, RetentionX

Resources for Amazon SageMaker Autopilot

Build high-quality machine learning models easily and quickly with Amazon SageMaker Autopilot (31:39)

Amazon SageMaker Autopilot