Posted On: Mar 22, 2023

Amazon SageMaker Data Wrangler now supports OAuth based authentication with identity providers including Okta, Microsoft Azure AD, and Ping Federate to access data in Snowflake for machine learning (ML). Data Wrangler reduces the time it takes to aggregate and prepare data for ML from weeks to minutes using a visual interface in Amazon SageMaker Studio.

This launch enables customers who want to use a single identity provider to manage their users, groups and access control across all applications, including Snowflake. Once the admins configure Snowflake OAuth access for Data Wrangler, you can log in using your organization identity provider when connecting from Data Wrangler to Snowflake to bring in data for ML. You can join data from other popular data sources such as Amazon S3, Amazon Athena, Amazon Redshift, Amazon EMR and over 40 SaaS applications supported by Data Wrangler to create the right data set for ML. You can quickly understand data quality, clean the data, and create features with 300+ built in analysis and data transformations using Data Wrangler’s visual interface. You can also train and deploy model with SageMaker Autopilot, and operationalize the data preparation process in a feature engineering, training or pipeline using integration with SageMaker Pipeline, all from Data Wrangler. 

Data Wrangler supports Okta, Microsoft Azure AD, and Ping Federate for Snowflake connections in all the regions currently supported by Data Wrangler at no additional charge. To learn more, see this blog post and the AWS technical documentation.