AWS Glue now supports connecting Amazon SageMaker notebooks to development endpoints

Posted on: Oct 5, 2018

You can now create an Amazon SageMaker notebook from the AWS Glue Console and connect it to an AWS Glue development endpoint. With this integration, you can now use Amazon SageMaker’s fully managed notebooks instead of provisioning and managing your own notebook servers, making it easier and faster to start developing your AWS Glue ETL scripts. An AWS Glue development endpoint is a serverless Apache Spark environment that you can use to develop, debug, and test your AWS Glue ETL scripts in an interactive manner. To learn more, please visit our documentation.

Additionally, you can use the Amazon SageMaker Spark library on AWS Glue development endpoints. This library is an open-source Apache Spark library for Amazon SageMaker. It enables you to interleave Apache Spark stages and stages that interact with Amazon SageMaker in your Apache Spark ML Pipelines, allowing you to train models using Apache Spark DataFrames in Amazon SageMaker with Amazon-provided ML algorithms like K-Means clustering or XGBoost.

This feature is available in the AWS Regions US East (N. Virginia), US East (Ohio), US West (Oregon), EU (Frankfurt), EU (Ireland), Asia Pacific (Seoul), Asia Pacific (Sydney), and Asia Pacific (Tokyo). For AWS Glue availability, please visit the AWS Region table.