What does this AWS Solution do?

Machine Learning for Telecommunication deploys a scalable, customizable machine learning (ML) architecture that provides a framework for an end-to-end ML process including ad-hoc data exploration, data processing and feature engineering, and model training and evaluation.

The solution also includes a synthetic telecom IP Data Record (IPDR) dataset to demonstrate how to use ML algorithms to test and train models for predictive analysis in telecommunication. You can use the included Jupyter notebooks as a starting point to develop your own custom ML models, or you can customize the included notebooks for your own use case.

AWS Solution overview

The Machine Learning for Telecommunication solution helps you implement a framework for an end-to-end ML process on the AWS Cloud using Jupyter Notebook, an open source web application for creating and sharing live code, equations, visualizations and narrative text. The diagram below presents the architecture you can build in minutes using the solution's implementation guide and accompanying AWS CloudFormation template.

machine-learning-for-telecommunication-architecture
 Click to enlarge

Machine Learning for Telecommunication solution architecture

An Amazon Simple Storage Service (Amazon S3) bucket includes a synthetic IP Data Record (IPDR) dataset, an AWS Glue job converts the datasets, and an Amazon SageMaker instance includes Machine Learning (ML) Jupyter Notebooks.

The solution ingests data from the Amazon S3 bucket into the Amazon SageMaker cluster and runs the Jupyter notebooks on the dataset.

The notebooks preprocess the data, extract features, and divide the data into training and testing. Amazon S3 Select reads the Parquet compressed data that was processed by the AWS Glue job. ML algorithms process the training dataset to develop a model to identify anomalies and predict future anomalies.

Machine Learning for Telecommunication

Version 1.1.0
Last updated: 06/2019
Author: AWS

Estimated deployment time: 5 min

Features

Machine Learning for Telecommunication reference implementation

Leverage the Machine Learning for Telecommunication solution out of-the-box, or as a reference implementation for building your own machine learning solution.

Synthetic dataset for training

This solution includes synthetic demo IP Data Record (IPDR) datasets in Abstract Syntax Notation One (ASN.1) format and call detail record (CDR) format.
Product-Page_Standard-Icons_01_Product-Features_SqInk
Explore all AWS Solutions

Browse our portfolio of AWS-built solutions to common architectural problems.

Learn more 
Next-Steps-Icon_Find-a-Partner-B
Find a Partner

Find AWS certified consulting and technology partners to help you get started.

Learn more 
Product-Page_Standard-Icons_03_Start-Building_SqInk
Start building in the console

Sign-up and start exploring our services.

Get started