Guidance for Monitoring High-Cardinality Telecom Metrics on AWS

This Guidance demonstrates how telecom operators can use Amazon CloudWatch to better monitor key performance indicators (KPIs) for their hardware devices, endpoints, services, and software that often have many unique values, or high-cardinality. CloudWatch anomaly detection applies statistical and machine learning algorithms to CloudWatch metrics, calculates normal baselines, and surfaces anomalies with minimal user intervention.

Please note: [Disclaimer]

Architecture Diagram

[Architecture diagram description]

Download the architecture diagram PDF

Guidance Architecture Diagram for Monitoring High-Cardinality Telecom Metrics on AWS

Step 1a
An Amazon EventBridge rule invokes an AWS Lambda function every minute that generates simulated telecommunications telemetry using the Amazon CloudWatch embedded metric format.

Step 1b
Real telecommunications telemetry is sent to CloudWatch using the CloudWatch embedded metric format through the CloudWatch agent or PutLogEvents API. For an example of how our customers use this, refer to How BT uses Amazon CloudWatch to monitor millions of devices.

Step 2
CloudWatch recognizes the embedded metric format in incoming log events, then extracts and generates aggregated metrics asynchronously.

Step 3
The CloudWatch alarms are configured for each KPI with anomaly detection. Composite alarms are also created for each geographic boundary and combined individual KPI alarms.

Step 4
One CloudWatch dashboard is created for each geographic level. That is, USA, Washington State, and Seattle would each have their own dashboard which shows the corresponding alarms, metrics, and logs for each level.

Step 5
A geographic map is generated and displayed on the CloudWatch dashboards using custom widgets which color codes the map according to the alarm status. A custom widget is a CloudWatch dashboard widget that can call any Lambda function with custom parameters. It then displays the returned HTML or JSON file.

Well-Architected Pillars

The AWS Well-Architected Framework helps you understand the pros and cons of the decisions you make when building systems in the cloud. The six pillars of the Framework allow you to learn architectural best practices for designing and operating reliable, secure, efficient, cost-effective, and sustainable systems. Using the AWS Well-Architected Tool, available at no charge in the AWS Management Console, you can review your workloads against these best practices by answering a set of questions for each pillar.

The architecture diagram above is an example of a Solution created with Well-Architected best practices in mind. To be fully Well-Architected, you should follow as many Well-Architected best practices as possible.

Operational Excellence

CloudWatch collects and displays real-time logs, metrics, and event data in automated dashboards to streamline your infrastructure and application maintenance. Lambda is used to enhance the capabilities in CloudWatch dashboards with custom widgets.

Read the Operational Excellence whitepaper
Security

CloudWatch logs are compliant with Payment Card Industry (PCI) and Federal Risk and Authorization Management Program (FedRAMP). CloudWatch is also integrated with AWS Identity and Access Management (IAM) so you can control which users and resources can access your data, and set permissions for how your data is accessed. You can also define data protection policies to discover and protect sensitive data logged by systems and applications. Lastly, data in this Guidance is encrypted at rest and in transit.

Read the Security whitepaper
Reliability

To keep your applications running smoothly, use CloudWatch to gain system-wide visibility into resource utilization, application performance, and operational health. CloudWatch is a fundamental service that equips you to collect logs, gather metrics, and send notifications when thresholds are crossed or significant events occur.

Read the Reliability whitepaper
Performance Efficiency

CloudWatch is a monitoring and observability service that provides you with both data and actionable insights. It helps you monitor your workloads so you can better respond to changes in performance of your systems and processes.

Read the Performance Efficiency whitepaper
Cost Optimization

The CloudWatch embedded metric format allows you to asynchronously generate custom metrics in the form of logs that are written to CloudWatch logs. CloudWatch then automatically creates an aggregated metric, which in turn provides a cost-effective way to monitor resources. Additionally, by using the embedded metric format, the cost of the PutMetricData API is removed.

Read the Cost Optimization whitepaper
Sustainability

CloudWatch streamlines data storage and eliminates the need to call the PutMetricData API to generate metrics, eliminating the need for additional compute. CloudWatch logs are also stored and estimated to be 15% of log volume ingested (due to compression). To further minimize your required resources, you can set a retention policy on individual log groups.

Read the Sustainability whitepaper

Implementation Resources

The sample code is a starting point. It is industry validated, prescriptive but not definitive, and a peek under the hood to help you begin.

Open sample code on GitHub

[SEO Subhead]

Architecture Diagram

Well-Architected Pillars

Implementation Resources

Related Content

[Title]

Disclaimer

Was this page helpful?

Guidance for Monitoring High-Cardinality Telecom Metrics on AWS

[SEO Subhead]

Architecture Diagram

Well-Architected Pillars

Implementation Resources

Related Content

[Title]

Disclaimer

Was this page helpful?

Ending Support for Internet Explorer