AWS Cloud Operations Blog
Category: Management Tools
Build Cloud Operations Skills Using The New Getting Started with AWS Systems Manager Training
Are you looking for a solution that would help you simplify your operational tasks? Do you want to automate regular operational activities? Would you like simplify patching your running instances? Those topics and more are covered in our Getting Started with AWS Systems Manager training available today. AWS Systems Manager centralizes operational data from multiple […]
Visualizing Amazon CloudWatch Costs – Part 2 – Where does the data come from?
In part 1 of this series we explored an Amazon CloudWatch dashboard which provides a real-time view of some of the typical main contributors to CloudWatch costs. In this second post, we’ll look at how the CloudWatch dashboard widgets were created so that you can learn how to create something similar, or modify the widgets […]
Visualizing Amazon CloudWatch Costs – Part 1
Amazon CloudWatch monitors your AWS resources and the applications you run on AWS in real-time. You can use CloudWatch to collect metrics, logs, traces, set up alarms, create synthetic checks, and more. The information you collect lets you observe, validate, and alert on areas of interest to you. In this two-part post, we’ll explore a […]
Avoid patching failures due to low disk space with AWS Systems Manager Automation and CloudWatch alarms.
Every organization has to comply with keeping their fleet updated on patching and ensure that business and workloads are not affected due to patching. One of the challenges for the operations teams is to patch at scale without affecting production software. The most common reasons workloads patching fails are insufficient disk space, a spike in […]
Improving Mergers & Acquisitions IT Integration with AWS Application Discovery Service
The purpose of this post is to provide high-level guidance for Mergers & Acquisitions (M&A) stakeholders on how to incorporate AWS Application Discovery Service as part of integration planning and integration data discovery. This post is part of a series of technical content on how M&A integration teams can utilize Amazon Web Services (AWS) to […]
Enable cross-account queries on AWS CloudTrail lake using delegated administration from AWS Organizations
We are excited to announce a new CloudTrail feature, which lets the management account of an organization configure up to 3 delegated administrators to manage the organization’s trails and Lake event data stores. A delegated administrator has permission to manage resources on behalf of the organization. Delegated administrator support enables flexibility for customers by allowing […]
The Importance of Key Performance Indicators (KPIs) for Large-Scale Cloud Migrations
Key performance indicators (KPIs) are quantifiable measurements that help you understand how well you’re performing in specific areas. For example, from an incident management perspective, you may measure the mean time to recovery to understand how long it takes to recover following an incident. Large-scale enterprise migration programs (such as vacating a data center or […]
AWS Cloud Operations Kiosks at AWS re:Invent 2022
The Expo on Day 3 of AWS on Wednesday, December 1, 2021 at the Venetian Resort in Las Vegas, Nevada. For most organizations, the question isn’t “if we move to the cloud” anymore; it’s “what do we move first?” and “how soon can we be operating in the cloud?” Wherever you are in your digital […]
Automate AIOps for your microservices in AWS using Amazon DevOps Guru and AWS Systems Manager Incident Manager
Artificial intelligence operations (AIOps) is the process of using machine learning techniques to solve operational problems. The goal of AIOps is to reduce human intervention in IT operations processes. By using advanced machine learning techniques, you can reduce operational incidents and increase service quality, and AIOps can help you predict incidents before they happen. Amazon […]
How to develop an Observability strategy – Part 2
Your observability strategy starts with your business. “Observability” describes how well you can understand what’s happening in a system. Developing an observability strategy isn’t a one-time effort. It’s a continuous improvement effort that occurs throughout the lifecycle of your workloads. It enables your teams to determine whether or not the workloads they design and run […]