AWS Cloud Operations Blog

Category: *Post Types

Getting Started with CloudWatch agent and collectd

Observability helps you understand the health, usage, performance, and customer experience for your workloads. Observability can support many use cases, from detecting incidents and supporting incident resolution, to understanding the impact of new features on your users and workflow. Establishing the right solution depends on being able to gather the right data for your situation. […]

Evaluate custom configurations using AWS Config Custom Policy rules and the open source sample repository

Does your organization have custom configuration requirements for your resources? Do you find it challenging to compare actual resource configuration settings against your configuration requirements? Today, you can leverage a new public repository of sample AWS Config custom rules using AWS CloudFormation Guard to help you address these challenges. AWS Config allows you to evaluate actual […]

Monitoring version compliance of Amazon Elastic Kubernetes Service by using AWS Config

Monitoring version compliance of Amazon Elastic Kubernetes Service by using AWS Config

Amazon Elastic Kubernetes Services (Amazon EKS) provides a managed Kubernetes service, simplifying cluster operations by offloading undifferentiated heavy lifting to AWS. With the Kubernetes release cycle of a new release every 4 months, customers have difficulty in keeping their EKS clusters up-to-date, especially across multiple AWS accounts. Additionally, keeping track of EKS version will aid your […]

Import existing AWS Control Tower accounts to Account Factory for Terraform

AWS Control Tower Account Factory for Terraform (AFT) allows customers to provision and customize their account in AWS Control Tower using Terraform. AFT can also import existing AWS Control Tower managed accounts into AFT management, allowing you to manage the global and account-specific customization at scale using Terraform. We hear from customers that they want […]

Gain actionable business insights with monitoring of Amazon MSK with Amazon Managed Service for Prometheus and Amazon Managed Grafana

Gain actionable business insights with monitoring of Amazon MSK with Amazon Managed Service for Prometheus and Amazon Managed Grafana

Introduction Monitoring is a critical aspect of maintaining the health and performance of any distributed system. In the case of Apache Kafka-based applications, configuring robust monitoring on kafka clusters becomes more crucial due to the real-time nature of data processing. This blog is intended for individuals or organizations utilizing Apache Kafka-based applications, specifically those facing […]

How to perform a Well-Architected Framework Review- Part 3

In previous blog posts, we discussed the first two phases for running a Well-Architected Framework Review, or WAFR. The first phase is to Prepare and the second phase in to conduct the Review. In this blog post, we dive deep into the third phase: Improve. Figure-1 WAFR Phases What is the Improve phase? At this […]

How to perform a Well-Architected Framework Review- Part 2

There are three phases to conduct a successful Well-Architected Framework Review or WAFR: Prepare, Review and Improve. In part 1 of this blog series, we discussed the preparation phase. In this part, we will dive deep into the best practices of the second phase, the actual review. Figure-1 WAFR Phases Assuming you follow the recommendations […]

How to perform a Well-Architected Framework Review- Part 1

Is my workload well-architected? Is my team following cloud best practices? How do other customers implement solution X? What is the best way to configure service Y? These are examples of questions I usually get from my customers who want to validate if their architecture is aligned with AWS best practices. The answers to these […]

Manage continuous compliance by using AWS Config Configuration Recorder resource type

AWS Config recently added support for configuration recorder as a resource type. The AWS::Config::ConfigurationRecorder resource is a configuration item (CI) for configuration recorder that tracks changes to the state of AWS Config configuration recorder (configuration recorder). You can use this CI to check if the state of the configuration recorder has changed (drifted), from its […]

Optimizing alarm lifecycle with Amazon CloudWatch Metrics Insights alarms

Optimizing alarm lifecycle with Amazon CloudWatch Metrics Insights alarms

Do you have entire fleets of dynamically changing resources that you are struggling to easily monitor and set alarm on? Do you have a ton of dangling alarms that you are paying for and that is cluttering your view? Are you looking for a simplified way to create alarms that automatically adjusts to resources that […]