AWS Cloud Operations Blog
Introducing Amazon EKS Observability Accelerator
Some of the details in this blog post are now outdated. For the latest information on the AWS Observability Accelerator please see Announcing AWS Observability Accelerator to configure comprehensive observability for Amazon EKS. Also explore the GitHub repository where you can find more details on how to get started. Observability is critical for any application […]
Proactive autoscaling of Kubernetes workloads with KEDA and Amazon CloudWatch
Container Orchestration platforms, such as Amazon Elastic Kubernetes Service (Amazon EKS), have simplified the process of building, securing, operating, and maintaining container-based applications. Therefore, they have helped organizations focus on building applications. Customers have started adopting event-driven deployment, allowing Kubernetes deployments to scale automatically in response to metrics from various sources dynamically. By implementing event-driven […]
Monitor Istio on EKS using Amazon Managed Prometheus and Amazon Managed Grafana
Service Meshes are an integral part of the Kubernetes environment that enables secure, reliable, and observable communication. Istio is an open-source service mesh that provides advanced network features without requiring any changes to the application code. These capabilities include service-to-service authentication, monitoring, and more. Istio generates detailed telemetry for all service communications within a mesh. This telemetry […]
Introducing vended metrics for Amazon Managed Service for Prometheus
Today, I’m happy to announce that Amazon Managed Service for Prometheus now vends usage metrics to Amazon CloudWatch. These metrics can be used to help you gain better visibility into your Amazon Managed Service for Prometheus workspace. Let’s dive in to see how you could use these new Prometheus usage metrics in CloudWatch. I‘ve set […]
Managing your application metadata using AWS Service Catalog App Registry
Customers need a way to track all of their AWS application resources in one place, and associate metadata like cost center, business unit with those resources centrally. AWS Service Catalog AppRegistry removes the need for complex tag management and allows for customers to aggregate application metadata such as cost center and business units across multiple […]
Integrating existing AWS CloudTrail configurations when launching AWS Control Tower
[Update] AWS Control Tower 3.0 now allows you to either select organization-level CloudTrail trails or opt out of Control Tower-managed trails, eliminating the need for the workaround mentioned in this blog. For additional details, refer to the Release notes. The customers that we work with often use multiple AWS accounts to meet their business needs. […]
DevOps automation for backup compliance in AWS using AWS Backup Audit Manager
Backup compliance in AWS includes defining and enforcing backup policies to encrypt your backups, protect them from manual deletion, prevent changes to your backup lifecycle settings, and audit and report on backup activity from a centralized console. AWS Backup Audit Manager, a feature within the AWS Backup service, provides built-in compliance controls for these areas. […]
How Expedia Group built Database as a Service (DBaaS) offering using AWS Service Catalog
Enabling agile application development teams to self-serve and quickly provision the resources that they need while adhering to the organization’s governance and controls can be challenging. In this post, we’ll explore Expedia Group’s Cerebro platform, a Database as a Service (DBaaS) offering built on AWS technologies. By using this platform, Expedia Group is able to […]
Managing cross-Region reports for AWS Marketplace and AWS Service Catalog resources
Organizations have many business reasons to track resource usage across their AWS environments. For example, management and administrative teams want to track operation expenditure, license governance, and asset tracking for their AWS Marketplace solutions across Regions currently in use. A centralized reporting dashboard allows the teams to access this information quickly and efficiently. This post […]
What is observability and Why does it matter? – Part 1
Before defining observability, consider the following example: You run an e-commerce site, and you’re interested in understanding the customer experience of the site, as well as how that translates into sales. You have identified that long page-loading times lead to poor customer experience, which in turn leads customers to abandon their carts and buy competing […]