AWS Cloud Operations Blog
Category: Intermediate (200)
Introducing vended logs for Amazon Managed Service for Prometheus
Customers are using Amazon Managed Service for Prometheus to monitor and alert on their container metrics. Amazon Managed Service for Prometheus ships with Alert Manager, the open source alert routing component in Prometheus. Alert manager routes alerts to Amazon Simple Notification Service (Amazon SNS). However, there are some common reasons why alert manager may fail […]
Move over IT, here comes innovation
Companies invest in research and development (R&D) to improve their cost structure, speed to market, and quality of offerings. R&D spend is highest in industries where research efforts are linked to product roadmaps, as is the case in the healthcare, pharmaceutical, and technology industries. Those typically allocate 10% to 15% of their revenues to innovation. […]
Developing enterprise applications on-demand with AWS Mainframe Modernization and Micro Focus
AWS Mainframe Modernization service delivers features and value across the entire mainframe application migration, modernization, execution, and operation lifecycle. The service enables two patterns – Re-platforming and Automated Refactoring. These are popular mainframe modernization patterns because they provide business and technical benefits in the short term with a predictable timeline and cost. The Micro Focus […]
Deliver Java JMX statistics to Amazon CloudWatch using the CloudWatch Agent and CollectD
A common problem customers face is alerting when their Java-based workloads experience performance issues, such as heap constraints. In this post, I’ll illustrate how relevant metrics from the Java Virtual Machine (JVM) can be collected and sent to Amazon CloudWatch, where customers can define alerts that fire when workloads are in jeopardy. Overview Let’s consider […]
Accelerate Modernization using AWS Migration Hub Refactor Spaces and AWS Proton
Refactoring legacy applications and infrastructure can be daunting. From navigating legacy codebase, identifying domains to decompose, where to start, what patterns to adopt, teams can quickly find themselves paralyzed even before they start. AWS Migration Hub Refactor Spaces is the new starting point for incremental app refactor that makes it easy to manage the refactoring […]
Use AWS Systems Manager Automation to create input parameters that populate AWS resources as a dropdown list
As a Solution Architect at AWS, my customers regularly ask how to automate everyday operations within their cloud environment. Their use cases include a variety of operational needs, such as provisioning new resources within an AWS account, and patching/updating managed Amazon Elastic Compute Cloud (Amazon EC2) instances. They are also focused on cost management with […]
How to use Resilience Hub’s Fault Injection Experiments to test application’s resilience
In this post, you’ll learn how to utilize AWS Fault Injection Simulator (AWS FIS) and AWS Resilience Hub to refactor a simple serverless application. Resilience Hub lets you define, validate, and track the resiliency of your AWS application. Resilience Hub integrates with AWS FIS, a chaos engineering service, to provide fault-injection simulations of real-world failures. These […]
Viewing Amazon CloudWatch metrics with Amazon Managed Service for Prometheus and Amazon Managed Grafana
Monitoring AWS services comprising of a customer workload with Amazon CloudWatch is important for resiliency of a workload. Customers can bring their CloudWatch data alongside their existing Prometheus data sources to improve their ability to join or query across for a holistic view of their systems. The Amazon Managed Service for Prometheus is a serverless […]
Governance Patterns to Manage Private Workloads through Cloud Operations Services
Introduction For enterprises, one of the larger obstacles when adopting and migrating to the cloud is how to establish a well-thought-out cloud governance model to meet internal or regulatory compliance requirements. One common inhibitor in the field is that enterprises seek to come up with a one-size-fits-all approach to cloud governance for all workloads. We […]
Validating and Improving the RTO and RPO Using AWS Resilience Hub
“Everything fails, all the time”, a famous quote from Werner Vogels, VP and CTO of Amazon.com. When you design and build an application, a typical goal is to have it working, the next is to keep it running, no matter what disruptions may occur. It is crucial to achieve resiliency, but you need to consider […]