AWS DevOps & Developer Productivity Blog

Feature Flag Orchestration with AWS DevOps Agent and LaunchDarkly

Introduction Organizations that use feature flags alongside incident response tooling often connect the two manually. When an outage occurs, engineers must identify which flags are relevant, decide whether to disable them, and coordinate the change across teams. This manual process adds latency at the moment it matters most. You can use AWS DevOps Agent and […]

Supercharge your cloud operations with the Kiro power for AWS DevOps Agent

When an alarm fires at 2 AM, the first thing most engineers do is grep logs, check recent deployments, and trace code paths. However, the context they need — metrics, traces, topology, configurations — lives in a separate browser tabs and applications. What if your IDE could bring that cloud intelligence directly to your code, […]

Accelerate Incident Resolution with PagerDuty and AWS DevOps Agent

When something breaks in production, you find out fast. Understanding why it broke, before the damage spreads, is the hard part. That is where Site Reliability Engineering (SRE) teams lose the most time. Think about the last time you got paged at 2 a.m. The alert said something broke, not why. You open four or […]

Production-Ready Autonomous Incident Resolution with AWS DevOps Agent (now GA) and Datadog MCP Server

This post was co-written with Bharadwaj Tanikella (AI/ML Product Engineering Leader) and Mohammad Jama (Product Marketing Manager) from Datadog. In December 2025, we showed how AWS DevOps Agent and Datadog MCP Server could work together to autonomously correlate monitoring data with the infrastructure deployed and configured on AWS to resolve incidents in minutes instead of […]

Diagnose EKS Node Issues Faster with AWS DevOps Agent and Custom MCP

AWS DevOps Agent can investigate a growing range of production incidents autonomously. It diagnoses CrashLoopBackOff failures, traces ConfigMap deletions through audit logs, and correlates Amazon CloudWatch metrics with cluster events — all without human intervention. But AWS DevOps Agent has a visibility boundary. When the data it needs lives outside its native integrations — on […]

Debug deployment failures faster with the Deployments tab in AWS Elastic Beanstalk

Introduction When a deployment fails, finding the root cause often means piecing together information from multiple sources. You wait for the deployment to finish, request a log bundle, download it, and then search through files like eb-engine.log and cfn-init.log to find the error. If you’re not familiar with Elastic Beanstalk’s log file structure, you might […]

How AWS DevOps Agent uses multi-agent reasoning to find root causes

Confirmation bias is one of the most common reasons incident investigations take longer than they should. An on-call engineer gets alerted, forms a theory based on initial triage and experience, finds one piece of supporting evidence, and stops looking. The actual root cause — buried in a different service, a different signal, a different time […]

Automate root cause analysis across Datadog and Elasticsearch with AWS DevOps Agent

Modern distributed systems route business transactions through dozens of microservices, message queues, and event streams. When a message fails to process or processing exceeds SLA thresholds, troubleshooting requires correlating logs from tools like Elasticsearch, metrics from Datadog, and infrastructure change events in AWS CloudTrail. Correlating these signals manually across heterogeneous backends, each with different query […]

Announcing AWS CDK Mixins: Composable Abstractions for AWS Resources

We are excited to announce CDK Mixins, a feature of the AWS Cloud Development Kit (CDK) that fundamentally changes how you compose and reuse infrastructure abstractions. In this post, you will learn how to use CDK Mixins to apply sophisticated features to any construct – whether L1, L2, or custom – without being locked into […]

Building Self-Extending CLI Tools with Strands Agent

Learn how to build CLI tools that extend themselves through natural language using Amazon Bedrock, the Strands Agents SDK, and Model Context Protocol. This post walks through a meta-tooling pattern where generated CLIs can create, refine, and version new commands at runtime—turning days of manual development into minutes of conversational iteration, all without touching source code.