AWS Compute Blog
Category: Auto Scaling
Faster scaling with Amazon EC2 Auto Scaling Target Tracking
This post is written by Shahad Choudhury, Senior Cloud Support Engineer and Tiago Souza, Solutions Architect Introduction One of the key benefits of the AWS cloud is elasticity. It enables our users to provision and pay only for resources they need. To fully use the elasticity benefits, users needed a mechanism that is automated and […]
Using zonal shift with Amazon EC2 Auto Scaling
This post is written by Michael Haken, Senior Principal Solutions Architect, AWS Today, we’re announcing support for zonal shift in Amazon EC2 Auto Scaling. Zonal shift allows you to rapidly recover from application impairments in a single Availability Zone (AZ) impacting your Auto Scaling Group (ASG) resources. In this post, we describe how performing an […]
Selecting cost effective capacity reservations for your business-critical workloads on Amazon EC2
This blog post is written by Sarath Krishnan, Senior Solutions Architect and Navdeep Singh, Senior Customer Solutions Manager. Amazon CTO Werner Vogels famously said, “everything fails all the time.” Designing your systems for failure is important for ensuring availability, scalability, fault tolerance and business continuity. Resilient systems scale with your business demand changes, prevent data […]
Reserving EC2 Capacity across Availability Zones by utilizing On Demand Capacity Reservations (ODCRs)
This post is written by Johan Hedlund, Senior Solutions Architect, Enterprise PUMA. Many customers have successfully migrated business critical legacy workloads to AWS, utilizing services such as Amazon Elastic Compute Cloud (Amazon EC2), Auto Scaling Groups (ASGs), as well as the use of Multiple Availability Zones (AZs), Regions for Business Continuity, and High Availability. These […]
Best practices to optimize your Amazon EC2 Spot Instances usage
This blog post is written by Pranaya Anshu, EC2 PMM, and Sid Ambatipudi, EC2 Compute GTM Specialist. Amazon EC2 Spot Instances are a powerful tool that thousands of customers use to optimize their compute costs. The National Football League (NFL) is an example of customer using Spot Instances, leveraging 4000 EC2 Spot Instances across more […]
Building diversified and cost-optimized EC2 server groups in Spinnaker
This blog post is written by Sandeep Palavalasa, Sr. Specialist Containers SA, and Prathibha Datta-Kumar, Software Development Engineer Spinnaker is an open source continuous delivery platform created by Netflix for releasing software changes rapidly and reliably. It enables teams to automate deployments into pipelines that are run whenever a new version is released with proven […]
How to create custom health checks for your Amazon EC2 Auto Scaling Fleet
This blog post is written by Gaurav Verma, Cloud Infrastructure Architect, Professional Services AWS. Amazon EC2 Auto Scaling helps you maintain application availability and lets you automatically add or remove Amazon Elastic Compute Cloud (Amazon EC2) instances according to the conditions that you define. You can use dynamic and predictive scaling to scale-out and scale-in […]
Scaling an ASG using target tracking with a dynamic SQS target
This blog post is written by Wassim Benhallam, Sr Cloud Application Architect AWS WWCO ProServe, and Rajesh Kesaraju, Sr. Specialist Solution Architect, EC2 Flexible Compute. Scaling an Amazon EC2 Auto Scaling group based on Amazon Simple Queue Service (Amazon SQS) is a commonly used design pattern in decoupled applications. For example, an EC2 Auto Scaling […]
Adopt Recommendations and Monitor Predictive Scaling for Optimal Compute Capacity
This post is written by Ankur Sethi, Sr. Product Manager, EC2, and Kinnar Sen, Sr. Specialist Solution Architect, AWS Compute. Amazon EC2 Auto Scaling helps customers optimize their Amazon EC2 capacity by dynamically responding to varying demand. Based on customer feedback, we enhanced the scaling experience with the launch of predictive scaling policies. Predictive scaling […]
Enabling load-balancing of non-HTTP(s) traffic on AWS Wavelength
This blog post is written by Jack Chen, Telco Solutions Architect, and Robert Belson, Developer Advocate. AWS Wavelength embeds AWS compute and storage services within 5G networks, providing mobile edge computing infrastructure for developing, deploying, and scaling ultra-low-latency applications. AWS recently introduced support for Application Load Balancer (ALB) in AWS Wavelength zones. Although ALB addresses […]