AWS Compute Blog

Solution architecture

Choosing between storage mechanisms for ML inferencing with AWS Lambda

This post is written by Veda Raman, SA Serverless, Casey Gerena, Sr Lab Engineer, Dan Fox, Principal Serverless SA. For real-time machine learning inferencing, customers often have several machine learning models trained for specific use-cases. For each inference request, the model must be chosen dynamically based on the input parameters. This blog post walks through the architecture […]

Each U30 accelerator card comes with two XCU30 Zynq UltraScale+ SoCs

Deep Dive on Amazon EC2 VT1 Instances

This post is written by:  Amr Ragab, Senior Solutions Architect; Bryan Samis, Principal Elemental SSA; Leif Reinert, Senior Product Manager Introduction We at AWS are excited to announce that new Amazon Elastic Compute Cloud (Amazon EC2) VT1 instances are now generally available in the US-East (N. Virginia), US-West (Oregon), Europe (Ireland), and Asia Pacific (Tokyo) […]

AWS architecture diagram for self-hosted runner.

Building ARM64 applications on AWS Graviton2 using the AWS CDK and Self-Hosted Runners for GitHub Actions

This post is written by Frank Dallezotte, Sr. Technical Account Manager, and Maxwell Moon, Sr. Solutions Architect AWS Graviton2 processors are custom built by AWS using the 64-bit Arm Neoverse cores to deliver great price performance for workloads running in Amazon Elastic Compute Cloud (Amazon EC2). These instances are powered by 64 physical core AWS […]

Step Functions workflow

Build workflows for Amazon Forecast with AWS Step Functions

This post shows how to create a Step Functions workflow for Forecast using AWS SDK service integrations, which allows you to use over 200 with AWS API actions. It shows two patterns for handling asynchronous tasks. The first pattern queries the describe-* API repeatedly and the second pattern uses the “Retry” option. This simplifies the development of workflows because in many cases they can replace Lambda functions.

Accelerating serverless development with AWS SAM Accelerate

Building a serverless application changes the way developers think about testing their code. Previously, developers would emulate the complete infrastructure locally and only commit code ready for testing. However, with serverless, local emulation can be more complex. In this post, I show you how to bypass most local emulation by testing serverless applications in the […]

The architecture of the solution. It shows an EC2 instance of the G4 family deployed in a public subnet. The EC2 instances communicates with S3. Also shown is how a security group controls access from users to the EC2 instance

Use Amazon EC2 for cost-efficient cloud gaming with pay-as-you-go pricing

July 2025c2: This post was reviewed for accuracy. Cloud gaming enables access to high-performance gaming without upfront hardware investment, using pay-as-you-go pricing instead. Cloud gaming platforms such as Amazon Luna are an entryway, but users are limited to the games available on the service. Furthermore, many users also prefer to own their games, or they […]

Performance graph

Monitoring and tuning federated GraphQL performance on AWS Lambda

There are multiple factors to consider when tuning a federated GQL system. You must be aware of trade-offs when deciding on factors like the runtime environment of Lambda functions. An extensive testing strategy can help you scale systems and narrow down issues quickly. Well-defined testing can also keep pipelines clean of false-positive blockages.