Posted On: Apr 7, 2023

AWS Lambda functions can now progressively stream response payloads back to the client, including payloads larger than 6MB, helping you improve performance for web and mobile applications. AWS Lambda is a serverless compute service that lets you run code without provisioning or managing infrastructure.

Before today, Lambda-based applications using the traditional request-response invocation model had to fully generate and buffer the response before returning it to the client, which could delay the time to first byte. With response streaming, functions can send partial responses back to the client as they become ready, significantly improving the time to first byte, which web and mobile applications are especially sensitive to.

Response streaming currently supports Node.js 14.x and newer runtimes. You can also stream responses with custom runtimes. You can stream responses via the Lambda API, AWS SDK, and Lambda function URLs, including as an Amazon CloudFront origin. Response streaming is available in the following AWS Regions: US East (Ohio), US East (N. Virginia), US West (N. California), US West (Oregon), Canada (Central), Europe (Ireland), Europe (London), Europe (Paris), Europe (Frankfurt), Europe (Stockholm), Europe (Milan), Middle East (Bahrain), Africa (Cape Town), Asia Pacific (Hong Kong), Asia Pacific (Tokyo), Asia Pacific (Osaka), Asia Pacific (Seoul), Asia Pacific (Singapore), Asia Pacific (Sydney), Asia Pacific (Jakarta), Asia Pacific (Mumbai), South America (São Paulo).

To learn more and get started, please see the launch blog post. To learn more about pricing, visit the AWS Lambda Pricing page.