AWS Storage Blog
Category: Compute
Bridge legacy and modern applications with Amazon S3 Access Points for Amazon FSx
Organizations rely on file storage accessed from traditional, file-based, applications while simultaneously wanting to build modern, cloud-native applications and services that access the same underlying data. Consequently, many cloud-native apps are built to work with Amazon S3. Amazon Web Services (AWS) recently introduced a new capability, S3 Access Points for Amazon FSx which solves challenges […]
Automatically decompress files in Amazon S3 using AWS Step Functions
Every day, AWS customers process millions of compressed files in Amazon S3, from small ZIP archives to multi-gigabyte datasets. While decompressing a single file is straightforward, processing thousands of files efficiently requires complex orchestration, error handling, and infrastructure management. Consider this scenario: Your organization receives over 10,000 compressed files daily from partners, ranging from 5 […]
Enhancing co-located Kubernetes Pod data access with Amazon EBS Node-Local volumes
Modern containerized applications running on Kubernetes enable organizations to deploy read-heavy workloads—such as machine learning inference, data analytics, and High Performance Computing (HPC)—at unprecedented scale. However, when multiple pods on the same node need to access identical datasets, a performance challenge emerges. Each pod typically fetches files from external storage independently over the network. This […]
Automated extraction of compressed files on Amazon S3 using AWS Batch and Amazon ECS
Organizations frequently upload compressed TAR files to Amazon S3 for efficient data transfer, but downstream applications often need extracted files for processing. Although AWS Glue excels at processing splittable files across worker nodes, TAR files need single-node processing, traditionally forcing teams to manually provision servers, monitor extraction jobs, and manage resource cleanup. This post demonstrates […]
Getting started with self-managed Oracle in AWS using Amazon FSx for OpenZFS
Organizations of all sizes run their enterprise applications and databases in the cloud. These organizations may choose to run self-managed databases on Amazon Elastic Compute Cloud (Amazon EC2) rather than using the fully-managed Amazon Relational Database Service (Amazon RDS) due to internal policies, Amazon RDS service maximums, and other reasons. When running self-managed databases in the cloud, there […]
Optimize WordPress performance on Amazon EKS with Amazon FSx for OpenZFS
As users progress in their cloud journey, they increasingly need robust storage options that integrate natively with containers to help them increase operational efficiency, improve performance, and reduce costs. Our users are finding that using Amazon Elastic Kubernetes Service (Amazon EKS) meets this demand by using the Container Storage Interface (CSI) driver. In this post, […]
PingCAP increased TiDB Cloud stability using Amazon EBS detailed performance statistics
PingCAP is a global company focused on developing distributed, high-performance, and auto scaling relational databases. Its flagship product, TiDB, is a popular open source databases, and TiDB Cloud, a fully managed Database-as-a-Service (DBaaS) based on TiDB, delivers horizontal scalability, strong consistency, and high availability for high-performance applications. Many PingCAP customers run highly latency-sensitive workloads such […]
How to use Amazon S3 Multi-Region Access Points to streamline and reduce the cost of writing across AWS Regions
Large global organizations often struggle to efficiently manage data copies across different geographic regions when using distributed object storage services. Although several approaches exist for cross-region data writing, common solutions such as data replication or streaming can be costly and introduce latency issues. Many customers have core services deployed globally across multiple Amazon Web Services […]
Accelerating Amazon S3 Batch Operations at scale with on-demand manifest generation
Modern enterprises routinely manage billions of objects across their cloud storage environments, needing efficient bulk operations for disaster recovery, compliance management, data transfer, and cost optimization. Performing these operations manually or through custom scripts becomes impractical at scale, often creating operational bottlenecks when time-sensitive actions are necessary. Organizations frequently need to identify and process specific […]
Improve Kubernetes pod scheduling accuracy using Amazon EBS
In the cloud-native landscape, Amazon Elastic Block Store (Amazon EBS) volumes serve as the backbone for persistent storage in containerized applications. As organizations scale their Kubernetes workloads on Amazon Elastic Kubernetes Service (Amazon EKS), they increasingly rely on EBS volumes to provide high-performance, durable storage for stateful applications such as databases, message queues, and data […]





