AWS Storage Blog

Automated cost-effective archiving and on-demand data restoration

Organizations across industries need automated cost-effective archiving and on-demand data restoration solutions to manage explosive data growth driven by digital transformation, regulatory compliance, and operational insights. This data—often stored as unstructured files—is frequently retained for extended periods to meet internal, legal, or analytical requirements. As storage volumes grow into the petabyte range, businesses face a […]

Amazon FSx for OpenZFS

Getting started with self-managed Oracle in AWS using Amazon FSx for OpenZFS

Organizations of all sizes run their enterprise applications and databases in the cloud. These organizations may choose to run self-managed databases on Amazon Elastic Compute Cloud (Amazon EC2) rather than using the fully-managed Amazon Relational Database Service (Amazon RDS) due to internal policies, Amazon RDS service maximums, and other reasons. When running self-managed databases in the cloud, there […]

Amazon FSx for NetApp ONTAP

Cost-optimized file storage with Amazon FSx for NetApp ONTAP and Komprise

As enterprises pursue digital transformation and smart operations, they’re challenged by the limitations of traditional file systems. Machine-generated data from connected systems and automation has pushed legacy storage solutions beyond their capabilities. In manufacturing, healthcare, logistics, financial services, and other industries, organizations need reliable access to data across globally distributed locations. These organizations face rising […]

Encrypt AWS Backup logically air-gapped vaults with customer-managed keys

Organizations in regulated industries often mandate control over encryption keys when storing data in the cloud to meet compliance requirements. Although AWS Backup logically air-gapped vault provides secure, isolated backup storage, these customers have needed the ability to use their own AWS Key Management Service (AWS KMS) customer-managed keys (CMKs) to provide greater control of […]

Build intelligent ETL pipelines using AWS Model Context Protocol and Amazon Q

Data scientists and engineers spend hours writing complex data pipelines to extract, transform, and load (ETL) data from various sources into their data lakes for data integration and creating unified data models to build business insights. The process involves understanding the source and target systems, discovering schemas, mapping source and target, writing and testing ETL […]

EBS feature image

Test and build application resilience using Amazon EBS latency injection

As businesses strive to build highly available applications, they must prevent disruptions that can lead to downtime and revenue loss. Robust monitoring systems help identify failures proactively, but chaos engineering has emerged as a systematic approach to building resilient systems by uncovering potential issues before they become outages. Chaos engineering is especially critical for storage […]

Optimizing recommendations and analytics using Amazon DynamoDB and Amazon S3

Today, consumers navigate thousands of products on e-commerce sites, hundreds of shows on streaming platforms, and countless options in digital marketplaces. This choice overload creates decision fatigue, yet consumers continue to demand more variety and make more purchases online. As a result, personalization has become essential—consumers reward brands that deliver relevant, tailored online experiences. However, […]

Cross-account Amazon S3 bulk transfers with enhanced AWS KMS support

Cross-account Amazon S3 bulk transfers with enhanced AWS Key Management Service (AWS KMS) support become increasingly critical as organizations grow and accumulate vast amounts of digital assets across their enterprise. Managing millions or even billions of files presents unique challenges, especially when these files need to be moved securely between different AWS accounts. Operations such […]

Amazon S3 Metadata thumbnail image

Derive intelligent storage insights using S3 Metadata and Model Context Protocol (MCP)

Organizations face mounting challenges in managing and operationalizing their ever-growing data assets for machine learning and analytics workflows. When dealing with billions and trillions of objects, teams struggle to find what data they have and how to efficiently find specific datasets. Without proper data discovery and metadata management, teams spend valuable time searching for relevant […]

Accelerating Amazon S3 Batch Operations at scale with on-demand manifest generation

Modern enterprises routinely manage billions of objects across their cloud storage environments, needing efficient bulk operations for disaster recovery, compliance management, data transfer, and cost optimization. Performing these operations manually or through custom scripts becomes impractical at scale, often creating operational bottlenecks when time-sensitive actions are necessary. Organizations frequently need to identify and process specific […]