AWS Cloud Storage | AWS Storage Blog

Build intelligent ETL pipelines using AWS Model Context Protocol and Amazon Q

Data scientists and engineers spend hours writing complex data pipelines to extract, transform, and load (ETL) data from various sources into their data lakes for data integration and creating unified data models to build business insights. The process involves understanding the source and target systems, discovering schemas, mapping source and target, writing and testing ETL […]

Test and build application resilience using Amazon EBS latency injection

As businesses strive to build highly available applications, they must prevent disruptions that can lead to downtime and revenue loss. Robust monitoring systems help identify failures proactively, but chaos engineering has emerged as a systematic approach to building resilient systems by uncovering potential issues before they become outages. Chaos engineering is especially critical for storage […]

Optimizing recommendations and analytics using Amazon DynamoDB and Amazon S3

Today, consumers navigate thousands of products on e-commerce sites, hundreds of shows on streaming platforms, and countless options in digital marketplaces. This choice overload creates decision fatigue, yet consumers continue to demand more variety and make more purchases online. As a result, personalization has become essential—consumers reward brands that deliver relevant, tailored online experiences. However, […]

Cross-account Amazon S3 bulk transfers with enhanced AWS KMS support

Cross-account Amazon S3 bulk transfers with enhanced AWS Key Management Service (AWS KMS) support become increasingly critical as organizations grow and accumulate vast amounts of digital assets across their enterprise. Managing millions or even billions of files presents unique challenges, especially when these files need to be moved securely between different AWS accounts. Operations such […]

Derive intelligent storage insights using S3 Metadata and Model Context Protocol (MCP)

Organizations face mounting challenges in managing and operationalizing their ever-growing data assets for machine learning and analytics workflows. When dealing with billions and trillions of objects, teams struggle to find what data they have and how to efficiently find specific datasets. Without proper data discovery and metadata management, teams spend valuable time searching for relevant […]

Accelerating Amazon S3 Batch Operations at scale with on-demand manifest generation

Modern enterprises routinely manage billions of objects across their cloud storage environments, needing efficient bulk operations for disaster recovery, compliance management, data transfer, and cost optimization. Performing these operations manually or through custom scripts becomes impractical at scale, often creating operational bottlenecks when time-sensitive actions are necessary. Organizations frequently need to identify and process specific […]

Improve Kubernetes pod scheduling accuracy using Amazon EBS

In the cloud-native landscape, Amazon Elastic Block Store (Amazon EBS) volumes serve as the backbone for persistent storage in containerized applications. As organizations scale their Kubernetes workloads on Amazon Elastic Kubernetes Service (Amazon EKS), they increasingly rely on EBS volumes to provide high-performance, durable storage for stateful applications such as databases, message queues, and data […]

Building self-managed RAG applications with Amazon EKS and Amazon S3 Vectors

Retrieval-Augmented Generation (RAG) is a technique that optimizes large language model (LLM) outputs by referencing authoritative knowledge bases outside of the model’s training data before generating responses. This addresses common limitations of traditional LLMs, such as outdated knowledge, hallucinated facts, and misinterpreted terminology. Organizations can implement RAG to enhance their generative AI applications with current, […]

Enhance your SMB file transfer security with Kerberos and AWS DataSync

Businesses replicate data from on-premises file shares to the cloud to power analytics processes, enable migrations, or free up archival storage. When authenticating to Windows Server Message Block (SMB) file shares, the NTLM protocol has been ubiquitous for decades. However, Microsoft announced its deprecation in 2024. Whether your business operates in a highly regulated industry […]

Simplify cross-account storage management with Amazon EFS and Amazon EKS

Organizations are increasingly adopting a multi-account Amazon Web Services (AWS) strategy to achieve enhanced security, governance, and operational efficiency at scale. Implementing separate accounts for production and non-production environments enables enterprises to group workloads based on business purpose, apply distinct security postures by environments, restrict access to sensitive data, and streamline cost management. You can […]

AWS Storage Blog

Tag: AWS Cloud Storage