AWS Storage Blog
Tag: Amazon Simple Storage Service (Amazon S3)
Building self-managed RAG applications with Amazon EKS and Amazon S3 Vectors
Retrieval-Augmented Generation (RAG) is a technique that optimizes large language model (LLM) outputs by referencing authoritative knowledge bases outside of the model’s training data before generating responses. This addresses common limitations of traditional LLMs, such as outdated knowledge, hallucinated facts, and misinterpreted terminology. Organizations can implement RAG to enhance their generative AI applications with current, […]
How Zeta Global scales multi-tenant data ingestion with Amazon S3 Tables
Zeta Global is a data-driven marketing technology company that uses consumer insights to empower brands in customer acquisition, growth, and retention. At the core of its operations is the Zeta Marketing Platform, an advanced system that applies sophisticated AI and machine learning (ML) capabilities on proprietary data from over 245 million U.S. consumer profiles. This […]
Enforcing organization-wide Amazon S3 bucket-tagging policies
In today’s complex cloud environments, maintaining consistent resource tagging is a critical challenge faced by organizations of all sizes. Proper resource tagging is essential for cost allocation, security compliance, operational management, and maintaining governance at scale. However, enforcing tagging standards across distributed teams and numerous resources can be difficult, especially when dealing with rapid deployment […]
Efficiently verify Amazon S3 data at scale with compute checksum operation
Organizations across industries must regularly verify the integrity of their stored datasets to protect valuable information, satisfy compliance requirements, and preserve trust. Media and entertainment customers validate assets to make sure that content remains intact, financial institutions run integrity checks to meet regulatory obligations, and research institutions confirm the reproducibility of scientific results. These verifications […]
Resilience by design: Building an effective ransomware recovery strategy
Ransomware events have become a board room priority for modern organizations. The data shows a clear trend: ransomware events have more than doubled since the pandemic began, with the financial services sector experiencing particularly high targeting rates. At AWS, our cross-field collaboration with global financial services customers, regulators, governing bodies and industry partners has resulted […]
Simplify log rotation with Amazon S3 Express One Zone
Log rotation is a standard operational practice for maintaining system health and performance while managing storage costs effectively. This practice involves systematically archiving log files to prevent them from consuming excessive storage. When a log file reaches a certain size or age, it’s rotated—meaning the current file is archived with a new name and a […]
Copy objects between any Amazon S3 storage classes using S3 Batch Operations
When storing data, choosing the storage class that is best suited for your particular needs allows you to optimize your storage costs, performance, and object availability. However, over time, the access patterns for your objects can change, which means you may need to migrate your objects to a different storage class to continue optimize for […]
Implementing conversational AI for S3 Tables using Model Context Protocol (MCP)
In today’s data-driven world, the ability to interact with your data through natural language is becoming increasingly valuable. By combining the power of conversational AI with Amazon S3 Tables, organizations can democratize data access and enable individuals across technical skill levels to query, analyze, and gain insights from their data using simple conversations. Model Context […]
Cost-efficient backup archiving with Veeam Direct to Amazon S3 Glacier storage classes
If you work with data storage and data protection, then you’re aware of the “3-2-1 rule.” System administrators consider the 3-2-1 rule a best practice for backup and disaster recovery (DR), and it is recommended by US-CERT. The 3-2-1 rule states that you should have three copies of your data (your production data and two […]
Cloud-powered tick data: revolutionizing financial data storage with Amazon S3 and LSEG
Data has become the lifeblood of modern financial markets, driving everything from investment decisions to regulatory compliance. Nowhere is this more evident than in electronic trading, where the ability to efficiently store, process, and analyze historical market data can make the difference between success and failure. Market participants are witnessing an unprecedented surge in tick […]