Amazon Simple Storage Service (Amazon S3)

How Tavily reduced AI search caching costs by 95% with Amazon S3 Express One Zone

Tavily is an AI infrastructure company building the web access layer for agents and large language models (LLMs). The company provides developer-friendly APIs that enable real-time, structured retrieval from the web. Their mission is to make information instantly accessible for intelligent systems, and they’re trusted by thousands of leading research, commercial AI teams, and enterprises […]

20 years of Amazon S3: A storage professional’s journey to AWS Hero

I’ve been working with data storage technologies for more than 20 years. Over that time, storage technologies have matured to keep up with exponential data growth. Solid state storage replaced spinning disk for the most critical workloads. Drive capacities grew from tens of gigabytes to tens of terabytes. On March 14, 2006, in the middle […]

Optimize data management on S3 Tables with Intelligent-Tiering

Organizations are rapidly adopting Apache Iceberg for their data lakes because it supports petabyte-scale growth and performance with the flexibility to evolve schemas and partitions without costly rewrites. Its architecture enables modern data lake management via features like time travel and incremental processing. However, managing Iceberg datasets efficiently can become a challenge over time as […]

Automatically decompress files in Amazon S3 using AWS Step Functions

Every day, AWS customers process millions of compressed files in Amazon S3, from small ZIP archives to multi-gigabyte datasets. While decompressing a single file is straightforward, processing thousands of files efficiently requires complex orchestration, error handling, and infrastructure management. Consider this scenario: Your organization receives over 10,000 compressed files daily from partners, ranging from 5 […]

Optimize agent tool selection using Amazon S3 Vectors and Amazon Bedrock Knowledge Bases

State-of-the-art AI agents rely on external tools to perform actions on their behalf. A tool is a function with a clear description, defined inputs, and outputs that extend the capabilities of a large language model (LLM). As toolkits expand, selecting the right tool for each task requires effective mechanisms, among which semantic search enables agents […]

Real-time fleet tracking using AWS IoT Core, Amazon S3 Tables, and Amazon Quick Sight

The fast pace of the logistics industry necessitates real-time vehicle fleet tracking for maintaining a competitive edge and meeting customer expectations. Traditional methods often provide delayed or incomplete information, leading to inefficiencies and missed opportunities. The demand for accurate, up-to-the-minute data on vehicle locations, driver behavior, and route performance has never been higher. Companies need […]

Automated extraction of compressed files on Amazon S3 using AWS Batch and Amazon ECS

Organizations frequently upload compressed TAR files to Amazon S3 for efficient data transfer, but downstream applications often need extracted files for processing. Although AWS Glue excels at processing splittable files across worker nodes, TAR files need single-node processing, traditionally forcing teams to manually provision servers, monitor extraction jobs, and manage resource cleanup. This post demonstrates […]

Architecting high performance AI-driven data applications with Spice AI and AWS

As enterprises scale their adoption of generative AI, one of the biggest technical challenges is connecting AI applications to the right data and making that data fast, accessible, and secure. AI agents are transforming industries through applications like customer support automation, personalized e-commerce recommendations, and research assistance in financial services and healthcare. These applications require […]

Building an open warehouse architecture: Supabase’s integration with Amazon S3 Tables

As applications scale, developers face a persistent challenge: analytical queries that slow down transactional databases, force them to copy data across multiple proprietary tools, and create disconnected data silos. For the 5 million developers building on Supabase, an open source Postgres development platform, this tension between operational and analytical workloads has become increasingly critical. The […]

Lower your Amazon S3 backup costs with AWS Backup S3 tiering

Organizations face a critical challenge in data protection: how to retain ever-increasing volumes of backup data for extended periods while maintaining cost efficiency. Regulatory mandates, internal governance policies, and comprehensive disaster recovery strategies often necessitate preserving backups for months or even years. At the same time, the threat landscape continues to evolve, with sophisticated ransomware […]

AWS Storage Blog

Tag: Amazon Simple Storage Service (Amazon S3)