AWS Storage Blog

Tag: Amazon Simple Storage Service (Amazon S3)

Amazon S3 Archive Storage Classes

Simplify querying your archive data in Amazon S3 with Amazon Athena

Today, customers increasingly choose to store data for longer because they recognize its future value potential. Storing data longer, coupled with exponential data growth, has led to customers placing a greater emphasis on storage cost optimization and using cost-effective storage classes. However, a modern data archiving strategy not only calls for optimizing storage costs, but […]

Amazon FSx for Lustre

Use Amazon FSx for Lustre to share Amazon S3 data across accounts

As enterprises evolve their cloud governance practices, multiple teams working in separate accounts may need to share data. One team may oversee an enterprise data lake in one account, while a data science team develops a high-performance computing (HPC) use case in another account. Customers want to take advantage of low-cost object storage and be […]

How PingCAP transformed TiDB into a serverless DBaaS using Amazon S3 and Amazon EBS

PingCAP, an AWS Partner Network (APN) Partner, is the company behind TiDB, an advanced open-source, distributed SQL database for building modern applications. TiDB is widely used and trusted by technologists around the world. In July 2023, PingCAP released TiDB Serverless, a fully managed, autonomous DBaaS offering of TiDB. However, based on TiDB’s existing architecture, PingCAP […]

Amazon S3 featured image - new

Manage event ordering and duplicate events with Amazon S3 Event Notifications

Customers use data events to build applications to trigger and communicate between decoupled services. An event is a change in state of, or an update to, data. For certain applications, such as batch order processing or content management, customers may need to implement application logic to handle duplicate and out-of-order events. For these use cases, […]

Automate Amazon S3 File Gateway on Amazon EC2 with Terraform by HashiCorp

Infrastructure as Code (IaC) involves managing IT infrastructure through code and automation tools to reduce manual management prone to errors, slow scaling, and overhead. For organizations implementing a hybrid cloud infrastructure, automation can ensure uniformity, scalability, and cost reduction while getting cloud resources provisioned efficiently. Automated provisioning and configuration enable organizations to adapt, innovate, and […]

Amazon S3 featured image - new

Getting visibility into storage usage in multi-tenant Amazon S3 buckets

SaaS providers with multi-tenant environments use cloud solutions to dynamically scale their workloads as customer demand increases. As their cloud footprint grows, having visibility into each end-customer’s storage consumption becomes important to distribute resources accordingly. An organization can use storage usage data per customer (tenant) to adjust its pricing model or better plan its budget. […]

Amazon S3 featured image - new

Consolidate and query Amazon S3 Inventory reports for Region-wide object-level visibility

Organizations around the world store billions of objects and files representing terabytes to petabytes of data. Data is often owned by different teams, departments, or business units, spanning multiple locations. As the amount of datastores, locations, and owners grow, you need a way to cost-effectively maintain visibility on important characteristics of your data, including based […]

Amazon S3 Archive Storage Classes

Identify cold objects for archiving to Amazon S3 Glacier storage classes

Many organizations move cold data to archive storage in the cloud to optimize storage costs for data they want to preserve over a number of years. Archiving data at a very low cost also gives organizations the ability to quickly restore that data and put it to work for their business, such as for historical […]

Amazon S3 featured image - new

How Continental uses Mountpoint for Amazon S3 in autonomous driving development – accelerating simulation performance by 20%

Continental and AWS have been collaborating to create the Continental Automotive Edge (CAEdge) framework – a modular hardware and software environment that connects the vehicle to the cloud and features virtual workbenches. The platform includes a virtual workbench offering numerous options to develop, supply, and maintain software-intensive system functions. It supports a wide range of automotive […]

Amazon S3 featured image - new

Optimizing performance of Apache Spark workloads on Amazon S3

This blog covers performance metrics, optimizations, and configuration tuning specific to OSS Spark running on Amazon EKS. For customers using or considering Amazon EMR on EKS, refer to the service documentation to get started and this blog post for the latest performance benchmark. Performance is top of mind for customers running streaming, extract transform load […]