AWS Storage Blog

Category: Storage

Amazon S3 Glacier Storage Classes

Modernizing NASCAR’s multi-PB media archive at speed with AWS Storage

The National Association for Stock Car Auto Racing (NASCAR) is the sanctioning body for the No. 1 form of motor sports in the United States, and owns 15 of the nation’s major motorsports entertainment facilities. About 15 years ago NASCAR began to collect all the video, audio, and image assets from over the last 70+ […]

Amazon S3 Replication featured image

Considering four different replication options for data in Amazon S3

UPDATE (2/10/2022): Amazon S3 Batch Replication launched on 2/8/2022, allowing you to replicate existing S3 objects and synchronize your S3 buckets. See the S3 User Guide for additional details. UPDATE (5/1/2023): Updated the comparison table to reflect new Amazon S3 features and capabilities. As your business grows and accumulates more data over time, you may […]

Amazon S3 Glacier Storage Classes

Best practices for archiving large datasets with AWS

As companies grow, they often find themselves managing an ever-increasing amount of data. Customers often need to retain backups for business continuity or disaster recovery, as well as records for compliance and audits. In addition, some customers may need to retain backups to create a centralized repository of information that is heterogeneous in nature, with […]

AWS Snow Family Featured Image

Customization and field maintenance with AWS Snowball Edge – sideloading AMIs

When the AWS Snowball Edge launched in 2016, if you wanted to run Amazon Elastic Compute Cloud (Amazon EC2) instances on the device, you were required to specify an Amazon Machine Image (AMI) to be installed on it at time of ordering. You were then able to launch Amazon EC2 instances on the device based […]

Amazon S3

Monitor Amazon S3 activity using S3 server access logs and Pandas in Python

Monitoring and controlling access to data is often essential for security, cost optimization, and compliance. For these reasons, customers want to know what data of theirs is being accessed, when it is being accessed, and who is accessing it. With more data to monitor, large amounts of data can make it more challenging to granularly […]

Amazon S3 Glacier Storage Classes

Collecting, archiving, and retrieving surveillance footage with AWS

Video feeds and still images from judiciary locations are considered critical forms of evidence in the court of law. These locations can be police stations and government offices or even civil locations of importance like banks and hospitals. As governments, particularly in smart cities rely upon video surveillance, it is critical to design a cost […]

Amazon S3

Point-in-time restore for Amazon S3 buckets

Enterprises store increasing quantities of object data for use cases like data lakes, document management systems, and media libraries. Performing point-in-time restores for large datasets can be challenging, as existing approaches with full-restore from backup are time consuming and expensive. Alternatively, restoring individual objects to previous versions is prone to errors and delays the restore […]

AWS Storage Gateway in 2021 – Year in Review

It has become customary for us to share with our customers the new AWS Storage Gateway enhancements every year. As we did in 2020 and 2019, through this blog I’m reviewing all the new Storage Gateway launches from 2021. I’ll cover how the new enhancements make it easier for customers to access AWS Storage from […]

Mount Amazon EFS file systems cross-account from Amazon EKS

Many customers use multiple AWS accounts managed by AWS Organizations to create security and cost boundaries around business units, projects, or applications. AWS Organizations helps you centrally manage and govern your environment as you grow and scale your AWS resources. In some cases, an application in one AWS account must access data in another. As […]

Optimizing SAS Grid on AWS with Amazon FSx for Lustre

Many customers run complex analytics and high performance SAS-based applications on premises using the SAS Grid platform to perform large-scale analytics. Customers with a strategy to move to open-source or cloud-native solutions will often consider refactoring applications to Python or R to lower their total cost of ownership, however refactoring these applications as part of […]