AWS Storage Blog

Category: Amazon S3 Glacier Deep Archive

Amazon S3 Glacier Storage Classes

Automatically compress and archive satellite imagery for Amazon S3

Satellite imagery often comes as large, high-resolution files, and organizations that work with this data typically have high storage costs. Additionally, large imagery files can take time and resources when downloaded for use with machine learning (ML), data analytics tools, or manual analyst review. Using standard compression techniques lets us achieve reductions in file size […]

Restoring archived objects at scale from the Amazon S3 Glacier storage classes

Every organization around the world has archival data. There is a data archiving need not only for companies that have been around for a while, but also for digital native businesses. Workloads such as medical records, news media content, and manufacturing datasets, often store petabytes – or billions of objects indefinitely. The vast majority of […]

Amazon S3 Glacier Storage Classes

How Indus OS cost-effectively transitioned billions of small objects between Amazon S3 storage classes

Indus OS is a mobile platform that enables content and application discovery for users, application developers, and original equipment manufacturers (OEMs). Powered by artificial intelligence, Indus App Bazaar curates locally relevant apps and content for users based on their demographics, lingual preferences, and behaviour, and offers an intuitive user interface with content-led discovery. Indus App […]

S3 Intelligent-Tiering

Automatically archive and restore data with Amazon S3 Intelligent-Tiering

Customers of all sizes, in all industries, are using data lakes to transform data from a cost that must be managed, to a business asset. From time to time, data scientists and business analysts need to restore subsets of historical datasets for longitudinal studies, machine learning retraining, and more. However, users commonly write queries that don’t […]

Amazon S3 Glacier Storage Classes

Restore data from Amazon S3 Glacier storage classes starting with partial object keys

When managing data storage, it is important to optimize for cost by storing data in the most cost-effective manner based on how often data is used or accessed. For many enterprises, this means using some form of cold storage or archiving for data that is less frequently accessed or used while keeping more frequently used […]

AWS DataSync Featured Image 2020

How to move and store your genomics sequencing data with AWS DataSync

Genomics data is expanding at a rate exceeding Moore’s law according to the National Human Genome Research Institute. As more sequencing data is produced and researchers move from genotyping to whole genome sequencing, the amount of data produced is outpacing on-premises capacity. Organizations need cloud solutions that help manage data movement, storage, and analysis. The […]

Amazon S3 Glacier Storage Classes

Compressing and archiving logs to the Amazon S3 Glacier storage classes

In distributed architectures, there is often a need to preserve application logs, and for AWS customers preservation is often done via an Amazon S3 bucket. The logs may contain information on runtime transactions, error/failure states, or application metrics and statistics. These logs are later used in business intelligence to provide useful insights and generate dashboards, […]

Amazon S3 Glacier Storage Classes

Best practices for archiving large datasets with AWS

As companies grow, they often find themselves managing an ever-increasing amount of data. Customers often need to retain backups for business continuity or disaster recovery, as well as records for compliance and audits. In addition, some customers may need to retain backups to create a centralized repository of information that is heterogeneous in nature, with […]

Amazon S3 Glacier Storage Classes

Collecting, archiving, and retrieving surveillance footage with AWS

Video feeds and still images from judiciary locations are considered critical forms of evidence in the court of law. These locations can be police stations and government offices or even civil locations of importance like banks and hospitals. As governments, particularly in smart cities rely upon video surveillance, it is critical to design a cost […]

How Pinterest uses Amazon S3 Glacier Deep Archive to manage storage for its visual discovery engine

Pinterest is the visual discovery engine with a mission to bring everyone the inspiration to create a life they love. It’s one of the biggest datasets of ideas ever assembled online, with over 300 billion Pins with ideas around home, food, style, beauty, travel, and more. More than 440 million people around the world use […]