AWS Storage Blog

Category: Compute

AWS DataSync Featured Image 2020

Automate data synchronization between AWS Outposts racks and Amazon S3 with AWS DataSync

Many organizations generate large quantities of data locally, including digital imagery, sensor data, and more. Customers require local compute and storage to ingest and enable real-time predications based on their data, and often preprocess this data locally before transferring to the cloud to unlock additional business value such analysis, reporting, and archiving. Automating transfers to […]

Amazon S3 Batch Operations featured image

Automate object processing in Amazon S3 directory buckets with S3 Batch Operations and AWS Lambda

Data, the lifeblood of any modern organization, is rarely static. For high-performance applications and workloads, enterprises need the ability to run operations on massive amounts of data, including modifying the data as is necessary for each use case, to further accelerate processing. This could include modifying uploaded images with a watermark, changing the bitrate of […]

Amazon S3 featured image 2023

Managing duplicate objects in Amazon S3

When managing a large volume of data in a storage system, it is common for data duplication to happen. Data duplication in data management refers to the presence of multiple copies of the same data within your system, leading to additional storage usage as well as extra overhead when handling multiple copies of the same […]

AWS Backup 2021 blog image

Automate the delivery of AWS Backup Audit Manager reports via email

Business continuity and disaster recovery plans include having a backup strategy for application workloads, whether on-premises or in Cloud. Furthermore, organizations need efficient methods to actively monitor their data protection posture and detect any failure for remediation while meeting the required recovery time objective (RTO) and recovery point objective (RPO) for the business. One efficient […]

Figure 1: Multi-region FSx for NetApp ONTAP with SnapMirror replication for SQL Server DR

Automating retrievals from the Amazon S3 Glacier storage classes

Faced with increasing amounts of data and a tightening economic climate, enterprises are looking to save money on their storage costs by moving rarely needed data to archival storage options. The least costly options require your internal systems to support receiving data back in hours or days, often called asynchronous retrievals. With this time delay, […]

Amazon S3 Object Lambda

Automatically modify data you are querying with Amazon Athena using Amazon S3 Object Lambda

Enterprises may want to customize their data sets for different requesting applications. For example, if you run an e-commerce website, you may want to mask Personally Identifiable Information (PII) when querying your data for analytics. Although you can create and store multiple customized copies of your data, that can increase your storage cost. You can […]

Improve compute utilization with more Amazon EBS volume attachments on 7th generation Amazon EC2 instances

For many stateful containerized applications, such as those using Kubernetes orchestration, each stateful pod (the smallest deployable container object) may require dedicated persistent storage. A block storage solution is a good fit due to its high performance, low latency, and persistence attributes. If a compute instance has more compute resources to spare, you can only […]

Simple and comprehensive data protection with Amazon Data Lifecycle Manager

Enterprises often use distinct accounts to group workloads and associated resources used across multiple teams and projects. This helps organizations align ownership, decision making, and costs so that they can be easily managed across internal teams. However, each team in an account may have different requirements and processes when it comes to backing up their […]

How PingCAP transformed TiDB into a serverless DBaaS using Amazon S3 and Amazon EBS

PingCAP, an AWS Partner Network (APN) Partner, is the company behind TiDB, an advanced open-source, distributed SQL database for building modern applications. TiDB is widely used and trusted by technologists around the world. In July 2023, PingCAP released TiDB Serverless, a fully managed, autonomous DBaaS offering of TiDB. However, based on TiDB’s existing architecture, PingCAP […]

Automating application-consistent Amazon EBS Snapshots for MySQL and PostgreSQL

MySQL and PostgreSQL are popular relational database management systems that many organizations use to power web applications, dynamic websites, and embedded systems. For customers self-hosting MySQL and PostgreSQL with AWS, they can use their choice of tools to manage the operating system, database software, patches, data replication, backup, and restoration. As customers back up their […]