AWS Storage Blog

Category: Amazon DynamoDB

AWS DataSync Featured Image 2020

How London Stock Exchange Group migrated 30 PB of market data using AWS DataSync

London Stock Exchange Group (LSEG) has 30 PB of Tick History-PCAP data, which is ultra-high-quality global market data that is based on raw exchange data, timestamped to the nanosecond. An additional 60 TB is generated every day. LSEG sought to migrate their data from Wasabi cloud storage, LSEG was looking for a new solution to […]

AWS Transfer Family Featured Image

Using Amazon Cognito as an identity provider with AWS Transfer Family and Amazon S3

In highly regulated industries, securely exchanging files business-to-business is a crucial business practice. When building out a Managed File Transfer (MFT) environment, it is common to consider using a third-party identity solution for authenticating users. This approach offers simplicity for businesses that already use an identity service, allowing them to maintain identities for a variety […]

Amazon S3 featured image - new

Manage event ordering and duplicate events with Amazon S3 Event Notifications

Customers use data events to build applications to trigger and communicate between decoupled services. An event is a change in state of, or an update to, data. For certain applications, such as batch order processing or content management, customers may need to implement application logic to handle duplicate and out-of-order events. For these use cases, […]

How Canva saves over $3 million annually in Amazon S3 costs

How Canva saves over $3 million annually in Amazon S3 costs

Canva is an online design tool that empowers users worldwide to design, edit, and publish anything they can dream up. Canva runs most of its production workloads on AWS, using several core services, including Amazon S3, Amazon ECS, Amazon RDS, and Amazon DynamoDB. Running on AWS has helped Canva move fast and keep up with […]

AWS DataSync Featured Image 2020

Creating an ETL pipeline trigger for existing AWS DataSync tasks

Organizations look for ways to leverage the compute power of the cloud to analyze their data and produce reports to help drive business decisions. They want to load their data sets into extract-transform-load (ETL) pipelines for data processing. Once the data is processed, business decision makers at these organizations rely on accurate report generation to […]

Amazon S3 featured image - new

Retrieving previous versions of S3 bucket policies

When dealing with applications or storage, it is often helpful to log previous policy configurations and have the ability to retrieve them. This can help you managed and troubleshoot configuration changes and comply with security regulations. Amazon Simple Storage Service (Amazon S3) bucket policies are resource-based policies that you can use to grant access permissions […]

Figure 1: Multi-region FSx for NetApp ONTAP with SnapMirror replication for SQL Server DR

On-demand archival and retrieval of documents from Amazon WorkDocs to Amazon S3

Cloud storage of documents has seen rapid growth over the years as more and more customers and businesses move away from traditional physical storage. As the size and number of documents continue to grow, customers want to manage their documents and retain them using long term, durable, cost-effective document archives. Businesses such as medical research […]

AWS Backup 2021 blog image

Troubleshooting automated pre- and post-scripts for AWS Backup

Customers can use event-driven architectures with decoupled tasks to automate and orchestrate custom scripts for backup jobs. With event-driven architectures, troubleshooting is key to understanding failures at the component levels in order to resolve issues that arise and keep the entire automated workflow running smoothly. In the first post in this two-part blog series, we […]

Using AWS Storage Gateway to modernize next-generation sequencing workflows

Exact Sciences operates the laboratories across the world that produce data that is critical to performing analysis and diagnostics to classify cancer modalities, treatments, and therapeutics. The laboratories generate large data sets from on-premises genomic sequencing devices that must be sent to the cloud for processing. Once in the cloud, we process the data to […]

Isima.io optimizes price performance for OLAP workloads using Amazon EBS

Isima.io, a unified analytics startup founded in 2016, aims to accelerate analytics outcomes for organizations. Isimia.io does this by combining multiple data management disciplines – including Enterprise Service Bus (ESB), Extract-Transform-Load (ETL), Enterprise-Data-Warehouse (EDW), and Business Intelligence (BI) – into one hyper-converged system. IT teams can only win by building differentiated, agile data apps. The […]