AWS Storage Blog
Category: AWS DataSync
Derive insights from AWS DataSync task reports using AWS Glue, Amazon Athena, and Amazon QuickSight
Update (10/30/2024): On October 30, 2024, AWS DataSync launched Enhanced mode tasks, prompting updates to this blog. Updates include a new step in the “Step 2: Populate Glue catalog with task reports data using a Glue crawler” section and detailed information on the new capabilities in “Updated steps for working with task reports of new […]
Migration to Amazon FSx for Windows File Server: cost optimization strategies and automation
Businesses today rely heavily on their file server infrastructure to store, manage, and share mission-critical data. However, maintaining and scaling these environments can be a complex and resource-intensive task. As customer needs and data volumes continue to grow, organizations often face the need to modernize their file server infrastructure. When migrating to Amazon FSx for […]
Direct Supply bolsters availability migrating to Amazon FSx for Windows File Server
Direct Supply, the leading provider of products and services to the Long-Term Care industry, migrated the bulk of our IT systems to AWS in early 2019. In the run-up to our cut over, we had five Server Message Block (SMB) file systems that needed to live alongside the applications they support. This meant that roughly […]
Migrate large HPC datasets from the edge to the cloud then synchronize continuously
Organizations running high-performance computing (HPC) workloads on premises often want to move data to the cloud to leverage scalability, performance, cost optimization, and other benefits of the cloud. For edge locations with limited or no available network bandwidth, online migrations can take a long time or be impossible. In locations where there is limited bandwidth […]
How London Stock Exchange Group migrated 30 PB of market data using AWS DataSync
London Stock Exchange Group (LSEG) has 30 PB of Tick History-PCAP data, which is ultra-high-quality global market data that is based on raw exchange data, timestamped to the nanosecond. An additional 60 TB is generated every day. LSEG sought to migrate their data from Wasabi cloud storage, LSEG was looking for a new solution to […]
Transferring data in Amazon S3 between AWS GovCloud (US) Regions and commercial AWS Regions using AWS DataSync
AWS users who need to comply with the most stringent US government security and compliance requirements operate their workloads in AWS GovCloud (US), which is architected as a separate partition providing network and identity isolation. A common use case for AWS GovCloud (US) users is to operate in both AWS GovCloud (US) Regions and commercial AWS Regions, such as […]
Transfer customer managed SSE-KMS encrypted objects across AWS accounts and Regions using AWS DataSync
Update (7/12/2024): Post updated to clarify that you must use a fully qualified Amazon Resource Name (ARN) when specifying your customer managed KMS keys. Some organizations have requirements to manage their own data encryption keys, both in general and during data transfer processes. In addition, when considering data transfer solutions (not just for encrypted data), […]
Replicate objects Using AWS DataSync with Amazon S3 compatible storage on Snowball Edge
Users have successfully leveraged Amazon Web Services (AWS) Snow Family to transfer petabytes of data between on premises and AWS Regions since its launch in 2015 with the AWS Snowball device. Increasingly, users are not just migrating data with the AWS Snow Family but now are using AWS Snowball Edge Compute Optimized devices to host […]
Migrating Oracle Cloud Infrastructure Object Storage to Amazon S3 using AWS DataSync
Users face challenges in their digital transformation journey involving the migration of data across various platforms, on-premises file systems, and other cloud services. When using other cloud providers, scenarios arise where the seamless transfer of data becomes essential. Whether executing a one-time data transfer or integrating it into a scheduled workflow, minimizing business downtime is […]
Implementing AWS DataSync with hundreds of millions of objects
Moving large volumes of data across your hybrid cloud environments can seem like a daunting task, especially when dealing with a litany of requirements that arise when working within the technical limits of network, storage, compute, and operating system layers, both on-premises and in the cloud. Users face additional challenges when balancing their Recovery Time […]