AWS DataSync

Easily transfer data to and from AWS up to 10x faster

AWS DataSync makes it simple and fast to move large amounts of data online between on-premises storage and Amazon S3 or Amazon Elastic File System (Amazon EFS). Manual tasks related to data transfers can slow down migrations and burden IT operations. DataSync eliminates or automatically handles many of these tasks, including scripting copy jobs, scheduling and monitoring transfers, validating data, and optimizing network utilization. The DataSync software agent connects to your Network File System (NFS) and Server Message Block (SMB) storage, so you don’t have to modify your applications. DataSync can transfer hundreds of terabytes and millions of files at speeds up to 10 times faster than open-source tools, over the internet or AWS Direct Connect links. You can use DataSync to migrate active data sets or archives to AWS, transfer data to the cloud for timely analysis and processing, or replicate data to AWS for business continuity. Getting started with DataSync is easy: deploy the DataSync agent, connect it to your file system, select your AWS storage resources, and start moving data between them. You pay only for the data you move.

AWS DataSync - Automate & accelerate online data transfer (1:31)

Benefits

Simplify and automate transfers

AWS DataSync makes it easy for you to move data over the network between on-premises storage and AWS. DataSync automates both the management of data transfer processes and the infrastructure required for high-performance, secure data transfer. The service also includes automatic encryption and data. All of this minimizes the in-house development and management otherwise needed for fast, reliable, and secure transfers.

Move data 10x faster

Transfer data rapidly over the network into AWS, up to 10 times faster than is common with open-source tooling. DataSync uses a purpose-built network protocol and a parallel, multi-threaded architecture to accelerate your transfers. This speeds up migrations, recurring data processing workflows for analytics and machine learning, and data protection processes.



Reduce operational costs

You can move data cost-effectively with DataSync’s flat, per-gigabyte pricing. You’ll also save on script development and management costs, and avoid the need for costly commercial transfer tools.

How it works

How DataSync works

Use cases

Data migration

If you are closing data centers or retiring storage arrays, you can use DataSync to move active data sets or archives rapidly over the network into Amazon S3 or Amazon EFS. DataSync does both full initial copies, and incremental transfers of changing data. It also includes encryption and integrity checking to help make sure your data arrives securely, intact, and ready to use. You can use DataSync to copy active, changing data alongside Snowball Edge for the migration of static data to Amazon S3.

Data processing for hybrid workloads

If you have on-premises systems generating or using data that needs to move into or out of AWS for processing, you can use DataSync to accelerate and schedule the transfers. It can help speed up critical hybrid cloud workflows in industries that need to move active files into AWS quickly, including video production in media and entertainment, seismic research in oil and gas, machine learning in life science, and big data analytics in finance.

Data protection

If you have large Network Attached Storage (NAS) systems, you likely have a lot of files to protect—either with replication or backup to a second hardware stack. With DataSync, you can replicate files into all Amazon S3 storage classes, and select the most cost-effective storage class for your needs. Or, you can send the data to Amazon EFS for a standby file system. 

Celegene
“At Celgene, our research teams are focused intently on the discovery and development of treatments for cancer and other severe conditions. AWS is an integral part of our innovation process, and for our IT teams that means using as many AWS services as we can, to eliminate the operational and cost burdens of running infrastructure and tooling that distract us from supporting drug discovery. Our labs generate petabytes of data – irreplaceable intellectual property – and we use AWS DataSync to get the data into Amazon S3 and Amazon EFS easily, quickly and cost-effectively. Without the data in AWS, there’s no way we could innovate as fast. AWS DataSync works with my existing storage systems, and efficiently uses as much bandwidth as we can give it to get our data safely into AWS.”

Lance Smith, Director of Research Computing - Celgene

News

Date
  • Date
Learn about DataSync features
Check out the product features

Learn what makes AWS DataSync fast, secure and easy to use as part of your AWS architecture.

Learn more 
Sign up for a free AWS account
Sign up for a free account

Instantly get access to the AWS Free Tier. 

Sign up 
Start building with DataSync in the console
Start building in the console

Get started building with AWS DataSync in the AWS Console.

Sign in