AWS DataSync is an online data transfer service that simplifies, automates, and accelerates copying large amounts of data between on-premises systems and AWS Storage services, as well as between AWS Storage services. DataSync can copy data between Network File System (NFS) shares, or Server Message Block (SMB) shares, self-managed object storage, AWS Snowcone, Amazon Simple Storage Service (Amazon S3) buckets, Amazon Elastic File System (Amazon EFS) file systems, and Amazon FSx for Windows File Server file systems.

Purpose-built network protocol

The service employs an AWS-designed transfer protocol—decoupled from the storage protocol—to accelerate data movement. The protocol performs optimizations on how, when, and what data is sent over the network. Network optimizations performed by DataSync include incremental transfers, in-line compression, and sparse file detection, as well as in-line data validation and encryption.

Connections between the local DataSync agent and the in-cloud service components are multi-threaded, maximizing performance over your Wide Area Network (WAN). A single DataSync task is capable fully utlizing 10 Gbps over anetwork link between your on-premises environment and AWS.

Automatic infrastructure management

DataSync removes many of the infrastructure and management challenges that you face when either writing, optimizing and managing your own copy scripts, or deploying and tuning heavyweight commercial transfer tools. DataSync comes with built-in monitoring and retry mechanisms, and granular control over the portion of network bandwidth used to transfer your data.

Data encryption and validation

All of your data is encrypted in transit with Transport Layer Security (TLS). DataSync supports using default encryption for S3 buckets, Amazon EFS file system encryption of data at rest, and Amazon FSx for Windows File Server encryption at rest and in transit.

DataSync ensures that your data arrives intact. For each transfer, the service performs integrity checks both in transit and at rest. These checks ensure that the data written to your destination matches the data read from your source, validating consistency.

Data transfer scheduling

DataSync comes with a built-in scheduling mechanism enabling you to periodically execute a data transfer task to detect and copy changes from your source storage system to the destination. You can schedule your tasks using the AWS DataSync Console or AWS Command Line Interface (CLI), without needing to write scripts to manage repeated transfers. Task scheduling automatically runs tasks on the schedule you configure, with hourly, daily, or weekly options provided directly in the Console.

File system integration and metadata preservation

The DataSync agent connects to your existing storage systems using the industry-standard NFS and SMB protocols, or to your self-managed object storage, using the Amazon S3 API. The agent transfers data rapidly and writes it into your designated Amazon S3 bucket, Amazon EFS file system, or Amazon FSx for Windows File Server file system.
 
When copying data between NFS shares and Amazon EFS, or between SMB shares and Amazon FSx for Windows File Server, file permissions and metadata are preserved, enabling an easy transition to the target file system. Similarly, when copying objects between self-managed object storage and Amazon S3, object metadata and tags are preserved.
 
When copying data to Amazon S3, DataSync automatically converts each file to be a single S3 object in a 1:1 relationship, and preserves POSIX metadata from NFS shares as Amazon S3 object metadata. When you copy objects that contain file system metadata back to file formats, the original file metadata that DataSync copied to S3 is restored. 

Integration with AWS infrastructure and management services

DataSync works natively with AWS security, monitoring, and audit services to make data movement simpler, and to provide consistent management experience for your IT, storage, and DevOps teams. In addition to integrations with Amazon S3, Amazon EFS, and Amazon FSx for Windows File Server, DataSync supports VPC endpoints (powered by AWS PrivateLink) in order to move files directly into your Amazon VPC. Like other AWS services, you use AWS Identity and Access Management (IAM) to securely manage access for DataSync. Similarly, the service accesses your Amazon S3 bucket using an IAM role you configure.

Monitoring and auditing with Amazon CloudWatch and AWS CloudTrail

With Amazon CloudWatch, you can monitor the status of any DataSync transfers currently in progress and check the history of previous data transfers. With CloudWatch Metrics you can see the number of files and amount of data that has been copied. You can consult CloudWatch Logs for information about individual files that are transferred at a given time, as well as the results of the integrity verification performed by DataSync. This simplifies monitoring, reporting, and troubleshooting, and enables you to provide timely updates to stakeholders. In addition, CloudWatch Events can be triggered as your transfer tasks complete, enabling automation of dependent workflows. For audit purposes, you can consult AWS CloudTrail, which logs all actions performed by DataSync.

Pay-as-you-go pricing

You pay only for data copied by the service, at a flat, per-gigabyte rate—no software licenses, contracts, maintenance fees, development cycles, or required hardware. This provides a lower total cost of ownership compared to manually building, operating, and optimizing your own high-performance scripted transfers. It also offers lower total cost than buying and running commercial transfer tools.

Learn more about DataSync pricing
Learn more about DataSync pricing

DataSync Pricing is simple, and based on how much you data you transfer.

Learn more 
Create a free account
Sign up for a free account

Instantly get access to the AWS Free Tier. 
 

Sign up 
Start building in the console
Start building in the Console

Get started building with DataSync in the AWS Console.

Sign in