AWS DataSync is an online data transfer service that simplifies, automates, and accelerates copying large amounts of data between on-premises systems and AWS Storage services, as well as between AWS Storage services. DataSync can copy data between Network File System (NFS) shares, or Server Message Block (SMB) shares, self-managed object storage, AWS Snowcone, Amazon Simple Storage Service (Amazon S3) buckets, Amazon Elastic File System (Amazon EFS) file systems, and Amazon FSx for Windows File Server file systems.
Purpose-built network protocol
The service employs an AWS-designed transfer protocol—decoupled from the storage protocol—to accelerate data movement. The protocol performs optimizations on how, when, and what data is sent over the network. Network optimizations performed by DataSync include incremental transfers, in-line compression, and sparse file detection, as well as in-line data validation and encryption.
Connections between the local DataSync agent and the in-cloud service components are multi-threaded, maximizing performance over your Wide Area Network (WAN). A single DataSync task is capable fully utlizing 10 Gbps over anetwork link between your on-premises environment and AWS.
Automatic infrastructure management
DataSync removes many of the infrastructure and management challenges that you face when either writing, optimizing and managing your own copy scripts, or deploying and tuning heavyweight commercial transfer tools. DataSync comes with built-in monitoring and retry mechanisms, and granular control over the portion of network bandwidth used to transfer your data.
Data encryption and validation
All of your data is encrypted in transit with Transport Layer Security (TLS). DataSync supports using default encryption for S3 buckets, Amazon EFS file system encryption of data at rest, and Amazon FSx for Windows File Server encryption at rest and in transit.
DataSync ensures that your data arrives intact. For each transfer, the service performs integrity checks both in transit and at rest. These checks ensure that the data written to your destination matches the data read from your source, validating consistency.
Data transfer scheduling
DataSync comes with a built-in scheduling mechanism enabling you to periodically execute a data transfer task to detect and copy changes from your source storage system to the destination. You can schedule your tasks using the AWS DataSync Console or AWS Command Line Interface (CLI), without needing to write scripts to manage repeated transfers. Task scheduling automatically runs tasks on the schedule you configure, with hourly, daily, or weekly options provided directly in the Console.
File system integration and metadata preservation
Integration with AWS infrastructure and management services
DataSync works natively with AWS security, monitoring, and audit services to make data movement simpler, and to provide consistent management experience for your IT, storage, and DevOps teams. In addition to integrations with Amazon S3, Amazon EFS, and Amazon FSx for Windows File Server, DataSync supports VPC endpoints (powered by AWS PrivateLink) in order to move files directly into your Amazon VPC. Like other AWS services, you use AWS Identity and Access Management (IAM) to securely manage access for DataSync. Similarly, the service accesses your Amazon S3 bucket using an IAM role you configure.
Monitoring and auditing with Amazon CloudWatch and AWS CloudTrail
With Amazon CloudWatch, you can monitor the status of any DataSync transfers currently in progress and check the history of previous data transfers. With CloudWatch Metrics you can see the number of files and amount of data that has been copied. You can consult CloudWatch Logs for information about individual files that are transferred at a given time, as well as the results of the integrity verification performed by DataSync. This simplifies monitoring, reporting, and troubleshooting, and enables you to provide timely updates to stakeholders. In addition, CloudWatch Events can be triggered as your transfer tasks complete, enabling automation of dependent workflows. For audit purposes, you can consult AWS CloudTrail, which logs all actions performed by DataSync.
You pay only for data copied by the service, at a flat, per-gigabyte rate—no software licenses, contracts, maintenance fees, development cycles, or required hardware. This provides a lower total cost of ownership compared to manually building, operating, and optimizing your own high-performance scripted transfers. It also offers lower total cost than buying and running commercial transfer tools.