Arcesium Uses Amazon EBS and Amazon FSR for Faster Database Refreshes
Arcesium, a global financial technology and professional services firm, recognized that speedier database refreshes would benefit its clients. The company ultimately turned to Amazon Web Services (AWS) to employ Amazon Elastic Block Store (Amazon EBS), a simple-to-use and high-performance block storage service, and Amazon EBS Fast Snapshot Restore (Amazon FSR). As a result, the company achieved rapid scaling, high performance, ample storage, and better resource management capabilities.
Using application-consistent Amazon EBS snapshots as database backups would be really helpful for any AWS customers running databases on top of Amazon EC2 instances."
Associate Director and Head of the Database Engineering Group, Arcesium
Getting More Reliable Service Using Amazon EBS and Amazon EC2
In 2014, Arcesium was in the process of spinning off from the D. E. Shaw group, a global investment and technology development firm. At that time, Arcesium considered whether to opt for an on-premises data center or a cloud provider for its computing needs. It ultimately made the decision to use AWS, largely because of the length of time AWS had been in the market and its available suite of dynamic services. “We serve the world’s most sophisticated financial institutions with very complex workflows, and we needed high-performing infrastructure that could scale and meet our data processing needs. We ran benchmarks with various providers,” says Manikantan Madhunapantula, associate director of Arcesium and head of the company’s database engineering group. “We compared on-premises and AWS instance sizes and throughput—input/output operations per second specifically. AWS had a significant advantage over other providers—especially for running our database workloads.”
One of the first challenges Arcesium faced was finding a way to refresh its multiterabyte nonproduction database servers on AWS without using expensive filer software, degrading performance, or incurring a lot of downtime. “Without the capability to refresh with the latest production data, clients would have to invest a lot of time and energy to create data to successfully test their daily, weekly, and monthly data-dependent workflows,” says Manikantan. “Whatever solution we embarked on had to provide a consistent, point-in-time view of multiple databases across the multiple volumes in our environment.”
As Arcesium started its cloud journey, it planned to run database workloads on Amazon Elastic Compute Cloud (Amazon EC2) instances, using Amazon EBS volumes to satisfy its performance requirements. Because Arcesium runs multiterabyte databases, it needed the most powerful instances available from Amazon EC2, a web service that provides secure, resizable compute capacity in the cloud.
Although Amazon EC2 and Amazon EBS offer high performance and ample storage, Arcesium recognized the need to develop an innovative solution to handle an increasing number of refreshes. Arcesium initially made copy-on-write backups for a consistent point-in-time view of databases, then mounted the shadow copy to an Amazon EC2 instance and copied all data over to another disk drive to accomplish refreshes. While this worked fine, Arcesium also wanted to provide the capability to manage multiple environment refreshes over the same weekend to enable clients to run numerous transformative projects simultaneously. That’s when Arcesium reached out to AWS and was introduced to what would become Amazon FSR, as it enables fast restores.
Achieving Seamless Backup and Speedy Database Refreshes
Arcesium wanted to achieve application-consistent snapshots for its databases, so it explored Amazon EBS snapshots using AWS Systems Manager—which provides visibility and control of a customer’s infrastructure on AWS—to discover a way to make a copy-only backup without affecting the backup chain. Arcesium was also able to further streamline the process by reducing the snapshots of all Amazon EBS volumes attached to the instance to just those that were needed. Using Amazon FSR, the company facilitated refreshes of up to 30 multiterabyte environments within the weekend time window while successfully reducing refresh costs by 58 percent and refresh time by 59 percent.
This innovative use of the technologies means that the company can now refresh multiple Amazon EBS databases over the weekend without the addition of filer-based software or other costly solutions. Refreshing client environments used to take the entire weekend, but now Arcesium only needs a few hours. This increase in both speed and performance has resulted in increased satisfaction for Arcesium’s clients, who can now successfully test their data-dependent workloads quickly and efficiently. “The capability to refresh multiple client environments provides clients the capacity to run multiple large transformation projects at the same time—for example, testing new system integration in one staging environment while testing an upgrade of an accounting system in another,” says Manikantan.
In the technology and professional services industry, it’s vital to have a plan in case of a disaster. For extensive databases, full backups take a lot of time, but Arcesium discovered that it could use application-consistent Amazon EBS snapshots and Amazon FSR for database backups as well. Using Amazon EBS snapshots, Arcesium can pause the input/output while taking the backups, and it no longer needs to stop the database. When the company uses Amazon FSR while restoring from the Amazon EBS snapshots, the target incurs a downtime of only 10–15 minutes, versus 15–20 hours for a traditional backup/restore for large databases. If the group wants to go back to a certain point in time, it can use the snapshot it has on hand—and, depending on the size, this can actually be done within 4 hours. “Using application-consistent Amazon EBS snapshots as database backups would be really helpful for any AWS customers running databases on top of Amazon EC2 instances,” says Manikantan.
Looking for New Ways to Grow on AWS
By using Amazon EBS and Amazon FSR, Arcesium can provide faster database refresh times for more clients without compromising performance or incurring downtime. Arcesium is looking to further explore what it can do on AWS. “We’re looking to create a strategy in which we can quickly move hundreds of terabytes on AWS by using complete copy clones,” says Manikantan. That type of speed and data storage would mean even faster and more efficient services for Arcesium’s global clientele.
Arcesium is a global financial technology and professional services firm. It offers a new, innovative way for alternative asset managers to scale their businesses while maintaining control of critical noninvestment activities.
Benefits of AWS
- Reduced database restore time from 15–20 hours to 10–15 minutes
- Supported up to 30 environments in a single weekend
- Reduced refresh costs by 58%
- Reduced client environment refresh time by 59%
AWS Services Used
Amazon Elastic Block Store
Amazon Elastic Block Store (EBS) is an easy to use, high-performance, block-storage service designed for use with Amazon Elastic Compute Cloud (EC2) for both throughput and transaction intensive workloads at any scale.
Amazon Elastic Compute Cloud (Amazon EC2) is a web service that provides secure, resizable compute capacity in the cloud. It is designed to make web-scale cloud computing easier for developers.
Amazon EBS Fast Snapshot Restore
Amazon EBS Fast Snapshot Restore enables you to create a volume from a snapshot that is fully initialized at creation. This eliminates the latency of I/O operations on a block when it is accessed for the first time.
AWS Systems Manager
AWS Systems Manager provides a unified user interface so you can view operational data from multiple AWS services and allows you to automate operational tasks across your AWS resources.