Customer Stories / Software & Internet / United States

2023
Autodesk Logo

Autodesk Builds Reliable, Cost-Effective Big Data Analytics Using Amazon EMR

Learn how 3D design, engineering, and entertainment software company Autodesk migrated to Amazon EMR and improved performance and reliability while saving costs.

Achieved zero service downtime

after migrating to Amazon EMR

99.8% availability

for open-source workloads

30% reduction

in costs

Implemented autoscaling

to support growth trajectory

25% reduction

in hours spent troubleshooting

Overview

For Autodesk, data is a driving force behind business decisions and growth strategies. As such, the reliability of its big data framework is crucial for success. When faced with lengthy outages and efficiency issues with its third-party big data solution, Autodesk sought a more reliable replacement that could scale to support its continued growth.

Turning to Amazon Web Services (AWS), Autodesk successfully migrated its complex data environment to Amazon EMR, an industry-leading cloud big data solution for petabyte-scale data processing, interactive analytics, and machine learning (ML) using open-source frameworks. Now, it can run high-performing big data workloads at scale with fast performance and without lengthy outages or downtimes.

Businesswoman working at laptop in open plan office

Opportunity | Using Amazon EMR to Improve Big Data Processing at Autodesk

Founded in 1982, Autodesk produces 3D design, engineering, and entertainment software used by more than 100 million customers worldwide. Its AutoCAD software helps architects, engineers, and construction professionals create precise digital drawings. As a data-driven company, Autodesk uses analytics and insights to bring better solutions and features to its customers. However, its third-party big data solution couldn’t keep up with its growth.

“Our previous solution had a lot of reliability issues,” says Nick Wilson, senior engineering manager at Autodesk. “Data is a fundamental unit of Autodesk’s business. We needed a new provider that could deliver the reliability that we required and could scale with our growth trajectory.”

After evaluating different solutions for reliability and performance, Autodesk chose to migrate to Amazon EMR. “Autodesk is an AWS shop, and we have a great history of using AWS services,” says Wilson. “AWS also offered many more service offerings for the cost that we were paying. We could identify a path to migrate every workload from our third-party provider to Amazon EMR and other AWS services.”

kr_quotemark

By migrating to Amazon EMR, we reduced costs by 30 percent. And we achieved this cost reduction while significantly improving our security posture.”

Anitha Matta
Engineering Manager of Platform Infrastructure, Autodesk

Solution | Migrating Petabytes of Data While Strengthening Security and Reliability with the AWS Migration Acceleration Program

Migrating Autodesk’s big data workloads to Amazon EMR would not be a simple task. This effort would involve migrating workloads from 30 different teams, each of which used different coding languages and ran different types of queries. Additionally, all Autodesk’s business-critical workloads ran on the third-party solution, and the company needed to avoid unnecessary interruptions during the process. To facilitate the complex migration, Autodesk participated in the AWS Migration Acceleration Program (AWS MAP), which helps accelerate cloud migration and modernization journeys with an outcome-driven methodology. Not only did AWS MAP support Autodesk in the technical aspects of the migration, but the program provided credits to offset some of the costs.

“Whenever you migrate to a new service, compute is not going to be optimized until you get some experience with it,” says Wilson. “With the AWS MAP pricing, we minimized the migration’s impact on our bottom line and made a seamless transition.”

Autodesk runs Apache Spark, an open-source unified analytics engine, on Amazon EMR to serve as its primary compute engine. Spark clusters support all the data processing and extract, load, transform pipelines. To handle its data load, Autodesk attaches volumes powered by Amazon Elastic Block Store (Amazon EBS)—a scalable, high-performance block-storage service—to Spark instances. These services scale seamlessly to process Autodesk’s data, which amounts to 1 PB and grows every month. To power dashboards and serve analytics, Autodesk runs Presto, an open-source distributed SQL query engine, on Amazon EMR.

“Presto on Amazon EMR is a fundamental part of the developer flow,” says Wilson. “It helps our team interact with and learn from data and provides general business insights to help develop new products.”

By adopting Amazon EMR, Autodesk has vastly improved the reliability of its big data service. For workloads that rely on Apache Spark and Presto, the company achieved 99.8 percent availability and has not experienced any significant outages or service downtime. Autodesk has also reduced the hours spent troubleshooting by 25 percent.

The development team greatly appreciates this consistent reliability and can now focus its resources on other areas of the business. “I used to not want to look at company communications on Sunday. I would see that business-critical processes were broken because our third-party solution was broken,” says Wilson. “Now, our pipelines are running stably, clusters are running as expected, and we are meeting our service-level agreements. We have a much more stable system running on Amazon EMR today than we had before the migration.”

Architecture Diagram

Figure 1. Job submission and migration

Figure 2. Autodesk’s AWS architecture for big data processing

Outcome | Improving Cost, Performance, and Reliability for Big Data Processing on AWS

By migrating its big data environment to Amazon EMR, Autodesk has improved the service’s reliability, performance, and cost efficiency. “By migrating to Amazon EMR, we reduced costs by 30 percent,” says Anitha Matta, engineering manager of platform infrastructure at Autodesk. “And we achieved this cost reduction while significantly improving our security posture.”

After beginning the migration, Autodesk achieved cost parity within 6 months. Because its clusters are clearly isolated on Amazon EMR, it can attribute costs to specific workloads in near real time. Using this information, its teams can identify the return on investment for their projects and make strategic decisions to improve cost efficiency. Autodesk can also process and analyze larger sets of data at almost the same price as its third-party solution, all while improving performance and reliability.

The company’s next step is to upgrade its Apache Spark clusters and adopt a containerized architecture using Amazon Elastic Kubernetes Service (Amazon EKS), a managed service to run Kubernetes in the AWS Cloud and on-premises data centers. Autodesk is also using ML services like Amazon SageMaker Studio, a fully integrated development environment, to incorporate advanced capabilities across its organization.

“Since 2013, a big part of Autodesk’s journey has been moving to the cloud. Through this migration, we wanted to facilitate access to timely, integrated, and governed data to help drive our growth,” says Wilson. “Using AWS, we have achieved these goals. By making data centrally accessible on AWS, it can be used by teams across Autodesk, including product, business, marketing, sales, and finance. AWS services are powering critical functions within the company.”

About Autodesk

Autodesk provides software products and services across several industries to empower innovators with design. Its AutoCAD software helps architects, engineers, and construction professionals create precise drawings digitally.

AWS Services Used

Amazon EMR

Amazon EMR is the industry-leading cloud big data solution for petabyte-scale data processing, interactive analytics, and machine learning using open-source frameworks such as Apache Spark, Apache Hive, and Presto.

Learn more »

Amazon EBS

Amazon Elastic Block Store (Amazon EBS) is an easy-to-use, scalable, high-performance block-storage service designed for Amazon Elastic Compute Cloud (Amazon EC2).

Learn more »

AWS Migration Acceleration Program (MAP)

The AWS Migration Acceleration Program (MAP) is a comprehensive and proven cloud migration program based upon AWS’s experience migrating thousands of enterprise customers to the cloud.

Learn more »

More Autodesk Stories

no items found 

1

Get Started

Organizations of all sizes across all industries are transforming their businesses and delivering on their missions every day using AWS. Contact our experts and start your own AWS journey today.