Customer Stories / Software & Internet / North America

2024
Articul8 logo

Articul8 AI Improves Productivity by 35% and Helps Customers Reduce AI Deployment Time by 4x Using Amazon SageMaker HyperPod

Learn how Articul8 AI built its enterprise generative AI solution using Amazon SageMaker HyperPod.

4x

faster deployment time

5x

lower total cost of ownership

35%

improvement in productivity

Overview

Autonomous generative artificial intelligence (AI) enterprise software company, Articul8 AI, empowers enterprises to solve complex, transformational business and technical challenges. It provides a system of generative AI and other models combined with proprietary algorithms to make predictions, provide insights, and automate functions, all within the customer’s own security perimeter.

Developing, integrating, deploying, and scaling enterprise-grade generative AI solutions can be expensive and time consuming. To address this issue, Articul8 AI offers a ready-to-use generative AI product that leverages Amazon Web Services (AWS) and in-house technology to provide customers across various industries a quick and cost-effective way to transform their private data into valuable insights, with measurable business outcomes. This solution enables customers to harness the power of generative AI in a secure setting, reducing deployment time and total cost of ownership (TCO) compared to alternative options.

Opportunity | Using Amazon SageMaker HyperPod to Accelerate Model Training and Fine-Tuning for Articul8 AI

Launched in January 2024, Articul8 AI offers a full-stack, vertically optimized generative AI solution, purpose-built to meet the four core needs of its customers: scalability, speed of deployment, security, and sustainable costs. The company offers generative AI technology that includes capabilities such as multi-modal understanding, dynamic reasoning and decisioning, autonomous actioning, explainability, auditability, and domain-specific/enterprise-specific model development.

Articul8 AI’s unique, self-contained software model helps customers rapidly build, deploy, and manage enterprise-grade generative AI applications while keeping their data within their own virtual private clouds.

To build its generative AI platform, the Articul8 AI team needed to quickly and efficiently deploy large scale infrastructure resources to manage workloads for internal research and development as well as production deployment.. Articul8 AI also faced the challenge of managing GPU clusters without a dedicated infrastructure team. The company needed a solution that would increase compute cluster resiliency along with high availability and automated management while also standardizing its technology stack so that it could deploy seamlessly in its customers’ infrastructure. Articul8 AI chose to build its solution on AWS because of its reliability, scalability and the proactive support provided by the AWS service teams.

One of the most computational- and time-intensive jobs for Articul8 AI was pretraining and fine-tuning the various large language models including domain-specific/enterprise-specific models that form the backbone of Articul8 AI’s generative AI platform. At Articul8 AI’s scale, any kind of performance degradation was unacceptable because it could negatively impact the cost and time it took to train the models. So, the company sought a high-performance solution that could accelerate its model training time. The company experimented with several solutions but ultimately adopted Amazon SageMaker HyperPod, which typically reduces the time to train foundation models by up to 40 percent with a purpose-built infrastructure for distributed training at scale.

Articul8 AI worked alongside AWS to collaboratively problem-solve and go from concept to deployment, which helped the company quickly move forward with their training workloads. “When we first started, we wanted to see how this solution would grow and adapt to our needs, and we couldn’t be happier with the support we received from AWS,” says Felipe Viana, head of applied research at Articul8 AI. “Getting the support we needed in a timely manner helped us gain confidence in the Amazon SageMaker HyperPod service and sped up our AI research work.”

kr_quotemark

Almost all our development happens on AWS, and there are so many more opportunities for us to expand our use of AWS services.”

Arun Subramaniyan
Founder and CEO, Articul8 AI

Solution | Empowering Four Times Faster Customer Deployments

Articul8 AI spent over 18 months building its autonomous, vertically integrated generative AI platform in stealth mode before publicly launching in January 2024 with two main product offerings with full enterprise support. The first is the Express Bundle, which is a preconfigured product for specific use cases, such as knowledge discovery, that do not require training or fine-tuning capabilities for customers. The second option is the customizable Premium Bundle that enables continued pretraining or fine-tuning of domain-specific/enterprise-specific models along with a host of other features and AI packs required for production-grade enterprise deployments. The solution stack for both these options is built on AWS and uses Amazon SageMaker HyperPod for model training and fine-tuning.

Using Amazon SageMaker HyperPod made it possible for Articul8 AI to enhance performance and get its solution to market quickly. Using Amazon SageMaker HyperPod removed the need for the company to manually manage its high performance compute clusters. As a result, Articul8 AI got its cluster usage up to nearly 100 percent. In addition, the company saw its productivity improve by up to 35 percent. Amazon SageMaker HyperPod also monitors cluster health and automatically replaces faulty nodes. This reduced Articul8 AI’s time to train and made the deployment frictionless for researchers, improving overall productivity. “The time we’ll save by not managing the clusters ourselves will really add up,” says Arun Subramaniyan, founder and CEO at Articul8 AI. “We’re investing in technologies such as Amazon SageMaker HyperPod up front to help our teams operate efficiently and to scale up our operations rapidly.”

Articul8 AI also integrated Amazon SageMaker HyperPod with other AWS services, including Amazon Managed Grafana, which offers scalable and secure data visualization for operational metrics, logs, and traces. The company further used Amazon SageMaker HyperPod lifecycle scripts to customize their cluster environment and install required libraries and packages including open-source exporters. With these additions, Articul8 AI achieved near real-time observability of its GPU resources in a single-pane-of-glass dashboard.

The prebuilt, out-of-box solution helps Articul8 AI’s customers accelerate their generative AI journeys with access to a collection of AI models as well as a multitude of other features, all deployed and operated autonomously within the customer’s secure environment. Articul8 AI’s customers experience up to four times faster deployment times compared with other generative AI solutions, cutting a potentially months-long process down to mere weeks for certain use cases. Articul8 AI’s optimizations on AWS and differentiated business model also help customers to reduce their total cost of ownership by up to five times when deploying their generative AI applications in production at scale.

Outcome | Adding Value for Articul8 AI’s End Customers

Early adopters in the financial services, cybersecurity, semiconductor, government, aerospace, and telecommunications industries are already benefiting from the time and cost savings provided by Articul8 AI’s solution. Articul8 AI is dedicated to helping its customers maximize their business potential by helping them harness the power of generative AI–based technologies. Moving forward, Articul8 AI will continue to scale its use of Amazon SageMaker HyperPod to run pretraining and fine-tuning jobs faster and more efficiently.

“Amazon SageMaker HyperPod is only the start of Articul8 AI’s use of AWS services,” says Subramaniyan. “Almost all our development happens on AWS, and there are so many more opportunities for us to expand our use of AWS services. We’re looking forward to shaping the future of generative AI alongside AWS and our customers.”

“Amazon SageMaker HyperPod has helped us tremendously in managing and operating our computational resources more efficiently with minimum downtime. We were early adopters of the Slurm-based HyperPod service and have benefitted from its ease-of-use and resiliency features, resulting in up to 35% productivity improvement and rapid scale up of our generative AI deployments”, says Subramaniyan. “As a Kubernetes house, we are now thrilled to welcome the launch of Amazon EKS support for SageMaker HyperPod. This is a game changer for us as it integrates seamlessly with our existing training pipelines and makes it even easier for us to manage and operate our large-scale Kubernetes clusters. In addition, this also helps our end customers as we are now able to package and productize this capability into our generative AI platform, enabling our customers to run their own training and finetuning workloads in a more streamlined manner. Amazon SageMaker HyperPod is only the start of Articul8 AI’s use of AWS services. Almost all our development happens on AWS, and there are so many more opportunities for us to expand our use of AWS services. We’re looking forward to shaping the future of generative AI alongside AWS and our customers.”

About Articul8

Articul8 AI, an enterprise software company, provides an autonomous, full-stack, vertically optimized generative AI software platform that empowers companies to build, deploy, and manage generative AI applications within their own security perimeter.

AWS Services Used

Amazon SageMaker HyperPod

AmazonSageMaker HyperPod removes the undifferentiated heavy lifting involved in building and optimizing machine learning (ML) infrastructure for training foundation models (FMs), reducing training time by up to 40%.

Learn more »

Amazon Managed Grafana

Amazon Managed Grafana is a fully managed service for Grafana, a popular open-source analytics platform that enables you to query, visualize, and alert on your metrics, logs, and traces.

Learn more »

More Software & Internet Customer Stories

no items found 

1

Get Started

Organizations of all sizes across all industries are transforming their businesses and delivering on their missions every day using AWS. Contact our experts and start your own AWS journey today.