Customer Stories / Software & Internet

2023
WOMBO Logo

Creating the Viral Dream App Using Amazon ECS and AWS Fargate with WOMBO

Learn how AI startup WOMBO delivers accessible AI to millions using Amazon ECS, AWS Fargate, and Amazon EC2 G5 Instances.

170 million

downloads (to date) with automatic scaling

5.2 million

monthly active users supported while going viral 

2 billion

unique images generated to date

Deploy and launch

new apps in just months

Reduced

expenses while improving performance

Overview

WOMBO, a Toronto-based artificial intelligence (AI) startup, needed to scale quickly so that its AI lip-synching and image-generation apps could scale to meet a viral surge of users without compromising speed or user experience. To meet its scaling needs, WOMBO chose to migrate its workloads to Amazon Web Services (AWS) using Amazon Elastic Container Service (Amazon ECS), a fully managed container orchestration service. By adopting Amazon ECS, WOMBO and its lean team of engineers successfully managed viral scaling, optimized costs, and delivered its apps to a huge audience.

Woman use of mobile phone

Opportunity | Migrating to Amazon ECS to Meet Viral Demand

WOMBO is all about bringing apps to everyone, everywhere. “We’re the fun and accessible face of AI,” says Vivek Bhakta, WOMBO’s cofounder and head of infrastructure. The company’s mission is to make AI accessible to all so that everyone from kids to grandparents can delight in the magic of AI. WOMBO offers free apps that run on nearly any device. “You don’t need the best smartphone to run our app because we do all the processing in the cloud,” says Bhakta.

WOMBO’s biggest challenge to achieving its goals was scalability. Creating and serving AI-generated content can require 12,000 graphics processing units (GPUs) per day. “Managing all those GPUs manually would be impossible without an orchestration solution,” says Bhakta.

At first, the company was running GPUs through a different cloud provider, but as it closed in on its first app’s launch date, WOMBO realized that it needed more scalability. “We found it was very hard for us to optimize our workloads for that cloud,” says Bhakta. With an app designed to work for everyone, WOMBO needed to think ahead and plan for massive scaling. To scale its apps quickly and affordably, WOMBO migrated to AWS using Amazon ECS. “We were able to quickly adopt Amazon ECS without having to worry about configuring infrastructure,” says Bhakta. “It allows us to deploy a cluster within 15 minutes.”
 

kr_quotemark

Our infrastructure has been largely on autopilot. Using Amazon ECS definitely saves a lot of time for us.”

Vivek Bhakta
Cofounder and Head of Infrastructure, WOMBO

Solution | Scaling to Millions of Users with a Lean Team

WOMBO’s first app went massively viral in 2021, hitting 25 million downloads 3 weeks after its launch. The debut app’s success motivated the WOMBO team to develop a second app called Dream, an AI art generator that helps users create beautiful images by combining unique prompts with custom art styles. The team was able to quickly build and deploy the app using fully managed AWS solutions. “Most of our time was not spent on infrastructure but on the actual coding,” says Bhakta. “Generally, we would need a lot of time to figure out scaling and infrastructure, but in this case, it took less than 1 week.” In May 2022, Dream also went viral, supporting 5.2 million monthly active users.

With a modestly sized engineering team, WOMBO handles a massive customer base. The team launched its first app with only three engineers. At peak capacity, WOMBO operates with a dedicated team of seven engineers and four additional professionals focusing on the backend architecture. WOMBO operates with this lean, efficient team by using AWS tools strategically to run applications and manage data at peak efficiency. WOMBO runs its application workloads on serverless containers by using Amazon ECS with AWS Fargate, a serverless, pay-as-you-go compute engine, letting its engineers focus on building applications without managing servers while deploying APIs quickly. “On AWS Fargate, there’s nothing for us to manage other than deciding how many nodes we want running,” says Bhakta. This support means the WOMBO team can allocate its resources and talents toward app development, innovation, and enhancing the overall user experience.

To maintain smooth operations during viral surges, WOMBO uses AWS Lambda, a compute service that lets users run code without provisioning or managing servers. WOMBO uses a database to store all the images that Dream generates—almost two billion images since 2022. Then it uses AWS Lambda to check the database, query for tasks that don’t need to live in the database any longer, and export those tasks to a storage service. “We’re not deleting them forever,” says Bhakta. “They’re just getting saved elsewhere. And we don’t have to pay for unused compute time. It’s extremely efficient.”

By taking advantage of the efficiency of AWS services, the WOMBO team has lowered its bills. “When we moved over to AWS, we could select the right instance size for our GPUs, which helped us optimize for cost and performance,” says Bhakta. “Being able to access compute and scale up that compute on demand is extremely valuable. If we tried to launch this product without AWS, it would feel next to impossible.”

WOMBO improved price performance further when AWS released Amazon Elastic Compute Cloud (Amazon EC2) G5 Instances, the latest generation of NVIDIA GPU-based instances, which can be used for a wide range of graphics-intensive and machine learning use cases. “We realized that we’d get more than three times the output using Amazon EC2 G5 Instances,” says Bhakta. “Now, we’ve shifted our whole workload to those instances, and it has changed the way that we operate because we have better margins.”

WOMBO’s trust in the reliability of Amazon ECS has led to streamlined operations. “We don’t have to spend the time researching all the other solutions because we can trust that with Amazon ECS, we’ll be able to handle whatever we put on it and it’s going to be easy,” says Bhakta. “Our infrastructure has been largely on autopilot. Using Amazon ECS definitely saves a lot of time for us.”

Outcome | Preparing to Create the Next Viral Sensation

On AWS, WOMBO gained the flexibility to scale up as needed to meet unprecedented demand while freeing up its team to develop the next viral AI app. By using services such as Amazon ECS, AWS Fargate, and AWS Lambda, WOMBO can manage millions of users and billions of pieces of AI-generated content.

“It was easy to get started, and it was easy to scale up,” says Bhakta. “It’s been fantastic for us.” Now, WOMBO’s apps have over 170 million downloads, and the numbers are still rising. With 2 years of experience as of 2023, WOMBO continues to harness the power of AWS to unleash creativity and joy through the power of AI.

 

About WOMBO

WOMBO is a Toronto-based AI startup revolutionizing the way that people interact with AI. With viral lip-synching and image-generation apps, WOMBO aims to make AI accessible to all, delivering joy, creativity, and magic to millions worldwide.

AWS Services Used

Amazon ECS

Amazon Elastic Container Service (Amazon ECS) is a fully managed container orchestration service that simplifies your deployment, management, and scaling of containerized applications. 

Learn more »

AWS Fargate

AWS Fargate is a serverless, pay-as-you-go compute engine that lets you focus on building applications without managing servers. AWS Fargate is compatible with both Amazon Elastic Container Service (Amazon ECS) and Amazon Elastic Kubernetes Service (Amazon EKS).

Learn more »

AWS Lambda

AWS Lambda is a serverless, event-driven compute service that lets you run code for virtually any type of application or backend service without provisioning or managing servers. 

Learn more »

Amazon EC2 G5 Instances

Amazon EC2 G5 instances are the latest generation of NVIDIA GPU-based instances that can be used for a wide range of graphics-intensive and machine learning use cases. 

Learn more »

More Software & Internet Customer Stories

Showing results: 9-12
Total results: 918

no items found 

  • The Netherlands

    Improvement-IT Uses TechNative to Migrate to AWS, Speeds Customer Onboarding, and Reduces Support Calls by 15%

    Improvement-IT, based in the Netherlands, provides IoT solutions to a variety of organizations with an emphasis on tracking, tracing, and monitoring the status of assets. Together with its other companies Port Pay and Alltrack Medical, it offers these innovative solutions to help customers track assets in the field, manage warehouses, and optimize supply chains. However, it was being hampered by its own managed services provider, which was running both Amazon Web Services (AWS) and on-premises assets for it. It wanted a proactive partner with deep expertise to help optimize its systems, improve client onboarding times, and better detect problems before they affected customers. AWS Partner TechNative has helped it to achieve those goals, reducing customer support calls by 15 percent and cutting onboarding time by 50 percent.

    2025
  • Argentina

    Kovix Improves Route Efficiency by 20% With HERE on AWS

    To optimize high-volume, complex routes for municipal recycling collection in Argentina, Kovix turned to AWS Partner HERE Technologies, a leader in location data and routing solutions. HERE offered enterprise-grade routing capabilities that Kovix deployed to dynamically manage hundreds of waypoints with scalability and precision. Using HERE Tour Planning, Kovix reduced route times by 20 percent and fuel expenses by 17 percent, improving operational performance for municipalities across Argentina.

    2025
  • United States

    AWS Partner Pinecone Helps Hyperleap Build Job Seeker-focused AI-powered Job Board

    Hyperleap, a company specialized in building SaaS solutions for the recruiting industry, worked with AWS Partner Pinecone to create a job board where job seekers could employ generative AI to stand out in the initial resume filter and put their best foot forward. Together, they developed Jennie Johnson, a job-seeker focused AI-powered job board which increased click-through rates by 50% and provided job seekers customized matches.

    2025
  • Palo Alto Networks Boosts 2,000 Developers’ Productivity Using AI Solutions from AWS, Anthropic, and Sourcegraph

    Palo Alto Networks, a leading cybersecurity company, sought to boost developer productivity using generative artificial intelligence (AI) technology. The goal was to create a custom solution that would enhance the speed and quality of coding while maintaining strict security standards. By leveraging Amazon Web Services (AWS), Claude 3.5 Sonnet and Claude 3 Haiku from AWS Partner Anthropic, and Cody from AWS Partner Sourcegraph, Palo Alto Networks developed a secure AI tool for generating, optimizing, and troubleshooting code. Within three months, Palo Alto Networks onboarded 2,000 developers and increased productivity up to 40 percent, with an average of 25 percent. This custom AI solution has empowered both senior and junior developers, and the company expects further improvements in code quality and efficiency.

    2024
1 230

Get Started

Organizations of all sizes across all industries are transforming their businesses and delivering on their missions every day using AWS. Contact our experts and start your own AWS journey today.