Customer Stories / Advertising & Marketing
2022
HUMAN Security Accelerates ML Training and Time to Market Using Amazon SageMaker
Cybersecurity company HUMAN Security has tripled the number of machine learning (ML) models that it has deployed to production and improved the quality of its digital solutions by using Amazon SageMaker. HUMAN Security offers solutions that use ML to detect fraud, and the company wanted to accelerate its time to market by automating the training and deployment of its ML models.
Weeks to hours
Reduction of time it takes to train new ML models
3x
Number of ML models deployed to production
5x
Amount of data ingested compared with previous system
15 trillion
Online interactions validated every week
Overview
HUMAN Security wanted to iterate its ML models more quickly and accelerate its time to market so that it could improve the performance of MediaGuard, its leading solution that helps protect media companies and advertisers from ad fraud. However, when the company first released MediaGuard, its engineering teams trained and deployed all its ML models manually. This manual process consumed a significant portion of HUMAN Security’s resources, and in some cases, it took the company weeks to deploy an ML model to production.
To train its ML models more efficiently, HUMAN Security wanted to automate its manual training process. Because of its history of working on Amazon Web Services (AWS), HUMAN Security adopted Amazon SageMaker, which provides companies the ability to build, train, and deploy ML models for virtually any use case with fully managed infrastructure, tools, and workflows. By combining automation with scalability, the company has tripled the number of ML models that it has trained while ingesting five times the amount of data compared to its previous process. Now, HUMAN Security can train and deploy ML models within a few hours, accelerating its time to market and improving the quality of its product offering.
Opportunity | Disrupting the Economics of Cybercrime
HUMAN Security uses a modern defense strategy, which includes disruptions, network effect, and internet visibility, to disrupt the economics of cybercrime. It helps businesses across all sectors increase the security of their digital presence by offering a wide range of cybersecurity solutions that help companies protect their digital assets from fraud and online bots imitating humans. For digital advertisers, the company created MediaGuard, an ad tech solution that uses ML in the Human Defense Platform to predict the validity of online advertising impressions in near real time across all digital channels and formats.
Because online bots are becoming increasingly sophisticated, HUMAN Security maintains strict latency and accuracy requirements for MediaGuard, and its team of engineers continually iterates new ML models to enhance its performance. However, when HUMAN Security rolled out this solution, the process for training its ML models was entirely manual and involved running a number of scripts and copying and pasting different configurations. In many cases, it took HUMAN Security weeks to deploy new ML models. “We wanted to save human time,” says Austin Leirvik, staff data scientist at HUMAN Security. “We were looking to set up a complete data pipeline that would do the data preparation, data extraction, model training, and off-line model evaluation all at the push of a button.”
Since its founding in 2012, HUMAN Security has relied on AWS for cloud solutions, and in 2020, it engaged the AWS team to mature its ML capabilities. “We collaborated every 2 weeks,” says Leirvik. “We received lots of feedback on how we could automate our model training, and we saw SageMaker as a tool that we could use to solve the problems that we were facing.”
By using Amazon SageMaker, we’ve substantially reduced the amount of time that is needed to train ML models.”
Austin Leirvik
Staff Data Scientist, HUMAN Security
Solution | Automating the Training Process for ML
While HUMAN Security engaged the AWS team, it also participated in multiple training opportunities through AWS, including AWS Partner Network Immersion Days, which are customer workshops delivered by AWS Partners. These training opportunities helped HUMAN Security upskill its staff and gain a deeper understanding of the ML model lifecycle. HUMAN Security also adopted Snowflake Data Cloud, a solution for data warehousing, data lakes, data engineering, data science, data application development, and data sharing from Snowflake, an AWS Partner. The company uses this solution to process and store its data tables at scale. “For a typical model training run, we’re working with around 50 million data points,” says Leirvik. “Because we can do our off-line evaluation on a bigger dataset, we have a much bigger picture of the long tail, which is really nice.”
The company also began using AWS Glue, a simple, scalable, and serverless data integration service. HUMAN Security uses AWS Glue for its extraction jobs and to prepare its data for querying. After the data has been prepared, HUMAN Security uses SageMaker to build, train, and deploy new ML models. “By using Amazon SageMaker, we’ve substantially reduced the amount of time that is needed to train ML models,” says Leirvik. “We have complete traceability and reproducibility across all our models.” Previously, training a new ML model could take HUMAN Security several weeks. Now, the company can build, train, and deploy a new ML model within hours.
Moreover, HUMAN Security runs its workloads using Amazon Elastic Compute Cloud (Amazon EC2) M5 Instances, which offer balanced compute, memory, and networking resources for general-purpose workloads. Since switching to this Amazon EC2 instance type, the company has increased its cost savings by 15 percent, and it can quickly scale demand. This scalability helps HUMAN Security power its ML models to validate the humanity of 15 trillion online interactions every week. “We’ve been very happy with the scalability and reliability of Amazon EC2 M5 Instances,” says Leirvik. “We’ve been able to increase the amount of data that we’re working with by five times.”
To achieve full automation, HUMAN Security set up step functions across all its AWS solutions by defining a set of configuration files using Amazon States Language and adding those files to its repository with the rest of its ML codebase. Anytime a change is made to its codebase, the company automatically redeploys those step functions, which has reduced the complexity of its workflows. This automation has helped the company accelerate its time to market and enhance its business agility. With additional time savings, HUMAN Security has refocused its efforts on releasing new predictive features for MediaGuard. “Using AWS, we’ve tripled our number of deployments compared with our previous process,” says Leirvik. “Now we can react more quickly when we see a performance problem emerging.”
Outcome | Applying Its Learning to Other ML Models
HUMAN Security plans to apply its learnings from this project to other ML models that it has in production. It will also continue using AWS services for a wide variety of use cases across the company. “Working alongside the AWS team has been a very positive experience,” says Leirvik. “The AWS team helped us look at the problem that we were facing in a new way and kept us on pace so we would succeed.”
About HUMAN Security
HUMAN Security helps businesses protect their digital assets from fraud by offering cybersecurity solutions that use ML to validate the authenticity of online interactions. Since 2012, the company has been an Independent Software Vendor on AWS.
AWS Services Used
Amazon SageMaker
Amazon SageMaker is built on Amazon’s two decades of experience developing real-world ML applications, including product recommendations, personalization, intelligent shopping, robotics, and voice-assisted devices.
AWS Glue
AWS Glue is a serverless data integration service that makes it easy to discover, prepare, and combine data for analytics, machine learning, and application development.
Learn more »
Amazon EC2
Amazon Elastic Compute Cloud (Amazon EC2) offers the broadest and deepest compute platform, with over 500 instances and choice of the latest processor, storage, networking, operating system, and purchase model to help you best match the needs of your workload.
Learn more »
Amazon EC2 M5 Instances
M5 instances offer a balance of compute, memory, and networking resources for a broad range of workloads.
Learn more »
Get Started
Organizations of all sizes across all industries are transforming their businesses and delivering on their missions every day using AWS. Contact our experts and start your own AWS journey today.