Overview
Wallarm AI Hypervisor traces every step.
Wallarm AI Hypervisor follows every AI request. From user prompt through the AI gateway, the LLM, and any downstream tool call, flagging sensitive data and blocking misuse in real time.
Your engineers are using AI. You know that. What you probably don't know is how many places it's running, who's triggering each call, what data is leaving for which provider, and how you would prove any of it to an auditor. Wallarm AI Hypervisor closes that gap and brings the same governance you already have for the rest of your stack to the LLM, agent, and MCP traffic flowing through your Kubernetes environment.
KNOW WHAT AI IS RUNNING
Stop discovering model providers by reading commit messages and auditing cloud bills. AI Hypervisor gives you a live inventory of every LLM provider, agent, MCP server, and tool your applications are actually using, updated continuously from real traffic. Anthropic, OpenAI, AWS Bedrock, Azure OpenAI, Gemini, Cohere, Mistral, and Together are recognized out of the box. Shadow AI surfaces next to the AI you officially approved.
KNOW WHO TRIGGERED IT
When a model returns something wrong, leaks something it shouldn't, or runs up a surprise bill, you need to know which end-user caused it. AI Hypervisor attributes every model call, agent step, and tool invocation back to the originating end-user across every internal service hop. Replay the exact prompt that produced a bad answer. Attribute LLM costs to individual users and teams. Reconstruct an AI-related incident timeline in minutes instead of days.
STOP THE BAD ONES IN REAL TIME
A misbehaving session shouldn't wait for the next deploy. Block a session, revoke a user, or stop a pattern instantly. No pod restart. No deploy cycle. No disruption to the workloads sharing the cluster. Enforcement is opt-in per namespace so platform teams can roll out one tenant at a time, while security gets a working off-switch for the rest of the estate.
PROVE IT TO AUDITORS
Whether the next conversation is EU AI Act review, SOC 2, an internal audit, or a customer security questionnaire, the evidence is already built. A current inventory of providers and tools, PII flow records traced from origin to model, an AI supply chain inventory with CVE enrichment, and full session replay. Compliance comes out of the same console your on-call team uses, not from spreadsheets assembled the week before.
ONE INSTALL, EVERY RUNTIME
AI Hypervisor covers Python, Node, Go, Java, and Ruby workloads in the same view, without application code changes, without an SDK to adopt, and without a per-language tool to operate. It supplements your existing APM, log pipeline, and SIEM rather than replacing them, ships connectors to the SOC tools you already run, and lives inside your own Kubernetes clusters.
WHO IT'S FOR
Leaders who own AI risk and need the same visibility, attribution, and audit posture for AI traffic that they have for everything else. Platform and AI infrastructure leaders running mixed- language workloads across many teams who want one consistent answer to "what is calling what, and is it safe?" Compliance and audit owners who need to prove what data left for which model provider, on whose behalf, on demand.
Highlights
- Stop discovering model providers from commit messages and cloud bills. AI Hypervisor gives you a live inventory of every LLM provider, agent, MCP server, and tool your applications are actually using, updated continuously from real traffic. Anthropic, OpenAI, AWS Bedrock, Azure OpenAI, Gemini, Cohere, Mistral, and Together are recognized out of the box. Shadow AI surfaces next to the AI you officially approved. The side projects and vendor trials show up too.
- When a model returns something wrong, leaks something it shouldn't, or runs up a surprise bill, you need to know which end-user caused it. AI Hypervisor attributes every model call, agent step, and tool invocation back to the originating end-user across every service hop. Replay the exact prompt that produced a bad answer. Attribute LLM costs by user and by team. Coverage spans Python, Node, Go, Java, and Ruby in a single view.
- A misbehaving session shouldn't wait for the next deploy. Block a user, kill a session, or stop a traffic pattern instantly. No pod restart. No change management ticket. No disruption to the workloads sharing the cluster. Enforcement is opt-in per namespace, so platform teams can roll out one tenant at a time while security gets a working off-switch for the rest of the estate. Built for production where downtime isn't an option.
Details
Introducing multi-product solutions
You can now purchase comprehensive solutions tailored to use cases and industries.
Features and programs
Financing for AWS Marketplace purchases
Pricing
Dimension | Description | Cost/12 months |
|---|---|---|
Starter Tier - AIH | Licensed for up to 500 vCPUs per month of monitored Kubernetes, with a capacity attribute of up to 2B requests per month (RPM). Actual vCPU and RPM use is measured at the 95th percentile of the maximum used values over a rolling 30 day window. Step-up to Enterprise Tier AIH is required if vCPU or RPM counts exceed the capacity for two consecutive months. AIH is a private-offer only listing. | $75,000.00 |
Enterprise Tier - AIH | Licensed for up to 2,000 vCPUs per month of monitored Kubernetes, with a capacity attribute of up to 6B requests per month (RPM). Actual vCPU and RPM use is measured at the 95th percentile of the maximum used values over a rolling 30 day window. Step-up to Enterprise+ Tier AIH is required if vCPU or RPM counts exceed the capacity for two consecutive months. AIH is a private-offer only listing. | $250,000.00 |
Enterprise+ Tier - AIH | Licensed for up to 5,000 vCPUs per month of monitored Kubernetes, with a capacity attribute of up to 15B requests per month (RPM). Actual vCPU and RPM use is measured at the 95th percentile of the maximum used values over a rolling 30 day window. Step-up to a Strategic Tier AIH is required if vCPU or RPM counts exceed the capacity for two consecutive months. AIH is a private-offer only listing. | $500,000.00 |
Strategic Tier - AIH | Licenses above 5,000 vCPUs per month of monitored Kubernetes, and a capacity attribute over 15B requests per month (RPM) is a Strategic Tier requiring a private offer. Please contact Wallarm or a Wallarm partner. | $1,000,000.00 |
Vendor refund policy
AI Hypervisor is sold through annual private offers on AWS Marketplace. Contract terms, including renewal, step-up, and step-down mechanics, are defined in the executed private offer. Step-up tier movement is required when Licensed Capacity or RPM exceeds the contracted cap for two consecutive 30-day measurement periods. Step-down to a lower tier is permitted under the same measurements, never below the minimum commitment floor. No refunds are issued for capacity below the contracted tier.
How can we make this page better?
Legal
Vendor terms and conditions
Content disclaimer
Delivery details
Software as a Service (SaaS)
SaaS delivers cloud-based software applications directly to customers over the internet. You can access these applications through a subscription model. You will pay recurring monthly usage fees through your AWS bill, while AWS handles deployment and infrastructure management, ensuring scalability, reliability, and seamless integration with other AWS services.
Resources
Vendor resources
Support
Vendor support
24/7 Global Support
For questions about Wallarm product offerings or capabilities (pre-purchase), please contact our specialist team at the following email: productquestions@wallarm.com
Free API Security Certification: https://www.wallarm.com/api-security-certification
Documentation:
AWS infrastructure support
AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.