Kubetorch

Kubetorch is a platform to orchestrate, scale, optimize, and observe your ML workloads on Kubernetes, with perfect reproducibility and 100x faster iteration, and debugging.

Request private offer

Overview

Kubetorch is a modern interface for running heterogeneous ML workloads on Kubernetes. While Kubernetes is the standard compute foundation, development and deployment remain challenging. Notebook-style or VM-based workflows slow productionalization, limit distributed scaling, and make inference difficult. Direct development on Kubernetes requires complex YAML and long iteration cycles, even for small changes.

With Kubetorch, you write programs in regular Python and can develop and debug interactively while running at scale on Kubernetes. Kubetorch's magic caching and deployment system enables nearly instanteous relaunch of your programs. Meanwhile, your code executes consistently across any environment, whether a teammate's laptop, CI, an orchestrator, or a production application.

Kubetorch combines a Python library with a Kubernetes operator deployed in your cloud account to provide a simple, flexible, and powerful way to build, deploy, and manage AI/ML applications. It can be adopted incrementally within an existing ML stack or used as a full replacement for training, batch processing, inference, hyperparameter optimization, and pipelining tools such as Kubeflow, SageMaker, or Vertex.

Highlights

Engineers can iterate instantly at scale, re-executing code changes in seconds on full-scale Kubernetes compute.
Zero research-to-production delay thanks to identical, reproducible execution across local and production environments.
Fault tolerance is built in, with pipelines automatically handling hardware faults, preemptions, and OOMs while providing transparent logging and detailed telemetry.

Details

Sold by

Kubetorch by Runhouse

Unlock automation with AI agent solutions

Fast-track AI initiatives with agents, tools, and solutions from AWS Partners.

Explore AI agent solutions

Pricing

Custom pricing options

Request private offer

Pricing is based on your specific requirements and eligibility. To get a custom quote for your needs, request a private offer.

How can we make this page better?

We'd like to hear your feedback and ideas on how to improve this page.

Legal

Content disclaimer

Vendors are responsible for their product descriptions and other product content. AWS does not warrant that vendors' product descriptions or other product content are accurate, complete, reliable, current, or error-free.

Resources

Vendor resources

Examples

API Reference

Support

Vendor support

Contact support@run.house for all support needs.