Overview
Trieve Vector Inference is an in-VPC solution for lightning-fast vector inference, unlocking performance and productivity by eliminating cloud latency and rate limits.
SaaS offerings for text embeddings have 2 major issues: 1) High latency due to batch processing 2) Heavy rate limits. This fuzzies the end-user experience and makes ingestion of large datasets impossible.
Host your embeddings locally for maximum speed, control, and scalability using any private, custom, or open-source model. TVI is great for large-scale deployments demanding high throughput.
Benchmarks can be found at https://docs.trieve.ai/vector-inference/introduction .
Highlights
- Blazing-Fast Inference: Achieve ultra-low latency for seamless embedding generation, eliminating bottlenecks in your pipelines.
- Unmetered Performance: Process massive datasets without rate limits, ensuring consistent performance and a frictionless user experience.
- Support Any Embedding Model: Use your preferred private, custom, or open-source models for maximum flexibility.
Details
Features and programs
Financing for AWS Marketplace purchases
Pricing
- $500.001/month
Vendor refund policy
Trieve backs all products with a 45 day integration guarantee. Simply provide proof that TVI was unable to run on your clusters and we will refund you fees paid to Trieve.
Legal
Vendor terms and conditions
Content disclaimer
Delivery details
Trieve Vector Inference via helm
- Amazon EKS
Helm chart
Helm charts are Kubernetes YAML manifests combined into a single package that can be installed on Kubernetes clusters. The containerized application is deployed on a cluster by running a single Helm install command to install the seller-provided Helm chart.
Version release notes
Trieve Vector Inference version 1
Additional details
Usage instructions
Support
Vendor support
Trieve General Support is available 24/7 with a maximum 6 hour response times. SLAs are available as custom add-ons.
AWS infrastructure support
AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.