Overview
Jina Embeddings v5 Omni Small is the multimodal member of the latest generation of Jina AI's open-weight embedding family. Built on a Qwen3-0.6B text backbone with a SigLIP2 vision tower and a Whisper-large-v3 audio tower, its 1.56B parameters map text, images, video, audio, and PDF documents into a single shared embedding space, so you can index any modality and retrieve any other from one vector index. This enables cross-modal search, multimodal RAG, visual document retrieval over PDFs and scans, video moment retrieval, and audio semantic search without separate per-modality pipelines.
Text embeddings are identical to jina-embeddings-v5-text-small, so multimodal content drops into an existing v5-text index with no reindexing. The model handles a 32,768-token text context, supports 30+ languages, and matches the retrieval quality of omni models more than four times its size. Matryoshka Representation Learning lets you truncate embeddings from 1024 down to 32 dimensions without retraining, trading storage cost for marginal recall loss. Four task-specific LoRA adapters (retrieval, text-matching, clustering, and classification) tune the same base model for different downstream workloads at request time.
Highlights
- One shared embedding space for every modality: index text, images, video, audio, and PDFs together and query across them, so a text query can retrieve a video frame, a scanned page, or an audio clip from a single vector index.
- Drop-in compatible with jina-embeddings-v5-text-small: text embeddings are identical, so you can add multimodal content to an existing v5-text index with no reindexing. At 1.56B parameters it matches the quality of omni models more than four times its size.
- Matryoshka dimensions from 32 to 1024 with four task-specific LoRA adapters: truncate embeddings to fit your storage and latency budget, and switch between retrieval, text-matching, clustering, and classification per request from a single deployed model.
Details
Introducing multi-product solutions
You can now purchase comprehensive solutions tailored to use cases and industries.
Features and programs
Trust Center
Financing for AWS Marketplace purchases
Pricing
Dimension | Description | Cost/host/hour |
|---|---|---|
ml.g5.xlarge Inference (Batch) Recommended | Model inference on the ml.g5.xlarge instance type, batch mode | $2.50 |
ml.g5.xlarge Inference (Real-Time) Recommended | Model inference on the ml.g5.xlarge instance type, real-time mode | $2.50 |
ml.g4dn.xlarge Inference (Batch) | Model inference on the ml.g4dn.xlarge instance type, batch mode | $2.50 |
ml.g4dn.2xlarge Inference (Batch) | Model inference on the ml.g4dn.2xlarge instance type, batch mode | $2.50 |
ml.g4dn.4xlarge Inference (Batch) | Model inference on the ml.g4dn.4xlarge instance type, batch mode | $2.50 |
ml.g4dn.8xlarge Inference (Batch) | Model inference on the ml.g4dn.8xlarge instance type, batch mode | $2.50 |
ml.g4dn.12xlarge Inference (Batch) | Model inference on the ml.g4dn.12xlarge instance type, batch mode | $2.50 |
ml.g4dn.16xlarge Inference (Batch) | Model inference on the ml.g4dn.16xlarge instance type, batch mode | $2.50 |
ml.g6.xlarge Inference (Batch) | Model inference on the ml.g6.xlarge instance type, batch mode | $2.50 |
ml.g5.2xlarge Inference (Real-Time) | Model inference on the ml.g5.2xlarge instance type, real-time mode | $2.50 |
Vendor refund policy
For support, please visit https://jina.ai/contact-sales or https://www.elastic.co/support .
How can we make this page better?
Legal
Vendor terms and conditions
Content disclaimer
Delivery details
Amazon SageMaker model
An Amazon SageMaker model package is a pre-trained machine learning model ready to use without additional training. Use the model package to create a model on Amazon SageMaker for real-time inference or batch processing. Amazon SageMaker is a fully managed platform for building, training, and deploying machine learning models at scale.
Version release notes
Initial release
Additional details
Inputs
- Summary
The model accepts JSON inputs. Texts must be passed in the following format.
- Input MIME type
- text/csv
Support
Vendor support
For support, please visit https://jina.ai/contact-sales or
AWS infrastructure support
AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.
Similar products




