Overview
Jina Embeddings v5 Omni Nano is the compact multimodal member of the latest generation of Jina AI's open-weight embedding family. Built on a EuroBERT-210m text backbone with a SigLIP2 Base vision tower and a Whisper-large-v3 audio tower, its 1.04B parameters map text, images, video, audio, and PDF documents into a single shared embedding space, so you can index any modality and retrieve any other from one vector index. This enables cross-modal search, multimodal RAG, visual document retrieval over PDFs and scans, video moment retrieval, and audio semantic search without separate per-modality pipelines. Text embeddings are identical to jina-embeddings-v5-text-nano, so multimodal content drops into an existing v5-text-nano index with no reindexing. The model handles an 8,192-token text context, supports multilingual text across dozens of languages, and delivers competitive document retrieval at just over a billion parameters, outperforming similarly sized open-weight omni models on the multimodal Pareto frontier. Matryoshka Representation Learning lets you truncate embeddings from 768 down to 32 dimensions without retraining, trading storage cost for marginal recall loss. Four task-specific LoRA adapters (retrieval, text-matching, clustering, and classification) tune the same base model for different downstream workloads at request time.
Highlights
- One shared embedding space for every modality: index text, images, video, audio, and PDFs together and query across them, so a text query can retrieve a video frame, a scanned page, or an audio clip from a single vector index.
- Drop-in compatible with jina-embeddings-v5-text-nano: text embeddings are identical, so you can add multimodal content to an existing v5-text-nano index with no reindexing. At just 1.04B parameters it runs lighter and cheaper than larger omni models while staying competitive on cross-modal retrieval.
- Matryoshka dimensions from 32 to 768 with four task-specific LoRA adapters: truncate embeddings to fit your storage and latency budget, and switch between retrieval, text-matching, clustering, and classification per request from a single deployed model.
Details
Introducing multi-product solutions
You can now purchase comprehensive solutions tailored to use cases and industries.
Features and programs
Trust Center
Financing for AWS Marketplace purchases
Pricing
Dimension | Description | Cost/host/hour |
|---|---|---|
ml.g5.xlarge Inference (Batch) Recommended | Model inference on the ml.g5.xlarge instance type, batch mode | $2.50 |
ml.g5.xlarge Inference (Real-Time) Recommended | Model inference on the ml.g5.xlarge instance type, real-time mode | $2.50 |
ml.g4dn.xlarge Inference (Batch) | Model inference on the ml.g4dn.xlarge instance type, batch mode | $2.50 |
ml.g4dn.2xlarge Inference (Batch) | Model inference on the ml.g4dn.2xlarge instance type, batch mode | $2.50 |
ml.g4dn.4xlarge Inference (Batch) | Model inference on the ml.g4dn.4xlarge instance type, batch mode | $2.50 |
ml.g4dn.8xlarge Inference (Batch) | Model inference on the ml.g4dn.8xlarge instance type, batch mode | $2.50 |
ml.g4dn.12xlarge Inference (Batch) | Model inference on the ml.g4dn.12xlarge instance type, batch mode | $2.50 |
ml.g4dn.16xlarge Inference (Batch) | Model inference on the ml.g4dn.16xlarge instance type, batch mode | $2.50 |
ml.g6.xlarge Inference (Batch) | Model inference on the ml.g6.xlarge instance type, batch mode | $2.50 |
ml.g5.2xlarge Inference (Real-Time) | Model inference on the ml.g5.2xlarge instance type, real-time mode | $2.50 |
Vendor refund policy
For support, please visit https://jina.ai/contact-sales or https://www.elastic.co/support .
How can we make this page better?
Legal
Vendor terms and conditions
Content disclaimer
Delivery details
Amazon SageMaker model
An Amazon SageMaker model package is a pre-trained machine learning model ready to use without additional training. Use the model package to create a model on Amazon SageMaker for real-time inference or batch processing. Amazon SageMaker is a fully managed platform for building, training, and deploying machine learning models at scale.
Version release notes
Initial release
Additional details
Inputs
- Summary
The model accepts JSON inputs. Texts must be passed in the following format.
- Input MIME type
- text/csv
Support
Vendor support
For support, please visit https://jina.ai/contact-sales or
AWS infrastructure support
AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.
Similar products




