
Overview
Jina Embeddings v2 Small model is optimized for speed of inference and memory efficiency - For higher accuracy, use the Base model.
jina-embeddings-v2-small-en is an open-source English embedding model supporting 8192 sequence length. This state-of-the-art AI embedding model enables many applications, such as document clustering, classification, content personalization, vector search, or retrieval augmented generation.
Highlights
- Use-cases: Vector search, retrieval augmented generation, long document clustering, sentiment analysis. Extended context length: This model uniquely support an 8K context length, enabling them to process and understand larger chunks of data in a single pass, resulting in richer embeddings and more accurate predictions. Model size: 32.7M parameters. High performance over tasks across the board: Our model ranks amongst the top performing ones on HuggingFace’s MTEB leaderboard for embedding models - especially considering its small size and extended context length.
- The backbone of this model was pretrained on the C4 dataset. This model is further trained on Jina AI's collection of more than 400 millions of sentence pairs and hard negatives. These pairs were obtained from various domains and were carefully selected through a thorough cleaning process.
Details
Unlock automation with AI agent solutions

Features and programs
Financing for AWS Marketplace purchases
Pricing
Free trial
Dimension | Description | Cost/host/hour |
|---|---|---|
ml.g4dn.xlarge Inference (Batch) Recommended | Model inference on the ml.g4dn.xlarge instance type, batch mode | $1.50 |
ml.g5.xlarge Inference (Real-Time) Recommended | Model inference on the ml.g5.xlarge instance type, real-time mode | $2.50 |
ml.p2.xlarge Inference (Batch) | Model inference on the ml.p2.xlarge instance type, batch mode | $2.30 |
ml.p3.8xlarge Inference (Batch) | Model inference on the ml.p3.8xlarge instance type, batch mode | $25.00 |
ml.g4dn.4xlarge Inference (Batch) | Model inference on the ml.g4dn.4xlarge instance type, batch mode | $4.00 |
ml.p3.2xlarge Inference (Batch) | Model inference on the ml.p3.2xlarge instance type, batch mode | $7.00 |
ml.g4dn.16xlarge Inference (Batch) | Model inference on the ml.g4dn.16xlarge instance type, batch mode | $14.50 |
ml.p2.8xlarge Inference (Batch) | Model inference on the ml.p2.8xlarge instance type, batch mode | $18.00 |
ml.g4dn.8xlarge Inference (Batch) | Model inference on the ml.g4dn.8xlarge instance type, batch mode | $7.60 |
ml.g4dn.12xlarge Inference (Batch) | Model inference on the ml.g4dn.12xlarge instance type, batch mode | $11.25 |
Vendor refund policy
Refunds to be processed under the conditions specified in EULA.
How can we make this page better?
Legal
Vendor terms and conditions
Content disclaimer
Delivery details
Amazon SageMaker model
An Amazon SageMaker model package is a pre-trained machine learning model ready to use without additional training. Use the model package to create a model on Amazon SageMaker for real-time inference or batch processing. Amazon SageMaker is a fully managed platform for building, training, and deploying machine learning models at scale.
Version release notes
Monetization and tokens counting
Additional details
Inputs
- Summary
The model accepts JSON inputs. Texts must be passed in the following format.
{ "data": [ {"text": "How is the weather today?"}, {"text": "What is the weather like today?"}, {"text": "What's the color of an orange?"} ] }
- Input MIME type
- text/csv
Input data descriptions
The following table describes supported input data fields for real-time inference and batch transform.
Field name | Description | Constraints | Required |
|---|---|---|---|
data | An array of strings for the model to embed.
| Type: FreeText | Yes |
text | An array of strings for the model to embed.
| Type: FreeText | Yes |
Resources
Vendor resources
Support
Vendor support
We offer support through Jina AI enterprise support team.
AWS infrastructure support
AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.
Similar products



