Create a new account

About

Amazon Sagemaker

Amazon SageMaker is a fully-managed platform that enables developers and data scientists to quickly and easily build, train, and deploy machine learning models at any scale. With Amazon SageMaker, all the barriers and complexity that typically slow down developers who want to use machine learning are removed. The service includes models that can be used together or independently to build, train, and deploy your machine learning models.

Jina Embeddings v2 Base - en

By:

Jina AI

Latest Version:

3.2

Text embedding model (base) for input of size up to 8192 tokens

Continue to Subscribe

Product Overview

Jina Embeddings v2 Base model is optimized for highly accurate embeddings - For speed of inference and memory efficiency use the Small model. jina-embeddings-v2-base-en is an open-source English embedding model supporting 8192 sequence length. This state-of-the-art AI embedding model enables many applications, such as document clustering, classification, content personalization, vector search, or retrieval augmented generation.

Key Data

Version

3.2

Usage Instructions

Release Notes

Jina AI

Highlights

Use-cases: Vector search, retrieval augmented generation, long document clustering, sentiment analysis.

Extended context length: This model uniquely support an 8K context length, enabling them to process and understand larger chunks of data in a single pass, resulting in richer embeddings and more accurate predictions.

Model size: 137M parameters.

High performance over tasks across the board: Our model ranks amongst the top performing ones on HuggingFace’s MTEB leaderboard for embedding models - especially considering its small size and extended context length.
The backbone of this model was pretrained on the C4 dataset. This model is further trained on Jina AI's collection of more than 400 millions of sentence pairs and hard negatives. These pairs were obtained from various domains and were carefully selected through a thorough cleaning process.

Not quite sure what you’re looking for? AWS Marketplace can help you find the right solution for your use case. Contact us

Pricing Information

Use this tool to estimate the software and infrastructure costs based your configuration choices. Your usage and costs might be different from this estimate. They will be reflected on your monthly AWS billing reports.

Estimating your costs

Choose your region and launch option to see the pricing details. Then, modify the estimated price by choosing different instance types.

Version

Region

Software Pricing

Model Realtime Inference$2.50/hr

running on ml.g5.xlarge

Model Batch Transform$1.50/hr

running on ml.g4dn.xlarge

Infrastructure Pricing
With Amazon SageMaker, you pay only for what you use. Training and inference is billed by the second, with no minimum fees and no upfront commitments. Pricing within Amazon SageMaker is broken down by on-demand ML instances, ML storage, and fees for data processing in notebooks and inference instances.
Learn more about SageMaker pricing

SageMaker Realtime Inference$1.408/host/hr

running on ml.g5.xlarge

SageMaker Batch Transform$0.736/host/hr

running on ml.g4dn.xlarge

Model Realtime Inference

For model deployment as Real-time endpoint in Amazon SageMaker, the software is priced based on hourly pricing that can vary by instance type. Additional infrastructure cost, taxes or fees may apply.

	InstanceType	Realtime Inference/hr
	ml.p2.xlarge	$2.30
	ml.g4dn.4xlarge	$4.00
	ml.g4dn.16xlarge	$14.50
	ml.p2.16xlarge	$35.00
	ml.p3.16xlarge	$48.25
	ml.g5.xlarge Vendor Recommended	$2.50
	ml.g5.8xlarge	$18.25
	ml.g5.12xlarge	$27.50
	ml.g4dn.2xlarge	$2.20
	ml.g5.4xlarge	$9.30
	ml.g5.16xlarge	$35.00
	ml.p3.8xlarge	$25.00
	ml.p3.2xlarge	$7.00
	ml.p2.8xlarge	$18.00
	ml.g4dn.8xlarge	$7.60
	ml.g4dn.12xlarge	$11.25
	ml.g5.2xlarge	$4.75
	ml.g4dn.xlarge	$1.50
	ml.g5.48xlarge	$90.00
	ml.g5.24xlarge	$52.00

Model Batch Transform

For model deployment as Batch transform job in Amazon SageMaker, the software is priced based on hourly pricing that can vary by instance type. Additional infrastructure cost, taxes or fees may apply.

	InstanceType	Batch Transform/hr
	ml.p2.xlarge	$2.30
	ml.p3.8xlarge	$25.00
	ml.g4dn.4xlarge	$4.00
	ml.p3.2xlarge	$7.00
	ml.g4dn.16xlarge	$14.50
	ml.p2.8xlarge	$18.00
	ml.g4dn.8xlarge	$7.60
	ml.g4dn.12xlarge	$11.25
	ml.p2.16xlarge	$35.00
	ml.p3.16xlarge	$48.25
	ml.g4dn.xlarge Vendor Recommended	$1.50
	ml.g4dn.2xlarge	$2.20

Usage Information

Model input and output details

Input

Summary

The model accepts JSON inputs. Texts must be passed in the following format.

{ "data": [ {"text": "How is the weather today?"}, {"text": "What is the weather like today?"}, {"text": "What's the color of an orange?"} ] }

Input MIME type

text/csv

Sample input data

view data

Expand all input descriptions

Input data description

Field names

data, text

Description

An array of strings for the model to embed.

Required

Yes

Data type

FreeText

Output

Summary

A JSON object with an array of IDs and Embeddings

Output MIME type

text/csv

Sample output data

view data

Expand all output descriptions

Output data description

Field names

data, id

Description

ID of the text field that was embedded

Always returned

Yes

Data type

FreeText

Field names

data, embedding

Description

Embeddings array for the text of length 768

Always returned

Yes

Data type

Continuous (Float)

Range

Sample notebook

Sample notebook link

Git repository link

Additional Resources

Technical Report

HuggingFace Model Card

Jina AI Embeddings Homepage

End User License Agreement

By subscribing to this product you agree to terms and conditions outlined in the product End user License Agreement (EULA)

Support Information

Jina Embeddings v2 Base - en

We provide support for this model package through our enterprise support channel.

https://jina.ai/contact-sales

AWS Infrastructure

AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.

Learn More

Refund Policy

Refunds to be processed under the conditions specified in EULA.

Customer Reviews

There are currently no reviews for this product.

View all

Write a review

Share your thoughts about this product.

Write a customer review

Select your cookie preferences

Customize cookie preferences

Essential

Performance

Functional

Advertising

Unable to save cookie preferences

Amazon Sagemaker

Jina Embeddings v2 Base - en

Product Overview

Key Data

Highlights

Pricing Information

Estimating your costs

Version

Region

Software Pricing

Model Realtime Inference

Model Batch Transform

Annual contract

Model Realtime Inference

Usage Information

Model input and output details

Input

Output

Sample notebook

Additional Resources

End User License Agreement

Support Information

Jina Embeddings v2 Base - en

AWS Infrastructure

Refund Policy

Customer Reviews

Write a review