
Overview
Llama-3-Varco-Offsetbias-8B is NCSOFT's generative judge model that performs pairwise preference evaluation task. It is trained to be robust on various evaluation biases commonly found in evaluation models, such as length bias, empty reference bias, and familiar knowledge bias. The methodology for training the model is introduced in paper "OffsetBias: Leveraging Debiased Data for Tuning Evaluators". Specifically, Llama-3-Varco-Offsetbias-8B is built with Meta Llama-3-8B-Instruct. It is fine-tuned on evaluation datasets for judging various capabilities including instruction following, safety, and dialogue. The model is trained to perform pairwise preference evaluation, where Instruction, Output (a), Output (b) are given, and a better output to the instruction needs to be found.
Highlights
- Llama-3-Varco-Offsetbias-8B is capable of judging which response is preferable for a given instruction, in terms of response quality, instruction following and safety.
- While LLMs deployed for evaluation tasks are vulnerable to evaluation biases such as length bias, our model is robust against the biases.
- The model is highly competitive in RewardBench leaderboard, given its size and model type.
Unlock automation with AI agent solutions

Features and programs
Financing for AWS Marketplace purchases
Pricing
Vendor refund policy
This product is offered for free. If there are any questions, please contact us for further clarifications.
How can we make this page better?
Legal
Vendor terms and conditions
Content disclaimer
Delivery details
Amazon SageMaker model
An Amazon SageMaker model package is a pre-trained machine learning model ready to use without additional training. Use the model package to create a model on Amazon SageMaker for real-time inference or batch processing. Amazon SageMaker is a fully managed platform for building, training, and deploying machine learning models at scale.
Version release notes
Bedrock inferface update
Additional details
Inputs
- Summary
Model accepts JSON requests. You can check examples and fields descriptions. For the details, please check sample notebook file.
- Input MIME type
- application/json
Input data descriptions
The following table describes supported input data fields for real-time inference and batch transform.
Field name | Description | Constraints | Required |
|---|---|---|---|
instruction | instruction that expects a response | Type: FreeText | Yes |
output_a | Candidate response for instruction | Type: FreeText | Yes |
output_b | Candidate response for instruction | Type: FreeText | Yes |
Resources
Vendor resources
Support
AWS infrastructure support
AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.