ilgee
/

Binary-Think-RM-8B

preference-learning

Model card Files Files and versions

ilgee commited on Oct 12, 2025

Commit

a9c8eae

·

verified ·

1 Parent(s): 7d75a16

Update model card

Files changed (1) hide show

README.md +0 -7

README.md CHANGED Viewed

@@ -2,7 +2,6 @@
 license: llama3.1
 language:
 - en
-pipeline_tag: text-classification
 tags:
 - reward-model
 - RLHF
@@ -31,12 +30,6 @@ Binary-Think-RM addresses limitations of conventional reward models by incorpora
 To evaluate the model, please use the following prompt template:
 ```python
-from transformers import AutoTokenizer, AutoModelForCausalLM
-model_name = "ilgee/Binary-Think-RM-8B"
-tokenizer = AutoTokenizer.from_pretrained(model_name)
-model = AutoModelForCausalLM.from_pretrained(model_name, torch_dtype="auto")
 system_msg = (
     "You are an impartial judge, tasked with evaluating the quality of the two AI assistants' responses to the context displayed below. "
     "Your evaluation should be based on the following six criteria:\n\n"

 license: llama3.1
 language:
 - en
 tags:
 - reward-model
 - RLHF
 To evaluate the model, please use the following prompt template:
 ```python
 system_msg = (
     "You are an impartial judge, tasked with evaluating the quality of the two AI assistants' responses to the context displayed below. "
     "Your evaluation should be based on the following six criteria:\n\n"