artefactory
/

BERTJudge

Text Classification

Model card Files Files and versions

hgissbkh commited on 3 days ago

Commit

c6c4ae7

·

verified ·

1 Parent(s): eb5e86e

Update README.md

Files changed (1) hide show

README.md +39 -17

README.md CHANGED Viewed

@@ -34,29 +34,51 @@ Models follow a standardized naming structure: `BERTJudge-<Candidate_Format>-<In
 ## Intended Use
-These models function as sequence classifiers that generate a binary score (0 for incorrect, 1 for correct). For general applications, we recommend **BERTJudge-Free-QCR**, as it provides the highest-performing and most robust evaluation.
-```python
-from transformers import AutoTokenizer, AutoModelForSequenceClassification
-import torch
-model_name = "hgissbkh/BERTJudge-Free-QCR"
-tokenizer = AutoTokenizer.from_pretrained(model_name)
-model = AutoModelForSequenceClassification.from_pretrained(model_name)
-question = "What is the capital of France?"
-reference = "Paris"
-candidate = "The capital city is Paris."
-# Construct input based on model type (QCR)
-input_text = f"<|question|>{question}<|candidate|>{candidate}<|reference|>{reference}"
-inputs = tokenizer(input_text, return_tensors="pt")
-with torch.no_grad():
-    logits = model(**inputs).logits
-    prediction = torch.argmax(logits, dim=-1)
-print("Correct" if prediction.item() == 1 else "Incorrect")
 ```
 ## Citation

 ## Intended Use
+These models are designed as sequence classifiers that output a sigmoid score indicating answer correctness. For inference, we recommend using the [BERT-as-a-Judge](https://github.com/artefactory/BERT-as-a-Judge) package. In general settings, we further recommend **BERTJudge-Free-QCR**, as it provides the strongest and most robust evaluation performance.
+### Installation
+```zsh
+git clone [https://github.com/artefactory/BERT-as-a-Judge.git](https://github.com/artefactory/BERT-as-a-Judge.git)
+cd BERT-as-a-Judge
+pip install -e .
+```
+### Usage
+Example:
+```python
+from bert_judge.judges import BERTJudge
+# 1) Initialize the judge
+judge = BERTJudge(
+    model_path="hgissbkh/BERTJudge-Free-QCR",
+    trust_remote_code=True,
+    dtype="bfloat16",
+)
+# 2) Define one question, one reference, and several candidate answers
+question = "What is the capital of France?"
+reference = "Paris"
+candidates = [
+    "Paris.",
+    "The capital of France is Paris.",
+    "I'm hesitating between Paris and London. I would say Paris.",
+    "London.",
+    "The capital of France is London.",
+    "I'm hesitating between Paris and London. I would say London.",
+]
+# 3) Predict scores (one score per candidate)
+scores = judge.predict(
+    questions=[question] * len(candidates),
+    references=[reference] * len(candidates),
+    candidates=candidates,
+    batch_size=1,
+)
+print(scores)
 ```
 ## Citation