hgissbkh commited on
Commit
ff69d25
·
verified ·
1 Parent(s): 68d9058

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +10 -0
README.md CHANGED
@@ -1,3 +1,13 @@
 
 
 
 
 
 
 
 
 
 
1
  # BERT-as-a-Judge: A Robust Alternative for LLM Evaluation
2
 
3
  BERT-as-a-Judge is a family of encoder-based models designed for efficient, reference-based evaluation of LLM outputs. By moving beyond rigid lexical matching (like Exact Match or ROUGE), these models assess **semantic correctness**, allowing for variations in phrasing and formatting while maintaining a fraction of the computational cost of LLM-as-a-Judge approaches.
 
1
+ ---
2
+ datasets:
3
+ - hgissbkh/BERTJudge-Dataset-Train
4
+ language:
5
+ - en
6
+ metrics:
7
+ - accuracy
8
+ base_model:
9
+ - EuroBERT/EuroBERT-210m
10
+ ---
11
  # BERT-as-a-Judge: A Robust Alternative for LLM Evaluation
12
 
13
  BERT-as-a-Judge is a family of encoder-based models designed for efficient, reference-based evaluation of LLM outputs. By moving beyond rigid lexical matching (like Exact Match or ROUGE), these models assess **semantic correctness**, allowing for variations in phrasing and formatting while maintaining a fraction of the computational cost of LLM-as-a-Judge approaches.