Fix Highlights accuracy to match results table (macro 86.3, micro 84.4)

#2
by dipta007 - opened
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -33,10 +33,10 @@ tags:
33
 
34
  ## Highlights
35
 
36
- - **84.5% micro-average balanced accuracy** across 9 in-domain claim-verification benchmarks (sample-weighted)
37
- - **84.6% macro-average balanced accuracy** across the same 9 benchmarks
38
  - Out-of-domain: **60.2% balanced accuracy on Coverbench**, **77.0% on LLM-AggreFact**
39
- - Strong on long-form evidence: 87% on Ex-FEVER, 92% on FEVEROUS, 76% on HoVer
40
  - Reasoning is **fully transparent**: the model emits its sub-claim checklist, every question it asked, every quote from evidence, and a final label
41
 
42
  ## Model Overview
 
33
 
34
  ## Highlights
35
 
36
+ - **84.4% micro-average balanced accuracy** across 9 in-domain claim-verification benchmarks (sample-weighted)
37
+ - **86.3% macro-average balanced accuracy** across the same 9 benchmarks
38
  - Out-of-domain: **60.2% balanced accuracy on Coverbench**, **77.0% on LLM-AggreFact**
39
+ - Strong on long-form evidence: 88% on Ex-FEVER, 93% on FEVEROUS, 76% on HoVer
40
  - Reasoning is **fully transparent**: the model emits its sub-claim checklist, every question it asked, every quote from evidence, and a final label
41
 
42
  ## Model Overview