MedGUIDE commited on
Commit
b918f76
·
verified ·
1 Parent(s): f4b3bd3

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -0
README.md ADDED
@@ -0,0 +1,11 @@
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # ClinicalRewardModel-Qwen2_5-7B
2
+
3
+ This is a 5-head reward model fine-tuned on clinical decision-making data using the [Qwen2.5-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-7B-Instruct) backbone. It is trained to evaluate clinical vignette–based multiple-choice questions across five expert-defined clinical criteria:
4
+
5
+ - Clinical Plausibility
6
+ - Clinical Utility
7
+ - Quality of Decision Path
8
+ - Alignment to Decision Path
9
+ - Correctness of the Suggested Answer
10
+
11
+ Each head independently scores one dimension on a 1–5 Likert scale. This model supports fine-grained quality filtering for guideline-based QA datasets such as [MedGUIDE-MCQA-8K](https://huggingface.co/datasets/MedGUIDE/MedGUIDE-MCQA-8K).