Create README.md
Browse files
README.md
ADDED
|
@@ -0,0 +1,11 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
# ClinicalRewardModel-Qwen2_5-7B
|
| 2 |
+
|
| 3 |
+
This is a 5-head reward model fine-tuned on clinical decision-making data using the [Qwen2.5-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-7B-Instruct) backbone. It is trained to evaluate clinical vignette–based multiple-choice questions across five expert-defined clinical criteria:
|
| 4 |
+
|
| 5 |
+
- Clinical Plausibility
|
| 6 |
+
- Clinical Utility
|
| 7 |
+
- Quality of Decision Path
|
| 8 |
+
- Alignment to Decision Path
|
| 9 |
+
- Correctness of the Suggested Answer
|
| 10 |
+
|
| 11 |
+
Each head independently scores one dimension on a 1–5 Likert scale. This model supports fine-grained quality filtering for guideline-based QA datasets such as [MedGUIDE-MCQA-8K](https://huggingface.co/datasets/MedGUIDE/MedGUIDE-MCQA-8K).
|