MedGUIDE
/

ClinicalRewardModel-Qwen2_5-7B

Model card Files Files and versions

MedGUIDE commited on May 15, 2025

Commit

b918f76

·

verified ·

1 Parent(s): f4b3bd3

Create README.md

Files changed (1) hide show

README.md +11 -0

README.md ADDED Viewed

	@@ -0,0 +1,11 @@

+# ClinicalRewardModel-Qwen2_5-7B
+This is a 5-head reward model fine-tuned on clinical decision-making data using the [Qwen2.5-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-7B-Instruct) backbone. It is trained to evaluate clinical vignette–based multiple-choice questions across five expert-defined clinical criteria:
+- Clinical Plausibility
+- Clinical Utility
+- Quality of Decision Path
+- Alignment to Decision Path
+- Correctness of the Suggested Answer
+Each head independently scores one dimension on a 1–5 Likert scale. This model supports fine-grained quality filtering for guideline-based QA datasets such as [MedGUIDE-MCQA-8K](https://huggingface.co/datasets/MedGUIDE/MedGUIDE-MCQA-8K).