Update README.md
Browse files
README.md
CHANGED
|
@@ -39,7 +39,7 @@ curriculum-informed reinforcement learning framework to enhance logical correctn
|
|
| 39 |
## Model Details
|
| 40 |
|
| 41 |
CURE-MED-14B is part of the CURE-MED family of models, designed to address the challenges of multilingual medical reasoning in large language models (LLMs).
|
| 42 |
-
Built on the Qwen2.5-14B
|
| 43 |
and Group Relative Policy Optimization (GRPO) to improve performance on open-ended medical queries across 13 languages, including underrepresented ones such as Amharic, Yoruba, and Swahili.
|
| 44 |
The model is trained and evaluated using CUREMED-BENCH, a high-quality multilingual open-ended medical reasoning benchmark with single verifiable answers.
|
| 45 |
|
|
|
|
| 39 |
## Model Details
|
| 40 |
|
| 41 |
CURE-MED-14B is part of the CURE-MED family of models, designed to address the challenges of multilingual medical reasoning in large language models (LLMs).
|
| 42 |
+
Built on the Qwen/Qwen2.5-14B-Instruct model, it incorporates a curriculum-informed reinforcement learning approach that integrates code-switching-aware supervised fine-tuning (SFT)
|
| 43 |
and Group Relative Policy Optimization (GRPO) to improve performance on open-ended medical queries across 13 languages, including underrepresented ones such as Amharic, Yoruba, and Swahili.
|
| 44 |
The model is trained and evaluated using CUREMED-BENCH, a high-quality multilingual open-ended medical reasoning benchmark with single verifiable answers.
|
| 45 |
|