Update README.md
Browse files
README.md
CHANGED
|
@@ -1,6 +1,6 @@
|
|
| 1 |
---
|
| 2 |
language: en
|
| 3 |
-
license:
|
| 4 |
tags:
|
| 5 |
- dora
|
| 6 |
- peft
|
|
@@ -12,6 +12,10 @@ tags:
|
|
| 12 |
- healthcare
|
| 13 |
base_model:
|
| 14 |
- Qwen/Qwen3-32B
|
|
|
|
|
|
|
|
|
|
|
|
|
| 15 |
---
|
| 16 |
|
| 17 |
# Gazal-R1-32B-sft-merged-preview
|
|
@@ -82,6 +86,17 @@ response = tokenizer.decode(outputs[0][inputs.input_ids.shape[1]:], skip_special
|
|
| 82 |
print(response)
|
| 83 |
```
|
| 84 |
|
| 85 |
-
##
|
| 86 |
|
| 87 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
| 2 |
language: en
|
| 3 |
+
license: apache-2.0
|
| 4 |
tags:
|
| 5 |
- dora
|
| 6 |
- peft
|
|
|
|
| 12 |
- healthcare
|
| 13 |
base_model:
|
| 14 |
- Qwen/Qwen3-32B
|
| 15 |
+
datasets:
|
| 16 |
+
- TachyHealth/structured_medical
|
| 17 |
+
pipeline_tag: text-generation
|
| 18 |
+
library_name: transformers
|
| 19 |
---
|
| 20 |
|
| 21 |
# Gazal-R1-32B-sft-merged-preview
|
|
|
|
| 86 |
print(response)
|
| 87 |
```
|
| 88 |
|
| 89 |
+
## Performance Results
|
| 90 |
|
| 91 |
+
Gazal-R1 achieves exceptional performance across standard medical benchmarks:
|
| 92 |
+
|
| 93 |
+
| Model | Size | MMLU Pro (Medical) | MedMCQA | MedQA | PubMedQA |
|
| 94 |
+
|-------|------|-------------------|---------|-------|----------|
|
| 95 |
+
| [**Gazal-R1 (Final)**](https://huggingface.co/TachyHealth/Gazal-R1-32B-GRPO-preview) | **32B** | **81.6** | **71.9** | **87.1** | **79.6** |
|
| 96 |
+
| Gazal-R1 (SFT-only) | 32B | 79.3 | 72.3 | 86.9 | 77.6 |
|
| 97 |
+
| Llama 3.1 405B Instruct | 405B | 70.2 | 75.8 | 81.9 | 74.6 |
|
| 98 |
+
| Qwen 2.5 72B Instruct | 72B | 72.1 | 66.2 | 72.7 | 71.7 |
|
| 99 |
+
| Med42-Llama3.1-70B | 70B | 66.1 | 72.4 | 80.4 | 77.6 |
|
| 100 |
+
| Llama 3.1 70B Instruct | 70B | 74.5 | 72.5 | 78.4 | 78.5 |
|
| 101 |
+
| QwQ 32B | 32B | 70.1 | 65.6 | 72.3 | 73.7 |
|
| 102 |
+
| Qwen 3 32B | 32B | 78.4 | 71.6 | 84.4 | 76.7 |
|