Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -1,6 +1,6 @@
 ---
 language: en
-license: agpl-3.0
 tags:
 - dora
 - peft
@@ -12,6 +12,10 @@ tags:
 - healthcare
 base_model:
 - Qwen/Qwen3-32B
 ---
 # Gazal-R1-32B-sft-merged-preview
@@ -82,6 +86,17 @@ response = tokenizer.decode(outputs[0][inputs.input_ids.shape[1]:], skip_special
 print(response)
 ```
-## Benchmarks
-TBA

 ---
 language: en
+license: apache-2.0
 tags:
 - dora
 - peft
 - healthcare
 base_model:
 - Qwen/Qwen3-32B
+datasets:
+- TachyHealth/structured_medical
+pipeline_tag: text-generation
+library_name: transformers
 ---
 # Gazal-R1-32B-sft-merged-preview
 print(response)
 ```
+## Performance Results
+Gazal-R1 achieves exceptional performance across standard medical benchmarks:
+| Model | Size | MMLU Pro (Medical) | MedMCQA | MedQA | PubMedQA |
+|-------|------|-------------------|---------|-------|----------|
+| [**Gazal-R1 (Final)**](https://huggingface.co/TachyHealth/Gazal-R1-32B-GRPO-preview) | **32B** | **81.6** | **71.9** | **87.1** | **79.6** |
+| Gazal-R1 (SFT-only) | 32B | 79.3 | 72.3 | 86.9 | 77.6 |
+| Llama 3.1 405B Instruct | 405B | 70.2 | 75.8 | 81.9 | 74.6 |
+| Qwen 2.5 72B Instruct | 72B | 72.1 | 66.2 | 72.7 | 71.7 |
+| Med42-Llama3.1-70B | 70B | 66.1 | 72.4 | 80.4 | 77.6 |
+| Llama 3.1 70B Instruct | 70B | 74.5 | 72.5 | 78.4 | 78.5 |
+| QwQ 32B | 32B | 70.1 | 65.6 | 72.3 | 73.7 |
+| Qwen 3 32B | 32B | 78.4 | 71.6 | 84.4 | 76.7 |