Update README.md
Browse files
README.md
CHANGED
|
@@ -19,8 +19,7 @@ base_model:
|
|
| 19 |
**Version:** v1.0 \
|
| 20 |
**Release Date:** October 23, 2025 \
|
| 21 |
**Base Model:** [Qwen/Qwen3-4B-Instruct-2507](https://huggingface.co/Qwen/Qwen3-4B-Instruct-2507) \
|
| 22 |
-
**Library:** 🤗 *Transformers*
|
| 23 |
-
|
| 24 |
|
| 25 |
|
| 26 |
**Purpose:**
|
|
@@ -63,6 +62,10 @@ Results here are preliminary and reflect internal benchmarking on the same task
|
|
| 63 |
|
| 64 |
---
|
| 65 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 66 |
### **License**
|
| 67 |
|
| 68 |
**MIT License** — free for research and non-commercial use with attribution.
|
|
|
|
| 19 |
**Version:** v1.0 \
|
| 20 |
**Release Date:** October 23, 2025 \
|
| 21 |
**Base Model:** [Qwen/Qwen3-4B-Instruct-2507](https://huggingface.co/Qwen/Qwen3-4B-Instruct-2507) \
|
| 22 |
+
**Library:** 🤗 *Transformers*
|
|
|
|
| 23 |
|
| 24 |
|
| 25 |
**Purpose:**
|
|
|
|
| 62 |
|
| 63 |
---
|
| 64 |
|
| 65 |
+
### Method
|
| 66 |
+
- GRPO
|
| 67 |
+
- Evol Merging
|
| 68 |
+
|
| 69 |
### **License**
|
| 70 |
|
| 71 |
**MIT License** — free for research and non-commercial use with attribution.
|