Update README.md
Browse files
README.md
CHANGED
|
@@ -18,5 +18,7 @@ datasets:
|
|
| 18 |
# Qwen3-MATH-R1-4B
|
| 19 |
## Model Description
|
| 20 |
This is a fine-tuned version of [Qwen/Qwen3-4B-Thinking-2507](https://huggingface.co/Qwen/Qwen3-4B-Thinking-2507) on parts of the [nvidia/OpenMathReasoning](https://huggingface.co/datasets/nvidia/OpenMathReasoning) dataset which was used to win the [AIMO](https://www.kaggle.com/competitions/ai-mathematical-olympiad-progress-prize-2/leaderboard) (AI Mathematical Olympiad) challenge!
|
| 21 |
-
|
|
|
|
|
|
|
| 22 |
- **Finetuned from model :** Qwen/Qwen3-4B-Thinking-2507
|
|
|
|
| 18 |
# Qwen3-MATH-R1-4B
|
| 19 |
## Model Description
|
| 20 |
This is a fine-tuned version of [Qwen/Qwen3-4B-Thinking-2507](https://huggingface.co/Qwen/Qwen3-4B-Thinking-2507) on parts of the [nvidia/OpenMathReasoning](https://huggingface.co/datasets/nvidia/OpenMathReasoning) dataset which was used to win the [AIMO](https://www.kaggle.com/competitions/ai-mathematical-olympiad-progress-prize-2/leaderboard) (AI Mathematical Olympiad) challenge!
|
| 21 |
+
-**recommended settings for instruct inference:** temperature = 0.7, top_p = 0.8, top_k = 20
|
| 22 |
+
-**For reasoning chat based inference :** temperature = 0.6, top_p = 0.95, top_k = 20
|
| 23 |
+
- **License :** apache-2.0
|
| 24 |
- **Finetuned from model :** Qwen/Qwen3-4B-Thinking-2507
|