Update README.md
Browse files
README.md
CHANGED
|
@@ -9,7 +9,7 @@ base_model:
|
|
| 9 |
|
| 10 |
## Overview
|
| 11 |
|
| 12 |
-
This model is a fine-tuned version of the base model Qwen/Qwen2.5-Coder-14B-Instruct. It was trained on a subset of problems from the GAIR/LIMO dataset, specifically focusing on 611 problems over
|
| 13 |
|
| 14 |
After testing more I found that the model does not always include reasoning, I will update with more epochs.
|
| 15 |
|
|
@@ -19,7 +19,7 @@ Warning! The model often goes into an endless chain of reasoning.
|
|
| 19 |
|
| 20 |
- **Base Model**: Qwen/Qwen2.5-Coder-14B-Instruct
|
| 21 |
- **Dataset**: GAIR/LIMO (subset of 611 problems)
|
| 22 |
-
- **Epochs**:
|
| 23 |
- **Training Limitations**: The training was constrained by the computational resources available on my machine, which means I haven't yet conducted a thorough evaluation of the model's performance improvements.
|
| 24 |
|
| 25 |
## Key Observations
|
|
|
|
| 9 |
|
| 10 |
## Overview
|
| 11 |
|
| 12 |
+
This model is a fine-tuned version of the base model Qwen/Qwen2.5-Coder-14B-Instruct. It was trained on a subset of problems from the GAIR/LIMO dataset, specifically focusing on 611 problems over 11 training epochs.
|
| 13 |
|
| 14 |
After testing more I found that the model does not always include reasoning, I will update with more epochs.
|
| 15 |
|
|
|
|
| 19 |
|
| 20 |
- **Base Model**: Qwen/Qwen2.5-Coder-14B-Instruct
|
| 21 |
- **Dataset**: GAIR/LIMO (subset of 611 problems)
|
| 22 |
+
- **Epochs**: 11
|
| 23 |
- **Training Limitations**: The training was constrained by the computational resources available on my machine, which means I haven't yet conducted a thorough evaluation of the model's performance improvements.
|
| 24 |
|
| 25 |
## Key Observations
|