# ToT-Reasoner-Qwen3-1.7B
## Model Description
Fine-tuned `ziadrone/oneplusaries1` using Supervised Fine-Tuning (SFT) on `open-r1/Mixture-of-Thoughts` (math split). Optimized for mathematical reasoning.
## Training Data
- **Source**: `open-r1/Mixture-of-Thoughts` (math split, up to 50 samples).
- **Format**: Prompts with `......` structure.
## Fine-Tuning Process
- **Method**: SFT with learning rate=1e-5, 3 epochs, batch size=1.
- **Setup**: Google Colab Pro with T4 GPU.
## Usage