real-jiakai
/

SmolLM3-3B-MathReason

Text Generation

Model card Files Files and versions

real-jiakai commited on Jan 10

Commit

f40855a

·

verified ·

1 Parent(s): 2958745

Update README.md

Files changed (1) hide show

README.md +4 -0

README.md CHANGED Viewed

@@ -22,8 +22,11 @@ A math-focused fine-tuned version of SmolLM3-3B, optimized for step-by-step math
 ## Highlights
 📚 **Math-First**: Trained on ~7K high-quality math and reasoning samples
 🧠 **Chain-of-Thought**: Supports `/think` mode for detailed reasoning
 ⚡ **Lightweight**: 3B parameters, runs on consumer GPUs
 ## Training Details
 | Parameter | Value |
@@ -37,6 +40,7 @@ A math-focused fine-tuned version of SmolLM3-3B, optimized for step-by-step math
 | Epochs | 2 |
 | Learning Rate | 2e-4 (cosine) |
 | Effective Batch Size | 16 |
 ## Usage
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer

 ## Highlights
 📚 **Math-First**: Trained on ~7K high-quality math and reasoning samples
 🧠 **Chain-of-Thought**: Supports `/think` mode for detailed reasoning
 ⚡ **Lightweight**: 3B parameters, runs on consumer GPUs
 ## Training Details
 | Parameter | Value |
 | Epochs | 2 |
 | Learning Rate | 2e-4 (cosine) |
 | Effective Batch Size | 16 |
 ## Usage
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer