Update README.md
Browse files
README.md
CHANGED
|
@@ -25,7 +25,7 @@ The model takes **building parameters** (length, width, height, and window-to-wa
|
|
| 25 |
|
| 26 |
## 📚 Training Details
|
| 27 |
- **Stage 1 (Supervised Finetuning):** 1500 samples of building descriptions → IDF mappings.
|
| 28 |
-
- **Stage 2 (Reasoning Finetuning):**
|
| 29 |
- **Base Model:** `unsloth/phi-4`
|
| 30 |
- **Frameworks Used:** [Unsloth](https://github.com/unslothai/unsloth) + Hugging Face TRL
|
| 31 |
- **Training Speed:** ~2× faster compared to standard Hugging Face pipelines.
|
|
|
|
| 25 |
|
| 26 |
## 📚 Training Details
|
| 27 |
- **Stage 1 (Supervised Finetuning):** 1500 samples of building descriptions → IDF mappings.
|
| 28 |
+
- **Stage 2 (Reasoning Finetuning):** 2500 reasoning-focused samples to improve logical consistency in geometry-to-IDF translation.
|
| 29 |
- **Base Model:** `unsloth/phi-4`
|
| 30 |
- **Frameworks Used:** [Unsloth](https://github.com/unslothai/unsloth) + Hugging Face TRL
|
| 31 |
- **Training Speed:** ~2× faster compared to standard Hugging Face pipelines.
|