Update README.md
Browse files
README.md
CHANGED
|
@@ -191,20 +191,6 @@ messages = [
|
|
| 191 |
| 📦 **Dataset** | 923 train / 102 eval samples |
|
| 192 |
| ⏱️ **Duration** | 11.9 minutes |
|
| 193 |
|
| 194 |
-
### Hyperparameters
|
| 195 |
-
|
| 196 |
-
| Parameter | Value |
|
| 197 |
-
|---|---|
|
| 198 |
-
| LoRA Rank / Alpha | 16 / 32 |
|
| 199 |
-
| LoRA Dropout | 0.10 |
|
| 200 |
-
| Target Modules | q, k, v, o, gate, up, down proj |
|
| 201 |
-
| Learning Rate | 5e-6 (cosine scheduler) |
|
| 202 |
-
| Epochs | 3 |
|
| 203 |
-
| Effective Batch Size | 4 (2 × 2 accum) |
|
| 204 |
-
| Max Sequence Length | 4096 |
|
| 205 |
-
| NEFTune Alpha | 5.0 |
|
| 206 |
-
| Warmup Ratio | 0.05 |
|
| 207 |
-
|
| 208 |
### 📉 Training Metrics
|
| 209 |
|
| 210 |
| Metric | Value |
|
|
|
|
| 191 |
| 📦 **Dataset** | 923 train / 102 eval samples |
|
| 192 |
| ⏱️ **Duration** | 11.9 minutes |
|
| 193 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 194 |
### 📉 Training Metrics
|
| 195 |
|
| 196 |
| Metric | Value |
|