Update README.md
Browse files
README.md
CHANGED
|
@@ -201,20 +201,6 @@ messages = [
|
|
| 201 |
| 📦 **Dataset** | 923 train / 102 eval samples |
|
| 202 |
| ⏱️ **Duration** | 40.0 minutes |
|
| 203 |
|
| 204 |
-
### Hyperparameters
|
| 205 |
-
|
| 206 |
-
| Parameter | Value |
|
| 207 |
-
|---|---|
|
| 208 |
-
| LoRA Rank / Alpha | 16 / 32 |
|
| 209 |
-
| LoRA Dropout | 0.10 |
|
| 210 |
-
| Target Modules | q, k, v, o, gate, up, down proj |
|
| 211 |
-
| Learning Rate | 5e-6 (cosine scheduler) |
|
| 212 |
-
| Epochs | 3 |
|
| 213 |
-
| Effective Batch Size | 4 (1 × 4 accum) |
|
| 214 |
-
| Max Sequence Length | 4096 |
|
| 215 |
-
| NEFTune Alpha | 5.0 |
|
| 216 |
-
| Warmup Ratio | 0.05 |
|
| 217 |
-
|
| 218 |
### 📉 Training Metrics
|
| 219 |
|
| 220 |
| Metric | Value |
|
|
|
|
| 201 |
| 📦 **Dataset** | 923 train / 102 eval samples |
|
| 202 |
| ⏱️ **Duration** | 40.0 minutes |
|
| 203 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 204 |
### 📉 Training Metrics
|
| 205 |
|
| 206 |
| Metric | Value |
|