MwSpace commited on
Commit
040a719
·
verified ·
1 Parent(s): b09fd4c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +0 -13
README.md CHANGED
@@ -204,19 +204,6 @@ messages = [
204
  | 📦 **Dataset** | 923 train / 102 eval samples |
205
  | ⏱️ **Duration** | 13.2 minutes |
206
 
207
- ### Hyperparameters
208
-
209
- | Parameter | Value |
210
- |---|---|
211
- | LoRA Rank / Alpha | 16 / 32 |
212
- | LoRA Dropout | 0.10 |
213
- | Target Modules | q, k, v, o, gate, up, down proj |
214
- | Learning Rate | 5e-6 (cosine scheduler) |
215
- | Epochs | 3 |
216
- | Effective Batch Size | 4 (2 × 2 accum) |
217
- | Max Sequence Length | 4096 |
218
- | NEFTune Alpha | 5.0 |
219
- | Warmup Ratio | 0.05 |
220
 
221
  ### 📉 Training Metrics
222
 
 
204
  | 📦 **Dataset** | 923 train / 102 eval samples |
205
  | ⏱️ **Duration** | 13.2 minutes |
206
 
 
 
 
 
 
 
 
 
 
 
 
 
 
207
 
208
  ### 📉 Training Metrics
209