MwSpace commited on
Commit
e77c63e
·
verified ·
1 Parent(s): a0be32e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +0 -14
README.md CHANGED
@@ -201,20 +201,6 @@ messages = [
201
  | 📦 **Dataset** | 923 train / 102 eval samples |
202
  | ⏱️ **Duration** | 40.0 minutes |
203
 
204
- ### Hyperparameters
205
-
206
- | Parameter | Value |
207
- |---|---|
208
- | LoRA Rank / Alpha | 16 / 32 |
209
- | LoRA Dropout | 0.10 |
210
- | Target Modules | q, k, v, o, gate, up, down proj |
211
- | Learning Rate | 5e-6 (cosine scheduler) |
212
- | Epochs | 3 |
213
- | Effective Batch Size | 4 (1 × 4 accum) |
214
- | Max Sequence Length | 4096 |
215
- | NEFTune Alpha | 5.0 |
216
- | Warmup Ratio | 0.05 |
217
-
218
  ### 📉 Training Metrics
219
 
220
  | Metric | Value |
 
201
  | 📦 **Dataset** | 923 train / 102 eval samples |
202
  | ⏱️ **Duration** | 40.0 minutes |
203
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
204
  ### 📉 Training Metrics
205
 
206
  | Metric | Value |