lsmille
/

lora_evo_ta_all_layers_7

Generated from Trainer

Model card Files Files and versions

lsmille commited on May 28, 2024

Commit

a99884c

·

verified ·

1 Parent(s): bbab25b

Update README.md

Files changed (1) hide show

README.md +18 -2

README.md CHANGED Viewed

@@ -20,7 +20,23 @@ It achieves the following results on the evaluation set:
 ## Model description
-More information needed
 ## Intended uses & limitations
@@ -28,7 +44,7 @@ More information needed
 ## Training and evaluation data
-More information needed
 ## Training procedure

 ## Model description
+lora_alpha = 32
+lora_dropout = 0.05
+lora_r = 16
+epochs = 3
+learning rate = 3e-5 <--------- (10x smaller)
+warmup_steps=0.5
+gradient_accumulation_steps = 8
+train_batch = 1
+eval_batch = 1
 ## Intended uses & limitations
 ## Training and evaluation data
+in files
 ## Training procedure