Weni
/

WeniGPT-Mistral-7B-instructBase-4bit

Model card Files Files and versions

Mel-Iza0 commited on Oct 11, 2023

Commit

0fad0e1

·

1 Parent(s): 20d6b85

Update README.md

Files changed (1) hide show

README.md +33 -0

README.md CHANGED Viewed

@@ -1,5 +1,7 @@
 ---
 library_name: peft
 ---
 ## Training procedure
@@ -19,3 +21,34 @@ The following `bitsandbytes` quantization config was used during training:
 - PEFT 0.4.0

 ---
 library_name: peft
+language:
+- pt
 ---
 ## Training procedure
 - PEFT 0.4.0
+This model was trained with this parameters:
+```
+max_seq_length = 2048
+training_arguments_mistral = {
+    'num_train_epochs':10,
+    'per_device_train_batch_size':2,
+    'gradient_accumulation_steps':2,
+    'gradient_checkpointing':True,
+    'optim':'adamw_torch',
+    'lr_scheduler_type':'constant_with_warmup',
+    'logging_steps':10,
+    'evaluation_strategy':'epoch',
+    'save_strategy':"epoch",
+    'load_best_model_at_end':True,
+    'learning_rate':4e-4,
+    'save_total_limit':3,
+    'fp16':True,
+    'tf32': True,
+    'max_steps':8000,
+    'max_grad_norm':0.3,
+    'warmup_ratio':0.03,
+    'disable_tqdm':False,
+    'weight_decay':0.001,
+    'hub_model_id':'Weni/WeniGPT-Mistral-7B-instructBase-4bit',
+    'push_to_hub':True,
+    'hub_strategy':'every_save',
+    'hub_token':token,
+    'hub_private_repo':True,
+```