EdBerg
/

Baha_2A

Text Generation

Model card Files Files and versions

Metrics Training metrics Community

EdBerg commited on Oct 1, 2024

Commit

f5ec921

·

verified ·

1 Parent(s): ca9c4b4

End of training

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -1,5 +1,5 @@
 ---
-base_model: meta-llama/Llama-3.2-1B-Instruct
 library_name: peft
 license: llama3.2
 tags:
@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 # Baha_2A
-This model is a fine-tuned version of [meta-llama/Llama-3.2-1B-Instruct](https://huggingface.co/meta-llama/Llama-3.2-1B-Instruct) on an unknown dataset.
 ## Model description
@@ -44,7 +44,7 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_ratio: 0.03
-- training_steps: 180
 - mixed_precision_training: Native AMP
 ### Training results

 ---
+base_model: meta-llama/Llama-3.2-3B-Instruct
 library_name: peft
 license: llama3.2
 tags:
 # Baha_2A
+This model is a fine-tuned version of [meta-llama/Llama-3.2-3B-Instruct](https://huggingface.co/meta-llama/Llama-3.2-3B-Instruct) on an unknown dataset.
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_ratio: 0.03
+- training_steps: 500
 - mixed_precision_training: Native AMP
 ### Training results