decryptellix
/

Llama-3.1-8B-CP-Test

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

Metrics Training metrics Community

Udith-Sandaruwan commited on May 5, 2025

Commit

605bc91

·

verified ·

1 Parent(s): 8d694cd

Model save

Files changed (1) hide show

README.md +3 -6

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [meta-llama/Meta-Llama-3.1-8B](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.7161
 ## Model description
@@ -43,17 +43,14 @@ The following hyperparameters were used during training:
 - total_train_batch_size: 24
 - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: cosine
-- training_steps: 40
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
-| 3.2668        | 0.0003 | 10   | 2.8281          |
-| 2.0234        | 0.0006 | 20   | 2.0007          |
-| 2.3921        | 0.0010 | 30   | 1.8118          |
-| 1.8744        | 0.0013 | 40   | 1.7161          |
 ### Framework versions

 This model is a fine-tuned version of [meta-llama/Meta-Llama-3.1-8B](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.5411
 ## Model description
 - total_train_batch_size: 24
 - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: cosine
+- training_steps: 10
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
+| 3.1171        | 0.0003 | 10   | 2.5411          |
 ### Framework versions