deepakkarkala
/

llama31-8b-sft-sitcom-lora

Generated from Trainer

4-bit precision

Model card Files Files and versions

deepakkarkala commited on Jun 5, 2025

Commit

8771d6c

·

verified ·

1 Parent(s): 3258618

End of training

Files changed (1) hide show

README.md +16 -1

README.md CHANGED Viewed

@@ -5,6 +5,8 @@ base_model: NousResearch/Meta-Llama-3-8B-Instruct
 tags:
 - axolotl
 - generated_from_trainer
 model-index:
 - name: llama31-8b-sft-sitcom-lora
   results: []
@@ -73,7 +75,9 @@ weight_decay: 0.0
 [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/deepakkarkala-personal/finetuning_llama31_8b_sitcom/runs/sft_trial_3)
 # llama31-8b-sft-sitcom-lora
-This model is a fine-tuned version of [NousResearch/Meta-Llama-3-8B-Instruct](https://huggingface.co/NousResearch/Meta-Llama-3-8B-Instruct) on an unknown dataset.
 ## Model description
@@ -103,6 +107,17 @@ The following hyperparameters were used during training:
 - lr_scheduler_warmup_steps: 10
 - training_steps: 200
 ### Framework versions
 - PEFT 0.15.2

 tags:
 - axolotl
 - generated_from_trainer
+datasets:
+- deepakkarkala/sft_sitcom_chandlerbing_jsonl
 model-index:
 - name: llama31-8b-sft-sitcom-lora
   results: []
 [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/deepakkarkala-personal/finetuning_llama31_8b_sitcom/runs/sft_trial_3)
 # llama31-8b-sft-sitcom-lora
+This model is a fine-tuned version of [NousResearch/Meta-Llama-3-8B-Instruct](https://huggingface.co/NousResearch/Meta-Llama-3-8B-Instruct) on the deepakkarkala/sft_sitcom_chandlerbing_jsonl dataset.
+It achieves the following results on the evaluation set:
+- Loss: 1.8431
 ## Model description
 - lr_scheduler_warmup_steps: 10
 - training_steps: 200
+### Training results
+| Training Loss | Epoch  | Step | Validation Loss |
+|:-------------:|:------:|:----:|:---------------:|
+| 2.9323        | 0.0050 | 1    | 2.8320          |
+| 2.0701        | 0.2506 | 50   | 1.9194          |
+| 1.9102        | 0.5013 | 100  | 1.8692          |
+| 1.9795        | 0.7519 | 150  | 1.8487          |
+| 1.8136        | 1.0    | 200  | 1.8431          |
 ### Framework versions
 - PEFT 0.15.2