besimray
/

test

Generated from Trainer

8-bit precision

Model card Files Files and versions

besimray commited on Oct 23, 2024

Commit

322ebcd

·

verified ·

1 Parent(s): 9c66697

End of training

Files changed (2) hide show

README.md +16 -10
adapter_model.bin +2 -2

README.md CHANGED Viewed

@@ -1,7 +1,7 @@
 ---
 library_name: peft
 license: llama3.2
-base_model: unsloth/Llama-3.2-1B-Instruct
 tags:
 - axolotl
 - generated_from_trainer
@@ -19,13 +19,19 @@ should probably proofread and complete it, then remove this comment. -->
 axolotl version: `0.4.1`
 ```yaml
 adapter: lora
-base_model: unsloth/Llama-3.2-1B-Instruct
 bf16: auto
 chat_template: llama3
 dataset_prepared_path: null
 datasets:
 - path: mhenrichsen/alpaca_2k_test
-  type: alpaca
 debug: null
 deepspeed: null
 early_stopping_patience: null
@@ -77,7 +83,7 @@ wandb_entity: besimray24-rayon
 wandb_mode: online
 wandb_project: Public_TuningSN
 wandb_run: miner_id_24
-wandb_runid: 383a850e-bb15-45a2-8f4b-fc96eb001a74
 warmup_steps: 10
 weight_decay: 0.0
 xformers_attention: null
@@ -88,9 +94,9 @@ xformers_attention: null
 # test
-This model is a fine-tuned version of [unsloth/Llama-3.2-1B-Instruct](https://huggingface.co/unsloth/Llama-3.2-1B-Instruct) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.2167
 ## Model description
@@ -124,10 +130,10 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
-| 1.3218        | 0.0042 | 1    | 1.2625          |
-| 1.3071        | 0.0126 | 3    | 1.2579          |
-| 1.4942        | 0.0253 | 6    | 1.2140          |
-| 1.277         | 0.0379 | 9    | 1.2167          |
 ### Framework versions

 ---
 library_name: peft
 license: llama3.2
+base_model: unsloth/Llama-3.2-3B-Instruct
 tags:
 - axolotl
 - generated_from_trainer
 axolotl version: `0.4.1`
 ```yaml
 adapter: lora
+base_model: unsloth/Llama-3.2-3B-Instruct
 bf16: auto
 chat_template: llama3
 dataset_prepared_path: null
 datasets:
 - path: mhenrichsen/alpaca_2k_test
+  type:
+    field_input: input
+    field_instruction: instruction
+    field_output: output
+    field_system: text
+    system_format: '{system}'
+    system_prompt: you are helpful
 debug: null
 deepspeed: null
 early_stopping_patience: null
 wandb_mode: online
 wandb_project: Public_TuningSN
 wandb_run: miner_id_24
+wandb_runid: 123e4567-e89b-12d3-a456-426614174000
 warmup_steps: 10
 weight_decay: 0.0
 xformers_attention: null
 # test
+This model is a fine-tuned version of [unsloth/Llama-3.2-3B-Instruct](https://huggingface.co/unsloth/Llama-3.2-3B-Instruct) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.0439
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
+| 0.1517        | 0.0042 | 1    | 0.2442          |
+| 0.1181        | 0.0126 | 3    | 0.2362          |
+| 0.3502        | 0.0253 | 6    | 0.1496          |
+| 0.0495        | 0.0379 | 9    | 0.0439          |
 ### Framework versions

adapter_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:61ba026082313c7f15d589fe716210021e2d1334718de7cb4e272b7552bcf546
-size 45169354

 version https://git-lfs.github.com/spec/v1
+oid sha256:43bf6fd3cc6bb3b2f61eb935b2c760530f315b059c95c5db87d7f3f2698f317b
+size 97396522