End of training

Files changed (6) hide show

README.md CHANGED Viewed

@@ -1,7 +1,8 @@
 ---
-base_model: meta-llama/Llama-2-7b-hf
 tags:
 - generated_from_trainer
 model-index:
 - name: llama_test
   results: []
@@ -12,9 +13,9 @@ should probably proofread and complete it, then remove this comment. -->
 # llama_test
-This model is a fine-tuned version of [meta-llama/Llama-2-7b-hf](https://huggingface.co/meta-llama/Llama-2-7b-hf) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 3.0042
 ## Model description
@@ -33,25 +34,25 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 1e-05
-- train_batch_size: 16
-- eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
-- lr_scheduler_type: linear
 - training_steps: 1000
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 2.9914        | 0.1   | 500  | 3.0161          |
-| 2.9937        | 0.2   | 1000 | 3.0042          |
 ### Framework versions
-- Transformers 4.34.1
-- Pytorch 1.12.1+cu113
-- Datasets 2.14.5
-- Tokenizers 0.14.1

 ---
+library_name: peft
 tags:
 - generated_from_trainer
+base_model: meta-llama/Llama-2-7b-hf
 model-index:
 - name: llama_test
   results: []
 # llama_test
+This model is a fine-tuned version of [meta-llama/Llama-2-7b-hf](https://huggingface.co/meta-llama/Llama-2-7b-hf) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 3.0538
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 2e-05
+- train_batch_size: 4
+- eval_batch_size: 4
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: cosine
 - training_steps: 1000
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 3.0546        | 0.05  | 1000 | 3.0538          |
 ### Framework versions
+- PEFT 0.8.2
+- Transformers 4.38.1
+- Pytorch 2.2.0+cu118
+- Datasets 2.17.1
+- Tokenizers 0.15.2

adapter_config.json ADDED Viewed

+{
+  "alpha_pattern": {},
+  "auto_mapping": null,
+  "base_model_name_or_path": "meta-llama/Llama-2-7b-hf",
+  "bias": "none",
+  "fan_in_fan_out": false,
+  "inference_mode": true,
+  "init_lora_weights": true,
+  "layers_pattern": null,
+  "layers_to_transform": null,
+  "loftq_config": {},
+  "lora_alpha": 32,
+  "lora_dropout": 0.1,
+  "megatron_config": null,
+  "megatron_core": "megatron.core",
+  "modules_to_save": null,
+  "peft_type": "LORA",
+  "r": 32,
+  "rank_pattern": {},
+  "revision": null,
+  "target_modules": [
+    "q_proj",
+    "v_proj"
+  ],
+  "task_type": "CAUSAL_LM",
+  "use_rslora": false
+}

adapter_model.safetensors ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:3286bf77bc7155cee4c5c18fca766a27cb2105d12145d8dda1107802e5bbdbba
+size 33571752

runs/Feb26_19-48-12_gpu-3/events.out.tfevents.1708976895.gpu-3.343680.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:666f554cf16ba97d778cba0ec2327ca83faed920d968f56c2546eb63eed85dd8
+size 13659

runs/Feb26_19-48-12_gpu-3/events.out.tfevents.1708977350.gpu-3.343680.1 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:aa1e3d5163193f1bb73e17b0d1eddac8275fbde84e91f1f0480d88c147f85ec2
+size 359

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:1c162a7e009b92741066720b77bec62310129e7f87b7b0cfa743ce7c569fc658
-size 4015

 version https://git-lfs.github.com/spec/v1
+oid sha256:2d142d93be53d026558969b0bb732d7e1fe3ac978f55e4493044cc9fa5a4b3b9
+size 4920