End of training

Files changed (5) hide show

README.md CHANGED Viewed

@@ -114,7 +114,7 @@ xformers_attention: null
 This model is a fine-tuned version of [HuggingFaceM4/tiny-random-LlamaForCausalLM](https://huggingface.co/HuggingFaceM4/tiny-random-LlamaForCausalLM) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 10.0460
 ## Model description
@@ -148,12 +148,12 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
-| No log        | 0.0017 | 1    | 10.3634         |
-| 10.1375       | 0.0846 | 50   | 10.1435         |
-| 10.0115       | 0.1693 | 100  | 10.0550         |
-| 10.0111       | 0.2539 | 150  | 10.0513         |
-| 10.0157       | 0.3386 | 200  | 10.0494         |
-| 10.0101       | 0.4232 | 250  | 10.0460         |
 ### Framework versions

 This model is a fine-tuned version of [HuggingFaceM4/tiny-random-LlamaForCausalLM](https://huggingface.co/HuggingFaceM4/tiny-random-LlamaForCausalLM) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 10.0458
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
+| No log        | 0.0017 | 1    | 10.3633         |
+| 10.1286       | 0.0846 | 50   | 10.1285         |
+| 10.0127       | 0.1693 | 100  | 10.0565         |
+| 10.0111       | 0.2539 | 150  | 10.0516         |
+| 10.0149       | 0.3386 | 200  | 10.0488         |
+| 10.0091       | 0.4232 | 250  | 10.0458         |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -20,13 +20,13 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
     "o_proj",
     "gate_proj",
-    "v_proj",
-    "q_proj",
-    "down_proj",
     "k_proj",
-    "up_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
+    "down_proj",
+    "q_proj",
     "o_proj",
     "gate_proj",
+    "up_proj",
     "k_proj",
+    "v_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

adapter_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:9f4cc1c911cdb6b75e6ef79b90816dff388e9b440f1518da129c0e75eae58174
 size 104322

 version https://git-lfs.github.com/spec/v1
+oid sha256:205782a99f62519600cd2b68e7422316b22672b1d147d80e76630f1382c8466e
 size 104322

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:2277498183f175e157bfe2a8cb6c913c6815b956b72ce057d1a61e34eaaf0e38
 size 97728

 version https://git-lfs.github.com/spec/v1
+oid sha256:e4e4fe181e83e6c7c00bc6ec075e87fb77fe72f07a40d0f4551a2e673e9d70c7
 size 97728

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4780c040b65b637eb80c0e56444fb3f2f6c0dba70fe84c9344982277dc43bda0
 size 6776

 version https://git-lfs.github.com/spec/v1
+oid sha256:2a1ec810967320e3c24d8c7b6696308cae904fc205727ef08a59093620d7caa2
 size 6776