Model save

Files changed (5) hide show

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [TinyLlama/TinyLlama-1.1B-Chat-v1.0](https://huggingface.co/TinyLlama/TinyLlama-1.1B-Chat-v1.0) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: nan
 ## Model description
@@ -50,8 +50,8 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| No log        | 0.4   | 2    | nan             |
-| No log        | 0.8   | 4    | nan             |
 ### Framework versions

 This model is a fine-tuned version of [TinyLlama/TinyLlama-1.1B-Chat-v1.0](https://huggingface.co/TinyLlama/TinyLlama-1.1B-Chat-v1.0) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.5165
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| No log        | 0.4   | 2    | 0.6340          |
+| No log        | 0.8   | 4    | 0.5165          |
 ### Framework versions

final/adapter_config.json CHANGED Viewed

@@ -24,8 +24,8 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "q_proj",
     "k_proj",
     "v_proj",
     "o_proj"
   ],

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
     "k_proj",
+    "q_proj",
     "v_proj",
     "o_proj"
   ],

final/adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:18fa74c76360bf6f2f85ddad45632b9d6d43663653f830389241672b4616f51a
 size 9034304

 version https://git-lfs.github.com/spec/v1
+oid sha256:c9c74dc405a2b8717ff50933f769b0dc5c03cb8b5f4b5b6d5b25e948f5a06aba
 size 9034304

final/tokenizer.json CHANGED Viewed

@@ -1,19 +1,7 @@
 {
   "version": "1.0",
-  "truncation": {
-    "direction": "Right",
-    "max_length": 1024,
-    "strategy": "LongestFirst",
-    "stride": 0
-  },
-  "padding": {
-    "strategy": "BatchLongest",
-    "direction": "Right",
-    "pad_to_multiple_of": null,
-    "pad_id": 2,
-    "pad_type_id": 0,
-    "pad_token": "</s>"
-  },
   "added_tokens": [
     {
       "id": 0,

 {
   "version": "1.0",
+  "truncation": null,
+  "padding": null,
   "added_tokens": [
     {
       "id": 0,

final/training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:67555874961dd2c97f3dddfe093aa697fa6bb50ffb490e3f81442c7944cd00ea
 size 5713

 version https://git-lfs.github.com/spec/v1
+oid sha256:b8d6ab70993f77f33eb2e4092599016b68a809c5f24fb7121c4556fe62b5f34f
 size 5713