End of training

Files changed (6) hide show

README.md CHANGED Viewed

@@ -19,8 +19,6 @@ should probably proofread and complete it, then remove this comment. -->
 # zephyr-7b-sft-lora
 This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the generator dataset.
-It achieves the following results on the evaluation set:
-- Loss: 1.1563
 ## Model description
@@ -47,14 +45,13 @@ The following hyperparameters were used during training:
 - total_train_batch_size: 128
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
-- num_epochs: 3
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
 | No log        | 1.0   | 1    | 1.1585          |
-| No log        | 2.0   | 3    | 1.1563          |
 ### Framework versions

 # zephyr-7b-sft-lora
 This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the generator dataset.
 ## Model description
 - total_train_batch_size: 128
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
+- num_epochs: 1
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
 | No log        | 1.0   | 1    | 1.1585          |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -20,8 +20,8 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "v_proj",
     "q_proj",
     "k_proj",
     "o_proj"
   ],

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
     "q_proj",
+    "v_proj",
     "k_proj",
     "o_proj"
   ],

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:bd498aa390277d4d5480c6044b88e597b352fe3a0278fb1c963bd55ebf39b619
 size 109086672

 version https://git-lfs.github.com/spec/v1
+oid sha256:a0f5b6b3e0c94a7c6e292367b53856155333ae85f4f5f825c5d5f9d3bc817b7b
 size 109086672

runs/Apr15_09-56-17_296d921ba72a/events.out.tfevents.1713175037.296d921ba72a.192.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:715814ef9feb1dd3312e47bd8d9acfb76ae0f8564d0063b81b404962f84a6857
+size 5673

special_tokens_map.json CHANGED Viewed

@@ -13,13 +13,7 @@
     "rstrip": false,
     "single_word": false
   },
-  "pad_token": {
-    "content": "</s>",
-    "lstrip": false,
-    "normalized": false,
-    "rstrip": false,
-    "single_word": false
-  },
   "unk_token": {
     "content": "<unk>",
     "lstrip": false,

     "rstrip": false,
     "single_word": false
   },
+  "pad_token": "</s>",
   "unk_token": {
     "content": "<unk>",
     "lstrip": false,

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:40a1bb5dcc207e7129f2a045941c3c828bd157617df0503126a3864e20ad1063
 size 4984

 version https://git-lfs.github.com/spec/v1
+oid sha256:c9582b26190fbdf4d63d1a324e42002ae8946fe46f9360450c6c956c788c1625
 size 4984