Viral04/CUAD_Finetuned

Files changed (5) hide show

README.md CHANGED Viewed

@@ -20,7 +20,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.1](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1) on the generator dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.9593
 ## Model description
@@ -47,16 +47,17 @@ The following hyperparameters were used during training:
 - lr_scheduler_type: constant
 - lr_scheduler_warmup_steps: 0.03
 - training_steps: 100
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 2.0288        | 0.08  | 20   | 2.0121          |
-| 1.9859        | 0.15  | 40   | 1.9932          |
-| 1.972         | 0.23  | 60   | 1.9786          |
-| 1.9645        | 0.3   | 80   | 1.9665          |
-| 1.9412        | 0.38  | 100  | 1.9593          |
 ### Framework versions

 This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.1](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1) on the generator dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.3202
 ## Model description
 - lr_scheduler_type: constant
 - lr_scheduler_warmup_steps: 0.03
 - training_steps: 100
+- mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 1.4597        | 0.0   | 20   | 1.3668          |
+| 1.4026        | 0.0   | 40   | 1.3473          |
+| 1.4525        | 0.01  | 60   | 1.3343          |
+| 1.4547        | 0.01  | 80   | 1.3256          |
+| 1.3827        | 0.01  | 100  | 1.3202          |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -19,8 +19,8 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "q_proj",
-    "v_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
+    "v_proj",
+    "q_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:95e9c124e121eaddca720d28beb16bb93e5eddf604e438974518a186a6ec0bff
 size 109069176

 version https://git-lfs.github.com/spec/v1
+oid sha256:2d43695c5203ee288accbe923024837c40f7bc415d9c9d4cf8af56b443321eee
 size 109069176

runs/Mar18_18-30-59_96cb8199a2b8/events.out.tfevents.1710786878.96cb8199a2b8.7106.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:d266280f6f102b1a66fdb1d5f3f7a09b64ab53483689eed7a69701c9cd3e77fe
+size 8797

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:3d27596c137feb8f6dd02e2e57d273d2c071c80a5ae1178485acdc361c7163a5
 size 4920

 version https://git-lfs.github.com/spec/v1
+oid sha256:8d31ea7a93f15082ed5c9492a7e9b967b2ae13c52d5242a45d4e54d216bb1897
 size 4920