Viral04/CUAD_Finetuned

Files changed (7) hide show

README.md CHANGED Viewed

@@ -20,7 +20,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.1](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1) on the generator dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.3202
 ## Model description
@@ -46,18 +46,23 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: constant
 - lr_scheduler_warmup_steps: 0.03
-- training_steps: 100
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 1.4597        | 0.0   | 20   | 1.3668          |
-| 1.4026        | 0.0   | 40   | 1.3473          |
-| 1.4525        | 0.01  | 60   | 1.3343          |
-| 1.4547        | 0.01  | 80   | 1.3256          |
-| 1.3827        | 0.01  | 100  | 1.3202          |
 ### Framework versions

 This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.1](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1) on the generator dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.3268
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: constant
 - lr_scheduler_warmup_steps: 0.03
+- training_steps: 200
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 1.4515        | 0.0   | 20   | 1.3677          |
+| 1.4001        | 0.0   | 40   | 1.3462          |
+| 1.4109        | 0.01  | 60   | 1.3316          |
+| 1.3631        | 0.01  | 80   | 1.3256          |
+| 1.3463        | 0.01  | 100  | 1.3231          |
+| 1.3541        | 0.01  | 120  | 1.3173          |
+| 1.3072        | 0.01  | 140  | 1.3156          |
+| 1.2798        | 0.02  | 160  | 1.3114          |
+| 1.3276        | 0.02  | 180  | 1.3143          |
+| 1.2681        | 0.02  | 200  | 1.3268          |
 ### Framework versions

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:2d43695c5203ee288accbe923024837c40f7bc415d9c9d4cf8af56b443321eee
 size 109069176

 version https://git-lfs.github.com/spec/v1
+oid sha256:3081a2fe3429a6b5217341739c900aa5d45b766132cb3150be514eb5833ba0c8
 size 109069176

runs/Mar19_16-12-38_637e4b325ea6/events.out.tfevents.1710864847.637e4b325ea6.3354.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:77dd94b7c1e96345241828ae7cd094fe8ef4407b1ba7a25ffeb36a914ccbae55
+size 5253

runs/Mar19_16-15-58_637e4b325ea6/events.out.tfevents.1710865008.637e4b325ea6.3354.1 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:1b8a94749339353b6f909b2bd6fc8c1f2cd3bf83b7351a504132f8dddeefa9c7
+size 5046

runs/Mar19_16-17-47_637e4b325ea6/events.out.tfevents.1710865070.637e4b325ea6.3354.2 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:4756c947be4beef4da6927b9cbec3c29182ee413feffdaa9420bc0a550dba113
+size 6409

runs/Mar19_16-56-50_637e4b325ea6/events.out.tfevents.1710867412.637e4b325ea6.16722.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:2764ae878b65e7ee441e5ee1e728572d5846d0fa11d2026169683aeedc8ea669
+size 12255

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:8d31ea7a93f15082ed5c9492a7e9b967b2ae13c52d5242a45d4e54d216bb1897
 size 4920

 version https://git-lfs.github.com/spec/v1
+oid sha256:961f4e5b319c3c0166e801eccd40a08a6cb966c618409a2eee4ee81ff64bf94e
 size 4920