Thalesian
/

train_4

@@ -14,7 +14,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model was trained from scratch on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.2986
 ## Model description
@@ -50,24 +50,35 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step   | Validation Loss |
 |:-------------:|:-----:|:------:|:---------------:|
-| 0.0862        | 1.0   | 12556  | 0.2640          |
-| 0.0846        | 2.0   | 25112  | 0.5038          |
-| 0.0851        | 3.0   | 37668  | 0.2431          |
-| 0.0848        | 4.0   | 50224  | 0.3051          |
-| 0.0837        | 5.0   | 62780  | 0.2666          |
-| 0.0821        | 6.0   | 75336  | 0.2576          |
-| 0.0821        | 7.0   | 87892  | 0.3419          |
-| 0.0813        | 8.0   | 100448 | 0.2919          |
-| 0.079         | 9.0   | 113004 | 0.3469          |
-| 0.0782        | 10.0  | 125560 | 0.2686          |
-| 0.0785        | 11.0  | 138116 | 0.2726          |
-| 0.0782        | 12.0  | 150672 | 0.2957          |
-| 0.0791        | 13.0  | 163228 | 0.2599          |
-| 0.0758        | 14.0  | 175784 | 0.2517          |
-| 0.0762        | 15.0  | 188340 | 0.2636          |
-| 0.0755        | 16.0  | 200896 | 0.3412          |
-| 0.0739        | 17.0  | 213452 | 0.2568          |
-| 0.0735        | 18.0  | 226008 | 0.2986          |
 ### Framework versions

 This model was trained from scratch on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.3656
 ## Model description
 | Training Loss | Epoch | Step   | Validation Loss |
 |:-------------:|:-----:|:------:|:---------------:|
+| 0.1096        | 1.0   | 10537  | 0.3729          |
+| 0.1064        | 2.0   | 21074  | 0.3767          |
+| 0.106         | 3.0   | 31611  | 0.4043          |
+| 0.1058        | 4.0   | 42148  | 0.4770          |
+| 0.1046        | 5.0   | 52685  | 0.3123          |
+| 0.1048        | 6.0   | 63222  | 0.5321          |
+| 0.102         | 7.0   | 73759  | 0.3984          |
+| 0.1028        | 8.0   | 84296  | 0.4548          |
+| 0.1014        | 9.0   | 94833  | 0.3696          |
+| 0.101         | 10.0  | 105370 | 0.3732          |
+| 0.0991        | 11.0  | 115907 | 0.4069          |
+| 0.0981        | 12.0  | 126444 | 0.5239          |
+| 0.0976        | 13.0  | 136981 | 0.6685          |
+| 0.0972        | 14.0  | 147518 | 0.2434          |
+| 0.0976        | 15.0  | 158055 | 0.3415          |
+| 0.0962        | 16.0  | 168592 | 0.5666          |
+| 0.0932        | 17.0  | 179129 | 0.3276          |
+| 0.0943        | 18.0  | 189666 | 0.4639          |
+| 0.0942        | 19.0  | 200203 | 0.3452          |
+| 0.0937        | 20.0  | 210740 | 0.3022          |
+| 0.0946        | 21.0  | 221277 | 0.3548          |
+| 0.0911        | 22.0  | 231814 | 0.5837          |
+| 0.0911        | 23.0  | 242351 | 0.3594          |
+| 0.0924        | 24.0  | 252888 | 0.2844          |
+| 0.0903        | 25.0  | 263425 | 0.3243          |
+| 0.0913        | 26.0  | 273962 | 0.4006          |
+| 0.0886        | 27.0  | 284499 | 0.3936          |
+| 0.0896        | 28.0  | 295036 | 0.2499          |
+| 0.0895        | 29.0  | 305573 | 0.3656          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:bb4f0c86f3288e211790f84a5b778a724f18892f7c3036b3b1dd9fa82bb0f61a
 size 243045416

 version https://git-lfs.github.com/spec/v1
+oid sha256:f419c27c0aecfab9d7f7792037dfcb5e8df30aea540f0db9be929cdd89247734
 size 243045416

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f5e9cb9e291d4e237e792ce385ec4f81008e658a668eb9d127d2ee3eec073938
 size 5560

 version https://git-lfs.github.com/spec/v1
+oid sha256:4fce7160bea2cdcb47c0e6438aa177f032ed758e5cec2442989e6c94760e7d32
 size 5560