model_15M_small_ds_masking_0.5_predicted_hparamas

Browse files

Files changed (3) hide show

README.md +33 -33
model.safetensors +1 -1
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -16,8 +16,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.5508
-- Accuracy: 0.8118
 ## Model description
@@ -50,37 +50,37 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch   | Step  | Validation Loss | Accuracy |
 |:-------------:|:-------:|:-----:|:---------------:|:--------:|
-| No log        | 0       | 0     | 4.4528          | 0.0016   |
-| 0.8934        | 0.4302  | 1953  | 0.8072          | 0.7287   |
-| 0.7687        | 0.8604  | 3906  | 0.7341          | 0.7523   |
-| 0.7224        | 1.2905  | 5859  | 0.6983          | 0.7638   |
-| 0.6967        | 1.7207  | 7812  | 0.6746          | 0.7709   |
-| 0.6766        | 2.1509  | 9765  | 0.6597          | 0.7762   |
-| 0.6601        | 2.5811  | 11718 | 0.6466          | 0.7806   |
-| 0.6487        | 3.0112  | 13671 | 0.6323          | 0.7849   |
-| 0.6372        | 3.4414  | 15624 | 0.6253          | 0.7872   |
-| 0.6308        | 3.8716  | 17577 | 0.6167          | 0.7897   |
-| 0.6233        | 4.3018  | 19530 | 0.6119          | 0.7917   |
-| 0.6211        | 4.7319  | 21483 | 0.6069          | 0.7932   |
-| 0.6127        | 5.1621  | 23436 | 0.6013          | 0.7951   |
-| 0.6064        | 5.5923  | 25389 | 0.5973          | 0.7966   |
-| 0.6024        | 6.0225  | 27342 | 0.5919          | 0.7983   |
-| 0.5987        | 6.4526  | 29295 | 0.5873          | 0.8000   |
-| 0.5935        | 6.8828  | 31248 | 0.5854          | 0.8001   |
-| 0.5914        | 7.3130  | 33201 | 0.5815          | 0.8014   |
-| 0.5883        | 7.7432  | 35154 | 0.5760          | 0.8034   |
-| 0.5841        | 8.1733  | 37107 | 0.5732          | 0.8040   |
-| 0.5805        | 8.6035  | 39060 | 0.5724          | 0.8043   |
-| 0.5768        | 9.0337  | 41013 | 0.5682          | 0.8057   |
-| 0.5747        | 9.4639  | 42966 | 0.5637          | 0.8073   |
-| 0.5734        | 9.8941  | 44919 | 0.5617          | 0.8079   |
-| 0.5677        | 10.3242 | 46872 | 0.5632          | 0.8076   |
-| 0.5692        | 10.7544 | 48825 | 0.5596          | 0.8089   |
-| 0.5642        | 11.1846 | 50778 | 0.5556          | 0.8103   |
-| 0.5633        | 11.6148 | 52731 | 0.5555          | 0.8105   |
-| 0.5629        | 12.0449 | 54684 | 0.5512          | 0.8118   |
-| 0.5591        | 12.4751 | 56637 | 0.5548          | 0.8105   |
-| 0.5579        | 12.9053 | 58590 | 0.5534          | 0.8107   |
 ### Framework versions

 This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.3649
+- Accuracy: 0.8733
 ## Model description
 | Training Loss | Epoch   | Step  | Validation Loss | Accuracy |
 |:-------------:|:-------:|:-----:|:---------------:|:--------:|
+| No log        | 0       | 0     | 4.4509          | 0.0072   |
+| 0.7148        | 0.4302  | 1953  | 0.6216          | 0.7885   |
+| 0.5807        | 0.8604  | 3906  | 0.5453          | 0.8131   |
+| 0.5322        | 1.2905  | 5859  | 0.5084          | 0.8251   |
+| 0.5068        | 1.7207  | 7812  | 0.4851          | 0.8328   |
+| 0.4855        | 2.1509  | 9765  | 0.4710          | 0.8376   |
+| 0.47          | 2.5811  | 11718 | 0.4552          | 0.8433   |
+| 0.4589        | 3.0112  | 13671 | 0.4455          | 0.8459   |
+| 0.4475        | 3.4414  | 15624 | 0.4383          | 0.8484   |
+| 0.4401        | 3.8716  | 17577 | 0.4276          | 0.8515   |
+| 0.4329        | 4.3018  | 19530 | 0.4218          | 0.8537   |
+| 0.4299        | 4.7319  | 21483 | 0.4167          | 0.8557   |
+| 0.423         | 5.1621  | 23436 | 0.4108          | 0.8568   |
+| 0.4175        | 5.5923  | 25389 | 0.4081          | 0.8584   |
+| 0.4122        | 6.0225  | 27342 | 0.4022          | 0.8607   |
+| 0.4095        | 6.4526  | 29295 | 0.3966          | 0.8625   |
+| 0.4034        | 6.8828  | 31248 | 0.3962          | 0.8623   |
+| 0.4012        | 7.3130  | 33201 | 0.3926          | 0.8636   |
+| 0.3984        | 7.7432  | 35154 | 0.3893          | 0.8650   |
+| 0.395         | 8.1733  | 37107 | 0.3849          | 0.8663   |
+| 0.3904        | 8.6035  | 39060 | 0.3827          | 0.8669   |
+| 0.3873        | 9.0337  | 41013 | 0.3797          | 0.8680   |
+| 0.3865        | 9.4639  | 42966 | 0.3775          | 0.8691   |
+| 0.3839        | 9.8941  | 44919 | 0.3742          | 0.8696   |
+| 0.3781        | 10.3242 | 46872 | 0.3757          | 0.8693   |
+| 0.3806        | 10.7544 | 48825 | 0.3698          | 0.8713   |
+| 0.376         | 11.1846 | 50778 | 0.3703          | 0.8712   |
+| 0.3748        | 11.6148 | 52731 | 0.3672          | 0.8720   |
+| 0.3743        | 12.0449 | 54684 | 0.3643          | 0.8733   |
+| 0.3715        | 12.4751 | 56637 | 0.3662          | 0.8727   |
+| 0.3695        | 12.9053 | 58590 | 0.3666          | 0.8724   |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:53d06bfc07b799ccf6d515f9bb3d1429212435d8f326a842d1bcad044e708e6f
 size 60925776

 version https://git-lfs.github.com/spec/v1
+oid sha256:fa8f0a3fb8aa1fb113188ec55e7fbe69b46e7eab0cb61755051222e9762bfa4c
 size 60925776

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:153bec54bbdca873dd67b71a240815555aa34c9b671116aa7d49e1072080bc03
 size 5905

 version https://git-lfs.github.com/spec/v1
+oid sha256:cc81299c0a6ea26b4af7dc8a6953ad6af75dfe1fa70e7d8cfa5a016c89291664
 size 5905