model_15M_small_ds_masking_0.4_predicted_hparamas

Browse files

Files changed (3) hide show

README.md +35 -9
model.safetensors +1 -1
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -16,8 +16,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.3599
-- Accuracy: 0.8744
 ## Model description
@@ -48,13 +48,39 @@ The following hyperparameters were used during training:
 ### Training results
-| Training Loss | Epoch  | Step | Validation Loss | Accuracy |
-|:-------------:|:------:|:----:|:---------------:|:--------:|
-| No log        | 0      | 0    | 4.6089          | 0.0017   |
-| 0.4423        | 0.4302 | 1953 | 0.3749          | 0.8701   |
-| 0.3952        | 0.8604 | 3906 | 0.3602          | 0.8745   |
-| 0.4069        | 1.2905 | 5859 | 0.3640          | 0.8738   |
-| 0.4012        | 1.7207 | 7812 | 0.3674          | 0.8711   |
 ### Framework versions

 This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.2445
+- Accuracy: 0.9135
 ## Model description
 ### Training results
+| Training Loss | Epoch   | Step  | Validation Loss | Accuracy |
+|:-------------:|:-------:|:-----:|:---------------:|:--------:|
+| No log        | 0       | 0     | 4.4175          | 0.0125   |
+| 0.5685        | 0.4302  | 1953  | 0.4742          | 0.8365   |
+| 0.4343        | 0.8604  | 3906  | 0.3988          | 0.8613   |
+| 0.3885        | 1.2905  | 5859  | 0.3679          | 0.8716   |
+| 0.3663        | 1.7207  | 7812  | 0.3454          | 0.8790   |
+| 0.3472        | 2.1509  | 9765  | 0.3319          | 0.8840   |
+| 0.3335        | 2.5811  | 11718 | 0.3186          | 0.8881   |
+| 0.3242        | 3.0112  | 13671 | 0.3085          | 0.8917   |
+| 0.3129        | 3.4414  | 15624 | 0.3026          | 0.8937   |
+| 0.3071        | 3.8716  | 17577 | 0.2958          | 0.8957   |
+| 0.3002        | 4.3018  | 19530 | 0.2911          | 0.8976   |
+| 0.2981        | 4.7319  | 21483 | 0.2861          | 0.8992   |
+| 0.2915        | 5.1621  | 23436 | 0.2819          | 0.9006   |
+| 0.2882        | 5.5923  | 25389 | 0.2782          | 0.9018   |
+| 0.2834        | 6.0225  | 27342 | 0.2731          | 0.9036   |
+| 0.2811        | 6.4526  | 29295 | 0.2711          | 0.9043   |
+| 0.2756        | 6.8828  | 31248 | 0.2679          | 0.9053   |
+| 0.2742        | 7.3130  | 33201 | 0.2663          | 0.9060   |
+| 0.2723        | 7.7432  | 35154 | 0.2618          | 0.9075   |
+| 0.2679        | 8.1733  | 37107 | 0.2587          | 0.9086   |
+| 0.2649        | 8.6035  | 39060 | 0.2585          | 0.9089   |
+| 0.2629        | 9.0337  | 41013 | 0.2564          | 0.9092   |
+| 0.2619        | 9.4639  | 42966 | 0.2536          | 0.9104   |
+| 0.2593        | 9.8941  | 44919 | 0.2514          | 0.9108   |
+| 0.2548        | 10.3242 | 46872 | 0.2509          | 0.9111   |
+| 0.2567        | 10.7544 | 48825 | 0.2484          | 0.9121   |
+| 0.2533        | 11.1846 | 50778 | 0.2487          | 0.9121   |
+| 0.2513        | 11.6148 | 52731 | 0.2470          | 0.9122   |
+| 0.2509        | 12.0449 | 54684 | 0.2421          | 0.9141   |
+| 0.2494        | 12.4751 | 56637 | 0.2456          | 0.9132   |
+| 0.2472        | 12.9053 | 58590 | 0.2428          | 0.9139   |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:de3dea857f6e6cad9d8ee484718c303aaa6109a7bc51f33aa4e590c361b1b73e
 size 60925776

 version https://git-lfs.github.com/spec/v1
+oid sha256:04daea2582129fcfb619b038a5f6975b9e0fdcb7a28c8d4b8eae29576e95afe2
 size 60925776

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:3a2b46f80daa977eea7bab37ba38e09ff666b2e9052ba40aad0a4a0380a48b6a
 size 5905

 version https://git-lfs.github.com/spec/v1
+oid sha256:2d08c7c87f1b42e56aeb6f0b627ccead925f157fb3c711e08171861f2405ce60
 size 5905