dataset=155, epochs=50, batch_size=1, early_stopping=eval_loss

Browse files

Files changed (3) hide show

README.md +21 -21
model.safetensors +1 -1
runs/Apr29_03-15-19_a88bf9f1af5f/events.out.tfevents.1714360521.a88bf9f1af5f.13227.0 +2 -2

README.md CHANGED Viewed

@@ -20,17 +20,17 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [bert-base-multilingual-cased](https://huggingface.co/bert-base-multilingual-cased) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.2288
-- Perf P: 0.8211
-- Perf R: 0.9398
-- Inst P: 0.9444
-- Inst R: 0.8095
-- Comp P: 0.7717
-- Comp R: 0.7474
-- Precision: 0.8182
-- Recall: 0.8270
-- F1: 0.8226
-- Accuracy: 0.9412
 ## Model description
@@ -50,8 +50,8 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
-- train_batch_size: 2
-- eval_batch_size: 2
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
@@ -61,14 +61,14 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Perf P | Perf R | Inst P | Inst R | Comp P | Comp R | Precision | Recall | F1     | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:------:|:------:|:------:|:---------:|:------:|:------:|:--------:|
-| 1.1177        | 1.0   | 68   | 0.5451          | 0.5033 | 0.9277 | 0.7377 | 0.7143 | 0.4333 | 0.2737 | 0.5762    | 0.4973 | 0.5338 | 0.8295   |
-| 0.4079        | 2.0   | 136  | 0.3157          | 0.8021 | 0.9277 | 0.7742 | 0.7619 | 0.7701 | 0.7053 | 0.7542    | 0.7297 | 0.7418 | 0.9097   |
-| 0.2188        | 3.0   | 204  | 0.2725          | 0.7143 | 0.9036 | 0.7    | 0.7778 | 0.74   | 0.7789 | 0.7391    | 0.7351 | 0.7371 | 0.9164   |
-| 0.1484        | 4.0   | 272  | 0.2467          | 0.79   | 0.9518 | 0.7681 | 0.8413 | 0.7692 | 0.7368 | 0.7838    | 0.8036 | 0.7936 | 0.9290   |
-| 0.0914        | 5.0   | 340  | 0.2059          | 0.8488 | 0.8795 | 0.8571 | 0.8571 | 0.8370 | 0.8105 | 0.8312    | 0.8252 | 0.8282 | 0.9374   |
-| 0.0656        | 6.0   | 408  | 0.2090          | 0.8247 | 0.9639 | 0.8194 | 0.9365 | 0.8636 | 0.8    | 0.8406    | 0.8360 | 0.8383 | 0.9406   |
-| 0.0541        | 7.0   | 476  | 0.2066          | 0.7692 | 0.9639 | 0.8254 | 0.8254 | 0.8506 | 0.7789 | 0.8259    | 0.8288 | 0.8273 | 0.9432   |
-| 0.0345        | 8.0   | 544  | 0.2288          | 0.8211 | 0.9398 | 0.9444 | 0.8095 | 0.7717 | 0.7474 | 0.8182    | 0.8270 | 0.8226 | 0.9412   |
 ### Framework versions

 This model is a fine-tuned version of [bert-base-multilingual-cased](https://huggingface.co/bert-base-multilingual-cased) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.3270
+- Perf P: 0.9
+- Perf R: 0.9529
+- Inst P: 0.8657
+- Inst R: 0.8657
+- Comp P: 0.9341
+- Comp R: 0.9341
+- Precision: 0.8256
+- Recall: 0.8311
+- F1: 0.8283
+- Accuracy: 0.9288
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
+- train_batch_size: 1
+- eval_batch_size: 1
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 | Training Loss | Epoch | Step | Validation Loss | Perf P | Perf R | Inst P | Inst R | Comp P | Comp R | Precision | Recall | F1     | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:------:|:------:|:------:|:---------:|:------:|:------:|:--------:|
+| 0.9205        | 1.0   | 135  | 0.4005          | 0.8148 | 0.7765 | 0.6923 | 0.8060 | 0.8101 | 0.7033 | 0.7042    | 0.6488 | 0.6754 | 0.8767   |
+| 0.2812        | 2.0   | 270  | 0.2675          | 0.8462 | 0.9059 | 0.8485 | 0.8358 | 0.8646 | 0.9121 | 0.7841    | 0.8077 | 0.7957 | 0.9235   |
+| 0.1573        | 3.0   | 405  | 0.2843          | 0.8778 | 0.9294 | 0.9048 | 0.8507 | 0.9014 | 0.7033 | 0.7713    | 0.7559 | 0.7635 | 0.9106   |
+| 0.1013        | 4.0   | 540  | 0.2547          | 0.8316 | 0.9294 | 0.8026 | 0.9104 | 0.8630 | 0.6923 | 0.7465    | 0.7926 | 0.7689 | 0.9235   |
+| 0.0688        | 5.0   | 675  | 0.2390          | 0.8333 | 0.9412 | 0.8611 | 0.9254 | 0.8690 | 0.8022 | 0.7977    | 0.8043 | 0.8010 | 0.9321   |
+| 0.0499        | 6.0   | 810  | 0.2709          | 0.8571 | 0.9176 | 0.8939 | 0.8806 | 0.8438 | 0.8901 | 0.7932    | 0.8211 | 0.8069 | 0.9327   |
+| 0.0387        | 7.0   | 945  | 0.3308          | 0.8941 | 0.8941 | 0.7532 | 0.8657 | 0.9178 | 0.7363 | 0.7638    | 0.7625 | 0.7632 | 0.9168   |
+| 0.0254        | 8.0   | 1080 | 0.3270          | 0.9    | 0.9529 | 0.8657 | 0.8657 | 0.9341 | 0.9341 | 0.8256    | 0.8311 | 0.8283 | 0.9288   |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:9813f51fc68314130233e7ec71713b252ee4de60b5390ad2fa126382cb89722e
 size 709139348

 version https://git-lfs.github.com/spec/v1
+oid sha256:ccc77749ef20bf6d901627c6204498fd6edf836487456cd4b4103ca82cfc1c71
 size 709139348

runs/Apr29_03-15-19_a88bf9f1af5f/events.out.tfevents.1714360521.a88bf9f1af5f.13227.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:426af555c6ac42e1d94c8fd5d85ca402ac000e50afec03313036fb0bebcf8486
-size 11525

 version https://git-lfs.github.com/spec/v1
+oid sha256:01245bc89abb86a814ace9210588ca97142c7a7ec1bc8dd7c94b034691609e1a
+size 13845