jamie613 commited on
Commit
57a4be3
·
verified ·
1 Parent(s): 737322b

dataset=155, epochs=50, batch_size=1, early_stopping=eval_loss

Browse files
README.md CHANGED
@@ -20,17 +20,17 @@ should probably proofread and complete it, then remove this comment. -->
20
 
21
  This model is a fine-tuned version of [bert-base-multilingual-cased](https://huggingface.co/bert-base-multilingual-cased) on an unknown dataset.
22
  It achieves the following results on the evaluation set:
23
- - Loss: 0.2288
24
- - Perf P: 0.8211
25
- - Perf R: 0.9398
26
- - Inst P: 0.9444
27
- - Inst R: 0.8095
28
- - Comp P: 0.7717
29
- - Comp R: 0.7474
30
- - Precision: 0.8182
31
- - Recall: 0.8270
32
- - F1: 0.8226
33
- - Accuracy: 0.9412
34
 
35
  ## Model description
36
 
@@ -50,8 +50,8 @@ More information needed
50
 
51
  The following hyperparameters were used during training:
52
  - learning_rate: 2e-05
53
- - train_batch_size: 2
54
- - eval_batch_size: 2
55
  - seed: 42
56
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
57
  - lr_scheduler_type: linear
@@ -61,14 +61,14 @@ The following hyperparameters were used during training:
61
 
62
  | Training Loss | Epoch | Step | Validation Loss | Perf P | Perf R | Inst P | Inst R | Comp P | Comp R | Precision | Recall | F1 | Accuracy |
63
  |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:------:|:------:|:------:|:---------:|:------:|:------:|:--------:|
64
- | 1.1177 | 1.0 | 68 | 0.5451 | 0.5033 | 0.9277 | 0.7377 | 0.7143 | 0.4333 | 0.2737 | 0.5762 | 0.4973 | 0.5338 | 0.8295 |
65
- | 0.4079 | 2.0 | 136 | 0.3157 | 0.8021 | 0.9277 | 0.7742 | 0.7619 | 0.7701 | 0.7053 | 0.7542 | 0.7297 | 0.7418 | 0.9097 |
66
- | 0.2188 | 3.0 | 204 | 0.2725 | 0.7143 | 0.9036 | 0.7 | 0.7778 | 0.74 | 0.7789 | 0.7391 | 0.7351 | 0.7371 | 0.9164 |
67
- | 0.1484 | 4.0 | 272 | 0.2467 | 0.79 | 0.9518 | 0.7681 | 0.8413 | 0.7692 | 0.7368 | 0.7838 | 0.8036 | 0.7936 | 0.9290 |
68
- | 0.0914 | 5.0 | 340 | 0.2059 | 0.8488 | 0.8795 | 0.8571 | 0.8571 | 0.8370 | 0.8105 | 0.8312 | 0.8252 | 0.8282 | 0.9374 |
69
- | 0.0656 | 6.0 | 408 | 0.2090 | 0.8247 | 0.9639 | 0.8194 | 0.9365 | 0.8636 | 0.8 | 0.8406 | 0.8360 | 0.8383 | 0.9406 |
70
- | 0.0541 | 7.0 | 476 | 0.2066 | 0.7692 | 0.9639 | 0.8254 | 0.8254 | 0.8506 | 0.7789 | 0.8259 | 0.8288 | 0.8273 | 0.9432 |
71
- | 0.0345 | 8.0 | 544 | 0.2288 | 0.8211 | 0.9398 | 0.9444 | 0.8095 | 0.7717 | 0.7474 | 0.8182 | 0.8270 | 0.8226 | 0.9412 |
72
 
73
 
74
  ### Framework versions
 
20
 
21
  This model is a fine-tuned version of [bert-base-multilingual-cased](https://huggingface.co/bert-base-multilingual-cased) on an unknown dataset.
22
  It achieves the following results on the evaluation set:
23
+ - Loss: 0.3270
24
+ - Perf P: 0.9
25
+ - Perf R: 0.9529
26
+ - Inst P: 0.8657
27
+ - Inst R: 0.8657
28
+ - Comp P: 0.9341
29
+ - Comp R: 0.9341
30
+ - Precision: 0.8256
31
+ - Recall: 0.8311
32
+ - F1: 0.8283
33
+ - Accuracy: 0.9288
34
 
35
  ## Model description
36
 
 
50
 
51
  The following hyperparameters were used during training:
52
  - learning_rate: 2e-05
53
+ - train_batch_size: 1
54
+ - eval_batch_size: 1
55
  - seed: 42
56
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
57
  - lr_scheduler_type: linear
 
61
 
62
  | Training Loss | Epoch | Step | Validation Loss | Perf P | Perf R | Inst P | Inst R | Comp P | Comp R | Precision | Recall | F1 | Accuracy |
63
  |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:------:|:------:|:------:|:---------:|:------:|:------:|:--------:|
64
+ | 0.9205 | 1.0 | 135 | 0.4005 | 0.8148 | 0.7765 | 0.6923 | 0.8060 | 0.8101 | 0.7033 | 0.7042 | 0.6488 | 0.6754 | 0.8767 |
65
+ | 0.2812 | 2.0 | 270 | 0.2675 | 0.8462 | 0.9059 | 0.8485 | 0.8358 | 0.8646 | 0.9121 | 0.7841 | 0.8077 | 0.7957 | 0.9235 |
66
+ | 0.1573 | 3.0 | 405 | 0.2843 | 0.8778 | 0.9294 | 0.9048 | 0.8507 | 0.9014 | 0.7033 | 0.7713 | 0.7559 | 0.7635 | 0.9106 |
67
+ | 0.1013 | 4.0 | 540 | 0.2547 | 0.8316 | 0.9294 | 0.8026 | 0.9104 | 0.8630 | 0.6923 | 0.7465 | 0.7926 | 0.7689 | 0.9235 |
68
+ | 0.0688 | 5.0 | 675 | 0.2390 | 0.8333 | 0.9412 | 0.8611 | 0.9254 | 0.8690 | 0.8022 | 0.7977 | 0.8043 | 0.8010 | 0.9321 |
69
+ | 0.0499 | 6.0 | 810 | 0.2709 | 0.8571 | 0.9176 | 0.8939 | 0.8806 | 0.8438 | 0.8901 | 0.7932 | 0.8211 | 0.8069 | 0.9327 |
70
+ | 0.0387 | 7.0 | 945 | 0.3308 | 0.8941 | 0.8941 | 0.7532 | 0.8657 | 0.9178 | 0.7363 | 0.7638 | 0.7625 | 0.7632 | 0.9168 |
71
+ | 0.0254 | 8.0 | 1080 | 0.3270 | 0.9 | 0.9529 | 0.8657 | 0.8657 | 0.9341 | 0.9341 | 0.8256 | 0.8311 | 0.8283 | 0.9288 |
72
 
73
 
74
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:9813f51fc68314130233e7ec71713b252ee4de60b5390ad2fa126382cb89722e
3
  size 709139348
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ccc77749ef20bf6d901627c6204498fd6edf836487456cd4b4103ca82cfc1c71
3
  size 709139348
runs/Apr29_03-15-19_a88bf9f1af5f/events.out.tfevents.1714360521.a88bf9f1af5f.13227.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:426af555c6ac42e1d94c8fd5d85ca402ac000e50afec03313036fb0bebcf8486
3
- size 11525
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:01245bc89abb86a814ace9210588ca97142c7a7ec1bc8dd7c94b034691609e1a
3
+ size 13845