Jsevisal commited on
Commit
ff68340
·
1 Parent(s): f22205d

update model card README.md

Browse files
Files changed (1) hide show
  1. README.md +83 -0
README.md ADDED
@@ -0,0 +1,83 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - generated_from_trainer
4
+ metrics:
5
+ - precision
6
+ - recall
7
+ - f1
8
+ - accuracy
9
+ model-index:
10
+ - name: balanced-augmented-mlroberta-gest-pred-seqeval-partialmatch
11
+ results: []
12
+ ---
13
+
14
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
15
+ should probably proofread and complete it, then remove this comment. -->
16
+
17
+ # balanced-augmented-mlroberta-gest-pred-seqeval-partialmatch
18
+
19
+ This model is a fine-tuned version of [xlm-roberta-large-finetuned-conll03-english](https://huggingface.co/xlm-roberta-large-finetuned-conll03-english) on the None dataset.
20
+ It achieves the following results on the evaluation set:
21
+ - Loss: 1.1691
22
+ - Precision: 0.8311
23
+ - Recall: 0.8196
24
+ - F1: 0.8166
25
+ - Accuracy: 0.8015
26
+
27
+ ## Model description
28
+
29
+ More information needed
30
+
31
+ ## Intended uses & limitations
32
+
33
+ More information needed
34
+
35
+ ## Training and evaluation data
36
+
37
+ More information needed
38
+
39
+ ## Training procedure
40
+
41
+ ### Training hyperparameters
42
+
43
+ The following hyperparameters were used during training:
44
+ - learning_rate: 2e-05
45
+ - train_batch_size: 16
46
+ - eval_batch_size: 16
47
+ - seed: 42
48
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
49
+ - lr_scheduler_type: linear
50
+ - num_epochs: 20
51
+
52
+ ### Training results
53
+
54
+ | Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1 | Accuracy |
55
+ |:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
56
+ | 3.1888 | 1.0 | 32 | 2.4935 | 0.2783 | 0.1689 | 0.1470 | 0.3131 |
57
+ | 2.25 | 2.0 | 64 | 1.6646 | 0.6169 | 0.5430 | 0.5416 | 0.5888 |
58
+ | 1.4955 | 3.0 | 96 | 1.2759 | 0.7516 | 0.6600 | 0.6688 | 0.6586 |
59
+ | 0.9512 | 4.0 | 128 | 1.0307 | 0.8052 | 0.7394 | 0.7513 | 0.7147 |
60
+ | 0.6053 | 5.0 | 160 | 0.9993 | 0.7975 | 0.7757 | 0.7724 | 0.7398 |
61
+ | 0.4064 | 6.0 | 192 | 0.9347 | 0.8335 | 0.7939 | 0.7988 | 0.7732 |
62
+ | 0.2802 | 7.0 | 224 | 0.9249 | 0.8285 | 0.7970 | 0.8013 | 0.7818 |
63
+ | 0.2062 | 8.0 | 256 | 0.9051 | 0.8395 | 0.8114 | 0.8189 | 0.7987 |
64
+ | 0.1372 | 9.0 | 288 | 0.9771 | 0.8447 | 0.7922 | 0.8079 | 0.7910 |
65
+ | 0.1 | 10.0 | 320 | 1.0232 | 0.8246 | 0.8086 | 0.8042 | 0.7974 |
66
+ | 0.0815 | 11.0 | 352 | 1.0103 | 0.8391 | 0.8173 | 0.8209 | 0.8024 |
67
+ | 0.0586 | 12.0 | 384 | 1.0424 | 0.8366 | 0.7980 | 0.8085 | 0.7932 |
68
+ | 0.0534 | 13.0 | 416 | 1.1246 | 0.8318 | 0.8070 | 0.8126 | 0.7969 |
69
+ | 0.0412 | 14.0 | 448 | 1.0816 | 0.8338 | 0.8186 | 0.8167 | 0.8028 |
70
+ | 0.0346 | 15.0 | 480 | 1.1178 | 0.8277 | 0.8222 | 0.8182 | 0.8037 |
71
+ | 0.0312 | 16.0 | 512 | 1.1570 | 0.8387 | 0.8237 | 0.8219 | 0.8037 |
72
+ | 0.0268 | 17.0 | 544 | 1.1548 | 0.8375 | 0.8279 | 0.8240 | 0.8028 |
73
+ | 0.0221 | 18.0 | 576 | 1.1514 | 0.8316 | 0.8149 | 0.8169 | 0.8005 |
74
+ | 0.0215 | 19.0 | 608 | 1.1698 | 0.8351 | 0.8221 | 0.8204 | 0.8037 |
75
+ | 0.0213 | 20.0 | 640 | 1.1691 | 0.8311 | 0.8196 | 0.8166 | 0.8015 |
76
+
77
+
78
+ ### Framework versions
79
+
80
+ - Transformers 4.27.3
81
+ - Pytorch 1.13.1+cu116
82
+ - Datasets 2.10.1
83
+ - Tokenizers 0.13.2