sofia-todeschini commited on
Commit
65492f8
·
1 Parent(s): 5112c50

update model card README.md

Browse files
Files changed (1) hide show
  1. README.md +77 -0
README.md ADDED
@@ -0,0 +1,77 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ tags:
4
+ - generated_from_trainer
5
+ metrics:
6
+ - accuracy
7
+ model-index:
8
+ - name: PubMedBERT-Large-LitCovid-v1.3.1
9
+ results: []
10
+ ---
11
+
12
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
13
+ should probably proofread and complete it, then remove this comment. -->
14
+
15
+ # PubMedBERT-Large-LitCovid-v1.3.1
16
+
17
+ This model is a fine-tuned version of [microsoft/BiomedNLP-PubMedBERT-large-uncased-abstract](https://huggingface.co/microsoft/BiomedNLP-PubMedBERT-large-uncased-abstract) on an unknown dataset.
18
+ It achieves the following results on the evaluation set:
19
+ - Loss: 1.2334
20
+ - Hamming loss: 0.0114
21
+ - F1 micro: 0.8967
22
+ - F1 macro: 0.5317
23
+ - F1 weighted: 0.9010
24
+ - F1 samples: 0.9021
25
+ - Precision micro: 0.8733
26
+ - Precision macro: 0.4705
27
+ - Precision weighted: 0.8829
28
+ - Precision samples: 0.9034
29
+ - Recall micro: 0.9214
30
+ - Recall macro: 0.6606
31
+ - Recall weighted: 0.9214
32
+ - Recall samples: 0.9340
33
+ - Roc Auc: 0.9569
34
+ - Accuracy: 0.7445
35
+
36
+ ## Model description
37
+
38
+ More information needed
39
+
40
+ ## Intended uses & limitations
41
+
42
+ More information needed
43
+
44
+ ## Training and evaluation data
45
+
46
+ More information needed
47
+
48
+ ## Training procedure
49
+
50
+ ### Training hyperparameters
51
+
52
+ The following hyperparameters were used during training:
53
+ - learning_rate: 2e-05
54
+ - train_batch_size: 4
55
+ - eval_batch_size: 4
56
+ - seed: 42
57
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
58
+ - lr_scheduler_type: linear
59
+ - num_epochs: 5
60
+
61
+ ### Training results
62
+
63
+ | Training Loss | Epoch | Step | Validation Loss | Hamming loss | F1 micro | F1 macro | F1 weighted | F1 samples | Precision micro | Precision macro | Precision weighted | Precision samples | Recall micro | Recall macro | Recall weighted | Recall samples | Roc Auc | Accuracy |
64
+ |:-------------:|:-----:|:-----:|:---------------:|:------------:|:--------:|:--------:|:-----------:|:----------:|:---------------:|:---------------:|:------------------:|:-----------------:|:------------:|:------------:|:---------------:|:--------------:|:-------:|:--------:|
65
+ | 2.4299 | 1.0 | 9086 | 0.8402 | 0.0183 | 0.8440 | 0.3442 | 0.8744 | 0.8786 | 0.7770 | 0.2832 | 0.8384 | 0.8690 | 0.9236 | 0.6237 | 0.9236 | 0.9363 | 0.9543 | 0.6930 |
66
+ | 1.388 | 2.0 | 18172 | 0.7233 | 0.0161 | 0.8613 | 0.3836 | 0.8886 | 0.8913 | 0.8016 | 0.3164 | 0.8565 | 0.8827 | 0.9306 | 0.6955 | 0.9306 | 0.9426 | 0.9588 | 0.7163 |
67
+ | 0.9723 | 3.0 | 27258 | 0.8506 | 0.0132 | 0.8829 | 0.4584 | 0.8970 | 0.8963 | 0.8434 | 0.3870 | 0.8735 | 0.8924 | 0.9264 | 0.6941 | 0.9264 | 0.9380 | 0.9583 | 0.7313 |
68
+ | 0.7713 | 4.0 | 36344 | 1.0207 | 0.0119 | 0.8923 | 0.5060 | 0.8994 | 0.8990 | 0.8635 | 0.4387 | 0.8795 | 0.8976 | 0.9230 | 0.6758 | 0.9230 | 0.9349 | 0.9574 | 0.7389 |
69
+ | 0.3859 | 5.0 | 45430 | 1.2334 | 0.0114 | 0.8967 | 0.5317 | 0.9010 | 0.9021 | 0.8733 | 0.4705 | 0.8829 | 0.9034 | 0.9214 | 0.6606 | 0.9214 | 0.9340 | 0.9569 | 0.7445 |
70
+
71
+
72
+ ### Framework versions
73
+
74
+ - Transformers 4.28.0
75
+ - Pytorch 2.0.0
76
+ - Datasets 2.1.0
77
+ - Tokenizers 0.13.3