Chantland
/

HRAF_Multilabel_SubClasses

Text Classification

Model card Files Files and versions

Chantland commited on Jul 29, 2024

Commit

7baa2d8

·

verified ·

1 Parent(s): 8773149

Update README.md

Files changed (1) hide show

README.md +5 -2

README.md CHANGED Viewed

@@ -14,14 +14,17 @@ widget:
 ---
 Text Multi-Label Sequence Classification model used to decode if passages contain a misfortunate event, a cause for misfortune, and/or an action to mollify or prevent some misfortune.
-8293 passages were used for Training and split into 5 folds (~6634 for the train set, ~1659 for the validation set over 5 folds). Although multiple parameters were tested, the current model uses:
 <br>Transformer: distilbert-base-uncased
 <br>Tokenizer: distilbert-base-uncased
-<br>learning rate: 2e-5
 <br>weight decay: .01
 <br>Dropout: .1
 <br>Batch Size: 8
 <br>Epochs: 15
 <br><br>Using epoch 13, the current F1 micro score of 2074 passages not used for training is .637. individual class f1 scores are shown below. Note that at this moment, some labels have been excluded as they are not relevant for the final use of the model.
 <ul>
   <li>EVENT:  -

 ---
 Text Multi-Label Sequence Classification model used to decode if passages contain a misfortunate event, a cause for misfortune, and/or an action to mollify or prevent some misfortune.
+8293 passages were used for Training and split into 5 folds (~6634 for the train set, ~1659 for the validation set over 5 folds).
+<br><b>Parameters</b>:
 <br>Transformer: distilbert-base-uncased
 <br>Tokenizer: distilbert-base-uncased
+<br>learning rate: 2e-05
 <br>weight decay: .01
 <br>Dropout: .1
 <br>Batch Size: 8
 <br>Epochs: 15
+<br>Metric for best model: F1 micro
 <br><br>Using epoch 13, the current F1 micro score of 2074 passages not used for training is .637. individual class f1 scores are shown below. Note that at this moment, some labels have been excluded as they are not relevant for the final use of the model.
 <ul>
   <li>EVENT:  -