End of training
Browse files- README.md +56 -56
- model.safetensors +1 -1
README.md
CHANGED
|
@@ -16,8 +16,8 @@ should probably proofread and complete it, then remove this comment. -->
|
|
| 16 |
|
| 17 |
This model is a fine-tuned version of [answerdotai/ModernBERT-large](https://huggingface.co/answerdotai/ModernBERT-large) on an unknown dataset.
|
| 18 |
It achieves the following results on the evaluation set:
|
| 19 |
-
- Loss: 0.
|
| 20 |
-
- Classification Report: {'0': {'precision': 0.
|
| 21 |
|
| 22 |
## Model description
|
| 23 |
|
|
@@ -36,14 +36,14 @@ More information needed
|
|
| 36 |
### Training hyperparameters
|
| 37 |
|
| 38 |
The following hyperparameters were used during training:
|
| 39 |
-
- learning_rate:
|
| 40 |
- train_batch_size: 256
|
| 41 |
- eval_batch_size: 256
|
| 42 |
- seed: 42
|
| 43 |
- distributed_type: multi-GPU
|
| 44 |
-
- num_devices:
|
| 45 |
-
- total_train_batch_size:
|
| 46 |
-
- total_eval_batch_size:
|
| 47 |
- optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
|
| 48 |
- lr_scheduler_type: linear
|
| 49 |
- num_epochs: 50
|
|
@@ -52,56 +52,56 @@ The following hyperparameters were used during training:
|
|
| 52 |
|
| 53 |
| Training Loss | Epoch | Step | Validation Loss | Classification Report |
|
| 54 |
|:-------------:|:-----:|:----:|:---------------:|:-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------:|
|
| 55 |
-
| No log | 1.0 |
|
| 56 |
-
| No log | 2.0 |
|
| 57 |
-
| No log | 3.0 |
|
| 58 |
-
| No log | 4.0 |
|
| 59 |
-
| No log | 5.0 |
|
| 60 |
-
| No log | 6.0 |
|
| 61 |
-
| No log | 7.0 |
|
| 62 |
-
| No log | 8.0 |
|
| 63 |
-
| No log | 9.0 |
|
| 64 |
-
| No log | 10.0 |
|
| 65 |
-
| No log | 11.0 |
|
| 66 |
-
| No log | 12.0 |
|
| 67 |
-
| No log | 13.0 |
|
| 68 |
-
| No log | 14.0 |
|
| 69 |
-
| No log | 15.0 |
|
| 70 |
-
| No log | 16.0 |
|
| 71 |
-
| No log | 17.0 |
|
| 72 |
-
| No log | 18.0 |
|
| 73 |
-
| No log | 19.0 |
|
| 74 |
-
| No log | 20.0 |
|
| 75 |
-
| No log | 21.0 |
|
| 76 |
-
| No log | 22.0 |
|
| 77 |
-
| No log | 23.0 |
|
| 78 |
-
| No log | 24.0 |
|
| 79 |
-
| No log | 25.0 |
|
| 80 |
-
| No log | 26.0 |
|
| 81 |
-
| No log | 27.0 |
|
| 82 |
-
| No log | 28.0 |
|
| 83 |
-
| No log | 29.0 |
|
| 84 |
-
| No log | 30.0 |
|
| 85 |
-
| No log | 31.0 |
|
| 86 |
-
| No log | 32.0 |
|
| 87 |
-
| No log | 33.0 |
|
| 88 |
-
| No log | 34.0 |
|
| 89 |
-
| No log | 35.0 |
|
| 90 |
-
| No log | 36.0 |
|
| 91 |
-
| No log | 37.0 |
|
| 92 |
-
| No log | 38.0 |
|
| 93 |
-
| No log | 39.0 |
|
| 94 |
-
| No log | 40.0 |
|
| 95 |
-
| No log | 41.0 |
|
| 96 |
-
| No log | 42.0 |
|
| 97 |
-
| No log | 43.0 |
|
| 98 |
-
| No log | 44.0 |
|
| 99 |
-
| No log | 45.0 |
|
| 100 |
-
| No log | 46.0 |
|
| 101 |
-
| No log | 47.0 |
|
| 102 |
-
| No log | 48.0 |
|
| 103 |
-
| No log | 49.0 |
|
| 104 |
-
| No log | 50.0 |
|
| 105 |
|
| 106 |
|
| 107 |
### Framework versions
|
|
|
|
| 16 |
|
| 17 |
This model is a fine-tuned version of [answerdotai/ModernBERT-large](https://huggingface.co/answerdotai/ModernBERT-large) on an unknown dataset.
|
| 18 |
It achieves the following results on the evaluation set:
|
| 19 |
+
- Loss: 0.9073
|
| 20 |
+
- Classification Report: {'0': {'precision': 0.9506866416978776, 'recall': 0.9566582914572864, 'f1-score': 0.9536631183469004, 'support': 1592.0}, '1': {'precision': 0.724, 'recall': 0.6961538461538461, 'f1-score': 0.7098039215686275, 'support': 260.0}, 'accuracy': 0.9200863930885529, 'macro avg': {'precision': 0.8373433208489388, 'recall': 0.8264060688055663, 'f1-score': 0.831733519957764, 'support': 1852.0}, 'weighted avg': {'precision': 0.91886238314418, 'recall': 0.9200863930885529, 'f1-score': 0.9194280259266244, 'support': 1852.0}}
|
| 21 |
|
| 22 |
## Model description
|
| 23 |
|
|
|
|
| 36 |
### Training hyperparameters
|
| 37 |
|
| 38 |
The following hyperparameters were used during training:
|
| 39 |
+
- learning_rate: 5e-06
|
| 40 |
- train_batch_size: 256
|
| 41 |
- eval_batch_size: 256
|
| 42 |
- seed: 42
|
| 43 |
- distributed_type: multi-GPU
|
| 44 |
+
- num_devices: 3
|
| 45 |
+
- total_train_batch_size: 768
|
| 46 |
+
- total_eval_batch_size: 768
|
| 47 |
- optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
|
| 48 |
- lr_scheduler_type: linear
|
| 49 |
- num_epochs: 50
|
|
|
|
| 52 |
|
| 53 |
| Training Loss | Epoch | Step | Validation Loss | Classification Report |
|
| 54 |
|:-------------:|:-----:|:----:|:---------------:|:-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------:|
|
| 55 |
+
| No log | 1.0 | 9 | 0.4858 | {'0': {'precision': 0.9859307359307359, 'recall': 0.5722361809045227, 'f1-score': 0.7241653418124007, 'support': 1592.0}, '1': {'precision': 0.2661637931034483, 'recall': 0.95, 'f1-score': 0.4158249158249158, 'support': 260.0}, 'accuracy': 0.6252699784017278, 'macro avg': {'precision': 0.626047264517092, 'recall': 0.7611180904522613, 'f1-score': 0.5699951288186582, 'support': 1852.0}, 'weighted avg': {'precision': 0.8848835409333844, 'recall': 0.6252699784017278, 'f1-score': 0.6808778090063822, 'support': 1852.0}} |
|
| 56 |
+
| No log | 2.0 | 18 | 0.4306 | {'0': {'precision': 0.9797421731123389, 'recall': 0.6683417085427136, 'f1-score': 0.79462285287528, 'support': 1592.0}, '1': {'precision': 0.31070496083550914, 'recall': 0.9153846153846154, 'f1-score': 0.46393762183235865, 'support': 260.0}, 'accuracy': 0.703023758099352, 'macro avg': {'precision': 0.645223566973924, 'recall': 0.7918631619636645, 'f1-score': 0.6292802373538193, 'support': 1852.0}, 'weighted avg': {'precision': 0.8858168625335183, 'recall': 0.703023758099352, 'f1-score': 0.7481983603962522, 'support': 1852.0}} |
|
| 57 |
+
| No log | 3.0 | 27 | 0.3970 | {'0': {'precision': 0.9707016191210486, 'recall': 0.7908291457286433, 'f1-score': 0.8715818622360678, 'support': 1592.0}, '1': {'precision': 0.4, 'recall': 0.8538461538461538, 'f1-score': 0.5447852760736196, 'support': 260.0}, 'accuracy': 0.7996760259179265, 'macro avg': {'precision': 0.6853508095605243, 'recall': 0.8223376497873985, 'f1-score': 0.7081835691548437, 'support': 1852.0}, 'weighted avg': {'precision': 0.8905815214042707, 'recall': 0.7996760259179265, 'f1-score': 0.825703291824493, 'support': 1852.0}} |
|
| 58 |
+
| No log | 4.0 | 36 | 0.4288 | {'0': {'precision': 0.9431891542930924, 'recall': 0.917713567839196, 'f1-score': 0.9302769818529131, 'support': 1592.0}, '1': {'precision': 0.5676567656765676, 'recall': 0.6615384615384615, 'f1-score': 0.61101243339254, 'support': 260.0}, 'accuracy': 0.8817494600431965, 'macro avg': {'precision': 0.75542295998483, 'recall': 0.7896260146888288, 'f1-score': 0.7706447076227265, 'support': 1852.0}, 'weighted avg': {'precision': 0.8904686245737099, 'recall': 0.8817494600431965, 'f1-score': 0.8854558249416297, 'support': 1852.0}} |
|
| 59 |
+
| No log | 5.0 | 45 | 0.3387 | {'0': {'precision': 0.9824293353705118, 'recall': 0.8077889447236181, 'f1-score': 0.8865908307480179, 'support': 1592.0}, '1': {'precision': 0.43646408839779005, 'recall': 0.9115384615384615, 'f1-score': 0.5902864259028643, 'support': 260.0}, 'accuracy': 0.822354211663067, 'macro avg': {'precision': 0.709446711884151, 'recall': 0.8596637031310398, 'f1-score': 0.7384386283254412, 'support': 1852.0}, 'weighted avg': {'precision': 0.9057819464866524, 'recall': 0.822354211663067, 'f1-score': 0.8449930201326077, 'support': 1852.0}} |
|
| 60 |
+
| No log | 6.0 | 54 | 0.3188 | {'0': {'precision': 0.9801178203240059, 'recall': 0.8360552763819096, 'f1-score': 0.9023728813559322, 'support': 1592.0}, '1': {'precision': 0.4716599190283401, 'recall': 0.8961538461538462, 'f1-score': 0.6180371352785146, 'support': 260.0}, 'accuracy': 0.8444924406047516, 'macro avg': {'precision': 0.7258888696761729, 'recall': 0.8661045612678779, 'f1-score': 0.7602050083172234, 'support': 1852.0}, 'weighted avg': {'precision': 0.9087360415243984, 'recall': 0.8444924406047516, 'f1-score': 0.8624553360102903, 'support': 1852.0}} |
|
| 61 |
+
| No log | 7.0 | 63 | 0.3201 | {'0': {'precision': 0.9748778785764132, 'recall': 0.8775125628140703, 'f1-score': 0.9236363636363636, 'support': 1592.0}, '1': {'precision': 0.5346062052505967, 'recall': 0.8615384615384616, 'f1-score': 0.6597938144329897, 'support': 260.0}, 'accuracy': 0.8752699784017278, 'macro avg': {'precision': 0.7547420419135049, 'recall': 0.869525512176266, 'f1-score': 0.7917150890346767, 'support': 1852.0}, 'weighted avg': {'precision': 0.9130686803773245, 'recall': 0.8752699784017278, 'f1-score': 0.8865958329706631, 'support': 1852.0}} |
|
| 62 |
+
| No log | 8.0 | 72 | 0.5406 | {'0': {'precision': 0.9192692987625221, 'recall': 0.9798994974874372, 'f1-score': 0.9486166007905138, 'support': 1592.0}, '1': {'precision': 0.7935483870967742, 'recall': 0.47307692307692306, 'f1-score': 0.5927710843373494, 'support': 260.0}, 'accuracy': 0.9087473002159827, 'macro avg': {'precision': 0.8564088429296481, 'recall': 0.7264882102821801, 'f1-score': 0.7706938425639316, 'support': 1852.0}, 'weighted avg': {'precision': 0.9016194947489721, 'recall': 0.9087473002159827, 'f1-score': 0.8986598868176074, 'support': 1852.0}} |
|
| 63 |
+
| No log | 9.0 | 81 | 0.3013 | {'0': {'precision': 0.9760056457304164, 'recall': 0.8687185929648241, 'f1-score': 0.9192422731804586, 'support': 1592.0}, '1': {'precision': 0.5195402298850574, 'recall': 0.8692307692307693, 'f1-score': 0.6503597122302158, 'support': 260.0}, 'accuracy': 0.8687904967602592, 'macro avg': {'precision': 0.7477729378077369, 'recall': 0.8689746810977967, 'f1-score': 0.7848009927053372, 'support': 1852.0}, 'weighted avg': {'precision': 0.9119230279551499, 'recall': 0.8687904967602592, 'f1-score': 0.8814941814703814, 'support': 1852.0}} |
|
| 64 |
+
| No log | 10.0 | 90 | 0.5003 | {'0': {'precision': 0.9357964869775893, 'recall': 0.9704773869346733, 'f1-score': 0.9528214616096207, 'support': 1592.0}, '1': {'precision': 0.7661691542288557, 'recall': 0.5923076923076923, 'f1-score': 0.6681127982646421, 'support': 260.0}, 'accuracy': 0.9173866090712743, 'macro avg': {'precision': 0.8509828206032225, 'recall': 0.7813925396211828, 'f1-score': 0.8104671299371313, 'support': 1852.0}, 'weighted avg': {'precision': 0.9119827145614603, 'recall': 0.9173866090712743, 'f1-score': 0.9128515628678849, 'support': 1852.0}} |
|
| 65 |
+
| No log | 11.0 | 99 | 0.3786 | {'0': {'precision': 0.9515418502202643, 'recall': 0.949748743718593, 'f1-score': 0.9506444514303678, 'support': 1592.0}, '1': {'precision': 0.6958174904942965, 'recall': 0.7038461538461539, 'f1-score': 0.6998087954110899, 'support': 260.0}, 'accuracy': 0.9152267818574514, 'macro avg': {'precision': 0.8236796703572804, 'recall': 0.8267974487823735, 'f1-score': 0.8252266234207288, 'support': 1852.0}, 'weighted avg': {'precision': 0.9156410221809815, 'recall': 0.9152267818574514, 'f1-score': 0.9154299424859768, 'support': 1852.0}} |
|
| 66 |
+
| No log | 12.0 | 108 | 0.3033 | {'0': {'precision': 0.9724896836313618, 'recall': 0.8881909547738693, 'f1-score': 0.9284307288246881, 'support': 1592.0}, '1': {'precision': 0.5527638190954773, 'recall': 0.8461538461538461, 'f1-score': 0.668693009118541, 'support': 260.0}, 'accuracy': 0.8822894168466523, 'macro avg': {'precision': 0.7626267513634195, 'recall': 0.8671724004638577, 'f1-score': 0.7985618689716145, 'support': 1852.0}, 'weighted avg': {'precision': 0.9135648862343154, 'recall': 0.8822894168466523, 'f1-score': 0.8919664701186415, 'support': 1852.0}} |
|
| 67 |
+
| No log | 13.0 | 117 | 0.2893 | {'0': {'precision': 0.9823399558498896, 'recall': 0.8385678391959799, 'f1-score': 0.904778041341918, 'support': 1592.0}, '1': {'precision': 0.4787018255578093, 'recall': 0.9076923076923077, 'f1-score': 0.6268260292164675, 'support': 260.0}, 'accuracy': 0.8482721382289417, 'macro avg': {'precision': 0.7305208907038495, 'recall': 0.8731300734441438, 'f1-score': 0.7658020352791928, 'support': 1852.0}, 'weighted avg': {'precision': 0.9116348187678481, 'recall': 0.8482721382289417, 'f1-score': 0.8657567005467682, 'support': 1852.0}} |
|
| 68 |
+
| No log | 14.0 | 126 | 0.2962 | {'0': {'precision': 0.9817117776152158, 'recall': 0.842964824120603, 'f1-score': 0.9070631970260223, 'support': 1592.0}, '1': {'precision': 0.4845360824742268, 'recall': 0.9038461538461539, 'f1-score': 0.6308724832214765, 'support': 260.0}, 'accuracy': 0.8515118790496761, 'macro avg': {'precision': 0.7331239300447213, 'recall': 0.8734054889833784, 'f1-score': 0.7689678401237494, 'support': 1852.0}, 'weighted avg': {'precision': 0.9119138938481224, 'recall': 0.8515118790496761, 'f1-score': 0.8682891227338074, 'support': 1852.0}} |
|
| 69 |
+
| No log | 15.0 | 135 | 0.2999 | {'0': {'precision': 0.9724517906336089, 'recall': 0.8869346733668342, 'f1-score': 0.9277266754270697, 'support': 1592.0}, '1': {'precision': 0.55, 'recall': 0.8461538461538461, 'f1-score': 0.6666666666666666, 'support': 260.0}, 'accuracy': 0.8812095032397408, 'macro avg': {'precision': 0.7612258953168045, 'recall': 0.8665442597603401, 'f1-score': 0.7971966710468681, 'support': 1852.0}, 'weighted avg': {'precision': 0.9131443038275947, 'recall': 0.8812095032397408, 'f1-score': 0.8910767821885681, 'support': 1852.0}} |
|
| 70 |
+
| No log | 16.0 | 144 | 0.3150 | {'0': {'precision': 0.9717241379310345, 'recall': 0.8850502512562815, 'f1-score': 0.9263642340565418, 'support': 1592.0}, '1': {'precision': 0.5447761194029851, 'recall': 0.8423076923076923, 'f1-score': 0.6616314199395771, 'support': 260.0}, 'accuracy': 0.8790496760259179, 'macro avg': {'precision': 0.7582501286670098, 'recall': 0.8636789717819868, 'f1-score': 0.7939978269980594, 'support': 1852.0}, 'weighted avg': {'precision': 0.9117854312262327, 'recall': 0.8790496760259179, 'f1-score': 0.8891987201956287, 'support': 1852.0}} |
|
| 71 |
+
| No log | 17.0 | 153 | 0.3029 | {'0': {'precision': 0.9771265189421015, 'recall': 0.8586683417085427, 'f1-score': 0.9140755600133734, 'support': 1592.0}, '1': {'precision': 0.5033112582781457, 'recall': 0.8769230769230769, 'f1-score': 0.6395511921458625, 'support': 260.0}, 'accuracy': 0.8612311015118791, 'macro avg': {'precision': 0.7402188886101235, 'recall': 0.8677957093158097, 'f1-score': 0.776813376079618, 'support': 1852.0}, 'weighted avg': {'precision': 0.9106081778121724, 'recall': 0.8612311015118791, 'f1-score': 0.8755354219758179, 'support': 1852.0}} |
|
| 72 |
+
| No log | 18.0 | 162 | 0.4483 | {'0': {'precision': 0.9490049751243781, 'recall': 0.9585427135678392, 'f1-score': 0.95375, 'support': 1592.0}, '1': {'precision': 0.7295081967213115, 'recall': 0.6846153846153846, 'f1-score': 0.7063492063492064, 'support': 260.0}, 'accuracy': 0.9200863930885529, 'macro avg': {'precision': 0.8392565859228448, 'recall': 0.821579049091612, 'f1-score': 0.8300496031746032, 'support': 1852.0}, 'weighted avg': {'precision': 0.9181900926271873, 'recall': 0.9200863930885529, 'f1-score': 0.9190177071548562, 'support': 1852.0}} |
|
| 73 |
+
| No log | 19.0 | 171 | 0.3250 | {'0': {'precision': 0.9701492537313433, 'recall': 0.8982412060301508, 'f1-score': 0.9328114807566862, 'support': 1592.0}, '1': {'precision': 0.5714285714285714, 'recall': 0.8307692307692308, 'f1-score': 0.677115987460815, 'support': 260.0}, 'accuracy': 0.8887688984881209, 'macro avg': {'precision': 0.7707889125799574, 'recall': 0.8645052183996909, 'f1-score': 0.8049637341087506, 'support': 1852.0}, 'weighted avg': {'precision': 0.9141733480084919, 'recall': 0.8887688984881209, 'f1-score': 0.8969147052399872, 'support': 1852.0}} |
|
| 74 |
+
| No log | 20.0 | 180 | 0.3295 | {'0': {'precision': 0.9803063457330415, 'recall': 0.8442211055276382, 'f1-score': 0.9071886601417483, 'support': 1592.0}, '1': {'precision': 0.48440748440748443, 'recall': 0.8961538461538462, 'f1-score': 0.6288798920377868, 'support': 260.0}, 'accuracy': 0.8515118790496761, 'macro avg': {'precision': 0.732356915070263, 'recall': 0.8701874758407422, 'f1-score': 0.7680342760897676, 'support': 1852.0}, 'weighted avg': {'precision': 0.9106877150933845, 'recall': 0.8515118790496761, 'f1-score': 0.8681172348139783, 'support': 1852.0}} |
|
| 75 |
+
| No log | 21.0 | 189 | 0.4537 | {'0': {'precision': 0.9506558400999375, 'recall': 0.9560301507537688, 'f1-score': 0.9533354212339492, 'support': 1592.0}, '1': {'precision': 0.7211155378486056, 'recall': 0.6961538461538461, 'f1-score': 0.7084148727984344, 'support': 260.0}, 'accuracy': 0.9195464362850972, 'macro avg': {'precision': 0.8358856889742716, 'recall': 0.8260919984538075, 'f1-score': 0.8308751470161918, 'support': 1852.0}, 'weighted avg': {'precision': 0.9184309596542862, 'recall': 0.9195464362850972, 'f1-score': 0.9189513269611447, 'support': 1852.0}} |
|
| 76 |
+
| No log | 22.0 | 198 | 0.5067 | {'0': {'precision': 0.9474335188620903, 'recall': 0.9623115577889447, 'f1-score': 0.954814583982549, 'support': 1592.0}, '1': {'precision': 0.7446808510638298, 'recall': 0.6730769230769231, 'f1-score': 0.7070707070707071, 'support': 260.0}, 'accuracy': 0.92170626349892, 'macro avg': {'precision': 0.84605718496296, 'recall': 0.8176942404329339, 'f1-score': 0.830942645526628, 'support': 1852.0}, 'weighted avg': {'precision': 0.9189693214390083, 'recall': 0.92170626349892, 'f1-score': 0.920034126100757, 'support': 1852.0}} |
|
| 77 |
+
| No log | 23.0 | 207 | 0.5322 | {'0': {'precision': 0.9415347137637028, 'recall': 0.9711055276381909, 'f1-score': 0.956091527520099, 'support': 1592.0}, '1': {'precision': 0.780952380952381, 'recall': 0.6307692307692307, 'f1-score': 0.6978723404255319, 'support': 260.0}, 'accuracy': 0.9233261339092873, 'macro avg': {'precision': 0.8612435473580419, 'recall': 0.8009373792037109, 'f1-score': 0.8269819339728155, 'support': 1852.0}, 'weighted avg': {'precision': 0.918990757753474, 'recall': 0.9233261339092873, 'f1-score': 0.9198404537379242, 'support': 1852.0}} |
|
| 78 |
+
| No log | 24.0 | 216 | 0.3741 | {'0': {'precision': 0.9659239842726082, 'recall': 0.9258793969849246, 'f1-score': 0.9454778704297627, 'support': 1592.0}, '1': {'precision': 0.6380368098159509, 'recall': 0.8, 'f1-score': 0.7098976109215017, 'support': 260.0}, 'accuracy': 0.908207343412527, 'macro avg': {'precision': 0.8019803970442796, 'recall': 0.8629396984924623, 'f1-score': 0.8276877406756322, 'support': 1852.0}, 'weighted avg': {'precision': 0.9198923075130342, 'recall': 0.908207343412527, 'f1-score': 0.9124050478206116, 'support': 1852.0}} |
|
| 79 |
+
| No log | 25.0 | 225 | 0.3313 | {'0': {'precision': 0.9733424470266575, 'recall': 0.8944723618090452, 'f1-score': 0.9322422258592471, 'support': 1592.0}, '1': {'precision': 0.5681233933161953, 'recall': 0.85, 'f1-score': 0.6810477657935285, 'support': 260.0}, 'accuracy': 0.8882289416846653, 'macro avg': {'precision': 0.7707329201714264, 'recall': 0.8722361809045226, 'f1-score': 0.8066449958263878, 'support': 1852.0}, 'weighted avg': {'precision': 0.9164542429420354, 'recall': 0.8882289416846653, 'f1-score': 0.8969773448565004, 'support': 1852.0}} |
|
| 80 |
+
| No log | 26.0 | 234 | 0.3321 | {'0': {'precision': 0.9748953974895398, 'recall': 0.878140703517588, 'f1-score': 0.9239920687376074, 'support': 1592.0}, '1': {'precision': 0.5358851674641149, 'recall': 0.8615384615384616, 'f1-score': 0.6607669616519174, 'support': 260.0}, 'accuracy': 0.8758099352051836, 'macro avg': {'precision': 0.7553902824768273, 'recall': 0.8698395825280247, 'f1-score': 0.7923795151947624, 'support': 1852.0}, 'weighted avg': {'precision': 0.9132632917624284, 'recall': 0.8758099352051836, 'f1-score': 0.8870382200106747, 'support': 1852.0}} |
|
| 81 |
+
| No log | 27.0 | 243 | 0.5558 | {'0': {'precision': 0.9436274509803921, 'recall': 0.9673366834170855, 'f1-score': 0.9553349875930521, 'support': 1592.0}, '1': {'precision': 0.7636363636363637, 'recall': 0.6461538461538462, 'f1-score': 0.7, 'support': 260.0}, 'accuracy': 0.9222462203023758, 'macro avg': {'precision': 0.8536319073083779, 'recall': 0.8067452647854658, 'f1-score': 0.827667493796526, 'support': 1852.0}, 'weighted avg': {'precision': 0.9183587238154637, 'recall': 0.9222462203023758, 'f1-score': 0.9194888230281528, 'support': 1852.0}} |
|
| 82 |
+
| No log | 28.0 | 252 | 0.3644 | {'0': {'precision': 0.9720708446866485, 'recall': 0.8963567839195979, 'f1-score': 0.9326797385620915, 'support': 1592.0}, '1': {'precision': 0.5703125, 'recall': 0.8423076923076923, 'f1-score': 0.6801242236024845, 'support': 260.0}, 'accuracy': 0.8887688984881209, 'macro avg': {'precision': 0.7711916723433243, 'recall': 0.8693322381136451, 'f1-score': 0.806401981082288, 'support': 1852.0}, 'weighted avg': {'precision': 0.9156684852813954, 'recall': 0.8887688984881209, 'f1-score': 0.8972237807383886, 'support': 1852.0}} |
|
| 83 |
+
| No log | 29.0 | 261 | 0.5193 | {'0': {'precision': 0.9546884833228445, 'recall': 0.9528894472361809, 'f1-score': 0.9537881169443572, 'support': 1592.0}, '1': {'precision': 0.714828897338403, 'recall': 0.7230769230769231, 'f1-score': 0.7189292543021033, 'support': 260.0}, 'accuracy': 0.9206263498920086, 'macro avg': {'precision': 0.8347586903306238, 'recall': 0.837983185156552, 'f1-score': 0.8363586856232302, 'support': 1852.0}, 'weighted avg': {'precision': 0.9210148913379876, 'recall': 0.9206263498920086, 'f1-score': 0.920816570353112, 'support': 1852.0}} |
|
| 84 |
+
| No log | 30.0 | 270 | 0.4313 | {'0': {'precision': 0.9656084656084656, 'recall': 0.9170854271356784, 'f1-score': 0.9407216494845361, 'support': 1592.0}, '1': {'precision': 0.611764705882353, 'recall': 0.8, 'f1-score': 0.6933333333333334, 'support': 260.0}, 'accuracy': 0.9006479481641468, 'macro avg': {'precision': 0.7886865857454093, 'recall': 0.8585427135678392, 'f1-score': 0.8170274914089347, 'support': 1852.0}, 'weighted avg': {'precision': 0.9159327757981042, 'recall': 0.9006479481641468, 'f1-score': 0.9059911083401987, 'support': 1852.0}} |
|
| 85 |
+
| No log | 31.0 | 279 | 0.5175 | {'0': {'precision': 0.9576923076923077, 'recall': 0.9384422110552764, 'f1-score': 0.9479695431472082, 'support': 1592.0}, '1': {'precision': 0.6643835616438356, 'recall': 0.7461538461538462, 'f1-score': 0.7028985507246377, 'support': 260.0}, 'accuracy': 0.9114470842332614, 'macro avg': {'precision': 0.8110379346680716, 'recall': 0.8422980286045613, 'f1-score': 0.8254340469359229, 'support': 1852.0}, 'weighted avg': {'precision': 0.9165150539274034, 'recall': 0.9114470842332614, 'f1-score': 0.9135643282282727, 'support': 1852.0}} |
|
| 86 |
+
| No log | 32.0 | 288 | 0.5112 | {'0': {'precision': 0.9575835475578406, 'recall': 0.9359296482412061, 'f1-score': 0.9466327827191868, 'support': 1592.0}, '1': {'precision': 0.6554054054054054, 'recall': 0.7461538461538462, 'f1-score': 0.697841726618705, 'support': 260.0}, 'accuracy': 0.9092872570194385, 'macro avg': {'precision': 0.806494476481623, 'recall': 0.8410417471975261, 'f1-score': 0.8222372546689459, 'support': 1852.0}, 'weighted avg': {'precision': 0.9151611301930278, 'recall': 0.9092872570194385, 'f1-score': 0.9117053126402856, 'support': 1852.0}} |
|
| 87 |
+
| No log | 33.0 | 297 | 0.7361 | {'0': {'precision': 0.9418604651162791, 'recall': 0.9667085427135679, 'f1-score': 0.9541227526348419, 'support': 1592.0}, '1': {'precision': 0.7568807339449541, 'recall': 0.6346153846153846, 'f1-score': 0.6903765690376569, 'support': 260.0}, 'accuracy': 0.9200863930885529, 'macro avg': {'precision': 0.8493705995306167, 'recall': 0.8006619636644763, 'f1-score': 0.8222496608362494, 'support': 1852.0}, 'weighted avg': {'precision': 0.9158913883859635, 'recall': 0.9200863930885529, 'f1-score': 0.9170957506179584, 'support': 1852.0}} |
|
| 88 |
+
| No log | 34.0 | 306 | 0.5370 | {'0': {'precision': 0.9603638726445743, 'recall': 0.928391959798995, 'f1-score': 0.9441073139572022, 'support': 1592.0}, '1': {'precision': 0.6357827476038339, 'recall': 0.7653846153846153, 'f1-score': 0.6945898778359512, 'support': 260.0}, 'accuracy': 0.9055075593952484, 'macro avg': {'precision': 0.7980733101242041, 'recall': 0.8468882875918051, 'f1-score': 0.8193485958965767, 'support': 1852.0}, 'weighted avg': {'precision': 0.9147963280924186, 'recall': 0.9055075593952484, 'f1-score': 0.9090778682814326, 'support': 1852.0}} |
|
| 89 |
+
| No log | 35.0 | 315 | 0.6283 | {'0': {'precision': 0.9570164348925411, 'recall': 0.9510050251256281, 'f1-score': 0.9540012602394455, 'support': 1592.0}, '1': {'precision': 0.7111111111111111, 'recall': 0.7384615384615385, 'f1-score': 0.7245283018867924, 'support': 260.0}, 'accuracy': 0.9211663066954644, 'macro avg': {'precision': 0.8340637730018261, 'recall': 0.8447332817935833, 'f1-score': 0.839264781063119, 'support': 1852.0}, 'weighted avg': {'precision': 0.922494089221282, 'recall': 0.9211663066954644, 'f1-score': 0.9217858341208225, 'support': 1852.0}} |
|
| 90 |
+
| No log | 36.0 | 324 | 0.8542 | {'0': {'precision': 0.940925700365408, 'recall': 0.9704773869346733, 'f1-score': 0.9554730983302412, 'support': 1592.0}, '1': {'precision': 0.7761904761904762, 'recall': 0.6269230769230769, 'f1-score': 0.6936170212765957, 'support': 260.0}, 'accuracy': 0.9222462203023758, 'macro avg': {'precision': 0.8585580882779421, 'recall': 0.7987002319288752, 'f1-score': 0.8245450598034185, 'support': 1852.0}, 'weighted avg': {'precision': 0.9177987250492728, 'recall': 0.9222462203023758, 'f1-score': 0.9187114460440923, 'support': 1852.0}} |
|
| 91 |
+
| No log | 37.0 | 333 | 0.5817 | {'0': {'precision': 0.961139896373057, 'recall': 0.9321608040201005, 'f1-score': 0.9464285714285714, 'support': 1592.0}, '1': {'precision': 0.6493506493506493, 'recall': 0.7692307692307693, 'f1-score': 0.704225352112676, 'support': 260.0}, 'accuracy': 0.9092872570194385, 'macro avg': {'precision': 0.8052452728618531, 'recall': 0.8506957866254349, 'f1-score': 0.8253269617706237, 'support': 1852.0}, 'weighted avg': {'precision': 0.9173681878277945, 'recall': 0.9092872570194385, 'f1-score': 0.9124259596455623, 'support': 1852.0}} |
|
| 92 |
+
| No log | 38.0 | 342 | 0.7546 | {'0': {'precision': 0.9490683229813665, 'recall': 0.9597989949748744, 'f1-score': 0.9544034978138664, 'support': 1592.0}, '1': {'precision': 0.7355371900826446, 'recall': 0.6846153846153846, 'f1-score': 0.7091633466135459, 'support': 260.0}, 'accuracy': 0.9211663066954644, 'macro avg': {'precision': 0.8423027565320056, 'recall': 0.8222071897951295, 'f1-score': 0.8317834222137062, 'support': 1852.0}, 'weighted avg': {'precision': 0.9190909501122155, 'recall': 0.9211663066954644, 'f1-score': 0.9199745349023744, 'support': 1852.0}} |
|
| 93 |
+
| No log | 39.0 | 351 | 0.7266 | {'0': {'precision': 0.9545741324921135, 'recall': 0.9503768844221105, 'f1-score': 0.952470884482216, 'support': 1592.0}, '1': {'precision': 0.704119850187266, 'recall': 0.7230769230769231, 'f1-score': 0.713472485768501, 'support': 260.0}, 'accuracy': 0.9184665226781857, 'macro avg': {'precision': 0.8293469913396898, 'recall': 0.8367269037495169, 'f1-score': 0.8329716851253585, 'support': 1852.0}, 'weighted avg': {'precision': 0.9194131641339816, 'recall': 0.9184665226781857, 'f1-score': 0.9189181935180875, 'support': 1852.0}} |
|
| 94 |
+
| No log | 40.0 | 360 | 0.8533 | {'0': {'precision': 0.9451632778804683, 'recall': 0.9635678391959799, 'f1-score': 0.9542768273716952, 'support': 1592.0}, '1': {'precision': 0.7467248908296943, 'recall': 0.6576923076923077, 'f1-score': 0.6993865030674846, 'support': 260.0}, 'accuracy': 0.9206263498920086, 'macro avg': {'precision': 0.8459440843550813, 'recall': 0.8106300734441438, 'f1-score': 0.82683166521959, 'support': 1852.0}, 'weighted avg': {'precision': 0.9173047570202085, 'recall': 0.9206263498920086, 'f1-score': 0.9184930885384907, 'support': 1852.0}} |
|
| 95 |
+
| No log | 41.0 | 369 | 0.7207 | {'0': {'precision': 0.9553571428571429, 'recall': 0.9409547738693468, 'f1-score': 0.9481012658227848, 'support': 1592.0}, '1': {'precision': 0.6690140845070423, 'recall': 0.7307692307692307, 'f1-score': 0.6985294117647058, 'support': 260.0}, 'accuracy': 0.9114470842332614, 'macro avg': {'precision': 0.8121856136820926, 'recall': 0.8358620023192888, 'f1-score': 0.8233153387937453, 'support': 1852.0}, 'weighted avg': {'precision': 0.9151577934127444, 'recall': 0.9114470842332614, 'f1-score': 0.9130641804798578, 'support': 1852.0}} |
|
| 96 |
+
| No log | 42.0 | 378 | 0.8507 | {'0': {'precision': 0.9478260869565217, 'recall': 0.9585427135678392, 'f1-score': 0.9531542785758901, 'support': 1592.0}, '1': {'precision': 0.7272727272727273, 'recall': 0.676923076923077, 'f1-score': 0.701195219123506, 'support': 260.0}, 'accuracy': 0.9190064794816415, 'macro avg': {'precision': 0.8375494071146246, 'recall': 0.8177328952454581, 'f1-score': 0.8271747488496981, 'support': 1852.0}, 'weighted avg': {'precision': 0.9168628723140884, 'recall': 0.9190064794816415, 'f1-score': 0.917782056406549, 'support': 1852.0}} |
|
| 97 |
+
| No log | 43.0 | 387 | 0.8223 | {'0': {'precision': 0.9534591194968554, 'recall': 0.9522613065326633, 'f1-score': 0.9528598365807668, 'support': 1592.0}, '1': {'precision': 0.7099236641221374, 'recall': 0.7153846153846154, 'f1-score': 0.7126436781609196, 'support': 260.0}, 'accuracy': 0.9190064794816415, 'macro avg': {'precision': 0.8316913918094964, 'recall': 0.8338229609586394, 'f1-score': 0.8327517573708432, 'support': 1852.0}, 'weighted avg': {'precision': 0.919269476733666, 'recall': 0.9190064794816415, 'f1-score': 0.9191361858306802, 'support': 1852.0}} |
|
| 98 |
+
| No log | 44.0 | 396 | 0.8322 | {'0': {'precision': 0.9528301886792453, 'recall': 0.9516331658291457, 'f1-score': 0.9522313010685104, 'support': 1592.0}, '1': {'precision': 0.7061068702290076, 'recall': 0.7115384615384616, 'f1-score': 0.7088122605363985, 'support': 260.0}, 'accuracy': 0.91792656587473, 'macro avg': {'precision': 0.8294685294541264, 'recall': 0.8315858136838037, 'f1-score': 0.8305217808024544, 'support': 1852.0}, 'weighted avg': {'precision': 0.9181930057434667, 'recall': 0.91792656587473, 'f1-score': 0.9180580016417561, 'support': 1852.0}} |
|
| 99 |
+
| No log | 45.0 | 405 | 0.8810 | {'0': {'precision': 0.9495327102803738, 'recall': 0.957286432160804, 'f1-score': 0.9533938066937754, 'support': 1592.0}, '1': {'precision': 0.7246963562753036, 'recall': 0.6884615384615385, 'f1-score': 0.7061143984220908, 'support': 260.0}, 'accuracy': 0.9195464362850972, 'macro avg': {'precision': 0.8371145332778387, 'recall': 0.8228739853111713, 'f1-score': 0.8297541025579331, 'support': 1852.0}, 'weighted avg': {'precision': 0.9179682113379773, 'recall': 0.9195464362850972, 'f1-score': 0.918678554992567, 'support': 1852.0}} |
|
| 100 |
+
| No log | 46.0 | 414 | 0.9026 | {'0': {'precision': 0.9483509645301804, 'recall': 0.957286432160804, 'f1-score': 0.9527977492966552, 'support': 1592.0}, '1': {'precision': 0.7224489795918367, 'recall': 0.6807692307692308, 'f1-score': 0.700990099009901, 'support': 260.0}, 'accuracy': 0.9184665226781857, 'macro avg': {'precision': 0.8353999720610086, 'recall': 0.8190278314650175, 'f1-score': 0.826893924153278, 'support': 1852.0}, 'weighted avg': {'precision': 0.9166368629729617, 'recall': 0.9184665226781857, 'f1-score': 0.9174467832736768, 'support': 1852.0}} |
|
| 101 |
+
| No log | 47.0 | 423 | 0.9279 | {'0': {'precision': 0.9461633663366337, 'recall': 0.960427135678392, 'f1-score': 0.9532418952618454, 'support': 1592.0}, '1': {'precision': 0.7330508474576272, 'recall': 0.6653846153846154, 'f1-score': 0.6975806451612904, 'support': 260.0}, 'accuracy': 0.9190064794816415, 'macro avg': {'precision': 0.8396071068971305, 'recall': 0.8129058755315037, 'f1-score': 0.8254112702115679, 'support': 1852.0}, 'weighted avg': {'precision': 0.9162447621743541, 'recall': 0.9190064794816415, 'f1-score': 0.91734992710518, 'support': 1852.0}} |
|
| 102 |
+
| No log | 48.0 | 432 | 0.8759 | {'0': {'precision': 0.9522613065326633, 'recall': 0.9522613065326633, 'f1-score': 0.9522613065326633, 'support': 1592.0}, '1': {'precision': 0.7076923076923077, 'recall': 0.7076923076923077, 'f1-score': 0.7076923076923077, 'support': 260.0}, 'accuracy': 0.91792656587473, 'macro avg': {'precision': 0.8299768071124856, 'recall': 0.8299768071124856, 'f1-score': 0.8299768071124856, 'support': 1852.0}, 'weighted avg': {'precision': 0.91792656587473, 'recall': 0.91792656587473, 'f1-score': 0.91792656587473, 'support': 1852.0}} |
|
| 103 |
+
| No log | 49.0 | 441 | 0.9424 | {'0': {'precision': 0.9461633663366337, 'recall': 0.960427135678392, 'f1-score': 0.9532418952618454, 'support': 1592.0}, '1': {'precision': 0.7330508474576272, 'recall': 0.6653846153846154, 'f1-score': 0.6975806451612904, 'support': 260.0}, 'accuracy': 0.9190064794816415, 'macro avg': {'precision': 0.8396071068971305, 'recall': 0.8129058755315037, 'f1-score': 0.8254112702115679, 'support': 1852.0}, 'weighted avg': {'precision': 0.9162447621743541, 'recall': 0.9190064794816415, 'f1-score': 0.91734992710518, 'support': 1852.0}} |
|
| 104 |
+
| No log | 50.0 | 450 | 0.9073 | {'0': {'precision': 0.9506866416978776, 'recall': 0.9566582914572864, 'f1-score': 0.9536631183469004, 'support': 1592.0}, '1': {'precision': 0.724, 'recall': 0.6961538461538461, 'f1-score': 0.7098039215686275, 'support': 260.0}, 'accuracy': 0.9200863930885529, 'macro avg': {'precision': 0.8373433208489388, 'recall': 0.8264060688055663, 'f1-score': 0.831733519957764, 'support': 1852.0}, 'weighted avg': {'precision': 0.91886238314418, 'recall': 0.9200863930885529, 'f1-score': 0.9194280259266244, 'support': 1852.0}} |
|
| 105 |
|
| 106 |
|
| 107 |
### Framework versions
|
model.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 1583351632
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:a5ab6086c2892defaedd9f491d70efdc9b040668f6386ad3ff462499da2c978b
|
| 3 |
size 1583351632
|