File size: 32,504 Bytes
23a792e
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
4b27eb4
 
23a792e
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
e548952
be49f9f
 
4b27eb4
aaa64e1
 
be49f9f
 
23a792e
 
be49f9f
23a792e
 
 
4b27eb4
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
23a792e
 
 
 
aaa64e1
23a792e
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
---
library_name: transformers
license: apache-2.0
base_model: answerdotai/ModernBERT-large
tags:
- generated_from_trainer
model-index:
- name: binary_paragraph
  results: []
---

<!-- This model card has been generated automatically according to the information the Trainer had access to. You
should probably proofread and complete it, then remove this comment. -->

# binary_paragraph

This model is a fine-tuned version of [answerdotai/ModernBERT-large](https://huggingface.co/answerdotai/ModernBERT-large) on an unknown dataset.
It achieves the following results on the evaluation set:
- Loss: 0.4963
- Classification Report: {'0': {'precision': 0.9531153608883405, 'recall': 0.9704773869346733, 'f1-score': 0.96171802054155, 'support': 1592.0}, '1': {'precision': 0.7965367965367965, 'recall': 0.7076923076923077, 'f1-score': 0.7494908350305499, 'support': 260.0}, 'accuracy': 0.933585313174946, 'macro avg': {'precision': 0.8748260787125686, 'recall': 0.8390848473134905, 'f1-score': 0.85560442778605, 'support': 1852.0}, 'weighted avg': {'precision': 0.9311334890031345, 'recall': 0.933585313174946, 'f1-score': 0.9319237072408696, 'support': 1852.0}}

## Model description

More information needed

## Intended uses & limitations

More information needed

## Training and evaluation data

More information needed

## Training procedure

### Training hyperparameters

The following hyperparameters were used during training:
- learning_rate: 5e-06
- train_batch_size: 16
- eval_batch_size: 16
- seed: 42
- distributed_type: multi-GPU
- num_devices: 4
- total_train_batch_size: 64
- total_eval_batch_size: 64
- optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
- lr_scheduler_type: linear
- num_epochs: 50

### Training results

| Training Loss | Epoch | Step | Validation Loss | Classification Report                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                  |
|:-------------:|:-----:|:----:|:---------------:|:------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------:|
| No log        | 1.0   | 98   | 0.2339          | {'0': {'precision': 0.9068287037037037, 'recall': 0.9842964824120602, 'f1-score': 0.9439759036144578, 'support': 1592.0}, '1': {'precision': 0.7983870967741935, 'recall': 0.38076923076923075, 'f1-score': 0.515625, 'support': 260.0}, 'accuracy': 0.8995680345572354, 'macro avg': {'precision': 0.8526079002389486, 'recall': 0.6825328565906454, 'f1-score': 0.729800451807229, 'support': 1852.0}, 'weighted avg': {'precision': 0.891604720009496, 'recall': 0.8995680345572354, 'f1-score': 0.8838402475994692, 'support': 1852.0}}            |
| No log        | 2.0   | 196  | 0.1974          | {'0': {'precision': 0.9299703264094955, 'recall': 0.9842964824120602, 'f1-score': 0.9563625267012511, 'support': 1592.0}, '1': {'precision': 0.8502994011976048, 'recall': 0.5461538461538461, 'f1-score': 0.6651053864168618, 'support': 260.0}, 'accuracy': 0.9227861771058316, 'macro avg': {'precision': 0.8901348638035502, 'recall': 0.7652251642829532, 'f1-score': 0.8107339565590564, 'support': 1852.0}, 'weighted avg': {'precision': 0.9187854233019946, 'recall': 0.9227861771058316, 'f1-score': 0.9154732953438315, 'support': 1852.0}} |
| No log        | 3.0   | 294  | 0.1808          | {'0': {'precision': 0.9491945477075588, 'recall': 0.9623115577889447, 'f1-score': 0.9557080474111042, 'support': 1592.0}, '1': {'precision': 0.7478991596638656, 'recall': 0.6846153846153846, 'f1-score': 0.714859437751004, 'support': 260.0}, 'accuracy': 0.9233261339092873, 'macro avg': {'precision': 0.8485468536857121, 'recall': 0.8234634712021647, 'f1-score': 0.8352837425810541, 'support': 1852.0}, 'weighted avg': {'precision': 0.9209349359951613, 'recall': 0.9233261339092873, 'f1-score': 0.9218956076100101, 'support': 1852.0}}  |
| No log        | 4.0   | 392  | 0.1874          | {'0': {'precision': 0.9276470588235294, 'recall': 0.9905778894472361, 'f1-score': 0.9580801944106926, 'support': 1592.0}, '1': {'precision': 0.9013157894736842, 'recall': 0.5269230769230769, 'f1-score': 0.6650485436893204, 'support': 260.0}, 'accuracy': 0.9254859611231101, 'macro avg': {'precision': 0.9144814241486068, 'recall': 0.7587504831851566, 'f1-score': 0.8115643690500065, 'support': 1852.0}, 'weighted avg': {'precision': 0.9239504443359702, 'recall': 0.9254859611231101, 'f1-score': 0.9169418417176273, 'support': 1852.0}} |
| No log        | 5.0   | 490  | 0.2778          | {'0': {'precision': 0.98125, 'recall': 0.8875628140703518, 'f1-score': 0.9320580474934037, 'support': 1592.0}, '1': {'precision': 0.5655339805825242, 'recall': 0.8961538461538462, 'f1-score': 0.6934523809523809, 'support': 260.0}, 'accuracy': 0.8887688984881209, 'macro avg': {'precision': 0.773391990291262, 'recall': 0.891858330112099, 'f1-score': 0.8127552142228923, 'support': 1852.0}, 'weighted avg': {'precision': 0.9228881398226005, 'recall': 0.8887688984881209, 'f1-score': 0.8985604917155063, 'support': 1852.0}}              |
| 0.2088        | 6.0   | 588  | 0.1884          | {'0': {'precision': 0.964308476736775, 'recall': 0.9503768844221105, 'f1-score': 0.957291996203733, 'support': 1592.0}, '1': {'precision': 0.7208480565371025, 'recall': 0.7846153846153846, 'f1-score': 0.7513812154696132, 'support': 260.0}, 'accuracy': 0.9271058315334774, 'macro avg': {'precision': 0.8425782666369388, 'recall': 0.8674961345187475, 'f1-score': 0.854336605836673, 'support': 1852.0}, 'weighted avg': {'precision': 0.9301293680694344, 'recall': 0.9271058315334774, 'f1-score': 0.9283844351935433, 'support': 1852.0}}    |
| 0.2088        | 7.0   | 686  | 0.3764          | {'0': {'precision': 0.9817287420941673, 'recall': 0.8775125628140703, 'f1-score': 0.9266998341625208, 'support': 1592.0}, '1': {'precision': 0.5454545454545454, 'recall': 0.9, 'f1-score': 0.6792452830188679, 'support': 260.0}, 'accuracy': 0.8806695464362851, 'macro avg': {'precision': 0.7635916437743564, 'recall': 0.8887562814070351, 'f1-score': 0.8029725585906944, 'support': 1852.0}, 'weighted avg': {'precision': 0.9204807447257538, 'recall': 0.8806695464362851, 'f1-score': 0.8919599943691354, 'support': 1852.0}}                |
| 0.2088        | 8.0   | 784  | 0.3698          | {'0': {'precision': 0.9257950530035336, 'recall': 0.9874371859296482, 'f1-score': 0.9556231003039514, 'support': 1592.0}, '1': {'precision': 0.8701298701298701, 'recall': 0.5153846153846153, 'f1-score': 0.6473429951690821, 'support': 260.0}, 'accuracy': 0.9211663066954644, 'macro avg': {'precision': 0.8979624615667019, 'recall': 0.7514109006571318, 'f1-score': 0.8014830477365167, 'support': 1852.0}, 'weighted avg': {'precision': 0.9179802865093909, 'recall': 0.9211663066954644, 'f1-score': 0.9123440358681706, 'support': 1852.0}} |
| 0.2088        | 9.0   | 882  | 0.3585          | {'0': {'precision': 0.9406474820143885, 'recall': 0.9855527638190955, 'f1-score': 0.9625766871165644, 'support': 1592.0}, '1': {'precision': 0.875, 'recall': 0.6192307692307693, 'f1-score': 0.7252252252252253, 'support': 260.0}, 'accuracy': 0.9341252699784017, 'macro avg': {'precision': 0.9078237410071943, 'recall': 0.8023917665249324, 'f1-score': 0.8439009561708948, 'support': 1852.0}, 'weighted avg': {'precision': 0.9314313128331029, 'recall': 0.9341252699784017, 'f1-score': 0.9292552075853829, 'support': 1852.0}}              |
| 0.2088        | 10.0  | 980  | 0.3604          | {'0': {'precision': 0.945995145631068, 'recall': 0.9792713567839196, 'f1-score': 0.9623456790123457, 'support': 1592.0}, '1': {'precision': 0.8382352941176471, 'recall': 0.6576923076923077, 'f1-score': 0.7370689655172413, 'support': 260.0}, 'accuracy': 0.9341252699784017, 'macro avg': {'precision': 0.8921152198743576, 'recall': 0.8184818322381137, 'f1-score': 0.8497073222647935, 'support': 1852.0}, 'weighted avg': {'precision': 0.930866872740415, 'recall': 0.9341252699784017, 'f1-score': 0.9307193585432706, 'support': 1852.0}}   |
| 0.0436        | 11.0  | 1078 | 0.4033          | {'0': {'precision': 0.9582814445828145, 'recall': 0.9667085427135679, 'f1-score': 0.9624765478424016, 'support': 1592.0}, '1': {'precision': 0.7845528455284553, 'recall': 0.7423076923076923, 'f1-score': 0.7628458498023716, 'support': 260.0}, 'accuracy': 0.9352051835853131, 'macro avg': {'precision': 0.871417145055635, 'recall': 0.8545081175106302, 'f1-score': 0.8626611988223866, 'support': 1852.0}, 'weighted avg': {'precision': 0.933891900439114, 'recall': 0.9352051835853131, 'f1-score': 0.9344506399102158, 'support': 1852.0}}   |
| 0.0436        | 12.0  | 1176 | 0.3949          | {'0': {'precision': 0.9597989949748744, 'recall': 0.9597989949748744, 'f1-score': 0.9597989949748744, 'support': 1592.0}, '1': {'precision': 0.7538461538461538, 'recall': 0.7538461538461538, 'f1-score': 0.7538461538461538, 'support': 260.0}, 'accuracy': 0.9308855291576674, 'macro avg': {'precision': 0.8568225744105141, 'recall': 0.8568225744105141, 'f1-score': 0.8568225744105141, 'support': 1852.0}, 'weighted avg': {'precision': 0.9308855291576674, 'recall': 0.9308855291576674, 'f1-score': 0.9308855291576674, 'support': 1852.0}} |
| 0.0436        | 13.0  | 1274 | 0.4250          | {'0': {'precision': 0.9497856705450092, 'recall': 0.9742462311557789, 'f1-score': 0.9618604651162791, 'support': 1592.0}, '1': {'precision': 0.8127853881278538, 'recall': 0.6846153846153846, 'f1-score': 0.7432150313152401, 'support': 260.0}, 'accuracy': 0.933585313174946, 'macro avg': {'precision': 0.8812855293364315, 'recall': 0.8294308078855818, 'f1-score': 0.8525377482157597, 'support': 1852.0}, 'weighted avg': {'precision': 0.9305523695577196, 'recall': 0.933585313174946, 'f1-score': 0.9311651018396754, 'support': 1852.0}}   |
| 0.0436        | 14.0  | 1372 | 0.4133          | {'0': {'precision': 0.9513846153846154, 'recall': 0.9711055276381909, 'f1-score': 0.9611439229095431, 'support': 1592.0}, '1': {'precision': 0.7973568281938326, 'recall': 0.6961538461538461, 'f1-score': 0.7433264887063655, 'support': 260.0}, 'accuracy': 0.9325053995680346, 'macro avg': {'precision': 0.874370721789224, 'recall': 0.8336296868960185, 'f1-score': 0.8522352058079543, 'support': 1852.0}, 'weighted avg': {'precision': 0.9297608439647431, 'recall': 0.9325053995680346, 'f1-score': 0.9305648014771316, 'support': 1852.0}}  |
| 0.0436        | 15.0  | 1470 | 0.4521          | {'0': {'precision': 0.9595267745952677, 'recall': 0.967964824120603, 'f1-score': 0.9637273295809882, 'support': 1592.0}, '1': {'precision': 0.7926829268292683, 'recall': 0.75, 'f1-score': 0.7707509881422925, 'support': 260.0}, 'accuracy': 0.937365010799136, 'macro avg': {'precision': 0.876104850712268, 'recall': 0.8589824120603016, 'f1-score': 0.8672391588616404, 'support': 1852.0}, 'weighted avg': {'precision': 0.9361037722091122, 'recall': 0.937365010799136, 'f1-score': 0.9366356185798754, 'support': 1852.0}}                   |
| 0.0034        | 16.0  | 1568 | 0.4702          | {'0': {'precision': 0.9539877300613497, 'recall': 0.9767587939698492, 'f1-score': 0.9652389819987586, 'support': 1592.0}, '1': {'precision': 0.8333333333333334, 'recall': 0.7115384615384616, 'f1-score': 0.7676348547717843, 'support': 260.0}, 'accuracy': 0.9395248380129589, 'macro avg': {'precision': 0.8936605316973416, 'recall': 0.8441486277541554, 'f1-score': 0.8664369183852714, 'support': 1852.0}, 'weighted avg': {'precision': 0.9370492078425138, 'recall': 0.9395248380129589, 'f1-score': 0.9374975818481035, 'support': 1852.0}} |
| 0.0034        | 17.0  | 1666 | 0.4387          | {'0': {'precision': 0.9544334975369458, 'recall': 0.9736180904522613, 'f1-score': 0.9639303482587065, 'support': 1592.0}, '1': {'precision': 0.8157894736842105, 'recall': 0.7153846153846154, 'f1-score': 0.7622950819672131, 'support': 260.0}, 'accuracy': 0.937365010799136, 'macro avg': {'precision': 0.8851114856105782, 'recall': 0.8445013529184384, 'f1-score': 0.8631127151129598, 'support': 1852.0}, 'weighted avg': {'precision': 0.9349694337131277, 'recall': 0.937365010799136, 'f1-score': 0.935623021457525, 'support': 1852.0}}    |
| 0.0034        | 18.0  | 1764 | 0.4214          | {'0': {'precision': 0.9576587795765878, 'recall': 0.9660804020100503, 'f1-score': 0.9618511569731082, 'support': 1592.0}, '1': {'precision': 0.7804878048780488, 'recall': 0.7384615384615385, 'f1-score': 0.758893280632411, 'support': 260.0}, 'accuracy': 0.9341252699784017, 'macro avg': {'precision': 0.8690732922273183, 'recall': 0.8522709702357945, 'f1-score': 0.8603722188027596, 'support': 1852.0}, 'weighted avg': {'precision': 0.9327859645541148, 'recall': 0.9341252699784017, 'f1-score': 0.9333581505753863, 'support': 1852.0}}  |
| 0.0034        | 19.0  | 1862 | 0.4786          | {'0': {'precision': 0.9513546798029556, 'recall': 0.9704773869346733, 'f1-score': 0.960820895522388, 'support': 1592.0}, '1': {'precision': 0.793859649122807, 'recall': 0.6961538461538461, 'f1-score': 0.7418032786885246, 'support': 260.0}, 'accuracy': 0.9319654427645788, 'macro avg': {'precision': 0.8726071644628813, 'recall': 0.8333156165442597, 'f1-score': 0.8513120871054562, 'support': 1852.0}, 'weighted avg': {'precision': 0.9292441463381399, 'recall': 0.9319654427645788, 'f1-score': 0.9300732819280012, 'support': 1852.0}}   |
| 0.0034        | 20.0  | 1960 | 0.4685          | {'0': {'precision': 0.9592220828105396, 'recall': 0.960427135678392, 'f1-score': 0.9598242310106717, 'support': 1592.0}, '1': {'precision': 0.7558139534883721, 'recall': 0.75, 'f1-score': 0.752895752895753, 'support': 260.0}, 'accuracy': 0.9308855291576674, 'macro avg': {'precision': 0.8575180181494558, 'recall': 0.855213567839196, 'f1-score': 0.8563599919532123, 'support': 1852.0}, 'weighted avg': {'precision': 0.9306658659510562, 'recall': 0.9308855291576674, 'f1-score': 0.9307737967180806, 'support': 1852.0}}                  |
| 0.0014        | 21.0  | 2058 | 0.4487          | {'0': {'precision': 0.956386292834891, 'recall': 0.9641959798994975, 'f1-score': 0.960275258054426, 'support': 1592.0}, '1': {'precision': 0.7692307692307693, 'recall': 0.7307692307692307, 'f1-score': 0.7495069033530573, 'support': 260.0}, 'accuracy': 0.9314254859611231, 'macro avg': {'precision': 0.8628085310328302, 'recall': 0.847482605334364, 'f1-score': 0.8548910807037416, 'support': 1852.0}, 'weighted avg': {'precision': 0.9301117592835564, 'recall': 0.9314254859611231, 'f1-score': 0.930685748215141, 'support': 1852.0}}     |
| 0.0014        | 22.0  | 2156 | 0.4918          | {'0': {'precision': 0.947560975609756, 'recall': 0.9761306532663316, 'f1-score': 0.9616336633663366, 'support': 1592.0}, '1': {'precision': 0.8207547169811321, 'recall': 0.6692307692307692, 'f1-score': 0.7372881355932204, 'support': 260.0}, 'accuracy': 0.9330453563714903, 'macro avg': {'precision': 0.884157846295444, 'recall': 0.8226807112485504, 'f1-score': 0.8494608994797785, 'support': 1852.0}, 'weighted avg': {'precision': 0.9297588010722603, 'recall': 0.9330453563714903, 'f1-score': 0.9301380709143872, 'support': 1852.0}}   |
| 0.0014        | 23.0  | 2254 | 0.4848          | {'0': {'precision': 0.9486866218692731, 'recall': 0.9755025125628141, 'f1-score': 0.9619077113657479, 'support': 1592.0}, '1': {'precision': 0.8186046511627907, 'recall': 0.676923076923077, 'f1-score': 0.7410526315789474, 'support': 260.0}, 'accuracy': 0.933585313174946, 'macro avg': {'precision': 0.8836456365160319, 'recall': 0.8262127947429455, 'f1-score': 0.8514801714723477, 'support': 1852.0}, 'weighted avg': {'precision': 0.9304245741459009, 'recall': 0.933585313174946, 'f1-score': 0.9309021386095017, 'support': 1852.0}}    |
| 0.0014        | 24.0  | 2352 | 0.4883          | {'0': {'precision': 0.950337216431637, 'recall': 0.9736180904522613, 'f1-score': 0.9618367980142725, 'support': 1592.0}, '1': {'precision': 0.8099547511312217, 'recall': 0.6884615384615385, 'f1-score': 0.7442827442827443, 'support': 260.0}, 'accuracy': 0.933585313174946, 'macro avg': {'precision': 0.8801459837814294, 'recall': 0.8310398144568999, 'f1-score': 0.8530597711485084, 'support': 1852.0}, 'weighted avg': {'precision': 0.930629094953177, 'recall': 0.933585313174946, 'f1-score': 0.9312946522420277, 'support': 1852.0}}     |
| 0.0014        | 25.0  | 2450 | 0.5083          | {'0': {'precision': 0.947528981086028, 'recall': 0.9755025125628141, 'f1-score': 0.9613122872175797, 'support': 1592.0}, '1': {'precision': 0.8169014084507042, 'recall': 0.6692307692307692, 'f1-score': 0.7357293868921776, 'support': 260.0}, 'accuracy': 0.9325053995680346, 'macro avg': {'precision': 0.8822151947683661, 'recall': 0.8223666408967916, 'f1-score': 0.8485208370548787, 'support': 1852.0}, 'weighted avg': {'precision': 0.9291903369795571, 'recall': 0.9325053995680346, 'f1-score': 0.9296429815563462, 'support': 1852.0}}  |
| 0.0011        | 26.0  | 2548 | 0.4622          | {'0': {'precision': 0.9547707558859975, 'recall': 0.967964824120603, 'f1-score': 0.9613225202744854, 'support': 1592.0}, '1': {'precision': 0.7857142857142857, 'recall': 0.7192307692307692, 'f1-score': 0.751004016064257, 'support': 260.0}, 'accuracy': 0.9330453563714903, 'macro avg': {'precision': 0.8702425208001416, 'recall': 0.8435977966756861, 'f1-score': 0.8561632681693712, 'support': 1852.0}, 'weighted avg': {'precision': 0.9310371261642669, 'recall': 0.9330453563714903, 'f1-score': 0.93179616439184, 'support': 1852.0}}     |
| 0.0011        | 27.0  | 2646 | 0.5078          | {'0': {'precision': 0.9498164014687882, 'recall': 0.9748743718592965, 'f1-score': 0.9621822690638562, 'support': 1592.0}, '1': {'precision': 0.8165137614678899, 'recall': 0.6846153846153846, 'f1-score': 0.7447698744769874, 'support': 260.0}, 'accuracy': 0.9341252699784017, 'macro avg': {'precision': 0.883165081468339, 'recall': 0.8297448782373406, 'f1-score': 0.8534760717704217, 'support': 1852.0}, 'weighted avg': {'precision': 0.9311022079481438, 'recall': 0.9341252699784017, 'f1-score': 0.9316600106445333, 'support': 1852.0}}  |
| 0.0011        | 28.0  | 2744 | 0.5243          | {'0': {'precision': 0.947528981086028, 'recall': 0.9755025125628141, 'f1-score': 0.9613122872175797, 'support': 1592.0}, '1': {'precision': 0.8169014084507042, 'recall': 0.6692307692307692, 'f1-score': 0.7357293868921776, 'support': 260.0}, 'accuracy': 0.9325053995680346, 'macro avg': {'precision': 0.8822151947683661, 'recall': 0.8223666408967916, 'f1-score': 0.8485208370548787, 'support': 1852.0}, 'weighted avg': {'precision': 0.9291903369795571, 'recall': 0.9325053995680346, 'f1-score': 0.9296429815563462, 'support': 1852.0}}  |
| 0.0011        | 29.0  | 2842 | 0.4909          | {'0': {'precision': 0.9502762430939227, 'recall': 0.9723618090452262, 'f1-score': 0.9611921763427507, 'support': 1592.0}, '1': {'precision': 0.8026905829596412, 'recall': 0.6884615384615385, 'f1-score': 0.7412008281573499, 'support': 260.0}, 'accuracy': 0.9325053995680346, 'macro avg': {'precision': 0.876483413026782, 'recall': 0.8304116737533823, 'f1-score': 0.8511965022500503, 'support': 1852.0}, 'weighted avg': {'precision': 0.9295568739606002, 'recall': 0.9325053995680346, 'f1-score': 0.9303078618026835, 'support': 1852.0}}  |
| 0.0011        | 30.0  | 2940 | 0.4770          | {'0': {'precision': 0.9525277435265105, 'recall': 0.9704773869346733, 'f1-score': 0.9614187927815806, 'support': 1592.0}, '1': {'precision': 0.7956521739130434, 'recall': 0.7038461538461539, 'f1-score': 0.746938775510204, 'support': 260.0}, 'accuracy': 0.9330453563714903, 'macro avg': {'precision': 0.874089958719777, 'recall': 0.8371617703904136, 'f1-score': 0.8541787841458923, 'support': 1852.0}, 'weighted avg': {'precision': 0.9305041754382267, 'recall': 0.9330453563714903, 'f1-score': 0.9313082072035255, 'support': 1852.0}}   |
| 0.0012        | 31.0  | 3038 | 0.5137          | {'0': {'precision': 0.9464068209500609, 'recall': 0.9761306532663316, 'f1-score': 0.961038961038961, 'support': 1592.0}, '1': {'precision': 0.819047619047619, 'recall': 0.6615384615384615, 'f1-score': 0.7319148936170212, 'support': 260.0}, 'accuracy': 0.9319654427645788, 'macro avg': {'precision': 0.88272721999884, 'recall': 0.8188345574023965, 'f1-score': 0.8464769273279911, 'support': 1852.0}, 'weighted avg': {'precision': 0.9285270193870832, 'recall': 0.9319654427645788, 'f1-score': 0.9288725152885808, 'support': 1852.0}}     |
| 0.0012        | 32.0  | 3136 | 0.4797          | {'0': {'precision': 0.9553349875930521, 'recall': 0.9673366834170855, 'f1-score': 0.9612983770287141, 'support': 1592.0}, '1': {'precision': 0.7833333333333333, 'recall': 0.7230769230769231, 'f1-score': 0.752, 'support': 260.0}, 'accuracy': 0.9330453563714903, 'macro avg': {'precision': 0.8693341604631928, 'recall': 0.8452068032470043, 'f1-score': 0.856649188514357, 'support': 1852.0}, 'weighted avg': {'precision': 0.9311878871030268, 'recall': 0.9330453563714903, 'f1-score': 0.9319152355452013, 'support': 1852.0}}               |
| 0.0012        | 33.0  | 3234 | 0.5103          | {'0': {'precision': 0.947528981086028, 'recall': 0.9755025125628141, 'f1-score': 0.9613122872175797, 'support': 1592.0}, '1': {'precision': 0.8169014084507042, 'recall': 0.6692307692307692, 'f1-score': 0.7357293868921776, 'support': 260.0}, 'accuracy': 0.9325053995680346, 'macro avg': {'precision': 0.8822151947683661, 'recall': 0.8223666408967916, 'f1-score': 0.8485208370548787, 'support': 1852.0}, 'weighted avg': {'precision': 0.9291903369795571, 'recall': 0.9325053995680346, 'f1-score': 0.9296429815563462, 'support': 1852.0}}  |
| 0.0012        | 34.0  | 3332 | 0.5018          | {'0': {'precision': 0.9497549019607843, 'recall': 0.9736180904522613, 'f1-score': 0.9615384615384616, 'support': 1592.0}, '1': {'precision': 0.8090909090909091, 'recall': 0.6846153846153846, 'f1-score': 0.7416666666666667, 'support': 260.0}, 'accuracy': 0.9330453563714903, 'macro avg': {'precision': 0.8794229055258467, 'recall': 0.829116737533823, 'f1-score': 0.8516025641025642, 'support': 1852.0}, 'weighted avg': {'precision': 0.9300072571734369, 'recall': 0.9330453563714903, 'f1-score': 0.9306709309409092, 'support': 1852.0}}  |
| 0.0012        | 35.0  | 3430 | 0.5222          | {'0': {'precision': 0.9457978075517661, 'recall': 0.9755025125628141, 'f1-score': 0.9604205318491033, 'support': 1592.0}, '1': {'precision': 0.8142857142857143, 'recall': 0.6576923076923077, 'f1-score': 0.7276595744680852, 'support': 260.0}, 'accuracy': 0.9308855291576674, 'macro avg': {'precision': 0.8800417609187402, 'recall': 0.8165974101275608, 'f1-score': 0.8440400531585942, 'support': 1852.0}, 'weighted avg': {'precision': 0.927334986682882, 'recall': 0.9308855291576674, 'f1-score': 0.9277435075947487, 'support': 1852.0}}  |
| 0.001         | 36.0  | 3528 | 0.5128          | {'0': {'precision': 0.9498164014687882, 'recall': 0.9748743718592965, 'f1-score': 0.9621822690638562, 'support': 1592.0}, '1': {'precision': 0.8165137614678899, 'recall': 0.6846153846153846, 'f1-score': 0.7447698744769874, 'support': 260.0}, 'accuracy': 0.9341252699784017, 'macro avg': {'precision': 0.883165081468339, 'recall': 0.8297448782373406, 'f1-score': 0.8534760717704217, 'support': 1852.0}, 'weighted avg': {'precision': 0.9311022079481438, 'recall': 0.9341252699784017, 'f1-score': 0.9316600106445333, 'support': 1852.0}}  |
| 0.001         | 37.0  | 3626 | 0.5022          | {'0': {'precision': 0.9513846153846154, 'recall': 0.9711055276381909, 'f1-score': 0.9611439229095431, 'support': 1592.0}, '1': {'precision': 0.7973568281938326, 'recall': 0.6961538461538461, 'f1-score': 0.7433264887063655, 'support': 260.0}, 'accuracy': 0.9325053995680346, 'macro avg': {'precision': 0.874370721789224, 'recall': 0.8336296868960185, 'f1-score': 0.8522352058079543, 'support': 1852.0}, 'weighted avg': {'precision': 0.9297608439647431, 'recall': 0.9325053995680346, 'f1-score': 0.9305648014771316, 'support': 1852.0}}  |
| 0.001         | 38.0  | 3724 | 0.4909          | {'0': {'precision': 0.9525277435265105, 'recall': 0.9704773869346733, 'f1-score': 0.9614187927815806, 'support': 1592.0}, '1': {'precision': 0.7956521739130434, 'recall': 0.7038461538461539, 'f1-score': 0.746938775510204, 'support': 260.0}, 'accuracy': 0.9330453563714903, 'macro avg': {'precision': 0.874089958719777, 'recall': 0.8371617703904136, 'f1-score': 0.8541787841458923, 'support': 1852.0}, 'weighted avg': {'precision': 0.9305041754382267, 'recall': 0.9330453563714903, 'f1-score': 0.9313082072035255, 'support': 1852.0}}   |
| 0.001         | 39.0  | 3822 | 0.4992          | {'0': {'precision': 0.9508599508599509, 'recall': 0.9723618090452262, 'f1-score': 0.9614906832298137, 'support': 1592.0}, '1': {'precision': 0.8035714285714286, 'recall': 0.6923076923076923, 'f1-score': 0.743801652892562, 'support': 260.0}, 'accuracy': 0.9330453563714903, 'macro avg': {'precision': 0.8772156897156898, 'recall': 0.8323347506764592, 'f1-score': 0.8526461680611879, 'support': 1852.0}, 'weighted avg': {'precision': 0.9301822965429878, 'recall': 0.9330453563714903, 'f1-score': 0.9309295882580612, 'support': 1852.0}}  |
| 0.001         | 40.0  | 3920 | 0.4993          | {'0': {'precision': 0.9503067484662576, 'recall': 0.9729899497487438, 'f1-score': 0.9615145872129113, 'support': 1592.0}, '1': {'precision': 0.8063063063063063, 'recall': 0.6884615384615385, 'f1-score': 0.7427385892116183, 'support': 260.0}, 'accuracy': 0.9330453563714903, 'macro avg': {'precision': 0.878306527386282, 'recall': 0.8307257441051411, 'f1-score': 0.8521265882122648, 'support': 1852.0}, 'weighted avg': {'precision': 0.9300907036705841, 'recall': 0.9330453563714903, 'f1-score': 0.9308008941889717, 'support': 1852.0}}  |
| 0.0009        | 41.0  | 4018 | 0.5086          | {'0': {'precision': 0.9503676470588235, 'recall': 0.9742462311557789, 'f1-score': 0.9621588089330024, 'support': 1592.0}, '1': {'precision': 0.8136363636363636, 'recall': 0.6884615384615385, 'f1-score': 0.7458333333333333, 'support': 260.0}, 'accuracy': 0.9341252699784017, 'macro avg': {'precision': 0.8820020053475935, 'recall': 0.8313538848086587, 'f1-score': 0.8539960711331679, 'support': 1852.0}, 'weighted avg': {'precision': 0.9311721105092341, 'recall': 0.9341252699784017, 'f1-score': 0.931789141732185, 'support': 1852.0}}  |
| 0.0009        | 42.0  | 4116 | 0.5173          | {'0': {'precision': 0.9469188529591214, 'recall': 0.9748743718592965, 'f1-score': 0.960693283813061, 'support': 1592.0}, '1': {'precision': 0.812206572769953, 'recall': 0.6653846153846154, 'f1-score': 0.7315010570824524, 'support': 260.0}, 'accuracy': 0.9314254859611231, 'macro avg': {'precision': 0.8795627128645371, 'recall': 0.8201294936219559, 'f1-score': 0.8460971704477567, 'support': 1852.0}, 'weighted avg': {'precision': 0.9280067617878558, 'recall': 0.9314254859611231, 'f1-score': 0.9285172692612478, 'support': 1852.0}}   |
| 0.0009        | 43.0  | 4214 | 0.5039          | {'0': {'precision': 0.9514443761524278, 'recall': 0.9723618090452262, 'f1-score': 0.961789375582479, 'support': 1592.0}, '1': {'precision': 0.8044444444444444, 'recall': 0.6961538461538461, 'f1-score': 0.7463917525773196, 'support': 260.0}, 'accuracy': 0.933585313174946, 'macro avg': {'precision': 0.8779444102984362, 'recall': 0.8342578275995362, 'f1-score': 0.8540905640798993, 'support': 1852.0}, 'weighted avg': {'precision': 0.9308072367117822, 'recall': 0.933585313174946, 'f1-score': 0.9315499684651241, 'support': 1852.0}}    |
| 0.0009        | 44.0  | 4312 | 0.4986          | {'0': {'precision': 0.952, 'recall': 0.9717336683417085, 'f1-score': 0.9617656201429904, 'support': 1592.0}, '1': {'precision': 0.801762114537445, 'recall': 0.7, 'f1-score': 0.7474332648870636, 'support': 260.0}, 'accuracy': 0.933585313174946, 'macro avg': {'precision': 0.8768810572687225, 'recall': 0.8358668341708542, 'f1-score': 0.854599442515027, 'support': 1852.0}, 'weighted avg': {'precision': 0.9309082882179998, 'recall': 0.933585313174946, 'f1-score': 0.9316757646534974, 'support': 1852.0}}                                 |
| 0.0009        | 45.0  | 4410 | 0.4987          | {'0': {'precision': 0.9513846153846154, 'recall': 0.9711055276381909, 'f1-score': 0.9611439229095431, 'support': 1592.0}, '1': {'precision': 0.7973568281938326, 'recall': 0.6961538461538461, 'f1-score': 0.7433264887063655, 'support': 260.0}, 'accuracy': 0.9325053995680346, 'macro avg': {'precision': 0.874370721789224, 'recall': 0.8336296868960185, 'f1-score': 0.8522352058079543, 'support': 1852.0}, 'weighted avg': {'precision': 0.9297608439647431, 'recall': 0.9325053995680346, 'f1-score': 0.9305648014771316, 'support': 1852.0}}  |
| 0.0009        | 46.0  | 4508 | 0.4933          | {'0': {'precision': 0.9524984577421345, 'recall': 0.9698492462311558, 'f1-score': 0.9610955493308434, 'support': 1592.0}, '1': {'precision': 0.7922077922077922, 'recall': 0.7038461538461539, 'f1-score': 0.745417515274949, 'support': 260.0}, 'accuracy': 0.9325053995680346, 'macro avg': {'precision': 0.8723531249749634, 'recall': 0.8368477000386548, 'f1-score': 0.8532565323028962, 'support': 1852.0}, 'weighted avg': {'precision': 0.9299954485418489, 'recall': 0.9325053995680346, 'f1-score': 0.9308167756512902, 'support': 1852.0}}  |
| 0.0009        | 47.0  | 4606 | 0.4932          | {'0': {'precision': 0.9530864197530864, 'recall': 0.9698492462311558, 'f1-score': 0.9613947696139477, 'support': 1592.0}, '1': {'precision': 0.7931034482758621, 'recall': 0.7076923076923077, 'f1-score': 0.7479674796747967, 'support': 260.0}, 'accuracy': 0.9330453563714903, 'macro avg': {'precision': 0.8730949340144742, 'recall': 0.8387707769617317, 'f1-score': 0.8546811246443722, 'support': 1852.0}, 'weighted avg': {'precision': 0.9306266073426769, 'recall': 0.9330453563714903, 'f1-score': 0.9314319751300496, 'support': 1852.0}} |
| 0.0009        | 48.0  | 4704 | 0.4988          | {'0': {'precision': 0.9508599508599509, 'recall': 0.9723618090452262, 'f1-score': 0.9614906832298137, 'support': 1592.0}, '1': {'precision': 0.8035714285714286, 'recall': 0.6923076923076923, 'f1-score': 0.743801652892562, 'support': 260.0}, 'accuracy': 0.9330453563714903, 'macro avg': {'precision': 0.8772156897156898, 'recall': 0.8323347506764592, 'f1-score': 0.8526461680611879, 'support': 1852.0}, 'weighted avg': {'precision': 0.9301822965429878, 'recall': 0.9330453563714903, 'f1-score': 0.9309295882580612, 'support': 1852.0}}  |
| 0.0009        | 49.0  | 4802 | 0.4958          | {'0': {'precision': 0.95079950799508, 'recall': 0.9711055276381909, 'f1-score': 0.9608452454940957, 'support': 1592.0}, '1': {'precision': 0.7964601769911505, 'recall': 0.6923076923076923, 'f1-score': 0.7407407407407407, 'support': 260.0}, 'accuracy': 0.9319654427645788, 'macro avg': {'precision': 0.8736298424931153, 'recall': 0.8317066099729415, 'f1-score': 0.8507929931174182, 'support': 1852.0}, 'weighted avg': {'precision': 0.9291319993228221, 'recall': 0.9319654427645788, 'f1-score': 0.9299450450427608, 'support': 1852.0}}   |
| 0.0009        | 50.0  | 4900 | 0.4963          | {'0': {'precision': 0.9531153608883405, 'recall': 0.9704773869346733, 'f1-score': 0.96171802054155, 'support': 1592.0}, '1': {'precision': 0.7965367965367965, 'recall': 0.7076923076923077, 'f1-score': 0.7494908350305499, 'support': 260.0}, 'accuracy': 0.933585313174946, 'macro avg': {'precision': 0.8748260787125686, 'recall': 0.8390848473134905, 'f1-score': 0.85560442778605, 'support': 1852.0}, 'weighted avg': {'precision': 0.9311334890031345, 'recall': 0.933585313174946, 'f1-score': 0.9319237072408696, 'support': 1852.0}}       |


### Framework versions

- Transformers 4.53.1
- Pytorch 2.6.0+cu124
- Datasets 3.5.0
- Tokenizers 0.21.1