ccaug
/

results

@@ -21,11 +21,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [answerdotai/ModernBERT-base](https://huggingface.co/answerdotai/ModernBERT-base) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.1729
-- Accuracy: 0.9439
-- F1: 0.9439
-- Precision: 0.9461
-- Recall: 0.9439
 ## Model description
@@ -44,7 +44,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 5e-05
 - train_batch_size: 6
 - eval_batch_size: 6
 - seed: 42
@@ -52,22 +52,25 @@ The following hyperparameters were used during training:
 - total_train_batch_size: 12
 - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
-- num_epochs: 1
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch  | Step | Validation Loss | Accuracy | F1     | Precision | Recall |
 |:-------------:|:------:|:----:|:---------------:|:--------:|:------:|:---------:|:------:|
-| 3.3687        | 0.1111 | 25   | 1.3275          | 0.425    | 0.3703 | 0.6006    | 0.425  |
-| 1.9545        | 0.2222 | 50   | 0.8983          | 0.6839   | 0.6536 | 0.6803    | 0.6839 |
-| 1.2368        | 0.3333 | 75   | 0.4575          | 0.8367   | 0.8323 | 0.8547    | 0.8367 |
-| 0.9007        | 0.4444 | 100  | 0.4360          | 0.8578   | 0.8436 | 0.8784    | 0.8578 |
-| 0.8601        | 0.5556 | 125  | 0.2811          | 0.8856   | 0.8869 | 0.8931    | 0.8856 |
-| 0.5962        | 0.6667 | 150  | 0.3887          | 0.8817   | 0.8717 | 0.9074    | 0.8817 |
-| 0.5623        | 0.7778 | 175  | 0.2442          | 0.9128   | 0.9102 | 0.9233    | 0.9128 |
-| 0.295         | 0.8889 | 200  | 0.2014          | 0.9283   | 0.9264 | 0.9361    | 0.9283 |
-| 0.6196        | 1.0    | 225  | 0.1729          | 0.9439   | 0.9439 | 0.9461    | 0.9439 |
 ### Framework versions

 This model is a fine-tuned version of [answerdotai/ModernBERT-base](https://huggingface.co/answerdotai/ModernBERT-base) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.2000
+- Accuracy: 0.9433
+- F1: 0.9429
+- Precision: 0.9508
+- Recall: 0.9433
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 4e-05
 - train_batch_size: 6
 - eval_batch_size: 6
 - seed: 42
 - total_train_batch_size: 12
 - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
+- num_epochs: 2
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch  | Step | Validation Loss | Accuracy | F1     | Precision | Recall |
 |:-------------:|:------:|:----:|:---------------:|:--------:|:------:|:---------:|:------:|
+| 3.8028        | 0.0833 | 25   | 1.5191          | 0.3944   | 0.2893 | 0.4598    | 0.3944 |
+| 2.2046        | 0.1667 | 50   | 0.7147          | 0.75     | 0.7423 | 0.7685    | 0.75   |
+| 1.2172        | 0.25   | 75   | 0.6074          | 0.7989   | 0.7727 | 0.8508    | 0.7989 |
+| 0.9054        | 0.3333 | 100  | 0.3817          | 0.8656   | 0.8637 | 0.8907    | 0.8656 |
+| 0.873         | 0.4167 | 125  | 0.3460          | 0.8678   | 0.8665 | 0.8810    | 0.8678 |
+| 0.7074        | 0.5    | 150  | 0.2918          | 0.8889   | 0.8848 | 0.9159    | 0.8889 |
+| 1.0552        | 0.5833 | 175  | 0.2550          | 0.89     | 0.8868 | 0.9130    | 0.89   |
+| 0.5167        | 0.6667 | 200  | 0.2660          | 0.9044   | 0.9043 | 0.9071    | 0.9044 |
+| 0.3174        | 0.75   | 225  | 0.2641          | 0.8956   | 0.8882 | 0.9235    | 0.8956 |
+| 0.3369        | 0.8333 | 250  | 0.1745          | 0.9489   | 0.9490 | 0.9520    | 0.9489 |
+| 0.2966        | 0.9167 | 275  | 0.1484          | 0.9567   | 0.9568 | 0.9589    | 0.9567 |
+| 0.5544        | 1.0    | 300  | 0.2000          | 0.9433   | 0.9429 | 0.9508    | 0.9433 |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:73a22c47896844149915e77622eabcec9a3cfd867243f68d5bd0687b11a00497
 size 598461316

 version https://git-lfs.github.com/spec/v1
+oid sha256:b2f89595e496e2c31773377d1e8822a1b55bbae0d268e0e4fd74fe00bbac3f01
 size 598461316

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f7ec83b42e86ca222902aeff78581a6aecbaa9bd51002bddc3f3eb0fae99720f
 size 5777

 version https://git-lfs.github.com/spec/v1
+oid sha256:1338d801601b58554a849c9ae5a59dae0269bba5783fe44b7da195e0dbd8ab49
 size 5777