End of training

Browse files

Files changed (4) hide show

README.md +24 -14
model.safetensors +1 -1
runs/Mar21_00-44-52_5eacb8eb673a/events.out.tfevents.1742517902.5eacb8eb673a.358.0 +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -18,8 +18,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [answerdotai/ModernBERT-base](https://huggingface.co/answerdotai/ModernBERT-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.3537
-- F1: 0.9000
 ## Model description
@@ -44,22 +44,32 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
-- num_epochs: 10
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | F1     |
 |:-------------:|:-----:|:----:|:---------------:|:------:|
-| No log        | 1.0   | 15   | 0.5482          | 0.7467 |
-| No log        | 2.0   | 30   | 0.4953          | 0.7600 |
-| No log        | 3.0   | 45   | 0.7658          | 0.7068 |
-| No log        | 4.0   | 60   | 0.2946          | 0.8628 |
-| No log        | 5.0   | 75   | 0.3272          | 0.8878 |
-| No log        | 6.0   | 90   | 0.6832          | 0.8348 |
-| 0.2242        | 7.0   | 105  | 0.4030          | 0.8740 |
-| 0.2242        | 8.0   | 120  | 0.3450          | 0.8997 |
-| 0.2242        | 9.0   | 135  | 0.3476          | 0.9126 |
-| 0.2242        | 10.0  | 150  | 0.3537          | 0.9000 |
 ### Framework versions
@@ -67,4 +77,4 @@ The following hyperparameters were used during training:
 - Transformers 4.48.0.dev0
 - Pytorch 2.6.0+cu124
 - Datasets 3.1.0
-- Tokenizers 0.21.0

 This model is a fine-tuned version of [answerdotai/ModernBERT-base](https://huggingface.co/answerdotai/ModernBERT-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.3695
+- F1: 0.9252
 ## Model description
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
+- num_epochs: 20
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | F1     |
 |:-------------:|:-----:|:----:|:---------------:|:------:|
+| No log        | 1.0   | 15   | 0.5181          | 0.7481 |
+| No log        | 2.0   | 30   | 0.3811          | 0.8373 |
+| No log        | 3.0   | 45   | 0.6849          | 0.6865 |
+| No log        | 4.0   | 60   | 0.4782          | 0.8611 |
+| No log        | 5.0   | 75   | 0.2552          | 0.9376 |
+| No log        | 6.0   | 90   | 0.3630          | 0.9127 |
+| 0.2889        | 7.0   | 105  | 0.4094          | 0.8618 |
+| 0.2889        | 8.0   | 120  | 0.3934          | 0.8997 |
+| 0.2889        | 9.0   | 135  | 0.3548          | 0.9376 |
+| 0.2889        | 10.0  | 150  | 0.4377          | 0.8746 |
+| 0.2889        | 11.0  | 165  | 0.4106          | 0.9126 |
+| 0.2889        | 12.0  | 180  | 0.4450          | 0.8997 |
+| 0.2889        | 13.0  | 195  | 0.3728          | 0.9376 |
+| 0.0041        | 14.0  | 210  | 0.3698          | 0.9252 |
+| 0.0041        | 15.0  | 225  | 0.3708          | 0.9252 |
+| 0.0041        | 16.0  | 240  | 0.3696          | 0.9252 |
+| 0.0041        | 17.0  | 255  | 0.3703          | 0.9252 |
+| 0.0041        | 18.0  | 270  | 0.3718          | 0.9252 |
+| 0.0041        | 19.0  | 285  | 0.3722          | 0.9252 |
+| 0.0           | 20.0  | 300  | 0.3695          | 0.9252 |
 ### Framework versions
 - Transformers 4.48.0.dev0
 - Pytorch 2.6.0+cu124
 - Datasets 3.1.0
+- Tokenizers 0.21.1

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:3e7048e1483b0fc90269afb947fcb1bc86c85c238a64a9a7357df6d900b77627
 size 598439784

 version https://git-lfs.github.com/spec/v1
+oid sha256:39f93fbfbf957dccd47c76d47e05b09d36ac0afa9bce35ab4617f610fa516a53
 size 598439784

runs/Mar21_00-44-52_5eacb8eb673a/events.out.tfevents.1742517902.5eacb8eb673a.358.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:1bb392a93355440173f15fe59e93dad90dd4f970506154c63d027514b869e5c3
+size 13123

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:644b890d805bb15322d99222e55de6b1bf2ff6e72809a2db256fa0600ad02511
 size 5432

 version https://git-lfs.github.com/spec/v1
+oid sha256:cc8eead68c3a2092b62da56f6c48de35ace65a8f2c6694f348004d1d51d4b70e
 size 5432