travis-simon commited on
Commit
7b8d560
·
verified ·
1 Parent(s): bb98b54

End of training

Browse files
README.md CHANGED
@@ -18,8 +18,8 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [answerdotai/ModernBERT-base](https://huggingface.co/answerdotai/ModernBERT-base) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 0.3537
22
- - F1: 0.9000
23
 
24
  ## Model description
25
 
@@ -44,22 +44,32 @@ The following hyperparameters were used during training:
44
  - seed: 42
45
  - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
46
  - lr_scheduler_type: linear
47
- - num_epochs: 10
48
 
49
  ### Training results
50
 
51
  | Training Loss | Epoch | Step | Validation Loss | F1 |
52
  |:-------------:|:-----:|:----:|:---------------:|:------:|
53
- | No log | 1.0 | 15 | 0.5482 | 0.7467 |
54
- | No log | 2.0 | 30 | 0.4953 | 0.7600 |
55
- | No log | 3.0 | 45 | 0.7658 | 0.7068 |
56
- | No log | 4.0 | 60 | 0.2946 | 0.8628 |
57
- | No log | 5.0 | 75 | 0.3272 | 0.8878 |
58
- | No log | 6.0 | 90 | 0.6832 | 0.8348 |
59
- | 0.2242 | 7.0 | 105 | 0.4030 | 0.8740 |
60
- | 0.2242 | 8.0 | 120 | 0.3450 | 0.8997 |
61
- | 0.2242 | 9.0 | 135 | 0.3476 | 0.9126 |
62
- | 0.2242 | 10.0 | 150 | 0.3537 | 0.9000 |
 
 
 
 
 
 
 
 
 
 
63
 
64
 
65
  ### Framework versions
@@ -67,4 +77,4 @@ The following hyperparameters were used during training:
67
  - Transformers 4.48.0.dev0
68
  - Pytorch 2.6.0+cu124
69
  - Datasets 3.1.0
70
- - Tokenizers 0.21.0
 
18
 
19
  This model is a fine-tuned version of [answerdotai/ModernBERT-base](https://huggingface.co/answerdotai/ModernBERT-base) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 0.3695
22
+ - F1: 0.9252
23
 
24
  ## Model description
25
 
 
44
  - seed: 42
45
  - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
46
  - lr_scheduler_type: linear
47
+ - num_epochs: 20
48
 
49
  ### Training results
50
 
51
  | Training Loss | Epoch | Step | Validation Loss | F1 |
52
  |:-------------:|:-----:|:----:|:---------------:|:------:|
53
+ | No log | 1.0 | 15 | 0.5181 | 0.7481 |
54
+ | No log | 2.0 | 30 | 0.3811 | 0.8373 |
55
+ | No log | 3.0 | 45 | 0.6849 | 0.6865 |
56
+ | No log | 4.0 | 60 | 0.4782 | 0.8611 |
57
+ | No log | 5.0 | 75 | 0.2552 | 0.9376 |
58
+ | No log | 6.0 | 90 | 0.3630 | 0.9127 |
59
+ | 0.2889 | 7.0 | 105 | 0.4094 | 0.8618 |
60
+ | 0.2889 | 8.0 | 120 | 0.3934 | 0.8997 |
61
+ | 0.2889 | 9.0 | 135 | 0.3548 | 0.9376 |
62
+ | 0.2889 | 10.0 | 150 | 0.4377 | 0.8746 |
63
+ | 0.2889 | 11.0 | 165 | 0.4106 | 0.9126 |
64
+ | 0.2889 | 12.0 | 180 | 0.4450 | 0.8997 |
65
+ | 0.2889 | 13.0 | 195 | 0.3728 | 0.9376 |
66
+ | 0.0041 | 14.0 | 210 | 0.3698 | 0.9252 |
67
+ | 0.0041 | 15.0 | 225 | 0.3708 | 0.9252 |
68
+ | 0.0041 | 16.0 | 240 | 0.3696 | 0.9252 |
69
+ | 0.0041 | 17.0 | 255 | 0.3703 | 0.9252 |
70
+ | 0.0041 | 18.0 | 270 | 0.3718 | 0.9252 |
71
+ | 0.0041 | 19.0 | 285 | 0.3722 | 0.9252 |
72
+ | 0.0 | 20.0 | 300 | 0.3695 | 0.9252 |
73
 
74
 
75
  ### Framework versions
 
77
  - Transformers 4.48.0.dev0
78
  - Pytorch 2.6.0+cu124
79
  - Datasets 3.1.0
80
+ - Tokenizers 0.21.1
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:3e7048e1483b0fc90269afb947fcb1bc86c85c238a64a9a7357df6d900b77627
3
  size 598439784
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:39f93fbfbf957dccd47c76d47e05b09d36ac0afa9bce35ab4617f610fa516a53
3
  size 598439784
runs/Mar21_00-44-52_5eacb8eb673a/events.out.tfevents.1742517902.5eacb8eb673a.358.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1bb392a93355440173f15fe59e93dad90dd4f970506154c63d027514b869e5c3
3
+ size 13123
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:644b890d805bb15322d99222e55de6b1bf2ff6e72809a2db256fa0600ad02511
3
  size 5432
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cc8eead68c3a2092b62da56f6c48de35ace65a8f2c6694f348004d1d51d4b70e
3
  size 5432