drcoool commited on
Commit
a3e38c6
·
verified ·
1 Parent(s): 73c0d65

End of training

Browse files
README.md CHANGED
@@ -5,7 +5,7 @@ base_model: answerdotai/ModernBERT-base
5
  tags:
6
  - generated_from_trainer
7
  metrics:
8
- - f1
9
  model-index:
10
  - name: featured-articles
11
  results: []
@@ -18,8 +18,15 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [answerdotai/ModernBERT-base](https://huggingface.co/answerdotai/ModernBERT-base) on the None dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 0.8059
22
- - F1: 0.6817
 
 
 
 
 
 
 
23
 
24
  ## Model description
25
 
@@ -38,21 +45,22 @@ More information needed
38
  ### Training hyperparameters
39
 
40
  The following hyperparameters were used during training:
41
- - learning_rate: 5e-05
42
  - train_batch_size: 8
43
- - eval_batch_size: 4
44
  - seed: 42
45
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
46
  - lr_scheduler_type: linear
47
- - num_epochs: 3
48
 
49
  ### Training results
50
 
51
- | Training Loss | Epoch | Step | Validation Loss | F1 |
52
- |:-------------:|:-----:|:----:|:---------------:|:------:|
53
- | 0.6394 | 1.0 | 269 | 0.5860 | 0.6517 |
54
- | 0.4479 | 2.0 | 538 | 0.7460 | 0.5421 |
55
- | 0.2307 | 3.0 | 807 | 0.8059 | 0.6817 |
 
56
 
57
 
58
  ### Framework versions
 
5
  tags:
6
  - generated_from_trainer
7
  metrics:
8
+ - accuracy
9
  model-index:
10
  - name: featured-articles
11
  results: []
 
18
 
19
  This model is a fine-tuned version of [answerdotai/ModernBERT-base](https://huggingface.co/answerdotai/ModernBERT-base) on the None dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 1.9620
22
+ - Weighted F1: 0.6740
23
+ - Accepted Precision: 0.7453
24
+ - Accepted Recall: 0.7790
25
+ - Accepted F1: 0.7618
26
+ - Rejected Precision: 0.5273
27
+ - Rejected Recall: 0.4807
28
+ - Rejected F1: 0.5029
29
+ - Accuracy: 0.6779
30
 
31
  ## Model description
32
 
 
45
  ### Training hyperparameters
46
 
47
  The following hyperparameters were used during training:
48
+ - learning_rate: 3e-05
49
  - train_batch_size: 8
50
+ - eval_batch_size: 8
51
  - seed: 42
52
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
53
  - lr_scheduler_type: linear
54
+ - num_epochs: 4
55
 
56
  ### Training results
57
 
58
+ | Training Loss | Epoch | Step | Validation Loss | Weighted F1 | Accepted Precision | Accepted Recall | Accepted F1 | Rejected Precision | Rejected Recall | Rejected F1 | Accuracy |
59
+ |:-------------:|:-----:|:----:|:---------------:|:-----------:|:------------------:|:---------------:|:-----------:|:------------------:|:---------------:|:-----------:|:--------:|
60
+ | 0.6595 | 1.0 | 267 | 0.6187 | 0.6876 | 0.752 | 0.7989 | 0.7747 | 0.5535 | 0.4862 | 0.5176 | 0.6929 |
61
+ | 0.4807 | 2.0 | 534 | 0.7625 | 0.5677 | 0.8030 | 0.4504 | 0.5771 | 0.4226 | 0.7845 | 0.5493 | 0.5637 |
62
+ | 0.3013 | 3.0 | 801 | 1.7444 | 0.6577 | 0.7105 | 0.9178 | 0.8010 | 0.6282 | 0.2707 | 0.3784 | 0.6985 |
63
+ | 0.0381 | 4.0 | 1068 | 1.9620 | 0.6740 | 0.7453 | 0.7790 | 0.7618 | 0.5273 | 0.4807 | 0.5029 | 0.6779 |
64
 
65
 
66
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:7b06955d17eb3060058ba75233716f6518816e4c36f5e0321629a16004c7b79c
3
  size 598439784
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:33c134cd604d021b99a9709616af5571ce8f59accfec9bf1f13522604980e99c
3
  size 598439784
runs/May29_12-36-49_MacBook-Pro-2.local/events.out.tfevents.1748540209.MacBook-Pro-2.local.97121.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a927e1d7ec4b42f7e4184966906d83adde07e7afdac3fc425c47668e66499f33
3
+ size 5680
runs/May29_12-37-56_MacBook-Pro-2.local/events.out.tfevents.1748540276.MacBook-Pro-2.local.97336.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:763c214d7befa551ba9fa668c54819db43ff4a6b2d59652818bd5864d287ecbc
3
+ size 20113
tokenizer.json CHANGED
@@ -2,13 +2,13 @@
2
  "version": "1.0",
3
  "truncation": {
4
  "direction": "Right",
5
- "max_length": 512,
6
  "strategy": "LongestFirst",
7
  "stride": 0
8
  },
9
  "padding": {
10
  "strategy": {
11
- "Fixed": 512
12
  },
13
  "direction": "Right",
14
  "pad_to_multiple_of": null,
 
2
  "version": "1.0",
3
  "truncation": {
4
  "direction": "Right",
5
+ "max_length": 384,
6
  "strategy": "LongestFirst",
7
  "stride": 0
8
  },
9
  "padding": {
10
  "strategy": {
11
+ "Fixed": 384
12
  },
13
  "direction": "Right",
14
  "pad_to_multiple_of": null,
tokenizer_config.json CHANGED
@@ -937,7 +937,7 @@
937
  "input_ids",
938
  "attention_mask"
939
  ],
940
- "model_max_length": 512,
941
  "pad_token": "[PAD]",
942
  "sep_token": "[SEP]",
943
  "tokenizer_class": "PreTrainedTokenizerFast",
 
937
  "input_ids",
938
  "attention_mask"
939
  ],
940
+ "model_max_length": 384,
941
  "pad_token": "[PAD]",
942
  "sep_token": "[SEP]",
943
  "tokenizer_class": "PreTrainedTokenizerFast",
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:770636106f5c8bc29870b48203e9f7053a6437e8e5524a68401c30aa0a7dd1ae
3
  size 5368
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:25219de5de47050884cc875b8c970f49a500b1cb3003d95c150add87360bc54e
3
  size 5368