End of training

Browse files

Files changed (7) hide show

README.md +19 -11
model.safetensors +1 -1
runs/May29_12-36-49_MacBook-Pro-2.local/events.out.tfevents.1748540209.MacBook-Pro-2.local.97121.0 +3 -0
runs/May29_12-37-56_MacBook-Pro-2.local/events.out.tfevents.1748540276.MacBook-Pro-2.local.97336.0 +3 -0
tokenizer.json +2 -2
tokenizer_config.json +1 -1
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -5,7 +5,7 @@ base_model: answerdotai/ModernBERT-base
 tags:
 - generated_from_trainer
 metrics:
-- f1
 model-index:
 - name: featured-articles
   results: []
@@ -18,8 +18,15 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [answerdotai/ModernBERT-base](https://huggingface.co/answerdotai/ModernBERT-base) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.8059
-- F1: 0.6817
 ## Model description
@@ -38,21 +45,22 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 5e-05
 - train_batch_size: 8
-- eval_batch_size: 4
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
-- num_epochs: 3
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | F1     |
-|:-------------:|:-----:|:----:|:---------------:|:------:|
-| 0.6394        | 1.0   | 269  | 0.5860          | 0.6517 |
-| 0.4479        | 2.0   | 538  | 0.7460          | 0.5421 |
-| 0.2307        | 3.0   | 807  | 0.8059          | 0.6817 |
 ### Framework versions

 tags:
 - generated_from_trainer
 metrics:
+- accuracy
 model-index:
 - name: featured-articles
   results: []
 This model is a fine-tuned version of [answerdotai/ModernBERT-base](https://huggingface.co/answerdotai/ModernBERT-base) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.9620
+- Weighted F1: 0.6740
+- Accepted Precision: 0.7453
+- Accepted Recall: 0.7790
+- Accepted F1: 0.7618
+- Rejected Precision: 0.5273
+- Rejected Recall: 0.4807
+- Rejected F1: 0.5029
+- Accuracy: 0.6779
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 3e-05
 - train_batch_size: 8
+- eval_batch_size: 8
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
+- num_epochs: 4
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss | Weighted F1 | Accepted Precision | Accepted Recall | Accepted F1 | Rejected Precision | Rejected Recall | Rejected F1 | Accuracy |
+|:-------------:|:-----:|:----:|:---------------:|:-----------:|:------------------:|:---------------:|:-----------:|:------------------:|:---------------:|:-----------:|:--------:|
+| 0.6595        | 1.0   | 267  | 0.6187          | 0.6876      | 0.752              | 0.7989          | 0.7747      | 0.5535             | 0.4862          | 0.5176      | 0.6929   |
+| 0.4807        | 2.0   | 534  | 0.7625          | 0.5677      | 0.8030             | 0.4504          | 0.5771      | 0.4226             | 0.7845          | 0.5493      | 0.5637   |
+| 0.3013        | 3.0   | 801  | 1.7444          | 0.6577      | 0.7105             | 0.9178          | 0.8010      | 0.6282             | 0.2707          | 0.3784      | 0.6985   |
+| 0.0381        | 4.0   | 1068 | 1.9620          | 0.6740      | 0.7453             | 0.7790          | 0.7618      | 0.5273             | 0.4807          | 0.5029      | 0.6779   |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:7b06955d17eb3060058ba75233716f6518816e4c36f5e0321629a16004c7b79c
 size 598439784

 version https://git-lfs.github.com/spec/v1
+oid sha256:33c134cd604d021b99a9709616af5571ce8f59accfec9bf1f13522604980e99c
 size 598439784

runs/May29_12-36-49_MacBook-Pro-2.local/events.out.tfevents.1748540209.MacBook-Pro-2.local.97121.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:a927e1d7ec4b42f7e4184966906d83adde07e7afdac3fc425c47668e66499f33
+size 5680

runs/May29_12-37-56_MacBook-Pro-2.local/events.out.tfevents.1748540276.MacBook-Pro-2.local.97336.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:763c214d7befa551ba9fa668c54819db43ff4a6b2d59652818bd5864d287ecbc
+size 20113

tokenizer.json CHANGED Viewed

@@ -2,13 +2,13 @@
   "version": "1.0",
   "truncation": {
     "direction": "Right",
-    "max_length": 512,
     "strategy": "LongestFirst",
     "stride": 0
   },
   "padding": {
     "strategy": {
-      "Fixed": 512
     },
     "direction": "Right",
     "pad_to_multiple_of": null,

   "version": "1.0",
   "truncation": {
     "direction": "Right",
+    "max_length": 384,
     "strategy": "LongestFirst",
     "stride": 0
   },
   "padding": {
     "strategy": {
+      "Fixed": 384
     },
     "direction": "Right",
     "pad_to_multiple_of": null,

tokenizer_config.json CHANGED Viewed

@@ -937,7 +937,7 @@
     "input_ids",
     "attention_mask"
   ],
-  "model_max_length": 512,
   "pad_token": "[PAD]",
   "sep_token": "[SEP]",
   "tokenizer_class": "PreTrainedTokenizerFast",

     "input_ids",
     "attention_mask"
   ],
+  "model_max_length": 384,
   "pad_token": "[PAD]",
   "sep_token": "[SEP]",
   "tokenizer_class": "PreTrainedTokenizerFast",

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:770636106f5c8bc29870b48203e9f7053a6437e8e5524a68401c30aa0a7dd1ae
 size 5368

 version https://git-lfs.github.com/spec/v1
+oid sha256:25219de5de47050884cc875b8c970f49a500b1cb3003d95c150add87360bc54e
 size 5368