maud-dr
/

model_2_stage2-seed_2025

@@ -9,21 +9,21 @@ metrics:
 - recall
 - f1
 model-index:
-- name: model_2_stage2-seed_123
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# model_2_stage2-seed_123
 This model is a fine-tuned version of [maud-dr/model_2_stage1](https://huggingface.co/maud-dr/model_2_stage1) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.7954
-- Precision: 0.6352
-- Recall: 0.7319
-- F1: 0.6801
 ## Model description
@@ -45,7 +45,7 @@ The following hyperparameters were used during training:
 - learning_rate: 0.0003
 - train_batch_size: 8
 - eval_batch_size: 8
-- seed: 123
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 - num_epochs: 15
@@ -54,21 +54,21 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1     |
 |:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|
-| 0.5075        | 1.0   | 447  | 0.6591          | 0.6296    | 0.6159 | 0.6227 |
-| 0.4241        | 2.0   | 894  | 0.8133          | 0.6012    | 0.7536 | 0.6688 |
-| 0.3651        | 3.0   | 1341 | 0.8972          | 0.6092    | 0.7174 | 0.6589 |
-| 0.3389        | 4.0   | 1788 | 1.3235          | 0.6040    | 0.7572 | 0.6720 |
-| 0.2572        | 5.0   | 2235 | 1.2850          | 0.6378    | 0.7464 | 0.6878 |
-| 0.187         | 6.0   | 2682 | 1.4055          | 0.6114    | 0.7754 | 0.6837 |
-| 0.1456        | 7.0   | 3129 | 1.8037          | 0.6464    | 0.6558 | 0.6511 |
-| 0.1386        | 8.0   | 3576 | 1.8962          | 0.6181    | 0.6920 | 0.6530 |
-| 0.1003        | 9.0   | 4023 | 2.1076          | 0.6198    | 0.7029 | 0.6587 |
-| 0.0738        | 10.0  | 4470 | 2.4260          | 0.6463    | 0.7283 | 0.6848 |
-| 0.0233        | 11.0  | 4917 | 2.5047          | 0.6242    | 0.7464 | 0.6799 |
-| 0.0677        | 12.0  | 5364 | 2.6329          | 0.6238    | 0.7029 | 0.6610 |
-| 0.0249        | 13.0  | 5811 | 2.5839          | 0.6429    | 0.7174 | 0.6781 |
-| 0.0249        | 14.0  | 6258 | 2.7944          | 0.6347    | 0.7428 | 0.6845 |
-| 0.0228        | 15.0  | 6705 | 2.7954          | 0.6352    | 0.7319 | 0.6801 |
 ### Framework versions

 - recall
 - f1
 model-index:
+- name: model_2_stage2-seed_2025
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# model_2_stage2-seed_2025
 This model is a fine-tuned version of [maud-dr/model_2_stage1](https://huggingface.co/maud-dr/model_2_stage1) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.6600
+- Precision: 0.6346
+- Recall: 0.7174
+- F1: 0.6735
 ## Model description
 - learning_rate: 0.0003
 - train_batch_size: 8
 - eval_batch_size: 8
+- seed: 2025
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 - num_epochs: 15
 | Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1     |
 |:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|
+| 0.2914        | 1.0   | 447  | 1.5544          | 0.5815    | 0.7754 | 0.6646 |
+| 0.2523        | 2.0   | 894  | 1.6443          | 0.6469    | 0.6703 | 0.6584 |
+| 0.16          | 3.0   | 1341 | 1.8783          | 0.6144    | 0.6812 | 0.6460 |
+| 0.1354        | 4.0   | 1788 | 1.5711          | 0.6287    | 0.7790 | 0.6958 |
+| 0.1321        | 5.0   | 2235 | 1.7032          | 0.6607    | 0.6703 | 0.6655 |
+| 0.1108        | 6.0   | 2682 | 1.9982          | 0.6144    | 0.6812 | 0.6460 |
+| 0.103         | 7.0   | 3129 | 2.2463          | 0.6146    | 0.6993 | 0.6542 |
+| 0.0778        | 8.0   | 3576 | 2.3003          | 0.6304    | 0.6920 | 0.6598 |
+| 0.0428        | 9.0   | 4023 | 2.6554          | 0.6226    | 0.6993 | 0.6587 |
+| 0.0589        | 10.0  | 4470 | 2.4618          | 0.6237    | 0.6667 | 0.6445 |
+| 0.046         | 11.0  | 4917 | 2.5882          | 0.6242    | 0.7101 | 0.6644 |
+| 0.0311        | 12.0  | 5364 | 2.5561          | 0.6321    | 0.7283 | 0.6768 |
+| 0.0288        | 13.0  | 5811 | 2.6707          | 0.6410    | 0.7246 | 0.6803 |
+| 0.0296        | 14.0  | 6258 | 2.6000          | 0.6343    | 0.7101 | 0.6701 |
+| 0.0002        | 15.0  | 6705 | 2.6600          | 0.6346    | 0.7174 | 0.6735 |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4781f1f99cd9fc34133949dd4639cdec472382438d6e0001e08d4f63ec3262c6
 size 894020048

 version https://git-lfs.github.com/spec/v1
+oid sha256:3d05e3f87dbaba9506739173909dbf483ea55ebd18081b1d19d9c1e43702379e
 size 894020048