aged-colt-222
This model is a fine-tuned version of google-bert/bert-base-cased on the None dataset. It achieves the following results on the evaluation set:
- Loss: 0.2040
- Hamming Loss: 0.0629
- Zero One Loss: 0.3725
- Jaccard Score: 0.3164
- Hamming Loss Optimised: 0.0602
- Hamming Loss Threshold: 0.6941
- Zero One Loss Optimised: 0.3712
- Zero One Loss Threshold: 0.5690
- Jaccard Score Optimised: 0.3027
- Jaccard Score Threshold: 0.3189
Model description
More information needed
Intended uses & limitations
More information needed
Training and evaluation data
More information needed
Training procedure
Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 1.8777284034581645e-05
- train_batch_size: 4
- eval_batch_size: 4
- seed: 2024
- optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
- lr_scheduler_type: linear
- num_epochs: 8
Training results
| Training Loss | Epoch | Step | Validation Loss | Hamming Loss | Zero One Loss | Jaccard Score | Hamming Loss Optimised | Hamming Loss Threshold | Zero One Loss Optimised | Zero One Loss Threshold | Jaccard Score Optimised | Jaccard Score Threshold |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0.2942 | 1.0 | 800 | 0.1817 | 0.0614 | 0.515 | 0.4782 | 0.0599 | 0.4035 | 0.4463 | 0.3226 | 0.3475 | 0.2729 |
| 0.1593 | 2.0 | 1600 | 0.1695 | 0.0586 | 0.4062 | 0.3618 | 0.0585 | 0.4832 | 0.395 | 0.4651 | 0.3226 | 0.2867 |
| 0.1236 | 3.0 | 2400 | 0.1682 | 0.0594 | 0.3888 | 0.3403 | 0.0564 | 0.6322 | 0.375 | 0.4467 | 0.3003 | 0.2734 |
| 0.1007 | 4.0 | 3200 | 0.1784 | 0.0574 | 0.3625 | 0.3143 | 0.0571 | 0.5063 | 0.3612 | 0.4823 | 0.3035 | 0.2923 |
| 0.0775 | 5.0 | 4000 | 0.1822 | 0.0615 | 0.3662 | 0.3159 | 0.0581 | 0.7021 | 0.3675 | 0.4750 | 0.3041 | 0.3292 |
| 0.059 | 6.0 | 4800 | 0.1951 | 0.0633 | 0.3688 | 0.3113 | 0.06 | 0.7800 | 0.3675 | 0.5584 | 0.2981 | 0.3377 |
| 0.0474 | 7.0 | 5600 | 0.2031 | 0.0636 | 0.38 | 0.3242 | 0.06 | 0.7791 | 0.375 | 0.5376 | 0.3068 | 0.1991 |
| 0.0406 | 8.0 | 6400 | 0.2040 | 0.0629 | 0.3725 | 0.3164 | 0.0602 | 0.6941 | 0.3712 | 0.5690 | 0.3027 | 0.3189 |
Framework versions
- Transformers 4.47.0
- Pytorch 2.5.1+cu124
- Datasets 3.1.0
- Tokenizers 0.21.0
- Downloads last month
- 2
Model tree for ElMad/aged-colt-222
Base model
google-bert/bert-base-cased