KanWasTaken's picture
Model save
cef68dc verified
|
raw
history blame
4.32 kB
metadata
library_name: transformers
tags:
  - generated_from_trainer
model-index:
  - name: WhartonDS_RegressionModel
    results: []

WhartonDS_RegressionModel

This model is a fine-tuned version of on an unknown dataset. It achieves the following results on the evaluation set:

  • Loss: 0.0095

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 1e-05
  • train_batch_size: 256
  • eval_batch_size: 64
  • seed: 42
  • optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
  • lr_scheduler_type: cosine
  • num_epochs: 60

Training results

Training Loss Epoch Step Validation Loss
0.0691 1.0 24 0.0664
0.0556 2.0 48 0.0614
0.0443 3.0 72 0.0559
0.0359 4.0 96 0.0493
0.0287 5.0 120 0.0287
0.0233 6.0 144 0.0206
0.0196 7.0 168 0.0170
0.0174 8.0 192 0.0155
0.0158 9.0 216 0.0156
0.0146 10.0 240 0.0135
0.0137 11.0 264 0.0123
0.0133 12.0 288 0.0122
0.013 13.0 312 0.0120
0.0125 14.0 336 0.0117
0.0123 15.0 360 0.0113
0.0119 16.0 384 0.0112
0.0119 17.0 408 0.0110
0.0118 18.0 432 0.0109
0.0116 19.0 456 0.0108
0.0113 20.0 480 0.0106
0.0113 21.0 504 0.0103
0.0111 22.0 528 0.0103
0.0111 23.0 552 0.0103
0.011 24.0 576 0.0102
0.0108 25.0 600 0.0101
0.0107 26.0 624 0.0100
0.0107 27.0 648 0.0100
0.0107 28.0 672 0.0099
0.0105 29.0 696 0.0097
0.0104 30.0 720 0.0098
0.0104 31.0 744 0.0097
0.0104 32.0 768 0.0096
0.0103 33.0 792 0.0097
0.0103 34.0 816 0.0098
0.0103 35.0 840 0.0097
0.0102 36.0 864 0.0098
0.0103 37.0 888 0.0096
0.0103 38.0 912 0.0095
0.0102 39.0 936 0.0095
0.0101 40.0 960 0.0096
0.0101 41.0 984 0.0096
0.0101 42.0 1008 0.0094
0.0102 43.0 1032 0.0096
0.0101 44.0 1056 0.0094
0.0101 45.0 1080 0.0095
0.01 46.0 1104 0.0095
0.01 47.0 1128 0.0094
0.01 48.0 1152 0.0094
0.01 49.0 1176 0.0094
0.0101 50.0 1200 0.0094
0.0099 51.0 1224 0.0094
0.01 52.0 1248 0.0094
0.0101 53.0 1272 0.0094
0.01 54.0 1296 0.0093
0.01 55.0 1320 0.0094
0.01 56.0 1344 0.0093
0.0101 57.0 1368 0.0095
0.0099 58.0 1392 0.0095
0.0099 59.0 1416 0.0095
0.01 60.0 1440 0.0095

Framework versions

  • Transformers 4.47.0
  • Pytorch 2.5.1+cu121
  • Datasets 3.2.0
  • Tokenizers 0.21.0