segformer-b0-scene-parse-150

This model is a fine-tuned version of nvidia/mit-b0 on an unknown dataset. It achieves the following results on the evaluation set:

  • Loss: 1.9881
  • Mean Iou: 0.0898
  • Mean Accuracy: 0.1508
  • Overall Accuracy: 0.6092

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 6e-05
  • train_batch_size: 2
  • eval_batch_size: 2
  • seed: 42
  • optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
  • lr_scheduler_type: linear
  • num_epochs: 50

Training results

Training Loss Epoch Step Validation Loss Mean Iou Mean Accuracy Overall Accuracy
4.8641 1.0 20 4.9027 0.0114 0.0472 0.2362
4.4897 2.0 40 4.5156 0.0223 0.0976 0.3997
4.1506 3.0 60 3.9759 0.0415 0.1049 0.4639
3.9239 4.0 80 3.5152 0.0513 0.1151 0.4998
3.7073 5.0 100 3.1904 0.0662 0.1180 0.5333
3.4644 6.0 120 3.0165 0.0694 0.1181 0.5548
3.3538 7.0 140 2.9451 0.0698 0.1189 0.5437
2.9297 8.0 160 2.8302 0.0794 0.1252 0.5704
3.0257 9.0 180 2.6410 0.0801 0.1307 0.5749
2.8081 10.0 200 2.4851 0.0803 0.1259 0.5883
2.6384 11.0 220 2.4904 0.0853 0.1318 0.5817
2.5555 12.0 240 2.3738 0.0874 0.1296 0.6095
2.3822 13.0 260 2.3239 0.0864 0.1317 0.5924
2.4267 14.0 280 2.2778 0.0843 0.1286 0.6019
2.3553 15.0 300 2.2917 0.0836 0.1301 0.5899
2.1810 16.0 320 2.2187 0.0873 0.1388 0.5972
2.2971 17.0 340 2.1589 0.0890 0.1423 0.6107
2.2366 18.0 360 2.2463 0.0866 0.1484 0.5969
2.2299 19.0 380 2.0700 0.0900 0.1415 0.6184
1.9433 20.0 400 2.0822 0.0891 0.1457 0.6177
1.9851 21.0 420 2.1299 0.0885 0.1484 0.5968
1.9428 22.0 440 2.0912 0.0856 0.1416 0.6082
1.6539 23.0 460 2.2378 0.0866 0.1464 0.5902
1.9056 24.0 480 2.0942 0.0896 0.1423 0.6142
1.6493 25.0 500 2.0635 0.0915 0.1477 0.6160
1.7495 26.0 520 2.0396 0.0920 0.1483 0.6179
1.5518 27.0 540 2.0270 0.0904 0.1485 0.6098
1.6620 28.0 560 2.0564 0.0931 0.1585 0.6098
1.5614 29.0 580 2.0542 0.0942 0.1536 0.6106
1.5037 30.0 600 2.0698 0.0874 0.1477 0.5995
1.3493 31.0 620 2.0744 0.0880 0.1539 0.5926
1.6179 32.0 640 2.0404 0.0906 0.1493 0.6081
1.5231 33.0 660 2.0153 0.0907 0.1506 0.6085
1.4016 34.0 680 2.1269 0.0860 0.1513 0.5846
1.5147 35.0 700 2.0160 0.0920 0.1472 0.6120
1.4723 36.0 720 1.9838 0.0948 0.1525 0.6211
1.3963 37.0 740 2.0161 0.0886 0.1514 0.6108
1.5678 38.0 760 2.0203 0.0887 0.1526 0.6133
1.4418 39.0 780 1.9610 0.0932 0.1470 0.6112
1.4737 40.0 800 1.9609 0.0912 0.1466 0.6106
1.3634 41.0 820 1.9471 0.0942 0.1509 0.6175
1.4730 42.0 840 1.9611 0.0939 0.1520 0.6152
1.4209 43.0 860 1.9680 0.0901 0.1465 0.6044
1.2518 44.0 880 1.9577 0.0918 0.1502 0.6121
1.2380 45.0 900 1.9631 0.0909 0.1516 0.6097
1.3753 46.0 920 1.9620 0.0919 0.1489 0.6090
1.2756 47.0 940 1.9659 0.0909 0.1499 0.6104
1.3696 48.0 960 1.9568 0.0898 0.1495 0.6098
1.2820 49.0 980 2.0069 0.0911 0.1524 0.6072
1.3792 50.0 1000 1.9881 0.0898 0.1508 0.6092

Framework versions

  • Transformers 5.8.0
  • Pytorch 2.10.0+cu128
  • Datasets 4.8.5
  • Tokenizers 0.22.2
Downloads last month
785
Safetensors
Model size
3.75M params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for dltpdn/segformer-b0-scene-parse-150

Base model

nvidia/mit-b0
Finetuned
(459)
this model