wi1vdcj9_20250706_183307
This model is a fine-tuned version of meta-llama/Llama-3.2-1B on an unknown dataset. It achieves the following results on the evaluation set:
- Loss: 0.8078
- Model Preparation Time: 0.0074
- Move Accuracy: 0.1523
- Token Accuracy: 0.6926
- Accuracy: 0.1523
Model description
More information needed
Intended uses & limitations
More information needed
Training and evaluation data
More information needed
Training procedure
Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 0.001
- train_batch_size: 128
- eval_batch_size: 256
- seed: 42
- optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
- lr_scheduler_type: constant_with_warmup
- lr_scheduler_warmup_ratio: 0.001
- num_epochs: 100
Training results
| Training Loss | Epoch | Step | Validation Loss | Model Preparation Time | Move Accuracy | Token Accuracy | Accuracy |
|---|---|---|---|---|---|---|---|
| No log | 0 | 0 | 11.9474 | 0.0074 | 0.0 | 0.0000 | 0.0 |
| 3.24 | 0.0098 | 100 | 3.2207 | 0.0074 | 0.0 | 0.2334 | 0.0 |
| 1.6407 | 0.0196 | 200 | 1.6433 | 0.0074 | 0.0014 | 0.3720 | 0.0014 |
| 1.501 | 0.0295 | 300 | 1.5145 | 0.0074 | 0.0098 | 0.4154 | 0.0098 |
| 1.4178 | 0.0393 | 400 | 1.4807 | 0.0074 | 0.0028 | 0.4266 | 0.0028 |
| 1.399 | 0.0491 | 500 | 1.4411 | 0.0074 | 0.0034 | 0.4417 | 0.0034 |
| 1.3539 | 0.0589 | 600 | 1.4110 | 0.0074 | 0.0071 | 0.4568 | 0.0071 |
| 1.3825 | 0.0687 | 700 | 1.3934 | 0.0074 | 0.0071 | 0.4667 | 0.0071 |
| 1.4135 | 0.0785 | 800 | 1.3575 | 0.0074 | 0.0162 | 0.4832 | 0.0162 |
| 1.3352 | 0.0884 | 900 | 1.3415 | 0.0074 | 0.0237 | 0.4885 | 0.0237 |
| 1.3507 | 0.0982 | 1000 | 1.3371 | 0.0074 | 0.0217 | 0.4912 | 0.0217 |
| 1.2986 | 0.1080 | 1100 | 1.3064 | 0.0074 | 0.0262 | 0.4990 | 0.0262 |
| 1.2421 | 0.1178 | 1200 | 1.2852 | 0.0074 | 0.0289 | 0.5069 | 0.0289 |
| 1.279 | 0.1276 | 1300 | 1.2777 | 0.0074 | 0.0316 | 0.5145 | 0.0316 |
| 1.2612 | 0.1374 | 1400 | 1.2643 | 0.0074 | 0.0304 | 0.5142 | 0.0304 |
| 1.2101 | 0.1473 | 1500 | 1.2590 | 0.0074 | 0.0389 | 0.5189 | 0.0389 |
| 1.2314 | 0.1571 | 1600 | 1.2229 | 0.0074 | 0.0390 | 0.5327 | 0.0390 |
| 1.1841 | 0.1669 | 1700 | 1.2078 | 0.0074 | 0.0404 | 0.5371 | 0.0404 |
| 1.2232 | 0.1767 | 1800 | 1.2045 | 0.0074 | 0.0425 | 0.5405 | 0.0425 |
| 1.1652 | 0.1865 | 1900 | 1.1870 | 0.0074 | 0.0500 | 0.5457 | 0.0500 |
| 1.149 | 0.1963 | 2000 | 1.1765 | 0.0074 | 0.0508 | 0.5496 | 0.0508 |
| 1.14 | 0.2062 | 2100 | 1.1662 | 0.0074 | 0.0492 | 0.5502 | 0.0492 |
| 1.1436 | 0.2160 | 2200 | 1.1622 | 0.0074 | 0.0537 | 0.5542 | 0.0537 |
| 1.1889 | 0.2258 | 2300 | 1.1567 | 0.0074 | 0.0467 | 0.5554 | 0.0467 |
| 1.0404 | 0.2356 | 2400 | 1.1339 | 0.0074 | 0.0566 | 0.5657 | 0.0566 |
| 1.0835 | 0.2454 | 2500 | 1.1196 | 0.0074 | 0.0585 | 0.5713 | 0.0585 |
| 1.1243 | 0.2553 | 2600 | 1.1185 | 0.0074 | 0.0631 | 0.5715 | 0.0631 |
| 1.1253 | 0.2651 | 2700 | 1.0997 | 0.0074 | 0.0671 | 0.5794 | 0.0671 |
| 1.0546 | 0.2749 | 2800 | 1.0907 | 0.0074 | 0.0725 | 0.5822 | 0.0725 |
| 1.0784 | 0.2847 | 2900 | 1.0893 | 0.0074 | 0.0660 | 0.5794 | 0.0660 |
| 1.0447 | 0.2945 | 3000 | 1.0600 | 0.0074 | 0.0714 | 0.5934 | 0.0714 |
| 1.0217 | 0.3043 | 3100 | 1.0520 | 0.0074 | 0.0710 | 0.5944 | 0.0710 |
| 1.0453 | 0.3142 | 3200 | 1.0415 | 0.0074 | 0.0778 | 0.6021 | 0.0778 |
| 1.0655 | 0.3240 | 3300 | 1.0294 | 0.0074 | 0.0801 | 0.6047 | 0.0801 |
| 0.9568 | 0.3338 | 3400 | 1.0192 | 0.0074 | 0.0774 | 0.6097 | 0.0774 |
| 0.968 | 0.3436 | 3500 | 1.0053 | 0.0074 | 0.0824 | 0.6141 | 0.0824 |
| 1.012 | 0.3534 | 3600 | 1.0100 | 0.0074 | 0.0820 | 0.6123 | 0.0820 |
| 0.9446 | 0.3632 | 3700 | 0.9983 | 0.0074 | 0.0800 | 0.6162 | 0.0800 |
| 1.0252 | 0.3731 | 3800 | 0.9838 | 0.0074 | 0.0865 | 0.6207 | 0.0865 |
| 0.9559 | 0.3829 | 3900 | 0.9691 | 0.0074 | 0.0905 | 0.6271 | 0.0905 |
| 0.9896 | 0.3927 | 4000 | 0.9663 | 0.0074 | 0.0896 | 0.6280 | 0.0896 |
| 0.9288 | 0.4025 | 4100 | 0.9559 | 0.0074 | 0.0914 | 0.6307 | 0.0914 |
| 0.8857 | 0.4123 | 4200 | 0.9645 | 0.0074 | 0.0928 | 0.6306 | 0.0928 |
| 0.9259 | 0.4221 | 4300 | 0.9336 | 0.0074 | 0.0987 | 0.6434 | 0.0987 |
| 0.9785 | 0.4320 | 4400 | 0.9329 | 0.0074 | 0.1040 | 0.6438 | 0.1040 |
| 0.9197 | 0.4418 | 4500 | 0.9349 | 0.0074 | 0.1048 | 0.6426 | 0.1048 |
| 0.8824 | 0.4516 | 4600 | 0.9130 | 0.0074 | 0.1067 | 0.6514 | 0.1067 |
| 0.9393 | 0.4614 | 4700 | 0.9109 | 0.0074 | 0.1027 | 0.6506 | 0.1027 |
| 0.9137 | 0.4712 | 4800 | 0.9032 | 0.0074 | 0.1078 | 0.6548 | 0.1078 |
| 0.8685 | 0.4811 | 4900 | 0.9086 | 0.0074 | 0.1089 | 0.6537 | 0.1089 |
| 0.8717 | 0.4909 | 5000 | 0.8973 | 0.0074 | 0.1163 | 0.6573 | 0.1163 |
| 0.9079 | 0.5007 | 5100 | 0.8854 | 0.0074 | 0.1167 | 0.6637 | 0.1167 |
| 0.9103 | 0.5105 | 5200 | 0.8983 | 0.0074 | 0.1096 | 0.6561 | 0.1096 |
| 0.913 | 0.5203 | 5300 | 0.8824 | 0.0074 | 0.1178 | 0.6638 | 0.1178 |
| 0.8891 | 0.5301 | 5400 | 0.8857 | 0.0074 | 0.1157 | 0.6624 | 0.1157 |
| 0.8637 | 0.5400 | 5500 | 0.8689 | 0.0074 | 0.1235 | 0.6689 | 0.1235 |
| 0.896 | 0.5498 | 5600 | 0.8837 | 0.0074 | 0.1193 | 0.6642 | 0.1193 |
| 0.8577 | 0.5596 | 5700 | 0.8708 | 0.0074 | 0.1221 | 0.6671 | 0.1221 |
| 0.8727 | 0.5694 | 5800 | 0.8728 | 0.0074 | 0.1176 | 0.6675 | 0.1176 |
| 0.9231 | 0.5792 | 5900 | 0.8652 | 0.0074 | 0.1299 | 0.6723 | 0.1299 |
| 0.8357 | 0.5890 | 6000 | 0.8636 | 0.0074 | 0.1221 | 0.6726 | 0.1221 |
| 0.8333 | 0.5989 | 6100 | 0.8590 | 0.0074 | 0.1247 | 0.6731 | 0.1247 |
| 0.8644 | 0.6087 | 6200 | 0.8640 | 0.0074 | 0.1272 | 0.6736 | 0.1272 |
| 0.8731 | 0.6185 | 6300 | 0.8508 | 0.0074 | 0.1272 | 0.6758 | 0.1272 |
| 0.8028 | 0.6283 | 6400 | 0.8445 | 0.0074 | 0.1317 | 0.6779 | 0.1317 |
| 0.8734 | 0.6381 | 6500 | 0.8440 | 0.0074 | 0.1348 | 0.6821 | 0.1348 |
| 0.8652 | 0.6479 | 6600 | 0.8452 | 0.0074 | 0.1359 | 0.6777 | 0.1359 |
| 0.7194 | 0.6578 | 6700 | 0.8416 | 0.0074 | 0.1342 | 0.6813 | 0.1342 |
| 0.8938 | 0.6676 | 6800 | 0.8475 | 0.0074 | 0.1305 | 0.6793 | 0.1305 |
| 0.8168 | 0.6774 | 6900 | 0.8397 | 0.0074 | 0.1350 | 0.6818 | 0.1350 |
| 0.8047 | 0.6872 | 7000 | 0.8368 | 0.0074 | 0.1363 | 0.6819 | 0.1363 |
| 0.8406 | 0.6970 | 7100 | 0.8378 | 0.0074 | 0.1386 | 0.6813 | 0.1386 |
| 0.7679 | 0.7069 | 7200 | 0.8268 | 0.0074 | 0.1455 | 0.6872 | 0.1455 |
| 0.8119 | 0.7167 | 7300 | 0.8265 | 0.0074 | 0.1396 | 0.6846 | 0.1396 |
| 0.8604 | 0.7265 | 7400 | 0.8292 | 0.0074 | 0.1455 | 0.6836 | 0.1455 |
| 0.7682 | 0.7363 | 7500 | 0.8299 | 0.0074 | 0.1467 | 0.6858 | 0.1467 |
| 0.8098 | 0.7461 | 7600 | 0.8267 | 0.0074 | 0.1420 | 0.6861 | 0.1420 |
| 0.8451 | 0.7559 | 7700 | 0.8307 | 0.0074 | 0.1421 | 0.6844 | 0.1421 |
| 0.816 | 0.7658 | 7800 | 0.8256 | 0.0074 | 0.1421 | 0.6866 | 0.1421 |
| 0.77 | 0.7756 | 7900 | 0.8297 | 0.0074 | 0.1413 | 0.6867 | 0.1413 |
| 0.7973 | 0.7854 | 8000 | 0.8242 | 0.0074 | 0.1464 | 0.6882 | 0.1464 |
| 0.7655 | 0.7952 | 8100 | 0.8109 | 0.0074 | 0.1466 | 0.6919 | 0.1466 |
| 0.8402 | 0.8050 | 8200 | 0.8158 | 0.0074 | 0.1452 | 0.6891 | 0.1452 |
| 0.7909 | 0.8148 | 8300 | 0.8240 | 0.0074 | 0.1408 | 0.6870 | 0.1408 |
| 0.7969 | 0.8247 | 8400 | 0.8135 | 0.0074 | 0.1409 | 0.6909 | 0.1409 |
| 0.8035 | 0.8345 | 8500 | 0.8164 | 0.0074 | 0.1431 | 0.6887 | 0.1431 |
| 0.8178 | 0.8443 | 8600 | 0.8250 | 0.0074 | 0.1423 | 0.6878 | 0.1423 |
| 0.795 | 0.8541 | 8700 | 0.8158 | 0.0074 | 0.1461 | 0.6899 | 0.1461 |
| 0.8431 | 0.8639 | 8800 | 0.8215 | 0.0074 | 0.1494 | 0.6896 | 0.1494 |
| 0.856 | 0.8737 | 8900 | 0.8096 | 0.0074 | 0.1471 | 0.6912 | 0.1471 |
| 0.7861 | 0.8836 | 9000 | 0.8123 | 0.0074 | 0.1441 | 0.6908 | 0.1441 |
| 0.7866 | 0.8934 | 9100 | 0.8252 | 0.0074 | 0.1423 | 0.6862 | 0.1423 |
| 0.855 | 0.9032 | 9200 | 0.8310 | 0.0074 | 0.1328 | 0.6852 | 0.1328 |
| 0.8046 | 0.9130 | 9300 | 0.8099 | 0.0074 | 0.1501 | 0.6939 | 0.1501 |
| 0.7777 | 0.9228 | 9400 | 0.8054 | 0.0074 | 0.1492 | 0.6946 | 0.1492 |
| 0.8388 | 0.9327 | 9500 | 0.8241 | 0.0074 | 0.1477 | 0.6869 | 0.1477 |
| 0.8031 | 0.9425 | 9600 | 0.8078 | 0.0074 | 0.1523 | 0.6926 | 0.1523 |
| 0.8637 | 0.9523 | 9700 | 0.8258 | 0.0074 | 0.1429 | 0.6863 | 0.1429 |
| 0.8029 | 0.9621 | 9800 | 0.8280 | 0.0074 | 0.1497 | 0.6872 | 0.1497 |
| 0.8676 | 0.9719 | 9900 | 0.8267 | 0.0074 | 0.1350 | 0.6836 | 0.1350 |
| 0.8323 | 0.9817 | 10000 | 0.8216 | 0.0074 | 0.1432 | 0.6872 | 0.1432 |
| 0.8044 | 0.9916 | 10100 | 0.8606 | 0.0074 | 0.1273 | 0.6750 | 0.1273 |
| 0.8753 | 1.0014 | 10200 | 0.8952 | 0.0074 | 0.1173 | 0.6642 | 0.1173 |
| 0.8331 | 1.0112 | 10300 | 0.8443 | 0.0074 | 0.1259 | 0.6794 | 0.1259 |
| 0.8309 | 1.0210 | 10400 | 0.8640 | 0.0074 | 0.1245 | 0.6733 | 0.1245 |
| 0.8116 | 1.0308 | 10500 | 0.8522 | 0.0074 | 0.1334 | 0.6788 | 0.1334 |
| 0.8556 | 1.0406 | 10600 | 0.8523 | 0.0074 | 0.1290 | 0.6773 | 0.1290 |
| 0.8961 | 1.0505 | 10700 | 0.9135 | 0.0074 | 0.1109 | 0.6558 | 0.1109 |
| 0.872 | 1.0603 | 10800 | 0.8709 | 0.0074 | 0.1226 | 0.6694 | 0.1226 |
| 0.9388 | 1.0701 | 10900 | 0.9032 | 0.0074 | 0.1117 | 0.6598 | 0.1117 |
| 0.8977 | 1.0799 | 11000 | 0.8771 | 0.0074 | 0.1164 | 0.6669 | 0.1164 |
| 0.8306 | 1.0897 | 11100 | 0.8883 | 0.0074 | 0.1151 | 0.6633 | 0.1151 |
| 0.9442 | 1.0995 | 11200 | 0.8849 | 0.0074 | 0.1182 | 0.6661 | 0.1182 |
| 0.873 | 1.1094 | 11300 | 0.8737 | 0.0074 | 0.1278 | 0.6694 | 0.1278 |
| 0.9291 | 1.1192 | 11400 | 0.9105 | 0.0074 | 0.1066 | 0.6549 | 0.1066 |
| 0.9171 | 1.1290 | 11500 | 0.9212 | 0.0074 | 0.1029 | 0.6537 | 0.1029 |
| 0.9219 | 1.1388 | 11600 | 0.9536 | 0.0074 | 0.0912 | 0.6370 | 0.0912 |
| 0.9688 | 1.1486 | 11700 | 0.9418 | 0.0074 | 0.0926 | 0.6439 | 0.0926 |
| 0.9282 | 1.1585 | 11800 | 0.9435 | 0.0074 | 0.0948 | 0.6426 | 0.0948 |
| 0.9858 | 1.1683 | 11900 | 0.9537 | 0.0074 | 0.0914 | 0.6415 | 0.0914 |
| 0.9413 | 1.1781 | 12000 | 0.9470 | 0.0074 | 0.0886 | 0.6392 | 0.0886 |
| 0.9603 | 1.1879 | 12100 | 0.9642 | 0.0074 | 0.0789 | 0.6334 | 0.0789 |
| 0.941 | 1.1977 | 12200 | 0.9631 | 0.0074 | 0.0818 | 0.6374 | 0.0818 |
| 0.9748 | 1.2075 | 12300 | 0.9894 | 0.0074 | 0.0876 | 0.6244 | 0.0876 |
| 1.0202 | 1.2174 | 12400 | 1.0030 | 0.0074 | 0.0769 | 0.6208 | 0.0769 |
| 0.9572 | 1.2272 | 12500 | 0.9575 | 0.0074 | 0.0866 | 0.6401 | 0.0866 |
| 0.8971 | 1.2370 | 12600 | 0.9700 | 0.0074 | 0.0778 | 0.6324 | 0.0778 |
| 0.9748 | 1.2468 | 12700 | 1.0161 | 0.0074 | 0.0688 | 0.6163 | 0.0688 |
| 1.0526 | 1.2566 | 12800 | 1.0049 | 0.0074 | 0.0725 | 0.6204 | 0.0725 |
| 1.0469 | 1.2664 | 12900 | 1.0372 | 0.0074 | 0.0655 | 0.6084 | 0.0655 |
| 0.9927 | 1.2763 | 13000 | 0.9946 | 0.0074 | 0.0703 | 0.6218 | 0.0703 |
| 1.0123 | 1.2861 | 13100 | 0.9582 | 0.0074 | 0.0868 | 0.6350 | 0.0868 |
| 1.2317 | 1.2959 | 13200 | 1.2106 | 0.0074 | 0.0293 | 0.5333 | 0.0293 |
| 1.1082 | 1.3057 | 13300 | 1.1069 | 0.0074 | 0.0449 | 0.5769 | 0.0449 |
| 1.1547 | 1.3155 | 13400 | 1.1249 | 0.0074 | 0.0446 | 0.5726 | 0.0446 |
| 0.9867 | 1.3253 | 13500 | 1.0435 | 0.0074 | 0.0626 | 0.6012 | 0.0626 |
| 1.0453 | 1.3352 | 13600 | 1.0407 | 0.0074 | 0.0581 | 0.6013 | 0.0581 |
| 1.0585 | 1.3450 | 13700 | 1.0425 | 0.0074 | 0.0592 | 0.6032 | 0.0592 |
| 1.0436 | 1.3548 | 13800 | 1.0303 | 0.0074 | 0.0646 | 0.6083 | 0.0646 |
| 0.9745 | 1.3646 | 13900 | 1.0415 | 0.0074 | 0.0559 | 0.6050 | 0.0559 |
| 1.1399 | 1.3744 | 14000 | 1.1701 | 0.0074 | 0.0351 | 0.5558 | 0.0351 |
| 1.1231 | 1.3843 | 14100 | 1.0773 | 0.0074 | 0.0521 | 0.5927 | 0.0521 |
| 0.968 | 1.3941 | 14200 | 1.0427 | 0.0074 | 0.0621 | 0.6044 | 0.0621 |
| 1.1768 | 1.4039 | 14300 | 1.1917 | 0.0074 | 0.0272 | 0.5405 | 0.0272 |
| 1.0553 | 1.4137 | 14400 | 1.0930 | 0.0074 | 0.0492 | 0.5811 | 0.0492 |
| 1.2772 | 1.4235 | 14500 | 1.2570 | 0.0074 | 0.0209 | 0.5092 | 0.0209 |
| 1.2338 | 1.4333 | 14600 | 1.2319 | 0.0074 | 0.0226 | 0.5195 | 0.0226 |
| 1.6905 | 1.4432 | 14700 | 1.6638 | 0.0074 | 0.0010 | 0.3531 | 0.0010 |
| 1.619 | 1.4530 | 14800 | 1.5834 | 0.0074 | 0.0034 | 0.3704 | 0.0034 |
| 1.644 | 1.4628 | 14900 | 1.6462 | 0.0074 | 0.0021 | 0.3564 | 0.0021 |
| 1.6394 | 1.4726 | 15000 | 1.5897 | 0.0074 | 0.0040 | 0.3739 | 0.0040 |
| 1.5479 | 1.4824 | 15100 | 1.5391 | 0.0074 | 0.0026 | 0.3850 | 0.0026 |
| 1.536 | 1.4922 | 15200 | 1.5667 | 0.0074 | 0.0006 | 0.3593 | 0.0006 |
| 1.5497 | 1.5021 | 15300 | 1.5620 | 0.0074 | 0.0027 | 0.3758 | 0.0027 |
| 1.5733 | 1.5119 | 15400 | 1.5300 | 0.0074 | 0.0007 | 0.3846 | 0.0007 |
| 1.5289 | 1.5217 | 15500 | 1.5208 | 0.0074 | 0.0052 | 0.3951 | 0.0052 |
| 1.544 | 1.5315 | 15600 | 1.5446 | 0.0074 | 0.0033 | 0.3837 | 0.0033 |
| 1.6313 | 1.5413 | 15700 | 1.6249 | 0.0074 | 0.0013 | 0.3553 | 0.0013 |
| 1.6409 | 1.5511 | 15800 | 1.6350 | 0.0074 | 0.0008 | 0.3404 | 0.0008 |
| 1.6881 | 1.5610 | 15900 | 1.6802 | 0.0074 | 0.0023 | 0.3618 | 0.0023 |
| 1.5856 | 1.5708 | 16000 | 1.5956 | 0.0074 | 0.0003 | 0.3740 | 0.0003 |
| 1.5598 | 1.5806 | 16100 | 1.5856 | 0.0074 | 0.0015 | 0.3784 | 0.0015 |
| 1.5692 | 1.5904 | 16200 | 1.6013 | 0.0074 | 0.0 | 0.3508 | 0.0 |
| 1.552 | 1.6002 | 16300 | 1.5620 | 0.0074 | 0.0014 | 0.3811 | 0.0014 |
| 1.5708 | 1.6101 | 16400 | 1.5635 | 0.0074 | 0.0013 | 0.3837 | 0.0013 |
| 1.5673 | 1.6199 | 16500 | 1.5936 | 0.0074 | 0.0019 | 0.3648 | 0.0019 |
| 1.555 | 1.6297 | 16600 | 1.5752 | 0.0074 | 0.0019 | 0.3757 | 0.0019 |
| 1.5661 | 1.6395 | 16700 | 1.5705 | 0.0074 | 0.0021 | 0.3662 | 0.0021 |
| 1.569 | 1.6493 | 16800 | 1.5806 | 0.0074 | 0.0008 | 0.3759 | 0.0008 |
| 1.6772 | 1.6591 | 16900 | 1.6513 | 0.0074 | 0.0008 | 0.3471 | 0.0008 |
| 1.6117 | 1.6690 | 17000 | 1.5943 | 0.0074 | 0.0035 | 0.3649 | 0.0035 |
| 1.6281 | 1.6788 | 17100 | 1.6273 | 0.0074 | 0.0006 | 0.3506 | 0.0006 |
| 1.6285 | 1.6886 | 17200 | 1.6462 | 0.0074 | 0.0019 | 0.3402 | 0.0019 |
| 1.6368 | 1.6984 | 17300 | 1.6529 | 0.0074 | 0.0004 | 0.3305 | 0.0004 |
| 1.9079 | 1.7082 | 17400 | 1.7858 | 0.0074 | 0.0006 | 0.3388 | 0.0006 |
| 1.6186 | 1.7180 | 17500 | 1.6339 | 0.0074 | 0.0003 | 0.3589 | 0.0003 |
| 1.6363 | 1.7279 | 17600 | 1.6601 | 0.0074 | 0.0 | 0.3406 | 0.0 |
| 1.6297 | 1.7377 | 17700 | 1.6429 | 0.0074 | 0.0 | 0.3561 | 0.0 |
| 1.6109 | 1.7475 | 17800 | 1.6086 | 0.0074 | 0.0010 | 0.3564 | 0.0010 |
| 1.6253 | 1.7573 | 17900 | 1.6136 | 0.0074 | 0.0020 | 0.3619 | 0.0020 |
| 1.6469 | 1.7671 | 18000 | 1.6048 | 0.0074 | 0.0019 | 0.3520 | 0.0019 |
| 1.6646 | 1.7769 | 18100 | 1.5883 | 0.0074 | 0.0 | 0.3773 | 0.0 |
| 1.634 | 1.7868 | 18200 | 1.6503 | 0.0074 | 0.0007 | 0.3550 | 0.0007 |
| 1.5912 | 1.7966 | 18300 | 1.5754 | 0.0074 | 0.0009 | 0.3713 | 0.0009 |
| 1.5395 | 1.8064 | 18400 | 1.5825 | 0.0074 | 0.0012 | 0.3721 | 0.0012 |
| 1.5949 | 1.8162 | 18500 | 1.5672 | 0.0074 | 0.0028 | 0.3715 | 0.0028 |
| 1.5803 | 1.8260 | 18600 | 1.5623 | 0.0074 | 0.0017 | 0.3741 | 0.0017 |
| 1.5247 | 1.8359 | 18700 | 1.5613 | 0.0074 | 0.0 | 0.3794 | 0.0 |
| 1.5456 | 1.8457 | 18800 | 1.5595 | 0.0074 | 0.0011 | 0.3766 | 0.0011 |
| 1.6075 | 1.8555 | 18900 | 1.5897 | 0.0074 | 0.0 | 0.3614 | 0.0 |
| 1.6064 | 1.8653 | 19000 | 1.7312 | 0.0074 | 0.0021 | 0.3680 | 0.0021 |
| 1.5555 | 1.8751 | 19100 | 1.5758 | 0.0074 | 0.0001 | 0.3677 | 0.0001 |
| 1.5675 | 1.8849 | 19200 | 1.5762 | 0.0074 | 0.0050 | 0.3707 | 0.0050 |
| 1.5514 | 1.8948 | 19300 | 1.5750 | 0.0074 | 0.0007 | 0.3698 | 0.0007 |
| 1.562 | 1.9046 | 19400 | 1.6466 | 0.0074 | 0.0012 | 0.3527 | 0.0012 |
| 1.5988 | 1.9144 | 19500 | 1.5863 | 0.0074 | 0.0021 | 0.3677 | 0.0021 |
| 1.5904 | 1.9242 | 19600 | 1.6006 | 0.0074 | 0.0032 | 0.3610 | 0.0032 |
| 1.5849 | 1.9340 | 19700 | 1.5916 | 0.0074 | 0.0003 | 0.3604 | 0.0003 |
Framework versions
- PEFT 0.15.2
- Transformers 4.51.3
- Pytorch 2.6.0+cu124
- Datasets 3.5.0
- Tokenizers 0.21.1
- Downloads last month
- 1
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support
Model tree for donoway/wi1vdcj9_20250706_183307
Base model
meta-llama/Llama-3.2-1B