slac-new-palate-none
This model is a fine-tuned version of on an unknown dataset. It achieves the following results on the evaluation set:
- Loss: 0.4204
- Accuracy: 0.9418
- F1 Macro: 0.8572
- Precision Macro: 0.8700
- Recall Macro: 0.8455
- F1 Micro: 0.9418
- Precision Micro: 0.9418
- Recall Micro: 0.9418
- Total Tf: [1457, 90, 1457, 90]
Model description
More information needed
Intended uses & limitations
More information needed
Training and evaluation data
More information needed
Training procedure
Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 2e-05
- train_batch_size: 64
- eval_batch_size: 64
- seed: 42
- optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
- lr_scheduler_type: linear
- lr_scheduler_warmup_steps: 188
- num_epochs: 15
Training results
| Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 Macro | Precision Macro | Recall Macro | F1 Micro | Precision Micro | Recall Micro | Total Tf |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 0.1783 | 1.0 | 189 | 0.1800 | 0.9367 | 0.8292 | 0.8849 | 0.7912 | 0.9367 | 0.9367 | 0.9367 | [1449, 98, 1449, 98] |
| 0.1334 | 2.0 | 378 | 0.1565 | 0.9451 | 0.8698 | 0.8690 | 0.8707 | 0.9451 | 0.9451 | 0.9451 | [1462, 85, 1462, 85] |
| 0.1027 | 3.0 | 567 | 0.1633 | 0.9438 | 0.8636 | 0.8717 | 0.8560 | 0.9438 | 0.9438 | 0.9438 | [1460, 87, 1460, 87] |
| 0.0829 | 4.0 | 756 | 0.2096 | 0.9354 | 0.8500 | 0.8423 | 0.8582 | 0.9354 | 0.9354 | 0.9354 | [1447, 100, 1447, 100] |
| 0.0553 | 5.0 | 945 | 0.2599 | 0.9412 | 0.8538 | 0.8717 | 0.8381 | 0.9412 | 0.9412 | 0.9412 | [1456, 91, 1456, 91] |
| 0.0346 | 6.0 | 1134 | 0.2825 | 0.9379 | 0.8520 | 0.8536 | 0.8503 | 0.9379 | 0.9379 | 0.9379 | [1451, 96, 1451, 96] |
| 0.0269 | 7.0 | 1323 | 0.3381 | 0.9392 | 0.8530 | 0.8599 | 0.8464 | 0.9392 | 0.9392 | 0.9392 | [1453, 94, 1453, 94] |
| 0.015 | 8.0 | 1512 | 0.3315 | 0.9405 | 0.8496 | 0.8750 | 0.8284 | 0.9405 | 0.9405 | 0.9405 | [1455, 92, 1455, 92] |
| 0.0101 | 9.0 | 1701 | 0.3932 | 0.9367 | 0.8373 | 0.8688 | 0.8122 | 0.9367 | 0.9367 | 0.9367 | [1449, 98, 1449, 98] |
| 0.0065 | 10.0 | 1890 | 0.3801 | 0.9431 | 0.8583 | 0.8774 | 0.8416 | 0.9431 | 0.9431 | 0.9431 | [1459, 88, 1459, 88] |
| 0.0069 | 11.0 | 2079 | 0.3929 | 0.9425 | 0.8556 | 0.8779 | 0.8365 | 0.9425 | 0.9425 | 0.9425 | [1458, 89, 1458, 89] |
| 0.0054 | 12.0 | 2268 | 0.4073 | 0.9412 | 0.8523 | 0.8745 | 0.8335 | 0.9412 | 0.9412 | 0.9412 | [1456, 91, 1456, 91] |
| 0.0055 | 13.0 | 2457 | 0.4138 | 0.9431 | 0.8575 | 0.8789 | 0.8392 | 0.9431 | 0.9431 | 0.9431 | [1459, 88, 1459, 88] |
| 0.0046 | 14.0 | 2646 | 0.4186 | 0.9418 | 0.8557 | 0.8727 | 0.8408 | 0.9418 | 0.9418 | 0.9418 | [1457, 90, 1457, 90] |
| 0.002 | 15.0 | 2835 | 0.4204 | 0.9418 | 0.8572 | 0.8700 | 0.8455 | 0.9418 | 0.9418 | 0.9418 | [1457, 90, 1457, 90] |
Framework versions
- Transformers 4.52.4
- Pytorch 2.6.0+cu124
- Datasets 3.6.0
- Tokenizers 0.21.2
- Downloads last month
- -
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support