ssc-koo-mms-model-mix-adapt-max3
This model was trained from scratch on an unknown dataset. It achieves the following results on the evaluation set:
- Loss: 0.9431
- Cer: 0.2157
- Wer: 0.7053
Model description
More information needed
Intended uses & limitations
More information needed
Training and evaluation data
More information needed
Training procedure
Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 0.0005
- train_batch_size: 1
- eval_batch_size: 6
- seed: 42
- gradient_accumulation_steps: 2
- total_train_batch_size: 2
- optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
- lr_scheduler_type: linear
- lr_scheduler_warmup_steps: 100
- num_epochs: 5
- mixed_precision_training: Native AMP
Training results
| Training Loss | Epoch | Step | Validation Loss | Cer | Wer |
|---|---|---|---|---|---|
| 0.5135 | 0.2380 | 200 | 1.0034 | 0.2243 | 0.7293 |
| 0.5991 | 0.4759 | 400 | 1.0588 | 0.2352 | 0.7711 |
| 0.6066 | 0.7139 | 600 | 0.9370 | 0.2254 | 0.7299 |
| 0.8816 | 0.9518 | 800 | 0.9418 | 0.2289 | 0.7431 |
| 0.7057 | 1.1892 | 1000 | 0.9382 | 0.2275 | 0.7372 |
| 0.5411 | 1.4271 | 1200 | 0.9608 | 0.2203 | 0.7210 |
| 0.7028 | 1.6651 | 1400 | 0.9315 | 0.2225 | 0.7227 |
| 0.4847 | 1.9030 | 1600 | 0.9501 | 0.2188 | 0.7142 |
| 0.6112 | 2.1404 | 1800 | 0.9569 | 0.2203 | 0.7210 |
| 0.4671 | 2.3783 | 2000 | 0.9296 | 0.2213 | 0.7068 |
| 0.5675 | 2.6163 | 2200 | 0.9371 | 0.2188 | 0.7181 |
| 0.4673 | 2.8543 | 2400 | 0.9604 | 0.2200 | 0.7149 |
| 0.5106 | 3.0916 | 2600 | 0.9521 | 0.2164 | 0.7073 |
| 0.4248 | 3.3296 | 2800 | 0.9527 | 0.2208 | 0.7176 |
| 0.4779 | 3.5675 | 3000 | 0.9301 | 0.2215 | 0.7177 |
| 0.4251 | 3.8055 | 3200 | 0.9468 | 0.2178 | 0.7106 |
| 0.5291 | 4.0428 | 3400 | 0.9580 | 0.2143 | 0.7068 |
| 0.5338 | 4.2808 | 3600 | 0.9324 | 0.2171 | 0.7092 |
| 0.4777 | 4.5187 | 3800 | 0.9244 | 0.2192 | 0.7096 |
| 0.4777 | 4.7567 | 4000 | 0.9326 | 0.2166 | 0.7070 |
| 0.4889 | 4.9946 | 4200 | 0.9431 | 0.2157 | 0.7053 |
Framework versions
- Transformers 4.52.1
- Pytorch 2.9.1+cu128
- Datasets 3.6.0
- Tokenizers 0.21.4
- Downloads last month
- -