ssc-aln-mms-model
This model is a fine-tuned version of facebook/mms-1b-all on an unknown dataset. It achieves the following results on the evaluation set:
- Loss: 1.1100
- Cer: 0.2283
- Wer: 0.5834
Model description
More information needed
Intended uses & limitations
More information needed
Training and evaluation data
More information needed
Training procedure
Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 0.0003
- train_batch_size: 8
- eval_batch_size: 12
- seed: 42
- gradient_accumulation_steps: 2
- total_train_batch_size: 16
- optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
- lr_scheduler_type: linear
- lr_scheduler_warmup_steps: 100
- num_epochs: 10
- mixed_precision_training: Native AMP
Training results
| Training Loss | Epoch | Step | Validation Loss | Cer | Wer |
|---|---|---|---|---|---|
| 2.3582 | 0.2851 | 200 | 1.4872 | 0.2751 | 0.6849 |
| 2.1108 | 0.5702 | 400 | 1.3452 | 0.2502 | 0.6423 |
| 2.056 | 0.8553 | 600 | 1.2937 | 0.2430 | 0.6208 |
| 2.0388 | 1.1397 | 800 | 1.1876 | 0.2569 | 0.6355 |
| 2.0228 | 1.4248 | 1000 | 1.1998 | 0.2405 | 0.6055 |
| 2.0217 | 1.7099 | 1200 | 1.1975 | 0.2359 | 0.6023 |
| 1.9443 | 1.9950 | 1400 | 1.1995 | 0.2350 | 0.5977 |
| 1.9195 | 2.2794 | 1600 | 1.1800 | 0.2352 | 0.5950 |
| 1.9821 | 2.5645 | 1800 | 1.1510 | 0.2352 | 0.5921 |
| 1.9628 | 2.8496 | 2000 | 1.1757 | 0.2327 | 0.5916 |
| 1.9479 | 3.1340 | 2200 | 1.1498 | 0.2340 | 0.5928 |
| 1.9345 | 3.4191 | 2400 | 1.1608 | 0.2310 | 0.5865 |
| 1.9219 | 3.7042 | 2600 | 1.1495 | 0.2314 | 0.5861 |
| 1.9985 | 3.9893 | 2800 | 1.1387 | 0.2329 | 0.5903 |
| 2.0339 | 4.2737 | 3000 | 1.1173 | 0.2365 | 0.5936 |
| 1.9431 | 4.5588 | 3200 | 1.1311 | 0.2320 | 0.5868 |
| 1.9065 | 4.8439 | 3400 | 1.1352 | 0.2301 | 0.5853 |
| 1.9288 | 5.1283 | 3600 | 1.1312 | 0.2301 | 0.5832 |
| 1.9472 | 5.4134 | 3800 | 1.1270 | 0.2302 | 0.5843 |
| 2.0217 | 5.6985 | 4000 | 1.1364 | 0.2279 | 0.5820 |
| 1.9219 | 5.9836 | 4200 | 1.1106 | 0.2303 | 0.5842 |
| 1.8896 | 6.2680 | 4400 | 1.1137 | 0.2303 | 0.5840 |
| 1.8987 | 6.5531 | 4600 | 1.1149 | 0.2298 | 0.5835 |
| 1.9852 | 6.8382 | 4800 | 1.0969 | 0.2351 | 0.5942 |
| 1.9597 | 7.1226 | 5000 | 1.1094 | 0.2308 | 0.5860 |
| 1.9146 | 7.4077 | 5200 | 1.1121 | 0.2290 | 0.5830 |
| 1.9448 | 7.6928 | 5400 | 1.1201 | 0.2279 | 0.5816 |
| 1.9027 | 7.9779 | 5600 | 1.1121 | 0.2283 | 0.5811 |
| 1.8896 | 8.2623 | 5800 | 1.1168 | 0.2282 | 0.5828 |
| 1.962 | 8.5474 | 6000 | 1.1039 | 0.2301 | 0.5854 |
| 1.9398 | 8.8325 | 6200 | 1.1078 | 0.2289 | 0.5833 |
| 1.9436 | 9.1169 | 6400 | 1.1099 | 0.2284 | 0.5824 |
| 1.8984 | 9.4020 | 6600 | 1.1051 | 0.2289 | 0.5845 |
| 1.9456 | 9.6871 | 6800 | 1.1091 | 0.2282 | 0.5828 |
| 1.8893 | 9.9722 | 7000 | 1.1100 | 0.2283 | 0.5834 |
Framework versions
- Transformers 4.57.2
- Pytorch 2.9.1+cu128
- Datasets 3.6.0
- Tokenizers 0.22.0
- Downloads last month
- -
Model tree for ctaguchi/ssc-aln-mms-model
Base model
facebook/mms-1b-all