Whisper large-v3 kh - Sethisak San

This model is a fine-tuned version of openai/whisper-large-v3 on the KhmerAsrDataset dataset. It achieves the following results on the evaluation set:

  • Loss: 0.0830
  • Wer: 81.8018
  • Cer: 20.7231

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 5e-05
  • train_batch_size: 2
  • eval_batch_size: 8
  • seed: 42
  • gradient_accumulation_steps: 8
  • total_train_batch_size: 16
  • optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
  • lr_scheduler_type: linear
  • lr_scheduler_warmup_steps: 500
  • training_steps: 10000
  • mixed_precision_training: Native AMP

Training results

Training Loss Epoch Step Cer Validation Loss Wer
0.0232 1.6086 1000 24.9919 0.1027 92.9730
0.0044 3.2157 2000 22.4125 0.0933 88.8288
0.0015 4.8243 3000 22.8354 0.0882 89.5495
0.0001 6.4315 4000 21.4830 0.0924 86.6667
0.2061 8.0499 5000 0.1793 97.1171 33.1714
0.1139 9.6585 6000 0.1184 93.3333 25.9980
0.0739 11.2656 7000 0.0932 87.5676 22.9656
0.0515 12.8742 8000 0.0838 86.4865 22.1825
0.0384 14.4814 9000 0.0823 81.4414 21.5597
0.0131 16.0885 10000 0.0830 81.8018 20.7231

Framework versions

  • Transformers 4.48.0
  • Pytorch 2.8.0+cu126
  • Datasets 2.14.7
  • Tokenizers 0.21.4
Downloads last month
6
Safetensors
Model size
2B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for S-Sethisak/whisper-largev3-kh

Finetuned
(760)
this model

Dataset used to train S-Sethisak/whisper-largev3-kh

Evaluation results