wme_30s_Static_atWall_1.1

This model is a fine-tuned version of openai/whisper-medium.en on the Common Voice 17.0 dataset. It achieves the following results on the evaluation set:

Loss: 0.9036
Wer: 28.7814

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 3e-05
train_batch_size: 48
eval_batch_size: 32
seed: 42
optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
lr_scheduler_type: linear
lr_scheduler_warmup_steps: 20
training_steps: 176
mixed_precision_training: Native AMP

Training results

Training Loss	Epoch	Step	Validation Loss	Wer
No log	0	0	1.5148	34.2009
0.8025	0.25	44	0.9935	31.2921
0.5599	0.5	88	0.9522	28.6589
0.5354	0.75	132	0.9182	28.5671
0.4127	1.0057	176	0.9036	28.7814

Framework versions

Transformers 4.51.3
Pytorch 2.6.0+cu124
Datasets 3.6.0
Tokenizers 0.21.1

Downloads last month: 1

Safetensors

Model size

0.8B params

Tensor type

F32

Model tree for xbilek25/wme_30s_Static_atWall_1.1

Base model

openai/whisper-medium.en

Finetuned

(93)

this model

Dataset used to train xbilek25/wme_30s_Static_atWall_1.1

Evaluation results

Wer on Common Voice 17.0
self-reported

28.781