FanatikSpeechToText

This model is a fine-tuned version of openai/whisper-large-v3 on the None dataset. It achieves the following results on the evaluation set:

Model description

More information needed

More information needed

More information needed

The following hyperparameters were used during training:

learning_rate: 1e-05
train_batch_size: 16
eval_batch_size: 8
seed: 42
optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
lr_scheduler_type: linear
lr_scheduler_warmup_steps: 500
training_steps: 4000
mixed_precision_training: Native AMP

Training Loss	Epoch	Step	Validation Loss	Wer
1.6988	1.9646	1000	1.7485	152.7657
1.3704	3.9293	2000	1.7755	146.4441
0.8549	5.8939	3000	2.0022	100.8839
0.4885	7.8585	4000	2.2409	93.8013

Safetensors

Model size

2B params

Tensor type

F32

Base model

Finetuned

(843)

this model