Whisper Large V3 Turbo โ€” Uzbek

Fine-tuned openai/whisper-large-v3-turbo for Uzbek automatic speech recognition.

Usage

from transformers import WhisperForConditionalGeneration, WhisperProcessor
import librosa

model_id = "idrock/piyola-v1"

processor = WhisperProcessor.from_pretrained(model_id)
model = WhisperForConditionalGeneration.from_pretrained(model_id)

audio, sr = librosa.load("audio.wav", sr=16000)

inputs = processor(audio, sampling_rate=16000, return_tensors="pt")
predicted_ids = model.generate(
    inputs.input_features,
    language="uz",
    task="transcribe",
    max_new_tokens=225,
)

text = processor.batch_decode(predicted_ids, skip_special_tokens=True)[0]
print(text)

Training

  • Base model: openai/whisper-large-v3-turbo
  • Language: Uzbek (uz)
  • Task: Transcribe
  • Precision: BF16

License

Apache 2.0

Downloads last month
93
Safetensors
Model size
0.8B params
Tensor type
F32
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Collection including idrock/piyola-v1