vhdm/persian-voice-v1
Viewer • Updated • 28.9k • 315 • 28
How to use DevMehdip/whisper-small-fa-lora with PEFT:
from peft import PeftModel
from transformers import AutoModelForSeq2SeqLM
base_model = AutoModelForSeq2SeqLM.from_pretrained("openai/whisper-small")
model = PeftModel.from_pretrained(base_model, "DevMehdip/whisper-small-fa-lora")How to use DevMehdip/whisper-small-fa-lora with Transformers:
# Load model directly
from transformers import AutoModel
model = AutoModel.from_pretrained("DevMehdip/whisper-small-fa-lora", dtype="auto")A fine-tuned version of openai/whisper-small for Persian speech-to-text (ASR) using LoRA.
This model is optimized for persian conversational speech and dataset-quality audio.
This model is a LoRA fine-tuned Whisper Small focused on Persian (fa) speech recognition.
It improves transcription accuracy on standard Persian audio segments (16kHz, mono, normalized WAV).
openai/whisper-smallpersian-voice-v1 (single dataset)Users should:
Base model
openai/whisper-small
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("DevMehdip/whisper-small-fa-lora", dtype="auto")