You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Whisper Dhivehi Code-Switching ASR

Whisper-small fine-tuned for code-switched Dhivehi (with English and Arabic). Adds a custom <|dv|> language token to the tokenizer.

Usage

from transformers import pipeline

asr = pipeline(
    task="automatic-speech-recognition",
    model="Serialtechlab/whisper-dhivehi-code-switch-v2",
    device=0,
    chunk_length_s=10,
    stride_length_s=(1, 1),
    generate_kwargs={"num_beams": 3, "repetition_penalty": 1.05},
)

result = asr("audio.wav")
print(result["text"])

Training data

Fine-tuned on a synthetic code-switched dataset combining:

Dhivehi: Serialtechlab/dhivehi-mms-v5-combined, dhivehi-tts-preprocessed, dv-syn-female2-for-tts
English/Arabic loan words: google/fleurs (en_us, ar_eg)

Trained for 20,000 steps from whisper-small base, with a custom <|dv|> language token added.

Downloads last month: -

Safetensors

Model size

0.2B params

Tensor type

F32

Model tree for Serialtechlab/whisper-dhivehi-code-switch-v2

Base model

openai/whisper-small

Finetuned

(3578)

this model