Nagamese ASR

Nagamese is a widely used creole language of Northeast India, spoken daily across communities as a bridge of communication.
Despite its reach and importance, it has long remained largely absent from modern speech technology.

This model demonstrates that Nagamese works with contemporary ASR systems-that spoken Nagamese can be recognized and transcribed in practical, real-world settings.

The goal of this project is to create the much needed presence and usability: making Nagamese visible, usable, and supported in today’s AI ecosystem with Nagamese Automatic Speech Recognition (ASR).

Model Details

Base model: openai/whisper-small
Language: Nagamese
Task: Automatic Speech Recognition (ASR)
Framework: Transformers (PyTorch)
Checkpoint format: safetensors

Training Strategy

Stage 1 – Real Speech Training

Dataset: Vaani (real Nagamese speech)
~12k utterances
Byte-based audio loading (no HF audio decoding)
Result: Strong language understanding, robust to noise

Stage 2 – Synthetic Polishing (Final Model)

Dataset: Synthetic Nagamese TTS (~2k samples)
Purpose:
- Improve fluency
- Fix incomplete word endings
- Reduce hallucinations and instability
Training:
- Low learning rate
- Short run (800 steps)
Evaluation was always done on real Nagamese speech

This staged approach avoids synthetic bias while improving real-world usability.

Evaluation Notes

Raw WER is high due to noise tags and annotation artifacts in real data
Qualitative evaluation shows:
- Correct semantics
- Improved fluency over Stage‑1
- Better robustness to spontaneous speech

This model was selected based on human evaluation, not raw WER alone.

Usage

from transformers import pipeline

asr = pipeline(
    "automatic-speech-recognition",
    model="MWirelabs/nagamese-asr"
)

asr("audio.wav")

Limitations

Designed specifically for Nagamese
Not intended for multilingual or code-switched ASR
Raw WER should not be used as the sole quality metric

Acknowledgements

OpenAI Whisper
ARTPARK / IISc Vaani datasets
Synthetic speech generated for research purposes

Citation

Citation: https://huggingface.co/MWirelabs/nagamese-asr

@misc{mwirlabs_nagamese_asr_2026, title = {Nagamese ASR}, author = {MWire Labs}, year = {2026}, howpublished = {\url{https://huggingface.co/MWirelabs/nagamese-asr}}, note = {Stage-2 fine-tuned Whisper model for Nagamese speech recognition} }

Downloads last month: 4

Safetensors

Model size

0.2B params

Tensor type

F32

Model tree for MWirelabs/nagamese-asr

Base model

openai/whisper-small

Finetuned

(3591)

this model

MWirelabs
/

nagamese-asr