Spaces:

PlotweaverModel
/

ncair-asr-api

Sleeping

App Files Files Community

ncair-asr-api / README.md

PlotweaverModel

Upload 4 files

fe3947d verified 5 days ago

preview code

Raw

History Blame Contribute Delete

2.56 kB

	---
	title: NCAIR ASR API
	emoji: 🎙️
	colorFrom: green
	colorTo: blue
	sdk: docker
	app_port: 7860
	pinned: false
	---

	## NCAIR Nigerian ASR — API backend

	An OpenAI-compatible speech-to-text service for the voice assistant. It loads the
	right NCAIR Whisper model based on the request's `language` field, so one endpoint
	covers Yorùbá, Igbo, Hausa, and Nigerian-accented English.

	```
	POST /v1/audio/transcriptions multipart: file [, model] [, language] -> {"text": "..."}
	GET /health
	```

	\| language value \| Model \|
	\|----------------\|-------\|
	\| `yo` / `yoruba` \| NCAIR1/Yoruba-ASR \|
	\| `ig` / `igbo` \| NCAIR1/Igbo-ASR \|
	\| `ha` / `hausa` \| NCAIR1/Hausa-ASR \|
	\| `en` / `english` / `nigerian english` \| NCAIR1/NigerianAccentedEnglish \|

	Models load lazily on first use and are cached, so startup is fast and only the
	languages you use get loaded. CPU is enough (Whisper Small).

	---

	### Before you deploy

	These models are gated and license-restricted:

	1. Accept the terms for each model you'll use on Hugging Face (Yoruba-ASR,
	Igbo-ASR, Hausa-ASR, NigerianAccentedEnglish) with the account whose token
	you'll use.
	2. Add an `HF_TOKEN` Secret so the gated weights can download.
	3. License: research/innovation license — not for commercial or large-scale
	use (>1000 active users) without a separate Awarri agreement. Attribution:
	*"developed by Awarri Technologies in partnership with the Federal Government of
	Nigeria."* (Not legal advice.)

	---

	### Deploy

	1. Create a Docker → Blank Space, push `app.py`, `requirements.txt`,
	`Dockerfile`, `README.md`.
	2. Add the `HF_TOKEN` Secret.
	3. Wait for the build; the first call per language downloads that model.

	### Wire it into voice-ai-demo

	On the voice-ai-demo Space, set per language (Yoruba shown):

	\| Secret \| Value \|
	\|--------\|-------\|
	\| `ASR_YORUBA_BASE_URL` \| `https://<this-space>.hf.space/v1` \|

	The demo sends `language=yo`, so this service returns Yoruba. For Igbo/Hausa/English,
	add those languages to the demo's `config.json` and set `ASR_<LANG>_BASE_URL` to the
	same URL — the service routes by the language hint.

	### Local test

	```bash
	curl -X POST http://localhost:7860/v1/audio/transcriptions \
	-F "file=@sample.wav" -F "language=yo"
	```

	### Project Structure

	```
	ncair-asr-api/
	├── app.py # FastAPI OpenAI-compatible ASR (multi-language)
	├── requirements.txt # transformers 5.x + serving deps
	├── Dockerfile # CPU image with ffmpeg
	└── README.md # this file
	```