--- title: Kashmiri Streaming ASR Zipformer emoji: ๐Ÿ’ป colorFrom: purple colorTo: gray sdk: docker pinned: false short_description: 'Online Streaming ASR System for Kashmiri Language' license: apache-2.0 --- # ๐ŸŽ™๏ธ Real-Time Kashmiri Streaming ASR (FastAPI + Sherpa-ONNX) This project demonstrates a real-time speech-to-text (ASR) web application with: * ๐ŸŽ›๏ธ Hugging Face Deployment taken from [Luigi](https://huggingface.co/spaces/Luigi/Streaming-Zipformer) * ๐Ÿง  [Sherpa-ONNX](https://github.com/k2-fsa/sherpa-onnx) streaming Zipformer model * ๐Ÿš€ FastAPI backend with WebSocket support * โ˜๏ธ Docker-compatible deployment (CPU-only) on Hugging Face Spaces ## ๐Ÿค– Training * Model: [Zipformer Small](https://github.com/k2-fsa/icefall) * Dataset: [IndicVoices](https://huggingface.co/datasets/ai4bharat/IndicVoices) * WER: 36% ## ๐Ÿงช Local Development 1. **Install dependencies** ```bash pip install -r requirements.txt ``` 2. **Run the app locally** ```bash uvicorn app.main:app --reload --host 0.0.0.0 --port 8501 ``` Open [http://localhost:8501](http://localhost:8501) in your browser. [https://k2-fsa.github.io/sherpa/ncnn/endpoint.html](https://k2-fsa.github.io/sherpa/ncnn/endpoint.html) ## ๐Ÿ“ Project Structure ``` . โ”œโ”€โ”€ app โ”‚ โ”œโ”€โ”€ main.py โ”‚ โ”œโ”€โ”€ asr.py โ”‚ โ””โ”€โ”€ model parts โ””โ”€โ”€ All Model parts here (encoder, decoder, joiner, tokens) โ”œโ”€โ”€ index.html โ”œโ”€โ”€ requirements.txt โ”œโ”€โ”€ Dockerfile โ””โ”€โ”€ README.md ``` ## ๐Ÿ”ง Credits * [Sherpa-ONNX](https://github.com/k2-fsa/sherpa-onnx) * [OpenCC](https://github.com/BYVoid/OpenCC) * [FastAPI](https://fastapi.tiangolo.com/) * [Hugging Face Spaces](https://huggingface.co/docs/hub/spaces) * [AI4Bharat](https://ai4bharat.iitm.ac.in/) * [Icefall-K2](https://github.com/k2-fsa/icefall) Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference