Add calendar-month hotwords to test spoken-date recognition ce538a0 Chirag-Bhardwaj commited on 6 days ago
Preserve \n through whitespace cleanup so "new line" survives strip d23bfc4 Chirag-Bhardwaj commited on 9 days ago
Lower default KENLM_ALPHA 0.5β0.2, BETA 1.5β1.0 (#5) 18192a6 deepakkaura chirag18 commited on 11 days ago
Use Python 3.10 (pypi-kenlm incompatible with 3.11) (#4) cd595e5 deepakkaura chirag18 commited on 11 days ago
Add C++ build deps so pypi-kenlm can compile in slim image (#3) 5f8798a deepakkaura chirag18 commited on 11 days ago
Download KenLM at server startup, not Docker build (#2) d1fc078 deepakkaura chirag18 commited on 12 days ago
Enable KenLM domain-LM shallow fusion at decode time (#1) 810b118 deepakkaura chirag18 commited on 12 days ago
Run model fp32 on GPU (fp16 produced degenerate logits) ea3dae0 verified deepakkaura commited on 12 days ago
Upcast fp16 logits to fp32 before pyctcdecode (fixes empty-beam error) 8f824ff verified deepakkaura commited on 12 days ago
Tune for speed: beam_width 8->4, hotword_weight 10->5 046871c verified deepakkaura commited on 12 days ago
Fix IndentationError: collapse duplicated regex lines e62dce0 verified deepakkaura commited on 12 days ago
Debug log + handle SentencePiece-prefixed brace tokens b03b022 verified deepakkaura commited on 12 days ago
Convert MedASR </s> sentence-boundary token to period e52390b verified deepakkaura commited on 12 days ago
Mark pad token as CTC blank for pyctcdecode (fix off-by-one) d6abe69 verified deepakkaura commited on 12 days ago
Fix decoder vocab-size mismatch (use model.config.vocab_size) 5ab66c5 verified deepakkaura commited on 12 days ago
Beam search + radiology hotwords + /transcribe-pcm endpoint 7ad44fe verified deepakkaura commited on 12 days ago