Keep structured date extraction opt-in 58bb624 Running Chirag-Bhardwaj commited on about 14 hours ago
Add calendar-month hotwords to test spoken-date recognition ce538a0 Chirag-Bhardwaj commited on 1 day ago
Preserve \n through whitespace cleanup so "new line" survives strip d23bfc4 Chirag-Bhardwaj commited on 4 days ago
Lower default KENLM_ALPHA 0.5β0.2, BETA 1.5β1.0 (#5) 18192a6 deepakkaura chirag18 commited on 6 days ago
Download KenLM at server startup, not Docker build (#2) d1fc078 deepakkaura chirag18 commited on 7 days ago
Enable KenLM domain-LM shallow fusion at decode time (#1) 810b118 deepakkaura chirag18 commited on 7 days ago
Run model fp32 on GPU (fp16 produced degenerate logits) ea3dae0 verified deepakkaura commited on 8 days ago
Upcast fp16 logits to fp32 before pyctcdecode (fixes empty-beam error) 8f824ff verified deepakkaura commited on 8 days ago
Tune for speed: beam_width 8->4, hotword_weight 10->5 046871c verified deepakkaura commited on 8 days ago
Fix IndentationError: collapse duplicated regex lines e62dce0 verified deepakkaura commited on 8 days ago
Debug log + handle SentencePiece-prefixed brace tokens b03b022 verified deepakkaura commited on 8 days ago
Convert MedASR </s> sentence-boundary token to period e52390b verified deepakkaura commited on 8 days ago
Mark pad token as CTC blank for pyctcdecode (fix off-by-one) d6abe69 verified deepakkaura commited on 8 days ago
Fix decoder vocab-size mismatch (use model.config.vocab_size) 5ab66c5 verified deepakkaura commited on 8 days ago
Beam search + radiology hotwords + /transcribe-pcm endpoint 7ad44fe verified deepakkaura commited on 8 days ago