medasr-server / server.py

Commit History

Keep structured date extraction opt-in
58bb624
Running

Chirag-Bhardwaj commited on

Reject inferred day-level structured dates
0816810

Chirag-Bhardwaj commited on

Guard structured dates against inferred parts
02757e2

Chirag-Bhardwaj commited on

Add structured date extractor
fc5ec37

Chirag-Bhardwaj commited on

Add experimental OpenAI date sidecar endpoint
1279cfd

Chirag-Bhardwaj commited on

Add calendar-month hotwords to test spoken-date recognition
ce538a0

Chirag-Bhardwaj commited on

Preserve \n through whitespace cleanup so "new line" survives strip
d23bfc4

Chirag-Bhardwaj commited on

Fix NameError on CORPUS_HOTWORDS
a67f0ce
verified

deepakkaura commited on

Add 86 corpus-mined hotwords
4134cad
verified

deepakkaura commited on

Always re-download LM on startup
cd70be3
verified

deepakkaura commited on

Add N-best LM rescoring path
1f324e5
verified

deepakkaura commited on

Truly bypass LM when KENLM_ALPHA<=0
98cb0ee
verified

deepakkaura commited on

Lower default KENLM_ALPHA 0.5β†’0.2, BETA 1.5β†’1.0 (#5)
18192a6

deepakkaura chirag18 commited on

Download KenLM at server startup, not Docker build (#2)
d1fc078

deepakkaura chirag18 commited on

Enable KenLM domain-LM shallow fusion at decode time (#1)
810b118

deepakkaura chirag18 commited on

Run model fp32 on GPU (fp16 produced degenerate logits)
ea3dae0
verified

deepakkaura commited on

Upcast fp16 logits to fp32 before pyctcdecode (fixes empty-beam error)
8f824ff
verified

deepakkaura commited on

Use GPU when available (fp16 + autocast for T4)
c5fe792
verified

deepakkaura commited on

Tune for speed: beam_width 8->4, hotword_weight 10->5
046871c
verified

deepakkaura commited on

Fix IndentationError: collapse duplicated regex lines
e62dce0
verified

deepakkaura commited on

Debug log + handle SentencePiece-prefixed brace tokens
b03b022
verified

deepakkaura commited on

Convert MedASR </s> sentence-boundary token to period
e52390b
verified

deepakkaura commited on

Mark pad token as CTC blank for pyctcdecode (fix off-by-one)
d6abe69
verified

deepakkaura commited on

Fix decoder vocab-size mismatch (use model.config.vocab_size)
5ab66c5
verified

deepakkaura commited on

Beam search + radiology hotwords + /transcribe-pcm endpoint
7ad44fe
verified

deepakkaura commited on

Upload server.py with huggingface_hub
9fbb166
verified

deepakkaura commited on

Upload server.py with huggingface_hub
031189e
verified

deepakkaura commited on

Upload server.py with huggingface_hub
7007888
verified

deepakkaura commited on

Upload folder using huggingface_hub
52300cf
verified

deepakkaura commited on