Reduce Whisper hallucinations: condition_on_previous_text=False, temperature fallback, light VAD, no_speech filter cd96775 verified benhadjermed commited on 13 days ago
Migrate to faster-whisper with INT8 quantization for ~4x speedup 90b0434 verified benhadjermed commited on 29 days ago
fix: run partial inference in background to stop blocking WebSocket receive loop 8cb52fb verified benhadjermed commited on Apr 11
fix: skip partial inference if CPU is locked to prevent timeouts ac95f56 verified benhadjermed commited on Apr 11
feat: migrate to streaming transcriptions via WebSockets 3f4cf11 verified benhadjermed commited on Apr 11