perf: greedy decoding + inference_mode for coherent fast answers on CIF path 8ca39a4 Running verified thagnitti commited on 11 minutes ago
fix: ThreadingHTTPServer so health checks never block during inference + all CPU threads for generation 94c87ce verified thagnitti commited on 22 minutes ago
fix: increase inference timeout 30s->180s, reduce nSamples 3->1 and maxTokens 400->100 for CPU feasibility 47ceb69 verified thagnitti commited on 33 minutes ago
fix: complete model server HTTP handler, remove duplicate daemon, extend health timeout to 900s 93b88f0 verified thagnitti commited on about 1 hour ago
fix: extend daemon health-probe timeout 90s→300s for 515MB model download+load bf78de1 verified thagnitti commited on about 3 hours ago
fix: remove circular isCheckpointReady() guard from startModelDaemon — daemon now self-starts 836bf7d verified thagnitti commited on about 3 hours ago
fix: aiScore ReferenceError — declare variables before if/else block cb9df6f verified thagnitti commited on about 3 hours ago
feat: AI/Search % badge, search-conditioned model, daemon self-fetches checkpoint via hf_hub_download 34cc9fe verified thagnitti commited on about 3 hours ago
search-guided model: fetch first, condition model on results, maxTokens=400, usedModel flag 62de457 verified thagnitti commited on about 3 hours ago
model-first: generate tokens directly, web search for grounding only 2f7211a verified thagnitti commited on about 3 hours ago
Fix: remove broken getCodeIntro ref, restore working code template flow 67b5700 verified thagnitti commited on 2 days ago