feat: custom_hf without CUDA falls back to offline baseline d1f6a06 sanjay7676 commited on 29 days ago
fix(custom_hf): prefer dtype over deprecated torch_dtype in from_pretrained 5a67c2d sanjay7676 commited on 29 days ago
perf(ui): fewer Gradio steps + live STEPS_PER_EPISODE, tighter token cap 26e10b8 sanjay7676 commited on 29 days ago
perf(space-ui): faster Gradio custom_hf — UI candidates=1, cap tokens, GenerationConfig f37f4f8 sanjay7676 commited on 29 days ago
Finalize eval-friendly defaults: offline baseline, deterministic API reset, docs cleanup 0c741d9 sanjay7676 commited on 29 days ago
docs+router: HF Space secret names; auto includes custom_hf only when HF_TOKEN set a874a8f sanjay7676 commited on 29 days ago
fix(router): exclude custom_hf from auto chain so Space auto stays fast 9e4a6fa sanjay7676 commited on 29 days ago
perf(ui): default mock provider; auto tries NIM/OpenRouter before HF; cap HF router timeout 90s 665aab6 sanjay7676 commited on 29 days ago
feat: inference router (HF/NIM/OpenRouter/mock); README aligned with judge rubric; chart axis labels 1546bc7 sanjay7676 commited on 30 days ago