Fix Nano: call @spaces.GPU function directly from Gradio handler 6498d62 Running Julien Simon Claude Opus 4.6 commited on about 23 hours ago
Disable SSR mode — fixes button not working in Gradio 6 10a750b Julien Simon Claude Opus 4.6 commited on about 23 hours ago
Fix GPU task aborted: call @spaces.GPU synchronously, not via to_thread eceb156 Julien Simon Claude Opus 4.6 commited on about 23 hours ago
Fix ZeroGPU: follow documented pattern — model.to('cuda') at module level 2c9d67a Julien Simon Claude Opus 4.6 commited on about 24 hours ago
Fix Trinity Nano ZeroGPU: load on CPU, explicit CUDA move, no asyncio.to_thread fe3afee Julien Simon Claude Opus 4.6 commited on about 24 hours ago
Remove judge/scoring entirely — show responses and costs only 1eeb2cf Julien Simon Claude Opus 4.6 commited on about 24 hours ago
Remove Tab 2 (Pick the Right Model) — single-page layout 438b4b9 Julien Simon Claude Opus 4.6 commited on about 24 hours ago
feat: show each model response as it arrives, not waiting for all three cce524d Julien Simon Claude Opus 4.6 commited on about 24 hours ago
fix: eager import of local_model to prevent ZeroGPU abort 92b09a7 Julien Simon Claude Opus 4.6 commited on 1 day ago
fix: import spaces before gradio to patch torch.cuda correctly 891fc91 Julien Simon Claude Opus 4.6 commited on 1 day ago
fix: load model to CUDA at module level, increase GPU duration to 120s 8caf4df Julien Simon Claude Opus 4.6 commited on 1 day ago
fix: proper ZeroGPU pattern — load CPU at startup, move to CUDA on call 7747fc1 Julien Simon Claude Opus 4.6 commited on 1 day ago
fix: pin transformers<5.0 for Trinity Nano compatibility d12eb30 Julien Simon Claude Opus 4.6 commited on 1 day ago
fix: patch missing pad_token_id in AfmoeConfig for Trinity Nano d44489d Julien Simon Claude Opus 4.6 commited on 1 day ago
fix: import local_model at startup so ZeroGPU detects @spaces.GPU 7e3f24d Julien Simon Claude Opus 4.6 commited on 1 day ago
chore: switch judge to Gemini 2.5 Flash for better scoring calibration 97f7b97 Julien Simon Claude Opus 4.6 commited on 1 day ago
feat: run Trinity Nano locally via ZeroGPU, Mini/Large via OpenRouter d98b8b5 Julien Simon Claude Opus 4.6 commited on 1 day ago
feat: switch all models to HF Inference API, drop OpenRouter f08e25e Julien Simon Claude Opus 4.6 commited on 1 day ago
feat: use HF Inference API for Trinity Nano instead of OpenRouter 15de015 Julien Simon Claude Opus 4.6 commited on 1 day ago
feat: add streaming progress indicator, fix judge model, add debug logging 56ef1e6 Julien Simon Claude Opus 4.6 commited on 1 day ago
fix: add css to Blocks for HF Spaces compat, improve auth error message 21278e8 Julien Simon commited on 1 day ago
fix: move css/theme to launch() for Gradio 6 compatibility bbd3d9c Julien Simon commited on 1 day ago
feat: add sample prompts highlighting different model strengths d6ed563 Julien Simon commited on 1 day ago
feat: add Gradio UI with two-tab layout and custom dark theme ac361a6 Julien Simon Claude Sonnet 4.6 commited on 1 day ago
feat: add OpenRouter async client with retry logic and cost calculation c047ce3 Julien Simon Claude Sonnet 4.6 commited on 1 day ago