Commit History

Fix Nano: call @spaces.GPU function directly from Gradio handler
6498d62
Running

Julien Simon Claude Opus 4.6 commited on

Disable SSR mode — fixes button not working in Gradio 6
10a750b

Julien Simon Claude Opus 4.6 commited on

Fix GPU task aborted: call @spaces.GPU synchronously, not via to_thread
eceb156

Julien Simon Claude Opus 4.6 commited on

Fix ZeroGPU: follow documented pattern — model.to('cuda') at module level
2c9d67a

Julien Simon Claude Opus 4.6 commited on

Fix Trinity Nano ZeroGPU: load on CPU, explicit CUDA move, no asyncio.to_thread
fe3afee

Julien Simon Claude Opus 4.6 commited on

Remove judge/scoring entirely — show responses and costs only
1eeb2cf

Julien Simon Claude Opus 4.6 commited on

Remove Tab 2 (Pick the Right Model) — single-page layout
438b4b9

Julien Simon Claude Opus 4.6 commited on

feat: show each model response as it arrives, not waiting for all three
cce524d

Julien Simon Claude Opus 4.6 commited on

fix: eager import of local_model to prevent ZeroGPU abort
92b09a7

Julien Simon Claude Opus 4.6 commited on

fix: import spaces before gradio to patch torch.cuda correctly
891fc91

Julien Simon Claude Opus 4.6 commited on

fix: load model to CUDA at module level, increase GPU duration to 120s
8caf4df

Julien Simon Claude Opus 4.6 commited on

fix: proper ZeroGPU pattern — load CPU at startup, move to CUDA on call
7747fc1

Julien Simon Claude Opus 4.6 commited on

fix: pin transformers<5.0 for Trinity Nano compatibility
d12eb30

Julien Simon Claude Opus 4.6 commited on

fix: patch missing pad_token_id in AfmoeConfig for Trinity Nano
d44489d

Julien Simon Claude Opus 4.6 commited on

fix: import local_model at startup so ZeroGPU detects @spaces.GPU
7e3f24d

Julien Simon Claude Opus 4.6 commited on

chore: switch judge to Gemini 2.5 Flash for better scoring calibration
97f7b97

Julien Simon Claude Opus 4.6 commited on

feat: run Trinity Nano locally via ZeroGPU, Mini/Large via OpenRouter
d98b8b5

Julien Simon Claude Opus 4.6 commited on

feat: switch all models to HF Inference API, drop OpenRouter
f08e25e

Julien Simon Claude Opus 4.6 commited on

feat: use HF Inference API for Trinity Nano instead of OpenRouter
15de015

Julien Simon Claude Opus 4.6 commited on

feat: add streaming progress indicator, fix judge model, add debug logging
56ef1e6

Julien Simon Claude Opus 4.6 commited on

fix: add css to Blocks for HF Spaces compat, improve auth error message
21278e8

Julien Simon commited on

fix: move css/theme to launch() for Gradio 6 compatibility
bbd3d9c

Julien Simon commited on

feat: add sample prompts highlighting different model strengths
d6ed563

Julien Simon commited on

fix: shorten Space description to meet 60 char limit
1f0d613

Julien Simon commited on

chore: add .gitignore
e0be116

Julien Simon commited on

feat: add Gradio UI with two-tab layout and custom dark theme
ac361a6

Julien Simon Claude Sonnet 4.6 commited on

test: add end-to-end integration tests
d9feb8c

Julien Simon commited on

feat: add HF Space metadata and README
67eaf8f

Julien Simon commited on

feat: add comparison engine with best-value selection
3be2d14

Julien Simon commited on

feat: add judge system with anonymized shuffled scoring
50f7094

Julien Simon commited on

feat: add IP-based rate limiter with configurable window
7fb0e51

Julien Simon commited on

feat: add OpenRouter async client with retry logic and cost calculation
c047ce3

Julien Simon Claude Sonnet 4.6 commited on

feat: add project config and dependencies
88b43a8

Julien Simon commited on

docs: add design spec and implementation plan
7268819

Julien Simon commited on