leaderboard: admin rescore (selected + all) reusing the eval pipeline 2893b22 Michael Rabinovich Cursor commited on about 3 hours ago
leaderboard: rename tabs, relabel fixtures as samples, inline gallery row stats 461547b Michael Rabinovich Cursor commited on about 10 hours ago
leaderboard: serve GT report assets via proxy; link them in hosted report 5140b0a Michael Rabinovich Cursor commited on about 11 hours ago
leaderboard: serve renders from the public bucket, not the dataset proxy d2161b1 Michael Rabinovich Cursor commited on about 18 hours ago
leaderboard: show edit-diff turntable in gallery grid for editing fixtures e611f15 Michael Rabinovich Cursor commited on 1 day ago
leaderboard: move Tasks tab after Leaderboard 31854f7 Michael Rabinovich Cursor commited on 1 day ago
leaderboard: add Tasks tab to browse benchmark fixtures f4924d6 Michael Rabinovich Cursor commited on 1 day ago
Serve rotating WebP turntables + GT generator c1cb5e4 Michael Rabinovich Cursor commited on 1 day ago
gallery: searchable fixture picker, URL-persisted picks, cached render proxies 4a9408a Michael Rabinovich Cursor commited on 1 day ago
submit: retry Hub commits on 429/5xx + persistent status panel b224eee Michael Rabinovich Cursor commited on 1 day ago
gallery: stop negative-caching render fetches 8eb8954 Michael Rabinovich Cursor commited on 3 days ago
add visual Gallery tab (top-10 verified, sticky GT, fixture picker) 01d67e9 Michael Rabinovich Cursor commited on 4 days ago
leaderboard: drop silent fallback; boot resilient on Hub read failure a662bfa Michael Rabinovich commited on 4 days ago
app: use a colon in the submit-tab system-agnostic note 8a21dae Michael Rabinovich commited on 6 days ago
app: credit Mecado as CAD data source in About tab 0c44305 Michael Rabinovich Cursor commited on 7 days ago
leaderboard: format submitted_at as `YYYY-MM-DD HH:MM UTC`; lock tables read-only c4e21b3 Michael Rabinovich commited on 8 days ago
submit: serialize cadgenbench evaluate to dodge cpu-upgrade contention 6e3ab50 Michael Rabinovich commited on 8 days ago
debug: add /debug/render-bench route for one-shot Chromium timing 37e45d8 Michael Rabinovich commited on 8 days ago
submit+app: gate Submit on HF OAuth; populate hf_username from profile c87b253 Michael Rabinovich commited on 8 days ago
submit+app: toast-based feedback (validating / queued / refreshed) 6facf47 Michael Rabinovich commited on 8 days ago
leaderboard+app: combined CSV download with validation_status discriminator f585077 Michael Rabinovich commited on 8 days ago
app: Citation + Validation Guidelines accordions, validation link in About 97b9a4a Michael Rabinovich commited on 8 days ago
app: render reports inline via iframe srcdoc (private-Space safe) 0e3b21f Michael Rabinovich commited on 8 days ago
app+leaderboard: link submission_name to a Space-side report proxy 77edebf Michael Rabinovich commited on 8 days ago
app+leaderboard: detail panel polish (rename link columns, fix broken report links) 1a8f331 Michael Rabinovich commited on 9 days ago
app: row-click detail panel below the leaderboard tables 3112173 Michael Rabinovich commited on 9 days ago
leaderboard: markdown link columns for agent_url, submission, report 53de73a Michael Rabinovich commited on 9 days ago
app: split Leaderboard into Validated + Unvalidated tables 046548a Michael Rabinovich commited on 9 days ago
app: move theme= from Blocks.launch() to Blocks() ctor (Gradio 5 API) 76f0611 Michael Rabinovich commited on 9 days ago
app+requirements: pin Gradio 5 + gradio_leaderboard, auto-refresh now works 4e86f82 Michael Rabinovich commited on 9 days ago
app: fix leaderboard auto-refresh via @gr .render(key=...) f2f35be Michael Rabinovich commited on 9 days ago
app: drive auto-refresh via gr.Timer().tick instead of every= on Dataframe 4ee70ef Michael Rabinovich commited on 9 days ago
submit: drop the agree checkbox, meta.json is the only consent gate 9e84592 Michael Rabinovich commited on 9 days ago
submit: drop dead form fields, meta.json is the only source f571140 Michael Rabinovich commited on 9 days ago
submit: move handle_submit into its own module + add validation pipeline 0501689 Michael Rabinovich commited on 9 days ago
refactor: split leaderboard read path into its own module b5ad973 Michael Rabinovich commited on 9 days ago
docs: rewrite Space README + About tab as declarative current state 628bc9e Michael Rabinovich commited on 9 days ago
docs: refresh stale status strings after org transfer 7dc3bc3 Michael Rabinovich Cursor commited on 10 days ago
ui: render validity_rate as percentage in the leaderboard table 00d6e0c Michael Rabinovich Cursor commited on 11 days ago