Commit History

Add researcher join analysis to eval detail
8c16960

Yanan Long commited on

Deploy DuckDB-backed frontend to
da8db3e

Jenny Chim commited on

Add DuckDB shadow-read backend with source-metadata fix
2fcae3f

Jenny Chim Claude Opus 4.7 (1M context) commited on

Separate policy and researcher views
9b4cdbb

evijit HF Staff commited on

Add interpretive signals, corpus dashboard, and slice browser
bca888a

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Preserve evaluator_relationship when flattening model hierarchy
431b0cc

evijit HF Staff commited on

improve ux
8058fce

evijit HF Staff commited on

Differentiate audience modes and tighten eval navigation
d8c2856

evijit HF Staff commited on

Aggregate setup aliases and clarify benchmark variants
dd0b4fc

evijit HF Staff commited on

Fix RewardBench2 key normalization for matrix leaderboard routing
8821e18

evijit HF Staff commited on

Improve eval/model UX, lite data paths, and leaderboard clarity
436ada0

evijit HF Staff commited on

Add per-benchmark comparison histograms on model detail
415ac43

evijit HF Staff Claude Opus 4.6 (1M context) commited on

Improve eval score displays and summary fallbacks
bd8cbe8

evijit HF Staff commited on

Refresh eval cards UI and backend data flow
c1f2130

evijit HF Staff commited on

Add survey submission and update survey text for public use
516ec04

evijit HF Staff Claude Opus 4.6 (1M context) commited on

fix bugs
ae1dc39

evijit HF Staff commited on

fix bugs
04b4cff

evijit HF Staff commited on

ux changes
5f59721

evijit HF Staff commited on

Add survey
e7123f0

evijit HF Staff commited on

fix: align reporting cues and developer slugs
5ca5561

evijit HF Staff commited on

feat: refine model and benchmark exploration
03e2430

evijit HF Staff commited on

redesigned
3a12290

evijit HF Staff commited on

fix data
ddfc163

Avijit Ghosh commited on

Refactor: Update benchmarks with realistic data, fix UI stats, and improve About page
2554366

Avijit Ghosh commited on

new ux
6978d97

Avijit Ghosh commited on

fixed a lot of bugs, centralized schema
49d5ba7

evijit HF Staff commited on

unit tests added
a58dac7

evijit HF Staff commited on

added all the new files
509e21e

evijit HF Staff commited on