general-eval-card / lib /eval-processing.ts

Commit History

Refactor to align on benchmark hierarchy
2ed4959

j-chim commited on

Update with datafix v2
11542d9

j-chim commited on

Tighten eval cards UI and clean up stale local data
32864b0

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Separate policy and researcher views
9b4cdbb

evijit HF Staff commited on

Add interpretive signals, corpus dashboard, and slice browser
bca888a

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Aggregate setup aliases and clarify benchmark variants
dd0b4fc

evijit HF Staff commited on

Add per-benchmark comparison histograms on model detail
415ac43

evijit HF Staff Claude Opus 4.6 (1M context) commited on

Refresh eval cards UI and backend data flow
c1f2130

evijit HF Staff commited on

fix bugs
04b4cff

evijit HF Staff commited on

ux changes
5f59721

evijit HF Staff commited on

feat: refine model and benchmark exploration
03e2430

evijit HF Staff commited on

redesigned
3a12290

evijit HF Staff commited on

fix data
ddfc163

Avijit Ghosh commited on

Refactor: Update benchmarks with realistic data, fix UI stats, and improve About page
2554366

Avijit Ghosh commited on

new ux
6978d97

Avijit Ghosh commited on