Spaces:

evaleval
/

general-eval-card

Running

App Files Files Community

general-eval-card / components

Commit History

Merge remote-tracking branch 'origin/main' into feat/use-new-backend-data

25ba6d0

j-chim commited on 26 days ago

Tighten eval cards UI and clean up stale local data

32864b0

evijit HF Staff Claude Opus 4.7 (1M context) commited on 27 days ago

Integrate with test backend data

7635aee

j-chim commited on 27 days ago

Add new component files and align app to EvalEval design system

dbdd6d1

evijit HF Staff Claude Sonnet 4.6 commited on 27 days ago

Replace shadcn-styled UI elements with design system primitives

187ffe6

evijit HF Staff Claude Sonnet 4.6 commited on 27 days ago

Add plain-language captions and mode-aware framing for policy readers

3ad47c6

evijit HF Staff Claude Opus 4.7 (1M context) commited on 30 days ago

Align user-facing labels with paper terminology

4be62f9

evijit HF Staff Claude Opus 4.7 (1M context) commited on 30 days ago

Merge corpus dashboard into home as paper-aligned landing

5279156

evijit HF Staff Claude Opus 4.7 (1M context) commited on 30 days ago

Deploy DuckDB-backed frontend to

da8db3e

Jenny Chim commited on Apr 29

Separate policy and researcher views

9b4cdbb

evijit HF Staff commited on Apr 29

Add interpretive signals, corpus dashboard, and slice browser

bca888a

evijit HF Staff Claude Opus 4.7 (1M context) commited on Apr 27

improve ux

8058fce

evijit HF Staff commited on Apr 15

Differentiate audience modes and tighten eval navigation

d8c2856

evijit HF Staff commited on Apr 15

Aggregate setup aliases and clarify benchmark variants

dd0b4fc

evijit HF Staff commited on Apr 14

Improve eval/model UX, lite data paths, and leaderboard clarity

436ada0

evijit HF Staff commited on Apr 14

Improve homepage loading and eval grouping

26a0d2d

evijit HF Staff commited on Apr 14

Add per-benchmark comparison histograms on model detail

415ac43

evijit HF Staff Claude Opus 4.6 (1M context) commited on Apr 13

Improve eval score displays and summary fallbacks

bd8cbe8

evijit HF Staff commited on Apr 13

Refine evaluation browsing UX

a0dd44e

evijit HF Staff commited on Apr 13

Refresh eval cards UI and backend data flow

c1f2130

evijit HF Staff commited on Apr 10

fix bugs

29afc21

evijit HF Staff commited on Apr 7

fix bugs

ae1dc39

evijit HF Staff commited on Apr 7

fix bugs

04b4cff

evijit HF Staff commited on Apr 7

ux changes

5f59721

evijit HF Staff commited on Apr 6

Add survey

e7123f0

evijit HF Staff commited on Apr 6

fix: align reporting cues and developer slugs

5ca5561

evijit HF Staff commited on Mar 28

feat: refine model and benchmark exploration

03e2430

evijit HF Staff commited on Mar 28

redesigned

3a12290

evijit HF Staff commited on Mar 27

rename benchmark to eval

0eafde7

Avijit Ghosh commited on Dec 17, 2025