Commit History

Add researcher join analysis to eval detail
8c16960

Yanan Long commited on

Bake DuckDB envs into runner stage
051fa16

j-chim Claude Opus 4.7 (1M context) commited on

Restore DuckDB-aware build cache logic
154e1d8

j-chim Claude Opus 4.7 (1M context) commited on

Point Dockerfile at production card_backend dataset
db192b0

j-chim Claude Opus 4.7 (1M context) commited on

Fix Fibble Arena (and similar) suite link routing
c569d0f

j-chim Claude Opus 4.7 (1M context) commited on

Sort eval-detail filenames by codepoint for parity
819e7c9

Jenny Chim Claude Opus 4.7 (1M context) commited on

Set LOCAL_PIPELINE_OUTPUT/HF_DATA_OFFLINE at Docker build time
fe5af86

Jenny Chim Claude Opus 4.7 (1M context) commited on

Bake DuckDB build-time defaults into Dockerfile
34ddba0

Jenny Chim Claude Opus 4.7 (1M context) commited on

Deploy DuckDB-backed frontend to
da8db3e

Jenny Chim commited on

Add three-tier test infrastructure for migration safety
d3cbe09

Jenny Chim Claude Opus 4.7 (1M context) commited on

Add DuckDB shadow-read backend with source-metadata fix
2fcae3f

Jenny Chim Claude Opus 4.7 (1M context) commited on

Separate policy and researcher views
9b4cdbb

evijit HF Staff commited on

Add interpretive signals, corpus dashboard, and slice browser
bca888a

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Preserve evaluator_relationship when flattening model hierarchy
431b0cc

evijit HF Staff commited on

improve ux
8058fce

evijit HF Staff commited on

Differentiate audience modes and tighten eval navigation
d8c2856

evijit HF Staff commited on

Aggregate setup aliases and clarify benchmark variants
dd0b4fc

evijit HF Staff commited on

Fix RewardBench2 key normalization for matrix leaderboard routing
8821e18

evijit HF Staff commited on

Improve eval/model UX, lite data paths, and leaderboard clarity
436ada0

evijit HF Staff commited on

Improve homepage loading and eval grouping
26a0d2d

evijit HF Staff commited on

Use HF dataset's peer-ranks.json instead of local recomputation
6a6446b

evijit HF Staff Claude Opus 4.6 (1M context) commited on

Add per-benchmark comparison histograms on model detail
415ac43

evijit HF Staff Claude Opus 4.6 (1M context) commited on

Add site favicon metadata
35729f5

evijit HF Staff commited on

Improve eval score displays and summary fallbacks
bd8cbe8

evijit HF Staff commited on

Harden aggregate evals and cache refresh
9d14977

evijit HF Staff commited on

Refine evaluation browsing UX
a0dd44e

evijit HF Staff commited on

Ignore local cache and data artifacts
0e12c7f

evijit HF Staff commited on

Refresh eval cards UI and backend data flow
c1f2130

evijit HF Staff commited on

Fix survey submit: use correct HF commit API JSON format with files array
c5372a8

evijit HF Staff Claude Opus 4.6 (1M context) commited on

Fix survey submit: use multipart form data for HF commit API
9481599

evijit HF Staff Claude Opus 4.6 (1M context) commited on

Add alert feedback on survey submit success/failure
872607f

evijit HF Staff Claude Opus 4.6 (1M context) commited on

Fix HF commit API field: summary not commit_message
023694a

evijit HF Staff Claude Opus 4.6 (1M context) commited on

Fix survey submission: use HF commit API instead of deprecated upload
ddf16f4

evijit HF Staff Claude Opus 4.6 (1M context) commited on

Add survey submission and update survey text for public use
516ec04

evijit HF Staff Claude Opus 4.6 (1M context) commited on

fix bugs
29afc21

evijit HF Staff commited on

fix bugs
ae1dc39

evijit HF Staff commited on

fix bugs
04b4cff

evijit HF Staff commited on

chore: sync EEE pipeline output [2026-04-07 05:13 UTC]
ddebd57

GitHub Actions commited on

ux changes
5f59721

evijit HF Staff commited on

Add survey
e7123f0

evijit HF Staff commited on

chore: sync EEE pipeline output [2026-04-06 05:29 UTC]
969ee51

GitHub Actions commited on

chore: sync EEE pipeline output [2026-04-05 05:17 UTC]
8bb81ee

GitHub Actions commited on

chore: sync EEE pipeline output [2026-04-04 04:53 UTC]
04e134c

GitHub Actions commited on

chore: sync EEE pipeline output [2026-04-03 05:09 UTC]
85f9d10

GitHub Actions commited on

chore: sync EEE pipeline output [2026-04-02 05:07 UTC]
49596d9

GitHub Actions commited on

chore: sync EEE pipeline output [2026-04-01 05:28 UTC]
d8be99e

GitHub Actions commited on

chore: sync EEE pipeline output [2026-03-31 05:13 UTC]
abb1ee6

GitHub Actions commited on

chore: sync EEE pipeline output [2026-03-30 05:29 UTC]
523c970

GitHub Actions commited on

chore: sync EEE pipeline output [2026-03-29 05:14 UTC]
b279bb2

GitHub Actions commited on

npm warning
a4b6a85

evijit HF Staff commited on