Commit History

Add Official/Community/All scope filter for developers; drop bar
4ba8d73
Running

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Add rule-based policy-mode summaries for model & eval views
aacebd7

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Cross-source dedup, plotbox polish, pretty URLs, eval page fallbacks
0b45710

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Auto-purge sidecar bucket when Next.js BUILD_ID changes
e9dae58

evijit HF Staff commited on

Bump clean-hierarchy cache version to v13 to drop stale blob
4d3de5c

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Merge cross-source benchmark families; tidy leaderboard panel + table chrome
8ef4cbc

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Drop alias-only single-bench families without merging them
cb0db40

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Restore curated benchmark families; polish frontier panel UX
ca20f78

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Precompute eval matrices for multi-metric + per-slice leaderboards
553b175

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Restore HF Open LLM v2 composite and dedup vals.ai aliases
6db4f51

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Add local parquet read support
aa29970

j-chim commited on

Sort evals list by family name; add sortable columns; use cleaned display names
919a75f

evijit HF Staff Claude Sonnet 4.6 commited on

Dedup logic to counts
aac276a

j-chim commited on

Compute and apply cleaned benchmark counts per model
c2e86ea

evijit HF Staff Claude Sonnet 4.6 commited on

Remove raw-hierarchy fallback — only ever serve cleaned hierarchy
b5fa10d

evijit HF Staff Claude Sonnet 4.6 commited on

Harden cleanHierarchy fallback and add family-name filter chips
8529a4b

evijit HF Staff Claude Sonnet 4.6 commited on

Bump clean-hierarchy cache version to v10 to bust stale HF Space cache
4bf0591

evijit HF Staff Claude Sonnet 4.6 commited on

Restructure model details + extend cleanHierarchy for split families and aggregator dedup
06313c1

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Add option to purge cache
f2e3a0a

j-chim commited on

stats change
f816900

j-chim commited on

Prefer /data persistent bucket for sidecar cache when available
dc95237

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Disk-cache snapshot sidecars to skip cold-start re-downloads
40339dc

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Switch family/model views to curated category tags
bc08b3b

evijit HF Staff commited on

Route peer-ranks fetch through SNAPSHOT_URL sidecar
6cc7b0b

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Hotfix: categories
a80dd9f

j-chim commited on

Group model/eval-detail benchmarks by hierarchy.json families
f073e7a

evijit HF Staff commited on

Refactor to align on benchmark hierarchy
2ed4959

j-chim commited on

Update with datafix v2
11542d9

j-chim commited on

Swap backend data (#3)
fe99ffa

evijit HF Staff j-chim commited on

Tighten eval cards UI and clean up stale local data
32864b0

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Merge corpus dashboard into home as paper-aligned landing
5279156

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Deploy DuckDB-backed frontend to
da8db3e

Jenny Chim commited on

Add DuckDB shadow-read backend with source-metadata fix
2fcae3f

Jenny Chim Claude Opus 4.7 (1M context) commited on

Separate policy and researcher views
9b4cdbb

evijit HF Staff commited on

Add interpretive signals, corpus dashboard, and slice browser
bca888a

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Preserve evaluator_relationship when flattening model hierarchy
431b0cc

evijit HF Staff commited on

improve ux
8058fce

evijit HF Staff commited on

Differentiate audience modes and tighten eval navigation
d8c2856

evijit HF Staff commited on

Aggregate setup aliases and clarify benchmark variants
dd0b4fc

evijit HF Staff commited on

Fix RewardBench2 key normalization for matrix leaderboard routing
8821e18

evijit HF Staff commited on

Improve eval/model UX, lite data paths, and leaderboard clarity
436ada0

evijit HF Staff commited on

Add per-benchmark comparison histograms on model detail
415ac43

evijit HF Staff Claude Opus 4.6 (1M context) commited on

Improve eval score displays and summary fallbacks
bd8cbe8

evijit HF Staff commited on

Refresh eval cards UI and backend data flow
c1f2130

evijit HF Staff commited on

Add survey submission and update survey text for public use
516ec04

evijit HF Staff Claude Opus 4.6 (1M context) commited on

fix bugs
ae1dc39

evijit HF Staff commited on

fix bugs
04b4cff

evijit HF Staff commited on

ux changes
5f59721

evijit HF Staff commited on

Add survey
e7123f0

evijit HF Staff commited on

fix: align reporting cues and developer slugs
5ca5561

evijit HF Staff commited on