Commit History

Add Official/Community/All scope filter for developers; drop bar
4ba8d73
Running

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Ignore local whisker-render.mjs probe script
b02b887

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Simplify interpretive signals heading
8494e4c

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Cross-source dedup, plotbox polish, pretty URLs, eval page fallbacks
6b39d1f

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Cross-suite signals, sortable leaderboard, theme cleanup
0314721

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Mount comparability panel above leaderboard, restyle, drop empty promises
02691ce

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Fix "null–null (null%)" confidence interval rendering
ae31eaf

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Add rule-based policy-mode summaries for model & eval views
aacebd7

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Cross-source dedup, plotbox polish, pretty URLs, eval page fallbacks
0b45710

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Match nested benchmarks in /evals search; auto-expand families with hits
26f932a

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Eval detail polish: hide empty fields, redesign splits, surface evaluator
c8aca27

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Move reader-mode toggle to detail pages; theme banners + apples-to-apples
4629534

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Auto-purge sidecar bucket when Next.js BUILD_ID changes
e9dae58

evijit HF Staff commited on

Bump clean-hierarchy cache version to v13 to drop stale blob
4d3de5c

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Merge cross-source benchmark families; tidy leaderboard panel + table chrome
8ef4cbc

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Drop alias-only single-bench families without merging them
cb0db40

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Restore curated benchmark families; polish frontier panel UX
ca20f78

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Live snapshot date, hide empty Updated col, clean slice contamination
cb0ce7c

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Humanize family names whose display matches the key under different separators
b763f91

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Make /models tables column-sortable; rebalance /evals + /models toolbars
5a2d59c

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Clean up Source column and per-row dataset label noise
eec1852

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Hide subtask-scope metrics from chips by default in matrix view
4cb8b56

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Render score-distribution metric picker as chips, not a dropdown
1303965

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Treat single-root-metric subtask evals as slice-pickable, not matrix
4ac3a9b

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Precompute eval matrices for multi-metric + per-slice leaderboards
553b175

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Restore HF Open LLM v2 composite and dedup vals.ai aliases
6db4f51

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Move split selector below the reporting comparison heading
629a612

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Add local parquet read support
aa29970

j-chim commited on

Fix sort toggle direction and remove categories as sortable column
c9c5a30

evijit HF Staff Claude Sonnet 4.6 commited on

Sort evals list by family name; add sortable columns; use cleaned display names
919a75f

evijit HF Staff Claude Sonnet 4.6 commited on

Dedup logic to counts
aac276a

j-chim commited on

Fix ranks-high/low-in using only sidecar ordinal data
970fdbe

evijit HF Staff Claude Sonnet 4.6 commited on

Wire search bar to overlaps table and hide chips in overlaps view
0f5fb5f

evijit HF Staff Claude Sonnet 4.6 commited on

Compute and apply cleaned benchmark counts per model
c2e86ea

evijit HF Staff Claude Sonnet 4.6 commited on

Remove raw-hierarchy fallback — only ever serve cleaned hierarchy
b5fa10d

evijit HF Staff Claude Sonnet 4.6 commited on

Harden cleanHierarchy fallback and add family-name filter chips
8529a4b

evijit HF Staff Claude Sonnet 4.6 commited on

Bump clean-hierarchy cache version to v10 to bust stale HF Space cache
4bf0591

evijit HF Staff Claude Sonnet 4.6 commited on

Restructure model details + extend cleanHierarchy for split families and aggregator dedup
06313c1

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Add list-view toggle to consolidate cross-family duplicate benchmarks
26eb09f

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Square off deep-dive theme and surface cross-family duplicates
b75f4c3

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Add option to purge cache
f2e3a0a

j-chim commited on

stats change
f816900

j-chim commited on

Prefer /data persistent bucket for sidecar cache when available
dc95237

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Disk-cache snapshot sidecars to skip cold-start re-downloads
40339dc

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Switch family/model views to curated category tags
bc08b3b

evijit HF Staff commited on

Route peer-ranks fetch through SNAPSHOT_URL sidecar
6cc7b0b

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Hotfix: categories
a80dd9f

j-chim commited on

Group model/eval-detail benchmarks by hierarchy.json families
f073e7a

evijit HF Staff commited on

Drop latest_timestamp fallback for release_date display
8717cca

evijit HF Staff Claude Opus 4.7 (1M context) commited on

Guard summaryText against null in PolicyOverview
c3a3598

j-chim Claude Opus 4.7 (1M context) commited on