Spaces:

evaleval
/

general-eval-card

Running

App Files Files Community

general-eval-card / components

Commit History

Tolerate missing localStorage in sandboxed embed iframes

01f29b6

Running

evijit HF Staff Claude Opus 4.8 (1M context) commited on 6 days ago

Fix model comparison + add evaluator logo

4945098

j-chim commited on 19 days ago

Stop propagating verified into detail route

66f4cb7

j-chim commited on 20 days ago

Update verified counts for grey validated orgs

2b5a024

j-chim commited on 20 days ago

Fix embeds and duplicates

7260042

j-chim commited on 21 days ago

Fix embeds to filter by canonical benchmark only (no splits in embed until we decide how to display splits) + suppress popup on embed routes

e74e7be

j-chim commited on 21 days ago

Fix embed - missing model name in benchmark details

b12c58d

j-chim commited on 22 days ago

Update mobile table rendering to stack

9d7e3c7

j-chim commited on 22 days ago

Make sure embeds dont have intro popup

1c48b8f

j-chim commited on 25 days ago

Update embeds title and move toggle order

4e96330

j-chim commited on 25 days ago

Update wording

6d05aa6

j-chim commited on 25 days ago

Letter-square signal flags on the overlaps expanded rows

6cc3c39

j-chim Claude Fable 5 commited on 26 days ago

Add the grey recognized tier to the verified badge

4a8e900

j-chim Claude Fable 5 commited on 26 days ago

Overlaps tab v2: single-source rows, per-source detail, slice-title contract

dae359e

j-chim Claude Fable 5 commited on 26 days ago

Open reported metrics in Overlaps view by default

abef456

Anka commited on 26 days ago

Rework Help/About: tutorials, signal docs, contribute/cite sharing, terminology, screenshots; default researcher view to full result set

f2a39b0

Anka commited on 27 days ago

Fix validated evaluator displays

4bce289

j-chim commited on 27 days ago

Add Help: intro tour, tutorials, stakeholder guides, citation info, footer

f737def

Anka commited on 27 days ago

format fixes - validated evals

2355bb3

j-chim commited on 27 days ago

Add validated evaluator badge

478ae6c

j-chim commited on 27 days ago

Add Feedback page and nav item

294d3a2

Anka commited on 28 days ago

Add eee source record (#9)

da4309d

j-chim commited on about 1 month ago

Update about (#8)

599471d

j-chim commited on about 1 month ago

Fix mobile leaderboard model link to use model_route_id fallback

b9bdca2

j-chim commited on May 28

Adapt upstream category-based code to v2 tag taxonomy

bfb71af

j-chim commited on May 28

Merge remote-tracking branch 'origin/main' into merge/main-into-v2-cleanup

6e90b4d

j-chim commited on May 28

WIP: v2 cleanup checkpoint before merging origin/main

d249d5b

j-chim commited on May 28

Bump @duckdb/node-api 1.5.2-r.1 → 1.5.3-r.2; suppress nested embed button

247b1c5

evijit HF Staff Claude Opus 4.7 (1M context) commited on May 28

Homepage: drop duplicate "Evaluation Cards · Beta" hero kicker; accent corpus date

abb939f

evijit HF Staff Claude Opus 4.7 (1M context) commited on May 28

Embeds: histogram route, leaderboard slices + sort, brand mark; cross-source row dedup

7a54021

evijit HF Staff Claude Opus 4.7 (1M context) commited on May 28

Internal-feedback pass: rename to "Evaluation Cards", rework Summary view, simplify §4 metrics

faa9b3f

evijit HF Staff Claude Opus 4.7 (1M context) commited on May 27