Spaces:

evaleval
/

general-eval-card

Running

App Files Files Community

general-eval-card / lib

Commit History

Tolerate missing localStorage in sandboxed embed iframes

01f29b6

Running

evijit HF Staff Claude Opus 4.8 (1M context) commited on 5 days ago

Add dynamic thumbnails for orgs

e068beb

j-chim commited on 17 days ago

Fix model comparison + add evaluator logo

4945098

j-chim commited on 18 days ago

Update verified counts for grey validated orgs

2b5a024

j-chim commited on 19 days ago

Update verified list

8a710d4

j-chim commited on 19 days ago

Help docs: verified-checkmark gif + refreshed cross-post tutorial

f7f3c2c

evijit HF Staff Claude Opus 4.8 (1M context) commited on 20 days ago

Fix embeds to filter by canonical benchmark only (no splits in embed until we decide how to display splits) + suppress popup on embed routes

e74e7be

j-chim commited on 21 days ago

Update embeds title and move toggle order

4e96330

j-chim commited on 25 days ago

Update wording

6d05aa6

j-chim commited on 25 days ago

Retarget three stale model redirects for the 657f24c2 registry

b337bd8

j-chim Claude Fable 5 commited on 25 days ago

Letter-square signal flags on the overlaps expanded rows

6cc3c39

j-chim Claude Fable 5 commited on 25 days ago

Add the grey recognized tier to the verified badge

4a8e900

j-chim Claude Fable 5 commited on 25 days ago

Overlaps tab v2: single-source rows, per-source detail, slice-title contract

dae359e

j-chim Claude Fable 5 commited on 25 days ago

Rework Help/About: tutorials, signal docs, contribute/cite sharing, terminology, screenshots; default researcher view to full result set

f2a39b0

Anka commited on 26 days ago

Render screenshot blocks indented under list items

600100f

Anka commited on 26 days ago

Fix validated evaluator displays

4bce289

j-chim commited on 26 days ago

Add Help: intro tour, tutorials, stakeholder guides, citation info, footer

f737def

Anka commited on 26 days ago

format fixes - validated evals

2355bb3

j-chim commited on 26 days ago

Add validated evaluator badge

478ae6c

j-chim commited on 26 days ago

Add eee source record (#9)

da4309d

j-chim commited on about 1 month ago

Update about (#8)

599471d

j-chim commited on about 1 month ago

Resilience + linux gate: connection-failure reset + pre-push DuckDB read-path gate

3fd5483

j-chim Claude Opus 4.8 (1M context) commited on Jun 2

Cache + gzip the /models and /evals index payloads; remove temp diag route

7888b0e

evijit HF Staff Claude Opus 4.8 (1M context) commited on Jun 2

Load parquet snapshots into in-memory DuckDB tables instead of /data cache

4d4ea55

evijit HF Staff Claude Opus 4.8 (1M context) commited on Jun 2

diag: add file-integrity hash + /tmp-vs-/data read probes

368cea5

evijit HF Staff Claude Opus 4.8 (1M context) commited on Jun 2

Split model and eval DuckDB projections

b7845a7

evijit HF Staff commited on Jun 1

Log failing model and eval context

01f05f9

evijit HF Staff commited on Jun 1

Trim DuckDB detail projection

70ffe59

evijit HF Staff commited on Jun 1

Serialize DuckDB detail queries

90f739e

evijit HF Staff commited on Jun 1

Warm sidecar cache and show loading progress

83ef54a

evijit HF Staff commited on Jun 1

Add eval query diagnostics

46e8ac0

evijit HF Staff commited on Jun 1

Fix DuckDB generation config projection

5f7e258

evijit HF Staff commited on Jun 1

Wrap derived tags for DuckDB space

f8940f7

evijit HF Staff commited on May 29

Fix DuckDB HF Space eval queries

0641374

evijit HF Staff commited on May 29

view-data: split runAndRead/readAll and log column types on failure

9b2a4b8

evijit HF Staff Claude Opus 4.7 (1M context) commited on May 29

Revert duckdb INSTALL/LOAD json — getRowObjectsJson() is the lib's documented path

d93d898

evijit HF Staff Claude Opus 4.7 (1M context) commited on May 29

duckdb: explicit INSTALL/LOAD json so linux-x64 binding can materialise JSON columns

fa8b4f1

evijit HF Staff Claude Opus 4.7 (1M context) commited on May 29

Adapt upstream category-based code to v2 tag taxonomy

bfb71af

j-chim commited on May 28

Merge remote-tracking branch 'origin/main' into merge/main-into-v2-cleanup

6e90b4d

j-chim commited on May 28