Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Spaces:
yananlong
/
general-eval-card
like
0
Sleeping
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
general-eval-card
/
lib
370 kB
Ctrl+K
Ctrl+K
8 contributors
History:
57 commits
Yanan Long
fix joins
fbbeb45
16 days ago
backend-artifacts.ts
14.8 kB
Compute and apply cleaned benchmark counts per model
17 days ago
benchmark-metadata-utils.ts
1.47 kB
Fix RewardBench2 key normalization for matrix leaderboard routing
about 1 month ago
benchmark-metadata.ts
1.65 kB
Differentiate audience modes and tighten eval navigation
about 1 month ago
benchmark-schema.ts
10.8 kB
Merge origin/main and integrate research joins
16 days ago
benchmark-tags.ts
11.7 kB
Restructure model details + extend cleanHierarchy for split families and aggregator dedup
17 days ago
clean-hierarchy.ts
63.6 kB
Merge cross-source benchmark families; tidy leaderboard panel + table chrome
16 days ago
dashboard-data-client.ts
3.46 kB
fix joins
16 days ago
data-backend.ts
5.06 kB
Compute and apply cleaned benchmark counts per model
17 days ago
duckdb-data.ts
9.26 kB
Deploy DuckDB-backed frontend to
23 days ago
duckdb.ts
5.87 kB
Add local parquet read support
16 days ago
eval-processing.ts
36 kB
Refactor to align on benchmark hierarchy
17 days ago
glossary.ts
5.19 kB
Separate policy and researcher views
23 days ago
hf-data.ts
50.9 kB
Merge origin/main and integrate research joins
16 days ago
hierarchy-lookup.ts
5.54 kB
Restructure model details + extend cleanHierarchy for split families and aggregator dedup
17 days ago
known-issues.ts
1.67 kB
Separate policy and researcher views
23 days ago
model-data.ts
59.6 kB
Merge origin/main and integrate research joins
16 days ago
model-family.ts
5.25 kB
Update with datafix v2
18 days ago
na-utils.ts
1.87 kB
Deploy DuckDB-backed frontend to
23 days ago
param-range.ts
3.59 kB
Tighten eval cards UI and clean up stale local data
18 days ago
research-join-types.ts
2.84 kB
fix joins
16 days ago
research-joins.ts
24.8 kB
fix joins
16 days ago
sidecars.ts
9.84 kB
Bump clean-hierarchy cache version to v13 to drop stale blob
16 days ago
survey-content.ts
7.05 kB
Add survey submission and update survey text for public use
about 1 month ago
utils.ts
1.88 kB
Update with datafix v2
18 days ago
view-data.ts
25.9 kB
Precompute eval matrices for multi-metric + per-slice leaderboards
16 days ago