Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Snowflake
/
MADQA-Leaderboard
like
11
Running
App
Files
Files
Fetching metadata from the HF Docker repository...
main
MADQA-Leaderboard
/
eval
328 kB
Ctrl+K
Ctrl+K
3 contributors
History:
11 commits
Borchmann
Add effort validation for agentic submissions and uniform-effort display
dfedb16
about 2 months ago
reevaluated_results
Upload folder using huggingface_hub
3 months ago
.DS_Store
6.15 kB
Upload folder using huggingface_hub
3 months ago
Archive.zip
114 kB
xet
Upload folder using huggingface_hub
3 months ago
README.md
2.26 kB
Upload folder using huggingface_hub
4 months ago
batch_reevaluate.py
17.7 kB
Upload folder using huggingface_hub
4 months ago
cleanup_submissions.py
6.05 kB
Upload folder using huggingface_hub
4 months ago
delete_unlinked.py
2.67 kB
Upload folder using huggingface_hub
4 months ago
evaluate.py
14.9 kB
Upload folder using huggingface_hub
4 months ago
link_file_search_predictions.py
3.59 kB
Upload folder using huggingface_hub
4 months ago
metrics.py
Safe
25.2 kB
Add effort validation for agentic submissions and uniform-effort display
about 2 months ago
reevaluate_submissions.py
8.57 kB
Upload folder using huggingface_hub
4 months ago
requirements.txt
77 Bytes
Upload folder using huggingface_hub
4 months ago