Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
smolagents
/
ml-agent
like
7
Running
App
Files
Files
Community
1
Fetching metadata from the HF Docker repository...
refs/pr/1
ml-agent
/
eval
297 kB
9 contributors
History:
20 commits
akseljoonas
HF Staff
functioning frontend and docker
32f62c3
about 1 month ago
scrape_discussions
thinking if we want eval or not
4 months ago
README.md
5.18 kB
eval readme update
2 months ago
__init__.py
89 Bytes
updated eval
3 months ago
check_completeness.py
5.33 kB
generated, filled in and verfied 250 eval questions
about 2 months ago
claude_batch_solve.py
7.75 kB
generated, filled in and verfied 250 eval questions
about 2 months ago
create_eval_dataset.py
5.11 kB
dataset creation script
3 months ago
eval_set.ipynb
182 kB
generated, filled in and verfied 250 eval questions
about 2 months ago
generate_rubrics.py
14.7 kB
generated, filled in and verfied 250 eval questions
about 2 months ago
generated_tasks_with_difficulty.json
37.5 kB
intermediate commit until i let amp loose
2 months ago
hf_agent_connector.py
2.86 kB
fixing tracing
3 months ago
hf_io.py
7.2 kB
updated eval
3 months ago
leaderboard.py
4.95 kB
rename
3 months ago
models.py
1.51 kB
eval runs
4 months ago
rubric_eval.py
3.82 kB
gpt 5 nano judge
3 months ago
run_eval_with_leaderboard.py
6.79 kB
rename
3 months ago
solvers.py
5.39 kB
fixing tracing
3 months ago
task.py
3.94 kB
gpt 5 nano judge
3 months ago