Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
smolagents
/
ml-agent
like
11
Running
App
Files
Files
Community
1
Fetching metadata from the HF Docker repository...
main
ml-agent
/
eval
8 contributors
History:
24 commits
akseljoonas
HF Staff
fix: properly close SDK message on error, show tool errorText
c68afb6
6 days ago
scrape_discussions
thinking if we want eval or not
5 months ago
README.md
Safe
5.18 kB
eval readme update
3 months ago
__init__.py
Safe
89 Bytes
updated eval
4 months ago
check_completeness.py
Safe
5.33 kB
generated, filled in and verfied 250 eval questions
3 months ago
claude_batch_solve.py
Safe
7.75 kB
generated, filled in and verfied 250 eval questions
3 months ago
create_eval_dataset.py
Safe
5.11 kB
dataset creation script
4 months ago
eval_set.ipynb
Safe
182 kB
generated, filled in and verfied 250 eval questions
3 months ago
generate_rubrics.py
Safe
14.7 kB
generated, filled in and verfied 250 eval questions
3 months ago
generated_tasks_with_difficulty.json
Safe
37.5 kB
intermediate commit until i let amp loose
3 months ago
hf_agent_connector.py
Safe
2.81 kB
fix: properly close SDK message on error, show tool errorText
6 days ago
hf_io.py
Safe
7.2 kB
updated eval
4 months ago
leaderboard.py
Safe
4.95 kB
rename
4 months ago
models.py
Safe
1.51 kB
eval runs
5 months ago
rubric_eval.py
Safe
3.82 kB
gpt 5 nano judge
4 months ago
run_eval_with_leaderboard.py
Safe
6.79 kB
rename
4 months ago
solvers.py
Safe
5.15 kB
remove lmnr dependency
6 days ago
task.py
Safe
3.94 kB
gpt 5 nano judge
4 months ago