Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Spaces:
smolagents
/
ml-agent
Running

App Files Files Community
1
Fetching metadata from the HF Docker repository...
ml-agent / eval
297 kB
  • 9 contributors
History: 20 commits
akseljoonas's picture
akseljoonas HF Staff
functioning frontend and docker
32f62c3 about 1 month ago
  • scrape_discussions
    thinking if we want eval or not 4 months ago
  • README.md
    5.18 kB
    eval readme update 2 months ago
  • __init__.py
    89 Bytes
    updated eval 3 months ago
  • check_completeness.py
    5.33 kB
    generated, filled in and verfied 250 eval questions about 2 months ago
  • claude_batch_solve.py
    7.75 kB
    generated, filled in and verfied 250 eval questions about 2 months ago
  • create_eval_dataset.py
    5.11 kB
    dataset creation script 3 months ago
  • eval_set.ipynb
    182 kB
    generated, filled in and verfied 250 eval questions about 2 months ago
  • generate_rubrics.py
    14.7 kB
    generated, filled in and verfied 250 eval questions about 2 months ago
  • generated_tasks_with_difficulty.json
    37.5 kB
    intermediate commit until i let amp loose 2 months ago
  • hf_agent_connector.py
    2.86 kB
    fixing tracing 3 months ago
  • hf_io.py
    7.2 kB
    updated eval 3 months ago
  • leaderboard.py
    4.95 kB
    rename 3 months ago
  • models.py
    1.51 kB
    eval runs 4 months ago
  • rubric_eval.py
    3.82 kB
    gpt 5 nano judge 3 months ago
  • run_eval_with_leaderboard.py
    6.79 kB
    rename 3 months ago
  • solvers.py
    5.39 kB
    fixing tracing 3 months ago
  • task.py
    3.94 kB
    gpt 5 nano judge 3 months ago