Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Spaces:

Duplicated fromย  OpenHandsCommunity/evaluation

3rdn4
/
openhands_official_evaluation
Build error

App Files Files Community
Fetching metadata from the HF Docker repository...
openhands_official_evaluation / outputs
Ctrl+K
Ctrl+K
  • 6 contributors
History: 30 commits
Xingyao Wang
add gpt-4-1106 results for codeact swe
bb237c5 almost 2 years ago
  • agent_bench
    agentbench (#3) almost 2 years ago
  • humanevalfix
    humanevalfix (#4) almost 2 years ago
  • miniwob
    Update outputs/miniwob/README.md almost 2 years ago
  • mint
    Add MINT results (#6) almost 2 years ago
  • swe_bench_lite
    add gpt-4-1106 results for codeact swe almost 2 years ago
  • webarena
    Update outputs/webarena/README.md almost 2 years ago