Spaces:

Adityax-07
/

AgentBench

Sleeping

App Files Files Community

AgentBench / bench_runner.py

Commit History

feat: add LLM-as-judge evaluator and benchmark runner

b5122b0

Adityax-07 commited on Apr 26