evaluation / outputs /agent_bench

Commit History

Update results
0e161f7

Boxuan Li commited on

Add AgentBench evaluation results
b58d2c4

Boxuan Li commited on