Commit History

yes
dba3d2e

Junzhe Li commited on

revamped benchmarking suite
89321e2

Junzhe Li commited on

added concurrency to benchmarking
aa37a55

victorli commited on

final updates
044eaf7

VictorLJZ commited on

refactored tools
25fe9b8

VictorLJZ commited on

Fix prompt load
a7d0aad

Adibvafa commited on

removed anthropic provider
f350e79

VictorLJZ commited on

added anthropic provider
b0f54ed

VictorLJZ commited on

added openrouter provider
f60c51c

VictorLJZ commited on

rexvqa now works
c7b65ec

VictorLJZ commited on

added chestagentbench
2b26ed4

VictorLJZ commited on

simplifying evaluations
a8f2960

VictorLJZ commited on

first working version
9bb904a

VictorLJZ commited on

changes so far
99f2cbc

VictorLJZ commited on