leaderboard / src /metrics /data_utils.py

Commit History

feat: include tag in experiment ID
c830a77
unverified

tareknaser commited on

refactor: migrate to integrate with inspect evals
f3d287f
unverified

tareknaser commited on

feat: add support for filtering by invalid JSON in leaderboard
8b1dabc
unverified

tareknaser commited on

feat: a simple leaderboard with filtering
7e7fbbc
unverified

tareknaser commited on