Switch to real putnam_small benchmark + statement-match guard f946490 mikeljl commited on 17 days ago
Leaderboard: score JSONL submissions by token-level proof reduction 945c26d mikeljl commited on 18 days ago