Phase A: integrate LLM judge panel for hybrid scoring 8e08ed6 verified Jasonkim8652 commited on 27 days ago
update leaderboard with rescored results and fair diversity formula c59de83 verified Jasonkim8652 commited on Mar 10
feat: add submission & scoring infrastructure (eval_scorer, dispatcher, boltz, queue, tasks) + fix gradio 5.x for Python 3.13 6205b94 verified Jasonkim8652 commited on Mar 3