Spaces:
Sleeping
Sleeping
Peiran
Pairing improvements: filter already-evaluated pairs from /data, round-robin schedule across test_ids, alternate A/B order per pair; ensure submit maps scores to correct model columns and auto-advance
88f2a10