refactor: update judge model to Qwen2.5-72B, standardize environment variables, and enhance task configuration schema 2d3fda8
TM23-sanji commited on