feat: Implement multi-round dispute lifecycle with arbitration scoring and related tests b7aa1f0 pauldebanshu19 commited on Apr 19
Refactor evidence building and improve code readability in iso_adapter.py 37bfd28 mitudrudutta commited on Apr 12
fix: squash inflated evidence scores for wrongly contested concedable cases 7eba019 mitudrudutta commited on Apr 6
feat: harden grader to penalise shallow operational behaviour 544c8b2 mitudrudutta commited on Mar 31
feat: harden grading, expand task catalog, add episode persistence 87c40c2 mitudrudutta commited on Mar 30
refactor: reorganize source files into core/, evaluation/, runners/, scenarios/ directories 3816847 mitudrudutta commited on Mar 29