Phase 1.3: eval results, test scripts, gap filter reverted β no improvement, changelog update 10f9a75 MukulRay commited on 22 days ago
Phase 1.3: smoke test scripts, eval runner, check_progress, fix unicode in run_eval.py f95672f MukulRay commited on 22 days ago
Phase 1.1: archive patch_contradiction.py β research integrity fix cd9075d MukulRay commited on 22 days ago
docs: commit eval summary; clarify critic as LLM-assisted-judge; fix test imports 7624a2f MukulRay commited on 22 days ago
Phase 13: HF Spaces deploy ready - verdict logging, clean requirements 6f237d6 MukulRay commited on Mar 29