docs: replace mermaid diagrams with PNG images for architecture and dispute lifecycle cb1aeae pauldebanshu19 commited on Apr 26
Enhance documentation and address specification gaming in ChargebackOps a92af86 mitudrudutta commited on Apr 25
fix(eval): sequential per-checkpoint base load + product-grade docs bb2cdb9 mitudrudutta commited on Apr 25
feat: Implement wait_for_updates action for handling delayed cases and evidence 2dedffd mitudrudutta commited on Apr 23
feat: enhance completion parsing to handle truncated JSON and `<think>` blocks 71f1fe0 mitudrudutta commited on Apr 20
feat: add per-family evaluation and plotting for training curves a79d430 mitudrudutta commited on Apr 20
feat: tighten EscalationROI, add ambiguous medium case, LLM note judge wrapper e32a33b mitudrudutta commited on Apr 19
feat: Add training curve evaluation and plotting utilities with unit tests 8fe3b35 pauldebanshu19 commited on Apr 19
Add training notebook and benchmark runner for ChargebackOps bd00c06 pauldebanshu19 commited on Apr 19
feat: Implement Issuer agent for multi-round dispute lifecycle b105545 mitudrudutta commited on Apr 19
refactor: tighten rubric discrimination + LLM path + add running doc 0054f7f mitudrudutta commited on Apr 15
refactor: update difficulty levels and enhance scoring rubrics in documentation and code 3149b7e mitudrudutta commited on Apr 14
Add documentation for core modules, data assets, evaluation components, runners, and tests 693f44e mitudrudutta commited on Apr 14
Refactor evidence building and improve code readability in iso_adapter.py 37bfd28 mitudrudutta commited on Apr 12