feat: add canonical demo URLs and update root endpoint response 34a93bb mitudrudutta commited on Apr 26
feat(training): outcome-based RLVR reward + clean Colab T4 notebook 1f49d52 mitudrudutta commited on Apr 23
feat: Implement wait_for_updates action for handling delayed cases and evidence 2dedffd mitudrudutta commited on Apr 23
feat(training): SFT dataset + stall detection in eval rollout 02a6a9f mitudrudutta commited on Apr 21
feat: enhance completion parsing to handle truncated JSON and `<think>` blocks 71f1fe0 mitudrudutta commited on Apr 20
feat: add per-family evaluation and plotting for training curves a79d430 mitudrudutta commited on Apr 20
feat: tighten EscalationROI, add ambiguous medium case, LLM note judge wrapper e32a33b mitudrudutta commited on Apr 19
feat: Add training curve evaluation and plotting utilities with unit tests 8fe3b35 pauldebanshu19 commited on Apr 19
Add training notebook and benchmark runner for ChargebackOps bd00c06 pauldebanshu19 commited on Apr 19
feat: Add LLM softening support to IssuerAgent and implement related tests 06abe10 pauldebanshu19 commited on Apr 19
feat: Implement multi-round dispute lifecycle with arbitration scoring and related tests b7aa1f0 pauldebanshu19 commited on Apr 19
feat: Implement Issuer agent for multi-round dispute lifecycle b105545 mitudrudutta commited on Apr 19
feat: add Gradio demo UI, realistic metadata, info dict, and metrics 9ae9432 mitudrudutta commited on Mar 31
feat: add adversarial evidence, nightmare difficulty, and benchmark splits 9e6686d mitudrudutta commited on Mar 31
feat: harden grading, expand task catalog, add episode persistence 87c40c2 mitudrudutta commited on Mar 30