docs: update image links in BLOG.md to point to raw GitHub URLs for better accessibility e5da154 mitudrudutta commited on Apr 27
feat: add canonical demo URLs and update root endpoint response 34a93bb mitudrudutta commited on Apr 26
docs: update previous training run link in README for accuracy bd4f36c mitudrudutta commited on Apr 26
Remove training notebook for ChargebackOps Merchant Agent, consolidating code and documentation for outcome-based RL on Qwen2.5-3B. 73ff1b0 mitudrudutta commited on Apr 26
docs: replace mermaid diagrams with PNG images for architecture and dispute lifecycle cb1aeae pauldebanshu19 commited on Apr 26
Implement code changes to enhance functionality and improve performance 862cfc4 mitudrudutta commited on Apr 26
docs: enhance README with additional resources and demo links 4d7c179 mitudrudutta commited on Apr 25
Enhance documentation and address specification gaming in ChargebackOps a92af86 mitudrudutta commited on Apr 25
fix(eval): sequential per-checkpoint base load + product-grade docs bb2cdb9 mitudrudutta commited on Apr 25
perf(training): v5 - shorter SFT, longer GRPO, hard/nightmare oversample d66354e mitudrudutta commited on Apr 25
fix(eval): pass offload_folder to from_pretrained on T4 OOM path c86ad46 mitudrudutta commited on Apr 25
fix(grpo): revert num_generations to 8 (TRL gen_batch divisibility) 2dce087 mitudrudutta commited on Apr 25
fix(eval): preload bases, swap adapters, fix peft offload_dir crash 8fcb6cf mitudrudutta commited on Apr 24
fix(grpo): unblock learning - widen sampling + raise lr + keep dropout e26c2ac mitudrudutta commited on Apr 24
fix(notebook): pin accelerate==1.0.1 to keep huggingface-hub at 0.26.x 5674079 mitudrudutta commited on Apr 23
feat(training): outcome-based RLVR reward + clean Colab T4 notebook 1f49d52 mitudrudutta commited on Apr 23
Implement code changes to enhance functionality and improve performance 30a9e6e mitudrudutta commited on Apr 23
feat: Implement wait_for_updates action for handling delayed cases and evidence 2dedffd mitudrudutta commited on Apr 23
feat(training): SFT dataset + stall detection in eval rollout 02a6a9f mitudrudutta commited on Apr 21
feat: enhance completion parsing to handle truncated JSON and `<think>` blocks 71f1fe0 mitudrudutta commited on Apr 20
feat: add per-family evaluation and plotting for training curves a79d430 mitudrudutta commited on Apr 20
feat: tighten EscalationROI, add ambiguous medium case, LLM note judge wrapper e32a33b mitudrudutta commited on Apr 19
feat: Add training curve evaluation and plotting utilities with unit tests 8fe3b35 pauldebanshu19 commited on Apr 19
Add training notebook and benchmark runner for ChargebackOps bd00c06 pauldebanshu19 commited on Apr 19
feat: Add LLM softening support to IssuerAgent and implement related tests 06abe10 pauldebanshu19 commited on Apr 19
feat: Implement multi-round dispute lifecycle with arbitration scoring and related tests b7aa1f0 pauldebanshu19 commited on Apr 19
feat: Implement Issuer agent for multi-round dispute lifecycle b105545 mitudrudutta commited on Apr 19
refactor: tighten rubric discrimination + LLM path + add running doc 0054f7f mitudrudutta commited on Apr 15
refactor: update difficulty levels and enhance scoring rubrics in documentation and code 3149b7e mitudrudutta commited on Apr 14
Add documentation for core modules, data assets, evaluation components, runners, and tests 693f44e mitudrudutta commited on Apr 14
Refactor evidence building and improve code readability in iso_adapter.py 37bfd28 mitudrudutta commited on Apr 12
fix: correct list_reports ordering and track ISO dataset in git 8b70d95 mitudrudutta commited on Apr 11
fix: add [START]/[STEP]/[END] structured output to inference.py 388e3b8 mitudrudutta commited on Apr 7
fix: squash inflated evidence scores for wrongly contested concedable cases 7eba019 mitudrudutta commited on Apr 6
feat: redesign Gradio demo with visual dashboard and update OPENENV.md 78830e5 mitudrudutta commited on Apr 6
feat: add card network metadata, trim README, add limitations section a1089c9 mitudrudutta commited on Apr 6
docs: sync AGENT.md and README with current grading values, add LICENSE 5edab41 mitudrudutta commited on Apr 2
fix: replace silent suppress with logged warning for Gradio mount failure 3af94fa mitudrudutta commited on Mar 31
feat: add Gradio demo UI, realistic metadata, info dict, and metrics 9ae9432 mitudrudutta commited on Mar 31