Commit History

docs: update image links in BLOG.md to point to raw GitHub URLs for better accessibility
e5da154

mitudrudutta commited on

feat: add canonical demo URLs and update root endpoint response
34a93bb

mitudrudutta commited on

docs: update previous training run link in README for accuracy
bd4f36c

mitudrudutta commited on

Remove training notebook for ChargebackOps Merchant Agent, consolidating code and documentation for outcome-based RL on Qwen2.5-3B.
73ff1b0

mitudrudutta commited on

docs: replace mermaid diagrams with PNG images for architecture and dispute lifecycle
cb1aeae

pauldebanshu19 commited on

Implement code changes to enhance functionality and improve performance
862cfc4

mitudrudutta commited on

docs: enhance README with additional resources and demo links
4d7c179

mitudrudutta commited on

chore: remove ChargebackOps project paper source file
f5a8e9d

pauldebanshu19 commited on

Add Overleaf research paper draft
1e823ee

mitudrudutta commited on

docs: correct HF Space URL to mitudrudutta/ChargeBackOps
15eeac0

mitudrudutta commited on

Add Hugging Face blog writeup
2f6f026

mitudrudutta commited on

Enhance documentation and address specification gaming in ChargebackOps
a92af86

mitudrudutta commited on

fix(notebook): rewrite diag-code as self-contained loader
0421766

mitudrudutta commited on

fix(eval): sequential per-checkpoint base load + product-grade docs
bb2cdb9

mitudrudutta commited on

perf(training): v5 - shorter SFT, longer GRPO, hard/nightmare oversample
d66354e

mitudrudutta commited on

fix(eval): pass offload_folder to from_pretrained on T4 OOM path
c86ad46

mitudrudutta commited on

fix(grpo): revert num_generations to 8 (TRL gen_batch divisibility)
2dce087

mitudrudutta commited on

chore(deps): add matplotlib to dev extras + sync uv.lock
6022558

mitudrudutta commited on

perf(notebook): shrink SFT+GRPO budget for Colab free T4
451f087

mitudrudutta commited on

fix(eval): preload bases, swap adapters, fix peft offload_dir crash
8fcb6cf

mitudrudutta commited on

fix(grpo): unblock learning - widen sampling + raise lr + keep dropout
e26c2ac

mitudrudutta commited on

fix(notebook): pin accelerate==1.0.1 to keep huggingface-hub at 0.26.x
5674079

mitudrudutta commited on

feat(training): outcome-based RLVR reward + clean Colab T4 notebook
1f49d52

mitudrudutta commited on

Implement code changes to enhance functionality and improve performance
30a9e6e

mitudrudutta commited on

feat: Implement wait_for_updates action for handling delayed cases and evidence
2dedffd

mitudrudutta commited on

feat(training): SFT dataset + stall detection in eval rollout
02a6a9f

mitudrudutta commited on

fix(training): per-action reward scoring vs heuristic oracle
243aa68

mitudrudutta commited on

feat: enhance completion parsing to handle truncated JSON and `<think>` blocks
71f1fe0

mitudrudutta commited on

feat: add per-family evaluation and plotting for training curves
a79d430

mitudrudutta commited on

feat: tighten EscalationROI, add ambiguous medium case, LLM note judge wrapper
e32a33b

mitudrudutta commited on

feat: Add training curve evaluation and plotting utilities with unit tests
8fe3b35

pauldebanshu19 commited on

Add training notebook and benchmark runner for ChargebackOps
bd00c06

pauldebanshu19 commited on

feat: Add LLM softening support to IssuerAgent and implement related tests
06abe10

pauldebanshu19 commited on

feat: Implement multi-round dispute lifecycle with arbitration scoring and related tests
b7aa1f0

pauldebanshu19 commited on

feat: Implement Issuer agent for multi-round dispute lifecycle
b105545

mitudrudutta commited on

refactor: tighten rubric discrimination + LLM path + add running doc
0054f7f

mitudrudutta commited on

refactor: update difficulty levels and enhance scoring rubrics in documentation and code
3149b7e

mitudrudutta commited on

Add documentation for core modules, data assets, evaluation components, runners, and tests
693f44e

mitudrudutta commited on

Refactor evidence building and improve code readability in iso_adapter.py
37bfd28

mitudrudutta commited on

refactor: build grading on OpenEnv Rubric system
c8ebaee

mitudrudutta commited on

fix: correct list_reports ordering and track ISO dataset in git
8b70d95

mitudrudutta commited on

fix: match structured output to hackathon spec format
0f980e9

mitudrudutta commited on

fix: add [START]/[STEP]/[END] structured output to inference.py
388e3b8

mitudrudutta commited on

fix: squash inflated evidence scores for wrongly contested concedable cases
7eba019

mitudrudutta commited on

feat: redesign Gradio demo with visual dashboard and update OPENENV.md
78830e5

mitudrudutta commited on

feat: add card network metadata, trim README, add limitations section
a1089c9

mitudrudutta commited on

docs: sync AGENT.md and README with current grading values, add LICENSE
5edab41

mitudrudutta commited on

fix: replace silent suppress with logged warning for Gradio mount failure
3af94fa

mitudrudutta commited on

fix: address Codex adversarial review findings
52e9e29

mitudrudutta commited on

feat: add Gradio demo UI, realistic metadata, info dict, and metrics
9ae9432

mitudrudutta commited on