ChargeBackOps / docs /RESULTS.md

Commit History

Enhance documentation and address specification gaming in ChargebackOps
a92af86

mitudrudutta commited on

fix(eval): sequential per-checkpoint base load + product-grade docs
bb2cdb9

mitudrudutta commited on

feat: Implement wait_for_updates action for handling delayed cases and evidence
2dedffd

mitudrudutta commited on

feat: enhance completion parsing to handle truncated JSON and `<think>` blocks
71f1fe0

mitudrudutta commited on

feat: add per-family evaluation and plotting for training curves
a79d430

mitudrudutta commited on

feat: tighten EscalationROI, add ambiguous medium case, LLM note judge wrapper
e32a33b

mitudrudutta commited on

feat: Add training curve evaluation and plotting utilities with unit tests
8fe3b35

pauldebanshu19 commited on

Add training notebook and benchmark runner for ChargebackOps
bd00c06

pauldebanshu19 commited on

refactor: tighten rubric discrimination + LLM path + add running doc
0054f7f

mitudrudutta commited on

refactor: update difficulty levels and enhance scoring rubrics in documentation and code
3149b7e

mitudrudutta commited on

Refactor evidence building and improve code readability in iso_adapter.py
37bfd28

mitudrudutta commited on