Add long_horizon/personalized tasks + GitHub-hosted training curves 473ab10 ps2181 Claude Sonnet 4.6 commited on 18 days ago
Implement all audit fixes: base class, curves, README, code quality b02956e ps2181 Claude Sonnet 4.6 commited on 18 days ago
Remove show_download_button — unsupported in this Gradio version e422a3e ps2181 Claude Sonnet 4.6 commited on 18 days ago
Fix training curves: switch gr.Plot → gr.Image with PNG bytes 7f1e860 ps2181 Claude Sonnet 4.6 commited on 18 days ago
Fix /web 404: guard matplotlib import and harden Gradio mount 707a8d9 ps2181 Claude Sonnet 4.6 commited on 18 days ago
Add Training Results tab with GRPO reward curves for all 3 agents 5a9c33c ps2181 Claude Sonnet 4.6 commited on 18 days ago
Fix pipeline UI: total regex case-insensitive, deduplicate invoice IDs aa15f22 ps2181 Claude Sonnet 4.6 commited on 18 days ago
Wire trained LoRA agents into pipeline demo UI e2f0d06 ps2181 Claude Sonnet 4.6 commited on 18 days ago
Add Multi-Agent Pipeline tab — live 5-agent episode trace e595317 ps2181 Claude Sonnet 4.6 commited on 18 days ago
Add 3 novelty upgrades: predictive Regulator, compound fraud, confidence calibration 48cc8c7 ps2181 Claude Sonnet 4.6 commited on 18 days ago
Add multi-agent architecture: Regulator, biased Generator, Auditor rewards 02b8804 ps2181 Claude Sonnet 4.6 commited on 18 days ago
Add Gradio web UI mounted at /web for interactive agent testing 8afb151 ps2181 Claude Sonnet 4.6 commited on Apr 7