Commit History

Maximize environment: curriculum task, metrics endpoint, 5 bug fixes, notebook fix
4890422

ps2181 Claude Sonnet 4.6 commited on

Add long_horizon/personalized tasks + GitHub-hosted training curves
473ab10

ps2181 Claude Sonnet 4.6 commited on

Implement all audit fixes: base class, curves, README, code quality
b02956e

ps2181 Claude Sonnet 4.6 commited on

Fix: score formatting was rounding 0.9999 to 1.000 in stdout logs
8dc2806

ps2181 Claude Sonnet 4.6 commited on

Fix: clamp all remaining hardcoded 0.0/1.0 score returns
af66f63

ps2181 Claude Sonnet 4.6 commited on

Fix: clamp all task scores to strictly open interval (0, 1)
b9b7965

ps2181 Claude Sonnet 4.6 commited on

Add adversarial, negotiate, supply_chain tasks + dynamic difficulty + richer rewards
59a05a5

ps2181 Claude Sonnet 4.6 commited on

Add expert fraud audit task and improve inference feedback loop
c0c1e0e

ps2181 Claude Sonnet 4.6 commited on

Add full invoice processing pipeline environment
0bf71ce

ps2181 Claude Sonnet 4.6 commited on