ppo active return penalized turnovers 1e-3 pi_lr + ppo nstep 20 reward acc with grad clip 3fcbda5 bobotsalos commited on Mar 8
ppo with act ret w/ turnovers 3e-4 pi_lr + 100x scaled reward + ppo nstep 20 reward accumulation 528779c bobotsalos commited on Mar 6
training logs ppo act ret clipped actions [0, 1] + ppo stability update (clipped actions, clip ration 0.1, bptt 256, ent bonus) 93f1ec8 bobotsalos commited on Mar 3
ppo max exposure lstm cidl remove actions mean + log std variations 7be7e38 bobotsalos commited on Feb 18
ppo max exposure lstm cidl remove actions mean + log std variations 7c4058a bobotsalos commited on Feb 18
ppo lstm no leverage env variations + sentiment no action scaling 2b20920 bobotsalos commited on Jan 29
agents ppo clipped actions [-1, 1] + fixed implicit leverage d51d834 bobotsalos commited on Dec 23, 2025