finish training ppo rnn pnl/active return multi-scale features combos f5dbbf5 bobotsalos commited on Mar 31
ppo rnn pnl/active reward base momentum features no double actor hidden state advancing + no floor on buys with not enough cash 7d6b0f3 bobotsalos commited on Mar 30
ppo rnn with cidl features no cyuclical, no adv normalization, and with ent coef cdd52c4 bobotsalos commited on Mar 20
ppo rnn with no cyclical features + with cidl features not normalized 25d0dde bobotsalos commited on Mar 19
ppo rnn cidl features with only week sin/cos sharpe reward + cidl features different window lengths and envs cbfbfee bobotsalos commited on Mar 17
ppo rnn with bootstrap last value on episode end and 1k window length: 1) active return 3e-4 pi lr, 2) active return penalized turnovers 3e-4 pi lr, 3) sharpe based reward 0a12bae bobotsalos commited on Mar 16
finish with ppo rnn with sharpe ratio reward base/cidl features 9ac6e20 bobotsalos commited on Mar 12
trained ppo 5 versions adv normalization, rew norm w/ critic grad clip 1.0, rew norm w/ vf_lr 1e-5, vf_lr 1e-5, clip_ratio 0.1 fe0915d bobotsalos commited on Mar 12
progress (1/5) train ppo rnn with sharpe ratio reward base/cidl features f166bce bobotsalos commited on Mar 12
ppo active return 1e-3 pi_lr rewards std advantage normalization bbd410b bobotsalos commited on Mar 10