Ctrl+K
ppo rnn with bootstrap last value on episode end and 1k window length: 1) active return 3e-4 pi lr, 2) active return penalized turnovers 3e-4 pi lr, 3) sharpe based reward
0a12bae - ppo_sharpe_reward_1
- ppo_sharpe_reward_2
- ppo_sharpe_reward_3
- ppo_sharpe_reward_4
- ppo_sharpe_reward_5
- ppo_sharpe_reward_bootstrap_1
- ppo_sharpe_reward_bootstrap_1k_len_1
- ppo_sharpe_reward_bootstrap_1k_len_2
- ppo_sharpe_reward_bootstrap_1k_len_3
- ppo_sharpe_reward_bootstrap_1k_len_4
- ppo_sharpe_reward_bootstrap_1k_len_5
- ppo_sharpe_reward_bootstrap_2
- ppo_sharpe_reward_bootstrap_3
- ppo_sharpe_reward_bootstrap_4
- ppo_sharpe_reward_bootstrap_5