Battu007
/

V4_PPO2_LunarLander_v2

Reinforcement Learning

stable-baselines3

deep-reinforcement-learning

Eval Results (legacy)

Model card Files Files and versions

V4_PPO2_LunarLander_v2 / V4_PPO_LL /_stable_baselines3_version

ASBattu

PPO Hyperparemeter tune 1M steps LL-2 agent

825d67c about 4 years ago

History Blame Contribute Delete

5 Bytes

1.5.0