Battu007
/

V4_PPO2_LunarLander_v2

Reinforcement Learning

stable-baselines3

deep-reinforcement-learning

Eval Results (legacy)

Model card Files Files and versions

V4_PPO2_LunarLander_v2 / V4_PPO_LL

146 kB

Ctrl+K

Ctrl+K

2 contributors

History: 1 commit

ASBattu

PPO Hyperparemeter tune 1M steps LL-2 agent

825d67c about 4 years ago

_stable_baselines3_version

5 Bytes
PPO Hyperparemeter tune 1M steps LL-2 agent about 4 years ago
data

17.5 kB
PPO Hyperparemeter tune 1M steps LL-2 agent about 4 years ago
policy.optimizer.pth
Detected Pickle imports (3)
- "torch._utils._rebuild_tensor_v2",
- "collections.OrderedDict",
- "torch.FloatStorage"
What is a pickle import?
84.6 kB
xet

PPO Hyperparemeter tune 1M steps LL-2 agent about 4 years ago
policy.pth
Detected Pickle imports (3)
- "torch._utils._rebuild_tensor_v2",
- "collections.OrderedDict",
- "torch.FloatStorage"
What is a pickle import?
43.1 kB
xet

PPO Hyperparemeter tune 1M steps LL-2 agent about 4 years ago
pytorch_variables.pth
Pickle imports
- No problematic imports detected
What is a pickle import?
431 Bytes
xet

PPO Hyperparemeter tune 1M steps LL-2 agent about 4 years ago
system_info.txt

146 Bytes
PPO Hyperparemeter tune 1M steps LL-2 agent about 4 years ago