mgfrantz
/

ppo-LunarLander-v2

Reinforcement Learning

stable-baselines3

deep-reinforcement-learning

Eval Results (legacy)

Model card Files Files and versions

ppo-LunarLander-v2 / mikes_first_lander-1

143 kB

Ctrl+K

Ctrl+K

1 contributor

History: 12 commits

mgfrantz's picture

Trained LunarLander-v2-PPO-0 with a reduced learning rate by a factor of 10

a2102ab almost 4 years ago

_stable_baselines3_version

5 Bytes
This LunarLander-v2-PPO-0 is my first submission to the HF Hub! almost 4 years ago
data

14.6 kB
Trained LunarLander-v2-PPO-0 with a reduced learning rate by a factor of 10 almost 4 years ago
policy.optimizer.pth
Detected Pickle imports (3)
- "torch.FloatStorage",
- "torch._utils._rebuild_tensor_v2",
- "collections.OrderedDict"
What is a pickle import?
84.9 kB
xet

Trained LunarLander-v2-PPO-0 with a reduced learning rate by a factor of 10 almost 4 years ago
policy.pth
Detected Pickle imports (3)
- "torch._utils._rebuild_tensor_v2",
- "collections.OrderedDict",
- "torch.FloatStorage"
What is a pickle import?
43.2 kB
xet

Trained LunarLander-v2-PPO-0 with a reduced learning rate by a factor of 10 almost 4 years ago
pytorch_variables.pth
Pickle imports
- No problematic imports detected
What is a pickle import?
431 Bytes
xet

This LunarLander-v2-PPO-0 is my first submission to the HF Hub! almost 4 years ago
system_info.txt

193 Bytes
This LunarLander-v2-PPO-0 is my first submission to the HF Hub! almost 4 years ago