alanwsx
/

Rl-Unit1-CoLab

Reinforcement Learning

stable-baselines3

deep-reinforcement-learning

Eval Results (legacy)

Model card Files Files and versions

Rl-Unit1-CoLab / ppo-LunarLander-v2

Ctrl+K

Ctrl+K

1 contributor

History: 1 commit

alanwsx's picture

First try of Unit 1 CoLab with PPO trained with 1 million steps

33ba93c verified over 1 year ago

_stable_baselines3_version

7 Bytes
First try of Unit 1 CoLab with PPO trained with 1 million steps over 1 year ago
data

14 kB
First try of Unit 1 CoLab with PPO trained with 1 million steps over 1 year ago
policy.optimizer.pth
Detected Pickle imports (3)
- "collections.OrderedDict",
- "torch._utils._rebuild_tensor_v2",
- "torch.FloatStorage"
What is a pickle import?
88.4 kB
xet

First try of Unit 1 CoLab with PPO trained with 1 million steps over 1 year ago
policy.pth
Detected Pickle imports (3)
- "collections.OrderedDict",
- "torch._utils._rebuild_tensor_v2",
- "torch.FloatStorage"
What is a pickle import?
43.8 kB
xet

First try of Unit 1 CoLab with PPO trained with 1 million steps over 1 year ago
pytorch_variables.pth
Pickle imports
- No problematic imports detected
What is a pickle import?
864 Bytes
xet

First try of Unit 1 CoLab with PPO trained with 1 million steps over 1 year ago
system_info.txt

263 Bytes
First try of Unit 1 CoLab with PPO trained with 1 million steps over 1 year ago