Hans14
/

LunarLander-v2

Reinforcement Learning

deep-reinforcement-learning

custom-implementation

Eval Results (legacy)

Model card Files Files and versions

Metrics Training metrics Community

3.45 MB

Ctrl+K

Ctrl+K

1 contributor

History: 10 commits

Hans14's picture

Push agent to the Hub

d9e7b0c almost 3 years ago

logs
Push agent to the Hub almost 3 years ago
pop-lunar-lander-test-2
UPLOAD Model version 1 : no hyperparameter trained on 1M step PPO architecture. Mean_reward 263.07035025211746 +/- std_reward 15.52574254837321 about 3 years ago
.gitattributes

1.48 kB
initial commit about 3 years ago
README.md

1.15 kB
Push agent to the Hub almost 3 years ago
config.json

12.8 kB
UPLOAD Model version 1 : no hyperparameter trained on 1M step PPO architecture. Mean_reward 263.07035025211746 +/- std_reward 15.52574254837321 about 3 years ago
model.pt
Detected Pickle imports (3)
- "torch._utils._rebuild_tensor_v2",
- "torch.FloatStorage",
- "collections.OrderedDict"
What is a pickle import?
42.6 kB
xet

Push agent to the Hub almost 3 years ago
pop-lunar-lander-test-2.zip
Pickle imports
- No problematic imports detected
What is a pickle import?
146 kB
xet

UPLOAD Model version 1 : no hyperparameter trained on 1M step PPO architecture. Mean_reward 263.07035025211746 +/- std_reward 15.52574254837321 about 3 years ago
replay.mp4

25.3 kB
Push agent to the Hub almost 3 years ago
results.json

172 Bytes
Push agent to the Hub almost 3 years ago