Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Cheekydave
/
PPO-LL2
like
0
Reinforcement Learning
stable-baselines3
LunarLander-v2
deep-reinforcement-learning
Eval Results
Model card
Files
Files and versions
xet
Community
Use this model
2d40278
PPO-LL2
/
results.json
Cheekydave
PPO model trained on 5m steps
2d40278
verified
over 1 year ago
raw
Copy download link
history
blame
Safe
158 Bytes
{
"mean_reward"
:
287.2246749
,
"std_reward"
:
11.812396363998744
,
"is_deterministic"
:
true
,
"n_eval_episodes"
:
10
,
"eval_datetime"
:
"2024-04-23T11:14:57.402104"
}