Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
J3
/
PPO-Lunar-trial
like
0
Reinforcement Learning
TensorBoard
LunarLander-v2
ppo
deep-reinforcement-learning
custom-implementation
deep-rl-course
Eval Results (legacy)
Model card
Files
Files and versions
xet
Metrics
Training metrics
Community
PPO Agent Playing LunarLander-v2
Hyperparameters
PPO Agent Playing LunarLander-v2
This is a trained model of a PPO agent playing LunarLander-v2.
Hyperparameters
Downloads last month
-
Downloads are not tracked for this model.
How to track
Video Preview
Reinforcement Learning
loading
Evaluation results
mean_reward
on LunarLander-v2
self-reported
263.55 +/- 75.91