Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Serotina
/
First-agent
like
0
Reinforcement Learning
stable-baselines3
LunarLander-v2
deep-reinforcement-learning
Eval Results (legacy)
Model card
Files
Files and versions
xet
Community
Use this model
main
First-agent
/
First-Agent
146 kB
1 contributor
History:
1 commit
Serotina
Uproad First Agent using PPO with environment LunarLander
daa83f3
over 2 years ago
_stable_baselines3_version
7 Bytes
Uproad First Agent using PPO with environment LunarLander
over 2 years ago
data
14.1 kB
Uproad First Agent using PPO with environment LunarLander
over 2 years ago
policy.optimizer.pth
87.9 kB
xet
Uproad First Agent using PPO with environment LunarLander
over 2 years ago
policy.pth
43.3 kB
xet
Uproad First Agent using PPO with environment LunarLander
over 2 years ago
pytorch_variables.pth
431 Bytes
xet
Uproad First Agent using PPO with environment LunarLander
over 2 years ago
system_info.txt
248 Bytes
Uproad First Agent using PPO with environment LunarLander
over 2 years ago