xiamoqiu
/

unit8-cleanrl-lunarlander-gpu-course

Model card Files Files and versions

Metrics Training metrics Community

YAML Metadata Warning:empty or missing yaml metadata in repo card

Check out the documentation for more information.

LunarLander-v3 PPO Run

Run name: ppo-LunarLander-gpu-course
Run time (UTC): 2026-04-18T08:51:55Z
Device: cuda
Total timesteps: 50000
Num envs: 4
Num steps: 128
Learning rate: 0.00025
Mean eval reward: -1054.7867
Std eval reward: 1284.7071
Evaluation video: replay.mp4

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support