YAML Metadata Warning:empty or missing yaml metadata in repo card

Check out the documentation for more information.

LunarLander-v3 PPO Run

  • Run name: ppo-LunarLander-gpu-course
  • Run time (UTC): 2026-04-18T08:51:55Z
  • Device: cuda
  • Total timesteps: 50000
  • Num envs: 4
  • Num steps: 128
  • Learning rate: 0.00025
  • Mean eval reward: -1054.7867
  • Std eval reward: 1284.7071
  • Evaluation video: replay.mp4
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support