VarmaHF
/

ppo-LunarLander-v2

Reinforcement Learning

stable-baselines3

deep-reinforcement-learning

Eval Results (legacy)

Model card Files Files and versions

VarmaHF commited on Sep 25, 2025

Commit

6c9bd7b

·

verified ·

1 Parent(s): 81aec79

Update README.md

Files changed (1) hide show

README.md +0 -3

README.md CHANGED Viewed

@@ -26,9 +26,6 @@ This is a trained model of a **PPO** agent playing **LunarLander-v2**
 using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).
 ## **Metrics**
-| Metric        | Value       | Std  |
-|---------------|------------|------|
-| mean_reward   | 370.25     | 8.94 |
 > The trained PPO agent achieves a mean reward of *370.24* ± *8.90* on **LunarLander-v2**. This means it consistently lands successfully, demonstrating both high performance and stability across multiple episodes.
 ## Usage (with Stable-baselines3)

 using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).
 ## **Metrics**
 > The trained PPO agent achieves a mean reward of *370.24* ± *8.90* on **LunarLander-v2**. This means it consistently lands successfully, demonstrating both high performance and stability across multiple episodes.
 ## Usage (with Stable-baselines3)