Vishand03
/

lunarlander-ppo

Reinforcement Learning

stable-baselines3

deep-reinforcement-learning

Eval Results (legacy)

Model card Files Files and versions

Vishand03 commited on Aug 24, 2025

Commit

8051f91

·

verified ·

1 Parent(s): 8d82a27

Update README.md

Files changed (1) hide show

README.md +32 -3

README.md CHANGED Viewed

@@ -8,7 +8,18 @@ tags:
 - lunar-lander
 model-index:
 - name: lunarlander-ppo
-  results: []
 license: apache-2.0
 ---
@@ -24,5 +35,23 @@ This is a PPO-trained agent for the **LunarLander-v3** environment using Stable-
 - Timesteps: 2.5M
 - Mean Reward: ~290
----
-🤖 Trained by [@Vishand03](https://huggingface.co/Vishand03)

 - lunar-lander
 model-index:
 - name: lunarlander-ppo
+  results:
+  - task:
+      type: reinforcement-learning
+      name: Reinforcement Learning
+    dataset:
+      name: LunarLander-v3
+      type: gymnasium
+    metrics:
+    - name: Mean Reward
+      type: reward
+      value: 290.40
+      verified: false
 license: apache-2.0
 ---
 - Timesteps: 2.5M
 - Mean Reward: ~290
+## 🛠 Usage
+You can load and test the trained agent like this:
+```python
+import gymnasium as gym
+from stable_baselines3 import PPO
+# Load environment
+env = gym.make("LunarLander-v3", render_mode="human")
+# Load the pretrained model from Hugging Face Hub
+model = PPO.load("Vishand03/lunarlander-ppo")
+# Run a single episode
+obs, _ = env.reset()
+done = False
+while not done:
+    action, _ = model.predict(obs, deterministic=True)
+    obs, reward, terminated, truncated, _ =_