Vishand03
/

lunarlander-ppo

Reinforcement Learning

stable-baselines3

deep-reinforcement-learning

Eval Results (legacy)

Model card Files Files and versions

Vishand03 commited on Aug 25, 2025

Commit

f9dcddc

·

verified ·

1 Parent(s): 311d5f4

Update README.md

Files changed (1) hide show

README.md +15 -8

README.md CHANGED Viewed

@@ -40,9 +40,19 @@ This is a trained **PPO agent** for the **LunarLander-v2** environment using Sta
 - Learning rate: 3e-4
 - Optimizer: Adam
-## 🎥 Demo
 ![LunarLander](lunarlander.gif)
 ## 🛠 Usage
 ```python
@@ -55,20 +65,17 @@ from huggingface_hub import hf_hub_download
 # -------------------------
 # Environment Setup
 # -------------------------
-# Environment for human rendering
-env = gym.make("LunarLander-v2", render_mode="human")
-# Environment for evaluation (no render)
-eval_env = Monitor(gym.make("LunarLander-v2"))
 # -------------------------
-# Load pretrained model from Hugging Face Hub
 # -------------------------
 model_path = hf_hub_download("Vishand03/lunarlander-ppo", "model.zip")
 model = PPO.load(model_path)
 # -------------------------
-# Run a single episode
 # -------------------------
 obs, _ = env.reset()
 done = False

 - Learning rate: 3e-4
 - Optimizer: Adam
+---
+## 🎥 Demo (Preview)
 ![LunarLander](lunarlander.gif)
+---
+## 🎬 Full Demo Video
+👉 [Watch the full video here](replay.mp4)
+---
 ## 🛠 Usage
 ```python
 # -------------------------
 # Environment Setup
 # -------------------------
+env = gym.make("LunarLander-v2", render_mode="human")       # Human render
+eval_env = Monitor(gym.make("LunarLander-v2"))              # Evaluation (no render)
 # -------------------------
+# Load pretrained model
 # -------------------------
 model_path = hf_hub_download("Vishand03/lunarlander-ppo", "model.zip")
 model = PPO.load(model_path)
 # -------------------------
+# Run one episode
 # -------------------------
 obs, _ = env.reset()
 done = False