ImaghT commited on
Commit
47ae5cb
·
verified ·
1 Parent(s): 3b01e38

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +41 -0
README.md ADDED
@@ -0,0 +1,41 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - deep-reinforcement-learning
4
+ - reinforcement-learning
5
+ - stable-baselines3
6
+ - LunarLander-v2
7
+ model-index:
8
+ - name: PPO
9
+ results:
10
+ - task:
11
+ type: reinforcement-learning
12
+ name: LunarLander-v2
13
+ dataset:
14
+ name: LunarLander-v2
15
+ type: LunarLander-v2
16
+ metrics:
17
+ - type: mean_reward
18
+ value: 273 +/- 9.50 # <--- 请修改这里的数字:你的均值 +/- 你的标准差
19
+ name: mean_reward
20
+ ---
21
+
22
+ # PPO Agent for LunarLander-v3 (Optimized)
23
+
24
+ This is a pre-trained model for **LunarLander-v3** using Stable-Baselines3.
25
+
26
+ ## Usage
27
+
28
+ ```python
29
+ import gymnasium as gym
30
+ from stable_baselines3 import PPO
31
+ from stable_baselines3.common.env_util import make_vec_env
32
+ from stable_baselines3.common.vec_env import VecNormalize
33
+
34
+ # Load the environment
35
+ env = make_vec_env("LunarLander-v3", n_envs=1)
36
+ env = VecNormalize.load("vec_normalize.pkl", env)
37
+ env.training = False
38
+ env.norm_reward = False
39
+
40
+ # Load the model
41
+ model = PPO.load("ppo_lunar_optimized", env=env)