chrisjcc commited on
Commit
d5334ec
·
verified ·
1 Parent(s): 2d9e9a9

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +48 -0
README.md ADDED
@@ -0,0 +1,48 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language: en
3
+ license: mit
4
+ library_name: stable-baselines3
5
+ tags:
6
+ - reinforcement-learning
7
+ - stable-baselines3
8
+ - gymnasium
9
+ - maskable-ppo
10
+ datasets:
11
+ - custom-utdg-env
12
+ metrics:
13
+ - episode_reward
14
+ ---
15
+
16
+ # UTDG Maskable PPO Policy
17
+
18
+ This model is trained on the UTDG (Untitled Tower Defense Game) environment using Stable-Baselines3 MaskablePPO.
19
+
20
+ ## Model Details
21
+
22
+ - **Algorithm**: MaskablePPO (Proximal Policy Optimization with invalid action masking)
23
+ - **Framework**: Stable-Baselines3
24
+ - **Environment**: Custom UTDG Gymnasium environment
25
+ - **Task**: Tower defense game AI agent
26
+
27
+ ## Usage
28
+ ```python
29
+ from huggingface_hub import hf_hub_download
30
+ from sb3_contrib import MaskablePPO
31
+
32
+ # Download the model
33
+ model_path = hf_hub_download(
34
+ repo_id="chrisjcc/utdg-maskableppo-policy",
35
+ filename="maskableppo_utdg_policy.zip"
36
+ )
37
+
38
+ # Load the model
39
+ model = MaskablePPO.load(model_path)
40
+
41
+ # Use for inference
42
+ # obs, info = env.reset()
43
+ # action, _states = model.predict(obs, action_masks=info["action_mask"])
44
+ ```
45
+
46
+ ## Training
47
+
48
+ The model was trained using reinforcement learning on the UTDG environment.