JoshuaFreeman commited on
Commit
32836a6
·
verified ·
1 Parent(s): 61ffce0

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +43 -0
README.md ADDED
@@ -0,0 +1,43 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ tags:
4
+ - reinforcement-learning
5
+ - ppo
6
+ - openfront
7
+ - game-ai
8
+ ---
9
+
10
+ # OpenFront RL Agent
11
+
12
+ PPO-trained agent for [OpenFront.io](https://openfront.io), a multiplayer territory control game.
13
+
14
+ ## Training Details
15
+
16
+ - **Algorithm:** PPO (Proximal Policy Optimization)
17
+ - **Architecture:** Actor-Critic with shared backbone (256→256→128)
18
+ - **Map:** world
19
+ - **Opponents:** 5 bots
20
+ - **Episodes trained:** 20
21
+ - **Global steps:** 57109
22
+ - **Best mean reward:** 151.78631000000004
23
+
24
+ ## Final Training Metrics
25
+
26
+ - **Mean reward:** 151.78631000000004
27
+ - **Mean episode length:** 2855.45
28
+ - **Loss:** 4.396657466888428
29
+
30
+ ## Usage
31
+
32
+ ```python
33
+ from train import ActorCritic
34
+ import torch
35
+
36
+ model = ActorCritic(obs_dim=32, max_neighbors=8)
37
+ model.load_state_dict(torch.load("best_model.pt", weights_only=True))
38
+ model.eval()
39
+ ```
40
+
41
+ ## Repository
42
+
43
+ Trained from [josh-freeman/openfront-rl](https://github.com/josh-freeman/openfront-rl).