JoshuaFreeman commited on
Commit
9484826
·
verified ·
1 Parent(s): cc8e933

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +14 -14
README.md CHANGED
@@ -15,22 +15,22 @@ PPO-trained agent for [OpenFront.io](https://openfront.io), a multiplayer territ
15
 
16
  - **Algorithm:** PPO (Proximal Policy Optimization)
17
  - **Architecture:** Actor-Critic with shared backbone (256→256→128)
18
- - **Observation dim:** N/A
19
- - **Max neighbors:** N/A
20
- - **Maps:** N/A (random per episode)
21
- - **Opponents:** N/A N/A bots
22
- - **Parallel envs:** N/A
23
- - **Learning rate:** N/A
24
- - **Rollout steps:** N/A
25
- - **Updates trained:** N/A
26
- - **Global steps:** N/A
27
- - **Best mean reward:** N/A
28
 
29
  ## Final Training Metrics
30
 
31
- - **Mean reward:** 29.898164215087892
32
- - **Mean episode length:** 3657.29
33
- - **Loss:** 0.8671517372131348
34
 
35
  ## Usage
36
 
@@ -38,7 +38,7 @@ PPO-trained agent for [OpenFront.io](https://openfront.io), a multiplayer territ
38
  from train import ActorCritic
39
  import torch
40
 
41
- model = ActorCritic(obs_dim=N/A, max_neighbors=N/A)
42
  model.load_state_dict(torch.load("best_model.pt", weights_only=True))
43
  model.eval()
44
  ```
 
15
 
16
  - **Algorithm:** PPO (Proximal Policy Optimization)
17
  - **Architecture:** Actor-Critic with shared backbone (256→256→128)
18
+ - **Observation dim:** 80
19
+ - **Max neighbors:** 16
20
+ - **Maps:** plains, big_plains, world, giantworldmap, ocean_and_land, half_land_half_ocean (random per episode)
21
+ - **Opponents:** 2 Easy bots
22
+ - **Parallel envs:** 8
23
+ - **Learning rate:** 0.0002
24
+ - **Rollout steps:** 512
25
+ - **Updates trained:** 1400
26
+ - **Global steps:** 5734400
27
+ - **Best mean reward:** 468.54246531009676
28
 
29
  ## Final Training Metrics
30
 
31
+ - **Mean reward:** 178.37687824249267
32
+ - **Mean episode length:** 6926.31
33
+ - **Loss:** 0.08463311195373535
34
 
35
  ## Usage
36
 
 
38
  from train import ActorCritic
39
  import torch
40
 
41
+ model = ActorCritic(obs_dim=80, max_neighbors=16)
42
  model.load_state_dict(torch.load("best_model.pt", weights_only=True))
43
  model.eval()
44
  ```