JoshuaFreeman commited on
Commit
6ef48d3
·
verified ·
1 Parent(s): fcaf321

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +7 -7
README.md CHANGED
@@ -17,15 +17,15 @@ PPO-trained agent for [OpenFront.io](https://openfront.io), a multiplayer territ
17
  - **Architecture:** Actor-Critic with shared backbone (256→256→128)
18
  - **Map:** world
19
  - **Opponents:** 5 bots
20
- - **Episodes trained:** 20
21
- - **Global steps:** 57109
22
- - **Best mean reward:** 151.78631000000004
23
 
24
  ## Final Training Metrics
25
 
26
- - **Mean reward:** 151.78631000000004
27
- - **Mean episode length:** 2855.45
28
- - **Loss:** 4.396657466888428
29
 
30
  ## Usage
31
 
@@ -33,7 +33,7 @@ PPO-trained agent for [OpenFront.io](https://openfront.io), a multiplayer territ
33
  from train import ActorCritic
34
  import torch
35
 
36
- model = ActorCritic(obs_dim=32, max_neighbors=8)
37
  model.load_state_dict(torch.load("best_model.pt", weights_only=True))
38
  model.eval()
39
  ```
 
17
  - **Architecture:** Actor-Critic with shared backbone (256→256→128)
18
  - **Map:** world
19
  - **Opponents:** 5 bots
20
+ - **Episodes trained:** N/A
21
+ - **Global steps:** 1638400
22
+ - **Best mean reward:** 31.003479852676392
23
 
24
  ## Final Training Metrics
25
 
26
+ - **Mean reward:** 29.898164215087892
27
+ - **Mean episode length:** 3657.29
28
+ - **Loss:** 0.8671517372131348
29
 
30
  ## Usage
31
 
 
33
  from train import ActorCritic
34
  import torch
35
 
36
+ model = ActorCritic(obs_dim=78, max_neighbors=16)
37
  model.load_state_dict(torch.load("best_model.pt", weights_only=True))
38
  model.eval()
39
  ```