mohamednabil500 commited on
Commit
0d335f7
·
verified ·
1 Parent(s): 3326dc2

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +35 -0
README.md ADDED
@@ -0,0 +1,35 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+
2
+ ---
3
+ tags:
4
+ - reinforcement-learning
5
+ - ppo
6
+ - space-invaders
7
+ - atari
8
+ - ale
9
+ - gymnasium
10
+ ---
11
+
12
+ # PPO Expert Agent for ALE/SpaceInvaders-v5 (10M Steps)
13
+
14
+ This is a Proximal Policy Optimization (PPO) agent trained on Atari Space Invaders (v5) using a vectorized environment setup.
15
+
16
+ ## Training Details
17
+ - **Algorithm:** PPO
18
+ - **Environment:** ALE/SpaceInvaders-v5 (with sticky actions)
19
+ - **Total Timesteps:** 10,000,000
20
+ - **Frame Stacking:** 4 frames
21
+ - **Terminal on Life Loss:** True (during training)
22
+
23
+ ## Performance
24
+ - **Peak Score observed:** 615.0
25
+ - **Average Reward (approx):** ~300-450 range at 10M steps.
26
+ - **Behavior:** Learned to clear multiple waves, use shields for cover, and target the Mystery Ship.
27
+
28
+ ## Usage
29
+ ```python
30
+ import torch
31
+ # Assumes you have the ActorCritic class defined in your script
32
+ config = Config() # Using your existing Config class
33
+ model = ActorCritic(input_channels=4, action_dim=6) # Space Invaders has 6 actions
34
+ model.load_state_dict(torch.load('ppo_final_10M.pt', map_location='cpu'))
35
+ model.eval()