xboy-352 commited on
Commit
a276bf7
·
verified ·
1 Parent(s): f6f6990

Upload DQN SpaceInvaders model

Browse files
README.md CHANGED
@@ -16,72 +16,22 @@ model-index:
16
  type: SpaceInvadersNoFrameskip-v4
17
  metrics:
18
  - type: mean_reward
19
- value: 329.00 +/- 157.97
20
  name: mean_reward
21
  verified: false
22
  ---
23
 
24
  # **DQN** Agent playing **SpaceInvadersNoFrameskip-v4**
25
  This is a trained model of a **DQN** agent playing **SpaceInvadersNoFrameskip-v4**
26
- using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3)
27
- and the [RL Zoo](https://github.com/DLR-RM/rl-baselines3-zoo).
28
 
29
- The RL Zoo is a training framework for Stable Baselines3
30
- reinforcement learning agents,
31
- with hyperparameter optimization and pre-trained agents included.
32
 
33
- ## Usage (with SB3 RL Zoo)
34
 
35
- RL Zoo: https://github.com/DLR-RM/rl-baselines3-zoo<br/>
36
- SB3: https://github.com/DLR-RM/stable-baselines3<br/>
37
- SB3 Contrib: https://github.com/Stable-Baselines-Team/stable-baselines3-contrib
38
- SBX (SB3 + Jax): https://github.com/araffin/sbx
39
-
40
- Install the RL Zoo (with SB3 and SB3-Contrib):
41
- ```bash
42
- pip install rl_zoo3
43
- ```
44
-
45
- ```
46
- # Download model and save it into the logs/ folder
47
- python -m rl_zoo3.load_from_hub --algo dqn --env SpaceInvadersNoFrameskip-v4 -orga xboy-352 -f logs/
48
- python -m rl_zoo3.enjoy --algo dqn --env SpaceInvadersNoFrameskip-v4 -f logs/
49
- ```
50
-
51
- If you installed the RL Zoo3 via pip (`pip install rl_zoo3`), from anywhere you can do:
52
- ```
53
- python -m rl_zoo3.load_from_hub --algo dqn --env SpaceInvadersNoFrameskip-v4 -orga xboy-352 -f logs/
54
- python -m rl_zoo3.enjoy --algo dqn --env SpaceInvadersNoFrameskip-v4 -f logs/
55
- ```
56
-
57
- ## Training (with the RL Zoo)
58
- ```
59
- python -m rl_zoo3.train --algo dqn --env SpaceInvadersNoFrameskip-v4 -f logs/
60
- # Upload the model and generate video (when possible)
61
- python -m rl_zoo3.push_to_hub --algo dqn --env SpaceInvadersNoFrameskip-v4 -f logs/ -orga xboy-352
62
- ```
63
-
64
- ## Hyperparameters
65
  ```python
66
- OrderedDict([('batch_size', 32),
67
- ('buffer_size', 100000),
68
- ('env_wrapper',
69
- ['stable_baselines3.common.atari_wrappers.AtariWrapper']),
70
- ('exploration_final_eps', 0.01),
71
- ('exploration_fraction', 0.1),
72
- ('frame_stack', 4),
73
- ('gradient_steps', 1),
74
- ('learning_rate', 0.0001),
75
- ('learning_starts', 100000),
76
- ('n_timesteps', 100000.0),
77
- ('optimize_memory_usage', False),
78
- ('policy', 'CnnPolicy'),
79
- ('target_update_interval', 1000),
80
- ('train_freq', 4),
81
- ('normalize', False)])
82
- ```
83
 
84
- # Environment Arguments
85
- ```python
86
- {'render_mode': 'rgb_array'}
87
  ```
 
16
  type: SpaceInvadersNoFrameskip-v4
17
  metrics:
18
  - type: mean_reward
19
+ value: -4983.57 +/- 15.29
20
  name: mean_reward
21
  verified: false
22
  ---
23
 
24
  # **DQN** Agent playing **SpaceInvadersNoFrameskip-v4**
25
  This is a trained model of a **DQN** agent playing **SpaceInvadersNoFrameskip-v4**
26
+ using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).
 
27
 
28
+ ## Usage (with Stable-baselines3)
29
+ TODO: Add your code
 
30
 
 
31
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
32
  ```python
33
+ from stable_baselines3 import ...
34
+ from huggingface_sb3 import load_from_hub
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
35
 
36
+ ...
 
 
37
  ```
config.json ADDED
The diff for this file is too large to render. See raw diff
 
dqn-SpaceInvadersNoFrameskip-v4.zip CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:44f6c551bf2348ea08dc119791a38b5fa15cae14dde8e6f79c18b5cf415ee867
3
- size 13713364
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:59357a84ad1bdf7cbd2d40daf0385a90a6e529d706bf94cd0d6942b3733af2d3
3
+ size 27484691
dqn-SpaceInvadersNoFrameskip-v4/data CHANGED
The diff for this file is too large to render. See raw diff
 
dqn-SpaceInvadersNoFrameskip-v4/policy.optimizer.pth CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:215ea7d8898faa9284464c7109532cf36390d330fccab8d77eeba20628a32876
3
- size 1120
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:60b24b03f0fb18db08801380d65aebc6ea612f46e1f2b7167b7f03a488344a4c
3
+ size 13506441
dqn-SpaceInvadersNoFrameskip-v4/policy.pth CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f9758ff08f9365b03fa63d1816c092684ed2b77c1da3508166555ee2ad6857c9
3
- size 13505370
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:408d7539ed854bde02ce729a3e365590ab682cc6a69efbe712b473c4496d452a
3
+ size 13505767
dqn-SpaceInvadersNoFrameskip-v4/pytorch_variables.pth CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:0c35cea3b2e60fb5e7e162d3592df775cd400e575a31c72f359fb9e654ab00c5
3
- size 864
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:07c7431cf6005e7d8f367d79e995f63e2f9b981a37e3437b795d058f9af4308b
3
+ size 1261
dqn-SpaceInvadersNoFrameskip-v4/system_info.txt CHANGED
@@ -1,7 +1,7 @@
1
  - OS: Linux-6.6.87.2-microsoft-standard-WSL2-x86_64-with-glibc2.35 # 1 SMP PREEMPT_DYNAMIC Thu Jun 5 18:30:46 UTC 2025
2
- - Python: 3.10.12
3
  - Stable-Baselines3: 2.7.0
4
- - PyTorch: 2.5.1+cu121
5
  - GPU Enabled: True
6
  - Numpy: 2.2.6
7
  - Cloudpickle: 3.1.2
 
1
  - OS: Linux-6.6.87.2-microsoft-standard-WSL2-x86_64-with-glibc2.35 # 1 SMP PREEMPT_DYNAMIC Thu Jun 5 18:30:46 UTC 2025
2
+ - Python: 3.10.19
3
  - Stable-Baselines3: 2.7.0
4
+ - PyTorch: 2.9.1+cu128
5
  - GPU Enabled: True
6
  - Numpy: 2.2.6
7
  - Cloudpickle: 3.1.2
results.json CHANGED
@@ -1 +1 @@
1
- {"mean_reward": 329.0, "std_reward": 157.96835126062436, "is_deterministic": false, "n_eval_episodes": 10, "eval_datetime": "2025-11-29T21:48:11.373607"}
 
1
+ {"mean_reward": -4983.571299999999, "std_reward": 15.290488095871863, "is_deterministic": false, "n_eval_episodes": 10, "eval_datetime": "2025-11-30T11:53:25.909221"}