Upload DQN SpaceInvaders model
Browse files- README.md +7 -57
- config.json +0 -0
- dqn-SpaceInvadersNoFrameskip-v4.zip +2 -2
- dqn-SpaceInvadersNoFrameskip-v4/data +0 -0
- dqn-SpaceInvadersNoFrameskip-v4/policy.optimizer.pth +2 -2
- dqn-SpaceInvadersNoFrameskip-v4/policy.pth +2 -2
- dqn-SpaceInvadersNoFrameskip-v4/pytorch_variables.pth +2 -2
- dqn-SpaceInvadersNoFrameskip-v4/system_info.txt +2 -2
- results.json +1 -1
README.md
CHANGED
|
@@ -16,72 +16,22 @@ model-index:
|
|
| 16 |
type: SpaceInvadersNoFrameskip-v4
|
| 17 |
metrics:
|
| 18 |
- type: mean_reward
|
| 19 |
-
value:
|
| 20 |
name: mean_reward
|
| 21 |
verified: false
|
| 22 |
---
|
| 23 |
|
| 24 |
# **DQN** Agent playing **SpaceInvadersNoFrameskip-v4**
|
| 25 |
This is a trained model of a **DQN** agent playing **SpaceInvadersNoFrameskip-v4**
|
| 26 |
-
using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3)
|
| 27 |
-
and the [RL Zoo](https://github.com/DLR-RM/rl-baselines3-zoo).
|
| 28 |
|
| 29 |
-
|
| 30 |
-
|
| 31 |
-
with hyperparameter optimization and pre-trained agents included.
|
| 32 |
|
| 33 |
-
## Usage (with SB3 RL Zoo)
|
| 34 |
|
| 35 |
-
RL Zoo: https://github.com/DLR-RM/rl-baselines3-zoo<br/>
|
| 36 |
-
SB3: https://github.com/DLR-RM/stable-baselines3<br/>
|
| 37 |
-
SB3 Contrib: https://github.com/Stable-Baselines-Team/stable-baselines3-contrib
|
| 38 |
-
SBX (SB3 + Jax): https://github.com/araffin/sbx
|
| 39 |
-
|
| 40 |
-
Install the RL Zoo (with SB3 and SB3-Contrib):
|
| 41 |
-
```bash
|
| 42 |
-
pip install rl_zoo3
|
| 43 |
-
```
|
| 44 |
-
|
| 45 |
-
```
|
| 46 |
-
# Download model and save it into the logs/ folder
|
| 47 |
-
python -m rl_zoo3.load_from_hub --algo dqn --env SpaceInvadersNoFrameskip-v4 -orga xboy-352 -f logs/
|
| 48 |
-
python -m rl_zoo3.enjoy --algo dqn --env SpaceInvadersNoFrameskip-v4 -f logs/
|
| 49 |
-
```
|
| 50 |
-
|
| 51 |
-
If you installed the RL Zoo3 via pip (`pip install rl_zoo3`), from anywhere you can do:
|
| 52 |
-
```
|
| 53 |
-
python -m rl_zoo3.load_from_hub --algo dqn --env SpaceInvadersNoFrameskip-v4 -orga xboy-352 -f logs/
|
| 54 |
-
python -m rl_zoo3.enjoy --algo dqn --env SpaceInvadersNoFrameskip-v4 -f logs/
|
| 55 |
-
```
|
| 56 |
-
|
| 57 |
-
## Training (with the RL Zoo)
|
| 58 |
-
```
|
| 59 |
-
python -m rl_zoo3.train --algo dqn --env SpaceInvadersNoFrameskip-v4 -f logs/
|
| 60 |
-
# Upload the model and generate video (when possible)
|
| 61 |
-
python -m rl_zoo3.push_to_hub --algo dqn --env SpaceInvadersNoFrameskip-v4 -f logs/ -orga xboy-352
|
| 62 |
-
```
|
| 63 |
-
|
| 64 |
-
## Hyperparameters
|
| 65 |
```python
|
| 66 |
-
|
| 67 |
-
|
| 68 |
-
('env_wrapper',
|
| 69 |
-
['stable_baselines3.common.atari_wrappers.AtariWrapper']),
|
| 70 |
-
('exploration_final_eps', 0.01),
|
| 71 |
-
('exploration_fraction', 0.1),
|
| 72 |
-
('frame_stack', 4),
|
| 73 |
-
('gradient_steps', 1),
|
| 74 |
-
('learning_rate', 0.0001),
|
| 75 |
-
('learning_starts', 100000),
|
| 76 |
-
('n_timesteps', 100000.0),
|
| 77 |
-
('optimize_memory_usage', False),
|
| 78 |
-
('policy', 'CnnPolicy'),
|
| 79 |
-
('target_update_interval', 1000),
|
| 80 |
-
('train_freq', 4),
|
| 81 |
-
('normalize', False)])
|
| 82 |
-
```
|
| 83 |
|
| 84 |
-
|
| 85 |
-
```python
|
| 86 |
-
{'render_mode': 'rgb_array'}
|
| 87 |
```
|
|
|
|
| 16 |
type: SpaceInvadersNoFrameskip-v4
|
| 17 |
metrics:
|
| 18 |
- type: mean_reward
|
| 19 |
+
value: -4983.57 +/- 15.29
|
| 20 |
name: mean_reward
|
| 21 |
verified: false
|
| 22 |
---
|
| 23 |
|
| 24 |
# **DQN** Agent playing **SpaceInvadersNoFrameskip-v4**
|
| 25 |
This is a trained model of a **DQN** agent playing **SpaceInvadersNoFrameskip-v4**
|
| 26 |
+
using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).
|
|
|
|
| 27 |
|
| 28 |
+
## Usage (with Stable-baselines3)
|
| 29 |
+
TODO: Add your code
|
|
|
|
| 30 |
|
|
|
|
| 31 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 32 |
```python
|
| 33 |
+
from stable_baselines3 import ...
|
| 34 |
+
from huggingface_sb3 import load_from_hub
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 35 |
|
| 36 |
+
...
|
|
|
|
|
|
|
| 37 |
```
|
config.json
ADDED
|
The diff for this file is too large to render.
See raw diff
|
|
|
dqn-SpaceInvadersNoFrameskip-v4.zip
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:59357a84ad1bdf7cbd2d40daf0385a90a6e529d706bf94cd0d6942b3733af2d3
|
| 3 |
+
size 27484691
|
dqn-SpaceInvadersNoFrameskip-v4/data
CHANGED
|
The diff for this file is too large to render.
See raw diff
|
|
|
dqn-SpaceInvadersNoFrameskip-v4/policy.optimizer.pth
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:60b24b03f0fb18db08801380d65aebc6ea612f46e1f2b7167b7f03a488344a4c
|
| 3 |
+
size 13506441
|
dqn-SpaceInvadersNoFrameskip-v4/policy.pth
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:408d7539ed854bde02ce729a3e365590ab682cc6a69efbe712b473c4496d452a
|
| 3 |
+
size 13505767
|
dqn-SpaceInvadersNoFrameskip-v4/pytorch_variables.pth
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:07c7431cf6005e7d8f367d79e995f63e2f9b981a37e3437b795d058f9af4308b
|
| 3 |
+
size 1261
|
dqn-SpaceInvadersNoFrameskip-v4/system_info.txt
CHANGED
|
@@ -1,7 +1,7 @@
|
|
| 1 |
- OS: Linux-6.6.87.2-microsoft-standard-WSL2-x86_64-with-glibc2.35 # 1 SMP PREEMPT_DYNAMIC Thu Jun 5 18:30:46 UTC 2025
|
| 2 |
-
- Python: 3.10.
|
| 3 |
- Stable-Baselines3: 2.7.0
|
| 4 |
-
- PyTorch: 2.
|
| 5 |
- GPU Enabled: True
|
| 6 |
- Numpy: 2.2.6
|
| 7 |
- Cloudpickle: 3.1.2
|
|
|
|
| 1 |
- OS: Linux-6.6.87.2-microsoft-standard-WSL2-x86_64-with-glibc2.35 # 1 SMP PREEMPT_DYNAMIC Thu Jun 5 18:30:46 UTC 2025
|
| 2 |
+
- Python: 3.10.19
|
| 3 |
- Stable-Baselines3: 2.7.0
|
| 4 |
+
- PyTorch: 2.9.1+cu128
|
| 5 |
- GPU Enabled: True
|
| 6 |
- Numpy: 2.2.6
|
| 7 |
- Cloudpickle: 3.1.2
|
results.json
CHANGED
|
@@ -1 +1 @@
|
|
| 1 |
-
{"mean_reward":
|
|
|
|
| 1 |
+
{"mean_reward": -4983.571299999999, "std_reward": 15.290488095871863, "is_deterministic": false, "n_eval_episodes": 10, "eval_datetime": "2025-11-30T11:53:25.909221"}
|