Upload DQN SpaceInvaders model

Browse files

Files changed (9) hide show

README.md +7 -57
config.json +0 -0
dqn-SpaceInvadersNoFrameskip-v4.zip +2 -2
dqn-SpaceInvadersNoFrameskip-v4/data +0 -0
dqn-SpaceInvadersNoFrameskip-v4/policy.optimizer.pth +2 -2
dqn-SpaceInvadersNoFrameskip-v4/policy.pth +2 -2
dqn-SpaceInvadersNoFrameskip-v4/pytorch_variables.pth +2 -2
dqn-SpaceInvadersNoFrameskip-v4/system_info.txt +2 -2
results.json +1 -1

README.md CHANGED Viewed

@@ -16,72 +16,22 @@ model-index:
       type: SpaceInvadersNoFrameskip-v4
     metrics:
     - type: mean_reward
-      value: 329.00 +/- 157.97
       name: mean_reward
       verified: false
 ---
 # **DQN** Agent playing **SpaceInvadersNoFrameskip-v4**
 This is a trained model of a **DQN** agent playing **SpaceInvadersNoFrameskip-v4**
-using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3)
-and the [RL Zoo](https://github.com/DLR-RM/rl-baselines3-zoo).
-The RL Zoo is a training framework for Stable Baselines3
-reinforcement learning agents,
-with hyperparameter optimization and pre-trained agents included.
-## Usage (with SB3 RL Zoo)
-RL Zoo: https://github.com/DLR-RM/rl-baselines3-zoo<br/>
-SB3: https://github.com/DLR-RM/stable-baselines3<br/>
-SB3 Contrib: https://github.com/Stable-Baselines-Team/stable-baselines3-contrib
-SBX (SB3 + Jax): https://github.com/araffin/sbx
-Install the RL Zoo (with SB3 and SB3-Contrib):
-```bash
-pip install rl_zoo3
-```
-```
-# Download model and save it into the logs/ folder
-python -m rl_zoo3.load_from_hub --algo dqn --env SpaceInvadersNoFrameskip-v4 -orga xboy-352 -f logs/
-python -m rl_zoo3.enjoy --algo dqn --env SpaceInvadersNoFrameskip-v4  -f logs/
-```
-If you installed the RL Zoo3 via pip (`pip install rl_zoo3`), from anywhere you can do:
-```
-python -m rl_zoo3.load_from_hub --algo dqn --env SpaceInvadersNoFrameskip-v4 -orga xboy-352 -f logs/
-python -m rl_zoo3.enjoy --algo dqn --env SpaceInvadersNoFrameskip-v4  -f logs/
-```
-## Training (with the RL Zoo)
-```
-python -m rl_zoo3.train --algo dqn --env SpaceInvadersNoFrameskip-v4 -f logs/
-# Upload the model and generate video (when possible)
-python -m rl_zoo3.push_to_hub --algo dqn --env SpaceInvadersNoFrameskip-v4 -f logs/ -orga xboy-352
-```
-## Hyperparameters
 ```python
-OrderedDict([('batch_size', 32),
-             ('buffer_size', 100000),
-             ('env_wrapper',
-              ['stable_baselines3.common.atari_wrappers.AtariWrapper']),
-             ('exploration_final_eps', 0.01),
-             ('exploration_fraction', 0.1),
-             ('frame_stack', 4),
-             ('gradient_steps', 1),
-             ('learning_rate', 0.0001),
-             ('learning_starts', 100000),
-             ('n_timesteps', 100000.0),
-             ('optimize_memory_usage', False),
-             ('policy', 'CnnPolicy'),
-             ('target_update_interval', 1000),
-             ('train_freq', 4),
-             ('normalize', False)])
-```
-# Environment Arguments
-```python
-{'render_mode': 'rgb_array'}
 ```

       type: SpaceInvadersNoFrameskip-v4
     metrics:
     - type: mean_reward
+      value: -4983.57 +/- 15.29
       name: mean_reward
       verified: false
 ---
 # **DQN** Agent playing **SpaceInvadersNoFrameskip-v4**
 This is a trained model of a **DQN** agent playing **SpaceInvadersNoFrameskip-v4**
+using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).
+## Usage (with Stable-baselines3)
+TODO: Add your code
 ```python
+from stable_baselines3 import ...
+from huggingface_sb3 import load_from_hub
+...
 ```

config.json ADDED Viewed

The diff for this file is too large to render. See raw diff

dqn-SpaceInvadersNoFrameskip-v4.zip CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:44f6c551bf2348ea08dc119791a38b5fa15cae14dde8e6f79c18b5cf415ee867
-size 13713364

 version https://git-lfs.github.com/spec/v1
+oid sha256:59357a84ad1bdf7cbd2d40daf0385a90a6e529d706bf94cd0d6942b3733af2d3
+size 27484691

dqn-SpaceInvadersNoFrameskip-v4/data CHANGED Viewed

The diff for this file is too large to render. See raw diff

dqn-SpaceInvadersNoFrameskip-v4/policy.optimizer.pth CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:215ea7d8898faa9284464c7109532cf36390d330fccab8d77eeba20628a32876
-size 1120

 version https://git-lfs.github.com/spec/v1
+oid sha256:60b24b03f0fb18db08801380d65aebc6ea612f46e1f2b7167b7f03a488344a4c
+size 13506441

dqn-SpaceInvadersNoFrameskip-v4/policy.pth CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f9758ff08f9365b03fa63d1816c092684ed2b77c1da3508166555ee2ad6857c9
-size 13505370

 version https://git-lfs.github.com/spec/v1
+oid sha256:408d7539ed854bde02ce729a3e365590ab682cc6a69efbe712b473c4496d452a
+size 13505767

dqn-SpaceInvadersNoFrameskip-v4/pytorch_variables.pth CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:0c35cea3b2e60fb5e7e162d3592df775cd400e575a31c72f359fb9e654ab00c5
-size 864

 version https://git-lfs.github.com/spec/v1
+oid sha256:07c7431cf6005e7d8f367d79e995f63e2f9b981a37e3437b795d058f9af4308b
+size 1261

dqn-SpaceInvadersNoFrameskip-v4/system_info.txt CHANGED Viewed

@@ -1,7 +1,7 @@
 - OS: Linux-6.6.87.2-microsoft-standard-WSL2-x86_64-with-glibc2.35 # 1 SMP PREEMPT_DYNAMIC Thu Jun  5 18:30:46 UTC 2025
-- Python: 3.10.12
 - Stable-Baselines3: 2.7.0
-- PyTorch: 2.5.1+cu121
 - GPU Enabled: True
 - Numpy: 2.2.6
 - Cloudpickle: 3.1.2

 - OS: Linux-6.6.87.2-microsoft-standard-WSL2-x86_64-with-glibc2.35 # 1 SMP PREEMPT_DYNAMIC Thu Jun  5 18:30:46 UTC 2025
+- Python: 3.10.19
 - Stable-Baselines3: 2.7.0
+- PyTorch: 2.9.1+cu128
 - GPU Enabled: True
 - Numpy: 2.2.6
 - Cloudpickle: 3.1.2

results.json CHANGED Viewed

	@@ -1 +1 @@
1	- {"mean_reward": ~~329~~.0, "std_reward": ~~157~~.~~96835126062436~~, "is_deterministic": false, "n_eval_episodes": 10, "eval_datetime": "2025-11-~~29T21~~:48:11.~~373607~~"}


1	+ {"mean_reward": -4983.571299999999, "std_reward": 15.290488095871863, "is_deterministic": false, "n_eval_episodes": 10, "eval_datetime": "2025-11-30T11:53:25.909221"}