Reinforcement Learning
stable-baselines3
SpaceInvadersNoFrameskip-v4
deep-reinforcement-learning
Eval Results (legacy)
Instructions to use webjdi/SpaceInvadersNoFrameskip-v4 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- stable-baselines3
How to use webjdi/SpaceInvadersNoFrameskip-v4 with stable-baselines3:
from huggingface_sb3 import load_from_hub checkpoint = load_from_hub( repo_id="webjdi/SpaceInvadersNoFrameskip-v4", filename="{MODEL FILENAME}.zip", ) - Notebooks
- Google Colab
- Kaggle
| library_name: stable-baselines3 | |
| tags: | |
| - SpaceInvadersNoFrameskip-v4 | |
| - deep-reinforcement-learning | |
| - reinforcement-learning | |
| - stable-baselines3 | |
| model-index: | |
| - name: DQN | |
| results: | |
| - task: | |
| type: reinforcement-learning | |
| name: reinforcement-learning | |
| dataset: | |
| name: SpaceInvadersNoFrameskip-v4 | |
| type: SpaceInvadersNoFrameskip-v4 | |
| metrics: | |
| - type: mean_reward | |
| value: 681.00 +/- 204.50 | |
| name: mean_reward | |
| verified: false | |
| # **DQN** Agent playing **SpaceInvadersNoFrameskip-v4** | |
| This is a trained model of a **DQN** agent playing **SpaceInvadersNoFrameskip-v4** | |
| using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3) | |
| and the [RL Zoo](https://github.com/DLR-RM/rl-baselines3-zoo). | |
| The RL Zoo is a training framework for Stable Baselines3 | |
| reinforcement learning agents, | |
| with hyperparameter optimization and pre-trained agents included. | |
| ## Usage (with SB3 RL Zoo) | |
| RL Zoo: https://github.com/DLR-RM/rl-baselines3-zoo<br/> | |
| SB3: https://github.com/DLR-RM/stable-baselines3<br/> | |
| SB3 Contrib: https://github.com/Stable-Baselines-Team/stable-baselines3-contrib | |
| SBX (SB3 + Jax): https://github.com/araffin/sbx | |
| Install the RL Zoo (with SB3 and SB3-Contrib): | |
| ```bash | |
| pip install rl_zoo3 | |
| ``` | |
| ``` | |
| # Download model and save it into the logs/ folder | |
| python -m rl_zoo3.load_from_hub --algo dqn --env SpaceInvadersNoFrameskip-v4 -orga webjdi -f logs/ | |
| python -m rl_zoo3.enjoy --algo dqn --env SpaceInvadersNoFrameskip-v4 -f logs/ | |
| ``` | |
| If you installed the RL Zoo3 via pip (`pip install rl_zoo3`), from anywhere you can do: | |
| ``` | |
| python -m rl_zoo3.load_from_hub --algo dqn --env SpaceInvadersNoFrameskip-v4 -orga webjdi -f logs/ | |
| python -m rl_zoo3.enjoy --algo dqn --env SpaceInvadersNoFrameskip-v4 -f logs/ | |
| ``` | |
| ## Training (with the RL Zoo) | |
| ``` | |
| python -m rl_zoo3.train --algo dqn --env SpaceInvadersNoFrameskip-v4 -f logs/ | |
| # Upload the model and generate video (when possible) | |
| python -m rl_zoo3.push_to_hub --algo dqn --env SpaceInvadersNoFrameskip-v4 -f logs/ -orga webjdi | |
| ``` | |
| ## Hyperparameters | |
| ```python | |
| OrderedDict([('batch_size', 32), | |
| ('buffer_size', 100000), | |
| ('env_wrapper', | |
| ['stable_baselines3.common.atari_wrappers.AtariWrapper']), | |
| ('exploration_final_eps', 0.01), | |
| ('exploration_fraction', 0.1), | |
| ('frame_stack', 4), | |
| ('gradient_steps', 1), | |
| ('learning_rate', 0.0001), | |
| ('learning_starts', 100000), | |
| ('n_timesteps', 1000000.0), | |
| ('optimize_memory_usage', False), | |
| ('policy', 'CnnPolicy'), | |
| ('target_update_interval', 1000), | |
| ('train_freq', 4), | |
| ('normalize', False)]) | |
| ``` | |
| # Environment Arguments | |
| ```python | |
| {'render_mode': 'rgb_array'} | |
| ``` | |