Update README.md
Browse files
README.md
CHANGED
|
@@ -21,7 +21,7 @@ model-index:
|
|
| 21 |
verified: false
|
| 22 |
---
|
| 23 |
|
| 24 |
-
# **DQN** Agent playing **SpaceInvadersNoFrameskip-v4**
|
| 25 |
This is a trained model of a **DQN** agent playing **SpaceInvadersNoFrameskip-v4**
|
| 26 |
using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3)
|
| 27 |
and the [RL Zoo](https://github.com/DLR-RM/rl-baselines3-zoo).
|
|
@@ -30,7 +30,7 @@ The RL Zoo is a training framework for Stable Baselines3
|
|
| 30 |
reinforcement learning agents,
|
| 31 |
with hyperparameter optimization and pre-trained agents included.
|
| 32 |
|
| 33 |
-
## Usage (with SB3 RL Zoo)
|
| 34 |
|
| 35 |
RL Zoo: https://github.com/DLR-RM/rl-baselines3-zoo<br/>
|
| 36 |
SB3: https://github.com/DLR-RM/stable-baselines3<br/>
|
|
@@ -54,14 +54,19 @@ python -m rl_zoo3.load_from_hub --algo dqn --env SpaceInvadersNoFrameskip-v4 -or
|
|
| 54 |
python -m rl_zoo3.enjoy --algo dqn --env SpaceInvadersNoFrameskip-v4 -f logs/
|
| 55 |
```
|
| 56 |
|
| 57 |
-
## Training (with the RL Zoo)
|
| 58 |
```
|
| 59 |
python -m rl_zoo3.train --algo dqn --env SpaceInvadersNoFrameskip-v4 -f logs/
|
| 60 |
# Upload the model and generate video (when possible)
|
| 61 |
python -m rl_zoo3.push_to_hub --algo dqn --env SpaceInvadersNoFrameskip-v4 -f logs/ -orga LimTara
|
| 62 |
```
|
| 63 |
|
| 64 |
-
## Hyperparameters
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 65 |
```python
|
| 66 |
OrderedDict([('batch_size', 32),
|
| 67 |
('buffer_size', 100000),
|
|
|
|
| 21 |
verified: false
|
| 22 |
---
|
| 23 |
|
| 24 |
+
# **DQN** Agent playing **SpaceInvadersNoFrameskip-v4** 🚀
|
| 25 |
This is a trained model of a **DQN** agent playing **SpaceInvadersNoFrameskip-v4**
|
| 26 |
using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3)
|
| 27 |
and the [RL Zoo](https://github.com/DLR-RM/rl-baselines3-zoo).
|
|
|
|
| 30 |
reinforcement learning agents,
|
| 31 |
with hyperparameter optimization and pre-trained agents included.
|
| 32 |
|
| 33 |
+
## Usage (with SB3 RL Zoo) 📚
|
| 34 |
|
| 35 |
RL Zoo: https://github.com/DLR-RM/rl-baselines3-zoo<br/>
|
| 36 |
SB3: https://github.com/DLR-RM/stable-baselines3<br/>
|
|
|
|
| 54 |
python -m rl_zoo3.enjoy --algo dqn --env SpaceInvadersNoFrameskip-v4 -f logs/
|
| 55 |
```
|
| 56 |
|
| 57 |
+
## Training (with the RL Zoo) 🖥️
|
| 58 |
```
|
| 59 |
python -m rl_zoo3.train --algo dqn --env SpaceInvadersNoFrameskip-v4 -f logs/
|
| 60 |
# Upload the model and generate video (when possible)
|
| 61 |
python -m rl_zoo3.push_to_hub --algo dqn --env SpaceInvadersNoFrameskip-v4 -f logs/ -orga LimTara
|
| 62 |
```
|
| 63 |
|
| 64 |
+
## Hyperparameters 🔗
|
| 65 |
+
Hyperparameters are used in order to control the behavior of the training model.
|
| 66 |
+
In this project, some of the hyperparameters include the number of times the model is simulated (n_timesteps). The higher the number, the more times the model is simulated and the higher the accuracy but the longer it takes to run. ⏰
|
| 67 |
+
You can also control how risky it is willing to go in order to explore new things. The higher the risk, the more the model will explore and step out of their comfort zone (exploration_final_eps)! 💥
|
| 68 |
+
|
| 69 |
+
To create and edit the hyperparameters, you need to create a file called "dqn.yml" by going to the left bar and click on the file icon which is right below the key icon. Then all you need to do is simply paste the following into the "dqn.yml" file! 📁
|
| 70 |
```python
|
| 71 |
OrderedDict([('batch_size', 32),
|
| 72 |
('buffer_size', 100000),
|