Update README.md
Browse filesAdded link to paper
README.md
CHANGED
|
@@ -25,13 +25,28 @@ model-index:
|
|
| 25 |
This is a trained model of a **A2C** agent playing **PandaReachDense-v2**
|
| 26 |
using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).
|
| 27 |
|
| 28 |
-
|
| 29 |
-
|
|
|
|
| 30 |
|
|
|
|
| 31 |
|
| 32 |
-
|
| 33 |
-
|
| 34 |
-
|
| 35 |
|
| 36 |
-
...
|
| 37 |
```
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 25 |
This is a trained model of a **A2C** agent playing **PandaReachDense-v2**
|
| 26 |
using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).
|
| 27 |
|
| 28 |
+
The RL Zoo is a training framework for Stable Baselines3
|
| 29 |
+
reinforcement learning agents,
|
| 30 |
+
with hyperparameter optimization and pre-trained agents included.
|
| 31 |
|
| 32 |
+
## Usage (with SB3 RL Zoo)
|
| 33 |
|
| 34 |
+
RL Zoo: https://github.com/DLR-RM/rl-baselines3-zoo<br/>
|
| 35 |
+
SB3: https://github.com/DLR-RM/stable-baselines3<br/>
|
| 36 |
+
SB3 Contrib: https://github.com/Stable-Baselines-Team/stable-baselines3-contrib
|
| 37 |
|
|
|
|
| 38 |
```
|
| 39 |
+
# Download model and save it into the logs/ folder
|
| 40 |
+
python -m rl_zoo3.load_from_hub --algo tqc --env PandaReachDense-v2 -orga sb3 -f logs/
|
| 41 |
+
python enjoy.py --algo a2c --env PandaReachDense-v2 -f logs/
|
| 42 |
+
```
|
| 43 |
+
|
| 44 |
+
## Training (with the RL Zoo)
|
| 45 |
+
```
|
| 46 |
+
python train.py --algo a2c --env PandaReachDense-v2 -f logs/
|
| 47 |
+
# Upload the model and generate video (when possible)
|
| 48 |
+
python -m rl_zoo3.push_to_hub --algo a2c --env PandaReachDense-v2 -f logs/ -orga sb3
|
| 49 |
+
```
|
| 50 |
+
|
| 51 |
+
|
| 52 |
+
Panda Gym environments: [arxiv.org/abs/2106.13687](https://arxiv.org/abs/2106.13687)
|