Antonio Serrano Muñoz commited on
Commit ·
becde8a
1
Parent(s): b437373
Add files
Browse files- README.md +88 -0
- agent.pickle +3 -0
- agent.pt +3 -0
README.md
ADDED
|
@@ -0,0 +1,88 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
library_name: skrl
|
| 3 |
+
tags:
|
| 4 |
+
- deep-reinforcement-learning
|
| 5 |
+
- reinforcement-learning
|
| 6 |
+
- skrl
|
| 7 |
+
model-index:
|
| 8 |
+
- name: PPO
|
| 9 |
+
results:
|
| 10 |
+
- metrics:
|
| 11 |
+
- type: mean_reward
|
| 12 |
+
value: 1265.87 +/- 49.12
|
| 13 |
+
name: Total reward (mean)
|
| 14 |
+
task:
|
| 15 |
+
type: reinforcement-learning
|
| 16 |
+
name: reinforcement-learning
|
| 17 |
+
dataset:
|
| 18 |
+
name: IsaacGymEnvs-Quadcopter
|
| 19 |
+
type: IsaacGymEnvs-Quadcopter
|
| 20 |
+
---
|
| 21 |
+
|
| 22 |
+
<!-- ---
|
| 23 |
+
torch: 1265.87 +/- 49.12
|
| 24 |
+
jax: 1224.24 +/- 53.15
|
| 25 |
+
numpy: 1204.61 +/- 49.65
|
| 26 |
+
--- -->
|
| 27 |
+
|
| 28 |
+
# IsaacGymEnvs-Quadcopter-PPO
|
| 29 |
+
|
| 30 |
+
Trained agent for [NVIDIA Isaac Gym Preview](https://github.com/NVIDIA-Omniverse/IsaacGymEnvs) environments.
|
| 31 |
+
|
| 32 |
+
- **Task:** Quadcopter
|
| 33 |
+
- **Agent:** [PPO](https://skrl.readthedocs.io/en/latest/api/agents/ppo.html)
|
| 34 |
+
|
| 35 |
+
# Usage (with skrl)
|
| 36 |
+
|
| 37 |
+
Note: Visit the skrl [Examples](https://skrl.readthedocs.io/en/latest/intro/examples.html) section to access the scripts.
|
| 38 |
+
|
| 39 |
+
* PyTorch
|
| 40 |
+
|
| 41 |
+
```python
|
| 42 |
+
from skrl.utils.huggingface import download_model_from_huggingface
|
| 43 |
+
|
| 44 |
+
# assuming that there is an agent named `agent`
|
| 45 |
+
path = download_model_from_huggingface("skrl/IsaacGymEnvs-Quadcopter-PPO", filename="agent.pt")
|
| 46 |
+
agent.load(path)
|
| 47 |
+
```
|
| 48 |
+
|
| 49 |
+
* JAX
|
| 50 |
+
|
| 51 |
+
```python
|
| 52 |
+
from skrl.utils.huggingface import download_model_from_huggingface
|
| 53 |
+
|
| 54 |
+
# assuming that there is an agent named `agent`
|
| 55 |
+
path = download_model_from_huggingface("skrl/IsaacGymEnvs-Quadcopter-PPO", filename="agent.pickle")
|
| 56 |
+
agent.load(path)
|
| 57 |
+
```
|
| 58 |
+
|
| 59 |
+
# Hyperparameters
|
| 60 |
+
|
| 61 |
+
Note: Undefined parameters keep their values by default.
|
| 62 |
+
|
| 63 |
+
```python
|
| 64 |
+
# https://skrl.readthedocs.io/en/latest/api/agents/ppo.html#configuration-and-hyperparameters
|
| 65 |
+
cfg = PPO_DEFAULT_CONFIG.copy()
|
| 66 |
+
cfg["rollouts"] = 8 # memory_size
|
| 67 |
+
cfg["learning_epochs"] = 8
|
| 68 |
+
cfg["mini_batches"] = 4 # 8 * 8192 / 16384
|
| 69 |
+
cfg["discount_factor"] = 0.99
|
| 70 |
+
cfg["lambda"] = 0.95
|
| 71 |
+
cfg["learning_rate"] = 1e-3
|
| 72 |
+
cfg["learning_rate_scheduler"] = KLAdaptiveRL
|
| 73 |
+
cfg["learning_rate_scheduler_kwargs"] = {"kl_threshold": 0.016}
|
| 74 |
+
cfg["random_timesteps"] = 0
|
| 75 |
+
cfg["learning_starts"] = 0
|
| 76 |
+
cfg["grad_norm_clip"] = 1.0
|
| 77 |
+
cfg["ratio_clip"] = 0.2
|
| 78 |
+
cfg["value_clip"] = 0.2
|
| 79 |
+
cfg["clip_predicted_values"] = True
|
| 80 |
+
cfg["entropy_loss_scale"] = 0.0
|
| 81 |
+
cfg["value_loss_scale"] = 1.0
|
| 82 |
+
cfg["kl_threshold"] = 0
|
| 83 |
+
cfg["rewards_shaper"] = lambda rewards, timestep, timesteps: rewards * 0.1
|
| 84 |
+
cfg["state_preprocessor"] = RunningStandardScaler
|
| 85 |
+
cfg["state_preprocessor_kwargs"] = {"size": env.observation_space, "device": device}
|
| 86 |
+
cfg["value_preprocessor"] = RunningStandardScaler
|
| 87 |
+
cfg["value_preprocessor_kwargs"] = {"size": 1, "device": device}
|
| 88 |
+
```
|
agent.pickle
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:3028f2a6618ccd9f26dd8047c5586fad9525d6ee290246efe9e64c6699c17177
|
| 3 |
+
size 2526229
|
agent.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:f9baa2295d0d98add54329ec0b7f746e7cc6537f96d612cfec67662928662cb8
|
| 3 |
+
size 1288802
|