Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
|
@@ -21,7 +21,7 @@ model-index:
|
|
| 21 |
type: OpenAI/Gym/Atari-QbertNoFrameskip-v4
|
| 22 |
metrics:
|
| 23 |
- type: mean_reward
|
| 24 |
-
value:
|
| 25 |
name: mean_reward
|
| 26 |
---
|
| 27 |
|
|
@@ -166,6 +166,7 @@ exp_config = {
|
|
| 166 |
'mode': 'train_iter'
|
| 167 |
},
|
| 168 |
'figure_path': None,
|
|
|
|
| 169 |
'cfg_type': 'InteractionSerialEvaluatorDict',
|
| 170 |
'stop_value': 30000,
|
| 171 |
'n_episode': 8
|
|
@@ -210,7 +211,7 @@ exp_config = {
|
|
| 210 |
|
| 211 |
**Training Procedure**
|
| 212 |
<!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
|
| 213 |
-
- **Weights & Biases (wandb):** [monitor link](https://wandb.ai/
|
| 214 |
|
| 215 |
## Model Information
|
| 216 |
<!-- Provide the basic links for the model. -->
|
|
@@ -220,7 +221,7 @@ exp_config = {
|
|
| 220 |
- **Demo:** [video](https://huggingface.co/OpenDILabCommunity/QbertNoFrameskip-v4-C51/blob/main/replay.mp4)
|
| 221 |
<!-- Provide the size information for the model. -->
|
| 222 |
- **Parameters total size:** 55276.2 KB
|
| 223 |
-
- **Last Update Date:** 2023-
|
| 224 |
|
| 225 |
## Environments
|
| 226 |
<!-- Address questions around what environment the model is intended to be trained and deployed at, including the necessary information needed to be provided for future users. -->
|
|
|
|
| 21 |
type: OpenAI/Gym/Atari-QbertNoFrameskip-v4
|
| 22 |
metrics:
|
| 23 |
- type: mean_reward
|
| 24 |
+
value: 12350.0 +/- 0.0
|
| 25 |
name: mean_reward
|
| 26 |
---
|
| 27 |
|
|
|
|
| 166 |
'mode': 'train_iter'
|
| 167 |
},
|
| 168 |
'figure_path': None,
|
| 169 |
+
'return_env_info': True,
|
| 170 |
'cfg_type': 'InteractionSerialEvaluatorDict',
|
| 171 |
'stop_value': 30000,
|
| 172 |
'n_episode': 8
|
|
|
|
| 211 |
|
| 212 |
**Training Procedure**
|
| 213 |
<!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
|
| 214 |
+
- **Weights & Biases (wandb):** [monitor link](https://wandb.ai/zjowowen/QbertNoFrameskip-v4-C51)
|
| 215 |
|
| 216 |
## Model Information
|
| 217 |
<!-- Provide the basic links for the model. -->
|
|
|
|
| 221 |
- **Demo:** [video](https://huggingface.co/OpenDILabCommunity/QbertNoFrameskip-v4-C51/blob/main/replay.mp4)
|
| 222 |
<!-- Provide the size information for the model. -->
|
| 223 |
- **Parameters total size:** 55276.2 KB
|
| 224 |
+
- **Last Update Date:** 2023-08-04
|
| 225 |
|
| 226 |
## Environments
|
| 227 |
<!-- Address questions around what environment the model is intended to be trained and deployed at, including the necessary information needed to be provided for future users. -->
|