OpenDILabCommunity
/

Lunarlander-v2-C51

@@ -21,7 +21,7 @@ model-index:
       type: OpenAI/Gym/Box2d-LunarLander-v2
     metrics:
     - type: mean_reward
-      value: 163.0 +/- 77.34
       name: mean_reward
 ---
@@ -114,7 +114,7 @@ exp_config = {
             'retry_waiting_time': 0.1,
             'cfg_type': 'BaseEnvManagerDict'
         },
-        'stop_value': 200,
         'n_evaluator_episode': 8,
         'collector_env_num': 8,
         'evaluator_env_num': 8,
@@ -164,8 +164,9 @@ exp_config = {
                     'mode': 'train_iter'
                 },
                 'figure_path': None,
                 'cfg_type': 'InteractionSerialEvaluatorDict',
-                'stop_value': 200,
                 'n_episode': 8
             }
         },
@@ -208,7 +209,7 @@ exp_config = {
 **Training Procedure**
 <!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
-- **Weights & Biases (wandb):** [monitor link](https://wandb.ai/anony-moose-281353441759581725/LunarLander-v2-C51?apiKey=d148cead9d59fbdabf4ef34f646a7ed95795e5bb)
 ## Model Information
 <!-- Provide the basic links for the model. -->
@@ -218,7 +219,7 @@ exp_config = {
 - **Demo:** [video](https://huggingface.co/OpenDILabCommunity/LunarLander-v2-C51/blob/main/replay.mp4)
 <!-- Provide the size information for the model. -->
 - **Parameters total size:** 214.3 KB
-- **Last Update Date:** 2023-07-23
 ## Environments
 <!-- Address questions around what environment the model is intended to be trained and deployed at, including the necessary information needed to be provided for future users. -->

       type: OpenAI/Gym/Box2d-LunarLander-v2
     metrics:
     - type: mean_reward
+      value: 196.19 +/- 78.51
       name: mean_reward
 ---
             'retry_waiting_time': 0.1,
             'cfg_type': 'BaseEnvManagerDict'
         },
+        'stop_value': 260,
         'n_evaluator_episode': 8,
         'collector_env_num': 8,
         'evaluator_env_num': 8,
                     'mode': 'train_iter'
                 },
                 'figure_path': None,
+                'return_env_info': True,
                 'cfg_type': 'InteractionSerialEvaluatorDict',
+                'stop_value': 260,
                 'n_episode': 8
             }
         },
 **Training Procedure**
 <!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
+- **Weights & Biases (wandb):** [monitor link](https://wandb.ai/zjowowen/Lunarlander-v2-C51)
 ## Model Information
 <!-- Provide the basic links for the model. -->
 - **Demo:** [video](https://huggingface.co/OpenDILabCommunity/LunarLander-v2-C51/blob/main/replay.mp4)
 <!-- Provide the size information for the model. -->
 - **Parameters total size:** 214.3 KB
+- **Last Update Date:** 2023-08-03
 ## Environments
 <!-- Address questions around what environment the model is intended to be trained and deployed at, including the necessary information needed to be provided for future users. -->