| [2023-07-16 21:11:30,304][236218] Saving configuration to /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/config.json... |
| [2023-07-16 21:11:30,320][236218] Rollout worker 0 uses device cpu |
| [2023-07-16 21:11:30,320][236218] Rollout worker 1 uses device cpu |
| [2023-07-16 21:11:30,320][236218] Rollout worker 2 uses device cpu |
| [2023-07-16 21:11:30,320][236218] Rollout worker 3 uses device cpu |
| [2023-07-16 21:11:30,320][236218] Rollout worker 4 uses device cpu |
| [2023-07-16 21:11:30,320][236218] Rollout worker 5 uses device cpu |
| [2023-07-16 21:11:30,321][236218] Rollout worker 6 uses device cpu |
| [2023-07-16 21:11:30,321][236218] Rollout worker 7 uses device cpu |
| [2023-07-16 21:11:30,321][236218] In synchronous mode, we only accumulate one batch. Setting num_batches_to_accumulate to 1 |
| [2023-07-16 21:11:30,332][236218] InferenceWorker_p0-w0: min num requests: 2 |
| [2023-07-16 21:11:30,351][236218] Starting all processes... |
| [2023-07-16 21:11:30,351][236218] Starting process learner_proc0 |
| [2023-07-16 21:11:30,400][236218] Starting all processes... |
| [2023-07-16 21:11:30,444][236218] Starting process inference_proc0-0 |
| [2023-07-16 21:11:30,453][236218] Starting process rollout_proc0 |
| [2023-07-16 21:11:30,454][236218] Starting process rollout_proc1 |
| [2023-07-16 21:11:30,454][236218] Starting process rollout_proc2 |
| [2023-07-16 21:11:30,454][236218] Starting process rollout_proc3 |
| [2023-07-16 21:11:30,454][236218] Starting process rollout_proc4 |
| [2023-07-16 21:11:30,454][236218] Starting process rollout_proc5 |
| [2023-07-16 21:11:30,454][236218] Starting process rollout_proc6 |
| [2023-07-16 21:11:30,454][236218] Starting process rollout_proc7 |
| [2023-07-16 21:11:32,172][236459] Starting seed is not provided |
| [2023-07-16 21:11:32,172][236459] Initializing actor-critic model on device cpu |
| [2023-07-16 21:11:32,172][236459] RunningMeanStd input shape: (39,) |
| [2023-07-16 21:11:32,172][236459] RunningMeanStd input shape: (1,) |
| [2023-07-16 21:11:32,227][236459] Created Actor Critic model with architecture: |
| [2023-07-16 21:11:32,227][236459] ActorCriticSharedWeights( |
| (obs_normalizer): ObservationNormalizer( |
| (running_mean_std): RunningMeanStdDictInPlace( |
| (running_mean_std): ModuleDict( |
| (obs): RunningMeanStdInPlace() |
| ) |
| ) |
| ) |
| (returns_normalizer): RecursiveScriptModule(original_name=RunningMeanStdInPlace) |
| (encoder): MultiInputEncoder( |
| (encoders): ModuleDict( |
| (obs): MlpEncoder( |
| (mlp_head): RecursiveScriptModule( |
| original_name=Sequential |
| (0): RecursiveScriptModule(original_name=Linear) |
| (1): RecursiveScriptModule(original_name=Tanh) |
| (2): RecursiveScriptModule(original_name=Linear) |
| (3): RecursiveScriptModule(original_name=Tanh) |
| ) |
| ) |
| ) |
| ) |
| (core): ModelCoreIdentity() |
| (decoder): MlpDecoder( |
| (mlp): Identity() |
| ) |
| (critic_linear): Linear(in_features=64, out_features=1, bias=True) |
| (action_parameterization): ActionParameterizationContinuousNonAdaptiveStddev( |
| (distribution_linear): Linear(in_features=64, out_features=4, bias=True) |
| ) |
| ) |
| [2023-07-16 21:11:32,422][236558] Worker 7 uses CPU cores [28, 29, 30, 31] |
| [2023-07-16 21:11:32,423][236588] Worker 6 uses CPU cores [24, 25, 26, 27] |
| [2023-07-16 21:11:32,428][236504] Worker 1 uses CPU cores [4, 5, 6, 7] |
| [2023-07-16 21:11:32,542][236459] Using optimizer <class 'torch.optim.adam.Adam'> |
| [2023-07-16 21:11:32,543][236459] No checkpoints found |
| [2023-07-16 21:11:32,543][236459] Did not load from checkpoint, starting from scratch! |
| [2023-07-16 21:11:32,543][236459] Initialized policy 0 weights for model version 0 |
| [2023-07-16 21:11:32,544][236459] LearnerWorker_p0 finished initialization! |
| [2023-07-16 21:11:32,597][236509] Worker 5 uses CPU cores [20, 21, 22, 23] |
| [2023-07-16 21:11:32,617][236506] Worker 2 uses CPU cores [8, 9, 10, 11] |
| [2023-07-16 21:11:32,781][236505] Worker 0 uses CPU cores [0, 1, 2, 3] |
| [2023-07-16 21:11:32,850][236508] Worker 4 uses CPU cores [16, 17, 18, 19] |
| [2023-07-16 21:11:32,961][236503] RunningMeanStd input shape: (39,) |
| [2023-07-16 21:11:32,961][236503] RunningMeanStd input shape: (1,) |
| [2023-07-16 21:11:33,008][236507] Worker 3 uses CPU cores [12, 13, 14, 15] |
| [2023-07-16 21:11:33,016][236218] Inference worker 0-0 is ready! |
| [2023-07-16 21:11:33,017][236218] All inference workers are ready! Signal rollout workers to start! |
| [2023-07-16 21:11:33,345][236218] Fps is (10 sec: nan, 60 sec: nan, 300 sec: nan). Total num frames: 0. Throughput: 0: nan. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) |
| [2023-07-16 21:11:34,271][236588] Decorrelating experience for 0 frames... |
| [2023-07-16 21:11:34,279][236588] Decorrelating experience for 64 frames... |
| [2023-07-16 21:11:34,279][236558] Decorrelating experience for 0 frames... |
| [2023-07-16 21:11:34,287][236558] Decorrelating experience for 64 frames... |
| [2023-07-16 21:11:34,288][236504] Decorrelating experience for 0 frames... |
| [2023-07-16 21:11:34,296][236504] Decorrelating experience for 64 frames... |
| [2023-07-16 21:11:34,307][236508] Decorrelating experience for 0 frames... |
| [2023-07-16 21:11:34,309][236506] Decorrelating experience for 0 frames... |
| [2023-07-16 21:11:34,312][236505] Decorrelating experience for 0 frames... |
| [2023-07-16 21:11:34,314][236509] Decorrelating experience for 0 frames... |
| [2023-07-16 21:11:34,314][236588] Decorrelating experience for 128 frames... |
| [2023-07-16 21:11:34,315][236507] Decorrelating experience for 0 frames... |
| [2023-07-16 21:11:34,315][236508] Decorrelating experience for 64 frames... |
| [2023-07-16 21:11:34,317][236506] Decorrelating experience for 64 frames... |
| [2023-07-16 21:11:34,320][236505] Decorrelating experience for 64 frames... |
| [2023-07-16 21:11:34,322][236509] Decorrelating experience for 64 frames... |
| [2023-07-16 21:11:34,323][236507] Decorrelating experience for 64 frames... |
| [2023-07-16 21:11:34,323][236558] Decorrelating experience for 128 frames... |
| [2023-07-16 21:11:34,331][236504] Decorrelating experience for 128 frames... |
| [2023-07-16 21:11:34,350][236508] Decorrelating experience for 128 frames... |
| [2023-07-16 21:11:34,352][236506] Decorrelating experience for 128 frames... |
| [2023-07-16 21:11:34,355][236505] Decorrelating experience for 128 frames... |
| [2023-07-16 21:11:34,356][236509] Decorrelating experience for 128 frames... |
| [2023-07-16 21:11:34,358][236507] Decorrelating experience for 128 frames... |
| [2023-07-16 21:11:34,384][236588] Decorrelating experience for 192 frames... |
| [2023-07-16 21:11:34,392][236558] Decorrelating experience for 192 frames... |
| [2023-07-16 21:11:34,400][236504] Decorrelating experience for 192 frames... |
| [2023-07-16 21:11:34,418][236508] Decorrelating experience for 192 frames... |
| [2023-07-16 21:11:34,420][236506] Decorrelating experience for 192 frames... |
| [2023-07-16 21:11:34,424][236505] Decorrelating experience for 192 frames... |
| [2023-07-16 21:11:34,425][236509] Decorrelating experience for 192 frames... |
| [2023-07-16 21:11:34,427][236507] Decorrelating experience for 192 frames... |
| [2023-07-16 21:11:35,612][236588] Decorrelating experience for 256 frames... |
| [2023-07-16 21:11:35,620][236558] Decorrelating experience for 256 frames... |
| [2023-07-16 21:11:35,624][236504] Decorrelating experience for 256 frames... |
| [2023-07-16 21:11:35,653][236506] Decorrelating experience for 256 frames... |
| [2023-07-16 21:11:35,654][236508] Decorrelating experience for 256 frames... |
| [2023-07-16 21:11:35,668][236509] Decorrelating experience for 256 frames... |
| [2023-07-16 21:11:35,670][236505] Decorrelating experience for 256 frames... |
| [2023-07-16 21:11:35,670][236507] Decorrelating experience for 256 frames... |
| [2023-07-16 21:11:35,743][236588] Decorrelating experience for 320 frames... |
| [2023-07-16 21:11:35,749][236558] Decorrelating experience for 320 frames... |
| [2023-07-16 21:11:35,754][236504] Decorrelating experience for 320 frames... |
| [2023-07-16 21:11:35,781][236506] Decorrelating experience for 320 frames... |
| [2023-07-16 21:11:35,783][236508] Decorrelating experience for 320 frames... |
| [2023-07-16 21:11:35,798][236509] Decorrelating experience for 320 frames... |
| [2023-07-16 21:11:35,799][236507] Decorrelating experience for 320 frames... |
| [2023-07-16 21:11:35,800][236505] Decorrelating experience for 320 frames... |
| [2023-07-16 21:11:35,907][236588] Decorrelating experience for 384 frames... |
| [2023-07-16 21:11:35,913][236558] Decorrelating experience for 384 frames... |
| [2023-07-16 21:11:35,917][236504] Decorrelating experience for 384 frames... |
| [2023-07-16 21:11:35,943][236506] Decorrelating experience for 384 frames... |
| [2023-07-16 21:11:35,946][236508] Decorrelating experience for 384 frames... |
| [2023-07-16 21:11:35,961][236509] Decorrelating experience for 384 frames... |
| [2023-07-16 21:11:35,962][236507] Decorrelating experience for 384 frames... |
| [2023-07-16 21:11:35,964][236505] Decorrelating experience for 384 frames... |
| [2023-07-16 21:11:36,098][236588] Decorrelating experience for 448 frames... |
| [2023-07-16 21:11:36,102][236558] Decorrelating experience for 448 frames... |
| [2023-07-16 21:11:36,107][236504] Decorrelating experience for 448 frames... |
| [2023-07-16 21:11:36,130][236506] Decorrelating experience for 448 frames... |
| [2023-07-16 21:11:36,135][236508] Decorrelating experience for 448 frames... |
| [2023-07-16 21:11:36,151][236507] Decorrelating experience for 448 frames... |
| [2023-07-16 21:11:36,151][236509] Decorrelating experience for 448 frames... |
| [2023-07-16 21:11:36,154][236505] Decorrelating experience for 448 frames... |
| [2023-07-16 21:11:38,345][236218] Fps is (10 sec: 2457.7, 60 sec: 2457.7, 300 sec: 2457.7). Total num frames: 12288. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:11:38,345][236218] Avg episode reward: [(0, '4.380')] |
| [2023-07-16 21:11:40,634][236503] Updated weights for policy 0, policy_version 80 (0.0005) |
| [2023-07-16 21:11:43,345][236218] Fps is (10 sec: 6963.3, 60 sec: 6963.3, 300 sec: 6963.3). Total num frames: 69632. Throughput: 0: 5735.2. Samples: 57352. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:11:43,345][236218] Avg episode reward: [(0, '23.155')] |
| [2023-07-16 21:11:43,348][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000000136_69632.pth... |
| [2023-07-16 21:11:44,287][236503] Updated weights for policy 0, policy_version 160 (0.0005) |
| [2023-07-16 21:11:48,023][236503] Updated weights for policy 0, policy_version 240 (0.0005) |
| [2023-07-16 21:11:48,345][236218] Fps is (10 sec: 11059.2, 60 sec: 8192.1, 300 sec: 8192.1). Total num frames: 122880. Throughput: 0: 8196.9. Samples: 122952. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-16 21:11:48,345][236218] Avg episode reward: [(0, '44.575')] |
| [2023-07-16 21:11:48,345][236459] Saving new best policy, reward=44.575! |
| [2023-07-16 21:11:50,327][236218] Heartbeat connected on Batcher_0 |
| [2023-07-16 21:11:50,329][236218] Heartbeat connected on LearnerWorker_p0 |
| [2023-07-16 21:11:50,338][236218] Heartbeat connected on RolloutWorker_w0 |
| [2023-07-16 21:11:50,339][236218] Heartbeat connected on InferenceWorker_p0-w0 |
| [2023-07-16 21:11:50,340][236218] Heartbeat connected on RolloutWorker_w2 |
| [2023-07-16 21:11:50,341][236218] Heartbeat connected on RolloutWorker_w1 |
| [2023-07-16 21:11:50,342][236218] Heartbeat connected on RolloutWorker_w3 |
| [2023-07-16 21:11:50,344][236218] Heartbeat connected on RolloutWorker_w4 |
| [2023-07-16 21:11:50,346][236218] Heartbeat connected on RolloutWorker_w5 |
| [2023-07-16 21:11:50,350][236218] Heartbeat connected on RolloutWorker_w6 |
| [2023-07-16 21:11:50,353][236218] Heartbeat connected on RolloutWorker_w7 |
| [2023-07-16 21:11:51,812][236503] Updated weights for policy 0, policy_version 320 (0.0005) |
| [2023-07-16 21:11:53,345][236218] Fps is (10 sec: 11059.2, 60 sec: 9011.2, 300 sec: 9011.2). Total num frames: 180224. Throughput: 0: 7785.6. Samples: 155712. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:11:53,345][236218] Avg episode reward: [(0, '87.194')] |
| [2023-07-16 21:11:53,346][236459] Saving new best policy, reward=87.194! |
| [2023-07-16 21:11:55,596][236503] Updated weights for policy 0, policy_version 400 (0.0005) |
| [2023-07-16 21:11:58,345][236218] Fps is (10 sec: 11059.1, 60 sec: 9338.9, 300 sec: 9338.9). Total num frames: 233472. Throughput: 0: 8844.3. Samples: 221108. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:11:58,345][236218] Avg episode reward: [(0, '60.293')] |
| [2023-07-16 21:11:58,349][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000000456_233472.pth... |
| [2023-07-16 21:11:59,390][236503] Updated weights for policy 0, policy_version 480 (0.0005) |
| [2023-07-16 21:12:03,338][236503] Updated weights for policy 0, policy_version 560 (0.0004) |
| [2023-07-16 21:12:03,345][236218] Fps is (10 sec: 10649.7, 60 sec: 9557.4, 300 sec: 9557.4). Total num frames: 286720. Throughput: 0: 9461.2. Samples: 283836. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:12:03,345][236218] Avg episode reward: [(0, '82.228')] |
| [2023-07-16 21:12:07,204][236503] Updated weights for policy 0, policy_version 640 (0.0005) |
| [2023-07-16 21:12:08,345][236218] Fps is (10 sec: 10240.1, 60 sec: 9596.4, 300 sec: 9596.4). Total num frames: 335872. Throughput: 0: 9019.5. Samples: 315680. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:12:08,345][236218] Avg episode reward: [(0, '100.220')] |
| [2023-07-16 21:12:08,396][236459] Saving new best policy, reward=100.220! |
| [2023-07-16 21:12:11,118][236503] Updated weights for policy 0, policy_version 720 (0.0005) |
| [2023-07-16 21:12:13,345][236218] Fps is (10 sec: 10240.0, 60 sec: 9728.0, 300 sec: 9728.0). Total num frames: 389120. Throughput: 0: 9461.7. Samples: 378468. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) |
| [2023-07-16 21:12:13,345][236218] Avg episode reward: [(0, '96.227')] |
| [2023-07-16 21:12:13,348][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000000760_389120.pth... |
| [2023-07-16 21:12:13,351][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000000136_69632.pth |
| [2023-07-16 21:12:14,897][236503] Updated weights for policy 0, policy_version 800 (0.0005) |
| [2023-07-16 21:12:18,345][236218] Fps is (10 sec: 11059.1, 60 sec: 9921.4, 300 sec: 9921.4). Total num frames: 446464. Throughput: 0: 9888.6. Samples: 444984. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-16 21:12:18,345][236218] Avg episode reward: [(0, '117.128')] |
| [2023-07-16 21:12:18,346][236459] Saving new best policy, reward=117.128! |
| [2023-07-16 21:12:18,509][236503] Updated weights for policy 0, policy_version 880 (0.0004) |
| [2023-07-16 21:12:22,122][236503] Updated weights for policy 0, policy_version 960 (0.0004) |
| [2023-07-16 21:12:23,345][236218] Fps is (10 sec: 11468.8, 60 sec: 10076.2, 300 sec: 10076.2). Total num frames: 503808. Throughput: 0: 10651.2. Samples: 479304. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:12:23,345][236218] Avg episode reward: [(0, '144.006')] |
| [2023-07-16 21:12:23,346][236459] Saving new best policy, reward=144.006! |
| [2023-07-16 21:12:25,856][236503] Updated weights for policy 0, policy_version 1040 (0.0004) |
| [2023-07-16 21:12:28,345][236218] Fps is (10 sec: 11059.1, 60 sec: 10128.3, 300 sec: 10128.3). Total num frames: 557056. Throughput: 0: 10845.5. Samples: 545400. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:12:28,345][236218] Avg episode reward: [(0, '130.227')] |
| [2023-07-16 21:12:28,348][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000001088_557056.pth... |
| [2023-07-16 21:12:28,351][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000000456_233472.pth |
| [2023-07-16 21:12:29,515][236503] Updated weights for policy 0, policy_version 1120 (0.0004) |
| [2023-07-16 21:12:33,044][236503] Updated weights for policy 0, policy_version 1200 (0.0004) |
| [2023-07-16 21:12:33,345][236218] Fps is (10 sec: 11059.2, 60 sec: 10240.0, 300 sec: 10240.0). Total num frames: 614400. Throughput: 0: 10922.5. Samples: 614464. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-16 21:12:33,345][236218] Avg episode reward: [(0, '186.984')] |
| [2023-07-16 21:12:33,346][236459] Saving new best policy, reward=186.984! |
| [2023-07-16 21:12:36,655][236503] Updated weights for policy 0, policy_version 1280 (0.0004) |
| [2023-07-16 21:12:38,345][236218] Fps is (10 sec: 11468.9, 60 sec: 10990.9, 300 sec: 10334.5). Total num frames: 671744. Throughput: 0: 10941.9. Samples: 648100. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) |
| [2023-07-16 21:12:38,345][236218] Avg episode reward: [(0, '210.890')] |
| [2023-07-16 21:12:38,346][236459] Saving new best policy, reward=210.890! |
| [2023-07-16 21:12:40,505][236503] Updated weights for policy 0, policy_version 1360 (0.0005) |
| [2023-07-16 21:12:43,345][236218] Fps is (10 sec: 11059.1, 60 sec: 10922.7, 300 sec: 10357.0). Total num frames: 724992. Throughput: 0: 10937.1. Samples: 713276. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) |
| [2023-07-16 21:12:43,345][236218] Avg episode reward: [(0, '208.509')] |
| [2023-07-16 21:12:43,348][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000001416_724992.pth... |
| [2023-07-16 21:12:43,351][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000000760_389120.pth |
| [2023-07-16 21:12:44,225][236503] Updated weights for policy 0, policy_version 1440 (0.0005) |
| [2023-07-16 21:12:48,080][236503] Updated weights for policy 0, policy_version 1520 (0.0005) |
| [2023-07-16 21:12:48,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10922.7, 300 sec: 10376.5). Total num frames: 778240. Throughput: 0: 10986.7. Samples: 778240. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:12:48,345][236218] Avg episode reward: [(0, '221.326')] |
| [2023-07-16 21:12:48,346][236459] Saving new best policy, reward=221.326! |
| [2023-07-16 21:12:51,909][236503] Updated weights for policy 0, policy_version 1600 (0.0005) |
| [2023-07-16 21:12:53,345][236218] Fps is (10 sec: 10649.7, 60 sec: 10854.4, 300 sec: 10393.6). Total num frames: 831488. Throughput: 0: 10987.2. Samples: 810104. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:12:53,345][236218] Avg episode reward: [(0, '198.209')] |
| [2023-07-16 21:12:55,747][236503] Updated weights for policy 0, policy_version 1680 (0.0005) |
| [2023-07-16 21:12:58,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10854.4, 300 sec: 10408.7). Total num frames: 884736. Throughput: 0: 11006.2. Samples: 873748. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) |
| [2023-07-16 21:12:58,345][236218] Avg episode reward: [(0, '261.028')] |
| [2023-07-16 21:12:58,348][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000001728_884736.pth... |
| [2023-07-16 21:12:58,349][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000001088_557056.pth |
| [2023-07-16 21:12:58,350][236459] Saving new best policy, reward=261.028! |
| [2023-07-16 21:12:59,664][236503] Updated weights for policy 0, policy_version 1760 (0.0005) |
| [2023-07-16 21:13:03,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10854.4, 300 sec: 10422.1). Total num frames: 937984. Throughput: 0: 10955.6. Samples: 937984. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:13:03,345][236218] Avg episode reward: [(0, '251.913')] |
| [2023-07-16 21:13:03,474][236503] Updated weights for policy 0, policy_version 1840 (0.0005) |
| [2023-07-16 21:13:07,328][236503] Updated weights for policy 0, policy_version 1920 (0.0005) |
| [2023-07-16 21:13:08,345][236218] Fps is (10 sec: 10649.7, 60 sec: 10922.7, 300 sec: 10434.0). Total num frames: 991232. Throughput: 0: 10900.5. Samples: 969828. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:13:08,345][236218] Avg episode reward: [(0, '254.969')] |
| [2023-07-16 21:13:11,062][236503] Updated weights for policy 0, policy_version 2000 (0.0005) |
| [2023-07-16 21:13:13,345][236218] Fps is (10 sec: 10649.5, 60 sec: 10922.7, 300 sec: 10444.8). Total num frames: 1044480. Throughput: 0: 10873.7. Samples: 1034716. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:13:13,345][236218] Avg episode reward: [(0, '254.354')] |
| [2023-07-16 21:13:13,353][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000002048_1048576.pth... |
| [2023-07-16 21:13:13,355][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000001416_724992.pth |
| [2023-07-16 21:13:14,863][236503] Updated weights for policy 0, policy_version 2080 (0.0005) |
| [2023-07-16 21:13:18,345][236218] Fps is (10 sec: 10649.5, 60 sec: 10854.4, 300 sec: 10454.6). Total num frames: 1097728. Throughput: 0: 10747.7. Samples: 1098112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:13:18,345][236218] Avg episode reward: [(0, '251.905')] |
| [2023-07-16 21:13:18,751][236503] Updated weights for policy 0, policy_version 2160 (0.0005) |
| [2023-07-16 21:13:22,578][236503] Updated weights for policy 0, policy_version 2240 (0.0005) |
| [2023-07-16 21:13:23,345][236218] Fps is (10 sec: 11059.3, 60 sec: 10854.4, 300 sec: 10500.7). Total num frames: 1155072. Throughput: 0: 10725.3. Samples: 1130736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:13:23,345][236218] Avg episode reward: [(0, '273.838')] |
| [2023-07-16 21:13:23,346][236459] Saving new best policy, reward=273.838! |
| [2023-07-16 21:13:26,333][236503] Updated weights for policy 0, policy_version 2320 (0.0005) |
| [2023-07-16 21:13:28,345][236218] Fps is (10 sec: 11059.1, 60 sec: 10854.4, 300 sec: 10507.1). Total num frames: 1208320. Throughput: 0: 10723.0. Samples: 1195812. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:13:28,345][236218] Avg episode reward: [(0, '296.819')] |
| [2023-07-16 21:13:28,349][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000002360_1208320.pth... |
| [2023-07-16 21:13:28,351][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000001728_884736.pth |
| [2023-07-16 21:13:28,352][236459] Saving new best policy, reward=296.819! |
| [2023-07-16 21:13:30,206][236503] Updated weights for policy 0, policy_version 2400 (0.0005) |
| [2023-07-16 21:13:33,345][236218] Fps is (10 sec: 10649.5, 60 sec: 10786.1, 300 sec: 10513.1). Total num frames: 1261568. Throughput: 0: 10698.7. Samples: 1259680. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:13:33,345][236218] Avg episode reward: [(0, '314.495')] |
| [2023-07-16 21:13:33,346][236459] Saving new best policy, reward=314.495! |
| [2023-07-16 21:13:33,994][236503] Updated weights for policy 0, policy_version 2480 (0.0005) |
| [2023-07-16 21:13:37,862][236503] Updated weights for policy 0, policy_version 2560 (0.0005) |
| [2023-07-16 21:13:38,345][236218] Fps is (10 sec: 10649.7, 60 sec: 10717.9, 300 sec: 10518.5). Total num frames: 1314816. Throughput: 0: 10692.3. Samples: 1291260. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-16 21:13:38,345][236218] Avg episode reward: [(0, '273.589')] |
| [2023-07-16 21:13:41,762][236503] Updated weights for policy 0, policy_version 2640 (0.0005) |
| [2023-07-16 21:13:43,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10523.6). Total num frames: 1368064. Throughput: 0: 10697.3. Samples: 1355124. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) |
| [2023-07-16 21:13:43,345][236218] Avg episode reward: [(0, '297.244')] |
| [2023-07-16 21:13:43,348][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000002672_1368064.pth... |
| [2023-07-16 21:13:43,350][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000002048_1048576.pth |
| [2023-07-16 21:13:45,680][236503] Updated weights for policy 0, policy_version 2720 (0.0005) |
| [2023-07-16 21:13:48,345][236218] Fps is (10 sec: 10240.1, 60 sec: 10649.6, 300 sec: 10497.9). Total num frames: 1417216. Throughput: 0: 10662.3. Samples: 1417788. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:13:48,345][236218] Avg episode reward: [(0, '316.556')] |
| [2023-07-16 21:13:48,356][236459] Saving new best policy, reward=316.556! |
| [2023-07-16 21:13:49,516][236503] Updated weights for policy 0, policy_version 2800 (0.0005) |
| [2023-07-16 21:13:53,345][236218] Fps is (10 sec: 10240.0, 60 sec: 10649.6, 300 sec: 10503.3). Total num frames: 1470464. Throughput: 0: 10670.1. Samples: 1449984. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) |
| [2023-07-16 21:13:53,345][236218] Avg episode reward: [(0, '299.647')] |
| [2023-07-16 21:13:53,432][236503] Updated weights for policy 0, policy_version 2880 (0.0005) |
| [2023-07-16 21:13:57,393][236503] Updated weights for policy 0, policy_version 2960 (0.0005) |
| [2023-07-16 21:13:58,345][236218] Fps is (10 sec: 10649.4, 60 sec: 10649.6, 300 sec: 10508.4). Total num frames: 1523712. Throughput: 0: 10595.1. Samples: 1511496. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) |
| [2023-07-16 21:13:58,345][236218] Avg episode reward: [(0, '323.290')] |
| [2023-07-16 21:13:58,348][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000002976_1523712.pth... |
| [2023-07-16 21:13:58,351][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000002360_1208320.pth |
| [2023-07-16 21:13:58,352][236459] Saving new best policy, reward=323.290! |
| [2023-07-16 21:14:01,296][236503] Updated weights for policy 0, policy_version 3040 (0.0006) |
| [2023-07-16 21:14:03,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10513.1). Total num frames: 1576960. Throughput: 0: 10606.7. Samples: 1575416. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:14:03,345][236218] Avg episode reward: [(0, '327.036')] |
| [2023-07-16 21:14:03,346][236459] Saving new best policy, reward=327.036! |
| [2023-07-16 21:14:05,165][236503] Updated weights for policy 0, policy_version 3120 (0.0005) |
| [2023-07-16 21:14:08,345][236218] Fps is (10 sec: 10649.7, 60 sec: 10649.6, 300 sec: 10517.5). Total num frames: 1630208. Throughput: 0: 10580.6. Samples: 1606864. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) |
| [2023-07-16 21:14:08,345][236218] Avg episode reward: [(0, '294.752')] |
| [2023-07-16 21:14:09,010][236503] Updated weights for policy 0, policy_version 3200 (0.0005) |
| [2023-07-16 21:14:12,870][236503] Updated weights for policy 0, policy_version 3280 (0.0005) |
| [2023-07-16 21:14:13,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10521.6). Total num frames: 1683456. Throughput: 0: 10563.7. Samples: 1671180. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:14:13,345][236218] Avg episode reward: [(0, '306.829')] |
| [2023-07-16 21:14:13,348][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000003288_1683456.pth... |
| [2023-07-16 21:14:13,351][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000002672_1368064.pth |
| [2023-07-16 21:14:16,789][236503] Updated weights for policy 0, policy_version 3360 (0.0005) |
| [2023-07-16 21:14:18,345][236218] Fps is (10 sec: 10649.7, 60 sec: 10649.6, 300 sec: 10525.5). Total num frames: 1736704. Throughput: 0: 10528.4. Samples: 1733456. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:14:18,345][236218] Avg episode reward: [(0, '343.020')] |
| [2023-07-16 21:14:18,345][236459] Saving new best policy, reward=343.020! |
| [2023-07-16 21:14:20,672][236503] Updated weights for policy 0, policy_version 3440 (0.0005) |
| [2023-07-16 21:14:23,345][236218] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10505.0). Total num frames: 1785856. Throughput: 0: 10537.3. Samples: 1765440. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) |
| [2023-07-16 21:14:23,345][236218] Avg episode reward: [(0, '286.041')] |
| [2023-07-16 21:14:24,508][236503] Updated weights for policy 0, policy_version 3520 (0.0005) |
| [2023-07-16 21:14:28,345][236218] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10509.2). Total num frames: 1839104. Throughput: 0: 10522.3. Samples: 1828628. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:14:28,345][236218] Avg episode reward: [(0, '315.810')] |
| [2023-07-16 21:14:28,404][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000003600_1843200.pth... |
| [2023-07-16 21:14:28,404][236503] Updated weights for policy 0, policy_version 3600 (0.0006) |
| [2023-07-16 21:14:28,406][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000002976_1523712.pth |
| [2023-07-16 21:14:32,279][236503] Updated weights for policy 0, policy_version 3680 (0.0005) |
| [2023-07-16 21:14:33,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10513.1). Total num frames: 1892352. Throughput: 0: 10546.0. Samples: 1892360. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:14:33,345][236218] Avg episode reward: [(0, '306.572')] |
| [2023-07-16 21:14:36,190][236503] Updated weights for policy 0, policy_version 3760 (0.0005) |
| [2023-07-16 21:14:38,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10516.8). Total num frames: 1945600. Throughput: 0: 10535.9. Samples: 1924100. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:14:38,345][236218] Avg episode reward: [(0, '289.305')] |
| [2023-07-16 21:14:40,068][236503] Updated weights for policy 0, policy_version 3840 (0.0005) |
| [2023-07-16 21:14:43,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10520.3). Total num frames: 1998848. Throughput: 0: 10558.6. Samples: 1986632. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-16 21:14:43,345][236218] Avg episode reward: [(0, '320.637')] |
| [2023-07-16 21:14:43,348][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000003904_1998848.pth... |
| [2023-07-16 21:14:43,351][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000003288_1683456.pth |
| [2023-07-16 21:14:43,933][236503] Updated weights for policy 0, policy_version 3920 (0.0005) |
| [2023-07-16 21:14:47,824][236503] Updated weights for policy 0, policy_version 4000 (0.0005) |
| [2023-07-16 21:14:48,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10523.6). Total num frames: 2052096. Throughput: 0: 10559.3. Samples: 2050584. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:14:48,346][236218] Avg episode reward: [(0, '325.632')] |
| [2023-07-16 21:14:51,721][236503] Updated weights for policy 0, policy_version 4080 (0.0005) |
| [2023-07-16 21:14:53,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10526.7). Total num frames: 2105344. Throughput: 0: 10546.3. Samples: 2081448. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:14:53,345][236218] Avg episode reward: [(0, '331.527')] |
| [2023-07-16 21:14:55,549][236503] Updated weights for policy 0, policy_version 4160 (0.0005) |
| [2023-07-16 21:14:58,345][236218] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10529.7). Total num frames: 2158592. Throughput: 0: 10559.7. Samples: 2146368. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-16 21:14:58,345][236218] Avg episode reward: [(0, '337.688')] |
| [2023-07-16 21:14:58,348][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000004216_2158592.pth... |
| [2023-07-16 21:14:58,350][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000003600_1843200.pth |
| [2023-07-16 21:14:59,227][236503] Updated weights for policy 0, policy_version 4240 (0.0005) |
| [2023-07-16 21:15:02,807][236503] Updated weights for policy 0, policy_version 4320 (0.0004) |
| [2023-07-16 21:15:03,345][236218] Fps is (10 sec: 11059.2, 60 sec: 10649.6, 300 sec: 10552.1). Total num frames: 2215936. Throughput: 0: 10694.7. Samples: 2214720. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) |
| [2023-07-16 21:15:03,345][236218] Avg episode reward: [(0, '307.850')] |
| [2023-07-16 21:15:06,380][236503] Updated weights for policy 0, policy_version 4400 (0.0004) |
| [2023-07-16 21:15:08,345][236218] Fps is (10 sec: 11468.8, 60 sec: 10717.9, 300 sec: 10573.4). Total num frames: 2273280. Throughput: 0: 10741.3. Samples: 2248800. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) |
| [2023-07-16 21:15:08,345][236218] Avg episode reward: [(0, '317.490')] |
| [2023-07-16 21:15:09,918][236503] Updated weights for policy 0, policy_version 4480 (0.0004) |
| [2023-07-16 21:15:13,345][236218] Fps is (10 sec: 11468.8, 60 sec: 10786.1, 300 sec: 10593.8). Total num frames: 2330624. Throughput: 0: 10890.1. Samples: 2318684. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:15:13,345][236218] Avg episode reward: [(0, '312.388')] |
| [2023-07-16 21:15:13,348][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000004552_2330624.pth... |
| [2023-07-16 21:15:13,350][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000003904_1998848.pth |
| [2023-07-16 21:15:13,414][236503] Updated weights for policy 0, policy_version 4560 (0.0004) |
| [2023-07-16 21:15:17,019][236503] Updated weights for policy 0, policy_version 4640 (0.0004) |
| [2023-07-16 21:15:18,345][236218] Fps is (10 sec: 11468.8, 60 sec: 10854.4, 300 sec: 10613.2). Total num frames: 2387968. Throughput: 0: 10997.0. Samples: 2387224. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:15:18,345][236218] Avg episode reward: [(0, '310.394')] |
| [2023-07-16 21:15:20,906][236503] Updated weights for policy 0, policy_version 4720 (0.0005) |
| [2023-07-16 21:15:23,345][236218] Fps is (10 sec: 11059.2, 60 sec: 10922.7, 300 sec: 10614.0). Total num frames: 2441216. Throughput: 0: 10977.9. Samples: 2418104. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:15:23,345][236218] Avg episode reward: [(0, '312.096')] |
| [2023-07-16 21:15:24,735][236503] Updated weights for policy 0, policy_version 4800 (0.0006) |
| [2023-07-16 21:15:28,345][236218] Fps is (10 sec: 10649.5, 60 sec: 10922.7, 300 sec: 10614.7). Total num frames: 2494464. Throughput: 0: 11012.1. Samples: 2482176. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-16 21:15:28,345][236218] Avg episode reward: [(0, '291.815')] |
| [2023-07-16 21:15:28,348][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000004872_2494464.pth... |
| [2023-07-16 21:15:28,351][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000004216_2158592.pth |
| [2023-07-16 21:15:28,616][236503] Updated weights for policy 0, policy_version 4880 (0.0005) |
| [2023-07-16 21:15:32,480][236503] Updated weights for policy 0, policy_version 4960 (0.0005) |
| [2023-07-16 21:15:33,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10922.7, 300 sec: 10615.5). Total num frames: 2547712. Throughput: 0: 11005.8. Samples: 2545844. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) |
| [2023-07-16 21:15:33,345][236218] Avg episode reward: [(0, '311.902')] |
| [2023-07-16 21:15:36,350][236503] Updated weights for policy 0, policy_version 5040 (0.0005) |
| [2023-07-16 21:15:38,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10922.7, 300 sec: 10616.2). Total num frames: 2600960. Throughput: 0: 11021.7. Samples: 2577424. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-16 21:15:38,345][236218] Avg episode reward: [(0, '328.719')] |
| [2023-07-16 21:15:40,243][236503] Updated weights for policy 0, policy_version 5120 (0.0005) |
| [2023-07-16 21:15:43,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10922.7, 300 sec: 10616.8). Total num frames: 2654208. Throughput: 0: 10973.8. Samples: 2640188. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:15:43,345][236218] Avg episode reward: [(0, '303.130')] |
| [2023-07-16 21:15:43,348][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000005184_2654208.pth... |
| [2023-07-16 21:15:43,351][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000004552_2330624.pth |
| [2023-07-16 21:15:44,041][236503] Updated weights for policy 0, policy_version 5200 (0.0005) |
| [2023-07-16 21:15:47,828][236503] Updated weights for policy 0, policy_version 5280 (0.0004) |
| [2023-07-16 21:15:48,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10922.7, 300 sec: 10617.5). Total num frames: 2707456. Throughput: 0: 10915.0. Samples: 2705896. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) |
| [2023-07-16 21:15:48,345][236218] Avg episode reward: [(0, '324.067')] |
| [2023-07-16 21:15:51,713][236503] Updated weights for policy 0, policy_version 5360 (0.0005) |
| [2023-07-16 21:15:53,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10922.7, 300 sec: 10618.1). Total num frames: 2760704. Throughput: 0: 10849.4. Samples: 2737024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:15:53,345][236218] Avg episode reward: [(0, '322.452')] |
| [2023-07-16 21:15:55,523][236503] Updated weights for policy 0, policy_version 5440 (0.0005) |
| [2023-07-16 21:15:58,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10922.7, 300 sec: 10618.7). Total num frames: 2813952. Throughput: 0: 10734.3. Samples: 2801728. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:15:58,345][236218] Avg episode reward: [(0, '291.760')] |
| [2023-07-16 21:15:58,348][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000005496_2813952.pth... |
| [2023-07-16 21:15:58,351][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000004872_2494464.pth |
| [2023-07-16 21:15:59,342][236503] Updated weights for policy 0, policy_version 5520 (0.0005) |
| [2023-07-16 21:16:03,213][236503] Updated weights for policy 0, policy_version 5600 (0.0005) |
| [2023-07-16 21:16:03,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10854.4, 300 sec: 10619.3). Total num frames: 2867200. Throughput: 0: 10632.4. Samples: 2865684. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) |
| [2023-07-16 21:16:03,345][236218] Avg episode reward: [(0, '315.793')] |
| [2023-07-16 21:16:07,156][236503] Updated weights for policy 0, policy_version 5680 (0.0005) |
| [2023-07-16 21:16:08,345][236218] Fps is (10 sec: 10649.4, 60 sec: 10786.1, 300 sec: 10619.8). Total num frames: 2920448. Throughput: 0: 10627.4. Samples: 2896340. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-16 21:16:08,345][236218] Avg episode reward: [(0, '312.489')] |
| [2023-07-16 21:16:10,968][236503] Updated weights for policy 0, policy_version 5760 (0.0005) |
| [2023-07-16 21:16:13,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10620.3). Total num frames: 2973696. Throughput: 0: 10642.6. Samples: 2961092. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-16 21:16:13,345][236218] Avg episode reward: [(0, '348.978')] |
| [2023-07-16 21:16:13,348][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000005808_2973696.pth... |
| [2023-07-16 21:16:13,351][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000005184_2654208.pth |
| [2023-07-16 21:16:13,352][236459] Saving new best policy, reward=348.978! |
| [2023-07-16 21:16:14,787][236503] Updated weights for policy 0, policy_version 5840 (0.0005) |
| [2023-07-16 21:16:18,345][236218] Fps is (10 sec: 10649.7, 60 sec: 10649.6, 300 sec: 10620.9). Total num frames: 3026944. Throughput: 0: 10627.3. Samples: 3024072. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-16 21:16:18,345][236218] Avg episode reward: [(0, '318.332')] |
| [2023-07-16 21:16:18,702][236503] Updated weights for policy 0, policy_version 5920 (0.0005) |
| [2023-07-16 21:16:22,591][236503] Updated weights for policy 0, policy_version 6000 (0.0006) |
| [2023-07-16 21:16:23,345][236218] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10607.2). Total num frames: 3076096. Throughput: 0: 10627.9. Samples: 3055680. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-16 21:16:23,345][236218] Avg episode reward: [(0, '317.397')] |
| [2023-07-16 21:16:26,512][236503] Updated weights for policy 0, policy_version 6080 (0.0005) |
| [2023-07-16 21:16:28,345][236218] Fps is (10 sec: 10240.1, 60 sec: 10581.3, 300 sec: 10608.0). Total num frames: 3129344. Throughput: 0: 10636.7. Samples: 3118840. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:16:28,345][236218] Avg episode reward: [(0, '311.370')] |
| [2023-07-16 21:16:28,403][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000006120_3133440.pth... |
| [2023-07-16 21:16:28,405][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000005496_2813952.pth |
| [2023-07-16 21:16:30,328][236503] Updated weights for policy 0, policy_version 6160 (0.0005) |
| [2023-07-16 21:16:33,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10746.8). Total num frames: 3182592. Throughput: 0: 10595.4. Samples: 3182688. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-16 21:16:33,345][236218] Avg episode reward: [(0, '307.929')] |
| [2023-07-16 21:16:34,110][236503] Updated weights for policy 0, policy_version 6240 (0.0005) |
| [2023-07-16 21:16:37,879][236503] Updated weights for policy 0, policy_version 6320 (0.0005) |
| [2023-07-16 21:16:38,345][236218] Fps is (10 sec: 11059.1, 60 sec: 10649.6, 300 sec: 10746.8). Total num frames: 3239936. Throughput: 0: 10648.0. Samples: 3216184. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:16:38,345][236218] Avg episode reward: [(0, '302.525')] |
| [2023-07-16 21:16:41,739][236503] Updated weights for policy 0, policy_version 6400 (0.0005) |
| [2023-07-16 21:16:43,345][236218] Fps is (10 sec: 11059.1, 60 sec: 10649.6, 300 sec: 10746.8). Total num frames: 3293184. Throughput: 0: 10638.8. Samples: 3280476. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:16:43,345][236218] Avg episode reward: [(0, '276.074')] |
| [2023-07-16 21:16:43,348][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000006432_3293184.pth... |
| [2023-07-16 21:16:43,350][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000005808_2973696.pth |
| [2023-07-16 21:16:45,567][236503] Updated weights for policy 0, policy_version 6480 (0.0005) |
| [2023-07-16 21:16:48,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10732.9). Total num frames: 3346432. Throughput: 0: 10671.7. Samples: 3345912. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:16:48,345][236218] Avg episode reward: [(0, '341.587')] |
| [2023-07-16 21:16:49,323][236503] Updated weights for policy 0, policy_version 6560 (0.0005) |
| [2023-07-16 21:16:53,084][236503] Updated weights for policy 0, policy_version 6640 (0.0005) |
| [2023-07-16 21:16:53,345][236218] Fps is (10 sec: 10649.7, 60 sec: 10649.6, 300 sec: 10732.9). Total num frames: 3399680. Throughput: 0: 10705.7. Samples: 3378096. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-16 21:16:53,345][236218] Avg episode reward: [(0, '353.120')] |
| [2023-07-16 21:16:53,346][236459] Saving new best policy, reward=353.120! |
| [2023-07-16 21:16:56,982][236503] Updated weights for policy 0, policy_version 6720 (0.0005) |
| [2023-07-16 21:16:58,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10732.9). Total num frames: 3452928. Throughput: 0: 10669.2. Samples: 3441208. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-16 21:16:58,345][236218] Avg episode reward: [(0, '351.068')] |
| [2023-07-16 21:16:58,348][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000006744_3452928.pth... |
| [2023-07-16 21:16:58,351][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000006120_3133440.pth |
| [2023-07-16 21:17:00,844][236503] Updated weights for policy 0, policy_version 6800 (0.0005) |
| [2023-07-16 21:17:03,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10746.8). Total num frames: 3506176. Throughput: 0: 10700.6. Samples: 3505600. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:17:03,345][236218] Avg episode reward: [(0, '322.202')] |
| [2023-07-16 21:17:04,729][236503] Updated weights for policy 0, policy_version 6880 (0.0005) |
| [2023-07-16 21:17:08,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10746.8). Total num frames: 3559424. Throughput: 0: 10687.7. Samples: 3536628. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:17:08,345][236218] Avg episode reward: [(0, '378.356')] |
| [2023-07-16 21:17:08,346][236459] Saving new best policy, reward=378.356! |
| [2023-07-16 21:17:08,649][236503] Updated weights for policy 0, policy_version 6960 (0.0005) |
| [2023-07-16 21:17:12,560][236503] Updated weights for policy 0, policy_version 7040 (0.0005) |
| [2023-07-16 21:17:13,345][236218] Fps is (10 sec: 10649.5, 60 sec: 10649.6, 300 sec: 10732.9). Total num frames: 3612672. Throughput: 0: 10685.3. Samples: 3599680. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:17:13,345][236218] Avg episode reward: [(0, '345.346')] |
| [2023-07-16 21:17:13,348][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000007056_3612672.pth... |
| [2023-07-16 21:17:13,352][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000006432_3293184.pth |
| [2023-07-16 21:17:16,453][236503] Updated weights for policy 0, policy_version 7120 (0.0005) |
| [2023-07-16 21:17:18,345][236218] Fps is (10 sec: 10240.1, 60 sec: 10581.4, 300 sec: 10705.1). Total num frames: 3661824. Throughput: 0: 10649.1. Samples: 3661896. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:17:18,345][236218] Avg episode reward: [(0, '330.749')] |
| [2023-07-16 21:17:20,383][236503] Updated weights for policy 0, policy_version 7200 (0.0005) |
| [2023-07-16 21:17:23,345][236218] Fps is (10 sec: 10240.1, 60 sec: 10649.6, 300 sec: 10705.1). Total num frames: 3715072. Throughput: 0: 10616.4. Samples: 3693920. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:17:23,345][236218] Avg episode reward: [(0, '352.672')] |
| [2023-07-16 21:17:24,228][236503] Updated weights for policy 0, policy_version 7280 (0.0005) |
| [2023-07-16 21:17:28,124][236503] Updated weights for policy 0, policy_version 7360 (0.0005) |
| [2023-07-16 21:17:28,345][236218] Fps is (10 sec: 10649.5, 60 sec: 10649.6, 300 sec: 10691.3). Total num frames: 3768320. Throughput: 0: 10591.5. Samples: 3757092. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) |
| [2023-07-16 21:17:28,345][236218] Avg episode reward: [(0, '365.181')] |
| [2023-07-16 21:17:28,348][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000007360_3768320.pth... |
| [2023-07-16 21:17:28,351][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000006744_3452928.pth |
| [2023-07-16 21:17:31,922][236503] Updated weights for policy 0, policy_version 7440 (0.0005) |
| [2023-07-16 21:17:33,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10677.4). Total num frames: 3821568. Throughput: 0: 10571.7. Samples: 3821640. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-16 21:17:33,345][236218] Avg episode reward: [(0, '357.987')] |
| [2023-07-16 21:17:35,731][236503] Updated weights for policy 0, policy_version 7520 (0.0005) |
| [2023-07-16 21:17:38,345][236218] Fps is (10 sec: 10649.7, 60 sec: 10581.3, 300 sec: 10677.4). Total num frames: 3874816. Throughput: 0: 10574.9. Samples: 3853968. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-16 21:17:38,345][236218] Avg episode reward: [(0, '363.330')] |
| [2023-07-16 21:17:39,604][236503] Updated weights for policy 0, policy_version 7600 (0.0005) |
| [2023-07-16 21:17:43,345][236218] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10677.4). Total num frames: 3928064. Throughput: 0: 10569.2. Samples: 3916820. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:17:43,345][236218] Avg episode reward: [(0, '344.849')] |
| [2023-07-16 21:17:43,349][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000007672_3928064.pth... |
| [2023-07-16 21:17:43,351][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000007056_3612672.pth |
| [2023-07-16 21:17:43,531][236503] Updated weights for policy 0, policy_version 7680 (0.0005) |
| [2023-07-16 21:17:47,263][236503] Updated weights for policy 0, policy_version 7760 (0.0004) |
| [2023-07-16 21:17:48,345][236218] Fps is (10 sec: 11059.2, 60 sec: 10649.6, 300 sec: 10691.3). Total num frames: 3985408. Throughput: 0: 10590.8. Samples: 3982188. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:17:48,345][236218] Avg episode reward: [(0, '318.030')] |
| [2023-07-16 21:17:50,866][236503] Updated weights for policy 0, policy_version 7840 (0.0004) |
| [2023-07-16 21:17:53,345][236218] Fps is (10 sec: 11059.3, 60 sec: 10649.6, 300 sec: 10691.3). Total num frames: 4038656. Throughput: 0: 10655.8. Samples: 4016140. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:17:53,345][236218] Avg episode reward: [(0, '340.550')] |
| [2023-07-16 21:17:54,455][236503] Updated weights for policy 0, policy_version 7920 (0.0004) |
| [2023-07-16 21:17:58,064][236503] Updated weights for policy 0, policy_version 8000 (0.0004) |
| [2023-07-16 21:17:58,345][236218] Fps is (10 sec: 11059.2, 60 sec: 10717.9, 300 sec: 10705.1). Total num frames: 4096000. Throughput: 0: 10757.9. Samples: 4083784. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:17:58,345][236218] Avg episode reward: [(0, '362.147')] |
| [2023-07-16 21:17:58,348][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000008000_4096000.pth... |
| [2023-07-16 21:17:58,351][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000007360_3768320.pth |
| [2023-07-16 21:18:01,666][236503] Updated weights for policy 0, policy_version 8080 (0.0004) |
| [2023-07-16 21:18:03,345][236218] Fps is (10 sec: 11468.7, 60 sec: 10786.1, 300 sec: 10719.0). Total num frames: 4153344. Throughput: 0: 10907.5. Samples: 4152736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:18:03,345][236218] Avg episode reward: [(0, '333.864')] |
| [2023-07-16 21:18:05,262][236503] Updated weights for policy 0, policy_version 8160 (0.0004) |
| [2023-07-16 21:18:08,345][236218] Fps is (10 sec: 11468.8, 60 sec: 10854.4, 300 sec: 10732.9). Total num frames: 4210688. Throughput: 0: 10939.0. Samples: 4186176. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) |
| [2023-07-16 21:18:08,346][236218] Avg episode reward: [(0, '335.565')] |
| [2023-07-16 21:18:09,108][236503] Updated weights for policy 0, policy_version 8240 (0.0005) |
| [2023-07-16 21:18:12,960][236503] Updated weights for policy 0, policy_version 8320 (0.0005) |
| [2023-07-16 21:18:13,345][236218] Fps is (10 sec: 11059.2, 60 sec: 10854.4, 300 sec: 10732.9). Total num frames: 4263936. Throughput: 0: 10968.8. Samples: 4250688. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) |
| [2023-07-16 21:18:13,346][236218] Avg episode reward: [(0, '342.407')] |
| [2023-07-16 21:18:13,349][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000008328_4263936.pth... |
| [2023-07-16 21:18:13,352][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000007672_3928064.pth |
| [2023-07-16 21:18:16,814][236503] Updated weights for policy 0, policy_version 8400 (0.0005) |
| [2023-07-16 21:18:18,345][236218] Fps is (10 sec: 10240.0, 60 sec: 10854.4, 300 sec: 10705.1). Total num frames: 4313088. Throughput: 0: 10932.2. Samples: 4313588. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:18:18,346][236218] Avg episode reward: [(0, '344.939')] |
| [2023-07-16 21:18:20,666][236503] Updated weights for policy 0, policy_version 8480 (0.0005) |
| [2023-07-16 21:18:23,345][236218] Fps is (10 sec: 10240.1, 60 sec: 10854.4, 300 sec: 10705.1). Total num frames: 4366336. Throughput: 0: 10932.3. Samples: 4345920. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) |
| [2023-07-16 21:18:23,346][236218] Avg episode reward: [(0, '373.959')] |
| [2023-07-16 21:18:24,529][236503] Updated weights for policy 0, policy_version 8560 (0.0005) |
| [2023-07-16 21:18:28,164][236503] Updated weights for policy 0, policy_version 8640 (0.0004) |
| [2023-07-16 21:18:28,345][236218] Fps is (10 sec: 11059.2, 60 sec: 10922.7, 300 sec: 10719.0). Total num frames: 4423680. Throughput: 0: 10980.1. Samples: 4410924. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:18:28,422][236218] Avg episode reward: [(0, '366.736')] |
| [2023-07-16 21:18:28,425][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000008640_4423680.pth... |
| [2023-07-16 21:18:28,428][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000008000_4096000.pth |
| [2023-07-16 21:18:31,795][236503] Updated weights for policy 0, policy_version 8720 (0.0004) |
| [2023-07-16 21:18:33,345][236218] Fps is (10 sec: 11468.8, 60 sec: 10990.9, 300 sec: 10732.9). Total num frames: 4481024. Throughput: 0: 11039.9. Samples: 4478984. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:18:33,345][236218] Avg episode reward: [(0, '356.037')] |
| [2023-07-16 21:18:35,399][236503] Updated weights for policy 0, policy_version 8800 (0.0004) |
| [2023-07-16 21:18:38,345][236218] Fps is (10 sec: 11059.1, 60 sec: 10990.9, 300 sec: 10732.9). Total num frames: 4534272. Throughput: 0: 11047.4. Samples: 4513276. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) |
| [2023-07-16 21:18:38,345][236218] Avg episode reward: [(0, '359.136')] |
| [2023-07-16 21:18:39,136][236503] Updated weights for policy 0, policy_version 8880 (0.0005) |
| [2023-07-16 21:18:42,698][236503] Updated weights for policy 0, policy_version 8960 (0.0004) |
| [2023-07-16 21:18:43,345][236218] Fps is (10 sec: 11059.2, 60 sec: 11059.2, 300 sec: 10760.7). Total num frames: 4591616. Throughput: 0: 11016.6. Samples: 4579532. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:18:43,345][236218] Avg episode reward: [(0, '341.233')] |
| [2023-07-16 21:18:43,348][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000008968_4591616.pth... |
| [2023-07-16 21:18:43,350][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000008328_4263936.pth |
| [2023-07-16 21:18:46,342][236503] Updated weights for policy 0, policy_version 9040 (0.0004) |
| [2023-07-16 21:18:48,345][236218] Fps is (10 sec: 11469.0, 60 sec: 11059.2, 300 sec: 10774.6). Total num frames: 4648960. Throughput: 0: 10993.7. Samples: 4647452. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:18:48,345][236218] Avg episode reward: [(0, '353.108')] |
| [2023-07-16 21:18:49,991][236503] Updated weights for policy 0, policy_version 9120 (0.0004) |
| [2023-07-16 21:18:53,345][236218] Fps is (10 sec: 11059.3, 60 sec: 11059.2, 300 sec: 10774.6). Total num frames: 4702208. Throughput: 0: 11002.4. Samples: 4681284. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:18:53,345][236218] Avg episode reward: [(0, '372.501')] |
| [2023-07-16 21:18:53,791][236503] Updated weights for policy 0, policy_version 9200 (0.0005) |
| [2023-07-16 21:18:57,646][236503] Updated weights for policy 0, policy_version 9280 (0.0005) |
| [2023-07-16 21:18:58,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10990.9, 300 sec: 10774.6). Total num frames: 4755456. Throughput: 0: 10990.2. Samples: 4745248. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:18:58,345][236218] Avg episode reward: [(0, '355.035')] |
| [2023-07-16 21:18:58,348][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000009288_4755456.pth... |
| [2023-07-16 21:18:58,351][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000008640_4423680.pth |
| [2023-07-16 21:19:01,526][236503] Updated weights for policy 0, policy_version 9360 (0.0004) |
| [2023-07-16 21:19:03,345][236218] Fps is (10 sec: 10649.5, 60 sec: 10922.7, 300 sec: 10774.6). Total num frames: 4808704. Throughput: 0: 11002.3. Samples: 4808692. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:19:03,346][236218] Avg episode reward: [(0, '348.041')] |
| [2023-07-16 21:19:05,396][236503] Updated weights for policy 0, policy_version 9440 (0.0005) |
| [2023-07-16 21:19:08,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10854.4, 300 sec: 10774.6). Total num frames: 4861952. Throughput: 0: 10996.2. Samples: 4840748. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-16 21:19:08,346][236218] Avg episode reward: [(0, '361.707')] |
| [2023-07-16 21:19:09,178][236503] Updated weights for policy 0, policy_version 9520 (0.0005) |
| [2023-07-16 21:19:12,835][236503] Updated weights for policy 0, policy_version 9600 (0.0005) |
| [2023-07-16 21:19:13,345][236218] Fps is (10 sec: 11059.1, 60 sec: 10922.7, 300 sec: 10788.4). Total num frames: 4919296. Throughput: 0: 11016.4. Samples: 4906664. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-16 21:19:13,440][236218] Avg episode reward: [(0, '349.418')] |
| [2023-07-16 21:19:13,444][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000009608_4919296.pth... |
| [2023-07-16 21:19:13,446][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000008968_4591616.pth |
| [2023-07-16 21:19:16,687][236503] Updated weights for policy 0, policy_version 9680 (0.0004) |
| [2023-07-16 21:19:18,345][236218] Fps is (10 sec: 11059.2, 60 sec: 10990.9, 300 sec: 10802.3). Total num frames: 4972544. Throughput: 0: 10911.1. Samples: 4969984. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) |
| [2023-07-16 21:19:18,346][236218] Avg episode reward: [(0, '358.805')] |
| [2023-07-16 21:19:20,610][236503] Updated weights for policy 0, policy_version 9760 (0.0006) |
| [2023-07-16 21:19:23,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10990.9, 300 sec: 10802.3). Total num frames: 5025792. Throughput: 0: 10852.0. Samples: 5001616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:19:23,345][236218] Avg episode reward: [(0, '365.747')] |
| [2023-07-16 21:19:24,461][236503] Updated weights for policy 0, policy_version 9840 (0.0006) |
| [2023-07-16 21:19:28,345][236218] Fps is (10 sec: 10240.0, 60 sec: 10854.4, 300 sec: 10788.4). Total num frames: 5074944. Throughput: 0: 10785.3. Samples: 5064872. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:19:28,345][236218] Avg episode reward: [(0, '360.695')] |
| [2023-07-16 21:19:28,380][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000009920_5079040.pth... |
| [2023-07-16 21:19:28,381][236503] Updated weights for policy 0, policy_version 9920 (0.0006) |
| [2023-07-16 21:19:28,382][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000009288_4755456.pth |
| [2023-07-16 21:19:32,209][236503] Updated weights for policy 0, policy_version 10000 (0.0005) |
| [2023-07-16 21:19:33,345][236218] Fps is (10 sec: 10240.1, 60 sec: 10786.2, 300 sec: 10788.4). Total num frames: 5128192. Throughput: 0: 10695.2. Samples: 5128736. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-16 21:19:33,345][236218] Avg episode reward: [(0, '368.424')] |
| [2023-07-16 21:19:36,025][236503] Updated weights for policy 0, policy_version 10080 (0.0005) |
| [2023-07-16 21:19:38,345][236218] Fps is (10 sec: 10649.7, 60 sec: 10786.2, 300 sec: 10788.4). Total num frames: 5181440. Throughput: 0: 10661.0. Samples: 5161032. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-16 21:19:38,346][236218] Avg episode reward: [(0, '340.016')] |
| [2023-07-16 21:19:39,903][236503] Updated weights for policy 0, policy_version 10160 (0.0005) |
| [2023-07-16 21:19:43,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10788.4). Total num frames: 5234688. Throughput: 0: 10660.5. Samples: 5224968. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-16 21:19:43,345][236218] Avg episode reward: [(0, '412.824')] |
| [2023-07-16 21:19:43,352][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000010232_5238784.pth... |
| [2023-07-16 21:19:43,354][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000009608_4919296.pth |
| [2023-07-16 21:19:43,355][236459] Saving new best policy, reward=412.824! |
| [2023-07-16 21:19:43,727][236503] Updated weights for policy 0, policy_version 10240 (0.0005) |
| [2023-07-16 21:19:47,667][236503] Updated weights for policy 0, policy_version 10320 (0.0005) |
| [2023-07-16 21:19:48,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10788.4). Total num frames: 5287936. Throughput: 0: 10650.0. Samples: 5287944. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:19:48,346][236218] Avg episode reward: [(0, '380.486')] |
| [2023-07-16 21:19:51,577][236503] Updated weights for policy 0, policy_version 10400 (0.0005) |
| [2023-07-16 21:19:53,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10788.4). Total num frames: 5341184. Throughput: 0: 10631.2. Samples: 5319152. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:19:53,345][236218] Avg episode reward: [(0, '340.372')] |
| [2023-07-16 21:19:55,469][236503] Updated weights for policy 0, policy_version 10480 (0.0005) |
| [2023-07-16 21:19:58,345][236218] Fps is (10 sec: 10649.5, 60 sec: 10649.6, 300 sec: 10774.6). Total num frames: 5394432. Throughput: 0: 10566.5. Samples: 5382156. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:19:58,346][236218] Avg episode reward: [(0, '384.036')] |
| [2023-07-16 21:19:58,349][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000010536_5394432.pth... |
| [2023-07-16 21:19:58,352][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000009920_5079040.pth |
| [2023-07-16 21:19:59,348][236503] Updated weights for policy 0, policy_version 10560 (0.0004) |
| [2023-07-16 21:20:03,178][236503] Updated weights for policy 0, policy_version 10640 (0.0005) |
| [2023-07-16 21:20:03,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10760.7). Total num frames: 5447680. Throughput: 0: 10591.4. Samples: 5446596. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-16 21:20:03,346][236218] Avg episode reward: [(0, '354.059')] |
| [2023-07-16 21:20:07,100][236503] Updated weights for policy 0, policy_version 10720 (0.0005) |
| [2023-07-16 21:20:08,345][236218] Fps is (10 sec: 10649.7, 60 sec: 10649.6, 300 sec: 10746.8). Total num frames: 5500928. Throughput: 0: 10569.7. Samples: 5477252. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-16 21:20:08,346][236218] Avg episode reward: [(0, '402.893')] |
| [2023-07-16 21:20:10,977][236503] Updated weights for policy 0, policy_version 10800 (0.0005) |
| [2023-07-16 21:20:13,345][236218] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10732.9). Total num frames: 5554176. Throughput: 0: 10595.8. Samples: 5541684. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-16 21:20:13,345][236218] Avg episode reward: [(0, '384.704')] |
| [2023-07-16 21:20:13,348][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000010848_5554176.pth... |
| [2023-07-16 21:20:13,351][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000010232_5238784.pth |
| [2023-07-16 21:20:14,783][236503] Updated weights for policy 0, policy_version 10880 (0.0005) |
| [2023-07-16 21:20:18,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10732.9). Total num frames: 5607424. Throughput: 0: 10584.3. Samples: 5605032. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-16 21:20:18,345][236218] Avg episode reward: [(0, '374.301')] |
| [2023-07-16 21:20:18,649][236503] Updated weights for policy 0, policy_version 10960 (0.0005) |
| [2023-07-16 21:20:22,509][236503] Updated weights for policy 0, policy_version 11040 (0.0005) |
| [2023-07-16 21:20:23,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10732.9). Total num frames: 5660672. Throughput: 0: 10571.6. Samples: 5636752. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:20:23,345][236218] Avg episode reward: [(0, '348.431')] |
| [2023-07-16 21:20:26,403][236503] Updated weights for policy 0, policy_version 11120 (0.0005) |
| [2023-07-16 21:20:28,345][236218] Fps is (10 sec: 10240.0, 60 sec: 10581.4, 300 sec: 10719.0). Total num frames: 5709824. Throughput: 0: 10571.4. Samples: 5700680. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:20:28,345][236218] Avg episode reward: [(0, '346.847')] |
| [2023-07-16 21:20:28,356][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000011160_5713920.pth... |
| [2023-07-16 21:20:28,358][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000010536_5394432.pth |
| [2023-07-16 21:20:30,261][236503] Updated weights for policy 0, policy_version 11200 (0.0005) |
| [2023-07-16 21:20:33,345][236218] Fps is (10 sec: 10240.1, 60 sec: 10581.3, 300 sec: 10719.0). Total num frames: 5763072. Throughput: 0: 10565.7. Samples: 5763400. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-16 21:20:33,345][236218] Avg episode reward: [(0, '330.010')] |
| [2023-07-16 21:20:34,143][236503] Updated weights for policy 0, policy_version 11280 (0.0005) |
| [2023-07-16 21:20:38,027][236503] Updated weights for policy 0, policy_version 11360 (0.0005) |
| [2023-07-16 21:20:38,345][236218] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10719.0). Total num frames: 5816320. Throughput: 0: 10593.2. Samples: 5795848. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) |
| [2023-07-16 21:20:38,345][236218] Avg episode reward: [(0, '343.456')] |
| [2023-07-16 21:20:41,847][236503] Updated weights for policy 0, policy_version 11440 (0.0005) |
| [2023-07-16 21:20:43,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10719.0). Total num frames: 5869568. Throughput: 0: 10608.3. Samples: 5859528. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) |
| [2023-07-16 21:20:43,345][236218] Avg episode reward: [(0, '388.275')] |
| [2023-07-16 21:20:43,373][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000011472_5873664.pth... |
| [2023-07-16 21:20:43,375][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000010848_5554176.pth |
| [2023-07-16 21:20:45,718][236503] Updated weights for policy 0, policy_version 11520 (0.0005) |
| [2023-07-16 21:20:48,345][236218] Fps is (10 sec: 10649.7, 60 sec: 10581.3, 300 sec: 10719.0). Total num frames: 5922816. Throughput: 0: 10584.1. Samples: 5922880. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-16 21:20:48,345][236218] Avg episode reward: [(0, '380.116')] |
| [2023-07-16 21:20:49,559][236503] Updated weights for policy 0, policy_version 11600 (0.0005) |
| [2023-07-16 21:20:53,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10719.0). Total num frames: 5976064. Throughput: 0: 10617.9. Samples: 5955056. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-16 21:20:53,345][236218] Avg episode reward: [(0, '377.738')] |
| [2023-07-16 21:20:53,428][236503] Updated weights for policy 0, policy_version 11680 (0.0005) |
| [2023-07-16 21:20:57,337][236503] Updated weights for policy 0, policy_version 11760 (0.0005) |
| [2023-07-16 21:20:58,345][236218] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10719.0). Total num frames: 6029312. Throughput: 0: 10583.2. Samples: 6017928. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:20:58,345][236218] Avg episode reward: [(0, '344.007')] |
| [2023-07-16 21:20:58,348][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000011776_6029312.pth... |
| [2023-07-16 21:20:58,351][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000011160_5713920.pth |
| [2023-07-16 21:21:01,193][236503] Updated weights for policy 0, policy_version 11840 (0.0005) |
| [2023-07-16 21:21:03,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10719.0). Total num frames: 6082560. Throughput: 0: 10604.5. Samples: 6082232. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:21:03,345][236218] Avg episode reward: [(0, '351.864')] |
| [2023-07-16 21:21:05,102][236503] Updated weights for policy 0, policy_version 11920 (0.0005) |
| [2023-07-16 21:21:08,345][236218] Fps is (10 sec: 10649.7, 60 sec: 10581.3, 300 sec: 10719.0). Total num frames: 6135808. Throughput: 0: 10581.4. Samples: 6112916. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:21:08,345][236218] Avg episode reward: [(0, '340.063')] |
| [2023-07-16 21:21:08,972][236503] Updated weights for policy 0, policy_version 12000 (0.0005) |
| [2023-07-16 21:21:12,859][236503] Updated weights for policy 0, policy_version 12080 (0.0005) |
| [2023-07-16 21:21:13,345][236218] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10719.0). Total num frames: 6189056. Throughput: 0: 10578.9. Samples: 6176732. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:21:13,345][236218] Avg episode reward: [(0, '355.050')] |
| [2023-07-16 21:21:13,348][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000012088_6189056.pth... |
| [2023-07-16 21:21:13,350][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000011472_5873664.pth |
| [2023-07-16 21:21:16,834][236503] Updated weights for policy 0, policy_version 12160 (0.0005) |
| [2023-07-16 21:21:18,345][236218] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10719.0). Total num frames: 6238208. Throughput: 0: 10560.1. Samples: 6238604. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:21:18,345][236218] Avg episode reward: [(0, '371.653')] |
| [2023-07-16 21:21:20,740][236503] Updated weights for policy 0, policy_version 12240 (0.0005) |
| [2023-07-16 21:21:23,345][236218] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10719.0). Total num frames: 6291456. Throughput: 0: 10550.0. Samples: 6270596. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:21:23,346][236218] Avg episode reward: [(0, '342.909')] |
| [2023-07-16 21:21:24,659][236503] Updated weights for policy 0, policy_version 12320 (0.0005) |
| [2023-07-16 21:21:28,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10719.0). Total num frames: 6344704. Throughput: 0: 10510.0. Samples: 6332480. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:21:28,345][236218] Avg episode reward: [(0, '372.273')] |
| [2023-07-16 21:21:28,348][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000012392_6344704.pth... |
| [2023-07-16 21:21:28,351][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000011776_6029312.pth |
| [2023-07-16 21:21:28,605][236503] Updated weights for policy 0, policy_version 12400 (0.0005) |
| [2023-07-16 21:21:32,464][236503] Updated weights for policy 0, policy_version 12480 (0.0005) |
| [2023-07-16 21:21:33,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10705.1). Total num frames: 6397952. Throughput: 0: 10517.2. Samples: 6396156. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:21:33,347][236218] Avg episode reward: [(0, '396.764')] |
| [2023-07-16 21:21:36,356][236503] Updated weights for policy 0, policy_version 12560 (0.0005) |
| [2023-07-16 21:21:38,345][236218] Fps is (10 sec: 10649.7, 60 sec: 10581.3, 300 sec: 10705.1). Total num frames: 6451200. Throughput: 0: 10500.8. Samples: 6427592. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:21:38,345][236218] Avg episode reward: [(0, '371.003')] |
| [2023-07-16 21:21:40,220][236503] Updated weights for policy 0, policy_version 12640 (0.0005) |
| [2023-07-16 21:21:43,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10705.1). Total num frames: 6504448. Throughput: 0: 10521.5. Samples: 6491396. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:21:43,345][236218] Avg episode reward: [(0, '397.593')] |
| [2023-07-16 21:21:43,348][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000012704_6504448.pth... |
| [2023-07-16 21:21:43,351][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000012088_6189056.pth |
| [2023-07-16 21:21:44,123][236503] Updated weights for policy 0, policy_version 12720 (0.0005) |
| [2023-07-16 21:21:48,053][236503] Updated weights for policy 0, policy_version 12800 (0.0005) |
| [2023-07-16 21:21:48,345][236218] Fps is (10 sec: 10239.9, 60 sec: 10513.1, 300 sec: 10691.3). Total num frames: 6553600. Throughput: 0: 10475.3. Samples: 6553620. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:21:48,346][236218] Avg episode reward: [(0, '399.547')] |
| [2023-07-16 21:21:51,978][236503] Updated weights for policy 0, policy_version 12880 (0.0005) |
| [2023-07-16 21:21:53,345][236218] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10691.3). Total num frames: 6606848. Throughput: 0: 10497.0. Samples: 6585280. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-16 21:21:53,346][236218] Avg episode reward: [(0, '395.588')] |
| [2023-07-16 21:21:55,968][236503] Updated weights for policy 0, policy_version 12960 (0.0005) |
| [2023-07-16 21:21:58,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10691.3). Total num frames: 6660096. Throughput: 0: 10453.3. Samples: 6647132. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-16 21:21:58,345][236218] Avg episode reward: [(0, '367.028')] |
| [2023-07-16 21:21:58,348][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000013008_6660096.pth... |
| [2023-07-16 21:21:58,351][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000012392_6344704.pth |
| [2023-07-16 21:21:59,859][236503] Updated weights for policy 0, policy_version 13040 (0.0005) |
| [2023-07-16 21:22:03,345][236218] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10677.4). Total num frames: 6709248. Throughput: 0: 10463.4. Samples: 6709456. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) |
| [2023-07-16 21:22:03,345][236218] Avg episode reward: [(0, '390.694')] |
| [2023-07-16 21:22:03,792][236503] Updated weights for policy 0, policy_version 13120 (0.0005) |
| [2023-07-16 21:22:07,737][236503] Updated weights for policy 0, policy_version 13200 (0.0005) |
| [2023-07-16 21:22:08,345][236218] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10677.4). Total num frames: 6762496. Throughput: 0: 10448.2. Samples: 6740764. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) |
| [2023-07-16 21:22:08,345][236218] Avg episode reward: [(0, '382.731')] |
| [2023-07-16 21:22:11,580][236503] Updated weights for policy 0, policy_version 13280 (0.0005) |
| [2023-07-16 21:22:13,345][236218] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10691.3). Total num frames: 6815744. Throughput: 0: 10476.5. Samples: 6803924. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:22:13,345][236218] Avg episode reward: [(0, '390.087')] |
| [2023-07-16 21:22:13,348][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000013312_6815744.pth... |
| [2023-07-16 21:22:13,351][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000012704_6504448.pth |
| [2023-07-16 21:22:15,524][236503] Updated weights for policy 0, policy_version 13360 (0.0005) |
| [2023-07-16 21:22:18,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10691.3). Total num frames: 6868992. Throughput: 0: 10450.5. Samples: 6866428. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:22:18,345][236218] Avg episode reward: [(0, '384.290')] |
| [2023-07-16 21:22:19,506][236503] Updated weights for policy 0, policy_version 13440 (0.0005) |
| [2023-07-16 21:22:23,345][236218] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10677.4). Total num frames: 6918144. Throughput: 0: 10446.2. Samples: 6897672. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:22:23,345][236218] Avg episode reward: [(0, '396.863')] |
| [2023-07-16 21:22:23,456][236503] Updated weights for policy 0, policy_version 13520 (0.0005) |
| [2023-07-16 21:22:27,256][236503] Updated weights for policy 0, policy_version 13600 (0.0005) |
| [2023-07-16 21:22:28,345][236218] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10677.4). Total num frames: 6971392. Throughput: 0: 10441.2. Samples: 6961252. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:22:28,345][236218] Avg episode reward: [(0, '391.764')] |
| [2023-07-16 21:22:28,348][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000013616_6971392.pth... |
| [2023-07-16 21:22:28,351][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000013008_6660096.pth |
| [2023-07-16 21:22:31,147][236503] Updated weights for policy 0, policy_version 13680 (0.0005) |
| [2023-07-16 21:22:33,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10677.4). Total num frames: 7024640. Throughput: 0: 10467.1. Samples: 7024640. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:22:33,345][236218] Avg episode reward: [(0, '382.425')] |
| [2023-07-16 21:22:35,028][236503] Updated weights for policy 0, policy_version 13760 (0.0005) |
| [2023-07-16 21:22:38,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10677.4). Total num frames: 7077888. Throughput: 0: 10454.0. Samples: 7055708. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:22:38,345][236218] Avg episode reward: [(0, '431.316')] |
| [2023-07-16 21:22:38,346][236459] Saving new best policy, reward=431.316! |
| [2023-07-16 21:22:39,021][236503] Updated weights for policy 0, policy_version 13840 (0.0005) |
| [2023-07-16 21:22:42,987][236503] Updated weights for policy 0, policy_version 13920 (0.0005) |
| [2023-07-16 21:22:43,345][236218] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10649.6). Total num frames: 7127040. Throughput: 0: 10454.1. Samples: 7117568. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-16 21:22:43,345][236218] Avg episode reward: [(0, '417.502')] |
| [2023-07-16 21:22:43,399][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000013928_7131136.pth... |
| [2023-07-16 21:22:43,401][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000013312_6815744.pth |
| [2023-07-16 21:22:46,899][236503] Updated weights for policy 0, policy_version 14000 (0.0005) |
| [2023-07-16 21:22:48,345][236218] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10649.6). Total num frames: 7180288. Throughput: 0: 10463.1. Samples: 7180296. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-16 21:22:48,345][236218] Avg episode reward: [(0, '435.193')] |
| [2023-07-16 21:22:48,346][236459] Saving new best policy, reward=435.193! |
| [2023-07-16 21:22:50,792][236503] Updated weights for policy 0, policy_version 14080 (0.0005) |
| [2023-07-16 21:22:53,345][236218] Fps is (10 sec: 10649.7, 60 sec: 10444.8, 300 sec: 10635.7). Total num frames: 7233536. Throughput: 0: 10468.6. Samples: 7211848. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:22:53,345][236218] Avg episode reward: [(0, '445.993')] |
| [2023-07-16 21:22:53,345][236459] Saving new best policy, reward=445.993! |
| [2023-07-16 21:22:54,703][236503] Updated weights for policy 0, policy_version 14160 (0.0005) |
| [2023-07-16 21:22:58,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10621.8). Total num frames: 7286784. Throughput: 0: 10458.6. Samples: 7274560. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:22:58,345][236218] Avg episode reward: [(0, '445.761')] |
| [2023-07-16 21:22:58,349][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000014232_7286784.pth... |
| [2023-07-16 21:22:58,351][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000013616_6971392.pth |
| [2023-07-16 21:22:58,589][236503] Updated weights for policy 0, policy_version 14240 (0.0005) |
| [2023-07-16 21:23:02,576][236503] Updated weights for policy 0, policy_version 14320 (0.0005) |
| [2023-07-16 21:23:03,345][236218] Fps is (10 sec: 10239.9, 60 sec: 10444.8, 300 sec: 10594.1). Total num frames: 7335936. Throughput: 0: 10446.1. Samples: 7336504. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-16 21:23:03,345][236218] Avg episode reward: [(0, '418.633')] |
| [2023-07-16 21:23:06,519][236503] Updated weights for policy 0, policy_version 14400 (0.0005) |
| [2023-07-16 21:23:08,345][236218] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10594.1). Total num frames: 7389184. Throughput: 0: 10460.4. Samples: 7368388. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-16 21:23:08,345][236218] Avg episode reward: [(0, '424.401')] |
| [2023-07-16 21:23:10,428][236503] Updated weights for policy 0, policy_version 14480 (0.0005) |
| [2023-07-16 21:23:13,345][236218] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10607.9). Total num frames: 7442432. Throughput: 0: 10433.8. Samples: 7430772. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-16 21:23:13,345][236218] Avg episode reward: [(0, '430.765')] |
| [2023-07-16 21:23:13,348][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000014536_7442432.pth... |
| [2023-07-16 21:23:13,351][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000013928_7131136.pth |
| [2023-07-16 21:23:14,311][236503] Updated weights for policy 0, policy_version 14560 (0.0005) |
| [2023-07-16 21:23:18,212][236503] Updated weights for policy 0, policy_version 14640 (0.0005) |
| [2023-07-16 21:23:18,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10607.9). Total num frames: 7495680. Throughput: 0: 10439.6. Samples: 7494420. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:23:18,345][236218] Avg episode reward: [(0, '439.078')] |
| [2023-07-16 21:23:22,107][236503] Updated weights for policy 0, policy_version 14720 (0.0005) |
| [2023-07-16 21:23:23,345][236218] Fps is (10 sec: 10649.8, 60 sec: 10513.1, 300 sec: 10594.1). Total num frames: 7548928. Throughput: 0: 10430.9. Samples: 7525096. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:23:23,345][236218] Avg episode reward: [(0, '403.690')] |
| [2023-07-16 21:23:25,913][236503] Updated weights for policy 0, policy_version 14800 (0.0006) |
| [2023-07-16 21:23:28,345][236218] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10580.2). Total num frames: 7602176. Throughput: 0: 10496.0. Samples: 7589888. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:23:28,345][236218] Avg episode reward: [(0, '439.102')] |
| [2023-07-16 21:23:28,348][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000014848_7602176.pth... |
| [2023-07-16 21:23:28,351][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000014232_7286784.pth |
| [2023-07-16 21:23:29,847][236503] Updated weights for policy 0, policy_version 14880 (0.0005) |
| [2023-07-16 21:23:33,345][236218] Fps is (10 sec: 10239.8, 60 sec: 10444.8, 300 sec: 10566.3). Total num frames: 7651328. Throughput: 0: 10467.7. Samples: 7651344. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:23:33,346][236218] Avg episode reward: [(0, '425.573')] |
| [2023-07-16 21:23:33,834][236503] Updated weights for policy 0, policy_version 14960 (0.0005) |
| [2023-07-16 21:23:37,708][236503] Updated weights for policy 0, policy_version 15040 (0.0005) |
| [2023-07-16 21:23:38,345][236218] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10552.4). Total num frames: 7704576. Throughput: 0: 10479.3. Samples: 7683416. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:23:38,346][236218] Avg episode reward: [(0, '424.691')] |
| [2023-07-16 21:23:41,663][236503] Updated weights for policy 0, policy_version 15120 (0.0005) |
| [2023-07-16 21:23:43,345][236218] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 10538.5). Total num frames: 7757824. Throughput: 0: 10466.6. Samples: 7745556. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:23:43,345][236218] Avg episode reward: [(0, '462.567')] |
| [2023-07-16 21:23:43,348][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000015152_7757824.pth... |
| [2023-07-16 21:23:43,351][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000014536_7442432.pth |
| [2023-07-16 21:23:43,351][236459] Saving new best policy, reward=462.567! |
| [2023-07-16 21:23:45,584][236503] Updated weights for policy 0, policy_version 15200 (0.0005) |
| [2023-07-16 21:23:48,345][236218] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10524.6). Total num frames: 7806976. Throughput: 0: 10464.7. Samples: 7807416. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:23:48,346][236218] Avg episode reward: [(0, '418.925')] |
| [2023-07-16 21:23:49,554][236503] Updated weights for policy 0, policy_version 15280 (0.0005) |
| [2023-07-16 21:23:53,345][236218] Fps is (10 sec: 10239.9, 60 sec: 10444.8, 300 sec: 10524.6). Total num frames: 7860224. Throughput: 0: 10471.2. Samples: 7839592. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:23:53,345][236218] Avg episode reward: [(0, '457.062')] |
| [2023-07-16 21:23:53,521][236503] Updated weights for policy 0, policy_version 15360 (0.0005) |
| [2023-07-16 21:23:57,431][236503] Updated weights for policy 0, policy_version 15440 (0.0005) |
| [2023-07-16 21:23:58,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10524.6). Total num frames: 7913472. Throughput: 0: 10455.0. Samples: 7901248. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-16 21:23:58,345][236218] Avg episode reward: [(0, '454.163')] |
| [2023-07-16 21:23:58,348][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000015456_7913472.pth... |
| [2023-07-16 21:23:58,351][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000014848_7602176.pth |
| [2023-07-16 21:24:01,302][236503] Updated weights for policy 0, policy_version 15520 (0.0005) |
| [2023-07-16 21:24:03,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10524.6). Total num frames: 7966720. Throughput: 0: 10445.5. Samples: 7964468. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-16 21:24:03,345][236218] Avg episode reward: [(0, '454.024')] |
| [2023-07-16 21:24:05,219][236503] Updated weights for policy 0, policy_version 15600 (0.0005) |
| [2023-07-16 21:24:08,345][236218] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10496.9). Total num frames: 8015872. Throughput: 0: 10456.2. Samples: 7995624. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-16 21:24:08,345][236218] Avg episode reward: [(0, '439.621')] |
| [2023-07-16 21:24:09,109][236503] Updated weights for policy 0, policy_version 15680 (0.0005) |
| [2023-07-16 21:24:12,969][236503] Updated weights for policy 0, policy_version 15760 (0.0005) |
| [2023-07-16 21:24:13,345][236218] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10496.9). Total num frames: 8069120. Throughput: 0: 10449.2. Samples: 8060100. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-16 21:24:13,345][236218] Avg episode reward: [(0, '462.221')] |
| [2023-07-16 21:24:13,348][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000015768_8073216.pth... |
| [2023-07-16 21:24:13,350][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000015152_7757824.pth |
| [2023-07-16 21:24:16,756][236503] Updated weights for policy 0, policy_version 15840 (0.0005) |
| [2023-07-16 21:24:18,345][236218] Fps is (10 sec: 11059.1, 60 sec: 10513.1, 300 sec: 10510.8). Total num frames: 8126464. Throughput: 0: 10501.8. Samples: 8123924. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-16 21:24:18,345][236218] Avg episode reward: [(0, '433.368')] |
| [2023-07-16 21:24:20,640][236503] Updated weights for policy 0, policy_version 15920 (0.0005) |
| [2023-07-16 21:24:23,345][236218] Fps is (10 sec: 10649.7, 60 sec: 10444.8, 300 sec: 10510.8). Total num frames: 8175616. Throughput: 0: 10484.1. Samples: 8155200. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-16 21:24:23,345][236218] Avg episode reward: [(0, '461.254')] |
| [2023-07-16 21:24:24,597][236503] Updated weights for policy 0, policy_version 16000 (0.0005) |
| [2023-07-16 21:24:28,345][236218] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10510.7). Total num frames: 8228864. Throughput: 0: 10493.9. Samples: 8217784. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-16 21:24:28,345][236218] Avg episode reward: [(0, '458.637')] |
| [2023-07-16 21:24:28,348][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000016072_8228864.pth... |
| [2023-07-16 21:24:28,351][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000015456_7913472.pth |
| [2023-07-16 21:24:28,508][236503] Updated weights for policy 0, policy_version 16080 (0.0005) |
| [2023-07-16 21:24:32,440][236503] Updated weights for policy 0, policy_version 16160 (0.0005) |
| [2023-07-16 21:24:33,345][236218] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10510.8). Total num frames: 8282112. Throughput: 0: 10515.8. Samples: 8280628. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-16 21:24:33,345][236218] Avg episode reward: [(0, '451.608')] |
| [2023-07-16 21:24:36,359][236503] Updated weights for policy 0, policy_version 16240 (0.0005) |
| [2023-07-16 21:24:38,345][236218] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10510.7). Total num frames: 8335360. Throughput: 0: 10493.0. Samples: 8311776. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-16 21:24:38,345][236218] Avg episode reward: [(0, '460.612')] |
| [2023-07-16 21:24:40,291][236503] Updated weights for policy 0, policy_version 16320 (0.0005) |
| [2023-07-16 21:24:43,345][236218] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10496.9). Total num frames: 8384512. Throughput: 0: 10502.2. Samples: 8373848. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-16 21:24:43,345][236218] Avg episode reward: [(0, '461.732')] |
| [2023-07-16 21:24:43,348][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000016376_8384512.pth... |
| [2023-07-16 21:24:43,351][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000015768_8073216.pth |
| [2023-07-16 21:24:44,266][236503] Updated weights for policy 0, policy_version 16400 (0.0005) |
| [2023-07-16 21:24:48,171][236503] Updated weights for policy 0, policy_version 16480 (0.0005) |
| [2023-07-16 21:24:48,345][236218] Fps is (10 sec: 10240.1, 60 sec: 10513.1, 300 sec: 10496.9). Total num frames: 8437760. Throughput: 0: 10493.5. Samples: 8436676. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:24:48,345][236218] Avg episode reward: [(0, '498.210')] |
| [2023-07-16 21:24:48,346][236459] Saving new best policy, reward=498.210! |
| [2023-07-16 21:24:52,123][236503] Updated weights for policy 0, policy_version 16560 (0.0005) |
| [2023-07-16 21:24:53,345][236218] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 10496.9). Total num frames: 8491008. Throughput: 0: 10480.2. Samples: 8467232. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:24:53,345][236218] Avg episode reward: [(0, '475.365')] |
| [2023-07-16 21:24:56,071][236503] Updated weights for policy 0, policy_version 16640 (0.0005) |
| [2023-07-16 21:24:58,345][236218] Fps is (10 sec: 10239.9, 60 sec: 10444.8, 300 sec: 10483.0). Total num frames: 8540160. Throughput: 0: 10442.2. Samples: 8530000. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) |
| [2023-07-16 21:24:58,345][236218] Avg episode reward: [(0, '461.049')] |
| [2023-07-16 21:24:58,348][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000016680_8540160.pth... |
| [2023-07-16 21:24:58,351][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000016072_8228864.pth |
| [2023-07-16 21:24:59,992][236503] Updated weights for policy 0, policy_version 16720 (0.0005) |
| [2023-07-16 21:25:03,345][236218] Fps is (10 sec: 10239.9, 60 sec: 10444.8, 300 sec: 10483.0). Total num frames: 8593408. Throughput: 0: 10411.6. Samples: 8592444. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) |
| [2023-07-16 21:25:03,346][236218] Avg episode reward: [(0, '460.292')] |
| [2023-07-16 21:25:03,953][236503] Updated weights for policy 0, policy_version 16800 (0.0005) |
| [2023-07-16 21:25:07,963][236503] Updated weights for policy 0, policy_version 16880 (0.0005) |
| [2023-07-16 21:25:08,345][236218] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10469.1). Total num frames: 8642560. Throughput: 0: 10381.1. Samples: 8622348. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-16 21:25:08,345][236218] Avg episode reward: [(0, '472.691')] |
| [2023-07-16 21:25:11,920][236503] Updated weights for policy 0, policy_version 16960 (0.0005) |
| [2023-07-16 21:25:13,345][236218] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10469.1). Total num frames: 8695808. Throughput: 0: 10379.8. Samples: 8684876. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-16 21:25:13,345][236218] Avg episode reward: [(0, '484.159')] |
| [2023-07-16 21:25:13,348][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000016984_8695808.pth... |
| [2023-07-16 21:25:13,351][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000016376_8384512.pth |
| [2023-07-16 21:25:15,853][236503] Updated weights for policy 0, policy_version 17040 (0.0005) |
| [2023-07-16 21:25:18,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10469.1). Total num frames: 8749056. Throughput: 0: 10365.5. Samples: 8747076. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-16 21:25:18,346][236218] Avg episode reward: [(0, '484.186')] |
| [2023-07-16 21:25:19,818][236503] Updated weights for policy 0, policy_version 17120 (0.0005) |
| [2023-07-16 21:25:23,345][236218] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10469.1). Total num frames: 8798208. Throughput: 0: 10356.1. Samples: 8777800. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) |
| [2023-07-16 21:25:23,345][236218] Avg episode reward: [(0, '452.661')] |
| [2023-07-16 21:25:23,784][236503] Updated weights for policy 0, policy_version 17200 (0.0005) |
| [2023-07-16 21:25:27,668][236503] Updated weights for policy 0, policy_version 17280 (0.0005) |
| [2023-07-16 21:25:28,345][236218] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10469.1). Total num frames: 8851456. Throughput: 0: 10374.9. Samples: 8840720. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) |
| [2023-07-16 21:25:28,360][236218] Avg episode reward: [(0, '474.451')] |
| [2023-07-16 21:25:28,363][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000017288_8851456.pth... |
| [2023-07-16 21:25:28,366][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000016680_8540160.pth |
| [2023-07-16 21:25:31,531][236503] Updated weights for policy 0, policy_version 17360 (0.0005) |
| [2023-07-16 21:25:33,345][236218] Fps is (10 sec: 10649.5, 60 sec: 10376.5, 300 sec: 10469.1). Total num frames: 8904704. Throughput: 0: 10400.8. Samples: 8904712. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:25:33,346][236218] Avg episode reward: [(0, '488.374')] |
| [2023-07-16 21:25:35,416][236503] Updated weights for policy 0, policy_version 17440 (0.0005) |
| [2023-07-16 21:25:38,345][236218] Fps is (10 sec: 10649.7, 60 sec: 10376.6, 300 sec: 10469.1). Total num frames: 8957952. Throughput: 0: 10417.4. Samples: 8936016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:25:38,345][236218] Avg episode reward: [(0, '462.400')] |
| [2023-07-16 21:25:39,333][236503] Updated weights for policy 0, policy_version 17520 (0.0005) |
| [2023-07-16 21:25:43,199][236503] Updated weights for policy 0, policy_version 17600 (0.0005) |
| [2023-07-16 21:25:43,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10469.1). Total num frames: 9011200. Throughput: 0: 10421.7. Samples: 8998976. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:25:43,345][236218] Avg episode reward: [(0, '451.940')] |
| [2023-07-16 21:25:43,349][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000017600_9011200.pth... |
| [2023-07-16 21:25:43,352][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000016984_8695808.pth |
| [2023-07-16 21:25:47,053][236503] Updated weights for policy 0, policy_version 17680 (0.0005) |
| [2023-07-16 21:25:48,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10469.1). Total num frames: 9064448. Throughput: 0: 10448.6. Samples: 9062628. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:25:48,345][236218] Avg episode reward: [(0, '456.482')] |
| [2023-07-16 21:25:50,981][236503] Updated weights for policy 0, policy_version 17760 (0.0005) |
| [2023-07-16 21:25:53,345][236218] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10455.2). Total num frames: 9113600. Throughput: 0: 10474.0. Samples: 9093676. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:25:53,345][236218] Avg episode reward: [(0, '492.713')] |
| [2023-07-16 21:25:54,945][236503] Updated weights for policy 0, policy_version 17840 (0.0005) |
| [2023-07-16 21:25:58,345][236218] Fps is (10 sec: 10239.9, 60 sec: 10444.8, 300 sec: 10455.2). Total num frames: 9166848. Throughput: 0: 10470.0. Samples: 9156024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:25:58,345][236218] Avg episode reward: [(0, '467.744')] |
| [2023-07-16 21:25:58,348][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000017904_9166848.pth... |
| [2023-07-16 21:25:58,351][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000017288_8851456.pth |
| [2023-07-16 21:25:58,849][236503] Updated weights for policy 0, policy_version 17920 (0.0005) |
| [2023-07-16 21:26:02,768][236503] Updated weights for policy 0, policy_version 18000 (0.0005) |
| [2023-07-16 21:26:03,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10455.2). Total num frames: 9220096. Throughput: 0: 10492.6. Samples: 9219244. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:26:03,345][236218] Avg episode reward: [(0, '481.240')] |
| [2023-07-16 21:26:06,651][236503] Updated weights for policy 0, policy_version 18080 (0.0005) |
| [2023-07-16 21:26:08,345][236218] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 10455.2). Total num frames: 9273344. Throughput: 0: 10503.6. Samples: 9250464. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:26:08,346][236218] Avg episode reward: [(0, '473.945')] |
| [2023-07-16 21:26:10,575][236503] Updated weights for policy 0, policy_version 18160 (0.0005) |
| [2023-07-16 21:26:13,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10469.1). Total num frames: 9326592. Throughput: 0: 10519.4. Samples: 9314092. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:26:13,345][236218] Avg episode reward: [(0, '472.976')] |
| [2023-07-16 21:26:13,348][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000018216_9326592.pth... |
| [2023-07-16 21:26:13,351][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000017600_9011200.pth |
| [2023-07-16 21:26:14,431][236503] Updated weights for policy 0, policy_version 18240 (0.0006) |
| [2023-07-16 21:26:18,311][236503] Updated weights for policy 0, policy_version 18320 (0.0005) |
| [2023-07-16 21:26:18,345][236218] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 10469.1). Total num frames: 9379840. Throughput: 0: 10493.6. Samples: 9376924. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:26:18,345][236218] Avg episode reward: [(0, '458.604')] |
| [2023-07-16 21:26:22,040][236503] Updated weights for policy 0, policy_version 18400 (0.0004) |
| [2023-07-16 21:26:23,345][236218] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10469.1). Total num frames: 9433088. Throughput: 0: 10523.8. Samples: 9409588. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:26:23,346][236218] Avg episode reward: [(0, '487.591')] |
| [2023-07-16 21:26:25,690][236503] Updated weights for policy 0, policy_version 18480 (0.0004) |
| [2023-07-16 21:26:28,345][236218] Fps is (10 sec: 11059.1, 60 sec: 10649.6, 300 sec: 10483.0). Total num frames: 9490432. Throughput: 0: 10610.0. Samples: 9476424. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:26:28,345][236218] Avg episode reward: [(0, '468.132')] |
| [2023-07-16 21:26:28,349][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000018536_9490432.pth... |
| [2023-07-16 21:26:28,353][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000017904_9166848.pth |
| [2023-07-16 21:26:29,386][236503] Updated weights for policy 0, policy_version 18560 (0.0004) |
| [2023-07-16 21:26:33,154][236503] Updated weights for policy 0, policy_version 18640 (0.0005) |
| [2023-07-16 21:26:33,345][236218] Fps is (10 sec: 11059.3, 60 sec: 10649.6, 300 sec: 10483.0). Total num frames: 9543680. Throughput: 0: 10663.5. Samples: 9542484. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:26:33,345][236218] Avg episode reward: [(0, '501.164')] |
| [2023-07-16 21:26:33,346][236459] Saving new best policy, reward=501.164! |
| [2023-07-16 21:26:36,759][236503] Updated weights for policy 0, policy_version 18720 (0.0004) |
| [2023-07-16 21:26:38,345][236218] Fps is (10 sec: 11059.3, 60 sec: 10717.9, 300 sec: 10496.9). Total num frames: 9601024. Throughput: 0: 10729.7. Samples: 9576512. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:26:38,345][236218] Avg episode reward: [(0, '500.825')] |
| [2023-07-16 21:26:40,409][236503] Updated weights for policy 0, policy_version 18800 (0.0004) |
| [2023-07-16 21:26:43,345][236218] Fps is (10 sec: 11468.8, 60 sec: 10786.1, 300 sec: 10524.6). Total num frames: 9658368. Throughput: 0: 10839.8. Samples: 9643816. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:26:43,345][236218] Avg episode reward: [(0, '475.058')] |
| [2023-07-16 21:26:43,348][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000018864_9658368.pth... |
| [2023-07-16 21:26:43,351][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000018216_9326592.pth |
| [2023-07-16 21:26:44,067][236503] Updated weights for policy 0, policy_version 18880 (0.0004) |
| [2023-07-16 21:26:47,755][236503] Updated weights for policy 0, policy_version 18960 (0.0005) |
| [2023-07-16 21:26:48,345][236218] Fps is (10 sec: 11059.2, 60 sec: 10786.1, 300 sec: 10524.6). Total num frames: 9711616. Throughput: 0: 10929.7. Samples: 9711080. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:26:48,345][236218] Avg episode reward: [(0, '460.733')] |
| [2023-07-16 21:26:51,379][236503] Updated weights for policy 0, policy_version 19040 (0.0005) |
| [2023-07-16 21:26:53,345][236218] Fps is (10 sec: 11059.2, 60 sec: 10922.7, 300 sec: 10538.5). Total num frames: 9768960. Throughput: 0: 10978.4. Samples: 9744492. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:26:53,345][236218] Avg episode reward: [(0, '460.987')] |
| [2023-07-16 21:26:55,060][236503] Updated weights for policy 0, policy_version 19120 (0.0005) |
| [2023-07-16 21:26:58,345][236218] Fps is (10 sec: 11059.2, 60 sec: 10922.7, 300 sec: 10552.4). Total num frames: 9822208. Throughput: 0: 11050.2. Samples: 9811352. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-16 21:26:58,345][236218] Avg episode reward: [(0, '476.742')] |
| [2023-07-16 21:26:58,348][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000019184_9822208.pth... |
| [2023-07-16 21:26:58,351][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000018536_9490432.pth |
| [2023-07-16 21:26:58,768][236503] Updated weights for policy 0, policy_version 19200 (0.0005) |
| [2023-07-16 21:27:02,430][236503] Updated weights for policy 0, policy_version 19280 (0.0004) |
| [2023-07-16 21:27:03,345][236218] Fps is (10 sec: 11059.2, 60 sec: 10990.9, 300 sec: 10566.3). Total num frames: 9879552. Throughput: 0: 11145.6. Samples: 9878476. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-16 21:27:03,345][236218] Avg episode reward: [(0, '489.723')] |
| [2023-07-16 21:27:06,127][236503] Updated weights for policy 0, policy_version 19360 (0.0005) |
| [2023-07-16 21:27:08,345][236218] Fps is (10 sec: 11468.9, 60 sec: 11059.2, 300 sec: 10580.2). Total num frames: 9936896. Throughput: 0: 11160.9. Samples: 9911828. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-16 21:27:08,345][236218] Avg episode reward: [(0, '467.622')] |
| [2023-07-16 21:27:09,800][236503] Updated weights for policy 0, policy_version 19440 (0.0005) |
| [2023-07-16 21:27:13,345][236218] Fps is (10 sec: 11059.1, 60 sec: 11059.2, 300 sec: 10580.2). Total num frames: 9990144. Throughput: 0: 11144.4. Samples: 9977920. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-16 21:27:13,345][236218] Avg episode reward: [(0, '494.151')] |
| [2023-07-16 21:27:13,348][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000019512_9990144.pth... |
| [2023-07-16 21:27:13,351][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000018864_9658368.pth |
| [2023-07-16 21:27:13,562][236503] Updated weights for policy 0, policy_version 19520 (0.0005) |
| [2023-07-16 21:27:14,658][236459] Early stopping after 2 epochs (8 sgd steps), loss delta 0.0000000 |
| [2023-07-16 21:27:14,659][236588] Stopping RolloutWorker_w6... |
| [2023-07-16 21:27:14,659][236504] Stopping RolloutWorker_w1... |
| [2023-07-16 21:27:14,659][236508] Stopping RolloutWorker_w4... |
| [2023-07-16 21:27:14,659][236505] Stopping RolloutWorker_w0... |
| [2023-07-16 21:27:14,659][236506] Stopping RolloutWorker_w2... |
| [2023-07-16 21:27:14,659][236509] Stopping RolloutWorker_w5... |
| [2023-07-16 21:27:14,659][236588] Loop rollout_proc6_evt_loop terminating... |
| [2023-07-16 21:27:14,659][236504] Loop rollout_proc1_evt_loop terminating... |
| [2023-07-16 21:27:14,659][236558] Stopping RolloutWorker_w7... |
| [2023-07-16 21:27:14,659][236508] Loop rollout_proc4_evt_loop terminating... |
| [2023-07-16 21:27:14,659][236505] Loop rollout_proc0_evt_loop terminating... |
| [2023-07-16 21:27:14,659][236506] Loop rollout_proc2_evt_loop terminating... |
| [2023-07-16 21:27:14,659][236509] Loop rollout_proc5_evt_loop terminating... |
| [2023-07-16 21:27:14,659][236507] Stopping RolloutWorker_w3... |
| [2023-07-16 21:27:14,659][236558] Loop rollout_proc7_evt_loop terminating... |
| [2023-07-16 21:27:14,659][236459] Stopping Batcher_0... |
| [2023-07-16 21:27:14,659][236507] Loop rollout_proc3_evt_loop terminating... |
| [2023-07-16 21:27:14,659][236218] Component RolloutWorker_w6 stopped! |
| [2023-07-16 21:27:14,659][236459] Loop batcher_evt_loop terminating... |
| [2023-07-16 21:27:14,659][236218] Component RolloutWorker_w4 stopped! |
| [2023-07-16 21:27:14,660][236218] Component RolloutWorker_w1 stopped! |
| [2023-07-16 21:27:14,660][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000019544_10006528.pth... |
| [2023-07-16 21:27:14,660][236218] Component RolloutWorker_w0 stopped! |
| [2023-07-16 21:27:14,660][236218] Component RolloutWorker_w2 stopped! |
| [2023-07-16 21:27:14,661][236218] Component RolloutWorker_w5 stopped! |
| [2023-07-16 21:27:14,661][236218] Component RolloutWorker_w7 stopped! |
| [2023-07-16 21:27:14,661][236218] Component RolloutWorker_w3 stopped! |
| [2023-07-16 21:27:14,661][236218] Component Batcher_0 stopped! |
| [2023-07-16 21:27:14,662][236459] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000019184_9822208.pth |
| [2023-07-16 21:27:14,663][236459] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-push-v2/checkpoint_p0/checkpoint_000019544_10006528.pth... |
| [2023-07-16 21:27:14,665][236459] Stopping LearnerWorker_p0... |
| [2023-07-16 21:27:14,665][236459] Loop learner_proc0_evt_loop terminating... |
| [2023-07-16 21:27:14,665][236218] Component LearnerWorker_p0 stopped! |
| [2023-07-16 21:27:14,726][236503] Weights refcount: 2 0 |
| [2023-07-16 21:27:14,727][236503] Stopping InferenceWorker_p0-w0... |
| [2023-07-16 21:27:14,727][236503] Loop inference_proc0-0_evt_loop terminating... |
| [2023-07-16 21:27:14,727][236218] Component InferenceWorker_p0-w0 stopped! |
| [2023-07-16 21:27:14,728][236218] Waiting for process learner_proc0 to stop... |
| [2023-07-16 21:27:15,258][236218] Waiting for process inference_proc0-0 to join... |
| [2023-07-16 21:27:15,270][236218] Waiting for process rollout_proc0 to join... |
| [2023-07-16 21:27:15,270][236218] Waiting for process rollout_proc1 to join... |
| [2023-07-16 21:27:15,271][236218] Waiting for process rollout_proc2 to join... |
| [2023-07-16 21:27:15,271][236218] Waiting for process rollout_proc3 to join... |
| [2023-07-16 21:27:15,271][236218] Waiting for process rollout_proc4 to join... |
| [2023-07-16 21:27:15,271][236218] Waiting for process rollout_proc5 to join... |
| [2023-07-16 21:27:15,271][236218] Waiting for process rollout_proc6 to join... |
| [2023-07-16 21:27:15,271][236218] Waiting for process rollout_proc7 to join... |
| [2023-07-16 21:27:15,272][236218] Batcher 0 profile tree view: |
| batching: 1.8805, releasing_batches: 1.6861 |
| [2023-07-16 21:27:15,272][236218] InferenceWorker_p0-w0 profile tree view: |
| wait_policy: 0.0051 |
| wait_policy_total: 346.5286 |
| update_model: 11.7488 |
| weight_update: 0.0004 |
| one_step: 0.0007 |
| handle_policy_step: 520.8934 |
| deserialize: 21.4553, stack: 5.5601, obs_to_device_normalize: 93.9153, forward: 257.3785, send_messages: 38.1053 |
| prepare_outputs: 59.7044 |
| to_cpu: 9.3519 |
| [2023-07-16 21:27:15,272][236218] Learner 0 profile tree view: |
| misc: 0.0102, prepare_batch: 10.9585 |
| train: 113.5252 |
| epoch_init: 0.0425, minibatch_init: 1.5614, losses_postprocess: 1.4950, kl_divergence: 0.5129, after_optimizer: 0.7143 |
| calculate_losses: 48.6036 |
| losses_init: 0.0369, forward_head: 19.2476, bptt_initial: 0.1617, bptt: 0.1479, tail: 13.6297, advantages_returns: 1.0375, losses: 12.6554 |
| update: 58.7304 |
| clip: 6.9427 |
| [2023-07-16 21:27:15,272][236218] RolloutWorker_w0 profile tree view: |
| wait_for_trajectories: 0.2998, enqueue_policy_requests: 12.6264, env_step: 662.8779, overhead: 19.8516, complete_rollouts: 0.3133 |
| save_policy_outputs: 38.8166 |
| split_output_tensors: 13.3700 |
| [2023-07-16 21:27:15,272][236218] RolloutWorker_w7 profile tree view: |
| wait_for_trajectories: 0.2744, enqueue_policy_requests: 12.6005, env_step: 660.2719, overhead: 19.5688, complete_rollouts: 0.3217 |
| save_policy_outputs: 38.6968 |
| split_output_tensors: 13.6183 |
| [2023-07-16 21:27:15,272][236218] Loop Runner_EvtLoop terminating... |
| [2023-07-16 21:27:15,273][236218] Runner profile tree view: |
| main_loop: 944.9228 |
| [2023-07-16 21:27:15,273][236218] Collected {0: 10006528}, FPS: 10589.8 |
|
|