diff --git "a/sf_log.txt" "b/sf_log.txt" new file mode 100644--- /dev/null +++ "b/sf_log.txt" @@ -0,0 +1,8111 @@ +[2023-03-11 18:07:36,357][65744] Saving configuration to /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/config.json... +[2023-03-11 18:07:36,383][65744] Rollout worker 0 uses device cpu +[2023-03-11 18:07:36,383][65744] Rollout worker 1 uses device cpu +[2023-03-11 18:07:36,384][65744] Rollout worker 2 uses device cpu +[2023-03-11 18:07:36,384][65744] Rollout worker 3 uses device cpu +[2023-03-11 18:07:36,384][65744] Rollout worker 4 uses device cpu +[2023-03-11 18:07:36,384][65744] Rollout worker 5 uses device cpu +[2023-03-11 18:07:36,384][65744] Rollout worker 6 uses device cpu +[2023-03-11 18:07:36,384][65744] Rollout worker 7 uses device cpu +[2023-03-11 18:07:36,384][65744] In synchronous mode, we only accumulate one batch. Setting num_batches_to_accumulate to 1 +[2023-03-11 18:07:36,395][65744] InferenceWorker_p0-w0: min num requests: 2 +[2023-03-11 18:07:36,410][65744] Starting all processes... +[2023-03-11 18:07:36,411][65744] Starting process learner_proc0 +[2023-03-11 18:07:36,461][65744] Starting all processes... +[2023-03-11 18:07:36,502][65744] Starting process inference_proc0-0 +[2023-03-11 18:07:36,509][65744] Starting process rollout_proc0 +[2023-03-11 18:07:36,510][65744] Starting process rollout_proc1 +[2023-03-11 18:07:36,510][65744] Starting process rollout_proc2 +[2023-03-11 18:07:36,510][65744] Starting process rollout_proc3 +[2023-03-11 18:07:36,510][65744] Starting process rollout_proc4 +[2023-03-11 18:07:36,510][65744] Starting process rollout_proc5 +[2023-03-11 18:07:36,510][65744] Starting process rollout_proc6 +[2023-03-11 18:07:36,510][65744] Starting process rollout_proc7 +[2023-03-11 18:07:37,941][66034] Worker 2 uses CPU cores [8, 9, 10, 11] +[2023-03-11 18:07:37,981][65987] Starting seed is not provided +[2023-03-11 18:07:37,981][65987] Initializing actor-critic model on device cpu +[2023-03-11 18:07:37,981][65987] RunningMeanStd input shape: (39,) +[2023-03-11 18:07:37,982][65987] RunningMeanStd input shape: (1,) +[2023-03-11 18:07:38,046][65987] Created Actor Critic model with architecture: +[2023-03-11 18:07:38,046][65987] ActorCriticSharedWeights( + (obs_normalizer): ObservationNormalizer( + (running_mean_std): RunningMeanStdDictInPlace( + (running_mean_std): ModuleDict( + (obs): RunningMeanStdInPlace() + ) + ) + ) + (returns_normalizer): RecursiveScriptModule(original_name=RunningMeanStdInPlace) + (encoder): MultiInputEncoder( + (encoders): ModuleDict( + (obs): MlpEncoder( + (mlp_head): RecursiveScriptModule( + original_name=Sequential + (0): RecursiveScriptModule(original_name=Linear) + (1): RecursiveScriptModule(original_name=Tanh) + (2): RecursiveScriptModule(original_name=Linear) + (3): RecursiveScriptModule(original_name=Tanh) + ) + ) + ) + ) + (core): ModelCoreIdentity() + (decoder): MlpDecoder( + (mlp): Identity() + ) + (critic_linear): Linear(in_features=64, out_features=1, bias=True) + (action_parameterization): ActionParameterizationContinuousNonAdaptiveStddev( + (distribution_linear): Linear(in_features=64, out_features=4, bias=True) + ) +) +[2023-03-11 18:07:38,058][66037] Worker 4 uses CPU cores [16, 17, 18, 19] +[2023-03-11 18:07:38,123][66036] Worker 5 uses CPU cores [20, 21, 22, 23] +[2023-03-11 18:07:38,182][66033] Worker 0 uses CPU cores [0, 1, 2, 3] +[2023-03-11 18:07:38,351][65987] Using optimizer +[2023-03-11 18:07:38,352][65987] No checkpoints found +[2023-03-11 18:07:38,352][65987] Did not load from checkpoint, starting from scratch! +[2023-03-11 18:07:38,353][65987] Initialized policy 0 weights for model version 0 +[2023-03-11 18:07:38,354][65987] LearnerWorker_p0 finished initialization! +[2023-03-11 18:07:38,355][66031] RunningMeanStd input shape: (39,) +[2023-03-11 18:07:38,355][66031] RunningMeanStd input shape: (1,) +[2023-03-11 18:07:38,411][65744] Inference worker 0-0 is ready! +[2023-03-11 18:07:38,412][65744] All inference workers are ready! Signal rollout workers to start! +[2023-03-11 18:07:38,458][66101] Worker 7 uses CPU cores [28, 29, 30, 31] +[2023-03-11 18:07:38,498][66032] Worker 1 uses CPU cores [4, 5, 6, 7] +[2023-03-11 18:07:38,655][66035] Worker 3 uses CPU cores [12, 13, 14, 15] +[2023-03-11 18:07:38,716][66038] Worker 6 uses CPU cores [24, 25, 26, 27] +[2023-03-11 18:07:39,012][65744] Fps is (10 sec: nan, 60 sec: nan, 300 sec: nan). Total num frames: 0. Throughput: 0: nan. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) +[2023-03-11 18:07:43,155][66033] Decorrelating experience for 0 frames... +[2023-03-11 18:07:43,160][66036] Decorrelating experience for 0 frames... +[2023-03-11 18:07:43,167][66037] Decorrelating experience for 0 frames... +[2023-03-11 18:07:43,172][66033] Decorrelating experience for 64 frames... +[2023-03-11 18:07:43,177][66036] Decorrelating experience for 64 frames... +[2023-03-11 18:07:43,185][66037] Decorrelating experience for 64 frames... +[2023-03-11 18:07:43,195][66034] Decorrelating experience for 0 frames... +[2023-03-11 18:07:43,212][66034] Decorrelating experience for 64 frames... +[2023-03-11 18:07:43,221][66033] Decorrelating experience for 128 frames... +[2023-03-11 18:07:43,226][66036] Decorrelating experience for 128 frames... +[2023-03-11 18:07:43,234][66037] Decorrelating experience for 128 frames... +[2023-03-11 18:07:43,262][66034] Decorrelating experience for 128 frames... +[2023-03-11 18:07:43,267][66101] Decorrelating experience for 0 frames... +[2023-03-11 18:07:43,275][66032] Decorrelating experience for 0 frames... +[2023-03-11 18:07:43,284][66101] Decorrelating experience for 64 frames... +[2023-03-11 18:07:43,292][66032] Decorrelating experience for 64 frames... +[2023-03-11 18:07:43,300][66033] Decorrelating experience for 192 frames... +[2023-03-11 18:07:43,306][66036] Decorrelating experience for 192 frames... +[2023-03-11 18:07:43,314][66037] Decorrelating experience for 192 frames... +[2023-03-11 18:07:43,333][66101] Decorrelating experience for 128 frames... +[2023-03-11 18:07:43,341][66032] Decorrelating experience for 128 frames... +[2023-03-11 18:07:43,342][66034] Decorrelating experience for 192 frames... +[2023-03-11 18:07:43,414][66101] Decorrelating experience for 192 frames... +[2023-03-11 18:07:43,418][66035] Decorrelating experience for 0 frames... +[2023-03-11 18:07:43,419][66032] Decorrelating experience for 192 frames... +[2023-03-11 18:07:43,435][66035] Decorrelating experience for 64 frames... +[2023-03-11 18:07:43,483][66038] Decorrelating experience for 0 frames... +[2023-03-11 18:07:43,484][66035] Decorrelating experience for 128 frames... +[2023-03-11 18:07:43,501][66038] Decorrelating experience for 64 frames... +[2023-03-11 18:07:43,550][66038] Decorrelating experience for 128 frames... +[2023-03-11 18:07:43,562][66035] Decorrelating experience for 192 frames... +[2023-03-11 18:07:43,630][66038] Decorrelating experience for 192 frames... +[2023-03-11 18:07:44,012][65744] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) +[2023-03-11 18:07:48,076][66036] Decorrelating experience for 256 frames... +[2023-03-11 18:07:48,078][66033] Decorrelating experience for 256 frames... +[2023-03-11 18:07:48,083][66037] Decorrelating experience for 256 frames... +[2023-03-11 18:07:48,117][66034] Decorrelating experience for 256 frames... +[2023-03-11 18:07:48,190][66032] Decorrelating experience for 256 frames... +[2023-03-11 18:07:48,196][66101] Decorrelating experience for 256 frames... +[2023-03-11 18:07:48,217][66036] Decorrelating experience for 320 frames... +[2023-03-11 18:07:48,228][66037] Decorrelating experience for 320 frames... +[2023-03-11 18:07:48,231][66033] Decorrelating experience for 320 frames... +[2023-03-11 18:07:48,261][66034] Decorrelating experience for 320 frames... +[2023-03-11 18:07:48,314][66035] Decorrelating experience for 256 frames... +[2023-03-11 18:07:48,329][66032] Decorrelating experience for 320 frames... +[2023-03-11 18:07:48,337][66101] Decorrelating experience for 320 frames... +[2023-03-11 18:07:48,389][66036] Decorrelating experience for 384 frames... +[2023-03-11 18:07:48,399][66038] Decorrelating experience for 256 frames... +[2023-03-11 18:07:48,399][66037] Decorrelating experience for 384 frames... +[2023-03-11 18:07:48,403][66033] Decorrelating experience for 384 frames... +[2023-03-11 18:07:48,450][66034] Decorrelating experience for 384 frames... +[2023-03-11 18:07:48,454][66035] Decorrelating experience for 320 frames... +[2023-03-11 18:07:48,500][66032] Decorrelating experience for 384 frames... +[2023-03-11 18:07:48,509][66101] Decorrelating experience for 384 frames... +[2023-03-11 18:07:48,540][66038] Decorrelating experience for 320 frames... +[2023-03-11 18:07:48,594][66036] Decorrelating experience for 448 frames... +[2023-03-11 18:07:48,601][66037] Decorrelating experience for 448 frames... +[2023-03-11 18:07:48,603][66033] Decorrelating experience for 448 frames... +[2023-03-11 18:07:48,622][66035] Decorrelating experience for 384 frames... +[2023-03-11 18:07:48,654][66034] Decorrelating experience for 448 frames... +[2023-03-11 18:07:48,708][66032] Decorrelating experience for 448 frames... +[2023-03-11 18:07:48,713][66101] Decorrelating experience for 448 frames... +[2023-03-11 18:07:48,713][66038] Decorrelating experience for 384 frames... +[2023-03-11 18:07:48,820][66035] Decorrelating experience for 448 frames... +[2023-03-11 18:07:48,916][66038] Decorrelating experience for 448 frames... +[2023-03-11 18:07:49,012][65744] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) +[2023-03-11 18:07:49,013][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000000000_0.pth... +[2023-03-11 18:07:53,360][66031] Updated weights for policy 0, policy_version 80 (0.0005) +[2023-03-11 18:07:54,012][65744] Fps is (10 sec: 4505.6, 60 sec: 3003.7, 300 sec: 3003.7). Total num frames: 45056. Throughput: 0: 2782.9. Samples: 41744. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:07:54,012][65744] Avg episode reward: [(0, '9.414')] +[2023-03-11 18:07:56,391][65744] Heartbeat connected on Batcher_0 +[2023-03-11 18:07:56,397][65744] Heartbeat connected on RolloutWorker_w0 +[2023-03-11 18:07:56,399][65744] Heartbeat connected on RolloutWorker_w1 +[2023-03-11 18:07:56,401][65744] Heartbeat connected on RolloutWorker_w2 +[2023-03-11 18:07:56,403][65744] Heartbeat connected on RolloutWorker_w3 +[2023-03-11 18:07:56,405][65744] Heartbeat connected on RolloutWorker_w4 +[2023-03-11 18:07:56,406][65744] Heartbeat connected on RolloutWorker_w5 +[2023-03-11 18:07:56,408][65744] Heartbeat connected on RolloutWorker_w6 +[2023-03-11 18:07:56,410][65744] Heartbeat connected on RolloutWorker_w7 +[2023-03-11 18:07:56,410][65744] Heartbeat connected on LearnerWorker_p0 +[2023-03-11 18:07:56,414][65744] Heartbeat connected on InferenceWorker_p0-w0 +[2023-03-11 18:07:57,166][66031] Updated weights for policy 0, policy_version 160 (0.0005) +[2023-03-11 18:07:59,012][65744] Fps is (10 sec: 9830.4, 60 sec: 4915.2, 300 sec: 4915.2). Total num frames: 98304. Throughput: 0: 3690.0. Samples: 73800. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:07:59,012][65744] Avg episode reward: [(0, '27.175')] +[2023-03-11 18:08:01,065][66031] Updated weights for policy 0, policy_version 240 (0.0005) +[2023-03-11 18:08:04,012][65744] Fps is (10 sec: 10649.5, 60 sec: 6062.1, 300 sec: 6062.1). Total num frames: 151552. Throughput: 0: 5484.6. Samples: 137116. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:08:04,012][65744] Avg episode reward: [(0, '70.748')] +[2023-03-11 18:08:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000000296_151552.pth... +[2023-03-11 18:08:04,019][65987] Saving new best policy, reward=70.748! +[2023-03-11 18:08:05,082][66031] Updated weights for policy 0, policy_version 320 (0.0004) +[2023-03-11 18:08:09,012][65744] Fps is (10 sec: 10239.9, 60 sec: 6690.1, 300 sec: 6690.1). Total num frames: 200704. Throughput: 0: 6656.8. Samples: 199704. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:08:09,012][65744] Avg episode reward: [(0, '102.876')] +[2023-03-11 18:08:09,013][65987] Saving new best policy, reward=102.876! +[2023-03-11 18:08:09,013][66031] Updated weights for policy 0, policy_version 400 (0.0005) +[2023-03-11 18:08:12,857][66031] Updated weights for policy 0, policy_version 480 (0.0005) +[2023-03-11 18:08:14,012][65744] Fps is (10 sec: 10240.1, 60 sec: 7255.8, 300 sec: 7255.8). Total num frames: 253952. Throughput: 0: 6592.6. Samples: 230740. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:08:14,012][65744] Avg episode reward: [(0, '145.669')] +[2023-03-11 18:08:14,039][65987] Saving new best policy, reward=145.669! +[2023-03-11 18:08:16,792][66031] Updated weights for policy 0, policy_version 560 (0.0005) +[2023-03-11 18:08:19,012][65744] Fps is (10 sec: 10649.7, 60 sec: 7680.0, 300 sec: 7680.0). Total num frames: 307200. Throughput: 0: 7343.6. Samples: 293744. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:08:19,012][65744] Avg episode reward: [(0, '160.917')] +[2023-03-11 18:08:19,068][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000000608_311296.pth... +[2023-03-11 18:08:19,070][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000000000_0.pth +[2023-03-11 18:08:19,070][65987] Saving new best policy, reward=160.917! +[2023-03-11 18:08:20,636][66031] Updated weights for policy 0, policy_version 640 (0.0005) +[2023-03-11 18:08:24,012][65744] Fps is (10 sec: 10649.6, 60 sec: 8010.0, 300 sec: 8010.0). Total num frames: 360448. Throughput: 0: 7933.2. Samples: 356992. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:08:24,012][65744] Avg episode reward: [(0, '193.557')] +[2023-03-11 18:08:24,013][65987] Saving new best policy, reward=193.557! +[2023-03-11 18:08:24,476][66031] Updated weights for policy 0, policy_version 720 (0.0005) +[2023-03-11 18:08:28,152][66031] Updated weights for policy 0, policy_version 800 (0.0004) +[2023-03-11 18:08:29,012][65744] Fps is (10 sec: 11059.2, 60 sec: 8355.8, 300 sec: 8355.8). Total num frames: 417792. Throughput: 0: 8685.3. Samples: 390836. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 18:08:29,012][65744] Avg episode reward: [(0, '194.933')] +[2023-03-11 18:08:29,013][65987] Saving new best policy, reward=194.933! +[2023-03-11 18:08:31,755][66031] Updated weights for policy 0, policy_version 880 (0.0003) +[2023-03-11 18:08:34,012][65744] Fps is (10 sec: 11468.7, 60 sec: 8638.8, 300 sec: 8638.8). Total num frames: 475136. Throughput: 0: 10194.5. Samples: 458752. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:08:34,012][65744] Avg episode reward: [(0, '129.479')] +[2023-03-11 18:08:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000000928_475136.pth... +[2023-03-11 18:08:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000000296_151552.pth +[2023-03-11 18:08:35,415][66031] Updated weights for policy 0, policy_version 960 (0.0004) +[2023-03-11 18:08:38,941][66031] Updated weights for policy 0, policy_version 1040 (0.0004) +[2023-03-11 18:08:39,012][65744] Fps is (10 sec: 11468.7, 60 sec: 8874.7, 300 sec: 8874.7). Total num frames: 532480. Throughput: 0: 10774.8. Samples: 526612. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:08:39,012][65744] Avg episode reward: [(0, '199.391')] +[2023-03-11 18:08:39,013][65987] Saving new best policy, reward=199.391! +[2023-03-11 18:08:42,544][66031] Updated weights for policy 0, policy_version 1120 (0.0004) +[2023-03-11 18:08:44,012][65744] Fps is (10 sec: 11059.2, 60 sec: 9762.1, 300 sec: 9011.2). Total num frames: 585728. Throughput: 0: 10830.0. Samples: 561152. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:08:44,012][65744] Avg episode reward: [(0, '201.687')] +[2023-03-11 18:08:44,016][65987] Saving new best policy, reward=201.687! +[2023-03-11 18:08:46,222][66031] Updated weights for policy 0, policy_version 1200 (0.0004) +[2023-03-11 18:08:49,012][65744] Fps is (10 sec: 11059.2, 60 sec: 10717.8, 300 sec: 9186.7). Total num frames: 643072. Throughput: 0: 10906.7. Samples: 627916. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:08:49,012][65744] Avg episode reward: [(0, '222.059')] +[2023-03-11 18:08:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000001256_643072.pth... +[2023-03-11 18:08:49,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000000608_311296.pth +[2023-03-11 18:08:49,018][65987] Saving new best policy, reward=222.059! +[2023-03-11 18:08:49,799][66031] Updated weights for policy 0, policy_version 1280 (0.0004) +[2023-03-11 18:08:53,348][66031] Updated weights for policy 0, policy_version 1360 (0.0004) +[2023-03-11 18:08:54,012][65744] Fps is (10 sec: 11468.9, 60 sec: 10922.7, 300 sec: 9338.9). Total num frames: 700416. Throughput: 0: 11057.2. Samples: 697276. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:08:54,012][65744] Avg episode reward: [(0, '256.688')] +[2023-03-11 18:08:54,013][65987] Saving new best policy, reward=256.688! +[2023-03-11 18:08:56,926][66031] Updated weights for policy 0, policy_version 1440 (0.0004) +[2023-03-11 18:08:59,012][65744] Fps is (10 sec: 11468.9, 60 sec: 10990.9, 300 sec: 9472.0). Total num frames: 757760. Throughput: 0: 11129.9. Samples: 731584. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 18:08:59,012][65744] Avg episode reward: [(0, '274.846')] +[2023-03-11 18:08:59,013][65987] Saving new best policy, reward=274.846! +[2023-03-11 18:09:00,521][66031] Updated weights for policy 0, policy_version 1520 (0.0004) +[2023-03-11 18:09:04,012][65744] Fps is (10 sec: 11468.7, 60 sec: 11059.2, 300 sec: 9589.4). Total num frames: 815104. Throughput: 0: 11230.6. Samples: 799124. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 18:09:04,012][65744] Avg episode reward: [(0, '269.543')] +[2023-03-11 18:09:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000001592_815104.pth... +[2023-03-11 18:09:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000000928_475136.pth +[2023-03-11 18:09:04,162][66031] Updated weights for policy 0, policy_version 1600 (0.0004) +[2023-03-11 18:09:07,695][66031] Updated weights for policy 0, policy_version 1680 (0.0004) +[2023-03-11 18:09:09,012][65744] Fps is (10 sec: 11468.8, 60 sec: 11195.7, 300 sec: 9693.9). Total num frames: 872448. Throughput: 0: 11367.3. Samples: 868520. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 18:09:09,012][65744] Avg episode reward: [(0, '290.265')] +[2023-03-11 18:09:09,013][65987] Saving new best policy, reward=290.265! +[2023-03-11 18:09:11,252][66031] Updated weights for policy 0, policy_version 1760 (0.0004) +[2023-03-11 18:09:14,012][65744] Fps is (10 sec: 11468.9, 60 sec: 11264.0, 300 sec: 9787.3). Total num frames: 929792. Throughput: 0: 11392.0. Samples: 903476. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 18:09:14,012][65744] Avg episode reward: [(0, '307.267')] +[2023-03-11 18:09:14,013][65987] Saving new best policy, reward=307.267! +[2023-03-11 18:09:15,137][66031] Updated weights for policy 0, policy_version 1840 (0.0005) +[2023-03-11 18:09:18,996][66031] Updated weights for policy 0, policy_version 1920 (0.0005) +[2023-03-11 18:09:19,012][65744] Fps is (10 sec: 11059.2, 60 sec: 11264.0, 300 sec: 9830.4). Total num frames: 983040. Throughput: 0: 11288.2. Samples: 966720. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:09:19,012][65744] Avg episode reward: [(0, '301.562')] +[2023-03-11 18:09:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000001920_983040.pth... +[2023-03-11 18:09:19,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000001256_643072.pth +[2023-03-11 18:09:22,746][66031] Updated weights for policy 0, policy_version 2000 (0.0005) +[2023-03-11 18:09:24,012][65744] Fps is (10 sec: 10649.6, 60 sec: 11264.0, 300 sec: 9869.4). Total num frames: 1036288. Throughput: 0: 11227.0. Samples: 1031828. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 18:09:24,012][65744] Avg episode reward: [(0, '310.664')] +[2023-03-11 18:09:24,013][65987] Saving new best policy, reward=310.664! +[2023-03-11 18:09:26,563][66031] Updated weights for policy 0, policy_version 2080 (0.0005) +[2023-03-11 18:09:29,012][65744] Fps is (10 sec: 10649.6, 60 sec: 11195.7, 300 sec: 9904.9). Total num frames: 1089536. Throughput: 0: 11174.9. Samples: 1064020. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:09:29,012][65744] Avg episode reward: [(0, '325.440')] +[2023-03-11 18:09:29,013][65987] Saving new best policy, reward=325.440! +[2023-03-11 18:09:30,404][66031] Updated weights for policy 0, policy_version 2160 (0.0005) +[2023-03-11 18:09:34,012][65744] Fps is (10 sec: 10649.6, 60 sec: 11127.5, 300 sec: 9937.2). Total num frames: 1142784. Throughput: 0: 11107.8. Samples: 1127768. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 18:09:34,012][65744] Avg episode reward: [(0, '330.741')] +[2023-03-11 18:09:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000002232_1142784.pth... +[2023-03-11 18:09:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000001592_815104.pth +[2023-03-11 18:09:34,018][65987] Saving new best policy, reward=330.741! +[2023-03-11 18:09:34,200][66031] Updated weights for policy 0, policy_version 2240 (0.0005) +[2023-03-11 18:09:38,028][66031] Updated weights for policy 0, policy_version 2320 (0.0005) +[2023-03-11 18:09:39,012][65744] Fps is (10 sec: 10649.6, 60 sec: 11059.2, 300 sec: 9966.9). Total num frames: 1196032. Throughput: 0: 10994.0. Samples: 1192008. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 18:09:39,012][65744] Avg episode reward: [(0, '324.430')] +[2023-03-11 18:09:41,957][66031] Updated weights for policy 0, policy_version 2400 (0.0005) +[2023-03-11 18:09:44,012][65744] Fps is (10 sec: 10649.6, 60 sec: 11059.2, 300 sec: 9994.2). Total num frames: 1249280. Throughput: 0: 10937.2. Samples: 1223760. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:09:44,012][65744] Avg episode reward: [(0, '304.488')] +[2023-03-11 18:09:45,735][66031] Updated weights for policy 0, policy_version 2480 (0.0005) +[2023-03-11 18:09:49,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10990.9, 300 sec: 10019.4). Total num frames: 1302528. Throughput: 0: 10852.5. Samples: 1287484. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 18:09:49,012][65744] Avg episode reward: [(0, '314.225')] +[2023-03-11 18:09:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000002544_1302528.pth... +[2023-03-11 18:09:49,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000001920_983040.pth +[2023-03-11 18:09:49,660][66031] Updated weights for policy 0, policy_version 2560 (0.0005) +[2023-03-11 18:09:53,601][66031] Updated weights for policy 0, policy_version 2640 (0.0005) +[2023-03-11 18:09:54,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10854.4, 300 sec: 10012.4). Total num frames: 1351680. Throughput: 0: 10707.7. Samples: 1350368. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:09:54,012][65744] Avg episode reward: [(0, '295.956')] +[2023-03-11 18:09:57,580][66031] Updated weights for policy 0, policy_version 2720 (0.0005) +[2023-03-11 18:09:59,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10786.1, 300 sec: 10035.2). Total num frames: 1404928. Throughput: 0: 10608.6. Samples: 1380864. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:09:59,012][65744] Avg episode reward: [(0, '257.125')] +[2023-03-11 18:10:01,425][66031] Updated weights for policy 0, policy_version 2800 (0.0004) +[2023-03-11 18:10:04,012][65744] Fps is (10 sec: 10649.7, 60 sec: 10717.9, 300 sec: 10056.4). Total num frames: 1458176. Throughput: 0: 10625.7. Samples: 1444876. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:10:04,012][65744] Avg episode reward: [(0, '362.321')] +[2023-03-11 18:10:04,028][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000002856_1462272.pth... +[2023-03-11 18:10:04,029][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000002232_1142784.pth +[2023-03-11 18:10:04,030][65987] Saving new best policy, reward=362.321! +[2023-03-11 18:10:05,168][66031] Updated weights for policy 0, policy_version 2880 (0.0004) +[2023-03-11 18:10:09,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10076.2). Total num frames: 1511424. Throughput: 0: 10611.5. Samples: 1509344. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 18:10:09,012][65744] Avg episode reward: [(0, '505.525')] +[2023-03-11 18:10:09,024][65987] Saving new best policy, reward=505.525! +[2023-03-11 18:10:09,026][66031] Updated weights for policy 0, policy_version 2960 (0.0004) +[2023-03-11 18:10:12,740][66031] Updated weights for policy 0, policy_version 3040 (0.0005) +[2023-03-11 18:10:14,012][65744] Fps is (10 sec: 11059.2, 60 sec: 10649.6, 300 sec: 10121.1). Total num frames: 1568768. Throughput: 0: 10626.0. Samples: 1542192. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 18:10:14,012][65744] Avg episode reward: [(0, '633.992')] +[2023-03-11 18:10:14,013][65987] Saving new best policy, reward=633.992! +[2023-03-11 18:10:16,548][66031] Updated weights for policy 0, policy_version 3120 (0.0005) +[2023-03-11 18:10:19,012][65744] Fps is (10 sec: 11059.1, 60 sec: 10649.6, 300 sec: 10137.6). Total num frames: 1622016. Throughput: 0: 10644.4. Samples: 1606764. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:10:19,012][65744] Avg episode reward: [(0, '771.082')] +[2023-03-11 18:10:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000003168_1622016.pth... +[2023-03-11 18:10:19,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000002544_1302528.pth +[2023-03-11 18:10:19,018][65987] Saving new best policy, reward=771.082! +[2023-03-11 18:10:20,349][66031] Updated weights for policy 0, policy_version 3200 (0.0004) +[2023-03-11 18:10:24,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10153.1). Total num frames: 1675264. Throughput: 0: 10649.4. Samples: 1671232. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:10:24,012][65744] Avg episode reward: [(0, '849.815')] +[2023-03-11 18:10:24,013][65987] Saving new best policy, reward=849.815! +[2023-03-11 18:10:24,198][66031] Updated weights for policy 0, policy_version 3280 (0.0004) +[2023-03-11 18:10:28,087][66031] Updated weights for policy 0, policy_version 3360 (0.0004) +[2023-03-11 18:10:29,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10167.7). Total num frames: 1728512. Throughput: 0: 10658.7. Samples: 1703404. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 18:10:29,012][65744] Avg episode reward: [(0, '821.902')] +[2023-03-11 18:10:31,943][66031] Updated weights for policy 0, policy_version 3440 (0.0004) +[2023-03-11 18:10:34,012][65744] Fps is (10 sec: 10649.5, 60 sec: 10649.6, 300 sec: 10181.5). Total num frames: 1781760. Throughput: 0: 10641.4. Samples: 1766348. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:10:34,012][65744] Avg episode reward: [(0, '822.862')] +[2023-03-11 18:10:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000003480_1781760.pth... +[2023-03-11 18:10:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000002856_1462272.pth +[2023-03-11 18:10:35,830][66031] Updated weights for policy 0, policy_version 3520 (0.0004) +[2023-03-11 18:10:39,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10581.3, 300 sec: 10171.7). Total num frames: 1830912. Throughput: 0: 10637.6. Samples: 1829060. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:10:39,013][65744] Avg episode reward: [(0, '860.833')] +[2023-03-11 18:10:39,053][65987] Saving new best policy, reward=860.833! +[2023-03-11 18:10:39,869][66031] Updated weights for policy 0, policy_version 3600 (0.0004) +[2023-03-11 18:10:43,977][66031] Updated weights for policy 0, policy_version 3680 (0.0005) +[2023-03-11 18:10:44,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10581.3, 300 sec: 10184.6). Total num frames: 1884160. Throughput: 0: 10631.0. Samples: 1859260. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:10:44,012][65744] Avg episode reward: [(0, '918.438')] +[2023-03-11 18:10:44,012][65987] Saving new best policy, reward=918.438! +[2023-03-11 18:10:48,120][66031] Updated weights for policy 0, policy_version 3760 (0.0005) +[2023-03-11 18:10:49,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10175.3). Total num frames: 1933312. Throughput: 0: 10525.3. Samples: 1918516. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:10:49,012][65744] Avg episode reward: [(0, '864.188')] +[2023-03-11 18:10:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000003776_1933312.pth... +[2023-03-11 18:10:49,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000003168_1622016.pth +[2023-03-11 18:10:52,304][66031] Updated weights for policy 0, policy_version 3840 (0.0005) +[2023-03-11 18:10:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10513.1, 300 sec: 10166.5). Total num frames: 1982464. Throughput: 0: 10415.9. Samples: 1978060. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:10:54,012][65744] Avg episode reward: [(0, '736.250')] +[2023-03-11 18:10:56,337][66031] Updated weights for policy 0, policy_version 3920 (0.0005) +[2023-03-11 18:10:59,012][65744] Fps is (10 sec: 9830.5, 60 sec: 10444.8, 300 sec: 10158.1). Total num frames: 2031616. Throughput: 0: 10347.6. Samples: 2007836. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:10:59,012][65744] Avg episode reward: [(0, '717.532')] +[2023-03-11 18:11:00,443][66031] Updated weights for policy 0, policy_version 4000 (0.0005) +[2023-03-11 18:11:04,012][65744] Fps is (10 sec: 9830.3, 60 sec: 10376.5, 300 sec: 10150.1). Total num frames: 2080768. Throughput: 0: 10260.6. Samples: 2068492. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 18:11:04,012][65744] Avg episode reward: [(0, '841.911')] +[2023-03-11 18:11:04,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000004064_2080768.pth... +[2023-03-11 18:11:04,019][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000003480_1781760.pth +[2023-03-11 18:11:04,523][66031] Updated weights for policy 0, policy_version 4080 (0.0005) +[2023-03-11 18:11:08,651][66031] Updated weights for policy 0, policy_version 4160 (0.0005) +[2023-03-11 18:11:09,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10308.3, 300 sec: 10142.5). Total num frames: 2129920. Throughput: 0: 10152.1. Samples: 2128076. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:11:09,012][65744] Avg episode reward: [(0, '1065.431')] +[2023-03-11 18:11:09,072][65987] Saving new best policy, reward=1065.431! +[2023-03-11 18:11:12,773][66031] Updated weights for policy 0, policy_version 4240 (0.0005) +[2023-03-11 18:11:14,012][65744] Fps is (10 sec: 9830.5, 60 sec: 10171.7, 300 sec: 10135.2). Total num frames: 2179072. Throughput: 0: 10112.2. Samples: 2158452. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:11:14,012][65744] Avg episode reward: [(0, '1128.214')] +[2023-03-11 18:11:14,026][65987] Saving new best policy, reward=1128.214! +[2023-03-11 18:11:16,993][66031] Updated weights for policy 0, policy_version 4320 (0.0005) +[2023-03-11 18:11:19,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10128.3). Total num frames: 2228224. Throughput: 0: 9994.1. Samples: 2216084. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:11:19,012][65744] Avg episode reward: [(0, '1105.877')] +[2023-03-11 18:11:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000004352_2228224.pth... +[2023-03-11 18:11:19,019][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000003776_1933312.pth +[2023-03-11 18:11:21,224][66031] Updated weights for policy 0, policy_version 4400 (0.0005) +[2023-03-11 18:11:24,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10121.7). Total num frames: 2277376. Throughput: 0: 9899.7. Samples: 2274544. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:11:24,012][65744] Avg episode reward: [(0, '838.879')] +[2023-03-11 18:11:25,414][66031] Updated weights for policy 0, policy_version 4480 (0.0005) +[2023-03-11 18:11:29,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 10115.3). Total num frames: 2326528. Throughput: 0: 9883.3. Samples: 2304008. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:11:29,012][65744] Avg episode reward: [(0, '887.167')] +[2023-03-11 18:11:29,593][66031] Updated weights for policy 0, policy_version 4560 (0.0004) +[2023-03-11 18:11:33,786][66031] Updated weights for policy 0, policy_version 4640 (0.0005) +[2023-03-11 18:11:34,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10109.3). Total num frames: 2375680. Throughput: 0: 9874.8. Samples: 2362880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:11:34,012][65744] Avg episode reward: [(0, '964.114')] +[2023-03-11 18:11:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000004640_2375680.pth... +[2023-03-11 18:11:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000004064_2080768.pth +[2023-03-11 18:11:37,934][66031] Updated weights for policy 0, policy_version 4720 (0.0004) +[2023-03-11 18:11:39,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10103.5). Total num frames: 2424832. Throughput: 0: 9851.1. Samples: 2421360. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:11:39,012][65744] Avg episode reward: [(0, '1122.962')] +[2023-03-11 18:11:42,196][66031] Updated weights for policy 0, policy_version 4800 (0.0005) +[2023-03-11 18:11:44,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9830.4, 300 sec: 10097.9). Total num frames: 2473984. Throughput: 0: 9832.1. Samples: 2450280. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 18:11:44,012][65744] Avg episode reward: [(0, '996.711')] +[2023-03-11 18:11:46,489][66031] Updated weights for policy 0, policy_version 4880 (0.0005) +[2023-03-11 18:11:49,012][65744] Fps is (10 sec: 9830.2, 60 sec: 9830.4, 300 sec: 10092.5). Total num frames: 2523136. Throughput: 0: 9757.3. Samples: 2507572. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:11:49,013][65744] Avg episode reward: [(0, '1101.505')] +[2023-03-11 18:11:49,017][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000004928_2523136.pth... +[2023-03-11 18:11:49,020][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000004352_2228224.pth +[2023-03-11 18:11:50,640][66031] Updated weights for policy 0, policy_version 4960 (0.0005) +[2023-03-11 18:11:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10087.4). Total num frames: 2572288. Throughput: 0: 9776.0. Samples: 2567996. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:11:54,012][65744] Avg episode reward: [(0, '1501.650')] +[2023-03-11 18:11:54,013][65987] Saving new best policy, reward=1501.650! +[2023-03-11 18:11:54,777][66031] Updated weights for policy 0, policy_version 5040 (0.0005) +[2023-03-11 18:11:58,909][66031] Updated weights for policy 0, policy_version 5120 (0.0005) +[2023-03-11 18:11:59,012][65744] Fps is (10 sec: 9830.6, 60 sec: 9830.4, 300 sec: 10082.5). Total num frames: 2621440. Throughput: 0: 9744.1. Samples: 2596936. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:11:59,012][65744] Avg episode reward: [(0, '1466.158')] +[2023-03-11 18:12:03,059][66031] Updated weights for policy 0, policy_version 5200 (0.0005) +[2023-03-11 18:12:04,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9830.4, 300 sec: 10077.7). Total num frames: 2670592. Throughput: 0: 9789.9. Samples: 2656628. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:12:04,012][65744] Avg episode reward: [(0, '1713.147')] +[2023-03-11 18:12:04,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000005216_2670592.pth... +[2023-03-11 18:12:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000004640_2375680.pth +[2023-03-11 18:12:04,019][65987] Saving new best policy, reward=1713.147! +[2023-03-11 18:12:07,027][66031] Updated weights for policy 0, policy_version 5280 (0.0004) +[2023-03-11 18:12:09,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9830.4, 300 sec: 10073.1). Total num frames: 2719744. Throughput: 0: 9832.9. Samples: 2717024. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 18:12:09,012][65744] Avg episode reward: [(0, '1385.773')] +[2023-03-11 18:12:11,263][66031] Updated weights for policy 0, policy_version 5360 (0.0005) +[2023-03-11 18:12:14,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9830.4, 300 sec: 10068.7). Total num frames: 2768896. Throughput: 0: 9828.4. Samples: 2746288. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 18:12:14,012][65744] Avg episode reward: [(0, '1460.425')] +[2023-03-11 18:12:15,476][66031] Updated weights for policy 0, policy_version 5440 (0.0005) +[2023-03-11 18:12:19,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10064.5). Total num frames: 2818048. Throughput: 0: 9802.4. Samples: 2803988. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 18:12:19,012][65744] Avg episode reward: [(0, '1604.223')] +[2023-03-11 18:12:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000005504_2818048.pth... +[2023-03-11 18:12:19,019][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000004928_2523136.pth +[2023-03-11 18:12:19,693][66031] Updated weights for policy 0, policy_version 5520 (0.0005) +[2023-03-11 18:12:23,777][66031] Updated weights for policy 0, policy_version 5600 (0.0005) +[2023-03-11 18:12:24,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10060.3). Total num frames: 2867200. Throughput: 0: 9828.9. Samples: 2863660. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:12:24,012][65744] Avg episode reward: [(0, '1713.469')] +[2023-03-11 18:12:24,013][65987] Saving new best policy, reward=1713.469! +[2023-03-11 18:12:27,852][66031] Updated weights for policy 0, policy_version 5680 (0.0005) +[2023-03-11 18:12:29,012][65744] Fps is (10 sec: 10240.1, 60 sec: 9898.7, 300 sec: 10070.5). Total num frames: 2920448. Throughput: 0: 9863.9. Samples: 2894156. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 18:12:29,012][65744] Avg episode reward: [(0, '1871.068')] +[2023-03-11 18:12:29,012][65987] Saving new best policy, reward=1871.068! +[2023-03-11 18:12:31,916][66031] Updated weights for policy 0, policy_version 5760 (0.0005) +[2023-03-11 18:12:34,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10052.6). Total num frames: 2965504. Throughput: 0: 9919.5. Samples: 2953948. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:12:34,012][65744] Avg episode reward: [(0, '1906.319')] +[2023-03-11 18:12:34,056][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000005800_2969600.pth... +[2023-03-11 18:12:34,058][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000005216_2670592.pth +[2023-03-11 18:12:34,059][65987] Saving new best policy, reward=1906.319! +[2023-03-11 18:12:36,116][66031] Updated weights for policy 0, policy_version 5840 (0.0005) +[2023-03-11 18:12:39,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9830.4, 300 sec: 10219.2). Total num frames: 3014656. Throughput: 0: 9894.8. Samples: 3013260. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:12:39,012][65744] Avg episode reward: [(0, '1549.934')] +[2023-03-11 18:12:40,324][66031] Updated weights for policy 0, policy_version 5920 (0.0005) +[2023-03-11 18:12:44,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 10399.7). Total num frames: 3067904. Throughput: 0: 9905.3. Samples: 3042676. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:12:44,023][65744] Avg episode reward: [(0, '1756.990')] +[2023-03-11 18:12:44,411][66031] Updated weights for policy 0, policy_version 6000 (0.0005) +[2023-03-11 18:12:48,427][66031] Updated weights for policy 0, policy_version 6080 (0.0004) +[2023-03-11 18:12:49,012][65744] Fps is (10 sec: 10239.9, 60 sec: 9898.7, 300 sec: 10413.6). Total num frames: 3117056. Throughput: 0: 9917.5. Samples: 3102916. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 18:12:49,012][65744] Avg episode reward: [(0, '2060.751')] +[2023-03-11 18:12:49,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000006088_3117056.pth... +[2023-03-11 18:12:49,028][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000005504_2818048.pth +[2023-03-11 18:12:49,029][65987] Saving new best policy, reward=2060.751! +[2023-03-11 18:12:52,489][66031] Updated weights for policy 0, policy_version 6160 (0.0005) +[2023-03-11 18:12:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10399.7). Total num frames: 3166208. Throughput: 0: 9908.9. Samples: 3162924. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:12:54,012][65744] Avg episode reward: [(0, '2097.000')] +[2023-03-11 18:12:54,023][65987] Saving new best policy, reward=2097.000! +[2023-03-11 18:12:56,784][66031] Updated weights for policy 0, policy_version 6240 (0.0005) +[2023-03-11 18:12:59,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10385.8). Total num frames: 3215360. Throughput: 0: 9896.4. Samples: 3191624. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:12:59,012][65744] Avg episode reward: [(0, '1830.340')] +[2023-03-11 18:13:01,023][66031] Updated weights for policy 0, policy_version 6320 (0.0005) +[2023-03-11 18:13:04,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10385.8). Total num frames: 3264512. Throughput: 0: 9903.0. Samples: 3249620. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:13:04,012][65744] Avg episode reward: [(0, '2027.250')] +[2023-03-11 18:13:04,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000006376_3264512.pth... +[2023-03-11 18:13:04,028][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000005800_2969600.pth +[2023-03-11 18:13:05,232][66031] Updated weights for policy 0, policy_version 6400 (0.0005) +[2023-03-11 18:13:09,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9830.4, 300 sec: 10358.0). Total num frames: 3309568. Throughput: 0: 9864.9. Samples: 3307580. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 18:13:09,012][65744] Avg episode reward: [(0, '2153.467')] +[2023-03-11 18:13:09,074][65987] Saving new best policy, reward=2153.467! +[2023-03-11 18:13:09,521][66031] Updated weights for policy 0, policy_version 6480 (0.0005) +[2023-03-11 18:13:13,797][66031] Updated weights for policy 0, policy_version 6560 (0.0005) +[2023-03-11 18:13:14,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9830.4, 300 sec: 10344.1). Total num frames: 3358720. Throughput: 0: 9825.3. Samples: 3336296. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:13:14,012][65744] Avg episode reward: [(0, '2370.795')] +[2023-03-11 18:13:14,013][65987] Saving new best policy, reward=2370.795! +[2023-03-11 18:13:18,203][66031] Updated weights for policy 0, policy_version 6640 (0.0005) +[2023-03-11 18:13:19,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9762.2, 300 sec: 10316.4). Total num frames: 3403776. Throughput: 0: 9743.3. Samples: 3392396. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:13:19,012][65744] Avg episode reward: [(0, '2088.631')] +[2023-03-11 18:13:19,077][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000006656_3407872.pth... +[2023-03-11 18:13:19,079][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000006088_3117056.pth +[2023-03-11 18:13:22,496][66031] Updated weights for policy 0, policy_version 6720 (0.0005) +[2023-03-11 18:13:24,012][65744] Fps is (10 sec: 9420.7, 60 sec: 9762.1, 300 sec: 10288.6). Total num frames: 3452928. Throughput: 0: 9692.3. Samples: 3449416. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 18:13:24,012][65744] Avg episode reward: [(0, '1671.237')] +[2023-03-11 18:13:26,853][66031] Updated weights for policy 0, policy_version 6800 (0.0005) +[2023-03-11 18:13:29,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9693.9, 300 sec: 10260.8). Total num frames: 3502080. Throughput: 0: 9664.6. Samples: 3477584. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 18:13:29,023][65744] Avg episode reward: [(0, '1703.991')] +[2023-03-11 18:13:31,014][66031] Updated weights for policy 0, policy_version 6880 (0.0004) +[2023-03-11 18:13:34,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9762.1, 300 sec: 10233.1). Total num frames: 3551232. Throughput: 0: 9638.0. Samples: 3536628. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 18:13:34,012][65744] Avg episode reward: [(0, '2034.997')] +[2023-03-11 18:13:34,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000006936_3551232.pth... +[2023-03-11 18:13:34,029][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000006376_3264512.pth +[2023-03-11 18:13:35,180][66031] Updated weights for policy 0, policy_version 6960 (0.0004) +[2023-03-11 18:13:39,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 10219.2). Total num frames: 3600384. Throughput: 0: 9631.9. Samples: 3596360. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 18:13:39,012][65744] Avg episode reward: [(0, '2020.592')] +[2023-03-11 18:13:39,206][66031] Updated weights for policy 0, policy_version 7040 (0.0003) +[2023-03-11 18:13:43,383][66031] Updated weights for policy 0, policy_version 7120 (0.0005) +[2023-03-11 18:13:44,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 10191.4). Total num frames: 3649536. Throughput: 0: 9667.1. Samples: 3626644. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 18:13:44,012][65744] Avg episode reward: [(0, '2085.415')] +[2023-03-11 18:13:47,632][66031] Updated weights for policy 0, policy_version 7200 (0.0005) +[2023-03-11 18:13:49,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 10163.6). Total num frames: 3698688. Throughput: 0: 9667.8. Samples: 3684672. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:13:49,012][65744] Avg episode reward: [(0, '2181.455')] +[2023-03-11 18:13:49,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000007224_3698688.pth... +[2023-03-11 18:13:49,029][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000006656_3407872.pth +[2023-03-11 18:13:51,955][66031] Updated weights for policy 0, policy_version 7280 (0.0005) +[2023-03-11 18:13:54,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9625.6, 300 sec: 10122.0). Total num frames: 3743744. Throughput: 0: 9648.2. Samples: 3741748. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:13:54,012][65744] Avg episode reward: [(0, '1851.959')] +[2023-03-11 18:13:56,306][66031] Updated weights for policy 0, policy_version 7360 (0.0005) +[2023-03-11 18:13:59,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 10094.2). Total num frames: 3792896. Throughput: 0: 9630.6. Samples: 3769672. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 18:13:59,012][65744] Avg episode reward: [(0, '1938.425')] +[2023-03-11 18:14:00,624][66031] Updated weights for policy 0, policy_version 7440 (0.0005) +[2023-03-11 18:14:04,012][65744] Fps is (10 sec: 9420.7, 60 sec: 9557.3, 300 sec: 10052.6). Total num frames: 3837952. Throughput: 0: 9639.4. Samples: 3826172. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 18:14:04,012][65744] Avg episode reward: [(0, '2055.072')] +[2023-03-11 18:14:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000007496_3837952.pth... +[2023-03-11 18:14:04,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000006936_3551232.pth +[2023-03-11 18:14:04,862][66031] Updated weights for policy 0, policy_version 7520 (0.0005) +[2023-03-11 18:14:08,886][66031] Updated weights for policy 0, policy_version 7600 (0.0005) +[2023-03-11 18:14:09,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 10038.7). Total num frames: 3891200. Throughput: 0: 9726.6. Samples: 3887112. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 18:14:09,012][65744] Avg episode reward: [(0, '1841.343')] +[2023-03-11 18:14:13,070][66031] Updated weights for policy 0, policy_version 7680 (0.0004) +[2023-03-11 18:14:14,012][65744] Fps is (10 sec: 10240.1, 60 sec: 9693.9, 300 sec: 10024.8). Total num frames: 3940352. Throughput: 0: 9746.2. Samples: 3916164. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:14:14,012][65744] Avg episode reward: [(0, '1870.025')] +[2023-03-11 18:14:17,320][66031] Updated weights for policy 0, policy_version 7760 (0.0005) +[2023-03-11 18:14:19,012][65744] Fps is (10 sec: 9420.7, 60 sec: 9693.8, 300 sec: 9997.0). Total num frames: 3985408. Throughput: 0: 9726.6. Samples: 3974324. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:14:19,012][65744] Avg episode reward: [(0, '2093.355')] +[2023-03-11 18:14:19,019][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000007792_3989504.pth... +[2023-03-11 18:14:19,020][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000007224_3698688.pth +[2023-03-11 18:14:21,402][66031] Updated weights for policy 0, policy_version 7840 (0.0004) +[2023-03-11 18:14:24,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9997.0). Total num frames: 4038656. Throughput: 0: 9728.8. Samples: 4034156. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:14:24,012][65744] Avg episode reward: [(0, '1565.012')] +[2023-03-11 18:14:25,678][66031] Updated weights for policy 0, policy_version 7920 (0.0005) +[2023-03-11 18:14:29,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9693.9, 300 sec: 9969.2). Total num frames: 4083712. Throughput: 0: 9695.0. Samples: 4062920. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 18:14:29,012][65744] Avg episode reward: [(0, '2131.427')] +[2023-03-11 18:14:29,959][66031] Updated weights for policy 0, policy_version 8000 (0.0005) +[2023-03-11 18:14:34,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9955.4). Total num frames: 4132864. Throughput: 0: 9680.6. Samples: 4120300. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:14:34,012][65744] Avg episode reward: [(0, '2160.505')] +[2023-03-11 18:14:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000008072_4132864.pth... +[2023-03-11 18:14:34,019][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000007496_3837952.pth +[2023-03-11 18:14:34,178][66031] Updated weights for policy 0, policy_version 8080 (0.0005) +[2023-03-11 18:14:38,371][66031] Updated weights for policy 0, policy_version 8160 (0.0005) +[2023-03-11 18:14:39,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9941.5). Total num frames: 4182016. Throughput: 0: 9705.1. Samples: 4178476. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:14:39,012][65744] Avg episode reward: [(0, '1730.126')] +[2023-03-11 18:14:42,659][66031] Updated weights for policy 0, policy_version 8240 (0.0005) +[2023-03-11 18:14:44,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9927.6). Total num frames: 4231168. Throughput: 0: 9716.5. Samples: 4206912. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:14:44,012][65744] Avg episode reward: [(0, '2177.313')] +[2023-03-11 18:14:46,971][66031] Updated weights for policy 0, policy_version 8320 (0.0005) +[2023-03-11 18:14:49,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9913.7). Total num frames: 4276224. Throughput: 0: 9736.6. Samples: 4264320. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:14:49,012][65744] Avg episode reward: [(0, '2014.978')] +[2023-03-11 18:14:49,060][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000008360_4280320.pth... +[2023-03-11 18:14:49,061][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000007792_3989504.pth +[2023-03-11 18:14:51,216][66031] Updated weights for policy 0, policy_version 8400 (0.0005) +[2023-03-11 18:14:54,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9899.8). Total num frames: 4325376. Throughput: 0: 9654.4. Samples: 4321560. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 18:14:54,012][65744] Avg episode reward: [(0, '2197.016')] +[2023-03-11 18:14:55,527][66031] Updated weights for policy 0, policy_version 8480 (0.0005) +[2023-03-11 18:14:59,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9885.9). Total num frames: 4374528. Throughput: 0: 9657.2. Samples: 4350736. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 18:14:59,012][65744] Avg episode reward: [(0, '2230.043')] +[2023-03-11 18:14:59,700][66031] Updated weights for policy 0, policy_version 8560 (0.0005) +[2023-03-11 18:15:03,931][66031] Updated weights for policy 0, policy_version 8640 (0.0005) +[2023-03-11 18:15:04,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9762.1, 300 sec: 9872.0). Total num frames: 4423680. Throughput: 0: 9670.9. Samples: 4409516. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:15:04,012][65744] Avg episode reward: [(0, '2078.381')] +[2023-03-11 18:15:04,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000008640_4423680.pth... +[2023-03-11 18:15:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000008072_4132864.pth +[2023-03-11 18:15:07,865][66031] Updated weights for policy 0, policy_version 8720 (0.0004) +[2023-03-11 18:15:09,012][65744] Fps is (10 sec: 10239.9, 60 sec: 9762.1, 300 sec: 9858.2). Total num frames: 4476928. Throughput: 0: 9704.8. Samples: 4470872. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:15:09,012][65744] Avg episode reward: [(0, '2031.194')] +[2023-03-11 18:15:12,031][66031] Updated weights for policy 0, policy_version 8800 (0.0004) +[2023-03-11 18:15:14,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9830.4). Total num frames: 4521984. Throughput: 0: 9722.6. Samples: 4500436. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:15:14,012][65744] Avg episode reward: [(0, '1682.843')] +[2023-03-11 18:15:16,173][66031] Updated weights for policy 0, policy_version 8880 (0.0005) +[2023-03-11 18:15:19,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9830.4). Total num frames: 4575232. Throughput: 0: 9759.4. Samples: 4559472. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:15:19,012][65744] Avg episode reward: [(0, '2263.570')] +[2023-03-11 18:15:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000008936_4575232.pth... +[2023-03-11 18:15:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000008360_4280320.pth +[2023-03-11 18:15:20,182][66031] Updated weights for policy 0, policy_version 8960 (0.0005) +[2023-03-11 18:15:24,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9762.1, 300 sec: 9816.5). Total num frames: 4624384. Throughput: 0: 9854.1. Samples: 4621912. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:15:24,012][65744] Avg episode reward: [(0, '1596.711')] +[2023-03-11 18:15:24,082][66031] Updated weights for policy 0, policy_version 9040 (0.0005) +[2023-03-11 18:15:27,984][66031] Updated weights for policy 0, policy_version 9120 (0.0005) +[2023-03-11 18:15:29,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9816.5). Total num frames: 4677632. Throughput: 0: 9915.9. Samples: 4653128. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:15:29,012][65744] Avg episode reward: [(0, '1917.407')] +[2023-03-11 18:15:31,942][66031] Updated weights for policy 0, policy_version 9200 (0.0005) +[2023-03-11 18:15:34,012][65744] Fps is (10 sec: 10649.6, 60 sec: 9966.9, 300 sec: 9830.4). Total num frames: 4730880. Throughput: 0: 10027.8. Samples: 4715572. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:15:34,012][65744] Avg episode reward: [(0, '1612.477')] +[2023-03-11 18:15:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000009240_4730880.pth... +[2023-03-11 18:15:34,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000008640_4423680.pth +[2023-03-11 18:15:35,908][66031] Updated weights for policy 0, policy_version 9280 (0.0005) +[2023-03-11 18:15:39,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 9816.5). Total num frames: 4780032. Throughput: 0: 10098.8. Samples: 4776008. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:15:39,012][65744] Avg episode reward: [(0, '2119.669')] +[2023-03-11 18:15:40,100][66031] Updated weights for policy 0, policy_version 9360 (0.0005) +[2023-03-11 18:15:44,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9816.5). Total num frames: 4829184. Throughput: 0: 10099.9. Samples: 4805232. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:15:44,012][65744] Avg episode reward: [(0, '2161.369')] +[2023-03-11 18:15:44,363][66031] Updated weights for policy 0, policy_version 9440 (0.0005) +[2023-03-11 18:15:48,628][66031] Updated weights for policy 0, policy_version 9520 (0.0005) +[2023-03-11 18:15:49,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9966.9, 300 sec: 9802.6). Total num frames: 4874240. Throughput: 0: 10073.6. Samples: 4862828. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:15:49,012][65744] Avg episode reward: [(0, '2026.963')] +[2023-03-11 18:15:49,065][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000009528_4878336.pth... +[2023-03-11 18:15:49,066][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000008936_4575232.pth +[2023-03-11 18:15:52,922][66031] Updated weights for policy 0, policy_version 9600 (0.0005) +[2023-03-11 18:15:54,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9966.9, 300 sec: 9802.6). Total num frames: 4923392. Throughput: 0: 9991.2. Samples: 4920476. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:15:54,012][65744] Avg episode reward: [(0, '2000.851')] +[2023-03-11 18:15:56,849][66031] Updated weights for policy 0, policy_version 9680 (0.0005) +[2023-03-11 18:15:59,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9816.5). Total num frames: 4976640. Throughput: 0: 10036.6. Samples: 4952084. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:15:59,023][65744] Avg episode reward: [(0, '1993.722')] +[2023-03-11 18:16:00,830][66031] Updated weights for policy 0, policy_version 9760 (0.0005) +[2023-03-11 18:16:04,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9816.5). Total num frames: 5025792. Throughput: 0: 10090.0. Samples: 5013524. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:16:04,012][65744] Avg episode reward: [(0, '2268.767')] +[2023-03-11 18:16:04,045][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000009824_5029888.pth... +[2023-03-11 18:16:04,047][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000009240_4730880.pth +[2023-03-11 18:16:04,861][66031] Updated weights for policy 0, policy_version 9840 (0.0005) +[2023-03-11 18:16:08,892][66031] Updated weights for policy 0, policy_version 9920 (0.0005) +[2023-03-11 18:16:09,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9830.4). Total num frames: 5079040. Throughput: 0: 10066.0. Samples: 5074880. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 18:16:09,012][65744] Avg episode reward: [(0, '2029.921')] +[2023-03-11 18:16:12,786][66031] Updated weights for policy 0, policy_version 10000 (0.0005) +[2023-03-11 18:16:14,012][65744] Fps is (10 sec: 10649.7, 60 sec: 10171.7, 300 sec: 9844.3). Total num frames: 5132288. Throughput: 0: 10061.2. Samples: 5105880. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 18:16:14,012][65744] Avg episode reward: [(0, '2252.879')] +[2023-03-11 18:16:16,691][66031] Updated weights for policy 0, policy_version 10080 (0.0004) +[2023-03-11 18:16:19,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 9844.3). Total num frames: 5181440. Throughput: 0: 10079.4. Samples: 5169144. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:16:19,012][65744] Avg episode reward: [(0, '2053.258')] +[2023-03-11 18:16:19,048][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000010128_5185536.pth... +[2023-03-11 18:16:19,050][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000009528_4878336.pth +[2023-03-11 18:16:20,628][66031] Updated weights for policy 0, policy_version 10160 (0.0005) +[2023-03-11 18:16:24,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 9858.2). Total num frames: 5234688. Throughput: 0: 10116.4. Samples: 5231244. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:16:24,012][65744] Avg episode reward: [(0, '2157.023')] +[2023-03-11 18:16:24,531][66031] Updated weights for policy 0, policy_version 10240 (0.0004) +[2023-03-11 18:16:28,502][66031] Updated weights for policy 0, policy_version 10320 (0.0005) +[2023-03-11 18:16:29,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10171.7, 300 sec: 9872.1). Total num frames: 5287936. Throughput: 0: 10169.7. Samples: 5262868. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:16:29,012][65744] Avg episode reward: [(0, '2235.112')] +[2023-03-11 18:16:32,509][66031] Updated weights for policy 0, policy_version 10400 (0.0005) +[2023-03-11 18:16:34,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 9872.1). Total num frames: 5337088. Throughput: 0: 10258.3. Samples: 5324452. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 18:16:34,012][65744] Avg episode reward: [(0, '2079.557')] +[2023-03-11 18:16:34,014][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000010424_5337088.pth... +[2023-03-11 18:16:34,016][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000009824_5029888.pth +[2023-03-11 18:16:36,452][66031] Updated weights for policy 0, policy_version 10480 (0.0005) +[2023-03-11 18:16:39,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 9885.9). Total num frames: 5390336. Throughput: 0: 10351.7. Samples: 5386304. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 18:16:39,023][65744] Avg episode reward: [(0, '1901.357')] +[2023-03-11 18:16:40,385][66031] Updated weights for policy 0, policy_version 10560 (0.0004) +[2023-03-11 18:16:44,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 9899.8). Total num frames: 5443584. Throughput: 0: 10355.3. Samples: 5418072. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 18:16:44,012][65744] Avg episode reward: [(0, '2310.364')] +[2023-03-11 18:16:44,337][66031] Updated weights for policy 0, policy_version 10640 (0.0004) +[2023-03-11 18:16:48,286][66031] Updated weights for policy 0, policy_version 10720 (0.0005) +[2023-03-11 18:16:49,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 9899.8). Total num frames: 5492736. Throughput: 0: 10366.4. Samples: 5480012. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 18:16:49,012][65744] Avg episode reward: [(0, '2559.012')] +[2023-03-11 18:16:49,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000010728_5492736.pth... +[2023-03-11 18:16:49,027][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000010128_5185536.pth +[2023-03-11 18:16:49,027][65987] Saving new best policy, reward=2559.012! +[2023-03-11 18:16:52,561][66031] Updated weights for policy 0, policy_version 10800 (0.0005) +[2023-03-11 18:16:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10308.3, 300 sec: 9899.8). Total num frames: 5541888. Throughput: 0: 10288.4. Samples: 5537856. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 18:16:54,012][65744] Avg episode reward: [(0, '2076.139')] +[2023-03-11 18:16:56,832][66031] Updated weights for policy 0, policy_version 10880 (0.0005) +[2023-03-11 18:16:59,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 9899.8). Total num frames: 5591040. Throughput: 0: 10241.6. Samples: 5566752. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 18:16:59,012][65744] Avg episode reward: [(0, '2134.770')] +[2023-03-11 18:17:01,117][66031] Updated weights for policy 0, policy_version 10960 (0.0005) +[2023-03-11 18:17:04,012][65744] Fps is (10 sec: 9420.7, 60 sec: 10171.7, 300 sec: 9885.9). Total num frames: 5636096. Throughput: 0: 10116.3. Samples: 5624376. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 18:17:04,012][65744] Avg episode reward: [(0, '2512.705')] +[2023-03-11 18:17:04,044][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000011016_5640192.pth... +[2023-03-11 18:17:04,046][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000010424_5337088.pth +[2023-03-11 18:17:05,319][66031] Updated weights for policy 0, policy_version 11040 (0.0005) +[2023-03-11 18:17:09,012][65744] Fps is (10 sec: 9420.8, 60 sec: 10103.5, 300 sec: 9885.9). Total num frames: 5685248. Throughput: 0: 10015.3. Samples: 5681932. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:17:09,012][65744] Avg episode reward: [(0, '2373.957')] +[2023-03-11 18:17:09,635][66031] Updated weights for policy 0, policy_version 11120 (0.0005) +[2023-03-11 18:17:14,010][66031] Updated weights for policy 0, policy_version 11200 (0.0005) +[2023-03-11 18:17:14,012][65744] Fps is (10 sec: 9830.5, 60 sec: 10035.2, 300 sec: 9885.9). Total num frames: 5734400. Throughput: 0: 9946.5. Samples: 5710460. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:17:14,012][65744] Avg episode reward: [(0, '2159.898')] +[2023-03-11 18:17:18,219][66031] Updated weights for policy 0, policy_version 11280 (0.0005) +[2023-03-11 18:17:19,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9966.9, 300 sec: 9872.1). Total num frames: 5779456. Throughput: 0: 9846.0. Samples: 5767520. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:17:19,012][65744] Avg episode reward: [(0, '2554.439')] +[2023-03-11 18:17:19,068][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000011296_5783552.pth... +[2023-03-11 18:17:19,070][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000010728_5492736.pth +[2023-03-11 18:17:22,493][66031] Updated weights for policy 0, policy_version 11360 (0.0005) +[2023-03-11 18:17:24,012][65744] Fps is (10 sec: 9420.7, 60 sec: 9898.7, 300 sec: 9858.2). Total num frames: 5828608. Throughput: 0: 9758.5. Samples: 5825436. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:17:24,012][65744] Avg episode reward: [(0, '1908.560')] +[2023-03-11 18:17:26,783][66031] Updated weights for policy 0, policy_version 11440 (0.0005) +[2023-03-11 18:17:29,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9872.1). Total num frames: 5877760. Throughput: 0: 9688.4. Samples: 5854048. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:17:29,012][65744] Avg episode reward: [(0, '2372.217')] +[2023-03-11 18:17:31,025][66031] Updated weights for policy 0, policy_version 11520 (0.0005) +[2023-03-11 18:17:34,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9872.1). Total num frames: 5926912. Throughput: 0: 9603.7. Samples: 5912180. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:17:34,012][65744] Avg episode reward: [(0, '2317.484')] +[2023-03-11 18:17:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000011576_5926912.pth... +[2023-03-11 18:17:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000011016_5640192.pth +[2023-03-11 18:17:35,224][66031] Updated weights for policy 0, policy_version 11600 (0.0005) +[2023-03-11 18:17:39,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9844.3). Total num frames: 5971968. Throughput: 0: 9617.8. Samples: 5970656. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:17:39,012][65744] Avg episode reward: [(0, '2543.829')] +[2023-03-11 18:17:39,466][66031] Updated weights for policy 0, policy_version 11680 (0.0004) +[2023-03-11 18:17:43,685][66031] Updated weights for policy 0, policy_version 11760 (0.0005) +[2023-03-11 18:17:44,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9844.3). Total num frames: 6021120. Throughput: 0: 9615.2. Samples: 5999436. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 18:17:44,012][65744] Avg episode reward: [(0, '2519.797')] +[2023-03-11 18:17:47,852][66031] Updated weights for policy 0, policy_version 11840 (0.0005) +[2023-03-11 18:17:49,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9625.6, 300 sec: 9844.3). Total num frames: 6070272. Throughput: 0: 9637.3. Samples: 6058056. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 18:17:49,012][65744] Avg episode reward: [(0, '2289.114')] +[2023-03-11 18:17:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000011856_6070272.pth... +[2023-03-11 18:17:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000011296_5783552.pth +[2023-03-11 18:17:52,042][66031] Updated weights for policy 0, policy_version 11920 (0.0005) +[2023-03-11 18:17:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9844.3). Total num frames: 6119424. Throughput: 0: 9652.0. Samples: 6116272. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:17:54,012][65744] Avg episode reward: [(0, '2525.081')] +[2023-03-11 18:17:56,310][66031] Updated weights for policy 0, policy_version 12000 (0.0004) +[2023-03-11 18:17:59,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9844.3). Total num frames: 6168576. Throughput: 0: 9664.4. Samples: 6145360. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:17:59,012][65744] Avg episode reward: [(0, '2704.366')] +[2023-03-11 18:17:59,013][65987] Saving new best policy, reward=2704.366! +[2023-03-11 18:18:00,565][66031] Updated weights for policy 0, policy_version 12080 (0.0005) +[2023-03-11 18:18:04,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9693.9, 300 sec: 9858.2). Total num frames: 6217728. Throughput: 0: 9683.3. Samples: 6203268. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:18:04,012][65744] Avg episode reward: [(0, '2384.054')] +[2023-03-11 18:18:04,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000012144_6217728.pth... +[2023-03-11 18:18:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000011576_5926912.pth +[2023-03-11 18:18:04,828][66031] Updated weights for policy 0, policy_version 12160 (0.0005) +[2023-03-11 18:18:09,012][65744] Fps is (10 sec: 9420.7, 60 sec: 9625.6, 300 sec: 9844.3). Total num frames: 6262784. Throughput: 0: 9669.1. Samples: 6260544. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:18:09,012][65744] Avg episode reward: [(0, '2570.343')] +[2023-03-11 18:18:09,077][66031] Updated weights for policy 0, policy_version 12240 (0.0005) +[2023-03-11 18:18:13,376][66031] Updated weights for policy 0, policy_version 12320 (0.0005) +[2023-03-11 18:18:14,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9625.6, 300 sec: 9858.2). Total num frames: 6311936. Throughput: 0: 9682.8. Samples: 6289772. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:18:14,012][65744] Avg episode reward: [(0, '2706.929')] +[2023-03-11 18:18:14,013][65987] Saving new best policy, reward=2706.929! +[2023-03-11 18:18:17,695][66031] Updated weights for policy 0, policy_version 12400 (0.0005) +[2023-03-11 18:18:19,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9693.9, 300 sec: 9858.2). Total num frames: 6361088. Throughput: 0: 9655.5. Samples: 6346676. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:18:19,012][65744] Avg episode reward: [(0, '2434.043')] +[2023-03-11 18:18:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000012424_6361088.pth... +[2023-03-11 18:18:19,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000011856_6070272.pth +[2023-03-11 18:18:21,865][66031] Updated weights for policy 0, policy_version 12480 (0.0005) +[2023-03-11 18:18:24,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9858.2). Total num frames: 6410240. Throughput: 0: 9675.9. Samples: 6406072. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:18:24,012][65744] Avg episode reward: [(0, '2468.658')] +[2023-03-11 18:18:26,013][66031] Updated weights for policy 0, policy_version 12560 (0.0005) +[2023-03-11 18:18:29,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9693.9, 300 sec: 9858.2). Total num frames: 6459392. Throughput: 0: 9676.5. Samples: 6434880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:18:29,023][65744] Avg episode reward: [(0, '2620.099')] +[2023-03-11 18:18:30,278][66031] Updated weights for policy 0, policy_version 12640 (0.0005) +[2023-03-11 18:18:34,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9844.3). Total num frames: 6504448. Throughput: 0: 9652.0. Samples: 6492396. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 18:18:34,012][65744] Avg episode reward: [(0, '2460.754')] +[2023-03-11 18:18:34,025][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000012704_6504448.pth... +[2023-03-11 18:18:34,028][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000012144_6217728.pth +[2023-03-11 18:18:34,504][66031] Updated weights for policy 0, policy_version 12720 (0.0005) +[2023-03-11 18:18:38,720][66031] Updated weights for policy 0, policy_version 12800 (0.0005) +[2023-03-11 18:18:39,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9844.3). Total num frames: 6553600. Throughput: 0: 9660.7. Samples: 6551004. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 18:18:39,012][65744] Avg episode reward: [(0, '2254.980')] +[2023-03-11 18:18:42,956][66031] Updated weights for policy 0, policy_version 12880 (0.0005) +[2023-03-11 18:18:44,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9844.3). Total num frames: 6602752. Throughput: 0: 9652.2. Samples: 6579708. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:18:44,012][65744] Avg episode reward: [(0, '2380.559')] +[2023-03-11 18:18:47,216][66031] Updated weights for policy 0, policy_version 12960 (0.0006) +[2023-03-11 18:18:49,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9858.2). Total num frames: 6651904. Throughput: 0: 9661.7. Samples: 6638044. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:18:49,012][65744] Avg episode reward: [(0, '2563.271')] +[2023-03-11 18:18:49,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000012992_6651904.pth... +[2023-03-11 18:18:49,029][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000012424_6361088.pth +[2023-03-11 18:18:51,213][66031] Updated weights for policy 0, policy_version 13040 (0.0005) +[2023-03-11 18:18:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9858.2). Total num frames: 6701056. Throughput: 0: 9745.2. Samples: 6699080. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:18:54,012][65744] Avg episode reward: [(0, '2355.180')] +[2023-03-11 18:18:55,408][66031] Updated weights for policy 0, policy_version 13120 (0.0005) +[2023-03-11 18:18:59,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9872.1). Total num frames: 6750208. Throughput: 0: 9731.2. Samples: 6727676. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 18:18:59,012][65744] Avg episode reward: [(0, '2723.391')] +[2023-03-11 18:18:59,013][65987] Saving new best policy, reward=2723.391! +[2023-03-11 18:18:59,635][66031] Updated weights for policy 0, policy_version 13200 (0.0005) +[2023-03-11 18:19:03,923][66031] Updated weights for policy 0, policy_version 13280 (0.0005) +[2023-03-11 18:19:04,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9858.2). Total num frames: 6799360. Throughput: 0: 9739.8. Samples: 6784968. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 18:19:04,012][65744] Avg episode reward: [(0, '2739.840')] +[2023-03-11 18:19:04,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000013280_6799360.pth... +[2023-03-11 18:19:04,029][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000012704_6504448.pth +[2023-03-11 18:19:04,029][65987] Saving new best policy, reward=2739.840! +[2023-03-11 18:19:08,139][66031] Updated weights for policy 0, policy_version 13360 (0.0005) +[2023-03-11 18:19:09,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9844.3). Total num frames: 6844416. Throughput: 0: 9728.0. Samples: 6843832. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 18:19:09,012][65744] Avg episode reward: [(0, '2948.699')] +[2023-03-11 18:19:09,023][65987] Saving new best policy, reward=2948.699! +[2023-03-11 18:19:12,274][66031] Updated weights for policy 0, policy_version 13440 (0.0005) +[2023-03-11 18:19:14,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9872.1). Total num frames: 6897664. Throughput: 0: 9731.1. Samples: 6872780. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:19:14,012][65744] Avg episode reward: [(0, '2673.189')] +[2023-03-11 18:19:16,409][66031] Updated weights for policy 0, policy_version 13520 (0.0005) +[2023-03-11 18:19:19,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9762.1, 300 sec: 9858.2). Total num frames: 6946816. Throughput: 0: 9774.4. Samples: 6932244. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:19:19,012][65744] Avg episode reward: [(0, '2060.044')] +[2023-03-11 18:19:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000013568_6946816.pth... +[2023-03-11 18:19:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000012992_6651904.pth +[2023-03-11 18:19:20,441][66031] Updated weights for policy 0, policy_version 13600 (0.0005) +[2023-03-11 18:19:24,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 9885.9). Total num frames: 7000064. Throughput: 0: 9855.9. Samples: 6994520. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:19:24,012][65744] Avg episode reward: [(0, '2601.111')] +[2023-03-11 18:19:24,402][66031] Updated weights for policy 0, policy_version 13680 (0.0005) +[2023-03-11 18:19:28,419][66031] Updated weights for policy 0, policy_version 13760 (0.0005) +[2023-03-11 18:19:29,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 9885.9). Total num frames: 7049216. Throughput: 0: 9889.0. Samples: 7024712. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:19:29,012][65744] Avg episode reward: [(0, '2631.494')] +[2023-03-11 18:19:32,659][66031] Updated weights for policy 0, policy_version 13840 (0.0005) +[2023-03-11 18:19:34,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9898.7, 300 sec: 9885.9). Total num frames: 7098368. Throughput: 0: 9909.2. Samples: 7083960. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:19:34,012][65744] Avg episode reward: [(0, '2527.639')] +[2023-03-11 18:19:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000013864_7098368.pth... +[2023-03-11 18:19:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000013280_6799360.pth +[2023-03-11 18:19:36,945][66031] Updated weights for policy 0, policy_version 13920 (0.0005) +[2023-03-11 18:19:39,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9885.9). Total num frames: 7147520. Throughput: 0: 9850.3. Samples: 7142344. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:19:39,023][65744] Avg episode reward: [(0, '2769.207')] +[2023-03-11 18:19:40,874][66031] Updated weights for policy 0, policy_version 14000 (0.0005) +[2023-03-11 18:19:44,012][65744] Fps is (10 sec: 10240.2, 60 sec: 9967.0, 300 sec: 9913.7). Total num frames: 7200768. Throughput: 0: 9916.9. Samples: 7173936. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:19:44,012][65744] Avg episode reward: [(0, '2279.933')] +[2023-03-11 18:19:44,783][66031] Updated weights for policy 0, policy_version 14080 (0.0004) +[2023-03-11 18:19:48,781][66031] Updated weights for policy 0, policy_version 14160 (0.0005) +[2023-03-11 18:19:49,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 9913.7). Total num frames: 7249920. Throughput: 0: 10022.3. Samples: 7235972. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:19:49,012][65744] Avg episode reward: [(0, '2091.738')] +[2023-03-11 18:19:49,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000014160_7249920.pth... +[2023-03-11 18:19:49,028][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000013568_6946816.pth +[2023-03-11 18:19:52,782][66031] Updated weights for policy 0, policy_version 14240 (0.0005) +[2023-03-11 18:19:54,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10035.2, 300 sec: 9927.6). Total num frames: 7303168. Throughput: 0: 10088.8. Samples: 7297828. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:19:54,012][65744] Avg episode reward: [(0, '1658.718')] +[2023-03-11 18:19:56,802][66031] Updated weights for policy 0, policy_version 14320 (0.0005) +[2023-03-11 18:19:59,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9927.6). Total num frames: 7352320. Throughput: 0: 10117.0. Samples: 7328044. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:19:59,012][65744] Avg episode reward: [(0, '2203.092')] +[2023-03-11 18:20:00,784][66031] Updated weights for policy 0, policy_version 14400 (0.0005) +[2023-03-11 18:20:04,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 9927.6). Total num frames: 7405568. Throughput: 0: 10160.6. Samples: 7389472. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:20:04,012][65744] Avg episode reward: [(0, '2828.202')] +[2023-03-11 18:20:04,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000014464_7405568.pth... +[2023-03-11 18:20:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000013864_7098368.pth +[2023-03-11 18:20:04,773][66031] Updated weights for policy 0, policy_version 14480 (0.0005) +[2023-03-11 18:20:08,734][66031] Updated weights for policy 0, policy_version 14560 (0.0005) +[2023-03-11 18:20:09,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 9941.5). Total num frames: 7454720. Throughput: 0: 10160.4. Samples: 7451736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:20:09,012][65744] Avg episode reward: [(0, '2318.684')] +[2023-03-11 18:20:12,746][66031] Updated weights for policy 0, policy_version 14640 (0.0005) +[2023-03-11 18:20:14,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 9941.5). Total num frames: 7507968. Throughput: 0: 10181.7. Samples: 7482888. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 18:20:14,012][65744] Avg episode reward: [(0, '2933.706')] +[2023-03-11 18:20:16,939][66031] Updated weights for policy 0, policy_version 14720 (0.0005) +[2023-03-11 18:20:19,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 9941.5). Total num frames: 7557120. Throughput: 0: 10167.8. Samples: 7541512. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 18:20:19,012][65744] Avg episode reward: [(0, '2622.100')] +[2023-03-11 18:20:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000014760_7557120.pth... +[2023-03-11 18:20:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000014160_7249920.pth +[2023-03-11 18:20:20,949][66031] Updated weights for policy 0, policy_version 14800 (0.0005) +[2023-03-11 18:20:24,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 9927.6). Total num frames: 7606272. Throughput: 0: 10224.9. Samples: 7602464. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 18:20:24,012][65744] Avg episode reward: [(0, '2614.717')] +[2023-03-11 18:20:24,959][66031] Updated weights for policy 0, policy_version 14880 (0.0005) +[2023-03-11 18:20:28,963][66031] Updated weights for policy 0, policy_version 14960 (0.0005) +[2023-03-11 18:20:29,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 9927.6). Total num frames: 7659520. Throughput: 0: 10233.7. Samples: 7634452. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 18:20:29,012][65744] Avg episode reward: [(0, '2771.579')] +[2023-03-11 18:20:32,988][66031] Updated weights for policy 0, policy_version 15040 (0.0005) +[2023-03-11 18:20:34,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10171.8, 300 sec: 9927.6). Total num frames: 7708672. Throughput: 0: 10203.4. Samples: 7695124. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:20:34,012][65744] Avg episode reward: [(0, '2213.124')] +[2023-03-11 18:20:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000015056_7708672.pth... +[2023-03-11 18:20:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000014464_7405568.pth +[2023-03-11 18:20:37,024][66031] Updated weights for policy 0, policy_version 15120 (0.0005) +[2023-03-11 18:20:39,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 9927.6). Total num frames: 7757824. Throughput: 0: 10161.2. Samples: 7755084. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:20:39,012][65744] Avg episode reward: [(0, '2341.359')] +[2023-03-11 18:20:41,113][66031] Updated weights for policy 0, policy_version 15200 (0.0004) +[2023-03-11 18:20:44,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 9941.5). Total num frames: 7806976. Throughput: 0: 10177.0. Samples: 7786008. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:20:44,012][65744] Avg episode reward: [(0, '2696.688')] +[2023-03-11 18:20:45,363][66031] Updated weights for policy 0, policy_version 15280 (0.0005) +[2023-03-11 18:20:49,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 9941.5). Total num frames: 7856128. Throughput: 0: 10098.5. Samples: 7843904. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 18:20:49,012][65744] Avg episode reward: [(0, '2992.599')] +[2023-03-11 18:20:49,079][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000015352_7860224.pth... +[2023-03-11 18:20:49,080][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000014760_7557120.pth +[2023-03-11 18:20:49,081][65987] Saving new best policy, reward=2992.599! +[2023-03-11 18:20:49,492][66031] Updated weights for policy 0, policy_version 15360 (0.0005) +[2023-03-11 18:20:53,640][66031] Updated weights for policy 0, policy_version 15440 (0.0005) +[2023-03-11 18:20:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 9927.6). Total num frames: 7905280. Throughput: 0: 10040.4. Samples: 7903552. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 18:20:54,012][65744] Avg episode reward: [(0, '3007.344')] +[2023-03-11 18:20:54,075][65987] Saving new best policy, reward=3007.344! +[2023-03-11 18:20:57,894][66031] Updated weights for policy 0, policy_version 15520 (0.0005) +[2023-03-11 18:20:59,012][65744] Fps is (10 sec: 9830.5, 60 sec: 10035.2, 300 sec: 9927.6). Total num frames: 7954432. Throughput: 0: 9982.9. Samples: 7932120. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 18:20:59,012][65744] Avg episode reward: [(0, '2338.979')] +[2023-03-11 18:21:01,961][66031] Updated weights for policy 0, policy_version 15600 (0.0004) +[2023-03-11 18:21:04,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10035.2, 300 sec: 9927.6). Total num frames: 8007680. Throughput: 0: 10007.2. Samples: 7991836. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 18:21:04,012][65744] Avg episode reward: [(0, '2673.985')] +[2023-03-11 18:21:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000015640_8007680.pth... +[2023-03-11 18:21:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000015056_7708672.pth +[2023-03-11 18:21:05,961][66031] Updated weights for policy 0, policy_version 15680 (0.0005) +[2023-03-11 18:21:09,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9913.7). Total num frames: 8056832. Throughput: 0: 10016.9. Samples: 8053224. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 18:21:09,012][65744] Avg episode reward: [(0, '2935.692')] +[2023-03-11 18:21:10,012][66031] Updated weights for policy 0, policy_version 15760 (0.0005) +[2023-03-11 18:21:14,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9913.7). Total num frames: 8105984. Throughput: 0: 9979.7. Samples: 8083540. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 18:21:14,012][65744] Avg episode reward: [(0, '2726.800')] +[2023-03-11 18:21:14,064][66031] Updated weights for policy 0, policy_version 15840 (0.0005) +[2023-03-11 18:21:18,070][66031] Updated weights for policy 0, policy_version 15920 (0.0005) +[2023-03-11 18:21:19,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10035.2, 300 sec: 9913.7). Total num frames: 8159232. Throughput: 0: 9994.8. Samples: 8144892. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:21:19,012][65744] Avg episode reward: [(0, '3029.369')] +[2023-03-11 18:21:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000015936_8159232.pth... +[2023-03-11 18:21:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000015352_7860224.pth +[2023-03-11 18:21:19,018][65987] Saving new best policy, reward=3029.369! +[2023-03-11 18:21:22,304][66031] Updated weights for policy 0, policy_version 16000 (0.0005) +[2023-03-11 18:21:24,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9885.9). Total num frames: 8204288. Throughput: 0: 9957.6. Samples: 8203176. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:21:24,012][65744] Avg episode reward: [(0, '2797.583')] +[2023-03-11 18:21:26,693][66031] Updated weights for policy 0, policy_version 16080 (0.0005) +[2023-03-11 18:21:29,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9898.7, 300 sec: 9885.9). Total num frames: 8253440. Throughput: 0: 9881.1. Samples: 8230656. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:21:29,012][65744] Avg episode reward: [(0, '2676.984')] +[2023-03-11 18:21:30,974][66031] Updated weights for policy 0, policy_version 16160 (0.0005) +[2023-03-11 18:21:34,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9872.1). Total num frames: 8302592. Throughput: 0: 9885.0. Samples: 8288728. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:21:34,012][65744] Avg episode reward: [(0, '1944.955')] +[2023-03-11 18:21:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000016216_8302592.pth... +[2023-03-11 18:21:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000015640_8007680.pth +[2023-03-11 18:21:35,135][66031] Updated weights for policy 0, policy_version 16240 (0.0005) +[2023-03-11 18:21:39,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9858.2). Total num frames: 8351744. Throughput: 0: 9846.8. Samples: 8346660. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:21:39,012][65744] Avg episode reward: [(0, '1782.696')] +[2023-03-11 18:21:39,477][66031] Updated weights for policy 0, policy_version 16320 (0.0005) +[2023-03-11 18:21:43,836][66031] Updated weights for policy 0, policy_version 16400 (0.0005) +[2023-03-11 18:21:44,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9830.4, 300 sec: 9844.3). Total num frames: 8396800. Throughput: 0: 9836.8. Samples: 8374776. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 18:21:44,012][65744] Avg episode reward: [(0, '2892.028')] +[2023-03-11 18:21:48,268][66031] Updated weights for policy 0, policy_version 16480 (0.0005) +[2023-03-11 18:21:49,012][65744] Fps is (10 sec: 9011.2, 60 sec: 9762.1, 300 sec: 9830.4). Total num frames: 8441856. Throughput: 0: 9733.8. Samples: 8429856. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 18:21:49,012][65744] Avg episode reward: [(0, '2743.702')] +[2023-03-11 18:21:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000016488_8441856.pth... +[2023-03-11 18:21:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000015936_8159232.pth +[2023-03-11 18:21:52,689][66031] Updated weights for policy 0, policy_version 16560 (0.0005) +[2023-03-11 18:21:54,017][65744] Fps is (10 sec: 9416.2, 60 sec: 9761.3, 300 sec: 9830.2). Total num frames: 8491008. Throughput: 0: 9622.1. Samples: 8486268. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:21:54,017][65744] Avg episode reward: [(0, '3085.085')] +[2023-03-11 18:21:54,018][65987] Saving new best policy, reward=3085.085! +[2023-03-11 18:21:57,050][66031] Updated weights for policy 0, policy_version 16640 (0.0005) +[2023-03-11 18:21:59,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9830.4). Total num frames: 8536064. Throughput: 0: 9575.9. Samples: 8514456. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:21:59,012][65744] Avg episode reward: [(0, '3118.556')] +[2023-03-11 18:21:59,013][65987] Saving new best policy, reward=3118.556! +[2023-03-11 18:22:01,404][66031] Updated weights for policy 0, policy_version 16720 (0.0005) +[2023-03-11 18:22:04,012][65744] Fps is (10 sec: 9425.5, 60 sec: 9625.6, 300 sec: 9830.4). Total num frames: 8585216. Throughput: 0: 9459.5. Samples: 8570568. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:22:04,012][65744] Avg episode reward: [(0, '3190.215')] +[2023-03-11 18:22:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000016768_8585216.pth... +[2023-03-11 18:22:04,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000016216_8302592.pth +[2023-03-11 18:22:04,017][65987] Saving new best policy, reward=3190.215! +[2023-03-11 18:22:05,721][66031] Updated weights for policy 0, policy_version 16800 (0.0005) +[2023-03-11 18:22:09,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9816.5). Total num frames: 8630272. Throughput: 0: 9430.1. Samples: 8627528. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:22:09,012][65744] Avg episode reward: [(0, '2826.215')] +[2023-03-11 18:22:10,056][66031] Updated weights for policy 0, policy_version 16880 (0.0005) +[2023-03-11 18:22:14,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9830.4). Total num frames: 8679424. Throughput: 0: 9442.6. Samples: 8655572. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:22:14,012][65744] Avg episode reward: [(0, '3298.248')] +[2023-03-11 18:22:14,013][65987] Saving new best policy, reward=3298.248! +[2023-03-11 18:22:14,342][66031] Updated weights for policy 0, policy_version 16960 (0.0005) +[2023-03-11 18:22:18,631][66031] Updated weights for policy 0, policy_version 17040 (0.0005) +[2023-03-11 18:22:19,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9816.5). Total num frames: 8724480. Throughput: 0: 9424.0. Samples: 8712808. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:22:19,012][65744] Avg episode reward: [(0, '3333.981')] +[2023-03-11 18:22:19,074][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000017048_8728576.pth... +[2023-03-11 18:22:19,075][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000016488_8441856.pth +[2023-03-11 18:22:19,075][65987] Saving new best policy, reward=3333.981! +[2023-03-11 18:22:22,840][66031] Updated weights for policy 0, policy_version 17120 (0.0005) +[2023-03-11 18:22:24,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9816.5). Total num frames: 8773632. Throughput: 0: 9437.1. Samples: 8771328. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:22:24,012][65744] Avg episode reward: [(0, '2917.572')] +[2023-03-11 18:22:27,208][66031] Updated weights for policy 0, policy_version 17200 (0.0005) +[2023-03-11 18:22:29,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9489.1, 300 sec: 9816.5). Total num frames: 8822784. Throughput: 0: 9436.3. Samples: 8799408. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:22:29,012][65744] Avg episode reward: [(0, '3125.399')] +[2023-03-11 18:22:31,281][66031] Updated weights for policy 0, policy_version 17280 (0.0004) +[2023-03-11 18:22:34,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9489.1, 300 sec: 9830.4). Total num frames: 8871936. Throughput: 0: 9548.3. Samples: 8859528. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 18:22:34,012][65744] Avg episode reward: [(0, '2928.492')] +[2023-03-11 18:22:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000017328_8871936.pth... +[2023-03-11 18:22:34,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000016768_8585216.pth +[2023-03-11 18:22:35,267][66031] Updated weights for policy 0, policy_version 17360 (0.0004) +[2023-03-11 18:22:39,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9557.3, 300 sec: 9844.3). Total num frames: 8925184. Throughput: 0: 9665.2. Samples: 8921152. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 18:22:39,012][65744] Avg episode reward: [(0, '3148.833')] +[2023-03-11 18:22:39,220][66031] Updated weights for policy 0, policy_version 17440 (0.0004) +[2023-03-11 18:22:43,300][66031] Updated weights for policy 0, policy_version 17520 (0.0005) +[2023-03-11 18:22:44,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9625.6, 300 sec: 9844.3). Total num frames: 8974336. Throughput: 0: 9716.9. Samples: 8951716. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:22:44,012][65744] Avg episode reward: [(0, '2706.870')] +[2023-03-11 18:22:47,266][66031] Updated weights for policy 0, policy_version 17600 (0.0004) +[2023-03-11 18:22:49,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9762.1, 300 sec: 9858.2). Total num frames: 9027584. Throughput: 0: 9829.1. Samples: 9012880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:22:49,012][65744] Avg episode reward: [(0, '2469.917')] +[2023-03-11 18:22:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000017632_9027584.pth... +[2023-03-11 18:22:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000017048_8728576.pth +[2023-03-11 18:22:51,265][66031] Updated weights for policy 0, policy_version 17680 (0.0005) +[2023-03-11 18:22:54,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9762.9, 300 sec: 9858.2). Total num frames: 9076736. Throughput: 0: 9932.8. Samples: 9074504. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:22:54,012][65744] Avg episode reward: [(0, '2636.260')] +[2023-03-11 18:22:55,223][66031] Updated weights for policy 0, policy_version 17760 (0.0005) +[2023-03-11 18:22:59,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9872.1). Total num frames: 9129984. Throughput: 0: 9998.0. Samples: 9105480. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:22:59,012][65744] Avg episode reward: [(0, '2148.046')] +[2023-03-11 18:22:59,172][66031] Updated weights for policy 0, policy_version 17840 (0.0005) +[2023-03-11 18:23:03,336][66031] Updated weights for policy 0, policy_version 17920 (0.0005) +[2023-03-11 18:23:04,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9885.9). Total num frames: 9179136. Throughput: 0: 10088.2. Samples: 9166780. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:23:04,012][65744] Avg episode reward: [(0, '2637.256')] +[2023-03-11 18:23:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000017928_9179136.pth... +[2023-03-11 18:23:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000017328_8871936.pth +[2023-03-11 18:23:07,593][66031] Updated weights for policy 0, policy_version 18000 (0.0005) +[2023-03-11 18:23:09,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9885.9). Total num frames: 9228288. Throughput: 0: 10063.8. Samples: 9224200. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:23:09,012][65744] Avg episode reward: [(0, '2979.101')] +[2023-03-11 18:23:11,845][66031] Updated weights for policy 0, policy_version 18080 (0.0005) +[2023-03-11 18:23:14,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9966.9, 300 sec: 9885.9). Total num frames: 9277440. Throughput: 0: 10078.2. Samples: 9252928. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:23:14,012][65744] Avg episode reward: [(0, '3091.383')] +[2023-03-11 18:23:15,981][66031] Updated weights for policy 0, policy_version 18160 (0.0005) +[2023-03-11 18:23:19,012][65744] Fps is (10 sec: 9830.3, 60 sec: 10035.2, 300 sec: 9885.9). Total num frames: 9326592. Throughput: 0: 10069.6. Samples: 9312660. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:23:19,012][65744] Avg episode reward: [(0, '3013.334')] +[2023-03-11 18:23:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000018216_9326592.pth... +[2023-03-11 18:23:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000017632_9027584.pth +[2023-03-11 18:23:20,047][66031] Updated weights for policy 0, policy_version 18240 (0.0005) +[2023-03-11 18:23:24,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 9885.9). Total num frames: 9375744. Throughput: 0: 10025.4. Samples: 9372296. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 18:23:24,012][65744] Avg episode reward: [(0, '2890.470')] +[2023-03-11 18:23:24,232][66031] Updated weights for policy 0, policy_version 18320 (0.0004) +[2023-03-11 18:23:28,440][66031] Updated weights for policy 0, policy_version 18400 (0.0005) +[2023-03-11 18:23:29,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 9899.8). Total num frames: 9424896. Throughput: 0: 9989.9. Samples: 9401260. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 18:23:29,012][65744] Avg episode reward: [(0, '2724.768')] +[2023-03-11 18:23:32,696][66031] Updated weights for policy 0, policy_version 18480 (0.0005) +[2023-03-11 18:23:34,012][65744] Fps is (10 sec: 9830.3, 60 sec: 10035.2, 300 sec: 9899.8). Total num frames: 9474048. Throughput: 0: 9921.7. Samples: 9459356. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:23:34,012][65744] Avg episode reward: [(0, '2715.897')] +[2023-03-11 18:23:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000018504_9474048.pth... +[2023-03-11 18:23:34,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000017928_9179136.pth +[2023-03-11 18:23:36,973][66031] Updated weights for policy 0, policy_version 18560 (0.0005) +[2023-03-11 18:23:39,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 9885.9). Total num frames: 9519104. Throughput: 0: 9837.3. Samples: 9517184. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:23:39,023][65744] Avg episode reward: [(0, '2844.116')] +[2023-03-11 18:23:41,219][66031] Updated weights for policy 0, policy_version 18640 (0.0005) +[2023-03-11 18:23:44,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9898.7, 300 sec: 9885.9). Total num frames: 9568256. Throughput: 0: 9793.2. Samples: 9546176. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:23:44,012][65744] Avg episode reward: [(0, '2484.295')] +[2023-03-11 18:23:45,345][66031] Updated weights for policy 0, policy_version 18720 (0.0005) +[2023-03-11 18:23:49,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9885.9). Total num frames: 9617408. Throughput: 0: 9741.2. Samples: 9605132. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:23:49,012][65744] Avg episode reward: [(0, '3043.249')] +[2023-03-11 18:23:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000018784_9617408.pth... +[2023-03-11 18:23:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000018216_9326592.pth +[2023-03-11 18:23:49,599][66031] Updated weights for policy 0, policy_version 18800 (0.0005) +[2023-03-11 18:23:53,893][66031] Updated weights for policy 0, policy_version 18880 (0.0005) +[2023-03-11 18:23:54,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9830.4, 300 sec: 9885.9). Total num frames: 9666560. Throughput: 0: 9739.2. Samples: 9662464. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:23:54,023][65744] Avg episode reward: [(0, '2778.584')] +[2023-03-11 18:23:58,213][66031] Updated weights for policy 0, policy_version 18960 (0.0005) +[2023-03-11 18:23:59,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9872.1). Total num frames: 9711616. Throughput: 0: 9738.2. Samples: 9691148. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:23:59,012][65744] Avg episode reward: [(0, '2695.477')] +[2023-03-11 18:24:02,492][66031] Updated weights for policy 0, policy_version 19040 (0.0005) +[2023-03-11 18:24:04,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9885.9). Total num frames: 9760768. Throughput: 0: 9683.6. Samples: 9748424. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 18:24:04,012][65744] Avg episode reward: [(0, '2662.815')] +[2023-03-11 18:24:04,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000019064_9760768.pth... +[2023-03-11 18:24:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000018504_9474048.pth +[2023-03-11 18:24:06,777][66031] Updated weights for policy 0, policy_version 19120 (0.0005) +[2023-03-11 18:24:09,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9872.1). Total num frames: 9809920. Throughput: 0: 9634.2. Samples: 9805836. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 18:24:09,012][65744] Avg episode reward: [(0, '2740.560')] +[2023-03-11 18:24:11,070][66031] Updated weights for policy 0, policy_version 19200 (0.0005) +[2023-03-11 18:24:14,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9858.2). Total num frames: 9854976. Throughput: 0: 9627.5. Samples: 9834496. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 18:24:14,012][65744] Avg episode reward: [(0, '2474.929')] +[2023-03-11 18:24:15,396][66031] Updated weights for policy 0, policy_version 19280 (0.0005) +[2023-03-11 18:24:19,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9844.3). Total num frames: 9904128. Throughput: 0: 9610.8. Samples: 9891840. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 18:24:19,012][65744] Avg episode reward: [(0, '2724.779')] +[2023-03-11 18:24:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000019344_9904128.pth... +[2023-03-11 18:24:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000018784_9617408.pth +[2023-03-11 18:24:19,588][66031] Updated weights for policy 0, policy_version 19360 (0.0005) +[2023-03-11 18:24:23,869][66031] Updated weights for policy 0, policy_version 19440 (0.0005) +[2023-03-11 18:24:24,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9844.3). Total num frames: 9953280. Throughput: 0: 9601.4. Samples: 9949248. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 18:24:24,012][65744] Avg episode reward: [(0, '2853.124')] +[2023-03-11 18:24:28,159][66031] Updated weights for policy 0, policy_version 19520 (0.0005) +[2023-03-11 18:24:29,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9844.3). Total num frames: 10002432. Throughput: 0: 9594.3. Samples: 9977920. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 18:24:29,012][65744] Avg episode reward: [(0, '2771.816')] +[2023-03-11 18:24:32,317][66031] Updated weights for policy 0, policy_version 19600 (0.0005) +[2023-03-11 18:24:34,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9844.3). Total num frames: 10051584. Throughput: 0: 9580.2. Samples: 10036240. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 18:24:34,012][65744] Avg episode reward: [(0, '2623.424')] +[2023-03-11 18:24:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000019632_10051584.pth... +[2023-03-11 18:24:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000019064_9760768.pth +[2023-03-11 18:24:36,535][66031] Updated weights for policy 0, policy_version 19680 (0.0005) +[2023-03-11 18:24:39,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9625.6, 300 sec: 9816.5). Total num frames: 10096640. Throughput: 0: 9608.0. Samples: 10094824. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 18:24:39,012][65744] Avg episode reward: [(0, '2744.196')] +[2023-03-11 18:24:40,732][66031] Updated weights for policy 0, policy_version 19760 (0.0005) +[2023-03-11 18:24:44,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9816.5). Total num frames: 10145792. Throughput: 0: 9621.2. Samples: 10124100. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:24:44,012][65744] Avg episode reward: [(0, '2908.614')] +[2023-03-11 18:24:44,985][66031] Updated weights for policy 0, policy_version 19840 (0.0005) +[2023-03-11 18:24:49,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9625.6, 300 sec: 9802.6). Total num frames: 10194944. Throughput: 0: 9632.1. Samples: 10181868. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:24:49,012][65744] Avg episode reward: [(0, '2863.095')] +[2023-03-11 18:24:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000019912_10194944.pth... +[2023-03-11 18:24:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000019344_9904128.pth +[2023-03-11 18:24:49,260][66031] Updated weights for policy 0, policy_version 19920 (0.0005) +[2023-03-11 18:24:53,511][66031] Updated weights for policy 0, policy_version 20000 (0.0005) +[2023-03-11 18:24:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9802.6). Total num frames: 10244096. Throughput: 0: 9646.1. Samples: 10239912. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:24:54,012][65744] Avg episode reward: [(0, '2623.588')] +[2023-03-11 18:24:57,796][66031] Updated weights for policy 0, policy_version 20080 (0.0005) +[2023-03-11 18:24:59,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9774.9). Total num frames: 10289152. Throughput: 0: 9648.7. Samples: 10268688. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:24:59,012][65744] Avg episode reward: [(0, '3216.453')] +[2023-03-11 18:25:01,894][66031] Updated weights for policy 0, policy_version 20160 (0.0004) +[2023-03-11 18:25:04,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9693.9, 300 sec: 9788.7). Total num frames: 10342400. Throughput: 0: 9680.6. Samples: 10327468. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:25:04,012][65744] Avg episode reward: [(0, '2780.926')] +[2023-03-11 18:25:04,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000020200_10342400.pth... +[2023-03-11 18:25:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000019632_10051584.pth +[2023-03-11 18:25:05,868][66031] Updated weights for policy 0, policy_version 20240 (0.0004) +[2023-03-11 18:25:09,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9693.9, 300 sec: 9774.9). Total num frames: 10391552. Throughput: 0: 9778.9. Samples: 10389300. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:25:09,012][65744] Avg episode reward: [(0, '2695.127')] +[2023-03-11 18:25:09,955][66031] Updated weights for policy 0, policy_version 20320 (0.0004) +[2023-03-11 18:25:14,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9774.9). Total num frames: 10440704. Throughput: 0: 9811.1. Samples: 10419420. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:25:14,012][65744] Avg episode reward: [(0, '2671.174')] +[2023-03-11 18:25:14,053][66031] Updated weights for policy 0, policy_version 20400 (0.0005) +[2023-03-11 18:25:18,262][66031] Updated weights for policy 0, policy_version 20480 (0.0004) +[2023-03-11 18:25:19,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9774.9). Total num frames: 10489856. Throughput: 0: 9808.7. Samples: 10477632. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:25:19,012][65744] Avg episode reward: [(0, '2945.593')] +[2023-03-11 18:25:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000020488_10489856.pth... +[2023-03-11 18:25:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000019912_10194944.pth +[2023-03-11 18:25:22,508][66031] Updated weights for policy 0, policy_version 20560 (0.0005) +[2023-03-11 18:25:24,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9761.0). Total num frames: 10539008. Throughput: 0: 9792.8. Samples: 10535500. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:25:24,012][65744] Avg episode reward: [(0, '3008.711')] +[2023-03-11 18:25:26,719][66031] Updated weights for policy 0, policy_version 20640 (0.0004) +[2023-03-11 18:25:29,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9761.0). Total num frames: 10588160. Throughput: 0: 9798.0. Samples: 10565012. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:25:29,012][65744] Avg episode reward: [(0, '2886.859')] +[2023-03-11 18:25:30,923][66031] Updated weights for policy 0, policy_version 20720 (0.0005) +[2023-03-11 18:25:34,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9761.0). Total num frames: 10637312. Throughput: 0: 9825.7. Samples: 10624024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:25:34,012][65744] Avg episode reward: [(0, '3082.005')] +[2023-03-11 18:25:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000020776_10637312.pth... +[2023-03-11 18:25:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000020200_10342400.pth +[2023-03-11 18:25:35,127][66031] Updated weights for policy 0, policy_version 20800 (0.0005) +[2023-03-11 18:25:39,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9761.0). Total num frames: 10686464. Throughput: 0: 9854.8. Samples: 10683380. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:25:39,012][65744] Avg episode reward: [(0, '2866.750')] +[2023-03-11 18:25:39,104][66031] Updated weights for policy 0, policy_version 20880 (0.0004) +[2023-03-11 18:25:43,092][66031] Updated weights for policy 0, policy_version 20960 (0.0005) +[2023-03-11 18:25:44,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9774.9). Total num frames: 10739712. Throughput: 0: 9921.0. Samples: 10715132. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:25:44,012][65744] Avg episode reward: [(0, '2930.029')] +[2023-03-11 18:25:46,997][66031] Updated weights for policy 0, policy_version 21040 (0.0005) +[2023-03-11 18:25:49,012][65744] Fps is (10 sec: 10649.6, 60 sec: 9966.9, 300 sec: 9788.7). Total num frames: 10792960. Throughput: 0: 9983.6. Samples: 10776732. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:25:49,012][65744] Avg episode reward: [(0, '2856.127')] +[2023-03-11 18:25:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000021080_10792960.pth... +[2023-03-11 18:25:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000020488_10489856.pth +[2023-03-11 18:25:51,081][66031] Updated weights for policy 0, policy_version 21120 (0.0005) +[2023-03-11 18:25:54,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9898.6, 300 sec: 9774.9). Total num frames: 10838016. Throughput: 0: 9925.4. Samples: 10835944. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:25:54,012][65744] Avg episode reward: [(0, '2869.094')] +[2023-03-11 18:25:55,324][66031] Updated weights for policy 0, policy_version 21200 (0.0005) +[2023-03-11 18:25:59,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9966.9, 300 sec: 9761.0). Total num frames: 10887168. Throughput: 0: 9910.4. Samples: 10865388. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:25:59,012][65744] Avg episode reward: [(0, '2579.283')] +[2023-03-11 18:25:59,645][66031] Updated weights for policy 0, policy_version 21280 (0.0005) +[2023-03-11 18:26:03,912][66031] Updated weights for policy 0, policy_version 21360 (0.0005) +[2023-03-11 18:26:04,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9898.7, 300 sec: 9761.0). Total num frames: 10936320. Throughput: 0: 9890.1. Samples: 10922688. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:26:04,012][65744] Avg episode reward: [(0, '2917.317')] +[2023-03-11 18:26:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000021360_10936320.pth... +[2023-03-11 18:26:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000020776_10637312.pth +[2023-03-11 18:26:08,149][66031] Updated weights for policy 0, policy_version 21440 (0.0005) +[2023-03-11 18:26:09,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9761.0). Total num frames: 10985472. Throughput: 0: 9887.9. Samples: 10980456. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:26:09,012][65744] Avg episode reward: [(0, '2527.066')] +[2023-03-11 18:26:12,411][66031] Updated weights for policy 0, policy_version 21520 (0.0005) +[2023-03-11 18:26:14,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9830.4, 300 sec: 9733.2). Total num frames: 11030528. Throughput: 0: 9868.5. Samples: 11009096. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:26:14,012][65744] Avg episode reward: [(0, '1995.674')] +[2023-03-11 18:26:16,700][66031] Updated weights for policy 0, policy_version 21600 (0.0005) +[2023-03-11 18:26:19,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9830.4, 300 sec: 9747.1). Total num frames: 11079680. Throughput: 0: 9841.3. Samples: 11066884. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:26:19,012][65744] Avg episode reward: [(0, '2703.350')] +[2023-03-11 18:26:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000021640_11079680.pth... +[2023-03-11 18:26:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000021080_10792960.pth +[2023-03-11 18:26:20,981][66031] Updated weights for policy 0, policy_version 21680 (0.0005) +[2023-03-11 18:26:24,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9747.1). Total num frames: 11128832. Throughput: 0: 9808.1. Samples: 11124744. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 18:26:24,012][65744] Avg episode reward: [(0, '2815.889')] +[2023-03-11 18:26:25,116][66031] Updated weights for policy 0, policy_version 21760 (0.0005) +[2023-03-11 18:26:29,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9747.1). Total num frames: 11177984. Throughput: 0: 9753.2. Samples: 11154028. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 18:26:29,012][65744] Avg episode reward: [(0, '2179.588')] +[2023-03-11 18:26:29,384][66031] Updated weights for policy 0, policy_version 21840 (0.0005) +[2023-03-11 18:26:33,589][66031] Updated weights for policy 0, policy_version 21920 (0.0005) +[2023-03-11 18:26:34,012][65744] Fps is (10 sec: 9420.7, 60 sec: 9762.1, 300 sec: 9733.2). Total num frames: 11223040. Throughput: 0: 9662.4. Samples: 11211540. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 18:26:34,012][65744] Avg episode reward: [(0, '2652.784')] +[2023-03-11 18:26:34,027][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000021928_11227136.pth... +[2023-03-11 18:26:34,029][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000021360_10936320.pth +[2023-03-11 18:26:37,897][66031] Updated weights for policy 0, policy_version 22000 (0.0005) +[2023-03-11 18:26:39,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9762.1, 300 sec: 9747.1). Total num frames: 11272192. Throughput: 0: 9626.2. Samples: 11269120. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:26:39,012][65744] Avg episode reward: [(0, '2854.134')] +[2023-03-11 18:26:42,200][66031] Updated weights for policy 0, policy_version 22080 (0.0005) +[2023-03-11 18:26:44,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9693.9, 300 sec: 9761.0). Total num frames: 11321344. Throughput: 0: 9614.0. Samples: 11298016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:26:44,012][65744] Avg episode reward: [(0, '2852.262')] +[2023-03-11 18:26:46,535][66031] Updated weights for policy 0, policy_version 22160 (0.0005) +[2023-03-11 18:26:49,012][65744] Fps is (10 sec: 9420.7, 60 sec: 9557.3, 300 sec: 9747.3). Total num frames: 11366400. Throughput: 0: 9600.1. Samples: 11354692. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:26:49,012][65744] Avg episode reward: [(0, '2515.036')] +[2023-03-11 18:26:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000022200_11366400.pth... +[2023-03-11 18:26:49,019][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000021640_11079680.pth +[2023-03-11 18:26:50,824][66031] Updated weights for policy 0, policy_version 22240 (0.0005) +[2023-03-11 18:26:54,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9761.0). Total num frames: 11415552. Throughput: 0: 9584.3. Samples: 11411748. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:26:54,012][65744] Avg episode reward: [(0, '2474.999')] +[2023-03-11 18:26:55,042][66031] Updated weights for policy 0, policy_version 22320 (0.0005) +[2023-03-11 18:26:59,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9761.0). Total num frames: 11464704. Throughput: 0: 9614.6. Samples: 11441752. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:26:59,012][65744] Avg episode reward: [(0, '2775.944')] +[2023-03-11 18:26:59,281][66031] Updated weights for policy 0, policy_version 22400 (0.0005) +[2023-03-11 18:27:03,550][66031] Updated weights for policy 0, policy_version 22480 (0.0005) +[2023-03-11 18:27:04,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9625.6, 300 sec: 9774.9). Total num frames: 11513856. Throughput: 0: 9611.1. Samples: 11499384. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:27:04,012][65744] Avg episode reward: [(0, '2788.237')] +[2023-03-11 18:27:04,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000022488_11513856.pth... +[2023-03-11 18:27:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000021928_11227136.pth +[2023-03-11 18:27:07,859][66031] Updated weights for policy 0, policy_version 22560 (0.0005) +[2023-03-11 18:27:09,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9761.0). Total num frames: 11558912. Throughput: 0: 9589.7. Samples: 11556280. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 18:27:09,012][65744] Avg episode reward: [(0, '2802.755')] +[2023-03-11 18:27:12,169][66031] Updated weights for policy 0, policy_version 22640 (0.0005) +[2023-03-11 18:27:14,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9625.6, 300 sec: 9774.9). Total num frames: 11608064. Throughput: 0: 9578.1. Samples: 11585044. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 18:27:14,012][65744] Avg episode reward: [(0, '2887.995')] +[2023-03-11 18:27:16,323][66031] Updated weights for policy 0, policy_version 22720 (0.0005) +[2023-03-11 18:27:19,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9774.9). Total num frames: 11657216. Throughput: 0: 9616.9. Samples: 11644300. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 18:27:19,012][65744] Avg episode reward: [(0, '2613.110')] +[2023-03-11 18:27:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000022768_11657216.pth... +[2023-03-11 18:27:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000022200_11366400.pth +[2023-03-11 18:27:20,384][66031] Updated weights for policy 0, policy_version 22800 (0.0005) +[2023-03-11 18:27:24,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9693.9, 300 sec: 9788.7). Total num frames: 11710464. Throughput: 0: 9681.9. Samples: 11704808. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:27:24,012][65744] Avg episode reward: [(0, '2559.005')] +[2023-03-11 18:27:24,427][66031] Updated weights for policy 0, policy_version 22880 (0.0004) +[2023-03-11 18:27:28,573][66031] Updated weights for policy 0, policy_version 22960 (0.0005) +[2023-03-11 18:27:29,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9693.9, 300 sec: 9788.7). Total num frames: 11759616. Throughput: 0: 9711.1. Samples: 11735016. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:27:29,012][65744] Avg episode reward: [(0, '2441.530')] +[2023-03-11 18:27:32,701][66031] Updated weights for policy 0, policy_version 23040 (0.0005) +[2023-03-11 18:27:34,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9774.9). Total num frames: 11808768. Throughput: 0: 9757.9. Samples: 11793796. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:27:34,012][65744] Avg episode reward: [(0, '2551.323')] +[2023-03-11 18:27:34,014][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000023064_11808768.pth... +[2023-03-11 18:27:34,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000022488_11513856.pth +[2023-03-11 18:27:36,846][66031] Updated weights for policy 0, policy_version 23120 (0.0005) +[2023-03-11 18:27:39,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9774.9). Total num frames: 11857920. Throughput: 0: 9823.9. Samples: 11853824. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:27:39,012][65744] Avg episode reward: [(0, '2537.193')] +[2023-03-11 18:27:40,939][66031] Updated weights for policy 0, policy_version 23200 (0.0004) +[2023-03-11 18:27:44,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9761.0). Total num frames: 11907072. Throughput: 0: 9814.5. Samples: 11883404. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:27:44,012][65744] Avg episode reward: [(0, '1991.978')] +[2023-03-11 18:27:45,040][66031] Updated weights for policy 0, policy_version 23280 (0.0004) +[2023-03-11 18:27:49,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9761.0). Total num frames: 11956224. Throughput: 0: 9869.2. Samples: 11943496. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:27:49,012][65744] Avg episode reward: [(0, '2654.000')] +[2023-03-11 18:27:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000023352_11956224.pth... +[2023-03-11 18:27:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000022768_11657216.pth +[2023-03-11 18:27:49,249][66031] Updated weights for policy 0, policy_version 23360 (0.0004) +[2023-03-11 18:27:53,608][66031] Updated weights for policy 0, policy_version 23440 (0.0005) +[2023-03-11 18:27:54,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9830.4, 300 sec: 9747.1). Total num frames: 12005376. Throughput: 0: 9860.0. Samples: 11999980. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:27:54,012][65744] Avg episode reward: [(0, '2607.814')] +[2023-03-11 18:27:57,746][66031] Updated weights for policy 0, policy_version 23520 (0.0005) +[2023-03-11 18:27:59,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9747.1). Total num frames: 12054528. Throughput: 0: 9873.7. Samples: 12029360. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:27:59,012][65744] Avg episode reward: [(0, '2886.343')] +[2023-03-11 18:28:01,837][66031] Updated weights for policy 0, policy_version 23600 (0.0004) +[2023-03-11 18:28:04,012][65744] Fps is (10 sec: 9420.7, 60 sec: 9762.1, 300 sec: 9733.2). Total num frames: 12099584. Throughput: 0: 9876.3. Samples: 12088732. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:28:04,012][65744] Avg episode reward: [(0, '2659.082')] +[2023-03-11 18:28:04,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000023640_12103680.pth... +[2023-03-11 18:28:04,028][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000023064_11808768.pth +[2023-03-11 18:28:06,165][66031] Updated weights for policy 0, policy_version 23680 (0.0005) +[2023-03-11 18:28:09,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9830.4, 300 sec: 9733.2). Total num frames: 12148736. Throughput: 0: 9808.0. Samples: 12146168. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:28:09,012][65744] Avg episode reward: [(0, '2978.257')] +[2023-03-11 18:28:10,364][66031] Updated weights for policy 0, policy_version 23760 (0.0005) +[2023-03-11 18:28:14,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9830.4, 300 sec: 9733.2). Total num frames: 12197888. Throughput: 0: 9793.6. Samples: 12175728. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 18:28:14,012][65744] Avg episode reward: [(0, '3091.121')] +[2023-03-11 18:28:14,634][66031] Updated weights for policy 0, policy_version 23840 (0.0005) +[2023-03-11 18:28:19,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9719.3). Total num frames: 12242944. Throughput: 0: 9729.5. Samples: 12231624. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 18:28:19,012][65744] Avg episode reward: [(0, '2913.079')] +[2023-03-11 18:28:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000023912_12242944.pth... +[2023-03-11 18:28:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000023352_11956224.pth +[2023-03-11 18:28:19,108][66031] Updated weights for policy 0, policy_version 23920 (0.0005) +[2023-03-11 18:28:23,550][66031] Updated weights for policy 0, policy_version 24000 (0.0005) +[2023-03-11 18:28:24,012][65744] Fps is (10 sec: 9420.7, 60 sec: 9693.9, 300 sec: 9719.3). Total num frames: 12292096. Throughput: 0: 9632.4. Samples: 12287284. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 18:28:24,012][65744] Avg episode reward: [(0, '2178.108')] +[2023-03-11 18:28:28,188][66031] Updated weights for policy 0, policy_version 24080 (0.0005) +[2023-03-11 18:28:29,012][65744] Fps is (10 sec: 9011.2, 60 sec: 9557.3, 300 sec: 9691.6). Total num frames: 12333056. Throughput: 0: 9553.4. Samples: 12313308. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:28:29,012][65744] Avg episode reward: [(0, '2524.440')] +[2023-03-11 18:28:32,635][66031] Updated weights for policy 0, policy_version 24160 (0.0005) +[2023-03-11 18:28:34,012][65744] Fps is (10 sec: 9011.1, 60 sec: 9557.3, 300 sec: 9705.4). Total num frames: 12382208. Throughput: 0: 9442.0. Samples: 12368388. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:28:34,012][65744] Avg episode reward: [(0, '2985.074')] +[2023-03-11 18:28:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000024184_12382208.pth... +[2023-03-11 18:28:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000023640_12103680.pth +[2023-03-11 18:28:36,934][66031] Updated weights for policy 0, policy_version 24240 (0.0005) +[2023-03-11 18:28:39,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9691.6). Total num frames: 12427264. Throughput: 0: 9432.9. Samples: 12424460. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:28:39,012][65744] Avg episode reward: [(0, '2975.709')] +[2023-03-11 18:28:41,548][66031] Updated weights for policy 0, policy_version 24320 (0.0005) +[2023-03-11 18:28:44,012][65744] Fps is (10 sec: 9011.3, 60 sec: 9420.8, 300 sec: 9677.7). Total num frames: 12472320. Throughput: 0: 9379.8. Samples: 12451452. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:28:44,012][65744] Avg episode reward: [(0, '3221.538')] +[2023-03-11 18:28:46,079][66031] Updated weights for policy 0, policy_version 24400 (0.0005) +[2023-03-11 18:28:49,012][65744] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9663.8). Total num frames: 12517376. Throughput: 0: 9252.6. Samples: 12505100. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:28:49,012][65744] Avg episode reward: [(0, '3294.503')] +[2023-03-11 18:28:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000024448_12517376.pth... +[2023-03-11 18:28:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000023912_12242944.pth +[2023-03-11 18:28:50,699][66031] Updated weights for policy 0, policy_version 24480 (0.0005) +[2023-03-11 18:28:54,012][65744] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9663.8). Total num frames: 12562432. Throughput: 0: 9162.6. Samples: 12558484. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:28:54,012][65744] Avg episode reward: [(0, '3811.058')] +[2023-03-11 18:28:54,013][65987] Saving new best policy, reward=3811.058! +[2023-03-11 18:28:55,226][66031] Updated weights for policy 0, policy_version 24560 (0.0005) +[2023-03-11 18:28:59,012][65744] Fps is (10 sec: 9011.3, 60 sec: 9216.0, 300 sec: 9649.9). Total num frames: 12607488. Throughput: 0: 9112.0. Samples: 12585768. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:28:59,012][65744] Avg episode reward: [(0, '3841.193')] +[2023-03-11 18:28:59,013][65987] Saving new best policy, reward=3841.193! +[2023-03-11 18:28:59,896][66031] Updated weights for policy 0, policy_version 24640 (0.0005) +[2023-03-11 18:29:04,012][65744] Fps is (10 sec: 9011.2, 60 sec: 9216.0, 300 sec: 9636.0). Total num frames: 12652544. Throughput: 0: 9063.9. Samples: 12639500. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:29:04,012][65744] Avg episode reward: [(0, '4022.760')] +[2023-03-11 18:29:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000024712_12652544.pth... +[2023-03-11 18:29:04,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000024184_12382208.pth +[2023-03-11 18:29:04,017][65987] Saving new best policy, reward=4022.760! +[2023-03-11 18:29:04,381][66031] Updated weights for policy 0, policy_version 24720 (0.0005) +[2023-03-11 18:29:08,935][66031] Updated weights for policy 0, policy_version 24800 (0.0005) +[2023-03-11 18:29:09,012][65744] Fps is (10 sec: 9011.2, 60 sec: 9147.7, 300 sec: 9636.0). Total num frames: 12697600. Throughput: 0: 9027.4. Samples: 12693516. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:29:09,023][65744] Avg episode reward: [(0, '4131.490')] +[2023-03-11 18:29:09,023][65987] Saving new best policy, reward=4131.490! +[2023-03-11 18:29:13,498][66031] Updated weights for policy 0, policy_version 24880 (0.0005) +[2023-03-11 18:29:14,012][65744] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 9622.1). Total num frames: 12742656. Throughput: 0: 9040.4. Samples: 12720124. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:29:14,012][65744] Avg episode reward: [(0, '3894.100')] +[2023-03-11 18:29:18,189][66031] Updated weights for policy 0, policy_version 24960 (0.0005) +[2023-03-11 18:29:19,012][65744] Fps is (10 sec: 8601.6, 60 sec: 9011.2, 300 sec: 9594.4). Total num frames: 12783616. Throughput: 0: 9000.1. Samples: 12773392. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:29:19,012][65744] Avg episode reward: [(0, '3853.001')] +[2023-03-11 18:29:19,014][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000024968_12783616.pth... +[2023-03-11 18:29:19,025][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000024448_12517376.pth +[2023-03-11 18:29:22,759][66031] Updated weights for policy 0, policy_version 25040 (0.0005) +[2023-03-11 18:29:24,012][65744] Fps is (10 sec: 8601.6, 60 sec: 8942.9, 300 sec: 9580.5). Total num frames: 12828672. Throughput: 0: 8937.3. Samples: 12826636. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:29:24,012][65744] Avg episode reward: [(0, '3795.884')] +[2023-03-11 18:29:27,404][66031] Updated weights for policy 0, policy_version 25120 (0.0005) +[2023-03-11 18:29:29,012][65744] Fps is (10 sec: 9011.2, 60 sec: 9011.2, 300 sec: 9566.6). Total num frames: 12873728. Throughput: 0: 8928.0. Samples: 12853212. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:29:29,012][65744] Avg episode reward: [(0, '2499.930')] +[2023-03-11 18:29:31,888][66031] Updated weights for policy 0, policy_version 25200 (0.0005) +[2023-03-11 18:29:34,012][65744] Fps is (10 sec: 9011.2, 60 sec: 8942.9, 300 sec: 9566.6). Total num frames: 12918784. Throughput: 0: 8943.8. Samples: 12907572. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:29:34,012][65744] Avg episode reward: [(0, '3466.056')] +[2023-03-11 18:29:34,064][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000025240_12922880.pth... +[2023-03-11 18:29:34,065][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000024712_12652544.pth +[2023-03-11 18:29:36,283][66031] Updated weights for policy 0, policy_version 25280 (0.0005) +[2023-03-11 18:29:39,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9011.2, 300 sec: 9566.6). Total num frames: 12967936. Throughput: 0: 9008.1. Samples: 12963848. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:29:39,012][65744] Avg episode reward: [(0, '3445.262')] +[2023-03-11 18:29:40,666][66031] Updated weights for policy 0, policy_version 25360 (0.0005) +[2023-03-11 18:29:44,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9011.2, 300 sec: 9552.7). Total num frames: 13012992. Throughput: 0: 9033.9. Samples: 12992292. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:29:44,012][65744] Avg episode reward: [(0, '3224.085')] +[2023-03-11 18:29:44,949][66031] Updated weights for policy 0, policy_version 25440 (0.0005) +[2023-03-11 18:29:49,012][65744] Fps is (10 sec: 9420.7, 60 sec: 9079.5, 300 sec: 9552.7). Total num frames: 13062144. Throughput: 0: 9109.2. Samples: 13049416. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:29:49,023][65744] Avg episode reward: [(0, '3533.241')] +[2023-03-11 18:29:49,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000025512_13062144.pth... +[2023-03-11 18:29:49,027][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000024968_12783616.pth +[2023-03-11 18:29:49,152][66031] Updated weights for policy 0, policy_version 25520 (0.0005) +[2023-03-11 18:29:53,391][66031] Updated weights for policy 0, policy_version 25600 (0.0004) +[2023-03-11 18:29:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9147.7, 300 sec: 9566.6). Total num frames: 13111296. Throughput: 0: 9199.5. Samples: 13107492. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:29:54,012][65744] Avg episode reward: [(0, '4212.187')] +[2023-03-11 18:29:54,023][65987] Saving new best policy, reward=4212.187! +[2023-03-11 18:29:57,610][66031] Updated weights for policy 0, policy_version 25680 (0.0005) +[2023-03-11 18:29:59,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9216.0, 300 sec: 9552.7). Total num frames: 13160448. Throughput: 0: 9253.2. Samples: 13136520. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:29:59,012][65744] Avg episode reward: [(0, '3800.591')] +[2023-03-11 18:30:02,024][66031] Updated weights for policy 0, policy_version 25760 (0.0005) +[2023-03-11 18:30:04,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9538.8). Total num frames: 13205504. Throughput: 0: 9330.8. Samples: 13193280. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:30:04,012][65744] Avg episode reward: [(0, '4210.410')] +[2023-03-11 18:30:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000025792_13205504.pth... +[2023-03-11 18:30:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000025240_12922880.pth +[2023-03-11 18:30:06,432][66031] Updated weights for policy 0, policy_version 25840 (0.0005) +[2023-03-11 18:30:09,012][65744] Fps is (10 sec: 9011.2, 60 sec: 9216.0, 300 sec: 9524.9). Total num frames: 13250560. Throughput: 0: 9382.5. Samples: 13248848. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 18:30:09,012][65744] Avg episode reward: [(0, '4435.891')] +[2023-03-11 18:30:09,012][65987] Saving new best policy, reward=4435.891! +[2023-03-11 18:30:10,919][66031] Updated weights for policy 0, policy_version 25920 (0.0005) +[2023-03-11 18:30:14,012][65744] Fps is (10 sec: 9011.2, 60 sec: 9216.0, 300 sec: 9511.1). Total num frames: 13295616. Throughput: 0: 9388.9. Samples: 13275712. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 18:30:14,012][65744] Avg episode reward: [(0, '4389.661')] +[2023-03-11 18:30:15,404][66031] Updated weights for policy 0, policy_version 26000 (0.0005) +[2023-03-11 18:30:19,012][65744] Fps is (10 sec: 9011.1, 60 sec: 9284.2, 300 sec: 9497.2). Total num frames: 13340672. Throughput: 0: 9383.6. Samples: 13329836. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 18:30:19,012][65744] Avg episode reward: [(0, '4120.568')] +[2023-03-11 18:30:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000026056_13340672.pth... +[2023-03-11 18:30:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000025512_13062144.pth +[2023-03-11 18:30:20,052][66031] Updated weights for policy 0, policy_version 26080 (0.0005) +[2023-03-11 18:30:24,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9497.2). Total num frames: 13389824. Throughput: 0: 9371.7. Samples: 13385576. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 18:30:24,012][65744] Avg episode reward: [(0, '4088.863')] +[2023-03-11 18:30:24,352][66031] Updated weights for policy 0, policy_version 26160 (0.0004) +[2023-03-11 18:30:28,710][66031] Updated weights for policy 0, policy_version 26240 (0.0005) +[2023-03-11 18:30:29,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9483.3). Total num frames: 13434880. Throughput: 0: 9371.7. Samples: 13414020. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 18:30:29,012][65744] Avg episode reward: [(0, '3828.129')] +[2023-03-11 18:30:33,044][66031] Updated weights for policy 0, policy_version 26320 (0.0005) +[2023-03-11 18:30:34,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9483.3). Total num frames: 13484032. Throughput: 0: 9357.9. Samples: 13470520. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 18:30:34,012][65744] Avg episode reward: [(0, '4107.326')] +[2023-03-11 18:30:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000026336_13484032.pth... +[2023-03-11 18:30:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000025792_13205504.pth +[2023-03-11 18:30:37,334][66031] Updated weights for policy 0, policy_version 26400 (0.0004) +[2023-03-11 18:30:39,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9352.5, 300 sec: 9455.5). Total num frames: 13529088. Throughput: 0: 9330.4. Samples: 13527360. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 18:30:39,012][65744] Avg episode reward: [(0, '4156.180')] +[2023-03-11 18:30:41,552][66031] Updated weights for policy 0, policy_version 26480 (0.0003) +[2023-03-11 18:30:44,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9441.6). Total num frames: 13578240. Throughput: 0: 9337.7. Samples: 13556716. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 18:30:44,012][65744] Avg episode reward: [(0, '4354.637')] +[2023-03-11 18:30:45,835][66031] Updated weights for policy 0, policy_version 26560 (0.0005) +[2023-03-11 18:30:49,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9420.8, 300 sec: 9455.5). Total num frames: 13627392. Throughput: 0: 9360.9. Samples: 13614520. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 18:30:49,012][65744] Avg episode reward: [(0, '3562.208')] +[2023-03-11 18:30:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000026616_13627392.pth... +[2023-03-11 18:30:49,019][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000026056_13340672.pth +[2023-03-11 18:30:50,159][66031] Updated weights for policy 0, policy_version 26640 (0.0005) +[2023-03-11 18:30:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9420.8, 300 sec: 9455.5). Total num frames: 13676544. Throughput: 0: 9402.8. Samples: 13671976. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 18:30:54,012][65744] Avg episode reward: [(0, '3918.922')] +[2023-03-11 18:30:54,360][66031] Updated weights for policy 0, policy_version 26720 (0.0005) +[2023-03-11 18:30:58,724][66031] Updated weights for policy 0, policy_version 26800 (0.0005) +[2023-03-11 18:30:59,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9441.6). Total num frames: 13721600. Throughput: 0: 9441.0. Samples: 13700556. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 18:30:59,012][65744] Avg episode reward: [(0, '3981.502')] +[2023-03-11 18:31:03,209][66031] Updated weights for policy 0, policy_version 26880 (0.0005) +[2023-03-11 18:31:04,012][65744] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9427.7). Total num frames: 13766656. Throughput: 0: 9465.4. Samples: 13755780. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 18:31:04,012][65744] Avg episode reward: [(0, '3937.378')] +[2023-03-11 18:31:04,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000026888_13766656.pth... +[2023-03-11 18:31:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000026336_13484032.pth +[2023-03-11 18:31:07,770][66031] Updated weights for policy 0, policy_version 26960 (0.0005) +[2023-03-11 18:31:09,012][65744] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9427.7). Total num frames: 13811712. Throughput: 0: 9427.6. Samples: 13809820. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 18:31:09,012][65744] Avg episode reward: [(0, '2905.403')] +[2023-03-11 18:31:12,245][66031] Updated weights for policy 0, policy_version 27040 (0.0005) +[2023-03-11 18:31:14,012][65744] Fps is (10 sec: 9011.3, 60 sec: 9352.5, 300 sec: 9413.9). Total num frames: 13856768. Throughput: 0: 9394.2. Samples: 13836760. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 18:31:14,012][65744] Avg episode reward: [(0, '3527.732')] +[2023-03-11 18:31:16,728][66031] Updated weights for policy 0, policy_version 27120 (0.0005) +[2023-03-11 18:31:19,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9413.9). Total num frames: 13905920. Throughput: 0: 9373.6. Samples: 13892332. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:31:19,012][65744] Avg episode reward: [(0, '3438.188')] +[2023-03-11 18:31:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000027160_13905920.pth... +[2023-03-11 18:31:19,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000026616_13627392.pth +[2023-03-11 18:31:21,195][66031] Updated weights for policy 0, policy_version 27200 (0.0005) +[2023-03-11 18:31:24,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9400.0). Total num frames: 13950976. Throughput: 0: 9322.8. Samples: 13946888. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:31:24,012][65744] Avg episode reward: [(0, '3702.390')] +[2023-03-11 18:31:25,777][66031] Updated weights for policy 0, policy_version 27280 (0.0005) +[2023-03-11 18:31:29,012][65744] Fps is (10 sec: 8601.7, 60 sec: 9284.3, 300 sec: 9386.1). Total num frames: 13991936. Throughput: 0: 9257.0. Samples: 13973280. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:31:29,012][65744] Avg episode reward: [(0, '3263.627')] +[2023-03-11 18:31:30,408][66031] Updated weights for policy 0, policy_version 27360 (0.0005) +[2023-03-11 18:31:34,012][65744] Fps is (10 sec: 8601.5, 60 sec: 9216.0, 300 sec: 9372.2). Total num frames: 14036992. Throughput: 0: 9138.7. Samples: 14025764. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:31:34,012][65744] Avg episode reward: [(0, '3189.260')] +[2023-03-11 18:31:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000027416_14036992.pth... +[2023-03-11 18:31:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000026888_13766656.pth +[2023-03-11 18:31:35,128][66031] Updated weights for policy 0, policy_version 27440 (0.0005) +[2023-03-11 18:31:39,012][65744] Fps is (10 sec: 9011.2, 60 sec: 9216.0, 300 sec: 9358.3). Total num frames: 14082048. Throughput: 0: 9019.9. Samples: 14077872. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:31:39,012][65744] Avg episode reward: [(0, '3140.560')] +[2023-03-11 18:31:39,933][66031] Updated weights for policy 0, policy_version 27520 (0.0005) +[2023-03-11 18:31:44,012][65744] Fps is (10 sec: 8601.6, 60 sec: 9079.5, 300 sec: 9344.4). Total num frames: 14123008. Throughput: 0: 8937.3. Samples: 14102736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:31:44,012][65744] Avg episode reward: [(0, '2723.030')] +[2023-03-11 18:31:44,782][66031] Updated weights for policy 0, policy_version 27600 (0.0004) +[2023-03-11 18:31:49,012][65744] Fps is (10 sec: 8601.6, 60 sec: 9011.2, 300 sec: 9330.5). Total num frames: 14168064. Throughput: 0: 8889.3. Samples: 14155796. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:31:49,012][65744] Avg episode reward: [(0, '3326.628')] +[2023-03-11 18:31:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000027672_14168064.pth... +[2023-03-11 18:31:49,019][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000027160_13905920.pth +[2023-03-11 18:31:49,323][66031] Updated weights for policy 0, policy_version 27680 (0.0005) +[2023-03-11 18:31:53,867][66031] Updated weights for policy 0, policy_version 27760 (0.0005) +[2023-03-11 18:31:54,012][65744] Fps is (10 sec: 9011.2, 60 sec: 8942.9, 300 sec: 9316.7). Total num frames: 14213120. Throughput: 0: 8876.3. Samples: 14209252. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:31:54,012][65744] Avg episode reward: [(0, '3334.544')] +[2023-03-11 18:31:58,317][66031] Updated weights for policy 0, policy_version 27840 (0.0005) +[2023-03-11 18:31:59,012][65744] Fps is (10 sec: 9011.2, 60 sec: 8942.9, 300 sec: 9302.8). Total num frames: 14258176. Throughput: 0: 8891.5. Samples: 14236880. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 18:31:59,012][65744] Avg episode reward: [(0, '3357.415')] +[2023-03-11 18:32:02,734][66031] Updated weights for policy 0, policy_version 27920 (0.0005) +[2023-03-11 18:32:04,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9011.2, 300 sec: 9316.7). Total num frames: 14307328. Throughput: 0: 8879.7. Samples: 14291920. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 18:32:04,012][65744] Avg episode reward: [(0, '3492.704')] +[2023-03-11 18:32:04,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000027944_14307328.pth... +[2023-03-11 18:32:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000027416_14036992.pth +[2023-03-11 18:32:06,662][66031] Updated weights for policy 0, policy_version 28000 (0.0005) +[2023-03-11 18:32:09,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9079.5, 300 sec: 9316.7). Total num frames: 14356480. Throughput: 0: 9042.1. Samples: 14353784. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 18:32:09,012][65744] Avg episode reward: [(0, '3305.053')] +[2023-03-11 18:32:10,734][66031] Updated weights for policy 0, policy_version 28080 (0.0005) +[2023-03-11 18:32:14,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9216.0, 300 sec: 9330.6). Total num frames: 14409728. Throughput: 0: 9129.8. Samples: 14384124. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:32:14,012][65744] Avg episode reward: [(0, '3528.616')] +[2023-03-11 18:32:14,743][66031] Updated weights for policy 0, policy_version 28160 (0.0005) +[2023-03-11 18:32:18,827][66031] Updated weights for policy 0, policy_version 28240 (0.0005) +[2023-03-11 18:32:19,012][65744] Fps is (10 sec: 10239.9, 60 sec: 9216.0, 300 sec: 9316.7). Total num frames: 14458880. Throughput: 0: 9317.7. Samples: 14445060. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:32:19,012][65744] Avg episode reward: [(0, '3998.601')] +[2023-03-11 18:32:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000028240_14458880.pth... +[2023-03-11 18:32:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000027672_14168064.pth +[2023-03-11 18:32:23,025][66031] Updated weights for policy 0, policy_version 28320 (0.0005) +[2023-03-11 18:32:24,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9284.3, 300 sec: 9316.7). Total num frames: 14508032. Throughput: 0: 9469.5. Samples: 14504000. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:32:24,012][65744] Avg episode reward: [(0, '3386.595')] +[2023-03-11 18:32:27,287][66031] Updated weights for policy 0, policy_version 28400 (0.0005) +[2023-03-11 18:32:29,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9302.8). Total num frames: 14553088. Throughput: 0: 9565.8. Samples: 14533196. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:32:29,012][65744] Avg episode reward: [(0, '3596.527')] +[2023-03-11 18:32:32,101][66031] Updated weights for policy 0, policy_version 28480 (0.0005) +[2023-03-11 18:32:34,012][65744] Fps is (10 sec: 9011.1, 60 sec: 9352.5, 300 sec: 9288.9). Total num frames: 14598144. Throughput: 0: 9537.5. Samples: 14584984. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 18:32:34,012][65744] Avg episode reward: [(0, '2531.098')] +[2023-03-11 18:32:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000028512_14598144.pth... +[2023-03-11 18:32:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000027944_14307328.pth +[2023-03-11 18:32:36,659][66031] Updated weights for policy 0, policy_version 28560 (0.0005) +[2023-03-11 18:32:39,012][65744] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9275.0). Total num frames: 14643200. Throughput: 0: 9551.7. Samples: 14639080. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 18:32:39,012][65744] Avg episode reward: [(0, '2714.956')] +[2023-03-11 18:32:41,204][66031] Updated weights for policy 0, policy_version 28640 (0.0005) +[2023-03-11 18:32:44,012][65744] Fps is (10 sec: 9011.2, 60 sec: 9420.8, 300 sec: 9261.1). Total num frames: 14688256. Throughput: 0: 9542.9. Samples: 14666312. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 18:32:44,012][65744] Avg episode reward: [(0, '3353.972')] +[2023-03-11 18:32:45,851][66031] Updated weights for policy 0, policy_version 28720 (0.0005) +[2023-03-11 18:32:49,012][65744] Fps is (10 sec: 8601.6, 60 sec: 9352.5, 300 sec: 9233.4). Total num frames: 14729216. Throughput: 0: 9480.0. Samples: 14718520. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 18:32:49,012][65744] Avg episode reward: [(0, '1753.954')] +[2023-03-11 18:32:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000028768_14729216.pth... +[2023-03-11 18:32:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000028240_14458880.pth +[2023-03-11 18:32:50,652][66031] Updated weights for policy 0, policy_version 28800 (0.0005) +[2023-03-11 18:32:54,012][65744] Fps is (10 sec: 8601.6, 60 sec: 9352.5, 300 sec: 9219.5). Total num frames: 14774272. Throughput: 0: 9254.6. Samples: 14770240. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:32:54,012][65744] Avg episode reward: [(0, '1775.148')] +[2023-03-11 18:32:55,359][66031] Updated weights for policy 0, policy_version 28880 (0.0005) +[2023-03-11 18:32:59,012][65744] Fps is (10 sec: 8601.6, 60 sec: 9284.3, 300 sec: 9205.6). Total num frames: 14815232. Throughput: 0: 9141.4. Samples: 14795488. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:32:59,012][65744] Avg episode reward: [(0, '1848.438')] +[2023-03-11 18:33:00,341][66031] Updated weights for policy 0, policy_version 28960 (0.0005) +[2023-03-11 18:33:04,012][65744] Fps is (10 sec: 8191.9, 60 sec: 9147.7, 300 sec: 9177.8). Total num frames: 14856192. Throughput: 0: 8883.8. Samples: 14844832. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:33:04,012][65744] Avg episode reward: [(0, '1830.077')] +[2023-03-11 18:33:04,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000029016_14856192.pth... +[2023-03-11 18:33:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000028512_14598144.pth +[2023-03-11 18:33:05,191][66031] Updated weights for policy 0, policy_version 29040 (0.0005) +[2023-03-11 18:33:09,012][65744] Fps is (10 sec: 8601.6, 60 sec: 9079.5, 300 sec: 9163.9). Total num frames: 14901248. Throughput: 0: 8773.3. Samples: 14898800. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:33:09,012][65744] Avg episode reward: [(0, '2619.658')] +[2023-03-11 18:33:09,514][66031] Updated weights for policy 0, policy_version 29120 (0.0005) +[2023-03-11 18:33:13,686][66031] Updated weights for policy 0, policy_version 29200 (0.0005) +[2023-03-11 18:33:14,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9011.2, 300 sec: 9177.8). Total num frames: 14950400. Throughput: 0: 8785.5. Samples: 14928544. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 18:33:14,012][65744] Avg episode reward: [(0, '3461.618')] +[2023-03-11 18:33:18,248][66031] Updated weights for policy 0, policy_version 29280 (0.0005) +[2023-03-11 18:33:19,012][65744] Fps is (10 sec: 9420.8, 60 sec: 8942.9, 300 sec: 9163.9). Total num frames: 14995456. Throughput: 0: 8866.0. Samples: 14983952. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 18:33:19,012][65744] Avg episode reward: [(0, '3755.559')] +[2023-03-11 18:33:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000029288_14995456.pth... +[2023-03-11 18:33:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000028768_14729216.pth +[2023-03-11 18:33:22,726][66031] Updated weights for policy 0, policy_version 29360 (0.0005) +[2023-03-11 18:33:24,012][65744] Fps is (10 sec: 9011.2, 60 sec: 8874.7, 300 sec: 9177.8). Total num frames: 15040512. Throughput: 0: 8885.2. Samples: 15038912. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 18:33:24,012][65744] Avg episode reward: [(0, '3556.239')] +[2023-03-11 18:33:27,167][66031] Updated weights for policy 0, policy_version 29440 (0.0005) +[2023-03-11 18:33:29,012][65744] Fps is (10 sec: 9420.8, 60 sec: 8942.9, 300 sec: 9177.8). Total num frames: 15089664. Throughput: 0: 8892.3. Samples: 15066464. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 18:33:29,012][65744] Avg episode reward: [(0, '3395.663')] +[2023-03-11 18:33:31,291][66031] Updated weights for policy 0, policy_version 29520 (0.0005) +[2023-03-11 18:33:34,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9011.2, 300 sec: 9191.7). Total num frames: 15138816. Throughput: 0: 9059.8. Samples: 15126212. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 18:33:34,012][65744] Avg episode reward: [(0, '3708.382')] +[2023-03-11 18:33:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000029568_15138816.pth... +[2023-03-11 18:33:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000029016_14856192.pth +[2023-03-11 18:33:35,354][66031] Updated weights for policy 0, policy_version 29600 (0.0005) +[2023-03-11 18:33:39,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9079.5, 300 sec: 9205.6). Total num frames: 15187968. Throughput: 0: 9201.6. Samples: 15184312. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 18:33:39,012][65744] Avg episode reward: [(0, '3995.122')] +[2023-03-11 18:33:39,777][66031] Updated weights for policy 0, policy_version 29680 (0.0005) +[2023-03-11 18:33:44,012][65744] Fps is (10 sec: 9011.3, 60 sec: 9011.2, 300 sec: 9191.7). Total num frames: 15228928. Throughput: 0: 9236.0. Samples: 15211108. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 18:33:44,012][65744] Avg episode reward: [(0, '3942.125')] +[2023-03-11 18:33:44,538][66031] Updated weights for policy 0, policy_version 29760 (0.0006) +[2023-03-11 18:33:49,012][65744] Fps is (10 sec: 8601.6, 60 sec: 9079.5, 300 sec: 9191.7). Total num frames: 15273984. Throughput: 0: 9271.0. Samples: 15262028. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 18:33:49,012][65744] Avg episode reward: [(0, '3556.058')] +[2023-03-11 18:33:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000029832_15273984.pth... +[2023-03-11 18:33:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000029288_14995456.pth +[2023-03-11 18:33:49,205][66031] Updated weights for policy 0, policy_version 29840 (0.0006) +[2023-03-11 18:33:53,995][66031] Updated weights for policy 0, policy_version 29920 (0.0005) +[2023-03-11 18:33:54,012][65744] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 9191.7). Total num frames: 15319040. Throughput: 0: 9243.8. Samples: 15314772. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 18:33:54,012][65744] Avg episode reward: [(0, '3652.383')] +[2023-03-11 18:33:58,422][66031] Updated weights for policy 0, policy_version 30000 (0.0005) +[2023-03-11 18:33:59,012][65744] Fps is (10 sec: 9011.2, 60 sec: 9147.7, 300 sec: 9191.7). Total num frames: 15364096. Throughput: 0: 9172.5. Samples: 15341304. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 18:33:59,012][65744] Avg episode reward: [(0, '3170.997')] +[2023-03-11 18:34:02,748][66031] Updated weights for policy 0, policy_version 30080 (0.0005) +[2023-03-11 18:34:04,012][65744] Fps is (10 sec: 9011.2, 60 sec: 9216.0, 300 sec: 9191.7). Total num frames: 15409152. Throughput: 0: 9200.6. Samples: 15397976. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 18:34:04,012][65744] Avg episode reward: [(0, '3729.457')] +[2023-03-11 18:34:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000030104_15413248.pth... +[2023-03-11 18:34:04,016][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000029568_15138816.pth +[2023-03-11 18:34:07,099][66031] Updated weights for policy 0, policy_version 30160 (0.0005) +[2023-03-11 18:34:09,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9205.6). Total num frames: 15458304. Throughput: 0: 9230.4. Samples: 15454280. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:34:09,012][65744] Avg episode reward: [(0, '3610.998')] +[2023-03-11 18:34:11,707][66031] Updated weights for policy 0, policy_version 30240 (0.0005) +[2023-03-11 18:34:14,012][65744] Fps is (10 sec: 9011.2, 60 sec: 9147.7, 300 sec: 9205.6). Total num frames: 15499264. Throughput: 0: 9201.5. Samples: 15480532. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:34:14,012][65744] Avg episode reward: [(0, '3116.135')] +[2023-03-11 18:34:16,600][66031] Updated weights for policy 0, policy_version 30320 (0.0005) +[2023-03-11 18:34:19,012][65744] Fps is (10 sec: 8601.6, 60 sec: 9147.7, 300 sec: 9205.6). Total num frames: 15544320. Throughput: 0: 9011.2. Samples: 15531716. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:34:19,012][65744] Avg episode reward: [(0, '2399.387')] +[2023-03-11 18:34:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000030360_15544320.pth... +[2023-03-11 18:34:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000029832_15273984.pth +[2023-03-11 18:34:21,283][66031] Updated weights for policy 0, policy_version 30400 (0.0005) +[2023-03-11 18:34:24,012][65744] Fps is (10 sec: 8601.6, 60 sec: 9079.5, 300 sec: 9191.7). Total num frames: 15585280. Throughput: 0: 8851.7. Samples: 15582640. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:34:24,023][65744] Avg episode reward: [(0, '3534.147')] +[2023-03-11 18:34:26,234][66031] Updated weights for policy 0, policy_version 30480 (0.0005) +[2023-03-11 18:34:29,012][65744] Fps is (10 sec: 8192.0, 60 sec: 8942.9, 300 sec: 9177.8). Total num frames: 15626240. Throughput: 0: 8814.5. Samples: 15607760. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:34:29,012][65744] Avg episode reward: [(0, '3720.767')] +[2023-03-11 18:34:30,873][66031] Updated weights for policy 0, policy_version 30560 (0.0005) +[2023-03-11 18:34:34,012][65744] Fps is (10 sec: 8601.6, 60 sec: 8874.7, 300 sec: 9163.9). Total num frames: 15671296. Throughput: 0: 8859.1. Samples: 15660688. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:34:34,012][65744] Avg episode reward: [(0, '3817.733')] +[2023-03-11 18:34:34,024][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000030616_15675392.pth... +[2023-03-11 18:34:34,026][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000030104_15413248.pth +[2023-03-11 18:34:35,319][66031] Updated weights for policy 0, policy_version 30640 (0.0004) +[2023-03-11 18:34:39,012][65744] Fps is (10 sec: 9011.2, 60 sec: 8806.4, 300 sec: 9163.9). Total num frames: 15716352. Throughput: 0: 8906.9. Samples: 15715584. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:34:39,012][65744] Avg episode reward: [(0, '3313.665')] +[2023-03-11 18:34:39,968][66031] Updated weights for policy 0, policy_version 30720 (0.0004) +[2023-03-11 18:34:44,012][65744] Fps is (10 sec: 9011.2, 60 sec: 8874.7, 300 sec: 9150.0). Total num frames: 15761408. Throughput: 0: 8882.7. Samples: 15741024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:34:44,012][65744] Avg episode reward: [(0, '3202.722')] +[2023-03-11 18:34:44,797][66031] Updated weights for policy 0, policy_version 30800 (0.0003) +[2023-03-11 18:34:49,012][65744] Fps is (10 sec: 8601.6, 60 sec: 8806.4, 300 sec: 9122.3). Total num frames: 15802368. Throughput: 0: 8783.2. Samples: 15793220. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:34:49,012][65744] Avg episode reward: [(0, '3072.417')] +[2023-03-11 18:34:49,045][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000030872_15806464.pth... +[2023-03-11 18:34:49,049][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000030360_15544320.pth +[2023-03-11 18:34:49,559][66031] Updated weights for policy 0, policy_version 30880 (0.0004) +[2023-03-11 18:34:54,012][65744] Fps is (10 sec: 8601.6, 60 sec: 8806.4, 300 sec: 9108.4). Total num frames: 15847424. Throughput: 0: 8657.7. Samples: 15843876. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:34:54,012][65744] Avg episode reward: [(0, '3160.228')] +[2023-03-11 18:34:54,276][66031] Updated weights for policy 0, policy_version 30960 (0.0004) +[2023-03-11 18:34:58,705][66031] Updated weights for policy 0, policy_version 31040 (0.0003) +[2023-03-11 18:34:59,012][65744] Fps is (10 sec: 9011.2, 60 sec: 8806.4, 300 sec: 9108.4). Total num frames: 15892480. Throughput: 0: 8699.3. Samples: 15872000. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:34:59,012][65744] Avg episode reward: [(0, '3706.460')] +[2023-03-11 18:35:02,965][66031] Updated weights for policy 0, policy_version 31120 (0.0003) +[2023-03-11 18:35:04,012][65744] Fps is (10 sec: 9420.8, 60 sec: 8874.7, 300 sec: 9122.3). Total num frames: 15941632. Throughput: 0: 8817.9. Samples: 15928520. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:35:04,012][65744] Avg episode reward: [(0, '3710.351')] +[2023-03-11 18:35:04,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000031136_15941632.pth... +[2023-03-11 18:35:04,019][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000030616_15675392.pth +[2023-03-11 18:35:07,220][66031] Updated weights for policy 0, policy_version 31200 (0.0003) +[2023-03-11 18:35:09,012][65744] Fps is (10 sec: 9830.4, 60 sec: 8874.7, 300 sec: 9136.2). Total num frames: 15990784. Throughput: 0: 8972.4. Samples: 15986396. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:35:09,012][65744] Avg episode reward: [(0, '3566.464')] +[2023-03-11 18:35:11,439][66031] Updated weights for policy 0, policy_version 31280 (0.0003) +[2023-03-11 18:35:14,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9011.2, 300 sec: 9150.1). Total num frames: 16039936. Throughput: 0: 9059.2. Samples: 16015424. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:35:14,012][65744] Avg episode reward: [(0, '3714.735')] +[2023-03-11 18:35:15,718][66031] Updated weights for policy 0, policy_version 31360 (0.0003) +[2023-03-11 18:35:19,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9011.2, 300 sec: 9136.2). Total num frames: 16084992. Throughput: 0: 9164.3. Samples: 16073080. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:35:19,012][65744] Avg episode reward: [(0, '3593.755')] +[2023-03-11 18:35:19,061][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000031424_16089088.pth... +[2023-03-11 18:35:19,063][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000030872_15806464.pth +[2023-03-11 18:35:19,948][66031] Updated weights for policy 0, policy_version 31440 (0.0004) +[2023-03-11 18:35:24,012][65744] Fps is (10 sec: 9420.7, 60 sec: 9147.7, 300 sec: 9150.0). Total num frames: 16134144. Throughput: 0: 9250.8. Samples: 16131868. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:35:24,012][65744] Avg episode reward: [(0, '3230.380')] +[2023-03-11 18:35:24,104][66031] Updated weights for policy 0, policy_version 31520 (0.0003) +[2023-03-11 18:35:28,426][66031] Updated weights for policy 0, policy_version 31600 (0.0003) +[2023-03-11 18:35:29,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9284.3, 300 sec: 9150.0). Total num frames: 16183296. Throughput: 0: 9332.5. Samples: 16160988. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:35:29,012][65744] Avg episode reward: [(0, '3009.571')] +[2023-03-11 18:35:32,708][66031] Updated weights for policy 0, policy_version 31680 (0.0003) +[2023-03-11 18:35:34,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9150.0). Total num frames: 16228352. Throughput: 0: 9436.3. Samples: 16217852. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:35:34,012][65744] Avg episode reward: [(0, '3389.621')] +[2023-03-11 18:35:34,038][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000031704_16232448.pth... +[2023-03-11 18:35:34,042][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000031136_15941632.pth +[2023-03-11 18:35:37,077][66031] Updated weights for policy 0, policy_version 31760 (0.0004) +[2023-03-11 18:35:39,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9150.0). Total num frames: 16277504. Throughput: 0: 9583.3. Samples: 16275124. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:35:39,012][65744] Avg episode reward: [(0, '2965.298')] +[2023-03-11 18:35:41,243][66031] Updated weights for policy 0, policy_version 31840 (0.0004) +[2023-03-11 18:35:44,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9420.8, 300 sec: 9150.0). Total num frames: 16326656. Throughput: 0: 9598.5. Samples: 16303932. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:35:44,012][65744] Avg episode reward: [(0, '3880.345')] +[2023-03-11 18:35:45,435][66031] Updated weights for policy 0, policy_version 31920 (0.0004) +[2023-03-11 18:35:49,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9557.3, 300 sec: 9150.0). Total num frames: 16375808. Throughput: 0: 9642.0. Samples: 16362408. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:35:49,012][65744] Avg episode reward: [(0, '3930.909')] +[2023-03-11 18:35:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000031984_16375808.pth... +[2023-03-11 18:35:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000031424_16089088.pth +[2023-03-11 18:35:49,701][66031] Updated weights for policy 0, policy_version 32000 (0.0005) +[2023-03-11 18:35:54,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9150.0). Total num frames: 16420864. Throughput: 0: 9600.8. Samples: 16418432. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:35:54,012][65744] Avg episode reward: [(0, '3737.417')] +[2023-03-11 18:35:54,133][66031] Updated weights for policy 0, policy_version 32080 (0.0003) +[2023-03-11 18:35:58,549][66031] Updated weights for policy 0, policy_version 32160 (0.0003) +[2023-03-11 18:35:59,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9625.6, 300 sec: 9163.9). Total num frames: 16470016. Throughput: 0: 9570.8. Samples: 16446108. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:35:59,023][65744] Avg episode reward: [(0, '3457.474')] +[2023-03-11 18:36:02,744][66031] Updated weights for policy 0, policy_version 32240 (0.0003) +[2023-03-11 18:36:04,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9625.6, 300 sec: 9177.8). Total num frames: 16519168. Throughput: 0: 9574.8. Samples: 16503948. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:36:04,012][65744] Avg episode reward: [(0, '3771.662')] +[2023-03-11 18:36:04,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000032264_16519168.pth... +[2023-03-11 18:36:04,029][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000031704_16232448.pth +[2023-03-11 18:36:06,852][66031] Updated weights for policy 0, policy_version 32320 (0.0004) +[2023-03-11 18:36:09,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9177.8). Total num frames: 16564224. Throughput: 0: 9577.8. Samples: 16562868. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:36:09,012][65744] Avg episode reward: [(0, '2628.152')] +[2023-03-11 18:36:11,466][66031] Updated weights for policy 0, policy_version 32400 (0.0005) +[2023-03-11 18:36:14,012][65744] Fps is (10 sec: 9011.3, 60 sec: 9489.1, 300 sec: 9163.9). Total num frames: 16609280. Throughput: 0: 9507.1. Samples: 16588808. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:36:14,012][65744] Avg episode reward: [(0, '3008.803')] +[2023-03-11 18:36:15,917][66031] Updated weights for policy 0, policy_version 32480 (0.0005) +[2023-03-11 18:36:19,012][65744] Fps is (10 sec: 9420.7, 60 sec: 9557.3, 300 sec: 9177.8). Total num frames: 16658432. Throughput: 0: 9492.7. Samples: 16645024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:36:19,012][65744] Avg episode reward: [(0, '3291.014')] +[2023-03-11 18:36:19,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000032536_16658432.pth... +[2023-03-11 18:36:19,029][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000031984_16375808.pth +[2023-03-11 18:36:19,956][66031] Updated weights for policy 0, policy_version 32560 (0.0005) +[2023-03-11 18:36:23,962][66031] Updated weights for policy 0, policy_version 32640 (0.0004) +[2023-03-11 18:36:24,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9625.6, 300 sec: 9219.5). Total num frames: 16711680. Throughput: 0: 9591.5. Samples: 16706740. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:36:24,012][65744] Avg episode reward: [(0, '3192.058')] +[2023-03-11 18:36:27,999][66031] Updated weights for policy 0, policy_version 32720 (0.0005) +[2023-03-11 18:36:29,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9625.6, 300 sec: 9233.4). Total num frames: 16760832. Throughput: 0: 9615.9. Samples: 16736648. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:36:29,012][65744] Avg episode reward: [(0, '3293.774')] +[2023-03-11 18:36:32,089][66031] Updated weights for policy 0, policy_version 32800 (0.0005) +[2023-03-11 18:36:34,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9247.2). Total num frames: 16809984. Throughput: 0: 9667.1. Samples: 16797428. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:36:34,012][65744] Avg episode reward: [(0, '3065.544')] +[2023-03-11 18:36:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000032832_16809984.pth... +[2023-03-11 18:36:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000032264_16519168.pth +[2023-03-11 18:36:36,172][66031] Updated weights for policy 0, policy_version 32880 (0.0005) +[2023-03-11 18:36:39,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9762.1, 300 sec: 9288.9). Total num frames: 16863232. Throughput: 0: 9763.2. Samples: 16857776. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:36:39,012][65744] Avg episode reward: [(0, '2031.479')] +[2023-03-11 18:36:40,221][66031] Updated weights for policy 0, policy_version 32960 (0.0005) +[2023-03-11 18:36:44,012][65744] Fps is (10 sec: 10240.1, 60 sec: 9762.1, 300 sec: 9302.8). Total num frames: 16912384. Throughput: 0: 9820.8. Samples: 16888044. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:36:44,012][65744] Avg episode reward: [(0, '3432.259')] +[2023-03-11 18:36:44,164][66031] Updated weights for policy 0, policy_version 33040 (0.0005) +[2023-03-11 18:36:48,201][66031] Updated weights for policy 0, policy_version 33120 (0.0005) +[2023-03-11 18:36:49,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 9330.5). Total num frames: 16965632. Throughput: 0: 9899.8. Samples: 16949440. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:36:49,012][65744] Avg episode reward: [(0, '3530.422')] +[2023-03-11 18:36:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000033136_16965632.pth... +[2023-03-11 18:36:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000032536_16658432.pth +[2023-03-11 18:36:52,430][66031] Updated weights for policy 0, policy_version 33200 (0.0005) +[2023-03-11 18:36:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9330.5). Total num frames: 17010688. Throughput: 0: 9884.6. Samples: 17007676. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:36:54,012][65744] Avg episode reward: [(0, '3291.717')] +[2023-03-11 18:36:56,755][66031] Updated weights for policy 0, policy_version 33280 (0.0006) +[2023-03-11 18:36:59,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9830.4, 300 sec: 9330.6). Total num frames: 17059840. Throughput: 0: 9939.4. Samples: 17036080. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:36:59,012][65744] Avg episode reward: [(0, '2629.042')] +[2023-03-11 18:37:01,141][66031] Updated weights for policy 0, policy_version 33360 (0.0005) +[2023-03-11 18:37:04,012][65744] Fps is (10 sec: 9420.7, 60 sec: 9762.1, 300 sec: 9316.7). Total num frames: 17104896. Throughput: 0: 9946.5. Samples: 17092616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:37:04,012][65744] Avg episode reward: [(0, '2762.655')] +[2023-03-11 18:37:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000033408_17104896.pth... +[2023-03-11 18:37:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000032832_16809984.pth +[2023-03-11 18:37:05,560][66031] Updated weights for policy 0, policy_version 33440 (0.0005) +[2023-03-11 18:37:09,012][65744] Fps is (10 sec: 9011.2, 60 sec: 9762.1, 300 sec: 9288.9). Total num frames: 17149952. Throughput: 0: 9821.1. Samples: 17148688. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:37:09,012][65744] Avg episode reward: [(0, '3354.503')] +[2023-03-11 18:37:10,015][66031] Updated weights for policy 0, policy_version 33520 (0.0005) +[2023-03-11 18:37:14,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9830.4, 300 sec: 9288.9). Total num frames: 17199104. Throughput: 0: 9748.7. Samples: 17175340. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:37:14,012][65744] Avg episode reward: [(0, '3338.524')] +[2023-03-11 18:37:14,391][66031] Updated weights for policy 0, policy_version 33600 (0.0005) +[2023-03-11 18:37:18,740][66031] Updated weights for policy 0, policy_version 33680 (0.0006) +[2023-03-11 18:37:19,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9275.0). Total num frames: 17244160. Throughput: 0: 9661.6. Samples: 17232200. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:37:19,012][65744] Avg episode reward: [(0, '2666.342')] +[2023-03-11 18:37:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000033680_17244160.pth... +[2023-03-11 18:37:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000033136_16965632.pth +[2023-03-11 18:37:23,077][66031] Updated weights for policy 0, policy_version 33760 (0.0005) +[2023-03-11 18:37:24,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9288.9). Total num frames: 17293312. Throughput: 0: 9584.3. Samples: 17289068. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:37:24,012][65744] Avg episode reward: [(0, '2961.349')] +[2023-03-11 18:37:27,455][66031] Updated weights for policy 0, policy_version 33840 (0.0005) +[2023-03-11 18:37:29,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9288.9). Total num frames: 17338368. Throughput: 0: 9524.3. Samples: 17316636. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:37:29,012][65744] Avg episode reward: [(0, '2811.825')] +[2023-03-11 18:37:31,845][66031] Updated weights for policy 0, policy_version 33920 (0.0005) +[2023-03-11 18:37:34,012][65744] Fps is (10 sec: 9011.2, 60 sec: 9557.3, 300 sec: 9288.9). Total num frames: 17383424. Throughput: 0: 9402.3. Samples: 17372544. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:37:34,012][65744] Avg episode reward: [(0, '3248.946')] +[2023-03-11 18:37:34,064][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000033960_17387520.pth... +[2023-03-11 18:37:34,067][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000033408_17104896.pth +[2023-03-11 18:37:36,084][66031] Updated weights for policy 0, policy_version 34000 (0.0004) +[2023-03-11 18:37:39,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9302.8). Total num frames: 17432576. Throughput: 0: 9411.6. Samples: 17431200. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:37:39,012][65744] Avg episode reward: [(0, '3765.772')] +[2023-03-11 18:37:40,417][66031] Updated weights for policy 0, policy_version 34080 (0.0004) +[2023-03-11 18:37:44,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9489.1, 300 sec: 9330.6). Total num frames: 17481728. Throughput: 0: 9393.2. Samples: 17458772. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:37:44,012][65744] Avg episode reward: [(0, '3556.484')] +[2023-03-11 18:37:44,905][66031] Updated weights for policy 0, policy_version 34160 (0.0006) +[2023-03-11 18:37:49,012][65744] Fps is (10 sec: 9420.7, 60 sec: 9352.5, 300 sec: 9330.5). Total num frames: 17526784. Throughput: 0: 9366.1. Samples: 17514092. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:37:49,012][65744] Avg episode reward: [(0, '3313.430')] +[2023-03-11 18:37:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000034232_17526784.pth... +[2023-03-11 18:37:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000033680_17244160.pth +[2023-03-11 18:37:49,315][66031] Updated weights for policy 0, policy_version 34240 (0.0005) +[2023-03-11 18:37:53,768][66031] Updated weights for policy 0, policy_version 34320 (0.0005) +[2023-03-11 18:37:54,012][65744] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9344.4). Total num frames: 17571840. Throughput: 0: 9340.3. Samples: 17569000. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:37:54,012][65744] Avg episode reward: [(0, '3950.001')] +[2023-03-11 18:37:58,119][66031] Updated weights for policy 0, policy_version 34400 (0.0005) +[2023-03-11 18:37:59,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9372.2). Total num frames: 17620992. Throughput: 0: 9367.4. Samples: 17596872. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:37:59,012][65744] Avg episode reward: [(0, '3990.567')] +[2023-03-11 18:38:02,324][66031] Updated weights for policy 0, policy_version 34480 (0.0005) +[2023-03-11 18:38:04,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9372.2). Total num frames: 17666048. Throughput: 0: 9392.8. Samples: 17654876. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:38:04,012][65744] Avg episode reward: [(0, '3727.799')] +[2023-03-11 18:38:04,058][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000034512_17670144.pth... +[2023-03-11 18:38:04,060][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000033960_17387520.pth +[2023-03-11 18:38:06,513][66031] Updated weights for policy 0, policy_version 34560 (0.0005) +[2023-03-11 18:38:09,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9489.1, 300 sec: 9386.1). Total num frames: 17719296. Throughput: 0: 9469.6. Samples: 17715200. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:38:09,012][65744] Avg episode reward: [(0, '4164.006')] +[2023-03-11 18:38:10,511][66031] Updated weights for policy 0, policy_version 34640 (0.0004) +[2023-03-11 18:38:14,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9489.1, 300 sec: 9400.0). Total num frames: 17768448. Throughput: 0: 9522.6. Samples: 17745152. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:38:14,012][65744] Avg episode reward: [(0, '4041.607')] +[2023-03-11 18:38:14,500][66031] Updated weights for policy 0, policy_version 34720 (0.0005) +[2023-03-11 18:38:18,480][66031] Updated weights for policy 0, policy_version 34800 (0.0004) +[2023-03-11 18:38:19,012][65744] Fps is (10 sec: 10239.9, 60 sec: 9625.6, 300 sec: 9427.7). Total num frames: 17821696. Throughput: 0: 9650.7. Samples: 17806828. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:38:19,012][65744] Avg episode reward: [(0, '3640.858')] +[2023-03-11 18:38:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000034808_17821696.pth... +[2023-03-11 18:38:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000034232_17526784.pth +[2023-03-11 18:38:22,450][66031] Updated weights for policy 0, policy_version 34880 (0.0004) +[2023-03-11 18:38:24,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9625.6, 300 sec: 9427.7). Total num frames: 17870848. Throughput: 0: 9713.5. Samples: 17868308. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:38:24,012][65744] Avg episode reward: [(0, '3012.484')] +[2023-03-11 18:38:26,635][66031] Updated weights for policy 0, policy_version 34960 (0.0005) +[2023-03-11 18:38:29,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9693.9, 300 sec: 9427.7). Total num frames: 17920000. Throughput: 0: 9764.4. Samples: 17898172. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:38:29,012][65744] Avg episode reward: [(0, '3198.195')] +[2023-03-11 18:38:30,900][66031] Updated weights for policy 0, policy_version 35040 (0.0005) +[2023-03-11 18:38:34,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9427.7). Total num frames: 17969152. Throughput: 0: 9808.8. Samples: 17955488. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:38:34,012][65744] Avg episode reward: [(0, '3257.122')] +[2023-03-11 18:38:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000035096_17969152.pth... +[2023-03-11 18:38:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000034512_17670144.pth +[2023-03-11 18:38:35,135][66031] Updated weights for policy 0, policy_version 35120 (0.0005) +[2023-03-11 18:38:39,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9455.5). Total num frames: 18018304. Throughput: 0: 9906.5. Samples: 18014792. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:38:39,012][65744] Avg episode reward: [(0, '3268.032')] +[2023-03-11 18:38:39,213][66031] Updated weights for policy 0, policy_version 35200 (0.0004) +[2023-03-11 18:38:43,326][66031] Updated weights for policy 0, policy_version 35280 (0.0004) +[2023-03-11 18:38:44,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9469.4). Total num frames: 18067456. Throughput: 0: 9943.9. Samples: 18044348. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:38:44,012][65744] Avg episode reward: [(0, '3507.853')] +[2023-03-11 18:38:47,507][66031] Updated weights for policy 0, policy_version 35360 (0.0005) +[2023-03-11 18:38:49,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9483.3). Total num frames: 18116608. Throughput: 0: 9982.8. Samples: 18104104. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:38:49,012][65744] Avg episode reward: [(0, '3581.420')] +[2023-03-11 18:38:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000035384_18116608.pth... +[2023-03-11 18:38:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000034808_17821696.pth +[2023-03-11 18:38:51,768][66031] Updated weights for policy 0, policy_version 35440 (0.0004) +[2023-03-11 18:38:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9497.2). Total num frames: 18165760. Throughput: 0: 9921.7. Samples: 18161676. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:38:54,012][65744] Avg episode reward: [(0, '3477.927')] +[2023-03-11 18:38:56,092][66031] Updated weights for policy 0, policy_version 35520 (0.0005) +[2023-03-11 18:38:59,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9830.4, 300 sec: 9497.2). Total num frames: 18210816. Throughput: 0: 9886.9. Samples: 18190064. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:38:59,012][65744] Avg episode reward: [(0, '2898.709')] +[2023-03-11 18:39:00,328][66031] Updated weights for policy 0, policy_version 35600 (0.0005) +[2023-03-11 18:39:04,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 9497.2). Total num frames: 18259968. Throughput: 0: 9795.3. Samples: 18247616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:39:04,012][65744] Avg episode reward: [(0, '3614.634')] +[2023-03-11 18:39:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000035664_18259968.pth... +[2023-03-11 18:39:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000035096_17969152.pth +[2023-03-11 18:39:04,664][66031] Updated weights for policy 0, policy_version 35680 (0.0005) +[2023-03-11 18:39:08,857][66031] Updated weights for policy 0, policy_version 35760 (0.0005) +[2023-03-11 18:39:09,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9524.9). Total num frames: 18309120. Throughput: 0: 9706.5. Samples: 18305100. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:39:09,012][65744] Avg episode reward: [(0, '3809.268')] +[2023-03-11 18:39:13,277][66031] Updated weights for policy 0, policy_version 35840 (0.0005) +[2023-03-11 18:39:14,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9524.9). Total num frames: 18354176. Throughput: 0: 9677.3. Samples: 18333652. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 18:39:14,012][65744] Avg episode reward: [(0, '3379.300')] +[2023-03-11 18:39:17,608][66031] Updated weights for policy 0, policy_version 35920 (0.0005) +[2023-03-11 18:39:19,012][65744] Fps is (10 sec: 9420.7, 60 sec: 9693.9, 300 sec: 9552.7). Total num frames: 18403328. Throughput: 0: 9646.5. Samples: 18389580. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 18:39:19,012][65744] Avg episode reward: [(0, '3732.154')] +[2023-03-11 18:39:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000035944_18403328.pth... +[2023-03-11 18:39:19,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000035384_18116608.pth +[2023-03-11 18:39:22,024][66031] Updated weights for policy 0, policy_version 36000 (0.0005) +[2023-03-11 18:39:24,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9566.6). Total num frames: 18448384. Throughput: 0: 9567.7. Samples: 18445340. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 18:39:24,012][65744] Avg episode reward: [(0, '3690.308')] +[2023-03-11 18:39:26,344][66031] Updated weights for policy 0, policy_version 36080 (0.0005) +[2023-03-11 18:39:29,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9625.6, 300 sec: 9580.5). Total num frames: 18497536. Throughput: 0: 9544.6. Samples: 18473856. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 18:39:29,012][65744] Avg episode reward: [(0, '3311.371')] +[2023-03-11 18:39:30,607][66031] Updated weights for policy 0, policy_version 36160 (0.0005) +[2023-03-11 18:39:34,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9594.4). Total num frames: 18546688. Throughput: 0: 9495.1. Samples: 18531384. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:39:34,012][65744] Avg episode reward: [(0, '3418.839')] +[2023-03-11 18:39:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000036224_18546688.pth... +[2023-03-11 18:39:34,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000035664_18259968.pth +[2023-03-11 18:39:34,864][66031] Updated weights for policy 0, policy_version 36240 (0.0005) +[2023-03-11 18:39:39,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9594.4). Total num frames: 18591744. Throughput: 0: 9479.3. Samples: 18588244. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:39:39,012][65744] Avg episode reward: [(0, '3824.040')] +[2023-03-11 18:39:39,227][66031] Updated weights for policy 0, policy_version 36320 (0.0005) +[2023-03-11 18:39:43,571][66031] Updated weights for policy 0, policy_version 36400 (0.0005) +[2023-03-11 18:39:44,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9622.1). Total num frames: 18640896. Throughput: 0: 9484.5. Samples: 18616868. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:39:44,012][65744] Avg episode reward: [(0, '3363.833')] +[2023-03-11 18:39:47,915][66031] Updated weights for policy 0, policy_version 36480 (0.0005) +[2023-03-11 18:39:49,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9622.1). Total num frames: 18685952. Throughput: 0: 9468.0. Samples: 18673676. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:39:49,012][65744] Avg episode reward: [(0, '3343.039')] +[2023-03-11 18:39:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000036496_18685952.pth... +[2023-03-11 18:39:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000035944_18403328.pth +[2023-03-11 18:39:52,276][66031] Updated weights for policy 0, policy_version 36560 (0.0005) +[2023-03-11 18:39:54,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9636.0). Total num frames: 18735104. Throughput: 0: 9446.8. Samples: 18730208. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:39:54,012][65744] Avg episode reward: [(0, '2800.796')] +[2023-03-11 18:39:56,565][66031] Updated weights for policy 0, policy_version 36640 (0.0005) +[2023-03-11 18:39:59,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9622.1). Total num frames: 18780160. Throughput: 0: 9451.5. Samples: 18758968. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:39:59,012][65744] Avg episode reward: [(0, '3233.357')] +[2023-03-11 18:40:00,869][66031] Updated weights for policy 0, policy_version 36720 (0.0005) +[2023-03-11 18:40:04,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9622.1). Total num frames: 18829312. Throughput: 0: 9486.4. Samples: 18816468. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:40:04,012][65744] Avg episode reward: [(0, '3283.518')] +[2023-03-11 18:40:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000036776_18829312.pth... +[2023-03-11 18:40:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000036224_18546688.pth +[2023-03-11 18:40:05,109][66031] Updated weights for policy 0, policy_version 36800 (0.0005) +[2023-03-11 18:40:09,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9489.1, 300 sec: 9622.1). Total num frames: 18878464. Throughput: 0: 9528.5. Samples: 18874124. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:40:09,012][65744] Avg episode reward: [(0, '3286.262')] +[2023-03-11 18:40:09,384][66031] Updated weights for policy 0, policy_version 36880 (0.0005) +[2023-03-11 18:40:13,665][66031] Updated weights for policy 0, policy_version 36960 (0.0005) +[2023-03-11 18:40:14,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9489.1, 300 sec: 9622.1). Total num frames: 18923520. Throughput: 0: 9527.8. Samples: 18902608. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:40:14,012][65744] Avg episode reward: [(0, '3464.809')] +[2023-03-11 18:40:17,892][66031] Updated weights for policy 0, policy_version 37040 (0.0005) +[2023-03-11 18:40:19,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9622.1). Total num frames: 18972672. Throughput: 0: 9533.7. Samples: 18960400. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:40:19,012][65744] Avg episode reward: [(0, '2953.480')] +[2023-03-11 18:40:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000037056_18972672.pth... +[2023-03-11 18:40:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000036496_18685952.pth +[2023-03-11 18:40:22,106][66031] Updated weights for policy 0, policy_version 37120 (0.0005) +[2023-03-11 18:40:24,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9557.3, 300 sec: 9622.1). Total num frames: 19021824. Throughput: 0: 9557.9. Samples: 19018348. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 18:40:24,012][65744] Avg episode reward: [(0, '2374.877')] +[2023-03-11 18:40:26,336][66031] Updated weights for policy 0, policy_version 37200 (0.0005) +[2023-03-11 18:40:29,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9557.3, 300 sec: 9636.0). Total num frames: 19070976. Throughput: 0: 9566.9. Samples: 19047380. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 18:40:29,012][65744] Avg episode reward: [(0, '2160.463')] +[2023-03-11 18:40:30,692][66031] Updated weights for policy 0, policy_version 37280 (0.0005) +[2023-03-11 18:40:34,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9622.1). Total num frames: 19116032. Throughput: 0: 9558.7. Samples: 19103816. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 18:40:34,012][65744] Avg episode reward: [(0, '2939.326')] +[2023-03-11 18:40:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000037336_19116032.pth... +[2023-03-11 18:40:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000036776_18829312.pth +[2023-03-11 18:40:34,961][66031] Updated weights for policy 0, policy_version 37360 (0.0005) +[2023-03-11 18:40:39,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9622.1). Total num frames: 19165184. Throughput: 0: 9583.8. Samples: 19161480. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 18:40:39,012][65744] Avg episode reward: [(0, '3300.833')] +[2023-03-11 18:40:39,260][66031] Updated weights for policy 0, policy_version 37440 (0.0005) +[2023-03-11 18:40:43,549][66031] Updated weights for policy 0, policy_version 37520 (0.0005) +[2023-03-11 18:40:44,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9622.1). Total num frames: 19214336. Throughput: 0: 9574.6. Samples: 19189824. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:40:44,012][65744] Avg episode reward: [(0, '3745.349')] +[2023-03-11 18:40:47,830][66031] Updated weights for policy 0, policy_version 37600 (0.0005) +[2023-03-11 18:40:49,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9622.1). Total num frames: 19259392. Throughput: 0: 9577.2. Samples: 19247440. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:40:49,012][65744] Avg episode reward: [(0, '3364.146')] +[2023-03-11 18:40:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000037616_19259392.pth... +[2023-03-11 18:40:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000037056_18972672.pth +[2023-03-11 18:40:52,080][66031] Updated weights for policy 0, policy_version 37680 (0.0005) +[2023-03-11 18:40:54,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9622.1). Total num frames: 19308544. Throughput: 0: 9586.9. Samples: 19305536. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:40:54,012][65744] Avg episode reward: [(0, '3579.191')] +[2023-03-11 18:40:56,298][66031] Updated weights for policy 0, policy_version 37760 (0.0005) +[2023-03-11 18:40:59,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9622.1). Total num frames: 19357696. Throughput: 0: 9599.6. Samples: 19334592. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:40:59,012][65744] Avg episode reward: [(0, '3610.707')] +[2023-03-11 18:41:00,512][66031] Updated weights for policy 0, policy_version 37840 (0.0005) +[2023-03-11 18:41:04,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9636.0). Total num frames: 19406848. Throughput: 0: 9640.5. Samples: 19394220. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:41:04,012][65744] Avg episode reward: [(0, '3211.386')] +[2023-03-11 18:41:04,046][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000037912_19410944.pth... +[2023-03-11 18:41:04,047][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000037336_19116032.pth +[2023-03-11 18:41:04,451][66031] Updated weights for policy 0, policy_version 37920 (0.0004) +[2023-03-11 18:41:08,439][66031] Updated weights for policy 0, policy_version 38000 (0.0005) +[2023-03-11 18:41:09,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9693.9, 300 sec: 9663.8). Total num frames: 19460096. Throughput: 0: 9727.0. Samples: 19456064. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 18:41:09,012][65744] Avg episode reward: [(0, '3398.516')] +[2023-03-11 18:41:12,433][66031] Updated weights for policy 0, policy_version 38080 (0.0004) +[2023-03-11 18:41:14,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9762.1, 300 sec: 9663.8). Total num frames: 19509248. Throughput: 0: 9766.5. Samples: 19486872. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 18:41:14,012][65744] Avg episode reward: [(0, '3725.814')] +[2023-03-11 18:41:16,434][66031] Updated weights for policy 0, policy_version 38160 (0.0004) +[2023-03-11 18:41:19,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 9663.8). Total num frames: 19562496. Throughput: 0: 9872.8. Samples: 19548092. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 18:41:19,012][65744] Avg episode reward: [(0, '3425.469')] +[2023-03-11 18:41:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000038208_19562496.pth... +[2023-03-11 18:41:19,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000037616_19259392.pth +[2023-03-11 18:41:20,432][66031] Updated weights for policy 0, policy_version 38240 (0.0004) +[2023-03-11 18:41:24,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 9663.8). Total num frames: 19611648. Throughput: 0: 9951.2. Samples: 19609284. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 18:41:24,012][65744] Avg episode reward: [(0, '3198.412')] +[2023-03-11 18:41:24,517][66031] Updated weights for policy 0, policy_version 38320 (0.0005) +[2023-03-11 18:41:28,516][66031] Updated weights for policy 0, policy_version 38400 (0.0004) +[2023-03-11 18:41:29,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9677.7). Total num frames: 19664896. Throughput: 0: 10000.2. Samples: 19639832. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 18:41:29,012][65744] Avg episode reward: [(0, '3416.368')] +[2023-03-11 18:41:32,583][66031] Updated weights for policy 0, policy_version 38480 (0.0005) +[2023-03-11 18:41:34,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 9663.8). Total num frames: 19714048. Throughput: 0: 10063.1. Samples: 19700280. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:41:34,012][65744] Avg episode reward: [(0, '4097.383')] +[2023-03-11 18:41:34,014][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000038504_19714048.pth... +[2023-03-11 18:41:34,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000037912_19410944.pth +[2023-03-11 18:41:36,629][66031] Updated weights for policy 0, policy_version 38560 (0.0004) +[2023-03-11 18:41:39,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9663.8). Total num frames: 19763200. Throughput: 0: 10129.1. Samples: 19761344. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:41:39,012][65744] Avg episode reward: [(0, '3852.146')] +[2023-03-11 18:41:40,580][66031] Updated weights for policy 0, policy_version 38640 (0.0004) +[2023-03-11 18:41:44,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9663.8). Total num frames: 19816448. Throughput: 0: 10170.7. Samples: 19792272. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:41:44,012][65744] Avg episode reward: [(0, '3912.023')] +[2023-03-11 18:41:44,599][66031] Updated weights for policy 0, policy_version 38720 (0.0005) +[2023-03-11 18:41:48,681][66031] Updated weights for policy 0, policy_version 38800 (0.0005) +[2023-03-11 18:41:49,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 9677.7). Total num frames: 19865600. Throughput: 0: 10202.2. Samples: 19853320. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:41:49,012][65744] Avg episode reward: [(0, '4083.911')] +[2023-03-11 18:41:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000038800_19865600.pth... +[2023-03-11 18:41:49,019][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000038208_19562496.pth +[2023-03-11 18:41:52,672][66031] Updated weights for policy 0, policy_version 38880 (0.0004) +[2023-03-11 18:41:54,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 9691.6). Total num frames: 19918848. Throughput: 0: 10193.2. Samples: 19914760. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:41:54,012][65744] Avg episode reward: [(0, '3982.360')] +[2023-03-11 18:41:56,953][66031] Updated weights for policy 0, policy_version 38960 (0.0004) +[2023-03-11 18:41:59,012][65744] Fps is (10 sec: 9830.5, 60 sec: 10103.5, 300 sec: 9691.6). Total num frames: 19963904. Throughput: 0: 10144.8. Samples: 19943388. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:41:59,012][65744] Avg episode reward: [(0, '3618.896')] +[2023-03-11 18:42:01,225][66031] Updated weights for policy 0, policy_version 39040 (0.0005) +[2023-03-11 18:42:04,012][65744] Fps is (10 sec: 9420.7, 60 sec: 10103.5, 300 sec: 9705.4). Total num frames: 20013056. Throughput: 0: 10058.0. Samples: 20000704. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:42:04,012][65744] Avg episode reward: [(0, '3488.150')] +[2023-03-11 18:42:04,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000039088_20013056.pth... +[2023-03-11 18:42:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000038504_19714048.pth +[2023-03-11 18:42:05,552][66031] Updated weights for policy 0, policy_version 39120 (0.0005) +[2023-03-11 18:42:09,012][65744] Fps is (10 sec: 9830.3, 60 sec: 10035.2, 300 sec: 9705.4). Total num frames: 20062208. Throughput: 0: 9966.8. Samples: 20057788. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:42:09,012][65744] Avg episode reward: [(0, '3477.572')] +[2023-03-11 18:42:09,723][66031] Updated weights for policy 0, policy_version 39200 (0.0005) +[2023-03-11 18:42:13,768][66031] Updated weights for policy 0, policy_version 39280 (0.0003) +[2023-03-11 18:42:14,012][65744] Fps is (10 sec: 9830.5, 60 sec: 10035.2, 300 sec: 9719.3). Total num frames: 20111360. Throughput: 0: 9955.6. Samples: 20087836. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:42:14,012][65744] Avg episode reward: [(0, '3842.485')] +[2023-03-11 18:42:17,734][66031] Updated weights for policy 0, policy_version 39360 (0.0003) +[2023-03-11 18:42:19,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10035.2, 300 sec: 9733.2). Total num frames: 20164608. Throughput: 0: 9985.0. Samples: 20149604. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:42:19,012][65744] Avg episode reward: [(0, '3776.607')] +[2023-03-11 18:42:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000039384_20164608.pth... +[2023-03-11 18:42:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000038800_19865600.pth +[2023-03-11 18:42:21,709][66031] Updated weights for policy 0, policy_version 39440 (0.0003) +[2023-03-11 18:42:24,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9747.1). Total num frames: 20213760. Throughput: 0: 9984.4. Samples: 20210640. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 18:42:24,012][65744] Avg episode reward: [(0, '4101.063')] +[2023-03-11 18:42:25,698][66031] Updated weights for policy 0, policy_version 39520 (0.0004) +[2023-03-11 18:42:29,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9774.9). Total num frames: 20267008. Throughput: 0: 10000.2. Samples: 20242280. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 18:42:29,012][65744] Avg episode reward: [(0, '3935.873')] +[2023-03-11 18:42:29,726][66031] Updated weights for policy 0, policy_version 39600 (0.0004) +[2023-03-11 18:42:33,856][66031] Updated weights for policy 0, policy_version 39680 (0.0005) +[2023-03-11 18:42:34,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9774.9). Total num frames: 20316160. Throughput: 0: 9969.6. Samples: 20301952. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 18:42:34,012][65744] Avg episode reward: [(0, '3783.542')] +[2023-03-11 18:42:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000039680_20316160.pth... +[2023-03-11 18:42:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000039088_20013056.pth +[2023-03-11 18:42:37,863][66031] Updated weights for policy 0, policy_version 39760 (0.0005) +[2023-03-11 18:42:39,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 9774.9). Total num frames: 20365312. Throughput: 0: 9959.3. Samples: 20362928. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 18:42:39,012][65744] Avg episode reward: [(0, '3732.326')] +[2023-03-11 18:42:41,947][66031] Updated weights for policy 0, policy_version 39840 (0.0004) +[2023-03-11 18:42:44,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10035.2, 300 sec: 9802.6). Total num frames: 20418560. Throughput: 0: 10001.0. Samples: 20393436. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 18:42:44,013][65744] Avg episode reward: [(0, '4088.394')] +[2023-03-11 18:42:45,928][66031] Updated weights for policy 0, policy_version 39920 (0.0004) +[2023-03-11 18:42:49,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10035.2, 300 sec: 9816.5). Total num frames: 20467712. Throughput: 0: 10086.8. Samples: 20454612. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 18:42:49,012][65744] Avg episode reward: [(0, '3579.261')] +[2023-03-11 18:42:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000039976_20467712.pth... +[2023-03-11 18:42:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000039384_20164608.pth +[2023-03-11 18:42:49,911][66031] Updated weights for policy 0, policy_version 40000 (0.0005) +[2023-03-11 18:42:53,889][66031] Updated weights for policy 0, policy_version 40080 (0.0004) +[2023-03-11 18:42:54,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10035.2, 300 sec: 9830.4). Total num frames: 20520960. Throughput: 0: 10195.9. Samples: 20516604. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 18:42:54,012][65744] Avg episode reward: [(0, '3869.182')] +[2023-03-11 18:42:57,802][66031] Updated weights for policy 0, policy_version 40160 (0.0004) +[2023-03-11 18:42:59,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10103.5, 300 sec: 9844.3). Total num frames: 20570112. Throughput: 0: 10218.6. Samples: 20547672. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 18:42:59,012][65744] Avg episode reward: [(0, '3583.863')] +[2023-03-11 18:43:01,872][66031] Updated weights for policy 0, policy_version 40240 (0.0005) +[2023-03-11 18:43:04,012][65744] Fps is (10 sec: 9830.3, 60 sec: 10103.5, 300 sec: 9830.4). Total num frames: 20619264. Throughput: 0: 10187.1. Samples: 20608024. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 18:43:04,012][65744] Avg episode reward: [(0, '3596.012')] +[2023-03-11 18:43:04,024][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000040280_20623360.pth... +[2023-03-11 18:43:04,025][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000039680_20316160.pth +[2023-03-11 18:43:06,189][66031] Updated weights for policy 0, policy_version 40320 (0.0005) +[2023-03-11 18:43:09,012][65744] Fps is (10 sec: 9830.3, 60 sec: 10103.5, 300 sec: 9830.4). Total num frames: 20668416. Throughput: 0: 10114.4. Samples: 20665788. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 18:43:09,012][65744] Avg episode reward: [(0, '3972.772')] +[2023-03-11 18:43:10,458][66031] Updated weights for policy 0, policy_version 40400 (0.0005) +[2023-03-11 18:43:14,012][65744] Fps is (10 sec: 9830.5, 60 sec: 10103.5, 300 sec: 9816.5). Total num frames: 20717568. Throughput: 0: 10048.4. Samples: 20694456. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 18:43:14,012][65744] Avg episode reward: [(0, '3805.302')] +[2023-03-11 18:43:14,817][66031] Updated weights for policy 0, policy_version 40480 (0.0005) +[2023-03-11 18:43:19,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9966.9, 300 sec: 9802.6). Total num frames: 20762624. Throughput: 0: 9964.3. Samples: 20750344. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 18:43:19,012][65744] Avg episode reward: [(0, '3083.613')] +[2023-03-11 18:43:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000040552_20762624.pth... +[2023-03-11 18:43:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000039976_20467712.pth +[2023-03-11 18:43:19,225][66031] Updated weights for policy 0, policy_version 40560 (0.0005) +[2023-03-11 18:43:23,329][66031] Updated weights for policy 0, policy_version 40640 (0.0005) +[2023-03-11 18:43:24,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9966.9, 300 sec: 9802.6). Total num frames: 20811776. Throughput: 0: 9907.1. Samples: 20808748. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 18:43:24,012][65744] Avg episode reward: [(0, '3811.229')] +[2023-03-11 18:43:27,584][66031] Updated weights for policy 0, policy_version 40720 (0.0005) +[2023-03-11 18:43:29,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9802.6). Total num frames: 20860928. Throughput: 0: 9881.0. Samples: 20838080. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 18:43:29,012][65744] Avg episode reward: [(0, '3821.190')] +[2023-03-11 18:43:31,936][66031] Updated weights for policy 0, policy_version 40800 (0.0005) +[2023-03-11 18:43:34,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9830.4, 300 sec: 9788.7). Total num frames: 20905984. Throughput: 0: 9766.4. Samples: 20894100. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 18:43:34,012][65744] Avg episode reward: [(0, '3505.159')] +[2023-03-11 18:43:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000040832_20905984.pth... +[2023-03-11 18:43:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000040280_20623360.pth +[2023-03-11 18:43:36,364][66031] Updated weights for policy 0, policy_version 40880 (0.0005) +[2023-03-11 18:43:39,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9830.4, 300 sec: 9788.7). Total num frames: 20955136. Throughput: 0: 9654.5. Samples: 20951056. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:43:39,012][65744] Avg episode reward: [(0, '3785.197')] +[2023-03-11 18:43:40,650][66031] Updated weights for policy 0, policy_version 40960 (0.0005) +[2023-03-11 18:43:44,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9774.9). Total num frames: 21000192. Throughput: 0: 9600.9. Samples: 20979712. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:43:44,012][65744] Avg episode reward: [(0, '3810.121')] +[2023-03-11 18:43:44,882][66031] Updated weights for policy 0, policy_version 41040 (0.0005) +[2023-03-11 18:43:49,012][65744] Fps is (10 sec: 9420.7, 60 sec: 9693.9, 300 sec: 9774.9). Total num frames: 21049344. Throughput: 0: 9535.6. Samples: 21037128. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:43:49,012][65744] Avg episode reward: [(0, '3342.308')] +[2023-03-11 18:43:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000041112_21049344.pth... +[2023-03-11 18:43:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000040552_20762624.pth +[2023-03-11 18:43:49,121][66031] Updated weights for policy 0, policy_version 41120 (0.0004) +[2023-03-11 18:43:53,311][66031] Updated weights for policy 0, policy_version 41200 (0.0005) +[2023-03-11 18:43:54,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9625.6, 300 sec: 9788.7). Total num frames: 21098496. Throughput: 0: 9548.8. Samples: 21095484. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:43:54,012][65744] Avg episode reward: [(0, '3620.869')] +[2023-03-11 18:43:57,616][66031] Updated weights for policy 0, policy_version 41280 (0.0005) +[2023-03-11 18:43:59,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9625.6, 300 sec: 9788.7). Total num frames: 21147648. Throughput: 0: 9552.5. Samples: 21124320. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:43:59,012][65744] Avg episode reward: [(0, '3840.208')] +[2023-03-11 18:44:01,961][66031] Updated weights for policy 0, policy_version 41360 (0.0005) +[2023-03-11 18:44:04,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9774.9). Total num frames: 21192704. Throughput: 0: 9562.4. Samples: 21180652. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:44:04,012][65744] Avg episode reward: [(0, '3911.959')] +[2023-03-11 18:44:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000041392_21192704.pth... +[2023-03-11 18:44:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000040832_20905984.pth +[2023-03-11 18:44:06,192][66031] Updated weights for policy 0, policy_version 41440 (0.0005) +[2023-03-11 18:44:09,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9788.7). Total num frames: 21241856. Throughput: 0: 9549.9. Samples: 21238492. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:44:09,012][65744] Avg episode reward: [(0, '3093.477')] +[2023-03-11 18:44:10,512][66031] Updated weights for policy 0, policy_version 41520 (0.0005) +[2023-03-11 18:44:14,012][65744] Fps is (10 sec: 9420.7, 60 sec: 9489.1, 300 sec: 9774.9). Total num frames: 21286912. Throughput: 0: 9525.5. Samples: 21266728. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:44:14,012][65744] Avg episode reward: [(0, '3507.384')] +[2023-03-11 18:44:14,841][66031] Updated weights for policy 0, policy_version 41600 (0.0005) +[2023-03-11 18:44:18,927][66031] Updated weights for policy 0, policy_version 41680 (0.0005) +[2023-03-11 18:44:19,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9625.6, 300 sec: 9802.6). Total num frames: 21340160. Throughput: 0: 9575.3. Samples: 21324988. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:44:19,012][65744] Avg episode reward: [(0, '3874.360')] +[2023-03-11 18:44:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000041680_21340160.pth... +[2023-03-11 18:44:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000041112_21049344.pth +[2023-03-11 18:44:22,987][66031] Updated weights for policy 0, policy_version 41760 (0.0004) +[2023-03-11 18:44:24,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9625.6, 300 sec: 9802.6). Total num frames: 21389312. Throughput: 0: 9651.7. Samples: 21385384. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:44:24,012][65744] Avg episode reward: [(0, '4137.573')] +[2023-03-11 18:44:27,122][66031] Updated weights for policy 0, policy_version 41840 (0.0005) +[2023-03-11 18:44:29,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9802.6). Total num frames: 21438464. Throughput: 0: 9682.8. Samples: 21415436. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:44:29,012][65744] Avg episode reward: [(0, '3789.211')] +[2023-03-11 18:44:31,122][66031] Updated weights for policy 0, policy_version 41920 (0.0004) +[2023-03-11 18:44:34,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9762.1, 300 sec: 9830.4). Total num frames: 21491712. Throughput: 0: 9758.3. Samples: 21476252. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:44:34,012][65744] Avg episode reward: [(0, '3325.705')] +[2023-03-11 18:44:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000041976_21491712.pth... +[2023-03-11 18:44:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000041392_21192704.pth +[2023-03-11 18:44:35,122][66031] Updated weights for policy 0, policy_version 42000 (0.0005) +[2023-03-11 18:44:39,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9762.1, 300 sec: 9830.4). Total num frames: 21540864. Throughput: 0: 9835.5. Samples: 21538080. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:44:39,012][65744] Avg episode reward: [(0, '4064.741')] +[2023-03-11 18:44:39,090][66031] Updated weights for policy 0, policy_version 42080 (0.0004) +[2023-03-11 18:44:43,134][66031] Updated weights for policy 0, policy_version 42160 (0.0004) +[2023-03-11 18:44:44,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9858.2). Total num frames: 21594112. Throughput: 0: 9885.8. Samples: 21569184. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:44:44,012][65744] Avg episode reward: [(0, '3970.092')] +[2023-03-11 18:44:47,189][66031] Updated weights for policy 0, policy_version 42240 (0.0004) +[2023-03-11 18:44:49,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9858.2). Total num frames: 21643264. Throughput: 0: 9973.7. Samples: 21629468. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:44:49,012][65744] Avg episode reward: [(0, '3593.911')] +[2023-03-11 18:44:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000042272_21643264.pth... +[2023-03-11 18:44:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000041680_21340160.pth +[2023-03-11 18:44:51,323][66031] Updated weights for policy 0, policy_version 42320 (0.0005) +[2023-03-11 18:44:54,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9898.7, 300 sec: 9872.1). Total num frames: 21692416. Throughput: 0: 10023.3. Samples: 21689540. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:44:54,012][65744] Avg episode reward: [(0, '3770.064')] +[2023-03-11 18:44:55,294][66031] Updated weights for policy 0, policy_version 42400 (0.0004) +[2023-03-11 18:44:59,012][65744] Fps is (10 sec: 10240.1, 60 sec: 9966.9, 300 sec: 9885.9). Total num frames: 21745664. Throughput: 0: 10094.1. Samples: 21720960. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:44:59,012][65744] Avg episode reward: [(0, '3334.132')] +[2023-03-11 18:44:59,248][66031] Updated weights for policy 0, policy_version 42480 (0.0004) +[2023-03-11 18:45:03,328][66031] Updated weights for policy 0, policy_version 42560 (0.0004) +[2023-03-11 18:45:04,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10035.2, 300 sec: 9885.9). Total num frames: 21794816. Throughput: 0: 10161.0. Samples: 21782232. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:45:04,012][65744] Avg episode reward: [(0, '3360.298')] +[2023-03-11 18:45:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000042568_21794816.pth... +[2023-03-11 18:45:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000041976_21491712.pth +[2023-03-11 18:45:07,369][66031] Updated weights for policy 0, policy_version 42640 (0.0004) +[2023-03-11 18:45:09,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 9913.7). Total num frames: 21848064. Throughput: 0: 10162.7. Samples: 21842704. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:45:09,012][65744] Avg episode reward: [(0, '3091.021')] +[2023-03-11 18:45:11,424][66031] Updated weights for policy 0, policy_version 42720 (0.0005) +[2023-03-11 18:45:14,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 9913.7). Total num frames: 21897216. Throughput: 0: 10161.7. Samples: 21872712. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:45:14,012][65744] Avg episode reward: [(0, '3784.552')] +[2023-03-11 18:45:15,441][66031] Updated weights for policy 0, policy_version 42800 (0.0004) +[2023-03-11 18:45:19,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 9927.6). Total num frames: 21950464. Throughput: 0: 10175.4. Samples: 21934144. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:45:19,012][65744] Avg episode reward: [(0, '3524.580')] +[2023-03-11 18:45:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000042872_21950464.pth... +[2023-03-11 18:45:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000042272_21643264.pth +[2023-03-11 18:45:19,397][66031] Updated weights for policy 0, policy_version 42880 (0.0004) +[2023-03-11 18:45:23,371][66031] Updated weights for policy 0, policy_version 42960 (0.0005) +[2023-03-11 18:45:24,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 9927.6). Total num frames: 21999616. Throughput: 0: 10177.5. Samples: 21996068. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:45:24,012][65744] Avg episode reward: [(0, '4020.356')] +[2023-03-11 18:45:27,353][66031] Updated weights for policy 0, policy_version 43040 (0.0005) +[2023-03-11 18:45:29,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 9941.5). Total num frames: 22048768. Throughput: 0: 10182.0. Samples: 22027372. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:45:29,012][65744] Avg episode reward: [(0, '4046.476')] +[2023-03-11 18:45:31,746][66031] Updated weights for policy 0, policy_version 43120 (0.0005) +[2023-03-11 18:45:34,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 9941.5). Total num frames: 22097920. Throughput: 0: 10107.7. Samples: 22084316. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:45:34,012][65744] Avg episode reward: [(0, '3595.319')] +[2023-03-11 18:45:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000043160_22097920.pth... +[2023-03-11 18:45:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000042568_21794816.pth +[2023-03-11 18:45:36,029][66031] Updated weights for policy 0, policy_version 43200 (0.0005) +[2023-03-11 18:45:39,012][65744] Fps is (10 sec: 9420.8, 60 sec: 10035.2, 300 sec: 9927.6). Total num frames: 22142976. Throughput: 0: 10041.3. Samples: 22141400. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:45:39,012][65744] Avg episode reward: [(0, '3891.368')] +[2023-03-11 18:45:40,348][66031] Updated weights for policy 0, policy_version 43280 (0.0005) +[2023-03-11 18:45:44,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9966.9, 300 sec: 9941.5). Total num frames: 22192128. Throughput: 0: 9983.8. Samples: 22170232. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:45:44,012][65744] Avg episode reward: [(0, '3708.495')] +[2023-03-11 18:45:44,604][66031] Updated weights for policy 0, policy_version 43360 (0.0005) +[2023-03-11 18:45:48,893][66031] Updated weights for policy 0, policy_version 43440 (0.0005) +[2023-03-11 18:45:49,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9966.9, 300 sec: 9941.5). Total num frames: 22241280. Throughput: 0: 9903.3. Samples: 22227880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:45:49,012][65744] Avg episode reward: [(0, '4187.927')] +[2023-03-11 18:45:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000043440_22241280.pth... +[2023-03-11 18:45:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000042872_21950464.pth +[2023-03-11 18:45:53,106][66031] Updated weights for policy 0, policy_version 43520 (0.0004) +[2023-03-11 18:45:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9941.5). Total num frames: 22290432. Throughput: 0: 9848.4. Samples: 22285880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:45:54,023][65744] Avg episode reward: [(0, '3920.040')] +[2023-03-11 18:45:57,102][66031] Updated weights for policy 0, policy_version 43600 (0.0004) +[2023-03-11 18:45:59,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9941.5). Total num frames: 22339584. Throughput: 0: 9856.4. Samples: 22316248. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:45:59,012][65744] Avg episode reward: [(0, '3950.502')] +[2023-03-11 18:46:01,146][66031] Updated weights for policy 0, policy_version 43680 (0.0004) +[2023-03-11 18:46:04,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9898.7, 300 sec: 9927.6). Total num frames: 22388736. Throughput: 0: 9840.1. Samples: 22376948. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:46:04,012][65744] Avg episode reward: [(0, '4180.032')] +[2023-03-11 18:46:04,027][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000043736_22392832.pth... +[2023-03-11 18:46:04,029][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000043160_22097920.pth +[2023-03-11 18:46:05,221][66031] Updated weights for policy 0, policy_version 43760 (0.0004) +[2023-03-11 18:46:09,012][65744] Fps is (10 sec: 10239.9, 60 sec: 9898.7, 300 sec: 9941.5). Total num frames: 22441984. Throughput: 0: 9802.2. Samples: 22437168. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:46:09,012][65744] Avg episode reward: [(0, '4190.972')] +[2023-03-11 18:46:09,372][66031] Updated weights for policy 0, policy_version 43840 (0.0005) +[2023-03-11 18:46:13,403][66031] Updated weights for policy 0, policy_version 43920 (0.0004) +[2023-03-11 18:46:14,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9927.6). Total num frames: 22491136. Throughput: 0: 9766.1. Samples: 22466848. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:46:14,012][65744] Avg episode reward: [(0, '4091.409')] +[2023-03-11 18:46:17,526][66031] Updated weights for policy 0, policy_version 44000 (0.0005) +[2023-03-11 18:46:19,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9927.6). Total num frames: 22540288. Throughput: 0: 9855.7. Samples: 22527824. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:46:19,012][65744] Avg episode reward: [(0, '4016.299')] +[2023-03-11 18:46:19,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000044024_22540288.pth... +[2023-03-11 18:46:19,029][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000043440_22241280.pth +[2023-03-11 18:46:21,619][66031] Updated weights for policy 0, policy_version 44080 (0.0005) +[2023-03-11 18:46:24,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9913.7). Total num frames: 22589440. Throughput: 0: 9904.9. Samples: 22587120. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:46:24,012][65744] Avg episode reward: [(0, '4013.111')] +[2023-03-11 18:46:25,676][66031] Updated weights for policy 0, policy_version 44160 (0.0004) +[2023-03-11 18:46:29,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9927.6). Total num frames: 22642688. Throughput: 0: 9952.8. Samples: 22618108. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:46:29,012][65744] Avg episode reward: [(0, '3495.213')] +[2023-03-11 18:46:29,773][66031] Updated weights for policy 0, policy_version 44240 (0.0004) +[2023-03-11 18:46:33,793][66031] Updated weights for policy 0, policy_version 44320 (0.0004) +[2023-03-11 18:46:34,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9927.6). Total num frames: 22691840. Throughput: 0: 9997.1. Samples: 22677752. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:46:34,012][65744] Avg episode reward: [(0, '3576.538')] +[2023-03-11 18:46:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000044320_22691840.pth... +[2023-03-11 18:46:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000043736_22392832.pth +[2023-03-11 18:46:37,768][66031] Updated weights for policy 0, policy_version 44400 (0.0004) +[2023-03-11 18:46:39,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9927.6). Total num frames: 22745088. Throughput: 0: 10085.6. Samples: 22739732. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:46:39,012][65744] Avg episode reward: [(0, '3996.632')] +[2023-03-11 18:46:41,818][66031] Updated weights for policy 0, policy_version 44480 (0.0005) +[2023-03-11 18:46:44,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9927.6). Total num frames: 22794240. Throughput: 0: 10079.8. Samples: 22769840. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:46:44,012][65744] Avg episode reward: [(0, '3854.907')] +[2023-03-11 18:46:46,068][66031] Updated weights for policy 0, policy_version 44560 (0.0004) +[2023-03-11 18:46:49,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9966.9, 300 sec: 9899.8). Total num frames: 22839296. Throughput: 0: 10003.5. Samples: 22827104. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:46:49,012][65744] Avg episode reward: [(0, '3816.745')] +[2023-03-11 18:46:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000044608_22839296.pth... +[2023-03-11 18:46:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000044024_22540288.pth +[2023-03-11 18:46:50,479][66031] Updated weights for policy 0, policy_version 44640 (0.0005) +[2023-03-11 18:46:54,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9966.9, 300 sec: 9913.7). Total num frames: 22888448. Throughput: 0: 9931.8. Samples: 22884100. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 18:46:54,012][65744] Avg episode reward: [(0, '3724.527')] +[2023-03-11 18:46:54,788][66031] Updated weights for policy 0, policy_version 44720 (0.0005) +[2023-03-11 18:46:59,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 9899.8). Total num frames: 22933504. Throughput: 0: 9911.2. Samples: 22912852. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 18:46:59,012][65744] Avg episode reward: [(0, '3685.954')] +[2023-03-11 18:46:59,095][66031] Updated weights for policy 0, policy_version 44800 (0.0005) +[2023-03-11 18:47:03,452][66031] Updated weights for policy 0, policy_version 44880 (0.0005) +[2023-03-11 18:47:04,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 9899.8). Total num frames: 22982656. Throughput: 0: 9801.6. Samples: 22968896. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 18:47:04,012][65744] Avg episode reward: [(0, '3927.946')] +[2023-03-11 18:47:04,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000044888_22982656.pth... +[2023-03-11 18:47:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000044320_22691840.pth +[2023-03-11 18:47:07,791][66031] Updated weights for policy 0, policy_version 44960 (0.0005) +[2023-03-11 18:47:09,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9762.2, 300 sec: 9885.9). Total num frames: 23027712. Throughput: 0: 9758.2. Samples: 23026236. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 18:47:09,012][65744] Avg episode reward: [(0, '4071.031')] +[2023-03-11 18:47:12,067][66031] Updated weights for policy 0, policy_version 45040 (0.0005) +[2023-03-11 18:47:14,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9872.1). Total num frames: 23076864. Throughput: 0: 9708.4. Samples: 23054988. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 18:47:14,012][65744] Avg episode reward: [(0, '3874.038')] +[2023-03-11 18:47:16,423][66031] Updated weights for policy 0, policy_version 45120 (0.0005) +[2023-03-11 18:47:19,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.2, 300 sec: 9872.1). Total num frames: 23126016. Throughput: 0: 9632.0. Samples: 23111192. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 18:47:19,012][65744] Avg episode reward: [(0, '4276.330')] +[2023-03-11 18:47:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000045168_23126016.pth... +[2023-03-11 18:47:19,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000044608_22839296.pth +[2023-03-11 18:47:20,729][66031] Updated weights for policy 0, policy_version 45200 (0.0005) +[2023-03-11 18:47:24,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9693.9, 300 sec: 9844.3). Total num frames: 23171072. Throughput: 0: 9517.7. Samples: 23168028. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 18:47:24,012][65744] Avg episode reward: [(0, '4158.993')] +[2023-03-11 18:47:25,030][66031] Updated weights for policy 0, policy_version 45280 (0.0005) +[2023-03-11 18:47:29,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9844.3). Total num frames: 23220224. Throughput: 0: 9477.0. Samples: 23196304. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 18:47:29,012][65744] Avg episode reward: [(0, '3801.142')] +[2023-03-11 18:47:29,416][66031] Updated weights for policy 0, policy_version 45360 (0.0005) +[2023-03-11 18:47:33,780][66031] Updated weights for policy 0, policy_version 45440 (0.0005) +[2023-03-11 18:47:34,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9830.4). Total num frames: 23265280. Throughput: 0: 9465.6. Samples: 23253056. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 18:47:34,012][65744] Avg episode reward: [(0, '3845.833')] +[2023-03-11 18:47:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000045440_23265280.pth... +[2023-03-11 18:47:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000044888_22982656.pth +[2023-03-11 18:47:38,065][66031] Updated weights for policy 0, policy_version 45520 (0.0005) +[2023-03-11 18:47:39,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9816.5). Total num frames: 23314432. Throughput: 0: 9472.1. Samples: 23310344. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 18:47:39,012][65744] Avg episode reward: [(0, '3968.513')] +[2023-03-11 18:47:42,341][66031] Updated weights for policy 0, policy_version 45600 (0.0005) +[2023-03-11 18:47:44,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9420.8, 300 sec: 9802.6). Total num frames: 23359488. Throughput: 0: 9470.1. Samples: 23339008. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 18:47:44,012][65744] Avg episode reward: [(0, '3719.641')] +[2023-03-11 18:47:46,588][66031] Updated weights for policy 0, policy_version 45680 (0.0005) +[2023-03-11 18:47:49,012][65744] Fps is (10 sec: 9420.7, 60 sec: 9489.1, 300 sec: 9788.7). Total num frames: 23408640. Throughput: 0: 9500.4. Samples: 23396416. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:47:49,012][65744] Avg episode reward: [(0, '3817.006')] +[2023-03-11 18:47:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000045720_23408640.pth... +[2023-03-11 18:47:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000045168_23126016.pth +[2023-03-11 18:47:50,770][66031] Updated weights for policy 0, policy_version 45760 (0.0004) +[2023-03-11 18:47:54,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9557.3, 300 sec: 9802.6). Total num frames: 23461888. Throughput: 0: 9558.0. Samples: 23456348. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:47:54,012][65744] Avg episode reward: [(0, '3670.894')] +[2023-03-11 18:47:54,826][66031] Updated weights for policy 0, policy_version 45840 (0.0004) +[2023-03-11 18:47:58,806][66031] Updated weights for policy 0, policy_version 45920 (0.0004) +[2023-03-11 18:47:59,012][65744] Fps is (10 sec: 10240.1, 60 sec: 9625.6, 300 sec: 9802.6). Total num frames: 23511040. Throughput: 0: 9589.8. Samples: 23486528. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:47:59,012][65744] Avg episode reward: [(0, '4005.146')] +[2023-03-11 18:48:02,874][66031] Updated weights for policy 0, policy_version 46000 (0.0005) +[2023-03-11 18:48:04,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9625.6, 300 sec: 9802.6). Total num frames: 23560192. Throughput: 0: 9706.1. Samples: 23547968. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:48:04,012][65744] Avg episode reward: [(0, '4076.339')] +[2023-03-11 18:48:04,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000046016_23560192.pth... +[2023-03-11 18:48:04,019][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000045440_23265280.pth +[2023-03-11 18:48:06,956][66031] Updated weights for policy 0, policy_version 46080 (0.0004) +[2023-03-11 18:48:09,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9762.1, 300 sec: 9816.5). Total num frames: 23613440. Throughput: 0: 9786.7. Samples: 23608428. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:48:09,012][65744] Avg episode reward: [(0, '3748.989')] +[2023-03-11 18:48:11,063][66031] Updated weights for policy 0, policy_version 46160 (0.0004) +[2023-03-11 18:48:14,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9693.9, 300 sec: 9816.5). Total num frames: 23658496. Throughput: 0: 9815.8. Samples: 23638016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:48:14,012][65744] Avg episode reward: [(0, '3423.638')] +[2023-03-11 18:48:15,325][66031] Updated weights for policy 0, policy_version 46240 (0.0005) +[2023-03-11 18:48:19,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9816.5). Total num frames: 23707648. Throughput: 0: 9830.4. Samples: 23695424. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 18:48:19,012][65744] Avg episode reward: [(0, '3944.340')] +[2023-03-11 18:48:19,014][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000046304_23707648.pth... +[2023-03-11 18:48:19,016][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000045720_23408640.pth +[2023-03-11 18:48:19,462][66031] Updated weights for policy 0, policy_version 46320 (0.0005) +[2023-03-11 18:48:23,579][66031] Updated weights for policy 0, policy_version 46400 (0.0005) +[2023-03-11 18:48:24,012][65744] Fps is (10 sec: 10239.9, 60 sec: 9830.4, 300 sec: 9830.4). Total num frames: 23760896. Throughput: 0: 9901.4. Samples: 23755908. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 18:48:24,012][65744] Avg episode reward: [(0, '4005.181')] +[2023-03-11 18:48:27,775][66031] Updated weights for policy 0, policy_version 46480 (0.0005) +[2023-03-11 18:48:29,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9830.4). Total num frames: 23805952. Throughput: 0: 9918.8. Samples: 23785356. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 18:48:29,012][65744] Avg episode reward: [(0, '3176.984')] +[2023-03-11 18:48:32,054][66031] Updated weights for policy 0, policy_version 46560 (0.0005) +[2023-03-11 18:48:34,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9830.4, 300 sec: 9830.4). Total num frames: 23855104. Throughput: 0: 9920.0. Samples: 23842816. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 18:48:34,012][65744] Avg episode reward: [(0, '2977.383')] +[2023-03-11 18:48:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000046592_23855104.pth... +[2023-03-11 18:48:34,019][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000046016_23560192.pth +[2023-03-11 18:48:36,368][66031] Updated weights for policy 0, policy_version 46640 (0.0005) +[2023-03-11 18:48:39,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9830.4, 300 sec: 9844.3). Total num frames: 23904256. Throughput: 0: 9856.7. Samples: 23899900. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 18:48:39,012][65744] Avg episode reward: [(0, '3217.574')] +[2023-03-11 18:48:40,632][66031] Updated weights for policy 0, policy_version 46720 (0.0005) +[2023-03-11 18:48:44,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9898.7, 300 sec: 9844.3). Total num frames: 23953408. Throughput: 0: 9827.4. Samples: 23928760. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 18:48:44,012][65744] Avg episode reward: [(0, '3614.303')] +[2023-03-11 18:48:44,753][66031] Updated weights for policy 0, policy_version 46800 (0.0004) +[2023-03-11 18:48:48,720][66031] Updated weights for policy 0, policy_version 46880 (0.0004) +[2023-03-11 18:48:49,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9844.3). Total num frames: 24002560. Throughput: 0: 9806.0. Samples: 23989240. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 18:48:49,012][65744] Avg episode reward: [(0, '3064.124')] +[2023-03-11 18:48:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000046880_24002560.pth... +[2023-03-11 18:48:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000046304_23707648.pth +[2023-03-11 18:48:52,830][66031] Updated weights for policy 0, policy_version 46960 (0.0005) +[2023-03-11 18:48:54,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9830.4, 300 sec: 9844.3). Total num frames: 24051712. Throughput: 0: 9801.8. Samples: 24049512. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 18:48:54,012][65744] Avg episode reward: [(0, '3751.802')] +[2023-03-11 18:48:56,913][66031] Updated weights for policy 0, policy_version 47040 (0.0004) +[2023-03-11 18:48:59,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9872.1). Total num frames: 24104960. Throughput: 0: 9814.1. Samples: 24079652. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 18:48:59,012][65744] Avg episode reward: [(0, '3919.897')] +[2023-03-11 18:49:01,008][66031] Updated weights for policy 0, policy_version 47120 (0.0005) +[2023-03-11 18:49:04,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9872.1). Total num frames: 24154112. Throughput: 0: 9859.3. Samples: 24139092. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 18:49:04,012][65744] Avg episode reward: [(0, '3962.489')] +[2023-03-11 18:49:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000047176_24154112.pth... +[2023-03-11 18:49:04,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000046592_23855104.pth +[2023-03-11 18:49:05,297][66031] Updated weights for policy 0, policy_version 47200 (0.0005) +[2023-03-11 18:49:09,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9872.1). Total num frames: 24199168. Throughput: 0: 9780.7. Samples: 24196040. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 18:49:09,012][65744] Avg episode reward: [(0, '3429.409')] +[2023-03-11 18:49:09,601][66031] Updated weights for policy 0, policy_version 47280 (0.0005) +[2023-03-11 18:49:13,902][66031] Updated weights for policy 0, policy_version 47360 (0.0005) +[2023-03-11 18:49:14,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9830.4, 300 sec: 9858.2). Total num frames: 24248320. Throughput: 0: 9748.9. Samples: 24224056. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 18:49:14,012][65744] Avg episode reward: [(0, '3855.694')] +[2023-03-11 18:49:17,923][66031] Updated weights for policy 0, policy_version 47440 (0.0004) +[2023-03-11 18:49:19,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9858.2). Total num frames: 24297472. Throughput: 0: 9808.9. Samples: 24284216. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:49:19,012][65744] Avg episode reward: [(0, '3814.215')] +[2023-03-11 18:49:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000047456_24297472.pth... +[2023-03-11 18:49:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000046880_24002560.pth +[2023-03-11 18:49:21,920][66031] Updated weights for policy 0, policy_version 47520 (0.0004) +[2023-03-11 18:49:24,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 9872.1). Total num frames: 24350720. Throughput: 0: 9908.1. Samples: 24345764. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:49:24,012][65744] Avg episode reward: [(0, '3708.025')] +[2023-03-11 18:49:26,066][66031] Updated weights for policy 0, policy_version 47600 (0.0004) +[2023-03-11 18:49:29,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9844.3). Total num frames: 24395776. Throughput: 0: 9923.0. Samples: 24375296. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:49:29,012][65744] Avg episode reward: [(0, '3639.182')] +[2023-03-11 18:49:30,368][66031] Updated weights for policy 0, policy_version 47680 (0.0005) +[2023-03-11 18:49:34,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9830.4, 300 sec: 9844.3). Total num frames: 24444928. Throughput: 0: 9846.8. Samples: 24432344. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:49:34,012][65744] Avg episode reward: [(0, '3731.420')] +[2023-03-11 18:49:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000047744_24444928.pth... +[2023-03-11 18:49:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000047176_24154112.pth +[2023-03-11 18:49:34,645][66031] Updated weights for policy 0, policy_version 47760 (0.0005) +[2023-03-11 18:49:38,958][66031] Updated weights for policy 0, policy_version 47840 (0.0005) +[2023-03-11 18:49:39,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9830.4, 300 sec: 9830.4). Total num frames: 24494080. Throughput: 0: 9783.0. Samples: 24489748. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:49:39,012][65744] Avg episode reward: [(0, '2737.575')] +[2023-03-11 18:49:43,283][66031] Updated weights for policy 0, policy_version 47920 (0.0005) +[2023-03-11 18:49:44,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9816.5). Total num frames: 24539136. Throughput: 0: 9737.1. Samples: 24517820. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:49:44,012][65744] Avg episode reward: [(0, '3807.621')] +[2023-03-11 18:49:47,597][66031] Updated weights for policy 0, policy_version 48000 (0.0005) +[2023-03-11 18:49:49,012][65744] Fps is (10 sec: 9420.7, 60 sec: 9762.1, 300 sec: 9816.5). Total num frames: 24588288. Throughput: 0: 9686.1. Samples: 24574968. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:49:49,012][65744] Avg episode reward: [(0, '3650.039')] +[2023-03-11 18:49:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000048024_24588288.pth... +[2023-03-11 18:49:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000047456_24297472.pth +[2023-03-11 18:49:51,837][66031] Updated weights for policy 0, policy_version 48080 (0.0004) +[2023-03-11 18:49:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.2, 300 sec: 9802.6). Total num frames: 24637440. Throughput: 0: 9703.5. Samples: 24632696. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:49:54,012][65744] Avg episode reward: [(0, '3982.041')] +[2023-03-11 18:49:56,172][66031] Updated weights for policy 0, policy_version 48160 (0.0005) +[2023-03-11 18:49:59,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9788.7). Total num frames: 24682496. Throughput: 0: 9703.9. Samples: 24660732. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:49:59,012][65744] Avg episode reward: [(0, '4110.659')] +[2023-03-11 18:50:00,432][66031] Updated weights for policy 0, policy_version 48240 (0.0005) +[2023-03-11 18:50:04,012][65744] Fps is (10 sec: 9420.7, 60 sec: 9625.6, 300 sec: 9774.9). Total num frames: 24731648. Throughput: 0: 9648.1. Samples: 24718380. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:50:04,012][65744] Avg episode reward: [(0, '4059.361')] +[2023-03-11 18:50:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000048304_24731648.pth... +[2023-03-11 18:50:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000047744_24444928.pth +[2023-03-11 18:50:04,769][66031] Updated weights for policy 0, policy_version 48320 (0.0005) +[2023-03-11 18:50:08,872][66031] Updated weights for policy 0, policy_version 48400 (0.0004) +[2023-03-11 18:50:09,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9774.9). Total num frames: 24780800. Throughput: 0: 9576.6. Samples: 24776712. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:50:09,012][65744] Avg episode reward: [(0, '3714.987')] +[2023-03-11 18:50:12,856][66031] Updated weights for policy 0, policy_version 48480 (0.0004) +[2023-03-11 18:50:14,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9693.9, 300 sec: 9761.0). Total num frames: 24829952. Throughput: 0: 9598.6. Samples: 24807232. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:50:14,012][65744] Avg episode reward: [(0, '3892.265')] +[2023-03-11 18:50:16,871][66031] Updated weights for policy 0, policy_version 48560 (0.0004) +[2023-03-11 18:50:19,012][65744] Fps is (10 sec: 10239.9, 60 sec: 9762.1, 300 sec: 9774.9). Total num frames: 24883200. Throughput: 0: 9691.7. Samples: 24868472. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:50:19,012][65744] Avg episode reward: [(0, '4038.566')] +[2023-03-11 18:50:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000048600_24883200.pth... +[2023-03-11 18:50:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000048024_24588288.pth +[2023-03-11 18:50:21,154][66031] Updated weights for policy 0, policy_version 48640 (0.0005) +[2023-03-11 18:50:24,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9625.6, 300 sec: 9761.0). Total num frames: 24928256. Throughput: 0: 9684.8. Samples: 24925564. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:50:24,012][65744] Avg episode reward: [(0, '4242.197')] +[2023-03-11 18:50:25,510][66031] Updated weights for policy 0, policy_version 48720 (0.0005) +[2023-03-11 18:50:29,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9761.0). Total num frames: 24977408. Throughput: 0: 9670.8. Samples: 24953008. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:50:29,012][65744] Avg episode reward: [(0, '3464.506')] +[2023-03-11 18:50:29,790][66031] Updated weights for policy 0, policy_version 48800 (0.0005) +[2023-03-11 18:50:34,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9761.0). Total num frames: 25022464. Throughput: 0: 9694.0. Samples: 25011200. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:50:34,012][65744] Avg episode reward: [(0, '3562.520')] +[2023-03-11 18:50:34,024][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000048880_25026560.pth... +[2023-03-11 18:50:34,024][66031] Updated weights for policy 0, policy_version 48880 (0.0005) +[2023-03-11 18:50:34,025][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000048304_24731648.pth +[2023-03-11 18:50:38,369][66031] Updated weights for policy 0, policy_version 48960 (0.0004) +[2023-03-11 18:50:39,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9761.0). Total num frames: 25071616. Throughput: 0: 9675.4. Samples: 25068088. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:50:39,012][65744] Avg episode reward: [(0, '3999.245')] +[2023-03-11 18:50:42,516][66031] Updated weights for policy 0, policy_version 49040 (0.0004) +[2023-03-11 18:50:44,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9761.0). Total num frames: 25120768. Throughput: 0: 9708.3. Samples: 25097604. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:50:44,012][65744] Avg episode reward: [(0, '4300.301')] +[2023-03-11 18:50:46,761][66031] Updated weights for policy 0, policy_version 49120 (0.0003) +[2023-03-11 18:50:49,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9761.0). Total num frames: 25169920. Throughput: 0: 9736.9. Samples: 25156540. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:50:49,012][65744] Avg episode reward: [(0, '4049.561')] +[2023-03-11 18:50:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000049160_25169920.pth... +[2023-03-11 18:50:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000048600_24883200.pth +[2023-03-11 18:50:50,864][66031] Updated weights for policy 0, policy_version 49200 (0.0004) +[2023-03-11 18:50:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9761.0). Total num frames: 25219072. Throughput: 0: 9739.2. Samples: 25214976. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:50:54,012][65744] Avg episode reward: [(0, '4159.770')] +[2023-03-11 18:50:55,215][66031] Updated weights for policy 0, policy_version 49280 (0.0005) +[2023-03-11 18:50:59,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9747.1). Total num frames: 25264128. Throughput: 0: 9696.2. Samples: 25243560. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:50:59,012][65744] Avg episode reward: [(0, '4157.585')] +[2023-03-11 18:50:59,548][66031] Updated weights for policy 0, policy_version 49360 (0.0005) +[2023-03-11 18:51:03,887][66031] Updated weights for policy 0, policy_version 49440 (0.0006) +[2023-03-11 18:51:04,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9733.2). Total num frames: 25313280. Throughput: 0: 9599.8. Samples: 25300464. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:51:04,012][65744] Avg episode reward: [(0, '4077.283')] +[2023-03-11 18:51:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000049440_25313280.pth... +[2023-03-11 18:51:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000048880_25026560.pth +[2023-03-11 18:51:07,903][66031] Updated weights for policy 0, policy_version 49520 (0.0005) +[2023-03-11 18:51:09,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9733.2). Total num frames: 25362432. Throughput: 0: 9643.9. Samples: 25359540. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:51:09,023][65744] Avg episode reward: [(0, '3591.697')] +[2023-03-11 18:51:11,902][66031] Updated weights for policy 0, policy_version 49600 (0.0005) +[2023-03-11 18:51:14,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9762.1, 300 sec: 9747.1). Total num frames: 25415680. Throughput: 0: 9727.4. Samples: 25390740. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:51:14,012][65744] Avg episode reward: [(0, '4012.491')] +[2023-03-11 18:51:15,948][66031] Updated weights for policy 0, policy_version 49680 (0.0005) +[2023-03-11 18:51:19,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9693.9, 300 sec: 9747.1). Total num frames: 25464832. Throughput: 0: 9778.4. Samples: 25451228. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 18:51:19,012][65744] Avg episode reward: [(0, '3927.140')] +[2023-03-11 18:51:19,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000049736_25464832.pth... +[2023-03-11 18:51:19,029][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000049160_25169920.pth +[2023-03-11 18:51:20,173][66031] Updated weights for policy 0, policy_version 49760 (0.0005) +[2023-03-11 18:51:24,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9719.3). Total num frames: 25509888. Throughput: 0: 9781.6. Samples: 25508260. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 18:51:24,012][65744] Avg episode reward: [(0, '3314.576')] +[2023-03-11 18:51:24,516][66031] Updated weights for policy 0, policy_version 49840 (0.0005) +[2023-03-11 18:51:28,826][66031] Updated weights for policy 0, policy_version 49920 (0.0005) +[2023-03-11 18:51:29,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9719.3). Total num frames: 25559040. Throughput: 0: 9751.0. Samples: 25536400. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 18:51:29,012][65744] Avg episode reward: [(0, '3305.172')] +[2023-03-11 18:51:32,871][66031] Updated weights for policy 0, policy_version 50000 (0.0005) +[2023-03-11 18:51:34,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9705.4). Total num frames: 25608192. Throughput: 0: 9764.0. Samples: 25595920. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 18:51:34,012][65744] Avg episode reward: [(0, '3683.878')] +[2023-03-11 18:51:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000050016_25608192.pth... +[2023-03-11 18:51:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000049440_25313280.pth +[2023-03-11 18:51:37,080][66031] Updated weights for policy 0, policy_version 50080 (0.0005) +[2023-03-11 18:51:39,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9705.4). Total num frames: 25657344. Throughput: 0: 9759.8. Samples: 25654168. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 18:51:39,023][65744] Avg episode reward: [(0, '3976.973')] +[2023-03-11 18:51:41,397][66031] Updated weights for policy 0, policy_version 50160 (0.0005) +[2023-03-11 18:51:44,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9705.4). Total num frames: 25702400. Throughput: 0: 9746.3. Samples: 25682144. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 18:51:44,012][65744] Avg episode reward: [(0, '4211.253')] +[2023-03-11 18:51:45,676][66031] Updated weights for policy 0, policy_version 50240 (0.0005) +[2023-03-11 18:51:49,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9762.1, 300 sec: 9719.3). Total num frames: 25755648. Throughput: 0: 9781.6. Samples: 25740636. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:51:49,012][65744] Avg episode reward: [(0, '4249.128')] +[2023-03-11 18:51:49,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000050304_25755648.pth... +[2023-03-11 18:51:49,029][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000049736_25464832.pth +[2023-03-11 18:51:49,725][66031] Updated weights for policy 0, policy_version 50320 (0.0005) +[2023-03-11 18:51:53,781][66031] Updated weights for policy 0, policy_version 50400 (0.0005) +[2023-03-11 18:51:54,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9762.1, 300 sec: 9733.2). Total num frames: 25804800. Throughput: 0: 9817.8. Samples: 25801340. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:51:54,012][65744] Avg episode reward: [(0, '3882.528')] +[2023-03-11 18:51:58,023][66031] Updated weights for policy 0, policy_version 50480 (0.0005) +[2023-03-11 18:51:59,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9733.2). Total num frames: 25853952. Throughput: 0: 9780.5. Samples: 25830864. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:51:59,012][65744] Avg episode reward: [(0, '3420.232')] +[2023-03-11 18:52:02,415][66031] Updated weights for policy 0, policy_version 50560 (0.0005) +[2023-03-11 18:52:04,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9733.2). Total num frames: 25899008. Throughput: 0: 9679.2. Samples: 25886792. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:52:04,012][65744] Avg episode reward: [(0, '3745.970')] +[2023-03-11 18:52:04,027][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000050584_25899008.pth... +[2023-03-11 18:52:04,029][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000050016_25608192.pth +[2023-03-11 18:52:06,541][66031] Updated weights for policy 0, policy_version 50640 (0.0005) +[2023-03-11 18:52:09,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9747.1). Total num frames: 25952256. Throughput: 0: 9760.5. Samples: 25947484. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:52:09,012][65744] Avg episode reward: [(0, '3852.426')] +[2023-03-11 18:52:10,586][66031] Updated weights for policy 0, policy_version 50720 (0.0005) +[2023-03-11 18:52:14,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9762.1, 300 sec: 9747.1). Total num frames: 26001408. Throughput: 0: 9793.0. Samples: 25977084. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:52:14,012][65744] Avg episode reward: [(0, '3958.916')] +[2023-03-11 18:52:14,562][66031] Updated weights for policy 0, policy_version 50800 (0.0005) +[2023-03-11 18:52:18,810][66031] Updated weights for policy 0, policy_version 50880 (0.0004) +[2023-03-11 18:52:19,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9761.0). Total num frames: 26050560. Throughput: 0: 9801.6. Samples: 26036992. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:52:19,012][65744] Avg episode reward: [(0, '3637.305')] +[2023-03-11 18:52:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000050880_26050560.pth... +[2023-03-11 18:52:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000050304_25755648.pth +[2023-03-11 18:52:22,777][66031] Updated weights for policy 0, policy_version 50960 (0.0004) +[2023-03-11 18:52:24,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9774.9). Total num frames: 26103808. Throughput: 0: 9872.9. Samples: 26098448. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:52:24,012][65744] Avg episode reward: [(0, '3692.848')] +[2023-03-11 18:52:26,800][66031] Updated weights for policy 0, policy_version 51040 (0.0005) +[2023-03-11 18:52:29,012][65744] Fps is (10 sec: 10240.1, 60 sec: 9898.7, 300 sec: 9788.7). Total num frames: 26152960. Throughput: 0: 9918.2. Samples: 26128464. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:52:29,012][65744] Avg episode reward: [(0, '3832.968')] +[2023-03-11 18:52:30,759][66031] Updated weights for policy 0, policy_version 51120 (0.0004) +[2023-03-11 18:52:34,012][65744] Fps is (10 sec: 10239.9, 60 sec: 9966.9, 300 sec: 9802.6). Total num frames: 26206208. Throughput: 0: 9996.1. Samples: 26190460. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:52:34,012][65744] Avg episode reward: [(0, '3925.543')] +[2023-03-11 18:52:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000051184_26206208.pth... +[2023-03-11 18:52:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000050584_25899008.pth +[2023-03-11 18:52:34,744][66031] Updated weights for policy 0, policy_version 51200 (0.0004) +[2023-03-11 18:52:38,848][66031] Updated weights for policy 0, policy_version 51280 (0.0003) +[2023-03-11 18:52:39,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 9816.5). Total num frames: 26255360. Throughput: 0: 9999.9. Samples: 26251336. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:52:39,012][65744] Avg episode reward: [(0, '3812.034')] +[2023-03-11 18:52:43,031][66031] Updated weights for policy 0, policy_version 51360 (0.0003) +[2023-03-11 18:52:44,012][65744] Fps is (10 sec: 9830.5, 60 sec: 10035.2, 300 sec: 9816.5). Total num frames: 26304512. Throughput: 0: 9995.0. Samples: 26280640. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:52:44,012][65744] Avg episode reward: [(0, '3981.259')] +[2023-03-11 18:52:47,105][66031] Updated weights for policy 0, policy_version 51440 (0.0003) +[2023-03-11 18:52:49,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9802.6). Total num frames: 26353664. Throughput: 0: 10092.7. Samples: 26340964. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:52:49,012][65744] Avg episode reward: [(0, '3156.030')] +[2023-03-11 18:52:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000051472_26353664.pth... +[2023-03-11 18:52:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000050880_26050560.pth +[2023-03-11 18:52:51,151][66031] Updated weights for policy 0, policy_version 51520 (0.0004) +[2023-03-11 18:52:54,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9816.5). Total num frames: 26406912. Throughput: 0: 10086.9. Samples: 26401392. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:52:54,012][65744] Avg episode reward: [(0, '2945.569')] +[2023-03-11 18:52:55,185][66031] Updated weights for policy 0, policy_version 51600 (0.0005) +[2023-03-11 18:52:59,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9802.6). Total num frames: 26451968. Throughput: 0: 10098.1. Samples: 26431496. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:52:59,012][65744] Avg episode reward: [(0, '3259.510')] +[2023-03-11 18:52:59,506][66031] Updated weights for policy 0, policy_version 51680 (0.0004) +[2023-03-11 18:53:03,660][66031] Updated weights for policy 0, policy_version 51760 (0.0003) +[2023-03-11 18:53:04,012][65744] Fps is (10 sec: 9420.7, 60 sec: 10035.2, 300 sec: 9788.7). Total num frames: 26501120. Throughput: 0: 10053.2. Samples: 26489388. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:53:04,012][65744] Avg episode reward: [(0, '3090.467')] +[2023-03-11 18:53:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000051760_26501120.pth... +[2023-03-11 18:53:04,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000051184_26206208.pth +[2023-03-11 18:53:07,904][66031] Updated weights for policy 0, policy_version 51840 (0.0003) +[2023-03-11 18:53:09,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9802.6). Total num frames: 26550272. Throughput: 0: 9974.0. Samples: 26547276. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:53:09,012][65744] Avg episode reward: [(0, '2833.232')] +[2023-03-11 18:53:11,932][66031] Updated weights for policy 0, policy_version 51920 (0.0003) +[2023-03-11 18:53:14,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9802.6). Total num frames: 26599424. Throughput: 0: 9989.8. Samples: 26578004. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:53:14,012][65744] Avg episode reward: [(0, '3293.419')] +[2023-03-11 18:53:16,221][66031] Updated weights for policy 0, policy_version 52000 (0.0005) +[2023-03-11 18:53:19,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9788.7). Total num frames: 26648576. Throughput: 0: 9908.7. Samples: 26636352. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:53:19,012][65744] Avg episode reward: [(0, '3885.897')] +[2023-03-11 18:53:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000052048_26648576.pth... +[2023-03-11 18:53:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000051472_26353664.pth +[2023-03-11 18:53:20,413][66031] Updated weights for policy 0, policy_version 52080 (0.0003) +[2023-03-11 18:53:24,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9802.6). Total num frames: 26697728. Throughput: 0: 9869.9. Samples: 26695480. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:53:24,012][65744] Avg episode reward: [(0, '3915.424')] +[2023-03-11 18:53:24,479][66031] Updated weights for policy 0, policy_version 52160 (0.0003) +[2023-03-11 18:53:28,606][66031] Updated weights for policy 0, policy_version 52240 (0.0003) +[2023-03-11 18:53:29,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 9816.5). Total num frames: 26750976. Throughput: 0: 9885.8. Samples: 26725500. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:53:29,012][65744] Avg episode reward: [(0, '3722.200')] +[2023-03-11 18:53:32,581][66031] Updated weights for policy 0, policy_version 52320 (0.0003) +[2023-03-11 18:53:34,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9816.5). Total num frames: 26800128. Throughput: 0: 9902.9. Samples: 26786596. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:53:34,012][65744] Avg episode reward: [(0, '3553.456')] +[2023-03-11 18:53:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000052344_26800128.pth... +[2023-03-11 18:53:34,019][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000051760_26501120.pth +[2023-03-11 18:53:36,720][66031] Updated weights for policy 0, policy_version 52400 (0.0003) +[2023-03-11 18:53:39,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9816.5). Total num frames: 26849280. Throughput: 0: 9879.8. Samples: 26845984. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:53:39,012][65744] Avg episode reward: [(0, '3512.843')] +[2023-03-11 18:53:40,805][66031] Updated weights for policy 0, policy_version 52480 (0.0003) +[2023-03-11 18:53:44,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9898.7, 300 sec: 9816.5). Total num frames: 26898432. Throughput: 0: 9886.5. Samples: 26876388. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:53:44,012][65744] Avg episode reward: [(0, '3725.122')] +[2023-03-11 18:53:44,856][66031] Updated weights for policy 0, policy_version 52560 (0.0003) +[2023-03-11 18:53:48,900][66031] Updated weights for policy 0, policy_version 52640 (0.0003) +[2023-03-11 18:53:49,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 9830.4). Total num frames: 26951680. Throughput: 0: 9953.8. Samples: 26937308. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:53:49,012][65744] Avg episode reward: [(0, '3935.818')] +[2023-03-11 18:53:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000052640_26951680.pth... +[2023-03-11 18:53:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000052048_26648576.pth +[2023-03-11 18:53:52,900][66031] Updated weights for policy 0, policy_version 52720 (0.0004) +[2023-03-11 18:53:54,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9816.5). Total num frames: 27000832. Throughput: 0: 10023.4. Samples: 26998328. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:53:54,012][65744] Avg episode reward: [(0, '3727.507')] +[2023-03-11 18:53:56,936][66031] Updated weights for policy 0, policy_version 52800 (0.0005) +[2023-03-11 18:53:59,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9966.9, 300 sec: 9816.5). Total num frames: 27049984. Throughput: 0: 10023.0. Samples: 27029040. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:53:59,012][65744] Avg episode reward: [(0, '3556.457')] +[2023-03-11 18:54:01,218][66031] Updated weights for policy 0, policy_version 52880 (0.0005) +[2023-03-11 18:54:04,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9966.9, 300 sec: 9830.4). Total num frames: 27099136. Throughput: 0: 10011.5. Samples: 27086868. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:54:04,012][65744] Avg episode reward: [(0, '3727.322')] +[2023-03-11 18:54:04,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000052928_27099136.pth... +[2023-03-11 18:54:04,029][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000052344_26800128.pth +[2023-03-11 18:54:05,448][66031] Updated weights for policy 0, policy_version 52960 (0.0005) +[2023-03-11 18:54:09,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9966.9, 300 sec: 9830.4). Total num frames: 27148288. Throughput: 0: 10031.7. Samples: 27146908. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:54:09,012][65744] Avg episode reward: [(0, '2970.068')] +[2023-03-11 18:54:09,416][66031] Updated weights for policy 0, policy_version 53040 (0.0005) +[2023-03-11 18:54:13,396][66031] Updated weights for policy 0, policy_version 53120 (0.0005) +[2023-03-11 18:54:14,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10035.2, 300 sec: 9844.3). Total num frames: 27201536. Throughput: 0: 10034.6. Samples: 27177056. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:54:14,012][65744] Avg episode reward: [(0, '3822.401')] +[2023-03-11 18:54:17,374][66031] Updated weights for policy 0, policy_version 53200 (0.0004) +[2023-03-11 18:54:19,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10103.5, 300 sec: 9844.3). Total num frames: 27254784. Throughput: 0: 10051.3. Samples: 27238904. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:54:19,012][65744] Avg episode reward: [(0, '3940.654')] +[2023-03-11 18:54:19,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000053232_27254784.pth... +[2023-03-11 18:54:19,029][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000052640_26951680.pth +[2023-03-11 18:54:21,354][66031] Updated weights for policy 0, policy_version 53280 (0.0005) +[2023-03-11 18:54:24,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 9858.2). Total num frames: 27303936. Throughput: 0: 10099.0. Samples: 27300440. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:54:24,012][65744] Avg episode reward: [(0, '3881.162')] +[2023-03-11 18:54:25,391][66031] Updated weights for policy 0, policy_version 53360 (0.0005) +[2023-03-11 18:54:29,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 9858.2). Total num frames: 27353088. Throughput: 0: 10108.3. Samples: 27331260. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:54:29,012][65744] Avg episode reward: [(0, '4276.704')] +[2023-03-11 18:54:29,481][66031] Updated weights for policy 0, policy_version 53440 (0.0005) +[2023-03-11 18:54:33,445][66031] Updated weights for policy 0, policy_version 53520 (0.0004) +[2023-03-11 18:54:34,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 9872.1). Total num frames: 27406336. Throughput: 0: 10105.0. Samples: 27392032. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:54:34,012][65744] Avg episode reward: [(0, '2766.441')] +[2023-03-11 18:54:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000053528_27406336.pth... +[2023-03-11 18:54:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000052928_27099136.pth +[2023-03-11 18:54:37,480][66031] Updated weights for policy 0, policy_version 53600 (0.0005) +[2023-03-11 18:54:39,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 9885.9). Total num frames: 27455488. Throughput: 0: 10093.0. Samples: 27452512. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:54:39,012][65744] Avg episode reward: [(0, '4028.170')] +[2023-03-11 18:54:41,582][66031] Updated weights for policy 0, policy_version 53680 (0.0005) +[2023-03-11 18:54:44,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10171.7, 300 sec: 9899.8). Total num frames: 27508736. Throughput: 0: 10085.1. Samples: 27482868. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:54:44,027][65744] Avg episode reward: [(0, '3926.363')] +[2023-03-11 18:54:45,588][66031] Updated weights for policy 0, policy_version 53760 (0.0005) +[2023-03-11 18:54:49,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 9899.8). Total num frames: 27557888. Throughput: 0: 10155.0. Samples: 27543844. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:54:49,012][65744] Avg episode reward: [(0, '3696.454')] +[2023-03-11 18:54:49,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000053824_27557888.pth... +[2023-03-11 18:54:49,029][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000053232_27254784.pth +[2023-03-11 18:54:49,664][66031] Updated weights for policy 0, policy_version 53840 (0.0005) +[2023-03-11 18:54:53,937][66031] Updated weights for policy 0, policy_version 53920 (0.0005) +[2023-03-11 18:54:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 9913.7). Total num frames: 27607040. Throughput: 0: 10133.9. Samples: 27602932. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:54:54,012][65744] Avg episode reward: [(0, '3880.608')] +[2023-03-11 18:54:58,231][66031] Updated weights for policy 0, policy_version 54000 (0.0005) +[2023-03-11 18:54:59,012][65744] Fps is (10 sec: 9420.8, 60 sec: 10035.2, 300 sec: 9899.8). Total num frames: 27652096. Throughput: 0: 10098.7. Samples: 27631496. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:54:59,012][65744] Avg episode reward: [(0, '4222.011')] +[2023-03-11 18:55:02,480][66031] Updated weights for policy 0, policy_version 54080 (0.0004) +[2023-03-11 18:55:04,012][65744] Fps is (10 sec: 9420.7, 60 sec: 10035.2, 300 sec: 9899.8). Total num frames: 27701248. Throughput: 0: 10001.2. Samples: 27688960. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:55:04,012][65744] Avg episode reward: [(0, '3802.808')] +[2023-03-11 18:55:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000054104_27701248.pth... +[2023-03-11 18:55:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000053528_27406336.pth +[2023-03-11 18:55:06,549][66031] Updated weights for policy 0, policy_version 54160 (0.0003) +[2023-03-11 18:55:09,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 9913.7). Total num frames: 27754496. Throughput: 0: 9981.7. Samples: 27749616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:55:09,012][65744] Avg episode reward: [(0, '3371.907')] +[2023-03-11 18:55:10,556][66031] Updated weights for policy 0, policy_version 54240 (0.0004) +[2023-03-11 18:55:14,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9899.8). Total num frames: 27803648. Throughput: 0: 9960.3. Samples: 27779472. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:55:14,012][65744] Avg episode reward: [(0, '3956.992')] +[2023-03-11 18:55:14,626][66031] Updated weights for policy 0, policy_version 54320 (0.0005) +[2023-03-11 18:55:18,700][66031] Updated weights for policy 0, policy_version 54400 (0.0004) +[2023-03-11 18:55:19,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9913.7). Total num frames: 27852800. Throughput: 0: 9960.0. Samples: 27840232. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:55:19,012][65744] Avg episode reward: [(0, '3649.661')] +[2023-03-11 18:55:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000054400_27852800.pth... +[2023-03-11 18:55:19,019][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000053824_27557888.pth +[2023-03-11 18:55:22,842][66031] Updated weights for policy 0, policy_version 54480 (0.0005) +[2023-03-11 18:55:24,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9913.7). Total num frames: 27901952. Throughput: 0: 9939.7. Samples: 27899800. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:55:24,012][65744] Avg episode reward: [(0, '3889.901')] +[2023-03-11 18:55:27,132][66031] Updated weights for policy 0, policy_version 54560 (0.0005) +[2023-03-11 18:55:29,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9927.6). Total num frames: 27951104. Throughput: 0: 9892.4. Samples: 27928028. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:55:29,012][65744] Avg episode reward: [(0, '3973.384')] +[2023-03-11 18:55:31,416][66031] Updated weights for policy 0, policy_version 54640 (0.0005) +[2023-03-11 18:55:34,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9898.7, 300 sec: 9927.6). Total num frames: 28000256. Throughput: 0: 9808.5. Samples: 27985228. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:55:34,012][65744] Avg episode reward: [(0, '3982.079')] +[2023-03-11 18:55:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000054688_28000256.pth... +[2023-03-11 18:55:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000054104_27701248.pth +[2023-03-11 18:55:35,656][66031] Updated weights for policy 0, policy_version 54720 (0.0004) +[2023-03-11 18:55:39,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9830.4, 300 sec: 9913.7). Total num frames: 28045312. Throughput: 0: 9782.4. Samples: 28043140. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 18:55:39,012][65744] Avg episode reward: [(0, '4051.971')] +[2023-03-11 18:55:39,960][66031] Updated weights for policy 0, policy_version 54800 (0.0005) +[2023-03-11 18:55:44,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9762.1, 300 sec: 9913.7). Total num frames: 28094464. Throughput: 0: 9783.8. Samples: 28071768. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 18:55:44,012][65744] Avg episode reward: [(0, '4245.294')] +[2023-03-11 18:55:44,252][66031] Updated weights for policy 0, policy_version 54880 (0.0005) +[2023-03-11 18:55:48,532][66031] Updated weights for policy 0, policy_version 54960 (0.0005) +[2023-03-11 18:55:49,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9913.7). Total num frames: 28143616. Throughput: 0: 9785.3. Samples: 28129300. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 18:55:49,012][65744] Avg episode reward: [(0, '4099.822')] +[2023-03-11 18:55:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000054968_28143616.pth... +[2023-03-11 18:55:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000054400_27852800.pth +[2023-03-11 18:55:52,826][66031] Updated weights for policy 0, policy_version 55040 (0.0005) +[2023-03-11 18:55:54,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9913.7). Total num frames: 28188672. Throughput: 0: 9713.2. Samples: 28186712. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 18:55:54,012][65744] Avg episode reward: [(0, '4234.432')] +[2023-03-11 18:55:57,064][66031] Updated weights for policy 0, policy_version 55120 (0.0005) +[2023-03-11 18:55:59,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9913.7). Total num frames: 28237824. Throughput: 0: 9691.9. Samples: 28215608. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 18:55:59,023][65744] Avg episode reward: [(0, '4432.738')] +[2023-03-11 18:56:01,372][66031] Updated weights for policy 0, policy_version 55200 (0.0005) +[2023-03-11 18:56:04,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9762.1, 300 sec: 9913.7). Total num frames: 28286976. Throughput: 0: 9600.6. Samples: 28272260. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 18:56:04,012][65744] Avg episode reward: [(0, '3929.114')] +[2023-03-11 18:56:04,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000055248_28286976.pth... +[2023-03-11 18:56:04,028][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000054688_28000256.pth +[2023-03-11 18:56:05,721][66031] Updated weights for policy 0, policy_version 55280 (0.0005) +[2023-03-11 18:56:09,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9885.9). Total num frames: 28332032. Throughput: 0: 9544.3. Samples: 28329292. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 18:56:09,012][65744] Avg episode reward: [(0, '4128.006')] +[2023-03-11 18:56:09,950][66031] Updated weights for policy 0, policy_version 55360 (0.0005) +[2023-03-11 18:56:14,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9885.9). Total num frames: 28381184. Throughput: 0: 9572.9. Samples: 28358808. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:56:14,012][65744] Avg episode reward: [(0, '4285.248')] +[2023-03-11 18:56:14,153][66031] Updated weights for policy 0, policy_version 55440 (0.0005) +[2023-03-11 18:56:18,220][66031] Updated weights for policy 0, policy_version 55520 (0.0004) +[2023-03-11 18:56:19,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9625.6, 300 sec: 9899.8). Total num frames: 28430336. Throughput: 0: 9619.8. Samples: 28418120. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:56:19,012][65744] Avg episode reward: [(0, '4089.108')] +[2023-03-11 18:56:19,045][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000055536_28434432.pth... +[2023-03-11 18:56:19,046][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000054968_28143616.pth +[2023-03-11 18:56:22,257][66031] Updated weights for policy 0, policy_version 55600 (0.0004) +[2023-03-11 18:56:24,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9693.9, 300 sec: 9913.7). Total num frames: 28483584. Throughput: 0: 9696.9. Samples: 28479500. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:56:24,012][65744] Avg episode reward: [(0, '4014.265')] +[2023-03-11 18:56:26,255][66031] Updated weights for policy 0, policy_version 55680 (0.0004) +[2023-03-11 18:56:29,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9693.9, 300 sec: 9913.7). Total num frames: 28532736. Throughput: 0: 9742.3. Samples: 28510172. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:56:29,012][65744] Avg episode reward: [(0, '4414.029')] +[2023-03-11 18:56:30,233][66031] Updated weights for policy 0, policy_version 55760 (0.0004) +[2023-03-11 18:56:34,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9762.1, 300 sec: 9927.6). Total num frames: 28585984. Throughput: 0: 9836.2. Samples: 28571928. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:56:34,012][65744] Avg episode reward: [(0, '3987.957')] +[2023-03-11 18:56:34,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000055832_28585984.pth... +[2023-03-11 18:56:34,029][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000055248_28286976.pth +[2023-03-11 18:56:34,183][66031] Updated weights for policy 0, policy_version 55840 (0.0004) +[2023-03-11 18:56:38,216][66031] Updated weights for policy 0, policy_version 55920 (0.0004) +[2023-03-11 18:56:39,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 9941.5). Total num frames: 28635136. Throughput: 0: 9924.4. Samples: 28633308. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:56:39,012][65744] Avg episode reward: [(0, '4270.070')] +[2023-03-11 18:56:42,208][66031] Updated weights for policy 0, policy_version 56000 (0.0005) +[2023-03-11 18:56:44,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9941.5). Total num frames: 28688384. Throughput: 0: 9961.4. Samples: 28663872. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:56:44,012][65744] Avg episode reward: [(0, '4352.845')] +[2023-03-11 18:56:46,184][66031] Updated weights for policy 0, policy_version 56080 (0.0004) +[2023-03-11 18:56:49,012][65744] Fps is (10 sec: 10239.9, 60 sec: 9898.7, 300 sec: 9941.5). Total num frames: 28737536. Throughput: 0: 10067.8. Samples: 28725312. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:56:49,012][65744] Avg episode reward: [(0, '4261.626')] +[2023-03-11 18:56:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000056128_28737536.pth... +[2023-03-11 18:56:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000055536_28434432.pth +[2023-03-11 18:56:50,460][66031] Updated weights for policy 0, policy_version 56160 (0.0005) +[2023-03-11 18:56:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9941.5). Total num frames: 28786688. Throughput: 0: 10092.7. Samples: 28783464. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:56:54,012][65744] Avg episode reward: [(0, '4141.151')] +[2023-03-11 18:56:54,648][66031] Updated weights for policy 0, policy_version 56240 (0.0004) +[2023-03-11 18:56:59,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9898.7, 300 sec: 9941.5). Total num frames: 28831744. Throughput: 0: 10057.5. Samples: 28811396. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:56:59,012][65744] Avg episode reward: [(0, '4242.360')] +[2023-03-11 18:56:59,013][66031] Updated weights for policy 0, policy_version 56320 (0.0005) +[2023-03-11 18:57:03,287][66031] Updated weights for policy 0, policy_version 56400 (0.0005) +[2023-03-11 18:57:04,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 9927.6). Total num frames: 28880896. Throughput: 0: 10012.4. Samples: 28868680. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:57:04,012][65744] Avg episode reward: [(0, '3910.160')] +[2023-03-11 18:57:04,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000056408_28880896.pth... +[2023-03-11 18:57:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000055832_28585984.pth +[2023-03-11 18:57:07,582][66031] Updated weights for policy 0, policy_version 56480 (0.0005) +[2023-03-11 18:57:09,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9927.6). Total num frames: 28930048. Throughput: 0: 9921.3. Samples: 28925960. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:57:09,012][65744] Avg episode reward: [(0, '3946.540')] +[2023-03-11 18:57:11,941][66031] Updated weights for policy 0, policy_version 56560 (0.0005) +[2023-03-11 18:57:14,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 9913.7). Total num frames: 28975104. Throughput: 0: 9874.0. Samples: 28954504. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:57:14,012][65744] Avg episode reward: [(0, '4258.228')] +[2023-03-11 18:57:16,324][66031] Updated weights for policy 0, policy_version 56640 (0.0005) +[2023-03-11 18:57:19,012][65744] Fps is (10 sec: 9420.7, 60 sec: 9898.7, 300 sec: 9899.8). Total num frames: 29024256. Throughput: 0: 9750.3. Samples: 29010692. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:57:19,012][65744] Avg episode reward: [(0, '4172.797')] +[2023-03-11 18:57:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000056688_29024256.pth... +[2023-03-11 18:57:19,019][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000056128_28737536.pth +[2023-03-11 18:57:20,621][66031] Updated weights for policy 0, policy_version 56720 (0.0005) +[2023-03-11 18:57:24,012][65744] Fps is (10 sec: 9420.7, 60 sec: 9762.1, 300 sec: 9885.9). Total num frames: 29069312. Throughput: 0: 9650.2. Samples: 29067568. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:57:24,012][65744] Avg episode reward: [(0, '3743.011')] +[2023-03-11 18:57:24,897][66031] Updated weights for policy 0, policy_version 56800 (0.0005) +[2023-03-11 18:57:29,003][66031] Updated weights for policy 0, policy_version 56880 (0.0004) +[2023-03-11 18:57:29,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9885.9). Total num frames: 29122560. Throughput: 0: 9610.0. Samples: 29096324. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:57:29,012][65744] Avg episode reward: [(0, '3555.588')] +[2023-03-11 18:57:33,029][66031] Updated weights for policy 0, policy_version 56960 (0.0004) +[2023-03-11 18:57:34,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9762.1, 300 sec: 9885.9). Total num frames: 29171712. Throughput: 0: 9609.1. Samples: 29157720. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:57:34,012][65744] Avg episode reward: [(0, '3855.116')] +[2023-03-11 18:57:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000056976_29171712.pth... +[2023-03-11 18:57:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000056408_28880896.pth +[2023-03-11 18:57:37,064][66031] Updated weights for policy 0, policy_version 57040 (0.0005) +[2023-03-11 18:57:39,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9762.1, 300 sec: 9885.9). Total num frames: 29220864. Throughput: 0: 9673.9. Samples: 29218788. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:57:39,012][65744] Avg episode reward: [(0, '3850.274')] +[2023-03-11 18:57:41,054][66031] Updated weights for policy 0, policy_version 57120 (0.0004) +[2023-03-11 18:57:44,012][65744] Fps is (10 sec: 10240.1, 60 sec: 9762.1, 300 sec: 9899.8). Total num frames: 29274112. Throughput: 0: 9737.9. Samples: 29249600. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:57:44,012][65744] Avg episode reward: [(0, '3643.660')] +[2023-03-11 18:57:45,061][66031] Updated weights for policy 0, policy_version 57200 (0.0004) +[2023-03-11 18:57:49,012][65744] Fps is (10 sec: 10239.9, 60 sec: 9762.1, 300 sec: 9885.9). Total num frames: 29323264. Throughput: 0: 9830.2. Samples: 29311040. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:57:49,012][65744] Avg episode reward: [(0, '3905.737')] +[2023-03-11 18:57:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000057272_29323264.pth... +[2023-03-11 18:57:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000056688_29024256.pth +[2023-03-11 18:57:49,068][66031] Updated weights for policy 0, policy_version 57280 (0.0004) +[2023-03-11 18:57:53,344][66031] Updated weights for policy 0, policy_version 57360 (0.0004) +[2023-03-11 18:57:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9899.8). Total num frames: 29372416. Throughput: 0: 9850.1. Samples: 29369216. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 18:57:54,012][65744] Avg episode reward: [(0, '3892.494')] +[2023-03-11 18:57:57,482][66031] Updated weights for policy 0, policy_version 57440 (0.0004) +[2023-03-11 18:57:59,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9899.8). Total num frames: 29421568. Throughput: 0: 9893.8. Samples: 29399724. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 18:57:59,012][65744] Avg episode reward: [(0, '2901.420')] +[2023-03-11 18:58:01,781][66031] Updated weights for policy 0, policy_version 57520 (0.0005) +[2023-03-11 18:58:04,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9899.8). Total num frames: 29470720. Throughput: 0: 9912.4. Samples: 29456752. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 18:58:04,012][65744] Avg episode reward: [(0, '2993.763')] +[2023-03-11 18:58:04,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000057560_29470720.pth... +[2023-03-11 18:58:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000056976_29171712.pth +[2023-03-11 18:58:06,133][66031] Updated weights for policy 0, policy_version 57600 (0.0005) +[2023-03-11 18:58:09,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9885.9). Total num frames: 29515776. Throughput: 0: 9924.0. Samples: 29514148. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 18:58:09,012][65744] Avg episode reward: [(0, '3029.933')] +[2023-03-11 18:58:10,415][66031] Updated weights for policy 0, policy_version 57680 (0.0005) +[2023-03-11 18:58:14,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9830.4, 300 sec: 9885.9). Total num frames: 29564928. Throughput: 0: 9902.8. Samples: 29541948. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 18:58:14,012][65744] Avg episode reward: [(0, '2376.079')] +[2023-03-11 18:58:14,780][66031] Updated weights for policy 0, policy_version 57760 (0.0005) +[2023-03-11 18:58:19,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9872.1). Total num frames: 29609984. Throughput: 0: 9778.8. Samples: 29597768. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 18:58:19,012][65744] Avg episode reward: [(0, '1897.851')] +[2023-03-11 18:58:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000057832_29609984.pth... +[2023-03-11 18:58:19,019][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000057272_29323264.pth +[2023-03-11 18:58:19,137][66031] Updated weights for policy 0, policy_version 57840 (0.0004) +[2023-03-11 18:58:23,440][66031] Updated weights for policy 0, policy_version 57920 (0.0005) +[2023-03-11 18:58:24,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9830.4, 300 sec: 9858.2). Total num frames: 29659136. Throughput: 0: 9695.9. Samples: 29655104. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 18:58:24,012][65744] Avg episode reward: [(0, '2157.801')] +[2023-03-11 18:58:27,716][66031] Updated weights for policy 0, policy_version 58000 (0.0005) +[2023-03-11 18:58:29,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9762.1, 300 sec: 9858.2). Total num frames: 29708288. Throughput: 0: 9648.4. Samples: 29683776. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 18:58:29,012][65744] Avg episode reward: [(0, '2447.192')] +[2023-03-11 18:58:31,997][66031] Updated weights for policy 0, policy_version 58080 (0.0005) +[2023-03-11 18:58:34,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9844.3). Total num frames: 29753344. Throughput: 0: 9557.5. Samples: 29741128. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 18:58:34,012][65744] Avg episode reward: [(0, '2413.033')] +[2023-03-11 18:58:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000058112_29753344.pth... +[2023-03-11 18:58:34,019][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000057560_29470720.pth +[2023-03-11 18:58:36,205][66031] Updated weights for policy 0, policy_version 58160 (0.0005) +[2023-03-11 18:58:39,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9844.3). Total num frames: 29802496. Throughput: 0: 9576.2. Samples: 29800144. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 18:58:39,012][65744] Avg episode reward: [(0, '2340.465')] +[2023-03-11 18:58:40,272][66031] Updated weights for policy 0, policy_version 58240 (0.0004) +[2023-03-11 18:58:44,012][65744] Fps is (10 sec: 10240.1, 60 sec: 9693.9, 300 sec: 9844.3). Total num frames: 29855744. Throughput: 0: 9586.5. Samples: 29831116. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 18:58:44,012][65744] Avg episode reward: [(0, '2172.443')] +[2023-03-11 18:58:44,277][66031] Updated weights for policy 0, policy_version 58320 (0.0004) +[2023-03-11 18:58:48,414][66031] Updated weights for policy 0, policy_version 58400 (0.0004) +[2023-03-11 18:58:49,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9693.9, 300 sec: 9844.3). Total num frames: 29904896. Throughput: 0: 9655.3. Samples: 29891240. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 18:58:49,012][65744] Avg episode reward: [(0, '2875.913')] +[2023-03-11 18:58:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000058408_29904896.pth... +[2023-03-11 18:58:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000057832_29609984.pth +[2023-03-11 18:58:52,706][66031] Updated weights for policy 0, policy_version 58480 (0.0005) +[2023-03-11 18:58:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9844.3). Total num frames: 29954048. Throughput: 0: 9667.6. Samples: 29949188. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 18:58:54,012][65744] Avg episode reward: [(0, '2719.305')] +[2023-03-11 18:58:56,967][66031] Updated weights for policy 0, policy_version 58560 (0.0005) +[2023-03-11 18:58:59,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9830.4). Total num frames: 29999104. Throughput: 0: 9692.3. Samples: 29978100. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 18:58:59,012][65744] Avg episode reward: [(0, '3523.276')] +[2023-03-11 18:59:01,190][66031] Updated weights for policy 0, policy_version 58640 (0.0005) +[2023-03-11 18:59:04,012][65744] Fps is (10 sec: 9420.7, 60 sec: 9625.6, 300 sec: 9830.4). Total num frames: 30048256. Throughput: 0: 9738.0. Samples: 30035976. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 18:59:04,012][65744] Avg episode reward: [(0, '4024.810')] +[2023-03-11 18:59:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000058688_30048256.pth... +[2023-03-11 18:59:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000058112_29753344.pth +[2023-03-11 18:59:05,452][66031] Updated weights for policy 0, policy_version 58720 (0.0005) +[2023-03-11 18:59:09,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9816.5). Total num frames: 30097408. Throughput: 0: 9757.9. Samples: 30094208. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 18:59:09,012][65744] Avg episode reward: [(0, '3514.362')] +[2023-03-11 18:59:09,575][66031] Updated weights for policy 0, policy_version 58800 (0.0004) +[2023-03-11 18:59:13,978][66031] Updated weights for policy 0, policy_version 58880 (0.0004) +[2023-03-11 18:59:14,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9802.6). Total num frames: 30146560. Throughput: 0: 9747.2. Samples: 30122400. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:59:14,012][65744] Avg episode reward: [(0, '2878.941')] +[2023-03-11 18:59:18,293][66031] Updated weights for policy 0, policy_version 58960 (0.0005) +[2023-03-11 18:59:19,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9788.7). Total num frames: 30191616. Throughput: 0: 9739.9. Samples: 30179424. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:59:19,012][65744] Avg episode reward: [(0, '3617.592')] +[2023-03-11 18:59:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000058968_30191616.pth... +[2023-03-11 18:59:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000058408_29904896.pth +[2023-03-11 18:59:22,602][66031] Updated weights for policy 0, policy_version 59040 (0.0004) +[2023-03-11 18:59:24,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9788.7). Total num frames: 30240768. Throughput: 0: 9700.8. Samples: 30236680. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:59:24,012][65744] Avg episode reward: [(0, '3356.875')] +[2023-03-11 18:59:26,856][66031] Updated weights for policy 0, policy_version 59120 (0.0005) +[2023-03-11 18:59:29,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9774.9). Total num frames: 30289920. Throughput: 0: 9651.1. Samples: 30265416. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:59:29,012][65744] Avg episode reward: [(0, '3117.623')] +[2023-03-11 18:59:31,144][66031] Updated weights for policy 0, policy_version 59200 (0.0005) +[2023-03-11 18:59:34,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9761.0). Total num frames: 30334976. Throughput: 0: 9587.9. Samples: 30322696. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:59:34,012][65744] Avg episode reward: [(0, '3721.930')] +[2023-03-11 18:59:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000059248_30334976.pth... +[2023-03-11 18:59:34,019][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000058688_30048256.pth +[2023-03-11 18:59:35,491][66031] Updated weights for policy 0, policy_version 59280 (0.0005) +[2023-03-11 18:59:39,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9747.1). Total num frames: 30384128. Throughput: 0: 9575.7. Samples: 30380096. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:59:39,012][65744] Avg episode reward: [(0, '4195.254')] +[2023-03-11 18:59:39,692][66031] Updated weights for policy 0, policy_version 59360 (0.0005) +[2023-03-11 18:59:43,956][66031] Updated weights for policy 0, policy_version 59440 (0.0005) +[2023-03-11 18:59:44,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9747.1). Total num frames: 30433280. Throughput: 0: 9574.9. Samples: 30408972. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:59:44,012][65744] Avg episode reward: [(0, '3990.446')] +[2023-03-11 18:59:48,313][66031] Updated weights for policy 0, policy_version 59520 (0.0005) +[2023-03-11 18:59:49,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9733.2). Total num frames: 30478336. Throughput: 0: 9558.8. Samples: 30466120. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:59:49,012][65744] Avg episode reward: [(0, '3909.227')] +[2023-03-11 18:59:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000059528_30478336.pth... +[2023-03-11 18:59:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000058968_30191616.pth +[2023-03-11 18:59:52,542][66031] Updated weights for policy 0, policy_version 59600 (0.0005) +[2023-03-11 18:59:54,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9747.1). Total num frames: 30527488. Throughput: 0: 9547.9. Samples: 30523864. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:59:54,012][65744] Avg episode reward: [(0, '4047.254')] +[2023-03-11 18:59:56,793][66031] Updated weights for policy 0, policy_version 59680 (0.0005) +[2023-03-11 18:59:59,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9625.6, 300 sec: 9747.1). Total num frames: 30576640. Throughput: 0: 9565.4. Samples: 30552844. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:59:59,012][65744] Avg episode reward: [(0, '3744.665')] +[2023-03-11 19:00:01,024][66031] Updated weights for policy 0, policy_version 59760 (0.0005) +[2023-03-11 19:00:04,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9719.3). Total num frames: 30621696. Throughput: 0: 9576.6. Samples: 30610372. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:00:04,012][65744] Avg episode reward: [(0, '3756.599')] +[2023-03-11 19:00:04,060][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000059816_30625792.pth... +[2023-03-11 19:00:04,062][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000059248_30334976.pth +[2023-03-11 19:00:05,411][66031] Updated weights for policy 0, policy_version 59840 (0.0005) +[2023-03-11 19:00:09,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9719.3). Total num frames: 30670848. Throughput: 0: 9569.9. Samples: 30667324. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:00:09,012][65744] Avg episode reward: [(0, '4174.037')] +[2023-03-11 19:00:09,671][66031] Updated weights for policy 0, policy_version 59920 (0.0005) +[2023-03-11 19:00:13,890][66031] Updated weights for policy 0, policy_version 60000 (0.0005) +[2023-03-11 19:00:14,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9719.3). Total num frames: 30720000. Throughput: 0: 9557.3. Samples: 30695496. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:00:14,012][65744] Avg episode reward: [(0, '4196.070')] +[2023-03-11 19:00:18,131][66031] Updated weights for policy 0, policy_version 60080 (0.0005) +[2023-03-11 19:00:19,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9719.3). Total num frames: 30769152. Throughput: 0: 9596.6. Samples: 30754544. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:00:19,012][65744] Avg episode reward: [(0, '3953.034')] +[2023-03-11 19:00:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000060096_30769152.pth... +[2023-03-11 19:00:19,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000059528_30478336.pth +[2023-03-11 19:00:22,301][66031] Updated weights for policy 0, policy_version 60160 (0.0005) +[2023-03-11 19:00:24,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9719.3). Total num frames: 30818304. Throughput: 0: 9627.7. Samples: 30813340. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:00:24,012][65744] Avg episode reward: [(0, '3592.148')] +[2023-03-11 19:00:26,553][66031] Updated weights for policy 0, policy_version 60240 (0.0005) +[2023-03-11 19:00:29,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9705.4). Total num frames: 30863360. Throughput: 0: 9628.6. Samples: 30842260. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:00:29,012][65744] Avg episode reward: [(0, '2817.899')] +[2023-03-11 19:00:30,860][66031] Updated weights for policy 0, policy_version 60320 (0.0005) +[2023-03-11 19:00:34,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9719.3). Total num frames: 30912512. Throughput: 0: 9633.0. Samples: 30899604. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:00:34,012][65744] Avg episode reward: [(0, '3060.065')] +[2023-03-11 19:00:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000060376_30912512.pth... +[2023-03-11 19:00:34,019][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000059816_30625792.pth +[2023-03-11 19:00:35,052][66031] Updated weights for policy 0, policy_version 60400 (0.0004) +[2023-03-11 19:00:39,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9625.6, 300 sec: 9719.3). Total num frames: 30961664. Throughput: 0: 9644.0. Samples: 30957844. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:00:39,012][65744] Avg episode reward: [(0, '3747.023')] +[2023-03-11 19:00:39,272][66031] Updated weights for policy 0, policy_version 60480 (0.0004) +[2023-03-11 19:00:43,464][66031] Updated weights for policy 0, policy_version 60560 (0.0005) +[2023-03-11 19:00:44,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9719.3). Total num frames: 31010816. Throughput: 0: 9638.2. Samples: 30986564. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:00:44,012][65744] Avg episode reward: [(0, '3328.894')] +[2023-03-11 19:00:47,736][66031] Updated weights for policy 0, policy_version 60640 (0.0005) +[2023-03-11 19:00:49,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9693.9, 300 sec: 9733.2). Total num frames: 31059968. Throughput: 0: 9653.0. Samples: 31044756. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:00:49,012][65744] Avg episode reward: [(0, '3539.088')] +[2023-03-11 19:00:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000060664_31059968.pth... +[2023-03-11 19:00:49,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000060096_30769152.pth +[2023-03-11 19:00:51,972][66031] Updated weights for policy 0, policy_version 60720 (0.0005) +[2023-03-11 19:00:54,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9719.3). Total num frames: 31105024. Throughput: 0: 9685.2. Samples: 31103160. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:00:54,012][65744] Avg episode reward: [(0, '3427.212')] +[2023-03-11 19:00:56,321][66031] Updated weights for policy 0, policy_version 60800 (0.0005) +[2023-03-11 19:00:59,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9719.3). Total num frames: 31154176. Throughput: 0: 9672.9. Samples: 31130776. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:00:59,012][65744] Avg episode reward: [(0, '3818.650')] +[2023-03-11 19:01:00,636][66031] Updated weights for policy 0, policy_version 60880 (0.0005) +[2023-03-11 19:01:04,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9719.3). Total num frames: 31199232. Throughput: 0: 9626.6. Samples: 31187740. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:01:04,012][65744] Avg episode reward: [(0, '3921.182')] +[2023-03-11 19:01:04,021][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000060944_31203328.pth... +[2023-03-11 19:01:04,022][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000060376_30912512.pth +[2023-03-11 19:01:04,882][66031] Updated weights for policy 0, policy_version 60960 (0.0005) +[2023-03-11 19:01:09,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9719.3). Total num frames: 31248384. Throughput: 0: 9622.0. Samples: 31246332. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:01:09,012][65744] Avg episode reward: [(0, '4020.927')] +[2023-03-11 19:01:09,133][66031] Updated weights for policy 0, policy_version 61040 (0.0005) +[2023-03-11 19:01:13,356][66031] Updated weights for policy 0, policy_version 61120 (0.0005) +[2023-03-11 19:01:14,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9719.3). Total num frames: 31297536. Throughput: 0: 9622.0. Samples: 31275252. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:01:14,012][65744] Avg episode reward: [(0, '3393.121')] +[2023-03-11 19:01:17,589][66031] Updated weights for policy 0, policy_version 61200 (0.0005) +[2023-03-11 19:01:19,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9705.4). Total num frames: 31346688. Throughput: 0: 9633.1. Samples: 31333092. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:01:19,012][65744] Avg episode reward: [(0, '3373.697')] +[2023-03-11 19:01:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000061224_31346688.pth... +[2023-03-11 19:01:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000060664_31059968.pth +[2023-03-11 19:01:21,902][66031] Updated weights for policy 0, policy_version 61280 (0.0005) +[2023-03-11 19:01:24,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9557.3, 300 sec: 9691.6). Total num frames: 31391744. Throughput: 0: 9602.7. Samples: 31389964. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:01:24,012][65744] Avg episode reward: [(0, '4087.043')] +[2023-03-11 19:01:26,163][66031] Updated weights for policy 0, policy_version 61360 (0.0005) +[2023-03-11 19:01:29,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9625.6, 300 sec: 9677.7). Total num frames: 31440896. Throughput: 0: 9621.0. Samples: 31419508. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:01:29,012][65744] Avg episode reward: [(0, '4179.339')] +[2023-03-11 19:01:30,383][66031] Updated weights for policy 0, policy_version 61440 (0.0005) +[2023-03-11 19:01:34,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9625.6, 300 sec: 9677.7). Total num frames: 31490048. Throughput: 0: 9617.9. Samples: 31477564. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:01:34,012][65744] Avg episode reward: [(0, '3401.283')] +[2023-03-11 19:01:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000061504_31490048.pth... +[2023-03-11 19:01:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000060944_31203328.pth +[2023-03-11 19:01:34,621][66031] Updated weights for policy 0, policy_version 61520 (0.0005) +[2023-03-11 19:01:38,848][66031] Updated weights for policy 0, policy_version 61600 (0.0005) +[2023-03-11 19:01:39,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9663.8). Total num frames: 31539200. Throughput: 0: 9600.2. Samples: 31535168. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:01:39,012][65744] Avg episode reward: [(0, '3638.527')] +[2023-03-11 19:01:43,077][66031] Updated weights for policy 0, policy_version 61680 (0.0005) +[2023-03-11 19:01:44,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9663.8). Total num frames: 31588352. Throughput: 0: 9624.3. Samples: 31563868. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:01:44,012][65744] Avg episode reward: [(0, '3455.652')] +[2023-03-11 19:01:47,332][66031] Updated weights for policy 0, policy_version 61760 (0.0005) +[2023-03-11 19:01:49,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9649.9). Total num frames: 31633408. Throughput: 0: 9654.1. Samples: 31622176. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:01:49,012][65744] Avg episode reward: [(0, '2394.467')] +[2023-03-11 19:01:49,078][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000061792_31637504.pth... +[2023-03-11 19:01:49,080][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000061224_31346688.pth +[2023-03-11 19:01:51,557][66031] Updated weights for policy 0, policy_version 61840 (0.0005) +[2023-03-11 19:01:54,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9663.8). Total num frames: 31682560. Throughput: 0: 9656.3. Samples: 31680864. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:01:54,012][65744] Avg episode reward: [(0, '3839.404')] +[2023-03-11 19:01:55,701][66031] Updated weights for policy 0, policy_version 61920 (0.0005) +[2023-03-11 19:01:59,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9663.8). Total num frames: 31731712. Throughput: 0: 9676.8. Samples: 31710708. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:01:59,012][65744] Avg episode reward: [(0, '3312.121')] +[2023-03-11 19:01:59,937][66031] Updated weights for policy 0, policy_version 62000 (0.0005) +[2023-03-11 19:02:04,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9663.8). Total num frames: 31780864. Throughput: 0: 9677.4. Samples: 31768576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:02:04,012][65744] Avg episode reward: [(0, '3584.397')] +[2023-03-11 19:02:04,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000062072_31780864.pth... +[2023-03-11 19:02:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000061504_31490048.pth +[2023-03-11 19:02:04,201][66031] Updated weights for policy 0, policy_version 62080 (0.0005) +[2023-03-11 19:02:08,465][66031] Updated weights for policy 0, policy_version 62160 (0.0005) +[2023-03-11 19:02:09,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9677.7). Total num frames: 31830016. Throughput: 0: 9688.3. Samples: 31825936. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:02:09,012][65744] Avg episode reward: [(0, '3665.393')] +[2023-03-11 19:02:12,768][66031] Updated weights for policy 0, policy_version 62240 (0.0006) +[2023-03-11 19:02:14,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9677.7). Total num frames: 31879168. Throughput: 0: 9668.5. Samples: 31854592. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:02:14,012][65744] Avg episode reward: [(0, '3864.368')] +[2023-03-11 19:02:16,796][66031] Updated weights for policy 0, policy_version 62320 (0.0005) +[2023-03-11 19:02:19,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9693.9, 300 sec: 9691.6). Total num frames: 31928320. Throughput: 0: 9710.9. Samples: 31914556. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:02:19,012][65744] Avg episode reward: [(0, '3589.129')] +[2023-03-11 19:02:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000062360_31928320.pth... +[2023-03-11 19:02:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000061792_31637504.pth +[2023-03-11 19:02:20,752][66031] Updated weights for policy 0, policy_version 62400 (0.0005) +[2023-03-11 19:02:24,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 9691.6). Total num frames: 31981568. Throughput: 0: 9811.6. Samples: 31976692. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:02:24,012][65744] Avg episode reward: [(0, '3799.910')] +[2023-03-11 19:02:24,793][66031] Updated weights for policy 0, policy_version 62480 (0.0005) +[2023-03-11 19:02:29,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9762.1, 300 sec: 9677.7). Total num frames: 32026624. Throughput: 0: 9828.5. Samples: 32006152. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:02:29,012][65744] Avg episode reward: [(0, '4016.581')] +[2023-03-11 19:02:29,028][66031] Updated weights for policy 0, policy_version 62560 (0.0005) +[2023-03-11 19:02:33,385][66031] Updated weights for policy 0, policy_version 62640 (0.0005) +[2023-03-11 19:02:34,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9677.7). Total num frames: 32075776. Throughput: 0: 9805.6. Samples: 32063428. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:02:34,012][65744] Avg episode reward: [(0, '4068.139')] +[2023-03-11 19:02:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000062648_32075776.pth... +[2023-03-11 19:02:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000062072_31780864.pth +[2023-03-11 19:02:37,655][66031] Updated weights for policy 0, policy_version 62720 (0.0005) +[2023-03-11 19:02:39,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9762.1, 300 sec: 9663.8). Total num frames: 32124928. Throughput: 0: 9773.3. Samples: 32120664. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:02:39,012][65744] Avg episode reward: [(0, '4292.199')] +[2023-03-11 19:02:41,902][66031] Updated weights for policy 0, policy_version 62800 (0.0005) +[2023-03-11 19:02:44,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9663.8). Total num frames: 32174080. Throughput: 0: 9751.0. Samples: 32149504. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:02:44,012][65744] Avg episode reward: [(0, '4062.726')] +[2023-03-11 19:02:46,158][66031] Updated weights for policy 0, policy_version 62880 (0.0005) +[2023-03-11 19:02:49,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9649.9). Total num frames: 32219136. Throughput: 0: 9741.0. Samples: 32206920. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:02:49,012][65744] Avg episode reward: [(0, '3904.535')] +[2023-03-11 19:02:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000062928_32219136.pth... +[2023-03-11 19:02:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000062360_31928320.pth +[2023-03-11 19:02:50,375][66031] Updated weights for policy 0, policy_version 62960 (0.0005) +[2023-03-11 19:02:54,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9649.9). Total num frames: 32268288. Throughput: 0: 9753.8. Samples: 32264856. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:02:54,012][65744] Avg episode reward: [(0, '3359.331')] +[2023-03-11 19:02:54,622][66031] Updated weights for policy 0, policy_version 63040 (0.0005) +[2023-03-11 19:02:58,836][66031] Updated weights for policy 0, policy_version 63120 (0.0005) +[2023-03-11 19:02:59,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9649.9). Total num frames: 32317440. Throughput: 0: 9765.8. Samples: 32294052. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:02:59,012][65744] Avg episode reward: [(0, '4274.238')] +[2023-03-11 19:03:03,043][66031] Updated weights for policy 0, policy_version 63200 (0.0005) +[2023-03-11 19:03:04,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9663.8). Total num frames: 32366592. Throughput: 0: 9733.5. Samples: 32352564. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:03:04,012][65744] Avg episode reward: [(0, '3891.029')] +[2023-03-11 19:03:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000063216_32366592.pth... +[2023-03-11 19:03:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000062648_32075776.pth +[2023-03-11 19:03:07,343][66031] Updated weights for policy 0, policy_version 63280 (0.0005) +[2023-03-11 19:03:09,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9762.1, 300 sec: 9663.8). Total num frames: 32415744. Throughput: 0: 9640.9. Samples: 32410532. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:03:09,012][65744] Avg episode reward: [(0, '3980.343')] +[2023-03-11 19:03:11,560][66031] Updated weights for policy 0, policy_version 63360 (0.0005) +[2023-03-11 19:03:14,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9663.8). Total num frames: 32460800. Throughput: 0: 9635.4. Samples: 32439744. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:03:14,012][65744] Avg episode reward: [(0, '3800.212')] +[2023-03-11 19:03:15,820][66031] Updated weights for policy 0, policy_version 63440 (0.0005) +[2023-03-11 19:03:19,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9663.8). Total num frames: 32509952. Throughput: 0: 9648.3. Samples: 32497600. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:03:19,012][65744] Avg episode reward: [(0, '3034.460')] +[2023-03-11 19:03:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000063496_32509952.pth... +[2023-03-11 19:03:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000062928_32219136.pth +[2023-03-11 19:03:20,040][66031] Updated weights for policy 0, policy_version 63520 (0.0005) +[2023-03-11 19:03:24,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9663.8). Total num frames: 32559104. Throughput: 0: 9653.7. Samples: 32555080. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:03:24,012][65744] Avg episode reward: [(0, '4247.748')] +[2023-03-11 19:03:24,231][66031] Updated weights for policy 0, policy_version 63600 (0.0005) +[2023-03-11 19:03:28,244][66031] Updated weights for policy 0, policy_version 63680 (0.0005) +[2023-03-11 19:03:29,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9677.7). Total num frames: 32608256. Throughput: 0: 9690.1. Samples: 32585560. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:03:29,012][65744] Avg episode reward: [(0, '3920.600')] +[2023-03-11 19:03:32,081][66031] Updated weights for policy 0, policy_version 63760 (0.0004) +[2023-03-11 19:03:34,012][65744] Fps is (10 sec: 10239.9, 60 sec: 9762.1, 300 sec: 9691.6). Total num frames: 32661504. Throughput: 0: 9827.9. Samples: 32649176. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:03:34,012][65744] Avg episode reward: [(0, '3496.240')] +[2023-03-11 19:03:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000063792_32661504.pth... +[2023-03-11 19:03:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000063216_32366592.pth +[2023-03-11 19:03:36,023][66031] Updated weights for policy 0, policy_version 63840 (0.0004) +[2023-03-11 19:03:39,012][65744] Fps is (10 sec: 10649.7, 60 sec: 9830.4, 300 sec: 9691.6). Total num frames: 32714752. Throughput: 0: 9896.5. Samples: 32710200. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:03:39,012][65744] Avg episode reward: [(0, '2893.271')] +[2023-03-11 19:03:40,271][66031] Updated weights for policy 0, policy_version 63920 (0.0005) +[2023-03-11 19:03:44,012][65744] Fps is (10 sec: 10240.1, 60 sec: 9830.4, 300 sec: 9691.6). Total num frames: 32763904. Throughput: 0: 9895.0. Samples: 32739328. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:03:44,012][65744] Avg episode reward: [(0, '4194.923')] +[2023-03-11 19:03:44,350][66031] Updated weights for policy 0, policy_version 64000 (0.0005) +[2023-03-11 19:03:48,537][66031] Updated weights for policy 0, policy_version 64080 (0.0005) +[2023-03-11 19:03:49,012][65744] Fps is (10 sec: 9830.2, 60 sec: 9898.6, 300 sec: 9691.5). Total num frames: 32813056. Throughput: 0: 9913.5. Samples: 32798672. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:03:49,012][65744] Avg episode reward: [(0, '4069.827')] +[2023-03-11 19:03:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000064088_32813056.pth... +[2023-03-11 19:03:49,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000063496_32509952.pth +[2023-03-11 19:03:52,752][66031] Updated weights for policy 0, policy_version 64160 (0.0005) +[2023-03-11 19:03:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9705.4). Total num frames: 32862208. Throughput: 0: 9924.1. Samples: 32857116. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:03:54,012][65744] Avg episode reward: [(0, '3696.277')] +[2023-03-11 19:03:56,917][66031] Updated weights for policy 0, policy_version 64240 (0.0005) +[2023-03-11 19:03:59,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9830.4, 300 sec: 9691.6). Total num frames: 32907264. Throughput: 0: 9934.2. Samples: 32886784. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:03:59,012][65744] Avg episode reward: [(0, '4120.774')] +[2023-03-11 19:04:01,052][66031] Updated weights for policy 0, policy_version 64320 (0.0005) +[2023-03-11 19:04:04,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9898.7, 300 sec: 9705.4). Total num frames: 32960512. Throughput: 0: 9975.5. Samples: 32946496. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:04:04,012][65744] Avg episode reward: [(0, '3706.246')] +[2023-03-11 19:04:04,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000064376_32960512.pth... +[2023-03-11 19:04:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000063792_32661504.pth +[2023-03-11 19:04:04,973][66031] Updated weights for policy 0, policy_version 64400 (0.0005) +[2023-03-11 19:04:08,868][66031] Updated weights for policy 0, policy_version 64480 (0.0004) +[2023-03-11 19:04:09,012][65744] Fps is (10 sec: 10649.6, 60 sec: 9966.9, 300 sec: 9719.3). Total num frames: 33013760. Throughput: 0: 10101.0. Samples: 33009624. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:04:09,012][65744] Avg episode reward: [(0, '4087.893')] +[2023-03-11 19:04:12,868][66031] Updated weights for policy 0, policy_version 64560 (0.0005) +[2023-03-11 19:04:14,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10035.2, 300 sec: 9733.2). Total num frames: 33062912. Throughput: 0: 10104.5. Samples: 33040264. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:04:14,012][65744] Avg episode reward: [(0, '3612.317')] +[2023-03-11 19:04:16,803][66031] Updated weights for policy 0, policy_version 64640 (0.0005) +[2023-03-11 19:04:19,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 9747.1). Total num frames: 33116160. Throughput: 0: 10072.8. Samples: 33102452. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:04:19,012][65744] Avg episode reward: [(0, '3977.093')] +[2023-03-11 19:04:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000064680_33116160.pth... +[2023-03-11 19:04:19,019][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000064088_32813056.pth +[2023-03-11 19:04:20,744][66031] Updated weights for policy 0, policy_version 64720 (0.0005) +[2023-03-11 19:04:24,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10171.7, 300 sec: 9761.0). Total num frames: 33169408. Throughput: 0: 10098.5. Samples: 33164632. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:04:24,012][65744] Avg episode reward: [(0, '3934.517')] +[2023-03-11 19:04:24,764][66031] Updated weights for policy 0, policy_version 64800 (0.0005) +[2023-03-11 19:04:29,012][65744] Fps is (10 sec: 9830.5, 60 sec: 10103.5, 300 sec: 9761.0). Total num frames: 33214464. Throughput: 0: 10103.5. Samples: 33193984. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:04:29,012][65744] Avg episode reward: [(0, '3622.862')] +[2023-03-11 19:04:29,013][66031] Updated weights for policy 0, policy_version 64880 (0.0005) +[2023-03-11 19:04:33,273][66031] Updated weights for policy 0, policy_version 64960 (0.0005) +[2023-03-11 19:04:34,012][65744] Fps is (10 sec: 9420.7, 60 sec: 10035.2, 300 sec: 9761.0). Total num frames: 33263616. Throughput: 0: 10060.9. Samples: 33251412. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:04:34,012][65744] Avg episode reward: [(0, '4026.058')] +[2023-03-11 19:04:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000064968_33263616.pth... +[2023-03-11 19:04:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000064376_32960512.pth +[2023-03-11 19:04:37,188][66031] Updated weights for policy 0, policy_version 65040 (0.0005) +[2023-03-11 19:04:39,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9774.9). Total num frames: 33316864. Throughput: 0: 10138.9. Samples: 33313368. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:04:39,012][65744] Avg episode reward: [(0, '4215.220')] +[2023-03-11 19:04:41,328][66031] Updated weights for policy 0, policy_version 65120 (0.0005) +[2023-03-11 19:04:44,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10035.2, 300 sec: 9788.7). Total num frames: 33366016. Throughput: 0: 10128.5. Samples: 33342564. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:04:44,012][65744] Avg episode reward: [(0, '3557.038')] +[2023-03-11 19:04:45,664][66031] Updated weights for policy 0, policy_version 65200 (0.0005) +[2023-03-11 19:04:49,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9966.9, 300 sec: 9774.9). Total num frames: 33411072. Throughput: 0: 10060.8. Samples: 33399232. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:04:49,012][65744] Avg episode reward: [(0, '3789.138')] +[2023-03-11 19:04:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000065256_33411072.pth... +[2023-03-11 19:04:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000064680_33116160.pth +[2023-03-11 19:04:49,889][66031] Updated weights for policy 0, policy_version 65280 (0.0005) +[2023-03-11 19:04:54,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9966.9, 300 sec: 9774.9). Total num frames: 33460224. Throughput: 0: 9960.9. Samples: 33457864. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:04:54,012][65744] Avg episode reward: [(0, '4175.726')] +[2023-03-11 19:04:54,095][66031] Updated weights for policy 0, policy_version 65360 (0.0005) +[2023-03-11 19:04:58,296][66031] Updated weights for policy 0, policy_version 65440 (0.0005) +[2023-03-11 19:04:59,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 9788.7). Total num frames: 33509376. Throughput: 0: 9931.1. Samples: 33487164. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 19:04:59,012][65744] Avg episode reward: [(0, '3998.617')] +[2023-03-11 19:05:02,629][66031] Updated weights for policy 0, policy_version 65520 (0.0005) +[2023-03-11 19:05:04,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9966.9, 300 sec: 9788.7). Total num frames: 33558528. Throughput: 0: 9830.4. Samples: 33544820. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 19:05:04,012][65744] Avg episode reward: [(0, '3719.590')] +[2023-03-11 19:05:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000065544_33558528.pth... +[2023-03-11 19:05:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000064968_33263616.pth +[2023-03-11 19:05:06,810][66031] Updated weights for policy 0, policy_version 65600 (0.0005) +[2023-03-11 19:05:09,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9788.7). Total num frames: 33607680. Throughput: 0: 9751.0. Samples: 33603428. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 19:05:09,012][65744] Avg episode reward: [(0, '3750.735')] +[2023-03-11 19:05:11,001][66031] Updated weights for policy 0, policy_version 65680 (0.0005) +[2023-03-11 19:05:14,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9788.7). Total num frames: 33656832. Throughput: 0: 9741.0. Samples: 33632328. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 19:05:14,012][65744] Avg episode reward: [(0, '3655.208')] +[2023-03-11 19:05:15,009][66031] Updated weights for policy 0, policy_version 65760 (0.0005) +[2023-03-11 19:05:18,941][66031] Updated weights for policy 0, policy_version 65840 (0.0005) +[2023-03-11 19:05:19,012][65744] Fps is (10 sec: 10239.9, 60 sec: 9898.7, 300 sec: 9802.6). Total num frames: 33710080. Throughput: 0: 9843.5. Samples: 33694368. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 19:05:19,012][65744] Avg episode reward: [(0, '3895.260')] +[2023-03-11 19:05:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000065840_33710080.pth... +[2023-03-11 19:05:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000065256_33411072.pth +[2023-03-11 19:05:22,927][66031] Updated weights for policy 0, policy_version 65920 (0.0005) +[2023-03-11 19:05:24,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 9816.5). Total num frames: 33759232. Throughput: 0: 9841.2. Samples: 33756224. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 19:05:24,012][65744] Avg episode reward: [(0, '3684.704')] +[2023-03-11 19:05:26,818][66031] Updated weights for policy 0, policy_version 66000 (0.0005) +[2023-03-11 19:05:29,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 9830.4). Total num frames: 33812480. Throughput: 0: 9897.9. Samples: 33787968. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 19:05:29,012][65744] Avg episode reward: [(0, '4268.487')] +[2023-03-11 19:05:30,738][66031] Updated weights for policy 0, policy_version 66080 (0.0004) +[2023-03-11 19:05:34,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10035.2, 300 sec: 9844.3). Total num frames: 33865728. Throughput: 0: 10040.3. Samples: 33851044. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 19:05:34,012][65744] Avg episode reward: [(0, '2815.417')] +[2023-03-11 19:05:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000066144_33865728.pth... +[2023-03-11 19:05:34,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000065544_33558528.pth +[2023-03-11 19:05:34,645][66031] Updated weights for policy 0, policy_version 66160 (0.0004) +[2023-03-11 19:05:38,465][66031] Updated weights for policy 0, policy_version 66240 (0.0003) +[2023-03-11 19:05:39,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10035.2, 300 sec: 9858.2). Total num frames: 33918976. Throughput: 0: 10153.4. Samples: 33914768. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:05:39,012][65744] Avg episode reward: [(0, '4110.152')] +[2023-03-11 19:05:42,403][66031] Updated weights for policy 0, policy_version 66320 (0.0004) +[2023-03-11 19:05:44,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10103.5, 300 sec: 9872.1). Total num frames: 33972224. Throughput: 0: 10202.5. Samples: 33946276. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:05:44,012][65744] Avg episode reward: [(0, '4044.901')] +[2023-03-11 19:05:46,280][66031] Updated weights for policy 0, policy_version 66400 (0.0004) +[2023-03-11 19:05:49,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 9885.9). Total num frames: 34021376. Throughput: 0: 10318.5. Samples: 34009152. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:05:49,012][65744] Avg episode reward: [(0, '3570.427')] +[2023-03-11 19:05:49,014][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000066448_34021376.pth... +[2023-03-11 19:05:49,016][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000065840_33710080.pth +[2023-03-11 19:05:50,247][66031] Updated weights for policy 0, policy_version 66480 (0.0005) +[2023-03-11 19:05:54,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 9899.8). Total num frames: 34074624. Throughput: 0: 10385.2. Samples: 34070760. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:05:54,023][65744] Avg episode reward: [(0, '3733.883')] +[2023-03-11 19:05:54,180][66031] Updated weights for policy 0, policy_version 66560 (0.0004) +[2023-03-11 19:05:58,185][66031] Updated weights for policy 0, policy_version 66640 (0.0005) +[2023-03-11 19:05:59,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 9927.6). Total num frames: 34127872. Throughput: 0: 10444.8. Samples: 34102344. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:05:59,023][65744] Avg episode reward: [(0, '3726.037')] +[2023-03-11 19:06:02,122][66031] Updated weights for policy 0, policy_version 66720 (0.0005) +[2023-03-11 19:06:04,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 9927.6). Total num frames: 34177024. Throughput: 0: 10439.7. Samples: 34164156. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:06:04,012][65744] Avg episode reward: [(0, '3956.460')] +[2023-03-11 19:06:04,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000066752_34177024.pth... +[2023-03-11 19:06:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000066144_33865728.pth +[2023-03-11 19:06:05,996][66031] Updated weights for policy 0, policy_version 66800 (0.0004) +[2023-03-11 19:06:09,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 9941.5). Total num frames: 34230272. Throughput: 0: 10474.7. Samples: 34227584. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:06:09,012][65744] Avg episode reward: [(0, '3118.314')] +[2023-03-11 19:06:09,921][66031] Updated weights for policy 0, policy_version 66880 (0.0004) +[2023-03-11 19:06:13,859][66031] Updated weights for policy 0, policy_version 66960 (0.0004) +[2023-03-11 19:06:14,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 9955.4). Total num frames: 34283520. Throughput: 0: 10456.1. Samples: 34258492. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:06:14,012][65744] Avg episode reward: [(0, '3610.419')] +[2023-03-11 19:06:17,793][66031] Updated weights for policy 0, policy_version 67040 (0.0005) +[2023-03-11 19:06:19,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 9983.1). Total num frames: 34336768. Throughput: 0: 10437.1. Samples: 34320712. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 19:06:19,012][65744] Avg episode reward: [(0, '3609.075')] +[2023-03-11 19:06:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000067064_34336768.pth... +[2023-03-11 19:06:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000066448_34021376.pth +[2023-03-11 19:06:21,669][66031] Updated weights for policy 0, policy_version 67120 (0.0004) +[2023-03-11 19:06:24,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 9997.0). Total num frames: 34390016. Throughput: 0: 10424.9. Samples: 34383888. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 19:06:24,012][65744] Avg episode reward: [(0, '4024.565')] +[2023-03-11 19:06:25,672][66031] Updated weights for policy 0, policy_version 67200 (0.0005) +[2023-03-11 19:06:29,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10376.5, 300 sec: 9983.1). Total num frames: 34435072. Throughput: 0: 10401.4. Samples: 34414340. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 19:06:29,012][65744] Avg episode reward: [(0, '4153.887')] +[2023-03-11 19:06:29,893][66031] Updated weights for policy 0, policy_version 67280 (0.0005) +[2023-03-11 19:06:34,012][65744] Fps is (10 sec: 9420.7, 60 sec: 10308.3, 300 sec: 9983.1). Total num frames: 34484224. Throughput: 0: 10285.7. Samples: 34472008. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 19:06:34,012][65744] Avg episode reward: [(0, '4021.297')] +[2023-03-11 19:06:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000067352_34484224.pth... +[2023-03-11 19:06:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000066752_34177024.pth +[2023-03-11 19:06:34,112][66031] Updated weights for policy 0, policy_version 67360 (0.0005) +[2023-03-11 19:06:38,303][66031] Updated weights for policy 0, policy_version 67440 (0.0005) +[2023-03-11 19:06:39,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 9983.1). Total num frames: 34533376. Throughput: 0: 10221.8. Samples: 34530740. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 19:06:39,012][65744] Avg episode reward: [(0, '4012.126')] +[2023-03-11 19:06:42,457][66031] Updated weights for policy 0, policy_version 67520 (0.0005) +[2023-03-11 19:06:44,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 9997.0). Total num frames: 34582528. Throughput: 0: 10185.4. Samples: 34560688. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 19:06:44,012][65744] Avg episode reward: [(0, '3752.506')] +[2023-03-11 19:06:46,622][66031] Updated weights for policy 0, policy_version 67600 (0.0005) +[2023-03-11 19:06:49,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10010.9). Total num frames: 34635776. Throughput: 0: 10120.8. Samples: 34619592. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 19:06:49,012][65744] Avg episode reward: [(0, '3989.158')] +[2023-03-11 19:06:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000067648_34635776.pth... +[2023-03-11 19:06:49,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000067064_34336768.pth +[2023-03-11 19:06:50,525][66031] Updated weights for policy 0, policy_version 67680 (0.0004) +[2023-03-11 19:06:54,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10171.7, 300 sec: 10010.9). Total num frames: 34684928. Throughput: 0: 10116.6. Samples: 34682832. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 19:06:54,012][65744] Avg episode reward: [(0, '3601.167')] +[2023-03-11 19:06:54,448][66031] Updated weights for policy 0, policy_version 67760 (0.0005) +[2023-03-11 19:06:58,323][66031] Updated weights for policy 0, policy_version 67840 (0.0004) +[2023-03-11 19:06:59,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10024.8). Total num frames: 34738176. Throughput: 0: 10124.2. Samples: 34714080. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 19:06:59,012][65744] Avg episode reward: [(0, '3633.544')] +[2023-03-11 19:07:02,207][66031] Updated weights for policy 0, policy_version 67920 (0.0004) +[2023-03-11 19:07:04,012][65744] Fps is (10 sec: 10649.5, 60 sec: 10240.0, 300 sec: 10038.7). Total num frames: 34791424. Throughput: 0: 10150.8. Samples: 34777500. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:07:04,012][65744] Avg episode reward: [(0, '4017.636')] +[2023-03-11 19:07:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000067952_34791424.pth... +[2023-03-11 19:07:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000067352_34484224.pth +[2023-03-11 19:07:06,096][66031] Updated weights for policy 0, policy_version 68000 (0.0005) +[2023-03-11 19:07:09,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 10052.6). Total num frames: 34844672. Throughput: 0: 10148.8. Samples: 34840584. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:07:09,012][65744] Avg episode reward: [(0, '3790.521')] +[2023-03-11 19:07:10,057][66031] Updated weights for policy 0, policy_version 68080 (0.0005) +[2023-03-11 19:07:13,945][66031] Updated weights for policy 0, policy_version 68160 (0.0004) +[2023-03-11 19:07:14,012][65744] Fps is (10 sec: 10649.7, 60 sec: 10240.0, 300 sec: 10066.4). Total num frames: 34897920. Throughput: 0: 10156.7. Samples: 34871392. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:07:14,012][65744] Avg episode reward: [(0, '3438.279')] +[2023-03-11 19:07:17,857][66031] Updated weights for policy 0, policy_version 68240 (0.0004) +[2023-03-11 19:07:19,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10052.6). Total num frames: 34947072. Throughput: 0: 10284.2. Samples: 34934796. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:07:19,012][65744] Avg episode reward: [(0, '3819.051')] +[2023-03-11 19:07:19,048][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000068264_34951168.pth... +[2023-03-11 19:07:19,049][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000067648_34635776.pth +[2023-03-11 19:07:21,767][66031] Updated weights for policy 0, policy_version 68320 (0.0004) +[2023-03-11 19:07:24,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10080.3). Total num frames: 35000320. Throughput: 0: 10372.0. Samples: 34997480. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:07:24,012][65744] Avg episode reward: [(0, '4128.890')] +[2023-03-11 19:07:25,662][66031] Updated weights for policy 0, policy_version 68400 (0.0004) +[2023-03-11 19:07:29,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 10094.2). Total num frames: 35053568. Throughput: 0: 10408.2. Samples: 35029056. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:07:29,012][65744] Avg episode reward: [(0, '3966.473')] +[2023-03-11 19:07:29,533][66031] Updated weights for policy 0, policy_version 68480 (0.0004) +[2023-03-11 19:07:33,494][66031] Updated weights for policy 0, policy_version 68560 (0.0004) +[2023-03-11 19:07:34,012][65744] Fps is (10 sec: 10649.5, 60 sec: 10376.5, 300 sec: 10108.1). Total num frames: 35106816. Throughput: 0: 10492.3. Samples: 35091744. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:07:34,012][65744] Avg episode reward: [(0, '4223.594')] +[2023-03-11 19:07:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000068568_35106816.pth... +[2023-03-11 19:07:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000067952_34791424.pth +[2023-03-11 19:07:37,379][66031] Updated weights for policy 0, policy_version 68640 (0.0004) +[2023-03-11 19:07:39,012][65744] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10122.0). Total num frames: 35160064. Throughput: 0: 10497.3. Samples: 35155212. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:07:39,012][65744] Avg episode reward: [(0, '4018.447')] +[2023-03-11 19:07:41,240][66031] Updated weights for policy 0, policy_version 68720 (0.0004) +[2023-03-11 19:07:44,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10149.7). Total num frames: 35213312. Throughput: 0: 10506.9. Samples: 35186892. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:07:44,012][65744] Avg episode reward: [(0, '4352.926')] +[2023-03-11 19:07:45,081][66031] Updated weights for policy 0, policy_version 68800 (0.0004) +[2023-03-11 19:07:48,947][66031] Updated weights for policy 0, policy_version 68880 (0.0004) +[2023-03-11 19:07:49,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10163.6). Total num frames: 35266560. Throughput: 0: 10513.7. Samples: 35250616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:07:49,012][65744] Avg episode reward: [(0, '3665.006')] +[2023-03-11 19:07:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000068880_35266560.pth... +[2023-03-11 19:07:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000068264_34951168.pth +[2023-03-11 19:07:52,876][66031] Updated weights for policy 0, policy_version 68960 (0.0004) +[2023-03-11 19:07:54,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10163.6). Total num frames: 35315712. Throughput: 0: 10505.1. Samples: 35313316. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:07:54,012][65744] Avg episode reward: [(0, '4159.384')] +[2023-03-11 19:07:56,807][66031] Updated weights for policy 0, policy_version 69040 (0.0005) +[2023-03-11 19:07:59,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10177.5). Total num frames: 35368960. Throughput: 0: 10514.4. Samples: 35344540. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:07:59,012][65744] Avg episode reward: [(0, '3993.171')] +[2023-03-11 19:08:00,762][66031] Updated weights for policy 0, policy_version 69120 (0.0004) +[2023-03-11 19:08:04,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10191.4). Total num frames: 35422208. Throughput: 0: 10495.6. Samples: 35407100. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:08:04,012][65744] Avg episode reward: [(0, '3755.159')] +[2023-03-11 19:08:04,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000069184_35422208.pth... +[2023-03-11 19:08:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000068568_35106816.pth +[2023-03-11 19:08:04,647][66031] Updated weights for policy 0, policy_version 69200 (0.0004) +[2023-03-11 19:08:08,571][66031] Updated weights for policy 0, policy_version 69280 (0.0004) +[2023-03-11 19:08:09,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10219.2). Total num frames: 35475456. Throughput: 0: 10498.9. Samples: 35469932. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:08:09,012][65744] Avg episode reward: [(0, '4096.590')] +[2023-03-11 19:08:12,471][66031] Updated weights for policy 0, policy_version 69360 (0.0004) +[2023-03-11 19:08:14,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10219.2). Total num frames: 35524608. Throughput: 0: 10495.7. Samples: 35501364. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:08:14,012][65744] Avg episode reward: [(0, '4226.834')] +[2023-03-11 19:08:16,349][66031] Updated weights for policy 0, policy_version 69440 (0.0004) +[2023-03-11 19:08:19,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10513.0, 300 sec: 10233.1). Total num frames: 35577856. Throughput: 0: 10510.2. Samples: 35564704. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:08:19,012][65744] Avg episode reward: [(0, '3674.332')] +[2023-03-11 19:08:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000069488_35577856.pth... +[2023-03-11 19:08:19,019][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000068880_35266560.pth +[2023-03-11 19:08:20,296][66031] Updated weights for policy 0, policy_version 69520 (0.0004) +[2023-03-11 19:08:24,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10246.9). Total num frames: 35631104. Throughput: 0: 10491.6. Samples: 35627336. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:08:24,012][65744] Avg episode reward: [(0, '4333.684')] +[2023-03-11 19:08:24,196][66031] Updated weights for policy 0, policy_version 69600 (0.0004) +[2023-03-11 19:08:28,361][66031] Updated weights for policy 0, policy_version 69680 (0.0004) +[2023-03-11 19:08:29,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10233.1). Total num frames: 35680256. Throughput: 0: 10473.6. Samples: 35658204. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 19:08:29,012][65744] Avg episode reward: [(0, '3991.582')] +[2023-03-11 19:08:32,535][66031] Updated weights for policy 0, policy_version 69760 (0.0005) +[2023-03-11 19:08:34,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10376.5, 300 sec: 10219.2). Total num frames: 35729408. Throughput: 0: 10353.3. Samples: 35716516. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 19:08:34,012][65744] Avg episode reward: [(0, '3731.930')] +[2023-03-11 19:08:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000069784_35729408.pth... +[2023-03-11 19:08:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000069184_35422208.pth +[2023-03-11 19:08:36,659][66031] Updated weights for policy 0, policy_version 69840 (0.0005) +[2023-03-11 19:08:39,012][65744] Fps is (10 sec: 9830.5, 60 sec: 10308.3, 300 sec: 10219.2). Total num frames: 35778560. Throughput: 0: 10259.7. Samples: 35775004. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 19:08:39,012][65744] Avg episode reward: [(0, '3323.445')] +[2023-03-11 19:08:40,863][66031] Updated weights for policy 0, policy_version 69920 (0.0005) +[2023-03-11 19:08:44,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10219.2). Total num frames: 35827712. Throughput: 0: 10219.4. Samples: 35804412. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 19:08:44,012][65744] Avg episode reward: [(0, '3772.670')] +[2023-03-11 19:08:45,072][66031] Updated weights for policy 0, policy_version 70000 (0.0005) +[2023-03-11 19:08:49,012][65744] Fps is (10 sec: 9830.3, 60 sec: 10171.7, 300 sec: 10219.2). Total num frames: 35876864. Throughput: 0: 10146.0. Samples: 35863672. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 19:08:49,012][65744] Avg episode reward: [(0, '3786.864')] +[2023-03-11 19:08:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000070072_35876864.pth... +[2023-03-11 19:08:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000069488_35577856.pth +[2023-03-11 19:08:49,308][66031] Updated weights for policy 0, policy_version 70080 (0.0005) +[2023-03-11 19:08:53,496][66031] Updated weights for policy 0, policy_version 70160 (0.0005) +[2023-03-11 19:08:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10233.1). Total num frames: 35926016. Throughput: 0: 10041.4. Samples: 35921796. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 19:08:54,012][65744] Avg episode reward: [(0, '4035.573')] +[2023-03-11 19:08:57,668][66031] Updated weights for policy 0, policy_version 70240 (0.0005) +[2023-03-11 19:08:59,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10219.2). Total num frames: 35975168. Throughput: 0: 9984.3. Samples: 35950656. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 19:08:59,012][65744] Avg episode reward: [(0, '4180.623')] +[2023-03-11 19:09:01,527][66031] Updated weights for policy 0, policy_version 70320 (0.0004) +[2023-03-11 19:09:04,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 10219.2). Total num frames: 36028416. Throughput: 0: 9969.0. Samples: 36013308. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 19:09:04,012][65744] Avg episode reward: [(0, '4274.315')] +[2023-03-11 19:09:04,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000070368_36028416.pth... +[2023-03-11 19:09:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000069784_35729408.pth +[2023-03-11 19:09:05,439][66031] Updated weights for policy 0, policy_version 70400 (0.0004) +[2023-03-11 19:09:09,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10103.5, 300 sec: 10233.1). Total num frames: 36081664. Throughput: 0: 9975.7. Samples: 36076244. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 19:09:09,012][65744] Avg episode reward: [(0, '3835.121')] +[2023-03-11 19:09:09,365][66031] Updated weights for policy 0, policy_version 70480 (0.0004) +[2023-03-11 19:09:13,612][66031] Updated weights for policy 0, policy_version 70560 (0.0005) +[2023-03-11 19:09:14,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10205.3). Total num frames: 36126720. Throughput: 0: 9956.5. Samples: 36106248. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 19:09:14,012][65744] Avg episode reward: [(0, '4368.907')] +[2023-03-11 19:09:17,760][66031] Updated weights for policy 0, policy_version 70640 (0.0005) +[2023-03-11 19:09:19,012][65744] Fps is (10 sec: 9830.5, 60 sec: 10035.2, 300 sec: 10205.3). Total num frames: 36179968. Throughput: 0: 9955.5. Samples: 36164512. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 19:09:19,012][65744] Avg episode reward: [(0, '4286.328')] +[2023-03-11 19:09:19,014][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000070664_36179968.pth... +[2023-03-11 19:09:19,016][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000070072_35876864.pth +[2023-03-11 19:09:21,885][66031] Updated weights for policy 0, policy_version 70720 (0.0005) +[2023-03-11 19:09:24,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 10219.2). Total num frames: 36229120. Throughput: 0: 9978.2. Samples: 36224024. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 19:09:24,012][65744] Avg episode reward: [(0, '4355.496')] +[2023-03-11 19:09:26,064][66031] Updated weights for policy 0, policy_version 70800 (0.0005) +[2023-03-11 19:09:29,012][65744] Fps is (10 sec: 9420.7, 60 sec: 9898.7, 300 sec: 10205.3). Total num frames: 36274176. Throughput: 0: 9984.1. Samples: 36253696. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 19:09:29,012][65744] Avg episode reward: [(0, '4289.461')] +[2023-03-11 19:09:30,293][66031] Updated weights for policy 0, policy_version 70880 (0.0005) +[2023-03-11 19:09:34,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 10191.4). Total num frames: 36323328. Throughput: 0: 9943.1. Samples: 36311112. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 19:09:34,012][65744] Avg episode reward: [(0, '4417.642')] +[2023-03-11 19:09:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000070944_36323328.pth... +[2023-03-11 19:09:34,019][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000070368_36028416.pth +[2023-03-11 19:09:34,536][66031] Updated weights for policy 0, policy_version 70960 (0.0005) +[2023-03-11 19:09:38,742][66031] Updated weights for policy 0, policy_version 71040 (0.0005) +[2023-03-11 19:09:39,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10191.4). Total num frames: 36372480. Throughput: 0: 9954.4. Samples: 36369744. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 19:09:39,012][65744] Avg episode reward: [(0, '4381.735')] +[2023-03-11 19:09:42,722][66031] Updated weights for policy 0, policy_version 71120 (0.0004) +[2023-03-11 19:09:44,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 10219.2). Total num frames: 36425728. Throughput: 0: 9982.4. Samples: 36399864. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 19:09:44,012][65744] Avg episode reward: [(0, '4443.433')] +[2023-03-11 19:09:44,013][65987] Saving new best policy, reward=4443.433! +[2023-03-11 19:09:46,685][66031] Updated weights for policy 0, policy_version 71200 (0.0005) +[2023-03-11 19:09:49,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 10219.2). Total num frames: 36474880. Throughput: 0: 9980.8. Samples: 36462444. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 19:09:49,012][65744] Avg episode reward: [(0, '4360.474')] +[2023-03-11 19:09:49,043][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000071248_36478976.pth... +[2023-03-11 19:09:49,044][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000070664_36179968.pth +[2023-03-11 19:09:50,569][66031] Updated weights for policy 0, policy_version 71280 (0.0004) +[2023-03-11 19:09:54,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10233.1). Total num frames: 36528128. Throughput: 0: 9952.3. Samples: 36524096. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 19:09:54,012][65744] Avg episode reward: [(0, '4339.230')] +[2023-03-11 19:09:54,709][66031] Updated weights for policy 0, policy_version 71360 (0.0005) +[2023-03-11 19:09:58,923][66031] Updated weights for policy 0, policy_version 71440 (0.0005) +[2023-03-11 19:09:59,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10035.2, 300 sec: 10233.1). Total num frames: 36577280. Throughput: 0: 9922.7. Samples: 36552768. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 19:09:59,012][65744] Avg episode reward: [(0, '4176.132')] +[2023-03-11 19:10:03,119][66031] Updated weights for policy 0, policy_version 71520 (0.0005) +[2023-03-11 19:10:04,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 10233.1). Total num frames: 36626432. Throughput: 0: 9930.5. Samples: 36611384. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 19:10:04,012][65744] Avg episode reward: [(0, '4198.893')] +[2023-03-11 19:10:04,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000071536_36626432.pth... +[2023-03-11 19:10:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000070944_36323328.pth +[2023-03-11 19:10:07,329][66031] Updated weights for policy 0, policy_version 71600 (0.0005) +[2023-03-11 19:10:09,012][65744] Fps is (10 sec: 9420.7, 60 sec: 9830.4, 300 sec: 10219.2). Total num frames: 36671488. Throughput: 0: 9915.6. Samples: 36670228. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 19:10:09,012][65744] Avg episode reward: [(0, '4257.979')] +[2023-03-11 19:10:11,580][66031] Updated weights for policy 0, policy_version 71680 (0.0005) +[2023-03-11 19:10:14,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 10205.3). Total num frames: 36720640. Throughput: 0: 9901.2. Samples: 36699248. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 19:10:14,012][65744] Avg episode reward: [(0, '4036.603')] +[2023-03-11 19:10:15,808][66031] Updated weights for policy 0, policy_version 71760 (0.0005) +[2023-03-11 19:10:19,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10205.3). Total num frames: 36769792. Throughput: 0: 9919.7. Samples: 36757500. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 19:10:19,012][65744] Avg episode reward: [(0, '4078.255')] +[2023-03-11 19:10:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000071816_36769792.pth... +[2023-03-11 19:10:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000071248_36478976.pth +[2023-03-11 19:10:19,889][66031] Updated weights for policy 0, policy_version 71840 (0.0005) +[2023-03-11 19:10:23,811][66031] Updated weights for policy 0, policy_version 71920 (0.0004) +[2023-03-11 19:10:24,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 10205.3). Total num frames: 36823040. Throughput: 0: 9985.1. Samples: 36819072. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 19:10:24,012][65744] Avg episode reward: [(0, '4137.402')] +[2023-03-11 19:10:27,713][66031] Updated weights for policy 0, policy_version 72000 (0.0004) +[2023-03-11 19:10:29,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10035.2, 300 sec: 10205.3). Total num frames: 36876288. Throughput: 0: 10018.6. Samples: 36850700. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 19:10:29,012][65744] Avg episode reward: [(0, '3885.944')] +[2023-03-11 19:10:31,677][66031] Updated weights for policy 0, policy_version 72080 (0.0005) +[2023-03-11 19:10:34,012][65744] Fps is (10 sec: 10649.5, 60 sec: 10103.5, 300 sec: 10205.3). Total num frames: 36929536. Throughput: 0: 10017.1. Samples: 36913216. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 19:10:34,012][65744] Avg episode reward: [(0, '4209.191')] +[2023-03-11 19:10:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000072128_36929536.pth... +[2023-03-11 19:10:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000071536_36626432.pth +[2023-03-11 19:10:35,630][66031] Updated weights for policy 0, policy_version 72160 (0.0004) +[2023-03-11 19:10:39,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10191.4). Total num frames: 36978688. Throughput: 0: 10014.8. Samples: 36974764. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 19:10:39,012][65744] Avg episode reward: [(0, '3335.130')] +[2023-03-11 19:10:39,565][66031] Updated weights for policy 0, policy_version 72240 (0.0004) +[2023-03-11 19:10:43,489][66031] Updated weights for policy 0, policy_version 72320 (0.0004) +[2023-03-11 19:10:44,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10103.5, 300 sec: 10205.3). Total num frames: 37031936. Throughput: 0: 10085.0. Samples: 37006592. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 19:10:44,012][65744] Avg episode reward: [(0, '4116.150')] +[2023-03-11 19:10:47,452][66031] Updated weights for policy 0, policy_version 72400 (0.0005) +[2023-03-11 19:10:49,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10191.4). Total num frames: 37081088. Throughput: 0: 10166.2. Samples: 37068864. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 19:10:49,012][65744] Avg episode reward: [(0, '3893.528')] +[2023-03-11 19:10:49,050][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000072432_37085184.pth... +[2023-03-11 19:10:49,052][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000071816_36769792.pth +[2023-03-11 19:10:51,416][66031] Updated weights for policy 0, policy_version 72480 (0.0005) +[2023-03-11 19:10:54,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10191.4). Total num frames: 37134336. Throughput: 0: 10224.1. Samples: 37130312. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 19:10:54,012][65744] Avg episode reward: [(0, '4303.259')] +[2023-03-11 19:10:55,423][66031] Updated weights for policy 0, policy_version 72560 (0.0005) +[2023-03-11 19:10:59,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10171.7, 300 sec: 10205.3). Total num frames: 37187584. Throughput: 0: 10274.6. Samples: 37161604. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 19:10:59,012][65744] Avg episode reward: [(0, '3870.400')] +[2023-03-11 19:10:59,373][66031] Updated weights for policy 0, policy_version 72640 (0.0004) +[2023-03-11 19:11:03,361][66031] Updated weights for policy 0, policy_version 72720 (0.0005) +[2023-03-11 19:11:04,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10191.4). Total num frames: 37236736. Throughput: 0: 10355.2. Samples: 37223484. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 19:11:04,012][65744] Avg episode reward: [(0, '4213.150')] +[2023-03-11 19:11:04,014][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000072728_37236736.pth... +[2023-03-11 19:11:04,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000072128_36929536.pth +[2023-03-11 19:11:07,259][66031] Updated weights for policy 0, policy_version 72800 (0.0004) +[2023-03-11 19:11:09,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10191.4). Total num frames: 37289984. Throughput: 0: 10375.1. Samples: 37285952. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 19:11:09,012][65744] Avg episode reward: [(0, '4485.645')] +[2023-03-11 19:11:09,013][65987] Saving new best policy, reward=4485.645! +[2023-03-11 19:11:11,213][66031] Updated weights for policy 0, policy_version 72880 (0.0004) +[2023-03-11 19:11:14,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10191.4). Total num frames: 37343232. Throughput: 0: 10360.0. Samples: 37316900. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 19:11:14,012][65744] Avg episode reward: [(0, '4219.818')] +[2023-03-11 19:11:15,144][66031] Updated weights for policy 0, policy_version 72960 (0.0004) +[2023-03-11 19:11:19,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10177.5). Total num frames: 37392384. Throughput: 0: 10374.8. Samples: 37380084. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 19:11:19,012][65744] Avg episode reward: [(0, '4179.814')] +[2023-03-11 19:11:19,033][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000073040_37396480.pth... +[2023-03-11 19:11:19,033][66031] Updated weights for policy 0, policy_version 73040 (0.0005) +[2023-03-11 19:11:19,034][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000072432_37085184.pth +[2023-03-11 19:11:22,937][66031] Updated weights for policy 0, policy_version 73120 (0.0004) +[2023-03-11 19:11:24,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10205.3). Total num frames: 37445632. Throughput: 0: 10401.8. Samples: 37442844. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 19:11:24,012][65744] Avg episode reward: [(0, '4303.869')] +[2023-03-11 19:11:26,845][66031] Updated weights for policy 0, policy_version 73200 (0.0004) +[2023-03-11 19:11:29,012][65744] Fps is (10 sec: 10649.7, 60 sec: 10376.5, 300 sec: 10219.2). Total num frames: 37498880. Throughput: 0: 10392.8. Samples: 37474268. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 19:11:29,012][65744] Avg episode reward: [(0, '3966.545')] +[2023-03-11 19:11:30,892][66031] Updated weights for policy 0, policy_version 73280 (0.0005) +[2023-03-11 19:11:34,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10219.2). Total num frames: 37548032. Throughput: 0: 10375.1. Samples: 37535744. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 19:11:34,012][65744] Avg episode reward: [(0, '4256.895')] +[2023-03-11 19:11:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000073336_37548032.pth... +[2023-03-11 19:11:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000072728_37236736.pth +[2023-03-11 19:11:34,840][66031] Updated weights for policy 0, policy_version 73360 (0.0004) +[2023-03-11 19:11:38,807][66031] Updated weights for policy 0, policy_version 73440 (0.0004) +[2023-03-11 19:11:39,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10233.1). Total num frames: 37601280. Throughput: 0: 10376.6. Samples: 37597260. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 19:11:39,012][65744] Avg episode reward: [(0, '4093.111')] +[2023-03-11 19:11:42,959][66031] Updated weights for policy 0, policy_version 73520 (0.0004) +[2023-03-11 19:11:44,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 10219.2). Total num frames: 37650432. Throughput: 0: 10354.1. Samples: 37627540. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 19:11:44,012][65744] Avg episode reward: [(0, '4160.953')] +[2023-03-11 19:11:47,212][66031] Updated weights for policy 0, policy_version 73600 (0.0005) +[2023-03-11 19:11:49,012][65744] Fps is (10 sec: 9830.3, 60 sec: 10308.3, 300 sec: 10219.2). Total num frames: 37699584. Throughput: 0: 10269.1. Samples: 37685596. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 19:11:49,012][65744] Avg episode reward: [(0, '4216.045')] +[2023-03-11 19:11:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000073632_37699584.pth... +[2023-03-11 19:11:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000073040_37396480.pth +[2023-03-11 19:11:51,496][66031] Updated weights for policy 0, policy_version 73680 (0.0005) +[2023-03-11 19:11:54,012][65744] Fps is (10 sec: 9420.8, 60 sec: 10171.7, 300 sec: 10191.4). Total num frames: 37744640. Throughput: 0: 10170.2. Samples: 37743612. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 19:11:54,012][65744] Avg episode reward: [(0, '4308.772')] +[2023-03-11 19:11:55,731][66031] Updated weights for policy 0, policy_version 73760 (0.0005) +[2023-03-11 19:11:59,012][65744] Fps is (10 sec: 9420.9, 60 sec: 10103.5, 300 sec: 10177.5). Total num frames: 37793792. Throughput: 0: 10122.0. Samples: 37772388. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 19:11:59,012][65744] Avg episode reward: [(0, '4052.506')] +[2023-03-11 19:12:00,012][66031] Updated weights for policy 0, policy_version 73840 (0.0005) +[2023-03-11 19:12:03,998][66031] Updated weights for policy 0, policy_version 73920 (0.0004) +[2023-03-11 19:12:04,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 10177.5). Total num frames: 37847040. Throughput: 0: 10015.6. Samples: 37830788. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 19:12:04,012][65744] Avg episode reward: [(0, '4152.312')] +[2023-03-11 19:12:04,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000073920_37847040.pth... +[2023-03-11 19:12:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000073336_37548032.pth +[2023-03-11 19:12:07,917][66031] Updated weights for policy 0, policy_version 74000 (0.0005) +[2023-03-11 19:12:09,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 10163.6). Total num frames: 37896192. Throughput: 0: 10016.4. Samples: 37893580. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:12:09,012][65744] Avg episode reward: [(0, '3930.564')] +[2023-03-11 19:12:11,867][66031] Updated weights for policy 0, policy_version 74080 (0.0004) +[2023-03-11 19:12:14,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10103.5, 300 sec: 10177.5). Total num frames: 37949440. Throughput: 0: 10013.5. Samples: 37924876. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:12:14,012][65744] Avg episode reward: [(0, '4245.031')] +[2023-03-11 19:12:16,009][66031] Updated weights for policy 0, policy_version 74160 (0.0005) +[2023-03-11 19:12:19,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10149.7). Total num frames: 37994496. Throughput: 0: 9946.0. Samples: 37983312. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:12:19,012][65744] Avg episode reward: [(0, '3755.533')] +[2023-03-11 19:12:19,022][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000074216_37998592.pth... +[2023-03-11 19:12:19,023][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000073632_37699584.pth +[2023-03-11 19:12:20,320][66031] Updated weights for policy 0, policy_version 74240 (0.0005) +[2023-03-11 19:12:24,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9966.9, 300 sec: 10135.9). Total num frames: 38043648. Throughput: 0: 9864.4. Samples: 38041160. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:12:24,012][65744] Avg episode reward: [(0, '3627.701')] +[2023-03-11 19:12:24,572][66031] Updated weights for policy 0, policy_version 74320 (0.0005) +[2023-03-11 19:12:28,759][66031] Updated weights for policy 0, policy_version 74400 (0.0005) +[2023-03-11 19:12:29,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10122.0). Total num frames: 38092800. Throughput: 0: 9851.3. Samples: 38070848. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:12:29,012][65744] Avg episode reward: [(0, '4414.269')] +[2023-03-11 19:12:33,015][66031] Updated weights for policy 0, policy_version 74480 (0.0005) +[2023-03-11 19:12:34,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9898.7, 300 sec: 10108.1). Total num frames: 38141952. Throughput: 0: 9839.8. Samples: 38128388. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:12:34,012][65744] Avg episode reward: [(0, '4257.522')] +[2023-03-11 19:12:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000074496_38141952.pth... +[2023-03-11 19:12:34,019][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000073920_37847040.pth +[2023-03-11 19:12:37,290][66031] Updated weights for policy 0, policy_version 74560 (0.0005) +[2023-03-11 19:12:39,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10094.2). Total num frames: 38191104. Throughput: 0: 9838.8. Samples: 38186360. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:12:39,012][65744] Avg episode reward: [(0, '4558.147')] +[2023-03-11 19:12:39,013][65987] Saving new best policy, reward=4558.147! +[2023-03-11 19:12:41,475][66031] Updated weights for policy 0, policy_version 74640 (0.0005) +[2023-03-11 19:12:44,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10080.3). Total num frames: 38240256. Throughput: 0: 9851.3. Samples: 38215696. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:12:44,012][65744] Avg episode reward: [(0, '4422.979')] +[2023-03-11 19:12:45,672][66031] Updated weights for policy 0, policy_version 74720 (0.0005) +[2023-03-11 19:12:49,012][65744] Fps is (10 sec: 9420.7, 60 sec: 9762.1, 300 sec: 10066.4). Total num frames: 38285312. Throughput: 0: 9841.8. Samples: 38273668. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:12:49,012][65744] Avg episode reward: [(0, '4394.197')] +[2023-03-11 19:12:49,023][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000074784_38289408.pth... +[2023-03-11 19:12:49,024][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000074216_37998592.pth +[2023-03-11 19:12:49,840][66031] Updated weights for policy 0, policy_version 74800 (0.0005) +[2023-03-11 19:12:54,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9830.4, 300 sec: 10052.6). Total num frames: 38334464. Throughput: 0: 9751.7. Samples: 38332408. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:12:54,012][65744] Avg episode reward: [(0, '4314.026')] +[2023-03-11 19:12:54,097][66031] Updated weights for policy 0, policy_version 74880 (0.0005) +[2023-03-11 19:12:58,340][66031] Updated weights for policy 0, policy_version 74960 (0.0005) +[2023-03-11 19:12:59,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9830.4, 300 sec: 10038.7). Total num frames: 38383616. Throughput: 0: 9693.1. Samples: 38361064. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:12:59,012][65744] Avg episode reward: [(0, '4213.471')] +[2023-03-11 19:13:02,517][66031] Updated weights for policy 0, policy_version 75040 (0.0005) +[2023-03-11 19:13:04,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 10024.8). Total num frames: 38432768. Throughput: 0: 9714.2. Samples: 38420452. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:13:04,012][65744] Avg episode reward: [(0, '4312.687')] +[2023-03-11 19:13:04,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000075064_38432768.pth... +[2023-03-11 19:13:04,019][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000074496_38141952.pth +[2023-03-11 19:13:06,684][66031] Updated weights for policy 0, policy_version 75120 (0.0005) +[2023-03-11 19:13:09,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 10024.8). Total num frames: 38481920. Throughput: 0: 9705.2. Samples: 38477896. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:13:09,012][65744] Avg episode reward: [(0, '4500.114')] +[2023-03-11 19:13:10,915][66031] Updated weights for policy 0, policy_version 75200 (0.0005) +[2023-03-11 19:13:14,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 10010.9). Total num frames: 38531072. Throughput: 0: 9708.3. Samples: 38507720. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:13:14,012][65744] Avg episode reward: [(0, '4499.574')] +[2023-03-11 19:13:14,901][66031] Updated weights for policy 0, policy_version 75280 (0.0005) +[2023-03-11 19:13:18,870][66031] Updated weights for policy 0, policy_version 75360 (0.0004) +[2023-03-11 19:13:19,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 10010.9). Total num frames: 38584320. Throughput: 0: 9805.3. Samples: 38569628. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:13:19,012][65744] Avg episode reward: [(0, '4535.614')] +[2023-03-11 19:13:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000075360_38584320.pth... +[2023-03-11 19:13:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000074784_38289408.pth +[2023-03-11 19:13:22,952][66031] Updated weights for policy 0, policy_version 75440 (0.0005) +[2023-03-11 19:13:24,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 10010.9). Total num frames: 38633472. Throughput: 0: 9851.2. Samples: 38629664. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:13:24,012][65744] Avg episode reward: [(0, '4463.172')] +[2023-03-11 19:13:27,250][66031] Updated weights for policy 0, policy_version 75520 (0.0005) +[2023-03-11 19:13:29,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10010.9). Total num frames: 38682624. Throughput: 0: 9835.2. Samples: 38658280. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:13:29,012][65744] Avg episode reward: [(0, '4529.469')] +[2023-03-11 19:13:31,472][66031] Updated weights for policy 0, policy_version 75600 (0.0005) +[2023-03-11 19:13:34,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9997.0). Total num frames: 38727680. Throughput: 0: 9830.6. Samples: 38716044. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:13:34,012][65744] Avg episode reward: [(0, '4360.585')] +[2023-03-11 19:13:34,031][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000075648_38731776.pth... +[2023-03-11 19:13:34,032][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000075064_38432768.pth +[2023-03-11 19:13:35,739][66031] Updated weights for policy 0, policy_version 75680 (0.0005) +[2023-03-11 19:13:39,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9997.0). Total num frames: 38776832. Throughput: 0: 9829.8. Samples: 38774748. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:13:39,012][65744] Avg episode reward: [(0, '3666.313')] +[2023-03-11 19:13:39,873][66031] Updated weights for policy 0, policy_version 75760 (0.0005) +[2023-03-11 19:13:43,664][66031] Updated weights for policy 0, policy_version 75840 (0.0004) +[2023-03-11 19:13:44,012][65744] Fps is (10 sec: 10239.9, 60 sec: 9830.4, 300 sec: 10010.9). Total num frames: 38830080. Throughput: 0: 9883.8. Samples: 38805836. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:13:44,012][65744] Avg episode reward: [(0, '1070.332')] +[2023-03-11 19:13:47,609][66031] Updated weights for policy 0, policy_version 75920 (0.0004) +[2023-03-11 19:13:49,012][65744] Fps is (10 sec: 10649.6, 60 sec: 9966.9, 300 sec: 10024.8). Total num frames: 38883328. Throughput: 0: 9976.7. Samples: 38869404. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:13:49,012][65744] Avg episode reward: [(0, '3569.180')] +[2023-03-11 19:13:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000075944_38883328.pth... +[2023-03-11 19:13:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000075360_38584320.pth +[2023-03-11 19:13:51,530][66031] Updated weights for policy 0, policy_version 76000 (0.0005) +[2023-03-11 19:13:54,012][65744] Fps is (10 sec: 10649.7, 60 sec: 10035.2, 300 sec: 10038.7). Total num frames: 38936576. Throughput: 0: 10100.9. Samples: 38932436. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:13:54,012][65744] Avg episode reward: [(0, '4124.751')] +[2023-03-11 19:13:55,424][66031] Updated weights for policy 0, policy_version 76080 (0.0004) +[2023-03-11 19:13:59,012][65744] Fps is (10 sec: 10649.7, 60 sec: 10103.5, 300 sec: 10038.7). Total num frames: 38989824. Throughput: 0: 10133.2. Samples: 38963712. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:13:59,012][65744] Avg episode reward: [(0, '4392.892')] +[2023-03-11 19:13:59,391][66031] Updated weights for policy 0, policy_version 76160 (0.0004) +[2023-03-11 19:14:03,313][66031] Updated weights for policy 0, policy_version 76240 (0.0004) +[2023-03-11 19:14:04,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 10024.8). Total num frames: 39038976. Throughput: 0: 10134.6. Samples: 39025684. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:14:04,012][65744] Avg episode reward: [(0, '4570.488')] +[2023-03-11 19:14:04,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000076248_39038976.pth... +[2023-03-11 19:14:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000075648_38731776.pth +[2023-03-11 19:14:04,019][65987] Saving new best policy, reward=4570.488! +[2023-03-11 19:14:07,343][66031] Updated weights for policy 0, policy_version 76320 (0.0004) +[2023-03-11 19:14:09,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10052.6). Total num frames: 39092224. Throughput: 0: 10180.6. Samples: 39087792. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:14:09,012][65744] Avg episode reward: [(0, '4415.096')] +[2023-03-11 19:14:11,318][66031] Updated weights for policy 0, policy_version 76400 (0.0004) +[2023-03-11 19:14:14,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10038.7). Total num frames: 39141376. Throughput: 0: 10217.0. Samples: 39118044. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:14:14,012][65744] Avg episode reward: [(0, '4355.936')] +[2023-03-11 19:14:15,383][66031] Updated weights for policy 0, policy_version 76480 (0.0004) +[2023-03-11 19:14:19,012][65744] Fps is (10 sec: 9830.3, 60 sec: 10103.5, 300 sec: 10038.7). Total num frames: 39190528. Throughput: 0: 10266.9. Samples: 39178056. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:14:19,012][65744] Avg episode reward: [(0, '4206.447')] +[2023-03-11 19:14:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000076544_39190528.pth... +[2023-03-11 19:14:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000075944_38883328.pth +[2023-03-11 19:14:19,685][66031] Updated weights for policy 0, policy_version 76560 (0.0005) +[2023-03-11 19:14:23,932][66031] Updated weights for policy 0, policy_version 76640 (0.0005) +[2023-03-11 19:14:24,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10052.6). Total num frames: 39239680. Throughput: 0: 10237.8. Samples: 39235448. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:14:24,012][65744] Avg episode reward: [(0, '4309.799')] +[2023-03-11 19:14:28,153][66031] Updated weights for policy 0, policy_version 76720 (0.0005) +[2023-03-11 19:14:29,012][65744] Fps is (10 sec: 9420.9, 60 sec: 10035.2, 300 sec: 10038.7). Total num frames: 39284736. Throughput: 0: 10188.5. Samples: 39264320. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:14:29,012][65744] Avg episode reward: [(0, '4313.124')] +[2023-03-11 19:14:32,456][66031] Updated weights for policy 0, policy_version 76800 (0.0005) +[2023-03-11 19:14:34,012][65744] Fps is (10 sec: 9420.8, 60 sec: 10103.5, 300 sec: 10038.7). Total num frames: 39333888. Throughput: 0: 10050.2. Samples: 39321664. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:14:34,012][65744] Avg episode reward: [(0, '4059.904')] +[2023-03-11 19:14:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000076824_39333888.pth... +[2023-03-11 19:14:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000076248_39038976.pth +[2023-03-11 19:14:36,567][66031] Updated weights for policy 0, policy_version 76880 (0.0005) +[2023-03-11 19:14:39,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10024.8). Total num frames: 39383040. Throughput: 0: 9975.3. Samples: 39381324. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:14:39,012][65744] Avg episode reward: [(0, '4253.937')] +[2023-03-11 19:14:40,773][66031] Updated weights for policy 0, policy_version 76960 (0.0005) +[2023-03-11 19:14:44,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10024.8). Total num frames: 39432192. Throughput: 0: 9931.7. Samples: 39410636. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:14:44,012][65744] Avg episode reward: [(0, '4392.447')] +[2023-03-11 19:14:44,927][66031] Updated weights for policy 0, policy_version 77040 (0.0004) +[2023-03-11 19:14:49,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 10010.9). Total num frames: 39481344. Throughput: 0: 9855.6. Samples: 39469184. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:14:49,012][65744] Avg episode reward: [(0, '4202.741')] +[2023-03-11 19:14:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000077112_39481344.pth... +[2023-03-11 19:14:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000076544_39190528.pth +[2023-03-11 19:14:49,116][66031] Updated weights for policy 0, policy_version 77120 (0.0005) +[2023-03-11 19:14:53,240][66031] Updated weights for policy 0, policy_version 77200 (0.0005) +[2023-03-11 19:14:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10010.9). Total num frames: 39530496. Throughput: 0: 9791.6. Samples: 39528412. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:14:54,012][65744] Avg episode reward: [(0, '4276.433')] +[2023-03-11 19:14:57,463][66031] Updated weights for policy 0, policy_version 77280 (0.0005) +[2023-03-11 19:14:59,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10010.9). Total num frames: 39579648. Throughput: 0: 9778.7. Samples: 39558084. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:14:59,012][65744] Avg episode reward: [(0, '4305.770')] +[2023-03-11 19:15:01,712][66031] Updated weights for policy 0, policy_version 77360 (0.0005) +[2023-03-11 19:15:04,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9830.4, 300 sec: 10024.8). Total num frames: 39628800. Throughput: 0: 9732.7. Samples: 39616028. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:15:04,012][65744] Avg episode reward: [(0, '4358.184')] +[2023-03-11 19:15:04,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000077400_39628800.pth... +[2023-03-11 19:15:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000076824_39333888.pth +[2023-03-11 19:15:05,920][66031] Updated weights for policy 0, policy_version 77440 (0.0005) +[2023-03-11 19:15:09,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 10024.8). Total num frames: 39677952. Throughput: 0: 9743.8. Samples: 39673920. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:15:09,012][65744] Avg episode reward: [(0, '4340.100')] +[2023-03-11 19:15:10,100][66031] Updated weights for policy 0, policy_version 77520 (0.0005) +[2023-03-11 19:15:14,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9762.1, 300 sec: 10024.8). Total num frames: 39727104. Throughput: 0: 9747.9. Samples: 39702976. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 19:15:14,012][65744] Avg episode reward: [(0, '4307.070')] +[2023-03-11 19:15:14,293][66031] Updated weights for policy 0, policy_version 77600 (0.0005) +[2023-03-11 19:15:18,525][66031] Updated weights for policy 0, policy_version 77680 (0.0005) +[2023-03-11 19:15:19,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 10010.9). Total num frames: 39776256. Throughput: 0: 9785.6. Samples: 39762016. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 19:15:19,012][65744] Avg episode reward: [(0, '4341.149')] +[2023-03-11 19:15:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000077688_39776256.pth... +[2023-03-11 19:15:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000077112_39481344.pth +[2023-03-11 19:15:22,645][66031] Updated weights for policy 0, policy_version 77760 (0.0005) +[2023-03-11 19:15:24,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9997.0). Total num frames: 39825408. Throughput: 0: 9777.7. Samples: 39821320. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 19:15:24,012][65744] Avg episode reward: [(0, '4235.984')] +[2023-03-11 19:15:26,825][66031] Updated weights for policy 0, policy_version 77840 (0.0005) +[2023-03-11 19:15:29,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9983.1). Total num frames: 39874560. Throughput: 0: 9769.7. Samples: 39850272. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 19:15:29,012][65744] Avg episode reward: [(0, '4236.518')] +[2023-03-11 19:15:30,971][66031] Updated weights for policy 0, policy_version 77920 (0.0005) +[2023-03-11 19:15:34,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9830.4, 300 sec: 9983.1). Total num frames: 39923712. Throughput: 0: 9812.1. Samples: 39910728. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 19:15:34,012][65744] Avg episode reward: [(0, '4240.887')] +[2023-03-11 19:15:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000077976_39923712.pth... +[2023-03-11 19:15:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000077400_39628800.pth +[2023-03-11 19:15:34,841][66031] Updated weights for policy 0, policy_version 78000 (0.0005) +[2023-03-11 19:15:38,799][66031] Updated weights for policy 0, policy_version 78080 (0.0005) +[2023-03-11 19:15:39,012][65744] Fps is (10 sec: 10240.1, 60 sec: 9898.7, 300 sec: 9983.1). Total num frames: 39976960. Throughput: 0: 9882.1. Samples: 39973104. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 19:15:39,012][65744] Avg episode reward: [(0, '4329.189')] +[2023-03-11 19:15:42,725][66031] Updated weights for policy 0, policy_version 78160 (0.0004) +[2023-03-11 19:15:44,012][65744] Fps is (10 sec: 10649.6, 60 sec: 9966.9, 300 sec: 9997.0). Total num frames: 40030208. Throughput: 0: 9930.1. Samples: 40004940. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 19:15:44,012][65744] Avg episode reward: [(0, '3903.271')] +[2023-03-11 19:15:46,640][66031] Updated weights for policy 0, policy_version 78240 (0.0004) +[2023-03-11 19:15:49,012][65744] Fps is (10 sec: 10649.4, 60 sec: 10035.2, 300 sec: 9997.0). Total num frames: 40083456. Throughput: 0: 10027.5. Samples: 40067264. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 19:15:49,012][65744] Avg episode reward: [(0, '3561.151')] +[2023-03-11 19:15:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000078288_40083456.pth... +[2023-03-11 19:15:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000077688_39776256.pth +[2023-03-11 19:15:50,572][66031] Updated weights for policy 0, policy_version 78320 (0.0004) +[2023-03-11 19:15:54,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10035.2, 300 sec: 9983.1). Total num frames: 40132608. Throughput: 0: 10141.9. Samples: 40130304. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 19:15:54,012][65744] Avg episode reward: [(0, '4340.432')] +[2023-03-11 19:15:54,432][66031] Updated weights for policy 0, policy_version 78400 (0.0004) +[2023-03-11 19:15:58,306][66031] Updated weights for policy 0, policy_version 78480 (0.0004) +[2023-03-11 19:15:59,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10103.5, 300 sec: 9997.0). Total num frames: 40185856. Throughput: 0: 10193.2. Samples: 40161672. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 19:15:59,012][65744] Avg episode reward: [(0, '4175.151')] +[2023-03-11 19:16:02,176][66031] Updated weights for policy 0, policy_version 78560 (0.0004) +[2023-03-11 19:16:04,012][65744] Fps is (10 sec: 10649.5, 60 sec: 10171.7, 300 sec: 9997.0). Total num frames: 40239104. Throughput: 0: 10302.0. Samples: 40225604. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:16:04,012][65744] Avg episode reward: [(0, '3987.467')] +[2023-03-11 19:16:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000078592_40239104.pth... +[2023-03-11 19:16:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000077976_39923712.pth +[2023-03-11 19:16:06,121][66031] Updated weights for policy 0, policy_version 78640 (0.0004) +[2023-03-11 19:16:09,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 9997.0). Total num frames: 40292352. Throughput: 0: 10376.3. Samples: 40288256. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:16:09,012][65744] Avg episode reward: [(0, '3848.518')] +[2023-03-11 19:16:09,954][66031] Updated weights for policy 0, policy_version 78720 (0.0004) +[2023-03-11 19:16:13,860][66031] Updated weights for policy 0, policy_version 78800 (0.0005) +[2023-03-11 19:16:14,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 10010.9). Total num frames: 40345600. Throughput: 0: 10446.1. Samples: 40320348. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:16:14,012][65744] Avg episode reward: [(0, '4091.753')] +[2023-03-11 19:16:17,770][66031] Updated weights for policy 0, policy_version 78880 (0.0004) +[2023-03-11 19:16:19,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10010.9). Total num frames: 40398848. Throughput: 0: 10495.3. Samples: 40383016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:16:19,015][65744] Avg episode reward: [(0, '4533.314')] +[2023-03-11 19:16:19,018][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000078904_40398848.pth... +[2023-03-11 19:16:19,020][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000078288_40083456.pth +[2023-03-11 19:16:21,698][66031] Updated weights for policy 0, policy_version 78960 (0.0004) +[2023-03-11 19:16:24,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10010.9). Total num frames: 40452096. Throughput: 0: 10518.0. Samples: 40446416. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:16:24,012][65744] Avg episode reward: [(0, '4286.767')] +[2023-03-11 19:16:25,581][66031] Updated weights for policy 0, policy_version 79040 (0.0005) +[2023-03-11 19:16:29,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10010.9). Total num frames: 40501248. Throughput: 0: 10498.9. Samples: 40477392. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:16:29,012][65744] Avg episode reward: [(0, '3781.876')] +[2023-03-11 19:16:29,582][66031] Updated weights for policy 0, policy_version 79120 (0.0004) +[2023-03-11 19:16:33,480][66031] Updated weights for policy 0, policy_version 79200 (0.0004) +[2023-03-11 19:16:34,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10010.9). Total num frames: 40554496. Throughput: 0: 10490.3. Samples: 40539328. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:16:34,012][65744] Avg episode reward: [(0, '4101.022')] +[2023-03-11 19:16:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000079208_40554496.pth... +[2023-03-11 19:16:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000078592_40239104.pth +[2023-03-11 19:16:37,407][66031] Updated weights for policy 0, policy_version 79280 (0.0004) +[2023-03-11 19:16:39,012][65744] Fps is (10 sec: 10649.5, 60 sec: 10513.0, 300 sec: 10024.8). Total num frames: 40607744. Throughput: 0: 10494.0. Samples: 40602536. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:16:39,012][65744] Avg episode reward: [(0, '4497.110')] +[2023-03-11 19:16:41,292][66031] Updated weights for policy 0, policy_version 79360 (0.0004) +[2023-03-11 19:16:44,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10024.8). Total num frames: 40656896. Throughput: 0: 10494.1. Samples: 40633908. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:16:44,012][65744] Avg episode reward: [(0, '4325.449')] +[2023-03-11 19:16:45,220][66031] Updated weights for policy 0, policy_version 79440 (0.0004) +[2023-03-11 19:16:49,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10052.6). Total num frames: 40710144. Throughput: 0: 10436.0. Samples: 40695224. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:16:49,012][65744] Avg episode reward: [(0, '4281.023')] +[2023-03-11 19:16:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000079512_40710144.pth... +[2023-03-11 19:16:49,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000078904_40398848.pth +[2023-03-11 19:16:49,393][66031] Updated weights for policy 0, policy_version 79520 (0.0005) +[2023-03-11 19:16:53,513][66031] Updated weights for policy 0, policy_version 79600 (0.0005) +[2023-03-11 19:16:54,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10052.6). Total num frames: 40759296. Throughput: 0: 10370.2. Samples: 40754912. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:16:54,012][65744] Avg episode reward: [(0, '4243.083')] +[2023-03-11 19:16:57,742][66031] Updated weights for policy 0, policy_version 79680 (0.0005) +[2023-03-11 19:16:59,012][65744] Fps is (10 sec: 9420.8, 60 sec: 10308.3, 300 sec: 10024.8). Total num frames: 40804352. Throughput: 0: 10300.7. Samples: 40783880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:16:59,012][65744] Avg episode reward: [(0, '4392.626')] +[2023-03-11 19:17:01,836][66031] Updated weights for policy 0, policy_version 79760 (0.0005) +[2023-03-11 19:17:04,012][65744] Fps is (10 sec: 9830.3, 60 sec: 10308.3, 300 sec: 10038.7). Total num frames: 40857600. Throughput: 0: 10231.3. Samples: 40843424. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:17:04,012][65744] Avg episode reward: [(0, '4552.621')] +[2023-03-11 19:17:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000079800_40857600.pth... +[2023-03-11 19:17:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000079208_40554496.pth +[2023-03-11 19:17:05,798][66031] Updated weights for policy 0, policy_version 79840 (0.0005) +[2023-03-11 19:17:09,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 10038.7). Total num frames: 40910848. Throughput: 0: 10200.5. Samples: 40905440. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:17:09,012][65744] Avg episode reward: [(0, '4535.441')] +[2023-03-11 19:17:09,724][66031] Updated weights for policy 0, policy_version 79920 (0.0004) +[2023-03-11 19:17:13,688][66031] Updated weights for policy 0, policy_version 80000 (0.0005) +[2023-03-11 19:17:14,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10052.6). Total num frames: 40960000. Throughput: 0: 10200.6. Samples: 40936420. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:17:14,012][65744] Avg episode reward: [(0, '4375.446')] +[2023-03-11 19:17:17,631][66031] Updated weights for policy 0, policy_version 80080 (0.0004) +[2023-03-11 19:17:19,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10066.4). Total num frames: 41013248. Throughput: 0: 10213.9. Samples: 40998956. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:17:19,012][65744] Avg episode reward: [(0, '4316.334')] +[2023-03-11 19:17:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000080104_41013248.pth... +[2023-03-11 19:17:19,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000079512_40710144.pth +[2023-03-11 19:17:21,512][66031] Updated weights for policy 0, policy_version 80160 (0.0004) +[2023-03-11 19:17:24,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 10080.3). Total num frames: 41066496. Throughput: 0: 10220.8. Samples: 41062472. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:17:24,012][65744] Avg episode reward: [(0, '4163.814')] +[2023-03-11 19:17:25,425][66031] Updated weights for policy 0, policy_version 80240 (0.0004) +[2023-03-11 19:17:29,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10080.3). Total num frames: 41115648. Throughput: 0: 10199.5. Samples: 41092884. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:17:29,023][65744] Avg episode reward: [(0, '4341.882')] +[2023-03-11 19:17:29,557][66031] Updated weights for policy 0, policy_version 80320 (0.0005) +[2023-03-11 19:17:33,755][66031] Updated weights for policy 0, policy_version 80400 (0.0005) +[2023-03-11 19:17:34,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10080.3). Total num frames: 41164800. Throughput: 0: 10162.0. Samples: 41152512. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:17:34,012][65744] Avg episode reward: [(0, '4044.234')] +[2023-03-11 19:17:34,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000080400_41164800.pth... +[2023-03-11 19:17:34,029][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000079800_40857600.pth +[2023-03-11 19:17:37,909][66031] Updated weights for policy 0, policy_version 80480 (0.0005) +[2023-03-11 19:17:39,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10080.3). Total num frames: 41213952. Throughput: 0: 10138.0. Samples: 41211124. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:17:39,012][65744] Avg episode reward: [(0, '4315.507')] +[2023-03-11 19:17:41,889][66031] Updated weights for policy 0, policy_version 80560 (0.0004) +[2023-03-11 19:17:44,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10108.1). Total num frames: 41267200. Throughput: 0: 10189.4. Samples: 41242404. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:17:44,012][65744] Avg episode reward: [(0, '4379.904')] +[2023-03-11 19:17:45,862][66031] Updated weights for policy 0, policy_version 80640 (0.0004) +[2023-03-11 19:17:49,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10171.7, 300 sec: 10122.0). Total num frames: 41320448. Throughput: 0: 10236.6. Samples: 41304072. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:17:49,012][65744] Avg episode reward: [(0, '4453.494')] +[2023-03-11 19:17:49,025][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000080704_41320448.pth... +[2023-03-11 19:17:49,028][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000080104_41013248.pth +[2023-03-11 19:17:49,795][66031] Updated weights for policy 0, policy_version 80720 (0.0004) +[2023-03-11 19:17:53,655][66031] Updated weights for policy 0, policy_version 80800 (0.0004) +[2023-03-11 19:17:54,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10122.0). Total num frames: 41369600. Throughput: 0: 10269.9. Samples: 41367584. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:17:54,012][65744] Avg episode reward: [(0, '4236.632')] +[2023-03-11 19:17:57,632][66031] Updated weights for policy 0, policy_version 80880 (0.0004) +[2023-03-11 19:17:59,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10135.9). Total num frames: 41422848. Throughput: 0: 10265.0. Samples: 41398344. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:17:59,012][65744] Avg episode reward: [(0, '4367.671')] +[2023-03-11 19:18:01,562][66031] Updated weights for policy 0, policy_version 80960 (0.0005) +[2023-03-11 19:18:04,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 10149.8). Total num frames: 41476096. Throughput: 0: 10263.5. Samples: 41460812. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:18:04,012][65744] Avg episode reward: [(0, '4447.184')] +[2023-03-11 19:18:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000081008_41476096.pth... +[2023-03-11 19:18:04,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000080400_41164800.pth +[2023-03-11 19:18:05,476][66031] Updated weights for policy 0, policy_version 81040 (0.0005) +[2023-03-11 19:18:09,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 10163.6). Total num frames: 41529344. Throughput: 0: 10257.8. Samples: 41524072. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:18:09,012][65744] Avg episode reward: [(0, '4497.326')] +[2023-03-11 19:18:09,331][66031] Updated weights for policy 0, policy_version 81120 (0.0004) +[2023-03-11 19:18:13,245][66031] Updated weights for policy 0, policy_version 81200 (0.0004) +[2023-03-11 19:18:14,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10149.8). Total num frames: 41578496. Throughput: 0: 10271.7. Samples: 41555112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:18:14,012][65744] Avg episode reward: [(0, '4298.514')] +[2023-03-11 19:18:17,187][66031] Updated weights for policy 0, policy_version 81280 (0.0004) +[2023-03-11 19:18:19,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10163.6). Total num frames: 41631744. Throughput: 0: 10346.0. Samples: 41618084. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:18:19,012][65744] Avg episode reward: [(0, '4516.243')] +[2023-03-11 19:18:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000081312_41631744.pth... +[2023-03-11 19:18:19,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000080704_41320448.pth +[2023-03-11 19:18:21,180][66031] Updated weights for policy 0, policy_version 81360 (0.0005) +[2023-03-11 19:18:24,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 10177.5). Total num frames: 41684992. Throughput: 0: 10427.1. Samples: 41680344. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:18:24,012][65744] Avg episode reward: [(0, '4519.031')] +[2023-03-11 19:18:25,118][66031] Updated weights for policy 0, policy_version 81440 (0.0004) +[2023-03-11 19:18:29,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 10191.4). Total num frames: 41734144. Throughput: 0: 10403.4. Samples: 41710556. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:18:29,012][65744] Avg episode reward: [(0, '4459.134')] +[2023-03-11 19:18:29,063][66031] Updated weights for policy 0, policy_version 81520 (0.0004) +[2023-03-11 19:18:33,042][66031] Updated weights for policy 0, policy_version 81600 (0.0003) +[2023-03-11 19:18:34,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10205.3). Total num frames: 41787392. Throughput: 0: 10424.6. Samples: 41773180. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:18:34,012][65744] Avg episode reward: [(0, '4069.827')] +[2023-03-11 19:18:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000081616_41787392.pth... +[2023-03-11 19:18:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000081008_41476096.pth +[2023-03-11 19:18:37,001][66031] Updated weights for policy 0, policy_version 81680 (0.0004) +[2023-03-11 19:18:39,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10205.3). Total num frames: 41840640. Throughput: 0: 10399.6. Samples: 41835564. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:18:39,012][65744] Avg episode reward: [(0, '3398.046')] +[2023-03-11 19:18:40,929][66031] Updated weights for policy 0, policy_version 81760 (0.0004) +[2023-03-11 19:18:44,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10191.4). Total num frames: 41889792. Throughput: 0: 10397.8. Samples: 41866244. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:18:44,012][65744] Avg episode reward: [(0, '4144.682')] +[2023-03-11 19:18:44,846][66031] Updated weights for policy 0, policy_version 81840 (0.0004) +[2023-03-11 19:18:48,771][66031] Updated weights for policy 0, policy_version 81920 (0.0004) +[2023-03-11 19:18:49,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10191.4). Total num frames: 41943040. Throughput: 0: 10417.2. Samples: 41929588. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:18:49,012][65744] Avg episode reward: [(0, '4126.050')] +[2023-03-11 19:18:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000081920_41943040.pth... +[2023-03-11 19:18:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000081312_41631744.pth +[2023-03-11 19:18:52,792][66031] Updated weights for policy 0, policy_version 82000 (0.0005) +[2023-03-11 19:18:54,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10177.5). Total num frames: 41992192. Throughput: 0: 10370.1. Samples: 41990728. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:18:54,012][65744] Avg episode reward: [(0, '4000.890')] +[2023-03-11 19:18:56,847][66031] Updated weights for policy 0, policy_version 82080 (0.0004) +[2023-03-11 19:18:59,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10191.4). Total num frames: 42045440. Throughput: 0: 10351.4. Samples: 42020928. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:18:59,012][65744] Avg episode reward: [(0, '4147.311')] +[2023-03-11 19:19:00,805][66031] Updated weights for policy 0, policy_version 82160 (0.0004) +[2023-03-11 19:19:04,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10308.3, 300 sec: 10177.5). Total num frames: 42094592. Throughput: 0: 10317.4. Samples: 42082368. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:19:04,012][65744] Avg episode reward: [(0, '4369.986')] +[2023-03-11 19:19:04,034][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000082224_42098688.pth... +[2023-03-11 19:19:04,036][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000081616_41787392.pth +[2023-03-11 19:19:04,845][66031] Updated weights for policy 0, policy_version 82240 (0.0005) +[2023-03-11 19:19:08,807][66031] Updated weights for policy 0, policy_version 82320 (0.0004) +[2023-03-11 19:19:09,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10191.4). Total num frames: 42147840. Throughput: 0: 10299.9. Samples: 42143840. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:19:09,012][65744] Avg episode reward: [(0, '4311.988')] +[2023-03-11 19:19:12,818][66031] Updated weights for policy 0, policy_version 82400 (0.0005) +[2023-03-11 19:19:14,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10205.3). Total num frames: 42201088. Throughput: 0: 10316.5. Samples: 42174800. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:19:14,012][65744] Avg episode reward: [(0, '4443.337')] +[2023-03-11 19:19:16,766][66031] Updated weights for policy 0, policy_version 82480 (0.0004) +[2023-03-11 19:19:19,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10205.3). Total num frames: 42250240. Throughput: 0: 10305.9. Samples: 42236948. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:19:19,012][65744] Avg episode reward: [(0, '4396.745')] +[2023-03-11 19:19:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000082520_42250240.pth... +[2023-03-11 19:19:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000081920_41943040.pth +[2023-03-11 19:19:20,674][66031] Updated weights for policy 0, policy_version 82560 (0.0004) +[2023-03-11 19:19:24,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10233.1). Total num frames: 42303488. Throughput: 0: 10308.7. Samples: 42299456. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:19:24,012][65744] Avg episode reward: [(0, '4197.179')] +[2023-03-11 19:19:24,617][66031] Updated weights for policy 0, policy_version 82640 (0.0004) +[2023-03-11 19:19:28,588][66031] Updated weights for policy 0, policy_version 82720 (0.0004) +[2023-03-11 19:19:29,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10246.9). Total num frames: 42356736. Throughput: 0: 10325.5. Samples: 42330892. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:19:29,012][65744] Avg episode reward: [(0, '4328.650')] +[2023-03-11 19:19:32,595][66031] Updated weights for policy 0, policy_version 82800 (0.0004) +[2023-03-11 19:19:34,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10308.3, 300 sec: 10246.9). Total num frames: 42405888. Throughput: 0: 10273.1. Samples: 42391880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:19:34,012][65744] Avg episode reward: [(0, '3871.347')] +[2023-03-11 19:19:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000082824_42405888.pth... +[2023-03-11 19:19:34,019][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000082224_42098688.pth +[2023-03-11 19:19:36,665][66031] Updated weights for policy 0, policy_version 82880 (0.0004) +[2023-03-11 19:19:39,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10246.9). Total num frames: 42455040. Throughput: 0: 10251.0. Samples: 42452024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:19:39,023][65744] Avg episode reward: [(0, '4061.566')] +[2023-03-11 19:19:40,880][66031] Updated weights for policy 0, policy_version 82960 (0.0005) +[2023-03-11 19:19:44,012][65744] Fps is (10 sec: 9830.5, 60 sec: 10240.0, 300 sec: 10246.9). Total num frames: 42504192. Throughput: 0: 10229.3. Samples: 42481244. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:19:44,012][65744] Avg episode reward: [(0, '4051.494')] +[2023-03-11 19:19:45,089][66031] Updated weights for policy 0, policy_version 83040 (0.0005) +[2023-03-11 19:19:49,012][65744] Fps is (10 sec: 9830.5, 60 sec: 10171.7, 300 sec: 10246.9). Total num frames: 42553344. Throughput: 0: 10160.2. Samples: 42539576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:19:49,012][65744] Avg episode reward: [(0, '3824.785')] +[2023-03-11 19:19:49,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000083112_42553344.pth... +[2023-03-11 19:19:49,029][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000082520_42250240.pth +[2023-03-11 19:19:49,257][66031] Updated weights for policy 0, policy_version 83120 (0.0005) +[2023-03-11 19:19:53,499][66031] Updated weights for policy 0, policy_version 83200 (0.0005) +[2023-03-11 19:19:54,012][65744] Fps is (10 sec: 9830.3, 60 sec: 10171.7, 300 sec: 10246.9). Total num frames: 42602496. Throughput: 0: 10101.3. Samples: 42598400. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:19:54,012][65744] Avg episode reward: [(0, '3909.334')] +[2023-03-11 19:19:57,693][66031] Updated weights for policy 0, policy_version 83280 (0.0005) +[2023-03-11 19:19:59,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10246.9). Total num frames: 42651648. Throughput: 0: 10050.5. Samples: 42627072. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:19:59,012][65744] Avg episode reward: [(0, '3915.781')] +[2023-03-11 19:20:01,597][66031] Updated weights for policy 0, policy_version 83360 (0.0004) +[2023-03-11 19:20:04,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10246.9). Total num frames: 42700800. Throughput: 0: 10036.5. Samples: 42688592. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:20:04,012][65744] Avg episode reward: [(0, '4161.654')] +[2023-03-11 19:20:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000083400_42700800.pth... +[2023-03-11 19:20:04,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000082824_42405888.pth +[2023-03-11 19:20:05,767][66031] Updated weights for policy 0, policy_version 83440 (0.0005) +[2023-03-11 19:20:09,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10246.9). Total num frames: 42749952. Throughput: 0: 9960.6. Samples: 42747684. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:20:09,012][65744] Avg episode reward: [(0, '4276.856')] +[2023-03-11 19:20:09,939][66031] Updated weights for policy 0, policy_version 83520 (0.0005) +[2023-03-11 19:20:14,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 10246.9). Total num frames: 42799104. Throughput: 0: 9914.8. Samples: 42777056. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:20:14,012][65744] Avg episode reward: [(0, '4374.487')] +[2023-03-11 19:20:14,179][66031] Updated weights for policy 0, policy_version 83600 (0.0005) +[2023-03-11 19:20:18,512][66031] Updated weights for policy 0, policy_version 83680 (0.0005) +[2023-03-11 19:20:19,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 10246.9). Total num frames: 42848256. Throughput: 0: 9843.1. Samples: 42834820. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:20:19,012][65744] Avg episode reward: [(0, '4186.429')] +[2023-03-11 19:20:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000083688_42848256.pth... +[2023-03-11 19:20:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000083112_42553344.pth +[2023-03-11 19:20:22,653][66031] Updated weights for policy 0, policy_version 83760 (0.0004) +[2023-03-11 19:20:24,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10246.9). Total num frames: 42897408. Throughput: 0: 9807.8. Samples: 42893376. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:20:24,012][65744] Avg episode reward: [(0, '4027.165')] +[2023-03-11 19:20:26,820][66031] Updated weights for policy 0, policy_version 83840 (0.0005) +[2023-03-11 19:20:29,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10246.9). Total num frames: 42946560. Throughput: 0: 9803.5. Samples: 42922400. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:20:29,012][65744] Avg episode reward: [(0, '3641.185')] +[2023-03-11 19:20:31,036][66031] Updated weights for policy 0, policy_version 83920 (0.0005) +[2023-03-11 19:20:34,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 10219.2). Total num frames: 42991616. Throughput: 0: 9797.4. Samples: 42980460. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:20:34,012][65744] Avg episode reward: [(0, '4098.817')] +[2023-03-11 19:20:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000083976_42995712.pth... +[2023-03-11 19:20:34,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000083400_42700800.pth +[2023-03-11 19:20:35,246][66031] Updated weights for policy 0, policy_version 84000 (0.0005) +[2023-03-11 19:20:39,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 10205.3). Total num frames: 43040768. Throughput: 0: 9800.9. Samples: 43039440. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:20:39,012][65744] Avg episode reward: [(0, '3929.047')] +[2023-03-11 19:20:39,425][66031] Updated weights for policy 0, policy_version 84080 (0.0004) +[2023-03-11 19:20:43,596][66031] Updated weights for policy 0, policy_version 84160 (0.0004) +[2023-03-11 19:20:44,012][65744] Fps is (10 sec: 10240.1, 60 sec: 9830.4, 300 sec: 10205.3). Total num frames: 43094016. Throughput: 0: 9825.1. Samples: 43069200. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:20:44,012][65744] Avg episode reward: [(0, '3919.869')] +[2023-03-11 19:20:47,790][66031] Updated weights for policy 0, policy_version 84240 (0.0005) +[2023-03-11 19:20:49,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 10191.4). Total num frames: 43139072. Throughput: 0: 9751.7. Samples: 43127420. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:20:49,012][65744] Avg episode reward: [(0, '3624.708')] +[2023-03-11 19:20:49,064][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000084264_43143168.pth... +[2023-03-11 19:20:49,067][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000083688_42848256.pth +[2023-03-11 19:20:51,992][66031] Updated weights for policy 0, policy_version 84320 (0.0005) +[2023-03-11 19:20:54,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 10177.5). Total num frames: 43188224. Throughput: 0: 9731.7. Samples: 43185612. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:20:54,012][65744] Avg episode reward: [(0, '4035.887')] +[2023-03-11 19:20:56,189][66031] Updated weights for policy 0, policy_version 84400 (0.0005) +[2023-03-11 19:20:59,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9762.1, 300 sec: 10163.6). Total num frames: 43237376. Throughput: 0: 9746.8. Samples: 43215660. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 19:20:59,012][65744] Avg episode reward: [(0, '4146.334')] +[2023-03-11 19:21:00,397][66031] Updated weights for policy 0, policy_version 84480 (0.0005) +[2023-03-11 19:21:04,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 10163.6). Total num frames: 43290624. Throughput: 0: 9766.3. Samples: 43274304. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 19:21:04,012][65744] Avg episode reward: [(0, '3404.635')] +[2023-03-11 19:21:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000084552_43290624.pth... +[2023-03-11 19:21:04,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000083976_42995712.pth +[2023-03-11 19:21:04,421][66031] Updated weights for policy 0, policy_version 84560 (0.0004) +[2023-03-11 19:21:08,651][66031] Updated weights for policy 0, policy_version 84640 (0.0005) +[2023-03-11 19:21:09,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9762.1, 300 sec: 10135.9). Total num frames: 43335680. Throughput: 0: 9794.7. Samples: 43334136. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 19:21:09,012][65744] Avg episode reward: [(0, '3909.489')] +[2023-03-11 19:21:12,912][66031] Updated weights for policy 0, policy_version 84720 (0.0005) +[2023-03-11 19:21:14,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 10122.0). Total num frames: 43384832. Throughput: 0: 9790.2. Samples: 43362960. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 19:21:14,012][65744] Avg episode reward: [(0, '4129.960')] +[2023-03-11 19:21:17,112][66031] Updated weights for policy 0, policy_version 84800 (0.0005) +[2023-03-11 19:21:19,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9762.1, 300 sec: 10108.1). Total num frames: 43433984. Throughput: 0: 9795.2. Samples: 43421244. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 19:21:19,012][65744] Avg episode reward: [(0, '3371.070')] +[2023-03-11 19:21:19,032][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000084832_43433984.pth... +[2023-03-11 19:21:19,035][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000084264_43143168.pth +[2023-03-11 19:21:21,320][66031] Updated weights for policy 0, policy_version 84880 (0.0005) +[2023-03-11 19:21:24,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 10108.1). Total num frames: 43483136. Throughput: 0: 9786.5. Samples: 43479832. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 19:21:24,012][65744] Avg episode reward: [(0, '2653.512')] +[2023-03-11 19:21:25,304][66031] Updated weights for policy 0, policy_version 84960 (0.0004) +[2023-03-11 19:21:29,012][65744] Fps is (10 sec: 10240.1, 60 sec: 9830.4, 300 sec: 10108.1). Total num frames: 43536384. Throughput: 0: 9835.7. Samples: 43511808. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 19:21:29,012][65744] Avg episode reward: [(0, '3414.242')] +[2023-03-11 19:21:29,377][66031] Updated weights for policy 0, policy_version 85040 (0.0005) +[2023-03-11 19:21:33,516][66031] Updated weights for policy 0, policy_version 85120 (0.0005) +[2023-03-11 19:21:34,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 10094.2). Total num frames: 43585536. Throughput: 0: 9856.4. Samples: 43570956. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 19:21:34,012][65744] Avg episode reward: [(0, '4127.192')] +[2023-03-11 19:21:34,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000085128_43585536.pth... +[2023-03-11 19:21:34,028][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000084552_43290624.pth +[2023-03-11 19:21:37,457][66031] Updated weights for policy 0, policy_version 85200 (0.0004) +[2023-03-11 19:21:39,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 10108.1). Total num frames: 43638784. Throughput: 0: 9933.1. Samples: 43632600. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 19:21:39,012][65744] Avg episode reward: [(0, '4087.748')] +[2023-03-11 19:21:41,395][66031] Updated weights for policy 0, policy_version 85280 (0.0004) +[2023-03-11 19:21:44,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 10094.2). Total num frames: 43687936. Throughput: 0: 9954.6. Samples: 43663616. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 19:21:44,012][65744] Avg episode reward: [(0, '4341.029')] +[2023-03-11 19:21:45,299][66031] Updated weights for policy 0, policy_version 85360 (0.0004) +[2023-03-11 19:21:49,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10035.2, 300 sec: 10108.1). Total num frames: 43741184. Throughput: 0: 10061.7. Samples: 43727080. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 19:21:49,012][65744] Avg episode reward: [(0, '3563.900')] +[2023-03-11 19:21:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000085432_43741184.pth... +[2023-03-11 19:21:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000084832_43433984.pth +[2023-03-11 19:21:49,177][66031] Updated weights for policy 0, policy_version 85440 (0.0004) +[2023-03-11 19:21:53,022][66031] Updated weights for policy 0, policy_version 85520 (0.0004) +[2023-03-11 19:21:54,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10103.5, 300 sec: 10135.9). Total num frames: 43794432. Throughput: 0: 10139.4. Samples: 43790408. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:21:54,012][65744] Avg episode reward: [(0, '3058.782')] +[2023-03-11 19:21:56,848][66031] Updated weights for policy 0, policy_version 85600 (0.0004) +[2023-03-11 19:21:59,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10171.7, 300 sec: 10135.9). Total num frames: 43847680. Throughput: 0: 10220.6. Samples: 43822888. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:21:59,012][65744] Avg episode reward: [(0, '3631.261')] +[2023-03-11 19:22:00,809][66031] Updated weights for policy 0, policy_version 85680 (0.0005) +[2023-03-11 19:22:04,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10171.7, 300 sec: 10135.9). Total num frames: 43900928. Throughput: 0: 10307.3. Samples: 43885072. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:22:04,012][65744] Avg episode reward: [(0, '3020.198')] +[2023-03-11 19:22:04,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000085744_43900928.pth... +[2023-03-11 19:22:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000085128_43585536.pth +[2023-03-11 19:22:04,758][66031] Updated weights for policy 0, policy_version 85760 (0.0004) +[2023-03-11 19:22:08,648][66031] Updated weights for policy 0, policy_version 85840 (0.0004) +[2023-03-11 19:22:09,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10135.9). Total num frames: 43950080. Throughput: 0: 10394.5. Samples: 43947584. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:22:09,012][65744] Avg episode reward: [(0, '3646.710')] +[2023-03-11 19:22:12,556][66031] Updated weights for policy 0, policy_version 85920 (0.0004) +[2023-03-11 19:22:14,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10135.9). Total num frames: 44003328. Throughput: 0: 10381.4. Samples: 43978972. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:22:14,012][65744] Avg episode reward: [(0, '3605.034')] +[2023-03-11 19:22:16,409][66031] Updated weights for policy 0, policy_version 86000 (0.0005) +[2023-03-11 19:22:19,012][65744] Fps is (10 sec: 10649.5, 60 sec: 10376.5, 300 sec: 10135.9). Total num frames: 44056576. Throughput: 0: 10497.5. Samples: 44043344. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:22:19,012][65744] Avg episode reward: [(0, '3930.713')] +[2023-03-11 19:22:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000086048_44056576.pth... +[2023-03-11 19:22:19,019][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000085432_43741184.pth +[2023-03-11 19:22:20,288][66031] Updated weights for policy 0, policy_version 86080 (0.0005) +[2023-03-11 19:22:24,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10149.7). Total num frames: 44109824. Throughput: 0: 10526.7. Samples: 44106300. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:22:24,012][65744] Avg episode reward: [(0, '3478.054')] +[2023-03-11 19:22:24,125][66031] Updated weights for policy 0, policy_version 86160 (0.0004) +[2023-03-11 19:22:28,093][66031] Updated weights for policy 0, policy_version 86240 (0.0005) +[2023-03-11 19:22:29,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10163.6). Total num frames: 44163072. Throughput: 0: 10548.5. Samples: 44138300. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:22:29,012][65744] Avg episode reward: [(0, '3973.097')] +[2023-03-11 19:22:32,055][66031] Updated weights for policy 0, policy_version 86320 (0.0005) +[2023-03-11 19:22:34,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10163.6). Total num frames: 44212224. Throughput: 0: 10508.2. Samples: 44199948. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:22:34,012][65744] Avg episode reward: [(0, '3595.087')] +[2023-03-11 19:22:34,034][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000086360_44216320.pth... +[2023-03-11 19:22:34,036][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000085744_43900928.pth +[2023-03-11 19:22:36,010][66031] Updated weights for policy 0, policy_version 86400 (0.0005) +[2023-03-11 19:22:39,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10163.6). Total num frames: 44265472. Throughput: 0: 10479.1. Samples: 44261968. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:22:39,012][65744] Avg episode reward: [(0, '4023.931')] +[2023-03-11 19:22:39,919][66031] Updated weights for policy 0, policy_version 86480 (0.0005) +[2023-03-11 19:22:43,811][66031] Updated weights for policy 0, policy_version 86560 (0.0004) +[2023-03-11 19:22:44,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10163.6). Total num frames: 44318720. Throughput: 0: 10466.0. Samples: 44293860. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 19:22:44,012][65744] Avg episode reward: [(0, '3640.511')] +[2023-03-11 19:22:47,724][66031] Updated weights for policy 0, policy_version 86640 (0.0005) +[2023-03-11 19:22:49,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10177.5). Total num frames: 44371968. Throughput: 0: 10475.8. Samples: 44356484. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 19:22:49,012][65744] Avg episode reward: [(0, '3890.894')] +[2023-03-11 19:22:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000086664_44371968.pth... +[2023-03-11 19:22:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000086048_44056576.pth +[2023-03-11 19:22:51,681][66031] Updated weights for policy 0, policy_version 86720 (0.0005) +[2023-03-11 19:22:54,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10177.5). Total num frames: 44425216. Throughput: 0: 10484.4. Samples: 44419384. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 19:22:54,023][65744] Avg episode reward: [(0, '4078.187')] +[2023-03-11 19:22:55,568][66031] Updated weights for policy 0, policy_version 86800 (0.0004) +[2023-03-11 19:22:59,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10163.6). Total num frames: 44474368. Throughput: 0: 10473.1. Samples: 44450260. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 19:22:59,012][65744] Avg episode reward: [(0, '3818.532')] +[2023-03-11 19:22:59,494][66031] Updated weights for policy 0, policy_version 86880 (0.0005) +[2023-03-11 19:23:03,376][66031] Updated weights for policy 0, policy_version 86960 (0.0004) +[2023-03-11 19:23:04,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10444.8, 300 sec: 10163.6). Total num frames: 44527616. Throughput: 0: 10464.7. Samples: 44514256. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 19:23:04,012][65744] Avg episode reward: [(0, '4047.219')] +[2023-03-11 19:23:04,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000086968_44527616.pth... +[2023-03-11 19:23:04,028][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000086360_44216320.pth +[2023-03-11 19:23:07,238][66031] Updated weights for policy 0, policy_version 87040 (0.0005) +[2023-03-11 19:23:09,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10177.5). Total num frames: 44580864. Throughput: 0: 10456.4. Samples: 44576840. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 19:23:09,012][65744] Avg episode reward: [(0, '4105.628')] +[2023-03-11 19:23:11,199][66031] Updated weights for policy 0, policy_version 87120 (0.0005) +[2023-03-11 19:23:14,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10177.5). Total num frames: 44634112. Throughput: 0: 10438.3. Samples: 44608024. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 19:23:14,012][65744] Avg episode reward: [(0, '4266.270')] +[2023-03-11 19:23:15,163][66031] Updated weights for policy 0, policy_version 87200 (0.0004) +[2023-03-11 19:23:19,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10163.6). Total num frames: 44683264. Throughput: 0: 10467.3. Samples: 44670976. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 19:23:19,012][65744] Avg episode reward: [(0, '4008.475')] +[2023-03-11 19:23:19,056][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000087280_44687360.pth... +[2023-03-11 19:23:19,056][66031] Updated weights for policy 0, policy_version 87280 (0.0005) +[2023-03-11 19:23:19,057][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000086664_44371968.pth +[2023-03-11 19:23:22,974][66031] Updated weights for policy 0, policy_version 87360 (0.0005) +[2023-03-11 19:23:24,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10177.5). Total num frames: 44736512. Throughput: 0: 10465.3. Samples: 44732908. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 19:23:24,012][65744] Avg episode reward: [(0, '3893.883')] +[2023-03-11 19:23:26,851][66031] Updated weights for policy 0, policy_version 87440 (0.0004) +[2023-03-11 19:23:29,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10177.5). Total num frames: 44789760. Throughput: 0: 10475.3. Samples: 44765248. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 19:23:29,012][65744] Avg episode reward: [(0, '4135.459')] +[2023-03-11 19:23:30,841][66031] Updated weights for policy 0, policy_version 87520 (0.0005) +[2023-03-11 19:23:34,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10177.5). Total num frames: 44843008. Throughput: 0: 10450.8. Samples: 44826768. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 19:23:34,012][65744] Avg episode reward: [(0, '4050.114')] +[2023-03-11 19:23:34,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000087584_44843008.pth... +[2023-03-11 19:23:34,028][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000086968_44527616.pth +[2023-03-11 19:23:34,791][66031] Updated weights for policy 0, policy_version 87600 (0.0004) +[2023-03-11 19:23:38,702][66031] Updated weights for policy 0, policy_version 87680 (0.0005) +[2023-03-11 19:23:39,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10177.5). Total num frames: 44892160. Throughput: 0: 10450.0. Samples: 44889632. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 19:23:39,012][65744] Avg episode reward: [(0, '4143.695')] +[2023-03-11 19:23:42,569][66031] Updated weights for policy 0, policy_version 87760 (0.0004) +[2023-03-11 19:23:44,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10177.5). Total num frames: 44945408. Throughput: 0: 10458.6. Samples: 44920896. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 19:23:44,012][65744] Avg episode reward: [(0, '4479.344')] +[2023-03-11 19:23:46,478][66031] Updated weights for policy 0, policy_version 87840 (0.0005) +[2023-03-11 19:23:49,012][65744] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10191.4). Total num frames: 44998656. Throughput: 0: 10446.0. Samples: 44984324. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 19:23:49,012][65744] Avg episode reward: [(0, '4012.430')] +[2023-03-11 19:23:49,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000087888_44998656.pth... +[2023-03-11 19:23:49,029][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000087280_44687360.pth +[2023-03-11 19:23:50,350][66031] Updated weights for policy 0, policy_version 87920 (0.0004) +[2023-03-11 19:23:54,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10191.4). Total num frames: 45051904. Throughput: 0: 10458.1. Samples: 45047456. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 19:23:54,012][65744] Avg episode reward: [(0, '4169.448')] +[2023-03-11 19:23:54,290][66031] Updated weights for policy 0, policy_version 88000 (0.0005) +[2023-03-11 19:23:58,230][66031] Updated weights for policy 0, policy_version 88080 (0.0005) +[2023-03-11 19:23:59,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10191.4). Total num frames: 45101056. Throughput: 0: 10454.0. Samples: 45078452. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 19:23:59,012][65744] Avg episode reward: [(0, '4166.723')] +[2023-03-11 19:24:02,437][66031] Updated weights for policy 0, policy_version 88160 (0.0006) +[2023-03-11 19:24:04,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10376.5, 300 sec: 10177.5). Total num frames: 45150208. Throughput: 0: 10377.9. Samples: 45137984. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 19:24:04,012][65744] Avg episode reward: [(0, '4178.510')] +[2023-03-11 19:24:04,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000088184_45150208.pth... +[2023-03-11 19:24:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000087584_44843008.pth +[2023-03-11 19:24:06,459][66031] Updated weights for policy 0, policy_version 88240 (0.0005) +[2023-03-11 19:24:09,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10177.5). Total num frames: 45203456. Throughput: 0: 10367.2. Samples: 45199432. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 19:24:09,023][65744] Avg episode reward: [(0, '4175.271')] +[2023-03-11 19:24:10,353][66031] Updated weights for policy 0, policy_version 88320 (0.0005) +[2023-03-11 19:24:14,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10191.4). Total num frames: 45256704. Throughput: 0: 10358.6. Samples: 45231384. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 19:24:14,012][65744] Avg episode reward: [(0, '3962.225')] +[2023-03-11 19:24:14,269][66031] Updated weights for policy 0, policy_version 88400 (0.0005) +[2023-03-11 19:24:18,135][66031] Updated weights for policy 0, policy_version 88480 (0.0004) +[2023-03-11 19:24:19,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10191.4). Total num frames: 45309952. Throughput: 0: 10384.4. Samples: 45294064. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 19:24:19,012][65744] Avg episode reward: [(0, '3951.973')] +[2023-03-11 19:24:19,027][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000088496_45309952.pth... +[2023-03-11 19:24:19,030][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000087888_44998656.pth +[2023-03-11 19:24:22,032][66031] Updated weights for policy 0, policy_version 88560 (0.0005) +[2023-03-11 19:24:24,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10191.4). Total num frames: 45363200. Throughput: 0: 10411.6. Samples: 45358156. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 19:24:24,012][65744] Avg episode reward: [(0, '3615.535')] +[2023-03-11 19:24:25,889][66031] Updated weights for policy 0, policy_version 88640 (0.0005) +[2023-03-11 19:24:29,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10191.4). Total num frames: 45412352. Throughput: 0: 10411.5. Samples: 45389412. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:24:29,012][65744] Avg episode reward: [(0, '3403.259')] +[2023-03-11 19:24:29,841][66031] Updated weights for policy 0, policy_version 88720 (0.0005) +[2023-03-11 19:24:33,920][66031] Updated weights for policy 0, policy_version 88800 (0.0005) +[2023-03-11 19:24:34,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10205.3). Total num frames: 45465600. Throughput: 0: 10369.9. Samples: 45450968. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:24:34,012][65744] Avg episode reward: [(0, '4305.958')] +[2023-03-11 19:24:34,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000088800_45465600.pth... +[2023-03-11 19:24:34,029][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000088184_45150208.pth +[2023-03-11 19:24:38,088][66031] Updated weights for policy 0, policy_version 88880 (0.0005) +[2023-03-11 19:24:39,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10205.3). Total num frames: 45514752. Throughput: 0: 10290.6. Samples: 45510532. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:24:39,012][65744] Avg episode reward: [(0, '3776.030')] +[2023-03-11 19:24:42,247][66031] Updated weights for policy 0, policy_version 88960 (0.0005) +[2023-03-11 19:24:44,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10308.3, 300 sec: 10205.3). Total num frames: 45563904. Throughput: 0: 10243.1. Samples: 45539392. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:24:44,012][65744] Avg episode reward: [(0, '3839.314')] +[2023-03-11 19:24:46,405][66031] Updated weights for policy 0, policy_version 89040 (0.0005) +[2023-03-11 19:24:49,012][65744] Fps is (10 sec: 9830.3, 60 sec: 10240.0, 300 sec: 10205.3). Total num frames: 45613056. Throughput: 0: 10242.0. Samples: 45598876. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:24:49,012][65744] Avg episode reward: [(0, '3897.411')] +[2023-03-11 19:24:49,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000089088_45613056.pth... +[2023-03-11 19:24:49,028][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000088496_45309952.pth +[2023-03-11 19:24:50,647][66031] Updated weights for policy 0, policy_version 89120 (0.0005) +[2023-03-11 19:24:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10205.3). Total num frames: 45662208. Throughput: 0: 10175.4. Samples: 45657324. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:24:54,012][65744] Avg episode reward: [(0, '3584.018')] +[2023-03-11 19:24:54,796][66031] Updated weights for policy 0, policy_version 89200 (0.0005) +[2023-03-11 19:24:59,012][65744] Fps is (10 sec: 9420.8, 60 sec: 10103.5, 300 sec: 10191.4). Total num frames: 45707264. Throughput: 0: 10120.2. Samples: 45686792. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:24:59,012][65744] Avg episode reward: [(0, '2985.299')] +[2023-03-11 19:24:59,036][66031] Updated weights for policy 0, policy_version 89280 (0.0005) +[2023-03-11 19:25:03,320][66031] Updated weights for policy 0, policy_version 89360 (0.0006) +[2023-03-11 19:25:04,012][65744] Fps is (10 sec: 9420.8, 60 sec: 10103.5, 300 sec: 10191.4). Total num frames: 45756416. Throughput: 0: 10001.6. Samples: 45744136. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:25:04,012][65744] Avg episode reward: [(0, '3066.096')] +[2023-03-11 19:25:04,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000089368_45756416.pth... +[2023-03-11 19:25:04,019][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000088800_45465600.pth +[2023-03-11 19:25:07,516][66031] Updated weights for policy 0, policy_version 89440 (0.0005) +[2023-03-11 19:25:09,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10191.4). Total num frames: 45805568. Throughput: 0: 9871.5. Samples: 45802372. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:25:09,012][65744] Avg episode reward: [(0, '3507.156')] +[2023-03-11 19:25:11,752][66031] Updated weights for policy 0, policy_version 89520 (0.0005) +[2023-03-11 19:25:14,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9966.9, 300 sec: 10191.4). Total num frames: 45854720. Throughput: 0: 9815.6. Samples: 45831112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:25:14,012][65744] Avg episode reward: [(0, '2897.380')] +[2023-03-11 19:25:15,795][66031] Updated weights for policy 0, policy_version 89600 (0.0005) +[2023-03-11 19:25:19,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 10205.3). Total num frames: 45907968. Throughput: 0: 9803.2. Samples: 45892112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:25:19,012][65744] Avg episode reward: [(0, '4083.280')] +[2023-03-11 19:25:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000089664_45907968.pth... +[2023-03-11 19:25:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000089088_45613056.pth +[2023-03-11 19:25:19,691][66031] Updated weights for policy 0, policy_version 89680 (0.0005) +[2023-03-11 19:25:23,539][66031] Updated weights for policy 0, policy_version 89760 (0.0004) +[2023-03-11 19:25:24,012][65744] Fps is (10 sec: 10649.6, 60 sec: 9966.9, 300 sec: 10219.2). Total num frames: 45961216. Throughput: 0: 9900.9. Samples: 45956072. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 19:25:24,012][65744] Avg episode reward: [(0, '3891.053')] +[2023-03-11 19:25:27,628][66031] Updated weights for policy 0, policy_version 89840 (0.0005) +[2023-03-11 19:25:29,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 10233.1). Total num frames: 46010368. Throughput: 0: 9924.3. Samples: 45985984. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 19:25:29,012][65744] Avg episode reward: [(0, '3878.649')] +[2023-03-11 19:25:31,792][66031] Updated weights for policy 0, policy_version 89920 (0.0005) +[2023-03-11 19:25:34,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9898.7, 300 sec: 10233.1). Total num frames: 46059520. Throughput: 0: 9927.8. Samples: 46045628. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 19:25:34,012][65744] Avg episode reward: [(0, '3995.179')] +[2023-03-11 19:25:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000089960_46059520.pth... +[2023-03-11 19:25:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000089368_45756416.pth +[2023-03-11 19:25:35,963][66031] Updated weights for policy 0, policy_version 90000 (0.0005) +[2023-03-11 19:25:39,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10219.2). Total num frames: 46108672. Throughput: 0: 9940.4. Samples: 46104640. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 19:25:39,012][65744] Avg episode reward: [(0, '4308.994')] +[2023-03-11 19:25:40,160][66031] Updated weights for policy 0, policy_version 90080 (0.0005) +[2023-03-11 19:25:44,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10233.1). Total num frames: 46157824. Throughput: 0: 9923.4. Samples: 46133344. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 19:25:44,012][65744] Avg episode reward: [(0, '4132.479')] +[2023-03-11 19:25:44,198][66031] Updated weights for policy 0, policy_version 90160 (0.0005) +[2023-03-11 19:25:48,188][66031] Updated weights for policy 0, policy_version 90240 (0.0004) +[2023-03-11 19:25:49,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 10246.9). Total num frames: 46211072. Throughput: 0: 10025.9. Samples: 46195300. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 19:25:49,012][65744] Avg episode reward: [(0, '4099.838')] +[2023-03-11 19:25:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000090256_46211072.pth... +[2023-03-11 19:25:49,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000089664_45907968.pth +[2023-03-11 19:25:52,209][66031] Updated weights for policy 0, policy_version 90320 (0.0003) +[2023-03-11 19:25:54,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 10246.9). Total num frames: 46260224. Throughput: 0: 10084.9. Samples: 46256192. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 19:25:54,012][65744] Avg episode reward: [(0, '4469.989')] +[2023-03-11 19:25:56,275][66031] Updated weights for policy 0, policy_version 90400 (0.0003) +[2023-03-11 19:25:59,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10233.1). Total num frames: 46309376. Throughput: 0: 10122.0. Samples: 46286604. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 19:25:59,012][65744] Avg episode reward: [(0, '4521.179')] +[2023-03-11 19:26:00,232][66031] Updated weights for policy 0, policy_version 90480 (0.0003) +[2023-03-11 19:26:04,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 10260.8). Total num frames: 46362624. Throughput: 0: 10128.3. Samples: 46347888. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 19:26:04,012][65744] Avg episode reward: [(0, '4482.238')] +[2023-03-11 19:26:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000090552_46362624.pth... +[2023-03-11 19:26:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000089960_46059520.pth +[2023-03-11 19:26:04,363][66031] Updated weights for policy 0, policy_version 90560 (0.0003) +[2023-03-11 19:26:08,402][66031] Updated weights for policy 0, policy_version 90640 (0.0003) +[2023-03-11 19:26:09,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10260.8). Total num frames: 46411776. Throughput: 0: 10039.6. Samples: 46407856. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 19:26:09,012][65744] Avg episode reward: [(0, '4326.591')] +[2023-03-11 19:26:12,394][66031] Updated weights for policy 0, policy_version 90720 (0.0004) +[2023-03-11 19:26:14,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10274.7). Total num frames: 46465024. Throughput: 0: 10065.0. Samples: 46438908. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 19:26:14,012][65744] Avg episode reward: [(0, '4141.119')] +[2023-03-11 19:26:16,393][66031] Updated weights for policy 0, policy_version 90800 (0.0004) +[2023-03-11 19:26:19,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 10274.7). Total num frames: 46514176. Throughput: 0: 10088.3. Samples: 46499600. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 19:26:19,012][65744] Avg episode reward: [(0, '3529.972')] +[2023-03-11 19:26:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000090848_46514176.pth... +[2023-03-11 19:26:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000090256_46211072.pth +[2023-03-11 19:26:20,548][66031] Updated weights for policy 0, policy_version 90880 (0.0005) +[2023-03-11 19:26:24,012][65744] Fps is (10 sec: 9830.5, 60 sec: 10035.2, 300 sec: 10260.8). Total num frames: 46563328. Throughput: 0: 10093.5. Samples: 46558848. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 19:26:24,012][65744] Avg episode reward: [(0, '4380.000')] +[2023-03-11 19:26:24,812][66031] Updated weights for policy 0, policy_version 90960 (0.0005) +[2023-03-11 19:26:29,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9966.9, 300 sec: 10246.9). Total num frames: 46608384. Throughput: 0: 10101.6. Samples: 46587916. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 19:26:29,012][65744] Avg episode reward: [(0, '4451.873')] +[2023-03-11 19:26:29,016][66031] Updated weights for policy 0, policy_version 91040 (0.0005) +[2023-03-11 19:26:33,283][66031] Updated weights for policy 0, policy_version 91120 (0.0005) +[2023-03-11 19:26:34,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9966.9, 300 sec: 10233.1). Total num frames: 46657536. Throughput: 0: 10000.3. Samples: 46645312. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 19:26:34,012][65744] Avg episode reward: [(0, '4304.011')] +[2023-03-11 19:26:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000091128_46657536.pth... +[2023-03-11 19:26:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000090552_46362624.pth +[2023-03-11 19:26:37,283][66031] Updated weights for policy 0, policy_version 91200 (0.0004) +[2023-03-11 19:26:39,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10246.9). Total num frames: 46710784. Throughput: 0: 10011.2. Samples: 46706696. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 19:26:39,012][65744] Avg episode reward: [(0, '4472.924')] +[2023-03-11 19:26:41,257][66031] Updated weights for policy 0, policy_version 91280 (0.0004) +[2023-03-11 19:26:44,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10233.1). Total num frames: 46759936. Throughput: 0: 10012.1. Samples: 46737148. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 19:26:44,012][65744] Avg episode reward: [(0, '3837.575')] +[2023-03-11 19:26:45,344][66031] Updated weights for policy 0, policy_version 91360 (0.0005) +[2023-03-11 19:26:49,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9966.9, 300 sec: 10219.2). Total num frames: 46809088. Throughput: 0: 9977.2. Samples: 46796864. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 19:26:49,012][65744] Avg episode reward: [(0, '4094.401')] +[2023-03-11 19:26:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000091424_46809088.pth... +[2023-03-11 19:26:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000090848_46514176.pth +[2023-03-11 19:26:49,604][66031] Updated weights for policy 0, policy_version 91440 (0.0005) +[2023-03-11 19:26:53,856][66031] Updated weights for policy 0, policy_version 91520 (0.0005) +[2023-03-11 19:26:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 10205.3). Total num frames: 46858240. Throughput: 0: 9918.9. Samples: 46854208. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 19:26:54,012][65744] Avg episode reward: [(0, '3929.908')] +[2023-03-11 19:26:58,151][66031] Updated weights for policy 0, policy_version 91600 (0.0005) +[2023-03-11 19:26:59,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 10191.4). Total num frames: 46907392. Throughput: 0: 9866.1. Samples: 46882880. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 19:26:59,012][65744] Avg episode reward: [(0, '4038.654')] +[2023-03-11 19:27:02,074][66031] Updated weights for policy 0, policy_version 91680 (0.0004) +[2023-03-11 19:27:04,012][65744] Fps is (10 sec: 10239.9, 60 sec: 9966.9, 300 sec: 10205.3). Total num frames: 46960640. Throughput: 0: 9881.4. Samples: 46944264. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 19:27:04,012][65744] Avg episode reward: [(0, '4105.596')] +[2023-03-11 19:27:04,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000091720_46960640.pth... +[2023-03-11 19:27:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000091128_46657536.pth +[2023-03-11 19:27:05,976][66031] Updated weights for policy 0, policy_version 91760 (0.0004) +[2023-03-11 19:27:09,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 10191.4). Total num frames: 47009792. Throughput: 0: 9959.2. Samples: 47007012. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 19:27:09,012][65744] Avg episode reward: [(0, '3516.786')] +[2023-03-11 19:27:09,893][66031] Updated weights for policy 0, policy_version 91840 (0.0004) +[2023-03-11 19:27:13,829][66031] Updated weights for policy 0, policy_version 91920 (0.0004) +[2023-03-11 19:27:14,012][65744] Fps is (10 sec: 10240.1, 60 sec: 9967.0, 300 sec: 10191.4). Total num frames: 47063040. Throughput: 0: 10012.4. Samples: 47038472. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 19:27:14,012][65744] Avg episode reward: [(0, '3486.570')] +[2023-03-11 19:27:17,744][66031] Updated weights for policy 0, policy_version 92000 (0.0005) +[2023-03-11 19:27:19,012][65744] Fps is (10 sec: 10649.5, 60 sec: 10035.2, 300 sec: 10191.4). Total num frames: 47116288. Throughput: 0: 10119.5. Samples: 47100692. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 19:27:19,012][65744] Avg episode reward: [(0, '4087.451')] +[2023-03-11 19:27:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000092024_47116288.pth... +[2023-03-11 19:27:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000091424_46809088.pth +[2023-03-11 19:27:21,663][66031] Updated weights for policy 0, policy_version 92080 (0.0004) +[2023-03-11 19:27:24,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10177.5). Total num frames: 47165440. Throughput: 0: 10146.8. Samples: 47163304. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 19:27:24,012][65744] Avg episode reward: [(0, '3761.955')] +[2023-03-11 19:27:25,591][66031] Updated weights for policy 0, policy_version 92160 (0.0004) +[2023-03-11 19:27:29,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10171.7, 300 sec: 10191.4). Total num frames: 47218688. Throughput: 0: 10165.3. Samples: 47194588. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 19:27:29,012][65744] Avg episode reward: [(0, '4370.434')] +[2023-03-11 19:27:29,607][66031] Updated weights for policy 0, policy_version 92240 (0.0004) +[2023-03-11 19:27:33,919][66031] Updated weights for policy 0, policy_version 92320 (0.0006) +[2023-03-11 19:27:34,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10177.5). Total num frames: 47267840. Throughput: 0: 10154.5. Samples: 47253816. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 19:27:34,012][65744] Avg episode reward: [(0, '3935.414')] +[2023-03-11 19:27:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000092320_47267840.pth... +[2023-03-11 19:27:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000091720_46960640.pth +[2023-03-11 19:27:38,206][66031] Updated weights for policy 0, policy_version 92400 (0.0005) +[2023-03-11 19:27:39,012][65744] Fps is (10 sec: 9420.8, 60 sec: 10035.2, 300 sec: 10149.8). Total num frames: 47312896. Throughput: 0: 10160.6. Samples: 47311436. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 19:27:39,012][65744] Avg episode reward: [(0, '4326.588')] +[2023-03-11 19:27:42,273][66031] Updated weights for policy 0, policy_version 92480 (0.0005) +[2023-03-11 19:27:44,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10149.8). Total num frames: 47366144. Throughput: 0: 10192.2. Samples: 47341528. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 19:27:44,012][65744] Avg episode reward: [(0, '4210.693')] +[2023-03-11 19:27:46,241][66031] Updated weights for policy 0, policy_version 92560 (0.0004) +[2023-03-11 19:27:49,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 10135.9). Total num frames: 47415296. Throughput: 0: 10195.9. Samples: 47403080. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 19:27:49,012][65744] Avg episode reward: [(0, '4509.917')] +[2023-03-11 19:27:49,018][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000092616_47419392.pth... +[2023-03-11 19:27:49,020][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000092024_47116288.pth +[2023-03-11 19:27:50,207][66031] Updated weights for policy 0, policy_version 92640 (0.0005) +[2023-03-11 19:27:54,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10149.7). Total num frames: 47468544. Throughput: 0: 10183.1. Samples: 47465252. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 19:27:54,012][65744] Avg episode reward: [(0, '4146.534')] +[2023-03-11 19:27:54,180][66031] Updated weights for policy 0, policy_version 92720 (0.0005) +[2023-03-11 19:27:58,528][66031] Updated weights for policy 0, policy_version 92800 (0.0004) +[2023-03-11 19:27:59,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10135.9). Total num frames: 47517696. Throughput: 0: 10111.3. Samples: 47493480. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 19:27:59,012][65744] Avg episode reward: [(0, '4305.477')] +[2023-03-11 19:28:02,624][66031] Updated weights for policy 0, policy_version 92880 (0.0005) +[2023-03-11 19:28:04,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10122.0). Total num frames: 47566848. Throughput: 0: 10054.1. Samples: 47553128. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 19:28:04,012][65744] Avg episode reward: [(0, '4295.691')] +[2023-03-11 19:28:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000092904_47566848.pth... +[2023-03-11 19:28:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000092320_47267840.pth +[2023-03-11 19:28:06,941][66031] Updated weights for policy 0, policy_version 92960 (0.0006) +[2023-03-11 19:28:09,012][65744] Fps is (10 sec: 9420.9, 60 sec: 10035.2, 300 sec: 10094.2). Total num frames: 47611904. Throughput: 0: 9939.9. Samples: 47610600. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:28:09,012][65744] Avg episode reward: [(0, '3908.744')] +[2023-03-11 19:28:11,185][66031] Updated weights for policy 0, policy_version 93040 (0.0005) +[2023-03-11 19:28:14,012][65744] Fps is (10 sec: 9830.5, 60 sec: 10035.2, 300 sec: 10108.1). Total num frames: 47665152. Throughput: 0: 9884.0. Samples: 47639368. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:28:14,012][65744] Avg episode reward: [(0, '4022.744')] +[2023-03-11 19:28:15,208][66031] Updated weights for policy 0, policy_version 93120 (0.0004) +[2023-03-11 19:28:19,012][65744] Fps is (10 sec: 10239.9, 60 sec: 9966.9, 300 sec: 10094.2). Total num frames: 47714304. Throughput: 0: 9937.1. Samples: 47700984. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:28:19,012][65744] Avg episode reward: [(0, '4316.774')] +[2023-03-11 19:28:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000093192_47714304.pth... +[2023-03-11 19:28:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000092616_47419392.pth +[2023-03-11 19:28:19,118][66031] Updated weights for policy 0, policy_version 93200 (0.0004) +[2023-03-11 19:28:23,136][66031] Updated weights for policy 0, policy_version 93280 (0.0005) +[2023-03-11 19:28:24,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10094.2). Total num frames: 47767552. Throughput: 0: 10027.7. Samples: 47762684. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:28:24,012][65744] Avg episode reward: [(0, '4186.909')] +[2023-03-11 19:28:27,263][66031] Updated weights for policy 0, policy_version 93360 (0.0004) +[2023-03-11 19:28:29,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 10080.3). Total num frames: 47816704. Throughput: 0: 10015.6. Samples: 47792232. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:28:29,012][65744] Avg episode reward: [(0, '4354.992')] +[2023-03-11 19:28:31,445][66031] Updated weights for policy 0, policy_version 93440 (0.0005) +[2023-03-11 19:28:34,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9966.9, 300 sec: 10080.3). Total num frames: 47865856. Throughput: 0: 9953.3. Samples: 47850980. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:28:34,012][65744] Avg episode reward: [(0, '4373.296')] +[2023-03-11 19:28:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000093488_47865856.pth... +[2023-03-11 19:28:34,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000092904_47566848.pth +[2023-03-11 19:28:35,679][66031] Updated weights for policy 0, policy_version 93520 (0.0005) +[2023-03-11 19:28:39,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9966.9, 300 sec: 10052.6). Total num frames: 47910912. Throughput: 0: 9848.5. Samples: 47908436. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:28:39,012][65744] Avg episode reward: [(0, '4524.159')] +[2023-03-11 19:28:39,995][66031] Updated weights for policy 0, policy_version 93600 (0.0005) +[2023-03-11 19:28:44,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9898.7, 300 sec: 10038.7). Total num frames: 47960064. Throughput: 0: 9861.8. Samples: 47937260. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:28:44,012][65744] Avg episode reward: [(0, '4521.386')] +[2023-03-11 19:28:44,270][66031] Updated weights for policy 0, policy_version 93680 (0.0005) +[2023-03-11 19:28:48,448][66031] Updated weights for policy 0, policy_version 93760 (0.0005) +[2023-03-11 19:28:49,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10024.8). Total num frames: 48009216. Throughput: 0: 9833.7. Samples: 47995644. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:28:49,012][65744] Avg episode reward: [(0, '4350.907')] +[2023-03-11 19:28:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000093768_48009216.pth... +[2023-03-11 19:28:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000093192_47714304.pth +[2023-03-11 19:28:52,660][66031] Updated weights for policy 0, policy_version 93840 (0.0005) +[2023-03-11 19:28:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10024.8). Total num frames: 48058368. Throughput: 0: 9859.3. Samples: 48054268. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:28:54,012][65744] Avg episode reward: [(0, '4418.669')] +[2023-03-11 19:28:56,948][66031] Updated weights for policy 0, policy_version 93920 (0.0005) +[2023-03-11 19:28:59,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 10010.9). Total num frames: 48103424. Throughput: 0: 9856.6. Samples: 48082916. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:28:59,012][65744] Avg episode reward: [(0, '4169.817')] +[2023-03-11 19:29:01,232][66031] Updated weights for policy 0, policy_version 94000 (0.0005) +[2023-03-11 19:29:04,012][65744] Fps is (10 sec: 9420.7, 60 sec: 9762.1, 300 sec: 9997.0). Total num frames: 48152576. Throughput: 0: 9757.1. Samples: 48140052. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:29:04,012][65744] Avg episode reward: [(0, '4353.062')] +[2023-03-11 19:29:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000094048_48152576.pth... +[2023-03-11 19:29:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000093488_47865856.pth +[2023-03-11 19:29:05,518][66031] Updated weights for policy 0, policy_version 94080 (0.0005) +[2023-03-11 19:29:09,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9830.4, 300 sec: 9983.1). Total num frames: 48201728. Throughput: 0: 9650.8. Samples: 48196968. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:29:09,012][65744] Avg episode reward: [(0, '4080.186')] +[2023-03-11 19:29:09,856][66031] Updated weights for policy 0, policy_version 94160 (0.0005) +[2023-03-11 19:29:14,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9955.4). Total num frames: 48246784. Throughput: 0: 9625.4. Samples: 48225376. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:29:14,012][65744] Avg episode reward: [(0, '3758.198')] +[2023-03-11 19:29:14,173][66031] Updated weights for policy 0, policy_version 94240 (0.0005) +[2023-03-11 19:29:18,533][66031] Updated weights for policy 0, policy_version 94320 (0.0005) +[2023-03-11 19:29:19,012][65744] Fps is (10 sec: 9420.7, 60 sec: 9693.9, 300 sec: 9941.5). Total num frames: 48295936. Throughput: 0: 9571.1. Samples: 48281680. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:29:19,012][65744] Avg episode reward: [(0, '3583.262')] +[2023-03-11 19:29:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000094328_48295936.pth... +[2023-03-11 19:29:19,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000093768_48009216.pth +[2023-03-11 19:29:22,769][66031] Updated weights for policy 0, policy_version 94400 (0.0005) +[2023-03-11 19:29:24,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9927.6). Total num frames: 48340992. Throughput: 0: 9576.7. Samples: 48339388. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:29:24,012][65744] Avg episode reward: [(0, '3866.245')] +[2023-03-11 19:29:27,172][66031] Updated weights for policy 0, policy_version 94480 (0.0005) +[2023-03-11 19:29:29,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9913.7). Total num frames: 48390144. Throughput: 0: 9548.2. Samples: 48366928. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:29:29,012][65744] Avg episode reward: [(0, '4266.256')] +[2023-03-11 19:29:31,400][66031] Updated weights for policy 0, policy_version 94560 (0.0005) +[2023-03-11 19:29:34,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9913.7). Total num frames: 48439296. Throughput: 0: 9529.3. Samples: 48424464. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:29:34,012][65744] Avg episode reward: [(0, '4184.939')] +[2023-03-11 19:29:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000094608_48439296.pth... +[2023-03-11 19:29:34,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000094048_48152576.pth +[2023-03-11 19:29:35,780][66031] Updated weights for policy 0, policy_version 94640 (0.0005) +[2023-03-11 19:29:39,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9899.8). Total num frames: 48484352. Throughput: 0: 9477.3. Samples: 48480748. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:29:39,012][65744] Avg episode reward: [(0, '4312.316')] +[2023-03-11 19:29:40,151][66031] Updated weights for policy 0, policy_version 94720 (0.0005) +[2023-03-11 19:29:44,012][65744] Fps is (10 sec: 9011.3, 60 sec: 9489.1, 300 sec: 9885.9). Total num frames: 48529408. Throughput: 0: 9467.3. Samples: 48508944. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:29:44,012][65744] Avg episode reward: [(0, '3749.455')] +[2023-03-11 19:29:44,507][66031] Updated weights for policy 0, policy_version 94800 (0.0005) +[2023-03-11 19:29:48,791][66031] Updated weights for policy 0, policy_version 94880 (0.0005) +[2023-03-11 19:29:49,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9885.9). Total num frames: 48578560. Throughput: 0: 9466.9. Samples: 48566060. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:29:49,012][65744] Avg episode reward: [(0, '2886.265')] +[2023-03-11 19:29:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000094880_48578560.pth... +[2023-03-11 19:29:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000094328_48295936.pth +[2023-03-11 19:29:53,080][66031] Updated weights for policy 0, policy_version 94960 (0.0005) +[2023-03-11 19:29:54,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9489.1, 300 sec: 9899.8). Total num frames: 48627712. Throughput: 0: 9481.1. Samples: 48623616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:29:54,012][65744] Avg episode reward: [(0, '3796.296')] +[2023-03-11 19:29:57,307][66031] Updated weights for policy 0, policy_version 95040 (0.0005) +[2023-03-11 19:29:59,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9885.9). Total num frames: 48672768. Throughput: 0: 9488.4. Samples: 48652352. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:29:59,012][65744] Avg episode reward: [(0, '3989.823')] +[2023-03-11 19:30:01,659][66031] Updated weights for policy 0, policy_version 95120 (0.0005) +[2023-03-11 19:30:04,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9885.9). Total num frames: 48721920. Throughput: 0: 9508.9. Samples: 48709580. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:30:04,012][65744] Avg episode reward: [(0, '3965.540')] +[2023-03-11 19:30:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000095160_48721920.pth... +[2023-03-11 19:30:04,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000094608_48439296.pth +[2023-03-11 19:30:05,956][66031] Updated weights for policy 0, policy_version 95200 (0.0005) +[2023-03-11 19:30:09,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9489.1, 300 sec: 9885.9). Total num frames: 48771072. Throughput: 0: 9501.7. Samples: 48766964. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:30:09,012][65744] Avg episode reward: [(0, '4026.382')] +[2023-03-11 19:30:10,227][66031] Updated weights for policy 0, policy_version 95280 (0.0005) +[2023-03-11 19:30:14,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9489.1, 300 sec: 9858.2). Total num frames: 48816128. Throughput: 0: 9518.2. Samples: 48795248. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:30:14,012][65744] Avg episode reward: [(0, '4122.931')] +[2023-03-11 19:30:14,569][66031] Updated weights for policy 0, policy_version 95360 (0.0005) +[2023-03-11 19:30:18,888][66031] Updated weights for policy 0, policy_version 95440 (0.0005) +[2023-03-11 19:30:19,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9844.3). Total num frames: 48865280. Throughput: 0: 9503.7. Samples: 48852132. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:30:19,012][65744] Avg episode reward: [(0, '4181.293')] +[2023-03-11 19:30:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000095440_48865280.pth... +[2023-03-11 19:30:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000094880_48578560.pth +[2023-03-11 19:30:23,120][66031] Updated weights for policy 0, policy_version 95520 (0.0005) +[2023-03-11 19:30:24,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9844.3). Total num frames: 48914432. Throughput: 0: 9532.4. Samples: 48909704. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:30:24,012][65744] Avg episode reward: [(0, '4107.107')] +[2023-03-11 19:30:27,456][66031] Updated weights for policy 0, policy_version 95600 (0.0005) +[2023-03-11 19:30:29,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9489.1, 300 sec: 9830.4). Total num frames: 48959488. Throughput: 0: 9538.5. Samples: 48938176. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:30:29,012][65744] Avg episode reward: [(0, '4336.948')] +[2023-03-11 19:30:31,665][66031] Updated weights for policy 0, policy_version 95680 (0.0005) +[2023-03-11 19:30:34,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9830.4). Total num frames: 49008640. Throughput: 0: 9558.9. Samples: 48996212. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:30:34,012][65744] Avg episode reward: [(0, '3806.625')] +[2023-03-11 19:30:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000095720_49008640.pth... +[2023-03-11 19:30:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000095160_48721920.pth +[2023-03-11 19:30:35,868][66031] Updated weights for policy 0, policy_version 95760 (0.0005) +[2023-03-11 19:30:39,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9830.4). Total num frames: 49057792. Throughput: 0: 9558.8. Samples: 49053760. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:30:39,012][65744] Avg episode reward: [(0, '3590.427')] +[2023-03-11 19:30:40,189][66031] Updated weights for policy 0, policy_version 95840 (0.0005) +[2023-03-11 19:30:44,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9802.6). Total num frames: 49102848. Throughput: 0: 9555.9. Samples: 49082368. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:30:44,012][65744] Avg episode reward: [(0, '4144.977')] +[2023-03-11 19:30:44,447][66031] Updated weights for policy 0, policy_version 95920 (0.0005) +[2023-03-11 19:30:48,636][66031] Updated weights for policy 0, policy_version 96000 (0.0005) +[2023-03-11 19:30:49,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9557.3, 300 sec: 9802.6). Total num frames: 49152000. Throughput: 0: 9567.1. Samples: 49140100. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:30:49,012][65744] Avg episode reward: [(0, '4355.673')] +[2023-03-11 19:30:49,072][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000096008_49156096.pth... +[2023-03-11 19:30:49,073][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000095440_48865280.pth +[2023-03-11 19:30:52,919][66031] Updated weights for policy 0, policy_version 96080 (0.0005) +[2023-03-11 19:30:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9802.6). Total num frames: 49201152. Throughput: 0: 9574.2. Samples: 49197804. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:30:54,023][65744] Avg episode reward: [(0, '3889.192')] +[2023-03-11 19:30:57,248][66031] Updated weights for policy 0, policy_version 96160 (0.0005) +[2023-03-11 19:30:59,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9625.6, 300 sec: 9788.7). Total num frames: 49250304. Throughput: 0: 9572.6. Samples: 49226016. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:30:59,012][65744] Avg episode reward: [(0, '4232.475')] +[2023-03-11 19:31:01,419][66031] Updated weights for policy 0, policy_version 96240 (0.0005) +[2023-03-11 19:31:04,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9788.7). Total num frames: 49299456. Throughput: 0: 9612.4. Samples: 49284688. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:31:04,012][65744] Avg episode reward: [(0, '4117.931')] +[2023-03-11 19:31:04,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000096288_49299456.pth... +[2023-03-11 19:31:04,027][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000095720_49008640.pth +[2023-03-11 19:31:05,631][66031] Updated weights for policy 0, policy_version 96320 (0.0005) +[2023-03-11 19:31:09,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9557.3, 300 sec: 9761.0). Total num frames: 49344512. Throughput: 0: 9634.7. Samples: 49343264. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:31:09,012][65744] Avg episode reward: [(0, '4349.262')] +[2023-03-11 19:31:09,896][66031] Updated weights for policy 0, policy_version 96400 (0.0005) +[2023-03-11 19:31:14,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9761.0). Total num frames: 49393664. Throughput: 0: 9642.4. Samples: 49372084. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:31:14,012][65744] Avg episode reward: [(0, '4405.066')] +[2023-03-11 19:31:14,138][66031] Updated weights for policy 0, policy_version 96480 (0.0005) +[2023-03-11 19:31:18,429][66031] Updated weights for policy 0, policy_version 96560 (0.0005) +[2023-03-11 19:31:19,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9625.6, 300 sec: 9761.0). Total num frames: 49442816. Throughput: 0: 9633.1. Samples: 49429700. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:31:19,012][65744] Avg episode reward: [(0, '4305.394')] +[2023-03-11 19:31:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000096568_49442816.pth... +[2023-03-11 19:31:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000096008_49156096.pth +[2023-03-11 19:31:22,703][66031] Updated weights for policy 0, policy_version 96640 (0.0005) +[2023-03-11 19:31:24,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9774.9). Total num frames: 49491968. Throughput: 0: 9634.2. Samples: 49487300. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:31:24,012][65744] Avg episode reward: [(0, '4317.143')] +[2023-03-11 19:31:26,959][66031] Updated weights for policy 0, policy_version 96720 (0.0005) +[2023-03-11 19:31:29,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9761.0). Total num frames: 49537024. Throughput: 0: 9632.1. Samples: 49515812. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:31:29,023][65744] Avg episode reward: [(0, '4313.666')] +[2023-03-11 19:31:31,192][66031] Updated weights for policy 0, policy_version 96800 (0.0005) +[2023-03-11 19:31:34,012][65744] Fps is (10 sec: 9420.7, 60 sec: 9625.6, 300 sec: 9747.1). Total num frames: 49586176. Throughput: 0: 9640.0. Samples: 49573900. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:31:34,012][65744] Avg episode reward: [(0, '4113.971')] +[2023-03-11 19:31:34,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000096848_49586176.pth... +[2023-03-11 19:31:34,029][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000096288_49299456.pth +[2023-03-11 19:31:35,350][66031] Updated weights for policy 0, policy_version 96880 (0.0005) +[2023-03-11 19:31:39,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9693.9, 300 sec: 9761.0). Total num frames: 49639424. Throughput: 0: 9693.3. Samples: 49634004. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:31:39,012][65744] Avg episode reward: [(0, '3903.800')] +[2023-03-11 19:31:39,396][66031] Updated weights for policy 0, policy_version 96960 (0.0005) +[2023-03-11 19:31:43,418][66031] Updated weights for policy 0, policy_version 97040 (0.0005) +[2023-03-11 19:31:44,012][65744] Fps is (10 sec: 10240.1, 60 sec: 9762.1, 300 sec: 9761.0). Total num frames: 49688576. Throughput: 0: 9742.4. Samples: 49664424. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:31:44,012][65744] Avg episode reward: [(0, '3685.534')] +[2023-03-11 19:31:47,394][66031] Updated weights for policy 0, policy_version 97120 (0.0004) +[2023-03-11 19:31:49,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 9774.9). Total num frames: 49741824. Throughput: 0: 9800.5. Samples: 49725712. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:31:49,012][65744] Avg episode reward: [(0, '3797.768')] +[2023-03-11 19:31:49,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000097152_49741824.pth... +[2023-03-11 19:31:49,029][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000096568_49442816.pth +[2023-03-11 19:31:51,467][66031] Updated weights for policy 0, policy_version 97200 (0.0004) +[2023-03-11 19:31:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9761.0). Total num frames: 49786880. Throughput: 0: 9822.3. Samples: 49785268. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:31:54,012][65744] Avg episode reward: [(0, '3722.299')] +[2023-03-11 19:31:55,704][66031] Updated weights for policy 0, policy_version 97280 (0.0005) +[2023-03-11 19:31:59,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9761.0). Total num frames: 49840128. Throughput: 0: 9843.7. Samples: 49815052. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:31:59,012][65744] Avg episode reward: [(0, '4077.657')] +[2023-03-11 19:31:59,704][66031] Updated weights for policy 0, policy_version 97360 (0.0005) +[2023-03-11 19:32:03,657][66031] Updated weights for policy 0, policy_version 97440 (0.0004) +[2023-03-11 19:32:04,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 9761.0). Total num frames: 49889280. Throughput: 0: 9927.4. Samples: 49876432. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:32:04,012][65744] Avg episode reward: [(0, '4156.878')] +[2023-03-11 19:32:04,042][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000097448_49893376.pth... +[2023-03-11 19:32:04,044][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000096848_49586176.pth +[2023-03-11 19:32:07,624][66031] Updated weights for policy 0, policy_version 97520 (0.0004) +[2023-03-11 19:32:09,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 9761.0). Total num frames: 49942528. Throughput: 0: 10030.2. Samples: 49938660. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:32:09,012][65744] Avg episode reward: [(0, '4291.466')] +[2023-03-11 19:32:11,607][66031] Updated weights for policy 0, policy_version 97600 (0.0004) +[2023-03-11 19:32:14,012][65744] Fps is (10 sec: 10240.1, 60 sec: 9966.9, 300 sec: 9747.1). Total num frames: 49991680. Throughput: 0: 10087.5. Samples: 49969748. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:32:14,012][65744] Avg episode reward: [(0, '3847.603')] +[2023-03-11 19:32:15,635][66031] Updated weights for policy 0, policy_version 97680 (0.0004) +[2023-03-11 19:32:19,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9761.0). Total num frames: 50044928. Throughput: 0: 10164.3. Samples: 50031292. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:32:19,012][65744] Avg episode reward: [(0, '3924.831')] +[2023-03-11 19:32:19,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000097744_50044928.pth... +[2023-03-11 19:32:19,029][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000097152_49741824.pth +[2023-03-11 19:32:19,629][66031] Updated weights for policy 0, policy_version 97760 (0.0005) +[2023-03-11 19:32:23,644][66031] Updated weights for policy 0, policy_version 97840 (0.0004) +[2023-03-11 19:32:24,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9747.1). Total num frames: 50094080. Throughput: 0: 10177.3. Samples: 50091980. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:32:24,012][65744] Avg episode reward: [(0, '4106.983')] +[2023-03-11 19:32:27,584][66031] Updated weights for policy 0, policy_version 97920 (0.0004) +[2023-03-11 19:32:29,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 9761.0). Total num frames: 50147328. Throughput: 0: 10186.7. Samples: 50122824. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:32:29,012][65744] Avg episode reward: [(0, '4050.737')] +[2023-03-11 19:32:31,530][66031] Updated weights for policy 0, policy_version 98000 (0.0005) +[2023-03-11 19:32:34,012][65744] Fps is (10 sec: 10649.5, 60 sec: 10240.0, 300 sec: 9788.7). Total num frames: 50200576. Throughput: 0: 10209.1. Samples: 50185120. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:32:34,012][65744] Avg episode reward: [(0, '4341.156')] +[2023-03-11 19:32:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000098048_50200576.pth... +[2023-03-11 19:32:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000097448_49893376.pth +[2023-03-11 19:32:35,482][66031] Updated weights for policy 0, policy_version 98080 (0.0005) +[2023-03-11 19:32:39,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 9774.9). Total num frames: 50249728. Throughput: 0: 10266.8. Samples: 50247276. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:32:39,023][65744] Avg episode reward: [(0, '3869.009')] +[2023-03-11 19:32:39,486][66031] Updated weights for policy 0, policy_version 98160 (0.0004) +[2023-03-11 19:32:43,441][66031] Updated weights for policy 0, policy_version 98240 (0.0005) +[2023-03-11 19:32:44,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 9788.7). Total num frames: 50302976. Throughput: 0: 10295.1. Samples: 50278332. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:32:44,012][65744] Avg episode reward: [(0, '4011.690')] +[2023-03-11 19:32:47,342][66031] Updated weights for policy 0, policy_version 98320 (0.0004) +[2023-03-11 19:32:49,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 9788.7). Total num frames: 50356224. Throughput: 0: 10318.7. Samples: 50340776. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:32:49,012][65744] Avg episode reward: [(0, '3784.860')] +[2023-03-11 19:32:49,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000098352_50356224.pth... +[2023-03-11 19:32:49,029][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000097744_50044928.pth +[2023-03-11 19:32:51,367][66031] Updated weights for policy 0, policy_version 98400 (0.0005) +[2023-03-11 19:32:54,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 9788.7). Total num frames: 50405376. Throughput: 0: 10303.8. Samples: 50402332. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:32:54,012][65744] Avg episode reward: [(0, '4031.696')] +[2023-03-11 19:32:55,321][66031] Updated weights for policy 0, policy_version 98480 (0.0004) +[2023-03-11 19:32:59,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 9802.6). Total num frames: 50458624. Throughput: 0: 10313.8. Samples: 50433872. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:32:59,012][65744] Avg episode reward: [(0, '4258.360')] +[2023-03-11 19:32:59,262][66031] Updated weights for policy 0, policy_version 98560 (0.0004) +[2023-03-11 19:33:03,280][66031] Updated weights for policy 0, policy_version 98640 (0.0005) +[2023-03-11 19:33:04,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 9816.5). Total num frames: 50507776. Throughput: 0: 10316.9. Samples: 50495552. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 19:33:04,012][65744] Avg episode reward: [(0, '4167.737')] +[2023-03-11 19:33:04,014][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000098648_50507776.pth... +[2023-03-11 19:33:04,016][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000098048_50200576.pth +[2023-03-11 19:33:07,240][66031] Updated weights for policy 0, policy_version 98720 (0.0005) +[2023-03-11 19:33:09,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 9816.5). Total num frames: 50561024. Throughput: 0: 10335.7. Samples: 50557084. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 19:33:09,012][65744] Avg episode reward: [(0, '4314.228')] +[2023-03-11 19:33:11,144][66031] Updated weights for policy 0, policy_version 98800 (0.0004) +[2023-03-11 19:33:14,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 9830.4). Total num frames: 50614272. Throughput: 0: 10355.2. Samples: 50588808. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 19:33:14,012][65744] Avg episode reward: [(0, '4105.038')] +[2023-03-11 19:33:15,121][66031] Updated weights for policy 0, policy_version 98880 (0.0004) +[2023-03-11 19:33:19,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 9816.5). Total num frames: 50663424. Throughput: 0: 10352.0. Samples: 50650960. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 19:33:19,012][65744] Avg episode reward: [(0, '4092.372')] +[2023-03-11 19:33:19,034][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000098960_50667520.pth... +[2023-03-11 19:33:19,034][66031] Updated weights for policy 0, policy_version 98960 (0.0004) +[2023-03-11 19:33:19,036][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000098352_50356224.pth +[2023-03-11 19:33:22,936][66031] Updated weights for policy 0, policy_version 99040 (0.0005) +[2023-03-11 19:33:24,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 9830.4). Total num frames: 50716672. Throughput: 0: 10367.2. Samples: 50713800. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 19:33:24,012][65744] Avg episode reward: [(0, '4261.812')] +[2023-03-11 19:33:26,863][66031] Updated weights for policy 0, policy_version 99120 (0.0004) +[2023-03-11 19:33:29,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 9844.3). Total num frames: 50769920. Throughput: 0: 10378.1. Samples: 50745344. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 19:33:29,012][65744] Avg episode reward: [(0, '3948.529')] +[2023-03-11 19:33:30,817][66031] Updated weights for policy 0, policy_version 99200 (0.0004) +[2023-03-11 19:33:34,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 9858.2). Total num frames: 50819072. Throughput: 0: 10357.2. Samples: 50806848. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 19:33:34,012][65744] Avg episode reward: [(0, '4032.374')] +[2023-03-11 19:33:34,036][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000099264_50823168.pth... +[2023-03-11 19:33:34,037][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000098648_50507776.pth +[2023-03-11 19:33:34,826][66031] Updated weights for policy 0, policy_version 99280 (0.0005) +[2023-03-11 19:33:38,758][66031] Updated weights for policy 0, policy_version 99360 (0.0004) +[2023-03-11 19:33:39,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 9872.1). Total num frames: 50872320. Throughput: 0: 10371.0. Samples: 50869028. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 19:33:39,012][65744] Avg episode reward: [(0, '4426.919')] +[2023-03-11 19:33:42,690][66031] Updated weights for policy 0, policy_version 99440 (0.0004) +[2023-03-11 19:33:44,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 9885.9). Total num frames: 50925568. Throughput: 0: 10363.0. Samples: 50900208. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 19:33:44,012][65744] Avg episode reward: [(0, '4290.580')] +[2023-03-11 19:33:46,597][66031] Updated weights for policy 0, policy_version 99520 (0.0004) +[2023-03-11 19:33:49,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 9899.8). Total num frames: 50978816. Throughput: 0: 10389.8. Samples: 50963092. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 19:33:49,012][65744] Avg episode reward: [(0, '4164.630')] +[2023-03-11 19:33:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000099568_50978816.pth... +[2023-03-11 19:33:49,019][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000098960_50667520.pth +[2023-03-11 19:33:50,505][66031] Updated weights for policy 0, policy_version 99600 (0.0004) +[2023-03-11 19:33:54,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 9913.7). Total num frames: 51027968. Throughput: 0: 10409.9. Samples: 51025528. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 19:33:54,012][65744] Avg episode reward: [(0, '4212.933')] +[2023-03-11 19:33:54,464][66031] Updated weights for policy 0, policy_version 99680 (0.0004) +[2023-03-11 19:33:58,338][66031] Updated weights for policy 0, policy_version 99760 (0.0004) +[2023-03-11 19:33:59,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 9927.6). Total num frames: 51081216. Throughput: 0: 10397.9. Samples: 51056712. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 19:33:59,012][65744] Avg episode reward: [(0, '4431.352')] +[2023-03-11 19:34:02,254][66031] Updated weights for policy 0, policy_version 99840 (0.0004) +[2023-03-11 19:34:04,012][65744] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 9941.5). Total num frames: 51134464. Throughput: 0: 10420.4. Samples: 51119880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:34:04,012][65744] Avg episode reward: [(0, '4367.690')] +[2023-03-11 19:34:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000099872_51134464.pth... +[2023-03-11 19:34:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000099264_50823168.pth +[2023-03-11 19:34:06,383][66031] Updated weights for policy 0, policy_version 99920 (0.0005) +[2023-03-11 19:34:09,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 9955.4). Total num frames: 51183616. Throughput: 0: 10349.2. Samples: 51179516. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:34:09,012][65744] Avg episode reward: [(0, '4327.912')] +[2023-03-11 19:34:10,616][66031] Updated weights for policy 0, policy_version 100000 (0.0005) +[2023-03-11 19:34:14,012][65744] Fps is (10 sec: 9420.8, 60 sec: 10240.0, 300 sec: 9941.5). Total num frames: 51228672. Throughput: 0: 10285.7. Samples: 51208200. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:34:14,012][65744] Avg episode reward: [(0, '4264.802')] +[2023-03-11 19:34:14,857][66031] Updated weights for policy 0, policy_version 100080 (0.0005) +[2023-03-11 19:34:19,011][66031] Updated weights for policy 0, policy_version 100160 (0.0005) +[2023-03-11 19:34:19,012][65744] Fps is (10 sec: 9830.5, 60 sec: 10308.3, 300 sec: 9969.3). Total num frames: 51281920. Throughput: 0: 10199.7. Samples: 51265836. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:34:19,012][65744] Avg episode reward: [(0, '4291.990')] +[2023-03-11 19:34:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000100160_51281920.pth... +[2023-03-11 19:34:19,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000099568_50978816.pth +[2023-03-11 19:34:22,884][66031] Updated weights for policy 0, policy_version 100240 (0.0003) +[2023-03-11 19:34:24,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 9969.2). Total num frames: 51331072. Throughput: 0: 10213.8. Samples: 51328648. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:34:24,012][65744] Avg episode reward: [(0, '4436.319')] +[2023-03-11 19:34:26,818][66031] Updated weights for policy 0, policy_version 100320 (0.0004) +[2023-03-11 19:34:29,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 9983.1). Total num frames: 51384320. Throughput: 0: 10216.4. Samples: 51359944. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:34:29,012][65744] Avg episode reward: [(0, '4020.550')] +[2023-03-11 19:34:30,761][66031] Updated weights for policy 0, policy_version 100400 (0.0004) +[2023-03-11 19:34:34,012][65744] Fps is (10 sec: 10649.5, 60 sec: 10308.3, 300 sec: 10010.9). Total num frames: 51437568. Throughput: 0: 10204.2. Samples: 51422280. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:34:34,012][65744] Avg episode reward: [(0, '3962.777')] +[2023-03-11 19:34:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000100464_51437568.pth... +[2023-03-11 19:34:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000099872_51134464.pth +[2023-03-11 19:34:34,674][66031] Updated weights for policy 0, policy_version 100480 (0.0004) +[2023-03-11 19:34:38,572][66031] Updated weights for policy 0, policy_version 100560 (0.0004) +[2023-03-11 19:34:39,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 10038.7). Total num frames: 51490816. Throughput: 0: 10222.2. Samples: 51485528. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:34:39,012][65744] Avg episode reward: [(0, '3774.840')] +[2023-03-11 19:34:42,460][66031] Updated weights for policy 0, policy_version 100640 (0.0004) +[2023-03-11 19:34:44,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10038.7). Total num frames: 51539968. Throughput: 0: 10233.9. Samples: 51517236. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:34:44,012][65744] Avg episode reward: [(0, '3683.088')] +[2023-03-11 19:34:46,382][66031] Updated weights for policy 0, policy_version 100720 (0.0004) +[2023-03-11 19:34:49,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10052.6). Total num frames: 51593216. Throughput: 0: 10228.4. Samples: 51580160. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:34:49,012][65744] Avg episode reward: [(0, '3783.873')] +[2023-03-11 19:34:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000100768_51593216.pth... +[2023-03-11 19:34:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000100160_51281920.pth +[2023-03-11 19:34:50,262][66031] Updated weights for policy 0, policy_version 100800 (0.0004) +[2023-03-11 19:34:54,012][65744] Fps is (10 sec: 10649.5, 60 sec: 10308.3, 300 sec: 10080.3). Total num frames: 51646464. Throughput: 0: 10295.7. Samples: 51642824. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:34:54,012][65744] Avg episode reward: [(0, '4112.251')] +[2023-03-11 19:34:54,144][66031] Updated weights for policy 0, policy_version 100880 (0.0004) +[2023-03-11 19:34:58,070][66031] Updated weights for policy 0, policy_version 100960 (0.0004) +[2023-03-11 19:34:59,012][65744] Fps is (10 sec: 10649.7, 60 sec: 10308.3, 300 sec: 10094.2). Total num frames: 51699712. Throughput: 0: 10363.5. Samples: 51674556. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:34:59,012][65744] Avg episode reward: [(0, '3941.355')] +[2023-03-11 19:35:01,983][66031] Updated weights for policy 0, policy_version 101040 (0.0004) +[2023-03-11 19:35:04,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 10108.1). Total num frames: 51752960. Throughput: 0: 10476.7. Samples: 51737288. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:35:04,012][65744] Avg episode reward: [(0, '3630.731')] +[2023-03-11 19:35:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000101080_51752960.pth... +[2023-03-11 19:35:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000100464_51437568.pth +[2023-03-11 19:35:05,864][66031] Updated weights for policy 0, policy_version 101120 (0.0004) +[2023-03-11 19:35:09,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10135.9). Total num frames: 51806208. Throughput: 0: 10479.8. Samples: 51800240. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:35:09,012][65744] Avg episode reward: [(0, '3942.093')] +[2023-03-11 19:35:09,750][66031] Updated weights for policy 0, policy_version 101200 (0.0004) +[2023-03-11 19:35:13,654][66031] Updated weights for policy 0, policy_version 101280 (0.0004) +[2023-03-11 19:35:14,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10135.9). Total num frames: 51855360. Throughput: 0: 10488.4. Samples: 51831924. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:35:14,012][65744] Avg episode reward: [(0, '3654.594')] +[2023-03-11 19:35:17,598][66031] Updated weights for policy 0, policy_version 101360 (0.0004) +[2023-03-11 19:35:19,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10444.8, 300 sec: 10149.7). Total num frames: 51908608. Throughput: 0: 10505.9. Samples: 51895044. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:35:19,012][65744] Avg episode reward: [(0, '4092.427')] +[2023-03-11 19:35:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000101384_51908608.pth... +[2023-03-11 19:35:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000100768_51593216.pth +[2023-03-11 19:35:21,525][66031] Updated weights for policy 0, policy_version 101440 (0.0004) +[2023-03-11 19:35:24,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10177.5). Total num frames: 51961856. Throughput: 0: 10494.3. Samples: 51957772. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:35:24,012][65744] Avg episode reward: [(0, '4024.679')] +[2023-03-11 19:35:25,480][66031] Updated weights for policy 0, policy_version 101520 (0.0004) +[2023-03-11 19:35:29,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10191.4). Total num frames: 52015104. Throughput: 0: 10458.8. Samples: 51987884. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:35:29,012][65744] Avg episode reward: [(0, '3358.601')] +[2023-03-11 19:35:29,380][66031] Updated weights for policy 0, policy_version 101600 (0.0004) +[2023-03-11 19:35:33,339][66031] Updated weights for policy 0, policy_version 101680 (0.0005) +[2023-03-11 19:35:34,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10191.4). Total num frames: 52064256. Throughput: 0: 10465.8. Samples: 52051120. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:35:34,012][65744] Avg episode reward: [(0, '3571.370')] +[2023-03-11 19:35:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000101688_52064256.pth... +[2023-03-11 19:35:34,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000101080_51752960.pth +[2023-03-11 19:35:37,257][66031] Updated weights for policy 0, policy_version 101760 (0.0004) +[2023-03-11 19:35:39,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10219.2). Total num frames: 52117504. Throughput: 0: 10459.0. Samples: 52113480. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:35:39,012][65744] Avg episode reward: [(0, '4157.176')] +[2023-03-11 19:35:41,133][66031] Updated weights for policy 0, policy_version 101840 (0.0003) +[2023-03-11 19:35:44,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10233.1). Total num frames: 52170752. Throughput: 0: 10465.5. Samples: 52145504. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:35:44,012][65744] Avg episode reward: [(0, '3853.815')] +[2023-03-11 19:35:45,011][66031] Updated weights for policy 0, policy_version 101920 (0.0004) +[2023-03-11 19:35:48,936][66031] Updated weights for policy 0, policy_version 102000 (0.0005) +[2023-03-11 19:35:49,012][65744] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10246.9). Total num frames: 52224000. Throughput: 0: 10453.3. Samples: 52207688. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:35:49,012][65744] Avg episode reward: [(0, '3167.270')] +[2023-03-11 19:35:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000102000_52224000.pth... +[2023-03-11 19:35:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000101384_51908608.pth +[2023-03-11 19:35:52,852][66031] Updated weights for policy 0, policy_version 102080 (0.0004) +[2023-03-11 19:35:54,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10246.9). Total num frames: 52273152. Throughput: 0: 10464.8. Samples: 52271156. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:35:54,022][65744] Avg episode reward: [(0, '3459.809')] +[2023-03-11 19:35:56,745][66031] Updated weights for policy 0, policy_version 102160 (0.0004) +[2023-03-11 19:35:59,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10260.8). Total num frames: 52326400. Throughput: 0: 10456.3. Samples: 52302456. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:35:59,022][65744] Avg episode reward: [(0, '3083.956')] +[2023-03-11 19:36:00,627][66031] Updated weights for policy 0, policy_version 102240 (0.0004) +[2023-03-11 19:36:04,012][65744] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10288.6). Total num frames: 52379648. Throughput: 0: 10459.5. Samples: 52365720. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:36:04,023][65744] Avg episode reward: [(0, '3706.169')] +[2023-03-11 19:36:04,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000102304_52379648.pth... +[2023-03-11 19:36:04,028][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000101688_52064256.pth +[2023-03-11 19:36:04,557][66031] Updated weights for policy 0, policy_version 102320 (0.0004) +[2023-03-11 19:36:08,488][66031] Updated weights for policy 0, policy_version 102400 (0.0005) +[2023-03-11 19:36:09,012][65744] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10302.5). Total num frames: 52432896. Throughput: 0: 10459.9. Samples: 52428468. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:36:09,012][65744] Avg episode reward: [(0, '2970.228')] +[2023-03-11 19:36:12,264][66031] Updated weights for policy 0, policy_version 102480 (0.0003) +[2023-03-11 19:36:14,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10316.4). Total num frames: 52486144. Throughput: 0: 10515.6. Samples: 52461084. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:36:14,012][65744] Avg episode reward: [(0, '3200.945')] +[2023-03-11 19:36:16,097][66031] Updated weights for policy 0, policy_version 102560 (0.0004) +[2023-03-11 19:36:19,012][65744] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10330.2). Total num frames: 52539392. Throughput: 0: 10529.6. Samples: 52524952. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:36:19,012][65744] Avg episode reward: [(0, '3576.793')] +[2023-03-11 19:36:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000102616_52539392.pth... +[2023-03-11 19:36:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000102000_52224000.pth +[2023-03-11 19:36:19,997][66031] Updated weights for policy 0, policy_version 102640 (0.0004) +[2023-03-11 19:36:23,886][66031] Updated weights for policy 0, policy_version 102720 (0.0004) +[2023-03-11 19:36:24,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10358.0). Total num frames: 52592640. Throughput: 0: 10555.9. Samples: 52588496. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:36:24,012][65744] Avg episode reward: [(0, '4253.476')] +[2023-03-11 19:36:27,779][66031] Updated weights for policy 0, policy_version 102800 (0.0004) +[2023-03-11 19:36:29,012][65744] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 10371.9). Total num frames: 52645888. Throughput: 0: 10535.0. Samples: 52619580. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:36:29,012][65744] Avg episode reward: [(0, '4241.943')] +[2023-03-11 19:36:31,720][66031] Updated weights for policy 0, policy_version 102880 (0.0004) +[2023-03-11 19:36:34,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10358.0). Total num frames: 52695040. Throughput: 0: 10552.5. Samples: 52682548. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:36:34,012][65744] Avg episode reward: [(0, '3345.194')] +[2023-03-11 19:36:34,027][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000102928_52699136.pth... +[2023-03-11 19:36:34,028][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000102304_52379648.pth +[2023-03-11 19:36:35,612][66031] Updated weights for policy 0, policy_version 102960 (0.0004) +[2023-03-11 19:36:39,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10371.9). Total num frames: 52748288. Throughput: 0: 10513.3. Samples: 52744256. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:36:39,012][65744] Avg episode reward: [(0, '3925.269')] +[2023-03-11 19:36:39,665][66031] Updated weights for policy 0, policy_version 103040 (0.0005) +[2023-03-11 19:36:43,860][66031] Updated weights for policy 0, policy_version 103120 (0.0005) +[2023-03-11 19:36:44,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10444.8, 300 sec: 10358.0). Total num frames: 52797440. Throughput: 0: 10466.6. Samples: 52773452. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:36:44,012][65744] Avg episode reward: [(0, '3921.272')] +[2023-03-11 19:36:47,839][66031] Updated weights for policy 0, policy_version 103200 (0.0004) +[2023-03-11 19:36:49,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10385.8). Total num frames: 52850688. Throughput: 0: 10412.8. Samples: 52834296. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:36:49,012][65744] Avg episode reward: [(0, '4015.872')] +[2023-03-11 19:36:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000103224_52850688.pth... +[2023-03-11 19:36:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000102616_52539392.pth +[2023-03-11 19:36:51,689][66031] Updated weights for policy 0, policy_version 103280 (0.0004) +[2023-03-11 19:36:54,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10371.9). Total num frames: 52899840. Throughput: 0: 10431.4. Samples: 52897880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:36:54,012][65744] Avg episode reward: [(0, '3939.141')] +[2023-03-11 19:36:55,600][66031] Updated weights for policy 0, policy_version 103360 (0.0004) +[2023-03-11 19:36:59,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10385.8). Total num frames: 52953088. Throughput: 0: 10389.8. Samples: 52928624. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:36:59,012][65744] Avg episode reward: [(0, '3696.644')] +[2023-03-11 19:36:59,537][66031] Updated weights for policy 0, policy_version 103440 (0.0004) +[2023-03-11 19:37:03,430][66031] Updated weights for policy 0, policy_version 103520 (0.0005) +[2023-03-11 19:37:04,012][65744] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10385.8). Total num frames: 53006336. Throughput: 0: 10376.9. Samples: 52991912. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:37:04,012][65744] Avg episode reward: [(0, '3503.891')] +[2023-03-11 19:37:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000103528_53006336.pth... +[2023-03-11 19:37:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000102928_52699136.pth +[2023-03-11 19:37:07,365][66031] Updated weights for policy 0, policy_version 103600 (0.0005) +[2023-03-11 19:37:09,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10399.7). Total num frames: 53059584. Throughput: 0: 10363.4. Samples: 53054848. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:37:09,012][65744] Avg episode reward: [(0, '3914.112')] +[2023-03-11 19:37:11,335][66031] Updated weights for policy 0, policy_version 103680 (0.0005) +[2023-03-11 19:37:14,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10385.8). Total num frames: 53108736. Throughput: 0: 10347.3. Samples: 53085208. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:37:14,012][65744] Avg episode reward: [(0, '3854.619')] +[2023-03-11 19:37:15,242][66031] Updated weights for policy 0, policy_version 103760 (0.0005) +[2023-03-11 19:37:19,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10399.7). Total num frames: 53161984. Throughput: 0: 10347.7. Samples: 53148196. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:37:19,012][65744] Avg episode reward: [(0, '3068.316')] +[2023-03-11 19:37:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000103832_53161984.pth... +[2023-03-11 19:37:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000103224_52850688.pth +[2023-03-11 19:37:19,264][66031] Updated weights for policy 0, policy_version 103840 (0.0005) +[2023-03-11 19:37:23,415][66031] Updated weights for policy 0, policy_version 103920 (0.0005) +[2023-03-11 19:37:24,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10308.3, 300 sec: 10385.8). Total num frames: 53211136. Throughput: 0: 10285.7. Samples: 53207112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:37:24,012][65744] Avg episode reward: [(0, '3903.408')] +[2023-03-11 19:37:27,314][66031] Updated weights for policy 0, policy_version 104000 (0.0005) +[2023-03-11 19:37:29,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10385.8). Total num frames: 53264384. Throughput: 0: 10343.7. Samples: 53238920. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:37:29,012][65744] Avg episode reward: [(0, '3979.446')] +[2023-03-11 19:37:31,163][66031] Updated weights for policy 0, policy_version 104080 (0.0004) +[2023-03-11 19:37:34,012][65744] Fps is (10 sec: 10649.5, 60 sec: 10376.5, 300 sec: 10399.7). Total num frames: 53317632. Throughput: 0: 10398.2. Samples: 53302216. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:37:34,012][65744] Avg episode reward: [(0, '3875.593')] +[2023-03-11 19:37:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000104136_53317632.pth... +[2023-03-11 19:37:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000103528_53006336.pth +[2023-03-11 19:37:35,132][66031] Updated weights for policy 0, policy_version 104160 (0.0004) +[2023-03-11 19:37:39,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10308.3, 300 sec: 10385.8). Total num frames: 53366784. Throughput: 0: 10372.2. Samples: 53364628. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:37:39,012][65744] Avg episode reward: [(0, '4044.655')] +[2023-03-11 19:37:39,053][66031] Updated weights for policy 0, policy_version 104240 (0.0004) +[2023-03-11 19:37:42,931][66031] Updated weights for policy 0, policy_version 104320 (0.0004) +[2023-03-11 19:37:44,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10385.8). Total num frames: 53420032. Throughput: 0: 10380.4. Samples: 53395744. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:37:44,012][65744] Avg episode reward: [(0, '4081.583')] +[2023-03-11 19:37:46,872][66031] Updated weights for policy 0, policy_version 104400 (0.0004) +[2023-03-11 19:37:49,012][65744] Fps is (10 sec: 10649.7, 60 sec: 10376.5, 300 sec: 10399.7). Total num frames: 53473280. Throughput: 0: 10378.8. Samples: 53458956. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:37:49,012][65744] Avg episode reward: [(0, '4256.841')] +[2023-03-11 19:37:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000104440_53473280.pth... +[2023-03-11 19:37:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000103832_53161984.pth +[2023-03-11 19:37:50,892][66031] Updated weights for policy 0, policy_version 104480 (0.0005) +[2023-03-11 19:37:54,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10385.8). Total num frames: 53522432. Throughput: 0: 10302.6. Samples: 53518464. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:37:54,012][65744] Avg episode reward: [(0, '4314.123')] +[2023-03-11 19:37:54,977][66031] Updated weights for policy 0, policy_version 104560 (0.0005) +[2023-03-11 19:37:58,898][66031] Updated weights for policy 0, policy_version 104640 (0.0005) +[2023-03-11 19:37:59,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10399.7). Total num frames: 53575680. Throughput: 0: 10328.6. Samples: 53549996. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:37:59,012][65744] Avg episode reward: [(0, '3915.045')] +[2023-03-11 19:38:02,850][66031] Updated weights for policy 0, policy_version 104720 (0.0005) +[2023-03-11 19:38:04,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10385.8). Total num frames: 53624832. Throughput: 0: 10319.0. Samples: 53612552. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:38:04,012][65744] Avg episode reward: [(0, '3885.383')] +[2023-03-11 19:38:04,014][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000104736_53624832.pth... +[2023-03-11 19:38:04,016][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000104136_53317632.pth +[2023-03-11 19:38:06,852][66031] Updated weights for policy 0, policy_version 104800 (0.0005) +[2023-03-11 19:38:09,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10385.8). Total num frames: 53678080. Throughput: 0: 10377.1. Samples: 53674080. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:38:09,012][65744] Avg episode reward: [(0, '3593.908')] +[2023-03-11 19:38:10,804][66031] Updated weights for policy 0, policy_version 104880 (0.0005) +[2023-03-11 19:38:14,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10399.7). Total num frames: 53731328. Throughput: 0: 10364.4. Samples: 53705320. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:38:14,012][65744] Avg episode reward: [(0, '3892.015')] +[2023-03-11 19:38:14,765][66031] Updated weights for policy 0, policy_version 104960 (0.0005) +[2023-03-11 19:38:18,664][66031] Updated weights for policy 0, policy_version 105040 (0.0005) +[2023-03-11 19:38:19,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10385.8). Total num frames: 53780480. Throughput: 0: 10345.4. Samples: 53767756. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:38:19,012][65744] Avg episode reward: [(0, '3770.311')] +[2023-03-11 19:38:19,014][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000105040_53780480.pth... +[2023-03-11 19:38:19,016][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000104440_53473280.pth +[2023-03-11 19:38:22,572][66031] Updated weights for policy 0, policy_version 105120 (0.0004) +[2023-03-11 19:38:24,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10385.8). Total num frames: 53833728. Throughput: 0: 10344.2. Samples: 53830116. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:38:24,012][65744] Avg episode reward: [(0, '3578.110')] +[2023-03-11 19:38:26,548][66031] Updated weights for policy 0, policy_version 105200 (0.0005) +[2023-03-11 19:38:29,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10399.7). Total num frames: 53886976. Throughput: 0: 10345.6. Samples: 53861296. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:38:29,012][65744] Avg episode reward: [(0, '3696.420')] +[2023-03-11 19:38:30,619][66031] Updated weights for policy 0, policy_version 105280 (0.0005) +[2023-03-11 19:38:34,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10308.3, 300 sec: 10385.8). Total num frames: 53936128. Throughput: 0: 10295.9. Samples: 53922272. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:38:34,012][65744] Avg episode reward: [(0, '3683.929')] +[2023-03-11 19:38:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000105344_53936128.pth... +[2023-03-11 19:38:34,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000104736_53624832.pth +[2023-03-11 19:38:34,592][66031] Updated weights for policy 0, policy_version 105360 (0.0005) +[2023-03-11 19:38:38,638][66031] Updated weights for policy 0, policy_version 105440 (0.0005) +[2023-03-11 19:38:39,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10308.3, 300 sec: 10371.9). Total num frames: 53985280. Throughput: 0: 10342.7. Samples: 53983884. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:38:39,012][65744] Avg episode reward: [(0, '3186.492')] +[2023-03-11 19:38:42,877][66031] Updated weights for policy 0, policy_version 105520 (0.0005) +[2023-03-11 19:38:44,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10358.0). Total num frames: 54034432. Throughput: 0: 10276.9. Samples: 54012456. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:38:44,012][65744] Avg episode reward: [(0, '3399.290')] +[2023-03-11 19:38:47,123][66031] Updated weights for policy 0, policy_version 105600 (0.0005) +[2023-03-11 19:38:49,012][65744] Fps is (10 sec: 9830.3, 60 sec: 10171.7, 300 sec: 10358.0). Total num frames: 54083584. Throughput: 0: 10180.9. Samples: 54070692. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:38:49,012][65744] Avg episode reward: [(0, '3548.438')] +[2023-03-11 19:38:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000105632_54083584.pth... +[2023-03-11 19:38:49,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000105040_53780480.pth +[2023-03-11 19:38:51,345][66031] Updated weights for policy 0, policy_version 105680 (0.0005) +[2023-03-11 19:38:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10344.1). Total num frames: 54132736. Throughput: 0: 10102.9. Samples: 54128712. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:38:54,012][65744] Avg episode reward: [(0, '3262.337')] +[2023-03-11 19:38:55,549][66031] Updated weights for policy 0, policy_version 105760 (0.0005) +[2023-03-11 19:38:59,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10330.3). Total num frames: 54181888. Throughput: 0: 10050.1. Samples: 54157576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:38:59,012][65744] Avg episode reward: [(0, '3591.400')] +[2023-03-11 19:38:59,792][66031] Updated weights for policy 0, policy_version 105840 (0.0005) +[2023-03-11 19:39:03,885][66031] Updated weights for policy 0, policy_version 105920 (0.0005) +[2023-03-11 19:39:04,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10330.3). Total num frames: 54231040. Throughput: 0: 9960.5. Samples: 54215980. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:39:04,012][65744] Avg episode reward: [(0, '3116.941')] +[2023-03-11 19:39:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000105920_54231040.pth... +[2023-03-11 19:39:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000105344_53936128.pth +[2023-03-11 19:39:07,844][66031] Updated weights for policy 0, policy_version 106000 (0.0005) +[2023-03-11 19:39:09,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10344.1). Total num frames: 54280192. Throughput: 0: 9962.9. Samples: 54278444. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:39:09,012][65744] Avg episode reward: [(0, '3744.629')] +[2023-03-11 19:39:11,862][66031] Updated weights for policy 0, policy_version 106080 (0.0005) +[2023-03-11 19:39:14,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10344.1). Total num frames: 54333440. Throughput: 0: 9946.1. Samples: 54308872. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:39:14,012][65744] Avg episode reward: [(0, '3913.501')] +[2023-03-11 19:39:15,828][66031] Updated weights for policy 0, policy_version 106160 (0.0005) +[2023-03-11 19:39:19,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10344.1). Total num frames: 54382592. Throughput: 0: 9957.7. Samples: 54370368. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:39:19,012][65744] Avg episode reward: [(0, '3982.141')] +[2023-03-11 19:39:19,028][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000106224_54386688.pth... +[2023-03-11 19:39:19,029][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000105632_54083584.pth +[2023-03-11 19:39:19,845][66031] Updated weights for policy 0, policy_version 106240 (0.0005) +[2023-03-11 19:39:23,829][66031] Updated weights for policy 0, policy_version 106320 (0.0004) +[2023-03-11 19:39:24,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10344.1). Total num frames: 54435840. Throughput: 0: 9954.0. Samples: 54431816. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:39:24,012][65744] Avg episode reward: [(0, '3914.427')] +[2023-03-11 19:39:27,817][66031] Updated weights for policy 0, policy_version 106400 (0.0005) +[2023-03-11 19:39:29,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 10330.3). Total num frames: 54484992. Throughput: 0: 10005.0. Samples: 54462680. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:39:29,012][65744] Avg episode reward: [(0, '3398.153')] +[2023-03-11 19:39:31,726][66031] Updated weights for policy 0, policy_version 106480 (0.0004) +[2023-03-11 19:39:34,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10330.2). Total num frames: 54538240. Throughput: 0: 10107.9. Samples: 54525548. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:39:34,012][65744] Avg episode reward: [(0, '3126.423')] +[2023-03-11 19:39:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000106520_54538240.pth... +[2023-03-11 19:39:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000105920_54231040.pth +[2023-03-11 19:39:35,857][66031] Updated weights for policy 0, policy_version 106560 (0.0004) +[2023-03-11 19:39:39,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10035.2, 300 sec: 10330.2). Total num frames: 54587392. Throughput: 0: 10107.4. Samples: 54583544. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:39:39,012][65744] Avg episode reward: [(0, '3051.315')] +[2023-03-11 19:39:40,033][66031] Updated weights for policy 0, policy_version 106640 (0.0005) +[2023-03-11 19:39:44,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10316.4). Total num frames: 54636544. Throughput: 0: 10126.2. Samples: 54613256. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:39:44,012][65744] Avg episode reward: [(0, '3811.185')] +[2023-03-11 19:39:44,237][66031] Updated weights for policy 0, policy_version 106720 (0.0005) +[2023-03-11 19:39:48,376][66031] Updated weights for policy 0, policy_version 106800 (0.0005) +[2023-03-11 19:39:49,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10302.5). Total num frames: 54685696. Throughput: 0: 10145.9. Samples: 54672544. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:39:49,012][65744] Avg episode reward: [(0, '3175.696')] +[2023-03-11 19:39:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000106808_54685696.pth... +[2023-03-11 19:39:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000106224_54386688.pth +[2023-03-11 19:39:52,310][66031] Updated weights for policy 0, policy_version 106880 (0.0004) +[2023-03-11 19:39:54,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10302.5). Total num frames: 54738944. Throughput: 0: 10127.6. Samples: 54734188. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:39:54,012][65744] Avg episode reward: [(0, '2764.297')] +[2023-03-11 19:39:56,344][66031] Updated weights for policy 0, policy_version 106960 (0.0005) +[2023-03-11 19:39:59,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10103.5, 300 sec: 10288.6). Total num frames: 54788096. Throughput: 0: 10123.5. Samples: 54764428. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:39:59,012][65744] Avg episode reward: [(0, '3544.155')] +[2023-03-11 19:40:00,208][66031] Updated weights for policy 0, policy_version 107040 (0.0004) +[2023-03-11 19:40:04,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10288.6). Total num frames: 54841344. Throughput: 0: 10157.4. Samples: 54827452. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 19:40:04,012][65744] Avg episode reward: [(0, '4114.523')] +[2023-03-11 19:40:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000107112_54841344.pth... +[2023-03-11 19:40:04,016][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000106520_54538240.pth +[2023-03-11 19:40:04,240][66031] Updated weights for policy 0, policy_version 107120 (0.0005) +[2023-03-11 19:40:08,463][66031] Updated weights for policy 0, policy_version 107200 (0.0005) +[2023-03-11 19:40:09,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10288.6). Total num frames: 54890496. Throughput: 0: 10102.2. Samples: 54886416. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 19:40:09,012][65744] Avg episode reward: [(0, '4179.061')] +[2023-03-11 19:40:12,529][66031] Updated weights for policy 0, policy_version 107280 (0.0004) +[2023-03-11 19:40:14,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10274.7). Total num frames: 54939648. Throughput: 0: 10071.0. Samples: 54915876. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 19:40:14,012][65744] Avg episode reward: [(0, '4227.452')] +[2023-03-11 19:40:16,489][66031] Updated weights for policy 0, policy_version 107360 (0.0005) +[2023-03-11 19:40:19,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10274.7). Total num frames: 54992896. Throughput: 0: 10067.7. Samples: 54978596. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 19:40:19,012][65744] Avg episode reward: [(0, '4324.623')] +[2023-03-11 19:40:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000107408_54992896.pth... +[2023-03-11 19:40:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000106808_54685696.pth +[2023-03-11 19:40:20,605][66031] Updated weights for policy 0, policy_version 107440 (0.0005) +[2023-03-11 19:40:24,012][65744] Fps is (10 sec: 9830.5, 60 sec: 10035.2, 300 sec: 10246.9). Total num frames: 55037952. Throughput: 0: 10070.1. Samples: 55036700. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 19:40:24,012][65744] Avg episode reward: [(0, '3991.203')] +[2023-03-11 19:40:24,906][66031] Updated weights for policy 0, policy_version 107520 (0.0005) +[2023-03-11 19:40:29,012][65744] Fps is (10 sec: 9420.8, 60 sec: 10035.2, 300 sec: 10246.9). Total num frames: 55087104. Throughput: 0: 10042.5. Samples: 55065168. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 19:40:29,012][65744] Avg episode reward: [(0, '4054.002')] +[2023-03-11 19:40:29,177][66031] Updated weights for policy 0, policy_version 107600 (0.0005) +[2023-03-11 19:40:33,281][66031] Updated weights for policy 0, policy_version 107680 (0.0005) +[2023-03-11 19:40:34,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9966.9, 300 sec: 10233.1). Total num frames: 55136256. Throughput: 0: 10019.7. Samples: 55123432. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 19:40:34,012][65744] Avg episode reward: [(0, '4185.187')] +[2023-03-11 19:40:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000107688_55136256.pth... +[2023-03-11 19:40:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000107112_54841344.pth +[2023-03-11 19:40:37,501][66031] Updated weights for policy 0, policy_version 107760 (0.0005) +[2023-03-11 19:40:39,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 10219.2). Total num frames: 55185408. Throughput: 0: 9953.0. Samples: 55182072. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 19:40:39,012][65744] Avg episode reward: [(0, '4202.617')] +[2023-03-11 19:40:41,754][66031] Updated weights for policy 0, policy_version 107840 (0.0005) +[2023-03-11 19:40:44,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 10205.3). Total num frames: 55234560. Throughput: 0: 9928.3. Samples: 55211204. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 19:40:44,012][65744] Avg episode reward: [(0, '4233.033')] +[2023-03-11 19:40:46,012][66031] Updated weights for policy 0, policy_version 107920 (0.0005) +[2023-03-11 19:40:49,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 10205.3). Total num frames: 55283712. Throughput: 0: 9806.3. Samples: 55268736. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 19:40:49,012][65744] Avg episode reward: [(0, '4202.856')] +[2023-03-11 19:40:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000107976_55283712.pth... +[2023-03-11 19:40:49,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000107408_54992896.pth +[2023-03-11 19:40:50,237][66031] Updated weights for policy 0, policy_version 108000 (0.0005) +[2023-03-11 19:40:54,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9830.4, 300 sec: 10177.5). Total num frames: 55328768. Throughput: 0: 9803.6. Samples: 55327576. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 19:40:54,012][65744] Avg episode reward: [(0, '4142.235')] +[2023-03-11 19:40:54,456][66031] Updated weights for policy 0, policy_version 108080 (0.0004) +[2023-03-11 19:40:58,649][66031] Updated weights for policy 0, policy_version 108160 (0.0005) +[2023-03-11 19:40:59,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9830.4, 300 sec: 10163.6). Total num frames: 55377920. Throughput: 0: 9796.8. Samples: 55356732. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 19:40:59,012][65744] Avg episode reward: [(0, '4091.917')] +[2023-03-11 19:41:02,876][66031] Updated weights for policy 0, policy_version 108240 (0.0004) +[2023-03-11 19:41:04,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 10149.8). Total num frames: 55427072. Throughput: 0: 9694.5. Samples: 55414848. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:41:04,012][65744] Avg episode reward: [(0, '4296.249')] +[2023-03-11 19:41:04,014][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000108256_55427072.pth... +[2023-03-11 19:41:04,016][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000107688_55136256.pth +[2023-03-11 19:41:06,795][66031] Updated weights for policy 0, policy_version 108320 (0.0004) +[2023-03-11 19:41:09,012][65744] Fps is (10 sec: 10239.9, 60 sec: 9830.4, 300 sec: 10149.7). Total num frames: 55480320. Throughput: 0: 9783.7. Samples: 55476968. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:41:09,012][65744] Avg episode reward: [(0, '4175.382')] +[2023-03-11 19:41:10,631][66031] Updated weights for policy 0, policy_version 108400 (0.0004) +[2023-03-11 19:41:14,012][65744] Fps is (10 sec: 10649.6, 60 sec: 9898.7, 300 sec: 10149.8). Total num frames: 55533568. Throughput: 0: 9864.2. Samples: 55509056. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:41:14,012][65744] Avg episode reward: [(0, '3950.293')] +[2023-03-11 19:41:14,542][66031] Updated weights for policy 0, policy_version 108480 (0.0004) +[2023-03-11 19:41:18,428][66031] Updated weights for policy 0, policy_version 108560 (0.0004) +[2023-03-11 19:41:19,012][65744] Fps is (10 sec: 10649.6, 60 sec: 9898.7, 300 sec: 10149.7). Total num frames: 55586816. Throughput: 0: 9976.7. Samples: 55572384. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:41:19,012][65744] Avg episode reward: [(0, '3822.208')] +[2023-03-11 19:41:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000108568_55586816.pth... +[2023-03-11 19:41:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000107976_55283712.pth +[2023-03-11 19:41:22,363][66031] Updated weights for policy 0, policy_version 108640 (0.0004) +[2023-03-11 19:41:24,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10035.2, 300 sec: 10149.7). Total num frames: 55640064. Throughput: 0: 10073.8. Samples: 55635392. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:41:24,012][65744] Avg episode reward: [(0, '3661.649')] +[2023-03-11 19:41:26,292][66031] Updated weights for policy 0, policy_version 108720 (0.0004) +[2023-03-11 19:41:29,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10149.7). Total num frames: 55689216. Throughput: 0: 10111.8. Samples: 55666236. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:41:29,012][65744] Avg episode reward: [(0, '4024.085')] +[2023-03-11 19:41:30,580][66031] Updated weights for policy 0, policy_version 108800 (0.0004) +[2023-03-11 19:41:34,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10135.9). Total num frames: 55738368. Throughput: 0: 10107.5. Samples: 55723572. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:41:34,012][65744] Avg episode reward: [(0, '3319.708')] +[2023-03-11 19:41:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000108864_55738368.pth... +[2023-03-11 19:41:34,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000108256_55427072.pth +[2023-03-11 19:41:34,814][66031] Updated weights for policy 0, policy_version 108880 (0.0005) +[2023-03-11 19:41:39,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9967.0, 300 sec: 10122.0). Total num frames: 55783424. Throughput: 0: 10095.9. Samples: 55781892. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:41:39,012][65744] Avg episode reward: [(0, '4269.699')] +[2023-03-11 19:41:39,052][66031] Updated weights for policy 0, policy_version 108960 (0.0005) +[2023-03-11 19:41:42,941][66031] Updated weights for policy 0, policy_version 109040 (0.0005) +[2023-03-11 19:41:44,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10122.0). Total num frames: 55836672. Throughput: 0: 10125.6. Samples: 55812384. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:41:44,012][65744] Avg episode reward: [(0, '4303.944')] +[2023-03-11 19:41:46,960][66031] Updated weights for policy 0, policy_version 109120 (0.0005) +[2023-03-11 19:41:49,012][65744] Fps is (10 sec: 10649.5, 60 sec: 10103.5, 300 sec: 10135.9). Total num frames: 55889920. Throughput: 0: 10213.7. Samples: 55874464. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:41:49,012][65744] Avg episode reward: [(0, '4300.745')] +[2023-03-11 19:41:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000109160_55889920.pth... +[2023-03-11 19:41:49,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000108568_55586816.pth +[2023-03-11 19:41:50,792][66031] Updated weights for policy 0, policy_version 109200 (0.0004) +[2023-03-11 19:41:54,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 10135.9). Total num frames: 55943168. Throughput: 0: 10249.2. Samples: 55938180. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:41:54,012][65744] Avg episode reward: [(0, '4208.786')] +[2023-03-11 19:41:54,708][66031] Updated weights for policy 0, policy_version 109280 (0.0004) +[2023-03-11 19:41:58,757][66031] Updated weights for policy 0, policy_version 109360 (0.0004) +[2023-03-11 19:41:59,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10122.0). Total num frames: 55992320. Throughput: 0: 10217.2. Samples: 55968828. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:41:59,012][65744] Avg episode reward: [(0, '4370.645')] +[2023-03-11 19:42:02,628][66031] Updated weights for policy 0, policy_version 109440 (0.0004) +[2023-03-11 19:42:04,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10122.0). Total num frames: 56045568. Throughput: 0: 10198.0. Samples: 56031296. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:42:04,012][65744] Avg episode reward: [(0, '4448.081')] +[2023-03-11 19:42:04,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000109464_56045568.pth... +[2023-03-11 19:42:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000108864_55738368.pth +[2023-03-11 19:42:06,585][66031] Updated weights for policy 0, policy_version 109520 (0.0004) +[2023-03-11 19:42:09,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 10135.9). Total num frames: 56098816. Throughput: 0: 10193.6. Samples: 56094104. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:42:09,012][65744] Avg episode reward: [(0, '4259.808')] +[2023-03-11 19:42:10,455][66031] Updated weights for policy 0, policy_version 109600 (0.0004) +[2023-03-11 19:42:14,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10122.0). Total num frames: 56147968. Throughput: 0: 10193.9. Samples: 56124960. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:42:14,012][65744] Avg episode reward: [(0, '4011.011')] +[2023-03-11 19:42:14,726][66031] Updated weights for policy 0, policy_version 109680 (0.0005) +[2023-03-11 19:42:18,917][66031] Updated weights for policy 0, policy_version 109760 (0.0005) +[2023-03-11 19:42:19,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10122.0). Total num frames: 56197120. Throughput: 0: 10207.1. Samples: 56182892. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:42:19,012][65744] Avg episode reward: [(0, '4146.762')] +[2023-03-11 19:42:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000109760_56197120.pth... +[2023-03-11 19:42:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000109160_55889920.pth +[2023-03-11 19:42:23,164][66031] Updated weights for policy 0, policy_version 109840 (0.0005) +[2023-03-11 19:42:24,012][65744] Fps is (10 sec: 9420.8, 60 sec: 10035.2, 300 sec: 10094.2). Total num frames: 56242176. Throughput: 0: 10202.8. Samples: 56241020. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:42:24,012][65744] Avg episode reward: [(0, '4268.204')] +[2023-03-11 19:42:27,350][66031] Updated weights for policy 0, policy_version 109920 (0.0005) +[2023-03-11 19:42:29,012][65744] Fps is (10 sec: 9420.9, 60 sec: 10035.2, 300 sec: 10080.3). Total num frames: 56291328. Throughput: 0: 10175.8. Samples: 56270292. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:42:29,012][65744] Avg episode reward: [(0, '4123.887')] +[2023-03-11 19:42:31,619][66031] Updated weights for policy 0, policy_version 110000 (0.0005) +[2023-03-11 19:42:34,012][65744] Fps is (10 sec: 9830.3, 60 sec: 10035.2, 300 sec: 10080.3). Total num frames: 56340480. Throughput: 0: 10083.1. Samples: 56328204. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:42:34,012][65744] Avg episode reward: [(0, '4170.278')] +[2023-03-11 19:42:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000110040_56340480.pth... +[2023-03-11 19:42:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000109464_56045568.pth +[2023-03-11 19:42:35,884][66031] Updated weights for policy 0, policy_version 110080 (0.0005) +[2023-03-11 19:42:39,012][65744] Fps is (10 sec: 9830.3, 60 sec: 10103.5, 300 sec: 10066.4). Total num frames: 56389632. Throughput: 0: 9942.7. Samples: 56385600. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:42:39,012][65744] Avg episode reward: [(0, '3705.750')] +[2023-03-11 19:42:40,157][66031] Updated weights for policy 0, policy_version 110160 (0.0005) +[2023-03-11 19:42:44,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9966.9, 300 sec: 10038.7). Total num frames: 56434688. Throughput: 0: 9898.8. Samples: 56414272. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:42:44,012][65744] Avg episode reward: [(0, '3903.311')] +[2023-03-11 19:42:44,456][66031] Updated weights for policy 0, policy_version 110240 (0.0005) +[2023-03-11 19:42:48,773][66031] Updated weights for policy 0, policy_version 110320 (0.0005) +[2023-03-11 19:42:49,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 10038.7). Total num frames: 56483840. Throughput: 0: 9784.9. Samples: 56471616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:42:49,012][65744] Avg episode reward: [(0, '3614.828')] +[2023-03-11 19:42:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000110320_56483840.pth... +[2023-03-11 19:42:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000109760_56197120.pth +[2023-03-11 19:42:53,024][66031] Updated weights for policy 0, policy_version 110400 (0.0005) +[2023-03-11 19:42:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10024.8). Total num frames: 56532992. Throughput: 0: 9662.2. Samples: 56528904. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:42:54,012][65744] Avg episode reward: [(0, '3428.346')] +[2023-03-11 19:42:57,239][66031] Updated weights for policy 0, policy_version 110480 (0.0005) +[2023-03-11 19:42:59,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10024.8). Total num frames: 56582144. Throughput: 0: 9615.1. Samples: 56557640. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:42:59,012][65744] Avg episode reward: [(0, '3873.321')] +[2023-03-11 19:43:01,480][66031] Updated weights for policy 0, policy_version 110560 (0.0005) +[2023-03-11 19:43:04,012][65744] Fps is (10 sec: 9420.7, 60 sec: 9693.9, 300 sec: 9997.0). Total num frames: 56627200. Throughput: 0: 9616.3. Samples: 56615628. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:43:04,012][65744] Avg episode reward: [(0, '4013.680')] +[2023-03-11 19:43:04,052][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000110608_56631296.pth... +[2023-03-11 19:43:04,053][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000110040_56340480.pth +[2023-03-11 19:43:05,805][66031] Updated weights for policy 0, policy_version 110640 (0.0005) +[2023-03-11 19:43:09,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9983.1). Total num frames: 56676352. Throughput: 0: 9585.2. Samples: 56672356. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:43:09,012][65744] Avg episode reward: [(0, '4095.897')] +[2023-03-11 19:43:10,136][66031] Updated weights for policy 0, policy_version 110720 (0.0005) +[2023-03-11 19:43:14,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9625.6, 300 sec: 9983.1). Total num frames: 56725504. Throughput: 0: 9571.3. Samples: 56701000. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:43:14,012][65744] Avg episode reward: [(0, '4148.528')] +[2023-03-11 19:43:14,437][66031] Updated weights for policy 0, policy_version 110800 (0.0004) +[2023-03-11 19:43:18,422][66031] Updated weights for policy 0, policy_version 110880 (0.0003) +[2023-03-11 19:43:19,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9969.2). Total num frames: 56774656. Throughput: 0: 9595.5. Samples: 56760004. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:43:19,012][65744] Avg episode reward: [(0, '3637.487')] +[2023-03-11 19:43:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000110888_56774656.pth... +[2023-03-11 19:43:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000110320_56483840.pth +[2023-03-11 19:43:22,578][66031] Updated weights for policy 0, policy_version 110960 (0.0003) +[2023-03-11 19:43:24,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9955.4). Total num frames: 56823808. Throughput: 0: 9648.5. Samples: 56819784. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:43:24,012][65744] Avg episode reward: [(0, '3762.450')] +[2023-03-11 19:43:26,679][66031] Updated weights for policy 0, policy_version 111040 (0.0005) +[2023-03-11 19:43:29,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.8, 300 sec: 9955.4). Total num frames: 56872960. Throughput: 0: 9688.3. Samples: 56850248. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:43:29,012][65744] Avg episode reward: [(0, '4013.652')] +[2023-03-11 19:43:30,603][66031] Updated weights for policy 0, policy_version 111120 (0.0004) +[2023-03-11 19:43:34,012][65744] Fps is (10 sec: 10239.9, 60 sec: 9762.1, 300 sec: 9969.2). Total num frames: 56926208. Throughput: 0: 9781.9. Samples: 56911804. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:43:34,012][65744] Avg episode reward: [(0, '3937.495')] +[2023-03-11 19:43:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000111184_56926208.pth... +[2023-03-11 19:43:34,019][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000110608_56631296.pth +[2023-03-11 19:43:34,729][66031] Updated weights for policy 0, policy_version 111200 (0.0005) +[2023-03-11 19:43:38,703][66031] Updated weights for policy 0, policy_version 111280 (0.0004) +[2023-03-11 19:43:39,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9762.1, 300 sec: 9969.2). Total num frames: 56975360. Throughput: 0: 9854.4. Samples: 56972352. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:43:39,012][65744] Avg episode reward: [(0, '3540.299')] +[2023-03-11 19:43:42,994][66031] Updated weights for policy 0, policy_version 111360 (0.0005) +[2023-03-11 19:43:44,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9830.4, 300 sec: 9969.2). Total num frames: 57024512. Throughput: 0: 9865.1. Samples: 57001572. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:43:44,012][65744] Avg episode reward: [(0, '4387.844')] +[2023-03-11 19:43:47,256][66031] Updated weights for policy 0, policy_version 111440 (0.0005) +[2023-03-11 19:43:49,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9969.2). Total num frames: 57073664. Throughput: 0: 9858.8. Samples: 57059272. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:43:49,012][65744] Avg episode reward: [(0, '4303.013')] +[2023-03-11 19:43:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000111472_57073664.pth... +[2023-03-11 19:43:49,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000110888_56774656.pth +[2023-03-11 19:43:51,548][66031] Updated weights for policy 0, policy_version 111520 (0.0005) +[2023-03-11 19:43:54,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9762.1, 300 sec: 9955.4). Total num frames: 57118720. Throughput: 0: 9859.7. Samples: 57116040. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:43:54,012][65744] Avg episode reward: [(0, '3563.226')] +[2023-03-11 19:43:55,829][66031] Updated weights for policy 0, policy_version 111600 (0.0005) +[2023-03-11 19:43:59,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9955.4). Total num frames: 57167872. Throughput: 0: 9864.9. Samples: 57144920. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:43:59,012][65744] Avg episode reward: [(0, '3519.008')] +[2023-03-11 19:44:00,169][66031] Updated weights for policy 0, policy_version 111680 (0.0005) +[2023-03-11 19:44:04,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9955.4). Total num frames: 57217024. Throughput: 0: 9818.4. Samples: 57201832. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:44:04,012][65744] Avg episode reward: [(0, '3623.383')] +[2023-03-11 19:44:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000111752_57217024.pth... +[2023-03-11 19:44:04,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000111184_56926208.pth +[2023-03-11 19:44:04,435][66031] Updated weights for policy 0, policy_version 111760 (0.0005) +[2023-03-11 19:44:08,776][66031] Updated weights for policy 0, policy_version 111840 (0.0005) +[2023-03-11 19:44:09,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9927.6). Total num frames: 57262080. Throughput: 0: 9753.5. Samples: 57258692. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:44:09,012][65744] Avg episode reward: [(0, '3035.267')] +[2023-03-11 19:44:13,121][66031] Updated weights for policy 0, policy_version 111920 (0.0005) +[2023-03-11 19:44:14,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9927.6). Total num frames: 57311232. Throughput: 0: 9707.0. Samples: 57287064. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:44:14,012][65744] Avg episode reward: [(0, '3283.511')] +[2023-03-11 19:44:17,432][66031] Updated weights for policy 0, policy_version 112000 (0.0005) +[2023-03-11 19:44:19,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9899.8). Total num frames: 57356288. Throughput: 0: 9605.8. Samples: 57344064. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:44:19,012][65744] Avg episode reward: [(0, '3795.365')] +[2023-03-11 19:44:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000112024_57356288.pth... +[2023-03-11 19:44:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000111472_57073664.pth +[2023-03-11 19:44:21,730][66031] Updated weights for policy 0, policy_version 112080 (0.0005) +[2023-03-11 19:44:24,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9899.8). Total num frames: 57405440. Throughput: 0: 9534.6. Samples: 57401408. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:44:24,012][65744] Avg episode reward: [(0, '3996.925')] +[2023-03-11 19:44:26,029][66031] Updated weights for policy 0, policy_version 112160 (0.0005) +[2023-03-11 19:44:29,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9885.9). Total num frames: 57454592. Throughput: 0: 9521.2. Samples: 57430024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:44:29,012][65744] Avg episode reward: [(0, '3511.367')] +[2023-03-11 19:44:30,291][66031] Updated weights for policy 0, policy_version 112240 (0.0005) +[2023-03-11 19:44:34,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9557.4, 300 sec: 9872.1). Total num frames: 57499648. Throughput: 0: 9515.7. Samples: 57487480. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:44:34,012][65744] Avg episode reward: [(0, '3564.540')] +[2023-03-11 19:44:34,014][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000112304_57499648.pth... +[2023-03-11 19:44:34,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000111752_57217024.pth +[2023-03-11 19:44:34,532][66031] Updated weights for policy 0, policy_version 112320 (0.0005) +[2023-03-11 19:44:38,825][66031] Updated weights for policy 0, policy_version 112400 (0.0005) +[2023-03-11 19:44:39,012][65744] Fps is (10 sec: 9420.7, 60 sec: 9557.3, 300 sec: 9872.1). Total num frames: 57548800. Throughput: 0: 9527.5. Samples: 57544780. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:44:39,012][65744] Avg episode reward: [(0, '4172.101')] +[2023-03-11 19:44:43,070][66031] Updated weights for policy 0, policy_version 112480 (0.0004) +[2023-03-11 19:44:44,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9872.1). Total num frames: 57597952. Throughput: 0: 9534.2. Samples: 57573960. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:44:44,012][65744] Avg episode reward: [(0, '4423.507')] +[2023-03-11 19:44:47,321][66031] Updated weights for policy 0, policy_version 112560 (0.0004) +[2023-03-11 19:44:49,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9489.1, 300 sec: 9844.3). Total num frames: 57643008. Throughput: 0: 9556.2. Samples: 57631860. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:44:49,012][65744] Avg episode reward: [(0, '4325.772')] +[2023-03-11 19:44:49,053][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000112592_57647104.pth... +[2023-03-11 19:44:49,054][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000112024_57356288.pth +[2023-03-11 19:44:51,548][66031] Updated weights for policy 0, policy_version 112640 (0.0005) +[2023-03-11 19:44:54,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9844.3). Total num frames: 57692160. Throughput: 0: 9588.8. Samples: 57690188. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:44:54,012][65744] Avg episode reward: [(0, '4202.017')] +[2023-03-11 19:44:55,824][66031] Updated weights for policy 0, policy_version 112720 (0.0005) +[2023-03-11 19:44:59,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9830.4). Total num frames: 57741312. Throughput: 0: 9591.9. Samples: 57718700. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:44:59,012][65744] Avg episode reward: [(0, '4176.353')] +[2023-03-11 19:45:00,109][66031] Updated weights for policy 0, policy_version 112800 (0.0005) +[2023-03-11 19:45:04,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9830.4). Total num frames: 57790464. Throughput: 0: 9628.4. Samples: 57777340. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:45:04,012][65744] Avg episode reward: [(0, '4374.459')] +[2023-03-11 19:45:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000112872_57790464.pth... +[2023-03-11 19:45:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000112304_57499648.pth +[2023-03-11 19:45:04,267][66031] Updated weights for policy 0, policy_version 112880 (0.0005) +[2023-03-11 19:45:08,512][66031] Updated weights for policy 0, policy_version 112960 (0.0005) +[2023-03-11 19:45:09,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9830.4). Total num frames: 57839616. Throughput: 0: 9645.6. Samples: 57835460. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 19:45:09,012][65744] Avg episode reward: [(0, '4357.932')] +[2023-03-11 19:45:12,736][66031] Updated weights for policy 0, policy_version 113040 (0.0005) +[2023-03-11 19:45:14,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9802.6). Total num frames: 57884672. Throughput: 0: 9648.4. Samples: 57864200. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 19:45:14,012][65744] Avg episode reward: [(0, '4337.279')] +[2023-03-11 19:45:16,948][66031] Updated weights for policy 0, policy_version 113120 (0.0005) +[2023-03-11 19:45:19,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9816.5). Total num frames: 57933824. Throughput: 0: 9659.9. Samples: 57922176. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 19:45:19,012][65744] Avg episode reward: [(0, '4385.505')] +[2023-03-11 19:45:19,041][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000113160_57937920.pth... +[2023-03-11 19:45:19,043][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000112592_57647104.pth +[2023-03-11 19:45:21,176][66031] Updated weights for policy 0, policy_version 113200 (0.0005) +[2023-03-11 19:45:24,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9816.5). Total num frames: 57982976. Throughput: 0: 9678.3. Samples: 57980304. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 19:45:24,012][65744] Avg episode reward: [(0, '4221.567')] +[2023-03-11 19:45:25,373][66031] Updated weights for policy 0, policy_version 113280 (0.0005) +[2023-03-11 19:45:29,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9816.5). Total num frames: 58032128. Throughput: 0: 9686.6. Samples: 58009856. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 19:45:29,012][65744] Avg episode reward: [(0, '4500.796')] +[2023-03-11 19:45:29,632][66031] Updated weights for policy 0, policy_version 113360 (0.0004) +[2023-03-11 19:45:33,573][66031] Updated weights for policy 0, policy_version 113440 (0.0004) +[2023-03-11 19:45:34,012][65744] Fps is (10 sec: 10239.9, 60 sec: 9762.1, 300 sec: 9830.4). Total num frames: 58085376. Throughput: 0: 9734.1. Samples: 58069896. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 19:45:34,012][65744] Avg episode reward: [(0, '4081.346')] +[2023-03-11 19:45:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000113448_58085376.pth... +[2023-03-11 19:45:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000112872_57790464.pth +[2023-03-11 19:45:37,801][66031] Updated weights for policy 0, policy_version 113520 (0.0005) +[2023-03-11 19:45:39,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9693.9, 300 sec: 9816.5). Total num frames: 58130432. Throughput: 0: 9738.0. Samples: 58128400. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 19:45:39,012][65744] Avg episode reward: [(0, '4437.961')] +[2023-03-11 19:45:41,952][66031] Updated weights for policy 0, policy_version 113600 (0.0005) +[2023-03-11 19:45:44,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9693.9, 300 sec: 9816.5). Total num frames: 58179584. Throughput: 0: 9772.5. Samples: 58158464. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 19:45:44,012][65744] Avg episode reward: [(0, '4410.746')] +[2023-03-11 19:45:46,208][66031] Updated weights for policy 0, policy_version 113680 (0.0005) +[2023-03-11 19:45:49,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9830.4). Total num frames: 58228736. Throughput: 0: 9758.2. Samples: 58216460. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 19:45:49,012][65744] Avg episode reward: [(0, '4434.451')] +[2023-03-11 19:45:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000113728_58228736.pth... +[2023-03-11 19:45:49,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000113160_57937920.pth +[2023-03-11 19:45:50,426][66031] Updated weights for policy 0, policy_version 113760 (0.0005) +[2023-03-11 19:45:54,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9762.1, 300 sec: 9830.4). Total num frames: 58277888. Throughput: 0: 9749.2. Samples: 58274176. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 19:45:54,012][65744] Avg episode reward: [(0, '4252.611')] +[2023-03-11 19:45:54,670][66031] Updated weights for policy 0, policy_version 113840 (0.0005) +[2023-03-11 19:45:58,896][66031] Updated weights for policy 0, policy_version 113920 (0.0005) +[2023-03-11 19:45:59,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9762.1, 300 sec: 9830.4). Total num frames: 58327040. Throughput: 0: 9757.3. Samples: 58303280. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 19:45:59,012][65744] Avg episode reward: [(0, '4412.301')] +[2023-03-11 19:46:03,151][66031] Updated weights for policy 0, policy_version 114000 (0.0005) +[2023-03-11 19:46:04,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9802.6). Total num frames: 58372096. Throughput: 0: 9757.2. Samples: 58361248. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 19:46:04,012][65744] Avg episode reward: [(0, '4337.444')] +[2023-03-11 19:46:04,037][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000114016_58376192.pth... +[2023-03-11 19:46:04,038][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000113448_58085376.pth +[2023-03-11 19:46:07,498][66031] Updated weights for policy 0, policy_version 114080 (0.0005) +[2023-03-11 19:46:09,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9788.7). Total num frames: 58421248. Throughput: 0: 9718.8. Samples: 58417652. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 19:46:09,012][65744] Avg episode reward: [(0, '4488.279')] +[2023-03-11 19:46:11,632][66031] Updated weights for policy 0, policy_version 114160 (0.0005) +[2023-03-11 19:46:14,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 9788.7). Total num frames: 58474496. Throughput: 0: 9736.3. Samples: 58447988. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 19:46:14,012][65744] Avg episode reward: [(0, '4488.418')] +[2023-03-11 19:46:15,684][66031] Updated weights for policy 0, policy_version 114240 (0.0004) +[2023-03-11 19:46:19,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9762.1, 300 sec: 9761.0). Total num frames: 58519552. Throughput: 0: 9726.6. Samples: 58507592. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 19:46:19,012][65744] Avg episode reward: [(0, '4394.726')] +[2023-03-11 19:46:19,048][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000114304_58523648.pth... +[2023-03-11 19:46:19,050][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000113728_58228736.pth +[2023-03-11 19:46:19,941][66031] Updated weights for policy 0, policy_version 114320 (0.0005) +[2023-03-11 19:46:24,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9761.0). Total num frames: 58568704. Throughput: 0: 9722.0. Samples: 58565888. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 19:46:24,023][65744] Avg episode reward: [(0, '4327.302')] +[2023-03-11 19:46:24,085][66031] Updated weights for policy 0, policy_version 114400 (0.0005) +[2023-03-11 19:46:28,004][66031] Updated weights for policy 0, policy_version 114480 (0.0004) +[2023-03-11 19:46:29,012][65744] Fps is (10 sec: 10240.1, 60 sec: 9830.4, 300 sec: 9774.9). Total num frames: 58621952. Throughput: 0: 9755.0. Samples: 58597440. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 19:46:29,012][65744] Avg episode reward: [(0, '4161.514')] +[2023-03-11 19:46:32,002][66031] Updated weights for policy 0, policy_version 114560 (0.0004) +[2023-03-11 19:46:34,012][65744] Fps is (10 sec: 10239.9, 60 sec: 9762.1, 300 sec: 9788.7). Total num frames: 58671104. Throughput: 0: 9831.9. Samples: 58658896. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 19:46:34,012][65744] Avg episode reward: [(0, '3961.505')] +[2023-03-11 19:46:34,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000114600_58675200.pth... +[2023-03-11 19:46:34,028][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000114016_58376192.pth +[2023-03-11 19:46:36,129][66031] Updated weights for policy 0, policy_version 114640 (0.0004) +[2023-03-11 19:46:39,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9774.9). Total num frames: 58720256. Throughput: 0: 9848.3. Samples: 58717352. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 19:46:39,012][65744] Avg episode reward: [(0, '4196.189')] +[2023-03-11 19:46:40,464][66031] Updated weights for policy 0, policy_version 114720 (0.0005) +[2023-03-11 19:46:44,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9830.4, 300 sec: 9761.0). Total num frames: 58769408. Throughput: 0: 9848.2. Samples: 58746448. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 19:46:44,012][65744] Avg episode reward: [(0, '4346.680')] +[2023-03-11 19:46:44,718][66031] Updated weights for policy 0, policy_version 114800 (0.0005) +[2023-03-11 19:46:48,599][66031] Updated weights for policy 0, policy_version 114880 (0.0004) +[2023-03-11 19:46:49,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9761.0). Total num frames: 58822656. Throughput: 0: 9891.0. Samples: 58806344. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 19:46:49,012][65744] Avg episode reward: [(0, '4331.145')] +[2023-03-11 19:46:49,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000114888_58822656.pth... +[2023-03-11 19:46:49,029][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000114304_58523648.pth +[2023-03-11 19:46:52,642][66031] Updated weights for policy 0, policy_version 114960 (0.0005) +[2023-03-11 19:46:54,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9761.0). Total num frames: 58871808. Throughput: 0: 10002.8. Samples: 58867776. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 19:46:54,012][65744] Avg episode reward: [(0, '4334.606')] +[2023-03-11 19:46:56,561][66031] Updated weights for policy 0, policy_version 115040 (0.0005) +[2023-03-11 19:46:59,012][65744] Fps is (10 sec: 10240.1, 60 sec: 9966.9, 300 sec: 9761.0). Total num frames: 58925056. Throughput: 0: 10030.5. Samples: 58899360. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 19:46:59,012][65744] Avg episode reward: [(0, '4306.079')] +[2023-03-11 19:47:00,422][66031] Updated weights for policy 0, policy_version 115120 (0.0003) +[2023-03-11 19:47:04,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9747.1). Total num frames: 58974208. Throughput: 0: 10099.9. Samples: 58962088. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 19:47:04,012][65744] Avg episode reward: [(0, '4298.459')] +[2023-03-11 19:47:04,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000115192_58978304.pth... +[2023-03-11 19:47:04,027][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000114600_58675200.pth +[2023-03-11 19:47:04,409][66031] Updated weights for policy 0, policy_version 115200 (0.0004) +[2023-03-11 19:47:08,393][66031] Updated weights for policy 0, policy_version 115280 (0.0004) +[2023-03-11 19:47:09,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 9761.0). Total num frames: 59027456. Throughput: 0: 10172.0. Samples: 59023628. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 19:47:09,012][65744] Avg episode reward: [(0, '4296.128')] +[2023-03-11 19:47:12,289][66031] Updated weights for policy 0, policy_version 115360 (0.0004) +[2023-03-11 19:47:14,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10103.5, 300 sec: 9774.9). Total num frames: 59080704. Throughput: 0: 10176.7. Samples: 59055392. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:47:14,012][65744] Avg episode reward: [(0, '4346.339')] +[2023-03-11 19:47:16,448][66031] Updated weights for policy 0, policy_version 115440 (0.0005) +[2023-03-11 19:47:19,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 9788.7). Total num frames: 59129856. Throughput: 0: 10138.4. Samples: 59115124. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:47:19,012][65744] Avg episode reward: [(0, '4095.968')] +[2023-03-11 19:47:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000115488_59129856.pth... +[2023-03-11 19:47:19,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000114888_58822656.pth +[2023-03-11 19:47:20,578][66031] Updated weights for policy 0, policy_version 115520 (0.0005) +[2023-03-11 19:47:24,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 9788.7). Total num frames: 59179008. Throughput: 0: 10168.3. Samples: 59174924. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:47:24,012][65744] Avg episode reward: [(0, '4244.127')] +[2023-03-11 19:47:24,758][66031] Updated weights for policy 0, policy_version 115600 (0.0005) +[2023-03-11 19:47:28,937][66031] Updated weights for policy 0, policy_version 115680 (0.0005) +[2023-03-11 19:47:29,012][65744] Fps is (10 sec: 9830.5, 60 sec: 10103.5, 300 sec: 9788.7). Total num frames: 59228160. Throughput: 0: 10160.0. Samples: 59203648. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:47:29,012][65744] Avg episode reward: [(0, '4283.348')] +[2023-03-11 19:47:33,195][66031] Updated weights for policy 0, policy_version 115760 (0.0005) +[2023-03-11 19:47:34,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 9788.7). Total num frames: 59277312. Throughput: 0: 10127.4. Samples: 59262076. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:47:34,012][65744] Avg episode reward: [(0, '4340.089')] +[2023-03-11 19:47:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000115776_59277312.pth... +[2023-03-11 19:47:34,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000115192_58978304.pth +[2023-03-11 19:47:37,372][66031] Updated weights for policy 0, policy_version 115840 (0.0004) +[2023-03-11 19:47:39,012][65744] Fps is (10 sec: 9420.8, 60 sec: 10035.2, 300 sec: 9788.7). Total num frames: 59322368. Throughput: 0: 10067.6. Samples: 59320816. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:47:39,012][65744] Avg episode reward: [(0, '4293.317')] +[2023-03-11 19:47:41,595][66031] Updated weights for policy 0, policy_version 115920 (0.0005) +[2023-03-11 19:47:44,012][65744] Fps is (10 sec: 9420.8, 60 sec: 10035.2, 300 sec: 9788.7). Total num frames: 59371520. Throughput: 0: 10007.6. Samples: 59349700. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:47:44,012][65744] Avg episode reward: [(0, '3977.802')] +[2023-03-11 19:47:45,828][66031] Updated weights for policy 0, policy_version 116000 (0.0005) +[2023-03-11 19:47:49,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9788.7). Total num frames: 59420672. Throughput: 0: 9917.7. Samples: 59408384. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:47:49,012][65744] Avg episode reward: [(0, '4350.399')] +[2023-03-11 19:47:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000116056_59420672.pth... +[2023-03-11 19:47:49,019][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000115488_59129856.pth +[2023-03-11 19:47:50,048][66031] Updated weights for policy 0, policy_version 116080 (0.0005) +[2023-03-11 19:47:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9788.7). Total num frames: 59469824. Throughput: 0: 9829.0. Samples: 59465932. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:47:54,012][65744] Avg episode reward: [(0, '4377.278')] +[2023-03-11 19:47:54,220][66031] Updated weights for policy 0, policy_version 116160 (0.0005) +[2023-03-11 19:47:58,420][66031] Updated weights for policy 0, policy_version 116240 (0.0005) +[2023-03-11 19:47:59,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9802.6). Total num frames: 59518976. Throughput: 0: 9777.4. Samples: 59495376. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:47:59,012][65744] Avg episode reward: [(0, '4440.776')] +[2023-03-11 19:48:02,564][66031] Updated weights for policy 0, policy_version 116320 (0.0005) +[2023-03-11 19:48:04,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9802.6). Total num frames: 59568128. Throughput: 0: 9783.2. Samples: 59555368. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:48:04,012][65744] Avg episode reward: [(0, '4287.793')] +[2023-03-11 19:48:04,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000116344_59568128.pth... +[2023-03-11 19:48:04,019][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000115776_59277312.pth +[2023-03-11 19:48:06,784][66031] Updated weights for policy 0, policy_version 116400 (0.0005) +[2023-03-11 19:48:09,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9802.6). Total num frames: 59617280. Throughput: 0: 9740.5. Samples: 59613248. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:48:09,012][65744] Avg episode reward: [(0, '4379.971')] +[2023-03-11 19:48:10,936][66031] Updated weights for policy 0, policy_version 116480 (0.0005) +[2023-03-11 19:48:14,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9762.1, 300 sec: 9802.6). Total num frames: 59666432. Throughput: 0: 9755.9. Samples: 59642664. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 19:48:14,012][65744] Avg episode reward: [(0, '3291.702')] +[2023-03-11 19:48:15,129][66031] Updated weights for policy 0, policy_version 116560 (0.0005) +[2023-03-11 19:48:19,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9802.6). Total num frames: 59715584. Throughput: 0: 9772.4. Samples: 59701832. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 19:48:19,012][65744] Avg episode reward: [(0, '3973.013')] +[2023-03-11 19:48:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000116632_59715584.pth... +[2023-03-11 19:48:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000116056_59420672.pth +[2023-03-11 19:48:19,353][66031] Updated weights for policy 0, policy_version 116640 (0.0005) +[2023-03-11 19:48:23,551][66031] Updated weights for policy 0, policy_version 116720 (0.0005) +[2023-03-11 19:48:24,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9802.6). Total num frames: 59764736. Throughput: 0: 9762.0. Samples: 59760108. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 19:48:24,012][65744] Avg episode reward: [(0, '4523.622')] +[2023-03-11 19:48:27,753][66031] Updated weights for policy 0, policy_version 116800 (0.0005) +[2023-03-11 19:48:29,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9762.1, 300 sec: 9788.7). Total num frames: 59813888. Throughput: 0: 9765.6. Samples: 59789152. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 19:48:29,012][65744] Avg episode reward: [(0, '4105.662')] +[2023-03-11 19:48:31,983][66031] Updated weights for policy 0, policy_version 116880 (0.0005) +[2023-03-11 19:48:34,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9774.9). Total num frames: 59858944. Throughput: 0: 9741.0. Samples: 59846728. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 19:48:34,012][65744] Avg episode reward: [(0, '3921.938')] +[2023-03-11 19:48:34,014][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000116912_59858944.pth... +[2023-03-11 19:48:34,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000116344_59568128.pth +[2023-03-11 19:48:36,225][66031] Updated weights for policy 0, policy_version 116960 (0.0005) +[2023-03-11 19:48:39,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9762.1, 300 sec: 9774.9). Total num frames: 59908096. Throughput: 0: 9773.2. Samples: 59905724. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 19:48:39,012][65744] Avg episode reward: [(0, '4396.523')] +[2023-03-11 19:48:40,353][66031] Updated weights for policy 0, policy_version 117040 (0.0005) +[2023-03-11 19:48:44,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9774.9). Total num frames: 59957248. Throughput: 0: 9774.5. Samples: 59935228. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 19:48:44,012][65744] Avg episode reward: [(0, '4559.102')] +[2023-03-11 19:48:44,604][66031] Updated weights for policy 0, policy_version 117120 (0.0005) +[2023-03-11 19:48:48,794][66031] Updated weights for policy 0, policy_version 117200 (0.0005) +[2023-03-11 19:48:49,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9788.7). Total num frames: 60006400. Throughput: 0: 9748.7. Samples: 59994060. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 19:48:49,012][65744] Avg episode reward: [(0, '4248.239')] +[2023-03-11 19:48:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000117200_60006400.pth... +[2023-03-11 19:48:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000116632_59715584.pth +[2023-03-11 19:48:53,028][66031] Updated weights for policy 0, policy_version 117280 (0.0005) +[2023-03-11 19:48:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9788.7). Total num frames: 60055552. Throughput: 0: 9739.4. Samples: 60051520. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 19:48:54,012][65744] Avg episode reward: [(0, '4581.381')] +[2023-03-11 19:48:54,013][65987] Saving new best policy, reward=4581.381! +[2023-03-11 19:48:57,204][66031] Updated weights for policy 0, policy_version 117360 (0.0005) +[2023-03-11 19:48:59,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9762.1, 300 sec: 9788.7). Total num frames: 60104704. Throughput: 0: 9728.3. Samples: 60080436. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 19:48:59,012][65744] Avg episode reward: [(0, '4522.882')] +[2023-03-11 19:49:01,287][66031] Updated weights for policy 0, policy_version 117440 (0.0005) +[2023-03-11 19:49:04,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9762.1, 300 sec: 9802.6). Total num frames: 60153856. Throughput: 0: 9749.1. Samples: 60140540. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 19:49:04,012][65744] Avg episode reward: [(0, '4352.242')] +[2023-03-11 19:49:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000117488_60153856.pth... +[2023-03-11 19:49:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000116912_59858944.pth +[2023-03-11 19:49:05,529][66031] Updated weights for policy 0, policy_version 117520 (0.0005) +[2023-03-11 19:49:09,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9762.1, 300 sec: 9802.6). Total num frames: 60203008. Throughput: 0: 9751.2. Samples: 60198912. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 19:49:09,012][65744] Avg episode reward: [(0, '4373.841')] +[2023-03-11 19:49:09,774][66031] Updated weights for policy 0, policy_version 117600 (0.0005) +[2023-03-11 19:49:14,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9693.9, 300 sec: 9802.6). Total num frames: 60248064. Throughput: 0: 9743.0. Samples: 60227584. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:49:14,012][65744] Avg episode reward: [(0, '4385.144')] +[2023-03-11 19:49:14,041][66031] Updated weights for policy 0, policy_version 117680 (0.0005) +[2023-03-11 19:49:18,211][66031] Updated weights for policy 0, policy_version 117760 (0.0005) +[2023-03-11 19:49:19,012][65744] Fps is (10 sec: 9420.7, 60 sec: 9693.9, 300 sec: 9802.6). Total num frames: 60297216. Throughput: 0: 9746.3. Samples: 60285312. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:49:19,012][65744] Avg episode reward: [(0, '4379.587')] +[2023-03-11 19:49:19,032][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000117776_60301312.pth... +[2023-03-11 19:49:19,033][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000117200_60006400.pth +[2023-03-11 19:49:22,309][66031] Updated weights for policy 0, policy_version 117840 (0.0005) +[2023-03-11 19:49:24,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9762.1, 300 sec: 9816.5). Total num frames: 60350464. Throughput: 0: 9775.6. Samples: 60345624. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:49:24,012][65744] Avg episode reward: [(0, '4119.042')] +[2023-03-11 19:49:26,491][66031] Updated weights for policy 0, policy_version 117920 (0.0004) +[2023-03-11 19:49:29,012][65744] Fps is (10 sec: 10240.1, 60 sec: 9762.2, 300 sec: 9830.4). Total num frames: 60399616. Throughput: 0: 9773.6. Samples: 60375040. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:49:29,012][65744] Avg episode reward: [(0, '4461.020')] +[2023-03-11 19:49:30,644][66031] Updated weights for policy 0, policy_version 118000 (0.0005) +[2023-03-11 19:49:34,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9830.4, 300 sec: 9830.4). Total num frames: 60448768. Throughput: 0: 9764.7. Samples: 60433472. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:49:34,012][65744] Avg episode reward: [(0, '4524.318')] +[2023-03-11 19:49:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000118064_60448768.pth... +[2023-03-11 19:49:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000117488_60153856.pth +[2023-03-11 19:49:34,766][66031] Updated weights for policy 0, policy_version 118080 (0.0005) +[2023-03-11 19:49:38,979][66031] Updated weights for policy 0, policy_version 118160 (0.0005) +[2023-03-11 19:49:39,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9830.4, 300 sec: 9830.4). Total num frames: 60497920. Throughput: 0: 9812.8. Samples: 60493096. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:49:39,012][65744] Avg episode reward: [(0, '4168.349')] +[2023-03-11 19:49:43,208][66031] Updated weights for policy 0, policy_version 118240 (0.0005) +[2023-03-11 19:49:44,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9830.4). Total num frames: 60542976. Throughput: 0: 9820.8. Samples: 60522372. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:49:44,012][65744] Avg episode reward: [(0, '3987.220')] +[2023-03-11 19:49:47,348][66031] Updated weights for policy 0, policy_version 118320 (0.0005) +[2023-03-11 19:49:49,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9830.4). Total num frames: 60592128. Throughput: 0: 9781.6. Samples: 60580712. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:49:49,012][65744] Avg episode reward: [(0, '3883.006')] +[2023-03-11 19:49:49,028][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000118352_60596224.pth... +[2023-03-11 19:49:49,029][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000117776_60301312.pth +[2023-03-11 19:49:51,584][66031] Updated weights for policy 0, policy_version 118400 (0.0005) +[2023-03-11 19:49:54,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9762.1, 300 sec: 9830.4). Total num frames: 60641280. Throughput: 0: 9787.4. Samples: 60639344. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:49:54,012][65744] Avg episode reward: [(0, '3878.697')] +[2023-03-11 19:49:55,740][66031] Updated weights for policy 0, policy_version 118480 (0.0005) +[2023-03-11 19:49:59,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9762.1, 300 sec: 9830.4). Total num frames: 60690432. Throughput: 0: 9804.5. Samples: 60668784. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:49:59,012][65744] Avg episode reward: [(0, '3862.429')] +[2023-03-11 19:49:59,905][66031] Updated weights for policy 0, policy_version 118560 (0.0005) +[2023-03-11 19:50:04,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9762.1, 300 sec: 9830.4). Total num frames: 60739584. Throughput: 0: 9821.9. Samples: 60727296. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:50:04,012][65744] Avg episode reward: [(0, '4455.893')] +[2023-03-11 19:50:04,017][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000118632_60739584.pth... +[2023-03-11 19:50:04,019][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000118064_60448768.pth +[2023-03-11 19:50:04,244][66031] Updated weights for policy 0, policy_version 118640 (0.0005) +[2023-03-11 19:50:08,497][66031] Updated weights for policy 0, policy_version 118720 (0.0005) +[2023-03-11 19:50:09,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9844.3). Total num frames: 60788736. Throughput: 0: 9755.9. Samples: 60784640. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:50:09,012][65744] Avg episode reward: [(0, '4413.934')] +[2023-03-11 19:50:12,697][66031] Updated weights for policy 0, policy_version 118800 (0.0005) +[2023-03-11 19:50:14,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9830.4, 300 sec: 9844.3). Total num frames: 60837888. Throughput: 0: 9740.8. Samples: 60813376. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:50:14,012][65744] Avg episode reward: [(0, '4474.529')] +[2023-03-11 19:50:16,907][66031] Updated weights for policy 0, policy_version 118880 (0.0005) +[2023-03-11 19:50:19,012][65744] Fps is (10 sec: 9420.7, 60 sec: 9762.1, 300 sec: 9830.4). Total num frames: 60882944. Throughput: 0: 9737.8. Samples: 60871672. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:50:19,012][65744] Avg episode reward: [(0, '4429.030')] +[2023-03-11 19:50:19,023][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000118920_60887040.pth... +[2023-03-11 19:50:19,025][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000118352_60596224.pth +[2023-03-11 19:50:21,113][66031] Updated weights for policy 0, policy_version 118960 (0.0005) +[2023-03-11 19:50:24,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9762.1, 300 sec: 9844.3). Total num frames: 60936192. Throughput: 0: 9755.7. Samples: 60932104. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:50:24,012][65744] Avg episode reward: [(0, '4424.173')] +[2023-03-11 19:50:25,046][66031] Updated weights for policy 0, policy_version 119040 (0.0004) +[2023-03-11 19:50:29,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9762.1, 300 sec: 9830.4). Total num frames: 60985344. Throughput: 0: 9798.0. Samples: 60963284. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:50:29,012][65744] Avg episode reward: [(0, '4227.786')] +[2023-03-11 19:50:29,044][66031] Updated weights for policy 0, policy_version 119120 (0.0004) +[2023-03-11 19:50:32,933][66031] Updated weights for policy 0, policy_version 119200 (0.0003) +[2023-03-11 19:50:34,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 9858.2). Total num frames: 61038592. Throughput: 0: 9875.5. Samples: 61025112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:50:34,012][65744] Avg episode reward: [(0, '4329.208')] +[2023-03-11 19:50:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000119216_61038592.pth... +[2023-03-11 19:50:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000118632_60739584.pth +[2023-03-11 19:50:36,929][66031] Updated weights for policy 0, policy_version 119280 (0.0004) +[2023-03-11 19:50:39,012][65744] Fps is (10 sec: 10240.1, 60 sec: 9830.4, 300 sec: 9858.2). Total num frames: 61087744. Throughput: 0: 9920.1. Samples: 61085748. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:50:39,012][65744] Avg episode reward: [(0, '4041.717')] +[2023-03-11 19:50:41,229][66031] Updated weights for policy 0, policy_version 119360 (0.0005) +[2023-03-11 19:50:44,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9858.2). Total num frames: 61136896. Throughput: 0: 9909.3. Samples: 61114704. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:50:44,012][65744] Avg episode reward: [(0, '4415.470')] +[2023-03-11 19:50:45,241][66031] Updated weights for policy 0, policy_version 119440 (0.0004) +[2023-03-11 19:50:49,012][65744] Fps is (10 sec: 10239.9, 60 sec: 9966.9, 300 sec: 9872.1). Total num frames: 61190144. Throughput: 0: 9979.0. Samples: 61176352. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:50:49,012][65744] Avg episode reward: [(0, '4521.914')] +[2023-03-11 19:50:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000119512_61190144.pth... +[2023-03-11 19:50:49,019][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000118920_60887040.pth +[2023-03-11 19:50:49,220][66031] Updated weights for policy 0, policy_version 119520 (0.0004) +[2023-03-11 19:50:53,414][66031] Updated weights for policy 0, policy_version 119600 (0.0005) +[2023-03-11 19:50:54,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9967.0, 300 sec: 9872.1). Total num frames: 61239296. Throughput: 0: 10014.0. Samples: 61235272. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:50:54,012][65744] Avg episode reward: [(0, '4484.564')] +[2023-03-11 19:50:57,693][66031] Updated weights for policy 0, policy_version 119680 (0.0005) +[2023-03-11 19:50:59,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9885.9). Total num frames: 61288448. Throughput: 0: 10018.2. Samples: 61264196. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:50:59,012][65744] Avg episode reward: [(0, '4330.958')] +[2023-03-11 19:51:01,952][66031] Updated weights for policy 0, policy_version 119760 (0.0005) +[2023-03-11 19:51:04,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 9872.1). Total num frames: 61333504. Throughput: 0: 9997.9. Samples: 61321576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:51:04,012][65744] Avg episode reward: [(0, '4045.143')] +[2023-03-11 19:51:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000119792_61333504.pth... +[2023-03-11 19:51:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000119216_61038592.pth +[2023-03-11 19:51:06,241][66031] Updated weights for policy 0, policy_version 119840 (0.0005) +[2023-03-11 19:51:09,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 9858.2). Total num frames: 61382656. Throughput: 0: 9922.9. Samples: 61378632. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:51:09,012][65744] Avg episode reward: [(0, '4157.368')] +[2023-03-11 19:51:10,509][66031] Updated weights for policy 0, policy_version 119920 (0.0005) +[2023-03-11 19:51:14,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9872.1). Total num frames: 61431808. Throughput: 0: 9880.6. Samples: 61407912. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:51:14,012][65744] Avg episode reward: [(0, '4211.285')] +[2023-03-11 19:51:14,737][66031] Updated weights for policy 0, policy_version 120000 (0.0005) +[2023-03-11 19:51:18,950][66031] Updated weights for policy 0, policy_version 120080 (0.0005) +[2023-03-11 19:51:19,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9966.9, 300 sec: 9872.1). Total num frames: 61480960. Throughput: 0: 9824.2. Samples: 61467200. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:51:19,012][65744] Avg episode reward: [(0, '3720.834')] +[2023-03-11 19:51:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000120080_61480960.pth... +[2023-03-11 19:51:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000119512_61190144.pth +[2023-03-11 19:51:23,114][66031] Updated weights for policy 0, policy_version 120160 (0.0005) +[2023-03-11 19:51:24,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9858.2). Total num frames: 61530112. Throughput: 0: 9772.3. Samples: 61525500. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:51:24,012][65744] Avg episode reward: [(0, '3735.713')] +[2023-03-11 19:51:27,077][66031] Updated weights for policy 0, policy_version 120240 (0.0004) +[2023-03-11 19:51:29,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9858.2). Total num frames: 61579264. Throughput: 0: 9809.8. Samples: 61556144. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:51:29,012][65744] Avg episode reward: [(0, '3710.627')] +[2023-03-11 19:51:31,020][66031] Updated weights for policy 0, policy_version 120320 (0.0004) +[2023-03-11 19:51:34,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9872.1). Total num frames: 61632512. Throughput: 0: 9827.8. Samples: 61618604. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:51:34,012][65744] Avg episode reward: [(0, '4216.296')] +[2023-03-11 19:51:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000120376_61632512.pth... +[2023-03-11 19:51:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000119792_61333504.pth +[2023-03-11 19:51:34,988][66031] Updated weights for policy 0, policy_version 120400 (0.0004) +[2023-03-11 19:51:38,993][66031] Updated weights for policy 0, policy_version 120480 (0.0005) +[2023-03-11 19:51:39,012][65744] Fps is (10 sec: 10649.7, 60 sec: 9966.9, 300 sec: 9885.9). Total num frames: 61685760. Throughput: 0: 9890.8. Samples: 61680360. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:51:39,012][65744] Avg episode reward: [(0, '4038.775')] +[2023-03-11 19:51:42,996][66031] Updated weights for policy 0, policy_version 120560 (0.0004) +[2023-03-11 19:51:44,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 9872.1). Total num frames: 61734912. Throughput: 0: 9927.1. Samples: 61710916. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:51:44,012][65744] Avg episode reward: [(0, '4101.634')] +[2023-03-11 19:51:47,154][66031] Updated weights for policy 0, policy_version 120640 (0.0004) +[2023-03-11 19:51:49,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9872.1). Total num frames: 61784064. Throughput: 0: 9986.0. Samples: 61770944. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:51:49,012][65744] Avg episode reward: [(0, '4395.109')] +[2023-03-11 19:51:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000120672_61784064.pth... +[2023-03-11 19:51:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000120080_61480960.pth +[2023-03-11 19:51:51,457][66031] Updated weights for policy 0, policy_version 120720 (0.0005) +[2023-03-11 19:51:54,012][65744] Fps is (10 sec: 9420.7, 60 sec: 9830.4, 300 sec: 9844.3). Total num frames: 61829120. Throughput: 0: 9986.4. Samples: 61828020. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:51:54,012][65744] Avg episode reward: [(0, '4406.629')] +[2023-03-11 19:51:55,793][66031] Updated weights for policy 0, policy_version 120800 (0.0005) +[2023-03-11 19:51:59,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9830.4, 300 sec: 9844.3). Total num frames: 61878272. Throughput: 0: 9953.1. Samples: 61855800. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:51:59,012][65744] Avg episode reward: [(0, '4479.894')] +[2023-03-11 19:52:00,119][66031] Updated weights for policy 0, policy_version 120880 (0.0005) +[2023-03-11 19:52:04,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9898.7, 300 sec: 9830.4). Total num frames: 61927424. Throughput: 0: 9890.2. Samples: 61912260. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:52:04,012][65744] Avg episode reward: [(0, '4375.589')] +[2023-03-11 19:52:04,014][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000120952_61927424.pth... +[2023-03-11 19:52:04,016][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000120376_61632512.pth +[2023-03-11 19:52:04,420][66031] Updated weights for policy 0, policy_version 120960 (0.0005) +[2023-03-11 19:52:08,728][66031] Updated weights for policy 0, policy_version 121040 (0.0005) +[2023-03-11 19:52:09,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9830.4, 300 sec: 9802.6). Total num frames: 61972480. Throughput: 0: 9878.3. Samples: 61970024. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:52:09,012][65744] Avg episode reward: [(0, '4262.334')] +[2023-03-11 19:52:13,018][66031] Updated weights for policy 0, policy_version 121120 (0.0005) +[2023-03-11 19:52:14,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9830.4, 300 sec: 9802.6). Total num frames: 62021632. Throughput: 0: 9831.8. Samples: 61998576. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:52:14,012][65744] Avg episode reward: [(0, '4441.869')] +[2023-03-11 19:52:17,313][66031] Updated weights for policy 0, policy_version 121200 (0.0004) +[2023-03-11 19:52:19,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9788.7). Total num frames: 62066688. Throughput: 0: 9708.3. Samples: 62055476. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 19:52:19,022][65744] Avg episode reward: [(0, '4395.292')] +[2023-03-11 19:52:19,025][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000121232_62070784.pth... +[2023-03-11 19:52:19,026][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000120672_61784064.pth +[2023-03-11 19:52:21,618][66031] Updated weights for policy 0, policy_version 121280 (0.0004) +[2023-03-11 19:52:24,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9762.1, 300 sec: 9788.7). Total num frames: 62115840. Throughput: 0: 9619.2. Samples: 62113224. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 19:52:24,012][65744] Avg episode reward: [(0, '4278.770')] +[2023-03-11 19:52:25,844][66031] Updated weights for policy 0, policy_version 121360 (0.0005) +[2023-03-11 19:52:29,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9762.1, 300 sec: 9788.7). Total num frames: 62164992. Throughput: 0: 9589.3. Samples: 62142436. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 19:52:29,012][65744] Avg episode reward: [(0, '4177.720')] +[2023-03-11 19:52:30,087][66031] Updated weights for policy 0, policy_version 121440 (0.0005) +[2023-03-11 19:52:34,012][65744] Fps is (10 sec: 9830.2, 60 sec: 9693.9, 300 sec: 9802.6). Total num frames: 62214144. Throughput: 0: 9553.2. Samples: 62200840. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 19:52:34,012][65744] Avg episode reward: [(0, '3738.560')] +[2023-03-11 19:52:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000121512_62214144.pth... +[2023-03-11 19:52:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000120952_61927424.pth +[2023-03-11 19:52:34,269][66031] Updated weights for policy 0, policy_version 121520 (0.0005) +[2023-03-11 19:52:38,525][66031] Updated weights for policy 0, policy_version 121600 (0.0005) +[2023-03-11 19:52:39,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9802.6). Total num frames: 62263296. Throughput: 0: 9577.7. Samples: 62259016. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 19:52:39,023][65744] Avg episode reward: [(0, '3038.408')] +[2023-03-11 19:52:42,776][66031] Updated weights for policy 0, policy_version 121680 (0.0005) +[2023-03-11 19:52:44,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9788.7). Total num frames: 62308352. Throughput: 0: 9596.5. Samples: 62287644. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 19:52:44,012][65744] Avg episode reward: [(0, '4109.899')] +[2023-03-11 19:52:47,069][66031] Updated weights for policy 0, policy_version 121760 (0.0005) +[2023-03-11 19:52:49,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9788.7). Total num frames: 62357504. Throughput: 0: 9620.8. Samples: 62345196. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 19:52:49,012][65744] Avg episode reward: [(0, '4281.626')] +[2023-03-11 19:52:49,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000121792_62357504.pth... +[2023-03-11 19:52:49,029][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000121232_62070784.pth +[2023-03-11 19:52:51,196][66031] Updated weights for policy 0, policy_version 121840 (0.0004) +[2023-03-11 19:52:54,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9625.6, 300 sec: 9788.7). Total num frames: 62406656. Throughput: 0: 9640.1. Samples: 62403828. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 19:52:54,012][65744] Avg episode reward: [(0, '4401.730')] +[2023-03-11 19:52:55,436][66031] Updated weights for policy 0, policy_version 121920 (0.0005) +[2023-03-11 19:52:59,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9625.6, 300 sec: 9788.7). Total num frames: 62455808. Throughput: 0: 9638.4. Samples: 62432304. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 19:52:59,012][65744] Avg episode reward: [(0, '4392.826')] +[2023-03-11 19:52:59,701][66031] Updated weights for policy 0, policy_version 122000 (0.0005) +[2023-03-11 19:53:04,007][66031] Updated weights for policy 0, policy_version 122080 (0.0005) +[2023-03-11 19:53:04,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9788.7). Total num frames: 62504960. Throughput: 0: 9660.4. Samples: 62490192. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 19:53:04,012][65744] Avg episode reward: [(0, '4276.898')] +[2023-03-11 19:53:04,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000122080_62504960.pth... +[2023-03-11 19:53:04,029][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000121512_62214144.pth +[2023-03-11 19:53:08,326][66031] Updated weights for policy 0, policy_version 122160 (0.0005) +[2023-03-11 19:53:09,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9774.9). Total num frames: 62550016. Throughput: 0: 9639.8. Samples: 62547016. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 19:53:09,012][65744] Avg episode reward: [(0, '4423.764')] +[2023-03-11 19:53:12,660][66031] Updated weights for policy 0, policy_version 122240 (0.0005) +[2023-03-11 19:53:14,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9774.9). Total num frames: 62599168. Throughput: 0: 9623.9. Samples: 62575512. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 19:53:14,012][65744] Avg episode reward: [(0, '4358.886')] +[2023-03-11 19:53:16,945][66031] Updated weights for policy 0, policy_version 122320 (0.0005) +[2023-03-11 19:53:19,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9761.0). Total num frames: 62644224. Throughput: 0: 9587.9. Samples: 62632296. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 19:53:19,012][65744] Avg episode reward: [(0, '4327.703')] +[2023-03-11 19:53:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000122352_62644224.pth... +[2023-03-11 19:53:19,019][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000121792_62357504.pth +[2023-03-11 19:53:21,293][66031] Updated weights for policy 0, policy_version 122400 (0.0005) +[2023-03-11 19:53:24,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9761.0). Total num frames: 62693376. Throughput: 0: 9563.0. Samples: 62689352. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:53:24,012][65744] Avg episode reward: [(0, '4452.920')] +[2023-03-11 19:53:25,576][66031] Updated weights for policy 0, policy_version 122480 (0.0005) +[2023-03-11 19:53:29,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9774.9). Total num frames: 62742528. Throughput: 0: 9563.8. Samples: 62718016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:53:29,012][65744] Avg episode reward: [(0, '4235.253')] +[2023-03-11 19:53:29,731][66031] Updated weights for policy 0, policy_version 122560 (0.0005) +[2023-03-11 19:53:33,845][66031] Updated weights for policy 0, policy_version 122640 (0.0005) +[2023-03-11 19:53:34,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9774.9). Total num frames: 62791680. Throughput: 0: 9611.4. Samples: 62777708. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:53:34,012][65744] Avg episode reward: [(0, '4154.434')] +[2023-03-11 19:53:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000122640_62791680.pth... +[2023-03-11 19:53:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000122080_62504960.pth +[2023-03-11 19:53:37,945][66031] Updated weights for policy 0, policy_version 122720 (0.0005) +[2023-03-11 19:53:39,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9625.6, 300 sec: 9774.9). Total num frames: 62840832. Throughput: 0: 9634.3. Samples: 62837372. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:53:39,012][65744] Avg episode reward: [(0, '4224.727')] +[2023-03-11 19:53:42,103][66031] Updated weights for policy 0, policy_version 122800 (0.0005) +[2023-03-11 19:53:44,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9774.9). Total num frames: 62889984. Throughput: 0: 9660.5. Samples: 62867028. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:53:44,012][65744] Avg episode reward: [(0, '4332.452')] +[2023-03-11 19:53:46,356][66031] Updated weights for policy 0, policy_version 122880 (0.0005) +[2023-03-11 19:53:49,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9693.9, 300 sec: 9774.9). Total num frames: 62939136. Throughput: 0: 9675.3. Samples: 62925580. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:53:49,012][65744] Avg episode reward: [(0, '4448.826')] +[2023-03-11 19:53:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000122928_62939136.pth... +[2023-03-11 19:53:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000122352_62644224.pth +[2023-03-11 19:53:50,432][66031] Updated weights for policy 0, policy_version 122960 (0.0005) +[2023-03-11 19:53:54,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9762.1, 300 sec: 9788.7). Total num frames: 62992384. Throughput: 0: 9779.7. Samples: 62987104. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:53:54,012][65744] Avg episode reward: [(0, '4460.730')] +[2023-03-11 19:53:54,437][66031] Updated weights for policy 0, policy_version 123040 (0.0005) +[2023-03-11 19:53:58,442][66031] Updated weights for policy 0, policy_version 123120 (0.0004) +[2023-03-11 19:53:59,012][65744] Fps is (10 sec: 10240.1, 60 sec: 9762.1, 300 sec: 9788.7). Total num frames: 63041536. Throughput: 0: 9811.4. Samples: 63017024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:53:59,012][65744] Avg episode reward: [(0, '3539.581')] +[2023-03-11 19:54:02,672][66031] Updated weights for policy 0, policy_version 123200 (0.0005) +[2023-03-11 19:54:04,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9762.1, 300 sec: 9788.7). Total num frames: 63090688. Throughput: 0: 9860.5. Samples: 63076020. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:54:04,012][65744] Avg episode reward: [(0, '4396.727')] +[2023-03-11 19:54:04,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000123224_63090688.pth... +[2023-03-11 19:54:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000122640_62791680.pth +[2023-03-11 19:54:06,940][66031] Updated weights for policy 0, policy_version 123280 (0.0005) +[2023-03-11 19:54:09,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9788.7). Total num frames: 63135744. Throughput: 0: 9891.2. Samples: 63134456. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:54:09,012][65744] Avg episode reward: [(0, '4593.875')] +[2023-03-11 19:54:09,070][65987] Saving new best policy, reward=4593.875! +[2023-03-11 19:54:11,237][66031] Updated weights for policy 0, policy_version 123360 (0.0005) +[2023-03-11 19:54:14,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9762.1, 300 sec: 9788.7). Total num frames: 63184896. Throughput: 0: 9879.2. Samples: 63162580. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:54:14,012][65744] Avg episode reward: [(0, '4331.773')] +[2023-03-11 19:54:15,472][66031] Updated weights for policy 0, policy_version 123440 (0.0005) +[2023-03-11 19:54:19,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9774.9). Total num frames: 63234048. Throughput: 0: 9839.1. Samples: 63220468. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:54:19,012][65744] Avg episode reward: [(0, '4499.056')] +[2023-03-11 19:54:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000123504_63234048.pth... +[2023-03-11 19:54:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000122928_62939136.pth +[2023-03-11 19:54:19,713][66031] Updated weights for policy 0, policy_version 123520 (0.0005) +[2023-03-11 19:54:23,945][66031] Updated weights for policy 0, policy_version 123600 (0.0005) +[2023-03-11 19:54:24,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9830.4, 300 sec: 9774.9). Total num frames: 63283200. Throughput: 0: 9807.4. Samples: 63278704. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:54:24,012][65744] Avg episode reward: [(0, '4143.493')] +[2023-03-11 19:54:28,294][66031] Updated weights for policy 0, policy_version 123680 (0.0005) +[2023-03-11 19:54:29,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9761.0). Total num frames: 63328256. Throughput: 0: 9786.2. Samples: 63307408. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:54:29,012][65744] Avg episode reward: [(0, '4376.387')] +[2023-03-11 19:54:32,568][66031] Updated weights for policy 0, policy_version 123760 (0.0005) +[2023-03-11 19:54:34,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9761.0). Total num frames: 63377408. Throughput: 0: 9744.5. Samples: 63364080. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:54:34,012][65744] Avg episode reward: [(0, '4313.606')] +[2023-03-11 19:54:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000123784_63377408.pth... +[2023-03-11 19:54:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000123224_63090688.pth +[2023-03-11 19:54:36,819][66031] Updated weights for policy 0, policy_version 123840 (0.0005) +[2023-03-11 19:54:39,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9762.1, 300 sec: 9774.9). Total num frames: 63426560. Throughput: 0: 9657.7. Samples: 63421700. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:54:39,012][65744] Avg episode reward: [(0, '4357.587')] +[2023-03-11 19:54:41,093][66031] Updated weights for policy 0, policy_version 123920 (0.0005) +[2023-03-11 19:54:44,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9761.0). Total num frames: 63471616. Throughput: 0: 9641.3. Samples: 63450884. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:54:44,012][65744] Avg episode reward: [(0, '4471.721')] +[2023-03-11 19:54:45,359][66031] Updated weights for policy 0, policy_version 124000 (0.0005) +[2023-03-11 19:54:49,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9774.9). Total num frames: 63524864. Throughput: 0: 9632.5. Samples: 63509484. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:54:49,012][65744] Avg episode reward: [(0, '4409.304')] +[2023-03-11 19:54:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000124072_63524864.pth... +[2023-03-11 19:54:49,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000123504_63234048.pth +[2023-03-11 19:54:49,376][66031] Updated weights for policy 0, policy_version 124080 (0.0004) +[2023-03-11 19:54:53,603][66031] Updated weights for policy 0, policy_version 124160 (0.0005) +[2023-03-11 19:54:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9761.0). Total num frames: 63569920. Throughput: 0: 9647.3. Samples: 63568584. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:54:54,012][65744] Avg episode reward: [(0, '4400.245')] +[2023-03-11 19:54:57,784][66031] Updated weights for policy 0, policy_version 124240 (0.0005) +[2023-03-11 19:54:59,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9625.6, 300 sec: 9761.0). Total num frames: 63619072. Throughput: 0: 9684.9. Samples: 63598400. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:54:59,012][65744] Avg episode reward: [(0, '4463.394')] +[2023-03-11 19:55:02,118][66031] Updated weights for policy 0, policy_version 124320 (0.0005) +[2023-03-11 19:55:04,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9761.0). Total num frames: 63668224. Throughput: 0: 9666.8. Samples: 63655476. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:55:04,012][65744] Avg episode reward: [(0, '4533.008')] +[2023-03-11 19:55:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000124352_63668224.pth... +[2023-03-11 19:55:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000123784_63377408.pth +[2023-03-11 19:55:06,278][66031] Updated weights for policy 0, policy_version 124400 (0.0005) +[2023-03-11 19:55:09,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9693.9, 300 sec: 9761.0). Total num frames: 63717376. Throughput: 0: 9693.9. Samples: 63714928. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:55:09,023][65744] Avg episode reward: [(0, '4309.500')] +[2023-03-11 19:55:10,267][66031] Updated weights for policy 0, policy_version 124480 (0.0004) +[2023-03-11 19:55:14,012][65744] Fps is (10 sec: 10240.1, 60 sec: 9762.1, 300 sec: 9788.7). Total num frames: 63770624. Throughput: 0: 9747.7. Samples: 63746056. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:55:14,012][65744] Avg episode reward: [(0, '4231.834')] +[2023-03-11 19:55:14,271][66031] Updated weights for policy 0, policy_version 124560 (0.0004) +[2023-03-11 19:55:18,269][66031] Updated weights for policy 0, policy_version 124640 (0.0004) +[2023-03-11 19:55:19,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9762.1, 300 sec: 9774.9). Total num frames: 63819776. Throughput: 0: 9853.7. Samples: 63807496. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:55:19,012][65744] Avg episode reward: [(0, '4061.698')] +[2023-03-11 19:55:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000124648_63819776.pth... +[2023-03-11 19:55:19,016][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000124072_63524864.pth +[2023-03-11 19:55:22,345][66031] Updated weights for policy 0, policy_version 124720 (0.0005) +[2023-03-11 19:55:24,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 9788.7). Total num frames: 63873024. Throughput: 0: 9918.3. Samples: 63868024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:55:24,023][65744] Avg episode reward: [(0, '4024.444')] +[2023-03-11 19:55:26,371][66031] Updated weights for policy 0, policy_version 124800 (0.0005) +[2023-03-11 19:55:29,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9774.9). Total num frames: 63922176. Throughput: 0: 9939.2. Samples: 63898148. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:55:29,012][65744] Avg episode reward: [(0, '4289.896')] +[2023-03-11 19:55:30,413][66031] Updated weights for policy 0, policy_version 124880 (0.0004) +[2023-03-11 19:55:34,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9898.7, 300 sec: 9774.9). Total num frames: 63971328. Throughput: 0: 9982.3. Samples: 63958688. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:55:34,012][65744] Avg episode reward: [(0, '4383.442')] +[2023-03-11 19:55:34,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000124944_63971328.pth... +[2023-03-11 19:55:34,028][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000124352_63668224.pth +[2023-03-11 19:55:34,578][66031] Updated weights for policy 0, policy_version 124960 (0.0005) +[2023-03-11 19:55:38,663][66031] Updated weights for policy 0, policy_version 125040 (0.0005) +[2023-03-11 19:55:39,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9774.9). Total num frames: 64020480. Throughput: 0: 9992.7. Samples: 64018256. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:55:39,012][65744] Avg episode reward: [(0, '4217.323')] +[2023-03-11 19:55:42,808][66031] Updated weights for policy 0, policy_version 125120 (0.0004) +[2023-03-11 19:55:44,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9966.9, 300 sec: 9761.0). Total num frames: 64069632. Throughput: 0: 9983.5. Samples: 64047656. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:55:44,012][65744] Avg episode reward: [(0, '4255.336')] +[2023-03-11 19:55:47,112][66031] Updated weights for policy 0, policy_version 125200 (0.0005) +[2023-03-11 19:55:49,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9898.7, 300 sec: 9761.0). Total num frames: 64118784. Throughput: 0: 10012.0. Samples: 64106016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:55:49,012][65744] Avg episode reward: [(0, '4361.992')] +[2023-03-11 19:55:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000125232_64118784.pth... +[2023-03-11 19:55:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000124648_63819776.pth +[2023-03-11 19:55:51,244][66031] Updated weights for policy 0, policy_version 125280 (0.0005) +[2023-03-11 19:55:54,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9774.9). Total num frames: 64172032. Throughput: 0: 10028.3. Samples: 64166200. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:55:54,012][65744] Avg episode reward: [(0, '4167.832')] +[2023-03-11 19:55:55,199][66031] Updated weights for policy 0, policy_version 125360 (0.0004) +[2023-03-11 19:55:59,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9788.7). Total num frames: 64221184. Throughput: 0: 10014.1. Samples: 64196692. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:55:59,012][65744] Avg episode reward: [(0, '4528.248')] +[2023-03-11 19:55:59,189][66031] Updated weights for policy 0, policy_version 125440 (0.0004) +[2023-03-11 19:56:03,159][66031] Updated weights for policy 0, policy_version 125520 (0.0004) +[2023-03-11 19:56:04,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 9802.6). Total num frames: 64274432. Throughput: 0: 10029.3. Samples: 64258816. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:56:04,012][65744] Avg episode reward: [(0, '4429.360')] +[2023-03-11 19:56:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000125536_64274432.pth... +[2023-03-11 19:56:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000124944_63971328.pth +[2023-03-11 19:56:07,294][66031] Updated weights for policy 0, policy_version 125600 (0.0004) +[2023-03-11 19:56:09,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 9802.6). Total num frames: 64323584. Throughput: 0: 10008.7. Samples: 64318416. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:56:09,023][65744] Avg episode reward: [(0, '4415.510')] +[2023-03-11 19:56:11,625][66031] Updated weights for policy 0, policy_version 125680 (0.0005) +[2023-03-11 19:56:14,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9966.9, 300 sec: 9788.7). Total num frames: 64368640. Throughput: 0: 9974.0. Samples: 64346980. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:56:14,023][65744] Avg episode reward: [(0, '4486.164')] +[2023-03-11 19:56:15,912][66031] Updated weights for policy 0, policy_version 125760 (0.0005) +[2023-03-11 19:56:19,012][65744] Fps is (10 sec: 9420.7, 60 sec: 9966.9, 300 sec: 9788.7). Total num frames: 64417792. Throughput: 0: 9890.6. Samples: 64403764. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:56:19,012][65744] Avg episode reward: [(0, '4168.068')] +[2023-03-11 19:56:19,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000125816_64417792.pth... +[2023-03-11 19:56:19,029][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000125232_64118784.pth +[2023-03-11 19:56:20,131][66031] Updated weights for policy 0, policy_version 125840 (0.0005) +[2023-03-11 19:56:24,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9898.7, 300 sec: 9788.7). Total num frames: 64466944. Throughput: 0: 9865.2. Samples: 64462188. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:56:24,012][65744] Avg episode reward: [(0, '4191.926')] +[2023-03-11 19:56:24,412][66031] Updated weights for policy 0, policy_version 125920 (0.0005) +[2023-03-11 19:56:28,748][66031] Updated weights for policy 0, policy_version 126000 (0.0005) +[2023-03-11 19:56:29,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9830.4, 300 sec: 9761.0). Total num frames: 64512000. Throughput: 0: 9831.2. Samples: 64490060. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:56:29,012][65744] Avg episode reward: [(0, '3751.562')] +[2023-03-11 19:56:33,104][66031] Updated weights for policy 0, policy_version 126080 (0.0005) +[2023-03-11 19:56:34,012][65744] Fps is (10 sec: 9420.7, 60 sec: 9830.4, 300 sec: 9747.1). Total num frames: 64561152. Throughput: 0: 9807.5. Samples: 64547352. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:56:34,012][65744] Avg episode reward: [(0, '4085.126')] +[2023-03-11 19:56:34,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000126096_64561152.pth... +[2023-03-11 19:56:34,029][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000125536_64274432.pth +[2023-03-11 19:56:37,490][66031] Updated weights for policy 0, policy_version 126160 (0.0005) +[2023-03-11 19:56:39,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9733.2). Total num frames: 64606208. Throughput: 0: 9701.4. Samples: 64602764. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:56:39,012][65744] Avg episode reward: [(0, '4346.613')] +[2023-03-11 19:56:41,788][66031] Updated weights for policy 0, policy_version 126240 (0.0005) +[2023-03-11 19:56:44,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9762.1, 300 sec: 9733.2). Total num frames: 64655360. Throughput: 0: 9665.5. Samples: 64631640. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:56:44,012][65744] Avg episode reward: [(0, '4341.087')] +[2023-03-11 19:56:46,158][66031] Updated weights for policy 0, policy_version 126320 (0.0005) +[2023-03-11 19:56:49,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9733.2). Total num frames: 64700416. Throughput: 0: 9540.3. Samples: 64688128. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:56:49,012][65744] Avg episode reward: [(0, '4254.328')] +[2023-03-11 19:56:49,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000126368_64700416.pth... +[2023-03-11 19:56:49,029][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000125816_64417792.pth +[2023-03-11 19:56:50,516][66031] Updated weights for policy 0, policy_version 126400 (0.0005) +[2023-03-11 19:56:54,012][65744] Fps is (10 sec: 9011.2, 60 sec: 9557.3, 300 sec: 9719.3). Total num frames: 64745472. Throughput: 0: 9452.9. Samples: 64743796. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:56:54,012][65744] Avg episode reward: [(0, '4322.800')] +[2023-03-11 19:56:54,940][66031] Updated weights for policy 0, policy_version 126480 (0.0005) +[2023-03-11 19:56:59,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9719.3). Total num frames: 64794624. Throughput: 0: 9460.0. Samples: 64772680. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:56:59,012][65744] Avg episode reward: [(0, '4525.032')] +[2023-03-11 19:56:59,098][66031] Updated weights for policy 0, policy_version 126560 (0.0004) +[2023-03-11 19:57:03,360][66031] Updated weights for policy 0, policy_version 126640 (0.0005) +[2023-03-11 19:57:04,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9489.1, 300 sec: 9733.2). Total num frames: 64843776. Throughput: 0: 9506.4. Samples: 64831552. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:57:04,012][65744] Avg episode reward: [(0, '4511.263')] +[2023-03-11 19:57:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000126648_64843776.pth... +[2023-03-11 19:57:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000126096_64561152.pth +[2023-03-11 19:57:07,717][66031] Updated weights for policy 0, policy_version 126720 (0.0005) +[2023-03-11 19:57:09,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9489.1, 300 sec: 9733.2). Total num frames: 64892928. Throughput: 0: 9464.1. Samples: 64888072. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:57:09,012][65744] Avg episode reward: [(0, '4425.337')] +[2023-03-11 19:57:11,916][66031] Updated weights for policy 0, policy_version 126800 (0.0005) +[2023-03-11 19:57:14,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9733.2). Total num frames: 64937984. Throughput: 0: 9498.9. Samples: 64917512. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:57:14,023][65744] Avg episode reward: [(0, '4529.800')] +[2023-03-11 19:57:16,008][66031] Updated weights for policy 0, policy_version 126880 (0.0005) +[2023-03-11 19:57:19,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9747.1). Total num frames: 64991232. Throughput: 0: 9563.5. Samples: 64977708. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:57:19,012][65744] Avg episode reward: [(0, '4459.267')] +[2023-03-11 19:57:19,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000126936_64991232.pth... +[2023-03-11 19:57:19,028][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000126368_64700416.pth +[2023-03-11 19:57:19,965][66031] Updated weights for policy 0, policy_version 126960 (0.0004) +[2023-03-11 19:57:23,926][66031] Updated weights for policy 0, policy_version 127040 (0.0005) +[2023-03-11 19:57:24,012][65744] Fps is (10 sec: 10649.6, 60 sec: 9625.6, 300 sec: 9761.0). Total num frames: 65044480. Throughput: 0: 9709.0. Samples: 65039668. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 19:57:24,012][65744] Avg episode reward: [(0, '4385.551')] +[2023-03-11 19:57:27,853][66031] Updated weights for policy 0, policy_version 127120 (0.0004) +[2023-03-11 19:57:29,012][65744] Fps is (10 sec: 10240.1, 60 sec: 9693.9, 300 sec: 9761.0). Total num frames: 65093632. Throughput: 0: 9757.1. Samples: 65070708. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:57:29,012][65744] Avg episode reward: [(0, '4387.357')] +[2023-03-11 19:57:31,821][66031] Updated weights for policy 0, policy_version 127200 (0.0004) +[2023-03-11 19:57:34,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9762.1, 300 sec: 9774.9). Total num frames: 65146880. Throughput: 0: 9885.8. Samples: 65132988. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:57:34,012][65744] Avg episode reward: [(0, '4422.424')] +[2023-03-11 19:57:34,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000127240_65146880.pth... +[2023-03-11 19:57:34,028][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000126648_64843776.pth +[2023-03-11 19:57:35,721][66031] Updated weights for policy 0, policy_version 127280 (0.0004) +[2023-03-11 19:57:39,012][65744] Fps is (10 sec: 10649.5, 60 sec: 9898.7, 300 sec: 9802.6). Total num frames: 65200128. Throughput: 0: 10039.5. Samples: 65195576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:57:39,012][65744] Avg episode reward: [(0, '4277.412')] +[2023-03-11 19:57:39,722][66031] Updated weights for policy 0, policy_version 127360 (0.0004) +[2023-03-11 19:57:43,630][66031] Updated weights for policy 0, policy_version 127440 (0.0004) +[2023-03-11 19:57:44,012][65744] Fps is (10 sec: 10649.7, 60 sec: 9966.9, 300 sec: 9816.5). Total num frames: 65253376. Throughput: 0: 10077.7. Samples: 65226176. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:57:44,012][65744] Avg episode reward: [(0, '4465.917')] +[2023-03-11 19:57:47,554][66031] Updated weights for policy 0, policy_version 127520 (0.0004) +[2023-03-11 19:57:49,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9816.5). Total num frames: 65302528. Throughput: 0: 10172.5. Samples: 65289316. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:57:49,012][65744] Avg episode reward: [(0, '4504.292')] +[2023-03-11 19:57:49,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000127544_65302528.pth... +[2023-03-11 19:57:49,028][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000126936_64991232.pth +[2023-03-11 19:57:51,502][66031] Updated weights for policy 0, policy_version 127600 (0.0004) +[2023-03-11 19:57:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 9816.5). Total num frames: 65351680. Throughput: 0: 10277.1. Samples: 65350540. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:57:54,012][65744] Avg episode reward: [(0, '4570.769')] +[2023-03-11 19:57:55,727][66031] Updated weights for policy 0, policy_version 127680 (0.0005) +[2023-03-11 19:57:59,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 9816.5). Total num frames: 65400832. Throughput: 0: 10268.0. Samples: 65379572. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:57:59,012][65744] Avg episode reward: [(0, '4461.884')] +[2023-03-11 19:57:59,969][66031] Updated weights for policy 0, policy_version 127760 (0.0005) +[2023-03-11 19:58:04,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 9830.4). Total num frames: 65449984. Throughput: 0: 10212.9. Samples: 65437288. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:58:04,012][65744] Avg episode reward: [(0, '4544.807')] +[2023-03-11 19:58:04,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000127832_65449984.pth... +[2023-03-11 19:58:04,029][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000127240_65146880.pth +[2023-03-11 19:58:04,297][66031] Updated weights for policy 0, policy_version 127840 (0.0005) +[2023-03-11 19:58:08,584][66031] Updated weights for policy 0, policy_version 127920 (0.0005) +[2023-03-11 19:58:09,012][65744] Fps is (10 sec: 9420.8, 60 sec: 10035.2, 300 sec: 9816.5). Total num frames: 65495040. Throughput: 0: 10099.1. Samples: 65494128. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:58:09,012][65744] Avg episode reward: [(0, '4471.521')] +[2023-03-11 19:58:12,830][66031] Updated weights for policy 0, policy_version 128000 (0.0005) +[2023-03-11 19:58:14,012][65744] Fps is (10 sec: 9420.8, 60 sec: 10103.5, 300 sec: 9830.4). Total num frames: 65544192. Throughput: 0: 10040.4. Samples: 65522528. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:58:14,012][65744] Avg episode reward: [(0, '4480.359')] +[2023-03-11 19:58:17,028][66031] Updated weights for policy 0, policy_version 128080 (0.0005) +[2023-03-11 19:58:19,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 9830.4). Total num frames: 65593344. Throughput: 0: 9959.2. Samples: 65581152. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:58:19,012][65744] Avg episode reward: [(0, '4540.569')] +[2023-03-11 19:58:19,063][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000128120_65597440.pth... +[2023-03-11 19:58:19,064][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000127544_65302528.pth +[2023-03-11 19:58:21,140][66031] Updated weights for policy 0, policy_version 128160 (0.0005) +[2023-03-11 19:58:24,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9844.3). Total num frames: 65646592. Throughput: 0: 9897.2. Samples: 65640952. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:58:24,012][65744] Avg episode reward: [(0, '4466.112')] +[2023-03-11 19:58:25,260][66031] Updated weights for policy 0, policy_version 128240 (0.0004) +[2023-03-11 19:58:29,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9844.3). Total num frames: 65695744. Throughput: 0: 9888.9. Samples: 65671176. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:58:29,012][65744] Avg episode reward: [(0, '4462.962')] +[2023-03-11 19:58:29,216][66031] Updated weights for policy 0, policy_version 128320 (0.0004) +[2023-03-11 19:58:33,182][66031] Updated weights for policy 0, policy_version 128400 (0.0004) +[2023-03-11 19:58:34,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10035.2, 300 sec: 9858.2). Total num frames: 65748992. Throughput: 0: 9860.7. Samples: 65733048. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:58:34,012][65744] Avg episode reward: [(0, '4578.575')] +[2023-03-11 19:58:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000128416_65748992.pth... +[2023-03-11 19:58:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000127832_65449984.pth +[2023-03-11 19:58:37,084][66031] Updated weights for policy 0, policy_version 128480 (0.0004) +[2023-03-11 19:58:39,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 9858.2). Total num frames: 65798144. Throughput: 0: 9888.5. Samples: 65795524. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:58:39,012][65744] Avg episode reward: [(0, '4562.949')] +[2023-03-11 19:58:41,028][66031] Updated weights for policy 0, policy_version 128560 (0.0004) +[2023-03-11 19:58:44,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 9872.1). Total num frames: 65851392. Throughput: 0: 9940.2. Samples: 65826880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:58:44,012][65744] Avg episode reward: [(0, '4524.780')] +[2023-03-11 19:58:44,918][66031] Updated weights for policy 0, policy_version 128640 (0.0004) +[2023-03-11 19:58:48,835][66031] Updated weights for policy 0, policy_version 128720 (0.0004) +[2023-03-11 19:58:49,012][65744] Fps is (10 sec: 10649.5, 60 sec: 10035.2, 300 sec: 9872.1). Total num frames: 65904640. Throughput: 0: 10066.2. Samples: 65890268. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:58:49,012][65744] Avg episode reward: [(0, '4414.062')] +[2023-03-11 19:58:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000128720_65904640.pth... +[2023-03-11 19:58:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000128120_65597440.pth +[2023-03-11 19:58:52,774][66031] Updated weights for policy 0, policy_version 128800 (0.0005) +[2023-03-11 19:58:54,012][65744] Fps is (10 sec: 10649.5, 60 sec: 10103.4, 300 sec: 9885.9). Total num frames: 65957888. Throughput: 0: 10184.9. Samples: 65952452. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:58:54,013][65744] Avg episode reward: [(0, '4548.890')] +[2023-03-11 19:58:56,704][66031] Updated weights for policy 0, policy_version 128880 (0.0004) +[2023-03-11 19:58:59,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10103.5, 300 sec: 9885.9). Total num frames: 66007040. Throughput: 0: 10246.1. Samples: 65983600. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:58:59,012][65744] Avg episode reward: [(0, '3898.267')] +[2023-03-11 19:59:00,595][66031] Updated weights for policy 0, policy_version 128960 (0.0004) +[2023-03-11 19:59:04,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10171.7, 300 sec: 9913.7). Total num frames: 66060288. Throughput: 0: 10356.1. Samples: 66047176. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:59:04,012][65744] Avg episode reward: [(0, '3575.367')] +[2023-03-11 19:59:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000129024_66060288.pth... +[2023-03-11 19:59:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000128416_65748992.pth +[2023-03-11 19:59:04,684][66031] Updated weights for policy 0, policy_version 129040 (0.0005) +[2023-03-11 19:59:08,733][66031] Updated weights for policy 0, policy_version 129120 (0.0005) +[2023-03-11 19:59:09,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 9913.7). Total num frames: 66109440. Throughput: 0: 10337.3. Samples: 66106132. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:59:09,012][65744] Avg episode reward: [(0, '3603.357')] +[2023-03-11 19:59:12,663][66031] Updated weights for policy 0, policy_version 129200 (0.0004) +[2023-03-11 19:59:14,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 9927.6). Total num frames: 66162688. Throughput: 0: 10376.7. Samples: 66138128. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:59:14,012][65744] Avg episode reward: [(0, '3992.186')] +[2023-03-11 19:59:16,554][66031] Updated weights for policy 0, policy_version 129280 (0.0004) +[2023-03-11 19:59:19,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 9941.5). Total num frames: 66215936. Throughput: 0: 10377.3. Samples: 66200024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:59:19,012][65744] Avg episode reward: [(0, '4493.223')] +[2023-03-11 19:59:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000129328_66215936.pth... +[2023-03-11 19:59:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000128720_65904640.pth +[2023-03-11 19:59:20,552][66031] Updated weights for policy 0, policy_version 129360 (0.0004) +[2023-03-11 19:59:24,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 9955.4). Total num frames: 66265088. Throughput: 0: 10343.7. Samples: 66260992. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:59:24,012][65744] Avg episode reward: [(0, '4462.766')] +[2023-03-11 19:59:24,796][66031] Updated weights for policy 0, policy_version 129440 (0.0005) +[2023-03-11 19:59:28,925][66031] Updated weights for policy 0, policy_version 129520 (0.0004) +[2023-03-11 19:59:29,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10308.3, 300 sec: 9955.4). Total num frames: 66314240. Throughput: 0: 10283.2. Samples: 66289624. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:59:29,012][65744] Avg episode reward: [(0, '4330.415')] +[2023-03-11 19:59:32,867][66031] Updated weights for policy 0, policy_version 129600 (0.0004) +[2023-03-11 19:59:34,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 9955.4). Total num frames: 66363392. Throughput: 0: 10239.1. Samples: 66351028. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:59:34,012][65744] Avg episode reward: [(0, '4257.419')] +[2023-03-11 19:59:34,050][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000129624_66367488.pth... +[2023-03-11 19:59:34,052][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000129024_66060288.pth +[2023-03-11 19:59:36,787][66031] Updated weights for policy 0, policy_version 129680 (0.0004) +[2023-03-11 19:59:39,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 9983.1). Total num frames: 66416640. Throughput: 0: 10249.0. Samples: 66413656. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:59:39,012][65744] Avg episode reward: [(0, '4514.465')] +[2023-03-11 19:59:40,726][66031] Updated weights for policy 0, policy_version 129760 (0.0004) +[2023-03-11 19:59:44,012][65744] Fps is (10 sec: 10649.5, 60 sec: 10308.3, 300 sec: 9983.1). Total num frames: 66469888. Throughput: 0: 10245.3. Samples: 66444640. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:59:44,012][65744] Avg episode reward: [(0, '4372.448')] +[2023-03-11 19:59:44,633][66031] Updated weights for policy 0, policy_version 129840 (0.0004) +[2023-03-11 19:59:48,638][66031] Updated weights for policy 0, policy_version 129920 (0.0004) +[2023-03-11 19:59:49,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 9997.0). Total num frames: 66519040. Throughput: 0: 10228.0. Samples: 66507436. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:59:49,012][65744] Avg episode reward: [(0, '4527.427')] +[2023-03-11 19:59:49,049][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000129928_66523136.pth... +[2023-03-11 19:59:49,051][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000129328_66215936.pth +[2023-03-11 19:59:52,826][66031] Updated weights for policy 0, policy_version 130000 (0.0005) +[2023-03-11 19:59:54,012][65744] Fps is (10 sec: 9830.5, 60 sec: 10171.8, 300 sec: 9997.0). Total num frames: 66568192. Throughput: 0: 10229.2. Samples: 66566444. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:59:54,012][65744] Avg episode reward: [(0, '4108.270')] +[2023-03-11 19:59:56,975][66031] Updated weights for policy 0, policy_version 130080 (0.0005) +[2023-03-11 19:59:59,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 9997.0). Total num frames: 66617344. Throughput: 0: 10177.7. Samples: 66596124. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 19:59:59,012][65744] Avg episode reward: [(0, '4152.760')] +[2023-03-11 20:00:01,019][66031] Updated weights for policy 0, policy_version 130160 (0.0005) +[2023-03-11 20:00:04,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 10010.9). Total num frames: 66670592. Throughput: 0: 10144.3. Samples: 66656516. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:00:04,012][65744] Avg episode reward: [(0, '4339.820')] +[2023-03-11 20:00:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000130216_66670592.pth... +[2023-03-11 20:00:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000129624_66367488.pth +[2023-03-11 20:00:05,033][66031] Updated weights for policy 0, policy_version 130240 (0.0005) +[2023-03-11 20:00:09,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 9997.0). Total num frames: 66719744. Throughput: 0: 10166.8. Samples: 66718500. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:00:09,012][65744] Avg episode reward: [(0, '4403.594')] +[2023-03-11 20:00:09,028][66031] Updated weights for policy 0, policy_version 130320 (0.0004) +[2023-03-11 20:00:12,954][66031] Updated weights for policy 0, policy_version 130400 (0.0005) +[2023-03-11 20:00:14,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10010.9). Total num frames: 66772992. Throughput: 0: 10207.2. Samples: 66748948. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:00:14,012][65744] Avg episode reward: [(0, '4465.749')] +[2023-03-11 20:00:16,949][66031] Updated weights for policy 0, policy_version 130480 (0.0005) +[2023-03-11 20:00:19,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10171.7, 300 sec: 10010.9). Total num frames: 66826240. Throughput: 0: 10220.7. Samples: 66810960. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:00:19,012][65744] Avg episode reward: [(0, '4358.762')] +[2023-03-11 20:00:19,014][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000130520_66826240.pth... +[2023-03-11 20:00:19,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000129928_66523136.pth +[2023-03-11 20:00:20,880][66031] Updated weights for policy 0, policy_version 130560 (0.0005) +[2023-03-11 20:00:24,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10171.7, 300 sec: 10010.9). Total num frames: 66875392. Throughput: 0: 10212.6. Samples: 66873224. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:00:24,012][65744] Avg episode reward: [(0, '3993.795')] +[2023-03-11 20:00:24,830][66031] Updated weights for policy 0, policy_version 130640 (0.0005) +[2023-03-11 20:00:28,710][66031] Updated weights for policy 0, policy_version 130720 (0.0004) +[2023-03-11 20:00:29,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10024.8). Total num frames: 66928640. Throughput: 0: 10216.5. Samples: 66904384. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:00:29,012][65744] Avg episode reward: [(0, '4371.917')] +[2023-03-11 20:00:32,563][66031] Updated weights for policy 0, policy_version 130800 (0.0004) +[2023-03-11 20:00:34,012][65744] Fps is (10 sec: 10649.5, 60 sec: 10308.2, 300 sec: 10038.7). Total num frames: 66981888. Throughput: 0: 10246.6. Samples: 66968532. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:00:34,012][65744] Avg episode reward: [(0, '4092.228')] +[2023-03-11 20:00:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000130824_66981888.pth... +[2023-03-11 20:00:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000130216_66670592.pth +[2023-03-11 20:00:36,505][66031] Updated weights for policy 0, policy_version 130880 (0.0005) +[2023-03-11 20:00:39,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 10052.6). Total num frames: 67035136. Throughput: 0: 10324.1. Samples: 67031028. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:00:39,012][65744] Avg episode reward: [(0, '4209.052')] +[2023-03-11 20:00:40,442][66031] Updated weights for policy 0, policy_version 130960 (0.0005) +[2023-03-11 20:00:44,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 10066.4). Total num frames: 67088384. Throughput: 0: 10367.3. Samples: 67062652. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:00:44,012][65744] Avg episode reward: [(0, '4106.413')] +[2023-03-11 20:00:44,272][66031] Updated weights for policy 0, policy_version 131040 (0.0005) +[2023-03-11 20:00:48,228][66031] Updated weights for policy 0, policy_version 131120 (0.0005) +[2023-03-11 20:00:49,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10066.4). Total num frames: 67141632. Throughput: 0: 10416.5. Samples: 67125260. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:00:49,012][65744] Avg episode reward: [(0, '4446.944')] +[2023-03-11 20:00:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000131136_67141632.pth... +[2023-03-11 20:00:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000130520_66826240.pth +[2023-03-11 20:00:52,074][66031] Updated weights for policy 0, policy_version 131200 (0.0004) +[2023-03-11 20:00:54,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10066.4). Total num frames: 67190784. Throughput: 0: 10454.9. Samples: 67188972. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:00:54,012][65744] Avg episode reward: [(0, '4488.266')] +[2023-03-11 20:00:55,958][66031] Updated weights for policy 0, policy_version 131280 (0.0005) +[2023-03-11 20:00:59,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10066.4). Total num frames: 67244032. Throughput: 0: 10474.1. Samples: 67220280. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:00:59,012][65744] Avg episode reward: [(0, '4358.005')] +[2023-03-11 20:00:59,856][66031] Updated weights for policy 0, policy_version 131360 (0.0005) +[2023-03-11 20:01:03,819][66031] Updated weights for policy 0, policy_version 131440 (0.0005) +[2023-03-11 20:01:04,012][65744] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10080.3). Total num frames: 67297280. Throughput: 0: 10492.5. Samples: 67283124. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:01:04,012][65744] Avg episode reward: [(0, '4169.565')] +[2023-03-11 20:01:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000131440_67297280.pth... +[2023-03-11 20:01:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000130824_66981888.pth +[2023-03-11 20:01:07,727][66031] Updated weights for policy 0, policy_version 131520 (0.0005) +[2023-03-11 20:01:09,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10108.1). Total num frames: 67350528. Throughput: 0: 10503.5. Samples: 67345884. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:01:09,023][65744] Avg episode reward: [(0, '4431.532')] +[2023-03-11 20:01:11,603][66031] Updated weights for policy 0, policy_version 131600 (0.0004) +[2023-03-11 20:01:14,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10122.0). Total num frames: 67403776. Throughput: 0: 10517.3. Samples: 67377660. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:01:14,012][65744] Avg episode reward: [(0, '4203.308')] +[2023-03-11 20:01:15,572][66031] Updated weights for policy 0, policy_version 131680 (0.0005) +[2023-03-11 20:01:19,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10122.0). Total num frames: 67452928. Throughput: 0: 10471.6. Samples: 67439752. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:01:19,012][65744] Avg episode reward: [(0, '4307.347')] +[2023-03-11 20:01:19,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000131744_67452928.pth... +[2023-03-11 20:01:19,028][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000131136_67141632.pth +[2023-03-11 20:01:19,501][66031] Updated weights for policy 0, policy_version 131760 (0.0005) +[2023-03-11 20:01:23,421][66031] Updated weights for policy 0, policy_version 131840 (0.0005) +[2023-03-11 20:01:24,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10149.7). Total num frames: 67506176. Throughput: 0: 10469.5. Samples: 67502156. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:01:24,012][65744] Avg episode reward: [(0, '4515.655')] +[2023-03-11 20:01:27,585][66031] Updated weights for policy 0, policy_version 131920 (0.0005) +[2023-03-11 20:01:29,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10149.8). Total num frames: 67555328. Throughput: 0: 10426.3. Samples: 67531836. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:01:29,012][65744] Avg episode reward: [(0, '4550.857')] +[2023-03-11 20:01:31,673][66031] Updated weights for policy 0, policy_version 132000 (0.0005) +[2023-03-11 20:01:34,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10177.5). Total num frames: 67608576. Throughput: 0: 10377.7. Samples: 67592256. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:01:34,012][65744] Avg episode reward: [(0, '4483.053')] +[2023-03-11 20:01:34,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000132048_67608576.pth... +[2023-03-11 20:01:34,028][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000131440_67297280.pth +[2023-03-11 20:01:35,566][66031] Updated weights for policy 0, policy_version 132080 (0.0005) +[2023-03-11 20:01:39,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10177.5). Total num frames: 67657728. Throughput: 0: 10349.6. Samples: 67654704. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:01:39,012][65744] Avg episode reward: [(0, '4467.727')] +[2023-03-11 20:01:39,493][66031] Updated weights for policy 0, policy_version 132160 (0.0005) +[2023-03-11 20:01:43,408][66031] Updated weights for policy 0, policy_version 132240 (0.0004) +[2023-03-11 20:01:44,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10205.3). Total num frames: 67710976. Throughput: 0: 10358.7. Samples: 67686420. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:01:44,012][65744] Avg episode reward: [(0, '4582.567')] +[2023-03-11 20:01:47,390][66031] Updated weights for policy 0, policy_version 132320 (0.0004) +[2023-03-11 20:01:49,012][65744] Fps is (10 sec: 10649.5, 60 sec: 10376.5, 300 sec: 10233.1). Total num frames: 67764224. Throughput: 0: 10334.1. Samples: 67748160. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:01:49,012][65744] Avg episode reward: [(0, '4260.385')] +[2023-03-11 20:01:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000132352_67764224.pth... +[2023-03-11 20:01:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000131744_67452928.pth +[2023-03-11 20:01:51,255][66031] Updated weights for policy 0, policy_version 132400 (0.0004) +[2023-03-11 20:01:54,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10246.9). Total num frames: 67817472. Throughput: 0: 10358.5. Samples: 67812016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:01:54,012][65744] Avg episode reward: [(0, '4383.610')] +[2023-03-11 20:01:55,154][66031] Updated weights for policy 0, policy_version 132480 (0.0005) +[2023-03-11 20:01:59,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10246.9). Total num frames: 67866624. Throughput: 0: 10341.0. Samples: 67843004. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:01:59,012][65744] Avg episode reward: [(0, '4296.309')] +[2023-03-11 20:01:59,108][66031] Updated weights for policy 0, policy_version 132560 (0.0005) +[2023-03-11 20:02:03,036][66031] Updated weights for policy 0, policy_version 132640 (0.0004) +[2023-03-11 20:02:04,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10260.8). Total num frames: 67919872. Throughput: 0: 10351.0. Samples: 67905548. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:02:04,012][65744] Avg episode reward: [(0, '4494.149')] +[2023-03-11 20:02:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000132656_67919872.pth... +[2023-03-11 20:02:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000132048_67608576.pth +[2023-03-11 20:02:06,906][66031] Updated weights for policy 0, policy_version 132720 (0.0005) +[2023-03-11 20:02:09,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10288.6). Total num frames: 67973120. Throughput: 0: 10372.8. Samples: 67968932. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:02:09,012][65744] Avg episode reward: [(0, '4304.784')] +[2023-03-11 20:02:10,790][66031] Updated weights for policy 0, policy_version 132800 (0.0005) +[2023-03-11 20:02:14,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10274.7). Total num frames: 68022272. Throughput: 0: 10417.8. Samples: 68000636. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:02:14,012][65744] Avg episode reward: [(0, '4508.830')] +[2023-03-11 20:02:14,855][66031] Updated weights for policy 0, policy_version 132880 (0.0005) +[2023-03-11 20:02:18,934][66031] Updated weights for policy 0, policy_version 132960 (0.0005) +[2023-03-11 20:02:19,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10274.7). Total num frames: 68075520. Throughput: 0: 10385.3. Samples: 68059596. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:02:19,012][65744] Avg episode reward: [(0, '4305.942')] +[2023-03-11 20:02:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000132960_68075520.pth... +[2023-03-11 20:02:19,019][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000132352_67764224.pth +[2023-03-11 20:02:22,787][66031] Updated weights for policy 0, policy_version 133040 (0.0005) +[2023-03-11 20:02:24,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10288.6). Total num frames: 68128768. Throughput: 0: 10424.3. Samples: 68123796. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:02:24,012][65744] Avg episode reward: [(0, '4388.997')] +[2023-03-11 20:02:26,638][66031] Updated weights for policy 0, policy_version 133120 (0.0005) +[2023-03-11 20:02:29,012][65744] Fps is (10 sec: 10649.7, 60 sec: 10444.8, 300 sec: 10288.6). Total num frames: 68182016. Throughput: 0: 10419.2. Samples: 68155284. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:02:29,012][65744] Avg episode reward: [(0, '4245.321')] +[2023-03-11 20:02:30,584][66031] Updated weights for policy 0, policy_version 133200 (0.0005) +[2023-03-11 20:02:34,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10274.7). Total num frames: 68231168. Throughput: 0: 10420.4. Samples: 68217076. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:02:34,012][65744] Avg episode reward: [(0, '3998.822')] +[2023-03-11 20:02:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000133264_68231168.pth... +[2023-03-11 20:02:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000132656_67919872.pth +[2023-03-11 20:02:34,612][66031] Updated weights for policy 0, policy_version 133280 (0.0004) +[2023-03-11 20:02:38,573][66031] Updated weights for policy 0, policy_version 133360 (0.0003) +[2023-03-11 20:02:39,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10274.7). Total num frames: 68284416. Throughput: 0: 10379.3. Samples: 68279084. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:02:39,012][65744] Avg episode reward: [(0, '4195.034')] +[2023-03-11 20:02:42,804][66031] Updated weights for policy 0, policy_version 133440 (0.0005) +[2023-03-11 20:02:44,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10308.3, 300 sec: 10260.8). Total num frames: 68329472. Throughput: 0: 10355.3. Samples: 68308992. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:02:44,012][65744] Avg episode reward: [(0, '3981.305')] +[2023-03-11 20:02:46,739][66031] Updated weights for policy 0, policy_version 133520 (0.0004) +[2023-03-11 20:02:49,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10308.3, 300 sec: 10274.7). Total num frames: 68382720. Throughput: 0: 10313.2. Samples: 68369640. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:02:49,012][65744] Avg episode reward: [(0, '4283.295')] +[2023-03-11 20:02:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000133560_68382720.pth... +[2023-03-11 20:02:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000132960_68075520.pth +[2023-03-11 20:02:50,705][66031] Updated weights for policy 0, policy_version 133600 (0.0005) +[2023-03-11 20:02:54,012][65744] Fps is (10 sec: 10649.7, 60 sec: 10308.3, 300 sec: 10288.6). Total num frames: 68435968. Throughput: 0: 10287.8. Samples: 68431884. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:02:54,012][65744] Avg episode reward: [(0, '3982.745')] +[2023-03-11 20:02:54,634][66031] Updated weights for policy 0, policy_version 133680 (0.0005) +[2023-03-11 20:02:58,518][66031] Updated weights for policy 0, policy_version 133760 (0.0005) +[2023-03-11 20:02:59,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10302.5). Total num frames: 68489216. Throughput: 0: 10288.4. Samples: 68463612. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:02:59,012][65744] Avg episode reward: [(0, '4403.355')] +[2023-03-11 20:03:02,380][66031] Updated weights for policy 0, policy_version 133840 (0.0004) +[2023-03-11 20:03:04,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10330.2). Total num frames: 68542464. Throughput: 0: 10376.1. Samples: 68526520. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:03:04,012][65744] Avg episode reward: [(0, '4564.538')] +[2023-03-11 20:03:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000133872_68542464.pth... +[2023-03-11 20:03:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000133264_68231168.pth +[2023-03-11 20:03:06,240][66031] Updated weights for policy 0, policy_version 133920 (0.0005) +[2023-03-11 20:03:09,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10344.1). Total num frames: 68595712. Throughput: 0: 10357.3. Samples: 68589876. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:03:09,023][65744] Avg episode reward: [(0, '4500.124')] +[2023-03-11 20:03:10,143][66031] Updated weights for policy 0, policy_version 134000 (0.0005) +[2023-03-11 20:03:14,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10344.1). Total num frames: 68644864. Throughput: 0: 10359.1. Samples: 68621444. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:03:14,012][65744] Avg episode reward: [(0, '4510.539')] +[2023-03-11 20:03:14,070][66031] Updated weights for policy 0, policy_version 134080 (0.0005) +[2023-03-11 20:03:18,025][66031] Updated weights for policy 0, policy_version 134160 (0.0005) +[2023-03-11 20:03:19,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10344.1). Total num frames: 68698112. Throughput: 0: 10375.7. Samples: 68683984. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:03:19,012][65744] Avg episode reward: [(0, '4473.094')] +[2023-03-11 20:03:19,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000134176_68698112.pth... +[2023-03-11 20:03:19,029][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000133560_68382720.pth +[2023-03-11 20:03:21,884][66031] Updated weights for policy 0, policy_version 134240 (0.0004) +[2023-03-11 20:03:24,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10358.0). Total num frames: 68751360. Throughput: 0: 10405.4. Samples: 68747328. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:03:24,012][65744] Avg episode reward: [(0, '4555.890')] +[2023-03-11 20:03:25,811][66031] Updated weights for policy 0, policy_version 134320 (0.0005) +[2023-03-11 20:03:29,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10358.0). Total num frames: 68804608. Throughput: 0: 10440.3. Samples: 68778804. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:03:29,012][65744] Avg episode reward: [(0, '4441.341')] +[2023-03-11 20:03:29,812][66031] Updated weights for policy 0, policy_version 134400 (0.0005) +[2023-03-11 20:03:33,845][66031] Updated weights for policy 0, policy_version 134480 (0.0005) +[2023-03-11 20:03:34,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10358.0). Total num frames: 68853760. Throughput: 0: 10431.6. Samples: 68839064. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:03:34,012][65744] Avg episode reward: [(0, '4551.835')] +[2023-03-11 20:03:34,025][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000134480_68853760.pth... +[2023-03-11 20:03:34,028][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000133872_68542464.pth +[2023-03-11 20:03:37,797][66031] Updated weights for policy 0, policy_version 134560 (0.0005) +[2023-03-11 20:03:39,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10308.3, 300 sec: 10344.1). Total num frames: 68902912. Throughput: 0: 10432.7. Samples: 68901356. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:03:39,012][65744] Avg episode reward: [(0, '4440.502')] +[2023-03-11 20:03:41,731][66031] Updated weights for policy 0, policy_version 134640 (0.0005) +[2023-03-11 20:03:44,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10344.1). Total num frames: 68956160. Throughput: 0: 10418.4. Samples: 68932440. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:03:44,023][65744] Avg episode reward: [(0, '4374.172')] +[2023-03-11 20:03:45,664][66031] Updated weights for policy 0, policy_version 134720 (0.0005) +[2023-03-11 20:03:49,012][65744] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10344.1). Total num frames: 69009408. Throughput: 0: 10406.7. Samples: 68994820. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:03:49,012][65744] Avg episode reward: [(0, '4461.732')] +[2023-03-11 20:03:49,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000134784_69009408.pth... +[2023-03-11 20:03:49,028][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000134176_68698112.pth +[2023-03-11 20:03:49,592][66031] Updated weights for policy 0, policy_version 134800 (0.0005) +[2023-03-11 20:03:53,507][66031] Updated weights for policy 0, policy_version 134880 (0.0004) +[2023-03-11 20:03:54,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10358.0). Total num frames: 69062656. Throughput: 0: 10400.9. Samples: 69057916. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:03:54,012][65744] Avg episode reward: [(0, '4221.410')] +[2023-03-11 20:03:57,573][66031] Updated weights for policy 0, policy_version 134960 (0.0005) +[2023-03-11 20:03:59,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10344.1). Total num frames: 69111808. Throughput: 0: 10367.1. Samples: 69087964. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:03:59,012][65744] Avg episode reward: [(0, '4379.328')] +[2023-03-11 20:04:01,419][66031] Updated weights for policy 0, policy_version 135040 (0.0005) +[2023-03-11 20:04:04,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10358.0). Total num frames: 69165056. Throughput: 0: 10387.6. Samples: 69151428. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:04:04,012][65744] Avg episode reward: [(0, '4481.264')] +[2023-03-11 20:04:04,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000135088_69165056.pth... +[2023-03-11 20:04:04,029][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000134480_68853760.pth +[2023-03-11 20:04:05,281][66031] Updated weights for policy 0, policy_version 135120 (0.0004) +[2023-03-11 20:04:09,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10358.0). Total num frames: 69218304. Throughput: 0: 10376.5. Samples: 69214272. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:04:09,012][65744] Avg episode reward: [(0, '4533.885')] +[2023-03-11 20:04:09,235][66031] Updated weights for policy 0, policy_version 135200 (0.0005) +[2023-03-11 20:04:13,104][66031] Updated weights for policy 0, policy_version 135280 (0.0004) +[2023-03-11 20:04:14,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10358.0). Total num frames: 69271552. Throughput: 0: 10381.3. Samples: 69245964. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:04:14,012][65744] Avg episode reward: [(0, '4544.786')] +[2023-03-11 20:04:17,052][66031] Updated weights for policy 0, policy_version 135360 (0.0005) +[2023-03-11 20:04:19,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10371.9). Total num frames: 69324800. Throughput: 0: 10430.4. Samples: 69308432. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:04:19,012][65744] Avg episode reward: [(0, '4426.940')] +[2023-03-11 20:04:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000135400_69324800.pth... +[2023-03-11 20:04:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000134784_69009408.pth +[2023-03-11 20:04:20,960][66031] Updated weights for policy 0, policy_version 135440 (0.0005) +[2023-03-11 20:04:24,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10371.9). Total num frames: 69373952. Throughput: 0: 10455.0. Samples: 69371832. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:04:24,012][65744] Avg episode reward: [(0, '4446.558')] +[2023-03-11 20:04:24,800][66031] Updated weights for policy 0, policy_version 135520 (0.0004) +[2023-03-11 20:04:28,753][66031] Updated weights for policy 0, policy_version 135600 (0.0005) +[2023-03-11 20:04:29,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10385.8). Total num frames: 69427200. Throughput: 0: 10450.2. Samples: 69402700. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:04:29,012][65744] Avg episode reward: [(0, '4372.509')] +[2023-03-11 20:04:32,686][66031] Updated weights for policy 0, policy_version 135680 (0.0005) +[2023-03-11 20:04:34,012][65744] Fps is (10 sec: 10649.4, 60 sec: 10444.8, 300 sec: 10385.8). Total num frames: 69480448. Throughput: 0: 10467.8. Samples: 69465872. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:04:34,012][65744] Avg episode reward: [(0, '4424.994')] +[2023-03-11 20:04:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000135704_69480448.pth... +[2023-03-11 20:04:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000135088_69165056.pth +[2023-03-11 20:04:36,760][66031] Updated weights for policy 0, policy_version 135760 (0.0005) +[2023-03-11 20:04:39,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10444.8, 300 sec: 10371.9). Total num frames: 69529600. Throughput: 0: 10391.0. Samples: 69525512. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:04:39,012][65744] Avg episode reward: [(0, '4339.092')] +[2023-03-11 20:04:41,018][66031] Updated weights for policy 0, policy_version 135840 (0.0005) +[2023-03-11 20:04:44,012][65744] Fps is (10 sec: 9421.0, 60 sec: 10308.3, 300 sec: 10358.0). Total num frames: 69574656. Throughput: 0: 10361.7. Samples: 69554240. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:04:44,012][65744] Avg episode reward: [(0, '4321.240')] +[2023-03-11 20:04:45,278][66031] Updated weights for policy 0, policy_version 135920 (0.0005) +[2023-03-11 20:04:49,012][65744] Fps is (10 sec: 9420.9, 60 sec: 10240.0, 300 sec: 10358.0). Total num frames: 69623808. Throughput: 0: 10227.1. Samples: 69611648. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:04:49,012][65744] Avg episode reward: [(0, '4424.181')] +[2023-03-11 20:04:49,046][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000135992_69627904.pth... +[2023-03-11 20:04:49,048][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000135400_69324800.pth +[2023-03-11 20:04:49,456][66031] Updated weights for policy 0, policy_version 136000 (0.0005) +[2023-03-11 20:04:53,624][66031] Updated weights for policy 0, policy_version 136080 (0.0005) +[2023-03-11 20:04:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10358.0). Total num frames: 69672960. Throughput: 0: 10150.3. Samples: 69671036. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:04:54,012][65744] Avg episode reward: [(0, '4477.559')] +[2023-03-11 20:04:57,923][66031] Updated weights for policy 0, policy_version 136160 (0.0005) +[2023-03-11 20:04:59,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10344.1). Total num frames: 69722112. Throughput: 0: 10086.6. Samples: 69699860. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:04:59,012][65744] Avg episode reward: [(0, '4547.281')] +[2023-03-11 20:05:02,051][66031] Updated weights for policy 0, policy_version 136240 (0.0005) +[2023-03-11 20:05:04,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 10358.0). Total num frames: 69775360. Throughput: 0: 10013.5. Samples: 69759040. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:05:04,012][65744] Avg episode reward: [(0, '4198.081')] +[2023-03-11 20:05:04,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000136280_69775360.pth... +[2023-03-11 20:05:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000135704_69480448.pth +[2023-03-11 20:05:05,948][66031] Updated weights for policy 0, policy_version 136320 (0.0004) +[2023-03-11 20:05:09,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10344.1). Total num frames: 69824512. Throughput: 0: 10005.4. Samples: 69822076. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:05:09,012][65744] Avg episode reward: [(0, '4254.147')] +[2023-03-11 20:05:09,866][66031] Updated weights for policy 0, policy_version 136400 (0.0005) +[2023-03-11 20:05:13,762][66031] Updated weights for policy 0, policy_version 136480 (0.0004) +[2023-03-11 20:05:14,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10344.1). Total num frames: 69877760. Throughput: 0: 10013.4. Samples: 69853304. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:05:14,012][65744] Avg episode reward: [(0, '4216.416')] +[2023-03-11 20:05:17,644][66031] Updated weights for policy 0, policy_version 136560 (0.0004) +[2023-03-11 20:05:19,012][65744] Fps is (10 sec: 10649.5, 60 sec: 10103.5, 300 sec: 10358.0). Total num frames: 69931008. Throughput: 0: 10016.2. Samples: 69916600. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:05:19,012][65744] Avg episode reward: [(0, '4260.192')] +[2023-03-11 20:05:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000136584_69931008.pth... +[2023-03-11 20:05:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000135992_69627904.pth +[2023-03-11 20:05:21,537][66031] Updated weights for policy 0, policy_version 136640 (0.0004) +[2023-03-11 20:05:24,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10171.7, 300 sec: 10358.0). Total num frames: 69984256. Throughput: 0: 10091.6. Samples: 69979636. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:05:24,012][65744] Avg episode reward: [(0, '4131.509')] +[2023-03-11 20:05:25,449][66031] Updated weights for policy 0, policy_version 136720 (0.0005) +[2023-03-11 20:05:29,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10344.1). Total num frames: 70033408. Throughput: 0: 10155.6. Samples: 70011244. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:05:29,012][65744] Avg episode reward: [(0, '4321.349')] +[2023-03-11 20:05:29,465][66031] Updated weights for policy 0, policy_version 136800 (0.0005) +[2023-03-11 20:05:33,374][66031] Updated weights for policy 0, policy_version 136880 (0.0004) +[2023-03-11 20:05:34,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 10344.1). Total num frames: 70086656. Throughput: 0: 10244.3. Samples: 70072644. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:05:34,012][65744] Avg episode reward: [(0, '4266.212')] +[2023-03-11 20:05:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000136888_70086656.pth... +[2023-03-11 20:05:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000136280_69775360.pth +[2023-03-11 20:05:37,343][66031] Updated weights for policy 0, policy_version 136960 (0.0005) +[2023-03-11 20:05:39,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10171.7, 300 sec: 10344.1). Total num frames: 70139904. Throughput: 0: 10322.9. Samples: 70135568. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:05:39,012][65744] Avg episode reward: [(0, '4485.281')] +[2023-03-11 20:05:41,172][66031] Updated weights for policy 0, policy_version 137040 (0.0004) +[2023-03-11 20:05:44,012][65744] Fps is (10 sec: 10649.7, 60 sec: 10308.3, 300 sec: 10344.1). Total num frames: 70193152. Throughput: 0: 10393.7. Samples: 70167576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:05:44,012][65744] Avg episode reward: [(0, '4536.323')] +[2023-03-11 20:05:45,070][66031] Updated weights for policy 0, policy_version 137120 (0.0004) +[2023-03-11 20:05:49,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10344.1). Total num frames: 70242304. Throughput: 0: 10465.9. Samples: 70230008. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:05:49,012][65744] Avg episode reward: [(0, '4478.012')] +[2023-03-11 20:05:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000137192_70242304.pth... +[2023-03-11 20:05:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000136584_69931008.pth +[2023-03-11 20:05:49,183][66031] Updated weights for policy 0, policy_version 137200 (0.0005) +[2023-03-11 20:05:53,406][66031] Updated weights for policy 0, policy_version 137280 (0.0005) +[2023-03-11 20:05:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10308.3, 300 sec: 10330.3). Total num frames: 70291456. Throughput: 0: 10342.0. Samples: 70287468. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:05:54,012][65744] Avg episode reward: [(0, '4320.638')] +[2023-03-11 20:05:57,624][66031] Updated weights for policy 0, policy_version 137360 (0.0005) +[2023-03-11 20:05:59,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10308.3, 300 sec: 10316.4). Total num frames: 70340608. Throughput: 0: 10295.5. Samples: 70316600. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:05:59,012][65744] Avg episode reward: [(0, '4516.219')] +[2023-03-11 20:06:01,875][66031] Updated weights for policy 0, policy_version 137440 (0.0005) +[2023-03-11 20:06:04,012][65744] Fps is (10 sec: 9830.3, 60 sec: 10240.0, 300 sec: 10302.5). Total num frames: 70389760. Throughput: 0: 10186.5. Samples: 70374992. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:06:04,012][65744] Avg episode reward: [(0, '4427.798')] +[2023-03-11 20:06:04,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000137480_70389760.pth... +[2023-03-11 20:06:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000136888_70086656.pth +[2023-03-11 20:06:06,116][66031] Updated weights for policy 0, policy_version 137520 (0.0005) +[2023-03-11 20:06:09,012][65744] Fps is (10 sec: 9420.8, 60 sec: 10171.7, 300 sec: 10274.7). Total num frames: 70434816. Throughput: 0: 10074.0. Samples: 70432964. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:06:09,012][65744] Avg episode reward: [(0, '4436.227')] +[2023-03-11 20:06:10,353][66031] Updated weights for policy 0, policy_version 137600 (0.0005) +[2023-03-11 20:06:14,012][65744] Fps is (10 sec: 9420.9, 60 sec: 10103.5, 300 sec: 10274.7). Total num frames: 70483968. Throughput: 0: 10015.6. Samples: 70461944. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:06:14,012][65744] Avg episode reward: [(0, '4426.229')] +[2023-03-11 20:06:14,564][66031] Updated weights for policy 0, policy_version 137680 (0.0005) +[2023-03-11 20:06:18,819][66031] Updated weights for policy 0, policy_version 137760 (0.0005) +[2023-03-11 20:06:19,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10260.8). Total num frames: 70533120. Throughput: 0: 9951.8. Samples: 70520476. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:06:19,012][65744] Avg episode reward: [(0, '4341.622')] +[2023-03-11 20:06:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000137760_70533120.pth... +[2023-03-11 20:06:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000137192_70242304.pth +[2023-03-11 20:06:23,064][66031] Updated weights for policy 0, policy_version 137840 (0.0005) +[2023-03-11 20:06:24,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 10260.8). Total num frames: 70582272. Throughput: 0: 9835.7. Samples: 70578176. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:06:24,012][65744] Avg episode reward: [(0, '4536.420')] +[2023-03-11 20:06:27,248][66031] Updated weights for policy 0, policy_version 137920 (0.0005) +[2023-03-11 20:06:29,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 10246.9). Total num frames: 70631424. Throughput: 0: 9763.2. Samples: 70606920. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:06:29,012][65744] Avg episode reward: [(0, '4452.407')] +[2023-03-11 20:06:31,491][66031] Updated weights for policy 0, policy_version 138000 (0.0005) +[2023-03-11 20:06:34,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9830.4, 300 sec: 10233.1). Total num frames: 70676480. Throughput: 0: 9672.5. Samples: 70665268. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:06:34,012][65744] Avg episode reward: [(0, '4219.491')] +[2023-03-11 20:06:34,048][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000138048_70680576.pth... +[2023-03-11 20:06:34,049][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000137480_70389760.pth +[2023-03-11 20:06:35,716][66031] Updated weights for policy 0, policy_version 138080 (0.0005) +[2023-03-11 20:06:39,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 10219.2). Total num frames: 70725632. Throughput: 0: 9693.0. Samples: 70723652. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:06:39,012][65744] Avg episode reward: [(0, '4091.425')] +[2023-03-11 20:06:39,917][66031] Updated weights for policy 0, policy_version 138160 (0.0005) +[2023-03-11 20:06:44,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 10205.3). Total num frames: 70774784. Throughput: 0: 9700.5. Samples: 70753124. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:06:44,012][65744] Avg episode reward: [(0, '4446.828')] +[2023-03-11 20:06:44,121][66031] Updated weights for policy 0, policy_version 138240 (0.0005) +[2023-03-11 20:06:48,404][66031] Updated weights for policy 0, policy_version 138320 (0.0005) +[2023-03-11 20:06:49,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9693.9, 300 sec: 10191.4). Total num frames: 70823936. Throughput: 0: 9686.4. Samples: 70810880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:06:49,012][65744] Avg episode reward: [(0, '4439.248')] +[2023-03-11 20:06:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000138328_70823936.pth... +[2023-03-11 20:06:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000137760_70533120.pth +[2023-03-11 20:06:52,405][66031] Updated weights for policy 0, policy_version 138400 (0.0004) +[2023-03-11 20:06:54,012][65744] Fps is (10 sec: 10239.9, 60 sec: 9762.1, 300 sec: 10205.3). Total num frames: 70877184. Throughput: 0: 9752.6. Samples: 70871832. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:06:54,012][65744] Avg episode reward: [(0, '4276.572')] +[2023-03-11 20:06:56,315][66031] Updated weights for policy 0, policy_version 138480 (0.0004) +[2023-03-11 20:06:59,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9762.1, 300 sec: 10191.4). Total num frames: 70926336. Throughput: 0: 9801.4. Samples: 70903008. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:06:59,012][65744] Avg episode reward: [(0, '4385.320')] +[2023-03-11 20:07:00,345][66031] Updated weights for policy 0, policy_version 138560 (0.0004) +[2023-03-11 20:07:04,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 10177.5). Total num frames: 70975488. Throughput: 0: 9839.7. Samples: 70963264. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:07:04,012][65744] Avg episode reward: [(0, '4315.860')] +[2023-03-11 20:07:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000138624_70975488.pth... +[2023-03-11 20:07:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000138048_70680576.pth +[2023-03-11 20:07:04,571][66031] Updated weights for policy 0, policy_version 138640 (0.0005) +[2023-03-11 20:07:08,821][66031] Updated weights for policy 0, policy_version 138720 (0.0005) +[2023-03-11 20:07:09,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10177.5). Total num frames: 71024640. Throughput: 0: 9832.0. Samples: 71020616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:07:09,012][65744] Avg episode reward: [(0, '4205.488')] +[2023-03-11 20:07:12,670][66031] Updated weights for policy 0, policy_version 138800 (0.0004) +[2023-03-11 20:07:14,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 10177.5). Total num frames: 71077888. Throughput: 0: 9906.5. Samples: 71052712. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:07:14,012][65744] Avg episode reward: [(0, '4092.322')] +[2023-03-11 20:07:16,589][66031] Updated weights for policy 0, policy_version 138880 (0.0004) +[2023-03-11 20:07:19,012][65744] Fps is (10 sec: 10649.6, 60 sec: 9966.9, 300 sec: 10177.5). Total num frames: 71131136. Throughput: 0: 10006.2. Samples: 71115548. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:07:19,012][65744] Avg episode reward: [(0, '4244.350')] +[2023-03-11 20:07:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000138928_71131136.pth... +[2023-03-11 20:07:19,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000138328_70823936.pth +[2023-03-11 20:07:20,564][66031] Updated weights for policy 0, policy_version 138960 (0.0005) +[2023-03-11 20:07:24,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 10163.6). Total num frames: 71180288. Throughput: 0: 10054.2. Samples: 71176092. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:07:24,012][65744] Avg episode reward: [(0, '3988.642')] +[2023-03-11 20:07:24,764][66031] Updated weights for policy 0, policy_version 139040 (0.0005) +[2023-03-11 20:07:28,913][66031] Updated weights for policy 0, policy_version 139120 (0.0005) +[2023-03-11 20:07:29,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 10163.6). Total num frames: 71229440. Throughput: 0: 10040.1. Samples: 71204928. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:07:29,012][65744] Avg episode reward: [(0, '4295.243')] +[2023-03-11 20:07:33,097][66031] Updated weights for policy 0, policy_version 139200 (0.0004) +[2023-03-11 20:07:34,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10149.7). Total num frames: 71278592. Throughput: 0: 10059.1. Samples: 71263540. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:07:34,020][65744] Avg episode reward: [(0, '4146.567')] +[2023-03-11 20:07:34,023][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000139216_71278592.pth... +[2023-03-11 20:07:34,025][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000138624_70975488.pth +[2023-03-11 20:07:37,272][66031] Updated weights for policy 0, policy_version 139280 (0.0005) +[2023-03-11 20:07:39,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10163.6). Total num frames: 71327744. Throughput: 0: 10041.8. Samples: 71323712. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:07:39,023][65744] Avg episode reward: [(0, '4263.183')] +[2023-03-11 20:07:41,160][66031] Updated weights for policy 0, policy_version 139360 (0.0004) +[2023-03-11 20:07:44,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10163.6). Total num frames: 71380992. Throughput: 0: 10057.1. Samples: 71355576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:07:44,012][65744] Avg episode reward: [(0, '4432.580')] +[2023-03-11 20:07:45,182][66031] Updated weights for policy 0, policy_version 139440 (0.0005) +[2023-03-11 20:07:49,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10149.7). Total num frames: 71430144. Throughput: 0: 10096.8. Samples: 71417620. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:07:49,012][65744] Avg episode reward: [(0, '4550.011')] +[2023-03-11 20:07:49,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000139512_71430144.pth... +[2023-03-11 20:07:49,029][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000138928_71131136.pth +[2023-03-11 20:07:49,094][66031] Updated weights for policy 0, policy_version 139520 (0.0004) +[2023-03-11 20:07:53,001][66031] Updated weights for policy 0, policy_version 139600 (0.0004) +[2023-03-11 20:07:54,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10149.8). Total num frames: 71483392. Throughput: 0: 10202.3. Samples: 71479720. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:07:54,012][65744] Avg episode reward: [(0, '4567.797')] +[2023-03-11 20:07:56,922][66031] Updated weights for policy 0, policy_version 139680 (0.0004) +[2023-03-11 20:07:59,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10171.7, 300 sec: 10149.8). Total num frames: 71536640. Throughput: 0: 10202.1. Samples: 71511808. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:07:59,012][65744] Avg episode reward: [(0, '4453.085')] +[2023-03-11 20:08:00,946][66031] Updated weights for policy 0, policy_version 139760 (0.0004) +[2023-03-11 20:08:04,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 10135.9). Total num frames: 71585792. Throughput: 0: 10136.2. Samples: 71571676. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:08:04,012][65744] Avg episode reward: [(0, '4477.119')] +[2023-03-11 20:08:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000139816_71585792.pth... +[2023-03-11 20:08:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000139216_71278592.pth +[2023-03-11 20:08:05,110][66031] Updated weights for policy 0, policy_version 139840 (0.0005) +[2023-03-11 20:08:09,012][65744] Fps is (10 sec: 9830.5, 60 sec: 10171.7, 300 sec: 10135.9). Total num frames: 71634944. Throughput: 0: 10142.3. Samples: 71632496. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:08:09,012][65744] Avg episode reward: [(0, '4472.181')] +[2023-03-11 20:08:09,030][66031] Updated weights for policy 0, policy_version 139920 (0.0005) +[2023-03-11 20:08:12,953][66031] Updated weights for policy 0, policy_version 140000 (0.0005) +[2023-03-11 20:08:14,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10171.7, 300 sec: 10135.9). Total num frames: 71688192. Throughput: 0: 10198.0. Samples: 71663840. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:08:14,012][65744] Avg episode reward: [(0, '4426.704')] +[2023-03-11 20:08:16,889][66031] Updated weights for policy 0, policy_version 140080 (0.0004) +[2023-03-11 20:08:19,012][65744] Fps is (10 sec: 10649.5, 60 sec: 10171.7, 300 sec: 10135.9). Total num frames: 71741440. Throughput: 0: 10296.3. Samples: 71726872. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:08:19,012][65744] Avg episode reward: [(0, '4308.330')] +[2023-03-11 20:08:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000140120_71741440.pth... +[2023-03-11 20:08:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000139512_71430144.pth +[2023-03-11 20:08:20,877][66031] Updated weights for policy 0, policy_version 140160 (0.0003) +[2023-03-11 20:08:24,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10122.0). Total num frames: 71790592. Throughput: 0: 10328.0. Samples: 71788472. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:08:24,012][65744] Avg episode reward: [(0, '4201.641')] +[2023-03-11 20:08:24,836][66031] Updated weights for policy 0, policy_version 140240 (0.0004) +[2023-03-11 20:08:28,776][66031] Updated weights for policy 0, policy_version 140320 (0.0005) +[2023-03-11 20:08:29,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10135.9). Total num frames: 71843840. Throughput: 0: 10309.3. Samples: 71819496. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:08:29,023][65744] Avg episode reward: [(0, '3879.691')] +[2023-03-11 20:08:32,783][66031] Updated weights for policy 0, policy_version 140400 (0.0004) +[2023-03-11 20:08:34,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10135.9). Total num frames: 71892992. Throughput: 0: 10303.9. Samples: 71881296. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:08:34,012][65744] Avg episode reward: [(0, '4408.591')] +[2023-03-11 20:08:34,043][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000140424_71897088.pth... +[2023-03-11 20:08:34,044][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000139816_71585792.pth +[2023-03-11 20:08:36,957][66031] Updated weights for policy 0, policy_version 140480 (0.0005) +[2023-03-11 20:08:39,012][65744] Fps is (10 sec: 9830.5, 60 sec: 10240.0, 300 sec: 10122.0). Total num frames: 71942144. Throughput: 0: 10229.2. Samples: 71940032. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:08:39,012][65744] Avg episode reward: [(0, '4330.227')] +[2023-03-11 20:08:41,164][66031] Updated weights for policy 0, policy_version 140560 (0.0005) +[2023-03-11 20:08:44,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10108.1). Total num frames: 71991296. Throughput: 0: 10175.5. Samples: 71969704. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:08:44,012][65744] Avg episode reward: [(0, '4209.051')] +[2023-03-11 20:08:45,429][66031] Updated weights for policy 0, policy_version 140640 (0.0004) +[2023-03-11 20:08:49,012][65744] Fps is (10 sec: 9830.3, 60 sec: 10171.7, 300 sec: 10094.2). Total num frames: 72040448. Throughput: 0: 10141.9. Samples: 72028060. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:08:49,012][65744] Avg episode reward: [(0, '4252.328')] +[2023-03-11 20:08:49,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000140704_72040448.pth... +[2023-03-11 20:08:49,029][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000140120_71741440.pth +[2023-03-11 20:08:49,624][66031] Updated weights for policy 0, policy_version 140720 (0.0005) +[2023-03-11 20:08:53,832][66031] Updated weights for policy 0, policy_version 140800 (0.0004) +[2023-03-11 20:08:54,012][65744] Fps is (10 sec: 9830.3, 60 sec: 10103.5, 300 sec: 10094.2). Total num frames: 72089600. Throughput: 0: 10068.4. Samples: 72085576. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:08:54,012][65744] Avg episode reward: [(0, '4348.114')] +[2023-03-11 20:08:57,971][66031] Updated weights for policy 0, policy_version 140880 (0.0005) +[2023-03-11 20:08:59,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10080.3). Total num frames: 72138752. Throughput: 0: 10040.0. Samples: 72115640. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:08:59,012][65744] Avg episode reward: [(0, '4374.754')] +[2023-03-11 20:09:02,120][66031] Updated weights for policy 0, policy_version 140960 (0.0005) +[2023-03-11 20:09:04,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10066.4). Total num frames: 72187904. Throughput: 0: 9965.2. Samples: 72175308. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:09:04,012][65744] Avg episode reward: [(0, '4112.587')] +[2023-03-11 20:09:04,027][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000140992_72187904.pth... +[2023-03-11 20:09:04,029][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000140424_71897088.pth +[2023-03-11 20:09:06,282][66031] Updated weights for policy 0, policy_version 141040 (0.0005) +[2023-03-11 20:09:09,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10052.6). Total num frames: 72237056. Throughput: 0: 9895.6. Samples: 72233776. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:09:09,012][65744] Avg episode reward: [(0, '4129.231')] +[2023-03-11 20:09:10,474][66031] Updated weights for policy 0, policy_version 141120 (0.0005) +[2023-03-11 20:09:14,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 10038.7). Total num frames: 72286208. Throughput: 0: 9855.3. Samples: 72262984. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:09:14,012][65744] Avg episode reward: [(0, '3744.950')] +[2023-03-11 20:09:14,714][66031] Updated weights for policy 0, policy_version 141200 (0.0005) +[2023-03-11 20:09:18,877][66031] Updated weights for policy 0, policy_version 141280 (0.0005) +[2023-03-11 20:09:19,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9898.7, 300 sec: 10038.7). Total num frames: 72335360. Throughput: 0: 9777.7. Samples: 72321292. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:09:19,012][65744] Avg episode reward: [(0, '4024.381')] +[2023-03-11 20:09:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000141280_72335360.pth... +[2023-03-11 20:09:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000140704_72040448.pth +[2023-03-11 20:09:22,906][66031] Updated weights for policy 0, policy_version 141360 (0.0005) +[2023-03-11 20:09:24,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9898.7, 300 sec: 10024.8). Total num frames: 72384512. Throughput: 0: 9808.5. Samples: 72381416. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:09:24,012][65744] Avg episode reward: [(0, '4034.837')] +[2023-03-11 20:09:27,118][66031] Updated weights for policy 0, policy_version 141440 (0.0004) +[2023-03-11 20:09:29,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10010.9). Total num frames: 72433664. Throughput: 0: 9797.1. Samples: 72410576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:09:29,012][65744] Avg episode reward: [(0, '4116.933')] +[2023-03-11 20:09:31,348][66031] Updated weights for policy 0, policy_version 141520 (0.0004) +[2023-03-11 20:09:34,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10010.9). Total num frames: 72482816. Throughput: 0: 9823.0. Samples: 72470096. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:09:34,012][65744] Avg episode reward: [(0, '3438.735')] +[2023-03-11 20:09:34,039][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000141576_72486912.pth... +[2023-03-11 20:09:34,040][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000140992_72187904.pth +[2023-03-11 20:09:35,286][66031] Updated weights for policy 0, policy_version 141600 (0.0004) +[2023-03-11 20:09:39,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10024.8). Total num frames: 72531968. Throughput: 0: 9861.8. Samples: 72529356. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:09:39,012][65744] Avg episode reward: [(0, '3495.638')] +[2023-03-11 20:09:39,575][66031] Updated weights for policy 0, policy_version 141680 (0.0005) +[2023-03-11 20:09:43,806][66031] Updated weights for policy 0, policy_version 141760 (0.0005) +[2023-03-11 20:09:44,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10024.8). Total num frames: 72581120. Throughput: 0: 9848.6. Samples: 72558828. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:09:44,012][65744] Avg episode reward: [(0, '4065.330')] +[2023-03-11 20:09:47,938][66031] Updated weights for policy 0, policy_version 141840 (0.0005) +[2023-03-11 20:09:49,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10024.8). Total num frames: 72630272. Throughput: 0: 9831.8. Samples: 72617740. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:09:49,012][65744] Avg episode reward: [(0, '4261.198')] +[2023-03-11 20:09:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000141856_72630272.pth... +[2023-03-11 20:09:49,019][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000141280_72335360.pth +[2023-03-11 20:09:52,185][66031] Updated weights for policy 0, policy_version 141920 (0.0003) +[2023-03-11 20:09:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10024.8). Total num frames: 72679424. Throughput: 0: 9812.4. Samples: 72675336. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:09:54,012][65744] Avg episode reward: [(0, '4357.938')] +[2023-03-11 20:09:56,417][66031] Updated weights for policy 0, policy_version 142000 (0.0005) +[2023-03-11 20:09:59,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9830.4, 300 sec: 10010.9). Total num frames: 72728576. Throughput: 0: 9802.0. Samples: 72704072. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:09:59,012][65744] Avg episode reward: [(0, '4117.716')] +[2023-03-11 20:10:00,572][66031] Updated weights for policy 0, policy_version 142080 (0.0005) +[2023-03-11 20:10:04,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10010.9). Total num frames: 72777728. Throughput: 0: 9816.1. Samples: 72763016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:10:04,012][65744] Avg episode reward: [(0, '3655.352')] +[2023-03-11 20:10:04,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000142144_72777728.pth... +[2023-03-11 20:10:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000141576_72486912.pth +[2023-03-11 20:10:04,868][66031] Updated weights for policy 0, policy_version 142160 (0.0005) +[2023-03-11 20:10:09,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9983.1). Total num frames: 72822784. Throughput: 0: 9743.7. Samples: 72819884. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:10:09,012][65744] Avg episode reward: [(0, '2840.189')] +[2023-03-11 20:10:09,175][66031] Updated weights for policy 0, policy_version 142240 (0.0004) +[2023-03-11 20:10:13,453][66031] Updated weights for policy 0, policy_version 142320 (0.0005) +[2023-03-11 20:10:14,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9969.2). Total num frames: 72871936. Throughput: 0: 9720.9. Samples: 72848016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:10:14,012][65744] Avg episode reward: [(0, '3551.062')] +[2023-03-11 20:10:17,711][66031] Updated weights for policy 0, policy_version 142400 (0.0005) +[2023-03-11 20:10:19,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9955.4). Total num frames: 72921088. Throughput: 0: 9697.4. Samples: 72906480. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:10:19,012][65744] Avg episode reward: [(0, '3737.306')] +[2023-03-11 20:10:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000142424_72921088.pth... +[2023-03-11 20:10:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000141856_72630272.pth +[2023-03-11 20:10:21,950][66031] Updated weights for policy 0, policy_version 142480 (0.0005) +[2023-03-11 20:10:24,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9941.5). Total num frames: 72966144. Throughput: 0: 9668.5. Samples: 72964436. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:10:24,012][65744] Avg episode reward: [(0, '3446.398')] +[2023-03-11 20:10:26,213][66031] Updated weights for policy 0, policy_version 142560 (0.0005) +[2023-03-11 20:10:29,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9927.6). Total num frames: 73015296. Throughput: 0: 9649.1. Samples: 72993040. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:10:29,012][65744] Avg episode reward: [(0, '3128.430')] +[2023-03-11 20:10:30,440][66031] Updated weights for policy 0, policy_version 142640 (0.0005) +[2023-03-11 20:10:34,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9693.9, 300 sec: 9913.7). Total num frames: 73064448. Throughput: 0: 9637.4. Samples: 73051424. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:10:34,012][65744] Avg episode reward: [(0, '3397.792')] +[2023-03-11 20:10:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000142704_73064448.pth... +[2023-03-11 20:10:34,019][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000142144_72777728.pth +[2023-03-11 20:10:34,680][66031] Updated weights for policy 0, policy_version 142720 (0.0005) +[2023-03-11 20:10:38,973][66031] Updated weights for policy 0, policy_version 142800 (0.0005) +[2023-03-11 20:10:39,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9899.8). Total num frames: 73113600. Throughput: 0: 9634.5. Samples: 73108888. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:10:39,012][65744] Avg episode reward: [(0, '3347.991')] +[2023-03-11 20:10:43,188][66031] Updated weights for policy 0, policy_version 142880 (0.0005) +[2023-03-11 20:10:44,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9885.9). Total num frames: 73158656. Throughput: 0: 9644.3. Samples: 73138068. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:10:44,012][65744] Avg episode reward: [(0, '2608.081')] +[2023-03-11 20:10:47,425][66031] Updated weights for policy 0, policy_version 142960 (0.0005) +[2023-03-11 20:10:49,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9885.9). Total num frames: 73207808. Throughput: 0: 9612.6. Samples: 73195584. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:10:49,012][65744] Avg episode reward: [(0, '3332.209')] +[2023-03-11 20:10:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000142984_73207808.pth... +[2023-03-11 20:10:49,019][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000142424_72921088.pth +[2023-03-11 20:10:51,643][66031] Updated weights for policy 0, policy_version 143040 (0.0005) +[2023-03-11 20:10:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9885.9). Total num frames: 73256960. Throughput: 0: 9635.0. Samples: 73253460. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:10:54,012][65744] Avg episode reward: [(0, '3451.590')] +[2023-03-11 20:10:55,848][66031] Updated weights for policy 0, policy_version 143120 (0.0005) +[2023-03-11 20:10:59,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9625.6, 300 sec: 9885.9). Total num frames: 73306112. Throughput: 0: 9674.8. Samples: 73283380. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:10:59,012][65744] Avg episode reward: [(0, '3347.420')] +[2023-03-11 20:10:59,931][66031] Updated weights for policy 0, policy_version 143200 (0.0004) +[2023-03-11 20:11:03,893][66031] Updated weights for policy 0, policy_version 143280 (0.0004) +[2023-03-11 20:11:04,012][65744] Fps is (10 sec: 10239.9, 60 sec: 9693.9, 300 sec: 9913.7). Total num frames: 73359360. Throughput: 0: 9727.2. Samples: 73344204. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:11:04,012][65744] Avg episode reward: [(0, '3385.390')] +[2023-03-11 20:11:04,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000143280_73359360.pth... +[2023-03-11 20:11:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000142704_73064448.pth +[2023-03-11 20:11:08,222][66031] Updated weights for policy 0, policy_version 143360 (0.0005) +[2023-03-11 20:11:09,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9899.8). Total num frames: 73404416. Throughput: 0: 9741.1. Samples: 73402784. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:11:09,012][65744] Avg episode reward: [(0, '4216.020')] +[2023-03-11 20:11:12,443][66031] Updated weights for policy 0, policy_version 143440 (0.0005) +[2023-03-11 20:11:14,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9899.8). Total num frames: 73453568. Throughput: 0: 9748.3. Samples: 73431712. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:11:14,012][65744] Avg episode reward: [(0, '4342.559')] +[2023-03-11 20:11:16,568][66031] Updated weights for policy 0, policy_version 143520 (0.0004) +[2023-03-11 20:11:19,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9899.8). Total num frames: 73502720. Throughput: 0: 9757.2. Samples: 73490496. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:11:19,012][65744] Avg episode reward: [(0, '4292.432')] +[2023-03-11 20:11:19,023][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000143568_73506816.pth... +[2023-03-11 20:11:19,026][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000142984_73207808.pth +[2023-03-11 20:11:20,637][66031] Updated weights for policy 0, policy_version 143600 (0.0003) +[2023-03-11 20:11:24,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 9913.7). Total num frames: 73555968. Throughput: 0: 9831.6. Samples: 73551312. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:11:24,012][65744] Avg episode reward: [(0, '4354.349')] +[2023-03-11 20:11:24,768][66031] Updated weights for policy 0, policy_version 143680 (0.0004) +[2023-03-11 20:11:28,992][66031] Updated weights for policy 0, policy_version 143760 (0.0005) +[2023-03-11 20:11:29,012][65744] Fps is (10 sec: 10240.1, 60 sec: 9830.4, 300 sec: 9927.6). Total num frames: 73605120. Throughput: 0: 9833.0. Samples: 73580552. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:11:29,012][65744] Avg episode reward: [(0, '4331.167')] +[2023-03-11 20:11:33,103][66031] Updated weights for policy 0, policy_version 143840 (0.0005) +[2023-03-11 20:11:34,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9927.6). Total num frames: 73654272. Throughput: 0: 9858.9. Samples: 73639236. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:11:34,012][65744] Avg episode reward: [(0, '4142.457')] +[2023-03-11 20:11:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000143856_73654272.pth... +[2023-03-11 20:11:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000143280_73359360.pth +[2023-03-11 20:11:37,111][66031] Updated weights for policy 0, policy_version 143920 (0.0004) +[2023-03-11 20:11:39,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9927.6). Total num frames: 73703424. Throughput: 0: 9936.0. Samples: 73700580. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:11:39,012][65744] Avg episode reward: [(0, '4410.161')] +[2023-03-11 20:11:41,089][66031] Updated weights for policy 0, policy_version 144000 (0.0005) +[2023-03-11 20:11:44,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 9941.5). Total num frames: 73756672. Throughput: 0: 9966.4. Samples: 73731868. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:11:44,012][65744] Avg episode reward: [(0, '4064.012')] +[2023-03-11 20:11:45,072][66031] Updated weights for policy 0, policy_version 144080 (0.0004) +[2023-03-11 20:11:49,012][65744] Fps is (10 sec: 10239.9, 60 sec: 9966.9, 300 sec: 9927.6). Total num frames: 73805824. Throughput: 0: 9975.2. Samples: 73793088. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:11:49,012][65744] Avg episode reward: [(0, '3998.196')] +[2023-03-11 20:11:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000144152_73805824.pth... +[2023-03-11 20:11:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000143568_73506816.pth +[2023-03-11 20:11:49,158][66031] Updated weights for policy 0, policy_version 144160 (0.0005) +[2023-03-11 20:11:53,089][66031] Updated weights for policy 0, policy_version 144240 (0.0004) +[2023-03-11 20:11:54,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9941.5). Total num frames: 73859072. Throughput: 0: 10044.5. Samples: 73854784. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:11:54,012][65744] Avg episode reward: [(0, '4396.448')] +[2023-03-11 20:11:57,087][66031] Updated weights for policy 0, policy_version 144320 (0.0004) +[2023-03-11 20:11:59,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10035.2, 300 sec: 9941.5). Total num frames: 73908224. Throughput: 0: 10078.0. Samples: 73885220. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:11:59,012][65744] Avg episode reward: [(0, '4472.959')] +[2023-03-11 20:12:01,138][66031] Updated weights for policy 0, policy_version 144400 (0.0005) +[2023-03-11 20:12:04,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10035.2, 300 sec: 9955.4). Total num frames: 73961472. Throughput: 0: 10135.5. Samples: 73946592. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:12:04,012][65744] Avg episode reward: [(0, '4492.043')] +[2023-03-11 20:12:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000144456_73961472.pth... +[2023-03-11 20:12:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000143856_73654272.pth +[2023-03-11 20:12:05,177][66031] Updated weights for policy 0, policy_version 144480 (0.0004) +[2023-03-11 20:12:09,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 9941.5). Total num frames: 74010624. Throughput: 0: 10146.5. Samples: 74007904. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:12:09,012][65744] Avg episode reward: [(0, '4474.521')] +[2023-03-11 20:12:09,105][66031] Updated weights for policy 0, policy_version 144560 (0.0004) +[2023-03-11 20:12:13,344][66031] Updated weights for policy 0, policy_version 144640 (0.0005) +[2023-03-11 20:12:14,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 9927.6). Total num frames: 74059776. Throughput: 0: 10169.8. Samples: 74038192. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:12:14,012][65744] Avg episode reward: [(0, '4600.319')] +[2023-03-11 20:12:14,013][65987] Saving new best policy, reward=4600.319! +[2023-03-11 20:12:17,611][66031] Updated weights for policy 0, policy_version 144720 (0.0005) +[2023-03-11 20:12:19,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 9927.6). Total num frames: 74108928. Throughput: 0: 10141.1. Samples: 74095588. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:12:19,012][65744] Avg episode reward: [(0, '4592.540')] +[2023-03-11 20:12:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000144744_74108928.pth... +[2023-03-11 20:12:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000144152_73805824.pth +[2023-03-11 20:12:21,875][66031] Updated weights for policy 0, policy_version 144800 (0.0005) +[2023-03-11 20:12:24,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9966.9, 300 sec: 9913.7). Total num frames: 74153984. Throughput: 0: 10054.1. Samples: 74153016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:12:24,012][65744] Avg episode reward: [(0, '4549.806')] +[2023-03-11 20:12:26,121][66031] Updated weights for policy 0, policy_version 144880 (0.0005) +[2023-03-11 20:12:29,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9966.9, 300 sec: 9913.7). Total num frames: 74203136. Throughput: 0: 10004.3. Samples: 74182060. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:12:29,012][65744] Avg episode reward: [(0, '4328.033')] +[2023-03-11 20:12:30,364][66031] Updated weights for policy 0, policy_version 144960 (0.0005) +[2023-03-11 20:12:34,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9913.7). Total num frames: 74252288. Throughput: 0: 9931.7. Samples: 74240012. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:12:34,012][65744] Avg episode reward: [(0, '4414.687')] +[2023-03-11 20:12:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000145024_74252288.pth... +[2023-03-11 20:12:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000144456_73961472.pth +[2023-03-11 20:12:34,555][66031] Updated weights for policy 0, policy_version 145040 (0.0005) +[2023-03-11 20:12:38,825][66031] Updated weights for policy 0, policy_version 145120 (0.0005) +[2023-03-11 20:12:39,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9899.8). Total num frames: 74301440. Throughput: 0: 9843.5. Samples: 74297744. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:12:39,012][65744] Avg episode reward: [(0, '4318.276')] +[2023-03-11 20:12:43,117][66031] Updated weights for policy 0, policy_version 145200 (0.0005) +[2023-03-11 20:12:44,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9899.8). Total num frames: 74350592. Throughput: 0: 9796.9. Samples: 74326080. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:12:44,012][65744] Avg episode reward: [(0, '4153.729')] +[2023-03-11 20:12:47,249][66031] Updated weights for policy 0, policy_version 145280 (0.0005) +[2023-03-11 20:12:49,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9898.7, 300 sec: 9885.9). Total num frames: 74399744. Throughput: 0: 9748.2. Samples: 74385260. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:12:49,012][65744] Avg episode reward: [(0, '4018.257')] +[2023-03-11 20:12:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000145312_74399744.pth... +[2023-03-11 20:12:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000144744_74108928.pth +[2023-03-11 20:12:51,450][66031] Updated weights for policy 0, policy_version 145360 (0.0005) +[2023-03-11 20:12:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9872.1). Total num frames: 74448896. Throughput: 0: 9713.4. Samples: 74445008. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:12:54,012][65744] Avg episode reward: [(0, '4422.312')] +[2023-03-11 20:12:55,325][66031] Updated weights for policy 0, policy_version 145440 (0.0004) +[2023-03-11 20:12:59,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9885.9). Total num frames: 74502144. Throughput: 0: 9743.8. Samples: 74476664. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:12:59,012][65744] Avg episode reward: [(0, '4352.782')] +[2023-03-11 20:12:59,274][66031] Updated weights for policy 0, policy_version 145520 (0.0004) +[2023-03-11 20:13:03,260][66031] Updated weights for policy 0, policy_version 145600 (0.0004) +[2023-03-11 20:13:04,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 9885.9). Total num frames: 74551296. Throughput: 0: 9854.0. Samples: 74539016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:13:04,012][65744] Avg episode reward: [(0, '4492.895')] +[2023-03-11 20:13:04,050][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000145616_74555392.pth... +[2023-03-11 20:13:04,052][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000145024_74252288.pth +[2023-03-11 20:13:07,227][66031] Updated weights for policy 0, policy_version 145680 (0.0004) +[2023-03-11 20:13:09,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9885.9). Total num frames: 74604544. Throughput: 0: 9945.3. Samples: 74600556. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:13:09,012][65744] Avg episode reward: [(0, '4419.422')] +[2023-03-11 20:13:11,288][66031] Updated weights for policy 0, policy_version 145760 (0.0005) +[2023-03-11 20:13:14,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9872.1). Total num frames: 74653696. Throughput: 0: 9969.7. Samples: 74630696. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:13:14,012][65744] Avg episode reward: [(0, '4168.399')] +[2023-03-11 20:13:15,527][66031] Updated weights for policy 0, policy_version 145840 (0.0005) +[2023-03-11 20:13:19,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9872.1). Total num frames: 74702848. Throughput: 0: 9969.7. Samples: 74688648. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:13:19,012][65744] Avg episode reward: [(0, '4071.314')] +[2023-03-11 20:13:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000145904_74702848.pth... +[2023-03-11 20:13:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000145312_74399744.pth +[2023-03-11 20:13:19,688][66031] Updated weights for policy 0, policy_version 145920 (0.0005) +[2023-03-11 20:13:23,649][66031] Updated weights for policy 0, policy_version 146000 (0.0005) +[2023-03-11 20:13:24,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9858.2). Total num frames: 74752000. Throughput: 0: 10048.4. Samples: 74749924. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:13:24,012][65744] Avg episode reward: [(0, '4230.054')] +[2023-03-11 20:13:27,576][66031] Updated weights for policy 0, policy_version 146080 (0.0004) +[2023-03-11 20:13:29,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10035.2, 300 sec: 9872.1). Total num frames: 74805248. Throughput: 0: 10107.4. Samples: 74780912. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:13:29,012][65744] Avg episode reward: [(0, '4450.680')] +[2023-03-11 20:13:31,507][66031] Updated weights for policy 0, policy_version 146160 (0.0004) +[2023-03-11 20:13:34,012][65744] Fps is (10 sec: 10649.5, 60 sec: 10103.5, 300 sec: 9885.9). Total num frames: 74858496. Throughput: 0: 10177.5. Samples: 74843248. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:13:34,012][65744] Avg episode reward: [(0, '4488.169')] +[2023-03-11 20:13:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000146208_74858496.pth... +[2023-03-11 20:13:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000145616_74555392.pth +[2023-03-11 20:13:35,572][66031] Updated weights for policy 0, policy_version 146240 (0.0005) +[2023-03-11 20:13:39,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 9885.9). Total num frames: 74907648. Throughput: 0: 10197.8. Samples: 74903908. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:13:39,012][65744] Avg episode reward: [(0, '4522.659')] +[2023-03-11 20:13:39,595][66031] Updated weights for policy 0, policy_version 146320 (0.0004) +[2023-03-11 20:13:43,578][66031] Updated weights for policy 0, policy_version 146400 (0.0004) +[2023-03-11 20:13:44,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 9899.8). Total num frames: 74960896. Throughput: 0: 10186.7. Samples: 74935064. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:13:44,012][65744] Avg episode reward: [(0, '4435.053')] +[2023-03-11 20:13:47,580][66031] Updated weights for policy 0, policy_version 146480 (0.0004) +[2023-03-11 20:13:49,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 9899.8). Total num frames: 75010048. Throughput: 0: 10167.3. Samples: 74996544. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:13:49,012][65744] Avg episode reward: [(0, '4396.762')] +[2023-03-11 20:13:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000146504_75010048.pth... +[2023-03-11 20:13:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000145904_74702848.pth +[2023-03-11 20:13:51,607][66031] Updated weights for policy 0, policy_version 146560 (0.0004) +[2023-03-11 20:13:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 9899.8). Total num frames: 75059200. Throughput: 0: 10148.0. Samples: 75057216. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:13:54,012][65744] Avg episode reward: [(0, '4390.375')] +[2023-03-11 20:13:55,667][66031] Updated weights for policy 0, policy_version 146640 (0.0004) +[2023-03-11 20:13:59,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 9913.7). Total num frames: 75112448. Throughput: 0: 10159.7. Samples: 75087884. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:13:59,012][65744] Avg episode reward: [(0, '4382.547')] +[2023-03-11 20:13:59,665][66031] Updated weights for policy 0, policy_version 146720 (0.0004) +[2023-03-11 20:14:03,702][66031] Updated weights for policy 0, policy_version 146800 (0.0005) +[2023-03-11 20:14:04,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 9913.7). Total num frames: 75161600. Throughput: 0: 10218.8. Samples: 75148496. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:14:04,012][65744] Avg episode reward: [(0, '4556.909')] +[2023-03-11 20:14:04,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000146800_75161600.pth... +[2023-03-11 20:14:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000146208_74858496.pth +[2023-03-11 20:14:07,650][66031] Updated weights for policy 0, policy_version 146880 (0.0004) +[2023-03-11 20:14:09,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 9927.6). Total num frames: 75214848. Throughput: 0: 10242.0. Samples: 75210816. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:14:09,012][65744] Avg episode reward: [(0, '4481.255')] +[2023-03-11 20:14:11,700][66031] Updated weights for policy 0, policy_version 146960 (0.0005) +[2023-03-11 20:14:14,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 9927.6). Total num frames: 75264000. Throughput: 0: 10223.5. Samples: 75240968. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:14:14,012][65744] Avg episode reward: [(0, '4337.907')] +[2023-03-11 20:14:15,679][66031] Updated weights for policy 0, policy_version 147040 (0.0005) +[2023-03-11 20:14:19,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 9941.5). Total num frames: 75317248. Throughput: 0: 10210.0. Samples: 75302696. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:14:19,012][65744] Avg episode reward: [(0, '4157.691')] +[2023-03-11 20:14:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000147104_75317248.pth... +[2023-03-11 20:14:19,019][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000146504_75010048.pth +[2023-03-11 20:14:19,660][66031] Updated weights for policy 0, policy_version 147120 (0.0004) +[2023-03-11 20:14:23,621][66031] Updated weights for policy 0, policy_version 147200 (0.0004) +[2023-03-11 20:14:24,012][65744] Fps is (10 sec: 10649.7, 60 sec: 10308.3, 300 sec: 9955.4). Total num frames: 75370496. Throughput: 0: 10238.1. Samples: 75364624. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:14:24,012][65744] Avg episode reward: [(0, '4485.253')] +[2023-03-11 20:14:27,495][66031] Updated weights for policy 0, policy_version 147280 (0.0004) +[2023-03-11 20:14:29,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 9955.4). Total num frames: 75419648. Throughput: 0: 10246.6. Samples: 75396160. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:14:29,012][65744] Avg episode reward: [(0, '4354.066')] +[2023-03-11 20:14:31,491][66031] Updated weights for policy 0, policy_version 147360 (0.0004) +[2023-03-11 20:14:34,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 9969.2). Total num frames: 75472896. Throughput: 0: 10264.0. Samples: 75458424. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:14:34,012][65744] Avg episode reward: [(0, '4599.476')] +[2023-03-11 20:14:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000147408_75472896.pth... +[2023-03-11 20:14:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000146800_75161600.pth +[2023-03-11 20:14:35,423][66031] Updated weights for policy 0, policy_version 147440 (0.0004) +[2023-03-11 20:14:39,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 9969.2). Total num frames: 75522048. Throughput: 0: 10288.5. Samples: 75520196. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:14:39,012][65744] Avg episode reward: [(0, '4559.154')] +[2023-03-11 20:14:39,403][66031] Updated weights for policy 0, policy_version 147520 (0.0004) +[2023-03-11 20:14:43,294][66031] Updated weights for policy 0, policy_version 147600 (0.0004) +[2023-03-11 20:14:44,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 9983.1). Total num frames: 75575296. Throughput: 0: 10306.6. Samples: 75551680. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:14:44,012][65744] Avg episode reward: [(0, '4464.305')] +[2023-03-11 20:14:47,306][66031] Updated weights for policy 0, policy_version 147680 (0.0004) +[2023-03-11 20:14:49,012][65744] Fps is (10 sec: 10649.5, 60 sec: 10308.3, 300 sec: 9997.0). Total num frames: 75628544. Throughput: 0: 10334.3. Samples: 75613540. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:14:49,012][65744] Avg episode reward: [(0, '4457.497')] +[2023-03-11 20:14:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000147712_75628544.pth... +[2023-03-11 20:14:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000147104_75317248.pth +[2023-03-11 20:14:51,292][66031] Updated weights for policy 0, policy_version 147760 (0.0004) +[2023-03-11 20:14:54,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 9997.0). Total num frames: 75677696. Throughput: 0: 10335.8. Samples: 75675928. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:14:54,012][65744] Avg episode reward: [(0, '4569.123')] +[2023-03-11 20:14:55,192][66031] Updated weights for policy 0, policy_version 147840 (0.0004) +[2023-03-11 20:14:59,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10010.9). Total num frames: 75730944. Throughput: 0: 10349.5. Samples: 75706696. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:14:59,012][65744] Avg episode reward: [(0, '4366.593')] +[2023-03-11 20:14:59,166][66031] Updated weights for policy 0, policy_version 147920 (0.0004) +[2023-03-11 20:15:03,114][66031] Updated weights for policy 0, policy_version 148000 (0.0005) +[2023-03-11 20:15:04,012][65744] Fps is (10 sec: 10649.5, 60 sec: 10376.5, 300 sec: 10038.7). Total num frames: 75784192. Throughput: 0: 10357.2. Samples: 75768772. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:15:04,012][65744] Avg episode reward: [(0, '4178.357')] +[2023-03-11 20:15:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000148016_75784192.pth... +[2023-03-11 20:15:04,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000147408_75472896.pth +[2023-03-11 20:15:06,997][66031] Updated weights for policy 0, policy_version 148080 (0.0004) +[2023-03-11 20:15:09,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10052.6). Total num frames: 75837440. Throughput: 0: 10385.1. Samples: 75831952. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:15:09,012][65744] Avg episode reward: [(0, '4162.165')] +[2023-03-11 20:15:11,036][66031] Updated weights for policy 0, policy_version 148160 (0.0005) +[2023-03-11 20:15:14,017][65744] Fps is (10 sec: 10234.4, 60 sec: 10375.6, 300 sec: 10052.4). Total num frames: 75886592. Throughput: 0: 10351.3. Samples: 75862024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:15:14,018][65744] Avg episode reward: [(0, '4222.109')] +[2023-03-11 20:15:15,301][66031] Updated weights for policy 0, policy_version 148240 (0.0004) +[2023-03-11 20:15:19,012][65744] Fps is (10 sec: 9420.7, 60 sec: 10240.0, 300 sec: 10052.6). Total num frames: 75931648. Throughput: 0: 10244.4. Samples: 75919424. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:15:19,012][65744] Avg episode reward: [(0, '3975.049')] +[2023-03-11 20:15:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000148304_75931648.pth... +[2023-03-11 20:15:19,019][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000147712_75628544.pth +[2023-03-11 20:15:19,568][66031] Updated weights for policy 0, policy_version 148320 (0.0005) +[2023-03-11 20:15:23,801][66031] Updated weights for policy 0, policy_version 148400 (0.0005) +[2023-03-11 20:15:24,012][65744] Fps is (10 sec: 9426.0, 60 sec: 10171.7, 300 sec: 10052.6). Total num frames: 75980800. Throughput: 0: 10151.9. Samples: 75977032. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:15:24,012][65744] Avg episode reward: [(0, '4397.375')] +[2023-03-11 20:15:27,759][66031] Updated weights for policy 0, policy_version 148480 (0.0004) +[2023-03-11 20:15:29,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10066.4). Total num frames: 76034048. Throughput: 0: 10138.8. Samples: 76007928. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:15:29,012][65744] Avg episode reward: [(0, '4275.362')] +[2023-03-11 20:15:31,732][66031] Updated weights for policy 0, policy_version 148560 (0.0004) +[2023-03-11 20:15:34,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 10066.4). Total num frames: 76083200. Throughput: 0: 10142.6. Samples: 76069956. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:15:34,012][65744] Avg episode reward: [(0, '3532.269')] +[2023-03-11 20:15:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000148600_76083200.pth... +[2023-03-11 20:15:34,019][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000148016_75784192.pth +[2023-03-11 20:15:36,053][66031] Updated weights for policy 0, policy_version 148640 (0.0005) +[2023-03-11 20:15:39,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10080.3). Total num frames: 76132352. Throughput: 0: 10037.9. Samples: 76127636. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:15:39,012][65744] Avg episode reward: [(0, '4307.180')] +[2023-03-11 20:15:40,202][66031] Updated weights for policy 0, policy_version 148720 (0.0005) +[2023-03-11 20:15:44,012][65744] Fps is (10 sec: 9830.5, 60 sec: 10103.5, 300 sec: 10080.3). Total num frames: 76181504. Throughput: 0: 10012.4. Samples: 76157256. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:15:44,012][65744] Avg episode reward: [(0, '4504.492')] +[2023-03-11 20:15:44,136][66031] Updated weights for policy 0, policy_version 148800 (0.0005) +[2023-03-11 20:15:48,104][66031] Updated weights for policy 0, policy_version 148880 (0.0004) +[2023-03-11 20:15:49,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10094.2). Total num frames: 76234752. Throughput: 0: 10021.2. Samples: 76219728. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:15:49,012][65744] Avg episode reward: [(0, '2952.325')] +[2023-03-11 20:15:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000148896_76234752.pth... +[2023-03-11 20:15:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000148304_75931648.pth +[2023-03-11 20:15:52,171][66031] Updated weights for policy 0, policy_version 148960 (0.0005) +[2023-03-11 20:15:54,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10103.4, 300 sec: 10094.2). Total num frames: 76283904. Throughput: 0: 9977.1. Samples: 76280920. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:15:54,012][65744] Avg episode reward: [(0, '4277.467')] +[2023-03-11 20:15:56,103][66031] Updated weights for policy 0, policy_version 149040 (0.0005) +[2023-03-11 20:15:59,012][65744] Fps is (10 sec: 9830.5, 60 sec: 10035.2, 300 sec: 10080.3). Total num frames: 76333056. Throughput: 0: 10005.2. Samples: 76312204. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:15:59,012][65744] Avg episode reward: [(0, '4534.982')] +[2023-03-11 20:16:00,378][66031] Updated weights for policy 0, policy_version 149120 (0.0005) +[2023-03-11 20:16:04,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 10094.2). Total num frames: 76382208. Throughput: 0: 10001.4. Samples: 76369488. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:16:04,023][65744] Avg episode reward: [(0, '4584.079')] +[2023-03-11 20:16:04,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000149184_76382208.pth... +[2023-03-11 20:16:04,029][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000148600_76083200.pth +[2023-03-11 20:16:04,636][66031] Updated weights for policy 0, policy_version 149200 (0.0005) +[2023-03-11 20:16:08,876][66031] Updated weights for policy 0, policy_version 149280 (0.0005) +[2023-03-11 20:16:09,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9898.7, 300 sec: 10094.2). Total num frames: 76431360. Throughput: 0: 10006.6. Samples: 76427328. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:16:09,012][65744] Avg episode reward: [(0, '4568.046')] +[2023-03-11 20:16:13,177][66031] Updated weights for policy 0, policy_version 149360 (0.0005) +[2023-03-11 20:16:14,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9831.3, 300 sec: 10080.3). Total num frames: 76476416. Throughput: 0: 9956.0. Samples: 76455948. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:16:14,012][65744] Avg episode reward: [(0, '4521.475')] +[2023-03-11 20:16:17,417][66031] Updated weights for policy 0, policy_version 149440 (0.0005) +[2023-03-11 20:16:19,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 10066.4). Total num frames: 76525568. Throughput: 0: 9853.3. Samples: 76513352. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:16:19,012][65744] Avg episode reward: [(0, '4465.690')] +[2023-03-11 20:16:19,089][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000149472_76529664.pth... +[2023-03-11 20:16:19,091][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000148896_76234752.pth +[2023-03-11 20:16:21,585][66031] Updated weights for policy 0, policy_version 149520 (0.0005) +[2023-03-11 20:16:24,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10066.4). Total num frames: 76574720. Throughput: 0: 9882.6. Samples: 76572352. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:16:24,012][65744] Avg episode reward: [(0, '4492.290')] +[2023-03-11 20:16:25,772][66031] Updated weights for policy 0, policy_version 149600 (0.0005) +[2023-03-11 20:16:29,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10066.4). Total num frames: 76623872. Throughput: 0: 9884.8. Samples: 76602072. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:16:29,012][65744] Avg episode reward: [(0, '4574.095')] +[2023-03-11 20:16:29,977][66031] Updated weights for policy 0, policy_version 149680 (0.0005) +[2023-03-11 20:16:33,942][66031] Updated weights for policy 0, policy_version 149760 (0.0005) +[2023-03-11 20:16:34,012][65744] Fps is (10 sec: 10239.9, 60 sec: 9898.7, 300 sec: 10080.3). Total num frames: 76677120. Throughput: 0: 9820.7. Samples: 76661660. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:16:34,012][65744] Avg episode reward: [(0, '4552.344')] +[2023-03-11 20:16:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000149760_76677120.pth... +[2023-03-11 20:16:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000149184_76382208.pth +[2023-03-11 20:16:37,844][66031] Updated weights for policy 0, policy_version 149840 (0.0005) +[2023-03-11 20:16:39,012][65744] Fps is (10 sec: 10240.1, 60 sec: 9898.7, 300 sec: 10066.4). Total num frames: 76726272. Throughput: 0: 9847.4. Samples: 76724052. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:16:39,012][65744] Avg episode reward: [(0, '4408.664')] +[2023-03-11 20:16:42,039][66031] Updated weights for policy 0, policy_version 149920 (0.0005) +[2023-03-11 20:16:44,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10066.4). Total num frames: 76775424. Throughput: 0: 9806.4. Samples: 76753492. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:16:44,012][65744] Avg episode reward: [(0, '4440.526')] +[2023-03-11 20:16:46,246][66031] Updated weights for policy 0, policy_version 150000 (0.0005) +[2023-03-11 20:16:49,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9830.4, 300 sec: 10052.6). Total num frames: 76824576. Throughput: 0: 9840.2. Samples: 76812296. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:16:49,012][65744] Avg episode reward: [(0, '4431.910')] +[2023-03-11 20:16:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000150048_76824576.pth... +[2023-03-11 20:16:49,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000149472_76529664.pth +[2023-03-11 20:16:50,321][66031] Updated weights for policy 0, policy_version 150080 (0.0005) +[2023-03-11 20:16:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10052.6). Total num frames: 76873728. Throughput: 0: 9866.7. Samples: 76871328. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:16:54,012][65744] Avg episode reward: [(0, '4366.611')] +[2023-03-11 20:16:54,586][66031] Updated weights for policy 0, policy_version 150160 (0.0005) +[2023-03-11 20:16:58,693][66031] Updated weights for policy 0, policy_version 150240 (0.0005) +[2023-03-11 20:16:59,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10038.7). Total num frames: 76922880. Throughput: 0: 9859.2. Samples: 76899612. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:16:59,012][65744] Avg episode reward: [(0, '4297.900')] +[2023-03-11 20:17:02,625][66031] Updated weights for policy 0, policy_version 150320 (0.0005) +[2023-03-11 20:17:04,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 10052.6). Total num frames: 76976128. Throughput: 0: 9969.2. Samples: 76961968. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:17:04,012][65744] Avg episode reward: [(0, '4476.364')] +[2023-03-11 20:17:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000150344_76976128.pth... +[2023-03-11 20:17:04,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000149760_76677120.pth +[2023-03-11 20:17:06,567][66031] Updated weights for policy 0, policy_version 150400 (0.0004) +[2023-03-11 20:17:09,012][65744] Fps is (10 sec: 10649.6, 60 sec: 9966.9, 300 sec: 10066.4). Total num frames: 77029376. Throughput: 0: 10047.6. Samples: 77024496. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:17:09,012][65744] Avg episode reward: [(0, '4521.312')] +[2023-03-11 20:17:10,457][66031] Updated weights for policy 0, policy_version 150480 (0.0004) +[2023-03-11 20:17:14,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10103.5, 300 sec: 10080.3). Total num frames: 77082624. Throughput: 0: 10087.8. Samples: 77056024. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:17:14,012][65744] Avg episode reward: [(0, '4544.675')] +[2023-03-11 20:17:14,364][66031] Updated weights for policy 0, policy_version 150560 (0.0004) +[2023-03-11 20:17:18,319][66031] Updated weights for policy 0, policy_version 150640 (0.0004) +[2023-03-11 20:17:19,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10094.2). Total num frames: 77131776. Throughput: 0: 10164.7. Samples: 77119072. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:17:19,012][65744] Avg episode reward: [(0, '4548.310')] +[2023-03-11 20:17:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000150648_77131776.pth... +[2023-03-11 20:17:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000150048_76824576.pth +[2023-03-11 20:17:22,324][66031] Updated weights for policy 0, policy_version 150720 (0.0005) +[2023-03-11 20:17:24,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10108.1). Total num frames: 77185024. Throughput: 0: 10132.1. Samples: 77179996. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:17:24,012][65744] Avg episode reward: [(0, '4509.895')] +[2023-03-11 20:17:26,523][66031] Updated weights for policy 0, policy_version 150800 (0.0005) +[2023-03-11 20:17:29,012][65744] Fps is (10 sec: 9830.5, 60 sec: 10103.5, 300 sec: 10094.2). Total num frames: 77230080. Throughput: 0: 10125.4. Samples: 77209136. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:17:29,012][65744] Avg episode reward: [(0, '4277.546')] +[2023-03-11 20:17:30,783][66031] Updated weights for policy 0, policy_version 150880 (0.0005) +[2023-03-11 20:17:34,012][65744] Fps is (10 sec: 9420.8, 60 sec: 10035.2, 300 sec: 10094.2). Total num frames: 77279232. Throughput: 0: 10101.7. Samples: 77266872. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:17:34,012][65744] Avg episode reward: [(0, '4582.576')] +[2023-03-11 20:17:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000150936_77279232.pth... +[2023-03-11 20:17:34,019][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000150344_76976128.pth +[2023-03-11 20:17:35,040][66031] Updated weights for policy 0, policy_version 150960 (0.0005) +[2023-03-11 20:17:38,922][66031] Updated weights for policy 0, policy_version 151040 (0.0004) +[2023-03-11 20:17:39,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10108.1). Total num frames: 77332480. Throughput: 0: 10137.9. Samples: 77327532. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:17:39,012][65744] Avg episode reward: [(0, '4579.631')] +[2023-03-11 20:17:43,109][66031] Updated weights for policy 0, policy_version 151120 (0.0005) +[2023-03-11 20:17:44,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10103.5, 300 sec: 10108.1). Total num frames: 77381632. Throughput: 0: 10166.8. Samples: 77357120. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:17:44,012][65744] Avg episode reward: [(0, '4529.925')] +[2023-03-11 20:17:47,324][66031] Updated weights for policy 0, policy_version 151200 (0.0005) +[2023-03-11 20:17:49,012][65744] Fps is (10 sec: 9420.8, 60 sec: 10035.2, 300 sec: 10094.2). Total num frames: 77426688. Throughput: 0: 10078.1. Samples: 77415480. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:17:49,012][65744] Avg episode reward: [(0, '4489.317')] +[2023-03-11 20:17:49,024][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000151232_77430784.pth... +[2023-03-11 20:17:49,025][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000150648_77131776.pth +[2023-03-11 20:17:51,509][66031] Updated weights for policy 0, policy_version 151280 (0.0005) +[2023-03-11 20:17:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10094.2). Total num frames: 77479936. Throughput: 0: 10019.7. Samples: 77475380. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:17:54,012][65744] Avg episode reward: [(0, '4384.776')] +[2023-03-11 20:17:55,593][66031] Updated weights for policy 0, policy_version 151360 (0.0005) +[2023-03-11 20:17:59,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10094.2). Total num frames: 77529088. Throughput: 0: 9967.8. Samples: 77504576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:17:59,012][65744] Avg episode reward: [(0, '4303.736')] +[2023-03-11 20:17:59,726][66031] Updated weights for policy 0, policy_version 151440 (0.0005) +[2023-03-11 20:18:03,649][66031] Updated weights for policy 0, policy_version 151520 (0.0004) +[2023-03-11 20:18:04,012][65744] Fps is (10 sec: 9830.3, 60 sec: 10035.2, 300 sec: 10080.3). Total num frames: 77578240. Throughput: 0: 9924.7. Samples: 77565684. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:18:04,012][65744] Avg episode reward: [(0, '4361.284')] +[2023-03-11 20:18:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000151520_77578240.pth... +[2023-03-11 20:18:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000150936_77279232.pth +[2023-03-11 20:18:07,940][66031] Updated weights for policy 0, policy_version 151600 (0.0005) +[2023-03-11 20:18:09,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 10080.3). Total num frames: 77627392. Throughput: 0: 9861.1. Samples: 77623744. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:18:09,012][65744] Avg episode reward: [(0, '4331.933')] +[2023-03-11 20:18:12,196][66031] Updated weights for policy 0, policy_version 151680 (0.0005) +[2023-03-11 20:18:14,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9898.7, 300 sec: 10080.3). Total num frames: 77676544. Throughput: 0: 9860.5. Samples: 77652860. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:18:14,012][65744] Avg episode reward: [(0, '4175.675')] +[2023-03-11 20:18:16,427][66031] Updated weights for policy 0, policy_version 151760 (0.0005) +[2023-03-11 20:18:19,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10080.3). Total num frames: 77725696. Throughput: 0: 9875.6. Samples: 77711272. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:18:19,012][65744] Avg episode reward: [(0, '4443.270')] +[2023-03-11 20:18:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000151808_77725696.pth... +[2023-03-11 20:18:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000151232_77430784.pth +[2023-03-11 20:18:20,619][66031] Updated weights for policy 0, policy_version 151840 (0.0005) +[2023-03-11 20:18:24,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 10052.6). Total num frames: 77770752. Throughput: 0: 9823.9. Samples: 77769608. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:18:24,012][65744] Avg episode reward: [(0, '4162.750')] +[2023-03-11 20:18:24,868][66031] Updated weights for policy 0, policy_version 151920 (0.0005) +[2023-03-11 20:18:28,966][66031] Updated weights for policy 0, policy_version 152000 (0.0005) +[2023-03-11 20:18:29,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10052.6). Total num frames: 77824000. Throughput: 0: 9817.0. Samples: 77798884. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:18:29,012][65744] Avg episode reward: [(0, '4359.648')] +[2023-03-11 20:18:32,999][66031] Updated weights for policy 0, policy_version 152080 (0.0005) +[2023-03-11 20:18:34,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 10052.6). Total num frames: 77873152. Throughput: 0: 9867.4. Samples: 77859516. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:18:34,012][65744] Avg episode reward: [(0, '4355.610')] +[2023-03-11 20:18:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000152096_77873152.pth... +[2023-03-11 20:18:34,019][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000151520_77578240.pth +[2023-03-11 20:18:37,266][66031] Updated weights for policy 0, policy_version 152160 (0.0005) +[2023-03-11 20:18:39,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10038.7). Total num frames: 77922304. Throughput: 0: 9830.2. Samples: 77917740. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:18:39,012][65744] Avg episode reward: [(0, '4424.622')] +[2023-03-11 20:18:41,373][66031] Updated weights for policy 0, policy_version 152240 (0.0005) +[2023-03-11 20:18:44,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10038.7). Total num frames: 77971456. Throughput: 0: 9840.6. Samples: 77947404. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:18:44,012][65744] Avg episode reward: [(0, '4508.185')] +[2023-03-11 20:18:45,540][66031] Updated weights for policy 0, policy_version 152320 (0.0005) +[2023-03-11 20:18:49,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9898.7, 300 sec: 10038.7). Total num frames: 78020608. Throughput: 0: 9803.2. Samples: 78006828. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:18:49,012][65744] Avg episode reward: [(0, '4409.695')] +[2023-03-11 20:18:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000152384_78020608.pth... +[2023-03-11 20:18:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000151808_77725696.pth +[2023-03-11 20:18:49,543][66031] Updated weights for policy 0, policy_version 152400 (0.0005) +[2023-03-11 20:18:53,494][66031] Updated weights for policy 0, policy_version 152480 (0.0004) +[2023-03-11 20:18:54,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 10038.7). Total num frames: 78073856. Throughput: 0: 9900.5. Samples: 78069268. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:18:54,012][65744] Avg episode reward: [(0, '4360.870')] +[2023-03-11 20:18:57,440][66031] Updated weights for policy 0, policy_version 152560 (0.0004) +[2023-03-11 20:18:59,012][65744] Fps is (10 sec: 10240.1, 60 sec: 9898.7, 300 sec: 10038.7). Total num frames: 78123008. Throughput: 0: 9934.7. Samples: 78099920. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:18:59,012][65744] Avg episode reward: [(0, '4478.865')] +[2023-03-11 20:19:01,643][66031] Updated weights for policy 0, policy_version 152640 (0.0005) +[2023-03-11 20:19:04,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10024.8). Total num frames: 78172160. Throughput: 0: 9968.9. Samples: 78159872. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:19:04,012][65744] Avg episode reward: [(0, '4502.879')] +[2023-03-11 20:19:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000152680_78172160.pth... +[2023-03-11 20:19:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000152096_77873152.pth +[2023-03-11 20:19:05,830][66031] Updated weights for policy 0, policy_version 152720 (0.0005) +[2023-03-11 20:19:09,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 10038.7). Total num frames: 78225408. Throughput: 0: 10022.2. Samples: 78220608. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:19:09,012][65744] Avg episode reward: [(0, '4552.808')] +[2023-03-11 20:19:09,670][66031] Updated weights for policy 0, policy_version 152800 (0.0004) +[2023-03-11 20:19:13,585][66031] Updated weights for policy 0, policy_version 152880 (0.0004) +[2023-03-11 20:19:14,012][65744] Fps is (10 sec: 10649.7, 60 sec: 10035.2, 300 sec: 10038.7). Total num frames: 78278656. Throughput: 0: 10070.3. Samples: 78252048. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:19:14,012][65744] Avg episode reward: [(0, '4415.549')] +[2023-03-11 20:19:17,589][66031] Updated weights for policy 0, policy_version 152960 (0.0004) +[2023-03-11 20:19:19,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10024.8). Total num frames: 78327808. Throughput: 0: 10116.8. Samples: 78314772. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:19:19,012][65744] Avg episode reward: [(0, '4134.832')] +[2023-03-11 20:19:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000152984_78327808.pth... +[2023-03-11 20:19:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000152384_78020608.pth +[2023-03-11 20:19:21,749][66031] Updated weights for policy 0, policy_version 153040 (0.0005) +[2023-03-11 20:19:24,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10024.8). Total num frames: 78376960. Throughput: 0: 10114.0. Samples: 78372872. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:19:24,012][65744] Avg episode reward: [(0, '4350.042')] +[2023-03-11 20:19:25,953][66031] Updated weights for policy 0, policy_version 153120 (0.0005) +[2023-03-11 20:19:29,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10010.9). Total num frames: 78426112. Throughput: 0: 10108.5. Samples: 78402284. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:19:29,012][65744] Avg episode reward: [(0, '4371.540')] +[2023-03-11 20:19:30,091][66031] Updated weights for policy 0, policy_version 153200 (0.0005) +[2023-03-11 20:19:34,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10010.9). Total num frames: 78475264. Throughput: 0: 10092.3. Samples: 78460980. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:19:34,012][65744] Avg episode reward: [(0, '4462.895')] +[2023-03-11 20:19:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000153272_78475264.pth... +[2023-03-11 20:19:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000152680_78172160.pth +[2023-03-11 20:19:34,369][66031] Updated weights for policy 0, policy_version 153280 (0.0005) +[2023-03-11 20:19:38,502][66031] Updated weights for policy 0, policy_version 153360 (0.0005) +[2023-03-11 20:19:39,012][65744] Fps is (10 sec: 9830.3, 60 sec: 10035.2, 300 sec: 9997.0). Total num frames: 78524416. Throughput: 0: 10017.4. Samples: 78520052. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:19:39,012][65744] Avg episode reward: [(0, '4581.959')] +[2023-03-11 20:19:42,418][66031] Updated weights for policy 0, policy_version 153440 (0.0005) +[2023-03-11 20:19:44,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 9997.0). Total num frames: 78577664. Throughput: 0: 10031.5. Samples: 78551336. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:19:44,012][65744] Avg episode reward: [(0, '4537.951')] +[2023-03-11 20:19:46,344][66031] Updated weights for policy 0, policy_version 153520 (0.0005) +[2023-03-11 20:19:49,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 9997.0). Total num frames: 78626816. Throughput: 0: 10089.2. Samples: 78613884. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:19:49,012][65744] Avg episode reward: [(0, '4547.093')] +[2023-03-11 20:19:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000153568_78626816.pth... +[2023-03-11 20:19:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000152984_78327808.pth +[2023-03-11 20:19:50,282][66031] Updated weights for policy 0, policy_version 153600 (0.0005) +[2023-03-11 20:19:54,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 9997.0). Total num frames: 78680064. Throughput: 0: 10124.8. Samples: 78676224. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:19:54,012][65744] Avg episode reward: [(0, '4330.269')] +[2023-03-11 20:19:54,176][66031] Updated weights for policy 0, policy_version 153680 (0.0005) +[2023-03-11 20:19:58,016][66031] Updated weights for policy 0, policy_version 153760 (0.0004) +[2023-03-11 20:19:59,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10171.7, 300 sec: 9997.0). Total num frames: 78733312. Throughput: 0: 10142.2. Samples: 78708448. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:19:59,012][65744] Avg episode reward: [(0, '4545.672')] +[2023-03-11 20:20:01,972][66031] Updated weights for policy 0, policy_version 153840 (0.0005) +[2023-03-11 20:20:04,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 9997.0). Total num frames: 78786560. Throughput: 0: 10137.1. Samples: 78770944. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:20:04,012][65744] Avg episode reward: [(0, '4218.608')] +[2023-03-11 20:20:04,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000153880_78786560.pth... +[2023-03-11 20:20:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000153272_78475264.pth +[2023-03-11 20:20:05,896][66031] Updated weights for policy 0, policy_version 153920 (0.0004) +[2023-03-11 20:20:09,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 9997.2). Total num frames: 78835712. Throughput: 0: 10239.6. Samples: 78833656. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:20:09,012][65744] Avg episode reward: [(0, '4455.229')] +[2023-03-11 20:20:09,799][66031] Updated weights for policy 0, policy_version 154000 (0.0005) +[2023-03-11 20:20:13,954][66031] Updated weights for policy 0, policy_version 154080 (0.0005) +[2023-03-11 20:20:14,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10171.7, 300 sec: 10024.8). Total num frames: 78888960. Throughput: 0: 10270.3. Samples: 78864448. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:20:14,012][65744] Avg episode reward: [(0, '4550.212')] +[2023-03-11 20:20:17,869][66031] Updated weights for policy 0, policy_version 154160 (0.0003) +[2023-03-11 20:20:19,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10024.8). Total num frames: 78938112. Throughput: 0: 10329.1. Samples: 78925788. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:20:19,012][65744] Avg episode reward: [(0, '4602.423')] +[2023-03-11 20:20:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000154176_78938112.pth... +[2023-03-11 20:20:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000153568_78626816.pth +[2023-03-11 20:20:19,018][65987] Saving new best policy, reward=4602.423! +[2023-03-11 20:20:21,831][66031] Updated weights for policy 0, policy_version 154240 (0.0004) +[2023-03-11 20:20:24,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10024.8). Total num frames: 78991360. Throughput: 0: 10374.3. Samples: 78986896. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:20:24,012][65744] Avg episode reward: [(0, '4588.688')] +[2023-03-11 20:20:25,886][66031] Updated weights for policy 0, policy_version 154320 (0.0004) +[2023-03-11 20:20:29,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10024.8). Total num frames: 79040512. Throughput: 0: 10367.4. Samples: 79017868. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:20:29,012][65744] Avg episode reward: [(0, '4570.448')] +[2023-03-11 20:20:29,806][66031] Updated weights for policy 0, policy_version 154400 (0.0003) +[2023-03-11 20:20:33,736][66031] Updated weights for policy 0, policy_version 154480 (0.0004) +[2023-03-11 20:20:34,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 10038.7). Total num frames: 79093760. Throughput: 0: 10360.5. Samples: 79080104. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:20:34,012][65744] Avg episode reward: [(0, '4600.245')] +[2023-03-11 20:20:34,014][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000154480_79093760.pth... +[2023-03-11 20:20:34,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000153880_78786560.pth +[2023-03-11 20:20:37,610][66031] Updated weights for policy 0, policy_version 154560 (0.0004) +[2023-03-11 20:20:39,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10052.6). Total num frames: 79147008. Throughput: 0: 10372.4. Samples: 79142984. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:20:39,012][65744] Avg episode reward: [(0, '4600.621')] +[2023-03-11 20:20:41,583][66031] Updated weights for policy 0, policy_version 154640 (0.0005) +[2023-03-11 20:20:44,012][65744] Fps is (10 sec: 10649.5, 60 sec: 10376.5, 300 sec: 10052.6). Total num frames: 79200256. Throughput: 0: 10359.2. Samples: 79174612. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:20:44,012][65744] Avg episode reward: [(0, '4564.461')] +[2023-03-11 20:20:45,520][66031] Updated weights for policy 0, policy_version 154720 (0.0005) +[2023-03-11 20:20:49,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10066.4). Total num frames: 79253504. Throughput: 0: 10360.9. Samples: 79237184. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:20:49,012][65744] Avg episode reward: [(0, '3906.175')] +[2023-03-11 20:20:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000154792_79253504.pth... +[2023-03-11 20:20:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000154176_78938112.pth +[2023-03-11 20:20:49,408][66031] Updated weights for policy 0, policy_version 154800 (0.0005) +[2023-03-11 20:20:53,406][66031] Updated weights for policy 0, policy_version 154880 (0.0005) +[2023-03-11 20:20:54,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10066.4). Total num frames: 79302656. Throughput: 0: 10333.6. Samples: 79298668. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:20:54,012][65744] Avg episode reward: [(0, '3947.867')] +[2023-03-11 20:20:57,299][66031] Updated weights for policy 0, policy_version 154960 (0.0004) +[2023-03-11 20:20:59,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10080.3). Total num frames: 79355904. Throughput: 0: 10365.0. Samples: 79330872. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:20:59,012][65744] Avg episode reward: [(0, '4423.349')] +[2023-03-11 20:21:01,176][66031] Updated weights for policy 0, policy_version 155040 (0.0004) +[2023-03-11 20:21:04,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10094.2). Total num frames: 79409152. Throughput: 0: 10381.9. Samples: 79392972. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:21:04,012][65744] Avg episode reward: [(0, '4563.839')] +[2023-03-11 20:21:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000155096_79409152.pth... +[2023-03-11 20:21:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000154480_79093760.pth +[2023-03-11 20:21:05,080][66031] Updated weights for policy 0, policy_version 155120 (0.0004) +[2023-03-11 20:21:08,976][66031] Updated weights for policy 0, policy_version 155200 (0.0004) +[2023-03-11 20:21:09,012][65744] Fps is (10 sec: 10649.7, 60 sec: 10444.8, 300 sec: 10122.0). Total num frames: 79462400. Throughput: 0: 10443.7. Samples: 79456860. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:21:09,012][65744] Avg episode reward: [(0, '4239.772')] +[2023-03-11 20:21:12,873][66031] Updated weights for policy 0, policy_version 155280 (0.0004) +[2023-03-11 20:21:14,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10122.0). Total num frames: 79511552. Throughput: 0: 10443.2. Samples: 79487812. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:21:14,012][65744] Avg episode reward: [(0, '4458.449')] +[2023-03-11 20:21:16,774][66031] Updated weights for policy 0, policy_version 155360 (0.0005) +[2023-03-11 20:21:19,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10444.8, 300 sec: 10135.9). Total num frames: 79564800. Throughput: 0: 10480.3. Samples: 79551720. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:21:19,012][65744] Avg episode reward: [(0, '4556.259')] +[2023-03-11 20:21:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000155400_79564800.pth... +[2023-03-11 20:21:19,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000154792_79253504.pth +[2023-03-11 20:21:20,639][66031] Updated weights for policy 0, policy_version 155440 (0.0004) +[2023-03-11 20:21:24,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10149.7). Total num frames: 79618048. Throughput: 0: 10464.0. Samples: 79613864. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:21:24,012][65744] Avg episode reward: [(0, '4536.144')] +[2023-03-11 20:21:24,658][66031] Updated weights for policy 0, policy_version 155520 (0.0005) +[2023-03-11 20:21:28,622][66031] Updated weights for policy 0, policy_version 155600 (0.0005) +[2023-03-11 20:21:29,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10135.9). Total num frames: 79667200. Throughput: 0: 10439.8. Samples: 79644400. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:21:29,012][65744] Avg episode reward: [(0, '4552.010')] +[2023-03-11 20:21:32,749][66031] Updated weights for policy 0, policy_version 155680 (0.0005) +[2023-03-11 20:21:34,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10149.7). Total num frames: 79720448. Throughput: 0: 10404.5. Samples: 79705388. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:21:34,012][65744] Avg episode reward: [(0, '4583.244')] +[2023-03-11 20:21:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000155704_79720448.pth... +[2023-03-11 20:21:34,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000155096_79409152.pth +[2023-03-11 20:21:36,940][66031] Updated weights for policy 0, policy_version 155760 (0.0005) +[2023-03-11 20:21:39,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10308.3, 300 sec: 10135.9). Total num frames: 79765504. Throughput: 0: 10335.1. Samples: 79763748. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:21:39,012][65744] Avg episode reward: [(0, '4588.409')] +[2023-03-11 20:21:41,117][66031] Updated weights for policy 0, policy_version 155840 (0.0005) +[2023-03-11 20:21:44,012][65744] Fps is (10 sec: 9420.8, 60 sec: 10240.0, 300 sec: 10135.9). Total num frames: 79814656. Throughput: 0: 10284.0. Samples: 79793652. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:21:44,012][65744] Avg episode reward: [(0, '4597.257')] +[2023-03-11 20:21:45,320][66031] Updated weights for policy 0, policy_version 155920 (0.0005) +[2023-03-11 20:21:49,012][65744] Fps is (10 sec: 9830.3, 60 sec: 10171.7, 300 sec: 10135.9). Total num frames: 79863808. Throughput: 0: 10191.4. Samples: 79851584. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:21:49,012][65744] Avg episode reward: [(0, '4514.313')] +[2023-03-11 20:21:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000155984_79863808.pth... +[2023-03-11 20:21:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000155400_79564800.pth +[2023-03-11 20:21:49,501][66031] Updated weights for policy 0, policy_version 156000 (0.0005) +[2023-03-11 20:21:53,659][66031] Updated weights for policy 0, policy_version 156080 (0.0005) +[2023-03-11 20:21:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10171.8, 300 sec: 10135.9). Total num frames: 79912960. Throughput: 0: 10098.1. Samples: 79911272. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:21:54,012][65744] Avg episode reward: [(0, '4522.042')] +[2023-03-11 20:21:57,860][66031] Updated weights for policy 0, policy_version 156160 (0.0005) +[2023-03-11 20:21:59,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10122.0). Total num frames: 79962112. Throughput: 0: 10054.6. Samples: 79940272. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:21:59,012][65744] Avg episode reward: [(0, '4317.163')] +[2023-03-11 20:22:02,075][66031] Updated weights for policy 0, policy_version 156240 (0.0005) +[2023-03-11 20:22:04,012][65744] Fps is (10 sec: 9830.3, 60 sec: 10035.2, 300 sec: 10108.1). Total num frames: 80011264. Throughput: 0: 9939.0. Samples: 79998976. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:22:04,012][65744] Avg episode reward: [(0, '4472.454')] +[2023-03-11 20:22:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000156272_80011264.pth... +[2023-03-11 20:22:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000155704_79720448.pth +[2023-03-11 20:22:06,300][66031] Updated weights for policy 0, policy_version 156320 (0.0005) +[2023-03-11 20:22:09,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9966.9, 300 sec: 10094.2). Total num frames: 80060416. Throughput: 0: 9844.6. Samples: 80056872. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:22:09,012][65744] Avg episode reward: [(0, '4309.909')] +[2023-03-11 20:22:10,463][66031] Updated weights for policy 0, policy_version 156400 (0.0005) +[2023-03-11 20:22:14,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 10094.2). Total num frames: 80109568. Throughput: 0: 9830.0. Samples: 80086752. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:22:14,012][65744] Avg episode reward: [(0, '4150.208')] +[2023-03-11 20:22:14,704][66031] Updated weights for policy 0, policy_version 156480 (0.0005) +[2023-03-11 20:22:18,896][66031] Updated weights for policy 0, policy_version 156560 (0.0005) +[2023-03-11 20:22:19,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9898.7, 300 sec: 10080.3). Total num frames: 80158720. Throughput: 0: 9769.8. Samples: 80145028. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:22:19,012][65744] Avg episode reward: [(0, '3936.543')] +[2023-03-11 20:22:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000156560_80158720.pth... +[2023-03-11 20:22:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000155984_79863808.pth +[2023-03-11 20:22:23,170][66031] Updated weights for policy 0, policy_version 156640 (0.0005) +[2023-03-11 20:22:24,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9762.1, 300 sec: 10080.3). Total num frames: 80203776. Throughput: 0: 9757.2. Samples: 80202820. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:22:24,012][65744] Avg episode reward: [(0, '3991.339')] +[2023-03-11 20:22:27,393][66031] Updated weights for policy 0, policy_version 156720 (0.0005) +[2023-03-11 20:22:29,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9762.1, 300 sec: 10080.3). Total num frames: 80252928. Throughput: 0: 9739.5. Samples: 80231928. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:22:29,012][65744] Avg episode reward: [(0, '4347.980')] +[2023-03-11 20:22:31,642][66031] Updated weights for policy 0, policy_version 156800 (0.0005) +[2023-03-11 20:22:34,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9693.9, 300 sec: 10066.4). Total num frames: 80302080. Throughput: 0: 9738.0. Samples: 80289792. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:22:34,012][65744] Avg episode reward: [(0, '4348.351')] +[2023-03-11 20:22:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000156840_80302080.pth... +[2023-03-11 20:22:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000156272_80011264.pth +[2023-03-11 20:22:35,859][66031] Updated weights for policy 0, policy_version 156880 (0.0005) +[2023-03-11 20:22:39,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 10066.4). Total num frames: 80351232. Throughput: 0: 9688.3. Samples: 80347244. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:22:39,012][65744] Avg episode reward: [(0, '4093.140')] +[2023-03-11 20:22:40,105][66031] Updated weights for policy 0, policy_version 156960 (0.0005) +[2023-03-11 20:22:44,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 10080.3). Total num frames: 80400384. Throughput: 0: 9711.7. Samples: 80377300. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:22:44,012][65744] Avg episode reward: [(0, '4176.147')] +[2023-03-11 20:22:44,231][66031] Updated weights for policy 0, policy_version 157040 (0.0005) +[2023-03-11 20:22:48,385][66031] Updated weights for policy 0, policy_version 157120 (0.0005) +[2023-03-11 20:22:49,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9762.1, 300 sec: 10066.4). Total num frames: 80449536. Throughput: 0: 9722.2. Samples: 80436476. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:22:49,012][65744] Avg episode reward: [(0, '4217.532')] +[2023-03-11 20:22:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000157128_80449536.pth... +[2023-03-11 20:22:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000156560_80158720.pth +[2023-03-11 20:22:52,706][66031] Updated weights for policy 0, policy_version 157200 (0.0005) +[2023-03-11 20:22:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 10066.4). Total num frames: 80498688. Throughput: 0: 9706.0. Samples: 80493644. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:22:54,012][65744] Avg episode reward: [(0, '4215.040')] +[2023-03-11 20:22:56,941][66031] Updated weights for policy 0, policy_version 157280 (0.0005) +[2023-03-11 20:22:59,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9693.9, 300 sec: 10052.6). Total num frames: 80543744. Throughput: 0: 9695.8. Samples: 80523064. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:22:59,012][65744] Avg episode reward: [(0, '4150.224')] +[2023-03-11 20:23:01,236][66031] Updated weights for policy 0, policy_version 157360 (0.0005) +[2023-03-11 20:23:04,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 10052.6). Total num frames: 80592896. Throughput: 0: 9668.7. Samples: 80580120. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:23:04,012][65744] Avg episode reward: [(0, '4232.382')] +[2023-03-11 20:23:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000157408_80592896.pth... +[2023-03-11 20:23:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000156840_80302080.pth +[2023-03-11 20:23:05,524][66031] Updated weights for policy 0, policy_version 157440 (0.0005) +[2023-03-11 20:23:09,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 10052.6). Total num frames: 80642048. Throughput: 0: 9665.1. Samples: 80637748. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:23:09,012][65744] Avg episode reward: [(0, '4111.523')] +[2023-03-11 20:23:09,806][66031] Updated weights for policy 0, policy_version 157520 (0.0005) +[2023-03-11 20:23:14,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 10038.7). Total num frames: 80687104. Throughput: 0: 9656.4. Samples: 80666468. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:23:14,012][65744] Avg episode reward: [(0, '3702.888')] +[2023-03-11 20:23:14,127][66031] Updated weights for policy 0, policy_version 157600 (0.0005) +[2023-03-11 20:23:18,456][66031] Updated weights for policy 0, policy_version 157680 (0.0005) +[2023-03-11 20:23:19,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 10052.6). Total num frames: 80736256. Throughput: 0: 9636.4. Samples: 80723428. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:23:19,012][65744] Avg episode reward: [(0, '4117.305')] +[2023-03-11 20:23:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000157688_80736256.pth... +[2023-03-11 20:23:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000157128_80449536.pth +[2023-03-11 20:23:22,741][66031] Updated weights for policy 0, policy_version 157760 (0.0005) +[2023-03-11 20:23:24,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9625.6, 300 sec: 10024.8). Total num frames: 80781312. Throughput: 0: 9616.3. Samples: 80779976. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:23:24,012][65744] Avg episode reward: [(0, '4117.270')] +[2023-03-11 20:23:26,971][66031] Updated weights for policy 0, policy_version 157840 (0.0005) +[2023-03-11 20:23:29,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9625.6, 300 sec: 10024.8). Total num frames: 80830464. Throughput: 0: 9593.6. Samples: 80809012. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:23:29,012][65744] Avg episode reward: [(0, '3935.683')] +[2023-03-11 20:23:31,261][66031] Updated weights for policy 0, policy_version 157920 (0.0005) +[2023-03-11 20:23:34,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9625.6, 300 sec: 10024.8). Total num frames: 80879616. Throughput: 0: 9564.5. Samples: 80866880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:23:34,012][65744] Avg episode reward: [(0, '4159.427')] +[2023-03-11 20:23:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000157968_80879616.pth... +[2023-03-11 20:23:34,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000157408_80592896.pth +[2023-03-11 20:23:35,545][66031] Updated weights for policy 0, policy_version 158000 (0.0005) +[2023-03-11 20:23:39,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9625.6, 300 sec: 10024.8). Total num frames: 80928768. Throughput: 0: 9568.0. Samples: 80924204. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:23:39,012][65744] Avg episode reward: [(0, '3730.886')] +[2023-03-11 20:23:39,805][66031] Updated weights for policy 0, policy_version 158080 (0.0005) +[2023-03-11 20:23:43,952][66031] Updated weights for policy 0, policy_version 158160 (0.0004) +[2023-03-11 20:23:44,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 10024.8). Total num frames: 80977920. Throughput: 0: 9563.2. Samples: 80953408. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:23:44,012][65744] Avg episode reward: [(0, '3378.083')] +[2023-03-11 20:23:48,032][66031] Updated weights for policy 0, policy_version 158240 (0.0004) +[2023-03-11 20:23:49,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 10010.9). Total num frames: 81027072. Throughput: 0: 9626.5. Samples: 81013312. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:23:49,012][65744] Avg episode reward: [(0, '3575.860')] +[2023-03-11 20:23:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000158256_81027072.pth... +[2023-03-11 20:23:49,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000157688_80736256.pth +[2023-03-11 20:23:52,029][66031] Updated weights for policy 0, policy_version 158320 (0.0003) +[2023-03-11 20:23:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 10010.9). Total num frames: 81076224. Throughput: 0: 9698.9. Samples: 81074196. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:23:54,012][65744] Avg episode reward: [(0, '3705.739')] +[2023-03-11 20:23:55,992][66031] Updated weights for policy 0, policy_version 158400 (0.0004) +[2023-03-11 20:23:59,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9762.1, 300 sec: 10024.8). Total num frames: 81129472. Throughput: 0: 9747.6. Samples: 81105108. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:23:59,012][65744] Avg episode reward: [(0, '3937.356')] +[2023-03-11 20:23:59,926][66031] Updated weights for policy 0, policy_version 158480 (0.0005) +[2023-03-11 20:24:04,012][65744] Fps is (10 sec: 10239.9, 60 sec: 9762.1, 300 sec: 10010.9). Total num frames: 81178624. Throughput: 0: 9834.9. Samples: 81166000. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:24:04,012][65744] Avg episode reward: [(0, '3899.238')] +[2023-03-11 20:24:04,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000158552_81178624.pth... +[2023-03-11 20:24:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000157968_80879616.pth +[2023-03-11 20:24:04,219][66031] Updated weights for policy 0, policy_version 158560 (0.0005) +[2023-03-11 20:24:08,489][66031] Updated weights for policy 0, policy_version 158640 (0.0005) +[2023-03-11 20:24:09,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9997.0). Total num frames: 81227776. Throughput: 0: 9860.1. Samples: 81223680. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:24:09,023][65744] Avg episode reward: [(0, '4020.900')] +[2023-03-11 20:24:12,570][66031] Updated weights for policy 0, policy_version 158720 (0.0005) +[2023-03-11 20:24:14,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9997.0). Total num frames: 81276928. Throughput: 0: 9857.0. Samples: 81252576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:24:14,012][65744] Avg episode reward: [(0, '4274.511')] +[2023-03-11 20:24:16,712][66031] Updated weights for policy 0, policy_version 158800 (0.0005) +[2023-03-11 20:24:19,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9830.4, 300 sec: 9997.0). Total num frames: 81326080. Throughput: 0: 9917.4. Samples: 81313164. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:24:19,012][65744] Avg episode reward: [(0, '4421.242')] +[2023-03-11 20:24:19,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000158840_81326080.pth... +[2023-03-11 20:24:19,029][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000158256_81027072.pth +[2023-03-11 20:24:20,923][66031] Updated weights for policy 0, policy_version 158880 (0.0005) +[2023-03-11 20:24:24,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.6, 300 sec: 9997.0). Total num frames: 81375232. Throughput: 0: 9933.2. Samples: 81371200. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:24:24,012][65744] Avg episode reward: [(0, '4402.580')] +[2023-03-11 20:24:25,170][66031] Updated weights for policy 0, policy_version 158960 (0.0005) +[2023-03-11 20:24:29,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9997.0). Total num frames: 81424384. Throughput: 0: 9921.6. Samples: 81399880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:24:29,012][65744] Avg episode reward: [(0, '4417.566')] +[2023-03-11 20:24:29,383][66031] Updated weights for policy 0, policy_version 159040 (0.0005) +[2023-03-11 20:24:33,393][66031] Updated weights for policy 0, policy_version 159120 (0.0004) +[2023-03-11 20:24:34,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9997.0). Total num frames: 81473536. Throughput: 0: 9920.5. Samples: 81459736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:24:34,012][65744] Avg episode reward: [(0, '4256.073')] +[2023-03-11 20:24:34,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000159128_81473536.pth... +[2023-03-11 20:24:34,028][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000158552_81178624.pth +[2023-03-11 20:24:37,413][66031] Updated weights for policy 0, policy_version 159200 (0.0004) +[2023-03-11 20:24:39,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9983.1). Total num frames: 81522688. Throughput: 0: 9929.6. Samples: 81521028. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:24:39,012][65744] Avg episode reward: [(0, '4009.702')] +[2023-03-11 20:24:41,443][66031] Updated weights for policy 0, policy_version 159280 (0.0003) +[2023-03-11 20:24:44,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 9997.0). Total num frames: 81575936. Throughput: 0: 9918.1. Samples: 81551424. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:24:44,012][65744] Avg episode reward: [(0, '4358.074')] +[2023-03-11 20:24:45,462][66031] Updated weights for policy 0, policy_version 159360 (0.0005) +[2023-03-11 20:24:49,014][65744] Fps is (10 sec: 10647.4, 60 sec: 10034.8, 300 sec: 9996.9). Total num frames: 81629184. Throughput: 0: 9930.0. Samples: 81612872. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 20:24:49,025][65744] Avg episode reward: [(0, '4031.372')] +[2023-03-11 20:24:49,028][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000159432_81629184.pth... +[2023-03-11 20:24:49,030][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000158840_81326080.pth +[2023-03-11 20:24:49,418][66031] Updated weights for policy 0, policy_version 159440 (0.0005) +[2023-03-11 20:24:53,460][66031] Updated weights for policy 0, policy_version 159520 (0.0005) +[2023-03-11 20:24:54,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9983.1). Total num frames: 81678336. Throughput: 0: 10012.8. Samples: 81674256. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 20:24:54,012][65744] Avg episode reward: [(0, '4382.826')] +[2023-03-11 20:24:57,525][66031] Updated weights for policy 0, policy_version 159600 (0.0005) +[2023-03-11 20:24:59,012][65744] Fps is (10 sec: 9832.5, 60 sec: 9966.9, 300 sec: 9969.3). Total num frames: 81727488. Throughput: 0: 10042.0. Samples: 81704468. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 20:24:59,012][65744] Avg episode reward: [(0, '4158.851')] +[2023-03-11 20:25:01,813][66031] Updated weights for policy 0, policy_version 159680 (0.0005) +[2023-03-11 20:25:04,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9969.2). Total num frames: 81776640. Throughput: 0: 9987.5. Samples: 81762600. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 20:25:04,012][65744] Avg episode reward: [(0, '4242.642')] +[2023-03-11 20:25:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000159720_81776640.pth... +[2023-03-11 20:25:04,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000159128_81473536.pth +[2023-03-11 20:25:05,823][66031] Updated weights for policy 0, policy_version 159760 (0.0005) +[2023-03-11 20:25:09,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9955.4). Total num frames: 81825792. Throughput: 0: 10058.7. Samples: 81823840. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 20:25:09,012][65744] Avg episode reward: [(0, '4108.867')] +[2023-03-11 20:25:09,822][66031] Updated weights for policy 0, policy_version 159840 (0.0004) +[2023-03-11 20:25:13,763][66031] Updated weights for policy 0, policy_version 159920 (0.0004) +[2023-03-11 20:25:14,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10035.2, 300 sec: 9969.3). Total num frames: 81879040. Throughput: 0: 10110.0. Samples: 81854828. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 20:25:14,012][65744] Avg episode reward: [(0, '4342.263')] +[2023-03-11 20:25:17,720][66031] Updated weights for policy 0, policy_version 160000 (0.0005) +[2023-03-11 20:25:19,012][65744] Fps is (10 sec: 10649.5, 60 sec: 10103.5, 300 sec: 9969.2). Total num frames: 81932288. Throughput: 0: 10161.0. Samples: 81916980. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 20:25:19,012][65744] Avg episode reward: [(0, '4176.334')] +[2023-03-11 20:25:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000160024_81932288.pth... +[2023-03-11 20:25:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000159432_81629184.pth +[2023-03-11 20:25:21,629][66031] Updated weights for policy 0, policy_version 160080 (0.0004) +[2023-03-11 20:25:24,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10171.7, 300 sec: 9983.1). Total num frames: 81985536. Throughput: 0: 10206.7. Samples: 81980328. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 20:25:24,012][65744] Avg episode reward: [(0, '4222.939')] +[2023-03-11 20:25:25,517][66031] Updated weights for policy 0, policy_version 160160 (0.0004) +[2023-03-11 20:25:29,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 9969.2). Total num frames: 82034688. Throughput: 0: 10217.2. Samples: 82011196. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 20:25:29,012][65744] Avg episode reward: [(0, '4293.626')] +[2023-03-11 20:25:29,523][66031] Updated weights for policy 0, policy_version 160240 (0.0005) +[2023-03-11 20:25:33,457][66031] Updated weights for policy 0, policy_version 160320 (0.0004) +[2023-03-11 20:25:34,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 9969.2). Total num frames: 82087936. Throughput: 0: 10236.6. Samples: 82073500. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 20:25:34,012][65744] Avg episode reward: [(0, '4131.308')] +[2023-03-11 20:25:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000160328_82087936.pth... +[2023-03-11 20:25:34,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000159720_81776640.pth +[2023-03-11 20:25:37,532][66031] Updated weights for policy 0, policy_version 160400 (0.0005) +[2023-03-11 20:25:39,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 9955.4). Total num frames: 82137088. Throughput: 0: 10214.0. Samples: 82133884. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 20:25:39,012][65744] Avg episode reward: [(0, '4177.803')] +[2023-03-11 20:25:41,530][66031] Updated weights for policy 0, policy_version 160480 (0.0004) +[2023-03-11 20:25:44,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 9955.4). Total num frames: 82190336. Throughput: 0: 10236.6. Samples: 82165116. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 20:25:44,012][65744] Avg episode reward: [(0, '4075.606')] +[2023-03-11 20:25:45,417][66031] Updated weights for policy 0, policy_version 160560 (0.0004) +[2023-03-11 20:25:49,012][65744] Fps is (10 sec: 10649.5, 60 sec: 10240.3, 300 sec: 9969.2). Total num frames: 82243584. Throughput: 0: 10327.5. Samples: 82227336. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 20:25:49,012][65744] Avg episode reward: [(0, '4057.379')] +[2023-03-11 20:25:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000160632_82243584.pth... +[2023-03-11 20:25:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000160024_81932288.pth +[2023-03-11 20:25:49,369][66031] Updated weights for policy 0, policy_version 160640 (0.0004) +[2023-03-11 20:25:53,404][66031] Updated weights for policy 0, policy_version 160720 (0.0005) +[2023-03-11 20:25:54,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 9955.4). Total num frames: 82292736. Throughput: 0: 10331.9. Samples: 82288776. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:25:54,023][65744] Avg episode reward: [(0, '3933.342')] +[2023-03-11 20:25:57,409][66031] Updated weights for policy 0, policy_version 160800 (0.0005) +[2023-03-11 20:25:59,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 9941.5). Total num frames: 82341888. Throughput: 0: 10337.8. Samples: 82320028. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:25:59,012][65744] Avg episode reward: [(0, '3882.112')] +[2023-03-11 20:26:01,629][66031] Updated weights for policy 0, policy_version 160880 (0.0005) +[2023-03-11 20:26:04,012][65744] Fps is (10 sec: 9830.3, 60 sec: 10240.0, 300 sec: 9927.6). Total num frames: 82391040. Throughput: 0: 10261.8. Samples: 82378760. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:26:04,012][65744] Avg episode reward: [(0, '4033.881')] +[2023-03-11 20:26:04,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000160920_82391040.pth... +[2023-03-11 20:26:04,029][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000160328_82087936.pth +[2023-03-11 20:26:05,905][66031] Updated weights for policy 0, policy_version 160960 (0.0005) +[2023-03-11 20:26:09,012][65744] Fps is (10 sec: 9830.5, 60 sec: 10240.0, 300 sec: 9927.6). Total num frames: 82440192. Throughput: 0: 10129.6. Samples: 82436160. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:26:09,012][65744] Avg episode reward: [(0, '3829.230')] +[2023-03-11 20:26:10,155][66031] Updated weights for policy 0, policy_version 161040 (0.0005) +[2023-03-11 20:26:14,012][65744] Fps is (10 sec: 9830.5, 60 sec: 10171.7, 300 sec: 9913.7). Total num frames: 82489344. Throughput: 0: 10081.0. Samples: 82464840. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:26:14,012][65744] Avg episode reward: [(0, '3731.488')] +[2023-03-11 20:26:14,432][66031] Updated weights for policy 0, policy_version 161120 (0.0005) +[2023-03-11 20:26:18,583][66031] Updated weights for policy 0, policy_version 161200 (0.0005) +[2023-03-11 20:26:19,012][65744] Fps is (10 sec: 9830.3, 60 sec: 10103.5, 300 sec: 9899.8). Total num frames: 82538496. Throughput: 0: 9981.6. Samples: 82522672. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:26:19,012][65744] Avg episode reward: [(0, '3797.545')] +[2023-03-11 20:26:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000161208_82538496.pth... +[2023-03-11 20:26:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000160632_82243584.pth +[2023-03-11 20:26:22,607][66031] Updated weights for policy 0, policy_version 161280 (0.0005) +[2023-03-11 20:26:24,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 9899.8). Total num frames: 82587648. Throughput: 0: 9994.0. Samples: 82583616. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:26:24,012][65744] Avg episode reward: [(0, '3818.966')] +[2023-03-11 20:26:26,625][66031] Updated weights for policy 0, policy_version 161360 (0.0004) +[2023-03-11 20:26:29,012][65744] Fps is (10 sec: 10240.2, 60 sec: 10103.5, 300 sec: 9899.8). Total num frames: 82640896. Throughput: 0: 9991.5. Samples: 82614732. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:26:29,022][65744] Avg episode reward: [(0, '3827.863')] +[2023-03-11 20:26:30,604][66031] Updated weights for policy 0, policy_version 161440 (0.0004) +[2023-03-11 20:26:34,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10035.2, 300 sec: 9913.7). Total num frames: 82690048. Throughput: 0: 9979.3. Samples: 82676404. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:26:34,023][65744] Avg episode reward: [(0, '3943.079')] +[2023-03-11 20:26:34,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000161504_82690048.pth... +[2023-03-11 20:26:34,028][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000160920_82391040.pth +[2023-03-11 20:26:34,588][66031] Updated weights for policy 0, policy_version 161520 (0.0004) +[2023-03-11 20:26:38,605][66031] Updated weights for policy 0, policy_version 161600 (0.0004) +[2023-03-11 20:26:39,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 9913.7). Total num frames: 82739200. Throughput: 0: 9975.2. Samples: 82737660. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:26:39,012][65744] Avg episode reward: [(0, '3706.334')] +[2023-03-11 20:26:42,616][66031] Updated weights for policy 0, policy_version 161680 (0.0005) +[2023-03-11 20:26:44,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9927.6). Total num frames: 82792448. Throughput: 0: 9952.3. Samples: 82767880. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:26:44,012][65744] Avg episode reward: [(0, '3777.231')] +[2023-03-11 20:26:46,567][66031] Updated weights for policy 0, policy_version 161760 (0.0004) +[2023-03-11 20:26:49,012][65744] Fps is (10 sec: 10649.5, 60 sec: 10035.2, 300 sec: 9941.5). Total num frames: 82845696. Throughput: 0: 10020.6. Samples: 82829688. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:26:49,012][65744] Avg episode reward: [(0, '4092.019')] +[2023-03-11 20:26:49,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000161808_82845696.pth... +[2023-03-11 20:26:49,028][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000161208_82538496.pth +[2023-03-11 20:26:50,577][66031] Updated weights for policy 0, policy_version 161840 (0.0004) +[2023-03-11 20:26:54,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9941.5). Total num frames: 82894848. Throughput: 0: 10082.8. Samples: 82889888. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:26:54,012][65744] Avg episode reward: [(0, '4277.365')] +[2023-03-11 20:26:54,840][66031] Updated weights for policy 0, policy_version 161920 (0.0005) +[2023-03-11 20:26:59,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9966.9, 300 sec: 9927.6). Total num frames: 82939904. Throughput: 0: 10087.4. Samples: 82918772. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 20:26:59,012][65744] Avg episode reward: [(0, '4360.816')] +[2023-03-11 20:26:59,089][66031] Updated weights for policy 0, policy_version 162000 (0.0005) +[2023-03-11 20:27:03,341][66031] Updated weights for policy 0, policy_version 162080 (0.0005) +[2023-03-11 20:27:04,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9966.9, 300 sec: 9927.6). Total num frames: 82989056. Throughput: 0: 10091.2. Samples: 82976776. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 20:27:04,012][65744] Avg episode reward: [(0, '4486.548')] +[2023-03-11 20:27:04,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000162088_82989056.pth... +[2023-03-11 20:27:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000161504_82690048.pth +[2023-03-11 20:27:07,681][66031] Updated weights for policy 0, policy_version 162160 (0.0005) +[2023-03-11 20:27:09,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9966.9, 300 sec: 9927.6). Total num frames: 83038208. Throughput: 0: 9997.6. Samples: 83033508. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 20:27:09,012][65744] Avg episode reward: [(0, '4391.287')] +[2023-03-11 20:27:11,687][66031] Updated weights for policy 0, policy_version 162240 (0.0004) +[2023-03-11 20:27:14,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9927.6). Total num frames: 83087360. Throughput: 0: 9982.9. Samples: 83063964. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 20:27:14,012][65744] Avg episode reward: [(0, '4401.649')] +[2023-03-11 20:27:15,722][66031] Updated weights for policy 0, policy_version 162320 (0.0004) +[2023-03-11 20:27:19,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9966.9, 300 sec: 9941.5). Total num frames: 83136512. Throughput: 0: 9968.7. Samples: 83124996. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 20:27:19,012][65744] Avg episode reward: [(0, '4356.358')] +[2023-03-11 20:27:19,076][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000162384_83140608.pth... +[2023-03-11 20:27:19,078][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000161808_82845696.pth +[2023-03-11 20:27:19,904][66031] Updated weights for policy 0, policy_version 162400 (0.0005) +[2023-03-11 20:27:24,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9941.5). Total num frames: 83185664. Throughput: 0: 9914.7. Samples: 83183824. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 20:27:24,012][65744] Avg episode reward: [(0, '4449.789')] +[2023-03-11 20:27:24,093][66031] Updated weights for policy 0, policy_version 162480 (0.0005) +[2023-03-11 20:27:28,400][66031] Updated weights for policy 0, policy_version 162560 (0.0004) +[2023-03-11 20:27:29,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.6, 300 sec: 9941.5). Total num frames: 83234816. Throughput: 0: 9879.7. Samples: 83212468. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 20:27:29,012][65744] Avg episode reward: [(0, '4276.705')] +[2023-03-11 20:27:32,699][66031] Updated weights for policy 0, policy_version 162640 (0.0005) +[2023-03-11 20:27:34,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9941.5). Total num frames: 83283968. Throughput: 0: 9772.7. Samples: 83269460. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 20:27:34,012][65744] Avg episode reward: [(0, '4287.783')] +[2023-03-11 20:27:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000162664_83283968.pth... +[2023-03-11 20:27:34,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000162088_82989056.pth +[2023-03-11 20:27:37,016][66031] Updated weights for policy 0, policy_version 162720 (0.0005) +[2023-03-11 20:27:39,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9830.4, 300 sec: 9927.6). Total num frames: 83329024. Throughput: 0: 9690.3. Samples: 83325952. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 20:27:39,023][65744] Avg episode reward: [(0, '4265.242')] +[2023-03-11 20:27:41,332][66031] Updated weights for policy 0, policy_version 162800 (0.0005) +[2023-03-11 20:27:44,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9927.6). Total num frames: 83378176. Throughput: 0: 9686.0. Samples: 83354644. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 20:27:44,012][65744] Avg episode reward: [(0, '4281.336')] +[2023-03-11 20:27:45,537][66031] Updated weights for policy 0, policy_version 162880 (0.0005) +[2023-03-11 20:27:49,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9927.6). Total num frames: 83427328. Throughput: 0: 9693.4. Samples: 83412980. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 20:27:49,012][65744] Avg episode reward: [(0, '4389.022')] +[2023-03-11 20:27:49,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000162944_83427328.pth... +[2023-03-11 20:27:49,028][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000162384_83140608.pth +[2023-03-11 20:27:49,785][66031] Updated weights for policy 0, policy_version 162960 (0.0005) +[2023-03-11 20:27:53,952][66031] Updated weights for policy 0, policy_version 163040 (0.0005) +[2023-03-11 20:27:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9941.5). Total num frames: 83476480. Throughput: 0: 9745.5. Samples: 83472056. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 20:27:54,012][65744] Avg episode reward: [(0, '4556.606')] +[2023-03-11 20:27:58,197][66031] Updated weights for policy 0, policy_version 163120 (0.0005) +[2023-03-11 20:27:59,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9693.9, 300 sec: 9927.6). Total num frames: 83521536. Throughput: 0: 9702.5. Samples: 83500576. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 20:27:59,012][65744] Avg episode reward: [(0, '4480.457')] +[2023-03-11 20:28:02,492][66031] Updated weights for policy 0, policy_version 163200 (0.0005) +[2023-03-11 20:28:04,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9927.6). Total num frames: 83570688. Throughput: 0: 9631.2. Samples: 83558400. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:28:04,012][65744] Avg episode reward: [(0, '4475.953')] +[2023-03-11 20:28:04,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000163224_83570688.pth... +[2023-03-11 20:28:04,029][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000162664_83283968.pth +[2023-03-11 20:28:06,743][66031] Updated weights for policy 0, policy_version 163280 (0.0005) +[2023-03-11 20:28:09,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9941.5). Total num frames: 83619840. Throughput: 0: 9604.1. Samples: 83616008. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:28:09,012][65744] Avg episode reward: [(0, '4362.855')] +[2023-03-11 20:28:10,963][66031] Updated weights for policy 0, policy_version 163360 (0.0005) +[2023-03-11 20:28:14,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9693.9, 300 sec: 9941.5). Total num frames: 83668992. Throughput: 0: 9602.8. Samples: 83644592. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:28:14,012][65744] Avg episode reward: [(0, '4382.574')] +[2023-03-11 20:28:15,240][66031] Updated weights for policy 0, policy_version 163440 (0.0005) +[2023-03-11 20:28:19,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9941.5). Total num frames: 83714048. Throughput: 0: 9617.5. Samples: 83702248. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:28:19,012][65744] Avg episode reward: [(0, '4126.557')] +[2023-03-11 20:28:19,055][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000163512_83718144.pth... +[2023-03-11 20:28:19,056][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000162944_83427328.pth +[2023-03-11 20:28:19,469][66031] Updated weights for policy 0, policy_version 163520 (0.0004) +[2023-03-11 20:28:23,700][66031] Updated weights for policy 0, policy_version 163600 (0.0005) +[2023-03-11 20:28:24,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9941.5). Total num frames: 83763200. Throughput: 0: 9657.0. Samples: 83760516. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:28:24,012][65744] Avg episode reward: [(0, '3995.853')] +[2023-03-11 20:28:28,004][66031] Updated weights for policy 0, policy_version 163680 (0.0005) +[2023-03-11 20:28:29,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9941.5). Total num frames: 83812352. Throughput: 0: 9667.3. Samples: 83789672. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:28:29,012][65744] Avg episode reward: [(0, '4348.545')] +[2023-03-11 20:28:32,320][66031] Updated weights for policy 0, policy_version 163760 (0.0005) +[2023-03-11 20:28:34,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9927.6). Total num frames: 83857408. Throughput: 0: 9629.8. Samples: 83846320. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:28:34,012][65744] Avg episode reward: [(0, '4279.131')] +[2023-03-11 20:28:34,047][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000163792_83861504.pth... +[2023-03-11 20:28:34,049][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000163224_83570688.pth +[2023-03-11 20:28:36,649][66031] Updated weights for policy 0, policy_version 163840 (0.0005) +[2023-03-11 20:28:39,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9927.6). Total num frames: 83906560. Throughput: 0: 9585.1. Samples: 83903384. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:28:39,012][65744] Avg episode reward: [(0, '4313.550')] +[2023-03-11 20:28:40,966][66031] Updated weights for policy 0, policy_version 163920 (0.0005) +[2023-03-11 20:28:44,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9927.6). Total num frames: 83955712. Throughput: 0: 9577.2. Samples: 83931552. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:28:44,012][65744] Avg episode reward: [(0, '4323.460')] +[2023-03-11 20:28:45,008][66031] Updated weights for policy 0, policy_version 164000 (0.0005) +[2023-03-11 20:28:49,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9927.6). Total num frames: 84004864. Throughput: 0: 9650.0. Samples: 83992648. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:28:49,012][65744] Avg episode reward: [(0, '4431.655')] +[2023-03-11 20:28:49,025][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000164080_84008960.pth... +[2023-03-11 20:28:49,025][66031] Updated weights for policy 0, policy_version 164080 (0.0005) +[2023-03-11 20:28:49,026][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000163512_83718144.pth +[2023-03-11 20:28:53,012][66031] Updated weights for policy 0, policy_version 164160 (0.0004) +[2023-03-11 20:28:54,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9693.9, 300 sec: 9927.6). Total num frames: 84058112. Throughput: 0: 9738.3. Samples: 84054232. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:28:54,012][65744] Avg episode reward: [(0, '4393.075')] +[2023-03-11 20:28:57,003][66031] Updated weights for policy 0, policy_version 164240 (0.0003) +[2023-03-11 20:28:59,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9762.1, 300 sec: 9927.6). Total num frames: 84107264. Throughput: 0: 9804.9. Samples: 84085812. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:28:59,012][65744] Avg episode reward: [(0, '4317.146')] +[2023-03-11 20:29:01,253][66031] Updated weights for policy 0, policy_version 164320 (0.0005) +[2023-03-11 20:29:04,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9927.6). Total num frames: 84156416. Throughput: 0: 9809.4. Samples: 84143672. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:29:04,012][65744] Avg episode reward: [(0, '4156.861')] +[2023-03-11 20:29:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000164368_84156416.pth... +[2023-03-11 20:29:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000163792_83861504.pth +[2023-03-11 20:29:05,511][66031] Updated weights for policy 0, policy_version 164400 (0.0005) +[2023-03-11 20:29:09,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9927.6). Total num frames: 84205568. Throughput: 0: 9809.0. Samples: 84201920. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:29:09,012][65744] Avg episode reward: [(0, '4257.479')] +[2023-03-11 20:29:09,659][66031] Updated weights for policy 0, policy_version 164480 (0.0005) +[2023-03-11 20:29:13,779][66031] Updated weights for policy 0, policy_version 164560 (0.0004) +[2023-03-11 20:29:14,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9927.6). Total num frames: 84254720. Throughput: 0: 9789.9. Samples: 84230216. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:29:14,012][65744] Avg episode reward: [(0, '4188.479')] +[2023-03-11 20:29:17,775][66031] Updated weights for policy 0, policy_version 164640 (0.0004) +[2023-03-11 20:29:19,012][65744] Fps is (10 sec: 10239.9, 60 sec: 9898.7, 300 sec: 9941.5). Total num frames: 84307968. Throughput: 0: 9902.7. Samples: 84291944. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:29:19,012][65744] Avg episode reward: [(0, '4373.734')] +[2023-03-11 20:29:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000164664_84307968.pth... +[2023-03-11 20:29:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000164080_84008960.pth +[2023-03-11 20:29:21,748][66031] Updated weights for policy 0, policy_version 164720 (0.0005) +[2023-03-11 20:29:24,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9941.5). Total num frames: 84357120. Throughput: 0: 10006.7. Samples: 84353684. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:29:24,012][65744] Avg episode reward: [(0, '4373.466')] +[2023-03-11 20:29:25,751][66031] Updated weights for policy 0, policy_version 164800 (0.0004) +[2023-03-11 20:29:29,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9898.7, 300 sec: 9941.5). Total num frames: 84406272. Throughput: 0: 10068.4. Samples: 84384628. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:29:29,012][65744] Avg episode reward: [(0, '4486.742')] +[2023-03-11 20:29:29,951][66031] Updated weights for policy 0, policy_version 164880 (0.0005) +[2023-03-11 20:29:34,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9941.5). Total num frames: 84455424. Throughput: 0: 10012.4. Samples: 84443208. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:29:34,012][65744] Avg episode reward: [(0, '4344.155')] +[2023-03-11 20:29:34,018][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000164960_84459520.pth... +[2023-03-11 20:29:34,018][66031] Updated weights for policy 0, policy_version 164960 (0.0005) +[2023-03-11 20:29:34,020][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000164368_84156416.pth +[2023-03-11 20:29:37,951][66031] Updated weights for policy 0, policy_version 165040 (0.0005) +[2023-03-11 20:29:39,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9941.5). Total num frames: 84508672. Throughput: 0: 10029.0. Samples: 84505536. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:29:39,012][65744] Avg episode reward: [(0, '4025.392')] +[2023-03-11 20:29:41,978][66031] Updated weights for policy 0, policy_version 165120 (0.0004) +[2023-03-11 20:29:44,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10103.5, 300 sec: 9941.5). Total num frames: 84561920. Throughput: 0: 10011.4. Samples: 84536324. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:29:44,012][65744] Avg episode reward: [(0, '4236.195')] +[2023-03-11 20:29:46,039][66031] Updated weights for policy 0, policy_version 165200 (0.0004) +[2023-03-11 20:29:49,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 9941.5). Total num frames: 84611072. Throughput: 0: 10069.5. Samples: 84596800. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:29:49,012][65744] Avg episode reward: [(0, '4099.542')] +[2023-03-11 20:29:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000165256_84611072.pth... +[2023-03-11 20:29:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000164664_84307968.pth +[2023-03-11 20:29:50,015][66031] Updated weights for policy 0, policy_version 165280 (0.0004) +[2023-03-11 20:29:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 9941.5). Total num frames: 84660224. Throughput: 0: 10128.0. Samples: 84657680. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:29:54,012][65744] Avg episode reward: [(0, '4236.002')] +[2023-03-11 20:29:54,111][66031] Updated weights for policy 0, policy_version 165360 (0.0005) +[2023-03-11 20:29:58,063][66031] Updated weights for policy 0, policy_version 165440 (0.0004) +[2023-03-11 20:29:59,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 9955.4). Total num frames: 84713472. Throughput: 0: 10183.9. Samples: 84688492. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:29:59,012][65744] Avg episode reward: [(0, '4313.235')] +[2023-03-11 20:30:02,257][66031] Updated weights for policy 0, policy_version 165520 (0.0005) +[2023-03-11 20:30:04,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 9955.4). Total num frames: 84762624. Throughput: 0: 10133.6. Samples: 84747956. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:30:04,012][65744] Avg episode reward: [(0, '4340.243')] +[2023-03-11 20:30:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000165552_84762624.pth... +[2023-03-11 20:30:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000164960_84459520.pth +[2023-03-11 20:30:06,520][66031] Updated weights for policy 0, policy_version 165600 (0.0005) +[2023-03-11 20:30:09,012][65744] Fps is (10 sec: 9420.8, 60 sec: 10035.2, 300 sec: 9927.6). Total num frames: 84807680. Throughput: 0: 10040.1. Samples: 84805488. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 20:30:09,012][65744] Avg episode reward: [(0, '4332.183')] +[2023-03-11 20:30:10,857][66031] Updated weights for policy 0, policy_version 165680 (0.0005) +[2023-03-11 20:30:14,012][65744] Fps is (10 sec: 9420.8, 60 sec: 10035.2, 300 sec: 9913.7). Total num frames: 84856832. Throughput: 0: 9987.9. Samples: 84834084. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:30:14,012][65744] Avg episode reward: [(0, '4316.073')] +[2023-03-11 20:30:14,989][66031] Updated weights for policy 0, policy_version 165760 (0.0005) +[2023-03-11 20:30:19,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9966.9, 300 sec: 9899.8). Total num frames: 84905984. Throughput: 0: 10012.2. Samples: 84893760. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:30:19,012][65744] Avg episode reward: [(0, '4452.252')] +[2023-03-11 20:30:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000165832_84905984.pth... +[2023-03-11 20:30:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000165256_84611072.pth +[2023-03-11 20:30:19,197][66031] Updated weights for policy 0, policy_version 165840 (0.0005) +[2023-03-11 20:30:23,509][66031] Updated weights for policy 0, policy_version 165920 (0.0005) +[2023-03-11 20:30:24,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9966.9, 300 sec: 9899.8). Total num frames: 84955136. Throughput: 0: 9900.1. Samples: 84951040. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:30:24,012][65744] Avg episode reward: [(0, '4359.476')] +[2023-03-11 20:30:27,785][66031] Updated weights for policy 0, policy_version 166000 (0.0005) +[2023-03-11 20:30:29,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9898.7, 300 sec: 9872.1). Total num frames: 85000192. Throughput: 0: 9844.9. Samples: 84979344. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:30:29,012][65744] Avg episode reward: [(0, '4235.675')] +[2023-03-11 20:30:32,080][66031] Updated weights for policy 0, policy_version 166080 (0.0004) +[2023-03-11 20:30:34,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 9872.1). Total num frames: 85049344. Throughput: 0: 9781.2. Samples: 85036952. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:30:34,012][65744] Avg episode reward: [(0, '4278.230')] +[2023-03-11 20:30:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000166112_85049344.pth... +[2023-03-11 20:30:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000165552_84762624.pth +[2023-03-11 20:30:36,435][66031] Updated weights for policy 0, policy_version 166160 (0.0005) +[2023-03-11 20:30:39,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9858.2). Total num frames: 85098496. Throughput: 0: 9688.5. Samples: 85093664. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:30:39,012][65744] Avg episode reward: [(0, '3980.804')] +[2023-03-11 20:30:40,701][66031] Updated weights for policy 0, policy_version 166240 (0.0005) +[2023-03-11 20:30:44,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9830.4). Total num frames: 85143552. Throughput: 0: 9644.3. Samples: 85122484. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:30:44,012][65744] Avg episode reward: [(0, '4319.028')] +[2023-03-11 20:30:45,005][66031] Updated weights for policy 0, policy_version 166320 (0.0005) +[2023-03-11 20:30:49,012][65744] Fps is (10 sec: 9420.7, 60 sec: 9693.9, 300 sec: 9830.4). Total num frames: 85192704. Throughput: 0: 9590.9. Samples: 85179548. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:30:49,012][65744] Avg episode reward: [(0, '4392.520')] +[2023-03-11 20:30:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000166392_85192704.pth... +[2023-03-11 20:30:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000165832_84905984.pth +[2023-03-11 20:30:49,284][66031] Updated weights for policy 0, policy_version 166400 (0.0005) +[2023-03-11 20:30:53,595][66031] Updated weights for policy 0, policy_version 166480 (0.0005) +[2023-03-11 20:30:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9830.4). Total num frames: 85241856. Throughput: 0: 9583.2. Samples: 85236732. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:30:54,012][65744] Avg episode reward: [(0, '4260.254')] +[2023-03-11 20:30:57,845][66031] Updated weights for policy 0, policy_version 166560 (0.0005) +[2023-03-11 20:30:59,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9557.3, 300 sec: 9816.5). Total num frames: 85286912. Throughput: 0: 9588.7. Samples: 85265576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:30:59,012][65744] Avg episode reward: [(0, '4326.787')] +[2023-03-11 20:31:02,131][66031] Updated weights for policy 0, policy_version 166640 (0.0005) +[2023-03-11 20:31:04,012][65744] Fps is (10 sec: 9420.7, 60 sec: 9557.3, 300 sec: 9816.5). Total num frames: 85336064. Throughput: 0: 9544.5. Samples: 85323264. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:31:04,012][65744] Avg episode reward: [(0, '4412.820')] +[2023-03-11 20:31:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000166672_85336064.pth... +[2023-03-11 20:31:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000166112_85049344.pth +[2023-03-11 20:31:06,374][66031] Updated weights for policy 0, policy_version 166720 (0.0005) +[2023-03-11 20:31:09,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9816.5). Total num frames: 85385216. Throughput: 0: 9556.6. Samples: 85381088. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:31:09,012][65744] Avg episode reward: [(0, '4406.191')] +[2023-03-11 20:31:10,605][66031] Updated weights for policy 0, policy_version 166800 (0.0005) +[2023-03-11 20:31:14,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9802.6). Total num frames: 85430272. Throughput: 0: 9565.7. Samples: 85409800. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:31:14,012][65744] Avg episode reward: [(0, '4274.441')] +[2023-03-11 20:31:14,968][66031] Updated weights for policy 0, policy_version 166880 (0.0005) +[2023-03-11 20:31:19,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9802.6). Total num frames: 85479424. Throughput: 0: 9559.6. Samples: 85467136. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:31:19,012][65744] Avg episode reward: [(0, '4192.316')] +[2023-03-11 20:31:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000166952_85479424.pth... +[2023-03-11 20:31:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000166392_85192704.pth +[2023-03-11 20:31:19,212][66031] Updated weights for policy 0, policy_version 166960 (0.0005) +[2023-03-11 20:31:23,353][66031] Updated weights for policy 0, policy_version 167040 (0.0004) +[2023-03-11 20:31:24,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9788.7). Total num frames: 85528576. Throughput: 0: 9591.1. Samples: 85525264. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:31:24,012][65744] Avg episode reward: [(0, '4515.928')] +[2023-03-11 20:31:27,548][66031] Updated weights for policy 0, policy_version 167120 (0.0004) +[2023-03-11 20:31:29,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9788.7). Total num frames: 85577728. Throughput: 0: 9613.4. Samples: 85555088. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:31:29,012][65744] Avg episode reward: [(0, '4140.583')] +[2023-03-11 20:31:31,592][66031] Updated weights for policy 0, policy_version 167200 (0.0004) +[2023-03-11 20:31:34,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9625.6, 300 sec: 9788.7). Total num frames: 85626880. Throughput: 0: 9669.2. Samples: 85614664. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:31:34,012][65744] Avg episode reward: [(0, '4145.770')] +[2023-03-11 20:31:34,032][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000167248_85630976.pth... +[2023-03-11 20:31:34,034][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000166672_85336064.pth +[2023-03-11 20:31:35,725][66031] Updated weights for policy 0, policy_version 167280 (0.0004) +[2023-03-11 20:31:39,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9774.9). Total num frames: 85676032. Throughput: 0: 9706.6. Samples: 85673528. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:31:39,012][65744] Avg episode reward: [(0, '4295.717')] +[2023-03-11 20:31:39,907][66031] Updated weights for policy 0, policy_version 167360 (0.0004) +[2023-03-11 20:31:43,889][66031] Updated weights for policy 0, policy_version 167440 (0.0005) +[2023-03-11 20:31:44,012][65744] Fps is (10 sec: 10240.1, 60 sec: 9762.1, 300 sec: 9774.9). Total num frames: 85729280. Throughput: 0: 9740.1. Samples: 85703880. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:31:44,012][65744] Avg episode reward: [(0, '4234.938')] +[2023-03-11 20:31:47,899][66031] Updated weights for policy 0, policy_version 167520 (0.0005) +[2023-03-11 20:31:49,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9762.1, 300 sec: 9774.9). Total num frames: 85778432. Throughput: 0: 9833.2. Samples: 85765756. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:31:49,012][65744] Avg episode reward: [(0, '3431.698')] +[2023-03-11 20:31:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000167536_85778432.pth... +[2023-03-11 20:31:49,019][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000166952_85479424.pth +[2023-03-11 20:31:52,065][66031] Updated weights for policy 0, policy_version 167600 (0.0005) +[2023-03-11 20:31:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9788.7). Total num frames: 85827584. Throughput: 0: 9880.3. Samples: 85825700. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:31:54,012][65744] Avg episode reward: [(0, '3784.875')] +[2023-03-11 20:31:56,090][66031] Updated weights for policy 0, policy_version 167680 (0.0004) +[2023-03-11 20:31:59,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9788.7). Total num frames: 85876736. Throughput: 0: 9916.6. Samples: 85856048. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:31:59,012][65744] Avg episode reward: [(0, '3920.287')] +[2023-03-11 20:32:00,375][66031] Updated weights for policy 0, policy_version 167760 (0.0005) +[2023-03-11 20:32:04,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9788.7). Total num frames: 85925888. Throughput: 0: 9921.4. Samples: 85913600. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:32:04,012][65744] Avg episode reward: [(0, '3452.718')] +[2023-03-11 20:32:04,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000167824_85925888.pth... +[2023-03-11 20:32:04,019][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000167248_85630976.pth +[2023-03-11 20:32:04,512][66031] Updated weights for policy 0, policy_version 167840 (0.0004) +[2023-03-11 20:32:08,736][66031] Updated weights for policy 0, policy_version 167920 (0.0004) +[2023-03-11 20:32:09,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9788.7). Total num frames: 85975040. Throughput: 0: 9925.2. Samples: 85971896. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:32:09,012][65744] Avg episode reward: [(0, '3424.109')] +[2023-03-11 20:32:12,932][66031] Updated weights for policy 0, policy_version 168000 (0.0005) +[2023-03-11 20:32:14,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9788.7). Total num frames: 86024192. Throughput: 0: 9920.3. Samples: 86001500. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:32:14,012][65744] Avg episode reward: [(0, '4156.461')] +[2023-03-11 20:32:17,205][66031] Updated weights for policy 0, policy_version 168080 (0.0005) +[2023-03-11 20:32:19,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9788.7). Total num frames: 86073344. Throughput: 0: 9886.8. Samples: 86059568. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:32:19,012][65744] Avg episode reward: [(0, '4372.404')] +[2023-03-11 20:32:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000168112_86073344.pth... +[2023-03-11 20:32:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000167536_85778432.pth +[2023-03-11 20:32:21,424][66031] Updated weights for policy 0, policy_version 168160 (0.0005) +[2023-03-11 20:32:24,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9788.7). Total num frames: 86122496. Throughput: 0: 9869.7. Samples: 86117664. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:32:24,012][65744] Avg episode reward: [(0, '4292.128')] +[2023-03-11 20:32:25,648][66031] Updated weights for policy 0, policy_version 168240 (0.0005) +[2023-03-11 20:32:29,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9830.4, 300 sec: 9774.9). Total num frames: 86167552. Throughput: 0: 9848.7. Samples: 86147072. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:32:29,012][65744] Avg episode reward: [(0, '4284.016')] +[2023-03-11 20:32:29,959][66031] Updated weights for policy 0, policy_version 168320 (0.0005) +[2023-03-11 20:32:34,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9830.4, 300 sec: 9788.7). Total num frames: 86216704. Throughput: 0: 9748.0. Samples: 86204416. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:32:34,012][65744] Avg episode reward: [(0, '4435.191')] +[2023-03-11 20:32:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000168392_86216704.pth... +[2023-03-11 20:32:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000167824_85925888.pth +[2023-03-11 20:32:34,174][66031] Updated weights for policy 0, policy_version 168400 (0.0005) +[2023-03-11 20:32:38,413][66031] Updated weights for policy 0, policy_version 168480 (0.0005) +[2023-03-11 20:32:39,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9788.7). Total num frames: 86265856. Throughput: 0: 9691.8. Samples: 86261832. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:32:39,012][65744] Avg episode reward: [(0, '4368.008')] +[2023-03-11 20:32:42,613][66031] Updated weights for policy 0, policy_version 168560 (0.0005) +[2023-03-11 20:32:44,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9788.7). Total num frames: 86315008. Throughput: 0: 9671.7. Samples: 86291276. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:32:44,012][65744] Avg episode reward: [(0, '4193.766')] +[2023-03-11 20:32:46,926][66031] Updated weights for policy 0, policy_version 168640 (0.0005) +[2023-03-11 20:32:49,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9774.9). Total num frames: 86360064. Throughput: 0: 9669.5. Samples: 86348728. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:32:49,012][65744] Avg episode reward: [(0, '4407.625')] +[2023-03-11 20:32:49,044][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000168680_86364160.pth... +[2023-03-11 20:32:49,046][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000168112_86073344.pth +[2023-03-11 20:32:51,138][66031] Updated weights for policy 0, policy_version 168720 (0.0004) +[2023-03-11 20:32:54,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9788.7). Total num frames: 86409216. Throughput: 0: 9666.0. Samples: 86406864. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:32:54,023][65744] Avg episode reward: [(0, '4556.220')] +[2023-03-11 20:32:55,435][66031] Updated weights for policy 0, policy_version 168800 (0.0005) +[2023-03-11 20:32:59,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9788.7). Total num frames: 86458368. Throughput: 0: 9641.0. Samples: 86435344. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:32:59,012][65744] Avg episode reward: [(0, '4405.559')] +[2023-03-11 20:32:59,738][66031] Updated weights for policy 0, policy_version 168880 (0.0005) +[2023-03-11 20:33:03,990][66031] Updated weights for policy 0, policy_version 168960 (0.0005) +[2023-03-11 20:33:04,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9693.9, 300 sec: 9788.7). Total num frames: 86507520. Throughput: 0: 9619.6. Samples: 86492448. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:33:04,012][65744] Avg episode reward: [(0, '4461.673')] +[2023-03-11 20:33:04,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000168960_86507520.pth... +[2023-03-11 20:33:04,028][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000168392_86216704.pth +[2023-03-11 20:33:08,292][66031] Updated weights for policy 0, policy_version 169040 (0.0005) +[2023-03-11 20:33:09,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9774.9). Total num frames: 86552576. Throughput: 0: 9608.5. Samples: 86550048. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:33:09,012][65744] Avg episode reward: [(0, '4541.236')] +[2023-03-11 20:33:12,493][66031] Updated weights for policy 0, policy_version 169120 (0.0004) +[2023-03-11 20:33:14,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9788.7). Total num frames: 86601728. Throughput: 0: 9603.6. Samples: 86579232. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:33:14,012][65744] Avg episode reward: [(0, '4452.329')] +[2023-03-11 20:33:16,816][66031] Updated weights for policy 0, policy_version 169200 (0.0005) +[2023-03-11 20:33:19,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9625.6, 300 sec: 9788.7). Total num frames: 86650880. Throughput: 0: 9595.2. Samples: 86636200. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:33:19,012][65744] Avg episode reward: [(0, '4486.977')] +[2023-03-11 20:33:19,027][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000169240_86650880.pth... +[2023-03-11 20:33:19,028][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000168680_86364160.pth +[2023-03-11 20:33:21,113][66031] Updated weights for policy 0, policy_version 169280 (0.0005) +[2023-03-11 20:33:24,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9774.9). Total num frames: 86695936. Throughput: 0: 9592.4. Samples: 86693488. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:33:24,012][65744] Avg episode reward: [(0, '4543.153')] +[2023-03-11 20:33:25,452][66031] Updated weights for policy 0, policy_version 169360 (0.0005) +[2023-03-11 20:33:29,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9625.6, 300 sec: 9788.7). Total num frames: 86745088. Throughput: 0: 9569.2. Samples: 86721892. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:33:29,012][65744] Avg episode reward: [(0, '4568.495')] +[2023-03-11 20:33:29,581][66031] Updated weights for policy 0, policy_version 169440 (0.0005) +[2023-03-11 20:33:33,560][66031] Updated weights for policy 0, policy_version 169520 (0.0004) +[2023-03-11 20:33:34,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9693.9, 300 sec: 9802.6). Total num frames: 86798336. Throughput: 0: 9644.5. Samples: 86782728. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:33:34,012][65744] Avg episode reward: [(0, '4516.566')] +[2023-03-11 20:33:34,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000169528_86798336.pth... +[2023-03-11 20:33:34,028][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000168960_86507520.pth +[2023-03-11 20:33:37,595][66031] Updated weights for policy 0, policy_version 169600 (0.0005) +[2023-03-11 20:33:39,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9693.9, 300 sec: 9802.6). Total num frames: 86847488. Throughput: 0: 9706.7. Samples: 86843664. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:33:39,012][65744] Avg episode reward: [(0, '4480.758')] +[2023-03-11 20:33:41,586][66031] Updated weights for policy 0, policy_version 169680 (0.0005) +[2023-03-11 20:33:44,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9762.1, 300 sec: 9816.5). Total num frames: 86900736. Throughput: 0: 9772.3. Samples: 86875096. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:33:44,012][65744] Avg episode reward: [(0, '4463.246')] +[2023-03-11 20:33:45,503][66031] Updated weights for policy 0, policy_version 169760 (0.0004) +[2023-03-11 20:33:49,012][65744] Fps is (10 sec: 10239.9, 60 sec: 9830.4, 300 sec: 9802.6). Total num frames: 86949888. Throughput: 0: 9871.3. Samples: 86936656. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:33:49,012][65744] Avg episode reward: [(0, '4539.937')] +[2023-03-11 20:33:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000169824_86949888.pth... +[2023-03-11 20:33:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000169240_86650880.pth +[2023-03-11 20:33:49,736][66031] Updated weights for policy 0, policy_version 169840 (0.0005) +[2023-03-11 20:33:53,913][66031] Updated weights for policy 0, policy_version 169920 (0.0005) +[2023-03-11 20:33:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9802.6). Total num frames: 86999040. Throughput: 0: 9886.8. Samples: 86994952. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:33:54,012][65744] Avg episode reward: [(0, '4347.685')] +[2023-03-11 20:33:58,128][66031] Updated weights for policy 0, policy_version 170000 (0.0005) +[2023-03-11 20:33:59,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9802.6). Total num frames: 87048192. Throughput: 0: 9875.4. Samples: 87023624. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:33:59,012][65744] Avg episode reward: [(0, '3784.618')] +[2023-03-11 20:34:02,412][66031] Updated weights for policy 0, policy_version 170080 (0.0005) +[2023-03-11 20:34:04,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9788.7). Total num frames: 87093248. Throughput: 0: 9885.2. Samples: 87081032. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:34:04,012][65744] Avg episode reward: [(0, '4337.304')] +[2023-03-11 20:34:04,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000170104_87093248.pth... +[2023-03-11 20:34:04,019][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000169528_86798336.pth +[2023-03-11 20:34:06,452][66031] Updated weights for policy 0, policy_version 170160 (0.0004) +[2023-03-11 20:34:09,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9802.6). Total num frames: 87146496. Throughput: 0: 9976.0. Samples: 87142408. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:34:09,012][65744] Avg episode reward: [(0, '3959.327')] +[2023-03-11 20:34:10,587][66031] Updated weights for policy 0, policy_version 170240 (0.0005) +[2023-03-11 20:34:14,012][65744] Fps is (10 sec: 10240.1, 60 sec: 9898.7, 300 sec: 9788.7). Total num frames: 87195648. Throughput: 0: 9983.4. Samples: 87171144. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:34:14,012][65744] Avg episode reward: [(0, '4494.565')] +[2023-03-11 20:34:14,869][66031] Updated weights for policy 0, policy_version 170320 (0.0005) +[2023-03-11 20:34:19,012][65744] Fps is (10 sec: 9420.7, 60 sec: 9830.4, 300 sec: 9774.9). Total num frames: 87240704. Throughput: 0: 9908.5. Samples: 87228612. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:34:19,012][65744] Avg episode reward: [(0, '4592.962')] +[2023-03-11 20:34:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000170392_87240704.pth... +[2023-03-11 20:34:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000169824_86949888.pth +[2023-03-11 20:34:19,145][66031] Updated weights for policy 0, policy_version 170400 (0.0005) +[2023-03-11 20:34:23,453][66031] Updated weights for policy 0, policy_version 170480 (0.0005) +[2023-03-11 20:34:24,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 9774.9). Total num frames: 87289856. Throughput: 0: 9825.8. Samples: 87285824. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:34:24,012][65744] Avg episode reward: [(0, '4593.114')] +[2023-03-11 20:34:27,627][66031] Updated weights for policy 0, policy_version 170560 (0.0005) +[2023-03-11 20:34:29,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9774.9). Total num frames: 87339008. Throughput: 0: 9774.6. Samples: 87314952. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:34:29,012][65744] Avg episode reward: [(0, '4450.390')] +[2023-03-11 20:34:31,939][66031] Updated weights for policy 0, policy_version 170640 (0.0005) +[2023-03-11 20:34:34,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9747.1). Total num frames: 87384064. Throughput: 0: 9681.6. Samples: 87372328. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:34:34,012][65744] Avg episode reward: [(0, '4590.925')] +[2023-03-11 20:34:34,051][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000170680_87388160.pth... +[2023-03-11 20:34:34,053][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000170104_87093248.pth +[2023-03-11 20:34:36,131][66031] Updated weights for policy 0, policy_version 170720 (0.0005) +[2023-03-11 20:34:39,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9733.2). Total num frames: 87433216. Throughput: 0: 9691.1. Samples: 87431052. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:34:39,012][65744] Avg episode reward: [(0, '4547.787')] +[2023-03-11 20:34:40,357][66031] Updated weights for policy 0, policy_version 170800 (0.0005) +[2023-03-11 20:34:44,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9733.2). Total num frames: 87482368. Throughput: 0: 9706.1. Samples: 87460396. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:34:44,012][65744] Avg episode reward: [(0, '4572.548')] +[2023-03-11 20:34:44,492][66031] Updated weights for policy 0, policy_version 170880 (0.0005) +[2023-03-11 20:34:48,777][66031] Updated weights for policy 0, policy_version 170960 (0.0005) +[2023-03-11 20:34:49,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9693.9, 300 sec: 9733.2). Total num frames: 87531520. Throughput: 0: 9738.0. Samples: 87519240. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:34:49,012][65744] Avg episode reward: [(0, '4583.248')] +[2023-03-11 20:34:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000170960_87531520.pth... +[2023-03-11 20:34:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000170392_87240704.pth +[2023-03-11 20:34:53,071][66031] Updated weights for policy 0, policy_version 171040 (0.0005) +[2023-03-11 20:34:54,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9693.9, 300 sec: 9719.3). Total num frames: 87580672. Throughput: 0: 9648.2. Samples: 87576576. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:34:54,012][65744] Avg episode reward: [(0, '4587.386')] +[2023-03-11 20:34:57,331][66031] Updated weights for policy 0, policy_version 171120 (0.0005) +[2023-03-11 20:34:59,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9625.6, 300 sec: 9705.4). Total num frames: 87625728. Throughput: 0: 9646.9. Samples: 87605256. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:34:59,012][65744] Avg episode reward: [(0, '4488.518')] +[2023-03-11 20:35:01,588][66031] Updated weights for policy 0, policy_version 171200 (0.0005) +[2023-03-11 20:35:04,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9719.3). Total num frames: 87674880. Throughput: 0: 9645.4. Samples: 87662656. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:35:04,012][65744] Avg episode reward: [(0, '4552.352')] +[2023-03-11 20:35:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000171240_87674880.pth... +[2023-03-11 20:35:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000170680_87388160.pth +[2023-03-11 20:35:05,812][66031] Updated weights for policy 0, policy_version 171280 (0.0005) +[2023-03-11 20:35:09,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9693.9, 300 sec: 9733.2). Total num frames: 87728128. Throughput: 0: 9716.5. Samples: 87723064. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:35:09,012][65744] Avg episode reward: [(0, '4572.887')] +[2023-03-11 20:35:09,763][66031] Updated weights for policy 0, policy_version 171360 (0.0004) +[2023-03-11 20:35:13,712][66031] Updated weights for policy 0, policy_version 171440 (0.0004) +[2023-03-11 20:35:14,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9693.9, 300 sec: 9733.2). Total num frames: 87777280. Throughput: 0: 9753.5. Samples: 87753860. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:35:14,012][65744] Avg episode reward: [(0, '4448.871')] +[2023-03-11 20:35:17,738][66031] Updated weights for policy 0, policy_version 171520 (0.0004) +[2023-03-11 20:35:19,012][65744] Fps is (10 sec: 10239.9, 60 sec: 9830.4, 300 sec: 9747.1). Total num frames: 87830528. Throughput: 0: 9836.6. Samples: 87814976. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:35:19,012][65744] Avg episode reward: [(0, '3909.798')] +[2023-03-11 20:35:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000171544_87830528.pth... +[2023-03-11 20:35:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000170960_87531520.pth +[2023-03-11 20:35:21,927][66031] Updated weights for policy 0, policy_version 171600 (0.0004) +[2023-03-11 20:35:24,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9747.1). Total num frames: 87875584. Throughput: 0: 9826.3. Samples: 87873236. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:35:24,012][65744] Avg episode reward: [(0, '3835.580')] +[2023-03-11 20:35:26,234][66031] Updated weights for policy 0, policy_version 171680 (0.0005) +[2023-03-11 20:35:29,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9747.1). Total num frames: 87924736. Throughput: 0: 9823.6. Samples: 87902460. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:35:29,012][65744] Avg episode reward: [(0, '4252.042')] +[2023-03-11 20:35:30,337][66031] Updated weights for policy 0, policy_version 171760 (0.0005) +[2023-03-11 20:35:34,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9761.0). Total num frames: 87977984. Throughput: 0: 9841.3. Samples: 87962100. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:35:34,012][65744] Avg episode reward: [(0, '4328.652')] +[2023-03-11 20:35:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000171832_87977984.pth... +[2023-03-11 20:35:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000171240_87674880.pth +[2023-03-11 20:35:34,358][66031] Updated weights for policy 0, policy_version 171840 (0.0004) +[2023-03-11 20:35:38,328][66031] Updated weights for policy 0, policy_version 171920 (0.0004) +[2023-03-11 20:35:39,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9774.9). Total num frames: 88027136. Throughput: 0: 9940.4. Samples: 88023896. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:35:39,012][65744] Avg episode reward: [(0, '4383.544')] +[2023-03-11 20:35:42,273][66031] Updated weights for policy 0, policy_version 172000 (0.0005) +[2023-03-11 20:35:44,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 9788.7). Total num frames: 88080384. Throughput: 0: 10011.1. Samples: 88055756. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:35:44,012][65744] Avg episode reward: [(0, '4217.550')] +[2023-03-11 20:35:46,333][66031] Updated weights for policy 0, policy_version 172080 (0.0005) +[2023-03-11 20:35:49,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 9788.7). Total num frames: 88129536. Throughput: 0: 10068.4. Samples: 88115736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:35:49,012][65744] Avg episode reward: [(0, '4051.171')] +[2023-03-11 20:35:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000172128_88129536.pth... +[2023-03-11 20:35:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000171544_87830528.pth +[2023-03-11 20:35:50,652][66031] Updated weights for policy 0, policy_version 172160 (0.0005) +[2023-03-11 20:35:54,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 9788.7). Total num frames: 88174592. Throughput: 0: 9994.7. Samples: 88172824. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:35:54,012][65744] Avg episode reward: [(0, '3886.952')] +[2023-03-11 20:35:54,944][66031] Updated weights for policy 0, policy_version 172240 (0.0005) +[2023-03-11 20:35:59,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9966.9, 300 sec: 9788.7). Total num frames: 88223744. Throughput: 0: 9937.3. Samples: 88201040. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:35:59,012][65744] Avg episode reward: [(0, '4216.688')] +[2023-03-11 20:35:59,219][66031] Updated weights for policy 0, policy_version 172320 (0.0005) +[2023-03-11 20:36:03,496][66031] Updated weights for policy 0, policy_version 172400 (0.0005) +[2023-03-11 20:36:04,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9788.7). Total num frames: 88272896. Throughput: 0: 9866.7. Samples: 88258976. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:36:04,012][65744] Avg episode reward: [(0, '4445.838')] +[2023-03-11 20:36:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000172408_88272896.pth... +[2023-03-11 20:36:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000171832_87977984.pth +[2023-03-11 20:36:07,689][66031] Updated weights for policy 0, policy_version 172480 (0.0005) +[2023-03-11 20:36:09,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9802.6). Total num frames: 88322048. Throughput: 0: 9871.7. Samples: 88317464. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:36:09,012][65744] Avg episode reward: [(0, '4095.745')] +[2023-03-11 20:36:11,845][66031] Updated weights for policy 0, policy_version 172560 (0.0005) +[2023-03-11 20:36:14,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9898.7, 300 sec: 9802.6). Total num frames: 88371200. Throughput: 0: 9873.2. Samples: 88346752. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:36:14,012][65744] Avg episode reward: [(0, '3635.841')] +[2023-03-11 20:36:16,090][66031] Updated weights for policy 0, policy_version 172640 (0.0005) +[2023-03-11 20:36:19,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9802.6). Total num frames: 88420352. Throughput: 0: 9849.2. Samples: 88405312. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:36:19,012][65744] Avg episode reward: [(0, '4025.132')] +[2023-03-11 20:36:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000172696_88420352.pth... +[2023-03-11 20:36:19,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000172128_88129536.pth +[2023-03-11 20:36:20,273][66031] Updated weights for policy 0, policy_version 172720 (0.0005) +[2023-03-11 20:36:24,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9802.6). Total num frames: 88469504. Throughput: 0: 9783.8. Samples: 88464168. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:36:24,012][65744] Avg episode reward: [(0, '4324.459')] +[2023-03-11 20:36:24,329][66031] Updated weights for policy 0, policy_version 172800 (0.0004) +[2023-03-11 20:36:28,476][66031] Updated weights for policy 0, policy_version 172880 (0.0004) +[2023-03-11 20:36:29,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9802.6). Total num frames: 88518656. Throughput: 0: 9744.8. Samples: 88494272. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:36:29,012][65744] Avg episode reward: [(0, '4008.922')] +[2023-03-11 20:36:32,844][66031] Updated weights for policy 0, policy_version 172960 (0.0005) +[2023-03-11 20:36:34,012][65744] Fps is (10 sec: 9420.7, 60 sec: 9762.1, 300 sec: 9788.7). Total num frames: 88563712. Throughput: 0: 9689.0. Samples: 88551740. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:36:34,012][65744] Avg episode reward: [(0, '3688.002')] +[2023-03-11 20:36:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000172976_88563712.pth... +[2023-03-11 20:36:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000172408_88272896.pth +[2023-03-11 20:36:36,922][66031] Updated weights for policy 0, policy_version 173040 (0.0004) +[2023-03-11 20:36:39,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9788.7). Total num frames: 88616960. Throughput: 0: 9767.8. Samples: 88612376. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:36:39,012][65744] Avg episode reward: [(0, '4033.843')] +[2023-03-11 20:36:40,950][66031] Updated weights for policy 0, policy_version 173120 (0.0004) +[2023-03-11 20:36:44,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9762.1, 300 sec: 9788.7). Total num frames: 88666112. Throughput: 0: 9810.2. Samples: 88642500. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:36:44,012][65744] Avg episode reward: [(0, '3849.290')] +[2023-03-11 20:36:45,233][66031] Updated weights for policy 0, policy_version 173200 (0.0005) +[2023-03-11 20:36:49,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9774.9). Total num frames: 88711168. Throughput: 0: 9795.0. Samples: 88699752. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:36:49,012][65744] Avg episode reward: [(0, '3967.774')] +[2023-03-11 20:36:49,060][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000173272_88715264.pth... +[2023-03-11 20:36:49,062][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000172696_88420352.pth +[2023-03-11 20:36:49,516][66031] Updated weights for policy 0, policy_version 173280 (0.0005) +[2023-03-11 20:36:53,885][66031] Updated weights for policy 0, policy_version 173360 (0.0005) +[2023-03-11 20:36:54,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9774.9). Total num frames: 88760320. Throughput: 0: 9751.6. Samples: 88756288. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:36:54,012][65744] Avg episode reward: [(0, '3852.048')] +[2023-03-11 20:36:58,117][66031] Updated weights for policy 0, policy_version 173440 (0.0005) +[2023-03-11 20:36:59,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9774.9). Total num frames: 88809472. Throughput: 0: 9739.5. Samples: 88785032. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:36:59,012][65744] Avg episode reward: [(0, '3950.048')] +[2023-03-11 20:37:02,185][66031] Updated weights for policy 0, policy_version 173520 (0.0004) +[2023-03-11 20:37:04,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9762.1, 300 sec: 9774.9). Total num frames: 88858624. Throughput: 0: 9768.6. Samples: 88844900. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:37:04,012][65744] Avg episode reward: [(0, '3769.674')] +[2023-03-11 20:37:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000173552_88858624.pth... +[2023-03-11 20:37:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000172976_88563712.pth +[2023-03-11 20:37:06,350][66031] Updated weights for policy 0, policy_version 173600 (0.0005) +[2023-03-11 20:37:09,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9774.9). Total num frames: 88907776. Throughput: 0: 9761.6. Samples: 88903440. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:37:09,012][65744] Avg episode reward: [(0, '3985.089')] +[2023-03-11 20:37:10,616][66031] Updated weights for policy 0, policy_version 173680 (0.0005) +[2023-03-11 20:37:14,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9774.9). Total num frames: 88956928. Throughput: 0: 9737.1. Samples: 88932440. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:37:14,012][65744] Avg episode reward: [(0, '3980.552')] +[2023-03-11 20:37:14,726][66031] Updated weights for policy 0, policy_version 173760 (0.0005) +[2023-03-11 20:37:19,012][65744] Fps is (10 sec: 9420.7, 60 sec: 9693.9, 300 sec: 9761.0). Total num frames: 89001984. Throughput: 0: 9748.1. Samples: 88990404. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:37:19,012][65744] Avg episode reward: [(0, '4224.753')] +[2023-03-11 20:37:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000173832_89001984.pth... +[2023-03-11 20:37:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000173272_88715264.pth +[2023-03-11 20:37:19,119][66031] Updated weights for policy 0, policy_version 173840 (0.0005) +[2023-03-11 20:37:23,310][66031] Updated weights for policy 0, policy_version 173920 (0.0005) +[2023-03-11 20:37:24,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9774.9). Total num frames: 89051136. Throughput: 0: 9683.6. Samples: 89048136. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:37:24,012][65744] Avg episode reward: [(0, '4018.860')] +[2023-03-11 20:37:27,587][66031] Updated weights for policy 0, policy_version 174000 (0.0005) +[2023-03-11 20:37:29,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9774.9). Total num frames: 89100288. Throughput: 0: 9651.0. Samples: 89076796. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:37:29,012][65744] Avg episode reward: [(0, '4341.154')] +[2023-03-11 20:37:31,867][66031] Updated weights for policy 0, policy_version 174080 (0.0005) +[2023-03-11 20:37:34,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9774.9). Total num frames: 89149440. Throughput: 0: 9666.7. Samples: 89134756. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:37:34,012][65744] Avg episode reward: [(0, '4369.520')] +[2023-03-11 20:37:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000174120_89149440.pth... +[2023-03-11 20:37:34,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000173552_88858624.pth +[2023-03-11 20:37:36,022][66031] Updated weights for policy 0, policy_version 174160 (0.0005) +[2023-03-11 20:37:39,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9774.9). Total num frames: 89198592. Throughput: 0: 9738.1. Samples: 89194504. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:37:39,012][65744] Avg episode reward: [(0, '4390.443')] +[2023-03-11 20:37:40,209][66031] Updated weights for policy 0, policy_version 174240 (0.0005) +[2023-03-11 20:37:44,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9774.9). Total num frames: 89243648. Throughput: 0: 9736.4. Samples: 89223168. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:37:44,012][65744] Avg episode reward: [(0, '4401.511')] +[2023-03-11 20:37:44,538][66031] Updated weights for policy 0, policy_version 174320 (0.0005) +[2023-03-11 20:37:48,833][66031] Updated weights for policy 0, policy_version 174400 (0.0005) +[2023-03-11 20:37:49,012][65744] Fps is (10 sec: 9420.7, 60 sec: 9693.9, 300 sec: 9774.9). Total num frames: 89292800. Throughput: 0: 9667.1. Samples: 89279920. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:37:49,012][65744] Avg episode reward: [(0, '3727.546')] +[2023-03-11 20:37:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000174400_89292800.pth... +[2023-03-11 20:37:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000173832_89001984.pth +[2023-03-11 20:37:53,101][66031] Updated weights for policy 0, policy_version 174480 (0.0005) +[2023-03-11 20:37:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9774.9). Total num frames: 89341952. Throughput: 0: 9644.2. Samples: 89337428. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:37:54,012][65744] Avg episode reward: [(0, '3430.801')] +[2023-03-11 20:37:57,330][66031] Updated weights for policy 0, policy_version 174560 (0.0005) +[2023-03-11 20:37:59,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9774.9). Total num frames: 89391104. Throughput: 0: 9635.2. Samples: 89366024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:37:59,012][65744] Avg episode reward: [(0, '3304.368')] +[2023-03-11 20:38:01,392][66031] Updated weights for policy 0, policy_version 174640 (0.0004) +[2023-03-11 20:38:04,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9788.7). Total num frames: 89440256. Throughput: 0: 9688.7. Samples: 89426396. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:38:04,012][65744] Avg episode reward: [(0, '4019.356')] +[2023-03-11 20:38:04,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000174688_89440256.pth... +[2023-03-11 20:38:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000174120_89149440.pth +[2023-03-11 20:38:05,519][66031] Updated weights for policy 0, policy_version 174720 (0.0005) +[2023-03-11 20:38:09,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9788.7). Total num frames: 89489408. Throughput: 0: 9716.4. Samples: 89485376. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:38:09,012][65744] Avg episode reward: [(0, '4298.443')] +[2023-03-11 20:38:09,728][66031] Updated weights for policy 0, policy_version 174800 (0.0005) +[2023-03-11 20:38:13,976][66031] Updated weights for policy 0, policy_version 174880 (0.0005) +[2023-03-11 20:38:14,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9788.7). Total num frames: 89538560. Throughput: 0: 9715.5. Samples: 89513992. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:38:14,012][65744] Avg episode reward: [(0, '4359.922')] +[2023-03-11 20:38:18,308][66031] Updated weights for policy 0, policy_version 174960 (0.0005) +[2023-03-11 20:38:19,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9788.7). Total num frames: 89583616. Throughput: 0: 9703.0. Samples: 89571392. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:38:19,012][65744] Avg episode reward: [(0, '4555.907')] +[2023-03-11 20:38:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000174968_89583616.pth... +[2023-03-11 20:38:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000174400_89292800.pth +[2023-03-11 20:38:22,497][66031] Updated weights for policy 0, policy_version 175040 (0.0005) +[2023-03-11 20:38:24,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9788.7). Total num frames: 89632768. Throughput: 0: 9670.4. Samples: 89629672. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:38:24,012][65744] Avg episode reward: [(0, '4529.240')] +[2023-03-11 20:38:26,582][66031] Updated weights for policy 0, policy_version 175120 (0.0005) +[2023-03-11 20:38:29,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9762.1, 300 sec: 9788.7). Total num frames: 89686016. Throughput: 0: 9705.7. Samples: 89659924. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:38:29,012][65744] Avg episode reward: [(0, '4497.111')] +[2023-03-11 20:38:30,586][66031] Updated weights for policy 0, policy_version 175200 (0.0003) +[2023-03-11 20:38:34,012][65744] Fps is (10 sec: 10239.9, 60 sec: 9762.1, 300 sec: 9788.7). Total num frames: 89735168. Throughput: 0: 9820.4. Samples: 89721836. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:38:34,012][65744] Avg episode reward: [(0, '4502.049')] +[2023-03-11 20:38:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000175264_89735168.pth... +[2023-03-11 20:38:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000174688_89440256.pth +[2023-03-11 20:38:34,570][66031] Updated weights for policy 0, policy_version 175280 (0.0004) +[2023-03-11 20:38:38,643][66031] Updated weights for policy 0, policy_version 175360 (0.0005) +[2023-03-11 20:38:39,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9774.9). Total num frames: 89784320. Throughput: 0: 9886.0. Samples: 89782296. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:38:39,012][65744] Avg episode reward: [(0, '4347.021')] +[2023-03-11 20:38:42,619][66031] Updated weights for policy 0, policy_version 175440 (0.0004) +[2023-03-11 20:38:44,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9788.7). Total num frames: 89837568. Throughput: 0: 9932.8. Samples: 89813000. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:38:44,012][65744] Avg episode reward: [(0, '4436.708')] +[2023-03-11 20:38:46,619][66031] Updated weights for policy 0, policy_version 175520 (0.0004) +[2023-03-11 20:38:49,012][65744] Fps is (10 sec: 10239.9, 60 sec: 9898.7, 300 sec: 9788.7). Total num frames: 89886720. Throughput: 0: 9957.9. Samples: 89874504. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:38:49,012][65744] Avg episode reward: [(0, '4148.519')] +[2023-03-11 20:38:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000175560_89886720.pth... +[2023-03-11 20:38:49,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000174968_89583616.pth +[2023-03-11 20:38:50,851][66031] Updated weights for policy 0, policy_version 175600 (0.0005) +[2023-03-11 20:38:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9788.7). Total num frames: 89935872. Throughput: 0: 9921.4. Samples: 89931840. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:38:54,012][65744] Avg episode reward: [(0, '4084.105')] +[2023-03-11 20:38:55,204][66031] Updated weights for policy 0, policy_version 175680 (0.0005) +[2023-03-11 20:38:59,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9898.7, 300 sec: 9802.6). Total num frames: 89985024. Throughput: 0: 9917.2. Samples: 89960264. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:38:59,012][65744] Avg episode reward: [(0, '3811.123')] +[2023-03-11 20:38:59,389][66031] Updated weights for policy 0, policy_version 175760 (0.0004) +[2023-03-11 20:39:03,363][66031] Updated weights for policy 0, policy_version 175840 (0.0004) +[2023-03-11 20:39:04,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9788.7). Total num frames: 90034176. Throughput: 0: 9981.6. Samples: 90020564. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:39:04,012][65744] Avg episode reward: [(0, '3599.998')] +[2023-03-11 20:39:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000175848_90034176.pth... +[2023-03-11 20:39:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000175264_89735168.pth +[2023-03-11 20:39:07,654][66031] Updated weights for policy 0, policy_version 175920 (0.0005) +[2023-03-11 20:39:09,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9788.7). Total num frames: 90083328. Throughput: 0: 9983.4. Samples: 90078924. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:39:09,012][65744] Avg episode reward: [(0, '4099.610')] +[2023-03-11 20:39:11,916][66031] Updated weights for policy 0, policy_version 176000 (0.0005) +[2023-03-11 20:39:14,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9898.7, 300 sec: 9802.6). Total num frames: 90132480. Throughput: 0: 9955.1. Samples: 90107904. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:39:14,012][65744] Avg episode reward: [(0, '4372.578')] +[2023-03-11 20:39:16,015][66031] Updated weights for policy 0, policy_version 176080 (0.0004) +[2023-03-11 20:39:19,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9802.6). Total num frames: 90181632. Throughput: 0: 9906.6. Samples: 90167632. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:39:19,012][65744] Avg episode reward: [(0, '4309.461')] +[2023-03-11 20:39:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000176136_90181632.pth... +[2023-03-11 20:39:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000175560_89886720.pth +[2023-03-11 20:39:20,219][66031] Updated weights for policy 0, policy_version 176160 (0.0005) +[2023-03-11 20:39:24,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 9788.7). Total num frames: 90226688. Throughput: 0: 9841.8. Samples: 90225176. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:39:24,012][65744] Avg episode reward: [(0, '4266.503')] +[2023-03-11 20:39:24,461][66031] Updated weights for policy 0, policy_version 176240 (0.0005) +[2023-03-11 20:39:28,761][66031] Updated weights for policy 0, policy_version 176320 (0.0005) +[2023-03-11 20:39:29,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9830.4, 300 sec: 9802.6). Total num frames: 90275840. Throughput: 0: 9803.0. Samples: 90254136. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:39:29,012][65744] Avg episode reward: [(0, '4242.117')] +[2023-03-11 20:39:32,935][66031] Updated weights for policy 0, policy_version 176400 (0.0005) +[2023-03-11 20:39:34,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9830.4, 300 sec: 9802.6). Total num frames: 90324992. Throughput: 0: 9725.3. Samples: 90312144. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:39:34,012][65744] Avg episode reward: [(0, '4110.052')] +[2023-03-11 20:39:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000176416_90324992.pth... +[2023-03-11 20:39:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000175848_90034176.pth +[2023-03-11 20:39:37,019][66031] Updated weights for policy 0, policy_version 176480 (0.0005) +[2023-03-11 20:39:39,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9802.6). Total num frames: 90374144. Throughput: 0: 9781.1. Samples: 90371988. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:39:39,012][65744] Avg episode reward: [(0, '4097.664')] +[2023-03-11 20:39:41,080][66031] Updated weights for policy 0, policy_version 176560 (0.0004) +[2023-03-11 20:39:44,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 9816.5). Total num frames: 90427392. Throughput: 0: 9831.3. Samples: 90402672. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:39:44,012][65744] Avg episode reward: [(0, '4149.335')] +[2023-03-11 20:39:45,116][66031] Updated weights for policy 0, policy_version 176640 (0.0004) +[2023-03-11 20:39:49,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 9816.5). Total num frames: 90476544. Throughput: 0: 9854.7. Samples: 90464024. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:39:49,012][65744] Avg episode reward: [(0, '3832.243')] +[2023-03-11 20:39:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000176712_90476544.pth... +[2023-03-11 20:39:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000176136_90181632.pth +[2023-03-11 20:39:49,062][66031] Updated weights for policy 0, policy_version 176720 (0.0004) +[2023-03-11 20:39:53,257][66031] Updated weights for policy 0, policy_version 176800 (0.0005) +[2023-03-11 20:39:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9830.4). Total num frames: 90525696. Throughput: 0: 9880.6. Samples: 90523552. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:39:54,012][65744] Avg episode reward: [(0, '3876.623')] +[2023-03-11 20:39:57,426][66031] Updated weights for policy 0, policy_version 176880 (0.0004) +[2023-03-11 20:39:59,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9830.4). Total num frames: 90574848. Throughput: 0: 9866.3. Samples: 90551888. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:39:59,012][65744] Avg episode reward: [(0, '4098.671')] +[2023-03-11 20:40:01,468][66031] Updated weights for policy 0, policy_version 176960 (0.0005) +[2023-03-11 20:40:04,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9830.4). Total num frames: 90628096. Throughput: 0: 9917.0. Samples: 90613896. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:40:04,012][65744] Avg episode reward: [(0, '3975.766')] +[2023-03-11 20:40:04,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000177008_90628096.pth... +[2023-03-11 20:40:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000176416_90324992.pth +[2023-03-11 20:40:05,423][66031] Updated weights for policy 0, policy_version 177040 (0.0005) +[2023-03-11 20:40:09,012][65744] Fps is (10 sec: 10240.1, 60 sec: 9898.7, 300 sec: 9830.4). Total num frames: 90677248. Throughput: 0: 10003.7. Samples: 90675340. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:40:09,012][65744] Avg episode reward: [(0, '4228.794')] +[2023-03-11 20:40:09,424][66031] Updated weights for policy 0, policy_version 177120 (0.0005) +[2023-03-11 20:40:13,426][66031] Updated weights for policy 0, policy_version 177200 (0.0004) +[2023-03-11 20:40:14,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 9830.4). Total num frames: 90730496. Throughput: 0: 10041.2. Samples: 90705992. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:40:14,012][65744] Avg episode reward: [(0, '4516.812')] +[2023-03-11 20:40:17,355][66031] Updated weights for policy 0, policy_version 177280 (0.0004) +[2023-03-11 20:40:19,012][65744] Fps is (10 sec: 10649.5, 60 sec: 10035.2, 300 sec: 9858.2). Total num frames: 90783744. Throughput: 0: 10127.7. Samples: 90767892. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:40:19,012][65744] Avg episode reward: [(0, '4338.681')] +[2023-03-11 20:40:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000177312_90783744.pth... +[2023-03-11 20:40:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000176712_90476544.pth +[2023-03-11 20:40:21,323][66031] Updated weights for policy 0, policy_version 177360 (0.0004) +[2023-03-11 20:40:24,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 9858.2). Total num frames: 90832896. Throughput: 0: 10169.0. Samples: 90829592. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:40:24,012][65744] Avg episode reward: [(0, '4450.443')] +[2023-03-11 20:40:25,313][66031] Updated weights for policy 0, policy_version 177440 (0.0004) +[2023-03-11 20:40:29,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10171.7, 300 sec: 9858.2). Total num frames: 90886144. Throughput: 0: 10188.0. Samples: 90861132. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:40:29,012][65744] Avg episode reward: [(0, '4549.806')] +[2023-03-11 20:40:29,354][66031] Updated weights for policy 0, policy_version 177520 (0.0004) +[2023-03-11 20:40:33,309][66031] Updated weights for policy 0, policy_version 177600 (0.0004) +[2023-03-11 20:40:34,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 9858.2). Total num frames: 90935296. Throughput: 0: 10196.5. Samples: 90922868. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:40:34,012][65744] Avg episode reward: [(0, '4477.742')] +[2023-03-11 20:40:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000177608_90935296.pth... +[2023-03-11 20:40:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000177008_90628096.pth +[2023-03-11 20:40:37,461][66031] Updated weights for policy 0, policy_version 177680 (0.0005) +[2023-03-11 20:40:39,012][65744] Fps is (10 sec: 9830.3, 60 sec: 10171.7, 300 sec: 9844.3). Total num frames: 90984448. Throughput: 0: 10172.7. Samples: 90981324. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:40:39,012][65744] Avg episode reward: [(0, '4429.722')] +[2023-03-11 20:40:41,562][66031] Updated weights for policy 0, policy_version 177760 (0.0004) +[2023-03-11 20:40:44,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10171.7, 300 sec: 9858.2). Total num frames: 91037696. Throughput: 0: 10226.7. Samples: 91012088. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:40:44,012][65744] Avg episode reward: [(0, '4327.353')] +[2023-03-11 20:40:45,496][66031] Updated weights for policy 0, policy_version 177840 (0.0004) +[2023-03-11 20:40:49,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 9872.1). Total num frames: 91086848. Throughput: 0: 10235.8. Samples: 91074508. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:40:49,012][65744] Avg episode reward: [(0, '4438.129')] +[2023-03-11 20:40:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000177904_91086848.pth... +[2023-03-11 20:40:49,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000177312_90783744.pth +[2023-03-11 20:40:49,455][66031] Updated weights for policy 0, policy_version 177920 (0.0004) +[2023-03-11 20:40:53,503][66031] Updated weights for policy 0, policy_version 178000 (0.0005) +[2023-03-11 20:40:54,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 9885.9). Total num frames: 91140096. Throughput: 0: 10236.9. Samples: 91136000. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:40:54,012][65744] Avg episode reward: [(0, '4399.287')] +[2023-03-11 20:40:57,739][66031] Updated weights for policy 0, policy_version 178080 (0.0005) +[2023-03-11 20:40:59,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 9872.1). Total num frames: 91185152. Throughput: 0: 10191.9. Samples: 91164628. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:40:59,012][65744] Avg episode reward: [(0, '4478.073')] +[2023-03-11 20:41:01,822][66031] Updated weights for policy 0, policy_version 178160 (0.0004) +[2023-03-11 20:41:04,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 9885.9). Total num frames: 91238400. Throughput: 0: 10137.0. Samples: 91224056. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:41:04,012][65744] Avg episode reward: [(0, '4511.195')] +[2023-03-11 20:41:04,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000178200_91238400.pth... +[2023-03-11 20:41:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000177608_90935296.pth +[2023-03-11 20:41:05,814][66031] Updated weights for policy 0, policy_version 178240 (0.0004) +[2023-03-11 20:41:09,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 9899.8). Total num frames: 91291648. Throughput: 0: 10136.2. Samples: 91285724. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:41:09,012][65744] Avg episode reward: [(0, '4406.867')] +[2023-03-11 20:41:09,802][66031] Updated weights for policy 0, policy_version 178320 (0.0004) +[2023-03-11 20:41:13,784][66031] Updated weights for policy 0, policy_version 178400 (0.0004) +[2023-03-11 20:41:14,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 9899.8). Total num frames: 91340800. Throughput: 0: 10117.0. Samples: 91316396. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:41:14,012][65744] Avg episode reward: [(0, '4456.388')] +[2023-03-11 20:41:17,735][66031] Updated weights for policy 0, policy_version 178480 (0.0004) +[2023-03-11 20:41:19,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 9913.7). Total num frames: 91394048. Throughput: 0: 10124.6. Samples: 91378476. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:41:19,012][65744] Avg episode reward: [(0, '4371.787')] +[2023-03-11 20:41:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000178504_91394048.pth... +[2023-03-11 20:41:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000177904_91086848.pth +[2023-03-11 20:41:21,741][66031] Updated weights for policy 0, policy_version 178560 (0.0004) +[2023-03-11 20:41:24,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 9913.7). Total num frames: 91443200. Throughput: 0: 10188.8. Samples: 91439820. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:41:24,012][65744] Avg episode reward: [(0, '4287.725')] +[2023-03-11 20:41:25,739][66031] Updated weights for policy 0, policy_version 178640 (0.0004) +[2023-03-11 20:41:29,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 9941.5). Total num frames: 91496448. Throughput: 0: 10207.2. Samples: 91471412. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:41:29,012][65744] Avg episode reward: [(0, '4365.596')] +[2023-03-11 20:41:29,760][66031] Updated weights for policy 0, policy_version 178720 (0.0005) +[2023-03-11 20:41:33,803][66031] Updated weights for policy 0, policy_version 178800 (0.0005) +[2023-03-11 20:41:34,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 9927.6). Total num frames: 91545600. Throughput: 0: 10157.0. Samples: 91531572. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:41:34,012][65744] Avg episode reward: [(0, '4404.322')] +[2023-03-11 20:41:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000178800_91545600.pth... +[2023-03-11 20:41:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000178200_91238400.pth +[2023-03-11 20:41:37,856][66031] Updated weights for policy 0, policy_version 178880 (0.0004) +[2023-03-11 20:41:39,012][65744] Fps is (10 sec: 9830.3, 60 sec: 10171.7, 300 sec: 9927.6). Total num frames: 91594752. Throughput: 0: 10140.1. Samples: 91592304. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:41:39,012][65744] Avg episode reward: [(0, '4477.250')] +[2023-03-11 20:41:41,864][66031] Updated weights for policy 0, policy_version 178960 (0.0004) +[2023-03-11 20:41:44,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 9955.4). Total num frames: 91648000. Throughput: 0: 10195.6. Samples: 91623432. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:41:44,012][65744] Avg episode reward: [(0, '4404.567')] +[2023-03-11 20:41:45,766][66031] Updated weights for policy 0, policy_version 179040 (0.0005) +[2023-03-11 20:41:49,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 9969.2). Total num frames: 91701248. Throughput: 0: 10266.5. Samples: 91686048. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:41:49,012][65744] Avg episode reward: [(0, '4549.951')] +[2023-03-11 20:41:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000179104_91701248.pth... +[2023-03-11 20:41:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000178504_91394048.pth +[2023-03-11 20:41:49,692][66031] Updated weights for policy 0, policy_version 179120 (0.0004) +[2023-03-11 20:41:53,598][66031] Updated weights for policy 0, policy_version 179200 (0.0004) +[2023-03-11 20:41:54,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 9983.1). Total num frames: 91754496. Throughput: 0: 10292.4. Samples: 91748880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:41:54,012][65744] Avg episode reward: [(0, '4415.083')] +[2023-03-11 20:41:57,568][66031] Updated weights for policy 0, policy_version 179280 (0.0004) +[2023-03-11 20:41:59,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 9983.1). Total num frames: 91803648. Throughput: 0: 10295.9. Samples: 91779712. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:41:59,012][65744] Avg episode reward: [(0, '4517.362')] +[2023-03-11 20:42:01,576][66031] Updated weights for policy 0, policy_version 179360 (0.0004) +[2023-03-11 20:42:04,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 9983.1). Total num frames: 91852800. Throughput: 0: 10271.3. Samples: 91840684. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:42:04,012][65744] Avg episode reward: [(0, '4357.455')] +[2023-03-11 20:42:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000179400_91852800.pth... +[2023-03-11 20:42:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000178800_91545600.pth +[2023-03-11 20:42:05,873][66031] Updated weights for policy 0, policy_version 179440 (0.0005) +[2023-03-11 20:42:09,012][65744] Fps is (10 sec: 9830.5, 60 sec: 10171.7, 300 sec: 9983.1). Total num frames: 91901952. Throughput: 0: 10178.9. Samples: 91897872. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:42:09,012][65744] Avg episode reward: [(0, '4301.897')] +[2023-03-11 20:42:10,051][66031] Updated weights for policy 0, policy_version 179520 (0.0004) +[2023-03-11 20:42:14,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 9997.0). Total num frames: 91951104. Throughput: 0: 10139.5. Samples: 91927688. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:42:14,012][65744] Avg episode reward: [(0, '4170.961')] +[2023-03-11 20:42:14,313][66031] Updated weights for policy 0, policy_version 179600 (0.0005) +[2023-03-11 20:42:18,582][66031] Updated weights for policy 0, policy_version 179680 (0.0005) +[2023-03-11 20:42:19,012][65744] Fps is (10 sec: 9420.8, 60 sec: 10035.2, 300 sec: 9983.1). Total num frames: 91996160. Throughput: 0: 10071.3. Samples: 91984780. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:42:19,012][65744] Avg episode reward: [(0, '4287.181')] +[2023-03-11 20:42:19,019][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000179688_92000256.pth... +[2023-03-11 20:42:19,021][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000179104_91701248.pth +[2023-03-11 20:42:22,896][66031] Updated weights for policy 0, policy_version 179760 (0.0005) +[2023-03-11 20:42:24,012][65744] Fps is (10 sec: 9420.8, 60 sec: 10035.2, 300 sec: 9983.1). Total num frames: 92045312. Throughput: 0: 10005.8. Samples: 92042564. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:42:24,012][65744] Avg episode reward: [(0, '4275.298')] +[2023-03-11 20:42:27,119][66031] Updated weights for policy 0, policy_version 179840 (0.0005) +[2023-03-11 20:42:29,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9983.1). Total num frames: 92094464. Throughput: 0: 9956.6. Samples: 92071480. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:42:29,012][65744] Avg episode reward: [(0, '3867.235')] +[2023-03-11 20:42:31,419][66031] Updated weights for policy 0, policy_version 179920 (0.0005) +[2023-03-11 20:42:34,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9966.9, 300 sec: 9983.1). Total num frames: 92143616. Throughput: 0: 9841.1. Samples: 92128896. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:42:34,012][65744] Avg episode reward: [(0, '4100.522')] +[2023-03-11 20:42:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000179968_92143616.pth... +[2023-03-11 20:42:34,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000179400_91852800.pth +[2023-03-11 20:42:35,652][66031] Updated weights for policy 0, policy_version 180000 (0.0005) +[2023-03-11 20:42:39,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9997.0). Total num frames: 92192768. Throughput: 0: 9756.8. Samples: 92187936. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:42:39,012][65744] Avg episode reward: [(0, '4283.904')] +[2023-03-11 20:42:39,732][66031] Updated weights for policy 0, policy_version 180080 (0.0005) +[2023-03-11 20:42:43,654][66031] Updated weights for policy 0, policy_version 180160 (0.0004) +[2023-03-11 20:42:44,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9898.7, 300 sec: 9997.0). Total num frames: 92241920. Throughput: 0: 9751.2. Samples: 92218516. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:42:44,012][65744] Avg episode reward: [(0, '4070.516')] +[2023-03-11 20:42:47,573][66031] Updated weights for policy 0, policy_version 180240 (0.0004) +[2023-03-11 20:42:49,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 10010.9). Total num frames: 92295168. Throughput: 0: 9794.1. Samples: 92281420. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:42:49,012][65744] Avg episode reward: [(0, '4241.155')] +[2023-03-11 20:42:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000180264_92295168.pth... +[2023-03-11 20:42:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000179688_92000256.pth +[2023-03-11 20:42:51,520][66031] Updated weights for policy 0, policy_version 180320 (0.0004) +[2023-03-11 20:42:54,012][65744] Fps is (10 sec: 10649.6, 60 sec: 9898.7, 300 sec: 10024.8). Total num frames: 92348416. Throughput: 0: 9918.6. Samples: 92344208. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:42:54,012][65744] Avg episode reward: [(0, '3972.798')] +[2023-03-11 20:42:55,478][66031] Updated weights for policy 0, policy_version 180400 (0.0004) +[2023-03-11 20:42:59,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 10024.8). Total num frames: 92397568. Throughput: 0: 9935.9. Samples: 92374804. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:42:59,012][65744] Avg episode reward: [(0, '3954.519')] +[2023-03-11 20:42:59,489][66031] Updated weights for policy 0, policy_version 180480 (0.0005) +[2023-03-11 20:43:03,717][66031] Updated weights for policy 0, policy_version 180560 (0.0005) +[2023-03-11 20:43:04,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9898.7, 300 sec: 10024.8). Total num frames: 92446720. Throughput: 0: 9991.8. Samples: 92434412. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:43:04,012][65744] Avg episode reward: [(0, '4051.893')] +[2023-03-11 20:43:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000180560_92446720.pth... +[2023-03-11 20:43:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000179968_92143616.pth +[2023-03-11 20:43:07,714][66031] Updated weights for policy 0, policy_version 180640 (0.0004) +[2023-03-11 20:43:09,012][65744] Fps is (10 sec: 10240.1, 60 sec: 9966.9, 300 sec: 10038.7). Total num frames: 92499968. Throughput: 0: 10065.3. Samples: 92495504. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:43:09,012][65744] Avg episode reward: [(0, '4121.149')] +[2023-03-11 20:43:11,702][66031] Updated weights for policy 0, policy_version 180720 (0.0005) +[2023-03-11 20:43:14,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 10052.6). Total num frames: 92549120. Throughput: 0: 10095.2. Samples: 92525764. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:43:14,012][65744] Avg episode reward: [(0, '4295.764')] +[2023-03-11 20:43:15,968][66031] Updated weights for policy 0, policy_version 180800 (0.0005) +[2023-03-11 20:43:19,012][65744] Fps is (10 sec: 9830.2, 60 sec: 10035.2, 300 sec: 10052.5). Total num frames: 92598272. Throughput: 0: 10101.8. Samples: 92583476. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:43:19,013][65744] Avg episode reward: [(0, '4167.495')] +[2023-03-11 20:43:19,017][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000180856_92598272.pth... +[2023-03-11 20:43:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000180264_92295168.pth +[2023-03-11 20:43:20,269][66031] Updated weights for policy 0, policy_version 180880 (0.0005) +[2023-03-11 20:43:24,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9966.9, 300 sec: 10024.8). Total num frames: 92643328. Throughput: 0: 10084.7. Samples: 92641748. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:43:24,012][65744] Avg episode reward: [(0, '4245.106')] +[2023-03-11 20:43:24,488][66031] Updated weights for policy 0, policy_version 180960 (0.0005) +[2023-03-11 20:43:28,737][66031] Updated weights for policy 0, policy_version 181040 (0.0005) +[2023-03-11 20:43:29,012][65744] Fps is (10 sec: 9421.0, 60 sec: 9966.9, 300 sec: 10024.8). Total num frames: 92692480. Throughput: 0: 10055.6. Samples: 92671020. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:43:29,012][65744] Avg episode reward: [(0, '4239.495')] +[2023-03-11 20:43:32,900][66031] Updated weights for policy 0, policy_version 181120 (0.0005) +[2023-03-11 20:43:34,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9966.9, 300 sec: 10024.8). Total num frames: 92741632. Throughput: 0: 9945.9. Samples: 92728984. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:43:34,012][65744] Avg episode reward: [(0, '4465.811')] +[2023-03-11 20:43:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000181136_92741632.pth... +[2023-03-11 20:43:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000180560_92446720.pth +[2023-03-11 20:43:37,111][66031] Updated weights for policy 0, policy_version 181200 (0.0005) +[2023-03-11 20:43:39,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9966.9, 300 sec: 10010.9). Total num frames: 92790784. Throughput: 0: 9834.5. Samples: 92786760. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:43:39,012][65744] Avg episode reward: [(0, '4011.965')] +[2023-03-11 20:43:41,465][66031] Updated weights for policy 0, policy_version 181280 (0.0005) +[2023-03-11 20:43:44,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9898.7, 300 sec: 9997.0). Total num frames: 92835840. Throughput: 0: 9790.5. Samples: 92815376. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:43:44,012][65744] Avg episode reward: [(0, '4090.723')] +[2023-03-11 20:43:45,742][66031] Updated weights for policy 0, policy_version 181360 (0.0005) +[2023-03-11 20:43:49,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9830.4, 300 sec: 9997.0). Total num frames: 92884992. Throughput: 0: 9741.2. Samples: 92872768. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:43:49,012][65744] Avg episode reward: [(0, '4349.097')] +[2023-03-11 20:43:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000181416_92884992.pth... +[2023-03-11 20:43:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000180856_92598272.pth +[2023-03-11 20:43:50,016][66031] Updated weights for policy 0, policy_version 181440 (0.0005) +[2023-03-11 20:43:54,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9762.1, 300 sec: 9997.0). Total num frames: 92934144. Throughput: 0: 9657.9. Samples: 92930112. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:43:54,012][65744] Avg episode reward: [(0, '4263.342')] +[2023-03-11 20:43:54,276][66031] Updated weights for policy 0, policy_version 181520 (0.0005) +[2023-03-11 20:43:58,505][66031] Updated weights for policy 0, policy_version 181600 (0.0005) +[2023-03-11 20:43:59,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9997.0). Total num frames: 92983296. Throughput: 0: 9635.6. Samples: 92959368. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:43:59,012][65744] Avg episode reward: [(0, '4355.648')] +[2023-03-11 20:44:02,780][66031] Updated weights for policy 0, policy_version 181680 (0.0005) +[2023-03-11 20:44:04,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9983.1). Total num frames: 93028352. Throughput: 0: 9634.2. Samples: 93017012. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:44:04,012][65744] Avg episode reward: [(0, '4296.282')] +[2023-03-11 20:44:04,052][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000181704_93032448.pth... +[2023-03-11 20:44:04,054][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000181136_92741632.pth +[2023-03-11 20:44:06,887][66031] Updated weights for policy 0, policy_version 181760 (0.0005) +[2023-03-11 20:44:09,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9997.0). Total num frames: 93081600. Throughput: 0: 9683.7. Samples: 93077516. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:44:09,012][65744] Avg episode reward: [(0, '4236.552')] +[2023-03-11 20:44:10,831][66031] Updated weights for policy 0, policy_version 181840 (0.0004) +[2023-03-11 20:44:14,012][65744] Fps is (10 sec: 10649.6, 60 sec: 9762.1, 300 sec: 10010.9). Total num frames: 93134848. Throughput: 0: 9720.7. Samples: 93108452. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:44:14,012][65744] Avg episode reward: [(0, '4363.723')] +[2023-03-11 20:44:14,821][66031] Updated weights for policy 0, policy_version 181920 (0.0004) +[2023-03-11 20:44:18,850][66031] Updated weights for policy 0, policy_version 182000 (0.0005) +[2023-03-11 20:44:19,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9762.2, 300 sec: 10024.8). Total num frames: 93184000. Throughput: 0: 9781.7. Samples: 93169160. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:44:19,012][65744] Avg episode reward: [(0, '4468.698')] +[2023-03-11 20:44:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000182000_93184000.pth... +[2023-03-11 20:44:19,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000181416_92884992.pth +[2023-03-11 20:44:22,852][66031] Updated weights for policy 0, policy_version 182080 (0.0004) +[2023-03-11 20:44:24,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10024.8). Total num frames: 93233152. Throughput: 0: 9878.2. Samples: 93231280. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 20:44:24,012][65744] Avg episode reward: [(0, '4459.612')] +[2023-03-11 20:44:26,938][66031] Updated weights for policy 0, policy_version 182160 (0.0003) +[2023-03-11 20:44:29,012][65744] Fps is (10 sec: 10240.1, 60 sec: 9898.7, 300 sec: 10038.7). Total num frames: 93286400. Throughput: 0: 9911.8. Samples: 93261408. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 20:44:29,012][65744] Avg episode reward: [(0, '4342.008')] +[2023-03-11 20:44:31,000][66031] Updated weights for policy 0, policy_version 182240 (0.0003) +[2023-03-11 20:44:34,012][65744] Fps is (10 sec: 10239.9, 60 sec: 9898.7, 300 sec: 10038.7). Total num frames: 93335552. Throughput: 0: 9964.6. Samples: 93321176. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 20:44:34,012][65744] Avg episode reward: [(0, '4387.312')] +[2023-03-11 20:44:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000182296_93335552.pth... +[2023-03-11 20:44:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000181704_93032448.pth +[2023-03-11 20:44:35,135][66031] Updated weights for policy 0, policy_version 182320 (0.0004) +[2023-03-11 20:44:39,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10024.8). Total num frames: 93384704. Throughput: 0: 9996.5. Samples: 93379956. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 20:44:39,012][65744] Avg episode reward: [(0, '4364.370')] +[2023-03-11 20:44:39,376][66031] Updated weights for policy 0, policy_version 182400 (0.0005) +[2023-03-11 20:44:43,624][66031] Updated weights for policy 0, policy_version 182480 (0.0005) +[2023-03-11 20:44:44,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9898.7, 300 sec: 10010.9). Total num frames: 93429760. Throughput: 0: 9998.4. Samples: 93409296. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 20:44:44,012][65744] Avg episode reward: [(0, '4302.033')] +[2023-03-11 20:44:47,892][66031] Updated weights for policy 0, policy_version 182560 (0.0005) +[2023-03-11 20:44:49,012][65744] Fps is (10 sec: 9420.7, 60 sec: 9898.7, 300 sec: 10010.9). Total num frames: 93478912. Throughput: 0: 9991.6. Samples: 93466636. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 20:44:49,012][65744] Avg episode reward: [(0, '4150.140')] +[2023-03-11 20:44:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000182576_93478912.pth... +[2023-03-11 20:44:49,019][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000182000_93184000.pth +[2023-03-11 20:44:52,181][66031] Updated weights for policy 0, policy_version 182640 (0.0005) +[2023-03-11 20:44:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10010.9). Total num frames: 93528064. Throughput: 0: 9921.4. Samples: 93523980. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 20:44:54,012][65744] Avg episode reward: [(0, '4289.713')] +[2023-03-11 20:44:56,456][66031] Updated weights for policy 0, policy_version 182720 (0.0005) +[2023-03-11 20:44:59,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9830.4, 300 sec: 9983.1). Total num frames: 93573120. Throughput: 0: 9872.3. Samples: 93552704. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 20:44:59,012][65744] Avg episode reward: [(0, '4017.873')] +[2023-03-11 20:45:00,756][66031] Updated weights for policy 0, policy_version 182800 (0.0005) +[2023-03-11 20:45:04,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 9983.1). Total num frames: 93622272. Throughput: 0: 9796.3. Samples: 93609992. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 20:45:04,012][65744] Avg episode reward: [(0, '4283.265')] +[2023-03-11 20:45:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000182856_93622272.pth... +[2023-03-11 20:45:04,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000182296_93335552.pth +[2023-03-11 20:45:04,889][66031] Updated weights for policy 0, policy_version 182880 (0.0005) +[2023-03-11 20:45:09,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9969.3). Total num frames: 93671424. Throughput: 0: 9747.9. Samples: 93669936. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 20:45:09,012][65744] Avg episode reward: [(0, '4339.469')] +[2023-03-11 20:45:09,047][66031] Updated weights for policy 0, policy_version 182960 (0.0005) +[2023-03-11 20:45:13,295][66031] Updated weights for policy 0, policy_version 183040 (0.0005) +[2023-03-11 20:45:14,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9955.4). Total num frames: 93720576. Throughput: 0: 9732.8. Samples: 93699384. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 20:45:14,012][65744] Avg episode reward: [(0, '4305.102')] +[2023-03-11 20:45:17,585][66031] Updated weights for policy 0, policy_version 183120 (0.0005) +[2023-03-11 20:45:19,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9762.1, 300 sec: 9955.4). Total num frames: 93769728. Throughput: 0: 9667.3. Samples: 93756204. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 20:45:19,012][65744] Avg episode reward: [(0, '4420.952')] +[2023-03-11 20:45:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000183144_93769728.pth... +[2023-03-11 20:45:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000182576_93478912.pth +[2023-03-11 20:45:21,717][66031] Updated weights for policy 0, policy_version 183200 (0.0005) +[2023-03-11 20:45:24,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9941.5). Total num frames: 93818880. Throughput: 0: 9667.0. Samples: 93814972. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 20:45:24,012][65744] Avg episode reward: [(0, '4365.501')] +[2023-03-11 20:45:25,802][66031] Updated weights for policy 0, policy_version 183280 (0.0005) +[2023-03-11 20:45:29,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9693.9, 300 sec: 9941.5). Total num frames: 93868032. Throughput: 0: 9706.0. Samples: 93846064. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:45:29,012][65744] Avg episode reward: [(0, '4394.755')] +[2023-03-11 20:45:29,872][66031] Updated weights for policy 0, policy_version 183360 (0.0005) +[2023-03-11 20:45:34,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9941.5). Total num frames: 93917184. Throughput: 0: 9740.7. Samples: 93904968. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:45:34,012][65744] Avg episode reward: [(0, '4293.399')] +[2023-03-11 20:45:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000183432_93917184.pth... +[2023-03-11 20:45:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000182856_93622272.pth +[2023-03-11 20:45:34,166][66031] Updated weights for policy 0, policy_version 183440 (0.0005) +[2023-03-11 20:45:38,559][66031] Updated weights for policy 0, policy_version 183520 (0.0005) +[2023-03-11 20:45:39,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9927.6). Total num frames: 93966336. Throughput: 0: 9719.8. Samples: 93961372. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:45:39,012][65744] Avg episode reward: [(0, '4237.232')] +[2023-03-11 20:45:42,791][66031] Updated weights for policy 0, policy_version 183600 (0.0005) +[2023-03-11 20:45:44,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.8, 300 sec: 9913.7). Total num frames: 94011392. Throughput: 0: 9737.9. Samples: 93990912. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:45:44,012][65744] Avg episode reward: [(0, '4095.680')] +[2023-03-11 20:45:47,125][66031] Updated weights for policy 0, policy_version 183680 (0.0005) +[2023-03-11 20:45:49,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9899.8). Total num frames: 94060544. Throughput: 0: 9729.1. Samples: 94047800. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:45:49,012][65744] Avg episode reward: [(0, '4100.831')] +[2023-03-11 20:45:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000183712_94060544.pth... +[2023-03-11 20:45:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000183144_93769728.pth +[2023-03-11 20:45:51,416][66031] Updated weights for policy 0, policy_version 183760 (0.0005) +[2023-03-11 20:45:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9913.7). Total num frames: 94109696. Throughput: 0: 9676.6. Samples: 94105384. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:45:54,012][65744] Avg episode reward: [(0, '4089.838')] +[2023-03-11 20:45:55,684][66031] Updated weights for policy 0, policy_version 183840 (0.0005) +[2023-03-11 20:45:59,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9885.9). Total num frames: 94154752. Throughput: 0: 9649.7. Samples: 94133620. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:45:59,012][65744] Avg episode reward: [(0, '4160.986')] +[2023-03-11 20:45:59,981][66031] Updated weights for policy 0, policy_version 183920 (0.0005) +[2023-03-11 20:46:04,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9872.1). Total num frames: 94203904. Throughput: 0: 9670.7. Samples: 94191384. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:46:04,012][65744] Avg episode reward: [(0, '4204.898')] +[2023-03-11 20:46:04,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000183992_94203904.pth... +[2023-03-11 20:46:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000183432_93917184.pth +[2023-03-11 20:46:04,225][66031] Updated weights for policy 0, policy_version 184000 (0.0005) +[2023-03-11 20:46:08,431][66031] Updated weights for policy 0, policy_version 184080 (0.0005) +[2023-03-11 20:46:09,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.8, 300 sec: 9872.1). Total num frames: 94253056. Throughput: 0: 9645.8. Samples: 94249032. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:46:09,012][65744] Avg episode reward: [(0, '4238.179')] +[2023-03-11 20:46:12,700][66031] Updated weights for policy 0, policy_version 184160 (0.0005) +[2023-03-11 20:46:14,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9858.2). Total num frames: 94302208. Throughput: 0: 9591.8. Samples: 94277696. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:46:14,012][65744] Avg episode reward: [(0, '4300.148')] +[2023-03-11 20:46:16,943][66031] Updated weights for policy 0, policy_version 184240 (0.0005) +[2023-03-11 20:46:19,012][65744] Fps is (10 sec: 9420.7, 60 sec: 9625.6, 300 sec: 9844.3). Total num frames: 94347264. Throughput: 0: 9565.1. Samples: 94335400. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:46:19,012][65744] Avg episode reward: [(0, '4354.231')] +[2023-03-11 20:46:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000184272_94347264.pth... +[2023-03-11 20:46:19,019][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000183712_94060544.pth +[2023-03-11 20:46:21,228][66031] Updated weights for policy 0, policy_version 184320 (0.0005) +[2023-03-11 20:46:24,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9830.4). Total num frames: 94396416. Throughput: 0: 9584.6. Samples: 94392680. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:46:24,012][65744] Avg episode reward: [(0, '4080.877')] +[2023-03-11 20:46:25,472][66031] Updated weights for policy 0, policy_version 184400 (0.0005) +[2023-03-11 20:46:29,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9625.6, 300 sec: 9830.4). Total num frames: 94445568. Throughput: 0: 9576.2. Samples: 94421840. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:46:29,012][65744] Avg episode reward: [(0, '4212.645')] +[2023-03-11 20:46:29,755][66031] Updated weights for policy 0, policy_version 184480 (0.0005) +[2023-03-11 20:46:34,012][65744] Fps is (10 sec: 9420.7, 60 sec: 9557.3, 300 sec: 9816.5). Total num frames: 94490624. Throughput: 0: 9598.1. Samples: 94479716. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:46:34,012][65744] Avg episode reward: [(0, '3966.807')] +[2023-03-11 20:46:34,039][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000184560_94494720.pth... +[2023-03-11 20:46:34,040][66031] Updated weights for policy 0, policy_version 184560 (0.0005) +[2023-03-11 20:46:34,041][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000183992_94203904.pth +[2023-03-11 20:46:38,177][66031] Updated weights for policy 0, policy_version 184640 (0.0005) +[2023-03-11 20:46:39,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9816.5). Total num frames: 94543872. Throughput: 0: 9629.2. Samples: 94538700. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:46:39,012][65744] Avg episode reward: [(0, '4241.033')] +[2023-03-11 20:46:42,206][66031] Updated weights for policy 0, policy_version 184720 (0.0004) +[2023-03-11 20:46:44,012][65744] Fps is (10 sec: 10240.1, 60 sec: 9693.9, 300 sec: 9802.6). Total num frames: 94593024. Throughput: 0: 9665.2. Samples: 94568556. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:46:44,012][65744] Avg episode reward: [(0, '4175.365')] +[2023-03-11 20:46:46,396][66031] Updated weights for policy 0, policy_version 184800 (0.0005) +[2023-03-11 20:46:49,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9788.7). Total num frames: 94642176. Throughput: 0: 9710.6. Samples: 94628360. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:46:49,012][65744] Avg episode reward: [(0, '4454.931')] +[2023-03-11 20:46:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000184848_94642176.pth... +[2023-03-11 20:46:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000184272_94347264.pth +[2023-03-11 20:46:50,317][66031] Updated weights for policy 0, policy_version 184880 (0.0003) +[2023-03-11 20:46:54,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9693.9, 300 sec: 9788.7). Total num frames: 94691328. Throughput: 0: 9788.6. Samples: 94689520. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:46:54,012][65744] Avg episode reward: [(0, '4305.678')] +[2023-03-11 20:46:54,434][66031] Updated weights for policy 0, policy_version 184960 (0.0004) +[2023-03-11 20:46:58,413][66031] Updated weights for policy 0, policy_version 185040 (0.0003) +[2023-03-11 20:46:59,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 9802.6). Total num frames: 94744576. Throughput: 0: 9830.6. Samples: 94720072. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:46:59,012][65744] Avg episode reward: [(0, '4164.722')] +[2023-03-11 20:47:02,516][66031] Updated weights for policy 0, policy_version 185120 (0.0004) +[2023-03-11 20:47:04,012][65744] Fps is (10 sec: 10239.9, 60 sec: 9830.4, 300 sec: 9802.6). Total num frames: 94793728. Throughput: 0: 9911.4. Samples: 94781412. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:47:04,012][65744] Avg episode reward: [(0, '4432.850')] +[2023-03-11 20:47:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000185144_94793728.pth... +[2023-03-11 20:47:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000184560_94494720.pth +[2023-03-11 20:47:06,830][66031] Updated weights for policy 0, policy_version 185200 (0.0006) +[2023-03-11 20:47:09,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9802.6). Total num frames: 94842880. Throughput: 0: 9906.6. Samples: 94838476. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:47:09,012][65744] Avg episode reward: [(0, '4382.266')] +[2023-03-11 20:47:11,077][66031] Updated weights for policy 0, policy_version 185280 (0.0005) +[2023-03-11 20:47:14,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9762.1, 300 sec: 9802.6). Total num frames: 94887936. Throughput: 0: 9901.6. Samples: 94867412. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:47:14,012][65744] Avg episode reward: [(0, '4460.281')] +[2023-03-11 20:47:15,373][66031] Updated weights for policy 0, policy_version 185360 (0.0006) +[2023-03-11 20:47:19,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9830.4, 300 sec: 9802.6). Total num frames: 94937088. Throughput: 0: 9890.8. Samples: 94924800. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:47:19,012][65744] Avg episode reward: [(0, '4480.585')] +[2023-03-11 20:47:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000185424_94937088.pth... +[2023-03-11 20:47:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000184848_94642176.pth +[2023-03-11 20:47:19,665][66031] Updated weights for policy 0, policy_version 185440 (0.0005) +[2023-03-11 20:47:23,913][66031] Updated weights for policy 0, policy_version 185520 (0.0005) +[2023-03-11 20:47:24,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9802.6). Total num frames: 94986240. Throughput: 0: 9849.8. Samples: 94981940. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:47:24,012][65744] Avg episode reward: [(0, '4421.886')] +[2023-03-11 20:47:28,194][66031] Updated weights for policy 0, policy_version 185600 (0.0005) +[2023-03-11 20:47:29,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9788.7). Total num frames: 95031296. Throughput: 0: 9828.3. Samples: 95010828. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:47:29,012][65744] Avg episode reward: [(0, '4369.790')] +[2023-03-11 20:47:32,512][66031] Updated weights for policy 0, policy_version 185680 (0.0005) +[2023-03-11 20:47:34,012][65744] Fps is (10 sec: 9420.7, 60 sec: 9830.4, 300 sec: 9788.7). Total num frames: 95080448. Throughput: 0: 9773.2. Samples: 95068156. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 20:47:34,012][65744] Avg episode reward: [(0, '4335.991')] +[2023-03-11 20:47:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000185704_95080448.pth... +[2023-03-11 20:47:34,019][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000185144_94793728.pth +[2023-03-11 20:47:36,716][66031] Updated weights for policy 0, policy_version 185760 (0.0005) +[2023-03-11 20:47:39,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9762.1, 300 sec: 9788.7). Total num frames: 95129600. Throughput: 0: 9689.9. Samples: 95125568. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:47:39,012][65744] Avg episode reward: [(0, '4349.833')] +[2023-03-11 20:47:41,030][66031] Updated weights for policy 0, policy_version 185840 (0.0005) +[2023-03-11 20:47:44,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9774.9). Total num frames: 95178752. Throughput: 0: 9647.1. Samples: 95154192. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:47:44,012][65744] Avg episode reward: [(0, '4492.212')] +[2023-03-11 20:47:45,252][66031] Updated weights for policy 0, policy_version 185920 (0.0005) +[2023-03-11 20:47:49,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9693.9, 300 sec: 9747.1). Total num frames: 95223808. Throughput: 0: 9565.6. Samples: 95211864. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:47:49,012][65744] Avg episode reward: [(0, '4349.464')] +[2023-03-11 20:47:49,080][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000185992_95227904.pth... +[2023-03-11 20:47:49,081][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000185424_94937088.pth +[2023-03-11 20:47:49,500][66031] Updated weights for policy 0, policy_version 186000 (0.0005) +[2023-03-11 20:47:53,734][66031] Updated weights for policy 0, policy_version 186080 (0.0005) +[2023-03-11 20:47:54,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.8, 300 sec: 9747.1). Total num frames: 95272960. Throughput: 0: 9597.9. Samples: 95270384. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:47:54,012][65744] Avg episode reward: [(0, '4400.891')] +[2023-03-11 20:47:58,062][66031] Updated weights for policy 0, policy_version 186160 (0.0005) +[2023-03-11 20:47:59,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9747.1). Total num frames: 95322112. Throughput: 0: 9582.7. Samples: 95298632. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:47:59,012][65744] Avg episode reward: [(0, '4380.179')] +[2023-03-11 20:48:02,280][66031] Updated weights for policy 0, policy_version 186240 (0.0005) +[2023-03-11 20:48:04,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9733.2). Total num frames: 95371264. Throughput: 0: 9589.8. Samples: 95356340. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:48:04,012][65744] Avg episode reward: [(0, '4373.376')] +[2023-03-11 20:48:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000186272_95371264.pth... +[2023-03-11 20:48:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000185704_95080448.pth +[2023-03-11 20:48:06,241][66031] Updated weights for policy 0, policy_version 186320 (0.0004) +[2023-03-11 20:48:09,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9733.2). Total num frames: 95420416. Throughput: 0: 9712.5. Samples: 95419004. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:48:09,012][65744] Avg episode reward: [(0, '4448.168')] +[2023-03-11 20:48:10,203][66031] Updated weights for policy 0, policy_version 186400 (0.0004) +[2023-03-11 20:48:14,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9762.1, 300 sec: 9747.1). Total num frames: 95473664. Throughput: 0: 9742.5. Samples: 95449240. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:48:14,012][65744] Avg episode reward: [(0, '4267.758')] +[2023-03-11 20:48:14,128][66031] Updated weights for policy 0, policy_version 186480 (0.0004) +[2023-03-11 20:48:18,084][66031] Updated weights for policy 0, policy_version 186560 (0.0004) +[2023-03-11 20:48:19,012][65744] Fps is (10 sec: 10649.5, 60 sec: 9830.4, 300 sec: 9774.9). Total num frames: 95526912. Throughput: 0: 9860.4. Samples: 95511872. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:48:19,012][65744] Avg episode reward: [(0, '4237.962')] +[2023-03-11 20:48:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000186576_95526912.pth... +[2023-03-11 20:48:19,019][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000185992_95227904.pth +[2023-03-11 20:48:22,073][66031] Updated weights for policy 0, policy_version 186640 (0.0004) +[2023-03-11 20:48:24,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 9774.9). Total num frames: 95576064. Throughput: 0: 9952.3. Samples: 95573420. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:48:24,012][65744] Avg episode reward: [(0, '4480.796')] +[2023-03-11 20:48:26,070][66031] Updated weights for policy 0, policy_version 186720 (0.0005) +[2023-03-11 20:48:29,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 9788.7). Total num frames: 95629312. Throughput: 0: 10012.1. Samples: 95604736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:48:29,012][65744] Avg episode reward: [(0, '4414.637')] +[2023-03-11 20:48:30,084][66031] Updated weights for policy 0, policy_version 186800 (0.0004) +[2023-03-11 20:48:34,002][66031] Updated weights for policy 0, policy_version 186880 (0.0004) +[2023-03-11 20:48:34,012][65744] Fps is (10 sec: 10649.5, 60 sec: 10035.2, 300 sec: 9802.6). Total num frames: 95682560. Throughput: 0: 10097.2. Samples: 95666240. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:48:34,012][65744] Avg episode reward: [(0, '4446.133')] +[2023-03-11 20:48:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000186880_95682560.pth... +[2023-03-11 20:48:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000186272_95371264.pth +[2023-03-11 20:48:37,972][66031] Updated weights for policy 0, policy_version 186960 (0.0004) +[2023-03-11 20:48:39,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9816.5). Total num frames: 95731712. Throughput: 0: 10170.5. Samples: 95728056. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:48:39,012][65744] Avg episode reward: [(0, '4520.226')] +[2023-03-11 20:48:41,949][66031] Updated weights for policy 0, policy_version 187040 (0.0005) +[2023-03-11 20:48:44,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 9830.4). Total num frames: 95784960. Throughput: 0: 10241.9. Samples: 95759520. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:48:44,012][65744] Avg episode reward: [(0, '4466.042')] +[2023-03-11 20:48:45,938][66031] Updated weights for policy 0, policy_version 187120 (0.0004) +[2023-03-11 20:48:49,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 9830.4). Total num frames: 95834112. Throughput: 0: 10318.7. Samples: 95820680. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:48:49,012][65744] Avg episode reward: [(0, '4573.712')] +[2023-03-11 20:48:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000187176_95834112.pth... +[2023-03-11 20:48:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000186576_95526912.pth +[2023-03-11 20:48:49,946][66031] Updated weights for policy 0, policy_version 187200 (0.0004) +[2023-03-11 20:48:53,921][66031] Updated weights for policy 0, policy_version 187280 (0.0004) +[2023-03-11 20:48:54,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 9844.3). Total num frames: 95887360. Throughput: 0: 10301.3. Samples: 95882564. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:48:54,012][65744] Avg episode reward: [(0, '4406.101')] +[2023-03-11 20:48:57,974][66031] Updated weights for policy 0, policy_version 187360 (0.0005) +[2023-03-11 20:48:59,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 9858.2). Total num frames: 95936512. Throughput: 0: 10302.6. Samples: 95912856. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:48:59,012][65744] Avg episode reward: [(0, '4551.575')] +[2023-03-11 20:49:01,929][66031] Updated weights for policy 0, policy_version 187440 (0.0004) +[2023-03-11 20:49:04,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 9858.2). Total num frames: 95989760. Throughput: 0: 10284.3. Samples: 95974664. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:49:04,012][65744] Avg episode reward: [(0, '4569.132')] +[2023-03-11 20:49:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000187480_95989760.pth... +[2023-03-11 20:49:04,017][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000186880_95682560.pth +[2023-03-11 20:49:05,992][66031] Updated weights for policy 0, policy_version 187520 (0.0005) +[2023-03-11 20:49:09,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10308.3, 300 sec: 9844.3). Total num frames: 96038912. Throughput: 0: 10245.4. Samples: 96034464. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:49:09,012][65744] Avg episode reward: [(0, '4579.002')] +[2023-03-11 20:49:10,104][66031] Updated weights for policy 0, policy_version 187600 (0.0005) +[2023-03-11 20:49:14,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 9844.3). Total num frames: 96088064. Throughput: 0: 10220.9. Samples: 96064676. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:49:14,012][65744] Avg episode reward: [(0, '4537.482')] +[2023-03-11 20:49:14,123][66031] Updated weights for policy 0, policy_version 187680 (0.0005) +[2023-03-11 20:49:18,052][66031] Updated weights for policy 0, policy_version 187760 (0.0005) +[2023-03-11 20:49:19,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 9858.2). Total num frames: 96141312. Throughput: 0: 10230.0. Samples: 96126588. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:49:19,012][65744] Avg episode reward: [(0, '4503.933')] +[2023-03-11 20:49:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000187776_96141312.pth... +[2023-03-11 20:49:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000187176_95834112.pth +[2023-03-11 20:49:22,017][66031] Updated weights for policy 0, policy_version 187840 (0.0005) +[2023-03-11 20:49:24,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 9858.2). Total num frames: 96194560. Throughput: 0: 10249.9. Samples: 96189300. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:49:24,012][65744] Avg episode reward: [(0, '4438.213')] +[2023-03-11 20:49:25,922][66031] Updated weights for policy 0, policy_version 187920 (0.0004) +[2023-03-11 20:49:29,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 9858.2). Total num frames: 96243712. Throughput: 0: 10238.8. Samples: 96220268. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:49:29,012][65744] Avg episode reward: [(0, '4419.889')] +[2023-03-11 20:49:29,879][66031] Updated weights for policy 0, policy_version 188000 (0.0005) +[2023-03-11 20:49:33,899][66031] Updated weights for policy 0, policy_version 188080 (0.0005) +[2023-03-11 20:49:34,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 9872.1). Total num frames: 96296960. Throughput: 0: 10250.7. Samples: 96281960. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:49:34,012][65744] Avg episode reward: [(0, '4301.710')] +[2023-03-11 20:49:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000188080_96296960.pth... +[2023-03-11 20:49:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000187480_95989760.pth +[2023-03-11 20:49:37,852][66031] Updated weights for policy 0, policy_version 188160 (0.0004) +[2023-03-11 20:49:39,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 9885.9). Total num frames: 96346112. Throughput: 0: 10247.1. Samples: 96343684. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:49:39,012][65744] Avg episode reward: [(0, '4269.774')] +[2023-03-11 20:49:41,960][66031] Updated weights for policy 0, policy_version 188240 (0.0003) +[2023-03-11 20:49:44,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 9899.8). Total num frames: 96399360. Throughput: 0: 10242.4. Samples: 96373764. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:49:44,012][65744] Avg episode reward: [(0, '4473.187')] +[2023-03-11 20:49:45,896][66031] Updated weights for policy 0, policy_version 188320 (0.0004) +[2023-03-11 20:49:49,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 9899.8). Total num frames: 96448512. Throughput: 0: 10242.9. Samples: 96435596. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 20:49:49,012][65744] Avg episode reward: [(0, '4506.039')] +[2023-03-11 20:49:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000188376_96448512.pth... +[2023-03-11 20:49:49,019][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000187776_96141312.pth +[2023-03-11 20:49:50,180][66031] Updated weights for policy 0, policy_version 188400 (0.0004) +[2023-03-11 20:49:54,012][65744] Fps is (10 sec: 9420.9, 60 sec: 10103.5, 300 sec: 9899.8). Total num frames: 96493568. Throughput: 0: 10181.7. Samples: 96492640. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:49:54,012][65744] Avg episode reward: [(0, '4456.940')] +[2023-03-11 20:49:54,453][66031] Updated weights for policy 0, policy_version 188480 (0.0005) +[2023-03-11 20:49:58,715][66031] Updated weights for policy 0, policy_version 188560 (0.0005) +[2023-03-11 20:49:59,012][65744] Fps is (10 sec: 9420.8, 60 sec: 10103.5, 300 sec: 9899.8). Total num frames: 96542720. Throughput: 0: 10145.0. Samples: 96521200. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:49:59,012][65744] Avg episode reward: [(0, '4376.794')] +[2023-03-11 20:50:02,955][66031] Updated weights for policy 0, policy_version 188640 (0.0006) +[2023-03-11 20:50:04,012][65744] Fps is (10 sec: 9830.3, 60 sec: 10035.2, 300 sec: 9899.8). Total num frames: 96591872. Throughput: 0: 10062.2. Samples: 96579388. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:50:04,012][65744] Avg episode reward: [(0, '4431.285')] +[2023-03-11 20:50:04,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000188656_96591872.pth... +[2023-03-11 20:50:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000188080_96296960.pth +[2023-03-11 20:50:07,253][66031] Updated weights for policy 0, policy_version 188720 (0.0005) +[2023-03-11 20:50:09,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 9899.8). Total num frames: 96641024. Throughput: 0: 9932.8. Samples: 96636276. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:50:09,012][65744] Avg episode reward: [(0, '4457.956')] +[2023-03-11 20:50:11,460][66031] Updated weights for policy 0, policy_version 188800 (0.0005) +[2023-03-11 20:50:14,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 9899.8). Total num frames: 96690176. Throughput: 0: 9895.7. Samples: 96665576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:50:14,012][65744] Avg episode reward: [(0, '4498.002')] +[2023-03-11 20:50:15,416][66031] Updated weights for policy 0, policy_version 188880 (0.0005) +[2023-03-11 20:50:19,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9899.8). Total num frames: 96739328. Throughput: 0: 9892.1. Samples: 96727104. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:50:19,012][65744] Avg episode reward: [(0, '4518.525')] +[2023-03-11 20:50:19,039][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000188952_96743424.pth... +[2023-03-11 20:50:19,041][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000188376_96448512.pth +[2023-03-11 20:50:19,425][66031] Updated weights for policy 0, policy_version 188960 (0.0005) +[2023-03-11 20:50:23,390][66031] Updated weights for policy 0, policy_version 189040 (0.0005) +[2023-03-11 20:50:24,012][65744] Fps is (10 sec: 10240.1, 60 sec: 9966.9, 300 sec: 9913.7). Total num frames: 96792576. Throughput: 0: 9891.5. Samples: 96788800. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:50:24,012][65744] Avg episode reward: [(0, '4551.639')] +[2023-03-11 20:50:27,355][66031] Updated weights for policy 0, policy_version 189120 (0.0004) +[2023-03-11 20:50:29,012][65744] Fps is (10 sec: 10649.7, 60 sec: 10035.2, 300 sec: 9927.6). Total num frames: 96845824. Throughput: 0: 9909.5. Samples: 96819692. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:50:29,012][65744] Avg episode reward: [(0, '4426.794')] +[2023-03-11 20:50:31,357][66031] Updated weights for policy 0, policy_version 189200 (0.0003) +[2023-03-11 20:50:34,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 9927.6). Total num frames: 96894976. Throughput: 0: 9910.3. Samples: 96881560. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:50:34,012][65744] Avg episode reward: [(0, '4185.444')] +[2023-03-11 20:50:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000189248_96894976.pth... +[2023-03-11 20:50:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000188656_96591872.pth +[2023-03-11 20:50:35,384][66031] Updated weights for policy 0, policy_version 189280 (0.0003) +[2023-03-11 20:50:39,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9955.4). Total num frames: 96948224. Throughput: 0: 10018.0. Samples: 96943452. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:50:39,012][65744] Avg episode reward: [(0, '4596.745')] +[2023-03-11 20:50:39,380][66031] Updated weights for policy 0, policy_version 189360 (0.0003) +[2023-03-11 20:50:43,492][66031] Updated weights for policy 0, policy_version 189440 (0.0003) +[2023-03-11 20:50:44,012][65744] Fps is (10 sec: 10240.1, 60 sec: 9966.9, 300 sec: 9955.4). Total num frames: 96997376. Throughput: 0: 10032.2. Samples: 96972648. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:50:44,012][65744] Avg episode reward: [(0, '4567.527')] +[2023-03-11 20:50:47,507][66031] Updated weights for policy 0, policy_version 189520 (0.0003) +[2023-03-11 20:50:49,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9966.9, 300 sec: 9955.4). Total num frames: 97046528. Throughput: 0: 10094.0. Samples: 97033616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:50:49,012][65744] Avg episode reward: [(0, '4566.539')] +[2023-03-11 20:50:49,014][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000189544_97046528.pth... +[2023-03-11 20:50:49,016][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000188952_96743424.pth +[2023-03-11 20:50:51,495][66031] Updated weights for policy 0, policy_version 189600 (0.0003) +[2023-03-11 20:50:54,012][65744] Fps is (10 sec: 9830.3, 60 sec: 10035.2, 300 sec: 9969.2). Total num frames: 97095680. Throughput: 0: 10190.0. Samples: 97094824. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:50:54,012][65744] Avg episode reward: [(0, '4563.398')] +[2023-03-11 20:50:55,745][66031] Updated weights for policy 0, policy_version 189680 (0.0004) +[2023-03-11 20:50:59,012][65744] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 9983.1). Total num frames: 97148928. Throughput: 0: 10169.7. Samples: 97123212. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:50:59,012][65744] Avg episode reward: [(0, '4493.431')] +[2023-03-11 20:50:59,742][66031] Updated weights for policy 0, policy_version 189760 (0.0004) +[2023-03-11 20:51:03,816][66031] Updated weights for policy 0, policy_version 189840 (0.0003) +[2023-03-11 20:51:04,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 9983.1). Total num frames: 97198080. Throughput: 0: 10161.5. Samples: 97184372. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:51:04,012][65744] Avg episode reward: [(0, '4560.117')] +[2023-03-11 20:51:04,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000189840_97198080.pth... +[2023-03-11 20:51:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000189248_96894976.pth +[2023-03-11 20:51:07,816][66031] Updated weights for policy 0, policy_version 189920 (0.0005) +[2023-03-11 20:51:09,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 9983.1). Total num frames: 97247232. Throughput: 0: 10145.3. Samples: 97245340. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:51:09,012][65744] Avg episode reward: [(0, '4581.394')] +[2023-03-11 20:51:11,771][66031] Updated weights for policy 0, policy_version 190000 (0.0005) +[2023-03-11 20:51:14,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10010.9). Total num frames: 97300480. Throughput: 0: 10147.8. Samples: 97276344. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:51:14,012][65744] Avg episode reward: [(0, '4391.485')] +[2023-03-11 20:51:15,735][66031] Updated weights for policy 0, policy_version 190080 (0.0005) +[2023-03-11 20:51:19,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 10024.8). Total num frames: 97353728. Throughput: 0: 10161.7. Samples: 97338836. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:51:19,012][65744] Avg episode reward: [(0, '4583.281')] +[2023-03-11 20:51:19,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000190144_97353728.pth... +[2023-03-11 20:51:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000189544_97046528.pth +[2023-03-11 20:51:19,773][66031] Updated weights for policy 0, policy_version 190160 (0.0005) +[2023-03-11 20:51:24,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10010.9). Total num frames: 97398784. Throughput: 0: 10085.5. Samples: 97397300. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:51:24,023][65744] Avg episode reward: [(0, '4391.159')] +[2023-03-11 20:51:24,041][66031] Updated weights for policy 0, policy_version 190240 (0.0005) +[2023-03-11 20:51:28,339][66031] Updated weights for policy 0, policy_version 190320 (0.0005) +[2023-03-11 20:51:29,012][65744] Fps is (10 sec: 9420.9, 60 sec: 10035.2, 300 sec: 10024.8). Total num frames: 97447936. Throughput: 0: 10084.4. Samples: 97426448. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:51:29,023][65744] Avg episode reward: [(0, '4393.399')] +[2023-03-11 20:51:32,705][66031] Updated weights for policy 0, policy_version 190400 (0.0005) +[2023-03-11 20:51:34,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10010.9). Total num frames: 97497088. Throughput: 0: 9979.6. Samples: 97482700. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:51:34,012][65744] Avg episode reward: [(0, '4504.314')] +[2023-03-11 20:51:34,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000190424_97497088.pth... +[2023-03-11 20:51:34,028][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000189840_97198080.pth +[2023-03-11 20:51:37,008][66031] Updated weights for policy 0, policy_version 190480 (0.0005) +[2023-03-11 20:51:39,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 9997.0). Total num frames: 97542144. Throughput: 0: 9879.2. Samples: 97539388. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:51:39,012][65744] Avg episode reward: [(0, '4526.212')] +[2023-03-11 20:51:41,224][66031] Updated weights for policy 0, policy_version 190560 (0.0005) +[2023-03-11 20:51:44,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9898.7, 300 sec: 9997.0). Total num frames: 97591296. Throughput: 0: 9904.3. Samples: 97568904. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:51:44,012][65744] Avg episode reward: [(0, '4527.784')] +[2023-03-11 20:51:45,466][66031] Updated weights for policy 0, policy_version 190640 (0.0005) +[2023-03-11 20:51:49,012][65744] Fps is (10 sec: 10240.1, 60 sec: 9966.9, 300 sec: 10010.9). Total num frames: 97644544. Throughput: 0: 9863.6. Samples: 97628232. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:51:49,012][65744] Avg episode reward: [(0, '4449.862')] +[2023-03-11 20:51:49,026][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000190712_97644544.pth... +[2023-03-11 20:51:49,030][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000190144_97353728.pth +[2023-03-11 20:51:49,397][66031] Updated weights for policy 0, policy_version 190720 (0.0004) +[2023-03-11 20:51:53,425][66031] Updated weights for policy 0, policy_version 190800 (0.0005) +[2023-03-11 20:51:54,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 9997.0). Total num frames: 97693696. Throughput: 0: 9873.9. Samples: 97689664. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:51:54,012][65744] Avg episode reward: [(0, '4465.238')] +[2023-03-11 20:51:57,360][66031] Updated weights for policy 0, policy_version 190880 (0.0004) +[2023-03-11 20:51:59,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 10010.9). Total num frames: 97746944. Throughput: 0: 9879.3. Samples: 97720912. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:51:59,012][65744] Avg episode reward: [(0, '4397.686')] +[2023-03-11 20:52:01,296][66031] Updated weights for policy 0, policy_version 190960 (0.0004) +[2023-03-11 20:52:04,012][65744] Fps is (10 sec: 10239.9, 60 sec: 9966.9, 300 sec: 10010.9). Total num frames: 97796096. Throughput: 0: 9888.5. Samples: 97783820. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:52:04,012][65744] Avg episode reward: [(0, '4432.175')] +[2023-03-11 20:52:04,035][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000191016_97800192.pth... +[2023-03-11 20:52:04,037][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000190424_97497088.pth +[2023-03-11 20:52:05,234][66031] Updated weights for policy 0, policy_version 191040 (0.0005) +[2023-03-11 20:52:09,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10038.7). Total num frames: 97849344. Throughput: 0: 9958.7. Samples: 97845440. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:52:09,012][65744] Avg episode reward: [(0, '4574.242')] +[2023-03-11 20:52:09,167][66031] Updated weights for policy 0, policy_version 191120 (0.0005) +[2023-03-11 20:52:13,108][66031] Updated weights for policy 0, policy_version 191200 (0.0004) +[2023-03-11 20:52:14,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10035.2, 300 sec: 10052.6). Total num frames: 97902592. Throughput: 0: 10023.0. Samples: 97877484. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:52:14,012][65744] Avg episode reward: [(0, '4264.916')] +[2023-03-11 20:52:17,033][66031] Updated weights for policy 0, policy_version 191280 (0.0004) +[2023-03-11 20:52:19,012][65744] Fps is (10 sec: 10649.6, 60 sec: 10035.2, 300 sec: 10066.4). Total num frames: 97955840. Throughput: 0: 10151.7. Samples: 97939528. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:52:19,012][65744] Avg episode reward: [(0, '4435.052')] +[2023-03-11 20:52:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000191320_97955840.pth... +[2023-03-11 20:52:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000190712_97644544.pth +[2023-03-11 20:52:20,952][66031] Updated weights for policy 0, policy_version 191360 (0.0004) +[2023-03-11 20:52:24,012][65744] Fps is (10 sec: 10240.1, 60 sec: 10103.5, 300 sec: 10080.3). Total num frames: 98004992. Throughput: 0: 10289.0. Samples: 98002392. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:52:24,012][65744] Avg episode reward: [(0, '4495.665')] +[2023-03-11 20:52:24,862][66031] Updated weights for policy 0, policy_version 191440 (0.0003) +[2023-03-11 20:52:28,808][66031] Updated weights for policy 0, policy_version 191520 (0.0004) +[2023-03-11 20:52:29,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10094.2). Total num frames: 98058240. Throughput: 0: 10332.3. Samples: 98033856. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:52:29,012][65744] Avg episode reward: [(0, '4283.635')] +[2023-03-11 20:52:32,817][66031] Updated weights for policy 0, policy_version 191600 (0.0005) +[2023-03-11 20:52:34,012][65744] Fps is (10 sec: 10649.5, 60 sec: 10240.0, 300 sec: 10108.1). Total num frames: 98111488. Throughput: 0: 10376.9. Samples: 98095192. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:52:34,012][65744] Avg episode reward: [(0, '4354.533')] +[2023-03-11 20:52:34,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000191624_98111488.pth... +[2023-03-11 20:52:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000191016_97800192.pth +[2023-03-11 20:52:36,834][66031] Updated weights for policy 0, policy_version 191680 (0.0005) +[2023-03-11 20:52:39,012][65744] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10108.1). Total num frames: 98160640. Throughput: 0: 10376.5. Samples: 98156608. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:52:39,012][65744] Avg episode reward: [(0, '4502.600')] +[2023-03-11 20:52:40,849][66031] Updated weights for policy 0, policy_version 191760 (0.0005) +[2023-03-11 20:52:44,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10308.3, 300 sec: 10122.0). Total num frames: 98209792. Throughput: 0: 10367.9. Samples: 98187468. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:52:44,012][65744] Avg episode reward: [(0, '4494.143')] +[2023-03-11 20:52:44,922][66031] Updated weights for policy 0, policy_version 191840 (0.0004) +[2023-03-11 20:52:49,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10122.0). Total num frames: 98258944. Throughput: 0: 10286.7. Samples: 98246720. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:52:49,012][65744] Avg episode reward: [(0, '4524.382')] +[2023-03-11 20:52:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000191912_98258944.pth... +[2023-03-11 20:52:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000191320_97955840.pth +[2023-03-11 20:52:49,161][66031] Updated weights for policy 0, policy_version 191920 (0.0005) +[2023-03-11 20:52:53,390][66031] Updated weights for policy 0, policy_version 192000 (0.0005) +[2023-03-11 20:52:54,012][65744] Fps is (10 sec: 9830.3, 60 sec: 10240.0, 300 sec: 10122.0). Total num frames: 98308096. Throughput: 0: 10197.3. Samples: 98304320. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:52:54,012][65744] Avg episode reward: [(0, '4517.087')] +[2023-03-11 20:52:57,671][66031] Updated weights for policy 0, policy_version 192080 (0.0005) +[2023-03-11 20:52:59,012][65744] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10122.0). Total num frames: 98357248. Throughput: 0: 10132.0. Samples: 98333424. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:52:59,012][65744] Avg episode reward: [(0, '4535.153')] +[2023-03-11 20:53:01,887][66031] Updated weights for policy 0, policy_version 192160 (0.0005) +[2023-03-11 20:53:04,012][65744] Fps is (10 sec: 9420.8, 60 sec: 10103.5, 300 sec: 10108.1). Total num frames: 98402304. Throughput: 0: 10036.2. Samples: 98391156. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 20:53:04,012][65744] Avg episode reward: [(0, '4487.795')] +[2023-03-11 20:53:04,025][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000192200_98406400.pth... +[2023-03-11 20:53:04,026][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000191624_98111488.pth +[2023-03-11 20:53:06,171][66031] Updated weights for policy 0, policy_version 192240 (0.0005) +[2023-03-11 20:53:09,012][65744] Fps is (10 sec: 9420.8, 60 sec: 10035.2, 300 sec: 10094.2). Total num frames: 98451456. Throughput: 0: 9928.3. Samples: 98449168. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 20:53:09,012][65744] Avg episode reward: [(0, '4373.301')] +[2023-03-11 20:53:10,387][66031] Updated weights for policy 0, policy_version 192320 (0.0005) +[2023-03-11 20:53:14,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9966.9, 300 sec: 10080.3). Total num frames: 98500608. Throughput: 0: 9875.8. Samples: 98478268. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 20:53:14,012][65744] Avg episode reward: [(0, '4521.541')] +[2023-03-11 20:53:14,680][66031] Updated weights for policy 0, policy_version 192400 (0.0005) +[2023-03-11 20:53:18,891][66031] Updated weights for policy 0, policy_version 192480 (0.0005) +[2023-03-11 20:53:19,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10080.3). Total num frames: 98549760. Throughput: 0: 9796.3. Samples: 98536028. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 20:53:19,012][65744] Avg episode reward: [(0, '4559.862')] +[2023-03-11 20:53:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000192480_98549760.pth... +[2023-03-11 20:53:19,019][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000191912_98258944.pth +[2023-03-11 20:53:23,076][66031] Updated weights for policy 0, policy_version 192560 (0.0005) +[2023-03-11 20:53:24,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10066.4). Total num frames: 98598912. Throughput: 0: 9737.0. Samples: 98594772. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 20:53:24,012][65744] Avg episode reward: [(0, '4459.062')] +[2023-03-11 20:53:27,309][66031] Updated weights for policy 0, policy_version 192640 (0.0005) +[2023-03-11 20:53:29,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10052.6). Total num frames: 98648064. Throughput: 0: 9689.6. Samples: 98623500. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 20:53:29,012][65744] Avg episode reward: [(0, '4464.672')] +[2023-03-11 20:53:31,608][66031] Updated weights for policy 0, policy_version 192720 (0.0005) +[2023-03-11 20:53:34,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 10038.7). Total num frames: 98693120. Throughput: 0: 9646.9. Samples: 98680832. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 20:53:34,012][65744] Avg episode reward: [(0, '4453.491')] +[2023-03-11 20:53:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000192760_98693120.pth... +[2023-03-11 20:53:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000192200_98406400.pth +[2023-03-11 20:53:35,905][66031] Updated weights for policy 0, policy_version 192800 (0.0005) +[2023-03-11 20:53:39,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 10024.8). Total num frames: 98742272. Throughput: 0: 9641.4. Samples: 98738184. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 20:53:39,012][65744] Avg episode reward: [(0, '4417.742')] +[2023-03-11 20:53:40,153][66031] Updated weights for policy 0, policy_version 192880 (0.0005) +[2023-03-11 20:53:44,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 10010.9). Total num frames: 98787328. Throughput: 0: 9632.0. Samples: 98766864. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 20:53:44,012][65744] Avg episode reward: [(0, '4393.972')] +[2023-03-11 20:53:44,473][66031] Updated weights for policy 0, policy_version 192960 (0.0005) +[2023-03-11 20:53:48,683][66031] Updated weights for policy 0, policy_version 193040 (0.0005) +[2023-03-11 20:53:49,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9997.0). Total num frames: 98836480. Throughput: 0: 9626.1. Samples: 98824332. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 20:53:49,012][65744] Avg episode reward: [(0, '4300.047')] +[2023-03-11 20:53:49,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000193040_98836480.pth... +[2023-03-11 20:53:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000192480_98549760.pth +[2023-03-11 20:53:52,883][66031] Updated weights for policy 0, policy_version 193120 (0.0005) +[2023-03-11 20:53:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9997.0). Total num frames: 98885632. Throughput: 0: 9642.7. Samples: 98883088. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 20:53:54,012][65744] Avg episode reward: [(0, '4423.711')] +[2023-03-11 20:53:57,131][66031] Updated weights for policy 0, policy_version 193200 (0.0005) +[2023-03-11 20:53:59,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9983.1). Total num frames: 98934784. Throughput: 0: 9638.9. Samples: 98912020. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 20:53:59,012][65744] Avg episode reward: [(0, '4495.020')] +[2023-03-11 20:54:01,411][66031] Updated weights for policy 0, policy_version 193280 (0.0005) +[2023-03-11 20:54:04,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9983.1). Total num frames: 98983936. Throughput: 0: 9630.0. Samples: 98969376. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 20:54:04,012][65744] Avg episode reward: [(0, '4361.831')] +[2023-03-11 20:54:04,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000193328_98983936.pth... +[2023-03-11 20:54:04,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000192760_98693120.pth +[2023-03-11 20:54:05,661][66031] Updated weights for policy 0, policy_version 193360 (0.0005) +[2023-03-11 20:54:09,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9983.1). Total num frames: 99033088. Throughput: 0: 9621.4. Samples: 99027736. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 20:54:09,012][65744] Avg episode reward: [(0, '4233.127')] +[2023-03-11 20:54:09,861][66031] Updated weights for policy 0, policy_version 193440 (0.0005) +[2023-03-11 20:54:14,012][65744] Fps is (10 sec: 9420.9, 60 sec: 9625.6, 300 sec: 9955.4). Total num frames: 99078144. Throughput: 0: 9630.0. Samples: 99056848. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 20:54:14,012][65744] Avg episode reward: [(0, '4313.183')] +[2023-03-11 20:54:14,099][66031] Updated weights for policy 0, policy_version 193520 (0.0005) +[2023-03-11 20:54:18,353][66031] Updated weights for policy 0, policy_version 193600 (0.0005) +[2023-03-11 20:54:19,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9941.5). Total num frames: 99127296. Throughput: 0: 9646.5. Samples: 99114924. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:54:19,012][65744] Avg episode reward: [(0, '4195.285')] +[2023-03-11 20:54:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000193608_99127296.pth... +[2023-03-11 20:54:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000193040_98836480.pth +[2023-03-11 20:54:22,624][66031] Updated weights for policy 0, policy_version 193680 (0.0004) +[2023-03-11 20:54:24,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9941.5). Total num frames: 99176448. Throughput: 0: 9649.6. Samples: 99172416. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:54:24,012][65744] Avg episode reward: [(0, '4305.187')] +[2023-03-11 20:54:26,859][66031] Updated weights for policy 0, policy_version 193760 (0.0005) +[2023-03-11 20:54:29,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9625.6, 300 sec: 9927.6). Total num frames: 99225600. Throughput: 0: 9649.6. Samples: 99201096. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:54:29,012][65744] Avg episode reward: [(0, '4410.371')] +[2023-03-11 20:54:31,058][66031] Updated weights for policy 0, policy_version 193840 (0.0005) +[2023-03-11 20:54:34,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9693.9, 300 sec: 9927.6). Total num frames: 99274752. Throughput: 0: 9660.5. Samples: 99259056. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:54:34,012][65744] Avg episode reward: [(0, '4430.448')] +[2023-03-11 20:54:34,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000193896_99274752.pth... +[2023-03-11 20:54:34,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000193328_98983936.pth +[2023-03-11 20:54:35,276][66031] Updated weights for policy 0, policy_version 193920 (0.0005) +[2023-03-11 20:54:39,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9899.8). Total num frames: 99319808. Throughput: 0: 9663.0. Samples: 99317924. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:54:39,012][65744] Avg episode reward: [(0, '4207.502')] +[2023-03-11 20:54:39,501][66031] Updated weights for policy 0, policy_version 194000 (0.0005) +[2023-03-11 20:54:43,591][66031] Updated weights for policy 0, policy_version 194080 (0.0004) +[2023-03-11 20:54:44,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9913.7). Total num frames: 99373056. Throughput: 0: 9680.0. Samples: 99347620. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:54:44,012][65744] Avg episode reward: [(0, '4382.576')] +[2023-03-11 20:54:47,623][66031] Updated weights for policy 0, policy_version 194160 (0.0004) +[2023-03-11 20:54:49,012][65744] Fps is (10 sec: 10239.9, 60 sec: 9762.1, 300 sec: 9927.6). Total num frames: 99422208. Throughput: 0: 9749.8. Samples: 99408116. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:54:49,012][65744] Avg episode reward: [(0, '4248.301')] +[2023-03-11 20:54:49,015][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000194184_99422208.pth... +[2023-03-11 20:54:49,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000193608_99127296.pth +[2023-03-11 20:54:51,844][66031] Updated weights for policy 0, policy_version 194240 (0.0005) +[2023-03-11 20:54:54,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9927.6). Total num frames: 99471360. Throughput: 0: 9760.0. Samples: 99466936. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:54:54,012][65744] Avg episode reward: [(0, '4406.887')] +[2023-03-11 20:54:56,120][66031] Updated weights for policy 0, policy_version 194320 (0.0005) +[2023-03-11 20:54:59,012][65744] Fps is (10 sec: 9420.7, 60 sec: 9693.9, 300 sec: 9913.7). Total num frames: 99516416. Throughput: 0: 9747.4. Samples: 99495480. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:54:59,012][65744] Avg episode reward: [(0, '4275.280')] +[2023-03-11 20:55:00,300][66031] Updated weights for policy 0, policy_version 194400 (0.0005) +[2023-03-11 20:55:04,012][65744] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9913.7). Total num frames: 99565568. Throughput: 0: 9746.4. Samples: 99553512. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:55:04,012][65744] Avg episode reward: [(0, '4227.216')] +[2023-03-11 20:55:04,064][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000194472_99569664.pth... +[2023-03-11 20:55:04,066][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000193896_99274752.pth +[2023-03-11 20:55:04,476][66031] Updated weights for policy 0, policy_version 194480 (0.0005) +[2023-03-11 20:55:08,719][66031] Updated weights for policy 0, policy_version 194560 (0.0005) +[2023-03-11 20:55:09,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9693.9, 300 sec: 9913.7). Total num frames: 99614720. Throughput: 0: 9766.8. Samples: 99611924. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:55:09,012][65744] Avg episode reward: [(0, '4328.847')] +[2023-03-11 20:55:12,941][66031] Updated weights for policy 0, policy_version 194640 (0.0005) +[2023-03-11 20:55:14,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9913.7). Total num frames: 99663872. Throughput: 0: 9772.9. Samples: 99640876. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:55:14,012][65744] Avg episode reward: [(0, '4301.127')] +[2023-03-11 20:55:16,876][66031] Updated weights for policy 0, policy_version 194720 (0.0005) +[2023-03-11 20:55:19,012][65744] Fps is (10 sec: 10239.9, 60 sec: 9830.4, 300 sec: 9913.7). Total num frames: 99717120. Throughput: 0: 9859.8. Samples: 99702748. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:55:19,012][65744] Avg episode reward: [(0, '4392.037')] +[2023-03-11 20:55:19,016][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000194760_99717120.pth... +[2023-03-11 20:55:19,018][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000194184_99422208.pth +[2023-03-11 20:55:20,830][66031] Updated weights for policy 0, policy_version 194800 (0.0005) +[2023-03-11 20:55:24,012][65744] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 9899.8). Total num frames: 99766272. Throughput: 0: 9917.5. Samples: 99764212. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:55:24,012][65744] Avg episode reward: [(0, '4301.510')] +[2023-03-11 20:55:24,785][66031] Updated weights for policy 0, policy_version 194880 (0.0005) +[2023-03-11 20:55:29,012][65744] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9899.8). Total num frames: 99815424. Throughput: 0: 9940.5. Samples: 99794944. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:55:29,012][65744] Avg episode reward: [(0, '4283.835')] +[2023-03-11 20:55:29,025][66031] Updated weights for policy 0, policy_version 194960 (0.0005) +[2023-03-11 20:55:33,196][66031] Updated weights for policy 0, policy_version 195040 (0.0005) +[2023-03-11 20:55:34,012][65744] Fps is (10 sec: 9830.3, 60 sec: 9830.4, 300 sec: 9885.9). Total num frames: 99864576. Throughput: 0: 9883.0. Samples: 99852852. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:55:34,013][65744] Avg episode reward: [(0, '4327.110')] +[2023-03-11 20:55:34,066][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000195056_99868672.pth... +[2023-03-11 20:55:34,067][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000194472_99569664.pth +[2023-03-11 20:55:37,500][66031] Updated weights for policy 0, policy_version 195120 (0.0005) +[2023-03-11 20:55:39,012][65744] Fps is (10 sec: 9830.5, 60 sec: 9898.7, 300 sec: 9885.9). Total num frames: 99913728. Throughput: 0: 9863.7. Samples: 99910804. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:55:39,012][65744] Avg episode reward: [(0, '4232.073')] +[2023-03-11 20:55:41,400][66031] Updated weights for policy 0, policy_version 195200 (0.0004) +[2023-03-11 20:55:44,012][65744] Fps is (10 sec: 10240.2, 60 sec: 9898.7, 300 sec: 9899.8). Total num frames: 99966976. Throughput: 0: 9933.2. Samples: 99942472. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 20:55:44,012][65744] Avg episode reward: [(0, '4396.909')] +[2023-03-11 20:55:45,351][66031] Updated weights for policy 0, policy_version 195280 (0.0004) +[2023-03-11 20:55:46,971][65987] Early stopping after 2 epochs (8 sgd steps), loss delta 0.0000010 +[2023-03-11 20:55:47,394][65987] Early stopping after 2 epochs (8 sgd steps), loss delta 0.0000000 +[2023-03-11 20:55:47,807][65987] Early stopping after 2 epochs (8 sgd steps), loss delta 0.0000000 +[2023-03-11 20:55:47,808][66035] Stopping RolloutWorker_w3... +[2023-03-11 20:55:47,808][66038] Stopping RolloutWorker_w6... +[2023-03-11 20:55:47,808][66037] Stopping RolloutWorker_w4... +[2023-03-11 20:55:47,808][66101] Stopping RolloutWorker_w7... +[2023-03-11 20:55:47,808][66033] Stopping RolloutWorker_w0... +[2023-03-11 20:55:47,809][66035] Loop rollout_proc3_evt_loop terminating... +[2023-03-11 20:55:47,809][66038] Loop rollout_proc6_evt_loop terminating... +[2023-03-11 20:55:47,808][66032] Stopping RolloutWorker_w1... +[2023-03-11 20:55:47,809][66037] Loop rollout_proc4_evt_loop terminating... +[2023-03-11 20:55:47,808][66036] Stopping RolloutWorker_w5... +[2023-03-11 20:55:47,809][66101] Loop rollout_proc7_evt_loop terminating... +[2023-03-11 20:55:47,809][66033] Loop rollout_proc0_evt_loop terminating... +[2023-03-11 20:55:47,809][66036] Loop rollout_proc5_evt_loop terminating... +[2023-03-11 20:55:47,809][66032] Loop rollout_proc1_evt_loop terminating... +[2023-03-11 20:55:47,809][66034] Stopping RolloutWorker_w2... +[2023-03-11 20:55:47,808][65744] Component RolloutWorker_w3 stopped! +[2023-03-11 20:55:47,809][66034] Loop rollout_proc2_evt_loop terminating... +[2023-03-11 20:55:47,809][65744] Component RolloutWorker_w6 stopped! +[2023-03-11 20:55:47,809][65744] Component RolloutWorker_w4 stopped! +[2023-03-11 20:55:47,809][65987] Stopping Batcher_0... +[2023-03-11 20:55:47,810][65744] Component RolloutWorker_w0 stopped! +[2023-03-11 20:55:47,810][65987] Loop batcher_evt_loop terminating... +[2023-03-11 20:55:47,810][65744] Component RolloutWorker_w7 stopped! +[2023-03-11 20:55:47,810][65744] Component RolloutWorker_w1 stopped! +[2023-03-11 20:55:47,810][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000195328_100007936.pth... +[2023-03-11 20:55:47,810][65744] Component RolloutWorker_w5 stopped! +[2023-03-11 20:55:47,811][65744] Component RolloutWorker_w2 stopped! +[2023-03-11 20:55:47,811][65744] Component Batcher_0 stopped! +[2023-03-11 20:55:47,813][65987] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000194760_99717120.pth +[2023-03-11 20:55:47,813][65987] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-push-v2/checkpoint_p0/checkpoint_000195328_100007936.pth... +[2023-03-11 20:55:47,815][65987] Stopping LearnerWorker_p0... +[2023-03-11 20:55:47,816][65987] Loop learner_proc0_evt_loop terminating... +[2023-03-11 20:55:47,815][65744] Component LearnerWorker_p0 stopped! +[2023-03-11 20:55:47,867][66031] Weights refcount: 2 0 +[2023-03-11 20:55:47,868][66031] Stopping InferenceWorker_p0-w0... +[2023-03-11 20:55:47,868][66031] Loop inference_proc0-0_evt_loop terminating... +[2023-03-11 20:55:47,868][65744] Component InferenceWorker_p0-w0 stopped! +[2023-03-11 20:55:47,869][65744] Waiting for process learner_proc0 to stop... +[2023-03-11 20:55:48,251][65744] Waiting for process inference_proc0-0 to join... +[2023-03-11 20:55:48,266][65744] Waiting for process rollout_proc0 to join... +[2023-03-11 20:55:48,267][65744] Waiting for process rollout_proc1 to join... +[2023-03-11 20:55:48,267][65744] Waiting for process rollout_proc2 to join... +[2023-03-11 20:55:48,267][65744] Waiting for process rollout_proc3 to join... +[2023-03-11 20:55:48,267][65744] Waiting for process rollout_proc4 to join... +[2023-03-11 20:55:48,267][65744] Waiting for process rollout_proc5 to join... +[2023-03-11 20:55:48,268][65744] Waiting for process rollout_proc6 to join... +[2023-03-11 20:55:48,268][65744] Waiting for process rollout_proc7 to join... +[2023-03-11 20:55:48,268][65744] Batcher 0 profile tree view: +batching: 17.8949, releasing_batches: 15.2731 +[2023-03-11 20:55:48,268][65744] InferenceWorker_p0-w0 profile tree view: +wait_policy: 0.0051 + wait_policy_total: 3860.5169 +update_model: 106.9851 + weight_update: 0.0005 +one_step: 0.0006 + handle_policy_step: 5445.4968 + deserialize: 230.4512, stack: 55.5785, obs_to_device_normalize: 981.0445, forward: 2716.6669, send_messages: 383.5050 + prepare_outputs: 605.7320 + to_cpu: 95.5188 +[2023-03-11 20:55:48,268][65744] Learner 0 profile tree view: +misc: 0.0943, prepare_batch: 89.4837 +train: 1146.7785 + epoch_init: 0.3386, minibatch_init: 11.5948, losses_postprocess: 12.0797, kl_divergence: 4.3234, after_optimizer: 4.9242 + calculate_losses: 470.7139 + losses_init: 0.3445, forward_head: 231.7023, bptt_initial: 1.2325, bptt: 1.1512, tail: 111.1089, advantages_returns: 8.5813, losses: 102.8222 + update: 627.3810 + clip: 55.6283 +[2023-03-11 20:55:48,269][65744] RolloutWorker_w0 profile tree view: +wait_for_trajectories: 2.6713, enqueue_policy_requests: 125.6221, env_step: 7273.8233, overhead: 297.6027, complete_rollouts: 3.2492 +save_policy_outputs: 321.4222 + split_output_tensors: 161.0361 +[2023-03-11 20:55:48,269][65744] RolloutWorker_w7 profile tree view: +wait_for_trajectories: 2.7167, enqueue_policy_requests: 126.3232, env_step: 7323.0226, overhead: 302.4292, complete_rollouts: 3.2360 +save_policy_outputs: 323.9917 + split_output_tensors: 160.5113 +[2023-03-11 20:55:48,269][65744] Loop Runner_EvtLoop terminating... +[2023-03-11 20:55:48,269][65744] Runner profile tree view: +main_loop: 10091.8588 +[2023-03-11 20:55:48,269][65744] Collected {0: 100007936}, FPS: 9909.8