diff --git "a/sf_log.txt" "b/sf_log.txt" new file mode 100644--- /dev/null +++ "b/sf_log.txt" @@ -0,0 +1,1036 @@ +[2023-07-08 02:03:13,346][800281] Saving configuration to /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/config.json... +[2023-07-08 02:03:13,368][800281] Rollout worker 0 uses device cpu +[2023-07-08 02:03:13,369][800281] Rollout worker 1 uses device cpu +[2023-07-08 02:03:13,369][800281] Rollout worker 2 uses device cpu +[2023-07-08 02:03:13,369][800281] Rollout worker 3 uses device cpu +[2023-07-08 02:03:13,369][800281] Rollout worker 4 uses device cpu +[2023-07-08 02:03:13,370][800281] Rollout worker 5 uses device cpu +[2023-07-08 02:03:13,370][800281] Rollout worker 6 uses device cpu +[2023-07-08 02:03:13,370][800281] Rollout worker 7 uses device cpu +[2023-07-08 02:03:13,370][800281] In synchronous mode, we only accumulate one batch. Setting num_batches_to_accumulate to 1 +[2023-07-08 02:03:13,384][800281] InferenceWorker_p0-w0: min num requests: 2 +[2023-07-08 02:03:13,403][800281] Starting all processes... +[2023-07-08 02:03:13,403][800281] Starting process learner_proc0 +[2023-07-08 02:03:13,452][800281] Starting all processes... +[2023-07-08 02:03:13,494][800281] Starting process inference_proc0-0 +[2023-07-08 02:03:13,494][800281] Starting process rollout_proc0 +[2023-07-08 02:03:13,494][800281] Starting process rollout_proc1 +[2023-07-08 02:03:13,494][800281] Starting process rollout_proc2 +[2023-07-08 02:03:13,494][800281] Starting process rollout_proc3 +[2023-07-08 02:03:13,494][800281] Starting process rollout_proc4 +[2023-07-08 02:03:13,494][800281] Starting process rollout_proc5 +[2023-07-08 02:03:13,495][800281] Starting process rollout_proc6 +[2023-07-08 02:03:13,495][800281] Starting process rollout_proc7 +[2023-07-08 02:03:15,398][800572] Worker 3 uses CPU cores [12, 13, 14, 15] +[2023-07-08 02:03:15,526][800669] Worker 7 uses CPU cores [28, 29, 30, 31] +[2023-07-08 02:03:15,568][800571] Worker 1 uses CPU cores [4, 5, 6, 7] +[2023-07-08 02:03:15,623][800524] Starting seed is not provided +[2023-07-08 02:03:15,623][800524] Initializing actor-critic model on device cpu +[2023-07-08 02:03:15,623][800524] RunningMeanStd input shape: (39,) +[2023-07-08 02:03:15,623][800524] RunningMeanStd input shape: (1,) +[2023-07-08 02:03:15,678][800524] Created Actor Critic model with architecture: +[2023-07-08 02:03:15,678][800524] ActorCriticSharedWeights( + (obs_normalizer): ObservationNormalizer( + (running_mean_std): RunningMeanStdDictInPlace( + (running_mean_std): ModuleDict( + (obs): RunningMeanStdInPlace() + ) + ) + ) + (returns_normalizer): RecursiveScriptModule(original_name=RunningMeanStdInPlace) + (encoder): MultiInputEncoder( + (encoders): ModuleDict( + (obs): MlpEncoder( + (mlp_head): RecursiveScriptModule( + original_name=Sequential + (0): RecursiveScriptModule(original_name=Linear) + (1): RecursiveScriptModule(original_name=Tanh) + (2): RecursiveScriptModule(original_name=Linear) + (3): RecursiveScriptModule(original_name=Tanh) + ) + ) + ) + ) + (core): ModelCoreIdentity() + (decoder): MlpDecoder( + (mlp): Identity() + ) + (critic_linear): Linear(in_features=64, out_features=1, bias=True) + (action_parameterization): ActionParameterizationContinuousNonAdaptiveStddev( + (distribution_linear): Linear(in_features=64, out_features=4, bias=True) + ) +) +[2023-07-08 02:03:15,797][800605] Worker 5 uses CPU cores [20, 21, 22, 23] +[2023-07-08 02:03:15,845][800637] Worker 6 uses CPU cores [24, 25, 26, 27] +[2023-07-08 02:03:15,942][800569] Worker 0 uses CPU cores [0, 1, 2, 3] +[2023-07-08 02:03:15,974][800524] Using optimizer +[2023-07-08 02:03:15,975][800524] No checkpoints found +[2023-07-08 02:03:15,975][800524] Did not load from checkpoint, starting from scratch! +[2023-07-08 02:03:15,975][800524] Initialized policy 0 weights for model version 0 +[2023-07-08 02:03:15,976][800524] LearnerWorker_p0 finished initialization! +[2023-07-08 02:03:15,977][800568] RunningMeanStd input shape: (39,) +[2023-07-08 02:03:15,978][800568] RunningMeanStd input shape: (1,) +[2023-07-08 02:03:16,033][800281] Inference worker 0-0 is ready! +[2023-07-08 02:03:16,034][800281] All inference workers are ready! Signal rollout workers to start! +[2023-07-08 02:03:16,043][800573] Worker 4 uses CPU cores [16, 17, 18, 19] +[2023-07-08 02:03:16,148][800570] Worker 2 uses CPU cores [8, 9, 10, 11] +[2023-07-08 02:03:19,707][800571] Decorrelating experience for 0 frames... +[2023-07-08 02:03:19,708][800605] Decorrelating experience for 0 frames... +[2023-07-08 02:03:19,720][800571] Decorrelating experience for 64 frames... +[2023-07-08 02:03:19,720][800605] Decorrelating experience for 64 frames... +[2023-07-08 02:03:19,725][800569] Decorrelating experience for 0 frames... +[2023-07-08 02:03:19,728][800669] Decorrelating experience for 0 frames... +[2023-07-08 02:03:19,729][800572] Decorrelating experience for 0 frames... +[2023-07-08 02:03:19,738][800569] Decorrelating experience for 64 frames... +[2023-07-08 02:03:19,740][800669] Decorrelating experience for 64 frames... +[2023-07-08 02:03:19,742][800572] Decorrelating experience for 64 frames... +[2023-07-08 02:03:19,747][800573] Decorrelating experience for 0 frames... +[2023-07-08 02:03:19,748][800637] Decorrelating experience for 0 frames... +[2023-07-08 02:03:19,751][800571] Decorrelating experience for 128 frames... +[2023-07-08 02:03:19,751][800605] Decorrelating experience for 128 frames... +[2023-07-08 02:03:19,759][800573] Decorrelating experience for 64 frames... +[2023-07-08 02:03:19,760][800637] Decorrelating experience for 64 frames... +[2023-07-08 02:03:19,769][800569] Decorrelating experience for 128 frames... +[2023-07-08 02:03:19,772][800669] Decorrelating experience for 128 frames... +[2023-07-08 02:03:19,773][800572] Decorrelating experience for 128 frames... +[2023-07-08 02:03:19,790][800573] Decorrelating experience for 128 frames... +[2023-07-08 02:03:19,792][800637] Decorrelating experience for 128 frames... +[2023-07-08 02:03:19,813][800605] Decorrelating experience for 192 frames... +[2023-07-08 02:03:19,813][800571] Decorrelating experience for 192 frames... +[2023-07-08 02:03:19,831][800569] Decorrelating experience for 192 frames... +[2023-07-08 02:03:19,834][800669] Decorrelating experience for 192 frames... +[2023-07-08 02:03:19,835][800572] Decorrelating experience for 192 frames... +[2023-07-08 02:03:19,851][800573] Decorrelating experience for 192 frames... +[2023-07-08 02:03:19,854][800637] Decorrelating experience for 192 frames... +[2023-07-08 02:03:19,895][800570] Decorrelating experience for 0 frames... +[2023-07-08 02:03:19,907][800570] Decorrelating experience for 64 frames... +[2023-07-08 02:03:19,938][800570] Decorrelating experience for 128 frames... +[2023-07-08 02:03:20,001][800570] Decorrelating experience for 192 frames... +[2023-07-08 02:03:20,740][800281] Fps is (10 sec: nan, 60 sec: nan, 300 sec: nan). Total num frames: 0. Throughput: 0: nan. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) +[2023-07-08 02:03:23,436][800605] Decorrelating experience for 256 frames... +[2023-07-08 02:03:23,439][800571] Decorrelating experience for 256 frames... +[2023-07-08 02:03:23,471][800569] Decorrelating experience for 256 frames... +[2023-07-08 02:03:23,481][800573] Decorrelating experience for 256 frames... +[2023-07-08 02:03:23,482][800572] Decorrelating experience for 256 frames... +[2023-07-08 02:03:23,483][800669] Decorrelating experience for 256 frames... +[2023-07-08 02:03:23,522][800637] Decorrelating experience for 256 frames... +[2023-07-08 02:03:23,546][800605] Decorrelating experience for 320 frames... +[2023-07-08 02:03:23,551][800571] Decorrelating experience for 320 frames... +[2023-07-08 02:03:23,582][800569] Decorrelating experience for 320 frames... +[2023-07-08 02:03:23,592][800573] Decorrelating experience for 320 frames... +[2023-07-08 02:03:23,593][800669] Decorrelating experience for 320 frames... +[2023-07-08 02:03:23,593][800572] Decorrelating experience for 320 frames... +[2023-07-08 02:03:23,632][800637] Decorrelating experience for 320 frames... +[2023-07-08 02:03:23,645][800570] Decorrelating experience for 256 frames... +[2023-07-08 02:03:23,686][800605] Decorrelating experience for 384 frames... +[2023-07-08 02:03:23,691][800571] Decorrelating experience for 384 frames... +[2023-07-08 02:03:23,722][800569] Decorrelating experience for 384 frames... +[2023-07-08 02:03:23,733][800669] Decorrelating experience for 384 frames... +[2023-07-08 02:03:23,733][800573] Decorrelating experience for 384 frames... +[2023-07-08 02:03:23,734][800572] Decorrelating experience for 384 frames... +[2023-07-08 02:03:23,755][800570] Decorrelating experience for 320 frames... +[2023-07-08 02:03:23,773][800637] Decorrelating experience for 384 frames... +[2023-07-08 02:03:23,846][800605] Decorrelating experience for 448 frames... +[2023-07-08 02:03:23,849][800571] Decorrelating experience for 448 frames... +[2023-07-08 02:03:23,879][800569] Decorrelating experience for 448 frames... +[2023-07-08 02:03:23,891][800669] Decorrelating experience for 448 frames... +[2023-07-08 02:03:23,893][800573] Decorrelating experience for 448 frames... +[2023-07-08 02:03:23,895][800570] Decorrelating experience for 384 frames... +[2023-07-08 02:03:23,896][800572] Decorrelating experience for 448 frames... +[2023-07-08 02:03:23,933][800637] Decorrelating experience for 448 frames... +[2023-07-08 02:03:24,059][800570] Decorrelating experience for 448 frames... +[2023-07-08 02:03:25,740][800281] Fps is (10 sec: 2457.6, 60 sec: 2457.6, 300 sec: 2457.6). Total num frames: 12288. Throughput: 0: 1397.6. Samples: 6988. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-07-08 02:03:25,740][800281] Avg episode reward: [(0, '255.409')] +[2023-07-08 02:03:25,743][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000000024_12288.pth... +[2023-07-08 02:03:27,922][800568] Updated weights for policy 0, policy_version 80 (0.0005) +[2023-07-08 02:03:30,740][800281] Fps is (10 sec: 7372.9, 60 sec: 7372.9, 300 sec: 7372.9). Total num frames: 73728. Throughput: 0: 4296.4. Samples: 42964. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:03:30,740][800281] Avg episode reward: [(0, '403.784')] +[2023-07-08 02:03:31,381][800568] Updated weights for policy 0, policy_version 160 (0.0005) +[2023-07-08 02:03:33,379][800281] Heartbeat connected on Batcher_0 +[2023-07-08 02:03:33,381][800281] Heartbeat connected on LearnerWorker_p0 +[2023-07-08 02:03:33,385][800281] Heartbeat connected on InferenceWorker_p0-w0 +[2023-07-08 02:03:33,389][800281] Heartbeat connected on RolloutWorker_w0 +[2023-07-08 02:03:33,395][800281] Heartbeat connected on RolloutWorker_w3 +[2023-07-08 02:03:33,399][800281] Heartbeat connected on RolloutWorker_w1 +[2023-07-08 02:03:33,400][800281] Heartbeat connected on RolloutWorker_w5 +[2023-07-08 02:03:33,401][800281] Heartbeat connected on RolloutWorker_w6 +[2023-07-08 02:03:33,401][800281] Heartbeat connected on RolloutWorker_w2 +[2023-07-08 02:03:33,401][800281] Heartbeat connected on RolloutWorker_w7 +[2023-07-08 02:03:33,409][800281] Heartbeat connected on RolloutWorker_w4 +[2023-07-08 02:03:35,033][800568] Updated weights for policy 0, policy_version 240 (0.0005) +[2023-07-08 02:03:35,740][800281] Fps is (10 sec: 11468.8, 60 sec: 8465.1, 300 sec: 8465.1). Total num frames: 126976. Throughput: 0: 7509.6. Samples: 112644. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-07-08 02:03:35,741][800281] Avg episode reward: [(0, '490.489')] +[2023-07-08 02:03:35,782][800524] Saving new best policy, reward=490.489! +[2023-07-08 02:03:38,426][800568] Updated weights for policy 0, policy_version 320 (0.0004) +[2023-07-08 02:03:40,740][800281] Fps is (10 sec: 11468.7, 60 sec: 9420.8, 300 sec: 9420.8). Total num frames: 188416. Throughput: 0: 9125.6. Samples: 182512. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:03:40,741][800281] Avg episode reward: [(0, '494.483')] +[2023-07-08 02:03:40,745][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000000368_188416.pth... +[2023-07-08 02:03:40,749][800524] Saving new best policy, reward=494.483! +[2023-07-08 02:03:42,023][800568] Updated weights for policy 0, policy_version 400 (0.0005) +[2023-07-08 02:03:45,740][800281] Fps is (10 sec: 11468.8, 60 sec: 9666.6, 300 sec: 9666.6). Total num frames: 241664. Throughput: 0: 8669.3. Samples: 216732. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-07-08 02:03:45,741][800281] Avg episode reward: [(0, '295.394')] +[2023-07-08 02:03:45,790][800568] Updated weights for policy 0, policy_version 480 (0.0005) +[2023-07-08 02:03:49,760][800568] Updated weights for policy 0, policy_version 560 (0.0005) +[2023-07-08 02:03:50,740][800281] Fps is (10 sec: 10649.6, 60 sec: 9830.4, 300 sec: 9830.4). Total num frames: 294912. Throughput: 0: 9313.6. Samples: 279408. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:03:50,741][800281] Avg episode reward: [(0, '254.941')] +[2023-07-08 02:03:54,224][800568] Updated weights for policy 0, policy_version 640 (0.0005) +[2023-07-08 02:03:55,740][800281] Fps is (10 sec: 9830.3, 60 sec: 9713.4, 300 sec: 9713.4). Total num frames: 339968. Throughput: 0: 9596.6. Samples: 335880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:03:55,741][800281] Avg episode reward: [(0, '253.653')] +[2023-07-08 02:03:55,743][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000000664_339968.pth... +[2023-07-08 02:03:55,747][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000000024_12288.pth +[2023-07-08 02:03:58,486][800568] Updated weights for policy 0, policy_version 720 (0.0005) +[2023-07-08 02:04:00,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9728.0, 300 sec: 9728.0). Total num frames: 389120. Throughput: 0: 9115.2. Samples: 364608. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:04:00,741][800281] Avg episode reward: [(0, '213.157')] +[2023-07-08 02:04:02,781][800568] Updated weights for policy 0, policy_version 800 (0.0005) +[2023-07-08 02:04:05,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9648.4, 300 sec: 9648.4). Total num frames: 434176. Throughput: 0: 9374.5. Samples: 421852. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:04:05,741][800281] Avg episode reward: [(0, '188.266')] +[2023-07-08 02:04:07,226][800568] Updated weights for policy 0, policy_version 880 (0.0005) +[2023-07-08 02:04:10,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9666.6, 300 sec: 9666.6). Total num frames: 483328. Throughput: 0: 10465.3. Samples: 477928. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-07-08 02:04:10,740][800281] Avg episode reward: [(0, '191.301')] +[2023-07-08 02:04:10,743][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000000944_483328.pth... +[2023-07-08 02:04:10,746][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000000368_188416.pth +[2023-07-08 02:04:11,345][800568] Updated weights for policy 0, policy_version 960 (0.0005) +[2023-07-08 02:04:15,413][800568] Updated weights for policy 0, policy_version 1040 (0.0005) +[2023-07-08 02:04:15,740][800281] Fps is (10 sec: 9830.4, 60 sec: 9681.5, 300 sec: 9681.5). Total num frames: 532480. Throughput: 0: 10338.2. Samples: 508184. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:04:15,741][800281] Avg episode reward: [(0, '185.356')] +[2023-07-08 02:04:19,829][800568] Updated weights for policy 0, policy_version 1120 (0.0006) +[2023-07-08 02:04:20,740][800281] Fps is (10 sec: 9830.3, 60 sec: 9693.9, 300 sec: 9693.9). Total num frames: 581632. Throughput: 0: 10092.8. Samples: 566820. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:04:20,741][800281] Avg episode reward: [(0, '189.077')] +[2023-07-08 02:04:24,240][800568] Updated weights for policy 0, policy_version 1200 (0.0006) +[2023-07-08 02:04:25,740][800281] Fps is (10 sec: 9420.8, 60 sec: 10240.0, 300 sec: 9641.4). Total num frames: 626688. Throughput: 0: 9779.6. Samples: 622592. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:04:25,741][800281] Avg episode reward: [(0, '189.836')] +[2023-07-08 02:04:25,744][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000001224_626688.pth... +[2023-07-08 02:04:25,747][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000000664_339968.pth +[2023-07-08 02:04:28,621][800568] Updated weights for policy 0, policy_version 1280 (0.0006) +[2023-07-08 02:04:30,740][800281] Fps is (10 sec: 9011.3, 60 sec: 9966.9, 300 sec: 9596.4). Total num frames: 671744. Throughput: 0: 9651.0. Samples: 651028. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:04:30,741][800281] Avg episode reward: [(0, '188.180')] +[2023-07-08 02:04:32,892][800568] Updated weights for policy 0, policy_version 1360 (0.0006) +[2023-07-08 02:04:35,740][800281] Fps is (10 sec: 9420.9, 60 sec: 9898.7, 300 sec: 9612.0). Total num frames: 720896. Throughput: 0: 9510.0. Samples: 707356. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-07-08 02:04:35,741][800281] Avg episode reward: [(0, '184.690')] +[2023-07-08 02:04:37,244][800568] Updated weights for policy 0, policy_version 1440 (0.0006) +[2023-07-08 02:04:40,740][800281] Fps is (10 sec: 9830.3, 60 sec: 9693.9, 300 sec: 9625.6). Total num frames: 770048. Throughput: 0: 9545.8. Samples: 765440. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-07-08 02:04:40,741][800281] Avg episode reward: [(0, '185.849')] +[2023-07-08 02:04:40,743][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000001504_770048.pth... +[2023-07-08 02:04:40,746][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000000944_483328.pth +[2023-07-08 02:04:41,443][800568] Updated weights for policy 0, policy_version 1520 (0.0005) +[2023-07-08 02:04:45,555][800568] Updated weights for policy 0, policy_version 1600 (0.0006) +[2023-07-08 02:04:45,740][800281] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9637.7). Total num frames: 819200. Throughput: 0: 9557.3. Samples: 794688. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:04:45,740][800281] Avg episode reward: [(0, '183.519')] +[2023-07-08 02:04:49,676][800568] Updated weights for policy 0, policy_version 1680 (0.0005) +[2023-07-08 02:04:50,740][800281] Fps is (10 sec: 9830.5, 60 sec: 9557.3, 300 sec: 9648.4). Total num frames: 868352. Throughput: 0: 9599.5. Samples: 853828. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:04:50,740][800281] Avg episode reward: [(0, '180.911')] +[2023-07-08 02:04:53,654][800568] Updated weights for policy 0, policy_version 1760 (0.0005) +[2023-07-08 02:04:55,740][800281] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9657.9). Total num frames: 917504. Throughput: 0: 9692.9. Samples: 914108. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-07-08 02:04:55,740][800281] Avg episode reward: [(0, '180.071')] +[2023-07-08 02:04:55,743][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000001792_917504.pth... +[2023-07-08 02:04:55,746][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000001224_626688.pth +[2023-07-08 02:04:57,810][800568] Updated weights for policy 0, policy_version 1840 (0.0005) +[2023-07-08 02:05:00,740][800281] Fps is (10 sec: 10240.0, 60 sec: 9693.9, 300 sec: 9707.5). Total num frames: 970752. Throughput: 0: 9701.3. Samples: 944744. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-07-08 02:05:00,741][800281] Avg episode reward: [(0, '182.439')] +[2023-07-08 02:05:01,973][800568] Updated weights for policy 0, policy_version 1920 (0.0005) +[2023-07-08 02:05:05,740][800281] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9674.4). Total num frames: 1015808. Throughput: 0: 9703.0. Samples: 1003452. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-07-08 02:05:05,741][800281] Avg episode reward: [(0, '183.962')] +[2023-07-08 02:05:06,355][800568] Updated weights for policy 0, policy_version 2000 (0.0005) +[2023-07-08 02:05:10,740][800281] Fps is (10 sec: 9011.2, 60 sec: 9625.6, 300 sec: 9644.2). Total num frames: 1060864. Throughput: 0: 9683.7. Samples: 1058360. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:05:10,740][800281] Avg episode reward: [(0, '183.798')] +[2023-07-08 02:05:10,794][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000002080_1064960.pth... +[2023-07-08 02:05:10,794][800568] Updated weights for policy 0, policy_version 2080 (0.0005) +[2023-07-08 02:05:10,796][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000001504_770048.pth +[2023-07-08 02:05:15,188][800568] Updated weights for policy 0, policy_version 2160 (0.0006) +[2023-07-08 02:05:15,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9652.3). Total num frames: 1110016. Throughput: 0: 9669.1. Samples: 1086136. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:05:15,741][800281] Avg episode reward: [(0, '181.960')] +[2023-07-08 02:05:19,677][800568] Updated weights for policy 0, policy_version 2240 (0.0005) +[2023-07-08 02:05:20,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9557.4, 300 sec: 9625.6). Total num frames: 1155072. Throughput: 0: 9658.0. Samples: 1141964. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:05:20,741][800281] Avg episode reward: [(0, '179.147')] +[2023-07-08 02:05:24,081][800568] Updated weights for policy 0, policy_version 2320 (0.0005) +[2023-07-08 02:05:25,740][800281] Fps is (10 sec: 9011.2, 60 sec: 9557.3, 300 sec: 9601.0). Total num frames: 1200128. Throughput: 0: 9597.2. Samples: 1197312. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-07-08 02:05:25,740][800281] Avg episode reward: [(0, '181.934')] +[2023-07-08 02:05:25,743][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000002344_1200128.pth... +[2023-07-08 02:05:25,746][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000001792_917504.pth +[2023-07-08 02:05:28,444][800568] Updated weights for policy 0, policy_version 2400 (0.0005) +[2023-07-08 02:05:30,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9609.9). Total num frames: 1249280. Throughput: 0: 9558.8. Samples: 1224832. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:05:30,740][800281] Avg episode reward: [(0, '182.058')] +[2023-07-08 02:05:32,754][800568] Updated weights for policy 0, policy_version 2480 (0.0005) +[2023-07-08 02:05:35,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9587.7). Total num frames: 1294336. Throughput: 0: 9517.6. Samples: 1282120. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:05:35,740][800281] Avg episode reward: [(0, '183.479')] +[2023-07-08 02:05:37,067][800568] Updated weights for policy 0, policy_version 2560 (0.0005) +[2023-07-08 02:05:40,740][800281] Fps is (10 sec: 9420.7, 60 sec: 9557.3, 300 sec: 9596.3). Total num frames: 1343488. Throughput: 0: 9434.8. Samples: 1338676. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-07-08 02:05:40,741][800281] Avg episode reward: [(0, '182.075')] +[2023-07-08 02:05:40,744][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000002624_1343488.pth... +[2023-07-08 02:05:40,747][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000002080_1064960.pth +[2023-07-08 02:05:41,503][800568] Updated weights for policy 0, policy_version 2640 (0.0005) +[2023-07-08 02:05:45,740][800281] Fps is (10 sec: 9420.7, 60 sec: 9489.1, 300 sec: 9576.2). Total num frames: 1388544. Throughput: 0: 9391.6. Samples: 1367364. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-07-08 02:05:45,740][800281] Avg episode reward: [(0, '184.190')] +[2023-07-08 02:05:45,927][800568] Updated weights for policy 0, policy_version 2720 (0.0005) +[2023-07-08 02:05:50,357][800568] Updated weights for policy 0, policy_version 2800 (0.0005) +[2023-07-08 02:05:50,740][800281] Fps is (10 sec: 9011.3, 60 sec: 9420.8, 300 sec: 9557.3). Total num frames: 1433600. Throughput: 0: 9293.7. Samples: 1421668. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:05:50,741][800281] Avg episode reward: [(0, '185.207')] +[2023-07-08 02:05:54,757][800568] Updated weights for policy 0, policy_version 2880 (0.0005) +[2023-07-08 02:05:55,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9566.1). Total num frames: 1482752. Throughput: 0: 9322.8. Samples: 1477888. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-07-08 02:05:55,740][800281] Avg episode reward: [(0, '202.509')] +[2023-07-08 02:05:55,744][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000002896_1482752.pth... +[2023-07-08 02:05:55,747][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000002344_1200128.pth +[2023-07-08 02:05:59,166][800568] Updated weights for policy 0, policy_version 2960 (0.0005) +[2023-07-08 02:06:00,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9548.8). Total num frames: 1527808. Throughput: 0: 9329.0. Samples: 1505940. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-07-08 02:06:00,741][800281] Avg episode reward: [(0, '181.255')] +[2023-07-08 02:06:03,645][800568] Updated weights for policy 0, policy_version 3040 (0.0005) +[2023-07-08 02:06:05,740][800281] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9532.5). Total num frames: 1572864. Throughput: 0: 9303.9. Samples: 1560640. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:06:05,740][800281] Avg episode reward: [(0, '181.657')] +[2023-07-08 02:06:08,049][800568] Updated weights for policy 0, policy_version 3120 (0.0005) +[2023-07-08 02:06:10,740][800281] Fps is (10 sec: 9420.7, 60 sec: 9352.5, 300 sec: 9541.3). Total num frames: 1622016. Throughput: 0: 9346.0. Samples: 1617884. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:06:10,741][800281] Avg episode reward: [(0, '180.372')] +[2023-07-08 02:06:10,744][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000003168_1622016.pth... +[2023-07-08 02:06:10,746][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000002624_1343488.pth +[2023-07-08 02:06:12,255][800568] Updated weights for policy 0, policy_version 3200 (0.0005) +[2023-07-08 02:06:15,740][800281] Fps is (10 sec: 9830.4, 60 sec: 9352.5, 300 sec: 9549.5). Total num frames: 1671168. Throughput: 0: 9372.6. Samples: 1646600. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:06:15,741][800281] Avg episode reward: [(0, '183.959')] +[2023-07-08 02:06:16,362][800568] Updated weights for policy 0, policy_version 3280 (0.0005) +[2023-07-08 02:06:20,572][800568] Updated weights for policy 0, policy_version 3360 (0.0005) +[2023-07-08 02:06:20,740][800281] Fps is (10 sec: 9830.5, 60 sec: 9420.8, 300 sec: 9557.3). Total num frames: 1720320. Throughput: 0: 9411.3. Samples: 1705628. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-07-08 02:06:20,741][800281] Avg episode reward: [(0, '180.563')] +[2023-07-08 02:06:24,783][800568] Updated weights for policy 0, policy_version 3440 (0.0005) +[2023-07-08 02:06:25,740][800281] Fps is (10 sec: 9830.3, 60 sec: 9489.1, 300 sec: 9564.7). Total num frames: 1769472. Throughput: 0: 9459.7. Samples: 1764360. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-07-08 02:06:25,741][800281] Avg episode reward: [(0, '181.849')] +[2023-07-08 02:06:25,744][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000003456_1769472.pth... +[2023-07-08 02:06:25,747][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000002896_1482752.pth +[2023-07-08 02:06:29,155][800568] Updated weights for policy 0, policy_version 3520 (0.0005) +[2023-07-08 02:06:30,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9550.1). Total num frames: 1814528. Throughput: 0: 9446.2. Samples: 1792444. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-07-08 02:06:30,741][800281] Avg episode reward: [(0, '180.054')] +[2023-07-08 02:06:33,396][800568] Updated weights for policy 0, policy_version 3600 (0.0004) +[2023-07-08 02:06:35,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9489.0, 300 sec: 9557.3). Total num frames: 1863680. Throughput: 0: 9524.1. Samples: 1850252. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-07-08 02:06:35,741][800281] Avg episode reward: [(0, '186.738')] +[2023-07-08 02:06:37,578][800568] Updated weights for policy 0, policy_version 3680 (0.0005) +[2023-07-08 02:06:40,740][800281] Fps is (10 sec: 9830.3, 60 sec: 9489.1, 300 sec: 9564.2). Total num frames: 1912832. Throughput: 0: 9564.6. Samples: 1908296. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:06:40,741][800281] Avg episode reward: [(0, '183.351')] +[2023-07-08 02:06:40,744][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000003736_1912832.pth... +[2023-07-08 02:06:40,747][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000003168_1622016.pth +[2023-07-08 02:06:41,812][800568] Updated weights for policy 0, policy_version 3760 (0.0004) +[2023-07-08 02:06:45,740][800281] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9570.7). Total num frames: 1961984. Throughput: 0: 9588.2. Samples: 1937408. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:06:45,741][800281] Avg episode reward: [(0, '177.952')] +[2023-07-08 02:06:46,047][800568] Updated weights for policy 0, policy_version 3840 (0.0005) +[2023-07-08 02:06:50,052][800568] Updated weights for policy 0, policy_version 3920 (0.0004) +[2023-07-08 02:06:50,740][800281] Fps is (10 sec: 9830.5, 60 sec: 9625.6, 300 sec: 9576.8). Total num frames: 2011136. Throughput: 0: 9697.6. Samples: 1997032. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-07-08 02:06:50,740][800281] Avg episode reward: [(0, '182.998')] +[2023-07-08 02:06:54,301][800568] Updated weights for policy 0, policy_version 4000 (0.0005) +[2023-07-08 02:06:55,740][800281] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9582.7). Total num frames: 2060288. Throughput: 0: 9730.1. Samples: 2055740. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:06:55,741][800281] Avg episode reward: [(0, '178.321')] +[2023-07-08 02:06:55,743][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000004024_2060288.pth... +[2023-07-08 02:06:55,746][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000003456_1769472.pth +[2023-07-08 02:06:58,419][800568] Updated weights for policy 0, policy_version 4080 (0.0005) +[2023-07-08 02:07:00,740][800281] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9588.4). Total num frames: 2109440. Throughput: 0: 9743.0. Samples: 2085036. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:07:00,741][800281] Avg episode reward: [(0, '179.506')] +[2023-07-08 02:07:02,909][800568] Updated weights for policy 0, policy_version 4160 (0.0005) +[2023-07-08 02:07:05,740][800281] Fps is (10 sec: 9420.9, 60 sec: 9693.9, 300 sec: 9575.5). Total num frames: 2154496. Throughput: 0: 9687.7. Samples: 2141576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:07:05,741][800281] Avg episode reward: [(0, '179.681')] +[2023-07-08 02:07:07,266][800568] Updated weights for policy 0, policy_version 4240 (0.0005) +[2023-07-08 02:07:10,740][800281] Fps is (10 sec: 9011.2, 60 sec: 9625.6, 300 sec: 9563.3). Total num frames: 2199552. Throughput: 0: 9611.5. Samples: 2196876. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:07:10,740][800281] Avg episode reward: [(0, '181.153')] +[2023-07-08 02:07:10,742][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000004296_2199552.pth... +[2023-07-08 02:07:10,745][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000003736_1912832.pth +[2023-07-08 02:07:11,764][800568] Updated weights for policy 0, policy_version 4320 (0.0005) +[2023-07-08 02:07:15,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9569.0). Total num frames: 2248704. Throughput: 0: 9599.8. Samples: 2224436. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-07-08 02:07:15,741][800281] Avg episode reward: [(0, '179.450')] +[2023-07-08 02:07:16,068][800568] Updated weights for policy 0, policy_version 4400 (0.0005) +[2023-07-08 02:07:20,478][800568] Updated weights for policy 0, policy_version 4480 (0.0005) +[2023-07-08 02:07:20,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9557.3). Total num frames: 2293760. Throughput: 0: 9571.2. Samples: 2280956. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:07:20,740][800281] Avg episode reward: [(0, '179.851')] +[2023-07-08 02:07:24,688][800568] Updated weights for policy 0, policy_version 4560 (0.0006) +[2023-07-08 02:07:25,740][800281] Fps is (10 sec: 9420.7, 60 sec: 9557.3, 300 sec: 9562.9). Total num frames: 2342912. Throughput: 0: 9565.7. Samples: 2338752. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-07-08 02:07:25,740][800281] Avg episode reward: [(0, '190.250')] +[2023-07-08 02:07:25,744][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000004576_2342912.pth... +[2023-07-08 02:07:25,746][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000004024_2060288.pth +[2023-07-08 02:07:28,781][800568] Updated weights for policy 0, policy_version 4640 (0.0006) +[2023-07-08 02:07:30,740][800281] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9568.3). Total num frames: 2392064. Throughput: 0: 9583.2. Samples: 2368652. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:07:30,740][800281] Avg episode reward: [(0, '187.128')] +[2023-07-08 02:07:33,103][800568] Updated weights for policy 0, policy_version 4720 (0.0005) +[2023-07-08 02:07:35,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9557.3). Total num frames: 2437120. Throughput: 0: 9518.5. Samples: 2425364. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:07:35,741][800281] Avg episode reward: [(0, '182.792')] +[2023-07-08 02:07:37,647][800568] Updated weights for policy 0, policy_version 4800 (0.0005) +[2023-07-08 02:07:40,740][800281] Fps is (10 sec: 9011.1, 60 sec: 9489.1, 300 sec: 9546.8). Total num frames: 2482176. Throughput: 0: 9416.9. Samples: 2479500. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:07:40,740][800281] Avg episode reward: [(0, '186.667')] +[2023-07-08 02:07:40,772][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000004856_2486272.pth... +[2023-07-08 02:07:40,775][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000004296_2199552.pth +[2023-07-08 02:07:42,155][800568] Updated weights for policy 0, policy_version 4880 (0.0005) +[2023-07-08 02:07:45,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9552.2). Total num frames: 2531328. Throughput: 0: 9373.1. Samples: 2506824. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-07-08 02:07:45,740][800281] Avg episode reward: [(0, '183.576')] +[2023-07-08 02:07:46,663][800568] Updated weights for policy 0, policy_version 4960 (0.0005) +[2023-07-08 02:07:50,740][800281] Fps is (10 sec: 9420.9, 60 sec: 9420.8, 300 sec: 9542.2). Total num frames: 2576384. Throughput: 0: 9347.5. Samples: 2562212. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:07:50,741][800281] Avg episode reward: [(0, '190.650')] +[2023-07-08 02:07:51,023][800568] Updated weights for policy 0, policy_version 5040 (0.0005) +[2023-07-08 02:07:55,351][800568] Updated weights for policy 0, policy_version 5120 (0.0005) +[2023-07-08 02:07:55,740][800281] Fps is (10 sec: 9011.1, 60 sec: 9352.5, 300 sec: 9532.5). Total num frames: 2621440. Throughput: 0: 9372.7. Samples: 2618648. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-07-08 02:07:55,741][800281] Avg episode reward: [(0, '182.745')] +[2023-07-08 02:07:55,795][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000005128_2625536.pth... +[2023-07-08 02:07:55,797][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000004576_2342912.pth +[2023-07-08 02:07:59,819][800568] Updated weights for policy 0, policy_version 5200 (0.0005) +[2023-07-08 02:08:00,740][800281] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9523.2). Total num frames: 2666496. Throughput: 0: 9376.4. Samples: 2646372. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-07-08 02:08:00,741][800281] Avg episode reward: [(0, '180.829')] +[2023-07-08 02:08:04,095][800568] Updated weights for policy 0, policy_version 5280 (0.0005) +[2023-07-08 02:08:05,740][800281] Fps is (10 sec: 9830.6, 60 sec: 9420.8, 300 sec: 9543.0). Total num frames: 2719744. Throughput: 0: 9386.7. Samples: 2703356. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:08:05,741][800281] Avg episode reward: [(0, '177.754')] +[2023-07-08 02:08:08,367][800568] Updated weights for policy 0, policy_version 5360 (0.0005) +[2023-07-08 02:08:10,740][800281] Fps is (10 sec: 9830.4, 60 sec: 9420.8, 300 sec: 9533.8). Total num frames: 2764800. Throughput: 0: 9375.0. Samples: 2760628. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:08:10,741][800281] Avg episode reward: [(0, '178.035')] +[2023-07-08 02:08:10,744][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000005400_2764800.pth... +[2023-07-08 02:08:10,747][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000004856_2486272.pth +[2023-07-08 02:08:12,825][800568] Updated weights for policy 0, policy_version 5440 (0.0005) +[2023-07-08 02:08:15,740][800281] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9524.9). Total num frames: 2809856. Throughput: 0: 9312.0. Samples: 2787692. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-07-08 02:08:15,740][800281] Avg episode reward: [(0, '177.686')] +[2023-07-08 02:08:16,965][800568] Updated weights for policy 0, policy_version 5520 (0.0005) +[2023-07-08 02:08:20,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9649.9). Total num frames: 2859008. Throughput: 0: 9356.1. Samples: 2846388. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:08:20,740][800281] Avg episode reward: [(0, '177.748')] +[2023-07-08 02:08:21,197][800568] Updated weights for policy 0, policy_version 5600 (0.0005) +[2023-07-08 02:08:25,288][800568] Updated weights for policy 0, policy_version 5680 (0.0005) +[2023-07-08 02:08:25,740][800281] Fps is (10 sec: 10239.9, 60 sec: 9489.1, 300 sec: 9622.1). Total num frames: 2912256. Throughput: 0: 9474.6. Samples: 2905856. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-07-08 02:08:25,741][800281] Avg episode reward: [(0, '180.044')] +[2023-07-08 02:08:25,744][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000005688_2912256.pth... +[2023-07-08 02:08:25,746][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000005128_2625536.pth +[2023-07-08 02:08:29,550][800568] Updated weights for policy 0, policy_version 5760 (0.0005) +[2023-07-08 02:08:30,740][800281] Fps is (10 sec: 9830.4, 60 sec: 9420.8, 300 sec: 9594.4). Total num frames: 2957312. Throughput: 0: 9508.1. Samples: 2934688. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-07-08 02:08:30,741][800281] Avg episode reward: [(0, '180.134')] +[2023-07-08 02:08:33,714][800568] Updated weights for policy 0, policy_version 5840 (0.0005) +[2023-07-08 02:08:35,740][800281] Fps is (10 sec: 9420.9, 60 sec: 9489.1, 300 sec: 9552.7). Total num frames: 3006464. Throughput: 0: 9593.2. Samples: 2993908. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-07-08 02:08:35,741][800281] Avg episode reward: [(0, '182.447')] +[2023-07-08 02:08:38,035][800568] Updated weights for policy 0, policy_version 5920 (0.0005) +[2023-07-08 02:08:40,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9524.9). Total num frames: 3051520. Throughput: 0: 9583.2. Samples: 3049892. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:08:40,741][800281] Avg episode reward: [(0, '182.213')] +[2023-07-08 02:08:40,744][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000005968_3055616.pth... +[2023-07-08 02:08:40,747][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000005400_2764800.pth +[2023-07-08 02:08:42,506][800568] Updated weights for policy 0, policy_version 6000 (0.0005) +[2023-07-08 02:08:45,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9511.1). Total num frames: 3100672. Throughput: 0: 9569.5. Samples: 3077000. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:08:45,740][800281] Avg episode reward: [(0, '178.846')] +[2023-07-08 02:08:46,919][800568] Updated weights for policy 0, policy_version 6080 (0.0005) +[2023-07-08 02:08:50,740][800281] Fps is (10 sec: 9420.9, 60 sec: 9489.1, 300 sec: 9511.1). Total num frames: 3145728. Throughput: 0: 9555.1. Samples: 3133336. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-07-08 02:08:50,740][800281] Avg episode reward: [(0, '179.179')] +[2023-07-08 02:08:51,375][800568] Updated weights for policy 0, policy_version 6160 (0.0005) +[2023-07-08 02:08:55,740][800281] Fps is (10 sec: 9011.1, 60 sec: 9489.1, 300 sec: 9497.2). Total num frames: 3190784. Throughput: 0: 9492.0. Samples: 3187768. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-07-08 02:08:55,741][800281] Avg episode reward: [(0, '179.740')] +[2023-07-08 02:08:55,772][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000006240_3194880.pth... +[2023-07-08 02:08:55,772][800568] Updated weights for policy 0, policy_version 6240 (0.0005) +[2023-07-08 02:08:55,774][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000005688_2912256.pth +[2023-07-08 02:09:00,063][800568] Updated weights for policy 0, policy_version 6320 (0.0005) +[2023-07-08 02:09:00,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9511.1). Total num frames: 3239936. Throughput: 0: 9551.6. Samples: 3217516. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-07-08 02:09:00,740][800281] Avg episode reward: [(0, '178.086')] +[2023-07-08 02:09:04,237][800568] Updated weights for policy 0, policy_version 6400 (0.0005) +[2023-07-08 02:09:05,740][800281] Fps is (10 sec: 9830.6, 60 sec: 9489.1, 300 sec: 9511.1). Total num frames: 3289088. Throughput: 0: 9528.1. Samples: 3275152. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:09:05,740][800281] Avg episode reward: [(0, '176.702')] +[2023-07-08 02:09:08,362][800568] Updated weights for policy 0, policy_version 6480 (0.0005) +[2023-07-08 02:09:10,740][800281] Fps is (10 sec: 9830.3, 60 sec: 9557.3, 300 sec: 9511.1). Total num frames: 3338240. Throughput: 0: 9518.9. Samples: 3334208. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-07-08 02:09:10,741][800281] Avg episode reward: [(0, '178.733')] +[2023-07-08 02:09:10,743][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000006520_3338240.pth... +[2023-07-08 02:09:10,745][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000005968_3055616.pth +[2023-07-08 02:09:12,673][800568] Updated weights for policy 0, policy_version 6560 (0.0005) +[2023-07-08 02:09:15,740][800281] Fps is (10 sec: 9420.7, 60 sec: 9557.3, 300 sec: 9497.2). Total num frames: 3383296. Throughput: 0: 9513.9. Samples: 3362812. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-07-08 02:09:15,741][800281] Avg episode reward: [(0, '178.055')] +[2023-07-08 02:09:17,137][800568] Updated weights for policy 0, policy_version 6640 (0.0006) +[2023-07-08 02:09:20,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9511.1). Total num frames: 3432448. Throughput: 0: 9421.9. Samples: 3417892. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:09:20,741][800281] Avg episode reward: [(0, '177.503')] +[2023-07-08 02:09:21,423][800568] Updated weights for policy 0, policy_version 6720 (0.0005) +[2023-07-08 02:09:25,552][800568] Updated weights for policy 0, policy_version 6800 (0.0005) +[2023-07-08 02:09:25,740][800281] Fps is (10 sec: 9830.4, 60 sec: 9489.1, 300 sec: 9524.9). Total num frames: 3481600. Throughput: 0: 9497.2. Samples: 3477268. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:09:25,741][800281] Avg episode reward: [(0, '177.435')] +[2023-07-08 02:09:25,744][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000006800_3481600.pth... +[2023-07-08 02:09:25,747][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000006240_3194880.pth +[2023-07-08 02:09:29,563][800568] Updated weights for policy 0, policy_version 6880 (0.0005) +[2023-07-08 02:09:30,740][800281] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9524.9). Total num frames: 3530752. Throughput: 0: 9572.9. Samples: 3507780. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-07-08 02:09:30,741][800281] Avg episode reward: [(0, '178.237')] +[2023-07-08 02:09:33,691][800568] Updated weights for policy 0, policy_version 6960 (0.0005) +[2023-07-08 02:09:35,740][800281] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9524.9). Total num frames: 3579904. Throughput: 0: 9648.9. Samples: 3567536. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:09:35,741][800281] Avg episode reward: [(0, '176.214')] +[2023-07-08 02:09:37,947][800568] Updated weights for policy 0, policy_version 7040 (0.0005) +[2023-07-08 02:09:40,740][800281] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9524.9). Total num frames: 3629056. Throughput: 0: 9715.6. Samples: 3624968. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:09:40,740][800281] Avg episode reward: [(0, '177.275')] +[2023-07-08 02:09:40,744][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000007088_3629056.pth... +[2023-07-08 02:09:40,747][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000006520_3338240.pth +[2023-07-08 02:09:42,178][800568] Updated weights for policy 0, policy_version 7120 (0.0005) +[2023-07-08 02:09:45,740][800281] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9524.9). Total num frames: 3678208. Throughput: 0: 9697.4. Samples: 3653900. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:09:45,741][800281] Avg episode reward: [(0, '176.869')] +[2023-07-08 02:09:46,479][800568] Updated weights for policy 0, policy_version 7200 (0.0005) +[2023-07-08 02:09:50,662][800568] Updated weights for policy 0, policy_version 7280 (0.0005) +[2023-07-08 02:09:50,740][800281] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9524.9). Total num frames: 3727360. Throughput: 0: 9703.0. Samples: 3711788. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-07-08 02:09:50,740][800281] Avg episode reward: [(0, '177.296')] +[2023-07-08 02:09:54,968][800568] Updated weights for policy 0, policy_version 7360 (0.0005) +[2023-07-08 02:09:55,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9497.2). Total num frames: 3772416. Throughput: 0: 9662.8. Samples: 3769036. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-07-08 02:09:55,741][800281] Avg episode reward: [(0, '177.874')] +[2023-07-08 02:09:55,744][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000007368_3772416.pth... +[2023-07-08 02:09:55,746][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000006800_3481600.pth +[2023-07-08 02:09:59,251][800568] Updated weights for policy 0, policy_version 7440 (0.0005) +[2023-07-08 02:10:00,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9511.1). Total num frames: 3821568. Throughput: 0: 9664.1. Samples: 3797696. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:10:00,741][800281] Avg episode reward: [(0, '176.777')] +[2023-07-08 02:10:03,635][800568] Updated weights for policy 0, policy_version 7520 (0.0005) +[2023-07-08 02:10:05,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9511.1). Total num frames: 3866624. Throughput: 0: 9698.9. Samples: 3854344. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:10:05,741][800281] Avg episode reward: [(0, '175.213')] +[2023-07-08 02:10:08,195][800568] Updated weights for policy 0, policy_version 7600 (0.0005) +[2023-07-08 02:10:10,740][800281] Fps is (10 sec: 9011.2, 60 sec: 9557.3, 300 sec: 9497.2). Total num frames: 3911680. Throughput: 0: 9570.2. Samples: 3907928. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:10:10,740][800281] Avg episode reward: [(0, '175.274')] +[2023-07-08 02:10:10,743][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000007640_3911680.pth... +[2023-07-08 02:10:10,746][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000007088_3629056.pth +[2023-07-08 02:10:12,774][800568] Updated weights for policy 0, policy_version 7680 (0.0005) +[2023-07-08 02:10:15,740][800281] Fps is (10 sec: 9011.2, 60 sec: 9557.3, 300 sec: 9497.2). Total num frames: 3956736. Throughput: 0: 9497.9. Samples: 3935184. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:10:15,741][800281] Avg episode reward: [(0, '177.103')] +[2023-07-08 02:10:17,156][800568] Updated weights for policy 0, policy_version 7760 (0.0005) +[2023-07-08 02:10:20,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9511.1). Total num frames: 4005888. Throughput: 0: 9438.0. Samples: 3992244. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-07-08 02:10:20,741][800281] Avg episode reward: [(0, '176.203')] +[2023-07-08 02:10:21,441][800568] Updated weights for policy 0, policy_version 7840 (0.0005) +[2023-07-08 02:10:25,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9497.2). Total num frames: 4050944. Throughput: 0: 9375.1. Samples: 4046848. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-07-08 02:10:25,741][800281] Avg episode reward: [(0, '174.836')] +[2023-07-08 02:10:25,744][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000007912_4050944.pth... +[2023-07-08 02:10:25,747][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000007368_3772416.pth +[2023-07-08 02:10:26,035][800568] Updated weights for policy 0, policy_version 7920 (0.0005) +[2023-07-08 02:10:30,637][800568] Updated weights for policy 0, policy_version 8000 (0.0005) +[2023-07-08 02:10:30,740][800281] Fps is (10 sec: 9011.2, 60 sec: 9420.8, 300 sec: 9497.2). Total num frames: 4096000. Throughput: 0: 9323.5. Samples: 4073456. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-07-08 02:10:30,741][800281] Avg episode reward: [(0, '177.060')] +[2023-07-08 02:10:35,082][800568] Updated weights for policy 0, policy_version 8080 (0.0005) +[2023-07-08 02:10:35,740][800281] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9483.3). Total num frames: 4141056. Throughput: 0: 9247.6. Samples: 4127928. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-07-08 02:10:35,741][800281] Avg episode reward: [(0, '176.045')] +[2023-07-08 02:10:39,564][800568] Updated weights for policy 0, policy_version 8160 (0.0005) +[2023-07-08 02:10:40,740][800281] Fps is (10 sec: 9011.1, 60 sec: 9284.3, 300 sec: 9483.3). Total num frames: 4186112. Throughput: 0: 9183.5. Samples: 4182296. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-07-08 02:10:40,741][800281] Avg episode reward: [(0, '176.898')] +[2023-07-08 02:10:40,744][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000008176_4186112.pth... +[2023-07-08 02:10:40,747][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000007640_3911680.pth +[2023-07-08 02:10:43,948][800568] Updated weights for policy 0, policy_version 8240 (0.0005) +[2023-07-08 02:10:45,740][800281] Fps is (10 sec: 9420.6, 60 sec: 9284.2, 300 sec: 9497.2). Total num frames: 4235264. Throughput: 0: 9177.7. Samples: 4210696. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-07-08 02:10:45,741][800281] Avg episode reward: [(0, '176.974')] +[2023-07-08 02:10:48,388][800568] Updated weights for policy 0, policy_version 8320 (0.0005) +[2023-07-08 02:10:50,740][800281] Fps is (10 sec: 9420.9, 60 sec: 9216.0, 300 sec: 9483.3). Total num frames: 4280320. Throughput: 0: 9150.0. Samples: 4266096. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-07-08 02:10:50,741][800281] Avg episode reward: [(0, '176.201')] +[2023-07-08 02:10:52,903][800568] Updated weights for policy 0, policy_version 8400 (0.0005) +[2023-07-08 02:10:55,740][800281] Fps is (10 sec: 9011.3, 60 sec: 9216.0, 300 sec: 9483.3). Total num frames: 4325376. Throughput: 0: 9185.6. Samples: 4321280. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-07-08 02:10:55,740][800281] Avg episode reward: [(0, '174.919')] +[2023-07-08 02:10:55,743][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000008448_4325376.pth... +[2023-07-08 02:10:55,746][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000007912_4050944.pth +[2023-07-08 02:10:57,365][800568] Updated weights for policy 0, policy_version 8480 (0.0006) +[2023-07-08 02:11:00,740][800281] Fps is (10 sec: 9011.2, 60 sec: 9147.7, 300 sec: 9483.3). Total num frames: 4370432. Throughput: 0: 9180.9. Samples: 4348324. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-07-08 02:11:00,740][800281] Avg episode reward: [(0, '175.559')] +[2023-07-08 02:11:01,857][800568] Updated weights for policy 0, policy_version 8560 (0.0005) +[2023-07-08 02:11:05,740][800281] Fps is (10 sec: 9011.2, 60 sec: 9147.7, 300 sec: 9469.4). Total num frames: 4415488. Throughput: 0: 9132.3. Samples: 4403196. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-07-08 02:11:05,740][800281] Avg episode reward: [(0, '174.692')] +[2023-07-08 02:11:06,240][800568] Updated weights for policy 0, policy_version 8640 (0.0005) +[2023-07-08 02:11:10,552][800568] Updated weights for policy 0, policy_version 8720 (0.0005) +[2023-07-08 02:11:10,740][800281] Fps is (10 sec: 9420.7, 60 sec: 9216.0, 300 sec: 9469.4). Total num frames: 4464640. Throughput: 0: 9189.7. Samples: 4460384. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:11:10,741][800281] Avg episode reward: [(0, '177.009')] +[2023-07-08 02:11:10,744][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000008720_4464640.pth... +[2023-07-08 02:11:10,747][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000008176_4186112.pth +[2023-07-08 02:11:14,773][800568] Updated weights for policy 0, policy_version 8800 (0.0005) +[2023-07-08 02:11:15,740][800281] Fps is (10 sec: 9830.4, 60 sec: 9284.3, 300 sec: 9469.4). Total num frames: 4513792. Throughput: 0: 9238.4. Samples: 4489184. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:11:15,741][800281] Avg episode reward: [(0, '176.023')] +[2023-07-08 02:11:19,070][800568] Updated weights for policy 0, policy_version 8880 (0.0005) +[2023-07-08 02:11:20,740][800281] Fps is (10 sec: 9420.9, 60 sec: 9216.0, 300 sec: 9455.5). Total num frames: 4558848. Throughput: 0: 9303.1. Samples: 4546568. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-07-08 02:11:20,741][800281] Avg episode reward: [(0, '176.579')] +[2023-07-08 02:11:23,421][800568] Updated weights for policy 0, policy_version 8960 (0.0005) +[2023-07-08 02:11:25,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9469.4). Total num frames: 4608000. Throughput: 0: 9361.2. Samples: 4603548. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-07-08 02:11:25,740][800281] Avg episode reward: [(0, '176.483')] +[2023-07-08 02:11:25,744][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000009000_4608000.pth... +[2023-07-08 02:11:25,747][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000008448_4325376.pth +[2023-07-08 02:11:27,904][800568] Updated weights for policy 0, policy_version 9040 (0.0005) +[2023-07-08 02:11:30,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9455.5). Total num frames: 4653056. Throughput: 0: 9316.4. Samples: 4629932. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-07-08 02:11:30,741][800281] Avg episode reward: [(0, '177.510')] +[2023-07-08 02:11:32,165][800568] Updated weights for policy 0, policy_version 9120 (0.0005) +[2023-07-08 02:11:35,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9455.5). Total num frames: 4702208. Throughput: 0: 9358.1. Samples: 4687212. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-07-08 02:11:35,741][800281] Avg episode reward: [(0, '177.074')] +[2023-07-08 02:11:36,463][800568] Updated weights for policy 0, policy_version 9200 (0.0005) +[2023-07-08 02:11:40,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9441.6). Total num frames: 4747264. Throughput: 0: 9397.3. Samples: 4744156. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-07-08 02:11:40,740][800281] Avg episode reward: [(0, '176.397')] +[2023-07-08 02:11:40,742][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000009272_4747264.pth... +[2023-07-08 02:11:40,746][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000008720_4464640.pth +[2023-07-08 02:11:40,845][800568] Updated weights for policy 0, policy_version 9280 (0.0005) +[2023-07-08 02:11:45,268][800568] Updated weights for policy 0, policy_version 9360 (0.0005) +[2023-07-08 02:11:45,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9352.6, 300 sec: 9441.6). Total num frames: 4796416. Throughput: 0: 9411.5. Samples: 4771840. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-07-08 02:11:45,740][800281] Avg episode reward: [(0, '178.030')] +[2023-07-08 02:11:49,648][800568] Updated weights for policy 0, policy_version 9440 (0.0005) +[2023-07-08 02:11:50,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9427.7). Total num frames: 4841472. Throughput: 0: 9432.4. Samples: 4827652. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-07-08 02:11:50,740][800281] Avg episode reward: [(0, '175.791')] +[2023-07-08 02:11:54,150][800568] Updated weights for policy 0, policy_version 9520 (0.0005) +[2023-07-08 02:11:55,740][800281] Fps is (10 sec: 9011.1, 60 sec: 9352.5, 300 sec: 9413.9). Total num frames: 4886528. Throughput: 0: 9379.0. Samples: 4882440. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-07-08 02:11:55,741][800281] Avg episode reward: [(0, '176.204')] +[2023-07-08 02:11:55,744][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000009544_4886528.pth... +[2023-07-08 02:11:55,747][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000009000_4608000.pth +[2023-07-08 02:11:58,603][800568] Updated weights for policy 0, policy_version 9600 (0.0005) +[2023-07-08 02:12:00,740][800281] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9413.9). Total num frames: 4931584. Throughput: 0: 9371.0. Samples: 4910880. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-07-08 02:12:00,740][800281] Avg episode reward: [(0, '177.512')] +[2023-07-08 02:12:02,963][800568] Updated weights for policy 0, policy_version 9680 (0.0005) +[2023-07-08 02:12:05,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9427.7). Total num frames: 4980736. Throughput: 0: 9316.3. Samples: 4965804. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:12:05,741][800281] Avg episode reward: [(0, '177.874')] +[2023-07-08 02:12:07,511][800568] Updated weights for policy 0, policy_version 9760 (0.0005) +[2023-07-08 02:12:10,740][800281] Fps is (10 sec: 9420.7, 60 sec: 9352.5, 300 sec: 9413.9). Total num frames: 5025792. Throughput: 0: 9276.8. Samples: 5021004. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:12:10,741][800281] Avg episode reward: [(0, '176.378')] +[2023-07-08 02:12:10,744][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000009816_5025792.pth... +[2023-07-08 02:12:10,747][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000009272_4747264.pth +[2023-07-08 02:12:11,885][800568] Updated weights for policy 0, policy_version 9840 (0.0005) +[2023-07-08 02:12:15,740][800281] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9413.9). Total num frames: 5070848. Throughput: 0: 9320.0. Samples: 5049332. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:12:15,741][800281] Avg episode reward: [(0, '177.827')] +[2023-07-08 02:12:16,444][800568] Updated weights for policy 0, policy_version 9920 (0.0005) +[2023-07-08 02:12:20,740][800281] Fps is (10 sec: 9011.3, 60 sec: 9284.3, 300 sec: 9400.0). Total num frames: 5115904. Throughput: 0: 9241.7. Samples: 5103088. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-07-08 02:12:20,741][800281] Avg episode reward: [(0, '179.008')] +[2023-07-08 02:12:21,007][800568] Updated weights for policy 0, policy_version 10000 (0.0005) +[2023-07-08 02:12:25,483][800568] Updated weights for policy 0, policy_version 10080 (0.0005) +[2023-07-08 02:12:25,740][800281] Fps is (10 sec: 9011.2, 60 sec: 9216.0, 300 sec: 9386.1). Total num frames: 5160960. Throughput: 0: 9172.7. Samples: 5156928. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-07-08 02:12:25,741][800281] Avg episode reward: [(0, '178.946')] +[2023-07-08 02:12:25,743][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000010080_5160960.pth... +[2023-07-08 02:12:25,746][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000009544_4886528.pth +[2023-07-08 02:12:30,019][800568] Updated weights for policy 0, policy_version 10160 (0.0005) +[2023-07-08 02:12:30,740][800281] Fps is (10 sec: 9011.2, 60 sec: 9216.0, 300 sec: 9386.1). Total num frames: 5206016. Throughput: 0: 9188.2. Samples: 5185308. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:12:30,741][800281] Avg episode reward: [(0, '177.865')] +[2023-07-08 02:12:34,479][800568] Updated weights for policy 0, policy_version 10240 (0.0005) +[2023-07-08 02:12:35,740][800281] Fps is (10 sec: 9011.2, 60 sec: 9147.7, 300 sec: 9386.1). Total num frames: 5251072. Throughput: 0: 9138.0. Samples: 5238864. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:12:35,740][800281] Avg episode reward: [(0, '179.531')] +[2023-07-08 02:12:38,997][800568] Updated weights for policy 0, policy_version 10320 (0.0005) +[2023-07-08 02:12:40,740][800281] Fps is (10 sec: 9011.1, 60 sec: 9147.7, 300 sec: 9372.2). Total num frames: 5296128. Throughput: 0: 9133.6. Samples: 5293452. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:12:40,741][800281] Avg episode reward: [(0, '179.001')] +[2023-07-08 02:12:40,744][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000010344_5296128.pth... +[2023-07-08 02:12:40,747][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000009816_5025792.pth +[2023-07-08 02:12:43,370][800568] Updated weights for policy 0, policy_version 10400 (0.0005) +[2023-07-08 02:12:45,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9147.7, 300 sec: 9386.1). Total num frames: 5345280. Throughput: 0: 9130.0. Samples: 5321732. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-07-08 02:12:45,741][800281] Avg episode reward: [(0, '177.779')] +[2023-07-08 02:12:47,484][800568] Updated weights for policy 0, policy_version 10480 (0.0005) +[2023-07-08 02:12:50,740][800281] Fps is (10 sec: 9830.5, 60 sec: 9216.0, 300 sec: 9400.0). Total num frames: 5394432. Throughput: 0: 9225.0. Samples: 5380928. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-07-08 02:12:50,740][800281] Avg episode reward: [(0, '178.827')] +[2023-07-08 02:12:51,908][800568] Updated weights for policy 0, policy_version 10560 (0.0005) +[2023-07-08 02:12:55,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9400.0). Total num frames: 5439488. Throughput: 0: 9210.7. Samples: 5435484. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-07-08 02:12:55,740][800281] Avg episode reward: [(0, '182.131')] +[2023-07-08 02:12:55,744][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000010624_5439488.pth... +[2023-07-08 02:12:55,747][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000010080_5160960.pth +[2023-07-08 02:12:56,351][800568] Updated weights for policy 0, policy_version 10640 (0.0005) +[2023-07-08 02:13:00,740][800281] Fps is (10 sec: 9011.1, 60 sec: 9216.0, 300 sec: 9372.2). Total num frames: 5484544. Throughput: 0: 9215.8. Samples: 5464044. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-07-08 02:13:00,741][800281] Avg episode reward: [(0, '179.989')] +[2023-07-08 02:13:00,847][800568] Updated weights for policy 0, policy_version 10720 (0.0005) +[2023-07-08 02:13:05,366][800568] Updated weights for policy 0, policy_version 10800 (0.0005) +[2023-07-08 02:13:05,740][800281] Fps is (10 sec: 9011.2, 60 sec: 9147.7, 300 sec: 9372.2). Total num frames: 5529600. Throughput: 0: 9213.8. Samples: 5517708. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-07-08 02:13:05,741][800281] Avg episode reward: [(0, '180.591')] +[2023-07-08 02:13:09,966][800568] Updated weights for policy 0, policy_version 10880 (0.0006) +[2023-07-08 02:13:10,740][800281] Fps is (10 sec: 9011.2, 60 sec: 9147.7, 300 sec: 9372.2). Total num frames: 5574656. Throughput: 0: 9207.1. Samples: 5571248. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:13:10,741][800281] Avg episode reward: [(0, '178.417')] +[2023-07-08 02:13:10,744][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000010888_5574656.pth... +[2023-07-08 02:13:10,747][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000010344_5296128.pth +[2023-07-08 02:13:14,442][800568] Updated weights for policy 0, policy_version 10960 (0.0005) +[2023-07-08 02:13:15,740][800281] Fps is (10 sec: 9011.2, 60 sec: 9147.7, 300 sec: 9358.3). Total num frames: 5619712. Throughput: 0: 9198.5. Samples: 5599240. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:13:15,740][800281] Avg episode reward: [(0, '179.311')] +[2023-07-08 02:13:18,948][800568] Updated weights for policy 0, policy_version 11040 (0.0005) +[2023-07-08 02:13:20,740][800281] Fps is (10 sec: 9011.3, 60 sec: 9147.7, 300 sec: 9330.6). Total num frames: 5664768. Throughput: 0: 9213.1. Samples: 5653452. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:13:20,740][800281] Avg episode reward: [(0, '181.553')] +[2023-07-08 02:13:23,391][800568] Updated weights for policy 0, policy_version 11120 (0.0005) +[2023-07-08 02:13:25,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9344.4). Total num frames: 5713920. Throughput: 0: 9231.1. Samples: 5708852. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:13:25,740][800281] Avg episode reward: [(0, '180.691')] +[2023-07-08 02:13:25,743][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000011160_5713920.pth... +[2023-07-08 02:13:25,746][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000010624_5439488.pth +[2023-07-08 02:13:27,847][800568] Updated weights for policy 0, policy_version 11200 (0.0005) +[2023-07-08 02:13:30,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9330.5). Total num frames: 5758976. Throughput: 0: 9218.8. Samples: 5736576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:13:30,740][800281] Avg episode reward: [(0, '179.285')] +[2023-07-08 02:13:32,398][800568] Updated weights for policy 0, policy_version 11280 (0.0005) +[2023-07-08 02:13:35,740][800281] Fps is (10 sec: 9011.2, 60 sec: 9216.0, 300 sec: 9330.6). Total num frames: 5804032. Throughput: 0: 9107.7. Samples: 5790776. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:13:35,741][800281] Avg episode reward: [(0, '180.077')] +[2023-07-08 02:13:36,832][800568] Updated weights for policy 0, policy_version 11360 (0.0005) +[2023-07-08 02:13:40,740][800281] Fps is (10 sec: 9011.2, 60 sec: 9216.0, 300 sec: 9316.7). Total num frames: 5849088. Throughput: 0: 9124.2. Samples: 5846072. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:13:40,741][800281] Avg episode reward: [(0, '183.105')] +[2023-07-08 02:13:40,744][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000011424_5849088.pth... +[2023-07-08 02:13:40,747][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000010888_5574656.pth +[2023-07-08 02:13:41,249][800568] Updated weights for policy 0, policy_version 11440 (0.0005) +[2023-07-08 02:13:45,616][800568] Updated weights for policy 0, policy_version 11520 (0.0005) +[2023-07-08 02:13:45,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9330.5). Total num frames: 5898240. Throughput: 0: 9113.3. Samples: 5874140. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:13:45,741][800281] Avg episode reward: [(0, '184.265')] +[2023-07-08 02:13:49,993][800568] Updated weights for policy 0, policy_version 11600 (0.0005) +[2023-07-08 02:13:50,740][800281] Fps is (10 sec: 9420.9, 60 sec: 9147.7, 300 sec: 9330.6). Total num frames: 5943296. Throughput: 0: 9184.6. Samples: 5931016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:13:50,741][800281] Avg episode reward: [(0, '181.150')] +[2023-07-08 02:13:54,503][800568] Updated weights for policy 0, policy_version 11680 (0.0005) +[2023-07-08 02:13:55,740][800281] Fps is (10 sec: 9011.1, 60 sec: 9147.7, 300 sec: 9316.7). Total num frames: 5988352. Throughput: 0: 9196.1. Samples: 5985072. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-07-08 02:13:55,741][800281] Avg episode reward: [(0, '182.214')] +[2023-07-08 02:13:55,744][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000011696_5988352.pth... +[2023-07-08 02:13:55,747][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000011160_5713920.pth +[2023-07-08 02:13:58,987][800568] Updated weights for policy 0, policy_version 11760 (0.0005) +[2023-07-08 02:14:00,740][800281] Fps is (10 sec: 9011.2, 60 sec: 9147.7, 300 sec: 9302.8). Total num frames: 6033408. Throughput: 0: 9192.0. Samples: 6012880. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-07-08 02:14:00,740][800281] Avg episode reward: [(0, '185.150')] +[2023-07-08 02:14:03,667][800568] Updated weights for policy 0, policy_version 11840 (0.0005) +[2023-07-08 02:14:05,740][800281] Fps is (10 sec: 9011.3, 60 sec: 9147.7, 300 sec: 9288.9). Total num frames: 6078464. Throughput: 0: 9171.6. Samples: 6066176. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:14:05,741][800281] Avg episode reward: [(0, '184.663')] +[2023-07-08 02:14:08,010][800568] Updated weights for policy 0, policy_version 11920 (0.0005) +[2023-07-08 02:14:10,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9302.8). Total num frames: 6127616. Throughput: 0: 9202.6. Samples: 6122968. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:14:10,741][800281] Avg episode reward: [(0, '185.526')] +[2023-07-08 02:14:10,744][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000011968_6127616.pth... +[2023-07-08 02:14:10,747][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000011424_5849088.pth +[2023-07-08 02:14:12,407][800568] Updated weights for policy 0, policy_version 12000 (0.0005) +[2023-07-08 02:14:15,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9288.9). Total num frames: 6172672. Throughput: 0: 9196.5. Samples: 6150420. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:14:15,740][800281] Avg episode reward: [(0, '187.542')] +[2023-07-08 02:14:16,658][800568] Updated weights for policy 0, policy_version 12080 (0.0005) +[2023-07-08 02:14:20,700][800568] Updated weights for policy 0, policy_version 12160 (0.0004) +[2023-07-08 02:14:20,740][800281] Fps is (10 sec: 9830.4, 60 sec: 9352.5, 300 sec: 9302.8). Total num frames: 6225920. Throughput: 0: 9305.8. Samples: 6209536. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:14:20,741][800281] Avg episode reward: [(0, '187.725')] +[2023-07-08 02:14:25,257][800568] Updated weights for policy 0, policy_version 12240 (0.0005) +[2023-07-08 02:14:25,740][800281] Fps is (10 sec: 9830.3, 60 sec: 9284.3, 300 sec: 9288.9). Total num frames: 6270976. Throughput: 0: 9326.1. Samples: 6265748. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-07-08 02:14:25,741][800281] Avg episode reward: [(0, '189.659')] +[2023-07-08 02:14:25,745][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000012248_6270976.pth... +[2023-07-08 02:14:25,748][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000011696_5988352.pth +[2023-07-08 02:14:29,598][800568] Updated weights for policy 0, policy_version 12320 (0.0005) +[2023-07-08 02:14:30,740][800281] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9275.0). Total num frames: 6316032. Throughput: 0: 9303.7. Samples: 6292808. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:14:30,740][800281] Avg episode reward: [(0, '188.783')] +[2023-07-08 02:14:33,986][800568] Updated weights for policy 0, policy_version 12400 (0.0005) +[2023-07-08 02:14:35,740][800281] Fps is (10 sec: 9420.9, 60 sec: 9352.5, 300 sec: 9275.0). Total num frames: 6365184. Throughput: 0: 9296.8. Samples: 6349372. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:14:35,741][800281] Avg episode reward: [(0, '190.585')] +[2023-07-08 02:14:38,180][800568] Updated weights for policy 0, policy_version 12480 (0.0005) +[2023-07-08 02:14:40,740][800281] Fps is (10 sec: 9830.3, 60 sec: 9420.8, 300 sec: 9275.0). Total num frames: 6414336. Throughput: 0: 9420.1. Samples: 6408976. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-07-08 02:14:40,741][800281] Avg episode reward: [(0, '189.468')] +[2023-07-08 02:14:40,744][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000012528_6414336.pth... +[2023-07-08 02:14:40,746][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000011968_6127616.pth +[2023-07-08 02:14:42,301][800568] Updated weights for policy 0, policy_version 12560 (0.0005) +[2023-07-08 02:14:45,740][800281] Fps is (10 sec: 9830.4, 60 sec: 9420.8, 300 sec: 9275.0). Total num frames: 6463488. Throughput: 0: 9457.6. Samples: 6438472. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-07-08 02:14:45,741][800281] Avg episode reward: [(0, '191.613')] +[2023-07-08 02:14:46,399][800568] Updated weights for policy 0, policy_version 12640 (0.0005) +[2023-07-08 02:14:50,737][800568] Updated weights for policy 0, policy_version 12720 (0.0005) +[2023-07-08 02:14:50,740][800281] Fps is (10 sec: 9830.5, 60 sec: 9489.1, 300 sec: 9288.9). Total num frames: 6512640. Throughput: 0: 9572.8. Samples: 6496952. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:14:50,740][800281] Avg episode reward: [(0, '190.485')] +[2023-07-08 02:14:55,014][800568] Updated weights for policy 0, policy_version 12800 (0.0005) +[2023-07-08 02:14:55,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9275.0). Total num frames: 6557696. Throughput: 0: 9575.1. Samples: 6553848. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:14:55,741][800281] Avg episode reward: [(0, '191.544')] +[2023-07-08 02:14:55,744][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000012808_6557696.pth... +[2023-07-08 02:14:55,747][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000012248_6270976.pth +[2023-07-08 02:14:59,448][800568] Updated weights for policy 0, policy_version 12880 (0.0005) +[2023-07-08 02:15:00,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9288.9). Total num frames: 6606848. Throughput: 0: 9594.9. Samples: 6582188. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:15:00,740][800281] Avg episode reward: [(0, '190.130')] +[2023-07-08 02:15:03,842][800568] Updated weights for policy 0, policy_version 12960 (0.0005) +[2023-07-08 02:15:05,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9288.9). Total num frames: 6651904. Throughput: 0: 9514.8. Samples: 6637700. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:15:05,741][800281] Avg episode reward: [(0, '190.928')] +[2023-07-08 02:15:08,027][800568] Updated weights for policy 0, policy_version 13040 (0.0005) +[2023-07-08 02:15:10,740][800281] Fps is (10 sec: 9420.7, 60 sec: 9557.3, 300 sec: 9302.8). Total num frames: 6701056. Throughput: 0: 9580.7. Samples: 6696880. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-07-08 02:15:10,741][800281] Avg episode reward: [(0, '190.120')] +[2023-07-08 02:15:10,744][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000013088_6701056.pth... +[2023-07-08 02:15:10,747][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000012528_6414336.pth +[2023-07-08 02:15:12,191][800568] Updated weights for policy 0, policy_version 13120 (0.0005) +[2023-07-08 02:15:15,740][800281] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9302.8). Total num frames: 6750208. Throughput: 0: 9625.9. Samples: 6725972. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-07-08 02:15:15,741][800281] Avg episode reward: [(0, '191.486')] +[2023-07-08 02:15:16,323][800568] Updated weights for policy 0, policy_version 13200 (0.0005) +[2023-07-08 02:15:20,537][800568] Updated weights for policy 0, policy_version 13280 (0.0005) +[2023-07-08 02:15:20,740][800281] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9316.7). Total num frames: 6799360. Throughput: 0: 9668.2. Samples: 6784440. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-07-08 02:15:20,741][800281] Avg episode reward: [(0, '192.031')] +[2023-07-08 02:15:24,735][800568] Updated weights for policy 0, policy_version 13360 (0.0005) +[2023-07-08 02:15:25,740][800281] Fps is (10 sec: 9830.3, 60 sec: 9625.6, 300 sec: 9330.5). Total num frames: 6848512. Throughput: 0: 9665.8. Samples: 6843936. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-07-08 02:15:25,741][800281] Avg episode reward: [(0, '191.244')] +[2023-07-08 02:15:25,744][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000013376_6848512.pth... +[2023-07-08 02:15:25,747][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000012808_6557696.pth +[2023-07-08 02:15:29,140][800568] Updated weights for policy 0, policy_version 13440 (0.0005) +[2023-07-08 02:15:30,740][800281] Fps is (10 sec: 9420.9, 60 sec: 9625.6, 300 sec: 9330.6). Total num frames: 6893568. Throughput: 0: 9615.2. Samples: 6871156. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-07-08 02:15:30,740][800281] Avg episode reward: [(0, '190.069')] +[2023-07-08 02:15:33,417][800568] Updated weights for policy 0, policy_version 13520 (0.0005) +[2023-07-08 02:15:35,740][800281] Fps is (10 sec: 9420.9, 60 sec: 9625.6, 300 sec: 9344.4). Total num frames: 6942720. Throughput: 0: 9593.9. Samples: 6928676. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-07-08 02:15:35,741][800281] Avg episode reward: [(0, '191.815')] +[2023-07-08 02:15:37,848][800568] Updated weights for policy 0, policy_version 13600 (0.0006) +[2023-07-08 02:15:40,740][800281] Fps is (10 sec: 9420.7, 60 sec: 9557.3, 300 sec: 9330.6). Total num frames: 6987776. Throughput: 0: 9560.2. Samples: 6984056. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-07-08 02:15:40,740][800281] Avg episode reward: [(0, '189.937')] +[2023-07-08 02:15:40,744][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000013648_6987776.pth... +[2023-07-08 02:15:40,747][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000013088_6701056.pth +[2023-07-08 02:15:42,035][800568] Updated weights for policy 0, policy_version 13680 (0.0005) +[2023-07-08 02:15:45,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9344.4). Total num frames: 7036928. Throughput: 0: 9602.8. Samples: 7014316. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-07-08 02:15:45,740][800281] Avg episode reward: [(0, '190.644')] +[2023-07-08 02:15:46,327][800568] Updated weights for policy 0, policy_version 13760 (0.0005) +[2023-07-08 02:15:50,532][800568] Updated weights for policy 0, policy_version 13840 (0.0005) +[2023-07-08 02:15:50,740][800281] Fps is (10 sec: 9830.5, 60 sec: 9557.3, 300 sec: 9358.3). Total num frames: 7086080. Throughput: 0: 9653.3. Samples: 7072100. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-07-08 02:15:50,741][800281] Avg episode reward: [(0, '189.106')] +[2023-07-08 02:15:55,040][800568] Updated weights for policy 0, policy_version 13920 (0.0006) +[2023-07-08 02:15:55,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9358.3). Total num frames: 7131136. Throughput: 0: 9560.7. Samples: 7127112. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-07-08 02:15:55,741][800281] Avg episode reward: [(0, '188.964')] +[2023-07-08 02:15:55,743][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000013928_7131136.pth... +[2023-07-08 02:15:55,746][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000013376_6848512.pth +[2023-07-08 02:15:59,276][800568] Updated weights for policy 0, policy_version 14000 (0.0005) +[2023-07-08 02:16:00,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9372.2). Total num frames: 7180288. Throughput: 0: 9550.3. Samples: 7155736. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-07-08 02:16:00,741][800281] Avg episode reward: [(0, '188.325')] +[2023-07-08 02:16:03,542][800568] Updated weights for policy 0, policy_version 14080 (0.0005) +[2023-07-08 02:16:05,740][800281] Fps is (10 sec: 9420.9, 60 sec: 9557.3, 300 sec: 9358.3). Total num frames: 7225344. Throughput: 0: 9535.7. Samples: 7213544. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-07-08 02:16:05,740][800281] Avg episode reward: [(0, '188.276')] +[2023-07-08 02:16:07,900][800568] Updated weights for policy 0, policy_version 14160 (0.0006) +[2023-07-08 02:16:10,740][800281] Fps is (10 sec: 9830.3, 60 sec: 9625.6, 300 sec: 9372.2). Total num frames: 7278592. Throughput: 0: 9528.8. Samples: 7272732. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-07-08 02:16:10,741][800281] Avg episode reward: [(0, '190.096')] +[2023-07-08 02:16:10,744][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000014216_7278592.pth... +[2023-07-08 02:16:10,747][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000013648_6987776.pth +[2023-07-08 02:16:11,791][800568] Updated weights for policy 0, policy_version 14240 (0.0005) +[2023-07-08 02:16:15,740][800281] Fps is (10 sec: 10240.0, 60 sec: 9625.6, 300 sec: 9386.1). Total num frames: 7327744. Throughput: 0: 9615.2. Samples: 7303840. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-07-08 02:16:15,741][800281] Avg episode reward: [(0, '188.999')] +[2023-07-08 02:16:15,883][800568] Updated weights for policy 0, policy_version 14320 (0.0005) +[2023-07-08 02:16:20,025][800568] Updated weights for policy 0, policy_version 14400 (0.0005) +[2023-07-08 02:16:20,740][800281] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9386.1). Total num frames: 7376896. Throughput: 0: 9664.1. Samples: 7363560. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-07-08 02:16:20,741][800281] Avg episode reward: [(0, '188.424')] +[2023-07-08 02:16:24,067][800568] Updated weights for policy 0, policy_version 14480 (0.0005) +[2023-07-08 02:16:25,740][800281] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9400.0). Total num frames: 7426048. Throughput: 0: 9755.1. Samples: 7423036. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-07-08 02:16:25,740][800281] Avg episode reward: [(0, '188.872')] +[2023-07-08 02:16:25,760][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000014512_7430144.pth... +[2023-07-08 02:16:25,763][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000013928_7131136.pth +[2023-07-08 02:16:28,391][800568] Updated weights for policy 0, policy_version 14560 (0.0005) +[2023-07-08 02:16:30,740][800281] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9400.0). Total num frames: 7475200. Throughput: 0: 9718.0. Samples: 7451628. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-07-08 02:16:30,740][800281] Avg episode reward: [(0, '187.159')] +[2023-07-08 02:16:32,752][800568] Updated weights for policy 0, policy_version 14640 (0.0005) +[2023-07-08 02:16:35,740][800281] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9413.9). Total num frames: 7524352. Throughput: 0: 9709.7. Samples: 7509036. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-07-08 02:16:35,741][800281] Avg episode reward: [(0, '187.200')] +[2023-07-08 02:16:36,894][800568] Updated weights for policy 0, policy_version 14720 (0.0004) +[2023-07-08 02:16:40,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9400.0). Total num frames: 7569408. Throughput: 0: 9753.4. Samples: 7566016. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-07-08 02:16:40,741][800281] Avg episode reward: [(0, '187.231')] +[2023-07-08 02:16:40,744][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000014784_7569408.pth... +[2023-07-08 02:16:40,746][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000014216_7278592.pth +[2023-07-08 02:16:41,213][800568] Updated weights for policy 0, policy_version 14800 (0.0005) +[2023-07-08 02:16:45,338][800568] Updated weights for policy 0, policy_version 14880 (0.0006) +[2023-07-08 02:16:45,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9413.9). Total num frames: 7618560. Throughput: 0: 9793.6. Samples: 7596448. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-07-08 02:16:45,741][800281] Avg episode reward: [(0, '186.509')] +[2023-07-08 02:16:49,454][800568] Updated weights for policy 0, policy_version 14960 (0.0005) +[2023-07-08 02:16:50,740][800281] Fps is (10 sec: 10240.0, 60 sec: 9762.1, 300 sec: 9441.6). Total num frames: 7671808. Throughput: 0: 9821.0. Samples: 7655488. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-07-08 02:16:50,741][800281] Avg episode reward: [(0, '187.256')] +[2023-07-08 02:16:53,498][800568] Updated weights for policy 0, policy_version 15040 (0.0005) +[2023-07-08 02:16:55,740][800281] Fps is (10 sec: 10239.9, 60 sec: 9830.4, 300 sec: 9455.5). Total num frames: 7720960. Throughput: 0: 9864.9. Samples: 7716652. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:16:55,741][800281] Avg episode reward: [(0, '188.166')] +[2023-07-08 02:16:55,744][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000015080_7720960.pth... +[2023-07-08 02:16:55,747][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000014512_7430144.pth +[2023-07-08 02:16:57,573][800568] Updated weights for policy 0, policy_version 15120 (0.0005) +[2023-07-08 02:17:00,740][800281] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9455.5). Total num frames: 7770112. Throughput: 0: 9834.9. Samples: 7746412. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:17:00,741][800281] Avg episode reward: [(0, '188.638')] +[2023-07-08 02:17:01,665][800568] Updated weights for policy 0, policy_version 15200 (0.0004) +[2023-07-08 02:17:05,740][800281] Fps is (10 sec: 9830.5, 60 sec: 9898.7, 300 sec: 9469.4). Total num frames: 7819264. Throughput: 0: 9828.8. Samples: 7805856. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-07-08 02:17:05,740][800281] Avg episode reward: [(0, '190.108')] +[2023-07-08 02:17:05,854][800568] Updated weights for policy 0, policy_version 15280 (0.0005) +[2023-07-08 02:17:09,963][800568] Updated weights for policy 0, policy_version 15360 (0.0005) +[2023-07-08 02:17:10,740][800281] Fps is (10 sec: 9830.3, 60 sec: 9830.4, 300 sec: 9483.3). Total num frames: 7868416. Throughput: 0: 9826.2. Samples: 7865216. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-07-08 02:17:10,741][800281] Avg episode reward: [(0, '189.622')] +[2023-07-08 02:17:10,748][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000015376_7872512.pth... +[2023-07-08 02:17:10,750][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000014784_7569408.pth +[2023-07-08 02:17:14,196][800568] Updated weights for policy 0, policy_version 15440 (0.0005) +[2023-07-08 02:17:15,740][800281] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9497.2). Total num frames: 7917568. Throughput: 0: 9848.9. Samples: 7894828. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:17:15,741][800281] Avg episode reward: [(0, '189.542')] +[2023-07-08 02:17:18,560][800568] Updated weights for policy 0, policy_version 15520 (0.0006) +[2023-07-08 02:17:20,740][800281] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9511.0). Total num frames: 7966720. Throughput: 0: 9817.8. Samples: 7950836. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:17:20,741][800281] Avg episode reward: [(0, '188.026')] +[2023-07-08 02:17:22,877][800568] Updated weights for policy 0, policy_version 15600 (0.0006) +[2023-07-08 02:17:25,740][800281] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9524.9). Total num frames: 8015872. Throughput: 0: 9855.6. Samples: 8009520. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:17:25,741][800281] Avg episode reward: [(0, '189.572')] +[2023-07-08 02:17:25,744][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000015656_8015872.pth... +[2023-07-08 02:17:25,746][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000015080_7720960.pth +[2023-07-08 02:17:26,982][800568] Updated weights for policy 0, policy_version 15680 (0.0005) +[2023-07-08 02:17:30,740][800281] Fps is (10 sec: 9830.5, 60 sec: 9830.4, 300 sec: 9538.8). Total num frames: 8065024. Throughput: 0: 9838.5. Samples: 8039180. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-07-08 02:17:30,741][800281] Avg episode reward: [(0, '189.158')] +[2023-07-08 02:17:31,073][800568] Updated weights for policy 0, policy_version 15760 (0.0005) +[2023-07-08 02:17:35,165][800568] Updated weights for policy 0, policy_version 15840 (0.0005) +[2023-07-08 02:17:35,740][800281] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9552.7). Total num frames: 8114176. Throughput: 0: 9857.2. Samples: 8099064. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-07-08 02:17:35,741][800281] Avg episode reward: [(0, '186.942')] +[2023-07-08 02:17:39,453][800568] Updated weights for policy 0, policy_version 15920 (0.0006) +[2023-07-08 02:17:40,740][800281] Fps is (10 sec: 9830.3, 60 sec: 9898.7, 300 sec: 9552.7). Total num frames: 8163328. Throughput: 0: 9778.7. Samples: 8156692. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-07-08 02:17:40,741][800281] Avg episode reward: [(0, '188.391')] +[2023-07-08 02:17:40,744][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000015944_8163328.pth... +[2023-07-08 02:17:40,746][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000015376_7872512.pth +[2023-07-08 02:17:43,865][800568] Updated weights for policy 0, policy_version 16000 (0.0006) +[2023-07-08 02:17:45,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9830.4, 300 sec: 9538.8). Total num frames: 8208384. Throughput: 0: 9731.3. Samples: 8184320. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-07-08 02:17:45,740][800281] Avg episode reward: [(0, '188.586')] +[2023-07-08 02:17:48,055][800568] Updated weights for policy 0, policy_version 16080 (0.0005) +[2023-07-08 02:17:50,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9552.7). Total num frames: 8257536. Throughput: 0: 9705.8. Samples: 8242616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:17:50,741][800281] Avg episode reward: [(0, '188.517')] +[2023-07-08 02:17:52,411][800568] Updated weights for policy 0, policy_version 16160 (0.0006) +[2023-07-08 02:17:55,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9552.7). Total num frames: 8302592. Throughput: 0: 9628.5. Samples: 8298496. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:17:55,740][800281] Avg episode reward: [(0, '188.236')] +[2023-07-08 02:17:55,742][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000016216_8302592.pth... +[2023-07-08 02:17:55,745][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000015656_8015872.pth +[2023-07-08 02:17:56,856][800568] Updated weights for policy 0, policy_version 16240 (0.0006) +[2023-07-08 02:18:00,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9566.6). Total num frames: 8351744. Throughput: 0: 9590.4. Samples: 8326396. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:18:00,741][800281] Avg episode reward: [(0, '187.599')] +[2023-07-08 02:18:01,120][800568] Updated weights for policy 0, policy_version 16320 (0.0006) +[2023-07-08 02:18:05,294][800568] Updated weights for policy 0, policy_version 16400 (0.0006) +[2023-07-08 02:18:05,740][800281] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9580.5). Total num frames: 8400896. Throughput: 0: 9638.9. Samples: 8384584. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:18:05,741][800281] Avg episode reward: [(0, '186.985')] +[2023-07-08 02:18:09,385][800568] Updated weights for policy 0, policy_version 16480 (0.0005) +[2023-07-08 02:18:10,740][800281] Fps is (10 sec: 9830.3, 60 sec: 9693.9, 300 sec: 9594.4). Total num frames: 8450048. Throughput: 0: 9671.1. Samples: 8444720. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:18:10,741][800281] Avg episode reward: [(0, '187.709')] +[2023-07-08 02:18:10,744][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000016504_8450048.pth... +[2023-07-08 02:18:10,746][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000015944_8163328.pth +[2023-07-08 02:18:13,767][800568] Updated weights for policy 0, policy_version 16560 (0.0005) +[2023-07-08 02:18:15,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9594.4). Total num frames: 8495104. Throughput: 0: 9608.8. Samples: 8471576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:18:15,741][800281] Avg episode reward: [(0, '186.506')] +[2023-07-08 02:18:18,044][800568] Updated weights for policy 0, policy_version 16640 (0.0005) +[2023-07-08 02:18:20,740][800281] Fps is (10 sec: 9420.9, 60 sec: 9625.6, 300 sec: 9594.4). Total num frames: 8544256. Throughput: 0: 9548.0. Samples: 8528724. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:18:20,740][800281] Avg episode reward: [(0, '187.568')] +[2023-07-08 02:18:22,483][800568] Updated weights for policy 0, policy_version 16720 (0.0006) +[2023-07-08 02:18:25,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9594.4). Total num frames: 8589312. Throughput: 0: 9520.2. Samples: 8585100. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-07-08 02:18:25,740][800281] Avg episode reward: [(0, '187.002')] +[2023-07-08 02:18:25,743][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000016776_8589312.pth... +[2023-07-08 02:18:25,746][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000016216_8302592.pth +[2023-07-08 02:18:26,836][800568] Updated weights for policy 0, policy_version 16800 (0.0006) +[2023-07-08 02:18:30,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9608.2). Total num frames: 8638464. Throughput: 0: 9546.1. Samples: 8613896. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-07-08 02:18:30,741][800281] Avg episode reward: [(0, '186.221')] +[2023-07-08 02:18:30,883][800568] Updated weights for policy 0, policy_version 16880 (0.0005) +[2023-07-08 02:18:35,130][800568] Updated weights for policy 0, policy_version 16960 (0.0005) +[2023-07-08 02:18:35,740][800281] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9622.1). Total num frames: 8687616. Throughput: 0: 9582.3. Samples: 8673820. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-07-08 02:18:35,740][800281] Avg episode reward: [(0, '186.823')] +[2023-07-08 02:18:39,288][800568] Updated weights for policy 0, policy_version 17040 (0.0005) +[2023-07-08 02:18:40,740][800281] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9622.1). Total num frames: 8736768. Throughput: 0: 9642.4. Samples: 8732404. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-07-08 02:18:40,741][800281] Avg episode reward: [(0, '187.348')] +[2023-07-08 02:18:40,744][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000017064_8736768.pth... +[2023-07-08 02:18:40,747][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000016504_8450048.pth +[2023-07-08 02:18:43,379][800568] Updated weights for policy 0, policy_version 17120 (0.0005) +[2023-07-08 02:18:45,740][800281] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9636.0). Total num frames: 8785920. Throughput: 0: 9684.5. Samples: 8762200. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-07-08 02:18:45,740][800281] Avg episode reward: [(0, '186.309')] +[2023-07-08 02:18:47,366][800568] Updated weights for policy 0, policy_version 17200 (0.0005) +[2023-07-08 02:18:50,740][800281] Fps is (10 sec: 10240.0, 60 sec: 9693.9, 300 sec: 9663.8). Total num frames: 8839168. Throughput: 0: 9749.6. Samples: 8823316. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-07-08 02:18:50,740][800281] Avg episode reward: [(0, '186.211')] +[2023-07-08 02:18:51,424][800568] Updated weights for policy 0, policy_version 17280 (0.0005) +[2023-07-08 02:18:55,471][800568] Updated weights for policy 0, policy_version 17360 (0.0005) +[2023-07-08 02:18:55,740][800281] Fps is (10 sec: 10240.0, 60 sec: 9762.1, 300 sec: 9677.7). Total num frames: 8888320. Throughput: 0: 9768.2. Samples: 8884288. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-07-08 02:18:55,740][800281] Avg episode reward: [(0, '187.470')] +[2023-07-08 02:18:55,744][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000017360_8888320.pth... +[2023-07-08 02:18:55,746][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000016776_8589312.pth +[2023-07-08 02:18:59,559][800568] Updated weights for policy 0, policy_version 17440 (0.0005) +[2023-07-08 02:19:00,740][800281] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9691.6). Total num frames: 8937472. Throughput: 0: 9833.8. Samples: 8914096. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-07-08 02:19:00,740][800281] Avg episode reward: [(0, '186.390')] +[2023-07-08 02:19:03,647][800568] Updated weights for policy 0, policy_version 17520 (0.0005) +[2023-07-08 02:19:05,740][800281] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 9705.4). Total num frames: 8990720. Throughput: 0: 9903.9. Samples: 8974400. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-07-08 02:19:05,740][800281] Avg episode reward: [(0, '188.403')] +[2023-07-08 02:19:07,790][800568] Updated weights for policy 0, policy_version 17600 (0.0005) +[2023-07-08 02:19:10,740][800281] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9705.4). Total num frames: 9035776. Throughput: 0: 9954.2. Samples: 9033040. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-07-08 02:19:10,740][800281] Avg episode reward: [(0, '188.584')] +[2023-07-08 02:19:10,787][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000017656_9039872.pth... +[2023-07-08 02:19:10,790][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000017064_8736768.pth +[2023-07-08 02:19:12,129][800568] Updated weights for policy 0, policy_version 17680 (0.0005) +[2023-07-08 02:19:15,740][800281] Fps is (10 sec: 9011.2, 60 sec: 9762.1, 300 sec: 9677.7). Total num frames: 9080832. Throughput: 0: 9921.4. Samples: 9060360. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-07-08 02:19:15,740][800281] Avg episode reward: [(0, '189.415')] +[2023-07-08 02:19:16,620][800568] Updated weights for policy 0, policy_version 17760 (0.0005) +[2023-07-08 02:19:20,740][800281] Fps is (10 sec: 9420.7, 60 sec: 9762.1, 300 sec: 9691.6). Total num frames: 9129984. Throughput: 0: 9841.1. Samples: 9116668. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-07-08 02:19:20,741][800281] Avg episode reward: [(0, '186.717')] +[2023-07-08 02:19:20,851][800568] Updated weights for policy 0, policy_version 17840 (0.0005) +[2023-07-08 02:19:25,257][800568] Updated weights for policy 0, policy_version 17920 (0.0005) +[2023-07-08 02:19:25,740][800281] Fps is (10 sec: 9830.3, 60 sec: 9830.4, 300 sec: 9705.4). Total num frames: 9179136. Throughput: 0: 9797.7. Samples: 9173300. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-07-08 02:19:25,741][800281] Avg episode reward: [(0, '188.796')] +[2023-07-08 02:19:25,744][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000017928_9179136.pth... +[2023-07-08 02:19:25,748][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000017360_8888320.pth +[2023-07-08 02:19:29,570][800568] Updated weights for policy 0, policy_version 18000 (0.0005) +[2023-07-08 02:19:30,740][800281] Fps is (10 sec: 9420.7, 60 sec: 9762.1, 300 sec: 9691.6). Total num frames: 9224192. Throughput: 0: 9778.8. Samples: 9202248. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:19:30,741][800281] Avg episode reward: [(0, '187.879')] +[2023-07-08 02:19:34,064][800568] Updated weights for policy 0, policy_version 18080 (0.0005) +[2023-07-08 02:19:35,740][800281] Fps is (10 sec: 9011.3, 60 sec: 9693.9, 300 sec: 9677.7). Total num frames: 9269248. Throughput: 0: 9638.0. Samples: 9257024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:19:35,740][800281] Avg episode reward: [(0, '189.367')] +[2023-07-08 02:19:38,522][800568] Updated weights for policy 0, policy_version 18160 (0.0005) +[2023-07-08 02:19:40,740][800281] Fps is (10 sec: 9420.9, 60 sec: 9693.9, 300 sec: 9677.7). Total num frames: 9318400. Throughput: 0: 9523.6. Samples: 9312852. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:19:40,740][800281] Avg episode reward: [(0, '188.341')] +[2023-07-08 02:19:40,743][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000018200_9318400.pth... +[2023-07-08 02:19:40,746][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000017656_9039872.pth +[2023-07-08 02:19:42,854][800568] Updated weights for policy 0, policy_version 18240 (0.0005) +[2023-07-08 02:19:45,740][800281] Fps is (10 sec: 9420.7, 60 sec: 9625.6, 300 sec: 9663.8). Total num frames: 9363456. Throughput: 0: 9486.2. Samples: 9340976. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:19:45,741][800281] Avg episode reward: [(0, '192.356')] +[2023-07-08 02:19:47,204][800568] Updated weights for policy 0, policy_version 18320 (0.0005) +[2023-07-08 02:19:50,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9677.7). Total num frames: 9412608. Throughput: 0: 9434.2. Samples: 9398940. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:19:50,741][800281] Avg episode reward: [(0, '189.321')] +[2023-07-08 02:19:51,297][800568] Updated weights for policy 0, policy_version 18400 (0.0005) +[2023-07-08 02:19:55,551][800568] Updated weights for policy 0, policy_version 18480 (0.0005) +[2023-07-08 02:19:55,740][800281] Fps is (10 sec: 9830.3, 60 sec: 9557.3, 300 sec: 9677.7). Total num frames: 9461760. Throughput: 0: 9436.1. Samples: 9457664. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:19:55,741][800281] Avg episode reward: [(0, '187.876')] +[2023-07-08 02:19:55,743][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000018480_9461760.pth... +[2023-07-08 02:19:55,746][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000017928_9179136.pth +[2023-07-08 02:19:59,780][800568] Updated weights for policy 0, policy_version 18560 (0.0005) +[2023-07-08 02:20:00,740][800281] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9691.6). Total num frames: 9510912. Throughput: 0: 9473.0. Samples: 9486644. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:20:00,741][800281] Avg episode reward: [(0, '187.081')] +[2023-07-08 02:20:04,087][800568] Updated weights for policy 0, policy_version 18640 (0.0005) +[2023-07-08 02:20:05,740][800281] Fps is (10 sec: 9420.9, 60 sec: 9420.8, 300 sec: 9677.7). Total num frames: 9555968. Throughput: 0: 9489.4. Samples: 9543692. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-07-08 02:20:05,740][800281] Avg episode reward: [(0, '187.030')] +[2023-07-08 02:20:08,322][800568] Updated weights for policy 0, policy_version 18720 (0.0005) +[2023-07-08 02:20:10,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9677.7). Total num frames: 9605120. Throughput: 0: 9506.8. Samples: 9601104. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-07-08 02:20:10,740][800281] Avg episode reward: [(0, '187.265')] +[2023-07-08 02:20:10,744][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000018760_9605120.pth... +[2023-07-08 02:20:10,747][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000018200_9318400.pth +[2023-07-08 02:20:12,665][800568] Updated weights for policy 0, policy_version 18800 (0.0005) +[2023-07-08 02:20:15,740][800281] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9677.7). Total num frames: 9654272. Throughput: 0: 9498.9. Samples: 9629696. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-07-08 02:20:15,741][800281] Avg episode reward: [(0, '187.666')] +[2023-07-08 02:20:16,985][800568] Updated weights for policy 0, policy_version 18880 (0.0005) +[2023-07-08 02:20:20,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9663.8). Total num frames: 9699328. Throughput: 0: 9552.8. Samples: 9686900. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:20:20,741][800281] Avg episode reward: [(0, '186.948')] +[2023-07-08 02:20:21,322][800568] Updated weights for policy 0, policy_version 18960 (0.0005) +[2023-07-08 02:20:25,540][800568] Updated weights for policy 0, policy_version 19040 (0.0005) +[2023-07-08 02:20:25,740][800281] Fps is (10 sec: 9420.7, 60 sec: 9489.1, 300 sec: 9677.7). Total num frames: 9748480. Throughput: 0: 9589.4. Samples: 9744376. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:20:25,741][800281] Avg episode reward: [(0, '187.930')] +[2023-07-08 02:20:25,744][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000019040_9748480.pth... +[2023-07-08 02:20:25,747][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000018480_9461760.pth +[2023-07-08 02:20:29,872][800568] Updated weights for policy 0, policy_version 19120 (0.0005) +[2023-07-08 02:20:30,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9663.8). Total num frames: 9793536. Throughput: 0: 9601.8. Samples: 9773056. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 02:20:30,740][800281] Avg episode reward: [(0, '188.392')] +[2023-07-08 02:20:34,164][800568] Updated weights for policy 0, policy_version 19200 (0.0005) +[2023-07-08 02:20:35,740][800281] Fps is (10 sec: 9420.9, 60 sec: 9557.3, 300 sec: 9677.7). Total num frames: 9842688. Throughput: 0: 9573.8. Samples: 9829760. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-07-08 02:20:35,740][800281] Avg episode reward: [(0, '187.381')] +[2023-07-08 02:20:38,498][800568] Updated weights for policy 0, policy_version 19280 (0.0005) +[2023-07-08 02:20:40,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9663.8). Total num frames: 9887744. Throughput: 0: 9499.1. Samples: 9885120. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-07-08 02:20:40,740][800281] Avg episode reward: [(0, '192.691')] +[2023-07-08 02:20:40,786][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000019320_9891840.pth... +[2023-07-08 02:20:40,789][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000018760_9605120.pth +[2023-07-08 02:20:43,060][800568] Updated weights for policy 0, policy_version 19360 (0.0005) +[2023-07-08 02:20:45,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9663.8). Total num frames: 9936896. Throughput: 0: 9460.9. Samples: 9912384. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-07-08 02:20:45,740][800281] Avg episode reward: [(0, '193.466')] +[2023-07-08 02:20:47,491][800568] Updated weights for policy 0, policy_version 19440 (0.0005) +[2023-07-08 02:20:50,740][800281] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9663.8). Total num frames: 9981952. Throughput: 0: 9449.4. Samples: 9968916. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-07-08 02:20:50,741][800281] Avg episode reward: [(0, '187.705')] +[2023-07-08 02:20:51,690][800568] Updated weights for policy 0, policy_version 19520 (0.0005) +[2023-07-08 02:20:52,946][800524] Early stopping after 2 epochs (8 sgd steps), loss delta 0.0000000 +[2023-07-08 02:20:52,947][800605] Stopping RolloutWorker_w5... +[2023-07-08 02:20:52,947][800637] Stopping RolloutWorker_w6... +[2023-07-08 02:20:52,947][800570] Stopping RolloutWorker_w2... +[2023-07-08 02:20:52,947][800571] Stopping RolloutWorker_w1... +[2023-07-08 02:20:52,947][800669] Stopping RolloutWorker_w7... +[2023-07-08 02:20:52,947][800572] Stopping RolloutWorker_w3... +[2023-07-08 02:20:52,947][800605] Loop rollout_proc5_evt_loop terminating... +[2023-07-08 02:20:52,947][800637] Loop rollout_proc6_evt_loop terminating... +[2023-07-08 02:20:52,947][800573] Stopping RolloutWorker_w4... +[2023-07-08 02:20:52,947][800571] Loop rollout_proc1_evt_loop terminating... +[2023-07-08 02:20:52,947][800570] Loop rollout_proc2_evt_loop terminating... +[2023-07-08 02:20:52,947][800524] Stopping Batcher_0... +[2023-07-08 02:20:52,947][800669] Loop rollout_proc7_evt_loop terminating... +[2023-07-08 02:20:52,947][800569] Stopping RolloutWorker_w0... +[2023-07-08 02:20:52,947][800572] Loop rollout_proc3_evt_loop terminating... +[2023-07-08 02:20:52,947][800573] Loop rollout_proc4_evt_loop terminating... +[2023-07-08 02:20:52,947][800281] Component RolloutWorker_w5 stopped! +[2023-07-08 02:20:52,947][800569] Loop rollout_proc0_evt_loop terminating... +[2023-07-08 02:20:52,948][800524] Loop batcher_evt_loop terminating... +[2023-07-08 02:20:52,948][800281] Component RolloutWorker_w6 stopped! +[2023-07-08 02:20:52,948][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000019544_10006528.pth... +[2023-07-08 02:20:52,948][800281] Component RolloutWorker_w2 stopped! +[2023-07-08 02:20:52,948][800281] Component RolloutWorker_w1 stopped! +[2023-07-08 02:20:52,949][800281] Component RolloutWorker_w3 stopped! +[2023-07-08 02:20:52,949][800281] Component RolloutWorker_w7 stopped! +[2023-07-08 02:20:52,949][800281] Component Batcher_0 stopped! +[2023-07-08 02:20:52,949][800281] Component RolloutWorker_w4 stopped! +[2023-07-08 02:20:52,950][800281] Component RolloutWorker_w0 stopped! +[2023-07-08 02:20:52,951][800524] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000019040_9748480.pth +[2023-07-08 02:20:52,952][800524] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/faucet-close-v2/checkpoint_p0/checkpoint_000019544_10006528.pth... +[2023-07-08 02:20:52,955][800524] Stopping LearnerWorker_p0... +[2023-07-08 02:20:52,956][800524] Loop learner_proc0_evt_loop terminating... +[2023-07-08 02:20:52,956][800281] Component LearnerWorker_p0 stopped! +[2023-07-08 02:20:52,980][800568] Weights refcount: 2 0 +[2023-07-08 02:20:52,981][800568] Stopping InferenceWorker_p0-w0... +[2023-07-08 02:20:52,981][800568] Loop inference_proc0-0_evt_loop terminating... +[2023-07-08 02:20:52,981][800281] Component InferenceWorker_p0-w0 stopped! +[2023-07-08 02:20:52,982][800281] Waiting for process learner_proc0 to stop... +[2023-07-08 02:20:53,549][800281] Waiting for process inference_proc0-0 to join... +[2023-07-08 02:20:53,555][800281] Waiting for process rollout_proc0 to join... +[2023-07-08 02:20:53,556][800281] Waiting for process rollout_proc1 to join... +[2023-07-08 02:20:53,556][800281] Waiting for process rollout_proc2 to join... +[2023-07-08 02:20:53,556][800281] Waiting for process rollout_proc3 to join... +[2023-07-08 02:20:53,556][800281] Waiting for process rollout_proc4 to join... +[2023-07-08 02:20:53,557][800281] Waiting for process rollout_proc5 to join... +[2023-07-08 02:20:53,557][800281] Waiting for process rollout_proc6 to join... +[2023-07-08 02:20:53,557][800281] Waiting for process rollout_proc7 to join... +[2023-07-08 02:20:53,557][800281] Batcher 0 profile tree view: +batching: 1.8551, releasing_batches: 1.6148 +[2023-07-08 02:20:53,558][800281] InferenceWorker_p0-w0 profile tree view: +wait_policy: 0.0001 + wait_policy_total: 401.0439 +update_model: 12.7489 + weight_update: 0.0005 +one_step: 0.0016 + handle_policy_step: 574.0846 + deserialize: 24.5699, stack: 6.0550, obs_to_device_normalize: 104.1797, forward: 284.1576, send_messages: 39.8615 + prepare_outputs: 65.7276 + to_cpu: 9.9614 +[2023-07-08 02:20:53,558][800281] Learner 0 profile tree view: +misc: 0.0102, prepare_batch: 10.0643 +train: 104.0486 + epoch_init: 0.0378, minibatch_init: 1.4761, losses_postprocess: 1.3899, kl_divergence: 0.4814, after_optimizer: 0.6851 + calculate_losses: 44.3108 + losses_init: 0.0399, forward_head: 17.3835, bptt_initial: 0.1521, bptt: 0.1359, tail: 12.4791, advantages_returns: 0.9536, losses: 11.6161 + update: 53.9244 + clip: 6.3724 +[2023-07-08 02:20:53,558][800281] RolloutWorker_w0 profile tree view: +wait_for_trajectories: 0.3243, enqueue_policy_requests: 12.7100, env_step: 721.1357, overhead: 20.0911, complete_rollouts: 0.3206 +save_policy_outputs: 38.7571 + split_output_tensors: 13.2802 +[2023-07-08 02:20:53,558][800281] RolloutWorker_w7 profile tree view: +wait_for_trajectories: 0.2931, enqueue_policy_requests: 12.8124, env_step: 722.0888, overhead: 20.2066, complete_rollouts: 0.3302 +save_policy_outputs: 38.6444 + split_output_tensors: 13.2779 +[2023-07-08 02:20:53,558][800281] Loop Runner_EvtLoop terminating... +[2023-07-08 02:20:53,558][800281] Runner profile tree view: +main_loop: 1060.1568 +[2023-07-08 02:20:53,559][800281] Collected {0: 10006528}, FPS: 9438.7