diff --git "a/sf_log.txt" "b/sf_log.txt" --- "a/sf_log.txt" +++ "b/sf_log.txt" @@ -1,33 +1,39 @@ -[2023-07-07 22:25:51,954][754029] Saving configuration to /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/config.json... -[2023-07-07 22:25:51,972][754029] Rollout worker 0 uses device cpu -[2023-07-07 22:25:51,972][754029] Rollout worker 1 uses device cpu -[2023-07-07 22:25:51,972][754029] Rollout worker 2 uses device cpu -[2023-07-07 22:25:51,972][754029] Rollout worker 3 uses device cpu -[2023-07-07 22:25:51,973][754029] Rollout worker 4 uses device cpu -[2023-07-07 22:25:51,973][754029] Rollout worker 5 uses device cpu -[2023-07-07 22:25:51,973][754029] Rollout worker 6 uses device cpu -[2023-07-07 22:25:51,973][754029] Rollout worker 7 uses device cpu -[2023-07-07 22:25:51,973][754029] In synchronous mode, we only accumulate one batch. Setting num_batches_to_accumulate to 1 -[2023-07-07 22:25:51,984][754029] InferenceWorker_p0-w0: min num requests: 2 -[2023-07-07 22:25:52,002][754029] Starting all processes... -[2023-07-07 22:25:52,003][754029] Starting process learner_proc0 -[2023-07-07 22:25:52,052][754029] Starting all processes... -[2023-07-07 22:25:52,095][754029] Starting process inference_proc0-0 -[2023-07-07 22:25:52,106][754029] Starting process rollout_proc0 -[2023-07-07 22:25:52,106][754029] Starting process rollout_proc1 -[2023-07-07 22:25:52,107][754029] Starting process rollout_proc2 -[2023-07-07 22:25:52,107][754029] Starting process rollout_proc3 -[2023-07-07 22:25:52,107][754029] Starting process rollout_proc4 -[2023-07-07 22:25:52,107][754029] Starting process rollout_proc5 -[2023-07-07 22:25:52,107][754029] Starting process rollout_proc6 -[2023-07-07 22:25:52,108][754029] Starting process rollout_proc7 -[2023-07-07 22:25:53,921][754270] Starting seed is not provided -[2023-07-07 22:25:53,921][754270] Initializing actor-critic model on device cpu -[2023-07-07 22:25:53,921][754270] RunningMeanStd input shape: (39,) -[2023-07-07 22:25:53,922][754270] RunningMeanStd input shape: (1,) -[2023-07-07 22:25:53,950][754317] Worker 2 uses CPU cores [8, 9, 10, 11] -[2023-07-07 22:25:53,979][754270] Created Actor Critic model with architecture: -[2023-07-07 22:25:53,979][754270] ActorCriticSharedWeights( +[2023-07-08 15:05:49,418][994321] Saving configuration to /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/config.json... +[2023-07-08 15:05:49,436][994321] Rollout worker 0 uses device cpu +[2023-07-08 15:05:49,437][994321] Rollout worker 1 uses device cpu +[2023-07-08 15:05:49,437][994321] Rollout worker 2 uses device cpu +[2023-07-08 15:05:49,437][994321] Rollout worker 3 uses device cpu +[2023-07-08 15:05:49,437][994321] Rollout worker 4 uses device cpu +[2023-07-08 15:05:49,437][994321] Rollout worker 5 uses device cpu +[2023-07-08 15:05:49,437][994321] Rollout worker 6 uses device cpu +[2023-07-08 15:05:49,437][994321] Rollout worker 7 uses device cpu +[2023-07-08 15:05:49,437][994321] In synchronous mode, we only accumulate one batch. Setting num_batches_to_accumulate to 1 +[2023-07-08 15:05:49,450][994321] InferenceWorker_p0-w0: min num requests: 2 +[2023-07-08 15:05:49,470][994321] Starting all processes... +[2023-07-08 15:05:49,470][994321] Starting process learner_proc0 +[2023-07-08 15:05:49,519][994321] Starting all processes... +[2023-07-08 15:05:49,556][994321] Starting process inference_proc0-0 +[2023-07-08 15:05:49,556][994321] Starting process rollout_proc0 +[2023-07-08 15:05:49,556][994321] Starting process rollout_proc1 +[2023-07-08 15:05:49,556][994321] Starting process rollout_proc2 +[2023-07-08 15:05:49,556][994321] Starting process rollout_proc3 +[2023-07-08 15:05:49,556][994321] Starting process rollout_proc4 +[2023-07-08 15:05:49,556][994321] Starting process rollout_proc5 +[2023-07-08 15:05:49,556][994321] Starting process rollout_proc6 +[2023-07-08 15:05:49,556][994321] Starting process rollout_proc7 +[2023-07-08 15:05:51,561][994610] Worker 3 uses CPU cores [12, 13, 14, 15] +[2023-07-08 15:05:51,668][994662] Worker 4 uses CPU cores [16, 17, 18, 19] +[2023-07-08 15:05:51,762][994738] Worker 7 uses CPU cores [28, 29, 30, 31] +[2023-07-08 15:05:51,882][994607] Worker 1 uses CPU cores [4, 5, 6, 7] +[2023-07-08 15:05:51,993][994724] Worker 6 uses CPU cores [24, 25, 26, 27] +[2023-07-08 15:05:52,113][994608] Worker 0 uses CPU cores [0, 1, 2, 3] +[2023-07-08 15:05:52,219][994611] Worker 5 uses CPU cores [20, 21, 22, 23] +[2023-07-08 15:05:52,258][994562] Starting seed is not provided +[2023-07-08 15:05:52,258][994562] Initializing actor-critic model on device cpu +[2023-07-08 15:05:52,258][994562] RunningMeanStd input shape: (39,) +[2023-07-08 15:05:52,259][994562] RunningMeanStd input shape: (1,) +[2023-07-08 15:05:52,314][994562] Created Actor Critic model with architecture: +[2023-07-08 15:05:52,314][994562] ActorCriticSharedWeights( (obs_normalizer): ObservationNormalizer( (running_mean_std): RunningMeanStdDictInPlace( (running_mean_std): ModuleDict( @@ -58,1012 +64,1116 @@ (distribution_linear): Linear(in_features=64, out_features=4, bias=True) ) ) -[2023-07-07 22:25:54,096][754316] Worker 1 uses CPU cores [4, 5, 6, 7] -[2023-07-07 22:25:54,205][754446] Worker 6 uses CPU cores [24, 25, 26, 27] -[2023-07-07 22:25:54,294][754270] Using optimizer -[2023-07-07 22:25:54,295][754270] No checkpoints found -[2023-07-07 22:25:54,295][754270] Did not load from checkpoint, starting from scratch! -[2023-07-07 22:25:54,295][754270] Initialized policy 0 weights for model version 0 -[2023-07-07 22:25:54,296][754270] LearnerWorker_p0 finished initialization! -[2023-07-07 22:25:54,298][754314] RunningMeanStd input shape: (39,) -[2023-07-07 22:25:54,298][754314] RunningMeanStd input shape: (1,) -[2023-07-07 22:25:54,352][754318] Worker 3 uses CPU cores [12, 13, 14, 15] -[2023-07-07 22:25:54,355][754029] Inference worker 0-0 is ready! -[2023-07-07 22:25:54,356][754029] All inference workers are ready! Signal rollout workers to start! -[2023-07-07 22:25:54,416][754319] Worker 4 uses CPU cores [16, 17, 18, 19] -[2023-07-07 22:25:54,461][754351] Worker 5 uses CPU cores [20, 21, 22, 23] -[2023-07-07 22:25:54,614][754414] Worker 7 uses CPU cores [28, 29, 30, 31] -[2023-07-07 22:25:54,718][754315] Worker 0 uses CPU cores [0, 1, 2, 3] -[2023-07-07 22:25:58,544][754317] Decorrelating experience for 0 frames... -[2023-07-07 22:25:58,560][754317] Decorrelating experience for 64 frames... -[2023-07-07 22:25:58,583][754316] Decorrelating experience for 0 frames... -[2023-07-07 22:25:58,595][754446] Decorrelating experience for 0 frames... -[2023-07-07 22:25:58,598][754316] Decorrelating experience for 64 frames... -[2023-07-07 22:25:58,602][754317] Decorrelating experience for 128 frames... -[2023-07-07 22:25:58,611][754446] Decorrelating experience for 64 frames... -[2023-07-07 22:25:58,624][754318] Decorrelating experience for 0 frames... -[2023-07-07 22:25:58,639][754318] Decorrelating experience for 64 frames... -[2023-07-07 22:25:58,640][754316] Decorrelating experience for 128 frames... -[2023-07-07 22:25:58,654][754446] Decorrelating experience for 128 frames... -[2023-07-07 22:25:58,682][754318] Decorrelating experience for 128 frames... -[2023-07-07 22:25:58,686][754317] Decorrelating experience for 192 frames... -[2023-07-07 22:25:58,696][754319] Decorrelating experience for 0 frames... -[2023-07-07 22:25:58,711][754319] Decorrelating experience for 64 frames... -[2023-07-07 22:25:58,724][754316] Decorrelating experience for 192 frames... -[2023-07-07 22:25:58,739][754446] Decorrelating experience for 192 frames... -[2023-07-07 22:25:58,753][754351] Decorrelating experience for 0 frames... -[2023-07-07 22:25:58,754][754319] Decorrelating experience for 128 frames... -[2023-07-07 22:25:58,765][754318] Decorrelating experience for 192 frames... -[2023-07-07 22:25:58,769][754351] Decorrelating experience for 64 frames... -[2023-07-07 22:25:58,811][754351] Decorrelating experience for 128 frames... -[2023-07-07 22:25:58,837][754319] Decorrelating experience for 192 frames... -[2023-07-07 22:25:58,894][754351] Decorrelating experience for 192 frames... -[2023-07-07 22:25:58,918][754414] Decorrelating experience for 0 frames... -[2023-07-07 22:25:58,934][754414] Decorrelating experience for 64 frames... -[2023-07-07 22:25:58,976][754414] Decorrelating experience for 128 frames... -[2023-07-07 22:25:58,981][754315] Decorrelating experience for 0 frames... -[2023-07-07 22:25:58,997][754315] Decorrelating experience for 64 frames... -[2023-07-07 22:25:59,039][754315] Decorrelating experience for 128 frames... -[2023-07-07 22:25:59,060][754414] Decorrelating experience for 192 frames... -[2023-07-07 22:25:59,123][754315] Decorrelating experience for 192 frames... -[2023-07-07 22:25:59,262][754029] Fps is (10 sec: nan, 60 sec: nan, 300 sec: nan). Total num frames: 0. Throughput: 0: nan. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) -[2023-07-07 22:26:02,881][754317] Decorrelating experience for 256 frames... -[2023-07-07 22:26:02,929][754316] Decorrelating experience for 256 frames... -[2023-07-07 22:26:02,970][754318] Decorrelating experience for 256 frames... -[2023-07-07 22:26:03,021][754446] Decorrelating experience for 256 frames... -[2023-07-07 22:26:03,033][754317] Decorrelating experience for 320 frames... -[2023-07-07 22:26:03,043][754319] Decorrelating experience for 256 frames... -[2023-07-07 22:26:03,083][754316] Decorrelating experience for 320 frames... -[2023-07-07 22:26:03,094][754351] Decorrelating experience for 256 frames... -[2023-07-07 22:26:03,122][754318] Decorrelating experience for 320 frames... -[2023-07-07 22:26:03,176][754446] Decorrelating experience for 320 frames... -[2023-07-07 22:26:03,195][754319] Decorrelating experience for 320 frames... -[2023-07-07 22:26:03,226][754317] Decorrelating experience for 384 frames... -[2023-07-07 22:26:03,246][754351] Decorrelating experience for 320 frames... -[2023-07-07 22:26:03,266][754414] Decorrelating experience for 256 frames... -[2023-07-07 22:26:03,277][754316] Decorrelating experience for 384 frames... -[2023-07-07 22:26:03,315][754318] Decorrelating experience for 384 frames... -[2023-07-07 22:26:03,331][754315] Decorrelating experience for 256 frames... -[2023-07-07 22:26:03,372][754446] Decorrelating experience for 384 frames... -[2023-07-07 22:26:03,389][754319] Decorrelating experience for 384 frames... -[2023-07-07 22:26:03,418][754414] Decorrelating experience for 320 frames... -[2023-07-07 22:26:03,439][754351] Decorrelating experience for 384 frames... -[2023-07-07 22:26:03,445][754317] Decorrelating experience for 448 frames... -[2023-07-07 22:26:03,484][754315] Decorrelating experience for 320 frames... -[2023-07-07 22:26:03,497][754316] Decorrelating experience for 448 frames... -[2023-07-07 22:26:03,535][754318] Decorrelating experience for 448 frames... -[2023-07-07 22:26:03,595][754446] Decorrelating experience for 448 frames... -[2023-07-07 22:26:03,609][754319] Decorrelating experience for 448 frames... -[2023-07-07 22:26:03,611][754414] Decorrelating experience for 384 frames... -[2023-07-07 22:26:03,658][754351] Decorrelating experience for 448 frames... -[2023-07-07 22:26:03,677][754315] Decorrelating experience for 384 frames... -[2023-07-07 22:26:03,831][754414] Decorrelating experience for 448 frames... -[2023-07-07 22:26:03,897][754315] Decorrelating experience for 448 frames... -[2023-07-07 22:26:04,262][754029] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) -[2023-07-07 22:26:04,262][754029] Avg episode reward: [(0, '0.648')] -[2023-07-07 22:26:04,264][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000000000_0.pth... -[2023-07-07 22:26:08,331][754314] Updated weights for policy 0, policy_version 80 (0.0005) -[2023-07-07 22:26:09,262][754029] Fps is (10 sec: 4915.2, 60 sec: 4915.2, 300 sec: 4915.2). Total num frames: 49152. Throughput: 0: 2037.6. Samples: 20376. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:26:09,262][754029] Avg episode reward: [(0, '8.628')] -[2023-07-07 22:26:11,979][754029] Heartbeat connected on Batcher_0 -[2023-07-07 22:26:11,982][754029] Heartbeat connected on LearnerWorker_p0 -[2023-07-07 22:26:11,985][754029] Heartbeat connected on InferenceWorker_p0-w0 -[2023-07-07 22:26:11,989][754029] Heartbeat connected on RolloutWorker_w0 -[2023-07-07 22:26:11,991][754029] Heartbeat connected on RolloutWorker_w1 -[2023-07-07 22:26:11,993][754029] Heartbeat connected on RolloutWorker_w2 -[2023-07-07 22:26:11,995][754029] Heartbeat connected on RolloutWorker_w3 -[2023-07-07 22:26:11,999][754029] Heartbeat connected on RolloutWorker_w4 -[2023-07-07 22:26:12,000][754029] Heartbeat connected on RolloutWorker_w5 -[2023-07-07 22:26:12,016][754029] Heartbeat connected on RolloutWorker_w6 -[2023-07-07 22:26:12,019][754029] Heartbeat connected on RolloutWorker_w7 -[2023-07-07 22:26:12,202][754314] Updated weights for policy 0, policy_version 160 (0.0005) -[2023-07-07 22:26:14,262][754029] Fps is (10 sec: 10240.1, 60 sec: 6826.7, 300 sec: 6826.7). Total num frames: 102400. Throughput: 0: 5590.2. Samples: 83852. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) -[2023-07-07 22:26:14,262][754029] Avg episode reward: [(0, '16.764')] -[2023-07-07 22:26:14,263][754270] Saving new best policy, reward=16.764! -[2023-07-07 22:26:16,084][754314] Updated weights for policy 0, policy_version 240 (0.0004) -[2023-07-07 22:26:19,262][754029] Fps is (10 sec: 10239.8, 60 sec: 7577.5, 300 sec: 7577.5). Total num frames: 151552. Throughput: 0: 7314.5. Samples: 146292. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:26:19,263][754029] Avg episode reward: [(0, '52.126')] -[2023-07-07 22:26:19,298][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000000304_155648.pth... -[2023-07-07 22:26:19,301][754270] Saving new best policy, reward=52.126! -[2023-07-07 22:26:20,150][754314] Updated weights for policy 0, policy_version 320 (0.0005) -[2023-07-07 22:26:24,209][754314] Updated weights for policy 0, policy_version 400 (0.0005) -[2023-07-07 22:26:24,262][754029] Fps is (10 sec: 10240.0, 60 sec: 8192.0, 300 sec: 8192.0). Total num frames: 204800. Throughput: 0: 7045.1. Samples: 176128. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:26:24,262][754029] Avg episode reward: [(0, '69.462')] -[2023-07-07 22:26:24,263][754270] Saving new best policy, reward=69.462! -[2023-07-07 22:26:28,291][754314] Updated weights for policy 0, policy_version 480 (0.0005) -[2023-07-07 22:26:29,262][754029] Fps is (10 sec: 10240.3, 60 sec: 8465.1, 300 sec: 8465.1). Total num frames: 253952. Throughput: 0: 7921.1. Samples: 237632. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) -[2023-07-07 22:26:29,262][754029] Avg episode reward: [(0, '81.216')] -[2023-07-07 22:26:29,263][754270] Saving new best policy, reward=81.216! -[2023-07-07 22:26:32,551][754314] Updated weights for policy 0, policy_version 560 (0.0006) -[2023-07-07 22:26:34,262][754029] Fps is (10 sec: 9830.3, 60 sec: 8660.1, 300 sec: 8660.1). Total num frames: 303104. Throughput: 0: 8426.3. Samples: 294920. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:26:34,262][754029] Avg episode reward: [(0, '90.173')] -[2023-07-07 22:26:34,265][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000000592_303104.pth... -[2023-07-07 22:26:34,268][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000000000_0.pth -[2023-07-07 22:26:34,269][754270] Saving new best policy, reward=90.173! -[2023-07-07 22:26:36,585][754314] Updated weights for policy 0, policy_version 640 (0.0005) -[2023-07-07 22:26:39,262][754029] Fps is (10 sec: 9830.4, 60 sec: 8806.4, 300 sec: 8806.4). Total num frames: 352256. Throughput: 0: 8149.8. Samples: 325992. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) -[2023-07-07 22:26:39,262][754029] Avg episode reward: [(0, '101.170')] -[2023-07-07 22:26:39,263][754270] Saving new best policy, reward=101.170! -[2023-07-07 22:26:40,565][754314] Updated weights for policy 0, policy_version 720 (0.0005) -[2023-07-07 22:26:44,262][754029] Fps is (10 sec: 10240.1, 60 sec: 9011.2, 300 sec: 9011.2). Total num frames: 405504. Throughput: 0: 8611.8. Samples: 387532. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:26:44,262][754029] Avg episode reward: [(0, '101.672')] -[2023-07-07 22:26:44,263][754270] Saving new best policy, reward=101.672! -[2023-07-07 22:26:44,632][754314] Updated weights for policy 0, policy_version 800 (0.0005) -[2023-07-07 22:26:48,652][754314] Updated weights for policy 0, policy_version 880 (0.0003) -[2023-07-07 22:26:49,262][754029] Fps is (10 sec: 10239.9, 60 sec: 9093.1, 300 sec: 9093.1). Total num frames: 454656. Throughput: 0: 9954.5. Samples: 447952. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:26:49,262][754029] Avg episode reward: [(0, '99.898')] -[2023-07-07 22:26:49,265][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000000888_454656.pth... -[2023-07-07 22:26:49,269][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000000304_155648.pth -[2023-07-07 22:26:52,849][754314] Updated weights for policy 0, policy_version 960 (0.0004) -[2023-07-07 22:26:54,262][754029] Fps is (10 sec: 9830.4, 60 sec: 9160.2, 300 sec: 9160.2). Total num frames: 503808. Throughput: 0: 10154.1. Samples: 477312. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:26:54,262][754029] Avg episode reward: [(0, '99.803')] -[2023-07-07 22:26:57,039][754314] Updated weights for policy 0, policy_version 1040 (0.0005) -[2023-07-07 22:26:59,262][754029] Fps is (10 sec: 9830.5, 60 sec: 9216.0, 300 sec: 9216.0). Total num frames: 552960. Throughput: 0: 10056.1. Samples: 536376. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:26:59,262][754029] Avg episode reward: [(0, '103.946')] -[2023-07-07 22:26:59,263][754270] Saving new best policy, reward=103.946! -[2023-07-07 22:27:01,232][754314] Updated weights for policy 0, policy_version 1120 (0.0005) -[2023-07-07 22:27:04,262][754029] Fps is (10 sec: 9830.3, 60 sec: 10035.2, 300 sec: 9263.3). Total num frames: 602112. Throughput: 0: 9985.6. Samples: 595644. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:27:04,262][754029] Avg episode reward: [(0, '107.064')] -[2023-07-07 22:27:04,266][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000001176_602112.pth... -[2023-07-07 22:27:04,268][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000000592_303104.pth -[2023-07-07 22:27:04,269][754270] Saving new best policy, reward=107.064! -[2023-07-07 22:27:05,195][754314] Updated weights for policy 0, policy_version 1200 (0.0005) -[2023-07-07 22:27:09,262][754029] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 9303.8). Total num frames: 651264. Throughput: 0: 10008.1. Samples: 626492. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:27:09,262][754029] Avg episode reward: [(0, '106.427')] -[2023-07-07 22:27:09,271][754314] Updated weights for policy 0, policy_version 1280 (0.0004) -[2023-07-07 22:27:13,305][754314] Updated weights for policy 0, policy_version 1360 (0.0004) -[2023-07-07 22:27:14,262][754029] Fps is (10 sec: 10240.1, 60 sec: 10035.2, 300 sec: 9393.5). Total num frames: 704512. Throughput: 0: 9987.7. Samples: 687080. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:27:14,262][754029] Avg episode reward: [(0, '105.901')] -[2023-07-07 22:27:17,601][754314] Updated weights for policy 0, policy_version 1440 (0.0005) -[2023-07-07 22:27:19,262][754029] Fps is (10 sec: 9830.3, 60 sec: 9967.0, 300 sec: 9369.6). Total num frames: 749568. Throughput: 0: 10009.5. Samples: 745348. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:27:19,262][754029] Avg episode reward: [(0, '111.297')] -[2023-07-07 22:27:19,273][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000001472_753664.pth... -[2023-07-07 22:27:19,275][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000000888_454656.pth -[2023-07-07 22:27:19,275][754270] Saving new best policy, reward=111.297! -[2023-07-07 22:27:21,869][754314] Updated weights for policy 0, policy_version 1520 (0.0005) -[2023-07-07 22:27:24,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 9396.7). Total num frames: 798720. Throughput: 0: 9958.9. Samples: 774144. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:27:24,262][754029] Avg episode reward: [(0, '110.870')] -[2023-07-07 22:27:26,125][754314] Updated weights for policy 0, policy_version 1600 (0.0005) -[2023-07-07 22:27:29,262][754029] Fps is (10 sec: 9420.9, 60 sec: 9830.4, 300 sec: 9375.3). Total num frames: 843776. Throughput: 0: 9858.3. Samples: 831156. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) -[2023-07-07 22:27:29,262][754029] Avg episode reward: [(0, '108.803')] -[2023-07-07 22:27:30,591][754314] Updated weights for policy 0, policy_version 1680 (0.0005) -[2023-07-07 22:27:34,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9830.4, 300 sec: 9399.2). Total num frames: 892928. Throughput: 0: 9787.0. Samples: 888364. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) -[2023-07-07 22:27:34,262][754029] Avg episode reward: [(0, '110.098')] -[2023-07-07 22:27:34,289][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000001752_897024.pth... -[2023-07-07 22:27:34,290][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000001176_602112.pth -[2023-07-07 22:27:34,714][754314] Updated weights for policy 0, policy_version 1760 (0.0004) -[2023-07-07 22:27:38,833][754314] Updated weights for policy 0, policy_version 1840 (0.0005) -[2023-07-07 22:27:39,262][754029] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9461.8). Total num frames: 946176. Throughput: 0: 9797.4. Samples: 918196. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:27:39,262][754029] Avg episode reward: [(0, '110.695')] -[2023-07-07 22:27:40,469][754270] Early stopping after 2 epochs (8 sgd steps), loss delta 0.0000004 -[2023-07-07 22:27:42,916][754314] Updated weights for policy 0, policy_version 1920 (0.0005) -[2023-07-07 22:27:44,262][754029] Fps is (10 sec: 10239.9, 60 sec: 9830.4, 300 sec: 9479.3). Total num frames: 995328. Throughput: 0: 9811.4. Samples: 977888. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) -[2023-07-07 22:27:44,262][754029] Avg episode reward: [(0, '111.502')] -[2023-07-07 22:27:44,263][754270] Saving new best policy, reward=111.502! -[2023-07-07 22:27:47,028][754314] Updated weights for policy 0, policy_version 2000 (0.0005) -[2023-07-07 22:27:49,262][754029] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9495.3). Total num frames: 1044480. Throughput: 0: 9815.4. Samples: 1037336. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) -[2023-07-07 22:27:49,262][754029] Avg episode reward: [(0, '108.511')] -[2023-07-07 22:27:49,266][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000002040_1044480.pth... -[2023-07-07 22:27:49,269][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000001472_753664.pth -[2023-07-07 22:27:51,235][754314] Updated weights for policy 0, policy_version 2080 (0.0005) -[2023-07-07 22:27:54,262][754029] Fps is (10 sec: 9830.5, 60 sec: 9830.4, 300 sec: 9509.8). Total num frames: 1093632. Throughput: 0: 9777.1. Samples: 1066460. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:27:54,262][754029] Avg episode reward: [(0, '110.679')] -[2023-07-07 22:27:55,502][754314] Updated weights for policy 0, policy_version 2160 (0.0005) -[2023-07-07 22:27:59,262][754029] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9523.2). Total num frames: 1142784. Throughput: 0: 9740.8. Samples: 1125416. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) -[2023-07-07 22:27:59,262][754029] Avg episode reward: [(0, '109.765')] -[2023-07-07 22:27:59,607][754314] Updated weights for policy 0, policy_version 2240 (0.0005) -[2023-07-07 22:28:03,748][754314] Updated weights for policy 0, policy_version 2320 (0.0004) -[2023-07-07 22:28:04,262][754029] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9535.5). Total num frames: 1191936. Throughput: 0: 9743.9. Samples: 1183824. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-07-07 22:28:04,262][754029] Avg episode reward: [(0, '109.910')] -[2023-07-07 22:28:04,265][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000002328_1191936.pth... -[2023-07-07 22:28:04,267][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000001752_897024.pth -[2023-07-07 22:28:07,987][754314] Updated weights for policy 0, policy_version 2400 (0.0005) -[2023-07-07 22:28:09,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9515.3). Total num frames: 1236992. Throughput: 0: 9746.4. Samples: 1212732. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) -[2023-07-07 22:28:09,262][754029] Avg episode reward: [(0, '113.376')] -[2023-07-07 22:28:09,270][754270] Saving new best policy, reward=113.376! -[2023-07-07 22:28:12,303][754314] Updated weights for policy 0, policy_version 2480 (0.0005) -[2023-07-07 22:28:14,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9527.0). Total num frames: 1286144. Throughput: 0: 9761.4. Samples: 1270420. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) -[2023-07-07 22:28:14,262][754029] Avg episode reward: [(0, '116.302')] -[2023-07-07 22:28:14,263][754270] Saving new best policy, reward=116.302! -[2023-07-07 22:28:16,780][754314] Updated weights for policy 0, policy_version 2560 (0.0006) -[2023-07-07 22:28:19,262][754029] Fps is (10 sec: 9420.7, 60 sec: 9693.9, 300 sec: 9508.6). Total num frames: 1331200. Throughput: 0: 9714.0. Samples: 1325496. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) -[2023-07-07 22:28:19,262][754029] Avg episode reward: [(0, '118.207')] -[2023-07-07 22:28:19,264][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000002600_1331200.pth... -[2023-07-07 22:28:19,267][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000002040_1044480.pth -[2023-07-07 22:28:19,268][754270] Saving new best policy, reward=118.207! -[2023-07-07 22:28:21,268][754314] Updated weights for policy 0, policy_version 2640 (0.0005) -[2023-07-07 22:28:24,262][754029] Fps is (10 sec: 9011.2, 60 sec: 9625.6, 300 sec: 9491.4). Total num frames: 1376256. Throughput: 0: 9658.0. Samples: 1352804. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:28:24,262][754029] Avg episode reward: [(0, '110.803')] -[2023-07-07 22:28:25,605][754314] Updated weights for policy 0, policy_version 2720 (0.0005) -[2023-07-07 22:28:29,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9502.7). Total num frames: 1425408. Throughput: 0: 9580.8. Samples: 1409024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:28:29,262][754029] Avg episode reward: [(0, '116.350')] -[2023-07-07 22:28:30,069][754314] Updated weights for policy 0, policy_version 2800 (0.0005) -[2023-07-07 22:28:34,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9486.9). Total num frames: 1470464. Throughput: 0: 9475.0. Samples: 1463712. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) -[2023-07-07 22:28:34,262][754029] Avg episode reward: [(0, '118.446')] -[2023-07-07 22:28:34,265][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000002872_1470464.pth... -[2023-07-07 22:28:34,268][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000002328_1191936.pth -[2023-07-07 22:28:34,268][754270] Saving new best policy, reward=118.446! -[2023-07-07 22:28:34,569][754314] Updated weights for policy 0, policy_version 2880 (0.0005) -[2023-07-07 22:28:39,089][754314] Updated weights for policy 0, policy_version 2960 (0.0005) -[2023-07-07 22:28:39,262][754029] Fps is (10 sec: 9011.1, 60 sec: 9489.1, 300 sec: 9472.0). Total num frames: 1515520. Throughput: 0: 9433.1. Samples: 1490952. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) -[2023-07-07 22:28:39,262][754029] Avg episode reward: [(0, '116.825')] -[2023-07-07 22:28:43,415][754314] Updated weights for policy 0, policy_version 3040 (0.0003) -[2023-07-07 22:28:44,262][754029] Fps is (10 sec: 9420.7, 60 sec: 9489.1, 300 sec: 9482.9). Total num frames: 1564672. Throughput: 0: 9356.5. Samples: 1546460. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:28:44,262][754029] Avg episode reward: [(0, '113.057')] -[2023-07-07 22:28:47,509][754314] Updated weights for policy 0, policy_version 3120 (0.0003) -[2023-07-07 22:28:49,262][754029] Fps is (10 sec: 9830.3, 60 sec: 9489.0, 300 sec: 9493.1). Total num frames: 1613824. Throughput: 0: 9392.1. Samples: 1606468. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) -[2023-07-07 22:28:49,262][754029] Avg episode reward: [(0, '115.746')] -[2023-07-07 22:28:49,265][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000003152_1613824.pth... -[2023-07-07 22:28:49,268][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000002600_1331200.pth -[2023-07-07 22:28:51,609][754314] Updated weights for policy 0, policy_version 3200 (0.0004) -[2023-07-07 22:28:54,262][754029] Fps is (10 sec: 9830.5, 60 sec: 9489.1, 300 sec: 9502.7). Total num frames: 1662976. Throughput: 0: 9406.6. Samples: 1636028. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:28:54,262][754029] Avg episode reward: [(0, '107.035')] -[2023-07-07 22:28:55,848][754314] Updated weights for policy 0, policy_version 3280 (0.0005) -[2023-07-07 22:28:59,262][754029] Fps is (10 sec: 9830.5, 60 sec: 9489.1, 300 sec: 9511.8). Total num frames: 1712128. Throughput: 0: 9428.4. Samples: 1694700. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) -[2023-07-07 22:28:59,262][754029] Avg episode reward: [(0, '113.922')] -[2023-07-07 22:29:00,100][754314] Updated weights for policy 0, policy_version 3360 (0.0005) -[2023-07-07 22:29:04,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9498.3). Total num frames: 1757184. Throughput: 0: 9474.0. Samples: 1751824. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) -[2023-07-07 22:29:04,262][754029] Avg episode reward: [(0, '115.351')] -[2023-07-07 22:29:04,265][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000003432_1757184.pth... -[2023-07-07 22:29:04,268][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000002872_1470464.pth -[2023-07-07 22:29:04,372][754314] Updated weights for policy 0, policy_version 3440 (0.0004) -[2023-07-07 22:29:08,615][754314] Updated weights for policy 0, policy_version 3520 (0.0005) -[2023-07-07 22:29:09,262][754029] Fps is (10 sec: 9420.9, 60 sec: 9489.1, 300 sec: 9507.0). Total num frames: 1806336. Throughput: 0: 9515.6. Samples: 1781008. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) -[2023-07-07 22:29:09,262][754029] Avg episode reward: [(0, '114.660')] -[2023-07-07 22:29:13,006][754314] Updated weights for policy 0, policy_version 3600 (0.0005) -[2023-07-07 22:29:14,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9494.3). Total num frames: 1851392. Throughput: 0: 9517.9. Samples: 1837332. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:29:14,262][754029] Avg episode reward: [(0, '117.114')] -[2023-07-07 22:29:17,520][754314] Updated weights for policy 0, policy_version 3680 (0.0005) -[2023-07-07 22:29:19,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9502.7). Total num frames: 1900544. Throughput: 0: 9525.8. Samples: 1892372. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:29:19,262][754029] Avg episode reward: [(0, '119.702')] -[2023-07-07 22:29:19,266][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000003712_1900544.pth... -[2023-07-07 22:29:19,269][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000003152_1613824.pth -[2023-07-07 22:29:19,269][754270] Saving new best policy, reward=119.702! -[2023-07-07 22:29:21,987][754314] Updated weights for policy 0, policy_version 3760 (0.0006) -[2023-07-07 22:29:24,262][754029] Fps is (10 sec: 9420.9, 60 sec: 9489.1, 300 sec: 9490.7). Total num frames: 1945600. Throughput: 0: 9550.1. Samples: 1920708. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) -[2023-07-07 22:29:24,262][754029] Avg episode reward: [(0, '110.155')] -[2023-07-07 22:29:26,366][754314] Updated weights for policy 0, policy_version 3840 (0.0005) -[2023-07-07 22:29:29,262][754029] Fps is (10 sec: 9011.2, 60 sec: 9420.8, 300 sec: 9479.3). Total num frames: 1990656. Throughput: 0: 9539.2. Samples: 1975724. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:29:29,262][754029] Avg episode reward: [(0, '111.410')] -[2023-07-07 22:29:30,804][754314] Updated weights for policy 0, policy_version 3920 (0.0005) -[2023-07-07 22:29:34,262][754029] Fps is (10 sec: 9011.2, 60 sec: 9420.8, 300 sec: 9468.4). Total num frames: 2035712. Throughput: 0: 9442.4. Samples: 2031372. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) -[2023-07-07 22:29:34,262][754029] Avg episode reward: [(0, '114.237')] -[2023-07-07 22:29:34,265][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000003976_2035712.pth... -[2023-07-07 22:29:34,268][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000003432_1757184.pth -[2023-07-07 22:29:35,279][754314] Updated weights for policy 0, policy_version 4000 (0.0005) -[2023-07-07 22:29:39,262][754029] Fps is (10 sec: 9011.1, 60 sec: 9420.8, 300 sec: 9458.0). Total num frames: 2080768. Throughput: 0: 9371.1. Samples: 2057728. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:29:39,262][754029] Avg episode reward: [(0, '111.634')] -[2023-07-07 22:29:39,902][754314] Updated weights for policy 0, policy_version 4080 (0.0005) -[2023-07-07 22:29:44,262][754029] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9448.1). Total num frames: 2125824. Throughput: 0: 9270.5. Samples: 2111872. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:29:44,262][754029] Avg episode reward: [(0, '116.356')] -[2023-07-07 22:29:44,461][754314] Updated weights for policy 0, policy_version 4160 (0.0005) -[2023-07-07 22:29:48,802][754314] Updated weights for policy 0, policy_version 4240 (0.0005) -[2023-07-07 22:29:49,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9456.4). Total num frames: 2174976. Throughput: 0: 9217.1. Samples: 2166592. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) -[2023-07-07 22:29:49,262][754029] Avg episode reward: [(0, '117.621')] -[2023-07-07 22:29:49,266][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000004248_2174976.pth... -[2023-07-07 22:29:49,268][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000003712_1900544.pth -[2023-07-07 22:29:53,180][754314] Updated weights for policy 0, policy_version 4320 (0.0005) -[2023-07-07 22:29:54,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9446.9). Total num frames: 2220032. Throughput: 0: 9209.7. Samples: 2195444. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:29:54,262][754029] Avg episode reward: [(0, '118.187')] -[2023-07-07 22:29:57,677][754314] Updated weights for policy 0, policy_version 4400 (0.0005) -[2023-07-07 22:29:59,262][754029] Fps is (10 sec: 9011.3, 60 sec: 9216.0, 300 sec: 9437.9). Total num frames: 2265088. Throughput: 0: 9171.8. Samples: 2250064. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:29:59,262][754029] Avg episode reward: [(0, '123.582')] -[2023-07-07 22:29:59,263][754270] Saving new best policy, reward=123.582! -[2023-07-07 22:30:02,169][754314] Updated weights for policy 0, policy_version 4480 (0.0006) -[2023-07-07 22:30:04,262][754029] Fps is (10 sec: 9011.1, 60 sec: 9216.0, 300 sec: 9429.2). Total num frames: 2310144. Throughput: 0: 9191.7. Samples: 2306000. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) -[2023-07-07 22:30:04,262][754029] Avg episode reward: [(0, '118.739')] -[2023-07-07 22:30:04,320][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000004520_2314240.pth... -[2023-07-07 22:30:04,322][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000003976_2035712.pth -[2023-07-07 22:30:06,567][754314] Updated weights for policy 0, policy_version 4560 (0.0005) -[2023-07-07 22:30:09,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9437.2). Total num frames: 2359296. Throughput: 0: 9169.5. Samples: 2333336. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) -[2023-07-07 22:30:09,262][754029] Avg episode reward: [(0, '108.093')] -[2023-07-07 22:30:11,035][754314] Updated weights for policy 0, policy_version 4640 (0.0005) -[2023-07-07 22:30:14,262][754029] Fps is (10 sec: 9420.9, 60 sec: 9216.0, 300 sec: 9428.8). Total num frames: 2404352. Throughput: 0: 9162.4. Samples: 2388032. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:30:14,262][754029] Avg episode reward: [(0, '118.379')] -[2023-07-07 22:30:15,578][754314] Updated weights for policy 0, policy_version 4720 (0.0005) -[2023-07-07 22:30:19,262][754029] Fps is (10 sec: 9011.1, 60 sec: 9147.7, 300 sec: 9420.8). Total num frames: 2449408. Throughput: 0: 9119.5. Samples: 2441752. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-07-07 22:30:19,262][754029] Avg episode reward: [(0, '119.222')] -[2023-07-07 22:30:19,265][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000004784_2449408.pth... -[2023-07-07 22:30:19,268][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000004248_2174976.pth -[2023-07-07 22:30:20,060][754314] Updated weights for policy 0, policy_version 4800 (0.0005) -[2023-07-07 22:30:24,262][754029] Fps is (10 sec: 9011.2, 60 sec: 9147.7, 300 sec: 9413.1). Total num frames: 2494464. Throughput: 0: 9159.1. Samples: 2469888. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:30:24,262][754029] Avg episode reward: [(0, '119.718')] -[2023-07-07 22:30:24,599][754314] Updated weights for policy 0, policy_version 4880 (0.0005) -[2023-07-07 22:30:29,103][754314] Updated weights for policy 0, policy_version 4960 (0.0005) -[2023-07-07 22:30:29,262][754029] Fps is (10 sec: 9011.2, 60 sec: 9147.7, 300 sec: 9405.6). Total num frames: 2539520. Throughput: 0: 9152.0. Samples: 2523712. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) -[2023-07-07 22:30:29,262][754029] Avg episode reward: [(0, '107.524')] -[2023-07-07 22:30:33,604][754314] Updated weights for policy 0, policy_version 5040 (0.0005) -[2023-07-07 22:30:34,262][754029] Fps is (10 sec: 9011.1, 60 sec: 9147.7, 300 sec: 9398.5). Total num frames: 2584576. Throughput: 0: 9165.8. Samples: 2579052. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:30:34,262][754029] Avg episode reward: [(0, '108.045')] -[2023-07-07 22:30:34,265][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000005048_2584576.pth... -[2023-07-07 22:30:34,268][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000004520_2314240.pth -[2023-07-07 22:30:37,931][754314] Updated weights for policy 0, policy_version 5120 (0.0005) -[2023-07-07 22:30:39,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9406.2). Total num frames: 2633728. Throughput: 0: 9141.8. Samples: 2606824. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:30:39,262][754029] Avg episode reward: [(0, '112.433')] -[2023-07-07 22:30:42,167][754314] Updated weights for policy 0, policy_version 5200 (0.0005) -[2023-07-07 22:30:44,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9399.2). Total num frames: 2678784. Throughput: 0: 9212.6. Samples: 2664632. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) -[2023-07-07 22:30:44,262][754029] Avg episode reward: [(0, '107.492')] -[2023-07-07 22:30:46,525][754314] Updated weights for policy 0, policy_version 5280 (0.0005) -[2023-07-07 22:30:49,262][754029] Fps is (10 sec: 9420.9, 60 sec: 9216.0, 300 sec: 9406.7). Total num frames: 2727936. Throughput: 0: 9244.3. Samples: 2721992. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:30:49,262][754029] Avg episode reward: [(0, '106.292')] -[2023-07-07 22:30:49,265][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000005328_2727936.pth... -[2023-07-07 22:30:49,267][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000004784_2449408.pth -[2023-07-07 22:30:50,744][754314] Updated weights for policy 0, policy_version 5360 (0.0005) -[2023-07-07 22:30:54,262][754029] Fps is (10 sec: 9830.4, 60 sec: 9284.3, 300 sec: 9413.9). Total num frames: 2777088. Throughput: 0: 9280.1. Samples: 2750940. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:30:54,262][754029] Avg episode reward: [(0, '102.846')] -[2023-07-07 22:30:55,059][754314] Updated weights for policy 0, policy_version 5440 (0.0005) -[2023-07-07 22:30:59,262][754029] Fps is (10 sec: 9420.7, 60 sec: 9284.3, 300 sec: 9566.6). Total num frames: 2822144. Throughput: 0: 9294.5. Samples: 2806284. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:30:59,262][754029] Avg episode reward: [(0, '109.054')] -[2023-07-07 22:30:59,650][754314] Updated weights for policy 0, policy_version 5520 (0.0005) -[2023-07-07 22:31:04,027][754314] Updated weights for policy 0, policy_version 5600 (0.0005) -[2023-07-07 22:31:04,262][754029] Fps is (10 sec: 9011.1, 60 sec: 9284.3, 300 sec: 9552.7). Total num frames: 2867200. Throughput: 0: 9312.6. Samples: 2860820. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:31:04,262][754029] Avg episode reward: [(0, '98.465')] -[2023-07-07 22:31:04,266][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000005600_2867200.pth... -[2023-07-07 22:31:04,269][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000005048_2584576.pth -[2023-07-07 22:31:08,219][754314] Updated weights for policy 0, policy_version 5680 (0.0005) -[2023-07-07 22:31:09,262][754029] Fps is (10 sec: 9420.9, 60 sec: 9284.3, 300 sec: 9538.8). Total num frames: 2916352. Throughput: 0: 9340.2. Samples: 2890196. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) -[2023-07-07 22:31:09,262][754029] Avg episode reward: [(0, '115.359')] -[2023-07-07 22:31:12,522][754314] Updated weights for policy 0, policy_version 5760 (0.0005) -[2023-07-07 22:31:14,262][754029] Fps is (10 sec: 9420.9, 60 sec: 9284.3, 300 sec: 9524.9). Total num frames: 2961408. Throughput: 0: 9431.7. Samples: 2948140. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) -[2023-07-07 22:31:14,262][754029] Avg episode reward: [(0, '107.956')] -[2023-07-07 22:31:16,965][754314] Updated weights for policy 0, policy_version 5840 (0.0005) -[2023-07-07 22:31:19,262][754029] Fps is (10 sec: 9420.7, 60 sec: 9352.5, 300 sec: 9511.0). Total num frames: 3010560. Throughput: 0: 9409.2. Samples: 3002468. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:31:19,263][754029] Avg episode reward: [(0, '107.921')] -[2023-07-07 22:31:19,266][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000005880_3010560.pth... -[2023-07-07 22:31:19,269][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000005328_2727936.pth -[2023-07-07 22:31:21,492][754314] Updated weights for policy 0, policy_version 5920 (0.0005) -[2023-07-07 22:31:24,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9497.2). Total num frames: 3055616. Throughput: 0: 9421.1. Samples: 3030772. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:31:24,262][754029] Avg episode reward: [(0, '114.827')] -[2023-07-07 22:31:26,011][754314] Updated weights for policy 0, policy_version 6000 (0.0005) -[2023-07-07 22:31:29,262][754029] Fps is (10 sec: 9011.3, 60 sec: 9352.5, 300 sec: 9483.3). Total num frames: 3100672. Throughput: 0: 9327.1. Samples: 3084352. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-07-07 22:31:29,262][754029] Avg episode reward: [(0, '123.142')] -[2023-07-07 22:31:30,561][754314] Updated weights for policy 0, policy_version 6080 (0.0005) -[2023-07-07 22:31:34,262][754029] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9469.4). Total num frames: 3145728. Throughput: 0: 9256.6. Samples: 3138540. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-07-07 22:31:34,262][754029] Avg episode reward: [(0, '116.090')] -[2023-07-07 22:31:34,312][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000006144_3145728.pth... -[2023-07-07 22:31:34,315][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000005600_2867200.pth -[2023-07-07 22:31:35,016][754314] Updated weights for policy 0, policy_version 6160 (0.0005) -[2023-07-07 22:31:39,262][754029] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9441.6). Total num frames: 3190784. Throughput: 0: 9233.4. Samples: 3166444. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:31:39,262][754029] Avg episode reward: [(0, '111.675')] -[2023-07-07 22:31:39,440][754314] Updated weights for policy 0, policy_version 6240 (0.0006) -[2023-07-07 22:31:43,591][754314] Updated weights for policy 0, policy_version 6320 (0.0005) -[2023-07-07 22:31:44,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9441.6). Total num frames: 3239936. Throughput: 0: 9285.2. Samples: 3224116. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:31:44,262][754029] Avg episode reward: [(0, '113.152')] -[2023-07-07 22:31:47,846][754314] Updated weights for policy 0, policy_version 6400 (0.0005) -[2023-07-07 22:31:49,262][754029] Fps is (10 sec: 9830.4, 60 sec: 9352.5, 300 sec: 9441.6). Total num frames: 3289088. Throughput: 0: 9360.4. Samples: 3282036. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:31:49,262][754029] Avg episode reward: [(0, '126.241')] -[2023-07-07 22:31:49,266][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000006424_3289088.pth... -[2023-07-07 22:31:49,269][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000005880_3010560.pth -[2023-07-07 22:31:49,270][754270] Saving new best policy, reward=126.241! -[2023-07-07 22:31:52,068][754314] Updated weights for policy 0, policy_version 6480 (0.0005) -[2023-07-07 22:31:54,262][754029] Fps is (10 sec: 9420.9, 60 sec: 9284.3, 300 sec: 9427.7). Total num frames: 3334144. Throughput: 0: 9367.6. Samples: 3311736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:31:54,262][754029] Avg episode reward: [(0, '107.024')] -[2023-07-07 22:31:56,557][754314] Updated weights for policy 0, policy_version 6560 (0.0005) -[2023-07-07 22:31:59,262][754029] Fps is (10 sec: 9420.9, 60 sec: 9352.5, 300 sec: 9427.7). Total num frames: 3383296. Throughput: 0: 9301.7. Samples: 3366716. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) -[2023-07-07 22:31:59,262][754029] Avg episode reward: [(0, '114.227')] -[2023-07-07 22:32:00,953][754314] Updated weights for policy 0, policy_version 6640 (0.0005) -[2023-07-07 22:32:04,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9352.6, 300 sec: 9413.9). Total num frames: 3428352. Throughput: 0: 9350.8. Samples: 3423252. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) -[2023-07-07 22:32:04,262][754029] Avg episode reward: [(0, '123.146')] -[2023-07-07 22:32:04,318][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000006704_3432448.pth... -[2023-07-07 22:32:04,321][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000006144_3145728.pth -[2023-07-07 22:32:05,217][754314] Updated weights for policy 0, policy_version 6720 (0.0005) -[2023-07-07 22:32:09,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9400.0). Total num frames: 3477504. Throughput: 0: 9358.1. Samples: 3451884. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) -[2023-07-07 22:32:09,262][754029] Avg episode reward: [(0, '109.156')] -[2023-07-07 22:32:09,720][754314] Updated weights for policy 0, policy_version 6800 (0.0006) -[2023-07-07 22:32:14,074][754314] Updated weights for policy 0, policy_version 6880 (0.0005) -[2023-07-07 22:32:14,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9400.0). Total num frames: 3522560. Throughput: 0: 9374.0. Samples: 3506184. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) -[2023-07-07 22:32:14,262][754029] Avg episode reward: [(0, '116.583')] -[2023-07-07 22:32:18,552][754314] Updated weights for policy 0, policy_version 6960 (0.0005) -[2023-07-07 22:32:19,262][754029] Fps is (10 sec: 9011.1, 60 sec: 9284.3, 300 sec: 9386.1). Total num frames: 3567616. Throughput: 0: 9418.5. Samples: 3562372. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:32:19,262][754029] Avg episode reward: [(0, '116.806')] -[2023-07-07 22:32:19,266][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000006968_3567616.pth... -[2023-07-07 22:32:19,269][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000006424_3289088.pth -[2023-07-07 22:32:23,103][754314] Updated weights for policy 0, policy_version 7040 (0.0005) -[2023-07-07 22:32:24,262][754029] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9386.1). Total num frames: 3612672. Throughput: 0: 9387.3. Samples: 3588872. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) -[2023-07-07 22:32:24,262][754029] Avg episode reward: [(0, '118.081')] -[2023-07-07 22:32:27,646][754314] Updated weights for policy 0, policy_version 7120 (0.0005) -[2023-07-07 22:32:29,262][754029] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9372.2). Total num frames: 3657728. Throughput: 0: 9316.1. Samples: 3643340. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) -[2023-07-07 22:32:29,262][754029] Avg episode reward: [(0, '109.213')] -[2023-07-07 22:32:32,072][754314] Updated weights for policy 0, policy_version 7200 (0.0005) -[2023-07-07 22:32:34,262][754029] Fps is (10 sec: 9011.1, 60 sec: 9284.3, 300 sec: 9344.4). Total num frames: 3702784. Throughput: 0: 9259.1. Samples: 3698696. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:32:34,262][754029] Avg episode reward: [(0, '117.304')] -[2023-07-07 22:32:34,279][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000007240_3706880.pth... -[2023-07-07 22:32:34,282][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000006704_3432448.pth -[2023-07-07 22:32:36,432][754314] Updated weights for policy 0, policy_version 7280 (0.0005) -[2023-07-07 22:32:39,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9344.4). Total num frames: 3751936. Throughput: 0: 9233.8. Samples: 3727256. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) -[2023-07-07 22:32:39,262][754029] Avg episode reward: [(0, '103.956')] -[2023-07-07 22:32:40,607][754314] Updated weights for policy 0, policy_version 7360 (0.0005) -[2023-07-07 22:32:44,262][754029] Fps is (10 sec: 9830.4, 60 sec: 9352.5, 300 sec: 9344.4). Total num frames: 3801088. Throughput: 0: 9303.8. Samples: 3785388. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) -[2023-07-07 22:32:44,262][754029] Avg episode reward: [(0, '116.529')] -[2023-07-07 22:32:44,864][754314] Updated weights for policy 0, policy_version 7440 (0.0005) -[2023-07-07 22:32:49,107][754314] Updated weights for policy 0, policy_version 7520 (0.0005) -[2023-07-07 22:32:49,262][754029] Fps is (10 sec: 9830.3, 60 sec: 9352.5, 300 sec: 9344.4). Total num frames: 3850240. Throughput: 0: 9347.6. Samples: 3843896. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) -[2023-07-07 22:32:49,262][754029] Avg episode reward: [(0, '120.204')] -[2023-07-07 22:32:49,266][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000007520_3850240.pth... -[2023-07-07 22:32:49,268][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000006968_3567616.pth -[2023-07-07 22:32:53,596][754314] Updated weights for policy 0, policy_version 7600 (0.0005) -[2023-07-07 22:32:54,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9330.5). Total num frames: 3895296. Throughput: 0: 9309.1. Samples: 3870792. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) -[2023-07-07 22:32:54,262][754029] Avg episode reward: [(0, '109.893')] -[2023-07-07 22:32:57,875][754314] Updated weights for policy 0, policy_version 7680 (0.0005) -[2023-07-07 22:32:59,262][754029] Fps is (10 sec: 9420.9, 60 sec: 9352.5, 300 sec: 9330.5). Total num frames: 3944448. Throughput: 0: 9371.5. Samples: 3927904. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:32:59,262][754029] Avg episode reward: [(0, '100.962')] -[2023-07-07 22:33:02,113][754314] Updated weights for policy 0, policy_version 7760 (0.0005) -[2023-07-07 22:33:04,262][754029] Fps is (10 sec: 9830.4, 60 sec: 9420.8, 300 sec: 9344.4). Total num frames: 3993600. Throughput: 0: 9400.8. Samples: 3985408. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) -[2023-07-07 22:33:04,262][754029] Avg episode reward: [(0, '108.193')] -[2023-07-07 22:33:04,265][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000007800_3993600.pth... -[2023-07-07 22:33:04,268][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000007240_3706880.pth -[2023-07-07 22:33:06,451][754314] Updated weights for policy 0, policy_version 7840 (0.0005) -[2023-07-07 22:33:09,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9330.5). Total num frames: 4038656. Throughput: 0: 9438.5. Samples: 4013604. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) -[2023-07-07 22:33:09,262][754029] Avg episode reward: [(0, '107.596')] -[2023-07-07 22:33:10,685][754314] Updated weights for policy 0, policy_version 7920 (0.0005) -[2023-07-07 22:33:14,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9344.4). Total num frames: 4087808. Throughput: 0: 9519.5. Samples: 4071716. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:33:14,262][754029] Avg episode reward: [(0, '102.893')] -[2023-07-07 22:33:14,904][754314] Updated weights for policy 0, policy_version 8000 (0.0005) -[2023-07-07 22:33:19,154][754314] Updated weights for policy 0, policy_version 8080 (0.0005) -[2023-07-07 22:33:19,262][754029] Fps is (10 sec: 9830.3, 60 sec: 9489.1, 300 sec: 9358.3). Total num frames: 4136960. Throughput: 0: 9568.8. Samples: 4129292. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) -[2023-07-07 22:33:19,262][754029] Avg episode reward: [(0, '111.076')] -[2023-07-07 22:33:19,266][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000008080_4136960.pth... -[2023-07-07 22:33:19,268][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000007520_3850240.pth -[2023-07-07 22:33:23,372][754314] Updated weights for policy 0, policy_version 8160 (0.0005) -[2023-07-07 22:33:24,262][754029] Fps is (10 sec: 9830.5, 60 sec: 9557.3, 300 sec: 9358.3). Total num frames: 4186112. Throughput: 0: 9585.4. Samples: 4158600. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:33:24,262][754029] Avg episode reward: [(0, '105.567')] -[2023-07-07 22:33:27,587][754314] Updated weights for policy 0, policy_version 8240 (0.0004) -[2023-07-07 22:33:29,262][754029] Fps is (10 sec: 9420.9, 60 sec: 9557.3, 300 sec: 9358.3). Total num frames: 4231168. Throughput: 0: 9593.8. Samples: 4217108. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:33:29,262][754029] Avg episode reward: [(0, '106.966')] -[2023-07-07 22:33:31,854][754314] Updated weights for policy 0, policy_version 8320 (0.0005) -[2023-07-07 22:33:34,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9372.2). Total num frames: 4280320. Throughput: 0: 9571.6. Samples: 4274616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:33:34,262][754029] Avg episode reward: [(0, '108.304')] -[2023-07-07 22:33:34,266][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000008360_4280320.pth... -[2023-07-07 22:33:34,269][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000007800_3993600.pth -[2023-07-07 22:33:36,130][754314] Updated weights for policy 0, policy_version 8400 (0.0005) -[2023-07-07 22:33:39,262][754029] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9372.2). Total num frames: 4329472. Throughput: 0: 9614.8. Samples: 4303460. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) -[2023-07-07 22:33:39,262][754029] Avg episode reward: [(0, '101.998')] -[2023-07-07 22:33:40,339][754314] Updated weights for policy 0, policy_version 8480 (0.0004) -[2023-07-07 22:33:44,262][754029] Fps is (10 sec: 9420.9, 60 sec: 9557.3, 300 sec: 9358.3). Total num frames: 4374528. Throughput: 0: 9643.4. Samples: 4361856. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) -[2023-07-07 22:33:44,262][754029] Avg episode reward: [(0, '102.829')] -[2023-07-07 22:33:44,758][754314] Updated weights for policy 0, policy_version 8560 (0.0005) -[2023-07-07 22:33:49,240][754314] Updated weights for policy 0, policy_version 8640 (0.0005) -[2023-07-07 22:33:49,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9557.4, 300 sec: 9358.3). Total num frames: 4423680. Throughput: 0: 9558.9. Samples: 4415560. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:33:49,262][754029] Avg episode reward: [(0, '103.574')] -[2023-07-07 22:33:49,265][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000008640_4423680.pth... -[2023-07-07 22:33:49,268][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000008080_4136960.pth -[2023-07-07 22:33:53,811][754314] Updated weights for policy 0, policy_version 8720 (0.0006) -[2023-07-07 22:33:54,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9344.4). Total num frames: 4468736. Throughput: 0: 9553.1. Samples: 4443492. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) -[2023-07-07 22:33:54,262][754029] Avg episode reward: [(0, '107.843')] -[2023-07-07 22:33:58,212][754314] Updated weights for policy 0, policy_version 8800 (0.0005) -[2023-07-07 22:33:59,262][754029] Fps is (10 sec: 9011.2, 60 sec: 9489.1, 300 sec: 9344.4). Total num frames: 4513792. Throughput: 0: 9461.3. Samples: 4497472. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) -[2023-07-07 22:33:59,262][754029] Avg episode reward: [(0, '102.976')] -[2023-07-07 22:34:02,607][754314] Updated weights for policy 0, policy_version 8880 (0.0005) -[2023-07-07 22:34:04,262][754029] Fps is (10 sec: 9011.2, 60 sec: 9420.8, 300 sec: 9330.5). Total num frames: 4558848. Throughput: 0: 9434.8. Samples: 4553860. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:34:04,262][754029] Avg episode reward: [(0, '104.786')] -[2023-07-07 22:34:04,266][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000008904_4558848.pth... -[2023-07-07 22:34:04,269][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000008360_4280320.pth -[2023-07-07 22:34:06,932][754314] Updated weights for policy 0, policy_version 8960 (0.0005) -[2023-07-07 22:34:09,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9344.4). Total num frames: 4608000. Throughput: 0: 9418.3. Samples: 4582424. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) -[2023-07-07 22:34:09,262][754029] Avg episode reward: [(0, '92.024')] -[2023-07-07 22:34:11,275][754314] Updated weights for policy 0, policy_version 9040 (0.0005) -[2023-07-07 22:34:14,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9330.6). Total num frames: 4653056. Throughput: 0: 9364.6. Samples: 4638516. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) -[2023-07-07 22:34:14,262][754029] Avg episode reward: [(0, '106.521')] -[2023-07-07 22:34:15,627][754314] Updated weights for policy 0, policy_version 9120 (0.0005) -[2023-07-07 22:34:19,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9344.4). Total num frames: 4702208. Throughput: 0: 9369.2. Samples: 4696228. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:34:19,262][754029] Avg episode reward: [(0, '90.143')] -[2023-07-07 22:34:19,265][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000009184_4702208.pth... -[2023-07-07 22:34:19,268][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000008640_4423680.pth -[2023-07-07 22:34:19,869][754314] Updated weights for policy 0, policy_version 9200 (0.0005) -[2023-07-07 22:34:24,095][754314] Updated weights for policy 0, policy_version 9280 (0.0005) -[2023-07-07 22:34:24,262][754029] Fps is (10 sec: 9830.4, 60 sec: 9420.8, 300 sec: 9358.3). Total num frames: 4751360. Throughput: 0: 9373.0. Samples: 4725244. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-07-07 22:34:24,262][754029] Avg episode reward: [(0, '85.435')] -[2023-07-07 22:34:28,356][754314] Updated weights for policy 0, policy_version 9360 (0.0005) -[2023-07-07 22:34:29,262][754029] Fps is (10 sec: 9830.4, 60 sec: 9489.1, 300 sec: 9372.2). Total num frames: 4800512. Throughput: 0: 9353.6. Samples: 4782768. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-07-07 22:34:29,262][754029] Avg episode reward: [(0, '84.615')] -[2023-07-07 22:34:32,704][754314] Updated weights for policy 0, policy_version 9440 (0.0005) -[2023-07-07 22:34:34,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9372.2). Total num frames: 4845568. Throughput: 0: 9425.6. Samples: 4839712. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:34:34,262][754029] Avg episode reward: [(0, '84.447')] -[2023-07-07 22:34:34,265][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000009464_4845568.pth... -[2023-07-07 22:34:34,267][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000008904_4558848.pth -[2023-07-07 22:34:37,341][754314] Updated weights for policy 0, policy_version 9520 (0.0005) -[2023-07-07 22:34:39,262][754029] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9372.2). Total num frames: 4890624. Throughput: 0: 9391.7. Samples: 4866120. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:34:39,262][754029] Avg episode reward: [(0, '87.666')] -[2023-07-07 22:34:41,724][754314] Updated weights for policy 0, policy_version 9600 (0.0005) -[2023-07-07 22:34:44,262][754029] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9358.3). Total num frames: 4935680. Throughput: 0: 9420.3. Samples: 4921384. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:34:44,262][754029] Avg episode reward: [(0, '85.406')] -[2023-07-07 22:34:46,065][754314] Updated weights for policy 0, policy_version 9680 (0.0006) -[2023-07-07 22:34:49,262][754029] Fps is (10 sec: 9420.7, 60 sec: 9352.5, 300 sec: 9372.2). Total num frames: 4984832. Throughput: 0: 9430.6. Samples: 4978236. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:34:49,262][754029] Avg episode reward: [(0, '86.214')] -[2023-07-07 22:34:49,265][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000009736_4984832.pth... -[2023-07-07 22:34:49,268][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000009184_4702208.pth -[2023-07-07 22:34:50,313][754314] Updated weights for policy 0, policy_version 9760 (0.0005) -[2023-07-07 22:34:54,262][754029] Fps is (10 sec: 9830.4, 60 sec: 9420.8, 300 sec: 9386.1). Total num frames: 5033984. Throughput: 0: 9445.3. Samples: 5007464. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) -[2023-07-07 22:34:54,262][754029] Avg episode reward: [(0, '87.133')] -[2023-07-07 22:34:54,562][754314] Updated weights for policy 0, policy_version 9840 (0.0005) -[2023-07-07 22:34:58,902][754314] Updated weights for policy 0, policy_version 9920 (0.0005) -[2023-07-07 22:34:59,262][754029] Fps is (10 sec: 9420.9, 60 sec: 9420.8, 300 sec: 9386.1). Total num frames: 5079040. Throughput: 0: 9462.7. Samples: 5064336. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) -[2023-07-07 22:34:59,262][754029] Avg episode reward: [(0, '88.538')] -[2023-07-07 22:35:03,134][754314] Updated weights for policy 0, policy_version 10000 (0.0005) -[2023-07-07 22:35:04,262][754029] Fps is (10 sec: 9420.7, 60 sec: 9489.1, 300 sec: 9386.1). Total num frames: 5128192. Throughput: 0: 9472.2. Samples: 5122476. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:35:04,262][754029] Avg episode reward: [(0, '81.297')] -[2023-07-07 22:35:04,265][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000010016_5128192.pth... -[2023-07-07 22:35:04,268][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000009464_4845568.pth -[2023-07-07 22:35:07,626][754314] Updated weights for policy 0, policy_version 10080 (0.0005) -[2023-07-07 22:35:09,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9386.1). Total num frames: 5173248. Throughput: 0: 9436.3. Samples: 5149876. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:35:09,262][754029] Avg episode reward: [(0, '82.755')] -[2023-07-07 22:35:12,082][754314] Updated weights for policy 0, policy_version 10160 (0.0005) -[2023-07-07 22:35:14,262][754029] Fps is (10 sec: 9011.3, 60 sec: 9420.8, 300 sec: 9386.1). Total num frames: 5218304. Throughput: 0: 9384.6. Samples: 5205076. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) -[2023-07-07 22:35:14,262][754029] Avg episode reward: [(0, '85.651')] -[2023-07-07 22:35:16,534][754314] Updated weights for policy 0, policy_version 10240 (0.0005) -[2023-07-07 22:35:19,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9400.0). Total num frames: 5267456. Throughput: 0: 9324.8. Samples: 5259328. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:35:19,262][754029] Avg episode reward: [(0, '74.846')] -[2023-07-07 22:35:19,265][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000010288_5267456.pth... -[2023-07-07 22:35:19,267][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000009736_4984832.pth -[2023-07-07 22:35:21,114][754314] Updated weights for policy 0, policy_version 10320 (0.0005) -[2023-07-07 22:35:24,262][754029] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9386.1). Total num frames: 5308416. Throughput: 0: 9342.0. Samples: 5286512. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:35:24,262][754029] Avg episode reward: [(0, '82.147')] -[2023-07-07 22:35:25,786][754314] Updated weights for policy 0, policy_version 10400 (0.0005) -[2023-07-07 22:35:29,262][754029] Fps is (10 sec: 8601.6, 60 sec: 9216.0, 300 sec: 9386.1). Total num frames: 5353472. Throughput: 0: 9283.4. Samples: 5339136. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:35:29,262][754029] Avg episode reward: [(0, '81.159')] -[2023-07-07 22:35:30,427][754314] Updated weights for policy 0, policy_version 10480 (0.0005) -[2023-07-07 22:35:34,262][754029] Fps is (10 sec: 9011.1, 60 sec: 9216.0, 300 sec: 9372.2). Total num frames: 5398528. Throughput: 0: 9199.7. Samples: 5392224. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:35:34,262][754029] Avg episode reward: [(0, '91.402')] -[2023-07-07 22:35:34,266][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000010544_5398528.pth... -[2023-07-07 22:35:34,269][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000010016_5128192.pth -[2023-07-07 22:35:34,973][754314] Updated weights for policy 0, policy_version 10560 (0.0005) -[2023-07-07 22:35:39,262][754029] Fps is (10 sec: 9011.2, 60 sec: 9216.0, 300 sec: 9372.2). Total num frames: 5443584. Throughput: 0: 9151.4. Samples: 5419276. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) -[2023-07-07 22:35:39,262][754029] Avg episode reward: [(0, '84.397')] -[2023-07-07 22:35:39,419][754314] Updated weights for policy 0, policy_version 10640 (0.0005) -[2023-07-07 22:35:43,650][754314] Updated weights for policy 0, policy_version 10720 (0.0005) -[2023-07-07 22:35:44,262][754029] Fps is (10 sec: 9420.9, 60 sec: 9284.3, 300 sec: 9372.2). Total num frames: 5492736. Throughput: 0: 9157.3. Samples: 5476416. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-07-07 22:35:44,262][754029] Avg episode reward: [(0, '95.218')] -[2023-07-07 22:35:47,949][754314] Updated weights for policy 0, policy_version 10800 (0.0005) -[2023-07-07 22:35:49,262][754029] Fps is (10 sec: 9830.4, 60 sec: 9284.3, 300 sec: 9372.2). Total num frames: 5541888. Throughput: 0: 9139.6. Samples: 5533760. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-07-07 22:35:49,262][754029] Avg episode reward: [(0, '85.168')] -[2023-07-07 22:35:49,265][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000010824_5541888.pth... -[2023-07-07 22:35:49,268][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000010288_5267456.pth -[2023-07-07 22:35:52,418][754314] Updated weights for policy 0, policy_version 10880 (0.0005) -[2023-07-07 22:35:54,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9372.2). Total num frames: 5586944. Throughput: 0: 9157.5. Samples: 5561964. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) -[2023-07-07 22:35:54,262][754029] Avg episode reward: [(0, '88.257')] -[2023-07-07 22:35:56,877][754314] Updated weights for policy 0, policy_version 10960 (0.0005) -[2023-07-07 22:35:59,262][754029] Fps is (10 sec: 9011.3, 60 sec: 9216.0, 300 sec: 9372.2). Total num frames: 5632000. Throughput: 0: 9136.4. Samples: 5616216. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) -[2023-07-07 22:35:59,262][754029] Avg episode reward: [(0, '72.427')] -[2023-07-07 22:36:01,214][754314] Updated weights for policy 0, policy_version 11040 (0.0005) -[2023-07-07 22:36:04,262][754029] Fps is (10 sec: 9420.7, 60 sec: 9216.0, 300 sec: 9372.2). Total num frames: 5681152. Throughput: 0: 9193.2. Samples: 5673024. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) -[2023-07-07 22:36:04,263][754029] Avg episode reward: [(0, '87.641')] -[2023-07-07 22:36:04,266][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000011096_5681152.pth... -[2023-07-07 22:36:04,268][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000010544_5398528.pth -[2023-07-07 22:36:05,536][754314] Updated weights for policy 0, policy_version 11120 (0.0005) -[2023-07-07 22:36:09,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9372.2). Total num frames: 5726208. Throughput: 0: 9226.5. Samples: 5701704. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:36:09,262][754029] Avg episode reward: [(0, '87.431')] -[2023-07-07 22:36:09,865][754314] Updated weights for policy 0, policy_version 11200 (0.0005) -[2023-07-07 22:36:14,262][754029] Fps is (10 sec: 9011.3, 60 sec: 9216.0, 300 sec: 9358.3). Total num frames: 5771264. Throughput: 0: 9284.1. Samples: 5756920. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:36:14,262][754029] Avg episode reward: [(0, '77.907')] -[2023-07-07 22:36:14,512][754314] Updated weights for policy 0, policy_version 11280 (0.0005) -[2023-07-07 22:36:19,057][754314] Updated weights for policy 0, policy_version 11360 (0.0005) -[2023-07-07 22:36:19,262][754029] Fps is (10 sec: 9011.2, 60 sec: 9147.7, 300 sec: 9358.3). Total num frames: 5816320. Throughput: 0: 9302.3. Samples: 5810828. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:36:19,262][754029] Avg episode reward: [(0, '79.695')] -[2023-07-07 22:36:19,265][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000011360_5816320.pth... -[2023-07-07 22:36:19,268][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000010824_5541888.pth -[2023-07-07 22:36:23,589][754314] Updated weights for policy 0, policy_version 11440 (0.0005) -[2023-07-07 22:36:24,262][754029] Fps is (10 sec: 9011.2, 60 sec: 9216.0, 300 sec: 9358.3). Total num frames: 5861376. Throughput: 0: 9278.5. Samples: 5836808. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:36:24,262][754029] Avg episode reward: [(0, '79.727')] -[2023-07-07 22:36:27,915][754314] Updated weights for policy 0, policy_version 11520 (0.0005) -[2023-07-07 22:36:29,262][754029] Fps is (10 sec: 9420.9, 60 sec: 9284.3, 300 sec: 9372.2). Total num frames: 5910528. Throughput: 0: 9276.4. Samples: 5893852. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-07-07 22:36:29,262][754029] Avg episode reward: [(0, '91.626')] -[2023-07-07 22:36:32,428][754314] Updated weights for policy 0, policy_version 11600 (0.0005) -[2023-07-07 22:36:34,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9372.2). Total num frames: 5955584. Throughput: 0: 9200.7. Samples: 5947792. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-07-07 22:36:34,262][754029] Avg episode reward: [(0, '92.119')] -[2023-07-07 22:36:34,266][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000011632_5955584.pth... -[2023-07-07 22:36:34,269][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000011096_5681152.pth -[2023-07-07 22:36:36,866][754314] Updated weights for policy 0, policy_version 11680 (0.0005) -[2023-07-07 22:36:39,262][754029] Fps is (10 sec: 9011.1, 60 sec: 9284.3, 300 sec: 9358.3). Total num frames: 6000640. Throughput: 0: 9199.6. Samples: 5975948. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:36:39,262][754029] Avg episode reward: [(0, '92.503')] -[2023-07-07 22:36:41,126][754314] Updated weights for policy 0, policy_version 11760 (0.0005) -[2023-07-07 22:36:44,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9358.3). Total num frames: 6049792. Throughput: 0: 9270.2. Samples: 6033376. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:36:44,262][754029] Avg episode reward: [(0, '92.847')] -[2023-07-07 22:36:45,463][754314] Updated weights for policy 0, policy_version 11840 (0.0005) -[2023-07-07 22:36:49,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9358.3). Total num frames: 6094848. Throughput: 0: 9227.8. Samples: 6088276. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:36:49,262][754029] Avg episode reward: [(0, '100.610')] -[2023-07-07 22:36:49,266][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000011904_6094848.pth... -[2023-07-07 22:36:49,269][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000011360_5816320.pth -[2023-07-07 22:36:50,116][754314] Updated weights for policy 0, policy_version 11920 (0.0005) -[2023-07-07 22:36:54,262][754029] Fps is (10 sec: 8601.6, 60 sec: 9147.7, 300 sec: 9330.6). Total num frames: 6135808. Throughput: 0: 9181.4. Samples: 6114864. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) -[2023-07-07 22:36:54,262][754029] Avg episode reward: [(0, '93.563')] -[2023-07-07 22:36:54,728][754314] Updated weights for policy 0, policy_version 12000 (0.0005) -[2023-07-07 22:36:59,249][754314] Updated weights for policy 0, policy_version 12080 (0.0005) -[2023-07-07 22:36:59,262][754029] Fps is (10 sec: 9011.3, 60 sec: 9216.0, 300 sec: 9344.4). Total num frames: 6184960. Throughput: 0: 9147.9. Samples: 6168576. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) -[2023-07-07 22:36:59,262][754029] Avg episode reward: [(0, '92.983')] -[2023-07-07 22:37:03,788][754314] Updated weights for policy 0, policy_version 12160 (0.0005) -[2023-07-07 22:37:04,262][754029] Fps is (10 sec: 9420.7, 60 sec: 9147.7, 300 sec: 9330.5). Total num frames: 6230016. Throughput: 0: 9140.8. Samples: 6222164. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:37:04,262][754029] Avg episode reward: [(0, '88.390')] -[2023-07-07 22:37:04,265][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000012168_6230016.pth... -[2023-07-07 22:37:04,267][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000011632_5955584.pth -[2023-07-07 22:37:08,438][754314] Updated weights for policy 0, policy_version 12240 (0.0005) -[2023-07-07 22:37:09,262][754029] Fps is (10 sec: 8601.6, 60 sec: 9079.5, 300 sec: 9316.7). Total num frames: 6270976. Throughput: 0: 9165.1. Samples: 6249236. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:37:09,262][754029] Avg episode reward: [(0, '84.390')] -[2023-07-07 22:37:13,066][754314] Updated weights for policy 0, policy_version 12320 (0.0005) -[2023-07-07 22:37:14,262][754029] Fps is (10 sec: 8601.6, 60 sec: 9079.5, 300 sec: 9316.7). Total num frames: 6316032. Throughput: 0: 9072.3. Samples: 6302104. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) -[2023-07-07 22:37:14,262][754029] Avg episode reward: [(0, '83.675')] -[2023-07-07 22:37:17,641][754314] Updated weights for policy 0, policy_version 12400 (0.0004) -[2023-07-07 22:37:19,262][754029] Fps is (10 sec: 9011.1, 60 sec: 9079.5, 300 sec: 9316.7). Total num frames: 6361088. Throughput: 0: 9069.5. Samples: 6355920. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) -[2023-07-07 22:37:19,262][754029] Avg episode reward: [(0, '85.983')] -[2023-07-07 22:37:19,266][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000012424_6361088.pth... -[2023-07-07 22:37:19,269][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000011904_6094848.pth -[2023-07-07 22:37:22,222][754314] Updated weights for policy 0, policy_version 12480 (0.0005) -[2023-07-07 22:37:24,262][754029] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 9316.7). Total num frames: 6406144. Throughput: 0: 9034.6. Samples: 6382504. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:37:24,262][754029] Avg episode reward: [(0, '90.201')] -[2023-07-07 22:37:26,822][754314] Updated weights for policy 0, policy_version 12560 (0.0005) -[2023-07-07 22:37:29,262][754029] Fps is (10 sec: 9011.3, 60 sec: 9011.2, 300 sec: 9316.7). Total num frames: 6451200. Throughput: 0: 8944.8. Samples: 6435892. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:37:29,262][754029] Avg episode reward: [(0, '82.498')] -[2023-07-07 22:37:31,342][754314] Updated weights for policy 0, policy_version 12640 (0.0005) -[2023-07-07 22:37:34,262][754029] Fps is (10 sec: 9011.1, 60 sec: 9011.2, 300 sec: 9302.8). Total num frames: 6496256. Throughput: 0: 8931.4. Samples: 6490188. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:37:34,262][754029] Avg episode reward: [(0, '85.960')] -[2023-07-07 22:37:34,266][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000012688_6496256.pth... -[2023-07-07 22:37:34,269][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000012168_6230016.pth -[2023-07-07 22:37:35,958][754314] Updated weights for policy 0, policy_version 12720 (0.0005) -[2023-07-07 22:37:39,262][754029] Fps is (10 sec: 9011.2, 60 sec: 9011.2, 300 sec: 9288.9). Total num frames: 6541312. Throughput: 0: 8930.7. Samples: 6516744. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:37:39,262][754029] Avg episode reward: [(0, '98.031')] -[2023-07-07 22:37:40,379][754314] Updated weights for policy 0, policy_version 12800 (0.0005) -[2023-07-07 22:37:44,262][754029] Fps is (10 sec: 9011.3, 60 sec: 8942.9, 300 sec: 9275.0). Total num frames: 6586368. Throughput: 0: 8970.8. Samples: 6572260. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:37:44,262][754029] Avg episode reward: [(0, '81.751')] -[2023-07-07 22:37:44,746][754314] Updated weights for policy 0, policy_version 12880 (0.0005) -[2023-07-07 22:37:49,262][754029] Fps is (10 sec: 9011.2, 60 sec: 8942.9, 300 sec: 9275.0). Total num frames: 6631424. Throughput: 0: 8999.5. Samples: 6627140. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:37:49,262][754029] Avg episode reward: [(0, '90.977')] -[2023-07-07 22:37:49,265][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000012952_6631424.pth... -[2023-07-07 22:37:49,267][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000012424_6361088.pth -[2023-07-07 22:37:49,456][754314] Updated weights for policy 0, policy_version 12960 (0.0006) -[2023-07-07 22:37:53,838][754314] Updated weights for policy 0, policy_version 13040 (0.0005) -[2023-07-07 22:37:54,262][754029] Fps is (10 sec: 9011.2, 60 sec: 9011.2, 300 sec: 9261.1). Total num frames: 6676480. Throughput: 0: 8973.6. Samples: 6653048. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:37:54,262][754029] Avg episode reward: [(0, '74.546')] -[2023-07-07 22:37:58,427][754314] Updated weights for policy 0, policy_version 13120 (0.0005) -[2023-07-07 22:37:59,262][754029] Fps is (10 sec: 9011.2, 60 sec: 8942.9, 300 sec: 9247.2). Total num frames: 6721536. Throughput: 0: 9027.9. Samples: 6708360. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:37:59,262][754029] Avg episode reward: [(0, '82.658')] -[2023-07-07 22:38:02,942][754314] Updated weights for policy 0, policy_version 13200 (0.0005) -[2023-07-07 22:38:04,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9011.2, 300 sec: 9261.1). Total num frames: 6770688. Throughput: 0: 9036.5. Samples: 6762560. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:38:04,262][754029] Avg episode reward: [(0, '84.955')] -[2023-07-07 22:38:04,264][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000013224_6770688.pth... -[2023-07-07 22:38:04,266][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000012688_6496256.pth -[2023-07-07 22:38:07,373][754314] Updated weights for policy 0, policy_version 13280 (0.0005) -[2023-07-07 22:38:09,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9079.5, 300 sec: 9247.2). Total num frames: 6815744. Throughput: 0: 9067.1. Samples: 6790524. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:38:09,262][754029] Avg episode reward: [(0, '76.038')] -[2023-07-07 22:38:11,691][754314] Updated weights for policy 0, policy_version 13360 (0.0004) -[2023-07-07 22:38:14,262][754029] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 9233.4). Total num frames: 6860800. Throughput: 0: 9128.3. Samples: 6846664. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) -[2023-07-07 22:38:14,262][754029] Avg episode reward: [(0, '69.738')] -[2023-07-07 22:38:16,192][754314] Updated weights for policy 0, policy_version 13440 (0.0005) -[2023-07-07 22:38:19,262][754029] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 9219.5). Total num frames: 6905856. Throughput: 0: 9143.8. Samples: 6901660. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) -[2023-07-07 22:38:19,262][754029] Avg episode reward: [(0, '78.358')] -[2023-07-07 22:38:19,264][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000013488_6905856.pth... -[2023-07-07 22:38:19,266][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000012952_6631424.pth -[2023-07-07 22:38:20,684][754314] Updated weights for policy 0, policy_version 13520 (0.0005) -[2023-07-07 22:38:24,262][754029] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 9219.5). Total num frames: 6950912. Throughput: 0: 9150.8. Samples: 6928528. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) -[2023-07-07 22:38:24,262][754029] Avg episode reward: [(0, '81.993')] -[2023-07-07 22:38:25,253][754314] Updated weights for policy 0, policy_version 13600 (0.0005) -[2023-07-07 22:38:29,262][754029] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 9205.6). Total num frames: 6995968. Throughput: 0: 9116.0. Samples: 6982480. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) -[2023-07-07 22:38:29,262][754029] Avg episode reward: [(0, '81.446')] -[2023-07-07 22:38:29,893][754314] Updated weights for policy 0, policy_version 13680 (0.0005) -[2023-07-07 22:38:34,262][754029] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 9191.7). Total num frames: 7041024. Throughput: 0: 9070.1. Samples: 7035296. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) -[2023-07-07 22:38:34,262][754029] Avg episode reward: [(0, '82.243')] -[2023-07-07 22:38:34,266][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000013752_7041024.pth... -[2023-07-07 22:38:34,267][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000013224_6770688.pth -[2023-07-07 22:38:34,504][754314] Updated weights for policy 0, policy_version 13760 (0.0004) -[2023-07-07 22:38:39,118][754314] Updated weights for policy 0, policy_version 13840 (0.0003) -[2023-07-07 22:38:39,262][754029] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 9191.7). Total num frames: 7086080. Throughput: 0: 9083.0. Samples: 7061784. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) -[2023-07-07 22:38:39,262][754029] Avg episode reward: [(0, '80.403')] -[2023-07-07 22:38:43,679][754314] Updated weights for policy 0, policy_version 13920 (0.0005) -[2023-07-07 22:38:44,262][754029] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 9177.8). Total num frames: 7131136. Throughput: 0: 9057.3. Samples: 7115940. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:38:44,262][754029] Avg episode reward: [(0, '73.591')] -[2023-07-07 22:38:48,053][754314] Updated weights for policy 0, policy_version 14000 (0.0005) -[2023-07-07 22:38:49,262][754029] Fps is (10 sec: 9011.1, 60 sec: 9079.5, 300 sec: 9177.8). Total num frames: 7176192. Throughput: 0: 9086.0. Samples: 7171432. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:38:49,262][754029] Avg episode reward: [(0, '73.267')] -[2023-07-07 22:38:49,266][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000014016_7176192.pth... -[2023-07-07 22:38:49,268][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000013488_6905856.pth -[2023-07-07 22:38:52,540][754314] Updated weights for policy 0, policy_version 14080 (0.0005) -[2023-07-07 22:38:54,262][754029] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 9177.8). Total num frames: 7221248. Throughput: 0: 9078.7. Samples: 7199064. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:38:54,262][754029] Avg episode reward: [(0, '81.384')] -[2023-07-07 22:38:56,949][754314] Updated weights for policy 0, policy_version 14160 (0.0005) -[2023-07-07 22:38:59,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9147.7, 300 sec: 9191.7). Total num frames: 7270400. Throughput: 0: 9052.4. Samples: 7254024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:38:59,262][754029] Avg episode reward: [(0, '72.246')] -[2023-07-07 22:39:01,474][754314] Updated weights for policy 0, policy_version 14240 (0.0005) -[2023-07-07 22:39:04,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9079.5, 300 sec: 9177.8). Total num frames: 7315456. Throughput: 0: 9053.9. Samples: 7309084. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) -[2023-07-07 22:39:04,262][754029] Avg episode reward: [(0, '76.806')] -[2023-07-07 22:39:04,266][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000014288_7315456.pth... -[2023-07-07 22:39:04,269][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000013752_7041024.pth -[2023-07-07 22:39:05,737][754314] Updated weights for policy 0, policy_version 14320 (0.0004) -[2023-07-07 22:39:09,262][754029] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 9177.8). Total num frames: 7360512. Throughput: 0: 9087.1. Samples: 7337448. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) -[2023-07-07 22:39:09,262][754029] Avg episode reward: [(0, '85.851')] -[2023-07-07 22:39:10,219][754314] Updated weights for policy 0, policy_version 14400 (0.0005) -[2023-07-07 22:39:14,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9147.7, 300 sec: 9177.8). Total num frames: 7409664. Throughput: 0: 9131.1. Samples: 7393380. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:39:14,262][754029] Avg episode reward: [(0, '76.720')] -[2023-07-07 22:39:14,543][754314] Updated weights for policy 0, policy_version 14480 (0.0005) -[2023-07-07 22:39:18,924][754314] Updated weights for policy 0, policy_version 14560 (0.0005) -[2023-07-07 22:39:19,262][754029] Fps is (10 sec: 9420.7, 60 sec: 9147.7, 300 sec: 9163.9). Total num frames: 7454720. Throughput: 0: 9219.4. Samples: 7450168. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:39:19,262][754029] Avg episode reward: [(0, '76.713')] -[2023-07-07 22:39:19,265][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000014560_7454720.pth... -[2023-07-07 22:39:19,268][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000014016_7176192.pth -[2023-07-07 22:39:23,280][754314] Updated weights for policy 0, policy_version 14640 (0.0004) -[2023-07-07 22:39:24,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9163.9). Total num frames: 7503872. Throughput: 0: 9235.4. Samples: 7477376. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) -[2023-07-07 22:39:24,262][754029] Avg episode reward: [(0, '80.886')] -[2023-07-07 22:39:27,663][754314] Updated weights for policy 0, policy_version 14720 (0.0005) -[2023-07-07 22:39:29,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9163.9). Total num frames: 7548928. Throughput: 0: 9291.9. Samples: 7534076. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) -[2023-07-07 22:39:29,262][754029] Avg episode reward: [(0, '82.953')] -[2023-07-07 22:39:31,985][754314] Updated weights for policy 0, policy_version 14800 (0.0005) -[2023-07-07 22:39:34,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9177.8). Total num frames: 7598080. Throughput: 0: 9312.6. Samples: 7590500. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) -[2023-07-07 22:39:34,262][754029] Avg episode reward: [(0, '79.146')] -[2023-07-07 22:39:34,272][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000014840_7598080.pth... -[2023-07-07 22:39:34,275][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000014288_7315456.pth -[2023-07-07 22:39:36,352][754314] Updated weights for policy 0, policy_version 14880 (0.0004) -[2023-07-07 22:39:39,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9177.8). Total num frames: 7643136. Throughput: 0: 9324.3. Samples: 7618656. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) -[2023-07-07 22:39:39,262][754029] Avg episode reward: [(0, '86.483')] -[2023-07-07 22:39:40,809][754314] Updated weights for policy 0, policy_version 14960 (0.0005) -[2023-07-07 22:39:44,262][754029] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9163.9). Total num frames: 7688192. Throughput: 0: 9326.8. Samples: 7673732. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) -[2023-07-07 22:39:44,262][754029] Avg episode reward: [(0, '82.211')] -[2023-07-07 22:39:45,237][754314] Updated weights for policy 0, policy_version 15040 (0.0005) -[2023-07-07 22:39:49,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9163.9). Total num frames: 7737344. Throughput: 0: 9343.6. Samples: 7729544. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) -[2023-07-07 22:39:49,262][754029] Avg episode reward: [(0, '71.536')] -[2023-07-07 22:39:49,265][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000015112_7737344.pth... -[2023-07-07 22:39:49,268][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000014560_7454720.pth -[2023-07-07 22:39:49,673][754314] Updated weights for policy 0, policy_version 15120 (0.0005) -[2023-07-07 22:39:54,024][754314] Updated weights for policy 0, policy_version 15200 (0.0005) -[2023-07-07 22:39:54,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9163.9). Total num frames: 7782400. Throughput: 0: 9343.1. Samples: 7757888. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) -[2023-07-07 22:39:54,262][754029] Avg episode reward: [(0, '67.934')] -[2023-07-07 22:39:58,367][754314] Updated weights for policy 0, policy_version 15280 (0.0005) -[2023-07-07 22:39:59,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9163.9). Total num frames: 7831552. Throughput: 0: 9362.8. Samples: 7814704. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:39:59,262][754029] Avg episode reward: [(0, '77.839')] -[2023-07-07 22:40:02,750][754314] Updated weights for policy 0, policy_version 15360 (0.0005) -[2023-07-07 22:40:04,262][754029] Fps is (10 sec: 9420.7, 60 sec: 9352.5, 300 sec: 9163.9). Total num frames: 7876608. Throughput: 0: 9342.6. Samples: 7870584. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:40:04,262][754029] Avg episode reward: [(0, '79.383')] -[2023-07-07 22:40:04,266][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000015384_7876608.pth... -[2023-07-07 22:40:04,268][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000014840_7598080.pth -[2023-07-07 22:40:07,090][754314] Updated weights for policy 0, policy_version 15440 (0.0004) -[2023-07-07 22:40:09,262][754029] Fps is (10 sec: 9011.3, 60 sec: 9352.5, 300 sec: 9163.9). Total num frames: 7921664. Throughput: 0: 9359.5. Samples: 7898552. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:40:09,262][754029] Avg episode reward: [(0, '77.873')] -[2023-07-07 22:40:11,639][754314] Updated weights for policy 0, policy_version 15520 (0.0005) -[2023-07-07 22:40:14,262][754029] Fps is (10 sec: 9011.3, 60 sec: 9284.3, 300 sec: 9150.0). Total num frames: 7966720. Throughput: 0: 9310.8. Samples: 7953064. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:40:14,262][754029] Avg episode reward: [(0, '73.652')] -[2023-07-07 22:40:16,255][754314] Updated weights for policy 0, policy_version 15600 (0.0005) -[2023-07-07 22:40:19,262][754029] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9163.9). Total num frames: 8011776. Throughput: 0: 9224.8. Samples: 8005616. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) -[2023-07-07 22:40:19,262][754029] Avg episode reward: [(0, '69.737')] -[2023-07-07 22:40:19,266][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000015648_8011776.pth... -[2023-07-07 22:40:19,269][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000015112_7737344.pth -[2023-07-07 22:40:21,000][754314] Updated weights for policy 0, policy_version 15680 (0.0005) -[2023-07-07 22:40:24,262][754029] Fps is (10 sec: 9011.2, 60 sec: 9216.0, 300 sec: 9163.9). Total num frames: 8056832. Throughput: 0: 9188.5. Samples: 8032140. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) -[2023-07-07 22:40:24,262][754029] Avg episode reward: [(0, '77.247')] -[2023-07-07 22:40:25,639][754314] Updated weights for policy 0, policy_version 15760 (0.0005) -[2023-07-07 22:40:29,262][754029] Fps is (10 sec: 8601.6, 60 sec: 9147.7, 300 sec: 9150.0). Total num frames: 8097792. Throughput: 0: 9136.3. Samples: 8084864. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) -[2023-07-07 22:40:29,262][754029] Avg episode reward: [(0, '78.113')] -[2023-07-07 22:40:30,325][754314] Updated weights for policy 0, policy_version 15840 (0.0005) -[2023-07-07 22:40:34,262][754029] Fps is (10 sec: 8601.6, 60 sec: 9079.5, 300 sec: 9150.0). Total num frames: 8142848. Throughput: 0: 9076.7. Samples: 8137996. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) -[2023-07-07 22:40:34,262][754029] Avg episode reward: [(0, '73.365')] -[2023-07-07 22:40:34,265][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000015904_8142848.pth... -[2023-07-07 22:40:34,267][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000015384_7876608.pth -[2023-07-07 22:40:34,920][754314] Updated weights for policy 0, policy_version 15920 (0.0005) -[2023-07-07 22:40:39,262][754029] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 9136.2). Total num frames: 8187904. Throughput: 0: 9025.9. Samples: 8164052. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) -[2023-07-07 22:40:39,262][754029] Avg episode reward: [(0, '76.082')] -[2023-07-07 22:40:39,486][754314] Updated weights for policy 0, policy_version 16000 (0.0005) -[2023-07-07 22:40:44,138][754314] Updated weights for policy 0, policy_version 16080 (0.0005) -[2023-07-07 22:40:44,262][754029] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 9122.3). Total num frames: 8232960. Throughput: 0: 8953.3. Samples: 8217600. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) -[2023-07-07 22:40:44,262][754029] Avg episode reward: [(0, '74.544')] -[2023-07-07 22:40:48,682][754314] Updated weights for policy 0, policy_version 16160 (0.0004) -[2023-07-07 22:40:49,262][754029] Fps is (10 sec: 9011.2, 60 sec: 9011.2, 300 sec: 9122.3). Total num frames: 8278016. Throughput: 0: 8905.3. Samples: 8271324. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) -[2023-07-07 22:40:49,262][754029] Avg episode reward: [(0, '76.793')] -[2023-07-07 22:40:49,265][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000016168_8278016.pth... -[2023-07-07 22:40:49,268][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000015648_8011776.pth -[2023-07-07 22:40:53,312][754314] Updated weights for policy 0, policy_version 16240 (0.0005) -[2023-07-07 22:40:54,262][754029] Fps is (10 sec: 9011.1, 60 sec: 9011.2, 300 sec: 9122.3). Total num frames: 8323072. Throughput: 0: 8887.6. Samples: 8298496. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) -[2023-07-07 22:40:54,262][754029] Avg episode reward: [(0, '75.869')] -[2023-07-07 22:40:57,942][754314] Updated weights for policy 0, policy_version 16320 (0.0004) -[2023-07-07 22:40:59,262][754029] Fps is (10 sec: 8601.6, 60 sec: 8874.7, 300 sec: 9094.5). Total num frames: 8364032. Throughput: 0: 8859.0. Samples: 8351720. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) -[2023-07-07 22:40:59,262][754029] Avg episode reward: [(0, '69.599')] -[2023-07-07 22:41:02,589][754314] Updated weights for policy 0, policy_version 16400 (0.0004) -[2023-07-07 22:41:04,262][754029] Fps is (10 sec: 8601.6, 60 sec: 8874.7, 300 sec: 9094.5). Total num frames: 8409088. Throughput: 0: 8860.7. Samples: 8404348. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) -[2023-07-07 22:41:04,262][754029] Avg episode reward: [(0, '72.830')] -[2023-07-07 22:41:04,266][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000016424_8409088.pth... -[2023-07-07 22:41:04,269][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000015904_8142848.pth -[2023-07-07 22:41:07,080][754314] Updated weights for policy 0, policy_version 16480 (0.0004) -[2023-07-07 22:41:09,262][754029] Fps is (10 sec: 9011.2, 60 sec: 8874.7, 300 sec: 9094.5). Total num frames: 8454144. Throughput: 0: 8871.9. Samples: 8431376. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) -[2023-07-07 22:41:09,262][754029] Avg episode reward: [(0, '76.282')] -[2023-07-07 22:41:11,413][754314] Updated weights for policy 0, policy_version 16560 (0.0004) -[2023-07-07 22:41:14,262][754029] Fps is (10 sec: 9420.8, 60 sec: 8942.9, 300 sec: 9108.4). Total num frames: 8503296. Throughput: 0: 8954.4. Samples: 8487812. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) -[2023-07-07 22:41:14,262][754029] Avg episode reward: [(0, '69.618')] -[2023-07-07 22:41:15,836][754314] Updated weights for policy 0, policy_version 16640 (0.0005) -[2023-07-07 22:41:19,262][754029] Fps is (10 sec: 9420.8, 60 sec: 8942.9, 300 sec: 9108.4). Total num frames: 8548352. Throughput: 0: 8989.4. Samples: 8542520. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:41:19,262][754029] Avg episode reward: [(0, '66.598')] -[2023-07-07 22:41:19,266][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000016696_8548352.pth... -[2023-07-07 22:41:19,269][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000016168_8278016.pth -[2023-07-07 22:41:20,526][754314] Updated weights for policy 0, policy_version 16720 (0.0005) -[2023-07-07 22:41:24,262][754029] Fps is (10 sec: 9011.2, 60 sec: 8942.9, 300 sec: 9094.5). Total num frames: 8593408. Throughput: 0: 8993.1. Samples: 8568740. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:41:24,262][754029] Avg episode reward: [(0, '72.301')] -[2023-07-07 22:41:25,182][754314] Updated weights for policy 0, policy_version 16800 (0.0005) -[2023-07-07 22:41:29,262][754029] Fps is (10 sec: 8601.6, 60 sec: 8942.9, 300 sec: 9080.6). Total num frames: 8634368. Throughput: 0: 8977.5. Samples: 8621588. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:41:29,262][754029] Avg episode reward: [(0, '69.761')] -[2023-07-07 22:41:29,813][754314] Updated weights for policy 0, policy_version 16880 (0.0005) -[2023-07-07 22:41:34,262][754029] Fps is (10 sec: 8601.5, 60 sec: 8942.9, 300 sec: 9080.6). Total num frames: 8679424. Throughput: 0: 8956.7. Samples: 8674376. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:41:34,262][754029] Avg episode reward: [(0, '78.368')] -[2023-07-07 22:41:34,266][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000016952_8679424.pth... -[2023-07-07 22:41:34,269][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000016424_8409088.pth -[2023-07-07 22:41:34,454][754314] Updated weights for policy 0, policy_version 16960 (0.0005) -[2023-07-07 22:41:39,089][754314] Updated weights for policy 0, policy_version 17040 (0.0005) -[2023-07-07 22:41:39,262][754029] Fps is (10 sec: 9011.2, 60 sec: 8942.9, 300 sec: 9066.7). Total num frames: 8724480. Throughput: 0: 8921.8. Samples: 8699976. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:41:39,262][754029] Avg episode reward: [(0, '71.872')] -[2023-07-07 22:41:43,732][754314] Updated weights for policy 0, policy_version 17120 (0.0004) -[2023-07-07 22:41:44,262][754029] Fps is (10 sec: 9011.3, 60 sec: 8942.9, 300 sec: 9066.7). Total num frames: 8769536. Throughput: 0: 8924.5. Samples: 8753320. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-07-07 22:41:44,262][754029] Avg episode reward: [(0, '76.996')] -[2023-07-07 22:41:48,414][754314] Updated weights for policy 0, policy_version 17200 (0.0004) -[2023-07-07 22:41:49,262][754029] Fps is (10 sec: 8601.7, 60 sec: 8874.7, 300 sec: 9066.7). Total num frames: 8810496. Throughput: 0: 8934.9. Samples: 8806416. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-07-07 22:41:49,262][754029] Avg episode reward: [(0, '86.343')] -[2023-07-07 22:41:49,316][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000017216_8814592.pth... -[2023-07-07 22:41:49,318][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000016696_8548352.pth -[2023-07-07 22:41:53,026][754314] Updated weights for policy 0, policy_version 17280 (0.0004) -[2023-07-07 22:41:54,262][754029] Fps is (10 sec: 8601.6, 60 sec: 8874.7, 300 sec: 9052.9). Total num frames: 8855552. Throughput: 0: 8927.7. Samples: 8833124. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:41:54,262][754029] Avg episode reward: [(0, '74.968')] -[2023-07-07 22:41:57,581][754314] Updated weights for policy 0, policy_version 17360 (0.0003) -[2023-07-07 22:41:59,262][754029] Fps is (10 sec: 9011.1, 60 sec: 8942.9, 300 sec: 9052.9). Total num frames: 8900608. Throughput: 0: 8872.3. Samples: 8887064. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:41:59,262][754029] Avg episode reward: [(0, '75.000')] -[2023-07-07 22:42:02,208][754314] Updated weights for policy 0, policy_version 17440 (0.0005) -[2023-07-07 22:42:04,262][754029] Fps is (10 sec: 9011.2, 60 sec: 8942.9, 300 sec: 9066.7). Total num frames: 8945664. Throughput: 0: 8845.4. Samples: 8940564. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:42:04,262][754029] Avg episode reward: [(0, '72.847')] -[2023-07-07 22:42:04,266][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000017472_8945664.pth... -[2023-07-07 22:42:04,268][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000016952_8679424.pth -[2023-07-07 22:42:06,676][754314] Updated weights for policy 0, policy_version 17520 (0.0005) -[2023-07-07 22:42:09,262][754029] Fps is (10 sec: 9011.2, 60 sec: 8942.9, 300 sec: 9066.7). Total num frames: 8990720. Throughput: 0: 8863.6. Samples: 8967600. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) -[2023-07-07 22:42:09,262][754029] Avg episode reward: [(0, '79.792')] -[2023-07-07 22:42:11,087][754314] Updated weights for policy 0, policy_version 17600 (0.0005) -[2023-07-07 22:42:14,262][754029] Fps is (10 sec: 9420.8, 60 sec: 8942.9, 300 sec: 9080.6). Total num frames: 9039872. Throughput: 0: 8932.7. Samples: 9023560. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) -[2023-07-07 22:42:14,262][754029] Avg episode reward: [(0, '74.309')] -[2023-07-07 22:42:15,488][754314] Updated weights for policy 0, policy_version 17680 (0.0005) -[2023-07-07 22:42:19,262][754029] Fps is (10 sec: 9420.7, 60 sec: 8942.9, 300 sec: 9080.6). Total num frames: 9084928. Throughput: 0: 8958.7. Samples: 9077516. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:42:19,262][754029] Avg episode reward: [(0, '87.906')] -[2023-07-07 22:42:19,265][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000017744_9084928.pth... -[2023-07-07 22:42:19,268][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000017216_8814592.pth -[2023-07-07 22:42:20,122][754314] Updated weights for policy 0, policy_version 17760 (0.0005) -[2023-07-07 22:42:24,262][754029] Fps is (10 sec: 8601.7, 60 sec: 8874.7, 300 sec: 9066.7). Total num frames: 9125888. Throughput: 0: 9000.8. Samples: 9105012. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:42:24,262][754029] Avg episode reward: [(0, '76.921')] -[2023-07-07 22:42:24,770][754314] Updated weights for policy 0, policy_version 17840 (0.0005) -[2023-07-07 22:42:29,262][754029] Fps is (10 sec: 8601.7, 60 sec: 8942.9, 300 sec: 9066.7). Total num frames: 9170944. Throughput: 0: 8975.6. Samples: 9157224. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:42:29,262][754029] Avg episode reward: [(0, '79.499')] -[2023-07-07 22:42:29,472][754314] Updated weights for policy 0, policy_version 17920 (0.0005) -[2023-07-07 22:42:33,946][754314] Updated weights for policy 0, policy_version 18000 (0.0005) -[2023-07-07 22:42:34,262][754029] Fps is (10 sec: 9011.1, 60 sec: 8942.9, 300 sec: 9066.7). Total num frames: 9216000. Throughput: 0: 8981.0. Samples: 9210560. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:42:34,262][754029] Avg episode reward: [(0, '75.600')] -[2023-07-07 22:42:34,266][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000018000_9216000.pth... -[2023-07-07 22:42:34,268][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000017472_8945664.pth -[2023-07-07 22:42:38,307][754314] Updated weights for policy 0, policy_version 18080 (0.0005) -[2023-07-07 22:42:39,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9011.2, 300 sec: 9080.6). Total num frames: 9265152. Throughput: 0: 9015.3. Samples: 9238812. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:42:39,262][754029] Avg episode reward: [(0, '79.340')] -[2023-07-07 22:42:42,764][754314] Updated weights for policy 0, policy_version 18160 (0.0005) -[2023-07-07 22:42:44,262][754029] Fps is (10 sec: 9420.9, 60 sec: 9011.2, 300 sec: 9080.6). Total num frames: 9310208. Throughput: 0: 9044.9. Samples: 9294084. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:42:44,262][754029] Avg episode reward: [(0, '74.653')] -[2023-07-07 22:42:47,370][754314] Updated weights for policy 0, policy_version 18240 (0.0005) -[2023-07-07 22:42:49,262][754029] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 9080.6). Total num frames: 9355264. Throughput: 0: 9059.0. Samples: 9348220. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:42:49,262][754029] Avg episode reward: [(0, '81.672')] -[2023-07-07 22:42:49,266][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000018272_9355264.pth... -[2023-07-07 22:42:49,268][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000017744_9084928.pth -[2023-07-07 22:42:51,687][754314] Updated weights for policy 0, policy_version 18320 (0.0005) -[2023-07-07 22:42:54,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9147.7, 300 sec: 9094.5). Total num frames: 9404416. Throughput: 0: 9098.0. Samples: 9377008. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:42:54,262][754029] Avg episode reward: [(0, '78.773')] -[2023-07-07 22:42:55,983][754314] Updated weights for policy 0, policy_version 18400 (0.0004) -[2023-07-07 22:42:59,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9147.7, 300 sec: 9080.6). Total num frames: 9449472. Throughput: 0: 9117.6. Samples: 9433852. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-07-07 22:42:59,262][754029] Avg episode reward: [(0, '72.406')] -[2023-07-07 22:43:00,305][754314] Updated weights for policy 0, policy_version 18480 (0.0005) -[2023-07-07 22:43:04,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9094.5). Total num frames: 9498624. Throughput: 0: 9194.0. Samples: 9491244. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-07-07 22:43:04,262][754029] Avg episode reward: [(0, '82.494')] -[2023-07-07 22:43:04,265][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000018552_9498624.pth... -[2023-07-07 22:43:04,268][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000018000_9216000.pth -[2023-07-07 22:43:04,584][754314] Updated weights for policy 0, policy_version 18560 (0.0005) -[2023-07-07 22:43:08,937][754314] Updated weights for policy 0, policy_version 18640 (0.0005) -[2023-07-07 22:43:09,262][754029] Fps is (10 sec: 9420.9, 60 sec: 9216.0, 300 sec: 9094.5). Total num frames: 9543680. Throughput: 0: 9217.7. Samples: 9519808. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-07-07 22:43:09,262][754029] Avg episode reward: [(0, '85.980')] -[2023-07-07 22:43:13,495][754314] Updated weights for policy 0, policy_version 18720 (0.0005) -[2023-07-07 22:43:14,262][754029] Fps is (10 sec: 9011.2, 60 sec: 9147.7, 300 sec: 9094.5). Total num frames: 9588736. Throughput: 0: 9272.2. Samples: 9574472. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-07-07 22:43:14,262][754029] Avg episode reward: [(0, '76.244')] -[2023-07-07 22:43:18,032][754314] Updated weights for policy 0, policy_version 18800 (0.0005) -[2023-07-07 22:43:19,262][754029] Fps is (10 sec: 9011.2, 60 sec: 9147.7, 300 sec: 9094.5). Total num frames: 9633792. Throughput: 0: 9302.3. Samples: 9629164. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-07-07 22:43:19,262][754029] Avg episode reward: [(0, '73.196')] -[2023-07-07 22:43:19,265][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000018816_9633792.pth... -[2023-07-07 22:43:19,268][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000018272_9355264.pth -[2023-07-07 22:43:22,444][754314] Updated weights for policy 0, policy_version 18880 (0.0005) -[2023-07-07 22:43:24,262][754029] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9108.4). Total num frames: 9682944. Throughput: 0: 9302.6. Samples: 9657428. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) -[2023-07-07 22:43:24,262][754029] Avg episode reward: [(0, '69.183')] -[2023-07-07 22:43:26,944][754314] Updated weights for policy 0, policy_version 18960 (0.0005) -[2023-07-07 22:43:29,262][754029] Fps is (10 sec: 9420.9, 60 sec: 9284.3, 300 sec: 9108.4). Total num frames: 9728000. Throughput: 0: 9278.8. Samples: 9711628. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) -[2023-07-07 22:43:29,262][754029] Avg episode reward: [(0, '76.872')] -[2023-07-07 22:43:31,516][754314] Updated weights for policy 0, policy_version 19040 (0.0005) -[2023-07-07 22:43:34,262][754029] Fps is (10 sec: 8601.6, 60 sec: 9216.0, 300 sec: 9094.5). Total num frames: 9768960. Throughput: 0: 9260.4. Samples: 9764940. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) -[2023-07-07 22:43:34,262][754029] Avg episode reward: [(0, '80.427')] -[2023-07-07 22:43:34,308][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000019088_9773056.pth... -[2023-07-07 22:43:34,310][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000018552_9498624.pth -[2023-07-07 22:43:36,232][754314] Updated weights for policy 0, policy_version 19120 (0.0006) -[2023-07-07 22:43:39,262][754029] Fps is (10 sec: 8601.6, 60 sec: 9147.7, 300 sec: 9094.5). Total num frames: 9814016. Throughput: 0: 9195.5. Samples: 9790804. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) -[2023-07-07 22:43:39,262][754029] Avg episode reward: [(0, '78.822')] -[2023-07-07 22:43:40,802][754314] Updated weights for policy 0, policy_version 19200 (0.0006) -[2023-07-07 22:43:44,262][754029] Fps is (10 sec: 9011.3, 60 sec: 9147.7, 300 sec: 9094.5). Total num frames: 9859072. Throughput: 0: 9122.1. Samples: 9844344. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) -[2023-07-07 22:43:44,262][754029] Avg episode reward: [(0, '76.677')] -[2023-07-07 22:43:45,404][754314] Updated weights for policy 0, policy_version 19280 (0.0005) -[2023-07-07 22:43:49,262][754029] Fps is (10 sec: 9011.1, 60 sec: 9147.7, 300 sec: 9094.5). Total num frames: 9904128. Throughput: 0: 9015.2. Samples: 9896928. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:43:49,262][754029] Avg episode reward: [(0, '63.074')] -[2023-07-07 22:43:49,265][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000019344_9904128.pth... -[2023-07-07 22:43:49,267][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000018816_9633792.pth -[2023-07-07 22:43:50,072][754314] Updated weights for policy 0, policy_version 19360 (0.0005) -[2023-07-07 22:43:54,262][754029] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 9080.6). Total num frames: 9949184. Throughput: 0: 8996.0. Samples: 9924628. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:43:54,262][754029] Avg episode reward: [(0, '75.492')] -[2023-07-07 22:43:54,505][754314] Updated weights for policy 0, policy_version 19440 (0.0005) -[2023-07-07 22:43:58,961][754314] Updated weights for policy 0, policy_version 19520 (0.0005) -[2023-07-07 22:43:59,262][754029] Fps is (10 sec: 9011.3, 60 sec: 9079.5, 300 sec: 9080.6). Total num frames: 9994240. Throughput: 0: 8997.7. Samples: 9979368. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-07-07 22:43:59,262][754029] Avg episode reward: [(0, '71.321')] -[2023-07-07 22:44:00,328][754270] Early stopping after 2 epochs (8 sgd steps), loss delta 0.0000000 -[2023-07-07 22:44:00,329][754317] Stopping RolloutWorker_w2... -[2023-07-07 22:44:00,329][754446] Stopping RolloutWorker_w6... -[2023-07-07 22:44:00,329][754319] Stopping RolloutWorker_w4... -[2023-07-07 22:44:00,329][754318] Stopping RolloutWorker_w3... -[2023-07-07 22:44:00,329][754317] Loop rollout_proc2_evt_loop terminating... -[2023-07-07 22:44:00,329][754446] Loop rollout_proc6_evt_loop terminating... -[2023-07-07 22:44:00,329][754315] Stopping RolloutWorker_w0... -[2023-07-07 22:44:00,329][754414] Stopping RolloutWorker_w7... -[2023-07-07 22:44:00,329][754351] Stopping RolloutWorker_w5... -[2023-07-07 22:44:00,329][754316] Stopping RolloutWorker_w1... -[2023-07-07 22:44:00,329][754319] Loop rollout_proc4_evt_loop terminating... -[2023-07-07 22:44:00,329][754315] Loop rollout_proc0_evt_loop terminating... -[2023-07-07 22:44:00,329][754414] Loop rollout_proc7_evt_loop terminating... -[2023-07-07 22:44:00,329][754318] Loop rollout_proc3_evt_loop terminating... -[2023-07-07 22:44:00,329][754351] Loop rollout_proc5_evt_loop terminating... -[2023-07-07 22:44:00,329][754316] Loop rollout_proc1_evt_loop terminating... -[2023-07-07 22:44:00,329][754029] Component RolloutWorker_w2 stopped! -[2023-07-07 22:44:00,329][754270] Stopping Batcher_0... -[2023-07-07 22:44:00,329][754029] Component RolloutWorker_w6 stopped! -[2023-07-07 22:44:00,329][754270] Loop batcher_evt_loop terminating... -[2023-07-07 22:44:00,330][754029] Component RolloutWorker_w3 stopped! -[2023-07-07 22:44:00,330][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000019544_10006528.pth... -[2023-07-07 22:44:00,330][754029] Component RolloutWorker_w4 stopped! -[2023-07-07 22:44:00,330][754029] Component RolloutWorker_w0 stopped! -[2023-07-07 22:44:00,330][754029] Component RolloutWorker_w7 stopped! -[2023-07-07 22:44:00,331][754029] Component RolloutWorker_w5 stopped! -[2023-07-07 22:44:00,331][754029] Component RolloutWorker_w1 stopped! -[2023-07-07 22:44:00,331][754029] Component Batcher_0 stopped! -[2023-07-07 22:44:00,332][754270] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000019088_9773056.pth -[2023-07-07 22:44:00,333][754270] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000019544_10006528.pth... -[2023-07-07 22:44:00,335][754270] Stopping LearnerWorker_p0... -[2023-07-07 22:44:00,335][754270] Loop learner_proc0_evt_loop terminating... -[2023-07-07 22:44:00,335][754029] Component LearnerWorker_p0 stopped! -[2023-07-07 22:44:00,395][754314] Weights refcount: 2 0 -[2023-07-07 22:44:00,396][754314] Stopping InferenceWorker_p0-w0... -[2023-07-07 22:44:00,396][754314] Loop inference_proc0-0_evt_loop terminating... -[2023-07-07 22:44:00,396][754029] Component InferenceWorker_p0-w0 stopped! -[2023-07-07 22:44:00,397][754029] Waiting for process learner_proc0 to stop... -[2023-07-07 22:44:00,938][754029] Waiting for process inference_proc0-0 to join... -[2023-07-07 22:44:00,964][754029] Waiting for process rollout_proc0 to join... -[2023-07-07 22:44:00,964][754029] Waiting for process rollout_proc1 to join... -[2023-07-07 22:44:00,965][754029] Waiting for process rollout_proc2 to join... -[2023-07-07 22:44:00,965][754029] Waiting for process rollout_proc3 to join... -[2023-07-07 22:44:00,965][754029] Waiting for process rollout_proc4 to join... -[2023-07-07 22:44:00,965][754029] Waiting for process rollout_proc5 to join... -[2023-07-07 22:44:00,965][754029] Waiting for process rollout_proc6 to join... -[2023-07-07 22:44:00,965][754029] Waiting for process rollout_proc7 to join... -[2023-07-07 22:44:00,966][754029] Batcher 0 profile tree view: -batching: 1.8589, releasing_batches: 1.6651 -[2023-07-07 22:44:00,966][754029] InferenceWorker_p0-w0 profile tree view: -wait_policy: 0.0051 - wait_policy_total: 420.7678 -update_model: 13.0370 +[2023-07-08 15:05:52,566][994609] Worker 2 uses CPU cores [8, 9, 10, 11] +[2023-07-08 15:05:52,611][994562] Using optimizer +[2023-07-08 15:05:52,612][994562] No checkpoints found +[2023-07-08 15:05:52,612][994562] Did not load from checkpoint, starting from scratch! +[2023-07-08 15:05:52,612][994562] Initialized policy 0 weights for model version 0 +[2023-07-08 15:05:52,613][994562] LearnerWorker_p0 finished initialization! +[2023-07-08 15:05:52,614][994606] RunningMeanStd input shape: (39,) +[2023-07-08 15:05:52,615][994606] RunningMeanStd input shape: (1,) +[2023-07-08 15:05:52,671][994321] Inference worker 0-0 is ready! +[2023-07-08 15:05:52,672][994321] All inference workers are ready! Signal rollout workers to start! +[2023-07-08 15:05:56,534][994321] Fps is (10 sec: nan, 60 sec: nan, 300 sec: nan). Total num frames: 0. Throughput: 0: nan. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) +[2023-07-08 15:05:57,063][994738] Decorrelating experience for 0 frames... +[2023-07-08 15:05:57,080][994738] Decorrelating experience for 64 frames... +[2023-07-08 15:05:57,082][994607] Decorrelating experience for 0 frames... +[2023-07-08 15:05:57,092][994610] Decorrelating experience for 0 frames... +[2023-07-08 15:05:57,094][994611] Decorrelating experience for 0 frames... +[2023-07-08 15:05:57,095][994662] Decorrelating experience for 0 frames... +[2023-07-08 15:05:57,098][994607] Decorrelating experience for 64 frames... +[2023-07-08 15:05:57,108][994610] Decorrelating experience for 64 frames... +[2023-07-08 15:05:57,109][994611] Decorrelating experience for 64 frames... +[2023-07-08 15:05:57,111][994662] Decorrelating experience for 64 frames... +[2023-07-08 15:05:57,113][994608] Decorrelating experience for 0 frames... +[2023-07-08 15:05:57,123][994738] Decorrelating experience for 128 frames... +[2023-07-08 15:05:57,125][994724] Decorrelating experience for 0 frames... +[2023-07-08 15:05:57,129][994608] Decorrelating experience for 64 frames... +[2023-07-08 15:05:57,140][994724] Decorrelating experience for 64 frames... +[2023-07-08 15:05:57,141][994607] Decorrelating experience for 128 frames... +[2023-07-08 15:05:57,150][994610] Decorrelating experience for 128 frames... +[2023-07-08 15:05:57,152][994611] Decorrelating experience for 128 frames... +[2023-07-08 15:05:57,154][994662] Decorrelating experience for 128 frames... +[2023-07-08 15:05:57,171][994608] Decorrelating experience for 128 frames... +[2023-07-08 15:05:57,184][994724] Decorrelating experience for 128 frames... +[2023-07-08 15:05:57,207][994738] Decorrelating experience for 192 frames... +[2023-07-08 15:05:57,226][994607] Decorrelating experience for 192 frames... +[2023-07-08 15:05:57,235][994610] Decorrelating experience for 192 frames... +[2023-07-08 15:05:57,237][994611] Decorrelating experience for 192 frames... +[2023-07-08 15:05:57,238][994662] Decorrelating experience for 192 frames... +[2023-07-08 15:05:57,256][994608] Decorrelating experience for 192 frames... +[2023-07-08 15:05:57,270][994724] Decorrelating experience for 192 frames... +[2023-07-08 15:05:57,612][994609] Decorrelating experience for 0 frames... +[2023-07-08 15:05:57,628][994609] Decorrelating experience for 64 frames... +[2023-07-08 15:05:57,671][994609] Decorrelating experience for 128 frames... +[2023-07-08 15:05:57,758][994609] Decorrelating experience for 192 frames... +[2023-07-08 15:06:01,533][994738] Decorrelating experience for 256 frames... +[2023-07-08 15:06:01,534][994321] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) +[2023-07-08 15:06:01,535][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000000000_0.pth... +[2023-07-08 15:06:01,552][994610] Decorrelating experience for 256 frames... +[2023-07-08 15:06:01,604][994662] Decorrelating experience for 256 frames... +[2023-07-08 15:06:01,613][994611] Decorrelating experience for 256 frames... +[2023-07-08 15:06:01,638][994608] Decorrelating experience for 256 frames... +[2023-07-08 15:06:01,641][994724] Decorrelating experience for 256 frames... +[2023-07-08 15:06:01,688][994738] Decorrelating experience for 320 frames... +[2023-07-08 15:06:01,706][994610] Decorrelating experience for 320 frames... +[2023-07-08 15:06:01,759][994662] Decorrelating experience for 320 frames... +[2023-07-08 15:06:01,769][994611] Decorrelating experience for 320 frames... +[2023-07-08 15:06:01,794][994608] Decorrelating experience for 320 frames... +[2023-07-08 15:06:01,797][994724] Decorrelating experience for 320 frames... +[2023-07-08 15:06:01,886][994738] Decorrelating experience for 384 frames... +[2023-07-08 15:06:01,904][994610] Decorrelating experience for 384 frames... +[2023-07-08 15:06:01,958][994662] Decorrelating experience for 384 frames... +[2023-07-08 15:06:01,967][994611] Decorrelating experience for 384 frames... +[2023-07-08 15:06:01,993][994608] Decorrelating experience for 384 frames... +[2023-07-08 15:06:01,995][994724] Decorrelating experience for 384 frames... +[2023-07-08 15:06:02,083][994609] Decorrelating experience for 256 frames... +[2023-07-08 15:06:02,109][994738] Decorrelating experience for 448 frames... +[2023-07-08 15:06:02,126][994610] Decorrelating experience for 448 frames... +[2023-07-08 15:06:02,181][994662] Decorrelating experience for 448 frames... +[2023-07-08 15:06:02,190][994611] Decorrelating experience for 448 frames... +[2023-07-08 15:06:02,218][994608] Decorrelating experience for 448 frames... +[2023-07-08 15:06:02,218][994724] Decorrelating experience for 448 frames... +[2023-07-08 15:06:02,237][994609] Decorrelating experience for 320 frames... +[2023-07-08 15:06:02,311][994607] Decorrelating experience for 256 frames... +[2023-07-08 15:06:02,433][994609] Decorrelating experience for 384 frames... +[2023-07-08 15:06:02,526][994607] Decorrelating experience for 320 frames... +[2023-07-08 15:06:02,656][994609] Decorrelating experience for 448 frames... +[2023-07-08 15:06:02,760][994607] Decorrelating experience for 384 frames... +[2023-07-08 15:06:03,042][994607] Decorrelating experience for 448 frames... +[2023-07-08 15:06:06,534][994321] Fps is (10 sec: 2457.6, 60 sec: 2457.6, 300 sec: 2457.6). Total num frames: 24576. Throughput: 0: 875.6. Samples: 8756. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:06:06,534][994321] Avg episode reward: [(0, '5.130')] +[2023-07-08 15:06:08,435][994606] Updated weights for policy 0, policy_version 80 (0.0005) +[2023-07-08 15:06:09,445][994321] Heartbeat connected on Batcher_0 +[2023-07-08 15:06:09,453][994321] Heartbeat connected on RolloutWorker_w0 +[2023-07-08 15:06:09,455][994321] Heartbeat connected on RolloutWorker_w1 +[2023-07-08 15:06:09,456][994321] Heartbeat connected on LearnerWorker_p0 +[2023-07-08 15:06:09,457][994321] Heartbeat connected on RolloutWorker_w2 +[2023-07-08 15:06:09,459][994321] Heartbeat connected on RolloutWorker_w3 +[2023-07-08 15:06:09,460][994321] Heartbeat connected on InferenceWorker_p0-w0 +[2023-07-08 15:06:09,469][994321] Heartbeat connected on RolloutWorker_w6 +[2023-07-08 15:06:09,470][994321] Heartbeat connected on RolloutWorker_w5 +[2023-07-08 15:06:09,472][994321] Heartbeat connected on RolloutWorker_w7 +[2023-07-08 15:06:09,473][994321] Heartbeat connected on RolloutWorker_w4 +[2023-07-08 15:06:11,534][994321] Fps is (10 sec: 6144.1, 60 sec: 4096.0, 300 sec: 4096.0). Total num frames: 61440. Throughput: 0: 3924.0. Samples: 58860. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-07-08 15:06:11,534][994321] Avg episode reward: [(0, '8.171')] +[2023-07-08 15:06:13,486][994606] Updated weights for policy 0, policy_version 160 (0.0005) +[2023-07-08 15:06:16,534][994321] Fps is (10 sec: 7782.4, 60 sec: 5120.0, 300 sec: 5120.0). Total num frames: 102400. Throughput: 0: 4203.4. Samples: 84068. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-07-08 15:06:16,534][994321] Avg episode reward: [(0, '9.267')] +[2023-07-08 15:06:16,569][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000000208_106496.pth... +[2023-07-08 15:06:16,572][994562] Saving new best policy, reward=9.267! +[2023-07-08 15:06:18,600][994606] Updated weights for policy 0, policy_version 240 (0.0005) +[2023-07-08 15:06:21,534][994321] Fps is (10 sec: 8192.0, 60 sec: 5734.4, 300 sec: 5734.4). Total num frames: 143360. Throughput: 0: 5321.2. Samples: 133028. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:06:21,534][994321] Avg episode reward: [(0, '9.860')] +[2023-07-08 15:06:21,544][994562] Saving new best policy, reward=9.860! +[2023-07-08 15:06:23,490][994606] Updated weights for policy 0, policy_version 320 (0.0006) +[2023-07-08 15:06:26,534][994321] Fps is (10 sec: 8601.6, 60 sec: 6280.6, 300 sec: 6280.6). Total num frames: 188416. Throughput: 0: 6120.6. Samples: 183616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:06:26,534][994321] Avg episode reward: [(0, '9.882')] +[2023-07-08 15:06:26,535][994562] Saving new best policy, reward=9.882! +[2023-07-08 15:06:28,309][994606] Updated weights for policy 0, policy_version 400 (0.0006) +[2023-07-08 15:06:31,534][994321] Fps is (10 sec: 8601.5, 60 sec: 6553.6, 300 sec: 6553.6). Total num frames: 229376. Throughput: 0: 5966.3. Samples: 208820. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:06:31,534][994321] Avg episode reward: [(0, '9.997')] +[2023-07-08 15:06:31,536][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000000448_229376.pth... +[2023-07-08 15:06:31,616][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000000000_0.pth +[2023-07-08 15:06:31,616][994562] Saving new best policy, reward=9.997! +[2023-07-08 15:06:33,289][994606] Updated weights for policy 0, policy_version 480 (0.0005) +[2023-07-08 15:06:36,534][994321] Fps is (10 sec: 8601.6, 60 sec: 6860.8, 300 sec: 6860.8). Total num frames: 274432. Throughput: 0: 6451.5. Samples: 258060. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-07-08 15:06:36,534][994321] Avg episode reward: [(0, '10.055')] +[2023-07-08 15:06:36,535][994562] Saving new best policy, reward=10.055! +[2023-07-08 15:06:37,864][994606] Updated weights for policy 0, policy_version 560 (0.0005) +[2023-07-08 15:06:41,534][994321] Fps is (10 sec: 9011.3, 60 sec: 7099.8, 300 sec: 7099.8). Total num frames: 319488. Throughput: 0: 6940.6. Samples: 312328. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:06:41,534][994321] Avg episode reward: [(0, '10.067')] +[2023-07-08 15:06:41,535][994562] Saving new best policy, reward=10.067! +[2023-07-08 15:06:42,451][994606] Updated weights for policy 0, policy_version 640 (0.0006) +[2023-07-08 15:06:46,534][994321] Fps is (10 sec: 8601.4, 60 sec: 7208.9, 300 sec: 7208.9). Total num frames: 360448. Throughput: 0: 7537.6. Samples: 339192. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-07-08 15:06:46,535][994321] Avg episode reward: [(0, '9.978')] +[2023-07-08 15:06:46,538][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000000704_360448.pth... +[2023-07-08 15:06:46,541][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000000208_106496.pth +[2023-07-08 15:06:47,381][994606] Updated weights for policy 0, policy_version 720 (0.0005) +[2023-07-08 15:06:51,534][994321] Fps is (10 sec: 8191.9, 60 sec: 7298.3, 300 sec: 7298.3). Total num frames: 401408. Throughput: 0: 8378.1. Samples: 385772. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:06:51,534][994321] Avg episode reward: [(0, '9.989')] +[2023-07-08 15:06:52,436][994606] Updated weights for policy 0, policy_version 800 (0.0005) +[2023-07-08 15:06:56,534][994321] Fps is (10 sec: 8192.3, 60 sec: 7372.8, 300 sec: 7372.8). Total num frames: 442368. Throughput: 0: 8405.1. Samples: 437088. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:06:56,534][994321] Avg episode reward: [(0, '10.029')] +[2023-07-08 15:06:57,450][994606] Updated weights for policy 0, policy_version 880 (0.0005) +[2023-07-08 15:07:01,534][994321] Fps is (10 sec: 8191.8, 60 sec: 8055.5, 300 sec: 7435.8). Total num frames: 483328. Throughput: 0: 8391.3. Samples: 461680. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-07-08 15:07:01,535][994321] Avg episode reward: [(0, '9.911')] +[2023-07-08 15:07:01,539][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000000944_483328.pth... +[2023-07-08 15:07:01,542][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000000448_229376.pth +[2023-07-08 15:07:02,302][994606] Updated weights for policy 0, policy_version 960 (0.0005) +[2023-07-08 15:07:06,534][994321] Fps is (10 sec: 8601.5, 60 sec: 8396.8, 300 sec: 7548.4). Total num frames: 528384. Throughput: 0: 8439.0. Samples: 512784. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:07:06,534][994321] Avg episode reward: [(0, '10.099')] +[2023-07-08 15:07:06,535][994562] Saving new best policy, reward=10.099! +[2023-07-08 15:07:06,954][994606] Updated weights for policy 0, policy_version 1040 (0.0005) +[2023-07-08 15:07:11,534][994321] Fps is (10 sec: 8601.8, 60 sec: 8465.1, 300 sec: 7591.3). Total num frames: 569344. Throughput: 0: 8465.0. Samples: 564540. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:07:11,534][994321] Avg episode reward: [(0, '9.939')] +[2023-07-08 15:07:11,949][994606] Updated weights for policy 0, policy_version 1120 (0.0005) +[2023-07-08 15:07:16,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8465.1, 300 sec: 7628.8). Total num frames: 610304. Throughput: 0: 8455.0. Samples: 589296. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:07:16,534][994321] Avg episode reward: [(0, '10.004')] +[2023-07-08 15:07:16,537][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000001192_610304.pth... +[2023-07-08 15:07:16,539][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000000704_360448.pth +[2023-07-08 15:07:16,635][994606] Updated weights for policy 0, policy_version 1200 (0.0005) +[2023-07-08 15:07:21,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8465.1, 300 sec: 7661.9). Total num frames: 651264. Throughput: 0: 8465.0. Samples: 638984. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-07-08 15:07:21,534][994321] Avg episode reward: [(0, '10.219')] +[2023-07-08 15:07:21,535][994562] Saving new best policy, reward=10.219! +[2023-07-08 15:07:21,775][994606] Updated weights for policy 0, policy_version 1280 (0.0005) +[2023-07-08 15:07:26,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8396.8, 300 sec: 7691.4). Total num frames: 692224. Throughput: 0: 8350.4. Samples: 688096. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-07-08 15:07:26,534][994321] Avg episode reward: [(0, '10.019')] +[2023-07-08 15:07:26,812][994606] Updated weights for policy 0, policy_version 1360 (0.0005) +[2023-07-08 15:07:31,502][994606] Updated weights for policy 0, policy_version 1440 (0.0005) +[2023-07-08 15:07:31,534][994321] Fps is (10 sec: 8601.6, 60 sec: 8465.1, 300 sec: 7760.8). Total num frames: 737280. Throughput: 0: 8363.6. Samples: 715552. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-07-08 15:07:31,534][994321] Avg episode reward: [(0, '10.142')] +[2023-07-08 15:07:31,537][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000001440_737280.pth... +[2023-07-08 15:07:31,541][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000000944_483328.pth +[2023-07-08 15:07:36,534][994321] Fps is (10 sec: 8192.1, 60 sec: 8328.5, 300 sec: 7741.5). Total num frames: 774144. Throughput: 0: 8381.6. Samples: 762944. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:07:36,534][994321] Avg episode reward: [(0, '10.113')] +[2023-07-08 15:07:36,638][994606] Updated weights for policy 0, policy_version 1520 (0.0005) +[2023-07-08 15:07:41,534][994321] Fps is (10 sec: 7782.4, 60 sec: 8260.3, 300 sec: 7762.9). Total num frames: 815104. Throughput: 0: 8310.7. Samples: 811072. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:07:41,534][994321] Avg episode reward: [(0, '10.448')] +[2023-07-08 15:07:41,535][994562] Saving new best policy, reward=10.448! +[2023-07-08 15:07:41,731][994606] Updated weights for policy 0, policy_version 1600 (0.0005) +[2023-07-08 15:07:46,499][994606] Updated weights for policy 0, policy_version 1680 (0.0005) +[2023-07-08 15:07:46,534][994321] Fps is (10 sec: 8601.5, 60 sec: 8328.6, 300 sec: 7819.6). Total num frames: 860160. Throughput: 0: 8308.7. Samples: 835568. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:07:46,534][994321] Avg episode reward: [(0, '10.582')] +[2023-07-08 15:07:46,537][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000001680_860160.pth... +[2023-07-08 15:07:46,539][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000001192_610304.pth +[2023-07-08 15:07:46,539][994562] Saving new best policy, reward=10.582! +[2023-07-08 15:07:51,408][994606] Updated weights for policy 0, policy_version 1760 (0.0005) +[2023-07-08 15:07:51,534][994321] Fps is (10 sec: 8601.5, 60 sec: 8328.5, 300 sec: 7835.8). Total num frames: 901120. Throughput: 0: 8333.9. Samples: 887812. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:07:51,534][994321] Avg episode reward: [(0, '12.173')] +[2023-07-08 15:07:51,535][994562] Saving new best policy, reward=12.173! +[2023-07-08 15:07:56,269][994606] Updated weights for policy 0, policy_version 1840 (0.0005) +[2023-07-08 15:07:56,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8328.5, 300 sec: 7850.7). Total num frames: 942080. Throughput: 0: 8298.6. Samples: 937976. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:07:56,534][994321] Avg episode reward: [(0, '14.257')] +[2023-07-08 15:07:56,535][994562] Saving new best policy, reward=14.257! +[2023-07-08 15:08:01,327][994606] Updated weights for policy 0, policy_version 1920 (0.0005) +[2023-07-08 15:08:01,534][994321] Fps is (10 sec: 8192.1, 60 sec: 8328.6, 300 sec: 7864.3). Total num frames: 983040. Throughput: 0: 8267.7. Samples: 961344. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-07-08 15:08:01,534][994321] Avg episode reward: [(0, '17.000')] +[2023-07-08 15:08:01,537][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000001920_983040.pth... +[2023-07-08 15:08:01,539][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000001440_737280.pth +[2023-07-08 15:08:01,539][994562] Saving new best policy, reward=17.000! +[2023-07-08 15:08:06,487][994606] Updated weights for policy 0, policy_version 2000 (0.0005) +[2023-07-08 15:08:06,534][994321] Fps is (10 sec: 8191.9, 60 sec: 8260.3, 300 sec: 7876.9). Total num frames: 1024000. Throughput: 0: 8242.6. Samples: 1009904. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-07-08 15:08:06,535][994321] Avg episode reward: [(0, '17.088')] +[2023-07-08 15:08:06,535][994562] Saving new best policy, reward=17.097! +[2023-07-08 15:08:11,534][994321] Fps is (10 sec: 7782.4, 60 sec: 8192.0, 300 sec: 7858.3). Total num frames: 1060864. Throughput: 0: 8194.7. Samples: 1056856. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:08:11,535][994321] Avg episode reward: [(0, '22.397')] +[2023-07-08 15:08:11,535][994562] Saving new best policy, reward=22.397! +[2023-07-08 15:08:11,731][994606] Updated weights for policy 0, policy_version 2080 (0.0005) +[2023-07-08 15:08:16,534][994321] Fps is (10 sec: 7782.4, 60 sec: 8192.0, 300 sec: 7870.2). Total num frames: 1101824. Throughput: 0: 8109.9. Samples: 1080496. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:08:16,535][994321] Avg episode reward: [(0, '23.875')] +[2023-07-08 15:08:16,537][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000002152_1101824.pth... +[2023-07-08 15:08:16,540][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000001680_860160.pth +[2023-07-08 15:08:16,540][994562] Saving new best policy, reward=23.875! +[2023-07-08 15:08:16,928][994606] Updated weights for policy 0, policy_version 2160 (0.0005) +[2023-07-08 15:08:21,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8192.0, 300 sec: 7881.3). Total num frames: 1142784. Throughput: 0: 8133.9. Samples: 1128972. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:08:21,553][994321] Avg episode reward: [(0, '23.112')] +[2023-07-08 15:08:21,940][994606] Updated weights for policy 0, policy_version 2240 (0.0005) +[2023-07-08 15:08:26,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8192.0, 300 sec: 7891.6). Total num frames: 1183744. Throughput: 0: 8189.2. Samples: 1179588. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:08:26,535][994321] Avg episode reward: [(0, '28.205')] +[2023-07-08 15:08:26,535][994562] Saving new best policy, reward=28.205! +[2023-07-08 15:08:26,833][994606] Updated weights for policy 0, policy_version 2320 (0.0005) +[2023-07-08 15:08:31,534][994321] Fps is (10 sec: 8192.1, 60 sec: 8123.7, 300 sec: 7901.3). Total num frames: 1224704. Throughput: 0: 8174.7. Samples: 1203428. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:08:31,535][994321] Avg episode reward: [(0, '35.303')] +[2023-07-08 15:08:31,537][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000002392_1224704.pth... +[2023-07-08 15:08:31,539][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000001920_983040.pth +[2023-07-08 15:08:31,540][994562] Saving new best policy, reward=35.303! +[2023-07-08 15:08:32,075][994606] Updated weights for policy 0, policy_version 2400 (0.0005) +[2023-07-08 15:08:36,534][994321] Fps is (10 sec: 7782.4, 60 sec: 8123.7, 300 sec: 7884.8). Total num frames: 1261568. Throughput: 0: 8032.6. Samples: 1249280. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-07-08 15:08:36,534][994321] Avg episode reward: [(0, '41.874')] +[2023-07-08 15:08:36,535][994562] Saving new best policy, reward=41.874! +[2023-07-08 15:08:37,279][994606] Updated weights for policy 0, policy_version 2480 (0.0005) +[2023-07-08 15:08:41,534][994321] Fps is (10 sec: 7782.3, 60 sec: 8123.7, 300 sec: 7894.1). Total num frames: 1302528. Throughput: 0: 8014.6. Samples: 1298632. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-07-08 15:08:41,534][994321] Avg episode reward: [(0, '51.208')] +[2023-07-08 15:08:41,535][994562] Saving new best policy, reward=51.208! +[2023-07-08 15:08:42,308][994606] Updated weights for policy 0, policy_version 2560 (0.0005) +[2023-07-08 15:08:46,534][994321] Fps is (10 sec: 8191.9, 60 sec: 8055.5, 300 sec: 7902.9). Total num frames: 1343488. Throughput: 0: 8010.9. Samples: 1321836. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:08:46,534][994321] Avg episode reward: [(0, '62.375')] +[2023-07-08 15:08:46,538][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000002624_1343488.pth... +[2023-07-08 15:08:46,541][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000002152_1101824.pth +[2023-07-08 15:08:46,541][994562] Saving new best policy, reward=62.375! +[2023-07-08 15:08:47,530][994606] Updated weights for policy 0, policy_version 2640 (0.0005) +[2023-07-08 15:08:51,534][994321] Fps is (10 sec: 7782.4, 60 sec: 7987.2, 300 sec: 7887.7). Total num frames: 1380352. Throughput: 0: 8023.7. Samples: 1370972. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:08:51,534][994321] Avg episode reward: [(0, '61.014')] +[2023-07-08 15:08:52,629][994606] Updated weights for policy 0, policy_version 2720 (0.0005) +[2023-07-08 15:08:56,534][994321] Fps is (10 sec: 7782.4, 60 sec: 7987.2, 300 sec: 7896.2). Total num frames: 1421312. Throughput: 0: 7996.0. Samples: 1416676. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:08:56,534][994321] Avg episode reward: [(0, '68.954')] +[2023-07-08 15:08:56,535][994562] Saving new best policy, reward=68.954! +[2023-07-08 15:08:57,830][994606] Updated weights for policy 0, policy_version 2800 (0.0005) +[2023-07-08 15:09:01,534][994321] Fps is (10 sec: 8192.0, 60 sec: 7987.2, 300 sec: 7904.2). Total num frames: 1462272. Throughput: 0: 8003.5. Samples: 1440652. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-07-08 15:09:01,534][994321] Avg episode reward: [(0, '71.820')] +[2023-07-08 15:09:01,537][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000002856_1462272.pth... +[2023-07-08 15:09:01,540][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000002392_1224704.pth +[2023-07-08 15:09:01,540][994562] Saving new best policy, reward=71.820! +[2023-07-08 15:09:02,882][994606] Updated weights for policy 0, policy_version 2880 (0.0005) +[2023-07-08 15:09:06,534][994321] Fps is (10 sec: 8192.0, 60 sec: 7987.2, 300 sec: 7911.8). Total num frames: 1503232. Throughput: 0: 8029.2. Samples: 1490284. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:09:06,534][994321] Avg episode reward: [(0, '73.901')] +[2023-07-08 15:09:06,535][994562] Saving new best policy, reward=73.901! +[2023-07-08 15:09:07,884][994606] Updated weights for policy 0, policy_version 2960 (0.0005) +[2023-07-08 15:09:11,534][994321] Fps is (10 sec: 8601.6, 60 sec: 8123.7, 300 sec: 7939.9). Total num frames: 1548288. Throughput: 0: 8015.9. Samples: 1540304. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:09:11,534][994321] Avg episode reward: [(0, '79.602')] +[2023-07-08 15:09:11,535][994562] Saving new best policy, reward=79.602! +[2023-07-08 15:09:12,590][994606] Updated weights for policy 0, policy_version 3040 (0.0005) +[2023-07-08 15:09:16,534][994321] Fps is (10 sec: 8191.9, 60 sec: 8055.5, 300 sec: 7925.8). Total num frames: 1585152. Throughput: 0: 8029.7. Samples: 1564764. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-07-08 15:09:16,534][994321] Avg episode reward: [(0, '83.208')] +[2023-07-08 15:09:16,537][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000003096_1585152.pth... +[2023-07-08 15:09:16,540][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000002624_1343488.pth +[2023-07-08 15:09:16,540][994562] Saving new best policy, reward=83.208! +[2023-07-08 15:09:17,878][994606] Updated weights for policy 0, policy_version 3120 (0.0007) +[2023-07-08 15:09:21,534][994321] Fps is (10 sec: 7372.9, 60 sec: 7987.2, 300 sec: 7912.3). Total num frames: 1622016. Throughput: 0: 8065.3. Samples: 1612216. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-07-08 15:09:21,534][994321] Avg episode reward: [(0, '77.360')] +[2023-07-08 15:09:23,036][994606] Updated weights for policy 0, policy_version 3200 (0.0005) +[2023-07-08 15:09:26,534][994321] Fps is (10 sec: 7782.6, 60 sec: 7987.2, 300 sec: 7918.9). Total num frames: 1662976. Throughput: 0: 8004.7. Samples: 1658844. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-07-08 15:09:26,534][994321] Avg episode reward: [(0, '84.830')] +[2023-07-08 15:09:26,535][994562] Saving new best policy, reward=84.830! +[2023-07-08 15:09:28,204][994606] Updated weights for policy 0, policy_version 3280 (0.0005) +[2023-07-08 15:09:31,534][994321] Fps is (10 sec: 8601.5, 60 sec: 8055.5, 300 sec: 7944.3). Total num frames: 1708032. Throughput: 0: 8035.8. Samples: 1683448. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-07-08 15:09:31,535][994321] Avg episode reward: [(0, '84.503')] +[2023-07-08 15:09:31,538][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000003336_1708032.pth... +[2023-07-08 15:09:31,541][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000002856_1462272.pth +[2023-07-08 15:09:32,785][994606] Updated weights for policy 0, policy_version 3360 (0.0005) +[2023-07-08 15:09:36,534][994321] Fps is (10 sec: 8601.5, 60 sec: 8123.7, 300 sec: 7950.0). Total num frames: 1748992. Throughput: 0: 8125.2. Samples: 1736604. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:09:36,535][994321] Avg episode reward: [(0, '87.266')] +[2023-07-08 15:09:36,535][994562] Saving new best policy, reward=87.266! +[2023-07-08 15:09:37,810][994606] Updated weights for policy 0, policy_version 3440 (0.0005) +[2023-07-08 15:09:41,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8123.7, 300 sec: 7955.3). Total num frames: 1789952. Throughput: 0: 8195.4. Samples: 1785468. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-07-08 15:09:41,535][994321] Avg episode reward: [(0, '88.916')] +[2023-07-08 15:09:41,535][994562] Saving new best policy, reward=88.916! +[2023-07-08 15:09:43,019][994606] Updated weights for policy 0, policy_version 3520 (0.0005) +[2023-07-08 15:09:46,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8123.7, 300 sec: 7960.5). Total num frames: 1830912. Throughput: 0: 8154.0. Samples: 1807580. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:09:46,535][994321] Avg episode reward: [(0, '88.868')] +[2023-07-08 15:09:46,538][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000003576_1830912.pth... +[2023-07-08 15:09:46,541][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000003096_1585152.pth +[2023-07-08 15:09:47,918][994606] Updated weights for policy 0, policy_version 3600 (0.0005) +[2023-07-08 15:09:51,534][994321] Fps is (10 sec: 7782.5, 60 sec: 8123.8, 300 sec: 7948.0). Total num frames: 1867776. Throughput: 0: 8160.4. Samples: 1857500. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:09:51,534][994321] Avg episode reward: [(0, '89.926')] +[2023-07-08 15:09:51,535][994562] Saving new best policy, reward=89.926! +[2023-07-08 15:09:53,286][994606] Updated weights for policy 0, policy_version 3680 (0.0005) +[2023-07-08 15:09:56,534][994321] Fps is (10 sec: 7782.5, 60 sec: 8123.7, 300 sec: 7953.1). Total num frames: 1908736. Throughput: 0: 8084.7. Samples: 1904116. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:09:56,534][994321] Avg episode reward: [(0, '92.301')] +[2023-07-08 15:09:56,535][994562] Saving new best policy, reward=92.301! +[2023-07-08 15:09:58,176][994606] Updated weights for policy 0, policy_version 3760 (0.0006) +[2023-07-08 15:10:01,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8123.8, 300 sec: 7957.9). Total num frames: 1949696. Throughput: 0: 8109.8. Samples: 1929704. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:10:01,534][994321] Avg episode reward: [(0, '99.632')] +[2023-07-08 15:10:01,536][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000003808_1949696.pth... +[2023-07-08 15:10:01,539][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000003336_1708032.pth +[2023-07-08 15:10:01,540][994562] Saving new best policy, reward=99.632! +[2023-07-08 15:10:03,121][994606] Updated weights for policy 0, policy_version 3840 (0.0005) +[2023-07-08 15:10:06,534][994321] Fps is (10 sec: 8191.9, 60 sec: 8123.7, 300 sec: 7962.6). Total num frames: 1990656. Throughput: 0: 8136.7. Samples: 1978368. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:10:06,534][994321] Avg episode reward: [(0, '103.616')] +[2023-07-08 15:10:06,535][994562] Saving new best policy, reward=103.616! +[2023-07-08 15:10:08,596][994606] Updated weights for policy 0, policy_version 3920 (0.0004) +[2023-07-08 15:10:11,534][994321] Fps is (10 sec: 7782.4, 60 sec: 7987.2, 300 sec: 7951.1). Total num frames: 2027520. Throughput: 0: 8101.8. Samples: 2023424. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:10:11,534][994321] Avg episode reward: [(0, '105.396')] +[2023-07-08 15:10:11,535][994562] Saving new best policy, reward=105.396! +[2023-07-08 15:10:13,793][994606] Updated weights for policy 0, policy_version 4000 (0.0005) +[2023-07-08 15:10:16,534][994321] Fps is (10 sec: 7782.4, 60 sec: 8055.5, 300 sec: 7955.7). Total num frames: 2068480. Throughput: 0: 8098.7. Samples: 2047888. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:10:16,534][994321] Avg episode reward: [(0, '111.828')] +[2023-07-08 15:10:16,537][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000004040_2068480.pth... +[2023-07-08 15:10:16,540][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000003576_1830912.pth +[2023-07-08 15:10:16,541][994562] Saving new best policy, reward=111.828! +[2023-07-08 15:10:18,682][994606] Updated weights for policy 0, policy_version 4080 (0.0005) +[2023-07-08 15:10:21,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8123.7, 300 sec: 7960.2). Total num frames: 2109440. Throughput: 0: 8011.6. Samples: 2097124. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:10:21,534][994321] Avg episode reward: [(0, '106.952')] +[2023-07-08 15:10:23,707][994606] Updated weights for policy 0, policy_version 4160 (0.0005) +[2023-07-08 15:10:26,534][994321] Fps is (10 sec: 8192.1, 60 sec: 8123.7, 300 sec: 7964.5). Total num frames: 2150400. Throughput: 0: 8031.9. Samples: 2146900. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:10:26,534][994321] Avg episode reward: [(0, '109.616')] +[2023-07-08 15:10:28,534][994606] Updated weights for policy 0, policy_version 4240 (0.0005) +[2023-07-08 15:10:31,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8055.5, 300 sec: 7968.6). Total num frames: 2191360. Throughput: 0: 8105.5. Samples: 2172328. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-07-08 15:10:31,534][994321] Avg episode reward: [(0, '111.255')] +[2023-07-08 15:10:31,537][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000004280_2191360.pth... +[2023-07-08 15:10:31,540][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000003808_1949696.pth +[2023-07-08 15:10:33,824][994606] Updated weights for policy 0, policy_version 4320 (0.0005) +[2023-07-08 15:10:36,534][994321] Fps is (10 sec: 8191.9, 60 sec: 8055.5, 300 sec: 7972.6). Total num frames: 2232320. Throughput: 0: 8050.7. Samples: 2219780. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-07-08 15:10:36,534][994321] Avg episode reward: [(0, '109.469')] +[2023-07-08 15:10:38,747][994606] Updated weights for policy 0, policy_version 4400 (0.0005) +[2023-07-08 15:10:41,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8055.5, 300 sec: 7976.4). Total num frames: 2273280. Throughput: 0: 8086.4. Samples: 2268004. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-07-08 15:10:41,534][994321] Avg episode reward: [(0, '109.352')] +[2023-07-08 15:10:44,003][994606] Updated weights for policy 0, policy_version 4480 (0.0005) +[2023-07-08 15:10:46,534][994321] Fps is (10 sec: 8191.9, 60 sec: 8055.5, 300 sec: 7980.1). Total num frames: 2314240. Throughput: 0: 8049.9. Samples: 2291952. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-07-08 15:10:46,534][994321] Avg episode reward: [(0, '120.078')] +[2023-07-08 15:10:46,537][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000004520_2314240.pth... +[2023-07-08 15:10:46,541][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000004040_2068480.pth +[2023-07-08 15:10:46,542][994562] Saving new best policy, reward=120.078! +[2023-07-08 15:10:49,039][994606] Updated weights for policy 0, policy_version 4560 (0.0005) +[2023-07-08 15:10:51,534][994321] Fps is (10 sec: 8192.1, 60 sec: 8123.7, 300 sec: 7983.7). Total num frames: 2355200. Throughput: 0: 8061.4. Samples: 2341132. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:10:51,534][994321] Avg episode reward: [(0, '132.698')] +[2023-07-08 15:10:51,535][994562] Saving new best policy, reward=132.698! +[2023-07-08 15:10:53,994][994606] Updated weights for policy 0, policy_version 4640 (0.0005) +[2023-07-08 15:10:56,534][994321] Fps is (10 sec: 7782.5, 60 sec: 8055.5, 300 sec: 8108.7). Total num frames: 2392064. Throughput: 0: 8130.0. Samples: 2389276. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:10:56,534][994321] Avg episode reward: [(0, '147.518')] +[2023-07-08 15:10:56,535][994562] Saving new best policy, reward=147.518! +[2023-07-08 15:10:58,936][994606] Updated weights for policy 0, policy_version 4720 (0.0005) +[2023-07-08 15:11:01,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8123.7, 300 sec: 8178.1). Total num frames: 2437120. Throughput: 0: 8154.1. Samples: 2414820. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-07-08 15:11:01,534][994321] Avg episode reward: [(0, '133.063')] +[2023-07-08 15:11:01,537][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000004760_2437120.pth... +[2023-07-08 15:11:01,540][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000004280_2191360.pth +[2023-07-08 15:11:03,516][994606] Updated weights for policy 0, policy_version 4800 (0.0004) +[2023-07-08 15:11:06,534][994321] Fps is (10 sec: 9011.2, 60 sec: 8192.0, 300 sec: 8205.9). Total num frames: 2482176. Throughput: 0: 8266.5. Samples: 2469116. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-07-08 15:11:06,534][994321] Avg episode reward: [(0, '149.882')] +[2023-07-08 15:11:06,535][994562] Saving new best policy, reward=149.882! +[2023-07-08 15:11:08,352][994606] Updated weights for policy 0, policy_version 4880 (0.0005) +[2023-07-08 15:11:11,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8192.0, 300 sec: 8192.0). Total num frames: 2519040. Throughput: 0: 8225.5. Samples: 2517048. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-07-08 15:11:11,534][994321] Avg episode reward: [(0, '154.544')] +[2023-07-08 15:11:11,535][994562] Saving new best policy, reward=154.544! +[2023-07-08 15:11:13,712][994606] Updated weights for policy 0, policy_version 4960 (0.0005) +[2023-07-08 15:11:16,534][994321] Fps is (10 sec: 7782.4, 60 sec: 8192.0, 300 sec: 8192.0). Total num frames: 2560000. Throughput: 0: 8160.1. Samples: 2539532. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-07-08 15:11:16,534][994321] Avg episode reward: [(0, '176.801')] +[2023-07-08 15:11:16,537][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000005000_2560000.pth... +[2023-07-08 15:11:16,539][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000004520_2314240.pth +[2023-07-08 15:11:16,540][994562] Saving new best policy, reward=176.801! +[2023-07-08 15:11:18,920][994606] Updated weights for policy 0, policy_version 5040 (0.0005) +[2023-07-08 15:11:21,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8192.0, 300 sec: 8178.1). Total num frames: 2600960. Throughput: 0: 8147.5. Samples: 2586420. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:11:21,534][994321] Avg episode reward: [(0, '178.606')] +[2023-07-08 15:11:21,535][994562] Saving new best policy, reward=178.606! +[2023-07-08 15:11:23,929][994606] Updated weights for policy 0, policy_version 5120 (0.0005) +[2023-07-08 15:11:26,534][994321] Fps is (10 sec: 7782.5, 60 sec: 8123.7, 300 sec: 8164.2). Total num frames: 2637824. Throughput: 0: 8164.7. Samples: 2635416. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-07-08 15:11:26,534][994321] Avg episode reward: [(0, '169.822')] +[2023-07-08 15:11:29,138][994606] Updated weights for policy 0, policy_version 5200 (0.0005) +[2023-07-08 15:11:31,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8192.0, 300 sec: 8164.2). Total num frames: 2682880. Throughput: 0: 8148.2. Samples: 2658620. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-07-08 15:11:31,534][994321] Avg episode reward: [(0, '161.820')] +[2023-07-08 15:11:31,537][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000005240_2682880.pth... +[2023-07-08 15:11:31,540][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000004760_2437120.pth +[2023-07-08 15:11:33,938][994606] Updated weights for policy 0, policy_version 5280 (0.0005) +[2023-07-08 15:11:36,534][994321] Fps is (10 sec: 8191.9, 60 sec: 8123.7, 300 sec: 8136.5). Total num frames: 2719744. Throughput: 0: 8192.8. Samples: 2709808. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:11:36,534][994321] Avg episode reward: [(0, '163.099')] +[2023-07-08 15:11:39,179][994606] Updated weights for policy 0, policy_version 5360 (0.0006) +[2023-07-08 15:11:41,534][994321] Fps is (10 sec: 7782.5, 60 sec: 8123.7, 300 sec: 8136.5). Total num frames: 2760704. Throughput: 0: 8170.6. Samples: 2756952. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:11:41,535][994321] Avg episode reward: [(0, '162.765')] +[2023-07-08 15:11:43,926][994606] Updated weights for policy 0, policy_version 5440 (0.0005) +[2023-07-08 15:11:46,534][994321] Fps is (10 sec: 8601.5, 60 sec: 8192.0, 300 sec: 8150.3). Total num frames: 2805760. Throughput: 0: 8178.7. Samples: 2782864. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:11:46,535][994321] Avg episode reward: [(0, '192.689')] +[2023-07-08 15:11:46,538][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000005480_2805760.pth... +[2023-07-08 15:11:46,541][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000005000_2560000.pth +[2023-07-08 15:11:46,541][994562] Saving new best policy, reward=192.689! +[2023-07-08 15:11:48,781][994606] Updated weights for policy 0, policy_version 5520 (0.0005) +[2023-07-08 15:11:51,534][994321] Fps is (10 sec: 8601.5, 60 sec: 8192.0, 300 sec: 8150.3). Total num frames: 2846720. Throughput: 0: 8115.7. Samples: 2834324. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-07-08 15:11:51,534][994321] Avg episode reward: [(0, '194.531')] +[2023-07-08 15:11:51,535][994562] Saving new best policy, reward=194.531! +[2023-07-08 15:11:53,568][994606] Updated weights for policy 0, policy_version 5600 (0.0005) +[2023-07-08 15:11:56,534][994321] Fps is (10 sec: 8601.6, 60 sec: 8328.5, 300 sec: 8164.2). Total num frames: 2891776. Throughput: 0: 8217.1. Samples: 2886816. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:11:56,534][994321] Avg episode reward: [(0, '189.768')] +[2023-07-08 15:11:58,062][994606] Updated weights for policy 0, policy_version 5680 (0.0004) +[2023-07-08 15:12:01,534][994321] Fps is (10 sec: 8601.6, 60 sec: 8260.3, 300 sec: 8150.3). Total num frames: 2932736. Throughput: 0: 8292.8. Samples: 2912708. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-07-08 15:12:01,534][994321] Avg episode reward: [(0, '164.839')] +[2023-07-08 15:12:01,537][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000005728_2932736.pth... +[2023-07-08 15:12:01,539][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000005240_2682880.pth +[2023-07-08 15:12:03,234][994606] Updated weights for policy 0, policy_version 5760 (0.0005) +[2023-07-08 15:12:06,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8192.0, 300 sec: 8150.3). Total num frames: 2973696. Throughput: 0: 8333.1. Samples: 2961408. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-07-08 15:12:06,534][994321] Avg episode reward: [(0, '166.898')] +[2023-07-08 15:12:08,393][994606] Updated weights for policy 0, policy_version 5840 (0.0005) +[2023-07-08 15:12:11,534][994321] Fps is (10 sec: 8192.1, 60 sec: 8260.3, 300 sec: 8150.3). Total num frames: 3014656. Throughput: 0: 8336.2. Samples: 3010544. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:12:11,534][994321] Avg episode reward: [(0, '174.953')] +[2023-07-08 15:12:13,168][994606] Updated weights for policy 0, policy_version 5920 (0.0005) +[2023-07-08 15:12:16,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8260.3, 300 sec: 8150.3). Total num frames: 3055616. Throughput: 0: 8368.8. Samples: 3035216. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-07-08 15:12:16,534][994321] Avg episode reward: [(0, '184.621')] +[2023-07-08 15:12:16,537][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000005968_3055616.pth... +[2023-07-08 15:12:16,539][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000005480_2805760.pth +[2023-07-08 15:12:17,997][994606] Updated weights for policy 0, policy_version 6000 (0.0005) +[2023-07-08 15:12:21,534][994321] Fps is (10 sec: 8601.5, 60 sec: 8328.5, 300 sec: 8164.2). Total num frames: 3100672. Throughput: 0: 8384.8. Samples: 3087124. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-07-08 15:12:21,534][994321] Avg episode reward: [(0, '169.403')] +[2023-07-08 15:12:22,843][994606] Updated weights for policy 0, policy_version 6080 (0.0005) +[2023-07-08 15:12:26,534][994321] Fps is (10 sec: 8601.6, 60 sec: 8396.8, 300 sec: 8150.3). Total num frames: 3141632. Throughput: 0: 8420.7. Samples: 3135884. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:12:26,535][994321] Avg episode reward: [(0, '171.572')] +[2023-07-08 15:12:27,854][994606] Updated weights for policy 0, policy_version 6160 (0.0005) +[2023-07-08 15:12:31,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8328.5, 300 sec: 8164.2). Total num frames: 3182592. Throughput: 0: 8408.8. Samples: 3161260. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:12:31,535][994321] Avg episode reward: [(0, '179.980')] +[2023-07-08 15:12:31,538][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000006216_3182592.pth... +[2023-07-08 15:12:31,541][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000005728_2932736.pth +[2023-07-08 15:12:32,688][994606] Updated weights for policy 0, policy_version 6240 (0.0005) +[2023-07-08 15:12:36,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8396.8, 300 sec: 8164.2). Total num frames: 3223552. Throughput: 0: 8405.4. Samples: 3212568. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:12:36,534][994321] Avg episode reward: [(0, '177.582')] +[2023-07-08 15:12:37,797][994606] Updated weights for policy 0, policy_version 6320 (0.0005) +[2023-07-08 15:12:41,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8396.8, 300 sec: 8150.3). Total num frames: 3264512. Throughput: 0: 8280.8. Samples: 3259452. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:12:41,534][994321] Avg episode reward: [(0, '177.601')] +[2023-07-08 15:12:42,962][994606] Updated weights for policy 0, policy_version 6400 (0.0005) +[2023-07-08 15:12:46,534][994321] Fps is (10 sec: 7782.4, 60 sec: 8260.3, 300 sec: 8136.5). Total num frames: 3301376. Throughput: 0: 8204.1. Samples: 3281892. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:12:46,534][994321] Avg episode reward: [(0, '173.298')] +[2023-07-08 15:12:46,537][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000006448_3301376.pth... +[2023-07-08 15:12:46,540][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000005968_3055616.pth +[2023-07-08 15:12:48,340][994606] Updated weights for policy 0, policy_version 6480 (0.0004) +[2023-07-08 15:12:51,534][994321] Fps is (10 sec: 7782.4, 60 sec: 8260.3, 300 sec: 8136.5). Total num frames: 3342336. Throughput: 0: 8163.9. Samples: 3328784. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:12:51,534][994321] Avg episode reward: [(0, '207.819')] +[2023-07-08 15:12:51,535][994562] Saving new best policy, reward=207.819! +[2023-07-08 15:12:53,426][994606] Updated weights for policy 0, policy_version 6560 (0.0005) +[2023-07-08 15:12:56,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8192.0, 300 sec: 8136.5). Total num frames: 3383296. Throughput: 0: 8193.8. Samples: 3379264. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:12:56,534][994321] Avg episode reward: [(0, '192.564')] +[2023-07-08 15:12:58,097][994606] Updated weights for policy 0, policy_version 6640 (0.0006) +[2023-07-08 15:13:01,534][994321] Fps is (10 sec: 8601.5, 60 sec: 8260.3, 300 sec: 8150.3). Total num frames: 3428352. Throughput: 0: 8207.7. Samples: 3404564. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:13:01,534][994321] Avg episode reward: [(0, '166.998')] +[2023-07-08 15:13:01,538][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000006696_3428352.pth... +[2023-07-08 15:13:01,540][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000006216_3182592.pth +[2023-07-08 15:13:02,593][994606] Updated weights for policy 0, policy_version 6720 (0.0005) +[2023-07-08 15:13:06,534][994321] Fps is (10 sec: 8601.7, 60 sec: 8260.3, 300 sec: 8164.2). Total num frames: 3469312. Throughput: 0: 8240.0. Samples: 3457924. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:13:06,534][994321] Avg episode reward: [(0, '198.961')] +[2023-07-08 15:13:07,597][994606] Updated weights for policy 0, policy_version 6800 (0.0005) +[2023-07-08 15:13:11,534][994321] Fps is (10 sec: 8601.7, 60 sec: 8328.5, 300 sec: 8178.1). Total num frames: 3514368. Throughput: 0: 8276.0. Samples: 3508304. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-07-08 15:13:11,534][994321] Avg episode reward: [(0, '197.879')] +[2023-07-08 15:13:12,429][994606] Updated weights for policy 0, policy_version 6880 (0.0005) +[2023-07-08 15:13:16,534][994321] Fps is (10 sec: 8601.5, 60 sec: 8328.5, 300 sec: 8178.1). Total num frames: 3555328. Throughput: 0: 8267.0. Samples: 3533276. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-07-08 15:13:16,534][994321] Avg episode reward: [(0, '192.328')] +[2023-07-08 15:13:16,538][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000006944_3555328.pth... +[2023-07-08 15:13:16,540][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000006448_3301376.pth +[2023-07-08 15:13:17,243][994606] Updated weights for policy 0, policy_version 6960 (0.0005) +[2023-07-08 15:13:21,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8260.3, 300 sec: 8178.1). Total num frames: 3596288. Throughput: 0: 8276.5. Samples: 3585012. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-07-08 15:13:21,534][994321] Avg episode reward: [(0, '200.407')] +[2023-07-08 15:13:22,065][994606] Updated weights for policy 0, policy_version 7040 (0.0004) +[2023-07-08 15:13:26,534][994321] Fps is (10 sec: 8192.1, 60 sec: 8260.3, 300 sec: 8178.1). Total num frames: 3637248. Throughput: 0: 8310.7. Samples: 3633432. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-07-08 15:13:26,534][994321] Avg episode reward: [(0, '165.300')] +[2023-07-08 15:13:27,162][994606] Updated weights for policy 0, policy_version 7120 (0.0005) +[2023-07-08 15:13:31,534][994321] Fps is (10 sec: 8191.9, 60 sec: 8260.3, 300 sec: 8192.0). Total num frames: 3678208. Throughput: 0: 8352.2. Samples: 3657740. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-07-08 15:13:31,534][994321] Avg episode reward: [(0, '197.000')] +[2023-07-08 15:13:31,537][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000007184_3678208.pth... +[2023-07-08 15:13:31,540][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000006696_3428352.pth +[2023-07-08 15:13:32,326][994606] Updated weights for policy 0, policy_version 7200 (0.0005) +[2023-07-08 15:13:36,534][994321] Fps is (10 sec: 7782.5, 60 sec: 8192.0, 300 sec: 8178.1). Total num frames: 3715072. Throughput: 0: 8357.7. Samples: 3704880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:13:36,534][994321] Avg episode reward: [(0, '188.172')] +[2023-07-08 15:13:37,584][994606] Updated weights for policy 0, policy_version 7280 (0.0005) +[2023-07-08 15:13:41,534][994321] Fps is (10 sec: 8192.1, 60 sec: 8260.3, 300 sec: 8192.0). Total num frames: 3760128. Throughput: 0: 8359.5. Samples: 3755440. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-07-08 15:13:41,534][994321] Avg episode reward: [(0, '202.843')] +[2023-07-08 15:13:42,451][994606] Updated weights for policy 0, policy_version 7360 (0.0005) +[2023-07-08 15:13:46,534][994321] Fps is (10 sec: 8601.5, 60 sec: 8328.5, 300 sec: 8205.9). Total num frames: 3801088. Throughput: 0: 8316.9. Samples: 3778824. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-07-08 15:13:46,534][994321] Avg episode reward: [(0, '183.480')] +[2023-07-08 15:13:46,537][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000007424_3801088.pth... +[2023-07-08 15:13:46,540][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000006944_3555328.pth +[2023-07-08 15:13:47,588][994606] Updated weights for policy 0, policy_version 7440 (0.0005) +[2023-07-08 15:13:51,534][994321] Fps is (10 sec: 7782.4, 60 sec: 8260.3, 300 sec: 8192.0). Total num frames: 3837952. Throughput: 0: 8171.1. Samples: 3825624. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:13:51,534][994321] Avg episode reward: [(0, '197.392')] +[2023-07-08 15:13:52,829][994606] Updated weights for policy 0, policy_version 7520 (0.0005) +[2023-07-08 15:13:56,534][994321] Fps is (10 sec: 7782.5, 60 sec: 8260.3, 300 sec: 8192.0). Total num frames: 3878912. Throughput: 0: 8124.7. Samples: 3873916. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:13:56,534][994321] Avg episode reward: [(0, '198.972')] +[2023-07-08 15:13:57,736][994606] Updated weights for policy 0, policy_version 7600 (0.0005) +[2023-07-08 15:14:01,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8192.0, 300 sec: 8192.0). Total num frames: 3919872. Throughput: 0: 8136.1. Samples: 3899400. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:14:01,534][994321] Avg episode reward: [(0, '193.568')] +[2023-07-08 15:14:01,537][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000007656_3919872.pth... +[2023-07-08 15:14:01,540][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000007184_3678208.pth +[2023-07-08 15:14:02,952][994606] Updated weights for policy 0, policy_version 7680 (0.0005) +[2023-07-08 15:14:06,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8192.0, 300 sec: 8178.1). Total num frames: 3960832. Throughput: 0: 8034.6. Samples: 3946568. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:14:06,534][994321] Avg episode reward: [(0, '183.758')] +[2023-07-08 15:14:07,982][994606] Updated weights for policy 0, policy_version 7760 (0.0005) +[2023-07-08 15:14:11,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8123.7, 300 sec: 8192.0). Total num frames: 4001792. Throughput: 0: 8082.1. Samples: 3997128. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:14:11,534][994321] Avg episode reward: [(0, '198.908')] +[2023-07-08 15:14:12,838][994606] Updated weights for policy 0, policy_version 7840 (0.0005) +[2023-07-08 15:14:16,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8123.7, 300 sec: 8205.9). Total num frames: 4042752. Throughput: 0: 8096.6. Samples: 4022088. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:14:16,534][994321] Avg episode reward: [(0, '181.957')] +[2023-07-08 15:14:16,537][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000007896_4042752.pth... +[2023-07-08 15:14:16,540][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000007424_3801088.pth +[2023-07-08 15:14:17,726][994606] Updated weights for policy 0, policy_version 7920 (0.0005) +[2023-07-08 15:14:21,534][994321] Fps is (10 sec: 8601.7, 60 sec: 8192.0, 300 sec: 8219.8). Total num frames: 4087808. Throughput: 0: 8173.4. Samples: 4072684. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-07-08 15:14:21,534][994321] Avg episode reward: [(0, '196.313')] +[2023-07-08 15:14:22,589][994606] Updated weights for policy 0, policy_version 8000 (0.0005) +[2023-07-08 15:14:26,534][994321] Fps is (10 sec: 8192.1, 60 sec: 8123.7, 300 sec: 8192.0). Total num frames: 4124672. Throughput: 0: 8115.6. Samples: 4120640. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-07-08 15:14:26,534][994321] Avg episode reward: [(0, '194.491')] +[2023-07-08 15:14:27,588][994606] Updated weights for policy 0, policy_version 8080 (0.0006) +[2023-07-08 15:14:31,534][994321] Fps is (10 sec: 7782.4, 60 sec: 8123.7, 300 sec: 8192.0). Total num frames: 4165632. Throughput: 0: 8158.4. Samples: 4145952. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:14:31,534][994321] Avg episode reward: [(0, '194.023')] +[2023-07-08 15:14:31,537][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000008136_4165632.pth... +[2023-07-08 15:14:31,539][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000007656_3919872.pth +[2023-07-08 15:14:32,911][994606] Updated weights for policy 0, policy_version 8160 (0.0005) +[2023-07-08 15:14:36,534][994321] Fps is (10 sec: 8191.8, 60 sec: 8192.0, 300 sec: 8192.0). Total num frames: 4206592. Throughput: 0: 8153.6. Samples: 4192536. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:14:36,534][994321] Avg episode reward: [(0, '172.446')] +[2023-07-08 15:14:37,836][994606] Updated weights for policy 0, policy_version 8240 (0.0005) +[2023-07-08 15:14:41,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8123.7, 300 sec: 8192.0). Total num frames: 4247552. Throughput: 0: 8205.1. Samples: 4243144. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-07-08 15:14:41,534][994321] Avg episode reward: [(0, '171.379')] +[2023-07-08 15:14:42,846][994606] Updated weights for policy 0, policy_version 8320 (0.0005) +[2023-07-08 15:14:46,534][994321] Fps is (10 sec: 8192.1, 60 sec: 8123.7, 300 sec: 8205.9). Total num frames: 4288512. Throughput: 0: 8191.0. Samples: 4267996. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-07-08 15:14:46,534][994321] Avg episode reward: [(0, '168.761')] +[2023-07-08 15:14:46,537][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000008376_4288512.pth... +[2023-07-08 15:14:46,540][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000007896_4042752.pth +[2023-07-08 15:14:47,720][994606] Updated weights for policy 0, policy_version 8400 (0.0005) +[2023-07-08 15:14:51,534][994321] Fps is (10 sec: 8601.6, 60 sec: 8260.3, 300 sec: 8219.8). Total num frames: 4333568. Throughput: 0: 8237.5. Samples: 4317256. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:14:51,534][994321] Avg episode reward: [(0, '180.457')] +[2023-07-08 15:14:52,446][994606] Updated weights for policy 0, policy_version 8480 (0.0005) +[2023-07-08 15:14:56,534][994321] Fps is (10 sec: 8192.1, 60 sec: 8192.0, 300 sec: 8205.9). Total num frames: 4370432. Throughput: 0: 8233.7. Samples: 4367644. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-07-08 15:14:56,534][994321] Avg episode reward: [(0, '205.047')] +[2023-07-08 15:14:57,448][994606] Updated weights for policy 0, policy_version 8560 (0.0005) +[2023-07-08 15:15:01,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8260.3, 300 sec: 8219.8). Total num frames: 4415488. Throughput: 0: 8287.1. Samples: 4395008. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-07-08 15:15:01,534][994321] Avg episode reward: [(0, '192.495')] +[2023-07-08 15:15:01,538][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000008624_4415488.pth... +[2023-07-08 15:15:01,541][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000008136_4165632.pth +[2023-07-08 15:15:02,392][994606] Updated weights for policy 0, policy_version 8640 (0.0005) +[2023-07-08 15:15:06,534][994321] Fps is (10 sec: 8191.9, 60 sec: 8192.0, 300 sec: 8219.8). Total num frames: 4452352. Throughput: 0: 8215.2. Samples: 4442368. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:15:06,534][994321] Avg episode reward: [(0, '196.513')] +[2023-07-08 15:15:07,591][994606] Updated weights for policy 0, policy_version 8720 (0.0005) +[2023-07-08 15:15:11,534][994321] Fps is (10 sec: 7782.5, 60 sec: 8192.0, 300 sec: 8219.8). Total num frames: 4493312. Throughput: 0: 8188.0. Samples: 4489100. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:15:11,534][994321] Avg episode reward: [(0, '198.037')] +[2023-07-08 15:15:12,732][994606] Updated weights for policy 0, policy_version 8800 (0.0005) +[2023-07-08 15:15:16,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8192.0, 300 sec: 8219.8). Total num frames: 4534272. Throughput: 0: 8172.4. Samples: 4513708. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:15:16,534][994321] Avg episode reward: [(0, '188.836')] +[2023-07-08 15:15:16,538][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000008856_4534272.pth... +[2023-07-08 15:15:16,541][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000008376_4288512.pth +[2023-07-08 15:15:17,766][994606] Updated weights for policy 0, policy_version 8880 (0.0005) +[2023-07-08 15:15:21,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8123.7, 300 sec: 8219.8). Total num frames: 4575232. Throughput: 0: 8229.9. Samples: 4562880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:15:21,534][994321] Avg episode reward: [(0, '187.980')] +[2023-07-08 15:15:22,870][994606] Updated weights for policy 0, policy_version 8960 (0.0006) +[2023-07-08 15:15:26,534][994321] Fps is (10 sec: 8601.6, 60 sec: 8260.3, 300 sec: 8233.7). Total num frames: 4620288. Throughput: 0: 8222.8. Samples: 4613172. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:15:26,534][994321] Avg episode reward: [(0, '191.088')] +[2023-07-08 15:15:27,510][994606] Updated weights for policy 0, policy_version 9040 (0.0005) +[2023-07-08 15:15:31,534][994321] Fps is (10 sec: 8191.9, 60 sec: 8192.0, 300 sec: 8219.8). Total num frames: 4657152. Throughput: 0: 8238.1. Samples: 4638708. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:15:31,534][994321] Avg episode reward: [(0, '199.425')] +[2023-07-08 15:15:31,537][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000009096_4657152.pth... +[2023-07-08 15:15:31,539][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000008624_4415488.pth +[2023-07-08 15:15:32,872][994606] Updated weights for policy 0, policy_version 9120 (0.0004) +[2023-07-08 15:15:36,534][994321] Fps is (10 sec: 7782.4, 60 sec: 8192.0, 300 sec: 8219.8). Total num frames: 4698112. Throughput: 0: 8174.7. Samples: 4685116. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:15:36,534][994321] Avg episode reward: [(0, '204.620')] +[2023-07-08 15:15:37,821][994606] Updated weights for policy 0, policy_version 9200 (0.0005) +[2023-07-08 15:15:41,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8192.0, 300 sec: 8219.8). Total num frames: 4739072. Throughput: 0: 8162.7. Samples: 4734968. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-07-08 15:15:41,534][994321] Avg episode reward: [(0, '189.444')] +[2023-07-08 15:15:42,736][994606] Updated weights for policy 0, policy_version 9280 (0.0005) +[2023-07-08 15:15:46,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8192.0, 300 sec: 8219.8). Total num frames: 4780032. Throughput: 0: 8100.5. Samples: 4759532. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-07-08 15:15:46,534][994321] Avg episode reward: [(0, '196.271')] +[2023-07-08 15:15:46,546][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000009344_4784128.pth... +[2023-07-08 15:15:46,548][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000008856_4534272.pth +[2023-07-08 15:15:47,499][994606] Updated weights for policy 0, policy_version 9360 (0.0005) +[2023-07-08 15:15:51,534][994321] Fps is (10 sec: 8601.6, 60 sec: 8192.0, 300 sec: 8247.5). Total num frames: 4825088. Throughput: 0: 8228.7. Samples: 4812660. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:15:51,534][994321] Avg episode reward: [(0, '192.579')] +[2023-07-08 15:15:52,475][994606] Updated weights for policy 0, policy_version 9440 (0.0005) +[2023-07-08 15:15:56,534][994321] Fps is (10 sec: 8601.6, 60 sec: 8260.3, 300 sec: 8233.7). Total num frames: 4866048. Throughput: 0: 8232.2. Samples: 4859552. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:15:56,534][994321] Avg episode reward: [(0, '190.352')] +[2023-07-08 15:15:57,550][994606] Updated weights for policy 0, policy_version 9520 (0.0005) +[2023-07-08 15:16:01,534][994321] Fps is (10 sec: 7782.4, 60 sec: 8123.7, 300 sec: 8205.9). Total num frames: 4902912. Throughput: 0: 8225.6. Samples: 4883860. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:16:01,534][994321] Avg episode reward: [(0, '194.289')] +[2023-07-08 15:16:01,537][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000009576_4902912.pth... +[2023-07-08 15:16:01,539][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000009096_4657152.pth +[2023-07-08 15:16:02,620][994606] Updated weights for policy 0, policy_version 9600 (0.0005) +[2023-07-08 15:16:06,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8260.3, 300 sec: 8233.7). Total num frames: 4947968. Throughput: 0: 8214.9. Samples: 4932552. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-07-08 15:16:06,534][994321] Avg episode reward: [(0, '204.426')] +[2023-07-08 15:16:07,452][994606] Updated weights for policy 0, policy_version 9680 (0.0005) +[2023-07-08 15:16:11,534][994321] Fps is (10 sec: 8601.6, 60 sec: 8260.3, 300 sec: 8233.7). Total num frames: 4988928. Throughput: 0: 8272.7. Samples: 4985444. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-07-08 15:16:11,534][994321] Avg episode reward: [(0, '189.906')] +[2023-07-08 15:16:12,128][994606] Updated weights for policy 0, policy_version 9760 (0.0004) +[2023-07-08 15:16:16,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8260.3, 300 sec: 8233.7). Total num frames: 5029888. Throughput: 0: 8243.1. Samples: 5009648. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:16:16,534][994321] Avg episode reward: [(0, '183.732')] +[2023-07-08 15:16:16,538][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000009824_5029888.pth... +[2023-07-08 15:16:16,540][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000009344_4784128.pth +[2023-07-08 15:16:17,107][994606] Updated weights for policy 0, policy_version 9840 (0.0005) +[2023-07-08 15:16:21,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8260.3, 300 sec: 8247.5). Total num frames: 5070848. Throughput: 0: 8300.2. Samples: 5058624. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-07-08 15:16:21,534][994321] Avg episode reward: [(0, '196.789')] +[2023-07-08 15:16:22,231][994606] Updated weights for policy 0, policy_version 9920 (0.0005) +[2023-07-08 15:16:26,534][994321] Fps is (10 sec: 8192.1, 60 sec: 8192.0, 300 sec: 8233.7). Total num frames: 5111808. Throughput: 0: 8323.4. Samples: 5109520. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-07-08 15:16:26,534][994321] Avg episode reward: [(0, '198.932')] +[2023-07-08 15:16:27,068][994606] Updated weights for policy 0, policy_version 10000 (0.0005) +[2023-07-08 15:16:31,534][994321] Fps is (10 sec: 8191.9, 60 sec: 8260.3, 300 sec: 8247.5). Total num frames: 5152768. Throughput: 0: 8297.7. Samples: 5132928. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-07-08 15:16:31,534][994321] Avg episode reward: [(0, '177.660')] +[2023-07-08 15:16:31,537][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000010064_5152768.pth... +[2023-07-08 15:16:31,539][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000009576_4902912.pth +[2023-07-08 15:16:32,223][994606] Updated weights for policy 0, policy_version 10080 (0.0005) +[2023-07-08 15:16:36,534][994321] Fps is (10 sec: 8191.9, 60 sec: 8260.3, 300 sec: 8247.5). Total num frames: 5193728. Throughput: 0: 8185.3. Samples: 5181000. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-07-08 15:16:36,534][994321] Avg episode reward: [(0, '176.497')] +[2023-07-08 15:16:37,379][994606] Updated weights for policy 0, policy_version 10160 (0.0005) +[2023-07-08 15:16:41,534][994321] Fps is (10 sec: 8601.7, 60 sec: 8328.5, 300 sec: 8247.5). Total num frames: 5238784. Throughput: 0: 8282.7. Samples: 5232272. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:16:41,534][994321] Avg episode reward: [(0, '182.718')] +[2023-07-08 15:16:41,961][994606] Updated weights for policy 0, policy_version 10240 (0.0006) +[2023-07-08 15:16:46,534][994321] Fps is (10 sec: 8601.6, 60 sec: 8328.5, 300 sec: 8247.5). Total num frames: 5279744. Throughput: 0: 8340.1. Samples: 5259164. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-07-08 15:16:46,534][994321] Avg episode reward: [(0, '160.766')] +[2023-07-08 15:16:46,537][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000010312_5279744.pth... +[2023-07-08 15:16:46,540][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000009824_5029888.pth +[2023-07-08 15:16:46,848][994606] Updated weights for policy 0, policy_version 10320 (0.0005) +[2023-07-08 15:16:51,534][994321] Fps is (10 sec: 8191.9, 60 sec: 8260.3, 300 sec: 8233.7). Total num frames: 5320704. Throughput: 0: 8335.6. Samples: 5307656. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-07-08 15:16:51,534][994321] Avg episode reward: [(0, '185.657')] +[2023-07-08 15:16:51,758][994606] Updated weights for policy 0, policy_version 10400 (0.0005) +[2023-07-08 15:16:56,317][994606] Updated weights for policy 0, policy_version 10480 (0.0005) +[2023-07-08 15:16:56,534][994321] Fps is (10 sec: 8601.6, 60 sec: 8328.5, 300 sec: 8247.5). Total num frames: 5365760. Throughput: 0: 8317.0. Samples: 5359712. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:16:56,534][994321] Avg episode reward: [(0, '194.157')] +[2023-07-08 15:17:00,966][994606] Updated weights for policy 0, policy_version 10560 (0.0005) +[2023-07-08 15:17:01,534][994321] Fps is (10 sec: 9011.2, 60 sec: 8465.1, 300 sec: 8261.4). Total num frames: 5410816. Throughput: 0: 8370.3. Samples: 5386312. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:17:01,534][994321] Avg episode reward: [(0, '186.559')] +[2023-07-08 15:17:01,537][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000010568_5410816.pth... +[2023-07-08 15:17:01,540][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000010064_5152768.pth +[2023-07-08 15:17:06,064][994606] Updated weights for policy 0, policy_version 10640 (0.0005) +[2023-07-08 15:17:06,534][994321] Fps is (10 sec: 8192.1, 60 sec: 8328.5, 300 sec: 8247.5). Total num frames: 5447680. Throughput: 0: 8411.6. Samples: 5437148. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:17:06,534][994321] Avg episode reward: [(0, '196.519')] +[2023-07-08 15:17:11,139][994606] Updated weights for policy 0, policy_version 10720 (0.0005) +[2023-07-08 15:17:11,534][994321] Fps is (10 sec: 7782.5, 60 sec: 8328.5, 300 sec: 8247.5). Total num frames: 5488640. Throughput: 0: 8353.2. Samples: 5485412. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:17:11,534][994321] Avg episode reward: [(0, '196.902')] +[2023-07-08 15:17:16,119][994606] Updated weights for policy 0, policy_version 10800 (0.0005) +[2023-07-08 15:17:16,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8328.5, 300 sec: 8233.7). Total num frames: 5529600. Throughput: 0: 8361.3. Samples: 5509184. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:17:16,534][994321] Avg episode reward: [(0, '186.644')] +[2023-07-08 15:17:16,536][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000010800_5529600.pth... +[2023-07-08 15:17:16,538][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000010312_5279744.pth +[2023-07-08 15:17:21,232][994606] Updated weights for policy 0, policy_version 10880 (0.0005) +[2023-07-08 15:17:21,534][994321] Fps is (10 sec: 8191.9, 60 sec: 8328.5, 300 sec: 8233.7). Total num frames: 5570560. Throughput: 0: 8384.0. Samples: 5558280. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:17:21,534][994321] Avg episode reward: [(0, '187.307')] +[2023-07-08 15:17:26,388][994606] Updated weights for policy 0, policy_version 10960 (0.0006) +[2023-07-08 15:17:26,534][994321] Fps is (10 sec: 8191.9, 60 sec: 8328.5, 300 sec: 8233.7). Total num frames: 5611520. Throughput: 0: 8329.7. Samples: 5607108. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:17:26,534][994321] Avg episode reward: [(0, '198.351')] +[2023-07-08 15:17:31,534][994321] Fps is (10 sec: 7782.4, 60 sec: 8260.3, 300 sec: 8219.8). Total num frames: 5648384. Throughput: 0: 8259.9. Samples: 5630860. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:17:31,534][994321] Avg episode reward: [(0, '187.056')] +[2023-07-08 15:17:31,541][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000011040_5652480.pth... +[2023-07-08 15:17:31,541][994606] Updated weights for policy 0, policy_version 11040 (0.0005) +[2023-07-08 15:17:31,542][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000010568_5410816.pth +[2023-07-08 15:17:36,534][994321] Fps is (10 sec: 7782.4, 60 sec: 8260.3, 300 sec: 8219.8). Total num frames: 5689344. Throughput: 0: 8210.3. Samples: 5677120. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:17:36,534][994321] Avg episode reward: [(0, '191.429')] +[2023-07-08 15:17:36,834][994606] Updated weights for policy 0, policy_version 11120 (0.0005) +[2023-07-08 15:17:41,534][994321] Fps is (10 sec: 8192.1, 60 sec: 8192.0, 300 sec: 8233.7). Total num frames: 5730304. Throughput: 0: 8137.5. Samples: 5725900. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:17:41,534][994321] Avg episode reward: [(0, '186.335')] +[2023-07-08 15:17:41,743][994606] Updated weights for policy 0, policy_version 11200 (0.0005) +[2023-07-08 15:17:46,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8192.0, 300 sec: 8233.7). Total num frames: 5771264. Throughput: 0: 8103.5. Samples: 5750968. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-07-08 15:17:46,534][994321] Avg episode reward: [(0, '193.053')] +[2023-07-08 15:17:46,537][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000011272_5771264.pth... +[2023-07-08 15:17:46,539][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000010800_5529600.pth +[2023-07-08 15:17:46,611][994606] Updated weights for policy 0, policy_version 11280 (0.0004) +[2023-07-08 15:17:51,514][994606] Updated weights for policy 0, policy_version 11360 (0.0005) +[2023-07-08 15:17:51,534][994321] Fps is (10 sec: 8601.5, 60 sec: 8260.3, 300 sec: 8247.5). Total num frames: 5816320. Throughput: 0: 8080.3. Samples: 5800760. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-07-08 15:17:51,534][994321] Avg episode reward: [(0, '204.090')] +[2023-07-08 15:17:56,414][994606] Updated weights for policy 0, policy_version 11440 (0.0006) +[2023-07-08 15:17:56,534][994321] Fps is (10 sec: 8601.6, 60 sec: 8192.0, 300 sec: 8233.7). Total num frames: 5857280. Throughput: 0: 8157.6. Samples: 5852504. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:17:56,534][994321] Avg episode reward: [(0, '211.294')] +[2023-07-08 15:17:56,535][994562] Saving new best policy, reward=211.294! +[2023-07-08 15:18:01,534][994321] Fps is (10 sec: 7782.5, 60 sec: 8055.5, 300 sec: 8219.8). Total num frames: 5894144. Throughput: 0: 8165.2. Samples: 5876620. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:18:01,534][994321] Avg episode reward: [(0, '206.274')] +[2023-07-08 15:18:01,537][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000011512_5894144.pth... +[2023-07-08 15:18:01,538][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000011040_5652480.pth +[2023-07-08 15:18:01,607][994606] Updated weights for policy 0, policy_version 11520 (0.0005) +[2023-07-08 15:18:06,534][994321] Fps is (10 sec: 7782.4, 60 sec: 8123.7, 300 sec: 8205.9). Total num frames: 5935104. Throughput: 0: 8098.2. Samples: 5922700. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:18:06,534][994321] Avg episode reward: [(0, '203.898')] +[2023-07-08 15:18:06,819][994606] Updated weights for policy 0, policy_version 11600 (0.0005) +[2023-07-08 15:18:11,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8123.7, 300 sec: 8205.9). Total num frames: 5976064. Throughput: 0: 8075.5. Samples: 5970504. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:18:11,534][994321] Avg episode reward: [(0, '211.343')] +[2023-07-08 15:18:11,535][994562] Saving new best policy, reward=211.343! +[2023-07-08 15:18:11,980][994606] Updated weights for policy 0, policy_version 11680 (0.0006) +[2023-07-08 15:18:16,534][994321] Fps is (10 sec: 7782.3, 60 sec: 8055.4, 300 sec: 8192.0). Total num frames: 6012928. Throughput: 0: 8032.5. Samples: 5992324. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:18:16,535][994321] Avg episode reward: [(0, '214.555')] +[2023-07-08 15:18:16,537][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000011744_6012928.pth... +[2023-07-08 15:18:16,540][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000011272_5771264.pth +[2023-07-08 15:18:16,540][994562] Saving new best policy, reward=214.555! +[2023-07-08 15:18:17,458][994606] Updated weights for policy 0, policy_version 11760 (0.0005) +[2023-07-08 15:18:21,534][994321] Fps is (10 sec: 7372.8, 60 sec: 7987.2, 300 sec: 8178.1). Total num frames: 6049792. Throughput: 0: 8059.8. Samples: 6039808. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-07-08 15:18:21,534][994321] Avg episode reward: [(0, '211.107')] +[2023-07-08 15:18:22,684][994606] Updated weights for policy 0, policy_version 11840 (0.0005) +[2023-07-08 15:18:26,534][994321] Fps is (10 sec: 7782.6, 60 sec: 7987.2, 300 sec: 8178.1). Total num frames: 6090752. Throughput: 0: 8024.5. Samples: 6087004. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-07-08 15:18:26,534][994321] Avg episode reward: [(0, '198.043')] +[2023-07-08 15:18:27,638][994606] Updated weights for policy 0, policy_version 11920 (0.0005) +[2023-07-08 15:18:31,534][994321] Fps is (10 sec: 8191.8, 60 sec: 8055.4, 300 sec: 8192.0). Total num frames: 6131712. Throughput: 0: 8005.8. Samples: 6111232. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:18:31,535][994321] Avg episode reward: [(0, '209.288')] +[2023-07-08 15:18:31,538][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000011976_6131712.pth... +[2023-07-08 15:18:31,540][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000011512_5894144.pth +[2023-07-08 15:18:32,696][994606] Updated weights for policy 0, policy_version 12000 (0.0005) +[2023-07-08 15:18:36,534][994321] Fps is (10 sec: 8191.9, 60 sec: 8055.5, 300 sec: 8178.1). Total num frames: 6172672. Throughput: 0: 7991.7. Samples: 6160384. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:18:36,534][994321] Avg episode reward: [(0, '206.032')] +[2023-07-08 15:18:37,941][994606] Updated weights for policy 0, policy_version 12080 (0.0005) +[2023-07-08 15:18:41,534][994321] Fps is (10 sec: 8192.1, 60 sec: 8055.5, 300 sec: 8178.1). Total num frames: 6213632. Throughput: 0: 7907.6. Samples: 6208344. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:18:41,534][994321] Avg episode reward: [(0, '196.580')] +[2023-07-08 15:18:42,929][994606] Updated weights for policy 0, policy_version 12160 (0.0005) +[2023-07-08 15:18:46,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8055.5, 300 sec: 8192.0). Total num frames: 6254592. Throughput: 0: 7902.0. Samples: 6232212. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:18:46,534][994321] Avg episode reward: [(0, '230.738')] +[2023-07-08 15:18:46,536][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000012216_6254592.pth... +[2023-07-08 15:18:46,538][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000011744_6012928.pth +[2023-07-08 15:18:46,538][994562] Saving new best policy, reward=230.738! +[2023-07-08 15:18:47,702][994606] Updated weights for policy 0, policy_version 12240 (0.0005) +[2023-07-08 15:18:51,534][994321] Fps is (10 sec: 8192.0, 60 sec: 7987.2, 300 sec: 8192.0). Total num frames: 6295552. Throughput: 0: 8007.8. Samples: 6283052. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-07-08 15:18:51,534][994321] Avg episode reward: [(0, '208.564')] +[2023-07-08 15:18:52,933][994606] Updated weights for policy 0, policy_version 12320 (0.0005) +[2023-07-08 15:18:56,534][994321] Fps is (10 sec: 8191.9, 60 sec: 7987.2, 300 sec: 8192.0). Total num frames: 6336512. Throughput: 0: 8012.4. Samples: 6331064. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-07-08 15:18:56,534][994321] Avg episode reward: [(0, '212.497')] +[2023-07-08 15:18:57,903][994606] Updated weights for policy 0, policy_version 12400 (0.0005) +[2023-07-08 15:19:01,534][994321] Fps is (10 sec: 8191.9, 60 sec: 8055.5, 300 sec: 8192.0). Total num frames: 6377472. Throughput: 0: 8093.6. Samples: 6356536. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-07-08 15:19:01,535][994321] Avg episode reward: [(0, '194.832')] +[2023-07-08 15:19:01,538][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000012456_6377472.pth... +[2023-07-08 15:19:01,541][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000011976_6131712.pth +[2023-07-08 15:19:02,814][994606] Updated weights for policy 0, policy_version 12480 (0.0005) +[2023-07-08 15:19:06,534][994321] Fps is (10 sec: 8192.1, 60 sec: 8055.5, 300 sec: 8192.0). Total num frames: 6418432. Throughput: 0: 8132.9. Samples: 6405788. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-07-08 15:19:06,535][994321] Avg episode reward: [(0, '204.194')] +[2023-07-08 15:19:07,997][994606] Updated weights for policy 0, policy_version 12560 (0.0005) +[2023-07-08 15:19:11,534][994321] Fps is (10 sec: 8192.1, 60 sec: 8055.5, 300 sec: 8192.0). Total num frames: 6459392. Throughput: 0: 8146.7. Samples: 6453608. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-07-08 15:19:11,534][994321] Avg episode reward: [(0, '210.158')] +[2023-07-08 15:19:12,803][994606] Updated weights for policy 0, policy_version 12640 (0.0005) +[2023-07-08 15:19:16,534][994321] Fps is (10 sec: 8191.9, 60 sec: 8123.7, 300 sec: 8178.1). Total num frames: 6500352. Throughput: 0: 8190.8. Samples: 6479816. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-07-08 15:19:16,534][994321] Avg episode reward: [(0, '221.859')] +[2023-07-08 15:19:16,538][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000012696_6500352.pth... +[2023-07-08 15:19:16,540][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000012216_6254592.pth +[2023-07-08 15:19:17,758][994606] Updated weights for policy 0, policy_version 12720 (0.0005) +[2023-07-08 15:19:21,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8192.0, 300 sec: 8192.0). Total num frames: 6541312. Throughput: 0: 8191.7. Samples: 6529012. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:19:21,535][994321] Avg episode reward: [(0, '227.926')] +[2023-07-08 15:19:22,625][994606] Updated weights for policy 0, policy_version 12800 (0.0005) +[2023-07-08 15:19:26,534][994321] Fps is (10 sec: 8192.1, 60 sec: 8192.0, 300 sec: 8192.0). Total num frames: 6582272. Throughput: 0: 8218.7. Samples: 6578184. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:19:26,535][994321] Avg episode reward: [(0, '216.197')] +[2023-07-08 15:19:27,755][994606] Updated weights for policy 0, policy_version 12880 (0.0005) +[2023-07-08 15:19:31,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8192.0, 300 sec: 8192.0). Total num frames: 6623232. Throughput: 0: 8235.8. Samples: 6602824. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-07-08 15:19:31,535][994321] Avg episode reward: [(0, '211.594')] +[2023-07-08 15:19:31,538][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000012936_6623232.pth... +[2023-07-08 15:19:31,541][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000012456_6377472.pth +[2023-07-08 15:19:32,781][994606] Updated weights for policy 0, policy_version 12960 (0.0005) +[2023-07-08 15:19:36,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8192.0, 300 sec: 8192.0). Total num frames: 6664192. Throughput: 0: 8194.9. Samples: 6651824. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-07-08 15:19:36,551][994321] Avg episode reward: [(0, '225.361')] +[2023-07-08 15:19:37,772][994606] Updated weights for policy 0, policy_version 13040 (0.0005) +[2023-07-08 15:19:41,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8192.0, 300 sec: 8192.0). Total num frames: 6705152. Throughput: 0: 8209.0. Samples: 6700468. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:19:41,535][994321] Avg episode reward: [(0, '210.713')] +[2023-07-08 15:19:42,944][994606] Updated weights for policy 0, policy_version 13120 (0.0005) +[2023-07-08 15:19:46,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8192.0, 300 sec: 8178.1). Total num frames: 6746112. Throughput: 0: 8147.4. Samples: 6723168. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:19:46,535][994321] Avg episode reward: [(0, '224.326')] +[2023-07-08 15:19:46,538][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000013176_6746112.pth... +[2023-07-08 15:19:46,540][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000012696_6500352.pth +[2023-07-08 15:19:47,749][994606] Updated weights for policy 0, policy_version 13200 (0.0005) +[2023-07-08 15:19:51,534][994321] Fps is (10 sec: 8192.1, 60 sec: 8192.0, 300 sec: 8192.0). Total num frames: 6787072. Throughput: 0: 8196.0. Samples: 6774608. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:19:51,535][994321] Avg episode reward: [(0, '210.606')] +[2023-07-08 15:19:52,739][994606] Updated weights for policy 0, policy_version 13280 (0.0005) +[2023-07-08 15:19:56,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8192.0, 300 sec: 8178.1). Total num frames: 6828032. Throughput: 0: 8229.4. Samples: 6823932. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:19:56,535][994321] Avg episode reward: [(0, '212.650')] +[2023-07-08 15:19:57,886][994606] Updated weights for policy 0, policy_version 13360 (0.0005) +[2023-07-08 15:20:01,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8192.0, 300 sec: 8192.0). Total num frames: 6868992. Throughput: 0: 8172.1. Samples: 6847560. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-07-08 15:20:01,535][994321] Avg episode reward: [(0, '237.171')] +[2023-07-08 15:20:01,537][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000013416_6868992.pth... +[2023-07-08 15:20:01,540][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000012936_6623232.pth +[2023-07-08 15:20:01,541][994562] Saving new best policy, reward=237.171! +[2023-07-08 15:20:02,535][994606] Updated weights for policy 0, policy_version 13440 (0.0005) +[2023-07-08 15:20:06,534][994321] Fps is (10 sec: 8601.6, 60 sec: 8260.3, 300 sec: 8205.9). Total num frames: 6914048. Throughput: 0: 8229.9. Samples: 6899360. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-07-08 15:20:06,534][994321] Avg episode reward: [(0, '227.889')] +[2023-07-08 15:20:07,161][994606] Updated weights for policy 0, policy_version 13520 (0.0005) +[2023-07-08 15:20:11,534][994321] Fps is (10 sec: 9011.2, 60 sec: 8328.5, 300 sec: 8219.8). Total num frames: 6959104. Throughput: 0: 8319.2. Samples: 6952548. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-07-08 15:20:11,534][994321] Avg episode reward: [(0, '253.565')] +[2023-07-08 15:20:11,535][994562] Saving new best policy, reward=253.565! +[2023-07-08 15:20:11,988][994606] Updated weights for policy 0, policy_version 13600 (0.0005) +[2023-07-08 15:20:16,534][994321] Fps is (10 sec: 8192.1, 60 sec: 8260.3, 300 sec: 8205.9). Total num frames: 6995968. Throughput: 0: 8302.9. Samples: 6976452. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-07-08 15:20:16,534][994321] Avg episode reward: [(0, '238.811')] +[2023-07-08 15:20:16,536][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000013664_6995968.pth... +[2023-07-08 15:20:16,538][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000013176_6746112.pth +[2023-07-08 15:20:17,186][994606] Updated weights for policy 0, policy_version 13680 (0.0005) +[2023-07-08 15:20:21,534][994321] Fps is (10 sec: 7782.3, 60 sec: 8260.3, 300 sec: 8192.0). Total num frames: 7036928. Throughput: 0: 8272.4. Samples: 7024084. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:20:21,534][994321] Avg episode reward: [(0, '222.220')] +[2023-07-08 15:20:22,379][994606] Updated weights for policy 0, policy_version 13760 (0.0005) +[2023-07-08 15:20:26,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8260.3, 300 sec: 8205.9). Total num frames: 7077888. Throughput: 0: 8286.9. Samples: 7073376. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:20:26,534][994321] Avg episode reward: [(0, '235.037')] +[2023-07-08 15:20:27,006][994606] Updated weights for policy 0, policy_version 13840 (0.0005) +[2023-07-08 15:20:31,534][994321] Fps is (10 sec: 8601.6, 60 sec: 8328.5, 300 sec: 8219.8). Total num frames: 7122944. Throughput: 0: 8404.2. Samples: 7101356. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:20:31,534][994321] Avg episode reward: [(0, '224.092')] +[2023-07-08 15:20:31,537][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000013912_7122944.pth... +[2023-07-08 15:20:31,539][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000013416_6868992.pth +[2023-07-08 15:20:31,980][994606] Updated weights for policy 0, policy_version 13920 (0.0005) +[2023-07-08 15:20:36,534][994321] Fps is (10 sec: 8601.4, 60 sec: 8328.5, 300 sec: 8219.8). Total num frames: 7163904. Throughput: 0: 8336.2. Samples: 7149736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:20:36,534][994321] Avg episode reward: [(0, '229.130')] +[2023-07-08 15:20:37,014][994606] Updated weights for policy 0, policy_version 14000 (0.0005) +[2023-07-08 15:20:41,534][994321] Fps is (10 sec: 7782.4, 60 sec: 8260.3, 300 sec: 8205.9). Total num frames: 7200768. Throughput: 0: 8319.7. Samples: 7198320. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-07-08 15:20:41,534][994321] Avg episode reward: [(0, '226.709')] +[2023-07-08 15:20:42,054][994606] Updated weights for policy 0, policy_version 14080 (0.0005) +[2023-07-08 15:20:46,534][994321] Fps is (10 sec: 7782.5, 60 sec: 8260.3, 300 sec: 8192.0). Total num frames: 7241728. Throughput: 0: 8304.3. Samples: 7221256. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-07-08 15:20:46,534][994321] Avg episode reward: [(0, '238.024')] +[2023-07-08 15:20:46,537][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000014144_7241728.pth... +[2023-07-08 15:20:46,540][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000013664_6995968.pth +[2023-07-08 15:20:47,387][994606] Updated weights for policy 0, policy_version 14160 (0.0006) +[2023-07-08 15:20:51,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8260.3, 300 sec: 8192.0). Total num frames: 7282688. Throughput: 0: 8210.7. Samples: 7268840. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-07-08 15:20:51,535][994321] Avg episode reward: [(0, '233.854')] +[2023-07-08 15:20:52,429][994606] Updated weights for policy 0, policy_version 14240 (0.0005) +[2023-07-08 15:20:56,534][994321] Fps is (10 sec: 8192.1, 60 sec: 8260.3, 300 sec: 8205.9). Total num frames: 7323648. Throughput: 0: 8126.0. Samples: 7318216. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-07-08 15:20:56,534][994321] Avg episode reward: [(0, '215.659')] +[2023-07-08 15:20:57,388][994606] Updated weights for policy 0, policy_version 14320 (0.0005) +[2023-07-08 15:21:01,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8260.3, 300 sec: 8192.0). Total num frames: 7364608. Throughput: 0: 8128.7. Samples: 7342244. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:21:01,534][994321] Avg episode reward: [(0, '233.157')] +[2023-07-08 15:21:01,537][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000014384_7364608.pth... +[2023-07-08 15:21:01,540][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000013912_7122944.pth +[2023-07-08 15:21:02,297][994606] Updated weights for policy 0, policy_version 14400 (0.0006) +[2023-07-08 15:21:06,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8192.0, 300 sec: 8192.0). Total num frames: 7405568. Throughput: 0: 8205.8. Samples: 7393344. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:21:06,534][994321] Avg episode reward: [(0, '238.277')] +[2023-07-08 15:21:07,235][994606] Updated weights for policy 0, policy_version 14480 (0.0005) +[2023-07-08 15:21:11,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8123.7, 300 sec: 8192.0). Total num frames: 7446528. Throughput: 0: 8195.5. Samples: 7442176. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:21:11,534][994321] Avg episode reward: [(0, '222.478')] +[2023-07-08 15:21:12,380][994606] Updated weights for policy 0, policy_version 14560 (0.0005) +[2023-07-08 15:21:16,534][994321] Fps is (10 sec: 8191.9, 60 sec: 8192.0, 300 sec: 8192.0). Total num frames: 7487488. Throughput: 0: 8121.4. Samples: 7466820. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:21:16,534][994321] Avg episode reward: [(0, '226.196')] +[2023-07-08 15:21:16,537][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000014624_7487488.pth... +[2023-07-08 15:21:16,540][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000014144_7241728.pth +[2023-07-08 15:21:17,568][994606] Updated weights for policy 0, policy_version 14640 (0.0005) +[2023-07-08 15:21:21,534][994321] Fps is (10 sec: 7782.4, 60 sec: 8123.7, 300 sec: 8178.1). Total num frames: 7524352. Throughput: 0: 8070.2. Samples: 7512892. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:21:21,534][994321] Avg episode reward: [(0, '231.290')] +[2023-07-08 15:21:22,589][994606] Updated weights for policy 0, policy_version 14720 (0.0005) +[2023-07-08 15:21:26,534][994321] Fps is (10 sec: 8191.9, 60 sec: 8192.0, 300 sec: 8192.0). Total num frames: 7569408. Throughput: 0: 8097.8. Samples: 7562720. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:21:26,534][994321] Avg episode reward: [(0, '223.281')] +[2023-07-08 15:21:27,610][994606] Updated weights for policy 0, policy_version 14800 (0.0005) +[2023-07-08 15:21:31,534][994321] Fps is (10 sec: 8601.6, 60 sec: 8123.7, 300 sec: 8192.0). Total num frames: 7610368. Throughput: 0: 8125.1. Samples: 7586884. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:21:31,534][994321] Avg episode reward: [(0, '214.643')] +[2023-07-08 15:21:31,537][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000014864_7610368.pth... +[2023-07-08 15:21:31,540][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000014384_7364608.pth +[2023-07-08 15:21:32,060][994606] Updated weights for policy 0, policy_version 14880 (0.0005) +[2023-07-08 15:21:36,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8123.7, 300 sec: 8178.1). Total num frames: 7651328. Throughput: 0: 8225.9. Samples: 7639004. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-07-08 15:21:36,534][994321] Avg episode reward: [(0, '217.252')] +[2023-07-08 15:21:37,439][994606] Updated weights for policy 0, policy_version 14960 (0.0005) +[2023-07-08 15:21:41,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8192.0, 300 sec: 8178.1). Total num frames: 7692288. Throughput: 0: 8199.0. Samples: 7687172. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-07-08 15:21:41,534][994321] Avg episode reward: [(0, '220.401')] +[2023-07-08 15:21:42,510][994606] Updated weights for policy 0, policy_version 15040 (0.0004) +[2023-07-08 15:21:46,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8192.0, 300 sec: 8178.1). Total num frames: 7733248. Throughput: 0: 8170.8. Samples: 7709928. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:21:46,534][994321] Avg episode reward: [(0, '236.481')] +[2023-07-08 15:21:46,537][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000015104_7733248.pth... +[2023-07-08 15:21:46,540][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000014624_7487488.pth +[2023-07-08 15:21:47,156][994606] Updated weights for policy 0, policy_version 15120 (0.0005) +[2023-07-08 15:21:51,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8192.0, 300 sec: 8164.2). Total num frames: 7774208. Throughput: 0: 8189.1. Samples: 7761852. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:21:51,534][994321] Avg episode reward: [(0, '223.893')] +[2023-07-08 15:21:52,222][994606] Updated weights for policy 0, policy_version 15200 (0.0005) +[2023-07-08 15:21:56,534][994321] Fps is (10 sec: 8192.1, 60 sec: 8192.0, 300 sec: 8150.3). Total num frames: 7815168. Throughput: 0: 8197.6. Samples: 7811068. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:21:56,534][994321] Avg episode reward: [(0, '235.787')] +[2023-07-08 15:21:57,318][994606] Updated weights for policy 0, policy_version 15280 (0.0005) +[2023-07-08 15:22:01,534][994321] Fps is (10 sec: 8191.9, 60 sec: 8192.0, 300 sec: 8164.2). Total num frames: 7856128. Throughput: 0: 8172.7. Samples: 7834592. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:22:01,534][994321] Avg episode reward: [(0, '231.151')] +[2023-07-08 15:22:01,538][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000015344_7856128.pth... +[2023-07-08 15:22:01,541][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000014864_7610368.pth +[2023-07-08 15:22:02,625][994606] Updated weights for policy 0, policy_version 15360 (0.0004) +[2023-07-08 15:22:06,534][994321] Fps is (10 sec: 7782.5, 60 sec: 8123.7, 300 sec: 8150.3). Total num frames: 7892992. Throughput: 0: 8197.3. Samples: 7881768. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:22:06,534][994321] Avg episode reward: [(0, '224.128')] +[2023-07-08 15:22:07,730][994606] Updated weights for policy 0, policy_version 15440 (0.0005) +[2023-07-08 15:22:11,534][994321] Fps is (10 sec: 7782.5, 60 sec: 8123.7, 300 sec: 8150.3). Total num frames: 7933952. Throughput: 0: 8154.9. Samples: 7929688. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:22:11,534][994321] Avg episode reward: [(0, '227.071')] +[2023-07-08 15:22:12,816][994606] Updated weights for policy 0, policy_version 15520 (0.0005) +[2023-07-08 15:22:16,534][994321] Fps is (10 sec: 8191.9, 60 sec: 8123.7, 300 sec: 8150.3). Total num frames: 7974912. Throughput: 0: 8166.5. Samples: 7954376. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:22:16,534][994321] Avg episode reward: [(0, '229.857')] +[2023-07-08 15:22:16,537][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000015576_7974912.pth... +[2023-07-08 15:22:16,540][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000015104_7733248.pth +[2023-07-08 15:22:17,689][994606] Updated weights for policy 0, policy_version 15600 (0.0005) +[2023-07-08 15:22:21,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8192.0, 300 sec: 8150.3). Total num frames: 8015872. Throughput: 0: 8103.4. Samples: 8003656. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-07-08 15:22:21,534][994321] Avg episode reward: [(0, '223.492')] +[2023-07-08 15:22:22,698][994606] Updated weights for policy 0, policy_version 15680 (0.0006) +[2023-07-08 15:22:26,534][994321] Fps is (10 sec: 8192.1, 60 sec: 8123.8, 300 sec: 8164.2). Total num frames: 8056832. Throughput: 0: 8141.7. Samples: 8053548. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-07-08 15:22:26,534][994321] Avg episode reward: [(0, '221.742')] +[2023-07-08 15:22:27,564][994606] Updated weights for policy 0, policy_version 15760 (0.0005) +[2023-07-08 15:22:31,534][994321] Fps is (10 sec: 8601.5, 60 sec: 8192.0, 300 sec: 8178.1). Total num frames: 8101888. Throughput: 0: 8206.5. Samples: 8079220. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-07-08 15:22:31,534][994321] Avg episode reward: [(0, '228.286')] +[2023-07-08 15:22:31,537][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000015824_8101888.pth... +[2023-07-08 15:22:31,539][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000015344_7856128.pth +[2023-07-08 15:22:32,658][994606] Updated weights for policy 0, policy_version 15840 (0.0005) +[2023-07-08 15:22:36,534][994321] Fps is (10 sec: 8191.9, 60 sec: 8123.7, 300 sec: 8164.2). Total num frames: 8138752. Throughput: 0: 8101.6. Samples: 8126424. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-07-08 15:22:36,534][994321] Avg episode reward: [(0, '225.351')] +[2023-07-08 15:22:37,794][994606] Updated weights for policy 0, policy_version 15920 (0.0005) +[2023-07-08 15:22:41,534][994321] Fps is (10 sec: 7782.4, 60 sec: 8123.7, 300 sec: 8164.2). Total num frames: 8179712. Throughput: 0: 8100.1. Samples: 8175572. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-07-08 15:22:41,534][994321] Avg episode reward: [(0, '224.327')] +[2023-07-08 15:22:42,725][994606] Updated weights for policy 0, policy_version 16000 (0.0005) +[2023-07-08 15:22:46,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8123.7, 300 sec: 8150.3). Total num frames: 8220672. Throughput: 0: 8124.1. Samples: 8200176. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-07-08 15:22:46,534][994321] Avg episode reward: [(0, '239.904')] +[2023-07-08 15:22:46,536][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000016056_8220672.pth... +[2023-07-08 15:22:46,538][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000015576_7974912.pth +[2023-07-08 15:22:47,710][994606] Updated weights for policy 0, policy_version 16080 (0.0005) +[2023-07-08 15:22:51,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8123.7, 300 sec: 8150.3). Total num frames: 8261632. Throughput: 0: 8166.5. Samples: 8249260. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-07-08 15:22:51,534][994321] Avg episode reward: [(0, '223.213')] +[2023-07-08 15:22:52,823][994606] Updated weights for policy 0, policy_version 16160 (0.0005) +[2023-07-08 15:22:56,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8123.7, 300 sec: 8164.2). Total num frames: 8302592. Throughput: 0: 8195.4. Samples: 8298480. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:22:56,534][994321] Avg episode reward: [(0, '229.037')] +[2023-07-08 15:22:57,634][994606] Updated weights for policy 0, policy_version 16240 (0.0005) +[2023-07-08 15:23:01,534][994321] Fps is (10 sec: 8192.1, 60 sec: 8123.8, 300 sec: 8164.2). Total num frames: 8343552. Throughput: 0: 8207.2. Samples: 8323700. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:23:01,534][994321] Avg episode reward: [(0, '225.866')] +[2023-07-08 15:23:01,537][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000016296_8343552.pth... +[2023-07-08 15:23:01,539][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000015824_8101888.pth +[2023-07-08 15:23:02,679][994606] Updated weights for policy 0, policy_version 16320 (0.0005) +[2023-07-08 15:23:06,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8192.0, 300 sec: 8164.2). Total num frames: 8384512. Throughput: 0: 8185.4. Samples: 8372000. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:23:06,534][994321] Avg episode reward: [(0, '232.106')] +[2023-07-08 15:23:07,719][994606] Updated weights for policy 0, policy_version 16400 (0.0005) +[2023-07-08 15:23:11,534][994321] Fps is (10 sec: 8191.9, 60 sec: 8192.0, 300 sec: 8178.1). Total num frames: 8425472. Throughput: 0: 8171.3. Samples: 8421256. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:23:11,534][994321] Avg episode reward: [(0, '235.300')] +[2023-07-08 15:23:12,867][994606] Updated weights for policy 0, policy_version 16480 (0.0005) +[2023-07-08 15:23:16,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8192.0, 300 sec: 8192.0). Total num frames: 8466432. Throughput: 0: 8151.0. Samples: 8446016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:23:16,534][994321] Avg episode reward: [(0, '229.815')] +[2023-07-08 15:23:16,537][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000016536_8466432.pth... +[2023-07-08 15:23:16,540][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000016056_8220672.pth +[2023-07-08 15:23:17,699][994606] Updated weights for policy 0, policy_version 16560 (0.0005) +[2023-07-08 15:23:21,534][994321] Fps is (10 sec: 8192.1, 60 sec: 8192.0, 300 sec: 8192.0). Total num frames: 8507392. Throughput: 0: 8190.5. Samples: 8494996. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:23:21,534][994321] Avg episode reward: [(0, '237.519')] +[2023-07-08 15:23:22,915][994606] Updated weights for policy 0, policy_version 16640 (0.0005) +[2023-07-08 15:23:26,534][994321] Fps is (10 sec: 8601.7, 60 sec: 8260.3, 300 sec: 8205.9). Total num frames: 8552448. Throughput: 0: 8211.8. Samples: 8545104. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:23:26,534][994321] Avg episode reward: [(0, '231.703')] +[2023-07-08 15:23:27,548][994606] Updated weights for policy 0, policy_version 16720 (0.0005) +[2023-07-08 15:23:31,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8123.7, 300 sec: 8192.0). Total num frames: 8589312. Throughput: 0: 8211.6. Samples: 8569700. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:23:31,534][994321] Avg episode reward: [(0, '222.137')] +[2023-07-08 15:23:31,537][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000016776_8589312.pth... +[2023-07-08 15:23:31,539][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000016296_8343552.pth +[2023-07-08 15:23:32,632][994606] Updated weights for policy 0, policy_version 16800 (0.0005) +[2023-07-08 15:23:36,534][994321] Fps is (10 sec: 8191.9, 60 sec: 8260.2, 300 sec: 8205.9). Total num frames: 8634368. Throughput: 0: 8221.0. Samples: 8619208. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:23:36,535][994321] Avg episode reward: [(0, '236.335')] +[2023-07-08 15:23:37,473][994606] Updated weights for policy 0, policy_version 16880 (0.0005) +[2023-07-08 15:23:41,534][994321] Fps is (10 sec: 8601.7, 60 sec: 8260.3, 300 sec: 8205.9). Total num frames: 8675328. Throughput: 0: 8275.2. Samples: 8670864. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-07-08 15:23:41,534][994321] Avg episode reward: [(0, '235.371')] +[2023-07-08 15:23:42,442][994606] Updated weights for policy 0, policy_version 16960 (0.0006) +[2023-07-08 15:23:46,534][994321] Fps is (10 sec: 8192.1, 60 sec: 8260.3, 300 sec: 8205.9). Total num frames: 8716288. Throughput: 0: 8238.5. Samples: 8694432. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-07-08 15:23:46,534][994321] Avg episode reward: [(0, '240.212')] +[2023-07-08 15:23:46,537][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000017024_8716288.pth... +[2023-07-08 15:23:46,540][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000016536_8466432.pth +[2023-07-08 15:23:47,559][994606] Updated weights for policy 0, policy_version 17040 (0.0005) +[2023-07-08 15:23:51,534][994321] Fps is (10 sec: 7782.4, 60 sec: 8192.0, 300 sec: 8192.0). Total num frames: 8753152. Throughput: 0: 8209.0. Samples: 8741404. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-07-08 15:23:51,535][994321] Avg episode reward: [(0, '232.717')] +[2023-07-08 15:23:52,834][994606] Updated weights for policy 0, policy_version 17120 (0.0005) +[2023-07-08 15:23:56,534][994321] Fps is (10 sec: 7782.5, 60 sec: 8192.0, 300 sec: 8192.0). Total num frames: 8794112. Throughput: 0: 8152.4. Samples: 8788112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:23:56,535][994321] Avg episode reward: [(0, '236.997')] +[2023-07-08 15:23:57,949][994606] Updated weights for policy 0, policy_version 17200 (0.0005) +[2023-07-08 15:24:01,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8192.0, 300 sec: 8192.0). Total num frames: 8835072. Throughput: 0: 8164.0. Samples: 8813396. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:24:01,535][994321] Avg episode reward: [(0, '222.288')] +[2023-07-08 15:24:01,537][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000017256_8835072.pth... +[2023-07-08 15:24:01,540][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000016776_8589312.pth +[2023-07-08 15:24:03,071][994606] Updated weights for policy 0, policy_version 17280 (0.0005) +[2023-07-08 15:24:06,534][994321] Fps is (10 sec: 7782.4, 60 sec: 8123.7, 300 sec: 8178.1). Total num frames: 8871936. Throughput: 0: 8104.8. Samples: 8859712. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-07-08 15:24:06,535][994321] Avg episode reward: [(0, '241.350')] +[2023-07-08 15:24:08,252][994606] Updated weights for policy 0, policy_version 17360 (0.0005) +[2023-07-08 15:24:11,534][994321] Fps is (10 sec: 7782.3, 60 sec: 8123.7, 300 sec: 8178.1). Total num frames: 8912896. Throughput: 0: 8083.7. Samples: 8908872. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-07-08 15:24:11,535][994321] Avg episode reward: [(0, '236.221')] +[2023-07-08 15:24:13,392][994606] Updated weights for policy 0, policy_version 17440 (0.0006) +[2023-07-08 15:24:16,534][994321] Fps is (10 sec: 8191.9, 60 sec: 8123.7, 300 sec: 8178.1). Total num frames: 8953856. Throughput: 0: 8057.1. Samples: 8932272. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-07-08 15:24:16,535][994321] Avg episode reward: [(0, '234.216')] +[2023-07-08 15:24:16,538][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000017488_8953856.pth... +[2023-07-08 15:24:16,541][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000017024_8716288.pth +[2023-07-08 15:24:18,173][994606] Updated weights for policy 0, policy_version 17520 (0.0005) +[2023-07-08 15:24:21,534][994321] Fps is (10 sec: 8601.6, 60 sec: 8192.0, 300 sec: 8192.0). Total num frames: 8998912. Throughput: 0: 8124.0. Samples: 8984788. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:24:21,535][994321] Avg episode reward: [(0, '239.166')] +[2023-07-08 15:24:22,824][994606] Updated weights for policy 0, policy_version 17600 (0.0005) +[2023-07-08 15:24:26,534][994321] Fps is (10 sec: 8601.7, 60 sec: 8123.7, 300 sec: 8192.0). Total num frames: 9039872. Throughput: 0: 8083.9. Samples: 9034640. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:24:26,535][994321] Avg episode reward: [(0, '234.693')] +[2023-07-08 15:24:27,894][994606] Updated weights for policy 0, policy_version 17680 (0.0004) +[2023-07-08 15:24:31,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8192.0, 300 sec: 8192.0). Total num frames: 9080832. Throughput: 0: 8104.1. Samples: 9059116. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:24:31,535][994321] Avg episode reward: [(0, '228.706')] +[2023-07-08 15:24:31,538][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000017736_9080832.pth... +[2023-07-08 15:24:31,540][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000017256_8835072.pth +[2023-07-08 15:24:32,671][994606] Updated weights for policy 0, policy_version 17760 (0.0005) +[2023-07-08 15:24:36,534][994321] Fps is (10 sec: 8601.5, 60 sec: 8192.0, 300 sec: 8205.9). Total num frames: 9125888. Throughput: 0: 8194.4. Samples: 9110152. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:24:36,535][994321] Avg episode reward: [(0, '236.921')] +[2023-07-08 15:24:37,449][994606] Updated weights for policy 0, policy_version 17840 (0.0005) +[2023-07-08 15:24:41,534][994321] Fps is (10 sec: 8601.7, 60 sec: 8192.0, 300 sec: 8205.9). Total num frames: 9166848. Throughput: 0: 8345.6. Samples: 9163664. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:24:41,534][994321] Avg episode reward: [(0, '233.822')] +[2023-07-08 15:24:42,157][994606] Updated weights for policy 0, policy_version 17920 (0.0006) +[2023-07-08 15:24:46,534][994321] Fps is (10 sec: 8601.6, 60 sec: 8260.3, 300 sec: 8219.8). Total num frames: 9211904. Throughput: 0: 8349.9. Samples: 9189140. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:24:46,534][994321] Avg episode reward: [(0, '244.859')] +[2023-07-08 15:24:46,537][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000017992_9211904.pth... +[2023-07-08 15:24:46,539][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000017488_8953856.pth +[2023-07-08 15:24:46,802][994606] Updated weights for policy 0, policy_version 18000 (0.0005) +[2023-07-08 15:24:51,534][994321] Fps is (10 sec: 8601.5, 60 sec: 8328.5, 300 sec: 8219.8). Total num frames: 9252864. Throughput: 0: 8455.5. Samples: 9240208. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:24:51,534][994321] Avg episode reward: [(0, '249.386')] +[2023-07-08 15:24:51,650][994606] Updated weights for policy 0, policy_version 18080 (0.0005) +[2023-07-08 15:24:56,524][994606] Updated weights for policy 0, policy_version 18160 (0.0005) +[2023-07-08 15:24:56,534][994321] Fps is (10 sec: 8601.7, 60 sec: 8396.8, 300 sec: 8233.7). Total num frames: 9297920. Throughput: 0: 8504.0. Samples: 9291552. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-07-08 15:24:56,534][994321] Avg episode reward: [(0, '243.096')] +[2023-07-08 15:25:01,534][994321] Fps is (10 sec: 8192.1, 60 sec: 8328.5, 300 sec: 8205.9). Total num frames: 9334784. Throughput: 0: 8520.0. Samples: 9315672. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-07-08 15:25:01,534][994321] Avg episode reward: [(0, '239.359')] +[2023-07-08 15:25:01,537][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000018232_9334784.pth... +[2023-07-08 15:25:01,540][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000017736_9080832.pth +[2023-07-08 15:25:01,647][994606] Updated weights for policy 0, policy_version 18240 (0.0005) +[2023-07-08 15:25:06,303][994606] Updated weights for policy 0, policy_version 18320 (0.0005) +[2023-07-08 15:25:06,534][994321] Fps is (10 sec: 8191.9, 60 sec: 8465.1, 300 sec: 8205.9). Total num frames: 9379840. Throughput: 0: 8441.6. Samples: 9364660. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-07-08 15:25:06,534][994321] Avg episode reward: [(0, '256.128')] +[2023-07-08 15:25:06,535][994562] Saving new best policy, reward=256.128! +[2023-07-08 15:25:11,453][994606] Updated weights for policy 0, policy_version 18400 (0.0006) +[2023-07-08 15:25:11,534][994321] Fps is (10 sec: 8601.6, 60 sec: 8465.1, 300 sec: 8219.8). Total num frames: 9420800. Throughput: 0: 8444.7. Samples: 9414652. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:25:11,534][994321] Avg episode reward: [(0, '241.193')] +[2023-07-08 15:25:16,243][994606] Updated weights for policy 0, policy_version 18480 (0.0005) +[2023-07-08 15:25:16,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8465.1, 300 sec: 8219.8). Total num frames: 9461760. Throughput: 0: 8444.6. Samples: 9439124. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:25:16,534][994321] Avg episode reward: [(0, '246.639')] +[2023-07-08 15:25:16,538][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000018480_9461760.pth... +[2023-07-08 15:25:16,540][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000017992_9211904.pth +[2023-07-08 15:25:21,298][994606] Updated weights for policy 0, policy_version 18560 (0.0005) +[2023-07-08 15:25:21,534][994321] Fps is (10 sec: 8191.9, 60 sec: 8396.8, 300 sec: 8219.8). Total num frames: 9502720. Throughput: 0: 8449.7. Samples: 9490388. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:25:21,535][994321] Avg episode reward: [(0, '231.191')] +[2023-07-08 15:25:26,382][994606] Updated weights for policy 0, policy_version 18640 (0.0005) +[2023-07-08 15:25:26,534][994321] Fps is (10 sec: 8192.2, 60 sec: 8396.8, 300 sec: 8205.9). Total num frames: 9543680. Throughput: 0: 8348.2. Samples: 9539332. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:25:26,535][994321] Avg episode reward: [(0, '252.708')] +[2023-07-08 15:25:31,141][994606] Updated weights for policy 0, policy_version 18720 (0.0006) +[2023-07-08 15:25:31,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8396.8, 300 sec: 8205.9). Total num frames: 9584640. Throughput: 0: 8352.1. Samples: 9564984. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:25:31,535][994321] Avg episode reward: [(0, '238.122')] +[2023-07-08 15:25:31,537][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000018720_9584640.pth... +[2023-07-08 15:25:31,539][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000018232_9334784.pth +[2023-07-08 15:25:35,838][994606] Updated weights for policy 0, policy_version 18800 (0.0005) +[2023-07-08 15:25:36,534][994321] Fps is (10 sec: 8601.5, 60 sec: 8396.8, 300 sec: 8233.7). Total num frames: 9629696. Throughput: 0: 8329.2. Samples: 9615020. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-07-08 15:25:36,535][994321] Avg episode reward: [(0, '243.689')] +[2023-07-08 15:25:40,885][994606] Updated weights for policy 0, policy_version 18880 (0.0005) +[2023-07-08 15:25:41,534][994321] Fps is (10 sec: 8601.5, 60 sec: 8396.8, 300 sec: 8233.7). Total num frames: 9670656. Throughput: 0: 8290.2. Samples: 9664612. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-07-08 15:25:41,534][994321] Avg episode reward: [(0, '232.168')] +[2023-07-08 15:25:45,866][994606] Updated weights for policy 0, policy_version 18960 (0.0005) +[2023-07-08 15:25:46,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8328.5, 300 sec: 8233.7). Total num frames: 9711616. Throughput: 0: 8339.6. Samples: 9690952. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-07-08 15:25:46,534][994321] Avg episode reward: [(0, '241.855')] +[2023-07-08 15:25:46,537][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000018968_9711616.pth... +[2023-07-08 15:25:46,540][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000018480_9461760.pth +[2023-07-08 15:25:50,997][994606] Updated weights for policy 0, policy_version 19040 (0.0005) +[2023-07-08 15:25:51,534][994321] Fps is (10 sec: 7782.5, 60 sec: 8260.3, 300 sec: 8219.8). Total num frames: 9748480. Throughput: 0: 8309.9. Samples: 9738604. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-07-08 15:25:51,534][994321] Avg episode reward: [(0, '224.046')] +[2023-07-08 15:25:56,438][994606] Updated weights for policy 0, policy_version 19120 (0.0006) +[2023-07-08 15:25:56,534][994321] Fps is (10 sec: 7782.4, 60 sec: 8192.0, 300 sec: 8219.8). Total num frames: 9789440. Throughput: 0: 8211.8. Samples: 9784184. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-07-08 15:25:56,534][994321] Avg episode reward: [(0, '240.012')] +[2023-07-08 15:26:01,442][994606] Updated weights for policy 0, policy_version 19200 (0.0005) +[2023-07-08 15:26:01,534][994321] Fps is (10 sec: 8191.9, 60 sec: 8260.3, 300 sec: 8219.8). Total num frames: 9830400. Throughput: 0: 8190.1. Samples: 9807676. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-07-08 15:26:01,534][994321] Avg episode reward: [(0, '226.944')] +[2023-07-08 15:26:01,537][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000019200_9830400.pth... +[2023-07-08 15:26:01,540][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000018720_9584640.pth +[2023-07-08 15:26:06,534][994321] Fps is (10 sec: 7782.4, 60 sec: 8123.7, 300 sec: 8205.9). Total num frames: 9867264. Throughput: 0: 8144.4. Samples: 9856884. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-07-08 15:26:06,534][994321] Avg episode reward: [(0, '241.173')] +[2023-07-08 15:26:06,536][994606] Updated weights for policy 0, policy_version 19280 (0.0005) +[2023-07-08 15:26:11,534][994321] Fps is (10 sec: 7782.4, 60 sec: 8123.7, 300 sec: 8205.9). Total num frames: 9908224. Throughput: 0: 8108.2. Samples: 9904200. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-07-08 15:26:11,535][994321] Avg episode reward: [(0, '233.525')] +[2023-07-08 15:26:11,679][994606] Updated weights for policy 0, policy_version 19360 (0.0005) +[2023-07-08 15:26:16,534][994321] Fps is (10 sec: 8192.0, 60 sec: 8123.7, 300 sec: 8219.8). Total num frames: 9949184. Throughput: 0: 8081.1. Samples: 9928632. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:26:16,535][994321] Avg episode reward: [(0, '230.256')] +[2023-07-08 15:26:16,538][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000019432_9949184.pth... +[2023-07-08 15:26:16,540][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000018968_9711616.pth +[2023-07-08 15:26:16,841][994606] Updated weights for policy 0, policy_version 19440 (0.0005) +[2023-07-08 15:26:21,534][994321] Fps is (10 sec: 8192.1, 60 sec: 8123.8, 300 sec: 8205.9). Total num frames: 9990144. Throughput: 0: 8058.4. Samples: 9977648. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-07-08 15:26:21,551][994321] Avg episode reward: [(0, '237.090')] +[2023-07-08 15:26:21,693][994606] Updated weights for policy 0, policy_version 19520 (0.0005) +[2023-07-08 15:26:22,964][994562] Early stopping after 2 epochs (8 sgd steps), loss delta 0.0000000 +[2023-07-08 15:26:22,965][994610] Stopping RolloutWorker_w3... +[2023-07-08 15:26:22,965][994662] Stopping RolloutWorker_w4... +[2023-07-08 15:26:22,965][994608] Stopping RolloutWorker_w0... +[2023-07-08 15:26:22,965][994607] Stopping RolloutWorker_w1... +[2023-07-08 15:26:22,965][994611] Stopping RolloutWorker_w5... +[2023-07-08 15:26:22,965][994609] Stopping RolloutWorker_w2... +[2023-07-08 15:26:22,966][994610] Loop rollout_proc3_evt_loop terminating... +[2023-07-08 15:26:22,965][994724] Stopping RolloutWorker_w6... +[2023-07-08 15:26:22,966][994608] Loop rollout_proc0_evt_loop terminating... +[2023-07-08 15:26:22,966][994662] Loop rollout_proc4_evt_loop terminating... +[2023-07-08 15:26:22,966][994607] Loop rollout_proc1_evt_loop terminating... +[2023-07-08 15:26:22,966][994611] Loop rollout_proc5_evt_loop terminating... +[2023-07-08 15:26:22,965][994738] Stopping RolloutWorker_w7... +[2023-07-08 15:26:22,965][994321] Component RolloutWorker_w3 stopped! +[2023-07-08 15:26:22,966][994724] Loop rollout_proc6_evt_loop terminating... +[2023-07-08 15:26:22,966][994609] Loop rollout_proc2_evt_loop terminating... +[2023-07-08 15:26:22,966][994562] Stopping Batcher_0... +[2023-07-08 15:26:22,966][994738] Loop rollout_proc7_evt_loop terminating... +[2023-07-08 15:26:22,966][994321] Component RolloutWorker_w4 stopped! +[2023-07-08 15:26:22,966][994562] Loop batcher_evt_loop terminating... +[2023-07-08 15:26:22,966][994321] Component RolloutWorker_w1 stopped! +[2023-07-08 15:26:22,966][994321] Component RolloutWorker_w5 stopped! +[2023-07-08 15:26:22,967][994321] Component RolloutWorker_w0 stopped! +[2023-07-08 15:26:22,967][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000019544_10006528.pth... +[2023-07-08 15:26:22,967][994321] Component RolloutWorker_w2 stopped! +[2023-07-08 15:26:22,967][994321] Component RolloutWorker_w6 stopped! +[2023-07-08 15:26:22,967][994321] Component RolloutWorker_w7 stopped! +[2023-07-08 15:26:22,967][994321] Component Batcher_0 stopped! +[2023-07-08 15:26:22,970][994562] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000019200_9830400.pth +[2023-07-08 15:26:22,971][994562] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000019544_10006528.pth... +[2023-07-08 15:26:22,973][994562] Stopping LearnerWorker_p0... +[2023-07-08 15:26:22,974][994562] Loop learner_proc0_evt_loop terminating... +[2023-07-08 15:26:22,974][994321] Component LearnerWorker_p0 stopped! +[2023-07-08 15:26:23,029][994606] Weights refcount: 2 0 +[2023-07-08 15:26:23,030][994606] Stopping InferenceWorker_p0-w0... +[2023-07-08 15:26:23,030][994606] Loop inference_proc0-0_evt_loop terminating... +[2023-07-08 15:26:23,030][994321] Component InferenceWorker_p0-w0 stopped! +[2023-07-08 15:26:23,031][994321] Waiting for process learner_proc0 to stop... +[2023-07-08 15:26:23,702][994321] Waiting for process inference_proc0-0 to join... +[2023-07-08 15:26:23,702][994321] Waiting for process rollout_proc0 to join... +[2023-07-08 15:26:23,703][994321] Waiting for process rollout_proc1 to join... +[2023-07-08 15:26:23,703][994321] Waiting for process rollout_proc2 to join... +[2023-07-08 15:26:23,703][994321] Waiting for process rollout_proc3 to join... +[2023-07-08 15:26:23,703][994321] Waiting for process rollout_proc4 to join... +[2023-07-08 15:26:23,704][994321] Waiting for process rollout_proc5 to join... +[2023-07-08 15:26:23,704][994321] Waiting for process rollout_proc6 to join... +[2023-07-08 15:26:23,704][994321] Waiting for process rollout_proc7 to join... +[2023-07-08 15:26:23,705][994321] Batcher 0 profile tree view: +batching: 1.8416, releasing_batches: 1.5460 +[2023-07-08 15:26:23,705][994321] InferenceWorker_p0-w0 profile tree view: +wait_policy: 0.0052 + wait_policy_total: 497.2761 +update_model: 14.0616 weight_update: 0.0005 -one_step: 0.0006 - handle_policy_step: 581.8655 - deserialize: 23.8642, stack: 6.1317, obs_to_device_normalize: 105.7983, forward: 291.1840, send_messages: 38.9478 - prepare_outputs: 66.1446 - to_cpu: 10.1733 -[2023-07-07 22:44:00,966][754029] Learner 0 profile tree view: -misc: 0.0110, prepare_batch: 9.4976 -train: 96.9600 - epoch_init: 0.0347, minibatch_init: 1.3214, losses_postprocess: 1.2922, kl_divergence: 0.4435, after_optimizer: 0.6397 - calculate_losses: 41.2681 - losses_init: 0.0327, forward_head: 16.1197, bptt_initial: 0.1412, bptt: 0.1380, tail: 11.6243, advantages_returns: 0.8905, losses: 10.9071 - update: 50.3779 - clip: 5.9484 -[2023-07-07 22:44:00,966][754029] RolloutWorker_w0 profile tree view: -wait_for_trajectories: 0.2905, enqueue_policy_requests: 12.7622, env_step: 775.2068, overhead: 19.5771, complete_rollouts: 0.3137 -save_policy_outputs: 38.4339 - split_output_tensors: 13.2252 -[2023-07-07 22:44:00,966][754029] RolloutWorker_w7 profile tree view: -wait_for_trajectories: 0.2854, enqueue_policy_requests: 12.7767, env_step: 775.8060, overhead: 20.1599, complete_rollouts: 0.3325 -save_policy_outputs: 38.3716 - split_output_tensors: 13.1080 -[2023-07-07 22:44:00,966][754029] Loop Runner_EvtLoop terminating... -[2023-07-07 22:44:00,967][754029] Runner profile tree view: -main_loop: 1088.9653 -[2023-07-07 22:44:00,967][754029] Collected {0: 10006528}, FPS: 9189.0 +one_step: 0.0007 + handle_policy_step: 639.5570 + deserialize: 26.4253, stack: 6.8323, obs_to_device_normalize: 115.2911, forward: 317.6661, send_messages: 45.9212 + prepare_outputs: 71.2082 + to_cpu: 10.9389 +[2023-07-08 15:26:23,705][994321] Learner 0 profile tree view: +misc: 0.0097, prepare_batch: 8.2726 +train: 85.8422 + epoch_init: 0.0340, minibatch_init: 1.1990, losses_postprocess: 1.2559, kl_divergence: 0.4167, after_optimizer: 0.6288 + calculate_losses: 36.1888 + losses_init: 0.0312, forward_head: 13.7874, bptt_initial: 0.1249, bptt: 0.1260, tail: 10.5518, advantages_returns: 0.8118, losses: 9.4544 + update: 44.6623 + clip: 5.3864 +[2023-07-08 15:26:23,705][994321] RolloutWorker_w0 profile tree view: +wait_for_trajectories: 0.4595, enqueue_policy_requests: 15.4754, env_step: 793.2932, overhead: 21.8072, complete_rollouts: 0.3991 +save_policy_outputs: 43.4683 + split_output_tensors: 14.9225 +[2023-07-08 15:26:23,706][994321] RolloutWorker_w7 profile tree view: +wait_for_trajectories: 0.4249, enqueue_policy_requests: 15.1410, env_step: 795.0662, overhead: 22.0262, complete_rollouts: 0.4316 +save_policy_outputs: 43.2239 + split_output_tensors: 14.7876 +[2023-07-08 15:26:23,706][994321] Loop Runner_EvtLoop terminating... +[2023-07-08 15:26:23,706][994321] Runner profile tree view: +main_loop: 1234.2376 +[2023-07-08 15:26:23,706][994321] Collected {0: 10006528}, FPS: 8107.5