| [2023-07-17 01:12:11,340][285366] Saving configuration to /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/config.json... |
| [2023-07-17 01:12:11,360][285366] Rollout worker 0 uses device cpu |
| [2023-07-17 01:12:11,361][285366] Rollout worker 1 uses device cpu |
| [2023-07-17 01:12:11,361][285366] Rollout worker 2 uses device cpu |
| [2023-07-17 01:12:11,361][285366] Rollout worker 3 uses device cpu |
| [2023-07-17 01:12:11,361][285366] Rollout worker 4 uses device cpu |
| [2023-07-17 01:12:11,361][285366] Rollout worker 5 uses device cpu |
| [2023-07-17 01:12:11,361][285366] Rollout worker 6 uses device cpu |
| [2023-07-17 01:12:11,362][285366] Rollout worker 7 uses device cpu |
| [2023-07-17 01:12:11,362][285366] In synchronous mode, we only accumulate one batch. Setting num_batches_to_accumulate to 1 |
| [2023-07-17 01:12:11,380][285366] InferenceWorker_p0-w0: min num requests: 2 |
| [2023-07-17 01:12:11,411][285366] Starting all processes... |
| [2023-07-17 01:12:11,411][285366] Starting process learner_proc0 |
| [2023-07-17 01:12:11,459][285366] Starting all processes... |
| [2023-07-17 01:12:11,503][285366] Starting process inference_proc0-0 |
| [2023-07-17 01:12:11,513][285366] Starting process rollout_proc0 |
| [2023-07-17 01:12:11,513][285366] Starting process rollout_proc1 |
| [2023-07-17 01:12:11,513][285366] Starting process rollout_proc2 |
| [2023-07-17 01:12:11,514][285366] Starting process rollout_proc3 |
| [2023-07-17 01:12:11,514][285366] Starting process rollout_proc4 |
| [2023-07-17 01:12:11,514][285366] Starting process rollout_proc5 |
| [2023-07-17 01:12:11,514][285366] Starting process rollout_proc6 |
| [2023-07-17 01:12:11,514][285366] Starting process rollout_proc7 |
| [2023-07-17 01:12:13,287][285607] Starting seed is not provided |
| [2023-07-17 01:12:13,287][285607] Initializing actor-critic model on device cpu |
| [2023-07-17 01:12:13,287][285607] RunningMeanStd input shape: (39,) |
| [2023-07-17 01:12:13,288][285607] RunningMeanStd input shape: (1,) |
| [2023-07-17 01:12:13,344][285607] Created Actor Critic model with architecture: |
| [2023-07-17 01:12:13,344][285607] ActorCriticSharedWeights( |
| (obs_normalizer): ObservationNormalizer( |
| (running_mean_std): RunningMeanStdDictInPlace( |
| (running_mean_std): ModuleDict( |
| (obs): RunningMeanStdInPlace() |
| ) |
| ) |
| ) |
| (returns_normalizer): RecursiveScriptModule(original_name=RunningMeanStdInPlace) |
| (encoder): MultiInputEncoder( |
| (encoders): ModuleDict( |
| (obs): MlpEncoder( |
| (mlp_head): RecursiveScriptModule( |
| original_name=Sequential |
| (0): RecursiveScriptModule(original_name=Linear) |
| (1): RecursiveScriptModule(original_name=Tanh) |
| (2): RecursiveScriptModule(original_name=Linear) |
| (3): RecursiveScriptModule(original_name=Tanh) |
| ) |
| ) |
| ) |
| ) |
| (core): ModelCoreIdentity() |
| (decoder): MlpDecoder( |
| (mlp): Identity() |
| ) |
| (critic_linear): Linear(in_features=64, out_features=1, bias=True) |
| (action_parameterization): ActionParameterizationContinuousNonAdaptiveStddev( |
| (distribution_linear): Linear(in_features=64, out_features=4, bias=True) |
| ) |
| ) |
| [2023-07-17 01:12:13,428][285720] Worker 6 uses CPU cores [24, 25, 26, 27] |
| [2023-07-17 01:12:13,561][285752] Worker 7 uses CPU cores [28, 29, 30, 31] |
| [2023-07-17 01:12:13,594][285653] Worker 2 uses CPU cores [8, 9, 10, 11] |
| [2023-07-17 01:12:13,660][285607] Using optimizer <class 'torch.optim.adam.Adam'> |
| [2023-07-17 01:12:13,661][285607] No checkpoints found |
| [2023-07-17 01:12:13,661][285607] Did not load from checkpoint, starting from scratch! |
| [2023-07-17 01:12:13,661][285607] Initialized policy 0 weights for model version 0 |
| [2023-07-17 01:12:13,663][285607] LearnerWorker_p0 finished initialization! |
| [2023-07-17 01:12:13,664][285651] RunningMeanStd input shape: (39,) |
| [2023-07-17 01:12:13,664][285651] RunningMeanStd input shape: (1,) |
| [2023-07-17 01:12:13,719][285652] Worker 1 uses CPU cores [4, 5, 6, 7] |
| [2023-07-17 01:12:13,720][285366] Inference worker 0-0 is ready! |
| [2023-07-17 01:12:13,720][285366] All inference workers are ready! Signal rollout workers to start! |
| [2023-07-17 01:12:13,785][285654] Worker 0 uses CPU cores [0, 1, 2, 3] |
| [2023-07-17 01:12:13,832][285657] Worker 4 uses CPU cores [16, 17, 18, 19] |
| [2023-07-17 01:12:13,923][285655] Worker 3 uses CPU cores [12, 13, 14, 15] |
| [2023-07-17 01:12:14,102][285656] Worker 5 uses CPU cores [20, 21, 22, 23] |
| [2023-07-17 01:12:14,344][285366] Fps is (10 sec: nan, 60 sec: nan, 300 sec: nan). Total num frames: 0. Throughput: 0: nan. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) |
| [2023-07-17 01:12:15,058][285653] Decorrelating experience for 0 frames... |
| [2023-07-17 01:12:15,065][285653] Decorrelating experience for 64 frames... |
| [2023-07-17 01:12:15,079][285752] Decorrelating experience for 0 frames... |
| [2023-07-17 01:12:15,082][285720] Decorrelating experience for 0 frames... |
| [2023-07-17 01:12:15,086][285752] Decorrelating experience for 64 frames... |
| [2023-07-17 01:12:15,088][285720] Decorrelating experience for 64 frames... |
| [2023-07-17 01:12:15,093][285653] Decorrelating experience for 128 frames... |
| [2023-07-17 01:12:15,100][285652] Decorrelating experience for 0 frames... |
| [2023-07-17 01:12:15,107][285652] Decorrelating experience for 64 frames... |
| [2023-07-17 01:12:15,114][285752] Decorrelating experience for 128 frames... |
| [2023-07-17 01:12:15,116][285720] Decorrelating experience for 128 frames... |
| [2023-07-17 01:12:15,134][285652] Decorrelating experience for 128 frames... |
| [2023-07-17 01:12:15,148][285653] Decorrelating experience for 192 frames... |
| [2023-07-17 01:12:15,168][285752] Decorrelating experience for 192 frames... |
| [2023-07-17 01:12:15,170][285720] Decorrelating experience for 192 frames... |
| [2023-07-17 01:12:15,187][285654] Decorrelating experience for 0 frames... |
| [2023-07-17 01:12:15,189][285652] Decorrelating experience for 192 frames... |
| [2023-07-17 01:12:15,194][285654] Decorrelating experience for 64 frames... |
| [2023-07-17 01:12:15,222][285654] Decorrelating experience for 128 frames... |
| [2023-07-17 01:12:15,227][285657] Decorrelating experience for 0 frames... |
| [2023-07-17 01:12:15,233][285657] Decorrelating experience for 64 frames... |
| [2023-07-17 01:12:15,261][285657] Decorrelating experience for 128 frames... |
| [2023-07-17 01:12:15,277][285654] Decorrelating experience for 192 frames... |
| [2023-07-17 01:12:15,317][285657] Decorrelating experience for 192 frames... |
| [2023-07-17 01:12:15,332][285655] Decorrelating experience for 0 frames... |
| [2023-07-17 01:12:15,338][285655] Decorrelating experience for 64 frames... |
| [2023-07-17 01:12:15,367][285655] Decorrelating experience for 128 frames... |
| [2023-07-17 01:12:15,422][285655] Decorrelating experience for 192 frames... |
| [2023-07-17 01:12:15,476][285656] Decorrelating experience for 0 frames... |
| [2023-07-17 01:12:15,483][285656] Decorrelating experience for 64 frames... |
| [2023-07-17 01:12:15,511][285656] Decorrelating experience for 128 frames... |
| [2023-07-17 01:12:15,566][285656] Decorrelating experience for 192 frames... |
| [2023-07-17 01:12:16,474][285653] Decorrelating experience for 256 frames... |
| [2023-07-17 01:12:16,502][285720] Decorrelating experience for 256 frames... |
| [2023-07-17 01:12:16,505][285752] Decorrelating experience for 256 frames... |
| [2023-07-17 01:12:16,511][285652] Decorrelating experience for 256 frames... |
| [2023-07-17 01:12:16,576][285653] Decorrelating experience for 320 frames... |
| [2023-07-17 01:12:16,604][285720] Decorrelating experience for 320 frames... |
| [2023-07-17 01:12:16,609][285752] Decorrelating experience for 320 frames... |
| [2023-07-17 01:12:16,612][285654] Decorrelating experience for 256 frames... |
| [2023-07-17 01:12:16,614][285652] Decorrelating experience for 320 frames... |
| [2023-07-17 01:12:16,642][285657] Decorrelating experience for 256 frames... |
| [2023-07-17 01:12:16,704][285653] Decorrelating experience for 384 frames... |
| [2023-07-17 01:12:16,714][285654] Decorrelating experience for 320 frames... |
| [2023-07-17 01:12:16,733][285720] Decorrelating experience for 384 frames... |
| [2023-07-17 01:12:16,738][285752] Decorrelating experience for 384 frames... |
| [2023-07-17 01:12:16,744][285657] Decorrelating experience for 320 frames... |
| [2023-07-17 01:12:16,745][285652] Decorrelating experience for 384 frames... |
| [2023-07-17 01:12:16,745][285655] Decorrelating experience for 256 frames... |
| [2023-07-17 01:12:16,844][285654] Decorrelating experience for 384 frames... |
| [2023-07-17 01:12:16,847][285655] Decorrelating experience for 320 frames... |
| [2023-07-17 01:12:16,853][285653] Decorrelating experience for 448 frames... |
| [2023-07-17 01:12:16,874][285657] Decorrelating experience for 384 frames... |
| [2023-07-17 01:12:16,884][285720] Decorrelating experience for 448 frames... |
| [2023-07-17 01:12:16,886][285752] Decorrelating experience for 448 frames... |
| [2023-07-17 01:12:16,887][285656] Decorrelating experience for 256 frames... |
| [2023-07-17 01:12:16,896][285652] Decorrelating experience for 448 frames... |
| [2023-07-17 01:12:16,975][285655] Decorrelating experience for 384 frames... |
| [2023-07-17 01:12:16,989][285656] Decorrelating experience for 320 frames... |
| [2023-07-17 01:12:16,993][285654] Decorrelating experience for 448 frames... |
| [2023-07-17 01:12:17,026][285657] Decorrelating experience for 448 frames... |
| [2023-07-17 01:12:17,118][285656] Decorrelating experience for 384 frames... |
| [2023-07-17 01:12:17,124][285655] Decorrelating experience for 448 frames... |
| [2023-07-17 01:12:17,265][285656] Decorrelating experience for 448 frames... |
| [2023-07-17 01:12:19,344][285366] Fps is (10 sec: 4096.1, 60 sec: 4096.1, 300 sec: 4096.1). Total num frames: 20480. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:12:19,344][285366] Avg episode reward: [(0, '69.434')] |
| [2023-07-17 01:12:20,462][285651] Updated weights for policy 0, policy_version 80 (0.0004) |
| [2023-07-17 01:12:23,162][285651] Updated weights for policy 0, policy_version 160 (0.0004) |
| [2023-07-17 01:12:24,344][285366] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9830.4). Total num frames: 98304. Throughput: 0: 7570.4. Samples: 75704. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) |
| [2023-07-17 01:12:24,344][285366] Avg episode reward: [(0, '99.252')] |
| [2023-07-17 01:12:24,347][285607] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000000192_98304.pth... |
| [2023-07-17 01:12:25,878][285651] Updated weights for policy 0, policy_version 240 (0.0004) |
| [2023-07-17 01:12:28,578][285651] Updated weights for policy 0, policy_version 320 (0.0004) |
| [2023-07-17 01:12:29,344][285366] Fps is (10 sec: 15155.1, 60 sec: 11468.8, 300 sec: 11468.8). Total num frames: 172032. Throughput: 0: 11101.4. Samples: 166520. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:12:29,344][285366] Avg episode reward: [(0, '114.093')] |
| [2023-07-17 01:12:29,345][285607] Saving new best policy, reward=114.093! |
| [2023-07-17 01:12:31,334][285651] Updated weights for policy 0, policy_version 400 (0.0004) |
| [2023-07-17 01:12:31,372][285366] Heartbeat connected on Batcher_0 |
| [2023-07-17 01:12:31,376][285366] Heartbeat connected on LearnerWorker_p0 |
| [2023-07-17 01:12:31,381][285366] Heartbeat connected on InferenceWorker_p0-w0 |
| [2023-07-17 01:12:31,387][285366] Heartbeat connected on RolloutWorker_w0 |
| [2023-07-17 01:12:31,397][285366] Heartbeat connected on RolloutWorker_w1 |
| [2023-07-17 01:12:31,398][285366] Heartbeat connected on RolloutWorker_w3 |
| [2023-07-17 01:12:31,399][285366] Heartbeat connected on RolloutWorker_w2 |
| [2023-07-17 01:12:31,401][285366] Heartbeat connected on RolloutWorker_w4 |
| [2023-07-17 01:12:31,405][285366] Heartbeat connected on RolloutWorker_w5 |
| [2023-07-17 01:12:31,408][285366] Heartbeat connected on RolloutWorker_w6 |
| [2023-07-17 01:12:31,411][285366] Heartbeat connected on RolloutWorker_w7 |
| [2023-07-17 01:12:34,094][285651] Updated weights for policy 0, policy_version 480 (0.0004) |
| [2023-07-17 01:12:34,344][285366] Fps is (10 sec: 14745.7, 60 sec: 12288.1, 300 sec: 12288.1). Total num frames: 245760. Throughput: 0: 10561.1. Samples: 211220. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:12:34,344][285366] Avg episode reward: [(0, '117.500')] |
| [2023-07-17 01:12:34,352][285607] Saving new best policy, reward=117.500! |
| [2023-07-17 01:12:36,771][285651] Updated weights for policy 0, policy_version 560 (0.0004) |
| [2023-07-17 01:12:39,344][285366] Fps is (10 sec: 15155.2, 60 sec: 12943.4, 300 sec: 12943.4). Total num frames: 323584. Throughput: 0: 12053.5. Samples: 301336. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-17 01:12:39,344][285366] Avg episode reward: [(0, '117.813')] |
| [2023-07-17 01:12:39,347][285607] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000000632_323584.pth... |
| [2023-07-17 01:12:39,350][285607] Saving new best policy, reward=117.813! |
| [2023-07-17 01:12:39,530][285651] Updated weights for policy 0, policy_version 640 (0.0004) |
| [2023-07-17 01:12:42,324][285651] Updated weights for policy 0, policy_version 720 (0.0004) |
| [2023-07-17 01:12:44,344][285366] Fps is (10 sec: 15155.1, 60 sec: 13243.8, 300 sec: 13243.8). Total num frames: 397312. Throughput: 0: 12973.2. Samples: 389196. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-17 01:12:44,344][285366] Avg episode reward: [(0, '119.741')] |
| [2023-07-17 01:12:44,345][285607] Saving new best policy, reward=119.741! |
| [2023-07-17 01:12:45,108][285651] Updated weights for policy 0, policy_version 800 (0.0004) |
| [2023-07-17 01:12:47,876][285651] Updated weights for policy 0, policy_version 880 (0.0004) |
| [2023-07-17 01:12:49,344][285366] Fps is (10 sec: 14745.6, 60 sec: 13458.3, 300 sec: 13458.3). Total num frames: 471040. Throughput: 0: 12405.3. Samples: 434184. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:12:49,345][285366] Avg episode reward: [(0, '118.089')] |
| [2023-07-17 01:12:50,621][285651] Updated weights for policy 0, policy_version 960 (0.0004) |
| [2023-07-17 01:12:53,267][285651] Updated weights for policy 0, policy_version 1040 (0.0004) |
| [2023-07-17 01:12:54,344][285366] Fps is (10 sec: 15155.2, 60 sec: 13721.6, 300 sec: 13721.6). Total num frames: 548864. Throughput: 0: 13108.8. Samples: 524352. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:12:54,346][285366] Avg episode reward: [(0, '119.946')] |
| [2023-07-17 01:12:54,348][285607] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000001072_548864.pth... |
| [2023-07-17 01:12:54,351][285607] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000000192_98304.pth |
| [2023-07-17 01:12:54,351][285607] Saving new best policy, reward=119.946! |
| [2023-07-17 01:12:55,970][285651] Updated weights for policy 0, policy_version 1120 (0.0004) |
| [2023-07-17 01:12:58,719][285651] Updated weights for policy 0, policy_version 1200 (0.0004) |
| [2023-07-17 01:12:59,344][285366] Fps is (10 sec: 15155.3, 60 sec: 13835.4, 300 sec: 13835.4). Total num frames: 622592. Throughput: 0: 13655.0. Samples: 614472. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:12:59,345][285366] Avg episode reward: [(0, '121.109')] |
| [2023-07-17 01:12:59,345][285607] Saving new best policy, reward=121.109! |
| [2023-07-17 01:13:01,495][285651] Updated weights for policy 0, policy_version 1280 (0.0004) |
| [2023-07-17 01:13:04,206][285651] Updated weights for policy 0, policy_version 1360 (0.0004) |
| [2023-07-17 01:13:04,344][285366] Fps is (10 sec: 14745.6, 60 sec: 13926.4, 300 sec: 13926.4). Total num frames: 696320. Throughput: 0: 14654.7. Samples: 659464. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-17 01:13:04,345][285366] Avg episode reward: [(0, '121.679')] |
| [2023-07-17 01:13:04,345][285607] Saving new best policy, reward=121.679! |
| [2023-07-17 01:13:06,947][285651] Updated weights for policy 0, policy_version 1440 (0.0004) |
| [2023-07-17 01:13:09,344][285366] Fps is (10 sec: 14745.3, 60 sec: 14000.9, 300 sec: 14000.9). Total num frames: 770048. Throughput: 0: 14964.8. Samples: 749120. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-17 01:13:09,344][285366] Avg episode reward: [(0, '121.629')] |
| [2023-07-17 01:13:09,348][285607] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000001504_770048.pth... |
| [2023-07-17 01:13:09,350][285607] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000000632_323584.pth |
| [2023-07-17 01:13:09,767][285651] Updated weights for policy 0, policy_version 1520 (0.0004) |
| [2023-07-17 01:13:12,516][285651] Updated weights for policy 0, policy_version 1600 (0.0004) |
| [2023-07-17 01:13:14,344][285366] Fps is (10 sec: 14745.6, 60 sec: 14062.9, 300 sec: 14062.9). Total num frames: 843776. Throughput: 0: 14913.0. Samples: 837604. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:13:14,344][285366] Avg episode reward: [(0, '122.019')] |
| [2023-07-17 01:13:14,345][285607] Saving new best policy, reward=122.019! |
| [2023-07-17 01:13:15,200][285651] Updated weights for policy 0, policy_version 1680 (0.0004) |
| [2023-07-17 01:13:17,996][285651] Updated weights for policy 0, policy_version 1760 (0.0004) |
| [2023-07-17 01:13:19,344][285366] Fps is (10 sec: 15155.5, 60 sec: 15018.7, 300 sec: 14178.5). Total num frames: 921600. Throughput: 0: 14923.2. Samples: 882764. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) |
| [2023-07-17 01:13:19,344][285366] Avg episode reward: [(0, '122.384')] |
| [2023-07-17 01:13:19,344][285607] Saving new best policy, reward=122.384! |
| [2023-07-17 01:13:20,848][285651] Updated weights for policy 0, policy_version 1840 (0.0004) |
| [2023-07-17 01:13:23,816][285651] Updated weights for policy 0, policy_version 1920 (0.0005) |
| [2023-07-17 01:13:24,344][285366] Fps is (10 sec: 14336.0, 60 sec: 14813.9, 300 sec: 14101.9). Total num frames: 987136. Throughput: 0: 14825.3. Samples: 968476. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) |
| [2023-07-17 01:13:24,344][285366] Avg episode reward: [(0, '122.486')] |
| [2023-07-17 01:13:24,348][285607] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000001928_987136.pth... |
| [2023-07-17 01:13:24,351][285607] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000001072_548864.pth |
| [2023-07-17 01:13:24,351][285607] Saving new best policy, reward=122.486! |
| [2023-07-17 01:13:26,949][285651] Updated weights for policy 0, policy_version 2000 (0.0005) |
| [2023-07-17 01:13:29,344][285366] Fps is (10 sec: 13107.1, 60 sec: 14677.3, 300 sec: 14035.6). Total num frames: 1052672. Throughput: 0: 14639.9. Samples: 1047992. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:13:29,344][285366] Avg episode reward: [(0, '123.075')] |
| [2023-07-17 01:13:29,345][285607] Saving new best policy, reward=123.075! |
| [2023-07-17 01:13:30,059][285651] Updated weights for policy 0, policy_version 2080 (0.0005) |
| [2023-07-17 01:13:33,128][285651] Updated weights for policy 0, policy_version 2160 (0.0005) |
| [2023-07-17 01:13:34,344][285366] Fps is (10 sec: 13516.9, 60 sec: 14609.1, 300 sec: 14028.8). Total num frames: 1122304. Throughput: 0: 14520.5. Samples: 1087608. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:13:34,344][285366] Avg episode reward: [(0, '122.845')] |
| [2023-07-17 01:13:36,169][285651] Updated weights for policy 0, policy_version 2240 (0.0005) |
| [2023-07-17 01:13:39,258][285651] Updated weights for policy 0, policy_version 2320 (0.0005) |
| [2023-07-17 01:13:39,344][285366] Fps is (10 sec: 13516.8, 60 sec: 14404.3, 300 sec: 13974.6). Total num frames: 1187840. Throughput: 0: 14290.5. Samples: 1167424. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) |
| [2023-07-17 01:13:39,344][285366] Avg episode reward: [(0, '122.976')] |
| [2023-07-17 01:13:39,347][285607] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000002320_1187840.pth... |
| [2023-07-17 01:13:39,350][285607] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000001504_770048.pth |
| [2023-07-17 01:13:42,308][285651] Updated weights for policy 0, policy_version 2400 (0.0005) |
| [2023-07-17 01:13:44,344][285366] Fps is (10 sec: 13107.1, 60 sec: 14267.7, 300 sec: 13926.4). Total num frames: 1253376. Throughput: 0: 14086.6. Samples: 1248368. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) |
| [2023-07-17 01:13:44,344][285366] Avg episode reward: [(0, '123.483')] |
| [2023-07-17 01:13:44,345][285607] Saving new best policy, reward=123.483! |
| [2023-07-17 01:13:45,331][285651] Updated weights for policy 0, policy_version 2480 (0.0005) |
| [2023-07-17 01:13:48,503][285651] Updated weights for policy 0, policy_version 2560 (0.0005) |
| [2023-07-17 01:13:49,344][285366] Fps is (10 sec: 13107.2, 60 sec: 14131.2, 300 sec: 13883.3). Total num frames: 1318912. Throughput: 0: 13943.3. Samples: 1286912. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:13:49,344][285366] Avg episode reward: [(0, '123.641')] |
| [2023-07-17 01:13:49,345][285607] Saving new best policy, reward=123.641! |
| [2023-07-17 01:13:51,595][285651] Updated weights for policy 0, policy_version 2640 (0.0005) |
| [2023-07-17 01:13:54,344][285366] Fps is (10 sec: 13107.2, 60 sec: 13926.4, 300 sec: 13844.5). Total num frames: 1384448. Throughput: 0: 13748.1. Samples: 1367784. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:13:54,344][285366] Avg episode reward: [(0, '122.152')] |
| [2023-07-17 01:13:54,381][285607] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000002712_1388544.pth... |
| [2023-07-17 01:13:54,383][285607] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000001928_987136.pth |
| [2023-07-17 01:13:54,682][285651] Updated weights for policy 0, policy_version 2720 (0.0005) |
| [2023-07-17 01:13:57,791][285651] Updated weights for policy 0, policy_version 2800 (0.0005) |
| [2023-07-17 01:13:59,344][285366] Fps is (10 sec: 13516.8, 60 sec: 13858.1, 300 sec: 13848.4). Total num frames: 1454080. Throughput: 0: 13531.5. Samples: 1446520. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) |
| [2023-07-17 01:13:59,344][285366] Avg episode reward: [(0, '122.400')] |
| [2023-07-17 01:14:00,827][285651] Updated weights for policy 0, policy_version 2880 (0.0005) |
| [2023-07-17 01:14:03,905][285651] Updated weights for policy 0, policy_version 2960 (0.0005) |
| [2023-07-17 01:14:04,344][285366] Fps is (10 sec: 13516.9, 60 sec: 13721.6, 300 sec: 13814.7). Total num frames: 1519616. Throughput: 0: 13425.5. Samples: 1486912. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:14:04,344][285366] Avg episode reward: [(0, '122.886')] |
| [2023-07-17 01:14:06,945][285651] Updated weights for policy 0, policy_version 3040 (0.0005) |
| [2023-07-17 01:14:09,344][285366] Fps is (10 sec: 13107.1, 60 sec: 13585.1, 300 sec: 13783.9). Total num frames: 1585152. Throughput: 0: 13308.9. Samples: 1567376. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:14:09,344][285366] Avg episode reward: [(0, '123.836')] |
| [2023-07-17 01:14:09,347][285607] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000003096_1585152.pth... |
| [2023-07-17 01:14:09,350][285607] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000002320_1187840.pth |
| [2023-07-17 01:14:09,350][285607] Saving new best policy, reward=123.836! |
| [2023-07-17 01:14:10,104][285651] Updated weights for policy 0, policy_version 3120 (0.0005) |
| [2023-07-17 01:14:13,210][285651] Updated weights for policy 0, policy_version 3200 (0.0005) |
| [2023-07-17 01:14:14,344][285366] Fps is (10 sec: 13107.2, 60 sec: 13448.5, 300 sec: 13755.7). Total num frames: 1650688. Throughput: 0: 13282.0. Samples: 1645680. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:14:14,344][285366] Avg episode reward: [(0, '123.137')] |
| [2023-07-17 01:14:16,236][285651] Updated weights for policy 0, policy_version 3280 (0.0005) |
| [2023-07-17 01:14:19,119][285651] Updated weights for policy 0, policy_version 3360 (0.0004) |
| [2023-07-17 01:14:19,344][285366] Fps is (10 sec: 13516.8, 60 sec: 13312.0, 300 sec: 13762.6). Total num frames: 1720320. Throughput: 0: 13315.5. Samples: 1686808. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-17 01:14:19,344][285366] Avg episode reward: [(0, '123.213')] |
| [2023-07-17 01:14:21,856][285651] Updated weights for policy 0, policy_version 3440 (0.0004) |
| [2023-07-17 01:14:24,344][285366] Fps is (10 sec: 14335.9, 60 sec: 13448.5, 300 sec: 13800.4). Total num frames: 1794048. Throughput: 0: 13484.1. Samples: 1774208. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-17 01:14:24,344][285366] Avg episode reward: [(0, '123.566')] |
| [2023-07-17 01:14:24,347][285607] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000003504_1794048.pth... |
| [2023-07-17 01:14:24,350][285607] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000002712_1388544.pth |
| [2023-07-17 01:14:24,874][285651] Updated weights for policy 0, policy_version 3520 (0.0004) |
| [2023-07-17 01:14:27,941][285651] Updated weights for policy 0, policy_version 3600 (0.0005) |
| [2023-07-17 01:14:29,344][285366] Fps is (10 sec: 13926.4, 60 sec: 13448.5, 300 sec: 13774.7). Total num frames: 1859584. Throughput: 0: 13458.4. Samples: 1853996. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-17 01:14:29,345][285366] Avg episode reward: [(0, '123.765')] |
| [2023-07-17 01:14:31,036][285651] Updated weights for policy 0, policy_version 3680 (0.0005) |
| [2023-07-17 01:14:34,201][285651] Updated weights for policy 0, policy_version 3760 (0.0005) |
| [2023-07-17 01:14:34,344][285366] Fps is (10 sec: 13107.3, 60 sec: 13380.3, 300 sec: 13750.9). Total num frames: 1925120. Throughput: 0: 13475.1. Samples: 1893292. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-17 01:14:34,345][285366] Avg episode reward: [(0, '123.705')] |
| [2023-07-17 01:14:37,244][285651] Updated weights for policy 0, policy_version 3840 (0.0005) |
| [2023-07-17 01:14:39,344][285366] Fps is (10 sec: 13107.1, 60 sec: 13380.2, 300 sec: 13728.7). Total num frames: 1990656. Throughput: 0: 13456.7. Samples: 1973336. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:14:39,345][285366] Avg episode reward: [(0, '123.890')] |
| [2023-07-17 01:14:39,393][285607] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000003896_1994752.pth... |
| [2023-07-17 01:14:39,395][285607] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000003096_1585152.pth |
| [2023-07-17 01:14:39,396][285607] Saving new best policy, reward=123.890! |
| [2023-07-17 01:14:40,296][285651] Updated weights for policy 0, policy_version 3920 (0.0005) |
| [2023-07-17 01:14:43,360][285651] Updated weights for policy 0, policy_version 4000 (0.0005) |
| [2023-07-17 01:14:44,344][285366] Fps is (10 sec: 13516.8, 60 sec: 13448.5, 300 sec: 13735.3). Total num frames: 2060288. Throughput: 0: 13478.7. Samples: 2053064. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:14:44,345][285366] Avg episode reward: [(0, '124.232')] |
| [2023-07-17 01:14:44,345][285607] Saving new best policy, reward=124.232! |
| [2023-07-17 01:14:46,454][285651] Updated weights for policy 0, policy_version 4080 (0.0005) |
| [2023-07-17 01:14:49,239][285651] Updated weights for policy 0, policy_version 4160 (0.0003) |
| [2023-07-17 01:14:49,344][285366] Fps is (10 sec: 13926.5, 60 sec: 13516.8, 300 sec: 13741.4). Total num frames: 2129920. Throughput: 0: 13471.5. Samples: 2093128. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:14:49,346][285366] Avg episode reward: [(0, '123.162')] |
| [2023-07-17 01:14:52,086][285651] Updated weights for policy 0, policy_version 4240 (0.0004) |
| [2023-07-17 01:14:54,344][285366] Fps is (10 sec: 14336.1, 60 sec: 13653.4, 300 sec: 13772.8). Total num frames: 2203648. Throughput: 0: 13620.2. Samples: 2180284. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:14:54,345][285366] Avg episode reward: [(0, '123.784')] |
| [2023-07-17 01:14:54,347][285607] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000004304_2203648.pth... |
| [2023-07-17 01:14:54,350][285607] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000003504_1794048.pth |
| [2023-07-17 01:14:54,916][285651] Updated weights for policy 0, policy_version 4320 (0.0004) |
| [2023-07-17 01:14:57,698][285651] Updated weights for policy 0, policy_version 4400 (0.0004) |
| [2023-07-17 01:14:59,344][285366] Fps is (10 sec: 14336.1, 60 sec: 13653.3, 300 sec: 13777.5). Total num frames: 2273280. Throughput: 0: 13846.4. Samples: 2268768. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:14:59,345][285366] Avg episode reward: [(0, '124.036')] |
| [2023-07-17 01:15:00,519][285651] Updated weights for policy 0, policy_version 4480 (0.0004) |
| [2023-07-17 01:15:03,388][285651] Updated weights for policy 0, policy_version 4560 (0.0004) |
| [2023-07-17 01:15:04,344][285366] Fps is (10 sec: 14336.0, 60 sec: 13789.9, 300 sec: 13805.9). Total num frames: 2347008. Throughput: 0: 13869.6. Samples: 2310940. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) |
| [2023-07-17 01:15:04,344][285366] Avg episode reward: [(0, '123.954')] |
| [2023-07-17 01:15:06,271][285651] Updated weights for policy 0, policy_version 4640 (0.0004) |
| [2023-07-17 01:15:09,344][285366] Fps is (10 sec: 13926.3, 60 sec: 13789.9, 300 sec: 13786.0). Total num frames: 2412544. Throughput: 0: 13804.2. Samples: 2395396. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:15:09,344][285366] Avg episode reward: [(0, '123.827')] |
| [2023-07-17 01:15:09,375][285607] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000004720_2416640.pth... |
| [2023-07-17 01:15:09,376][285651] Updated weights for policy 0, policy_version 4720 (0.0005) |
| [2023-07-17 01:15:09,378][285607] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000003896_1994752.pth |
| [2023-07-17 01:15:12,361][285651] Updated weights for policy 0, policy_version 4800 (0.0005) |
| [2023-07-17 01:15:14,344][285366] Fps is (10 sec: 13516.9, 60 sec: 13858.1, 300 sec: 13789.9). Total num frames: 2482176. Throughput: 0: 13868.0. Samples: 2478056. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-17 01:15:14,344][285366] Avg episode reward: [(0, '122.662')] |
| [2023-07-17 01:15:15,149][285651] Updated weights for policy 0, policy_version 4880 (0.0004) |
| [2023-07-17 01:15:18,057][285651] Updated weights for policy 0, policy_version 4960 (0.0004) |
| [2023-07-17 01:15:19,344][285366] Fps is (10 sec: 14336.0, 60 sec: 13926.4, 300 sec: 13815.7). Total num frames: 2555904. Throughput: 0: 13956.3. Samples: 2521324. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-17 01:15:19,344][285366] Avg episode reward: [(0, '122.389')] |
| [2023-07-17 01:15:20,879][285651] Updated weights for policy 0, policy_version 5040 (0.0004) |
| [2023-07-17 01:15:23,732][285651] Updated weights for policy 0, policy_version 5120 (0.0004) |
| [2023-07-17 01:15:24,344][285366] Fps is (10 sec: 14745.5, 60 sec: 13926.4, 300 sec: 13840.2). Total num frames: 2629632. Throughput: 0: 14082.8. Samples: 2607060. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-17 01:15:24,344][285366] Avg episode reward: [(0, '119.700')] |
| [2023-07-17 01:15:24,346][285607] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000005136_2629632.pth... |
| [2023-07-17 01:15:24,349][285607] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000004304_2203648.pth |
| [2023-07-17 01:15:26,512][285651] Updated weights for policy 0, policy_version 5200 (0.0004) |
| [2023-07-17 01:15:29,296][285651] Updated weights for policy 0, policy_version 5280 (0.0004) |
| [2023-07-17 01:15:29,344][285366] Fps is (10 sec: 14745.5, 60 sec: 14062.9, 300 sec: 13863.4). Total num frames: 2703360. Throughput: 0: 14270.6. Samples: 2695240. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) |
| [2023-07-17 01:15:29,344][285366] Avg episode reward: [(0, '122.185')] |
| [2023-07-17 01:15:32,052][285651] Updated weights for policy 0, policy_version 5360 (0.0004) |
| [2023-07-17 01:15:34,344][285366] Fps is (10 sec: 14745.5, 60 sec: 14199.5, 300 sec: 13885.4). Total num frames: 2777088. Throughput: 0: 14379.6. Samples: 2740208. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:15:34,344][285366] Avg episode reward: [(0, '122.369')] |
| [2023-07-17 01:15:34,766][285651] Updated weights for policy 0, policy_version 5440 (0.0004) |
| [2023-07-17 01:15:37,472][285651] Updated weights for policy 0, policy_version 5520 (0.0003) |
| [2023-07-17 01:15:39,344][285366] Fps is (10 sec: 14745.6, 60 sec: 14336.0, 300 sec: 13906.4). Total num frames: 2850816. Throughput: 0: 14447.0. Samples: 2830400. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) |
| [2023-07-17 01:15:39,344][285366] Avg episode reward: [(0, '122.211')] |
| [2023-07-17 01:15:39,356][285607] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000005576_2854912.pth... |
| [2023-07-17 01:15:39,358][285607] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000004720_2416640.pth |
| [2023-07-17 01:15:40,185][285651] Updated weights for policy 0, policy_version 5600 (0.0004) |
| [2023-07-17 01:15:42,962][285651] Updated weights for policy 0, policy_version 5680 (0.0004) |
| [2023-07-17 01:15:44,344][285366] Fps is (10 sec: 14745.7, 60 sec: 14404.3, 300 sec: 13926.4). Total num frames: 2924544. Throughput: 0: 14475.9. Samples: 2920184. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-17 01:15:44,344][285366] Avg episode reward: [(0, '122.707')] |
| [2023-07-17 01:15:45,791][285651] Updated weights for policy 0, policy_version 5760 (0.0004) |
| [2023-07-17 01:15:48,543][285651] Updated weights for policy 0, policy_version 5840 (0.0004) |
| [2023-07-17 01:15:49,344][285366] Fps is (10 sec: 14745.7, 60 sec: 14472.5, 300 sec: 13945.5). Total num frames: 2998272. Throughput: 0: 14506.6. Samples: 2963736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:15:49,344][285366] Avg episode reward: [(0, '122.994')] |
| [2023-07-17 01:15:51,258][285651] Updated weights for policy 0, policy_version 5920 (0.0003) |
| [2023-07-17 01:15:54,009][285651] Updated weights for policy 0, policy_version 6000 (0.0004) |
| [2023-07-17 01:15:54,344][285366] Fps is (10 sec: 15155.1, 60 sec: 14540.8, 300 sec: 13982.3). Total num frames: 3076096. Throughput: 0: 14622.5. Samples: 3053408. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:15:54,344][285366] Avg episode reward: [(0, '122.519')] |
| [2023-07-17 01:15:54,347][285607] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000006008_3076096.pth... |
| [2023-07-17 01:15:54,350][285607] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000005136_2629632.pth |
| [2023-07-17 01:15:56,769][285651] Updated weights for policy 0, policy_version 6080 (0.0004) |
| [2023-07-17 01:15:59,344][285366] Fps is (10 sec: 15155.2, 60 sec: 14609.1, 300 sec: 13999.2). Total num frames: 3149824. Throughput: 0: 14762.1. Samples: 3142352. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:15:59,344][285366] Avg episode reward: [(0, '122.741')] |
| [2023-07-17 01:15:59,532][285651] Updated weights for policy 0, policy_version 6160 (0.0004) |
| [2023-07-17 01:16:02,318][285651] Updated weights for policy 0, policy_version 6240 (0.0004) |
| [2023-07-17 01:16:04,344][285366] Fps is (10 sec: 14745.7, 60 sec: 14609.1, 300 sec: 14015.4). Total num frames: 3223552. Throughput: 0: 14786.0. Samples: 3186696. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) |
| [2023-07-17 01:16:04,344][285366] Avg episode reward: [(0, '123.045')] |
| [2023-07-17 01:16:05,073][285651] Updated weights for policy 0, policy_version 6320 (0.0004) |
| [2023-07-17 01:16:07,857][285651] Updated weights for policy 0, policy_version 6400 (0.0004) |
| [2023-07-17 01:16:09,344][285366] Fps is (10 sec: 14745.5, 60 sec: 14745.6, 300 sec: 14031.0). Total num frames: 3297280. Throughput: 0: 14844.0. Samples: 3275040. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:16:09,344][285366] Avg episode reward: [(0, '122.669')] |
| [2023-07-17 01:16:09,347][285607] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000006440_3297280.pth... |
| [2023-07-17 01:16:09,350][285607] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000005576_2854912.pth |
| [2023-07-17 01:16:10,604][285651] Updated weights for policy 0, policy_version 6480 (0.0004) |
| [2023-07-17 01:16:13,500][285651] Updated weights for policy 0, policy_version 6560 (0.0004) |
| [2023-07-17 01:16:14,344][285366] Fps is (10 sec: 14745.6, 60 sec: 14813.8, 300 sec: 14045.9). Total num frames: 3371008. Throughput: 0: 14833.9. Samples: 3362764. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:16:14,344][285366] Avg episode reward: [(0, '123.294')] |
| [2023-07-17 01:16:16,230][285651] Updated weights for policy 0, policy_version 6640 (0.0004) |
| [2023-07-17 01:16:18,967][285651] Updated weights for policy 0, policy_version 6720 (0.0004) |
| [2023-07-17 01:16:19,344][285366] Fps is (10 sec: 14745.7, 60 sec: 14813.9, 300 sec: 14060.2). Total num frames: 3444736. Throughput: 0: 14837.3. Samples: 3407884. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:16:19,344][285366] Avg episode reward: [(0, '123.663')] |
| [2023-07-17 01:16:21,906][285651] Updated weights for policy 0, policy_version 6800 (0.0005) |
| [2023-07-17 01:16:24,344][285366] Fps is (10 sec: 13926.4, 60 sec: 14677.3, 300 sec: 14041.1). Total num frames: 3510272. Throughput: 0: 14726.2. Samples: 3493080. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:16:24,344][285366] Avg episode reward: [(0, '122.937')] |
| [2023-07-17 01:16:24,347][285607] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000006856_3510272.pth... |
| [2023-07-17 01:16:24,350][285607] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000006008_3076096.pth |
| [2023-07-17 01:16:25,088][285651] Updated weights for policy 0, policy_version 6880 (0.0005) |
| [2023-07-17 01:16:28,301][285651] Updated weights for policy 0, policy_version 6960 (0.0005) |
| [2023-07-17 01:16:29,344][285366] Fps is (10 sec: 13107.2, 60 sec: 14540.8, 300 sec: 14022.8). Total num frames: 3575808. Throughput: 0: 14422.0. Samples: 3569176. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-17 01:16:29,344][285366] Avg episode reward: [(0, '122.785')] |
| [2023-07-17 01:16:31,382][285651] Updated weights for policy 0, policy_version 7040 (0.0005) |
| [2023-07-17 01:16:34,344][285366] Fps is (10 sec: 13107.2, 60 sec: 14404.3, 300 sec: 14005.2). Total num frames: 3641344. Throughput: 0: 14334.4. Samples: 3608784. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:16:34,344][285366] Avg episode reward: [(0, '123.128')] |
| [2023-07-17 01:16:34,514][285651] Updated weights for policy 0, policy_version 7120 (0.0005) |
| [2023-07-17 01:16:37,445][285651] Updated weights for policy 0, policy_version 7200 (0.0004) |
| [2023-07-17 01:16:39,344][285366] Fps is (10 sec: 13516.7, 60 sec: 14336.0, 300 sec: 14003.7). Total num frames: 3710976. Throughput: 0: 14157.7. Samples: 3690504. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:16:39,344][285366] Avg episode reward: [(0, '123.563')] |
| [2023-07-17 01:16:39,348][285607] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000007248_3710976.pth... |
| [2023-07-17 01:16:39,350][285607] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000006440_3297280.pth |
| [2023-07-17 01:16:40,576][285651] Updated weights for policy 0, policy_version 7280 (0.0005) |
| [2023-07-17 01:16:43,658][285651] Updated weights for policy 0, policy_version 7360 (0.0005) |
| [2023-07-17 01:16:44,344][285366] Fps is (10 sec: 13516.8, 60 sec: 14199.5, 300 sec: 13987.1). Total num frames: 3776512. Throughput: 0: 13937.8. Samples: 3769552. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-17 01:16:44,345][285366] Avg episode reward: [(0, '123.906')] |
| [2023-07-17 01:16:46,685][285651] Updated weights for policy 0, policy_version 7440 (0.0005) |
| [2023-07-17 01:16:49,344][285366] Fps is (10 sec: 13107.3, 60 sec: 14062.9, 300 sec: 13971.1). Total num frames: 3842048. Throughput: 0: 13850.7. Samples: 3809976. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-17 01:16:49,344][285366] Avg episode reward: [(0, '123.946')] |
| [2023-07-17 01:16:49,734][285651] Updated weights for policy 0, policy_version 7520 (0.0004) |
| [2023-07-17 01:16:52,808][285651] Updated weights for policy 0, policy_version 7600 (0.0005) |
| [2023-07-17 01:16:54,344][285366] Fps is (10 sec: 13516.8, 60 sec: 13926.4, 300 sec: 13970.3). Total num frames: 3911680. Throughput: 0: 13684.1. Samples: 3890824. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:16:54,344][285366] Avg episode reward: [(0, '123.900')] |
| [2023-07-17 01:16:54,348][285607] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000007640_3911680.pth... |
| [2023-07-17 01:16:54,350][285607] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000006856_3510272.pth |
| [2023-07-17 01:16:55,884][285651] Updated weights for policy 0, policy_version 7680 (0.0005) |
| [2023-07-17 01:16:59,037][285651] Updated weights for policy 0, policy_version 7760 (0.0005) |
| [2023-07-17 01:16:59,344][285366] Fps is (10 sec: 13107.2, 60 sec: 13721.6, 300 sec: 13940.8). Total num frames: 3973120. Throughput: 0: 13477.6. Samples: 3969256. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:16:59,344][285366] Avg episode reward: [(0, '123.685')] |
| [2023-07-17 01:17:02,204][285651] Updated weights for policy 0, policy_version 7840 (0.0005) |
| [2023-07-17 01:17:04,344][285366] Fps is (10 sec: 12697.6, 60 sec: 13585.1, 300 sec: 13926.4). Total num frames: 4038656. Throughput: 0: 13335.9. Samples: 4008000. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-17 01:17:04,345][285366] Avg episode reward: [(0, '124.004')] |
| [2023-07-17 01:17:05,297][285651] Updated weights for policy 0, policy_version 7920 (0.0005) |
| [2023-07-17 01:17:08,443][285651] Updated weights for policy 0, policy_version 8000 (0.0005) |
| [2023-07-17 01:17:09,344][285366] Fps is (10 sec: 13107.1, 60 sec: 13448.5, 300 sec: 13912.5). Total num frames: 4104192. Throughput: 0: 13208.6. Samples: 4087468. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-17 01:17:09,345][285366] Avg episode reward: [(0, '123.479')] |
| [2023-07-17 01:17:09,362][285607] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000008024_4108288.pth... |
| [2023-07-17 01:17:09,365][285607] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000007248_3710976.pth |
| [2023-07-17 01:17:11,595][285651] Updated weights for policy 0, policy_version 8080 (0.0005) |
| [2023-07-17 01:17:14,344][285366] Fps is (10 sec: 13516.8, 60 sec: 13380.3, 300 sec: 14079.1). Total num frames: 4173824. Throughput: 0: 13330.4. Samples: 4169044. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) |
| [2023-07-17 01:17:14,344][285366] Avg episode reward: [(0, '124.322')] |
| [2023-07-17 01:17:14,345][285607] Saving new best policy, reward=124.322! |
| [2023-07-17 01:17:14,395][285651] Updated weights for policy 0, policy_version 8160 (0.0004) |
| [2023-07-17 01:17:17,334][285651] Updated weights for policy 0, policy_version 8240 (0.0005) |
| [2023-07-17 01:17:19,344][285366] Fps is (10 sec: 13926.4, 60 sec: 13312.0, 300 sec: 14051.4). Total num frames: 4243456. Throughput: 0: 13377.1. Samples: 4210752. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:17:19,344][285366] Avg episode reward: [(0, '124.216')] |
| [2023-07-17 01:17:20,392][285651] Updated weights for policy 0, policy_version 8320 (0.0005) |
| [2023-07-17 01:17:23,421][285651] Updated weights for policy 0, policy_version 8400 (0.0005) |
| [2023-07-17 01:17:24,344][285366] Fps is (10 sec: 13926.2, 60 sec: 13380.2, 300 sec: 14037.5). Total num frames: 4313088. Throughput: 0: 13370.3. Samples: 4292168. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:17:24,344][285366] Avg episode reward: [(0, '124.181')] |
| [2023-07-17 01:17:24,347][285607] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000008424_4313088.pth... |
| [2023-07-17 01:17:24,350][285607] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000007640_3911680.pth |
| [2023-07-17 01:17:26,497][285651] Updated weights for policy 0, policy_version 8480 (0.0005) |
| [2023-07-17 01:17:29,344][285366] Fps is (10 sec: 13516.7, 60 sec: 13380.3, 300 sec: 14009.7). Total num frames: 4378624. Throughput: 0: 13369.5. Samples: 4371180. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:17:29,344][285366] Avg episode reward: [(0, '124.345')] |
| [2023-07-17 01:17:29,345][285607] Saving new best policy, reward=124.345! |
| [2023-07-17 01:17:29,644][285651] Updated weights for policy 0, policy_version 8560 (0.0005) |
| [2023-07-17 01:17:32,755][285651] Updated weights for policy 0, policy_version 8640 (0.0005) |
| [2023-07-17 01:17:34,344][285366] Fps is (10 sec: 12697.8, 60 sec: 13312.0, 300 sec: 13954.2). Total num frames: 4440064. Throughput: 0: 13355.1. Samples: 4410956. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-17 01:17:34,345][285366] Avg episode reward: [(0, '124.671')] |
| [2023-07-17 01:17:34,351][285607] Saving new best policy, reward=124.671! |
| [2023-07-17 01:17:35,947][285651] Updated weights for policy 0, policy_version 8720 (0.0005) |
| [2023-07-17 01:17:38,959][285651] Updated weights for policy 0, policy_version 8800 (0.0005) |
| [2023-07-17 01:17:39,344][285366] Fps is (10 sec: 13107.2, 60 sec: 13312.0, 300 sec: 13940.3). Total num frames: 4509696. Throughput: 0: 13297.8. Samples: 4489224. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:17:39,345][285366] Avg episode reward: [(0, '124.955')] |
| [2023-07-17 01:17:39,348][285607] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000008808_4509696.pth... |
| [2023-07-17 01:17:39,351][285607] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000008024_4108288.pth |
| [2023-07-17 01:17:39,352][285607] Saving new best policy, reward=124.955! |
| [2023-07-17 01:17:42,102][285651] Updated weights for policy 0, policy_version 8880 (0.0005) |
| [2023-07-17 01:17:44,344][285366] Fps is (10 sec: 13107.1, 60 sec: 13243.7, 300 sec: 13898.6). Total num frames: 4571136. Throughput: 0: 13285.7. Samples: 4567112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:17:44,345][285366] Avg episode reward: [(0, '124.603')] |
| [2023-07-17 01:17:45,308][285651] Updated weights for policy 0, policy_version 8960 (0.0005) |
| [2023-07-17 01:17:48,505][285651] Updated weights for policy 0, policy_version 9040 (0.0005) |
| [2023-07-17 01:17:49,344][285366] Fps is (10 sec: 12697.7, 60 sec: 13243.7, 300 sec: 13857.0). Total num frames: 4636672. Throughput: 0: 13297.0. Samples: 4606364. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) |
| [2023-07-17 01:17:49,344][285366] Avg episode reward: [(0, '125.106')] |
| [2023-07-17 01:17:49,345][285607] Saving new best policy, reward=125.106! |
| [2023-07-17 01:17:51,725][285651] Updated weights for policy 0, policy_version 9120 (0.0005) |
| [2023-07-17 01:17:54,344][285366] Fps is (10 sec: 13107.2, 60 sec: 13175.5, 300 sec: 13829.2). Total num frames: 4702208. Throughput: 0: 13229.5. Samples: 4682796. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:17:54,345][285366] Avg episode reward: [(0, '124.938')] |
| [2023-07-17 01:17:54,348][285607] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000009184_4702208.pth... |
| [2023-07-17 01:17:54,351][285607] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000008424_4313088.pth |
| [2023-07-17 01:17:54,906][285651] Updated weights for policy 0, policy_version 9200 (0.0005) |
| [2023-07-17 01:17:58,098][285651] Updated weights for policy 0, policy_version 9280 (0.0005) |
| [2023-07-17 01:17:59,344][285366] Fps is (10 sec: 13107.2, 60 sec: 13243.7, 300 sec: 13801.4). Total num frames: 4767744. Throughput: 0: 13123.8. Samples: 4759616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:17:59,345][285366] Avg episode reward: [(0, '126.299')] |
| [2023-07-17 01:17:59,345][285607] Saving new best policy, reward=126.299! |
| [2023-07-17 01:18:01,263][285651] Updated weights for policy 0, policy_version 9360 (0.0005) |
| [2023-07-17 01:18:04,344][285366] Fps is (10 sec: 12697.6, 60 sec: 13175.5, 300 sec: 13759.8). Total num frames: 4829184. Throughput: 0: 13057.8. Samples: 4798352. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:18:04,344][285366] Avg episode reward: [(0, '139.910')] |
| [2023-07-17 01:18:04,345][285607] Saving new best policy, reward=139.910! |
| [2023-07-17 01:18:04,512][285651] Updated weights for policy 0, policy_version 9440 (0.0005) |
| [2023-07-17 01:18:07,747][285651] Updated weights for policy 0, policy_version 9520 (0.0005) |
| [2023-07-17 01:18:09,344][285366] Fps is (10 sec: 12697.6, 60 sec: 13175.5, 300 sec: 13732.0). Total num frames: 4894720. Throughput: 0: 12936.4. Samples: 4874304. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) |
| [2023-07-17 01:18:09,344][285366] Avg episode reward: [(0, '158.382')] |
| [2023-07-17 01:18:09,347][285607] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000009560_4894720.pth... |
| [2023-07-17 01:18:09,349][285607] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000008808_4509696.pth |
| [2023-07-17 01:18:09,349][285607] Saving new best policy, reward=158.382! |
| [2023-07-17 01:18:10,939][285651] Updated weights for policy 0, policy_version 9600 (0.0005) |
| [2023-07-17 01:18:14,245][285651] Updated weights for policy 0, policy_version 9680 (0.0005) |
| [2023-07-17 01:18:14,344][285366] Fps is (10 sec: 12697.7, 60 sec: 13038.9, 300 sec: 13676.5). Total num frames: 4956160. Throughput: 0: 12864.3. Samples: 4950072. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) |
| [2023-07-17 01:18:14,344][285366] Avg episode reward: [(0, '196.959')] |
| [2023-07-17 01:18:14,344][285607] Saving new best policy, reward=196.959! |
| [2023-07-17 01:18:17,508][285651] Updated weights for policy 0, policy_version 9760 (0.0005) |
| [2023-07-17 01:18:19,344][285366] Fps is (10 sec: 12288.1, 60 sec: 12902.4, 300 sec: 13662.6). Total num frames: 5017600. Throughput: 0: 12819.7. Samples: 4987844. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:18:19,344][285366] Avg episode reward: [(0, '192.693')] |
| [2023-07-17 01:18:20,715][285651] Updated weights for policy 0, policy_version 9840 (0.0005) |
| [2023-07-17 01:18:23,959][285651] Updated weights for policy 0, policy_version 9920 (0.0004) |
| [2023-07-17 01:18:24,344][285366] Fps is (10 sec: 12697.6, 60 sec: 12834.2, 300 sec: 13662.6). Total num frames: 5083136. Throughput: 0: 12762.4. Samples: 5063532. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:18:24,344][285366] Avg episode reward: [(0, '225.990')] |
| [2023-07-17 01:18:24,347][285607] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000009928_5083136.pth... |
| [2023-07-17 01:18:24,350][285607] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000009184_4702208.pth |
| [2023-07-17 01:18:24,350][285607] Saving new best policy, reward=225.990! |
| [2023-07-17 01:18:27,106][285651] Updated weights for policy 0, policy_version 10000 (0.0005) |
| [2023-07-17 01:18:29,344][285366] Fps is (10 sec: 12697.6, 60 sec: 12765.9, 300 sec: 13634.8). Total num frames: 5144576. Throughput: 0: 12752.2. Samples: 5140960. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:18:29,344][285366] Avg episode reward: [(0, '182.839')] |
| [2023-07-17 01:18:30,317][285651] Updated weights for policy 0, policy_version 10080 (0.0005) |
| [2023-07-17 01:18:33,525][285651] Updated weights for policy 0, policy_version 10160 (0.0005) |
| [2023-07-17 01:18:34,344][285366] Fps is (10 sec: 12697.6, 60 sec: 12834.1, 300 sec: 13634.8). Total num frames: 5210112. Throughput: 0: 12736.3. Samples: 5179500. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:18:34,344][285366] Avg episode reward: [(0, '190.425')] |
| [2023-07-17 01:18:36,753][285651] Updated weights for policy 0, policy_version 10240 (0.0005) |
| [2023-07-17 01:18:39,344][285366] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 13620.9). Total num frames: 5271552. Throughput: 0: 12720.8. Samples: 5255232. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:18:39,344][285366] Avg episode reward: [(0, '226.801')] |
| [2023-07-17 01:18:39,347][285607] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000010296_5271552.pth... |
| [2023-07-17 01:18:39,350][285607] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000009560_4894720.pth |
| [2023-07-17 01:18:39,351][285607] Saving new best policy, reward=226.801! |
| [2023-07-17 01:18:40,090][285651] Updated weights for policy 0, policy_version 10320 (0.0005) |
| [2023-07-17 01:18:43,352][285651] Updated weights for policy 0, policy_version 10400 (0.0005) |
| [2023-07-17 01:18:44,344][285366] Fps is (10 sec: 12697.7, 60 sec: 12765.9, 300 sec: 13620.9). Total num frames: 5337088. Throughput: 0: 12669.9. Samples: 5329760. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-17 01:18:44,344][285366] Avg episode reward: [(0, '240.963')] |
| [2023-07-17 01:18:44,344][285607] Saving new best policy, reward=240.963! |
| [2023-07-17 01:18:46,563][285651] Updated weights for policy 0, policy_version 10480 (0.0005) |
| [2023-07-17 01:18:49,344][285366] Fps is (10 sec: 13107.3, 60 sec: 12765.9, 300 sec: 13620.9). Total num frames: 5402624. Throughput: 0: 12667.2. Samples: 5368376. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-17 01:18:49,344][285366] Avg episode reward: [(0, '300.811')] |
| [2023-07-17 01:18:49,345][285607] Saving new best policy, reward=300.811! |
| [2023-07-17 01:18:49,523][285651] Updated weights for policy 0, policy_version 10560 (0.0005) |
| [2023-07-17 01:18:52,681][285651] Updated weights for policy 0, policy_version 10640 (0.0005) |
| [2023-07-17 01:18:54,344][285366] Fps is (10 sec: 13107.1, 60 sec: 12765.9, 300 sec: 13607.1). Total num frames: 5468160. Throughput: 0: 12756.5. Samples: 5448348. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:18:54,344][285366] Avg episode reward: [(0, '190.615')] |
| [2023-07-17 01:18:54,348][285607] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000010680_5468160.pth... |
| [2023-07-17 01:18:54,350][285607] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000009928_5083136.pth |
| [2023-07-17 01:18:55,976][285651] Updated weights for policy 0, policy_version 10720 (0.0005) |
| [2023-07-17 01:18:59,139][285651] Updated weights for policy 0, policy_version 10800 (0.0005) |
| [2023-07-17 01:18:59,344][285366] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 13593.2). Total num frames: 5529600. Throughput: 0: 12784.5. Samples: 5525376. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-17 01:18:59,344][285366] Avg episode reward: [(0, '196.431')] |
| [2023-07-17 01:19:02,100][285651] Updated weights for policy 0, policy_version 10880 (0.0004) |
| [2023-07-17 01:19:04,344][285366] Fps is (10 sec: 13107.2, 60 sec: 12834.1, 300 sec: 13607.1). Total num frames: 5599232. Throughput: 0: 12838.6. Samples: 5565580. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-17 01:19:04,344][285366] Avg episode reward: [(0, '252.272')] |
| [2023-07-17 01:19:05,120][285651] Updated weights for policy 0, policy_version 10960 (0.0004) |
| [2023-07-17 01:19:08,194][285651] Updated weights for policy 0, policy_version 11040 (0.0005) |
| [2023-07-17 01:19:09,344][285366] Fps is (10 sec: 13516.8, 60 sec: 12834.1, 300 sec: 13607.0). Total num frames: 5664768. Throughput: 0: 12950.6. Samples: 5646308. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:19:09,344][285366] Avg episode reward: [(0, '241.948')] |
| [2023-07-17 01:19:09,347][285607] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000011064_5664768.pth... |
| [2023-07-17 01:19:09,350][285607] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000010296_5271552.pth |
| [2023-07-17 01:19:11,396][285651] Updated weights for policy 0, policy_version 11120 (0.0005) |
| [2023-07-17 01:19:14,344][285366] Fps is (10 sec: 12697.7, 60 sec: 12834.1, 300 sec: 13579.3). Total num frames: 5726208. Throughput: 0: 12915.9. Samples: 5722176. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:19:14,344][285366] Avg episode reward: [(0, '216.523')] |
| [2023-07-17 01:19:14,737][285651] Updated weights for policy 0, policy_version 11200 (0.0005) |
| [2023-07-17 01:19:18,108][285651] Updated weights for policy 0, policy_version 11280 (0.0006) |
| [2023-07-17 01:19:19,344][285366] Fps is (10 sec: 12287.9, 60 sec: 12834.1, 300 sec: 13537.6). Total num frames: 5787648. Throughput: 0: 12877.2. Samples: 5758976. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:19:19,344][285366] Avg episode reward: [(0, '395.861')] |
| [2023-07-17 01:19:19,347][285607] Saving new best policy, reward=395.861! |
| [2023-07-17 01:19:21,176][285651] Updated weights for policy 0, policy_version 11360 (0.0005) |
| [2023-07-17 01:19:24,154][285651] Updated weights for policy 0, policy_version 11440 (0.0004) |
| [2023-07-17 01:19:24,344][285366] Fps is (10 sec: 13107.0, 60 sec: 12902.4, 300 sec: 13551.5). Total num frames: 5857280. Throughput: 0: 12948.5. Samples: 5837916. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-17 01:19:24,344][285366] Avg episode reward: [(0, '427.962')] |
| [2023-07-17 01:19:24,347][285607] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000011440_5857280.pth... |
| [2023-07-17 01:19:24,349][285607] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000010680_5468160.pth |
| [2023-07-17 01:19:24,350][285607] Saving new best policy, reward=427.962! |
| [2023-07-17 01:19:27,123][285651] Updated weights for policy 0, policy_version 11520 (0.0004) |
| [2023-07-17 01:19:29,344][285366] Fps is (10 sec: 13926.6, 60 sec: 13038.9, 300 sec: 13565.4). Total num frames: 5926912. Throughput: 0: 13128.7. Samples: 5920552. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-17 01:19:29,344][285366] Avg episode reward: [(0, '382.411')] |
| [2023-07-17 01:19:30,221][285651] Updated weights for policy 0, policy_version 11600 (0.0005) |
| [2023-07-17 01:19:33,527][285651] Updated weights for policy 0, policy_version 11680 (0.0005) |
| [2023-07-17 01:19:34,344][285366] Fps is (10 sec: 13107.3, 60 sec: 12970.7, 300 sec: 13551.5). Total num frames: 5988352. Throughput: 0: 13114.8. Samples: 5958544. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-17 01:19:34,344][285366] Avg episode reward: [(0, '392.686')] |
| [2023-07-17 01:19:36,760][285651] Updated weights for policy 0, policy_version 11760 (0.0005) |
| [2023-07-17 01:19:39,344][285366] Fps is (10 sec: 12697.6, 60 sec: 13038.9, 300 sec: 13537.6). Total num frames: 6053888. Throughput: 0: 13007.6. Samples: 6033688. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-17 01:19:39,344][285366] Avg episode reward: [(0, '463.424')] |
| [2023-07-17 01:19:39,347][285607] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000011824_6053888.pth... |
| [2023-07-17 01:19:39,349][285607] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000011064_5664768.pth |
| [2023-07-17 01:19:39,349][285607] Saving new best policy, reward=463.424! |
| [2023-07-17 01:19:39,977][285651] Updated weights for policy 0, policy_version 11840 (0.0005) |
| [2023-07-17 01:19:43,201][285651] Updated weights for policy 0, policy_version 11920 (0.0005) |
| [2023-07-17 01:19:44,344][285366] Fps is (10 sec: 12697.6, 60 sec: 12970.7, 300 sec: 13509.9). Total num frames: 6115328. Throughput: 0: 12995.2. Samples: 6110160. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:19:44,344][285366] Avg episode reward: [(0, '480.849')] |
| [2023-07-17 01:19:44,344][285607] Saving new best policy, reward=480.849! |
| [2023-07-17 01:19:46,452][285651] Updated weights for policy 0, policy_version 12000 (0.0005) |
| [2023-07-17 01:19:49,344][285366] Fps is (10 sec: 12288.0, 60 sec: 12902.4, 300 sec: 13468.2). Total num frames: 6176768. Throughput: 0: 12944.8. Samples: 6148096. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:19:49,344][285366] Avg episode reward: [(0, '401.990')] |
| [2023-07-17 01:19:49,725][285651] Updated weights for policy 0, policy_version 12080 (0.0005) |
| [2023-07-17 01:19:53,129][285651] Updated weights for policy 0, policy_version 12160 (0.0005) |
| [2023-07-17 01:19:54,344][285366] Fps is (10 sec: 12288.1, 60 sec: 12834.1, 300 sec: 13440.4). Total num frames: 6238208. Throughput: 0: 12789.5. Samples: 6221836. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:19:54,344][285366] Avg episode reward: [(0, '404.027')] |
| [2023-07-17 01:19:54,346][285607] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000012184_6238208.pth... |
| [2023-07-17 01:19:54,349][285607] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000011440_5857280.pth |
| [2023-07-17 01:19:56,354][285651] Updated weights for policy 0, policy_version 12240 (0.0005) |
| [2023-07-17 01:19:59,344][285366] Fps is (10 sec: 12697.5, 60 sec: 12902.4, 300 sec: 13412.7). Total num frames: 6303744. Throughput: 0: 12763.1. Samples: 6296516. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:19:59,357][285366] Avg episode reward: [(0, '449.436')] |
| [2023-07-17 01:19:59,628][285651] Updated weights for policy 0, policy_version 12320 (0.0005) |
| [2023-07-17 01:20:02,808][285651] Updated weights for policy 0, policy_version 12400 (0.0005) |
| [2023-07-17 01:20:04,344][285366] Fps is (10 sec: 12697.5, 60 sec: 12765.9, 300 sec: 13398.8). Total num frames: 6365184. Throughput: 0: 12816.2. Samples: 6335704. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-17 01:20:04,344][285366] Avg episode reward: [(0, '414.528')] |
| [2023-07-17 01:20:06,049][285651] Updated weights for policy 0, policy_version 12480 (0.0005) |
| [2023-07-17 01:20:09,344][285366] Fps is (10 sec: 12288.1, 60 sec: 12697.6, 300 sec: 13371.0). Total num frames: 6426624. Throughput: 0: 12719.9. Samples: 6410312. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-17 01:20:09,345][285366] Avg episode reward: [(0, '430.252')] |
| [2023-07-17 01:20:09,373][285607] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000012560_6430720.pth... |
| [2023-07-17 01:20:09,373][285651] Updated weights for policy 0, policy_version 12560 (0.0005) |
| [2023-07-17 01:20:09,375][285607] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000011824_6053888.pth |
| [2023-07-17 01:20:12,603][285651] Updated weights for policy 0, policy_version 12640 (0.0005) |
| [2023-07-17 01:20:14,344][285366] Fps is (10 sec: 12697.6, 60 sec: 12765.8, 300 sec: 13343.2). Total num frames: 6492160. Throughput: 0: 12575.5. Samples: 6486452. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-17 01:20:14,345][285366] Avg episode reward: [(0, '460.078')] |
| [2023-07-17 01:20:15,911][285651] Updated weights for policy 0, policy_version 12720 (0.0005) |
| [2023-07-17 01:20:19,204][285651] Updated weights for policy 0, policy_version 12800 (0.0005) |
| [2023-07-17 01:20:19,344][285366] Fps is (10 sec: 12697.6, 60 sec: 12765.9, 300 sec: 13301.6). Total num frames: 6553600. Throughput: 0: 12553.2. Samples: 6523440. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-17 01:20:19,345][285366] Avg episode reward: [(0, '473.387')] |
| [2023-07-17 01:20:22,454][285651] Updated weights for policy 0, policy_version 12880 (0.0005) |
| [2023-07-17 01:20:24,344][285366] Fps is (10 sec: 12288.0, 60 sec: 12629.3, 300 sec: 13259.9). Total num frames: 6615040. Throughput: 0: 12554.8. Samples: 6598656. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-17 01:20:24,345][285366] Avg episode reward: [(0, '519.089')] |
| [2023-07-17 01:20:24,348][285607] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000012920_6615040.pth... |
| [2023-07-17 01:20:24,351][285607] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000012184_6238208.pth |
| [2023-07-17 01:20:24,351][285607] Saving new best policy, reward=519.089! |
| [2023-07-17 01:20:25,718][285651] Updated weights for policy 0, policy_version 12960 (0.0005) |
| [2023-07-17 01:20:28,980][285651] Updated weights for policy 0, policy_version 13040 (0.0005) |
| [2023-07-17 01:20:29,344][285366] Fps is (10 sec: 12697.6, 60 sec: 12561.0, 300 sec: 13232.2). Total num frames: 6680576. Throughput: 0: 12524.4. Samples: 6673760. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:20:29,345][285366] Avg episode reward: [(0, '304.856')] |
| [2023-07-17 01:20:32,249][285651] Updated weights for policy 0, policy_version 13120 (0.0005) |
| [2023-07-17 01:20:34,344][285366] Fps is (10 sec: 12697.7, 60 sec: 12561.1, 300 sec: 13190.5). Total num frames: 6742016. Throughput: 0: 12511.5. Samples: 6711112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:20:34,344][285366] Avg episode reward: [(0, '396.889')] |
| [2023-07-17 01:20:35,518][285651] Updated weights for policy 0, policy_version 13200 (0.0005) |
| [2023-07-17 01:20:38,743][285651] Updated weights for policy 0, policy_version 13280 (0.0005) |
| [2023-07-17 01:20:39,344][285366] Fps is (10 sec: 12288.1, 60 sec: 12492.8, 300 sec: 13148.9). Total num frames: 6803456. Throughput: 0: 12562.2. Samples: 6787136. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:20:39,344][285366] Avg episode reward: [(0, '357.222')] |
| [2023-07-17 01:20:39,355][285607] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000013296_6807552.pth... |
| [2023-07-17 01:20:39,357][285607] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000012560_6430720.pth |
| [2023-07-17 01:20:41,928][285651] Updated weights for policy 0, policy_version 13360 (0.0005) |
| [2023-07-17 01:20:44,344][285366] Fps is (10 sec: 12697.6, 60 sec: 12561.1, 300 sec: 13121.1). Total num frames: 6868992. Throughput: 0: 12624.6. Samples: 6864624. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:20:44,345][285366] Avg episode reward: [(0, '440.926')] |
| [2023-07-17 01:20:45,077][285651] Updated weights for policy 0, policy_version 13440 (0.0005) |
| [2023-07-17 01:20:48,050][285651] Updated weights for policy 0, policy_version 13520 (0.0004) |
| [2023-07-17 01:20:49,344][285366] Fps is (10 sec: 13516.7, 60 sec: 12697.6, 300 sec: 13093.3). Total num frames: 6938624. Throughput: 0: 12621.7. Samples: 6903680. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:20:49,345][285366] Avg episode reward: [(0, '495.861')] |
| [2023-07-17 01:20:50,912][285651] Updated weights for policy 0, policy_version 13600 (0.0004) |
| [2023-07-17 01:20:53,795][285651] Updated weights for policy 0, policy_version 13680 (0.0003) |
| [2023-07-17 01:20:54,344][285366] Fps is (10 sec: 13926.3, 60 sec: 12834.1, 300 sec: 13079.4). Total num frames: 7008256. Throughput: 0: 12866.7. Samples: 6989316. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-17 01:20:54,345][285366] Avg episode reward: [(0, '515.366')] |
| [2023-07-17 01:20:54,347][285607] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000013688_7008256.pth... |
| [2023-07-17 01:20:54,349][285607] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000012920_6615040.pth |
| [2023-07-17 01:20:56,693][285651] Updated weights for policy 0, policy_version 13760 (0.0004) |
| [2023-07-17 01:20:59,344][285366] Fps is (10 sec: 13926.5, 60 sec: 12902.4, 300 sec: 13065.5). Total num frames: 7077888. Throughput: 0: 13052.2. Samples: 7073800. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-17 01:20:59,345][285366] Avg episode reward: [(0, '474.468')] |
| [2023-07-17 01:20:59,724][285651] Updated weights for policy 0, policy_version 13840 (0.0004) |
| [2023-07-17 01:21:03,044][285651] Updated weights for policy 0, policy_version 13920 (0.0005) |
| [2023-07-17 01:21:04,344][285366] Fps is (10 sec: 13516.9, 60 sec: 12970.7, 300 sec: 13037.8). Total num frames: 7143424. Throughput: 0: 13050.9. Samples: 7110728. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:21:04,344][285366] Avg episode reward: [(0, '522.290')] |
| [2023-07-17 01:21:04,345][285607] Saving new best policy, reward=522.290! |
| [2023-07-17 01:21:06,203][285651] Updated weights for policy 0, policy_version 14000 (0.0005) |
| [2023-07-17 01:21:09,344][285366] Fps is (10 sec: 12697.5, 60 sec: 12970.7, 300 sec: 12996.1). Total num frames: 7204864. Throughput: 0: 13088.4. Samples: 7187632. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:21:09,345][285366] Avg episode reward: [(0, '559.435')] |
| [2023-07-17 01:21:09,347][285607] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000014072_7204864.pth... |
| [2023-07-17 01:21:09,349][285607] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000013296_6807552.pth |
| [2023-07-17 01:21:09,349][285607] Saving new best policy, reward=559.435! |
| [2023-07-17 01:21:09,400][285651] Updated weights for policy 0, policy_version 14080 (0.0005) |
| [2023-07-17 01:21:12,343][285651] Updated weights for policy 0, policy_version 14160 (0.0004) |
| [2023-07-17 01:21:14,344][285366] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 12982.2). Total num frames: 7274496. Throughput: 0: 13246.2. Samples: 7269836. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) |
| [2023-07-17 01:21:14,344][285366] Avg episode reward: [(0, '572.645')] |
| [2023-07-17 01:21:14,379][285607] Saving new best policy, reward=572.645! |
| [2023-07-17 01:21:15,368][285651] Updated weights for policy 0, policy_version 14240 (0.0004) |
| [2023-07-17 01:21:18,701][285651] Updated weights for policy 0, policy_version 14320 (0.0005) |
| [2023-07-17 01:21:19,344][285366] Fps is (10 sec: 13516.9, 60 sec: 13107.2, 300 sec: 12982.2). Total num frames: 7340032. Throughput: 0: 13265.6. Samples: 7308064. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) |
| [2023-07-17 01:21:19,345][285366] Avg episode reward: [(0, '600.744')] |
| [2023-07-17 01:21:19,345][285607] Saving new best policy, reward=600.744! |
| [2023-07-17 01:21:21,902][285651] Updated weights for policy 0, policy_version 14400 (0.0005) |
| [2023-07-17 01:21:24,344][285366] Fps is (10 sec: 12697.6, 60 sec: 13107.2, 300 sec: 12968.4). Total num frames: 7401472. Throughput: 0: 13260.8. Samples: 7383872. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-17 01:21:24,345][285366] Avg episode reward: [(0, '598.940')] |
| [2023-07-17 01:21:24,348][285607] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000014456_7401472.pth... |
| [2023-07-17 01:21:24,350][285607] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000013688_7008256.pth |
| [2023-07-17 01:21:25,085][285651] Updated weights for policy 0, policy_version 14480 (0.0005) |
| [2023-07-17 01:21:28,264][285651] Updated weights for policy 0, policy_version 14560 (0.0005) |
| [2023-07-17 01:21:29,344][285366] Fps is (10 sec: 12697.6, 60 sec: 13107.2, 300 sec: 12968.4). Total num frames: 7467008. Throughput: 0: 13249.3. Samples: 7460844. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-17 01:21:29,345][285366] Avg episode reward: [(0, '601.067')] |
| [2023-07-17 01:21:29,345][285607] Saving new best policy, reward=601.067! |
| [2023-07-17 01:21:31,362][285651] Updated weights for policy 0, policy_version 14640 (0.0004) |
| [2023-07-17 01:21:34,344][285366] Fps is (10 sec: 13107.1, 60 sec: 13175.5, 300 sec: 12954.5). Total num frames: 7532544. Throughput: 0: 13268.3. Samples: 7500752. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-17 01:21:34,344][285366] Avg episode reward: [(0, '566.995')] |
| [2023-07-17 01:21:34,362][285651] Updated weights for policy 0, policy_version 14720 (0.0004) |
| [2023-07-17 01:21:37,333][285651] Updated weights for policy 0, policy_version 14800 (0.0004) |
| [2023-07-17 01:21:39,344][285366] Fps is (10 sec: 13516.7, 60 sec: 13312.0, 300 sec: 12968.4). Total num frames: 7602176. Throughput: 0: 13205.7. Samples: 7583572. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-17 01:21:39,363][285366] Avg episode reward: [(0, '605.410')] |
| [2023-07-17 01:21:39,393][285607] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000014856_7606272.pth... |
| [2023-07-17 01:21:39,395][285607] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000014072_7204864.pth |
| [2023-07-17 01:21:39,395][285607] Saving new best policy, reward=605.410! |
| [2023-07-17 01:21:40,265][285651] Updated weights for policy 0, policy_version 14880 (0.0004) |
| [2023-07-17 01:21:43,226][285651] Updated weights for policy 0, policy_version 14960 (0.0004) |
| [2023-07-17 01:21:44,344][285366] Fps is (10 sec: 13926.5, 60 sec: 13380.3, 300 sec: 12982.2). Total num frames: 7671808. Throughput: 0: 13180.5. Samples: 7666924. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) |
| [2023-07-17 01:21:44,345][285366] Avg episode reward: [(0, '576.567')] |
| [2023-07-17 01:21:46,177][285651] Updated weights for policy 0, policy_version 15040 (0.0004) |
| [2023-07-17 01:21:49,115][285651] Updated weights for policy 0, policy_version 15120 (0.0004) |
| [2023-07-17 01:21:49,344][285366] Fps is (10 sec: 13926.5, 60 sec: 13380.3, 300 sec: 12982.2). Total num frames: 7741440. Throughput: 0: 13289.1. Samples: 7708736. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) |
| [2023-07-17 01:21:49,344][285366] Avg episode reward: [(0, '599.064')] |
| [2023-07-17 01:21:52,008][285651] Updated weights for policy 0, policy_version 15200 (0.0004) |
| [2023-07-17 01:21:54,344][285366] Fps is (10 sec: 13926.4, 60 sec: 13380.3, 300 sec: 13010.0). Total num frames: 7811072. Throughput: 0: 13446.1. Samples: 7792704. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:21:54,344][285366] Avg episode reward: [(0, '614.812')] |
| [2023-07-17 01:21:54,364][285607] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000015264_7815168.pth... |
| [2023-07-17 01:21:54,365][285607] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000014456_7401472.pth |
| [2023-07-17 01:21:54,365][285607] Saving new best policy, reward=614.812! |
| [2023-07-17 01:21:54,974][285651] Updated weights for policy 0, policy_version 15280 (0.0004) |
| [2023-07-17 01:21:57,971][285651] Updated weights for policy 0, policy_version 15360 (0.0004) |
| [2023-07-17 01:21:59,344][285366] Fps is (10 sec: 13926.4, 60 sec: 13380.3, 300 sec: 13023.9). Total num frames: 7880704. Throughput: 0: 13454.8. Samples: 7875304. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:21:59,345][285366] Avg episode reward: [(0, '611.354')] |
| [2023-07-17 01:22:00,993][285651] Updated weights for policy 0, policy_version 15440 (0.0005) |
| [2023-07-17 01:22:03,903][285651] Updated weights for policy 0, policy_version 15520 (0.0004) |
| [2023-07-17 01:22:04,344][285366] Fps is (10 sec: 13926.3, 60 sec: 13448.5, 300 sec: 13037.8). Total num frames: 7950336. Throughput: 0: 13518.4. Samples: 7916392. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:22:04,345][285366] Avg episode reward: [(0, '615.946')] |
| [2023-07-17 01:22:04,345][285607] Saving new best policy, reward=615.946! |
| [2023-07-17 01:22:06,813][285651] Updated weights for policy 0, policy_version 15600 (0.0004) |
| [2023-07-17 01:22:09,344][285366] Fps is (10 sec: 13926.3, 60 sec: 13585.1, 300 sec: 13037.8). Total num frames: 8019968. Throughput: 0: 13703.6. Samples: 8000536. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) |
| [2023-07-17 01:22:09,345][285366] Avg episode reward: [(0, '606.377')] |
| [2023-07-17 01:22:09,379][285607] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000015672_8024064.pth... |
| [2023-07-17 01:22:09,383][285607] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000014856_7606272.pth |
| [2023-07-17 01:22:09,663][285651] Updated weights for policy 0, policy_version 15680 (0.0004) |
| [2023-07-17 01:22:12,646][285651] Updated weights for policy 0, policy_version 15760 (0.0004) |
| [2023-07-17 01:22:14,344][285366] Fps is (10 sec: 13926.4, 60 sec: 13585.1, 300 sec: 13037.8). Total num frames: 8089600. Throughput: 0: 13867.0. Samples: 8084860. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) |
| [2023-07-17 01:22:14,345][285366] Avg episode reward: [(0, '580.940')] |
| [2023-07-17 01:22:15,578][285651] Updated weights for policy 0, policy_version 15840 (0.0004) |
| [2023-07-17 01:22:18,535][285651] Updated weights for policy 0, policy_version 15920 (0.0004) |
| [2023-07-17 01:22:19,344][285366] Fps is (10 sec: 13926.5, 60 sec: 13653.3, 300 sec: 13037.8). Total num frames: 8159232. Throughput: 0: 13903.0. Samples: 8126384. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-17 01:22:19,344][285366] Avg episode reward: [(0, '631.537')] |
| [2023-07-17 01:22:19,345][285607] Saving new best policy, reward=631.537! |
| [2023-07-17 01:22:21,448][285651] Updated weights for policy 0, policy_version 16000 (0.0004) |
| [2023-07-17 01:22:24,339][285651] Updated weights for policy 0, policy_version 16080 (0.0004) |
| [2023-07-17 01:22:24,344][285366] Fps is (10 sec: 14335.9, 60 sec: 13858.1, 300 sec: 13065.5). Total num frames: 8232960. Throughput: 0: 13920.1. Samples: 8209976. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-17 01:22:24,345][285366] Avg episode reward: [(0, '622.590')] |
| [2023-07-17 01:22:24,348][285607] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000016080_8232960.pth... |
| [2023-07-17 01:22:24,351][285607] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000015264_7815168.pth |
| [2023-07-17 01:22:27,312][285651] Updated weights for policy 0, policy_version 16160 (0.0004) |
| [2023-07-17 01:22:29,344][285366] Fps is (10 sec: 14336.0, 60 sec: 13926.4, 300 sec: 13093.3). Total num frames: 8302592. Throughput: 0: 13945.3. Samples: 8294464. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) |
| [2023-07-17 01:22:29,345][285366] Avg episode reward: [(0, '600.207')] |
| [2023-07-17 01:22:30,239][285651] Updated weights for policy 0, policy_version 16240 (0.0004) |
| [2023-07-17 01:22:33,149][285651] Updated weights for policy 0, policy_version 16320 (0.0004) |
| [2023-07-17 01:22:34,344][285366] Fps is (10 sec: 13926.5, 60 sec: 13994.7, 300 sec: 13093.3). Total num frames: 8372224. Throughput: 0: 13935.2. Samples: 8335820. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) |
| [2023-07-17 01:22:34,344][285366] Avg episode reward: [(0, '611.158')] |
| [2023-07-17 01:22:36,085][285651] Updated weights for policy 0, policy_version 16400 (0.0004) |
| [2023-07-17 01:22:39,018][285651] Updated weights for policy 0, policy_version 16480 (0.0004) |
| [2023-07-17 01:22:39,344][285366] Fps is (10 sec: 13926.3, 60 sec: 13994.7, 300 sec: 13121.1). Total num frames: 8441856. Throughput: 0: 13955.3. Samples: 8420692. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:22:39,344][285366] Avg episode reward: [(0, '589.033')] |
| [2023-07-17 01:22:39,347][285607] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000016488_8441856.pth... |
| [2023-07-17 01:22:39,350][285607] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000015672_8024064.pth |
| [2023-07-17 01:22:41,981][285651] Updated weights for policy 0, policy_version 16560 (0.0004) |
| [2023-07-17 01:22:44,344][285366] Fps is (10 sec: 13516.7, 60 sec: 13926.4, 300 sec: 13121.1). Total num frames: 8507392. Throughput: 0: 13947.1. Samples: 8502924. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:22:44,344][285366] Avg episode reward: [(0, '583.695')] |
| [2023-07-17 01:22:44,957][285651] Updated weights for policy 0, policy_version 16640 (0.0005) |
| [2023-07-17 01:22:48,088][285651] Updated weights for policy 0, policy_version 16720 (0.0005) |
| [2023-07-17 01:22:49,344][285366] Fps is (10 sec: 13107.3, 60 sec: 13858.1, 300 sec: 13121.1). Total num frames: 8572928. Throughput: 0: 13925.5. Samples: 8543040. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:22:49,344][285366] Avg episode reward: [(0, '591.792')] |
| [2023-07-17 01:22:51,327][285651] Updated weights for policy 0, policy_version 16800 (0.0005) |
| [2023-07-17 01:22:54,344][285366] Fps is (10 sec: 13107.2, 60 sec: 13789.8, 300 sec: 13121.1). Total num frames: 8638464. Throughput: 0: 13769.8. Samples: 8620176. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-17 01:22:54,344][285366] Avg episode reward: [(0, '600.855')] |
| [2023-07-17 01:22:54,347][285607] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000016872_8638464.pth... |
| [2023-07-17 01:22:54,350][285607] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000016080_8232960.pth |
| [2023-07-17 01:22:54,437][285651] Updated weights for policy 0, policy_version 16880 (0.0005) |
| [2023-07-17 01:22:57,419][285651] Updated weights for policy 0, policy_version 16960 (0.0004) |
| [2023-07-17 01:22:59,344][285366] Fps is (10 sec: 13516.8, 60 sec: 13789.9, 300 sec: 13148.9). Total num frames: 8708096. Throughput: 0: 13718.1. Samples: 8702172. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-17 01:22:59,344][285366] Avg episode reward: [(0, '585.386')] |
| [2023-07-17 01:23:00,355][285651] Updated weights for policy 0, policy_version 17040 (0.0004) |
| [2023-07-17 01:23:03,322][285651] Updated weights for policy 0, policy_version 17120 (0.0004) |
| [2023-07-17 01:23:04,344][285366] Fps is (10 sec: 13926.5, 60 sec: 13789.9, 300 sec: 13162.7). Total num frames: 8777728. Throughput: 0: 13711.7. Samples: 8743412. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:23:04,345][285366] Avg episode reward: [(0, '582.228')] |
| [2023-07-17 01:23:06,260][285651] Updated weights for policy 0, policy_version 17200 (0.0004) |
| [2023-07-17 01:23:09,177][285651] Updated weights for policy 0, policy_version 17280 (0.0004) |
| [2023-07-17 01:23:09,344][285366] Fps is (10 sec: 13926.3, 60 sec: 13789.9, 300 sec: 13190.5). Total num frames: 8847360. Throughput: 0: 13710.4. Samples: 8826944. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:23:09,345][285366] Avg episode reward: [(0, '570.487')] |
| [2023-07-17 01:23:09,348][285607] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000017280_8847360.pth... |
| [2023-07-17 01:23:09,351][285607] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000016488_8441856.pth |
| [2023-07-17 01:23:12,382][285651] Updated weights for policy 0, policy_version 17360 (0.0005) |
| [2023-07-17 01:23:14,344][285366] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13190.5). Total num frames: 8908800. Throughput: 0: 13570.1. Samples: 8905120. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:23:14,345][285366] Avg episode reward: [(0, '585.754')] |
| [2023-07-17 01:23:15,667][285651] Updated weights for policy 0, policy_version 17440 (0.0006) |
| [2023-07-17 01:23:18,908][285651] Updated weights for policy 0, policy_version 17520 (0.0005) |
| [2023-07-17 01:23:19,344][285366] Fps is (10 sec: 12697.6, 60 sec: 13585.1, 300 sec: 13190.5). Total num frames: 8974336. Throughput: 0: 13495.6. Samples: 8943120. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:23:19,344][285366] Avg episode reward: [(0, '550.536')] |
| [2023-07-17 01:23:22,147][285651] Updated weights for policy 0, policy_version 17600 (0.0006) |
| [2023-07-17 01:23:24,344][285366] Fps is (10 sec: 12697.6, 60 sec: 13380.3, 300 sec: 13190.5). Total num frames: 9035776. Throughput: 0: 13298.1. Samples: 9019108. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 01:23:24,344][285366] Avg episode reward: [(0, '573.005')] |
| [2023-07-17 01:23:24,347][285607] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000017648_9035776.pth... |
| [2023-07-17 01:23:24,350][285607] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000016872_8638464.pth |
| [2023-07-17 01:23:25,490][285651] Updated weights for policy 0, policy_version 17680 (0.0006) |
| [2023-07-17 01:23:28,395][285651] Updated weights for policy 0, policy_version 17760 (0.0004) |
| [2023-07-17 01:23:29,344][285366] Fps is (10 sec: 13107.2, 60 sec: 13380.3, 300 sec: 13204.4). Total num frames: 9105408. Throughput: 0: 13223.2. Samples: 9097968. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-17 01:23:29,344][285366] Avg episode reward: [(0, '545.783')] |
| [2023-07-17 01:23:31,239][285651] Updated weights for policy 0, policy_version 17840 (0.0004) |
| [2023-07-17 01:23:34,179][285651] Updated weights for policy 0, policy_version 17920 (0.0004) |
| [2023-07-17 01:23:34,344][285366] Fps is (10 sec: 13926.5, 60 sec: 13380.3, 300 sec: 13232.2). Total num frames: 9175040. Throughput: 0: 13295.0. Samples: 9141316. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-17 01:23:34,344][285366] Avg episode reward: [(0, '548.506')] |
| [2023-07-17 01:23:37,113][285651] Updated weights for policy 0, policy_version 18000 (0.0004) |
| [2023-07-17 01:23:39,344][285366] Fps is (10 sec: 13926.3, 60 sec: 13380.3, 300 sec: 13246.0). Total num frames: 9244672. Throughput: 0: 13441.1. Samples: 9225028. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-17 01:23:39,344][285366] Avg episode reward: [(0, '559.871')] |
| [2023-07-17 01:23:39,347][285607] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000018056_9244672.pth... |
| [2023-07-17 01:23:39,350][285607] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000017280_8847360.pth |
| [2023-07-17 01:23:39,995][285651] Updated weights for policy 0, policy_version 18080 (0.0004) |
| [2023-07-17 01:23:43,017][285651] Updated weights for policy 0, policy_version 18160 (0.0004) |
| [2023-07-17 01:23:44,343][285366] Fps is (10 sec: 13926.5, 60 sec: 13448.6, 300 sec: 13259.9). Total num frames: 9314304. Throughput: 0: 13454.8. Samples: 9307636. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-17 01:23:44,345][285366] Avg episode reward: [(0, '590.257')] |
| [2023-07-17 01:23:46,165][285651] Updated weights for policy 0, policy_version 18240 (0.0005) |
| [2023-07-17 01:23:49,344][285366] Fps is (10 sec: 13107.2, 60 sec: 13380.2, 300 sec: 13246.0). Total num frames: 9375744. Throughput: 0: 13414.6. Samples: 9347072. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-17 01:23:49,345][285366] Avg episode reward: [(0, '603.941')] |
| [2023-07-17 01:23:49,392][285651] Updated weights for policy 0, policy_version 18320 (0.0005) |
| [2023-07-17 01:23:52,676][285651] Updated weights for policy 0, policy_version 18400 (0.0005) |
| [2023-07-17 01:23:54,344][285366] Fps is (10 sec: 12287.8, 60 sec: 13312.0, 300 sec: 13246.0). Total num frames: 9437184. Throughput: 0: 13213.1. Samples: 9421536. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-17 01:23:54,344][285366] Avg episode reward: [(0, '618.386')] |
| [2023-07-17 01:23:54,347][285607] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000018440_9441280.pth... |
| [2023-07-17 01:23:54,350][285607] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000017648_9035776.pth |
| [2023-07-17 01:23:55,931][285651] Updated weights for policy 0, policy_version 18480 (0.0006) |
| [2023-07-17 01:23:58,885][285651] Updated weights for policy 0, policy_version 18560 (0.0004) |
| [2023-07-17 01:23:59,344][285366] Fps is (10 sec: 13107.4, 60 sec: 13312.0, 300 sec: 13246.0). Total num frames: 9506816. Throughput: 0: 13235.1. Samples: 9500700. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-17 01:23:59,345][285366] Avg episode reward: [(0, '616.639')] |
| [2023-07-17 01:24:01,945][285651] Updated weights for policy 0, policy_version 18640 (0.0005) |
| [2023-07-17 01:24:04,344][285366] Fps is (10 sec: 13517.0, 60 sec: 13243.8, 300 sec: 13246.1). Total num frames: 9572352. Throughput: 0: 13288.5. Samples: 9541100. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-17 01:24:04,344][285366] Avg episode reward: [(0, '606.312')] |
| [2023-07-17 01:24:05,236][285651] Updated weights for policy 0, policy_version 18720 (0.0005) |
| [2023-07-17 01:24:08,497][285651] Updated weights for policy 0, policy_version 18800 (0.0005) |
| [2023-07-17 01:24:09,344][285366] Fps is (10 sec: 12697.4, 60 sec: 13107.2, 300 sec: 13246.0). Total num frames: 9633792. Throughput: 0: 13270.6. Samples: 9616284. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-17 01:24:09,366][285366] Avg episode reward: [(0, '611.278')] |
| [2023-07-17 01:24:09,369][285607] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000018816_9633792.pth... |
| [2023-07-17 01:24:09,372][285607] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000018056_9244672.pth |
| [2023-07-17 01:24:11,477][285651] Updated weights for policy 0, policy_version 18880 (0.0004) |
| [2023-07-17 01:24:14,344][285366] Fps is (10 sec: 13107.1, 60 sec: 13243.7, 300 sec: 13273.8). Total num frames: 9703424. Throughput: 0: 13325.7. Samples: 9697624. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-17 01:24:14,344][285366] Avg episode reward: [(0, '589.382')] |
| [2023-07-17 01:24:14,475][285651] Updated weights for policy 0, policy_version 18960 (0.0005) |
| [2023-07-17 01:24:17,396][285651] Updated weights for policy 0, policy_version 19040 (0.0004) |
| [2023-07-17 01:24:19,344][285366] Fps is (10 sec: 13926.5, 60 sec: 13312.0, 300 sec: 13273.8). Total num frames: 9773056. Throughput: 0: 13294.2. Samples: 9739556. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-17 01:24:19,358][285366] Avg episode reward: [(0, '604.227')] |
| [2023-07-17 01:24:20,319][285651] Updated weights for policy 0, policy_version 19120 (0.0004) |
| [2023-07-17 01:24:23,263][285651] Updated weights for policy 0, policy_version 19200 (0.0004) |
| [2023-07-17 01:24:24,344][285366] Fps is (10 sec: 13926.2, 60 sec: 13448.5, 300 sec: 13273.8). Total num frames: 9842688. Throughput: 0: 13278.6. Samples: 9822564. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) |
| [2023-07-17 01:24:24,344][285366] Avg episode reward: [(0, '609.328')] |
| [2023-07-17 01:24:24,348][285607] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000019224_9842688.pth... |
| [2023-07-17 01:24:24,350][285607] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000018440_9441280.pth |
| [2023-07-17 01:24:26,266][285651] Updated weights for policy 0, policy_version 19280 (0.0005) |
| [2023-07-17 01:24:29,292][285651] Updated weights for policy 0, policy_version 19360 (0.0004) |
| [2023-07-17 01:24:29,344][285366] Fps is (10 sec: 13926.4, 60 sec: 13448.5, 300 sec: 13301.6). Total num frames: 9912320. Throughput: 0: 13282.9. Samples: 9905368. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) |
| [2023-07-17 01:24:29,344][285366] Avg episode reward: [(0, '601.418')] |
| [2023-07-17 01:24:32,501][285651] Updated weights for policy 0, policy_version 19440 (0.0005) |
| [2023-07-17 01:24:34,344][285366] Fps is (10 sec: 13107.3, 60 sec: 13312.0, 300 sec: 13287.7). Total num frames: 9973760. Throughput: 0: 13274.5. Samples: 9944424. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-17 01:24:34,344][285366] Avg episode reward: [(0, '595.085')] |
| [2023-07-17 01:24:35,664][285651] Updated weights for policy 0, policy_version 19520 (0.0005) |
| [2023-07-17 01:24:36,671][285607] Early stopping after 2 epochs (8 sgd steps), loss delta 0.0000000 |
| [2023-07-17 01:24:36,672][285655] Stopping RolloutWorker_w3... |
| [2023-07-17 01:24:36,672][285652] Stopping RolloutWorker_w1... |
| [2023-07-17 01:24:36,672][285657] Stopping RolloutWorker_w4... |
| [2023-07-17 01:24:36,672][285656] Stopping RolloutWorker_w5... |
| [2023-07-17 01:24:36,672][285720] Stopping RolloutWorker_w6... |
| [2023-07-17 01:24:36,672][285654] Stopping RolloutWorker_w0... |
| [2023-07-17 01:24:36,672][285752] Stopping RolloutWorker_w7... |
| [2023-07-17 01:24:36,672][285653] Stopping RolloutWorker_w2... |
| [2023-07-17 01:24:36,672][285655] Loop rollout_proc3_evt_loop terminating... |
| [2023-07-17 01:24:36,672][285652] Loop rollout_proc1_evt_loop terminating... |
| [2023-07-17 01:24:36,672][285656] Loop rollout_proc5_evt_loop terminating... |
| [2023-07-17 01:24:36,672][285657] Loop rollout_proc4_evt_loop terminating... |
| [2023-07-17 01:24:36,672][285720] Loop rollout_proc6_evt_loop terminating... |
| [2023-07-17 01:24:36,672][285654] Loop rollout_proc0_evt_loop terminating... |
| [2023-07-17 01:24:36,672][285752] Loop rollout_proc7_evt_loop terminating... |
| [2023-07-17 01:24:36,672][285653] Loop rollout_proc2_evt_loop terminating... |
| [2023-07-17 01:24:36,672][285366] Component RolloutWorker_w3 stopped! |
| [2023-07-17 01:24:36,673][285366] Component RolloutWorker_w4 stopped! |
| [2023-07-17 01:24:36,673][285607] Stopping Batcher_0... |
| [2023-07-17 01:24:36,673][285366] Component RolloutWorker_w6 stopped! |
| [2023-07-17 01:24:36,673][285366] Component RolloutWorker_w1 stopped! |
| [2023-07-17 01:24:36,673][285607] Loop batcher_evt_loop terminating... |
| [2023-07-17 01:24:36,674][285366] Component RolloutWorker_w5 stopped! |
| [2023-07-17 01:24:36,673][285607] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000019544_10006528.pth... |
| [2023-07-17 01:24:36,674][285366] Component RolloutWorker_w0 stopped! |
| [2023-07-17 01:24:36,674][285366] Component RolloutWorker_w7 stopped! |
| [2023-07-17 01:24:36,674][285366] Component RolloutWorker_w2 stopped! |
| [2023-07-17 01:24:36,674][285366] Component Batcher_0 stopped! |
| [2023-07-17 01:24:36,676][285607] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000018816_9633792.pth |
| [2023-07-17 01:24:36,677][285607] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000019544_10006528.pth... |
| [2023-07-17 01:24:36,679][285607] Stopping LearnerWorker_p0... |
| [2023-07-17 01:24:36,680][285607] Loop learner_proc0_evt_loop terminating... |
| [2023-07-17 01:24:36,680][285366] Component LearnerWorker_p0 stopped! |
| [2023-07-17 01:24:36,738][285651] Weights refcount: 2 0 |
| [2023-07-17 01:24:36,739][285651] Stopping InferenceWorker_p0-w0... |
| [2023-07-17 01:24:36,740][285651] Loop inference_proc0-0_evt_loop terminating... |
| [2023-07-17 01:24:36,740][285366] Component InferenceWorker_p0-w0 stopped! |
| [2023-07-17 01:24:36,740][285366] Waiting for process learner_proc0 to stop... |
| [2023-07-17 01:24:37,280][285366] Waiting for process inference_proc0-0 to join... |
| [2023-07-17 01:24:37,296][285366] Waiting for process rollout_proc0 to join... |
| [2023-07-17 01:24:37,296][285366] Waiting for process rollout_proc1 to join... |
| [2023-07-17 01:24:37,296][285366] Waiting for process rollout_proc2 to join... |
| [2023-07-17 01:24:37,296][285366] Waiting for process rollout_proc3 to join... |
| [2023-07-17 01:24:37,297][285366] Waiting for process rollout_proc4 to join... |
| [2023-07-17 01:24:37,297][285366] Waiting for process rollout_proc5 to join... |
| [2023-07-17 01:24:37,297][285366] Waiting for process rollout_proc6 to join... |
| [2023-07-17 01:24:37,297][285366] Waiting for process rollout_proc7 to join... |
| [2023-07-17 01:24:37,297][285366] Batcher 0 profile tree view: |
| batching: 1.8364, releasing_batches: 1.6288 |
| [2023-07-17 01:24:37,297][285366] InferenceWorker_p0-w0 profile tree view: |
| wait_policy: 0.0051 |
| wait_policy_total: 240.3535 |
| update_model: 9.7942 |
| weight_update: 0.0005 |
| one_step: 0.0006 |
| handle_policy_step: 442.2087 |
| deserialize: 18.8726, stack: 4.6258, obs_to_device_normalize: 78.6490, forward: 216.8483, send_messages: 35.0300 |
| prepare_outputs: 50.4467 |
| to_cpu: 7.6827 |
| [2023-07-17 01:24:37,297][285366] Learner 0 profile tree view: |
| misc: 0.0107, prepare_batch: 9.5273 |
| train: 97.1004 |
| epoch_init: 0.0347, minibatch_init: 1.3723, losses_postprocess: 1.2884, kl_divergence: 0.4350, after_optimizer: 0.5776 |
| calculate_losses: 41.4344 |
| losses_init: 0.0332, forward_head: 16.1917, bptt_initial: 0.1434, bptt: 0.1416, tail: 11.6519, advantages_returns: 0.8915, losses: 10.9274 |
| update: 50.3420 |
| clip: 5.9398 |
| [2023-07-17 01:24:37,298][285366] RolloutWorker_w0 profile tree view: |
| wait_for_trajectories: 0.2724, enqueue_policy_requests: 12.2577, env_step: 478.9028, overhead: 19.4063, complete_rollouts: 0.3161 |
| save_policy_outputs: 38.1381 |
| split_output_tensors: 12.9879 |
| [2023-07-17 01:24:37,298][285366] RolloutWorker_w7 profile tree view: |
| wait_for_trajectories: 0.2620, enqueue_policy_requests: 12.4813, env_step: 481.9282, overhead: 19.7011, complete_rollouts: 0.3153 |
| save_policy_outputs: 38.0531 |
| split_output_tensors: 13.2893 |
| [2023-07-17 01:24:37,298][285366] Loop Runner_EvtLoop terminating... |
| [2023-07-17 01:24:37,298][285366] Runner profile tree view: |
| main_loop: 745.8890 |
| [2023-07-17 01:24:37,298][285366] Collected {0: 10006528}, FPS: 13415.6 |
|
|