| [2023-07-17 06:20:14,603][348097] Saving configuration to /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/config.json... |
| [2023-07-17 06:20:14,620][348097] Rollout worker 0 uses device cpu |
| [2023-07-17 06:20:14,621][348097] Rollout worker 1 uses device cpu |
| [2023-07-17 06:20:14,621][348097] Rollout worker 2 uses device cpu |
| [2023-07-17 06:20:14,621][348097] Rollout worker 3 uses device cpu |
| [2023-07-17 06:20:14,621][348097] Rollout worker 4 uses device cpu |
| [2023-07-17 06:20:14,621][348097] Rollout worker 5 uses device cpu |
| [2023-07-17 06:20:14,621][348097] Rollout worker 6 uses device cpu |
| [2023-07-17 06:20:14,621][348097] Rollout worker 7 uses device cpu |
| [2023-07-17 06:20:14,622][348097] In synchronous mode, we only accumulate one batch. Setting num_batches_to_accumulate to 1 |
| [2023-07-17 06:20:14,634][348097] InferenceWorker_p0-w0: min num requests: 2 |
| [2023-07-17 06:20:14,663][348097] Starting all processes... |
| [2023-07-17 06:20:14,663][348097] Starting process learner_proc0 |
| [2023-07-17 06:20:14,712][348097] Starting all processes... |
| [2023-07-17 06:20:14,757][348097] Starting process inference_proc0-0 |
| [2023-07-17 06:20:14,757][348097] Starting process rollout_proc0 |
| [2023-07-17 06:20:14,757][348097] Starting process rollout_proc1 |
| [2023-07-17 06:20:14,757][348097] Starting process rollout_proc2 |
| [2023-07-17 06:20:14,757][348097] Starting process rollout_proc3 |
| [2023-07-17 06:20:14,758][348097] Starting process rollout_proc4 |
| [2023-07-17 06:20:14,758][348097] Starting process rollout_proc5 |
| [2023-07-17 06:20:14,758][348097] Starting process rollout_proc6 |
| [2023-07-17 06:20:14,759][348097] Starting process rollout_proc7 |
| [2023-07-17 06:20:16,573][348338] Starting seed is not provided |
| [2023-07-17 06:20:16,574][348338] Initializing actor-critic model on device cpu |
| [2023-07-17 06:20:16,574][348338] RunningMeanStd input shape: (39,) |
| [2023-07-17 06:20:16,574][348338] RunningMeanStd input shape: (1,) |
| [2023-07-17 06:20:16,629][348338] Created Actor Critic model with architecture: |
| [2023-07-17 06:20:16,629][348338] ActorCriticSharedWeights( |
| (obs_normalizer): ObservationNormalizer( |
| (running_mean_std): RunningMeanStdDictInPlace( |
| (running_mean_std): ModuleDict( |
| (obs): RunningMeanStdInPlace() |
| ) |
| ) |
| ) |
| (returns_normalizer): RecursiveScriptModule(original_name=RunningMeanStdInPlace) |
| (encoder): MultiInputEncoder( |
| (encoders): ModuleDict( |
| (obs): MlpEncoder( |
| (mlp_head): RecursiveScriptModule( |
| original_name=Sequential |
| (0): RecursiveScriptModule(original_name=Linear) |
| (1): RecursiveScriptModule(original_name=Tanh) |
| (2): RecursiveScriptModule(original_name=Linear) |
| (3): RecursiveScriptModule(original_name=Tanh) |
| ) |
| ) |
| ) |
| ) |
| (core): ModelCoreIdentity() |
| (decoder): MlpDecoder( |
| (mlp): Identity() |
| ) |
| (critic_linear): Linear(in_features=64, out_features=1, bias=True) |
| (action_parameterization): ActionParameterizationContinuousNonAdaptiveStddev( |
| (distribution_linear): Linear(in_features=64, out_features=4, bias=True) |
| ) |
| ) |
| [2023-07-17 06:20:16,719][348385] Worker 2 uses CPU cores [8, 9, 10, 11] |
| [2023-07-17 06:20:16,814][348393] Worker 5 uses CPU cores [20, 21, 22, 23] |
| [2023-07-17 06:20:16,881][348493] Worker 6 uses CPU cores [24, 25, 26, 27] |
| [2023-07-17 06:20:16,930][348338] Using optimizer <class 'torch.optim.adam.Adam'> |
| [2023-07-17 06:20:16,930][348338] No checkpoints found |
| [2023-07-17 06:20:16,931][348338] Did not load from checkpoint, starting from scratch! |
| [2023-07-17 06:20:16,931][348338] Initialized policy 0 weights for model version 0 |
| [2023-07-17 06:20:16,932][348338] LearnerWorker_p0 finished initialization! |
| [2023-07-17 06:20:17,013][348383] Worker 0 uses CPU cores [0, 1, 2, 3] |
| [2023-07-17 06:20:17,035][348382] RunningMeanStd input shape: (39,) |
| [2023-07-17 06:20:17,036][348382] RunningMeanStd input shape: (1,) |
| [2023-07-17 06:20:17,091][348097] Inference worker 0-0 is ready! |
| [2023-07-17 06:20:17,091][348097] All inference workers are ready! Signal rollout workers to start! |
| [2023-07-17 06:20:17,167][348384] Worker 1 uses CPU cores [4, 5, 6, 7] |
| [2023-07-17 06:20:17,177][348387] Worker 4 uses CPU cores [16, 17, 18, 19] |
| [2023-07-17 06:20:17,313][348386] Worker 3 uses CPU cores [12, 13, 14, 15] |
| [2023-07-17 06:20:17,444][348451] Worker 7 uses CPU cores [28, 29, 30, 31] |
| [2023-07-17 06:20:17,652][348097] Fps is (10 sec: nan, 60 sec: nan, 300 sec: nan). Total num frames: 0. Throughput: 0: nan. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) |
| [2023-07-17 06:20:18,338][348385] Decorrelating experience for 0 frames... |
| [2023-07-17 06:20:18,340][348493] Decorrelating experience for 0 frames... |
| [2023-07-17 06:20:18,346][348383] Decorrelating experience for 0 frames... |
| [2023-07-17 06:20:18,348][348393] Decorrelating experience for 0 frames... |
| [2023-07-17 06:20:18,350][348385] Decorrelating experience for 64 frames... |
| [2023-07-17 06:20:18,353][348493] Decorrelating experience for 64 frames... |
| [2023-07-17 06:20:18,359][348383] Decorrelating experience for 64 frames... |
| [2023-07-17 06:20:18,361][348393] Decorrelating experience for 64 frames... |
| [2023-07-17 06:20:18,390][348385] Decorrelating experience for 128 frames... |
| [2023-07-17 06:20:18,392][348493] Decorrelating experience for 128 frames... |
| [2023-07-17 06:20:18,398][348383] Decorrelating experience for 128 frames... |
| [2023-07-17 06:20:18,400][348393] Decorrelating experience for 128 frames... |
| [2023-07-17 06:20:18,455][348384] Decorrelating experience for 0 frames... |
| [2023-07-17 06:20:18,467][348385] Decorrelating experience for 192 frames... |
| [2023-07-17 06:20:18,467][348384] Decorrelating experience for 64 frames... |
| [2023-07-17 06:20:18,470][348493] Decorrelating experience for 192 frames... |
| [2023-07-17 06:20:18,476][348383] Decorrelating experience for 192 frames... |
| [2023-07-17 06:20:18,478][348393] Decorrelating experience for 192 frames... |
| [2023-07-17 06:20:18,491][348387] Decorrelating experience for 0 frames... |
| [2023-07-17 06:20:18,503][348387] Decorrelating experience for 64 frames... |
| [2023-07-17 06:20:18,507][348384] Decorrelating experience for 128 frames... |
| [2023-07-17 06:20:18,542][348387] Decorrelating experience for 128 frames... |
| [2023-07-17 06:20:18,584][348384] Decorrelating experience for 192 frames... |
| [2023-07-17 06:20:18,609][348386] Decorrelating experience for 0 frames... |
| [2023-07-17 06:20:18,620][348387] Decorrelating experience for 192 frames... |
| [2023-07-17 06:20:18,621][348386] Decorrelating experience for 64 frames... |
| [2023-07-17 06:20:18,662][348386] Decorrelating experience for 128 frames... |
| [2023-07-17 06:20:18,740][348386] Decorrelating experience for 192 frames... |
| [2023-07-17 06:20:18,752][348451] Decorrelating experience for 0 frames... |
| [2023-07-17 06:20:18,765][348451] Decorrelating experience for 64 frames... |
| [2023-07-17 06:20:18,804][348451] Decorrelating experience for 128 frames... |
| [2023-07-17 06:20:18,883][348451] Decorrelating experience for 192 frames... |
| [2023-07-17 06:20:19,722][348383] Decorrelating experience for 256 frames... |
| [2023-07-17 06:20:19,724][348493] Decorrelating experience for 256 frames... |
| [2023-07-17 06:20:19,725][348385] Decorrelating experience for 256 frames... |
| [2023-07-17 06:20:19,729][348393] Decorrelating experience for 256 frames... |
| [2023-07-17 06:20:19,828][348384] Decorrelating experience for 256 frames... |
| [2023-07-17 06:20:19,857][348387] Decorrelating experience for 256 frames... |
| [2023-07-17 06:20:19,865][348383] Decorrelating experience for 320 frames... |
| [2023-07-17 06:20:19,867][348385] Decorrelating experience for 320 frames... |
| [2023-07-17 06:20:19,868][348493] Decorrelating experience for 320 frames... |
| [2023-07-17 06:20:19,872][348393] Decorrelating experience for 320 frames... |
| [2023-07-17 06:20:19,971][348384] Decorrelating experience for 320 frames... |
| [2023-07-17 06:20:19,988][348386] Decorrelating experience for 256 frames... |
| [2023-07-17 06:20:20,000][348387] Decorrelating experience for 320 frames... |
| [2023-07-17 06:20:20,046][348383] Decorrelating experience for 384 frames... |
| [2023-07-17 06:20:20,048][348385] Decorrelating experience for 384 frames... |
| [2023-07-17 06:20:20,050][348493] Decorrelating experience for 384 frames... |
| [2023-07-17 06:20:20,053][348393] Decorrelating experience for 384 frames... |
| [2023-07-17 06:20:20,121][348451] Decorrelating experience for 256 frames... |
| [2023-07-17 06:20:20,133][348386] Decorrelating experience for 320 frames... |
| [2023-07-17 06:20:20,151][348384] Decorrelating experience for 384 frames... |
| [2023-07-17 06:20:20,181][348387] Decorrelating experience for 384 frames... |
| [2023-07-17 06:20:20,253][348383] Decorrelating experience for 448 frames... |
| [2023-07-17 06:20:20,255][348385] Decorrelating experience for 448 frames... |
| [2023-07-17 06:20:20,257][348493] Decorrelating experience for 448 frames... |
| [2023-07-17 06:20:20,259][348393] Decorrelating experience for 448 frames... |
| [2023-07-17 06:20:20,263][348451] Decorrelating experience for 320 frames... |
| [2023-07-17 06:20:20,315][348386] Decorrelating experience for 384 frames... |
| [2023-07-17 06:20:20,358][348384] Decorrelating experience for 448 frames... |
| [2023-07-17 06:20:20,388][348387] Decorrelating experience for 448 frames... |
| [2023-07-17 06:20:20,444][348451] Decorrelating experience for 384 frames... |
| [2023-07-17 06:20:20,524][348386] Decorrelating experience for 448 frames... |
| [2023-07-17 06:20:20,652][348451] Decorrelating experience for 448 frames... |
| [2023-07-17 06:20:22,652][348097] Fps is (10 sec: 2457.7, 60 sec: 2457.7, 300 sec: 2457.7). Total num frames: 12288. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) |
| [2023-07-17 06:20:22,652][348097] Avg episode reward: [(0, '58.146')] |
| [2023-07-17 06:20:25,073][348382] Updated weights for policy 0, policy_version 80 (0.0006) |
| [2023-07-17 06:20:27,652][348097] Fps is (10 sec: 6553.6, 60 sec: 6553.6, 300 sec: 6553.6). Total num frames: 65536. Throughput: 0: 5215.2. Samples: 52152. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) |
| [2023-07-17 06:20:27,652][348097] Avg episode reward: [(0, '97.208')] |
| [2023-07-17 06:20:27,655][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000000128_65536.pth... |
| [2023-07-17 06:20:29,076][348382] Updated weights for policy 0, policy_version 160 (0.0005) |
| [2023-07-17 06:20:32,652][348097] Fps is (10 sec: 10239.9, 60 sec: 7645.9, 300 sec: 7645.9). Total num frames: 114688. Throughput: 0: 7601.4. Samples: 114020. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-17 06:20:32,652][348097] Avg episode reward: [(0, '199.163')] |
| [2023-07-17 06:20:32,658][348338] Saving new best policy, reward=199.163! |
| [2023-07-17 06:20:33,086][348382] Updated weights for policy 0, policy_version 240 (0.0005) |
| [2023-07-17 06:20:34,628][348097] Heartbeat connected on Batcher_0 |
| [2023-07-17 06:20:34,630][348097] Heartbeat connected on LearnerWorker_p0 |
| [2023-07-17 06:20:34,635][348097] Heartbeat connected on InferenceWorker_p0-w0 |
| [2023-07-17 06:20:34,642][348097] Heartbeat connected on RolloutWorker_w0 |
| [2023-07-17 06:20:34,644][348097] Heartbeat connected on RolloutWorker_w1 |
| [2023-07-17 06:20:34,650][348097] Heartbeat connected on RolloutWorker_w2 |
| [2023-07-17 06:20:34,653][348097] Heartbeat connected on RolloutWorker_w3 |
| [2023-07-17 06:20:34,656][348097] Heartbeat connected on RolloutWorker_w4 |
| [2023-07-17 06:20:34,658][348097] Heartbeat connected on RolloutWorker_w5 |
| [2023-07-17 06:20:34,661][348097] Heartbeat connected on RolloutWorker_w6 |
| [2023-07-17 06:20:34,664][348097] Heartbeat connected on RolloutWorker_w7 |
| [2023-07-17 06:20:37,340][348382] Updated weights for policy 0, policy_version 320 (0.0005) |
| [2023-07-17 06:20:37,652][348097] Fps is (10 sec: 9830.5, 60 sec: 8192.0, 300 sec: 8192.0). Total num frames: 163840. Throughput: 0: 7118.0. Samples: 142360. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:20:37,653][348097] Avg episode reward: [(0, '391.902')] |
| [2023-07-17 06:20:37,654][348338] Saving new best policy, reward=391.902! |
| [2023-07-17 06:20:41,717][348382] Updated weights for policy 0, policy_version 400 (0.0005) |
| [2023-07-17 06:20:42,652][348097] Fps is (10 sec: 9830.3, 60 sec: 8519.7, 300 sec: 8519.7). Total num frames: 212992. Throughput: 0: 7961.6. Samples: 199040. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-17 06:20:42,653][348097] Avg episode reward: [(0, '488.149')] |
| [2023-07-17 06:20:42,656][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000000416_212992.pth... |
| [2023-07-17 06:20:42,657][348338] Saving new best policy, reward=488.149! |
| [2023-07-17 06:20:46,167][348382] Updated weights for policy 0, policy_version 480 (0.0006) |
| [2023-07-17 06:20:47,652][348097] Fps is (10 sec: 9420.8, 60 sec: 8601.6, 300 sec: 8601.6). Total num frames: 258048. Throughput: 0: 8489.2. Samples: 254676. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:20:47,652][348097] Avg episode reward: [(0, '518.442')] |
| [2023-07-17 06:20:47,653][348338] Saving new best policy, reward=518.442! |
| [2023-07-17 06:20:50,484][348382] Updated weights for policy 0, policy_version 560 (0.0006) |
| [2023-07-17 06:20:52,652][348097] Fps is (10 sec: 9421.0, 60 sec: 8777.2, 300 sec: 8777.2). Total num frames: 307200. Throughput: 0: 8089.4. Samples: 283128. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-17 06:20:52,652][348097] Avg episode reward: [(0, '480.916')] |
| [2023-07-17 06:20:54,645][348382] Updated weights for policy 0, policy_version 640 (0.0005) |
| [2023-07-17 06:20:57,652][348097] Fps is (10 sec: 9830.4, 60 sec: 8908.8, 300 sec: 8908.8). Total num frames: 356352. Throughput: 0: 8557.9. Samples: 342316. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) |
| [2023-07-17 06:20:57,652][348097] Avg episode reward: [(0, '533.466')] |
| [2023-07-17 06:20:57,655][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000000696_356352.pth... |
| [2023-07-17 06:20:57,658][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000000128_65536.pth |
| [2023-07-17 06:20:57,658][348338] Saving new best policy, reward=533.466! |
| [2023-07-17 06:20:58,809][348382] Updated weights for policy 0, policy_version 720 (0.0004) |
| [2023-07-17 06:21:02,652][348097] Fps is (10 sec: 9830.4, 60 sec: 9011.2, 300 sec: 9011.2). Total num frames: 405504. Throughput: 0: 8921.6. Samples: 401472. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-17 06:21:02,652][348097] Avg episode reward: [(0, '519.759')] |
| [2023-07-17 06:21:02,925][348382] Updated weights for policy 0, policy_version 800 (0.0005) |
| [2023-07-17 06:21:07,049][348382] Updated weights for policy 0, policy_version 880 (0.0004) |
| [2023-07-17 06:21:07,652][348097] Fps is (10 sec: 9830.4, 60 sec: 9093.1, 300 sec: 9093.1). Total num frames: 454656. Throughput: 0: 9592.0. Samples: 431640. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) |
| [2023-07-17 06:21:07,652][348097] Avg episode reward: [(0, '558.310')] |
| [2023-07-17 06:21:07,653][348338] Saving new best policy, reward=558.310! |
| [2023-07-17 06:21:11,136][348382] Updated weights for policy 0, policy_version 960 (0.0004) |
| [2023-07-17 06:21:12,652][348097] Fps is (10 sec: 9830.3, 60 sec: 9160.1, 300 sec: 9160.1). Total num frames: 503808. Throughput: 0: 9763.9. Samples: 491528. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-17 06:21:12,652][348097] Avg episode reward: [(0, '548.310')] |
| [2023-07-17 06:21:12,656][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000000984_503808.pth... |
| [2023-07-17 06:21:12,658][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000000416_212992.pth |
| [2023-07-17 06:21:15,206][348382] Updated weights for policy 0, policy_version 1040 (0.0005) |
| [2023-07-17 06:21:17,652][348097] Fps is (10 sec: 9830.4, 60 sec: 9216.0, 300 sec: 9216.0). Total num frames: 552960. Throughput: 0: 9720.6. Samples: 551448. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:21:17,653][348097] Avg episode reward: [(0, '554.198')] |
| [2023-07-17 06:21:19,409][348382] Updated weights for policy 0, policy_version 1120 (0.0005) |
| [2023-07-17 06:21:22,652][348097] Fps is (10 sec: 9830.5, 60 sec: 9830.4, 300 sec: 9263.3). Total num frames: 602112. Throughput: 0: 9740.4. Samples: 580676. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-17 06:21:22,653][348097] Avg episode reward: [(0, '552.931')] |
| [2023-07-17 06:21:23,496][348382] Updated weights for policy 0, policy_version 1200 (0.0005) |
| [2023-07-17 06:21:27,650][348382] Updated weights for policy 0, policy_version 1280 (0.0005) |
| [2023-07-17 06:21:27,652][348097] Fps is (10 sec: 10239.9, 60 sec: 9830.4, 300 sec: 9362.3). Total num frames: 655360. Throughput: 0: 9803.9. Samples: 640212. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:21:27,652][348097] Avg episode reward: [(0, '554.815')] |
| [2023-07-17 06:21:27,655][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000001280_655360.pth... |
| [2023-07-17 06:21:27,658][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000000696_356352.pth |
| [2023-07-17 06:21:31,864][348382] Updated weights for policy 0, policy_version 1360 (0.0005) |
| [2023-07-17 06:21:32,652][348097] Fps is (10 sec: 9830.3, 60 sec: 9762.1, 300 sec: 9338.9). Total num frames: 700416. Throughput: 0: 9872.0. Samples: 698916. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-17 06:21:32,653][348097] Avg episode reward: [(0, '565.550')] |
| [2023-07-17 06:21:32,654][348338] Saving new best policy, reward=565.550! |
| [2023-07-17 06:21:35,996][348382] Updated weights for policy 0, policy_version 1440 (0.0005) |
| [2023-07-17 06:21:37,652][348097] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9369.6). Total num frames: 749568. Throughput: 0: 9903.5. Samples: 728784. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:21:37,653][348097] Avg episode reward: [(0, '566.683')] |
| [2023-07-17 06:21:37,661][348338] Saving new best policy, reward=566.683! |
| [2023-07-17 06:21:40,234][348382] Updated weights for policy 0, policy_version 1520 (0.0005) |
| [2023-07-17 06:21:42,652][348097] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9396.7). Total num frames: 798720. Throughput: 0: 9873.3. Samples: 786616. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-17 06:21:42,653][348097] Avg episode reward: [(0, '568.909')] |
| [2023-07-17 06:21:42,656][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000001560_798720.pth... |
| [2023-07-17 06:21:42,659][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000000984_503808.pth |
| [2023-07-17 06:21:42,659][348338] Saving new best policy, reward=568.909! |
| [2023-07-17 06:21:44,418][348382] Updated weights for policy 0, policy_version 1600 (0.0005) |
| [2023-07-17 06:21:47,652][348097] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9420.8). Total num frames: 847872. Throughput: 0: 9861.9. Samples: 845260. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:21:47,652][348097] Avg episode reward: [(0, '566.440')] |
| [2023-07-17 06:21:48,635][348382] Updated weights for policy 0, policy_version 1680 (0.0005) |
| [2023-07-17 06:21:52,652][348097] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9442.4). Total num frames: 897024. Throughput: 0: 9837.9. Samples: 874348. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:21:52,652][348097] Avg episode reward: [(0, '559.340')] |
| [2023-07-17 06:21:52,877][348382] Updated weights for policy 0, policy_version 1760 (0.0005) |
| [2023-07-17 06:21:57,018][348382] Updated weights for policy 0, policy_version 1840 (0.0005) |
| [2023-07-17 06:21:57,652][348097] Fps is (10 sec: 9830.3, 60 sec: 9830.4, 300 sec: 9461.8). Total num frames: 946176. Throughput: 0: 9814.1. Samples: 933164. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:21:57,652][348097] Avg episode reward: [(0, '570.584')] |
| [2023-07-17 06:21:57,655][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000001848_946176.pth... |
| [2023-07-17 06:21:57,658][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000001280_655360.pth |
| [2023-07-17 06:21:57,658][348338] Saving new best policy, reward=570.584! |
| [2023-07-17 06:22:01,194][348382] Updated weights for policy 0, policy_version 1920 (0.0004) |
| [2023-07-17 06:22:02,652][348097] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9479.3). Total num frames: 995328. Throughput: 0: 9799.6. Samples: 992432. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-17 06:22:02,652][348097] Avg episode reward: [(0, '563.434')] |
| [2023-07-17 06:22:05,251][348382] Updated weights for policy 0, policy_version 2000 (0.0004) |
| [2023-07-17 06:22:07,652][348097] Fps is (10 sec: 9830.5, 60 sec: 9830.4, 300 sec: 9495.3). Total num frames: 1044480. Throughput: 0: 9823.6. Samples: 1022740. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:22:07,652][348097] Avg episode reward: [(0, '573.878')] |
| [2023-07-17 06:22:07,653][348338] Saving new best policy, reward=573.878! |
| [2023-07-17 06:22:09,602][348382] Updated weights for policy 0, policy_version 2080 (0.0006) |
| [2023-07-17 06:22:12,652][348097] Fps is (10 sec: 9420.7, 60 sec: 9762.1, 300 sec: 9474.2). Total num frames: 1089536. Throughput: 0: 9726.9. Samples: 1077924. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:22:12,652][348097] Avg episode reward: [(0, '571.711')] |
| [2023-07-17 06:22:12,655][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000002128_1089536.pth... |
| [2023-07-17 06:22:12,658][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000001560_798720.pth |
| [2023-07-17 06:22:13,967][348382] Updated weights for policy 0, policy_version 2160 (0.0005) |
| [2023-07-17 06:22:17,652][348097] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9489.1). Total num frames: 1138688. Throughput: 0: 9706.1. Samples: 1135692. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:22:17,652][348097] Avg episode reward: [(0, '574.705')] |
| [2023-07-17 06:22:17,653][348338] Saving new best policy, reward=574.705! |
| [2023-07-17 06:22:18,331][348382] Updated weights for policy 0, policy_version 2240 (0.0006) |
| [2023-07-17 06:22:22,543][348382] Updated weights for policy 0, policy_version 2320 (0.0005) |
| [2023-07-17 06:22:22,652][348097] Fps is (10 sec: 9830.5, 60 sec: 9762.1, 300 sec: 9502.7). Total num frames: 1187840. Throughput: 0: 9678.1. Samples: 1164300. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:22:22,652][348097] Avg episode reward: [(0, '570.255')] |
| [2023-07-17 06:22:26,747][348382] Updated weights for policy 0, policy_version 2400 (0.0005) |
| [2023-07-17 06:22:27,652][348097] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9515.3). Total num frames: 1236992. Throughput: 0: 9690.3. Samples: 1222680. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) |
| [2023-07-17 06:22:27,652][348097] Avg episode reward: [(0, '572.969')] |
| [2023-07-17 06:22:27,655][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000002416_1236992.pth... |
| [2023-07-17 06:22:27,658][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000001848_946176.pth |
| [2023-07-17 06:22:30,989][348382] Updated weights for policy 0, policy_version 2480 (0.0005) |
| [2023-07-17 06:22:32,652][348097] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9496.7). Total num frames: 1282048. Throughput: 0: 9668.4. Samples: 1280336. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) |
| [2023-07-17 06:22:32,652][348097] Avg episode reward: [(0, '572.836')] |
| [2023-07-17 06:22:35,489][348382] Updated weights for policy 0, policy_version 2560 (0.0006) |
| [2023-07-17 06:22:37,652][348097] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9508.6). Total num frames: 1331200. Throughput: 0: 9623.9. Samples: 1307424. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:22:37,652][348097] Avg episode reward: [(0, '575.494')] |
| [2023-07-17 06:22:37,653][348338] Saving new best policy, reward=575.494! |
| [2023-07-17 06:22:39,849][348382] Updated weights for policy 0, policy_version 2640 (0.0005) |
| [2023-07-17 06:22:42,652][348097] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9491.4). Total num frames: 1376256. Throughput: 0: 9565.9. Samples: 1363628. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-17 06:22:42,652][348097] Avg episode reward: [(0, '571.795')] |
| [2023-07-17 06:22:42,655][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000002688_1376256.pth... |
| [2023-07-17 06:22:42,658][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000002128_1089536.pth |
| [2023-07-17 06:22:44,321][348382] Updated weights for policy 0, policy_version 2720 (0.0005) |
| [2023-07-17 06:22:47,652][348097] Fps is (10 sec: 9011.2, 60 sec: 9557.3, 300 sec: 9475.4). Total num frames: 1421312. Throughput: 0: 9492.1. Samples: 1419576. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-17 06:22:47,652][348097] Avg episode reward: [(0, '573.999')] |
| [2023-07-17 06:22:48,640][348382] Updated weights for policy 0, policy_version 2800 (0.0005) |
| [2023-07-17 06:22:52,652][348097] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9486.9). Total num frames: 1470464. Throughput: 0: 9435.3. Samples: 1447328. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:22:52,652][348097] Avg episode reward: [(0, '574.171')] |
| [2023-07-17 06:22:53,110][348382] Updated weights for policy 0, policy_version 2880 (0.0005) |
| [2023-07-17 06:22:57,554][348382] Updated weights for policy 0, policy_version 2960 (0.0006) |
| [2023-07-17 06:22:57,652][348097] Fps is (10 sec: 9420.7, 60 sec: 9489.1, 300 sec: 9472.0). Total num frames: 1515520. Throughput: 0: 9441.2. Samples: 1502780. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:22:57,653][348097] Avg episode reward: [(0, '568.999')] |
| [2023-07-17 06:22:57,656][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000002960_1515520.pth... |
| [2023-07-17 06:22:57,659][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000002416_1236992.pth |
| [2023-07-17 06:23:01,945][348382] Updated weights for policy 0, policy_version 3040 (0.0005) |
| [2023-07-17 06:23:02,652][348097] Fps is (10 sec: 9011.2, 60 sec: 9420.8, 300 sec: 9458.0). Total num frames: 1560576. Throughput: 0: 9387.4. Samples: 1558124. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:23:02,652][348097] Avg episode reward: [(0, '570.652')] |
| [2023-07-17 06:23:06,372][348382] Updated weights for policy 0, policy_version 3120 (0.0005) |
| [2023-07-17 06:23:07,652][348097] Fps is (10 sec: 9420.9, 60 sec: 9420.8, 300 sec: 9469.0). Total num frames: 1609728. Throughput: 0: 9357.1. Samples: 1585372. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-17 06:23:07,652][348097] Avg episode reward: [(0, '574.476')] |
| [2023-07-17 06:23:10,541][348382] Updated weights for policy 0, policy_version 3200 (0.0005) |
| [2023-07-17 06:23:12,652][348097] Fps is (10 sec: 9420.9, 60 sec: 9420.8, 300 sec: 9455.9). Total num frames: 1654784. Throughput: 0: 9357.4. Samples: 1643764. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) |
| [2023-07-17 06:23:12,652][348097] Avg episode reward: [(0, '577.451')] |
| [2023-07-17 06:23:12,686][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000003240_1658880.pth... |
| [2023-07-17 06:23:12,688][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000002688_1376256.pth |
| [2023-07-17 06:23:12,689][348338] Saving new best policy, reward=577.451! |
| [2023-07-17 06:23:14,797][348382] Updated weights for policy 0, policy_version 3280 (0.0005) |
| [2023-07-17 06:23:17,652][348097] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9466.3). Total num frames: 1703936. Throughput: 0: 9368.0. Samples: 1701896. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) |
| [2023-07-17 06:23:17,652][348097] Avg episode reward: [(0, '580.050')] |
| [2023-07-17 06:23:17,653][348338] Saving new best policy, reward=580.050! |
| [2023-07-17 06:23:19,046][348382] Updated weights for policy 0, policy_version 3360 (0.0005) |
| [2023-07-17 06:23:22,652][348097] Fps is (10 sec: 9830.4, 60 sec: 9420.8, 300 sec: 9476.2). Total num frames: 1753088. Throughput: 0: 9406.1. Samples: 1730700. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-17 06:23:22,652][348097] Avg episode reward: [(0, '582.970')] |
| [2023-07-17 06:23:22,653][348338] Saving new best policy, reward=582.970! |
| [2023-07-17 06:23:23,187][348382] Updated weights for policy 0, policy_version 3440 (0.0004) |
| [2023-07-17 06:23:27,399][348382] Updated weights for policy 0, policy_version 3520 (0.0004) |
| [2023-07-17 06:23:27,652][348097] Fps is (10 sec: 9830.4, 60 sec: 9420.8, 300 sec: 9485.5). Total num frames: 1802240. Throughput: 0: 9475.3. Samples: 1790016. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) |
| [2023-07-17 06:23:27,652][348097] Avg episode reward: [(0, '584.272')] |
| [2023-07-17 06:23:27,656][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000003520_1802240.pth... |
| [2023-07-17 06:23:27,658][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000002960_1515520.pth |
| [2023-07-17 06:23:27,659][348338] Saving new best policy, reward=584.272! |
| [2023-07-17 06:23:31,595][348382] Updated weights for policy 0, policy_version 3600 (0.0005) |
| [2023-07-17 06:23:32,652][348097] Fps is (10 sec: 9830.3, 60 sec: 9489.1, 300 sec: 9494.3). Total num frames: 1851392. Throughput: 0: 9520.0. Samples: 1847976. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) |
| [2023-07-17 06:23:32,653][348097] Avg episode reward: [(0, '586.085')] |
| [2023-07-17 06:23:32,653][348338] Saving new best policy, reward=586.085! |
| [2023-07-17 06:23:36,031][348382] Updated weights for policy 0, policy_version 3680 (0.0005) |
| [2023-07-17 06:23:37,652][348097] Fps is (10 sec: 9420.9, 60 sec: 9420.8, 300 sec: 9482.2). Total num frames: 1896448. Throughput: 0: 9525.5. Samples: 1875976. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:23:37,652][348097] Avg episode reward: [(0, '586.733')] |
| [2023-07-17 06:23:37,653][348338] Saving new best policy, reward=586.733! |
| [2023-07-17 06:23:40,524][348382] Updated weights for policy 0, policy_version 3760 (0.0005) |
| [2023-07-17 06:23:42,652][348097] Fps is (10 sec: 9011.2, 60 sec: 9420.8, 300 sec: 9470.8). Total num frames: 1941504. Throughput: 0: 9505.5. Samples: 1930528. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) |
| [2023-07-17 06:23:42,653][348097] Avg episode reward: [(0, '583.266')] |
| [2023-07-17 06:23:42,656][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000003792_1941504.pth... |
| [2023-07-17 06:23:42,659][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000003240_1658880.pth |
| [2023-07-17 06:23:45,004][348382] Updated weights for policy 0, policy_version 3840 (0.0005) |
| [2023-07-17 06:23:47,652][348097] Fps is (10 sec: 9011.2, 60 sec: 9420.8, 300 sec: 9459.8). Total num frames: 1986560. Throughput: 0: 9497.3. Samples: 1985500. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:23:47,653][348097] Avg episode reward: [(0, '576.964')] |
| [2023-07-17 06:23:49,495][348382] Updated weights for policy 0, policy_version 3920 (0.0005) |
| [2023-07-17 06:23:52,652][348097] Fps is (10 sec: 9420.9, 60 sec: 9420.8, 300 sec: 9468.4). Total num frames: 2035712. Throughput: 0: 9493.1. Samples: 2012560. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:23:52,653][348097] Avg episode reward: [(0, '581.536')] |
| [2023-07-17 06:23:53,978][348382] Updated weights for policy 0, policy_version 4000 (0.0005) |
| [2023-07-17 06:23:57,652][348097] Fps is (10 sec: 9420.7, 60 sec: 9420.8, 300 sec: 9458.0). Total num frames: 2080768. Throughput: 0: 9414.0. Samples: 2067396. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) |
| [2023-07-17 06:23:57,652][348097] Avg episode reward: [(0, '572.926')] |
| [2023-07-17 06:23:57,655][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000004064_2080768.pth... |
| [2023-07-17 06:23:57,657][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000003520_1802240.pth |
| [2023-07-17 06:23:58,564][348382] Updated weights for policy 0, policy_version 4080 (0.0005) |
| [2023-07-17 06:24:02,652][348097] Fps is (10 sec: 8601.6, 60 sec: 9352.5, 300 sec: 9429.9). Total num frames: 2121728. Throughput: 0: 9306.0. Samples: 2120668. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-17 06:24:02,653][348097] Avg episode reward: [(0, '565.815')] |
| [2023-07-17 06:24:03,152][348382] Updated weights for policy 0, policy_version 4160 (0.0005) |
| [2023-07-17 06:24:07,604][348382] Updated weights for policy 0, policy_version 4240 (0.0005) |
| [2023-07-17 06:24:07,652][348097] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9438.6). Total num frames: 2170880. Throughput: 0: 9282.7. Samples: 2148424. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:24:07,653][348097] Avg episode reward: [(0, '577.042')] |
| [2023-07-17 06:24:12,040][348382] Updated weights for policy 0, policy_version 4320 (0.0004) |
| [2023-07-17 06:24:12,652][348097] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9429.5). Total num frames: 2215936. Throughput: 0: 9182.7. Samples: 2203240. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) |
| [2023-07-17 06:24:12,653][348097] Avg episode reward: [(0, '585.144')] |
| [2023-07-17 06:24:12,656][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000004328_2215936.pth... |
| [2023-07-17 06:24:12,658][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000003792_1941504.pth |
| [2023-07-17 06:24:16,550][348382] Updated weights for policy 0, policy_version 4400 (0.0004) |
| [2023-07-17 06:24:17,652][348097] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9420.8). Total num frames: 2260992. Throughput: 0: 9101.0. Samples: 2257520. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:24:17,653][348097] Avg episode reward: [(0, '588.790')] |
| [2023-07-17 06:24:17,653][348338] Saving new best policy, reward=588.790! |
| [2023-07-17 06:24:21,207][348382] Updated weights for policy 0, policy_version 4480 (0.0005) |
| [2023-07-17 06:24:22,652][348097] Fps is (10 sec: 9011.3, 60 sec: 9216.0, 300 sec: 9412.4). Total num frames: 2306048. Throughput: 0: 9085.4. Samples: 2284820. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:24:22,652][348097] Avg episode reward: [(0, '586.389')] |
| [2023-07-17 06:24:25,868][348382] Updated weights for policy 0, policy_version 4560 (0.0005) |
| [2023-07-17 06:24:27,652][348097] Fps is (10 sec: 8601.6, 60 sec: 9079.5, 300 sec: 9388.0). Total num frames: 2347008. Throughput: 0: 9035.7. Samples: 2337132. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:24:27,652][348097] Avg episode reward: [(0, '581.783')] |
| [2023-07-17 06:24:27,662][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000004592_2351104.pth... |
| [2023-07-17 06:24:27,664][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000004064_2080768.pth |
| [2023-07-17 06:24:30,384][348382] Updated weights for policy 0, policy_version 4640 (0.0005) |
| [2023-07-17 06:24:32,652][348097] Fps is (10 sec: 9011.1, 60 sec: 9079.5, 300 sec: 9396.7). Total num frames: 2396160. Throughput: 0: 9029.5. Samples: 2391828. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-17 06:24:32,652][348097] Avg episode reward: [(0, '591.940')] |
| [2023-07-17 06:24:32,653][348338] Saving new best policy, reward=591.940! |
| [2023-07-17 06:24:34,925][348382] Updated weights for policy 0, policy_version 4720 (0.0005) |
| [2023-07-17 06:24:37,652][348097] Fps is (10 sec: 9011.2, 60 sec: 9011.2, 300 sec: 9373.5). Total num frames: 2437120. Throughput: 0: 9020.4. Samples: 2418476. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-17 06:24:37,652][348097] Avg episode reward: [(0, '586.946')] |
| [2023-07-17 06:24:39,582][348382] Updated weights for policy 0, policy_version 4800 (0.0005) |
| [2023-07-17 06:24:42,652][348097] Fps is (10 sec: 8601.6, 60 sec: 9011.2, 300 sec: 9366.7). Total num frames: 2482176. Throughput: 0: 8980.9. Samples: 2471536. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) |
| [2023-07-17 06:24:42,652][348097] Avg episode reward: [(0, '580.113')] |
| [2023-07-17 06:24:42,655][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000004848_2482176.pth... |
| [2023-07-17 06:24:42,658][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000004328_2215936.pth |
| [2023-07-17 06:24:44,110][348382] Updated weights for policy 0, policy_version 4880 (0.0005) |
| [2023-07-17 06:24:47,652][348097] Fps is (10 sec: 9011.2, 60 sec: 9011.2, 300 sec: 9360.1). Total num frames: 2527232. Throughput: 0: 8995.1. Samples: 2525448. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:24:47,653][348097] Avg episode reward: [(0, '583.937')] |
| [2023-07-17 06:24:48,695][348382] Updated weights for policy 0, policy_version 4960 (0.0005) |
| [2023-07-17 06:24:52,652][348097] Fps is (10 sec: 9011.2, 60 sec: 8942.9, 300 sec: 9353.8). Total num frames: 2572288. Throughput: 0: 8967.6. Samples: 2551968. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-17 06:24:52,652][348097] Avg episode reward: [(0, '587.156')] |
| [2023-07-17 06:24:53,379][348382] Updated weights for policy 0, policy_version 5040 (0.0005) |
| [2023-07-17 06:24:57,652][348097] Fps is (10 sec: 9011.2, 60 sec: 8942.9, 300 sec: 9347.7). Total num frames: 2617344. Throughput: 0: 8916.8. Samples: 2604496. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:24:57,653][348097] Avg episode reward: [(0, '585.670')] |
| [2023-07-17 06:24:57,655][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000005112_2617344.pth... |
| [2023-07-17 06:24:57,658][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000004592_2351104.pth |
| [2023-07-17 06:24:58,110][348382] Updated weights for policy 0, policy_version 5120 (0.0005) |
| [2023-07-17 06:25:02,652][348097] Fps is (10 sec: 8601.6, 60 sec: 8942.9, 300 sec: 9327.4). Total num frames: 2658304. Throughput: 0: 8874.0. Samples: 2656848. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-17 06:25:02,652][348097] Avg episode reward: [(0, '585.996')] |
| [2023-07-17 06:25:02,772][348382] Updated weights for policy 0, policy_version 5200 (0.0006) |
| [2023-07-17 06:25:07,237][348382] Updated weights for policy 0, policy_version 5280 (0.0005) |
| [2023-07-17 06:25:07,652][348097] Fps is (10 sec: 8601.7, 60 sec: 8874.7, 300 sec: 9321.9). Total num frames: 2703360. Throughput: 0: 8867.2. Samples: 2683844. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:25:07,652][348097] Avg episode reward: [(0, '577.949')] |
| [2023-07-17 06:25:11,941][348382] Updated weights for policy 0, policy_version 5360 (0.0005) |
| [2023-07-17 06:25:12,652][348097] Fps is (10 sec: 9011.3, 60 sec: 8874.7, 300 sec: 9316.7). Total num frames: 2748416. Throughput: 0: 8892.2. Samples: 2737280. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-17 06:25:12,652][348097] Avg episode reward: [(0, '585.154')] |
| [2023-07-17 06:25:12,654][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000005368_2748416.pth... |
| [2023-07-17 06:25:12,656][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000004848_2482176.pth |
| [2023-07-17 06:25:16,627][348382] Updated weights for policy 0, policy_version 5440 (0.0006) |
| [2023-07-17 06:25:17,652][348097] Fps is (10 sec: 9011.2, 60 sec: 8874.7, 300 sec: 9427.7). Total num frames: 2793472. Throughput: 0: 8835.8. Samples: 2789440. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-17 06:25:17,652][348097] Avg episode reward: [(0, '584.118')] |
| [2023-07-17 06:25:21,381][348382] Updated weights for policy 0, policy_version 5520 (0.0006) |
| [2023-07-17 06:25:22,652][348097] Fps is (10 sec: 8601.6, 60 sec: 8806.4, 300 sec: 9386.1). Total num frames: 2834432. Throughput: 0: 8823.5. Samples: 2815532. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:25:22,652][348097] Avg episode reward: [(0, '583.019')] |
| [2023-07-17 06:25:26,115][348382] Updated weights for policy 0, policy_version 5600 (0.0006) |
| [2023-07-17 06:25:27,652][348097] Fps is (10 sec: 8601.5, 60 sec: 8874.7, 300 sec: 9372.2). Total num frames: 2879488. Throughput: 0: 8794.1. Samples: 2867272. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-17 06:25:27,652][348097] Avg episode reward: [(0, '580.990')] |
| [2023-07-17 06:25:27,655][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000005624_2879488.pth... |
| [2023-07-17 06:25:27,658][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000005112_2617344.pth |
| [2023-07-17 06:25:30,821][348382] Updated weights for policy 0, policy_version 5680 (0.0006) |
| [2023-07-17 06:25:32,652][348097] Fps is (10 sec: 8601.6, 60 sec: 8738.1, 300 sec: 9344.4). Total num frames: 2920448. Throughput: 0: 8757.0. Samples: 2919512. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:25:32,652][348097] Avg episode reward: [(0, '585.625')] |
| [2023-07-17 06:25:35,564][348382] Updated weights for policy 0, policy_version 5760 (0.0005) |
| [2023-07-17 06:25:37,652][348097] Fps is (10 sec: 8601.7, 60 sec: 8806.4, 300 sec: 9330.6). Total num frames: 2965504. Throughput: 0: 8736.2. Samples: 2945096. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:25:37,652][348097] Avg episode reward: [(0, '586.370')] |
| [2023-07-17 06:25:40,054][348382] Updated weights for policy 0, policy_version 5840 (0.0005) |
| [2023-07-17 06:25:42,652][348097] Fps is (10 sec: 9011.1, 60 sec: 8806.4, 300 sec: 9330.5). Total num frames: 3010560. Throughput: 0: 8785.2. Samples: 2999828. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-17 06:25:42,652][348097] Avg episode reward: [(0, '589.058')] |
| [2023-07-17 06:25:42,656][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000005880_3010560.pth... |
| [2023-07-17 06:25:42,658][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000005368_2748416.pth |
| [2023-07-17 06:25:44,532][348382] Updated weights for policy 0, policy_version 5920 (0.0005) |
| [2023-07-17 06:25:47,652][348097] Fps is (10 sec: 9420.8, 60 sec: 8874.7, 300 sec: 9330.5). Total num frames: 3059712. Throughput: 0: 8862.0. Samples: 3055636. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:25:47,653][348097] Avg episode reward: [(0, '588.085')] |
| [2023-07-17 06:25:48,855][348382] Updated weights for policy 0, policy_version 6000 (0.0005) |
| [2023-07-17 06:25:52,652][348097] Fps is (10 sec: 9420.9, 60 sec: 8874.7, 300 sec: 9316.7). Total num frames: 3104768. Throughput: 0: 8900.2. Samples: 3084352. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:25:52,653][348097] Avg episode reward: [(0, '579.653')] |
| [2023-07-17 06:25:53,259][348382] Updated weights for policy 0, policy_version 6080 (0.0005) |
| [2023-07-17 06:25:57,643][348382] Updated weights for policy 0, policy_version 6160 (0.0005) |
| [2023-07-17 06:25:57,652][348097] Fps is (10 sec: 9420.7, 60 sec: 8942.9, 300 sec: 9316.7). Total num frames: 3153920. Throughput: 0: 8948.2. Samples: 3139952. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:25:57,652][348097] Avg episode reward: [(0, '581.511')] |
| [2023-07-17 06:25:57,656][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000006160_3153920.pth... |
| [2023-07-17 06:25:57,658][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000005624_2879488.pth |
| [2023-07-17 06:26:01,937][348382] Updated weights for policy 0, policy_version 6240 (0.0005) |
| [2023-07-17 06:26:02,652][348097] Fps is (10 sec: 9420.8, 60 sec: 9011.2, 300 sec: 9302.8). Total num frames: 3198976. Throughput: 0: 9048.3. Samples: 3196612. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:26:02,653][348097] Avg episode reward: [(0, '578.571')] |
| [2023-07-17 06:26:06,330][348382] Updated weights for policy 0, policy_version 6320 (0.0005) |
| [2023-07-17 06:26:07,652][348097] Fps is (10 sec: 9011.2, 60 sec: 9011.2, 300 sec: 9288.9). Total num frames: 3244032. Throughput: 0: 9085.5. Samples: 3224380. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:26:07,653][348097] Avg episode reward: [(0, '583.801')] |
| [2023-07-17 06:26:10,889][348382] Updated weights for policy 0, policy_version 6400 (0.0006) |
| [2023-07-17 06:26:12,652][348097] Fps is (10 sec: 9011.1, 60 sec: 9011.2, 300 sec: 9275.0). Total num frames: 3289088. Throughput: 0: 9151.8. Samples: 3279104. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:26:12,652][348097] Avg episode reward: [(0, '589.026')] |
| [2023-07-17 06:26:12,655][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000006424_3289088.pth... |
| [2023-07-17 06:26:12,658][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000005880_3010560.pth |
| [2023-07-17 06:26:15,511][348382] Updated weights for policy 0, policy_version 6480 (0.0006) |
| [2023-07-17 06:26:17,652][348097] Fps is (10 sec: 9011.3, 60 sec: 9011.2, 300 sec: 9261.1). Total num frames: 3334144. Throughput: 0: 9170.8. Samples: 3332200. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-17 06:26:17,653][348097] Avg episode reward: [(0, '588.735')] |
| [2023-07-17 06:26:20,063][348382] Updated weights for policy 0, policy_version 6560 (0.0005) |
| [2023-07-17 06:26:22,652][348097] Fps is (10 sec: 9420.8, 60 sec: 9147.7, 300 sec: 9247.2). Total num frames: 3383296. Throughput: 0: 9200.3. Samples: 3359112. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) |
| [2023-07-17 06:26:22,653][348097] Avg episode reward: [(0, '586.451')] |
| [2023-07-17 06:26:24,426][348382] Updated weights for policy 0, policy_version 6640 (0.0005) |
| [2023-07-17 06:26:27,652][348097] Fps is (10 sec: 9420.8, 60 sec: 9147.7, 300 sec: 9247.2). Total num frames: 3428352. Throughput: 0: 9245.0. Samples: 3415852. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) |
| [2023-07-17 06:26:27,652][348097] Avg episode reward: [(0, '587.074')] |
| [2023-07-17 06:26:27,655][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000006696_3428352.pth... |
| [2023-07-17 06:26:27,657][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000006160_3153920.pth |
| [2023-07-17 06:26:28,910][348382] Updated weights for policy 0, policy_version 6720 (0.0005) |
| [2023-07-17 06:26:32,652][348097] Fps is (10 sec: 9011.2, 60 sec: 9216.0, 300 sec: 9233.4). Total num frames: 3473408. Throughput: 0: 9212.2. Samples: 3470184. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:26:32,652][348097] Avg episode reward: [(0, '584.543')] |
| [2023-07-17 06:26:33,305][348382] Updated weights for policy 0, policy_version 6800 (0.0005) |
| [2023-07-17 06:26:37,580][348382] Updated weights for policy 0, policy_version 6880 (0.0005) |
| [2023-07-17 06:26:37,652][348097] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9233.4). Total num frames: 3522560. Throughput: 0: 9203.7. Samples: 3498520. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) |
| [2023-07-17 06:26:37,653][348097] Avg episode reward: [(0, '577.768')] |
| [2023-07-17 06:26:42,026][348382] Updated weights for policy 0, policy_version 6960 (0.0005) |
| [2023-07-17 06:26:42,652][348097] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9219.5). Total num frames: 3567616. Throughput: 0: 9232.2. Samples: 3555400. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-17 06:26:42,652][348097] Avg episode reward: [(0, '582.479')] |
| [2023-07-17 06:26:42,655][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000006968_3567616.pth... |
| [2023-07-17 06:26:42,658][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000006424_3289088.pth |
| [2023-07-17 06:26:46,507][348382] Updated weights for policy 0, policy_version 7040 (0.0005) |
| [2023-07-17 06:26:47,652][348097] Fps is (10 sec: 9011.2, 60 sec: 9216.0, 300 sec: 9205.6). Total num frames: 3612672. Throughput: 0: 9194.5. Samples: 3610364. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:26:47,652][348097] Avg episode reward: [(0, '583.230')] |
| [2023-07-17 06:26:50,947][348382] Updated weights for policy 0, policy_version 7120 (0.0005) |
| [2023-07-17 06:26:52,652][348097] Fps is (10 sec: 9011.2, 60 sec: 9216.0, 300 sec: 9191.7). Total num frames: 3657728. Throughput: 0: 9191.9. Samples: 3638016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:26:52,652][348097] Avg episode reward: [(0, '584.435')] |
| [2023-07-17 06:26:55,307][348382] Updated weights for policy 0, policy_version 7200 (0.0005) |
| [2023-07-17 06:26:57,652][348097] Fps is (10 sec: 9011.2, 60 sec: 9147.7, 300 sec: 9177.8). Total num frames: 3702784. Throughput: 0: 9217.2. Samples: 3693880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:26:57,652][348097] Avg episode reward: [(0, '588.160')] |
| [2023-07-17 06:26:57,655][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000007240_3706880.pth... |
| [2023-07-17 06:26:57,658][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000006696_3428352.pth |
| [2023-07-17 06:26:59,915][348382] Updated weights for policy 0, policy_version 7280 (0.0005) |
| [2023-07-17 06:27:02,652][348097] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9177.8). Total num frames: 3751936. Throughput: 0: 9236.4. Samples: 3747840. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:27:02,652][348097] Avg episode reward: [(0, '587.152')] |
| [2023-07-17 06:27:04,453][348382] Updated weights for policy 0, policy_version 7360 (0.0005) |
| [2023-07-17 06:27:07,652][348097] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9177.8). Total num frames: 3796992. Throughput: 0: 9227.4. Samples: 3774344. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-17 06:27:07,652][348097] Avg episode reward: [(0, '586.281')] |
| [2023-07-17 06:27:08,931][348382] Updated weights for policy 0, policy_version 7440 (0.0005) |
| [2023-07-17 06:27:12,652][348097] Fps is (10 sec: 9011.0, 60 sec: 9216.0, 300 sec: 9163.9). Total num frames: 3842048. Throughput: 0: 9190.3. Samples: 3829416. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-17 06:27:12,653][348097] Avg episode reward: [(0, '582.292')] |
| [2023-07-17 06:27:12,656][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000007504_3842048.pth... |
| [2023-07-17 06:27:12,659][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000006968_3567616.pth |
| [2023-07-17 06:27:13,426][348382] Updated weights for policy 0, policy_version 7520 (0.0005) |
| [2023-07-17 06:27:17,652][348097] Fps is (10 sec: 9011.2, 60 sec: 9216.0, 300 sec: 9150.0). Total num frames: 3887104. Throughput: 0: 9189.5. Samples: 3883712. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) |
| [2023-07-17 06:27:17,653][348097] Avg episode reward: [(0, '582.081')] |
| [2023-07-17 06:27:17,922][348382] Updated weights for policy 0, policy_version 7600 (0.0005) |
| [2023-07-17 06:27:22,515][348382] Updated weights for policy 0, policy_version 7680 (0.0006) |
| [2023-07-17 06:27:22,652][348097] Fps is (10 sec: 9011.3, 60 sec: 9147.7, 300 sec: 9136.2). Total num frames: 3932160. Throughput: 0: 9167.7. Samples: 3911068. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-17 06:27:22,662][348097] Avg episode reward: [(0, '585.786')] |
| [2023-07-17 06:27:27,137][348382] Updated weights for policy 0, policy_version 7760 (0.0006) |
| [2023-07-17 06:27:27,652][348097] Fps is (10 sec: 9011.2, 60 sec: 9147.7, 300 sec: 9136.2). Total num frames: 3977216. Throughput: 0: 9088.9. Samples: 3964400. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) |
| [2023-07-17 06:27:27,652][348097] Avg episode reward: [(0, '584.881')] |
| [2023-07-17 06:27:27,655][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000007768_3977216.pth... |
| [2023-07-17 06:27:27,658][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000007240_3706880.pth |
| [2023-07-17 06:27:31,685][348382] Updated weights for policy 0, policy_version 7840 (0.0006) |
| [2023-07-17 06:27:32,652][348097] Fps is (10 sec: 9011.2, 60 sec: 9147.7, 300 sec: 9122.3). Total num frames: 4022272. Throughput: 0: 9062.9. Samples: 4018192. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) |
| [2023-07-17 06:27:32,653][348097] Avg episode reward: [(0, '591.811')] |
| [2023-07-17 06:27:36,291][348382] Updated weights for policy 0, policy_version 7920 (0.0005) |
| [2023-07-17 06:27:37,652][348097] Fps is (10 sec: 8601.6, 60 sec: 9011.2, 300 sec: 9108.4). Total num frames: 4063232. Throughput: 0: 9033.2. Samples: 4044512. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:27:37,653][348097] Avg episode reward: [(0, '581.459')] |
| [2023-07-17 06:27:40,808][348382] Updated weights for policy 0, policy_version 8000 (0.0006) |
| [2023-07-17 06:27:42,652][348097] Fps is (10 sec: 9011.1, 60 sec: 9079.5, 300 sec: 9122.3). Total num frames: 4112384. Throughput: 0: 9003.4. Samples: 4099032. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:27:42,652][348097] Avg episode reward: [(0, '576.008')] |
| [2023-07-17 06:27:42,655][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000008032_4112384.pth... |
| [2023-07-17 06:27:42,658][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000007504_3842048.pth |
| [2023-07-17 06:27:45,281][348382] Updated weights for policy 0, policy_version 8080 (0.0005) |
| [2023-07-17 06:27:47,652][348097] Fps is (10 sec: 9420.8, 60 sec: 9079.5, 300 sec: 9108.4). Total num frames: 4157440. Throughput: 0: 9028.8. Samples: 4154136. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:27:47,653][348097] Avg episode reward: [(0, '581.559')] |
| [2023-07-17 06:27:49,688][348382] Updated weights for policy 0, policy_version 8160 (0.0005) |
| [2023-07-17 06:27:52,652][348097] Fps is (10 sec: 9011.3, 60 sec: 9079.5, 300 sec: 9108.4). Total num frames: 4202496. Throughput: 0: 9059.4. Samples: 4182016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:27:52,653][348097] Avg episode reward: [(0, '582.333')] |
| [2023-07-17 06:27:54,165][348382] Updated weights for policy 0, policy_version 8240 (0.0005) |
| [2023-07-17 06:27:57,652][348097] Fps is (10 sec: 9420.7, 60 sec: 9147.7, 300 sec: 9122.3). Total num frames: 4251648. Throughput: 0: 9064.9. Samples: 4237336. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-17 06:27:57,652][348097] Avg episode reward: [(0, '584.618')] |
| [2023-07-17 06:27:57,655][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000008304_4251648.pth... |
| [2023-07-17 06:27:57,658][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000007768_3977216.pth |
| [2023-07-17 06:27:58,527][348382] Updated weights for policy 0, policy_version 8320 (0.0005) |
| [2023-07-17 06:28:02,652][348097] Fps is (10 sec: 9420.8, 60 sec: 9079.5, 300 sec: 9108.4). Total num frames: 4296704. Throughput: 0: 9111.3. Samples: 4293720. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:28:02,652][348097] Avg episode reward: [(0, '585.986')] |
| [2023-07-17 06:28:02,832][348382] Updated weights for policy 0, policy_version 8400 (0.0005) |
| [2023-07-17 06:28:07,272][348382] Updated weights for policy 0, policy_version 8480 (0.0005) |
| [2023-07-17 06:28:07,652][348097] Fps is (10 sec: 9011.3, 60 sec: 9079.5, 300 sec: 9108.4). Total num frames: 4341760. Throughput: 0: 9139.2. Samples: 4322332. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:28:07,652][348097] Avg episode reward: [(0, '582.754')] |
| [2023-07-17 06:28:11,871][348382] Updated weights for policy 0, policy_version 8560 (0.0005) |
| [2023-07-17 06:28:12,652][348097] Fps is (10 sec: 9011.1, 60 sec: 9079.5, 300 sec: 9094.5). Total num frames: 4386816. Throughput: 0: 9146.0. Samples: 4375972. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-17 06:28:12,652][348097] Avg episode reward: [(0, '587.467')] |
| [2023-07-17 06:28:12,656][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000008568_4386816.pth... |
| [2023-07-17 06:28:12,659][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000008032_4112384.pth |
| [2023-07-17 06:28:16,195][348382] Updated weights for policy 0, policy_version 8640 (0.0005) |
| [2023-07-17 06:28:17,652][348097] Fps is (10 sec: 9420.7, 60 sec: 9147.7, 300 sec: 9094.5). Total num frames: 4435968. Throughput: 0: 9200.1. Samples: 4432196. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) |
| [2023-07-17 06:28:17,652][348097] Avg episode reward: [(0, '585.974')] |
| [2023-07-17 06:28:20,473][348382] Updated weights for policy 0, policy_version 8720 (0.0005) |
| [2023-07-17 06:28:22,652][348097] Fps is (10 sec: 9830.4, 60 sec: 9216.0, 300 sec: 9094.5). Total num frames: 4485120. Throughput: 0: 9255.4. Samples: 4461004. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) |
| [2023-07-17 06:28:22,652][348097] Avg episode reward: [(0, '580.573')] |
| [2023-07-17 06:28:24,745][348382] Updated weights for policy 0, policy_version 8800 (0.0005) |
| [2023-07-17 06:28:27,652][348097] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9080.6). Total num frames: 4530176. Throughput: 0: 9321.7. Samples: 4518508. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:28:27,652][348097] Avg episode reward: [(0, '587.054')] |
| [2023-07-17 06:28:27,655][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000008848_4530176.pth... |
| [2023-07-17 06:28:27,658][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000008304_4251648.pth |
| [2023-07-17 06:28:29,096][348382] Updated weights for policy 0, policy_version 8880 (0.0005) |
| [2023-07-17 06:28:32,652][348097] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9094.5). Total num frames: 4579328. Throughput: 0: 9357.9. Samples: 4575240. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-17 06:28:32,652][348097] Avg episode reward: [(0, '585.332')] |
| [2023-07-17 06:28:33,465][348382] Updated weights for policy 0, policy_version 8960 (0.0005) |
| [2023-07-17 06:28:37,652][348097] Fps is (10 sec: 9420.9, 60 sec: 9352.5, 300 sec: 9094.5). Total num frames: 4624384. Throughput: 0: 9374.9. Samples: 4603888. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-17 06:28:37,652][348097] Avg episode reward: [(0, '588.109')] |
| [2023-07-17 06:28:37,755][348382] Updated weights for policy 0, policy_version 9040 (0.0005) |
| [2023-07-17 06:28:41,976][348382] Updated weights for policy 0, policy_version 9120 (0.0005) |
| [2023-07-17 06:28:42,652][348097] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9108.4). Total num frames: 4673536. Throughput: 0: 9421.9. Samples: 4661320. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-17 06:28:42,652][348097] Avg episode reward: [(0, '589.388')] |
| [2023-07-17 06:28:42,655][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000009128_4673536.pth... |
| [2023-07-17 06:28:42,658][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000008568_4386816.pth |
| [2023-07-17 06:28:46,411][348382] Updated weights for policy 0, policy_version 9200 (0.0005) |
| [2023-07-17 06:28:47,652][348097] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9094.5). Total num frames: 4718592. Throughput: 0: 9415.6. Samples: 4717424. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:28:47,652][348097] Avg episode reward: [(0, '588.081')] |
| [2023-07-17 06:28:50,750][348382] Updated weights for policy 0, policy_version 9280 (0.0005) |
| [2023-07-17 06:28:52,652][348097] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9108.4). Total num frames: 4767744. Throughput: 0: 9397.2. Samples: 4745208. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:28:52,652][348097] Avg episode reward: [(0, '585.168')] |
| [2023-07-17 06:28:55,332][348382] Updated weights for policy 0, policy_version 9360 (0.0005) |
| [2023-07-17 06:28:57,652][348097] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9122.3). Total num frames: 4812800. Throughput: 0: 9423.4. Samples: 4800024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:28:57,652][348097] Avg episode reward: [(0, '586.877')] |
| [2023-07-17 06:28:57,655][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000009400_4812800.pth... |
| [2023-07-17 06:28:57,657][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000008848_4530176.pth |
| [2023-07-17 06:28:59,819][348382] Updated weights for policy 0, policy_version 9440 (0.0006) |
| [2023-07-17 06:29:02,652][348097] Fps is (10 sec: 9011.3, 60 sec: 9352.5, 300 sec: 9108.4). Total num frames: 4857856. Throughput: 0: 9368.4. Samples: 4853772. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:29:02,652][348097] Avg episode reward: [(0, '591.802')] |
| [2023-07-17 06:29:04,467][348382] Updated weights for policy 0, policy_version 9520 (0.0006) |
| [2023-07-17 06:29:07,652][348097] Fps is (10 sec: 8601.6, 60 sec: 9284.3, 300 sec: 9094.5). Total num frames: 4898816. Throughput: 0: 9311.8. Samples: 4880036. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:29:07,652][348097] Avg episode reward: [(0, '589.119')] |
| [2023-07-17 06:29:09,024][348382] Updated weights for policy 0, policy_version 9600 (0.0006) |
| [2023-07-17 06:29:12,652][348097] Fps is (10 sec: 8601.5, 60 sec: 9284.3, 300 sec: 9094.5). Total num frames: 4943872. Throughput: 0: 9242.7. Samples: 4934432. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) |
| [2023-07-17 06:29:12,652][348097] Avg episode reward: [(0, '583.814')] |
| [2023-07-17 06:29:12,672][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000009664_4947968.pth... |
| [2023-07-17 06:29:12,674][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000009128_4673536.pth |
| [2023-07-17 06:29:13,571][348382] Updated weights for policy 0, policy_version 9680 (0.0006) |
| [2023-07-17 06:29:17,652][348097] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9108.4). Total num frames: 4993024. Throughput: 0: 9193.2. Samples: 4988936. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-17 06:29:17,652][348097] Avg episode reward: [(0, '578.964')] |
| [2023-07-17 06:29:18,028][348382] Updated weights for policy 0, policy_version 9760 (0.0006) |
| [2023-07-17 06:29:22,381][348382] Updated weights for policy 0, policy_version 9840 (0.0005) |
| [2023-07-17 06:29:22,652][348097] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9122.3). Total num frames: 5038080. Throughput: 0: 9186.9. Samples: 5017300. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-17 06:29:22,653][348097] Avg episode reward: [(0, '590.656')] |
| [2023-07-17 06:29:26,714][348382] Updated weights for policy 0, policy_version 9920 (0.0005) |
| [2023-07-17 06:29:27,652][348097] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9122.3). Total num frames: 5087232. Throughput: 0: 9159.3. Samples: 5073488. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:29:27,652][348097] Avg episode reward: [(0, '583.859')] |
| [2023-07-17 06:29:27,656][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000009936_5087232.pth... |
| [2023-07-17 06:29:27,658][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000009400_4812800.pth |
| [2023-07-17 06:29:31,103][348382] Updated weights for policy 0, policy_version 10000 (0.0006) |
| [2023-07-17 06:29:32,652][348097] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9136.2). Total num frames: 5132288. Throughput: 0: 9160.3. Samples: 5129636. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:29:32,653][348097] Avg episode reward: [(0, '591.675')] |
| [2023-07-17 06:29:35,505][348382] Updated weights for policy 0, policy_version 10080 (0.0005) |
| [2023-07-17 06:29:37,652][348097] Fps is (10 sec: 9011.3, 60 sec: 9216.0, 300 sec: 9136.2). Total num frames: 5177344. Throughput: 0: 9155.0. Samples: 5157184. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:29:37,652][348097] Avg episode reward: [(0, '590.321')] |
| [2023-07-17 06:29:39,951][348382] Updated weights for policy 0, policy_version 10160 (0.0005) |
| [2023-07-17 06:29:42,652][348097] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9150.0). Total num frames: 5226496. Throughput: 0: 9169.4. Samples: 5212648. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-17 06:29:42,652][348097] Avg episode reward: [(0, '586.493')] |
| [2023-07-17 06:29:42,656][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000010208_5226496.pth... |
| [2023-07-17 06:29:42,659][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000009664_4947968.pth |
| [2023-07-17 06:29:44,289][348382] Updated weights for policy 0, policy_version 10240 (0.0005) |
| [2023-07-17 06:29:47,652][348097] Fps is (10 sec: 9420.7, 60 sec: 9216.0, 300 sec: 9150.0). Total num frames: 5271552. Throughput: 0: 9236.7. Samples: 5269424. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-17 06:29:47,652][348097] Avg episode reward: [(0, '583.883')] |
| [2023-07-17 06:29:48,661][348382] Updated weights for policy 0, policy_version 10320 (0.0005) |
| [2023-07-17 06:29:52,652][348097] Fps is (10 sec: 9011.2, 60 sec: 9147.7, 300 sec: 9150.1). Total num frames: 5316608. Throughput: 0: 9274.5. Samples: 5297388. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:29:52,652][348097] Avg episode reward: [(0, '589.796')] |
| [2023-07-17 06:29:53,128][348382] Updated weights for policy 0, policy_version 10400 (0.0005) |
| [2023-07-17 06:29:57,494][348382] Updated weights for policy 0, policy_version 10480 (0.0005) |
| [2023-07-17 06:29:57,652][348097] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9177.8). Total num frames: 5365760. Throughput: 0: 9294.8. Samples: 5352696. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-17 06:29:57,652][348097] Avg episode reward: [(0, '591.139')] |
| [2023-07-17 06:29:57,656][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000010480_5365760.pth... |
| [2023-07-17 06:29:57,658][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000009936_5087232.pth |
| [2023-07-17 06:30:01,828][348382] Updated weights for policy 0, policy_version 10560 (0.0005) |
| [2023-07-17 06:30:02,652][348097] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9177.8). Total num frames: 5410816. Throughput: 0: 9349.2. Samples: 5409652. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-17 06:30:02,652][348097] Avg episode reward: [(0, '579.851')] |
| [2023-07-17 06:30:06,109][348382] Updated weights for policy 0, policy_version 10640 (0.0005) |
| [2023-07-17 06:30:07,652][348097] Fps is (10 sec: 9420.9, 60 sec: 9352.5, 300 sec: 9191.7). Total num frames: 5459968. Throughput: 0: 9362.8. Samples: 5438624. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) |
| [2023-07-17 06:30:07,652][348097] Avg episode reward: [(0, '590.661')] |
| [2023-07-17 06:30:10,463][348382] Updated weights for policy 0, policy_version 10720 (0.0005) |
| [2023-07-17 06:30:12,652][348097] Fps is (10 sec: 9830.3, 60 sec: 9420.8, 300 sec: 9205.6). Total num frames: 5509120. Throughput: 0: 9366.5. Samples: 5494984. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) |
| [2023-07-17 06:30:12,653][348097] Avg episode reward: [(0, '586.138')] |
| [2023-07-17 06:30:12,656][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000010760_5509120.pth... |
| [2023-07-17 06:30:12,658][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000010208_5226496.pth |
| [2023-07-17 06:30:14,735][348382] Updated weights for policy 0, policy_version 10800 (0.0005) |
| [2023-07-17 06:30:17,652][348097] Fps is (10 sec: 9420.7, 60 sec: 9352.5, 300 sec: 9219.5). Total num frames: 5554176. Throughput: 0: 9387.4. Samples: 5552068. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-17 06:30:17,653][348097] Avg episode reward: [(0, '588.264')] |
| [2023-07-17 06:30:19,093][348382] Updated weights for policy 0, policy_version 10880 (0.0005) |
| [2023-07-17 06:30:22,652][348097] Fps is (10 sec: 9420.9, 60 sec: 9420.8, 300 sec: 9233.4). Total num frames: 5603328. Throughput: 0: 9399.6. Samples: 5580168. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) |
| [2023-07-17 06:30:22,652][348097] Avg episode reward: [(0, '589.250')] |
| [2023-07-17 06:30:23,424][348382] Updated weights for policy 0, policy_version 10960 (0.0005) |
| [2023-07-17 06:30:27,652][348097] Fps is (10 sec: 9420.7, 60 sec: 9352.5, 300 sec: 9247.2). Total num frames: 5648384. Throughput: 0: 9421.3. Samples: 5636608. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) |
| [2023-07-17 06:30:27,653][348097] Avg episode reward: [(0, '584.820')] |
| [2023-07-17 06:30:27,656][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000011032_5648384.pth... |
| [2023-07-17 06:30:27,659][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000010480_5365760.pth |
| [2023-07-17 06:30:27,923][348382] Updated weights for policy 0, policy_version 11040 (0.0005) |
| [2023-07-17 06:30:32,529][348382] Updated weights for policy 0, policy_version 11120 (0.0005) |
| [2023-07-17 06:30:32,652][348097] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9247.2). Total num frames: 5693440. Throughput: 0: 9337.6. Samples: 5689616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:30:32,653][348097] Avg episode reward: [(0, '593.442')] |
| [2023-07-17 06:30:32,653][348338] Saving new best policy, reward=593.442! |
| [2023-07-17 06:30:37,042][348382] Updated weights for policy 0, policy_version 11200 (0.0005) |
| [2023-07-17 06:30:37,652][348097] Fps is (10 sec: 9011.3, 60 sec: 9352.5, 300 sec: 9247.2). Total num frames: 5738496. Throughput: 0: 9333.2. Samples: 5717384. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:30:37,652][348097] Avg episode reward: [(0, '592.328')] |
| [2023-07-17 06:30:41,537][348382] Updated weights for policy 0, policy_version 11280 (0.0005) |
| [2023-07-17 06:30:42,652][348097] Fps is (10 sec: 9011.1, 60 sec: 9284.2, 300 sec: 9233.4). Total num frames: 5783552. Throughput: 0: 9306.6. Samples: 5771492. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-17 06:30:42,652][348097] Avg episode reward: [(0, '592.386')] |
| [2023-07-17 06:30:42,656][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000011296_5783552.pth... |
| [2023-07-17 06:30:42,659][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000010760_5509120.pth |
| [2023-07-17 06:30:46,050][348382] Updated weights for policy 0, policy_version 11360 (0.0005) |
| [2023-07-17 06:30:47,652][348097] Fps is (10 sec: 9011.1, 60 sec: 9284.3, 300 sec: 9233.4). Total num frames: 5828608. Throughput: 0: 9268.4. Samples: 5826732. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-17 06:30:47,653][348097] Avg episode reward: [(0, '592.204')] |
| [2023-07-17 06:30:50,457][348382] Updated weights for policy 0, policy_version 11440 (0.0004) |
| [2023-07-17 06:30:52,652][348097] Fps is (10 sec: 9011.3, 60 sec: 9284.3, 300 sec: 9219.5). Total num frames: 5873664. Throughput: 0: 9240.6. Samples: 5854452. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:30:52,653][348097] Avg episode reward: [(0, '586.189')] |
| [2023-07-17 06:30:55,048][348382] Updated weights for policy 0, policy_version 11520 (0.0005) |
| [2023-07-17 06:30:57,652][348097] Fps is (10 sec: 9011.1, 60 sec: 9216.0, 300 sec: 9219.5). Total num frames: 5918720. Throughput: 0: 9181.0. Samples: 5908128. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:30:57,653][348097] Avg episode reward: [(0, '589.558')] |
| [2023-07-17 06:30:57,657][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000011560_5918720.pth... |
| [2023-07-17 06:30:57,660][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000011032_5648384.pth |
| [2023-07-17 06:30:59,472][348382] Updated weights for policy 0, policy_version 11600 (0.0005) |
| [2023-07-17 06:31:02,652][348097] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9233.4). Total num frames: 5967872. Throughput: 0: 9149.3. Samples: 5963784. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:31:02,652][348097] Avg episode reward: [(0, '584.046')] |
| [2023-07-17 06:31:03,862][348382] Updated weights for policy 0, policy_version 11680 (0.0004) |
| [2023-07-17 06:31:07,652][348097] Fps is (10 sec: 9420.9, 60 sec: 9216.0, 300 sec: 9233.4). Total num frames: 6012928. Throughput: 0: 9161.9. Samples: 5992452. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:31:07,653][348097] Avg episode reward: [(0, '589.170')] |
| [2023-07-17 06:31:08,202][348382] Updated weights for policy 0, policy_version 11760 (0.0004) |
| [2023-07-17 06:31:12,585][348382] Updated weights for policy 0, policy_version 11840 (0.0005) |
| [2023-07-17 06:31:12,652][348097] Fps is (10 sec: 9420.7, 60 sec: 9216.0, 300 sec: 9247.2). Total num frames: 6062080. Throughput: 0: 9157.6. Samples: 6048700. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:31:12,653][348097] Avg episode reward: [(0, '588.906')] |
| [2023-07-17 06:31:12,656][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000011840_6062080.pth... |
| [2023-07-17 06:31:12,659][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000011296_5783552.pth |
| [2023-07-17 06:31:16,910][348382] Updated weights for policy 0, policy_version 11920 (0.0005) |
| [2023-07-17 06:31:17,652][348097] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9233.4). Total num frames: 6107136. Throughput: 0: 9233.9. Samples: 6105140. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:31:17,653][348097] Avg episode reward: [(0, '585.143')] |
| [2023-07-17 06:31:21,382][348382] Updated weights for policy 0, policy_version 12000 (0.0005) |
| [2023-07-17 06:31:22,652][348097] Fps is (10 sec: 9011.3, 60 sec: 9147.7, 300 sec: 9233.4). Total num frames: 6152192. Throughput: 0: 9209.3. Samples: 6131804. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:31:22,653][348097] Avg episode reward: [(0, '585.736')] |
| [2023-07-17 06:31:25,748][348382] Updated weights for policy 0, policy_version 12080 (0.0004) |
| [2023-07-17 06:31:27,652][348097] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9247.2). Total num frames: 6201344. Throughput: 0: 9273.4. Samples: 6188796. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:31:27,653][348097] Avg episode reward: [(0, '587.333')] |
| [2023-07-17 06:31:27,656][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000012112_6201344.pth... |
| [2023-07-17 06:31:27,658][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000011560_5918720.pth |
| [2023-07-17 06:31:30,106][348382] Updated weights for policy 0, policy_version 12160 (0.0004) |
| [2023-07-17 06:31:32,652][348097] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9233.4). Total num frames: 6246400. Throughput: 0: 9284.7. Samples: 6244544. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:31:32,653][348097] Avg episode reward: [(0, '594.522')] |
| [2023-07-17 06:31:32,653][348338] Saving new best policy, reward=594.522! |
| [2023-07-17 06:31:34,500][348382] Updated weights for policy 0, policy_version 12240 (0.0005) |
| [2023-07-17 06:31:37,652][348097] Fps is (10 sec: 9420.9, 60 sec: 9284.3, 300 sec: 9247.2). Total num frames: 6295552. Throughput: 0: 9299.7. Samples: 6272940. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:31:37,653][348097] Avg episode reward: [(0, '584.661')] |
| [2023-07-17 06:31:38,874][348382] Updated weights for policy 0, policy_version 12320 (0.0005) |
| [2023-07-17 06:31:42,652][348097] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9247.2). Total num frames: 6340608. Throughput: 0: 9354.2. Samples: 6329064. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:31:42,653][348097] Avg episode reward: [(0, '584.060')] |
| [2023-07-17 06:31:42,656][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000012384_6340608.pth... |
| [2023-07-17 06:31:42,659][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000011840_6062080.pth |
| [2023-07-17 06:31:43,162][348382] Updated weights for policy 0, policy_version 12400 (0.0005) |
| [2023-07-17 06:31:47,462][348382] Updated weights for policy 0, policy_version 12480 (0.0005) |
| [2023-07-17 06:31:47,652][348097] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9261.1). Total num frames: 6389760. Throughput: 0: 9393.6. Samples: 6386496. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-17 06:31:47,652][348097] Avg episode reward: [(0, '580.114')] |
| [2023-07-17 06:31:51,911][348382] Updated weights for policy 0, policy_version 12560 (0.0005) |
| [2023-07-17 06:31:52,652][348097] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9261.1). Total num frames: 6434816. Throughput: 0: 9376.6. Samples: 6414400. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-17 06:31:52,652][348097] Avg episode reward: [(0, '584.389')] |
| [2023-07-17 06:31:56,381][348382] Updated weights for policy 0, policy_version 12640 (0.0005) |
| [2023-07-17 06:31:57,652][348097] Fps is (10 sec: 9011.1, 60 sec: 9352.5, 300 sec: 9247.2). Total num frames: 6479872. Throughput: 0: 9342.1. Samples: 6469096. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-17 06:31:57,652][348097] Avg episode reward: [(0, '585.822')] |
| [2023-07-17 06:31:57,655][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000012656_6479872.pth... |
| [2023-07-17 06:31:57,657][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000012112_6201344.pth |
| [2023-07-17 06:32:00,713][348382] Updated weights for policy 0, policy_version 12720 (0.0005) |
| [2023-07-17 06:32:02,652][348097] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9261.1). Total num frames: 6529024. Throughput: 0: 9348.2. Samples: 6525808. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-17 06:32:02,653][348097] Avg episode reward: [(0, '582.333')] |
| [2023-07-17 06:32:05,120][348382] Updated weights for policy 0, policy_version 12800 (0.0005) |
| [2023-07-17 06:32:07,652][348097] Fps is (10 sec: 9420.9, 60 sec: 9352.5, 300 sec: 9261.1). Total num frames: 6574080. Throughput: 0: 9374.7. Samples: 6553664. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-17 06:32:07,652][348097] Avg episode reward: [(0, '591.975')] |
| [2023-07-17 06:32:09,751][348382] Updated weights for policy 0, policy_version 12880 (0.0005) |
| [2023-07-17 06:32:12,652][348097] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9261.1). Total num frames: 6619136. Throughput: 0: 9291.5. Samples: 6606912. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-17 06:32:12,652][348097] Avg episode reward: [(0, '588.983')] |
| [2023-07-17 06:32:12,656][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000012928_6619136.pth... |
| [2023-07-17 06:32:12,659][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000012384_6340608.pth |
| [2023-07-17 06:32:14,351][348382] Updated weights for policy 0, policy_version 12960 (0.0005) |
| [2023-07-17 06:32:17,652][348097] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9261.1). Total num frames: 6664192. Throughput: 0: 9236.1. Samples: 6660168. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-17 06:32:17,652][348097] Avg episode reward: [(0, '589.351')] |
| [2023-07-17 06:32:18,958][348382] Updated weights for policy 0, policy_version 13040 (0.0005) |
| [2023-07-17 06:32:22,652][348097] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9261.1). Total num frames: 6709248. Throughput: 0: 9196.3. Samples: 6686772. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-17 06:32:22,652][348097] Avg episode reward: [(0, '584.051')] |
| [2023-07-17 06:32:23,532][348382] Updated weights for policy 0, policy_version 13120 (0.0005) |
| [2023-07-17 06:32:27,652][348097] Fps is (10 sec: 8601.6, 60 sec: 9147.7, 300 sec: 9247.2). Total num frames: 6750208. Throughput: 0: 9145.2. Samples: 6740596. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-17 06:32:27,652][348097] Avg episode reward: [(0, '585.654')] |
| [2023-07-17 06:32:27,663][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000013192_6754304.pth... |
| [2023-07-17 06:32:27,665][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000012656_6479872.pth |
| [2023-07-17 06:32:28,111][348382] Updated weights for policy 0, policy_version 13200 (0.0006) |
| [2023-07-17 06:32:32,652][348097] Fps is (10 sec: 8601.6, 60 sec: 9147.7, 300 sec: 9261.1). Total num frames: 6795264. Throughput: 0: 9064.9. Samples: 6794416. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-17 06:32:32,653][348097] Avg episode reward: [(0, '587.857')] |
| [2023-07-17 06:32:32,695][348382] Updated weights for policy 0, policy_version 13280 (0.0005) |
| [2023-07-17 06:32:37,324][348382] Updated weights for policy 0, policy_version 13360 (0.0005) |
| [2023-07-17 06:32:37,652][348097] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 9247.2). Total num frames: 6840320. Throughput: 0: 9031.5. Samples: 6820816. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-17 06:32:37,653][348097] Avg episode reward: [(0, '586.522')] |
| [2023-07-17 06:32:41,905][348382] Updated weights for policy 0, policy_version 13440 (0.0005) |
| [2023-07-17 06:32:42,652][348097] Fps is (10 sec: 9011.1, 60 sec: 9079.5, 300 sec: 9247.2). Total num frames: 6885376. Throughput: 0: 9000.9. Samples: 6874136. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-17 06:32:42,653][348097] Avg episode reward: [(0, '586.126')] |
| [2023-07-17 06:32:42,656][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000013448_6885376.pth... |
| [2023-07-17 06:32:42,659][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000012928_6619136.pth |
| [2023-07-17 06:32:46,508][348382] Updated weights for policy 0, policy_version 13520 (0.0005) |
| [2023-07-17 06:32:47,652][348097] Fps is (10 sec: 9011.2, 60 sec: 9011.2, 300 sec: 9247.2). Total num frames: 6930432. Throughput: 0: 8931.4. Samples: 6927720. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-17 06:32:47,653][348097] Avg episode reward: [(0, '583.547')] |
| [2023-07-17 06:32:51,021][348382] Updated weights for policy 0, policy_version 13600 (0.0005) |
| [2023-07-17 06:32:52,652][348097] Fps is (10 sec: 9011.2, 60 sec: 9011.2, 300 sec: 9233.4). Total num frames: 6975488. Throughput: 0: 8920.2. Samples: 6955072. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-17 06:32:52,653][348097] Avg episode reward: [(0, '588.890')] |
| [2023-07-17 06:32:55,678][348382] Updated weights for policy 0, policy_version 13680 (0.0005) |
| [2023-07-17 06:32:57,652][348097] Fps is (10 sec: 9011.1, 60 sec: 9011.2, 300 sec: 9233.4). Total num frames: 7020544. Throughput: 0: 8919.0. Samples: 7008268. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:32:57,653][348097] Avg episode reward: [(0, '583.218')] |
| [2023-07-17 06:32:57,657][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000013712_7020544.pth... |
| [2023-07-17 06:32:57,660][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000013192_6754304.pth |
| [2023-07-17 06:33:00,284][348382] Updated weights for policy 0, policy_version 13760 (0.0005) |
| [2023-07-17 06:33:02,652][348097] Fps is (10 sec: 9011.2, 60 sec: 8942.9, 300 sec: 9233.4). Total num frames: 7065600. Throughput: 0: 8920.0. Samples: 7061568. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:33:02,653][348097] Avg episode reward: [(0, '588.680')] |
| [2023-07-17 06:33:04,853][348382] Updated weights for policy 0, policy_version 13840 (0.0005) |
| [2023-07-17 06:33:07,652][348097] Fps is (10 sec: 9011.2, 60 sec: 8942.9, 300 sec: 9233.4). Total num frames: 7110656. Throughput: 0: 8932.3. Samples: 7088724. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:33:07,653][348097] Avg episode reward: [(0, '589.275')] |
| [2023-07-17 06:33:09,497][348382] Updated weights for policy 0, policy_version 13920 (0.0005) |
| [2023-07-17 06:33:12,652][348097] Fps is (10 sec: 8601.6, 60 sec: 8874.7, 300 sec: 9205.6). Total num frames: 7151616. Throughput: 0: 8916.7. Samples: 7141848. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:33:12,653][348097] Avg episode reward: [(0, '582.041')] |
| [2023-07-17 06:33:12,655][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000013968_7151616.pth... |
| [2023-07-17 06:33:12,659][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000013448_6885376.pth |
| [2023-07-17 06:33:14,091][348382] Updated weights for policy 0, policy_version 14000 (0.0005) |
| [2023-07-17 06:33:17,652][348097] Fps is (10 sec: 8601.7, 60 sec: 8874.7, 300 sec: 9191.7). Total num frames: 7196672. Throughput: 0: 8916.4. Samples: 7195652. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) |
| [2023-07-17 06:33:17,652][348097] Avg episode reward: [(0, '591.519')] |
| [2023-07-17 06:33:18,705][348382] Updated weights for policy 0, policy_version 14080 (0.0005) |
| [2023-07-17 06:33:22,652][348097] Fps is (10 sec: 9011.2, 60 sec: 8874.7, 300 sec: 9191.7). Total num frames: 7241728. Throughput: 0: 8901.0. Samples: 7221360. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) |
| [2023-07-17 06:33:22,652][348097] Avg episode reward: [(0, '587.661')] |
| [2023-07-17 06:33:23,344][348382] Updated weights for policy 0, policy_version 14160 (0.0005) |
| [2023-07-17 06:33:27,652][348097] Fps is (10 sec: 9011.1, 60 sec: 8942.9, 300 sec: 9177.8). Total num frames: 7286784. Throughput: 0: 8901.6. Samples: 7274708. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:33:27,653][348097] Avg episode reward: [(0, '588.284')] |
| [2023-07-17 06:33:27,655][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000014232_7286784.pth... |
| [2023-07-17 06:33:27,658][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000013712_7020544.pth |
| [2023-07-17 06:33:27,948][348382] Updated weights for policy 0, policy_version 14240 (0.0005) |
| [2023-07-17 06:33:32,565][348382] Updated weights for policy 0, policy_version 14320 (0.0005) |
| [2023-07-17 06:33:32,652][348097] Fps is (10 sec: 9011.2, 60 sec: 8942.9, 300 sec: 9177.8). Total num frames: 7331840. Throughput: 0: 8891.5. Samples: 7327840. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:33:32,652][348097] Avg episode reward: [(0, '586.021')] |
| [2023-07-17 06:33:37,156][348382] Updated weights for policy 0, policy_version 14400 (0.0005) |
| [2023-07-17 06:33:37,652][348097] Fps is (10 sec: 9011.2, 60 sec: 8942.9, 300 sec: 9163.9). Total num frames: 7376896. Throughput: 0: 8877.7. Samples: 7354568. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:33:37,652][348097] Avg episode reward: [(0, '586.904')] |
| [2023-07-17 06:33:41,667][348382] Updated weights for policy 0, policy_version 14480 (0.0005) |
| [2023-07-17 06:33:42,652][348097] Fps is (10 sec: 9011.2, 60 sec: 8942.9, 300 sec: 9163.9). Total num frames: 7421952. Throughput: 0: 8902.2. Samples: 7408868. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:33:42,652][348097] Avg episode reward: [(0, '590.077')] |
| [2023-07-17 06:33:42,656][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000014496_7421952.pth... |
| [2023-07-17 06:33:42,659][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000013968_7151616.pth |
| [2023-07-17 06:33:46,330][348382] Updated weights for policy 0, policy_version 14560 (0.0005) |
| [2023-07-17 06:33:47,652][348097] Fps is (10 sec: 8601.6, 60 sec: 8874.7, 300 sec: 9136.2). Total num frames: 7462912. Throughput: 0: 8900.4. Samples: 7462088. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:33:47,652][348097] Avg episode reward: [(0, '590.408')] |
| [2023-07-17 06:33:50,894][348382] Updated weights for policy 0, policy_version 14640 (0.0005) |
| [2023-07-17 06:33:52,652][348097] Fps is (10 sec: 8601.6, 60 sec: 8874.7, 300 sec: 9136.2). Total num frames: 7507968. Throughput: 0: 8890.1. Samples: 7488780. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:33:52,652][348097] Avg episode reward: [(0, '578.454')] |
| [2023-07-17 06:33:55,436][348382] Updated weights for policy 0, policy_version 14720 (0.0005) |
| [2023-07-17 06:33:57,652][348097] Fps is (10 sec: 9011.2, 60 sec: 8874.7, 300 sec: 9136.2). Total num frames: 7553024. Throughput: 0: 8913.8. Samples: 7542968. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-17 06:33:57,653][348097] Avg episode reward: [(0, '588.361')] |
| [2023-07-17 06:33:57,707][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000014760_7557120.pth... |
| [2023-07-17 06:33:57,710][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000014232_7286784.pth |
| [2023-07-17 06:33:59,963][348382] Updated weights for policy 0, policy_version 14800 (0.0005) |
| [2023-07-17 06:34:02,652][348097] Fps is (10 sec: 9011.2, 60 sec: 8874.7, 300 sec: 9150.0). Total num frames: 7598080. Throughput: 0: 8919.4. Samples: 7597024. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-17 06:34:02,652][348097] Avg episode reward: [(0, '584.590')] |
| [2023-07-17 06:34:04,580][348382] Updated weights for policy 0, policy_version 14880 (0.0005) |
| [2023-07-17 06:34:07,652][348097] Fps is (10 sec: 9011.3, 60 sec: 8874.7, 300 sec: 9150.1). Total num frames: 7643136. Throughput: 0: 8925.7. Samples: 7623016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:34:07,652][348097] Avg episode reward: [(0, '584.423')] |
| [2023-07-17 06:34:09,211][348382] Updated weights for policy 0, policy_version 14960 (0.0006) |
| [2023-07-17 06:34:12,652][348097] Fps is (10 sec: 9011.2, 60 sec: 8942.9, 300 sec: 9136.2). Total num frames: 7688192. Throughput: 0: 8932.4. Samples: 7676664. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:34:12,652][348097] Avg episode reward: [(0, '587.926')] |
| [2023-07-17 06:34:12,655][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000015016_7688192.pth... |
| [2023-07-17 06:34:12,658][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000014496_7421952.pth |
| [2023-07-17 06:34:13,720][348382] Updated weights for policy 0, policy_version 15040 (0.0005) |
| [2023-07-17 06:34:17,652][348097] Fps is (10 sec: 9011.2, 60 sec: 8942.9, 300 sec: 9136.2). Total num frames: 7733248. Throughput: 0: 8964.1. Samples: 7731224. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:34:17,652][348097] Avg episode reward: [(0, '581.699')] |
| [2023-07-17 06:34:18,307][348382] Updated weights for policy 0, policy_version 15120 (0.0005) |
| [2023-07-17 06:34:22,652][348097] Fps is (10 sec: 9011.2, 60 sec: 8942.9, 300 sec: 9122.3). Total num frames: 7778304. Throughput: 0: 8961.5. Samples: 7757836. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:34:22,652][348097] Avg episode reward: [(0, '588.598')] |
| [2023-07-17 06:34:22,933][348382] Updated weights for policy 0, policy_version 15200 (0.0005) |
| [2023-07-17 06:34:27,599][348382] Updated weights for policy 0, policy_version 15280 (0.0005) |
| [2023-07-17 06:34:27,652][348097] Fps is (10 sec: 9011.2, 60 sec: 8942.9, 300 sec: 9122.3). Total num frames: 7823360. Throughput: 0: 8932.9. Samples: 7810848. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:34:27,652][348097] Avg episode reward: [(0, '584.936')] |
| [2023-07-17 06:34:27,655][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000015280_7823360.pth... |
| [2023-07-17 06:34:27,658][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000014760_7557120.pth |
| [2023-07-17 06:34:32,233][348382] Updated weights for policy 0, policy_version 15360 (0.0005) |
| [2023-07-17 06:34:32,652][348097] Fps is (10 sec: 8601.7, 60 sec: 8874.7, 300 sec: 9108.4). Total num frames: 7864320. Throughput: 0: 8925.0. Samples: 7863712. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:34:32,652][348097] Avg episode reward: [(0, '583.055')] |
| [2023-07-17 06:34:36,792][348382] Updated weights for policy 0, policy_version 15440 (0.0005) |
| [2023-07-17 06:34:37,652][348097] Fps is (10 sec: 8601.6, 60 sec: 8874.7, 300 sec: 9094.5). Total num frames: 7909376. Throughput: 0: 8930.8. Samples: 7890664. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:34:37,652][348097] Avg episode reward: [(0, '582.839')] |
| [2023-07-17 06:34:41,465][348382] Updated weights for policy 0, policy_version 15520 (0.0005) |
| [2023-07-17 06:34:42,652][348097] Fps is (10 sec: 9011.1, 60 sec: 8874.7, 300 sec: 9094.5). Total num frames: 7954432. Throughput: 0: 8894.6. Samples: 7943224. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) |
| [2023-07-17 06:34:42,652][348097] Avg episode reward: [(0, '587.400')] |
| [2023-07-17 06:34:42,655][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000015536_7954432.pth... |
| [2023-07-17 06:34:42,658][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000015016_7688192.pth |
| [2023-07-17 06:34:46,029][348382] Updated weights for policy 0, policy_version 15600 (0.0005) |
| [2023-07-17 06:34:47,652][348097] Fps is (10 sec: 9011.2, 60 sec: 8942.9, 300 sec: 9094.5). Total num frames: 7999488. Throughput: 0: 8892.7. Samples: 7997196. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) |
| [2023-07-17 06:34:47,652][348097] Avg episode reward: [(0, '589.102')] |
| [2023-07-17 06:34:50,432][348382] Updated weights for policy 0, policy_version 15680 (0.0005) |
| [2023-07-17 06:34:52,652][348097] Fps is (10 sec: 9420.8, 60 sec: 9011.2, 300 sec: 9094.5). Total num frames: 8048640. Throughput: 0: 8942.7. Samples: 8025436. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:34:52,652][348097] Avg episode reward: [(0, '590.905')] |
| [2023-07-17 06:34:54,884][348382] Updated weights for policy 0, policy_version 15760 (0.0005) |
| [2023-07-17 06:34:57,652][348097] Fps is (10 sec: 9420.8, 60 sec: 9011.2, 300 sec: 9094.5). Total num frames: 8093696. Throughput: 0: 8969.9. Samples: 8080308. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:34:57,652][348097] Avg episode reward: [(0, '591.500')] |
| [2023-07-17 06:34:57,655][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000015808_8093696.pth... |
| [2023-07-17 06:34:57,657][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000015280_7823360.pth |
| [2023-07-17 06:34:59,511][348382] Updated weights for policy 0, policy_version 15840 (0.0005) |
| [2023-07-17 06:35:02,652][348097] Fps is (10 sec: 8601.6, 60 sec: 8942.9, 300 sec: 9066.7). Total num frames: 8134656. Throughput: 0: 8951.6. Samples: 8134048. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:35:02,652][348097] Avg episode reward: [(0, '588.754')] |
| [2023-07-17 06:35:04,123][348382] Updated weights for policy 0, policy_version 15920 (0.0005) |
| [2023-07-17 06:35:07,652][348097] Fps is (10 sec: 8601.6, 60 sec: 8942.9, 300 sec: 9052.9). Total num frames: 8179712. Throughput: 0: 8940.0. Samples: 8160136. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:35:07,652][348097] Avg episode reward: [(0, '592.695')] |
| [2023-07-17 06:35:08,682][348382] Updated weights for policy 0, policy_version 16000 (0.0005) |
| [2023-07-17 06:35:12,652][348097] Fps is (10 sec: 9011.2, 60 sec: 8942.9, 300 sec: 9052.9). Total num frames: 8224768. Throughput: 0: 8957.5. Samples: 8213936. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:35:12,652][348097] Avg episode reward: [(0, '588.479')] |
| [2023-07-17 06:35:12,655][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000016064_8224768.pth... |
| [2023-07-17 06:35:12,658][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000015536_7954432.pth |
| [2023-07-17 06:35:13,241][348382] Updated weights for policy 0, policy_version 16080 (0.0005) |
| [2023-07-17 06:35:17,652][348097] Fps is (10 sec: 9011.2, 60 sec: 8942.9, 300 sec: 9039.0). Total num frames: 8269824. Throughput: 0: 8978.1. Samples: 8267728. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:35:17,652][348097] Avg episode reward: [(0, '591.384')] |
| [2023-07-17 06:35:17,841][348382] Updated weights for policy 0, policy_version 16160 (0.0005) |
| [2023-07-17 06:35:22,385][348382] Updated weights for policy 0, policy_version 16240 (0.0005) |
| [2023-07-17 06:35:22,652][348097] Fps is (10 sec: 9011.2, 60 sec: 8942.9, 300 sec: 9039.0). Total num frames: 8314880. Throughput: 0: 8973.5. Samples: 8294472. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:35:22,652][348097] Avg episode reward: [(0, '585.778')] |
| [2023-07-17 06:35:26,741][348382] Updated weights for policy 0, policy_version 16320 (0.0004) |
| [2023-07-17 06:35:27,652][348097] Fps is (10 sec: 9420.8, 60 sec: 9011.2, 300 sec: 9052.9). Total num frames: 8364032. Throughput: 0: 9035.4. Samples: 8349816. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:35:27,653][348097] Avg episode reward: [(0, '593.242')] |
| [2023-07-17 06:35:27,656][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000016336_8364032.pth... |
| [2023-07-17 06:35:27,660][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000015808_8093696.pth |
| [2023-07-17 06:35:31,201][348382] Updated weights for policy 0, policy_version 16400 (0.0005) |
| [2023-07-17 06:35:32,652][348097] Fps is (10 sec: 9420.9, 60 sec: 9079.5, 300 sec: 9052.9). Total num frames: 8409088. Throughput: 0: 9068.7. Samples: 8405288. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:35:32,652][348097] Avg episode reward: [(0, '583.044')] |
| [2023-07-17 06:35:35,725][348382] Updated weights for policy 0, policy_version 16480 (0.0005) |
| [2023-07-17 06:35:37,652][348097] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 9052.9). Total num frames: 8454144. Throughput: 0: 9051.1. Samples: 8432736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:35:37,652][348097] Avg episode reward: [(0, '583.682')] |
| [2023-07-17 06:35:40,198][348382] Updated weights for policy 0, policy_version 16560 (0.0005) |
| [2023-07-17 06:35:42,652][348097] Fps is (10 sec: 9011.1, 60 sec: 9079.5, 300 sec: 9052.9). Total num frames: 8499200. Throughput: 0: 9037.0. Samples: 8486976. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:35:42,653][348097] Avg episode reward: [(0, '587.883')] |
| [2023-07-17 06:35:42,656][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000016600_8499200.pth... |
| [2023-07-17 06:35:42,659][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000016064_8224768.pth |
| [2023-07-17 06:35:44,731][348382] Updated weights for policy 0, policy_version 16640 (0.0005) |
| [2023-07-17 06:35:47,652][348097] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 9052.9). Total num frames: 8544256. Throughput: 0: 9033.8. Samples: 8540568. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:35:47,652][348097] Avg episode reward: [(0, '591.602')] |
| [2023-07-17 06:35:49,199][348382] Updated weights for policy 0, policy_version 16720 (0.0005) |
| [2023-07-17 06:35:52,652][348097] Fps is (10 sec: 9011.2, 60 sec: 9011.2, 300 sec: 9052.9). Total num frames: 8589312. Throughput: 0: 9083.5. Samples: 8568896. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) |
| [2023-07-17 06:35:52,653][348097] Avg episode reward: [(0, '592.528')] |
| [2023-07-17 06:35:53,753][348382] Updated weights for policy 0, policy_version 16800 (0.0006) |
| [2023-07-17 06:35:57,652][348097] Fps is (10 sec: 9011.1, 60 sec: 9011.2, 300 sec: 9039.0). Total num frames: 8634368. Throughput: 0: 9090.9. Samples: 8623028. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) |
| [2023-07-17 06:35:57,653][348097] Avg episode reward: [(0, '591.825')] |
| [2023-07-17 06:35:57,656][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000016864_8634368.pth... |
| [2023-07-17 06:35:57,659][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000016336_8364032.pth |
| [2023-07-17 06:35:58,317][348382] Updated weights for policy 0, policy_version 16880 (0.0006) |
| [2023-07-17 06:36:02,652][348097] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 9039.0). Total num frames: 8679424. Throughput: 0: 9069.2. Samples: 8675844. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-17 06:36:02,652][348097] Avg episode reward: [(0, '591.625')] |
| [2023-07-17 06:36:02,993][348382] Updated weights for policy 0, policy_version 16960 (0.0006) |
| [2023-07-17 06:36:07,540][348382] Updated weights for policy 0, policy_version 17040 (0.0006) |
| [2023-07-17 06:36:07,652][348097] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 9025.1). Total num frames: 8724480. Throughput: 0: 9073.9. Samples: 8702796. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-17 06:36:07,652][348097] Avg episode reward: [(0, '588.707')] |
| [2023-07-17 06:36:12,051][348382] Updated weights for policy 0, policy_version 17120 (0.0005) |
| [2023-07-17 06:36:12,652][348097] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 9025.1). Total num frames: 8769536. Throughput: 0: 9055.5. Samples: 8757312. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-17 06:36:12,653][348097] Avg episode reward: [(0, '585.411')] |
| [2023-07-17 06:36:12,656][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000017128_8769536.pth... |
| [2023-07-17 06:36:12,659][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000016600_8499200.pth |
| [2023-07-17 06:36:16,657][348382] Updated weights for policy 0, policy_version 17200 (0.0006) |
| [2023-07-17 06:36:17,652][348097] Fps is (10 sec: 9011.3, 60 sec: 9079.5, 300 sec: 9025.1). Total num frames: 8814592. Throughput: 0: 9006.2. Samples: 8810568. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:36:17,653][348097] Avg episode reward: [(0, '583.489')] |
| [2023-07-17 06:36:21,133][348382] Updated weights for policy 0, policy_version 17280 (0.0005) |
| [2023-07-17 06:36:22,652][348097] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 9011.2). Total num frames: 8859648. Throughput: 0: 9024.3. Samples: 8838828. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:36:22,653][348097] Avg episode reward: [(0, '587.736')] |
| [2023-07-17 06:36:25,641][348382] Updated weights for policy 0, policy_version 17360 (0.0005) |
| [2023-07-17 06:36:27,652][348097] Fps is (10 sec: 9011.1, 60 sec: 9011.2, 300 sec: 9011.2). Total num frames: 8904704. Throughput: 0: 9011.4. Samples: 8892488. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:36:27,653][348097] Avg episode reward: [(0, '584.122')] |
| [2023-07-17 06:36:27,657][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000017392_8904704.pth... |
| [2023-07-17 06:36:27,660][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000016864_8634368.pth |
| [2023-07-17 06:36:30,192][348382] Updated weights for policy 0, policy_version 17440 (0.0005) |
| [2023-07-17 06:36:32,652][348097] Fps is (10 sec: 9011.2, 60 sec: 9011.2, 300 sec: 8997.3). Total num frames: 8949760. Throughput: 0: 9020.7. Samples: 8946500. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:36:32,653][348097] Avg episode reward: [(0, '584.921')] |
| [2023-07-17 06:36:34,751][348382] Updated weights for policy 0, policy_version 17520 (0.0005) |
| [2023-07-17 06:36:37,652][348097] Fps is (10 sec: 9011.2, 60 sec: 9011.2, 300 sec: 8997.3). Total num frames: 8994816. Throughput: 0: 8999.7. Samples: 8973884. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:36:37,653][348097] Avg episode reward: [(0, '590.262')] |
| [2023-07-17 06:36:39,328][348382] Updated weights for policy 0, policy_version 17600 (0.0005) |
| [2023-07-17 06:36:42,652][348097] Fps is (10 sec: 9011.2, 60 sec: 9011.2, 300 sec: 8983.4). Total num frames: 9039872. Throughput: 0: 8985.5. Samples: 9027376. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:36:42,653][348097] Avg episode reward: [(0, '586.763')] |
| [2023-07-17 06:36:42,656][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000017656_9039872.pth... |
| [2023-07-17 06:36:42,659][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000017128_8769536.pth |
| [2023-07-17 06:36:43,923][348382] Updated weights for policy 0, policy_version 17680 (0.0005) |
| [2023-07-17 06:36:47,652][348097] Fps is (10 sec: 8601.6, 60 sec: 8942.9, 300 sec: 8969.5). Total num frames: 9080832. Throughput: 0: 8997.2. Samples: 9080720. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:36:47,653][348097] Avg episode reward: [(0, '592.989')] |
| [2023-07-17 06:36:48,561][348382] Updated weights for policy 0, policy_version 17760 (0.0005) |
| [2023-07-17 06:36:52,652][348097] Fps is (10 sec: 9011.2, 60 sec: 9011.2, 300 sec: 8983.4). Total num frames: 9129984. Throughput: 0: 9010.6. Samples: 9108272. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) |
| [2023-07-17 06:36:52,653][348097] Avg episode reward: [(0, '587.088')] |
| [2023-07-17 06:36:52,874][348382] Updated weights for policy 0, policy_version 17840 (0.0005) |
| [2023-07-17 06:36:57,267][348382] Updated weights for policy 0, policy_version 17920 (0.0005) |
| [2023-07-17 06:36:57,652][348097] Fps is (10 sec: 9420.8, 60 sec: 9011.2, 300 sec: 8969.5). Total num frames: 9175040. Throughput: 0: 9048.7. Samples: 9164504. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) |
| [2023-07-17 06:36:57,652][348097] Avg episode reward: [(0, '588.585')] |
| [2023-07-17 06:36:57,655][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000017920_9175040.pth... |
| [2023-07-17 06:36:57,658][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000017392_8904704.pth |
| [2023-07-17 06:37:01,777][348382] Updated weights for policy 0, policy_version 18000 (0.0005) |
| [2023-07-17 06:37:02,652][348097] Fps is (10 sec: 9011.2, 60 sec: 9011.2, 300 sec: 8969.5). Total num frames: 9220096. Throughput: 0: 9084.5. Samples: 9219372. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) |
| [2023-07-17 06:37:02,652][348097] Avg episode reward: [(0, '591.456')] |
| [2023-07-17 06:37:06,144][348382] Updated weights for policy 0, policy_version 18080 (0.0005) |
| [2023-07-17 06:37:07,652][348097] Fps is (10 sec: 9420.8, 60 sec: 9079.5, 300 sec: 8983.4). Total num frames: 9269248. Throughput: 0: 9076.2. Samples: 9247256. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-17 06:37:07,652][348097] Avg episode reward: [(0, '588.720')] |
| [2023-07-17 06:37:10,471][348382] Updated weights for policy 0, policy_version 18160 (0.0004) |
| [2023-07-17 06:37:12,652][348097] Fps is (10 sec: 9830.4, 60 sec: 9147.7, 300 sec: 8997.3). Total num frames: 9318400. Throughput: 0: 9143.8. Samples: 9303960. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-17 06:37:12,652][348097] Avg episode reward: [(0, '593.871')] |
| [2023-07-17 06:37:12,655][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000018200_9318400.pth... |
| [2023-07-17 06:37:12,657][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000017656_9039872.pth |
| [2023-07-17 06:37:14,787][348382] Updated weights for policy 0, policy_version 18240 (0.0005) |
| [2023-07-17 06:37:17,652][348097] Fps is (10 sec: 9420.8, 60 sec: 9147.7, 300 sec: 8997.3). Total num frames: 9363456. Throughput: 0: 9198.9. Samples: 9360452. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:37:17,653][348097] Avg episode reward: [(0, '590.601')] |
| [2023-07-17 06:37:19,169][348382] Updated weights for policy 0, policy_version 18320 (0.0005) |
| [2023-07-17 06:37:22,652][348097] Fps is (10 sec: 9011.2, 60 sec: 9147.7, 300 sec: 9011.2). Total num frames: 9408512. Throughput: 0: 9216.7. Samples: 9388636. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:37:22,652][348097] Avg episode reward: [(0, '594.121')] |
| [2023-07-17 06:37:23,520][348382] Updated weights for policy 0, policy_version 18400 (0.0005) |
| [2023-07-17 06:37:27,652][348097] Fps is (10 sec: 9420.7, 60 sec: 9216.0, 300 sec: 9025.1). Total num frames: 9457664. Throughput: 0: 9286.7. Samples: 9445280. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:37:27,653][348097] Avg episode reward: [(0, '591.723')] |
| [2023-07-17 06:37:27,656][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000018472_9457664.pth... |
| [2023-07-17 06:37:27,659][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000017920_9175040.pth |
| [2023-07-17 06:37:27,962][348382] Updated weights for policy 0, policy_version 18480 (0.0005) |
| [2023-07-17 06:37:32,652][348097] Fps is (10 sec: 9011.2, 60 sec: 9147.7, 300 sec: 9011.2). Total num frames: 9498624. Throughput: 0: 9277.1. Samples: 9498192. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:37:32,653][348097] Avg episode reward: [(0, '592.456')] |
| [2023-07-17 06:37:32,703][348382] Updated weights for policy 0, policy_version 18560 (0.0005) |
| [2023-07-17 06:37:37,305][348382] Updated weights for policy 0, policy_version 18640 (0.0005) |
| [2023-07-17 06:37:37,652][348097] Fps is (10 sec: 8601.6, 60 sec: 9147.7, 300 sec: 9011.2). Total num frames: 9543680. Throughput: 0: 9246.8. Samples: 9524376. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:37:37,653][348097] Avg episode reward: [(0, '588.345')] |
| [2023-07-17 06:37:41,802][348382] Updated weights for policy 0, policy_version 18720 (0.0005) |
| [2023-07-17 06:37:42,652][348097] Fps is (10 sec: 9011.2, 60 sec: 9147.7, 300 sec: 9011.2). Total num frames: 9588736. Throughput: 0: 9186.0. Samples: 9577876. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:37:42,653][348097] Avg episode reward: [(0, '584.864')] |
| [2023-07-17 06:37:42,679][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000018736_9592832.pth... |
| [2023-07-17 06:37:42,681][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000018200_9318400.pth |
| [2023-07-17 06:37:46,137][348382] Updated weights for policy 0, policy_version 18800 (0.0004) |
| [2023-07-17 06:37:47,652][348097] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9025.1). Total num frames: 9637888. Throughput: 0: 9227.5. Samples: 9634608. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:37:47,652][348097] Avg episode reward: [(0, '596.847')] |
| [2023-07-17 06:37:47,653][348338] Saving new best policy, reward=596.847! |
| [2023-07-17 06:37:50,511][348382] Updated weights for policy 0, policy_version 18880 (0.0005) |
| [2023-07-17 06:37:52,652][348097] Fps is (10 sec: 9420.9, 60 sec: 9216.0, 300 sec: 9025.1). Total num frames: 9682944. Throughput: 0: 9229.2. Samples: 9662572. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:37:52,652][348097] Avg episode reward: [(0, '589.330')] |
| [2023-07-17 06:37:54,924][348382] Updated weights for policy 0, policy_version 18960 (0.0005) |
| [2023-07-17 06:37:57,652][348097] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9039.0). Total num frames: 9732096. Throughput: 0: 9229.2. Samples: 9719276. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:37:57,652][348097] Avg episode reward: [(0, '589.509')] |
| [2023-07-17 06:37:57,656][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000019008_9732096.pth... |
| [2023-07-17 06:37:57,658][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000018472_9457664.pth |
| [2023-07-17 06:37:59,208][348382] Updated weights for policy 0, policy_version 19040 (0.0005) |
| [2023-07-17 06:38:02,652][348097] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9039.0). Total num frames: 9777152. Throughput: 0: 9233.3. Samples: 9775952. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:38:02,653][348097] Avg episode reward: [(0, '591.489')] |
| [2023-07-17 06:38:03,572][348382] Updated weights for policy 0, policy_version 19120 (0.0005) |
| [2023-07-17 06:38:07,652][348097] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9066.7). Total num frames: 9826304. Throughput: 0: 9225.8. Samples: 9803796. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 06:38:07,653][348097] Avg episode reward: [(0, '589.454')] |
| [2023-07-17 06:38:07,884][348382] Updated weights for policy 0, policy_version 19200 (0.0005) |
| [2023-07-17 06:38:12,168][348382] Updated weights for policy 0, policy_version 19280 (0.0004) |
| [2023-07-17 06:38:12,652][348097] Fps is (10 sec: 9830.3, 60 sec: 9284.3, 300 sec: 9080.6). Total num frames: 9875456. Throughput: 0: 9244.5. Samples: 9861284. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-17 06:38:12,653][348097] Avg episode reward: [(0, '593.510')] |
| [2023-07-17 06:38:12,656][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000019288_9875456.pth... |
| [2023-07-17 06:38:12,659][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000018736_9592832.pth |
| [2023-07-17 06:38:16,432][348382] Updated weights for policy 0, policy_version 19360 (0.0004) |
| [2023-07-17 06:38:17,652][348097] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9080.6). Total num frames: 9920512. Throughput: 0: 9346.4. Samples: 9918780. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-17 06:38:17,653][348097] Avg episode reward: [(0, '587.185')] |
| [2023-07-17 06:38:20,704][348382] Updated weights for policy 0, policy_version 19440 (0.0004) |
| [2023-07-17 06:38:22,652][348097] Fps is (10 sec: 9420.9, 60 sec: 9352.5, 300 sec: 9094.5). Total num frames: 9969664. Throughput: 0: 9412.2. Samples: 9947924. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-17 06:38:22,656][348097] Avg episode reward: [(0, '586.513')] |
| [2023-07-17 06:38:25,030][348382] Updated weights for policy 0, policy_version 19520 (0.0004) |
| [2023-07-17 06:38:25,863][348338] Early stopping after 2 epochs (8 sgd steps), loss delta 0.0000002 |
| [2023-07-17 06:38:26,316][348338] Early stopping after 2 epochs (8 sgd steps), loss delta 0.0000000 |
| [2023-07-17 06:38:26,317][348493] Stopping RolloutWorker_w6... |
| [2023-07-17 06:38:26,317][348386] Stopping RolloutWorker_w3... |
| [2023-07-17 06:38:26,317][348384] Stopping RolloutWorker_w1... |
| [2023-07-17 06:38:26,317][348387] Stopping RolloutWorker_w4... |
| [2023-07-17 06:38:26,317][348393] Stopping RolloutWorker_w5... |
| [2023-07-17 06:38:26,317][348385] Stopping RolloutWorker_w2... |
| [2023-07-17 06:38:26,317][348383] Stopping RolloutWorker_w0... |
| [2023-07-17 06:38:26,317][348493] Loop rollout_proc6_evt_loop terminating... |
| [2023-07-17 06:38:26,318][348386] Loop rollout_proc3_evt_loop terminating... |
| [2023-07-17 06:38:26,318][348384] Loop rollout_proc1_evt_loop terminating... |
| [2023-07-17 06:38:26,317][348451] Stopping RolloutWorker_w7... |
| [2023-07-17 06:38:26,318][348393] Loop rollout_proc5_evt_loop terminating... |
| [2023-07-17 06:38:26,318][348385] Loop rollout_proc2_evt_loop terminating... |
| [2023-07-17 06:38:26,318][348383] Loop rollout_proc0_evt_loop terminating... |
| [2023-07-17 06:38:26,318][348387] Loop rollout_proc4_evt_loop terminating... |
| [2023-07-17 06:38:26,318][348451] Loop rollout_proc7_evt_loop terminating... |
| [2023-07-17 06:38:26,317][348097] Component RolloutWorker_w6 stopped! |
| [2023-07-17 06:38:26,318][348338] Stopping Batcher_0... |
| [2023-07-17 06:38:26,318][348338] Loop batcher_evt_loop terminating... |
| [2023-07-17 06:38:26,318][348097] Component RolloutWorker_w3 stopped! |
| [2023-07-17 06:38:26,318][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000019544_10006528.pth... |
| [2023-07-17 06:38:26,318][348097] Component RolloutWorker_w4 stopped! |
| [2023-07-17 06:38:26,319][348097] Component RolloutWorker_w1 stopped! |
| [2023-07-17 06:38:26,319][348097] Component RolloutWorker_w2 stopped! |
| [2023-07-17 06:38:26,319][348097] Component RolloutWorker_w5 stopped! |
| [2023-07-17 06:38:26,320][348097] Component RolloutWorker_w0 stopped! |
| [2023-07-17 06:38:26,320][348097] Component RolloutWorker_w7 stopped! |
| [2023-07-17 06:38:26,320][348097] Component Batcher_0 stopped! |
| [2023-07-17 06:38:26,321][348338] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000019008_9732096.pth |
| [2023-07-17 06:38:26,321][348338] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/window-close-v2/checkpoint_p0/checkpoint_000019544_10006528.pth... |
| [2023-07-17 06:38:26,324][348338] Stopping LearnerWorker_p0... |
| [2023-07-17 06:38:26,324][348338] Loop learner_proc0_evt_loop terminating... |
| [2023-07-17 06:38:26,324][348097] Component LearnerWorker_p0 stopped! |
| [2023-07-17 06:38:26,377][348382] Weights refcount: 2 0 |
| [2023-07-17 06:38:26,378][348382] Stopping InferenceWorker_p0-w0... |
| [2023-07-17 06:38:26,379][348382] Loop inference_proc0-0_evt_loop terminating... |
| [2023-07-17 06:38:26,379][348097] Component InferenceWorker_p0-w0 stopped! |
| [2023-07-17 06:38:26,379][348097] Waiting for process learner_proc0 to stop... |
| [2023-07-17 06:38:26,910][348097] Waiting for process inference_proc0-0 to join... |
| [2023-07-17 06:38:26,922][348097] Waiting for process rollout_proc0 to join... |
| [2023-07-17 06:38:26,922][348097] Waiting for process rollout_proc1 to join... |
| [2023-07-17 06:38:26,922][348097] Waiting for process rollout_proc2 to join... |
| [2023-07-17 06:38:26,922][348097] Waiting for process rollout_proc3 to join... |
| [2023-07-17 06:38:26,923][348097] Waiting for process rollout_proc4 to join... |
| [2023-07-17 06:38:26,923][348097] Waiting for process rollout_proc5 to join... |
| [2023-07-17 06:38:26,923][348097] Waiting for process rollout_proc6 to join... |
| [2023-07-17 06:38:26,923][348097] Waiting for process rollout_proc7 to join... |
| [2023-07-17 06:38:26,923][348097] Batcher 0 profile tree view: |
| batching: 1.8758, releasing_batches: 1.6745 |
| [2023-07-17 06:38:26,923][348097] InferenceWorker_p0-w0 profile tree view: |
| wait_policy: 0.0051 |
| wait_policy_total: 443.5758 |
| update_model: 13.3257 |
| weight_update: 0.0004 |
| one_step: 0.0006 |
| handle_policy_step: 564.1415 |
| deserialize: 23.3498, stack: 5.9729, obs_to_device_normalize: 103.3066, forward: 279.5177, send_messages: 39.0305 |
| prepare_outputs: 64.1039 |
| to_cpu: 9.9784 |
| [2023-07-17 06:38:26,924][348097] Learner 0 profile tree view: |
| misc: 0.0099, prepare_batch: 9.3418 |
| train: 97.2531 |
| epoch_init: 0.0374, minibatch_init: 1.3262, losses_postprocess: 1.3001, kl_divergence: 0.4494, after_optimizer: 0.6425 |
| calculate_losses: 41.5518 |
| losses_init: 0.0324, forward_head: 16.2875, bptt_initial: 0.1409, bptt: 0.1269, tail: 11.6812, advantages_returns: 0.8933, losses: 10.9457 |
| update: 50.3328 |
| clip: 5.9925 |
| [2023-07-17 06:38:26,924][348097] RolloutWorker_w0 profile tree view: |
| wait_for_trajectories: 0.2898, enqueue_policy_requests: 12.8086, env_step: 825.4394, overhead: 19.9522, complete_rollouts: 0.3289 |
| save_policy_outputs: 38.2603 |
| split_output_tensors: 13.2536 |
| [2023-07-17 06:38:26,924][348097] RolloutWorker_w7 profile tree view: |
| wait_for_trajectories: 0.2724, enqueue_policy_requests: 12.5506, env_step: 825.0107, overhead: 20.3152, complete_rollouts: 0.3184 |
| save_policy_outputs: 38.2072 |
| split_output_tensors: 13.0543 |
| [2023-07-17 06:38:26,924][348097] Loop Runner_EvtLoop terminating... |
| [2023-07-17 06:38:26,925][348097] Runner profile tree view: |
| main_loop: 1092.2628 |
| [2023-07-17 06:38:26,925][348097] Collected {0: 10006528}, FPS: 9161.3 |
|
|