| [2023-07-17 05:13:30,859][335310] Saving configuration to /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/config.json... |
| [2023-07-17 05:13:30,876][335310] Rollout worker 0 uses device cpu |
| [2023-07-17 05:13:30,877][335310] Rollout worker 1 uses device cpu |
| [2023-07-17 05:13:30,877][335310] Rollout worker 2 uses device cpu |
| [2023-07-17 05:13:30,877][335310] Rollout worker 3 uses device cpu |
| [2023-07-17 05:13:30,877][335310] Rollout worker 4 uses device cpu |
| [2023-07-17 05:13:30,877][335310] Rollout worker 5 uses device cpu |
| [2023-07-17 05:13:30,877][335310] Rollout worker 6 uses device cpu |
| [2023-07-17 05:13:30,878][335310] Rollout worker 7 uses device cpu |
| [2023-07-17 05:13:30,878][335310] In synchronous mode, we only accumulate one batch. Setting num_batches_to_accumulate to 1 |
| [2023-07-17 05:13:30,896][335310] InferenceWorker_p0-w0: min num requests: 2 |
| [2023-07-17 05:13:30,925][335310] Starting all processes... |
| [2023-07-17 05:13:30,926][335310] Starting process learner_proc0 |
| [2023-07-17 05:13:30,974][335310] Starting all processes... |
| [2023-07-17 05:13:31,017][335310] Starting process inference_proc0-0 |
| [2023-07-17 05:13:31,028][335310] Starting process rollout_proc0 |
| [2023-07-17 05:13:31,029][335310] Starting process rollout_proc1 |
| [2023-07-17 05:13:31,029][335310] Starting process rollout_proc2 |
| [2023-07-17 05:13:31,029][335310] Starting process rollout_proc3 |
| [2023-07-17 05:13:31,029][335310] Starting process rollout_proc4 |
| [2023-07-17 05:13:31,029][335310] Starting process rollout_proc5 |
| [2023-07-17 05:13:31,029][335310] Starting process rollout_proc6 |
| [2023-07-17 05:13:31,029][335310] Starting process rollout_proc7 |
| [2023-07-17 05:13:32,855][335551] Starting seed is not provided |
| [2023-07-17 05:13:32,855][335551] Initializing actor-critic model on device cpu |
| [2023-07-17 05:13:32,855][335551] RunningMeanStd input shape: (39,) |
| [2023-07-17 05:13:32,855][335551] RunningMeanStd input shape: (1,) |
| [2023-07-17 05:13:32,865][335598] Worker 0 uses CPU cores [0, 1, 2, 3] |
| [2023-07-17 05:13:32,924][335551] Created Actor Critic model with architecture: |
| [2023-07-17 05:13:32,924][335551] ActorCriticSharedWeights( |
| (obs_normalizer): ObservationNormalizer( |
| (running_mean_std): RunningMeanStdDictInPlace( |
| (running_mean_std): ModuleDict( |
| (obs): RunningMeanStdInPlace() |
| ) |
| ) |
| ) |
| (returns_normalizer): RecursiveScriptModule(original_name=RunningMeanStdInPlace) |
| (encoder): MultiInputEncoder( |
| (encoders): ModuleDict( |
| (obs): MlpEncoder( |
| (mlp_head): RecursiveScriptModule( |
| original_name=Sequential |
| (0): RecursiveScriptModule(original_name=Linear) |
| (1): RecursiveScriptModule(original_name=Tanh) |
| (2): RecursiveScriptModule(original_name=Linear) |
| (3): RecursiveScriptModule(original_name=Tanh) |
| ) |
| ) |
| ) |
| ) |
| (core): ModelCoreIdentity() |
| (decoder): MlpDecoder( |
| (mlp): Identity() |
| ) |
| (critic_linear): Linear(in_features=64, out_features=1, bias=True) |
| (action_parameterization): ActionParameterizationContinuousNonAdaptiveStddev( |
| (distribution_linear): Linear(in_features=64, out_features=4, bias=True) |
| ) |
| ) |
| [2023-07-17 05:13:33,013][335665] Worker 6 uses CPU cores [24, 25, 26, 27] |
| [2023-07-17 05:13:33,039][335633] Worker 7 uses CPU cores [28, 29, 30, 31] |
| [2023-07-17 05:13:33,102][335597] Worker 2 uses CPU cores [8, 9, 10, 11] |
| [2023-07-17 05:13:33,236][335551] Using optimizer <class 'torch.optim.adam.Adam'> |
| [2023-07-17 05:13:33,237][335551] No checkpoints found |
| [2023-07-17 05:13:33,237][335551] Did not load from checkpoint, starting from scratch! |
| [2023-07-17 05:13:33,237][335551] Initialized policy 0 weights for model version 0 |
| [2023-07-17 05:13:33,238][335551] LearnerWorker_p0 finished initialization! |
| [2023-07-17 05:13:33,285][335599] Worker 3 uses CPU cores [12, 13, 14, 15] |
| [2023-07-17 05:13:33,337][335595] RunningMeanStd input shape: (39,) |
| [2023-07-17 05:13:33,337][335595] RunningMeanStd input shape: (1,) |
| [2023-07-17 05:13:33,399][335310] Inference worker 0-0 is ready! |
| [2023-07-17 05:13:33,399][335310] All inference workers are ready! Signal rollout workers to start! |
| [2023-07-17 05:13:33,438][335600] Worker 5 uses CPU cores [20, 21, 22, 23] |
| [2023-07-17 05:13:33,456][335601] Worker 4 uses CPU cores [16, 17, 18, 19] |
| [2023-07-17 05:13:33,598][335596] Worker 1 uses CPU cores [4, 5, 6, 7] |
| [2023-07-17 05:13:33,849][335310] Fps is (10 sec: nan, 60 sec: nan, 300 sec: nan). Total num frames: 0. Throughput: 0: nan. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) |
| [2023-07-17 05:13:34,782][335598] Decorrelating experience for 0 frames... |
| [2023-07-17 05:13:34,791][335598] Decorrelating experience for 64 frames... |
| [2023-07-17 05:13:34,808][335597] Decorrelating experience for 0 frames... |
| [2023-07-17 05:13:34,810][335665] Decorrelating experience for 0 frames... |
| [2023-07-17 05:13:34,813][335599] Decorrelating experience for 0 frames... |
| [2023-07-17 05:13:34,813][335633] Decorrelating experience for 0 frames... |
| [2023-07-17 05:13:34,817][335597] Decorrelating experience for 64 frames... |
| [2023-07-17 05:13:34,819][335665] Decorrelating experience for 64 frames... |
| [2023-07-17 05:13:34,822][335633] Decorrelating experience for 64 frames... |
| [2023-07-17 05:13:34,822][335599] Decorrelating experience for 64 frames... |
| [2023-07-17 05:13:34,831][335598] Decorrelating experience for 128 frames... |
| [2023-07-17 05:13:34,858][335597] Decorrelating experience for 128 frames... |
| [2023-07-17 05:13:34,859][335665] Decorrelating experience for 128 frames... |
| [2023-07-17 05:13:34,859][335600] Decorrelating experience for 0 frames... |
| [2023-07-17 05:13:34,862][335633] Decorrelating experience for 128 frames... |
| [2023-07-17 05:13:34,862][335599] Decorrelating experience for 128 frames... |
| [2023-07-17 05:13:34,868][335600] Decorrelating experience for 64 frames... |
| [2023-07-17 05:13:34,909][335600] Decorrelating experience for 128 frames... |
| [2023-07-17 05:13:34,910][335601] Decorrelating experience for 0 frames... |
| [2023-07-17 05:13:34,911][335598] Decorrelating experience for 192 frames... |
| [2023-07-17 05:13:34,919][335601] Decorrelating experience for 64 frames... |
| [2023-07-17 05:13:34,937][335597] Decorrelating experience for 192 frames... |
| [2023-07-17 05:13:34,938][335665] Decorrelating experience for 192 frames... |
| [2023-07-17 05:13:34,941][335599] Decorrelating experience for 192 frames... |
| [2023-07-17 05:13:34,941][335633] Decorrelating experience for 192 frames... |
| [2023-07-17 05:13:34,959][335601] Decorrelating experience for 128 frames... |
| [2023-07-17 05:13:34,988][335600] Decorrelating experience for 192 frames... |
| [2023-07-17 05:13:35,022][335596] Decorrelating experience for 0 frames... |
| [2023-07-17 05:13:35,031][335596] Decorrelating experience for 64 frames... |
| [2023-07-17 05:13:35,038][335601] Decorrelating experience for 192 frames... |
| [2023-07-17 05:13:35,071][335596] Decorrelating experience for 128 frames... |
| [2023-07-17 05:13:35,151][335596] Decorrelating experience for 192 frames... |
| [2023-07-17 05:13:36,288][335598] Decorrelating experience for 256 frames... |
| [2023-07-17 05:13:36,345][335597] Decorrelating experience for 256 frames... |
| [2023-07-17 05:13:36,351][335665] Decorrelating experience for 256 frames... |
| [2023-07-17 05:13:36,352][335633] Decorrelating experience for 256 frames... |
| [2023-07-17 05:13:36,353][335599] Decorrelating experience for 256 frames... |
| [2023-07-17 05:13:36,360][335600] Decorrelating experience for 256 frames... |
| [2023-07-17 05:13:36,414][335601] Decorrelating experience for 256 frames... |
| [2023-07-17 05:13:36,438][335598] Decorrelating experience for 320 frames... |
| [2023-07-17 05:13:36,496][335597] Decorrelating experience for 320 frames... |
| [2023-07-17 05:13:36,498][335665] Decorrelating experience for 320 frames... |
| [2023-07-17 05:13:36,501][335633] Decorrelating experience for 320 frames... |
| [2023-07-17 05:13:36,504][335599] Decorrelating experience for 320 frames... |
| [2023-07-17 05:13:36,509][335600] Decorrelating experience for 320 frames... |
| [2023-07-17 05:13:36,526][335596] Decorrelating experience for 256 frames... |
| [2023-07-17 05:13:36,563][335601] Decorrelating experience for 320 frames... |
| [2023-07-17 05:13:36,626][335598] Decorrelating experience for 384 frames... |
| [2023-07-17 05:13:36,675][335596] Decorrelating experience for 320 frames... |
| [2023-07-17 05:13:36,685][335597] Decorrelating experience for 384 frames... |
| [2023-07-17 05:13:36,686][335665] Decorrelating experience for 384 frames... |
| [2023-07-17 05:13:36,688][335633] Decorrelating experience for 384 frames... |
| [2023-07-17 05:13:36,692][335599] Decorrelating experience for 384 frames... |
| [2023-07-17 05:13:36,697][335600] Decorrelating experience for 384 frames... |
| [2023-07-17 05:13:36,750][335601] Decorrelating experience for 384 frames... |
| [2023-07-17 05:13:36,845][335598] Decorrelating experience for 448 frames... |
| [2023-07-17 05:13:36,864][335596] Decorrelating experience for 384 frames... |
| [2023-07-17 05:13:36,903][335665] Decorrelating experience for 448 frames... |
| [2023-07-17 05:13:36,905][335597] Decorrelating experience for 448 frames... |
| [2023-07-17 05:13:36,908][335633] Decorrelating experience for 448 frames... |
| [2023-07-17 05:13:36,911][335599] Decorrelating experience for 448 frames... |
| [2023-07-17 05:13:36,914][335600] Decorrelating experience for 448 frames... |
| [2023-07-17 05:13:36,966][335601] Decorrelating experience for 448 frames... |
| [2023-07-17 05:13:37,082][335596] Decorrelating experience for 448 frames... |
| [2023-07-17 05:13:38,849][335310] Fps is (10 sec: 1638.4, 60 sec: 1638.4, 300 sec: 1638.4). Total num frames: 8192. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:13:38,850][335310] Avg episode reward: [(0, '2.184')] |
| [2023-07-17 05:13:41,750][335595] Updated weights for policy 0, policy_version 80 (0.0004) |
| [2023-07-17 05:13:43,849][335310] Fps is (10 sec: 6144.0, 60 sec: 6144.0, 300 sec: 6144.0). Total num frames: 61440. Throughput: 0: 4818.4. Samples: 48184. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:13:43,850][335310] Avg episode reward: [(0, '4.942')] |
| [2023-07-17 05:13:43,853][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000000120_61440.pth... |
| [2023-07-17 05:13:45,848][335595] Updated weights for policy 0, policy_version 160 (0.0005) |
| [2023-07-17 05:13:48,849][335310] Fps is (10 sec: 10240.0, 60 sec: 7372.8, 300 sec: 7372.8). Total num frames: 110592. Throughput: 0: 7243.5. Samples: 108652. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) |
| [2023-07-17 05:13:48,850][335310] Avg episode reward: [(0, '7.975')] |
| [2023-07-17 05:13:48,851][335551] Saving new best policy, reward=7.975! |
| [2023-07-17 05:13:49,856][335595] Updated weights for policy 0, policy_version 240 (0.0005) |
| [2023-07-17 05:13:50,888][335310] Heartbeat connected on Batcher_0 |
| [2023-07-17 05:13:50,891][335310] Heartbeat connected on LearnerWorker_p0 |
| [2023-07-17 05:13:50,897][335310] Heartbeat connected on InferenceWorker_p0-w0 |
| [2023-07-17 05:13:50,906][335310] Heartbeat connected on RolloutWorker_w0 |
| [2023-07-17 05:13:50,909][335310] Heartbeat connected on RolloutWorker_w1 |
| [2023-07-17 05:13:50,912][335310] Heartbeat connected on RolloutWorker_w2 |
| [2023-07-17 05:13:50,914][335310] Heartbeat connected on RolloutWorker_w3 |
| [2023-07-17 05:13:50,917][335310] Heartbeat connected on RolloutWorker_w4 |
| [2023-07-17 05:13:50,920][335310] Heartbeat connected on RolloutWorker_w5 |
| [2023-07-17 05:13:50,923][335310] Heartbeat connected on RolloutWorker_w6 |
| [2023-07-17 05:13:50,927][335310] Heartbeat connected on RolloutWorker_w7 |
| [2023-07-17 05:13:53,849][335310] Fps is (10 sec: 9830.4, 60 sec: 7987.2, 300 sec: 7987.2). Total num frames: 159744. Throughput: 0: 6963.2. Samples: 139264. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:13:53,850][335310] Avg episode reward: [(0, '18.841')] |
| [2023-07-17 05:13:53,851][335551] Saving new best policy, reward=18.841! |
| [2023-07-17 05:13:54,012][335595] Updated weights for policy 0, policy_version 320 (0.0005) |
| [2023-07-17 05:13:58,302][335595] Updated weights for policy 0, policy_version 400 (0.0005) |
| [2023-07-17 05:13:58,849][335310] Fps is (10 sec: 9830.4, 60 sec: 8355.8, 300 sec: 8355.8). Total num frames: 208896. Throughput: 0: 7864.6. Samples: 196616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:13:58,850][335310] Avg episode reward: [(0, '44.234')] |
| [2023-07-17 05:13:58,853][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000000408_208896.pth... |
| [2023-07-17 05:13:58,856][335551] Saving new best policy, reward=44.234! |
| [2023-07-17 05:14:02,685][335595] Updated weights for policy 0, policy_version 480 (0.0005) |
| [2023-07-17 05:14:03,849][335310] Fps is (10 sec: 9420.7, 60 sec: 8465.1, 300 sec: 8465.1). Total num frames: 253952. Throughput: 0: 8429.3. Samples: 252880. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) |
| [2023-07-17 05:14:03,850][335310] Avg episode reward: [(0, '39.389')] |
| [2023-07-17 05:14:07,039][335595] Updated weights for policy 0, policy_version 560 (0.0005) |
| [2023-07-17 05:14:08,849][335310] Fps is (10 sec: 9420.8, 60 sec: 8660.1, 300 sec: 8660.1). Total num frames: 303104. Throughput: 0: 8043.0. Samples: 281504. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:14:08,850][335310] Avg episode reward: [(0, '60.071')] |
| [2023-07-17 05:14:08,850][335551] Saving new best policy, reward=60.071! |
| [2023-07-17 05:14:11,316][335595] Updated weights for policy 0, policy_version 640 (0.0005) |
| [2023-07-17 05:14:13,849][335310] Fps is (10 sec: 9420.7, 60 sec: 8704.0, 300 sec: 8704.0). Total num frames: 348160. Throughput: 0: 8462.9. Samples: 338516. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-17 05:14:13,850][335310] Avg episode reward: [(0, '51.658')] |
| [2023-07-17 05:14:13,873][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000000688_352256.pth... |
| [2023-07-17 05:14:13,876][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000000120_61440.pth |
| [2023-07-17 05:14:15,587][335595] Updated weights for policy 0, policy_version 720 (0.0005) |
| [2023-07-17 05:14:18,849][335310] Fps is (10 sec: 9420.8, 60 sec: 8829.2, 300 sec: 8829.2). Total num frames: 397312. Throughput: 0: 8779.6. Samples: 395080. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-17 05:14:18,850][335310] Avg episode reward: [(0, '71.899')] |
| [2023-07-17 05:14:18,850][335551] Saving new best policy, reward=71.899! |
| [2023-07-17 05:14:19,942][335595] Updated weights for policy 0, policy_version 800 (0.0005) |
| [2023-07-17 05:14:23,849][335310] Fps is (10 sec: 9830.7, 60 sec: 8929.3, 300 sec: 8929.3). Total num frames: 446464. Throughput: 0: 9414.0. Samples: 423628. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:14:23,850][335310] Avg episode reward: [(0, '109.884')] |
| [2023-07-17 05:14:23,850][335551] Saving new best policy, reward=109.884! |
| [2023-07-17 05:14:24,228][335595] Updated weights for policy 0, policy_version 880 (0.0005) |
| [2023-07-17 05:14:28,551][335595] Updated weights for policy 0, policy_version 960 (0.0005) |
| [2023-07-17 05:14:28,849][335310] Fps is (10 sec: 9420.7, 60 sec: 8936.7, 300 sec: 8936.7). Total num frames: 491520. Throughput: 0: 9614.6. Samples: 480840. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) |
| [2023-07-17 05:14:28,850][335310] Avg episode reward: [(0, '107.919')] |
| [2023-07-17 05:14:28,853][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000000960_491520.pth... |
| [2023-07-17 05:14:28,856][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000000408_208896.pth |
| [2023-07-17 05:14:32,861][335595] Updated weights for policy 0, policy_version 1040 (0.0005) |
| [2023-07-17 05:14:33,849][335310] Fps is (10 sec: 9420.7, 60 sec: 9011.2, 300 sec: 9011.2). Total num frames: 540672. Throughput: 0: 9536.1. Samples: 537776. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) |
| [2023-07-17 05:14:33,850][335310] Avg episode reward: [(0, '126.256')] |
| [2023-07-17 05:14:33,850][335551] Saving new best policy, reward=126.256! |
| [2023-07-17 05:14:37,222][335595] Updated weights for policy 0, policy_version 1120 (0.0005) |
| [2023-07-17 05:14:38,849][335310] Fps is (10 sec: 9420.9, 60 sec: 9625.6, 300 sec: 9011.2). Total num frames: 585728. Throughput: 0: 9481.5. Samples: 565932. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) |
| [2023-07-17 05:14:38,850][335310] Avg episode reward: [(0, '148.312')] |
| [2023-07-17 05:14:38,850][335551] Saving new best policy, reward=148.312! |
| [2023-07-17 05:14:41,512][335595] Updated weights for policy 0, policy_version 1200 (0.0005) |
| [2023-07-17 05:14:43,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9069.7). Total num frames: 634880. Throughput: 0: 9474.2. Samples: 622956. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:14:43,850][335310] Avg episode reward: [(0, '156.938')] |
| [2023-07-17 05:14:43,853][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000001240_634880.pth... |
| [2023-07-17 05:14:43,856][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000000688_352256.pth |
| [2023-07-17 05:14:43,856][335551] Saving new best policy, reward=156.938! |
| [2023-07-17 05:14:45,783][335595] Updated weights for policy 0, policy_version 1280 (0.0005) |
| [2023-07-17 05:14:48,849][335310] Fps is (10 sec: 9830.3, 60 sec: 9557.3, 300 sec: 9120.4). Total num frames: 684032. Throughput: 0: 9504.5. Samples: 680584. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) |
| [2023-07-17 05:14:48,850][335310] Avg episode reward: [(0, '147.037')] |
| [2023-07-17 05:14:50,089][335595] Updated weights for policy 0, policy_version 1360 (0.0005) |
| [2023-07-17 05:14:53,849][335310] Fps is (10 sec: 9420.9, 60 sec: 9489.1, 300 sec: 9113.6). Total num frames: 729088. Throughput: 0: 9494.4. Samples: 708752. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:14:53,850][335310] Avg episode reward: [(0, '189.592')] |
| [2023-07-17 05:14:53,850][335551] Saving new best policy, reward=189.592! |
| [2023-07-17 05:14:54,406][335595] Updated weights for policy 0, policy_version 1440 (0.0005) |
| [2023-07-17 05:14:58,849][335310] Fps is (10 sec: 9011.3, 60 sec: 9420.8, 300 sec: 9107.6). Total num frames: 774144. Throughput: 0: 9492.4. Samples: 765672. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-17 05:14:58,850][335310] Avg episode reward: [(0, '208.057')] |
| [2023-07-17 05:14:58,880][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000001520_778240.pth... |
| [2023-07-17 05:14:58,880][335595] Updated weights for policy 0, policy_version 1520 (0.0005) |
| [2023-07-17 05:14:58,882][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000000960_491520.pth |
| [2023-07-17 05:14:58,882][335551] Saving new best policy, reward=208.057! |
| [2023-07-17 05:15:03,330][335595] Updated weights for policy 0, policy_version 1600 (0.0005) |
| [2023-07-17 05:15:03,849][335310] Fps is (10 sec: 9420.7, 60 sec: 9489.1, 300 sec: 9147.7). Total num frames: 823296. Throughput: 0: 9443.1. Samples: 820020. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:15:03,850][335310] Avg episode reward: [(0, '217.721')] |
| [2023-07-17 05:15:03,851][335551] Saving new best policy, reward=217.721! |
| [2023-07-17 05:15:07,775][335595] Updated weights for policy 0, policy_version 1680 (0.0006) |
| [2023-07-17 05:15:08,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9140.6). Total num frames: 868352. Throughput: 0: 9429.1. Samples: 847936. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:15:08,855][335310] Avg episode reward: [(0, '243.923')] |
| [2023-07-17 05:15:08,856][335551] Saving new best policy, reward=243.923! |
| [2023-07-17 05:15:12,310][335595] Updated weights for policy 0, policy_version 1760 (0.0005) |
| [2023-07-17 05:15:13,849][335310] Fps is (10 sec: 9011.3, 60 sec: 9420.8, 300 sec: 9134.1). Total num frames: 913408. Throughput: 0: 9364.8. Samples: 902256. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) |
| [2023-07-17 05:15:13,850][335310] Avg episode reward: [(0, '255.448')] |
| [2023-07-17 05:15:13,853][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000001784_913408.pth... |
| [2023-07-17 05:15:13,856][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000001240_634880.pth |
| [2023-07-17 05:15:13,856][335551] Saving new best policy, reward=255.448! |
| [2023-07-17 05:15:16,887][335595] Updated weights for policy 0, policy_version 1840 (0.0005) |
| [2023-07-17 05:15:18,849][335310] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9128.2). Total num frames: 958464. Throughput: 0: 9295.1. Samples: 956052. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:15:18,850][335310] Avg episode reward: [(0, '284.476')] |
| [2023-07-17 05:15:18,851][335551] Saving new best policy, reward=284.476! |
| [2023-07-17 05:15:21,327][335595] Updated weights for policy 0, policy_version 1920 (0.0005) |
| [2023-07-17 05:15:23,849][335310] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9122.9). Total num frames: 1003520. Throughput: 0: 9288.2. Samples: 983900. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-17 05:15:23,850][335310] Avg episode reward: [(0, '290.489')] |
| [2023-07-17 05:15:23,851][335551] Saving new best policy, reward=290.489! |
| [2023-07-17 05:15:25,743][335595] Updated weights for policy 0, policy_version 2000 (0.0005) |
| [2023-07-17 05:15:28,849][335310] Fps is (10 sec: 9420.7, 60 sec: 9352.5, 300 sec: 9153.7). Total num frames: 1052672. Throughput: 0: 9273.6. Samples: 1040268. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:15:28,850][335310] Avg episode reward: [(0, '296.958')] |
| [2023-07-17 05:15:28,853][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000002056_1052672.pth... |
| [2023-07-17 05:15:28,856][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000001520_778240.pth |
| [2023-07-17 05:15:28,857][335551] Saving new best policy, reward=296.958! |
| [2023-07-17 05:15:30,117][335595] Updated weights for policy 0, policy_version 2080 (0.0005) |
| [2023-07-17 05:15:33,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9147.7). Total num frames: 1097728. Throughput: 0: 9221.1. Samples: 1095532. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:15:33,850][335310] Avg episode reward: [(0, '333.954')] |
| [2023-07-17 05:15:33,851][335551] Saving new best policy, reward=333.954! |
| [2023-07-17 05:15:34,555][335595] Updated weights for policy 0, policy_version 2160 (0.0005) |
| [2023-07-17 05:15:38,849][335310] Fps is (10 sec: 9011.3, 60 sec: 9284.3, 300 sec: 9142.3). Total num frames: 1142784. Throughput: 0: 9214.3. Samples: 1123396. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:15:38,850][335310] Avg episode reward: [(0, '366.417')] |
| [2023-07-17 05:15:38,850][335551] Saving new best policy, reward=366.417! |
| [2023-07-17 05:15:38,957][335595] Updated weights for policy 0, policy_version 2240 (0.0005) |
| [2023-07-17 05:15:43,379][335595] Updated weights for policy 0, policy_version 2320 (0.0005) |
| [2023-07-17 05:15:43,849][335310] Fps is (10 sec: 9420.7, 60 sec: 9284.3, 300 sec: 9168.7). Total num frames: 1191936. Throughput: 0: 9192.0. Samples: 1179312. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:15:43,850][335310] Avg episode reward: [(0, '394.952')] |
| [2023-07-17 05:15:43,853][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000002328_1191936.pth... |
| [2023-07-17 05:15:43,855][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000001784_913408.pth |
| [2023-07-17 05:15:43,855][335551] Saving new best policy, reward=394.952! |
| [2023-07-17 05:15:47,770][335595] Updated weights for policy 0, policy_version 2400 (0.0006) |
| [2023-07-17 05:15:48,849][335310] Fps is (10 sec: 9420.7, 60 sec: 9216.0, 300 sec: 9162.9). Total num frames: 1236992. Throughput: 0: 9221.6. Samples: 1234992. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:15:48,850][335310] Avg episode reward: [(0, '426.014')] |
| [2023-07-17 05:15:48,850][335551] Saving new best policy, reward=426.014! |
| [2023-07-17 05:15:52,157][335595] Updated weights for policy 0, policy_version 2480 (0.0005) |
| [2023-07-17 05:15:53,849][335310] Fps is (10 sec: 9011.3, 60 sec: 9216.0, 300 sec: 9157.5). Total num frames: 1282048. Throughput: 0: 9223.7. Samples: 1263004. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:15:53,850][335310] Avg episode reward: [(0, '423.155')] |
| [2023-07-17 05:15:56,559][335595] Updated weights for policy 0, policy_version 2560 (0.0006) |
| [2023-07-17 05:15:58,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9180.7). Total num frames: 1331200. Throughput: 0: 9260.4. Samples: 1318976. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:15:58,850][335310] Avg episode reward: [(0, '414.503')] |
| [2023-07-17 05:15:58,853][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000002600_1331200.pth... |
| [2023-07-17 05:15:58,856][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000002056_1052672.pth |
| [2023-07-17 05:16:00,883][335595] Updated weights for policy 0, policy_version 2640 (0.0006) |
| [2023-07-17 05:16:03,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9175.0). Total num frames: 1376256. Throughput: 0: 9335.4. Samples: 1376144. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-17 05:16:03,850][335310] Avg episode reward: [(0, '384.782')] |
| [2023-07-17 05:16:05,276][335595] Updated weights for policy 0, policy_version 2720 (0.0005) |
| [2023-07-17 05:16:08,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9196.2). Total num frames: 1425408. Throughput: 0: 9332.1. Samples: 1403844. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-17 05:16:08,850][335310] Avg episode reward: [(0, '410.423')] |
| [2023-07-17 05:16:09,607][335595] Updated weights for policy 0, policy_version 2800 (0.0006) |
| [2023-07-17 05:16:13,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9190.4). Total num frames: 1470464. Throughput: 0: 9333.1. Samples: 1460256. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:16:13,850][335310] Avg episode reward: [(0, '436.468')] |
| [2023-07-17 05:16:13,853][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000002872_1470464.pth... |
| [2023-07-17 05:16:13,855][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000002328_1191936.pth |
| [2023-07-17 05:16:13,856][335551] Saving new best policy, reward=436.468! |
| [2023-07-17 05:16:14,004][335595] Updated weights for policy 0, policy_version 2880 (0.0005) |
| [2023-07-17 05:16:18,435][335595] Updated weights for policy 0, policy_version 2960 (0.0006) |
| [2023-07-17 05:16:18,849][335310] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9185.0). Total num frames: 1515520. Throughput: 0: 9334.5. Samples: 1515584. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:16:18,850][335310] Avg episode reward: [(0, '441.345')] |
| [2023-07-17 05:16:18,889][335551] Saving new best policy, reward=441.345! |
| [2023-07-17 05:16:22,833][335595] Updated weights for policy 0, policy_version 3040 (0.0005) |
| [2023-07-17 05:16:23,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9204.0). Total num frames: 1564672. Throughput: 0: 9351.0. Samples: 1544192. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) |
| [2023-07-17 05:16:23,850][335310] Avg episode reward: [(0, '450.067')] |
| [2023-07-17 05:16:23,850][335551] Saving new best policy, reward=450.067! |
| [2023-07-17 05:16:27,270][335595] Updated weights for policy 0, policy_version 3120 (0.0005) |
| [2023-07-17 05:16:28,849][335310] Fps is (10 sec: 9420.7, 60 sec: 9284.3, 300 sec: 9198.4). Total num frames: 1609728. Throughput: 0: 9325.0. Samples: 1598936. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-17 05:16:28,850][335310] Avg episode reward: [(0, '418.239')] |
| [2023-07-17 05:16:28,853][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000003144_1609728.pth... |
| [2023-07-17 05:16:28,856][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000002600_1331200.pth |
| [2023-07-17 05:16:31,448][335595] Updated weights for policy 0, policy_version 3200 (0.0005) |
| [2023-07-17 05:16:33,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9216.0). Total num frames: 1658880. Throughput: 0: 9417.2. Samples: 1658764. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:16:33,850][335310] Avg episode reward: [(0, '428.357')] |
| [2023-07-17 05:16:35,550][335595] Updated weights for policy 0, policy_version 3280 (0.0004) |
| [2023-07-17 05:16:38,849][335310] Fps is (10 sec: 9830.6, 60 sec: 9420.8, 300 sec: 9232.6). Total num frames: 1708032. Throughput: 0: 9441.3. Samples: 1687864. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:16:38,850][335310] Avg episode reward: [(0, '438.145')] |
| [2023-07-17 05:16:39,689][335595] Updated weights for policy 0, policy_version 3360 (0.0005) |
| [2023-07-17 05:16:43,773][335595] Updated weights for policy 0, policy_version 3440 (0.0005) |
| [2023-07-17 05:16:43,849][335310] Fps is (10 sec: 10239.9, 60 sec: 9489.1, 300 sec: 9269.9). Total num frames: 1761280. Throughput: 0: 9529.0. Samples: 1747780. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) |
| [2023-07-17 05:16:43,850][335310] Avg episode reward: [(0, '441.160')] |
| [2023-07-17 05:16:43,852][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000003440_1761280.pth... |
| [2023-07-17 05:16:43,855][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000002872_1470464.pth |
| [2023-07-17 05:16:47,946][335595] Updated weights for policy 0, policy_version 3520 (0.0005) |
| [2023-07-17 05:16:48,849][335310] Fps is (10 sec: 10239.9, 60 sec: 9557.3, 300 sec: 9284.3). Total num frames: 1810432. Throughput: 0: 9568.6. Samples: 1806732. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:16:48,850][335310] Avg episode reward: [(0, '446.303')] |
| [2023-07-17 05:16:52,060][335595] Updated weights for policy 0, policy_version 3600 (0.0005) |
| [2023-07-17 05:16:53,849][335310] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9297.9). Total num frames: 1859584. Throughput: 0: 9622.6. Samples: 1836860. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-17 05:16:53,850][335310] Avg episode reward: [(0, '473.094')] |
| [2023-07-17 05:16:53,850][335551] Saving new best policy, reward=473.094! |
| [2023-07-17 05:16:56,157][335595] Updated weights for policy 0, policy_version 3680 (0.0004) |
| [2023-07-17 05:16:58,849][335310] Fps is (10 sec: 9830.3, 60 sec: 9625.6, 300 sec: 9310.9). Total num frames: 1908736. Throughput: 0: 9699.0. Samples: 1896712. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:16:58,850][335310] Avg episode reward: [(0, '476.325')] |
| [2023-07-17 05:16:58,853][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000003728_1908736.pth... |
| [2023-07-17 05:16:58,855][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000003144_1609728.pth |
| [2023-07-17 05:16:58,856][335551] Saving new best policy, reward=476.325! |
| [2023-07-17 05:17:00,325][335595] Updated weights for policy 0, policy_version 3760 (0.0005) |
| [2023-07-17 05:17:03,849][335310] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9323.3). Total num frames: 1957888. Throughput: 0: 9801.1. Samples: 1956636. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-17 05:17:03,850][335310] Avg episode reward: [(0, '471.797')] |
| [2023-07-17 05:17:04,354][335595] Updated weights for policy 0, policy_version 3840 (0.0004) |
| [2023-07-17 05:17:08,473][335595] Updated weights for policy 0, policy_version 3920 (0.0004) |
| [2023-07-17 05:17:08,849][335310] Fps is (10 sec: 9830.5, 60 sec: 9693.9, 300 sec: 9335.1). Total num frames: 2007040. Throughput: 0: 9831.8. Samples: 1986624. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-17 05:17:08,850][335310] Avg episode reward: [(0, '468.677')] |
| [2023-07-17 05:17:12,741][335595] Updated weights for policy 0, policy_version 4000 (0.0006) |
| [2023-07-17 05:17:13,849][335310] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9346.3). Total num frames: 2056192. Throughput: 0: 9912.1. Samples: 2044980. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:17:13,850][335310] Avg episode reward: [(0, '471.084')] |
| [2023-07-17 05:17:13,853][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000004016_2056192.pth... |
| [2023-07-17 05:17:13,856][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000003440_1761280.pth |
| [2023-07-17 05:17:17,089][335595] Updated weights for policy 0, policy_version 4080 (0.0005) |
| [2023-07-17 05:17:18,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9338.9). Total num frames: 2101248. Throughput: 0: 9834.4. Samples: 2101312. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:17:18,850][335310] Avg episode reward: [(0, '473.415')] |
| [2023-07-17 05:17:21,495][335595] Updated weights for policy 0, policy_version 4160 (0.0005) |
| [2023-07-17 05:17:23,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9349.6). Total num frames: 2150400. Throughput: 0: 9823.4. Samples: 2129920. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:17:23,850][335310] Avg episode reward: [(0, '499.511')] |
| [2023-07-17 05:17:23,850][335551] Saving new best policy, reward=499.511! |
| [2023-07-17 05:17:25,840][335595] Updated weights for policy 0, policy_version 4240 (0.0005) |
| [2023-07-17 05:17:28,849][335310] Fps is (10 sec: 9420.7, 60 sec: 9762.1, 300 sec: 9342.4). Total num frames: 2195456. Throughput: 0: 9735.2. Samples: 2185864. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-17 05:17:28,850][335310] Avg episode reward: [(0, '454.626')] |
| [2023-07-17 05:17:28,853][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000004288_2195456.pth... |
| [2023-07-17 05:17:28,856][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000003728_1908736.pth |
| [2023-07-17 05:17:30,241][335595] Updated weights for policy 0, policy_version 4320 (0.0005) |
| [2023-07-17 05:17:33,849][335310] Fps is (10 sec: 9420.9, 60 sec: 9762.1, 300 sec: 9352.5). Total num frames: 2244608. Throughput: 0: 9675.7. Samples: 2242140. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:17:33,850][335310] Avg episode reward: [(0, '495.835')] |
| [2023-07-17 05:17:34,559][335595] Updated weights for policy 0, policy_version 4400 (0.0006) |
| [2023-07-17 05:17:38,849][335310] Fps is (10 sec: 9420.9, 60 sec: 9693.9, 300 sec: 9345.6). Total num frames: 2289664. Throughput: 0: 9625.6. Samples: 2270012. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:17:38,850][335310] Avg episode reward: [(0, '464.031')] |
| [2023-07-17 05:17:38,929][335595] Updated weights for policy 0, policy_version 4480 (0.0005) |
| [2023-07-17 05:17:43,309][335595] Updated weights for policy 0, policy_version 4560 (0.0005) |
| [2023-07-17 05:17:43,849][335310] Fps is (10 sec: 9420.7, 60 sec: 9625.6, 300 sec: 9355.3). Total num frames: 2338816. Throughput: 0: 9552.9. Samples: 2326592. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) |
| [2023-07-17 05:17:43,850][335310] Avg episode reward: [(0, '490.903')] |
| [2023-07-17 05:17:43,852][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000004568_2338816.pth... |
| [2023-07-17 05:17:43,855][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000004016_2056192.pth |
| [2023-07-17 05:17:47,654][335595] Updated weights for policy 0, policy_version 4640 (0.0004) |
| [2023-07-17 05:17:48,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9348.5). Total num frames: 2383872. Throughput: 0: 9485.4. Samples: 2383480. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:17:48,850][335310] Avg episode reward: [(0, '496.232')] |
| [2023-07-17 05:17:52,022][335595] Updated weights for policy 0, policy_version 4720 (0.0005) |
| [2023-07-17 05:17:53,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9357.8). Total num frames: 2433024. Throughput: 0: 9437.9. Samples: 2411332. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:17:53,850][335310] Avg episode reward: [(0, '470.382')] |
| [2023-07-17 05:17:56,142][335595] Updated weights for policy 0, policy_version 4800 (0.0005) |
| [2023-07-17 05:17:58,849][335310] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9366.7). Total num frames: 2482176. Throughput: 0: 9443.8. Samples: 2469952. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:17:58,850][335310] Avg episode reward: [(0, '501.081')] |
| [2023-07-17 05:17:58,852][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000004848_2482176.pth... |
| [2023-07-17 05:17:58,855][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000004288_2195456.pth |
| [2023-07-17 05:17:58,856][335551] Saving new best policy, reward=501.081! |
| [2023-07-17 05:18:00,483][335595] Updated weights for policy 0, policy_version 4880 (0.0005) |
| [2023-07-17 05:18:03,849][335310] Fps is (10 sec: 9420.9, 60 sec: 9489.1, 300 sec: 9360.1). Total num frames: 2527232. Throughput: 0: 9465.1. Samples: 2527240. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:18:03,850][335310] Avg episode reward: [(0, '476.773')] |
| [2023-07-17 05:18:04,680][335595] Updated weights for policy 0, policy_version 4960 (0.0005) |
| [2023-07-17 05:18:08,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9368.7). Total num frames: 2576384. Throughput: 0: 9473.1. Samples: 2556208. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) |
| [2023-07-17 05:18:08,850][335310] Avg episode reward: [(0, '484.045')] |
| [2023-07-17 05:18:09,020][335595] Updated weights for policy 0, policy_version 5040 (0.0005) |
| [2023-07-17 05:18:13,361][335595] Updated weights for policy 0, policy_version 5120 (0.0005) |
| [2023-07-17 05:18:13,849][335310] Fps is (10 sec: 9830.2, 60 sec: 9489.1, 300 sec: 9376.9). Total num frames: 2625536. Throughput: 0: 9492.4. Samples: 2613024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:18:13,850][335310] Avg episode reward: [(0, '478.669')] |
| [2023-07-17 05:18:13,853][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000005128_2625536.pth... |
| [2023-07-17 05:18:13,856][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000004568_2338816.pth |
| [2023-07-17 05:18:17,730][335595] Updated weights for policy 0, policy_version 5200 (0.0005) |
| [2023-07-17 05:18:18,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9370.5). Total num frames: 2670592. Throughput: 0: 9492.5. Samples: 2669304. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:18:18,850][335310] Avg episode reward: [(0, '486.757')] |
| [2023-07-17 05:18:22,092][335595] Updated weights for policy 0, policy_version 5280 (0.0005) |
| [2023-07-17 05:18:23,849][335310] Fps is (10 sec: 9011.3, 60 sec: 9420.8, 300 sec: 9364.3). Total num frames: 2715648. Throughput: 0: 9487.7. Samples: 2696960. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:18:23,850][335310] Avg episode reward: [(0, '483.037')] |
| [2023-07-17 05:18:26,511][335595] Updated weights for policy 0, policy_version 5360 (0.0005) |
| [2023-07-17 05:18:28,849][335310] Fps is (10 sec: 9420.7, 60 sec: 9489.1, 300 sec: 9372.2). Total num frames: 2764800. Throughput: 0: 9477.7. Samples: 2753088. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:18:28,850][335310] Avg episode reward: [(0, '492.020')] |
| [2023-07-17 05:18:28,852][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000005400_2764800.pth... |
| [2023-07-17 05:18:28,854][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000004848_2482176.pth |
| [2023-07-17 05:18:30,792][335595] Updated weights for policy 0, policy_version 5440 (0.0005) |
| [2023-07-17 05:18:33,849][335310] Fps is (10 sec: 9830.3, 60 sec: 9489.1, 300 sec: 9511.1). Total num frames: 2813952. Throughput: 0: 9481.4. Samples: 2810144. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:18:33,850][335310] Avg episode reward: [(0, '486.097')] |
| [2023-07-17 05:18:35,107][335595] Updated weights for policy 0, policy_version 5520 (0.0005) |
| [2023-07-17 05:18:38,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9483.3). Total num frames: 2859008. Throughput: 0: 9493.4. Samples: 2838536. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-17 05:18:38,850][335310] Avg episode reward: [(0, '494.541')] |
| [2023-07-17 05:18:39,448][335595] Updated weights for policy 0, policy_version 5600 (0.0005) |
| [2023-07-17 05:18:43,827][335595] Updated weights for policy 0, policy_version 5680 (0.0005) |
| [2023-07-17 05:18:43,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9483.3). Total num frames: 2908160. Throughput: 0: 9453.8. Samples: 2895372. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-17 05:18:43,850][335310] Avg episode reward: [(0, '496.589')] |
| [2023-07-17 05:18:43,852][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000005680_2908160.pth... |
| [2023-07-17 05:18:43,855][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000005128_2625536.pth |
| [2023-07-17 05:18:48,235][335595] Updated weights for policy 0, policy_version 5760 (0.0005) |
| [2023-07-17 05:18:48,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9469.4). Total num frames: 2953216. Throughput: 0: 9417.6. Samples: 2951032. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:18:48,850][335310] Avg episode reward: [(0, '493.104')] |
| [2023-07-17 05:18:52,699][335595] Updated weights for policy 0, policy_version 5840 (0.0005) |
| [2023-07-17 05:18:53,849][335310] Fps is (10 sec: 9011.3, 60 sec: 9420.8, 300 sec: 9455.5). Total num frames: 2998272. Throughput: 0: 9387.1. Samples: 2978628. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:18:53,850][335310] Avg episode reward: [(0, '487.136')] |
| [2023-07-17 05:18:57,075][335595] Updated weights for policy 0, policy_version 5920 (0.0005) |
| [2023-07-17 05:18:58,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9469.4). Total num frames: 3047424. Throughput: 0: 9370.6. Samples: 3034700. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:18:58,850][335310] Avg episode reward: [(0, '472.475')] |
| [2023-07-17 05:18:58,853][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000005952_3047424.pth... |
| [2023-07-17 05:18:58,855][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000005400_2764800.pth |
| [2023-07-17 05:19:01,339][335595] Updated weights for policy 0, policy_version 6000 (0.0005) |
| [2023-07-17 05:19:03,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9455.5). Total num frames: 3092480. Throughput: 0: 9386.8. Samples: 3091712. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-17 05:19:03,850][335310] Avg episode reward: [(0, '489.487')] |
| [2023-07-17 05:19:05,569][335595] Updated weights for policy 0, policy_version 6080 (0.0005) |
| [2023-07-17 05:19:08,849][335310] Fps is (10 sec: 9830.5, 60 sec: 9489.1, 300 sec: 9483.3). Total num frames: 3145728. Throughput: 0: 9431.5. Samples: 3121376. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:19:08,850][335310] Avg episode reward: [(0, '474.764')] |
| [2023-07-17 05:19:09,660][335595] Updated weights for policy 0, policy_version 6160 (0.0004) |
| [2023-07-17 05:19:13,756][335595] Updated weights for policy 0, policy_version 6240 (0.0004) |
| [2023-07-17 05:19:13,849][335310] Fps is (10 sec: 10240.1, 60 sec: 9489.1, 300 sec: 9483.3). Total num frames: 3194880. Throughput: 0: 9540.9. Samples: 3182428. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:19:13,850][335310] Avg episode reward: [(0, '471.555')] |
| [2023-07-17 05:19:13,853][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000006240_3194880.pth... |
| [2023-07-17 05:19:13,856][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000005680_2908160.pth |
| [2023-07-17 05:19:18,144][335595] Updated weights for policy 0, policy_version 6320 (0.0006) |
| [2023-07-17 05:19:18,849][335310] Fps is (10 sec: 9420.7, 60 sec: 9489.1, 300 sec: 9469.4). Total num frames: 3239936. Throughput: 0: 9526.8. Samples: 3238848. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) |
| [2023-07-17 05:19:18,850][335310] Avg episode reward: [(0, '475.861')] |
| [2023-07-17 05:19:22,473][335595] Updated weights for policy 0, policy_version 6400 (0.0005) |
| [2023-07-17 05:19:23,849][335310] Fps is (10 sec: 9420.7, 60 sec: 9557.3, 300 sec: 9483.3). Total num frames: 3289088. Throughput: 0: 9521.5. Samples: 3267004. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:19:23,850][335310] Avg episode reward: [(0, '479.277')] |
| [2023-07-17 05:19:26,912][335595] Updated weights for policy 0, policy_version 6480 (0.0005) |
| [2023-07-17 05:19:28,849][335310] Fps is (10 sec: 9420.7, 60 sec: 9489.1, 300 sec: 9469.4). Total num frames: 3334144. Throughput: 0: 9494.6. Samples: 3322628. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:19:28,930][335310] Avg episode reward: [(0, '482.722')] |
| [2023-07-17 05:19:28,934][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000006512_3334144.pth... |
| [2023-07-17 05:19:28,937][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000005952_3047424.pth |
| [2023-07-17 05:19:31,367][335595] Updated weights for policy 0, policy_version 6560 (0.0005) |
| [2023-07-17 05:19:33,849][335310] Fps is (10 sec: 9011.2, 60 sec: 9420.8, 300 sec: 9469.4). Total num frames: 3379200. Throughput: 0: 9497.0. Samples: 3378396. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:19:33,850][335310] Avg episode reward: [(0, '485.542')] |
| [2023-07-17 05:19:35,726][335595] Updated weights for policy 0, policy_version 6640 (0.0005) |
| [2023-07-17 05:19:38,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9469.4). Total num frames: 3428352. Throughput: 0: 9511.8. Samples: 3406660. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-17 05:19:38,850][335310] Avg episode reward: [(0, '468.483')] |
| [2023-07-17 05:19:40,085][335595] Updated weights for policy 0, policy_version 6720 (0.0005) |
| [2023-07-17 05:19:43,849][335310] Fps is (10 sec: 9420.9, 60 sec: 9420.8, 300 sec: 9455.5). Total num frames: 3473408. Throughput: 0: 9528.4. Samples: 3463476. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-17 05:19:43,850][335310] Avg episode reward: [(0, '481.035')] |
| [2023-07-17 05:19:43,869][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000006792_3477504.pth... |
| [2023-07-17 05:19:43,871][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000006240_3194880.pth |
| [2023-07-17 05:19:44,269][335595] Updated weights for policy 0, policy_version 6800 (0.0005) |
| [2023-07-17 05:19:48,432][335595] Updated weights for policy 0, policy_version 6880 (0.0005) |
| [2023-07-17 05:19:48,849][335310] Fps is (10 sec: 9421.0, 60 sec: 9489.1, 300 sec: 9469.4). Total num frames: 3522560. Throughput: 0: 9575.8. Samples: 3522624. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:19:48,850][335310] Avg episode reward: [(0, '461.683')] |
| [2023-07-17 05:19:52,614][335595] Updated weights for policy 0, policy_version 6960 (0.0004) |
| [2023-07-17 05:19:53,849][335310] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9483.3). Total num frames: 3571712. Throughput: 0: 9570.1. Samples: 3552028. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:19:53,850][335310] Avg episode reward: [(0, '480.192')] |
| [2023-07-17 05:19:56,941][335595] Updated weights for policy 0, policy_version 7040 (0.0005) |
| [2023-07-17 05:19:58,849][335310] Fps is (10 sec: 9830.3, 60 sec: 9557.3, 300 sec: 9483.3). Total num frames: 3620864. Throughput: 0: 9471.5. Samples: 3608648. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) |
| [2023-07-17 05:19:58,850][335310] Avg episode reward: [(0, '497.630')] |
| [2023-07-17 05:19:58,853][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000007072_3620864.pth... |
| [2023-07-17 05:19:58,856][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000006512_3334144.pth |
| [2023-07-17 05:20:01,399][335595] Updated weights for policy 0, policy_version 7120 (0.0005) |
| [2023-07-17 05:20:03,849][335310] Fps is (10 sec: 9420.7, 60 sec: 9557.3, 300 sec: 9483.3). Total num frames: 3665920. Throughput: 0: 9472.5. Samples: 3665108. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:20:03,850][335310] Avg episode reward: [(0, '489.145')] |
| [2023-07-17 05:20:05,681][335595] Updated weights for policy 0, policy_version 7200 (0.0005) |
| [2023-07-17 05:20:08,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9497.2). Total num frames: 3715072. Throughput: 0: 9477.5. Samples: 3693492. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:20:08,850][335310] Avg episode reward: [(0, '505.117')] |
| [2023-07-17 05:20:08,851][335551] Saving new best policy, reward=505.117! |
| [2023-07-17 05:20:10,103][335595] Updated weights for policy 0, policy_version 7280 (0.0005) |
| [2023-07-17 05:20:13,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9497.2). Total num frames: 3760128. Throughput: 0: 9459.9. Samples: 3748324. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:20:13,850][335310] Avg episode reward: [(0, '486.362')] |
| [2023-07-17 05:20:13,853][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000007344_3760128.pth... |
| [2023-07-17 05:20:13,856][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000006792_3477504.pth |
| [2023-07-17 05:20:14,556][335595] Updated weights for policy 0, policy_version 7360 (0.0005) |
| [2023-07-17 05:20:18,849][335310] Fps is (10 sec: 9011.2, 60 sec: 9420.8, 300 sec: 9497.2). Total num frames: 3805184. Throughput: 0: 9472.4. Samples: 3804652. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) |
| [2023-07-17 05:20:18,851][335310] Avg episode reward: [(0, '453.102')] |
| [2023-07-17 05:20:18,940][335595] Updated weights for policy 0, policy_version 7440 (0.0005) |
| [2023-07-17 05:20:23,310][335595] Updated weights for policy 0, policy_version 7520 (0.0005) |
| [2023-07-17 05:20:23,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9497.2). Total num frames: 3854336. Throughput: 0: 9477.4. Samples: 3833140. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:20:23,850][335310] Avg episode reward: [(0, '489.923')] |
| [2023-07-17 05:20:27,641][335595] Updated weights for policy 0, policy_version 7600 (0.0005) |
| [2023-07-17 05:20:28,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9497.2). Total num frames: 3899392. Throughput: 0: 9465.7. Samples: 3889432. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:20:28,850][335310] Avg episode reward: [(0, '490.752')] |
| [2023-07-17 05:20:28,853][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000007616_3899392.pth... |
| [2023-07-17 05:20:28,856][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000007072_3620864.pth |
| [2023-07-17 05:20:32,016][335595] Updated weights for policy 0, policy_version 7680 (0.0005) |
| [2023-07-17 05:20:33,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9511.1). Total num frames: 3948544. Throughput: 0: 9391.6. Samples: 3945248. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-17 05:20:33,850][335310] Avg episode reward: [(0, '487.811')] |
| [2023-07-17 05:20:36,462][335595] Updated weights for policy 0, policy_version 7760 (0.0005) |
| [2023-07-17 05:20:38,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9497.2). Total num frames: 3993600. Throughput: 0: 9357.8. Samples: 3973128. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-17 05:20:38,850][335310] Avg episode reward: [(0, '499.029')] |
| [2023-07-17 05:20:40,916][335595] Updated weights for policy 0, policy_version 7840 (0.0005) |
| [2023-07-17 05:20:43,849][335310] Fps is (10 sec: 9011.1, 60 sec: 9420.8, 300 sec: 9497.2). Total num frames: 4038656. Throughput: 0: 9327.5. Samples: 4028388. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) |
| [2023-07-17 05:20:43,850][335310] Avg episode reward: [(0, '490.176')] |
| [2023-07-17 05:20:43,853][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000007888_4038656.pth... |
| [2023-07-17 05:20:43,856][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000007344_3760128.pth |
| [2023-07-17 05:20:45,325][335595] Updated weights for policy 0, policy_version 7920 (0.0005) |
| [2023-07-17 05:20:48,849][335310] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9497.2). Total num frames: 4083712. Throughput: 0: 9303.7. Samples: 4083776. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:20:48,850][335310] Avg episode reward: [(0, '486.757')] |
| [2023-07-17 05:20:49,744][335595] Updated weights for policy 0, policy_version 8000 (0.0005) |
| [2023-07-17 05:20:53,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9497.2). Total num frames: 4132864. Throughput: 0: 9303.9. Samples: 4112168. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:20:53,850][335310] Avg episode reward: [(0, '490.310')] |
| [2023-07-17 05:20:54,188][335595] Updated weights for policy 0, policy_version 8080 (0.0005) |
| [2023-07-17 05:20:58,639][335595] Updated weights for policy 0, policy_version 8160 (0.0005) |
| [2023-07-17 05:20:58,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9497.2). Total num frames: 4177920. Throughput: 0: 9297.5. Samples: 4166712. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-17 05:20:58,850][335310] Avg episode reward: [(0, '499.788')] |
| [2023-07-17 05:20:58,853][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000008160_4177920.pth... |
| [2023-07-17 05:20:58,856][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000007616_3899392.pth |
| [2023-07-17 05:21:02,979][335595] Updated weights for policy 0, policy_version 8240 (0.0005) |
| [2023-07-17 05:21:03,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9497.2). Total num frames: 4227072. Throughput: 0: 9297.7. Samples: 4223048. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:21:03,850][335310] Avg episode reward: [(0, '487.946')] |
| [2023-07-17 05:21:07,380][335595] Updated weights for policy 0, policy_version 8320 (0.0005) |
| [2023-07-17 05:21:08,849][335310] Fps is (10 sec: 9420.9, 60 sec: 9284.3, 300 sec: 9497.2). Total num frames: 4272128. Throughput: 0: 9300.2. Samples: 4251648. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-17 05:21:08,850][335310] Avg episode reward: [(0, '484.799')] |
| [2023-07-17 05:21:11,751][335595] Updated weights for policy 0, policy_version 8400 (0.0005) |
| [2023-07-17 05:21:13,849][335310] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9497.2). Total num frames: 4317184. Throughput: 0: 9289.1. Samples: 4307440. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-17 05:21:13,850][335310] Avg episode reward: [(0, '479.634')] |
| [2023-07-17 05:21:13,892][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000008440_4321280.pth... |
| [2023-07-17 05:21:13,895][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000007888_4038656.pth |
| [2023-07-17 05:21:16,037][335595] Updated weights for policy 0, policy_version 8480 (0.0005) |
| [2023-07-17 05:21:18,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9497.2). Total num frames: 4366336. Throughput: 0: 9348.6. Samples: 4365936. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-17 05:21:18,850][335310] Avg episode reward: [(0, '500.253')] |
| [2023-07-17 05:21:20,136][335595] Updated weights for policy 0, policy_version 8560 (0.0004) |
| [2023-07-17 05:21:23,849][335310] Fps is (10 sec: 10240.0, 60 sec: 9420.8, 300 sec: 9524.9). Total num frames: 4419584. Throughput: 0: 9378.8. Samples: 4395176. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-17 05:21:23,850][335310] Avg episode reward: [(0, '491.635')] |
| [2023-07-17 05:21:24,240][335595] Updated weights for policy 0, policy_version 8640 (0.0004) |
| [2023-07-17 05:21:28,387][335595] Updated weights for policy 0, policy_version 8720 (0.0004) |
| [2023-07-17 05:21:28,849][335310] Fps is (10 sec: 10240.0, 60 sec: 9489.1, 300 sec: 9524.9). Total num frames: 4468736. Throughput: 0: 9498.2. Samples: 4455808. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-17 05:21:28,850][335310] Avg episode reward: [(0, '503.065')] |
| [2023-07-17 05:21:28,853][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000008728_4468736.pth... |
| [2023-07-17 05:21:28,855][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000008160_4177920.pth |
| [2023-07-17 05:21:32,765][335595] Updated weights for policy 0, policy_version 8800 (0.0005) |
| [2023-07-17 05:21:33,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9511.0). Total num frames: 4513792. Throughput: 0: 9517.8. Samples: 4512076. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:21:33,850][335310] Avg episode reward: [(0, '494.070')] |
| [2023-07-17 05:21:36,980][335595] Updated weights for policy 0, policy_version 8880 (0.0005) |
| [2023-07-17 05:21:38,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9497.2). Total num frames: 4562944. Throughput: 0: 9537.7. Samples: 4541364. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-17 05:21:38,850][335310] Avg episode reward: [(0, '500.539')] |
| [2023-07-17 05:21:41,118][335595] Updated weights for policy 0, policy_version 8960 (0.0005) |
| [2023-07-17 05:21:43,849][335310] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9497.2). Total num frames: 4612096. Throughput: 0: 9631.8. Samples: 4600144. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) |
| [2023-07-17 05:21:43,850][335310] Avg episode reward: [(0, '488.598')] |
| [2023-07-17 05:21:43,853][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000009008_4612096.pth... |
| [2023-07-17 05:21:43,856][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000008440_4321280.pth |
| [2023-07-17 05:21:45,257][335595] Updated weights for policy 0, policy_version 9040 (0.0004) |
| [2023-07-17 05:21:48,849][335310] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9497.2). Total num frames: 4661248. Throughput: 0: 9713.2. Samples: 4660144. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) |
| [2023-07-17 05:21:48,851][335310] Avg episode reward: [(0, '495.800')] |
| [2023-07-17 05:21:49,393][335595] Updated weights for policy 0, policy_version 9120 (0.0004) |
| [2023-07-17 05:21:53,477][335595] Updated weights for policy 0, policy_version 9200 (0.0004) |
| [2023-07-17 05:21:53,849][335310] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9497.2). Total num frames: 4710400. Throughput: 0: 9737.0. Samples: 4689816. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-17 05:21:53,850][335310] Avg episode reward: [(0, '489.062')] |
| [2023-07-17 05:21:57,713][335595] Updated weights for policy 0, policy_version 9280 (0.0005) |
| [2023-07-17 05:21:58,849][335310] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9497.2). Total num frames: 4759552. Throughput: 0: 9808.7. Samples: 4748832. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-17 05:21:58,850][335310] Avg episode reward: [(0, '507.400')] |
| [2023-07-17 05:21:58,853][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000009296_4759552.pth... |
| [2023-07-17 05:21:58,856][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000008728_4468736.pth |
| [2023-07-17 05:21:58,857][335551] Saving new best policy, reward=507.400! |
| [2023-07-17 05:22:02,102][335595] Updated weights for policy 0, policy_version 9360 (0.0005) |
| [2023-07-17 05:22:03,849][335310] Fps is (10 sec: 9830.5, 60 sec: 9693.9, 300 sec: 9497.2). Total num frames: 4808704. Throughput: 0: 9750.4. Samples: 4804704. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:22:03,850][335310] Avg episode reward: [(0, '502.434')] |
| [2023-07-17 05:22:06,458][335595] Updated weights for policy 0, policy_version 9440 (0.0005) |
| [2023-07-17 05:22:08,849][335310] Fps is (10 sec: 9420.9, 60 sec: 9693.9, 300 sec: 9483.3). Total num frames: 4853760. Throughput: 0: 9735.9. Samples: 4833292. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) |
| [2023-07-17 05:22:08,850][335310] Avg episode reward: [(0, '497.403')] |
| [2023-07-17 05:22:10,797][335595] Updated weights for policy 0, policy_version 9520 (0.0005) |
| [2023-07-17 05:22:13,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9497.2). Total num frames: 4902912. Throughput: 0: 9653.1. Samples: 4890196. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) |
| [2023-07-17 05:22:13,850][335310] Avg episode reward: [(0, '502.112')] |
| [2023-07-17 05:22:13,853][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000009576_4902912.pth... |
| [2023-07-17 05:22:13,855][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000009008_4612096.pth |
| [2023-07-17 05:22:15,084][335595] Updated weights for policy 0, policy_version 9600 (0.0005) |
| [2023-07-17 05:22:18,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9483.3). Total num frames: 4947968. Throughput: 0: 9670.3. Samples: 4947240. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:22:18,850][335310] Avg episode reward: [(0, '499.281')] |
| [2023-07-17 05:22:19,429][335595] Updated weights for policy 0, policy_version 9680 (0.0005) |
| [2023-07-17 05:22:23,811][335595] Updated weights for policy 0, policy_version 9760 (0.0005) |
| [2023-07-17 05:22:23,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9497.2). Total num frames: 4997120. Throughput: 0: 9642.5. Samples: 4975276. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:22:23,850][335310] Avg episode reward: [(0, '505.856')] |
| [2023-07-17 05:22:28,125][335595] Updated weights for policy 0, policy_version 9840 (0.0005) |
| [2023-07-17 05:22:28,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9483.3). Total num frames: 5042176. Throughput: 0: 9583.7. Samples: 5031412. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:22:28,850][335310] Avg episode reward: [(0, '517.455')] |
| [2023-07-17 05:22:28,853][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000009848_5042176.pth... |
| [2023-07-17 05:22:28,856][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000009296_4759552.pth |
| [2023-07-17 05:22:28,857][335551] Saving new best policy, reward=517.455! |
| [2023-07-17 05:22:32,431][335595] Updated weights for policy 0, policy_version 9920 (0.0005) |
| [2023-07-17 05:22:33,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9497.2). Total num frames: 5091328. Throughput: 0: 9520.4. Samples: 5088560. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:22:33,850][335310] Avg episode reward: [(0, '508.459')] |
| [2023-07-17 05:22:36,791][335595] Updated weights for policy 0, policy_version 10000 (0.0005) |
| [2023-07-17 05:22:38,849][335310] Fps is (10 sec: 9420.9, 60 sec: 9557.3, 300 sec: 9483.3). Total num frames: 5136384. Throughput: 0: 9481.6. Samples: 5116488. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:22:38,850][335310] Avg episode reward: [(0, '516.941')] |
| [2023-07-17 05:22:41,212][335595] Updated weights for policy 0, policy_version 10080 (0.0005) |
| [2023-07-17 05:22:43,849][335310] Fps is (10 sec: 9011.2, 60 sec: 9489.1, 300 sec: 9483.3). Total num frames: 5181440. Throughput: 0: 9417.0. Samples: 5172596. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-17 05:22:43,850][335310] Avg episode reward: [(0, '520.668')] |
| [2023-07-17 05:22:43,870][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000010128_5185536.pth... |
| [2023-07-17 05:22:43,872][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000009576_4902912.pth |
| [2023-07-17 05:22:43,872][335551] Saving new best policy, reward=520.668! |
| [2023-07-17 05:22:45,600][335595] Updated weights for policy 0, policy_version 10160 (0.0005) |
| [2023-07-17 05:22:48,849][335310] Fps is (10 sec: 9420.7, 60 sec: 9489.1, 300 sec: 9483.3). Total num frames: 5230592. Throughput: 0: 9424.9. Samples: 5228824. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:22:48,850][335310] Avg episode reward: [(0, '505.581')] |
| [2023-07-17 05:22:49,858][335595] Updated weights for policy 0, policy_version 10240 (0.0005) |
| [2023-07-17 05:22:53,849][335310] Fps is (10 sec: 9830.4, 60 sec: 9489.1, 300 sec: 9483.3). Total num frames: 5279744. Throughput: 0: 9452.8. Samples: 5258668. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:22:53,850][335310] Avg episode reward: [(0, '492.824')] |
| [2023-07-17 05:22:53,984][335595] Updated weights for policy 0, policy_version 10320 (0.0004) |
| [2023-07-17 05:22:58,051][335595] Updated weights for policy 0, policy_version 10400 (0.0004) |
| [2023-07-17 05:22:58,849][335310] Fps is (10 sec: 9830.4, 60 sec: 9489.1, 300 sec: 9497.2). Total num frames: 5328896. Throughput: 0: 9521.8. Samples: 5318680. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) |
| [2023-07-17 05:22:58,850][335310] Avg episode reward: [(0, '510.486')] |
| [2023-07-17 05:22:58,871][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000010416_5332992.pth... |
| [2023-07-17 05:22:58,872][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000009848_5042176.pth |
| [2023-07-17 05:23:02,221][335595] Updated weights for policy 0, policy_version 10480 (0.0005) |
| [2023-07-17 05:23:03,849][335310] Fps is (10 sec: 9830.4, 60 sec: 9489.1, 300 sec: 9497.2). Total num frames: 5378048. Throughput: 0: 9569.2. Samples: 5377852. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) |
| [2023-07-17 05:23:03,850][335310] Avg episode reward: [(0, '505.573')] |
| [2023-07-17 05:23:06,369][335595] Updated weights for policy 0, policy_version 10560 (0.0004) |
| [2023-07-17 05:23:08,849][335310] Fps is (10 sec: 10240.0, 60 sec: 9625.6, 300 sec: 9511.1). Total num frames: 5431296. Throughput: 0: 9597.1. Samples: 5407148. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-17 05:23:08,850][335310] Avg episode reward: [(0, '488.544')] |
| [2023-07-17 05:23:10,489][335595] Updated weights for policy 0, policy_version 10640 (0.0004) |
| [2023-07-17 05:23:13,849][335310] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9511.0). Total num frames: 5476352. Throughput: 0: 9667.0. Samples: 5466428. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) |
| [2023-07-17 05:23:13,850][335310] Avg episode reward: [(0, '506.999')] |
| [2023-07-17 05:23:13,853][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000010696_5476352.pth... |
| [2023-07-17 05:23:13,856][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000010128_5185536.pth |
| [2023-07-17 05:23:14,850][335595] Updated weights for policy 0, policy_version 10720 (0.0005) |
| [2023-07-17 05:23:18,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9524.9). Total num frames: 5525504. Throughput: 0: 9631.8. Samples: 5521992. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) |
| [2023-07-17 05:23:18,850][335310] Avg episode reward: [(0, '510.863')] |
| [2023-07-17 05:23:19,213][335595] Updated weights for policy 0, policy_version 10800 (0.0005) |
| [2023-07-17 05:23:23,536][335595] Updated weights for policy 0, policy_version 10880 (0.0005) |
| [2023-07-17 05:23:23,849][335310] Fps is (10 sec: 9420.9, 60 sec: 9557.3, 300 sec: 9511.1). Total num frames: 5570560. Throughput: 0: 9644.5. Samples: 5550492. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:23:23,850][335310] Avg episode reward: [(0, '515.538')] |
| [2023-07-17 05:23:27,932][335595] Updated weights for policy 0, policy_version 10960 (0.0006) |
| [2023-07-17 05:23:28,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9511.1). Total num frames: 5619712. Throughput: 0: 9662.8. Samples: 5607424. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:23:28,850][335310] Avg episode reward: [(0, '507.357')] |
| [2023-07-17 05:23:28,853][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000010976_5619712.pth... |
| [2023-07-17 05:23:28,855][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000010416_5332992.pth |
| [2023-07-17 05:23:32,263][335595] Updated weights for policy 0, policy_version 11040 (0.0005) |
| [2023-07-17 05:23:33,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9511.1). Total num frames: 5664768. Throughput: 0: 9675.2. Samples: 5664208. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:23:33,850][335310] Avg episode reward: [(0, '493.517')] |
| [2023-07-17 05:23:36,385][335595] Updated weights for policy 0, policy_version 11120 (0.0005) |
| [2023-07-17 05:23:38,849][335310] Fps is (10 sec: 9420.9, 60 sec: 9625.6, 300 sec: 9511.1). Total num frames: 5713920. Throughput: 0: 9666.8. Samples: 5693672. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:23:38,849][335310] Avg episode reward: [(0, '498.549')] |
| [2023-07-17 05:23:40,517][335595] Updated weights for policy 0, policy_version 11200 (0.0005) |
| [2023-07-17 05:23:43,849][335310] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9524.9). Total num frames: 5763072. Throughput: 0: 9673.7. Samples: 5753996. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:23:43,849][335310] Avg episode reward: [(0, '498.831')] |
| [2023-07-17 05:23:43,871][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000011264_5767168.pth... |
| [2023-07-17 05:23:43,874][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000010696_5476352.pth |
| [2023-07-17 05:23:44,756][335595] Updated weights for policy 0, policy_version 11280 (0.0005) |
| [2023-07-17 05:23:48,849][335310] Fps is (10 sec: 9830.3, 60 sec: 9693.9, 300 sec: 9538.8). Total num frames: 5812224. Throughput: 0: 9578.1. Samples: 5808864. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:23:48,850][335310] Avg episode reward: [(0, '492.381')] |
| [2023-07-17 05:23:49,206][335595] Updated weights for policy 0, policy_version 11360 (0.0004) |
| [2023-07-17 05:23:53,544][335595] Updated weights for policy 0, policy_version 11440 (0.0004) |
| [2023-07-17 05:23:53,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9524.9). Total num frames: 5857280. Throughput: 0: 9549.3. Samples: 5836864. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:23:53,850][335310] Avg episode reward: [(0, '498.990')] |
| [2023-07-17 05:23:57,973][335595] Updated weights for policy 0, policy_version 11520 (0.0005) |
| [2023-07-17 05:23:58,849][335310] Fps is (10 sec: 9011.2, 60 sec: 9557.4, 300 sec: 9524.9). Total num frames: 5902336. Throughput: 0: 9498.2. Samples: 5893848. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-17 05:23:58,850][335310] Avg episode reward: [(0, '482.074')] |
| [2023-07-17 05:23:58,875][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000011536_5906432.pth... |
| [2023-07-17 05:23:58,877][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000010976_5619712.pth |
| [2023-07-17 05:24:02,326][335595] Updated weights for policy 0, policy_version 11600 (0.0005) |
| [2023-07-17 05:24:03,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9511.0). Total num frames: 5951488. Throughput: 0: 9506.1. Samples: 5949768. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-17 05:24:03,850][335310] Avg episode reward: [(0, '508.461')] |
| [2023-07-17 05:24:06,788][335595] Updated weights for policy 0, policy_version 11680 (0.0006) |
| [2023-07-17 05:24:08,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9497.2). Total num frames: 5996544. Throughput: 0: 9473.3. Samples: 5976792. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) |
| [2023-07-17 05:24:08,850][335310] Avg episode reward: [(0, '510.483')] |
| [2023-07-17 05:24:11,288][335595] Updated weights for policy 0, policy_version 11760 (0.0005) |
| [2023-07-17 05:24:13,849][335310] Fps is (10 sec: 9011.2, 60 sec: 9420.8, 300 sec: 9497.2). Total num frames: 6041600. Throughput: 0: 9430.4. Samples: 6031792. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) |
| [2023-07-17 05:24:13,850][335310] Avg episode reward: [(0, '500.205')] |
| [2023-07-17 05:24:13,853][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000011800_6041600.pth... |
| [2023-07-17 05:24:13,856][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000011264_5767168.pth |
| [2023-07-17 05:24:15,643][335595] Updated weights for policy 0, policy_version 11840 (0.0005) |
| [2023-07-17 05:24:18,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9497.2). Total num frames: 6090752. Throughput: 0: 9432.5. Samples: 6088672. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-17 05:24:18,850][335310] Avg episode reward: [(0, '502.237')] |
| [2023-07-17 05:24:19,967][335595] Updated weights for policy 0, policy_version 11920 (0.0006) |
| [2023-07-17 05:24:23,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9497.2). Total num frames: 6135808. Throughput: 0: 9400.3. Samples: 6116688. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-17 05:24:23,850][335310] Avg episode reward: [(0, '496.052')] |
| [2023-07-17 05:24:24,326][335595] Updated weights for policy 0, policy_version 12000 (0.0005) |
| [2023-07-17 05:24:28,764][335595] Updated weights for policy 0, policy_version 12080 (0.0006) |
| [2023-07-17 05:24:28,849][335310] Fps is (10 sec: 9420.7, 60 sec: 9420.8, 300 sec: 9511.1). Total num frames: 6184960. Throughput: 0: 9305.3. Samples: 6172736. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-17 05:24:28,850][335310] Avg episode reward: [(0, '494.166')] |
| [2023-07-17 05:24:28,854][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000012080_6184960.pth... |
| [2023-07-17 05:24:28,857][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000011536_5906432.pth |
| [2023-07-17 05:24:33,069][335595] Updated weights for policy 0, policy_version 12160 (0.0005) |
| [2023-07-17 05:24:33,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9497.2). Total num frames: 6230016. Throughput: 0: 9353.1. Samples: 6229752. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:24:33,850][335310] Avg episode reward: [(0, '499.734')] |
| [2023-07-17 05:24:37,432][335595] Updated weights for policy 0, policy_version 12240 (0.0006) |
| [2023-07-17 05:24:38,849][335310] Fps is (10 sec: 9420.9, 60 sec: 9420.8, 300 sec: 9511.0). Total num frames: 6279168. Throughput: 0: 9355.1. Samples: 6257844. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:24:38,850][335310] Avg episode reward: [(0, '501.822')] |
| [2023-07-17 05:24:41,722][335595] Updated weights for policy 0, policy_version 12320 (0.0005) |
| [2023-07-17 05:24:43,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9497.2). Total num frames: 6324224. Throughput: 0: 9350.4. Samples: 6314616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:24:43,850][335310] Avg episode reward: [(0, '502.583')] |
| [2023-07-17 05:24:43,869][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000012360_6328320.pth... |
| [2023-07-17 05:24:43,871][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000011800_6041600.pth |
| [2023-07-17 05:24:46,032][335595] Updated weights for policy 0, policy_version 12400 (0.0005) |
| [2023-07-17 05:24:48,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9497.2). Total num frames: 6373376. Throughput: 0: 9379.0. Samples: 6371824. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:24:48,850][335310] Avg episode reward: [(0, '503.082')] |
| [2023-07-17 05:24:50,217][335595] Updated weights for policy 0, policy_version 12480 (0.0004) |
| [2023-07-17 05:24:53,849][335310] Fps is (10 sec: 9830.5, 60 sec: 9420.8, 300 sec: 9497.2). Total num frames: 6422528. Throughput: 0: 9445.3. Samples: 6401832. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) |
| [2023-07-17 05:24:53,850][335310] Avg episode reward: [(0, '508.680')] |
| [2023-07-17 05:24:54,313][335595] Updated weights for policy 0, policy_version 12560 (0.0004) |
| [2023-07-17 05:24:58,369][335595] Updated weights for policy 0, policy_version 12640 (0.0004) |
| [2023-07-17 05:24:58,849][335310] Fps is (10 sec: 10239.9, 60 sec: 9557.3, 300 sec: 9524.9). Total num frames: 6475776. Throughput: 0: 9567.1. Samples: 6462312. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) |
| [2023-07-17 05:24:58,850][335310] Avg episode reward: [(0, '501.921')] |
| [2023-07-17 05:24:58,853][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000012648_6475776.pth... |
| [2023-07-17 05:24:58,855][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000012080_6184960.pth |
| [2023-07-17 05:25:02,645][335595] Updated weights for policy 0, policy_version 12720 (0.0005) |
| [2023-07-17 05:25:03,849][335310] Fps is (10 sec: 9830.4, 60 sec: 9489.1, 300 sec: 9511.1). Total num frames: 6520832. Throughput: 0: 9591.6. Samples: 6520292. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) |
| [2023-07-17 05:25:03,850][335310] Avg episode reward: [(0, '511.297')] |
| [2023-07-17 05:25:06,822][335595] Updated weights for policy 0, policy_version 12800 (0.0005) |
| [2023-07-17 05:25:08,849][335310] Fps is (10 sec: 9420.9, 60 sec: 9557.3, 300 sec: 9524.9). Total num frames: 6569984. Throughput: 0: 9619.7. Samples: 6549576. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) |
| [2023-07-17 05:25:08,850][335310] Avg episode reward: [(0, '513.337')] |
| [2023-07-17 05:25:11,096][335595] Updated weights for policy 0, policy_version 12880 (0.0005) |
| [2023-07-17 05:25:13,849][335310] Fps is (10 sec: 9830.3, 60 sec: 9625.6, 300 sec: 9538.8). Total num frames: 6619136. Throughput: 0: 9647.1. Samples: 6606856. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) |
| [2023-07-17 05:25:13,850][335310] Avg episode reward: [(0, '505.223')] |
| [2023-07-17 05:25:13,853][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000012928_6619136.pth... |
| [2023-07-17 05:25:13,856][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000012360_6328320.pth |
| [2023-07-17 05:25:15,532][335595] Updated weights for policy 0, policy_version 12960 (0.0005) |
| [2023-07-17 05:25:18,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9524.9). Total num frames: 6664192. Throughput: 0: 9640.3. Samples: 6663564. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) |
| [2023-07-17 05:25:18,850][335310] Avg episode reward: [(0, '514.319')] |
| [2023-07-17 05:25:19,797][335595] Updated weights for policy 0, policy_version 13040 (0.0005) |
| [2023-07-17 05:25:23,849][335310] Fps is (10 sec: 9420.9, 60 sec: 9625.6, 300 sec: 9538.8). Total num frames: 6713344. Throughput: 0: 9645.1. Samples: 6691872. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) |
| [2023-07-17 05:25:23,850][335310] Avg episode reward: [(0, '501.583')] |
| [2023-07-17 05:25:24,125][335595] Updated weights for policy 0, policy_version 13120 (0.0005) |
| [2023-07-17 05:25:28,556][335595] Updated weights for policy 0, policy_version 13200 (0.0005) |
| [2023-07-17 05:25:28,849][335310] Fps is (10 sec: 9420.7, 60 sec: 9557.3, 300 sec: 9524.9). Total num frames: 6758400. Throughput: 0: 9624.2. Samples: 6747704. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) |
| [2023-07-17 05:25:28,850][335310] Avg episode reward: [(0, '514.339')] |
| [2023-07-17 05:25:28,853][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000013200_6758400.pth... |
| [2023-07-17 05:25:28,856][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000012648_6475776.pth |
| [2023-07-17 05:25:32,952][335595] Updated weights for policy 0, policy_version 13280 (0.0005) |
| [2023-07-17 05:25:33,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9538.8). Total num frames: 6807552. Throughput: 0: 9593.4. Samples: 6803528. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:25:33,850][335310] Avg episode reward: [(0, '514.815')] |
| [2023-07-17 05:25:37,405][335595] Updated weights for policy 0, policy_version 13360 (0.0005) |
| [2023-07-17 05:25:38,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9538.8). Total num frames: 6852608. Throughput: 0: 9552.8. Samples: 6831708. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:25:38,850][335310] Avg episode reward: [(0, '515.150')] |
| [2023-07-17 05:25:41,755][335595] Updated weights for policy 0, policy_version 13440 (0.0005) |
| [2023-07-17 05:25:43,849][335310] Fps is (10 sec: 9011.2, 60 sec: 9557.3, 300 sec: 9538.8). Total num frames: 6897664. Throughput: 0: 9451.2. Samples: 6887616. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-17 05:25:43,850][335310] Avg episode reward: [(0, '505.640')] |
| [2023-07-17 05:25:43,853][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000013472_6897664.pth... |
| [2023-07-17 05:25:43,856][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000012928_6619136.pth |
| [2023-07-17 05:25:46,107][335595] Updated weights for policy 0, policy_version 13520 (0.0005) |
| [2023-07-17 05:25:48,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9538.8). Total num frames: 6946816. Throughput: 0: 9405.9. Samples: 6943556. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-17 05:25:48,850][335310] Avg episode reward: [(0, '510.619')] |
| [2023-07-17 05:25:50,512][335595] Updated weights for policy 0, policy_version 13600 (0.0005) |
| [2023-07-17 05:25:53,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9538.8). Total num frames: 6991872. Throughput: 0: 9385.2. Samples: 6971912. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-17 05:25:53,850][335310] Avg episode reward: [(0, '510.610')] |
| [2023-07-17 05:25:54,857][335595] Updated weights for policy 0, policy_version 13680 (0.0005) |
| [2023-07-17 05:25:58,849][335310] Fps is (10 sec: 9420.7, 60 sec: 9420.8, 300 sec: 9538.8). Total num frames: 7041024. Throughput: 0: 9375.5. Samples: 7028752. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-17 05:25:58,850][335310] Avg episode reward: [(0, '504.306')] |
| [2023-07-17 05:25:58,854][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000013752_7041024.pth... |
| [2023-07-17 05:25:58,855][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000013200_6758400.pth |
| [2023-07-17 05:25:59,209][335595] Updated weights for policy 0, policy_version 13760 (0.0005) |
| [2023-07-17 05:26:03,549][335595] Updated weights for policy 0, policy_version 13840 (0.0006) |
| [2023-07-17 05:26:03,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9538.8). Total num frames: 7086080. Throughput: 0: 9375.8. Samples: 7085476. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:26:03,850][335310] Avg episode reward: [(0, '517.125')] |
| [2023-07-17 05:26:07,939][335595] Updated weights for policy 0, policy_version 13920 (0.0005) |
| [2023-07-17 05:26:08,849][335310] Fps is (10 sec: 9420.9, 60 sec: 9420.8, 300 sec: 9552.7). Total num frames: 7135232. Throughput: 0: 9363.5. Samples: 7113228. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:26:08,850][335310] Avg episode reward: [(0, '523.437')] |
| [2023-07-17 05:26:08,851][335551] Saving new best policy, reward=523.437! |
| [2023-07-17 05:26:12,273][335595] Updated weights for policy 0, policy_version 14000 (0.0005) |
| [2023-07-17 05:26:13,849][335310] Fps is (10 sec: 9420.7, 60 sec: 9352.5, 300 sec: 9538.8). Total num frames: 7180288. Throughput: 0: 9372.0. Samples: 7169444. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:26:13,850][335310] Avg episode reward: [(0, '524.295')] |
| [2023-07-17 05:26:13,853][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000014024_7180288.pth... |
| [2023-07-17 05:26:13,856][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000013472_6897664.pth |
| [2023-07-17 05:26:13,857][335551] Saving new best policy, reward=524.295! |
| [2023-07-17 05:26:16,679][335595] Updated weights for policy 0, policy_version 14080 (0.0005) |
| [2023-07-17 05:26:18,849][335310] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9511.1). Total num frames: 7225344. Throughput: 0: 9373.9. Samples: 7225352. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:26:18,850][335310] Avg episode reward: [(0, '524.164')] |
| [2023-07-17 05:26:21,096][335595] Updated weights for policy 0, policy_version 14160 (0.0005) |
| [2023-07-17 05:26:23,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9511.1). Total num frames: 7274496. Throughput: 0: 9373.5. Samples: 7253516. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) |
| [2023-07-17 05:26:23,850][335310] Avg episode reward: [(0, '522.501')] |
| [2023-07-17 05:26:25,343][335595] Updated weights for policy 0, policy_version 14240 (0.0005) |
| [2023-07-17 05:26:28,849][335310] Fps is (10 sec: 9830.3, 60 sec: 9420.8, 300 sec: 9524.9). Total num frames: 7323648. Throughput: 0: 9416.5. Samples: 7311360. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) |
| [2023-07-17 05:26:28,850][335310] Avg episode reward: [(0, '519.627')] |
| [2023-07-17 05:26:28,853][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000014304_7323648.pth... |
| [2023-07-17 05:26:28,856][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000013752_7041024.pth |
| [2023-07-17 05:26:29,647][335595] Updated weights for policy 0, policy_version 14320 (0.0005) |
| [2023-07-17 05:26:33,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9511.1). Total num frames: 7368704. Throughput: 0: 9421.7. Samples: 7367532. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) |
| [2023-07-17 05:26:33,850][335310] Avg episode reward: [(0, '525.636')] |
| [2023-07-17 05:26:33,851][335551] Saving new best policy, reward=525.636! |
| [2023-07-17 05:26:34,010][335595] Updated weights for policy 0, policy_version 14400 (0.0005) |
| [2023-07-17 05:26:38,365][335595] Updated weights for policy 0, policy_version 14480 (0.0005) |
| [2023-07-17 05:26:38,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9511.1). Total num frames: 7417856. Throughput: 0: 9420.0. Samples: 7395812. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) |
| [2023-07-17 05:26:38,850][335310] Avg episode reward: [(0, '516.731')] |
| [2023-07-17 05:26:42,794][335595] Updated weights for policy 0, policy_version 14560 (0.0005) |
| [2023-07-17 05:26:43,849][335310] Fps is (10 sec: 9420.7, 60 sec: 9420.8, 300 sec: 9497.2). Total num frames: 7462912. Throughput: 0: 9392.3. Samples: 7451404. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:26:43,850][335310] Avg episode reward: [(0, '499.756')] |
| [2023-07-17 05:26:43,854][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000014576_7462912.pth... |
| [2023-07-17 05:26:43,857][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000014024_7180288.pth |
| [2023-07-17 05:26:47,157][335595] Updated weights for policy 0, policy_version 14640 (0.0005) |
| [2023-07-17 05:26:48,849][335310] Fps is (10 sec: 9011.3, 60 sec: 9352.5, 300 sec: 9483.3). Total num frames: 7507968. Throughput: 0: 9388.3. Samples: 7507948. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:26:48,850][335310] Avg episode reward: [(0, '510.533')] |
| [2023-07-17 05:26:51,510][335595] Updated weights for policy 0, policy_version 14720 (0.0005) |
| [2023-07-17 05:26:53,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9483.3). Total num frames: 7557120. Throughput: 0: 9398.7. Samples: 7536172. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:26:53,850][335310] Avg episode reward: [(0, '527.917')] |
| [2023-07-17 05:26:53,851][335551] Saving new best policy, reward=527.917! |
| [2023-07-17 05:26:55,929][335595] Updated weights for policy 0, policy_version 14800 (0.0005) |
| [2023-07-17 05:26:58,849][335310] Fps is (10 sec: 9420.7, 60 sec: 9352.5, 300 sec: 9469.4). Total num frames: 7602176. Throughput: 0: 9391.1. Samples: 7592044. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:26:58,850][335310] Avg episode reward: [(0, '519.093')] |
| [2023-07-17 05:26:58,854][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000014848_7602176.pth... |
| [2023-07-17 05:26:58,857][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000014304_7323648.pth |
| [2023-07-17 05:27:00,321][335595] Updated weights for policy 0, policy_version 14880 (0.0005) |
| [2023-07-17 05:27:03,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9483.3). Total num frames: 7651328. Throughput: 0: 9376.5. Samples: 7647296. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:27:03,850][335310] Avg episode reward: [(0, '529.246')] |
| [2023-07-17 05:27:03,851][335551] Saving new best policy, reward=529.246! |
| [2023-07-17 05:27:04,704][335595] Updated weights for policy 0, policy_version 14960 (0.0005) |
| [2023-07-17 05:27:08,849][335310] Fps is (10 sec: 9420.9, 60 sec: 9352.5, 300 sec: 9469.4). Total num frames: 7696384. Throughput: 0: 9386.4. Samples: 7675904. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) |
| [2023-07-17 05:27:08,850][335310] Avg episode reward: [(0, '526.036')] |
| [2023-07-17 05:27:09,116][335595] Updated weights for policy 0, policy_version 15040 (0.0005) |
| [2023-07-17 05:27:13,480][335595] Updated weights for policy 0, policy_version 15120 (0.0005) |
| [2023-07-17 05:27:13,849][335310] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9469.4). Total num frames: 7741440. Throughput: 0: 9343.7. Samples: 7731828. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) |
| [2023-07-17 05:27:13,850][335310] Avg episode reward: [(0, '524.597')] |
| [2023-07-17 05:27:13,854][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000015120_7741440.pth... |
| [2023-07-17 05:27:13,857][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000014576_7462912.pth |
| [2023-07-17 05:27:17,877][335595] Updated weights for policy 0, policy_version 15200 (0.0005) |
| [2023-07-17 05:27:18,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9469.4). Total num frames: 7790592. Throughput: 0: 9340.4. Samples: 7787852. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-17 05:27:18,851][335310] Avg episode reward: [(0, '520.917')] |
| [2023-07-17 05:27:22,001][335595] Updated weights for policy 0, policy_version 15280 (0.0004) |
| [2023-07-17 05:27:23,849][335310] Fps is (10 sec: 9830.4, 60 sec: 9420.8, 300 sec: 9483.3). Total num frames: 7839744. Throughput: 0: 9376.8. Samples: 7817768. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-17 05:27:23,850][335310] Avg episode reward: [(0, '508.376')] |
| [2023-07-17 05:27:26,223][335595] Updated weights for policy 0, policy_version 15360 (0.0005) |
| [2023-07-17 05:27:28,849][335310] Fps is (10 sec: 9830.4, 60 sec: 9420.8, 300 sec: 9483.3). Total num frames: 7888896. Throughput: 0: 9449.2. Samples: 7876616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:27:28,850][335310] Avg episode reward: [(0, '518.317')] |
| [2023-07-17 05:27:28,853][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000015408_7888896.pth... |
| [2023-07-17 05:27:28,856][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000014848_7602176.pth |
| [2023-07-17 05:27:30,367][335595] Updated weights for policy 0, policy_version 15440 (0.0004) |
| [2023-07-17 05:27:33,849][335310] Fps is (10 sec: 9830.5, 60 sec: 9489.1, 300 sec: 9497.2). Total num frames: 7938048. Throughput: 0: 9482.5. Samples: 7934660. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:27:33,850][335310] Avg episode reward: [(0, '519.225')] |
| [2023-07-17 05:27:34,568][335595] Updated weights for policy 0, policy_version 15520 (0.0005) |
| [2023-07-17 05:27:38,849][335310] Fps is (10 sec: 9420.9, 60 sec: 9420.8, 300 sec: 9497.2). Total num frames: 7983104. Throughput: 0: 9504.6. Samples: 7963880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:27:38,850][335310] Avg episode reward: [(0, '515.295')] |
| [2023-07-17 05:27:38,896][335595] Updated weights for policy 0, policy_version 15600 (0.0006) |
| [2023-07-17 05:27:43,204][335595] Updated weights for policy 0, policy_version 15680 (0.0005) |
| [2023-07-17 05:27:43,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9497.2). Total num frames: 8032256. Throughput: 0: 9527.1. Samples: 8020764. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:27:43,850][335310] Avg episode reward: [(0, '516.493')] |
| [2023-07-17 05:27:43,853][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000015688_8032256.pth... |
| [2023-07-17 05:27:43,856][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000015120_7741440.pth |
| [2023-07-17 05:27:47,563][335595] Updated weights for policy 0, policy_version 15760 (0.0005) |
| [2023-07-17 05:27:48,849][335310] Fps is (10 sec: 9420.7, 60 sec: 9489.1, 300 sec: 9483.3). Total num frames: 8077312. Throughput: 0: 9557.3. Samples: 8077376. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:27:48,850][335310] Avg episode reward: [(0, '514.697')] |
| [2023-07-17 05:27:51,941][335595] Updated weights for policy 0, policy_version 15840 (0.0005) |
| [2023-07-17 05:27:53,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9483.3). Total num frames: 8126464. Throughput: 0: 9552.8. Samples: 8105780. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:27:53,850][335310] Avg episode reward: [(0, '512.469')] |
| [2023-07-17 05:27:56,312][335595] Updated weights for policy 0, policy_version 15920 (0.0005) |
| [2023-07-17 05:27:58,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9469.4). Total num frames: 8171520. Throughput: 0: 9550.8. Samples: 8161612. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:27:58,850][335310] Avg episode reward: [(0, '519.215')] |
| [2023-07-17 05:27:58,853][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000015960_8171520.pth... |
| [2023-07-17 05:27:58,856][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000015408_7888896.pth |
| [2023-07-17 05:28:00,652][335595] Updated weights for policy 0, policy_version 16000 (0.0005) |
| [2023-07-17 05:28:03,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9455.5). Total num frames: 8220672. Throughput: 0: 9561.2. Samples: 8218104. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:28:03,850][335310] Avg episode reward: [(0, '518.064')] |
| [2023-07-17 05:28:04,999][335595] Updated weights for policy 0, policy_version 16080 (0.0005) |
| [2023-07-17 05:28:08,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9455.5). Total num frames: 8265728. Throughput: 0: 9529.3. Samples: 8246584. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:28:08,850][335310] Avg episode reward: [(0, '500.839')] |
| [2023-07-17 05:28:09,332][335595] Updated weights for policy 0, policy_version 16160 (0.0005) |
| [2023-07-17 05:28:13,616][335595] Updated weights for policy 0, policy_version 16240 (0.0005) |
| [2023-07-17 05:28:13,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9455.5). Total num frames: 8314880. Throughput: 0: 9486.2. Samples: 8303496. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:28:13,850][335310] Avg episode reward: [(0, '516.987')] |
| [2023-07-17 05:28:13,853][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000016240_8314880.pth... |
| [2023-07-17 05:28:13,856][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000015688_8032256.pth |
| [2023-07-17 05:28:18,010][335595] Updated weights for policy 0, policy_version 16320 (0.0005) |
| [2023-07-17 05:28:18,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9455.5). Total num frames: 8359936. Throughput: 0: 9452.0. Samples: 8360000. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:28:18,850][335310] Avg episode reward: [(0, '511.501')] |
| [2023-07-17 05:28:22,303][335595] Updated weights for policy 0, policy_version 16400 (0.0005) |
| [2023-07-17 05:28:23,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9455.5). Total num frames: 8409088. Throughput: 0: 9439.8. Samples: 8388672. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:28:23,850][335310] Avg episode reward: [(0, '516.452')] |
| [2023-07-17 05:28:26,655][335595] Updated weights for policy 0, policy_version 16480 (0.0005) |
| [2023-07-17 05:28:28,849][335310] Fps is (10 sec: 9830.3, 60 sec: 9489.1, 300 sec: 9469.4). Total num frames: 8458240. Throughput: 0: 9436.9. Samples: 8445428. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-17 05:28:28,850][335310] Avg episode reward: [(0, '511.319')] |
| [2023-07-17 05:28:28,853][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000016520_8458240.pth... |
| [2023-07-17 05:28:28,856][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000015960_8171520.pth |
| [2023-07-17 05:28:30,947][335595] Updated weights for policy 0, policy_version 16560 (0.0005) |
| [2023-07-17 05:28:33,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9455.5). Total num frames: 8503296. Throughput: 0: 9441.3. Samples: 8502236. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) |
| [2023-07-17 05:28:33,850][335310] Avg episode reward: [(0, '509.237')] |
| [2023-07-17 05:28:35,299][335595] Updated weights for policy 0, policy_version 16640 (0.0005) |
| [2023-07-17 05:28:38,849][335310] Fps is (10 sec: 9420.9, 60 sec: 9489.1, 300 sec: 9455.5). Total num frames: 8552448. Throughput: 0: 9437.9. Samples: 8530484. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:28:38,850][335310] Avg episode reward: [(0, '496.186')] |
| [2023-07-17 05:28:39,669][335595] Updated weights for policy 0, policy_version 16720 (0.0005) |
| [2023-07-17 05:28:43,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9441.6). Total num frames: 8597504. Throughput: 0: 9444.7. Samples: 8586624. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:28:43,850][335310] Avg episode reward: [(0, '509.345')] |
| [2023-07-17 05:28:43,853][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000016792_8597504.pth... |
| [2023-07-17 05:28:43,855][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000016240_8314880.pth |
| [2023-07-17 05:28:43,991][335595] Updated weights for policy 0, policy_version 16800 (0.0005) |
| [2023-07-17 05:28:48,352][335595] Updated weights for policy 0, policy_version 16880 (0.0005) |
| [2023-07-17 05:28:48,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9455.5). Total num frames: 8646656. Throughput: 0: 9444.1. Samples: 8643088. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:28:48,850][335310] Avg episode reward: [(0, '491.148')] |
| [2023-07-17 05:28:52,664][335595] Updated weights for policy 0, policy_version 16960 (0.0005) |
| [2023-07-17 05:28:53,849][335310] Fps is (10 sec: 9420.9, 60 sec: 9420.8, 300 sec: 9455.5). Total num frames: 8691712. Throughput: 0: 9454.8. Samples: 8672052. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:28:53,850][335310] Avg episode reward: [(0, '504.252')] |
| [2023-07-17 05:28:57,049][335595] Updated weights for policy 0, policy_version 17040 (0.0005) |
| [2023-07-17 05:28:58,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9455.5). Total num frames: 8740864. Throughput: 0: 9442.9. Samples: 8728428. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:28:58,850][335310] Avg episode reward: [(0, '506.716')] |
| [2023-07-17 05:28:58,854][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000017072_8740864.pth... |
| [2023-07-17 05:28:58,856][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000016520_8458240.pth |
| [2023-07-17 05:29:01,451][335595] Updated weights for policy 0, policy_version 17120 (0.0005) |
| [2023-07-17 05:29:03,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9455.5). Total num frames: 8785920. Throughput: 0: 9418.2. Samples: 8783820. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:29:03,850][335310] Avg episode reward: [(0, '494.401')] |
| [2023-07-17 05:29:05,838][335595] Updated weights for policy 0, policy_version 17200 (0.0005) |
| [2023-07-17 05:29:08,849][335310] Fps is (10 sec: 9011.3, 60 sec: 9420.8, 300 sec: 9455.5). Total num frames: 8830976. Throughput: 0: 9408.4. Samples: 8812048. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:29:08,850][335310] Avg episode reward: [(0, '499.514')] |
| [2023-07-17 05:29:10,172][335595] Updated weights for policy 0, policy_version 17280 (0.0005) |
| [2023-07-17 05:29:13,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9455.5). Total num frames: 8880128. Throughput: 0: 9394.8. Samples: 8868192. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:29:13,850][335310] Avg episode reward: [(0, '510.081')] |
| [2023-07-17 05:29:13,853][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000017344_8880128.pth... |
| [2023-07-17 05:29:13,856][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000016792_8597504.pth |
| [2023-07-17 05:29:14,577][335595] Updated weights for policy 0, policy_version 17360 (0.0005) |
| [2023-07-17 05:29:18,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9455.5). Total num frames: 8925184. Throughput: 0: 9384.8. Samples: 8924552. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:29:18,850][335310] Avg episode reward: [(0, '511.483')] |
| [2023-07-17 05:29:18,978][335595] Updated weights for policy 0, policy_version 17440 (0.0005) |
| [2023-07-17 05:29:23,357][335595] Updated weights for policy 0, policy_version 17520 (0.0005) |
| [2023-07-17 05:29:23,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9455.5). Total num frames: 8974336. Throughput: 0: 9371.7. Samples: 8952212. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:29:23,850][335310] Avg episode reward: [(0, '500.270')] |
| [2023-07-17 05:29:27,699][335595] Updated weights for policy 0, policy_version 17600 (0.0005) |
| [2023-07-17 05:29:28,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9455.5). Total num frames: 9019392. Throughput: 0: 9381.0. Samples: 9008768. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:29:28,850][335310] Avg episode reward: [(0, '502.726')] |
| [2023-07-17 05:29:28,853][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000017616_9019392.pth... |
| [2023-07-17 05:29:28,856][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000017072_8740864.pth |
| [2023-07-17 05:29:32,104][335595] Updated weights for policy 0, policy_version 17680 (0.0005) |
| [2023-07-17 05:29:33,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9455.5). Total num frames: 9068544. Throughput: 0: 9365.0. Samples: 9064512. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:29:33,850][335310] Avg episode reward: [(0, '509.201')] |
| [2023-07-17 05:29:36,431][335595] Updated weights for policy 0, policy_version 17760 (0.0005) |
| [2023-07-17 05:29:38,849][335310] Fps is (10 sec: 9420.9, 60 sec: 9352.5, 300 sec: 9455.5). Total num frames: 9113600. Throughput: 0: 9358.5. Samples: 9093184. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-17 05:29:38,850][335310] Avg episode reward: [(0, '510.394')] |
| [2023-07-17 05:29:40,821][335595] Updated weights for policy 0, policy_version 17840 (0.0005) |
| [2023-07-17 05:29:43,849][335310] Fps is (10 sec: 9011.1, 60 sec: 9352.5, 300 sec: 9441.6). Total num frames: 9158656. Throughput: 0: 9362.1. Samples: 9149724. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-17 05:29:43,850][335310] Avg episode reward: [(0, '516.211')] |
| [2023-07-17 05:29:43,858][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000017896_9162752.pth... |
| [2023-07-17 05:29:43,860][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000017344_8880128.pth |
| [2023-07-17 05:29:45,196][335595] Updated weights for policy 0, policy_version 17920 (0.0005) |
| [2023-07-17 05:29:48,849][335310] Fps is (10 sec: 9420.7, 60 sec: 9352.5, 300 sec: 9441.6). Total num frames: 9207808. Throughput: 0: 9373.7. Samples: 9205636. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-17 05:29:48,850][335310] Avg episode reward: [(0, '504.566')] |
| [2023-07-17 05:29:49,525][335595] Updated weights for policy 0, policy_version 18000 (0.0005) |
| [2023-07-17 05:29:53,849][335310] Fps is (10 sec: 9420.9, 60 sec: 9352.5, 300 sec: 9413.9). Total num frames: 9252864. Throughput: 0: 9359.4. Samples: 9233220. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:29:53,850][335310] Avg episode reward: [(0, '503.212')] |
| [2023-07-17 05:29:54,049][335595] Updated weights for policy 0, policy_version 18080 (0.0005) |
| [2023-07-17 05:29:58,455][335595] Updated weights for policy 0, policy_version 18160 (0.0005) |
| [2023-07-17 05:29:58,849][335310] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9413.9). Total num frames: 9297920. Throughput: 0: 9346.2. Samples: 9288772. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:29:58,850][335310] Avg episode reward: [(0, '501.128')] |
| [2023-07-17 05:29:58,882][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000018168_9302016.pth... |
| [2023-07-17 05:29:58,883][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000017616_9019392.pth |
| [2023-07-17 05:30:02,725][335595] Updated weights for policy 0, policy_version 18240 (0.0005) |
| [2023-07-17 05:30:03,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9413.9). Total num frames: 9347072. Throughput: 0: 9360.7. Samples: 9345784. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-17 05:30:03,850][335310] Avg episode reward: [(0, '505.102')] |
| [2023-07-17 05:30:07,083][335595] Updated weights for policy 0, policy_version 18320 (0.0005) |
| [2023-07-17 05:30:08,849][335310] Fps is (10 sec: 9830.5, 60 sec: 9420.8, 300 sec: 9413.9). Total num frames: 9396224. Throughput: 0: 9372.5. Samples: 9373972. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-17 05:30:08,850][335310] Avg episode reward: [(0, '513.228')] |
| [2023-07-17 05:30:11,459][335595] Updated weights for policy 0, policy_version 18400 (0.0005) |
| [2023-07-17 05:30:13,849][335310] Fps is (10 sec: 9420.7, 60 sec: 9352.5, 300 sec: 9413.9). Total num frames: 9441280. Throughput: 0: 9350.4. Samples: 9429536. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) |
| [2023-07-17 05:30:13,850][335310] Avg episode reward: [(0, '514.487')] |
| [2023-07-17 05:30:13,853][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000018440_9441280.pth... |
| [2023-07-17 05:30:13,856][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000017896_9162752.pth |
| [2023-07-17 05:30:15,907][335595] Updated weights for policy 0, policy_version 18480 (0.0005) |
| [2023-07-17 05:30:18,849][335310] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9400.0). Total num frames: 9486336. Throughput: 0: 9353.4. Samples: 9485416. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) |
| [2023-07-17 05:30:18,850][335310] Avg episode reward: [(0, '513.865')] |
| [2023-07-17 05:30:20,329][335595] Updated weights for policy 0, policy_version 18560 (0.0005) |
| [2023-07-17 05:30:23,849][335310] Fps is (10 sec: 9420.9, 60 sec: 9352.5, 300 sec: 9413.9). Total num frames: 9535488. Throughput: 0: 9337.4. Samples: 9513368. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) |
| [2023-07-17 05:30:23,850][335310] Avg episode reward: [(0, '515.949')] |
| [2023-07-17 05:30:24,535][335595] Updated weights for policy 0, policy_version 18640 (0.0005) |
| [2023-07-17 05:30:28,849][335310] Fps is (10 sec: 9420.7, 60 sec: 9352.5, 300 sec: 9400.0). Total num frames: 9580544. Throughput: 0: 9358.0. Samples: 9570832. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) |
| [2023-07-17 05:30:28,850][335310] Avg episode reward: [(0, '516.135')] |
| [2023-07-17 05:30:28,853][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000018712_9580544.pth... |
| [2023-07-17 05:30:28,856][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000018168_9302016.pth |
| [2023-07-17 05:30:28,942][335595] Updated weights for policy 0, policy_version 18720 (0.0005) |
| [2023-07-17 05:30:33,364][335595] Updated weights for policy 0, policy_version 18800 (0.0005) |
| [2023-07-17 05:30:33,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9413.9). Total num frames: 9629696. Throughput: 0: 9343.8. Samples: 9626108. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-17 05:30:33,850][335310] Avg episode reward: [(0, '516.042')] |
| [2023-07-17 05:30:37,829][335595] Updated weights for policy 0, policy_version 18880 (0.0005) |
| [2023-07-17 05:30:38,849][335310] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9413.9). Total num frames: 9674752. Throughput: 0: 9353.4. Samples: 9654124. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) |
| [2023-07-17 05:30:38,850][335310] Avg episode reward: [(0, '511.759')] |
| [2023-07-17 05:30:42,356][335595] Updated weights for policy 0, policy_version 18960 (0.0004) |
| [2023-07-17 05:30:43,849][335310] Fps is (10 sec: 9011.1, 60 sec: 9352.5, 300 sec: 9400.0). Total num frames: 9719808. Throughput: 0: 9318.8. Samples: 9708120. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:30:43,850][335310] Avg episode reward: [(0, '514.517')] |
| [2023-07-17 05:30:43,853][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000018984_9719808.pth... |
| [2023-07-17 05:30:43,856][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000018440_9441280.pth |
| [2023-07-17 05:30:46,790][335595] Updated weights for policy 0, policy_version 19040 (0.0005) |
| [2023-07-17 05:30:48,849][335310] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9400.0). Total num frames: 9764864. Throughput: 0: 9299.1. Samples: 9764244. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:30:48,850][335310] Avg episode reward: [(0, '520.146')] |
| [2023-07-17 05:30:51,247][335595] Updated weights for policy 0, policy_version 19120 (0.0005) |
| [2023-07-17 05:30:53,849][335310] Fps is (10 sec: 9420.9, 60 sec: 9352.5, 300 sec: 9400.0). Total num frames: 9814016. Throughput: 0: 9271.7. Samples: 9791200. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:30:53,850][335310] Avg episode reward: [(0, '511.701')] |
| [2023-07-17 05:30:55,472][335595] Updated weights for policy 0, policy_version 19200 (0.0005) |
| [2023-07-17 05:30:58,850][335310] Fps is (10 sec: 9830.1, 60 sec: 9420.8, 300 sec: 9413.8). Total num frames: 9863168. Throughput: 0: 9339.8. Samples: 9849828. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:30:58,850][335310] Avg episode reward: [(0, '517.748')] |
| [2023-07-17 05:30:58,854][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000019264_9863168.pth... |
| [2023-07-17 05:30:58,856][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000018712_9580544.pth |
| [2023-07-17 05:30:59,621][335595] Updated weights for policy 0, policy_version 19280 (0.0004) |
| [2023-07-17 05:31:03,849][335310] Fps is (10 sec: 9830.4, 60 sec: 9420.8, 300 sec: 9413.9). Total num frames: 9912320. Throughput: 0: 9397.5. Samples: 9908304. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:31:03,849][335595] Updated weights for policy 0, policy_version 19360 (0.0005) |
| [2023-07-17 05:31:03,850][335310] Avg episode reward: [(0, '505.885')] |
| [2023-07-17 05:31:08,073][335595] Updated weights for policy 0, policy_version 19440 (0.0004) |
| [2023-07-17 05:31:08,849][335310] Fps is (10 sec: 9421.1, 60 sec: 9352.5, 300 sec: 9413.9). Total num frames: 9957376. Throughput: 0: 9435.6. Samples: 9937972. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) |
| [2023-07-17 05:31:08,850][335310] Avg episode reward: [(0, '515.946')] |
| [2023-07-17 05:31:12,214][335595] Updated weights for policy 0, policy_version 19520 (0.0004) |
| [2023-07-17 05:31:13,475][335551] Early stopping after 2 epochs (8 sgd steps), loss delta 0.0000000 |
| [2023-07-17 05:31:13,476][335600] Stopping RolloutWorker_w5... |
| [2023-07-17 05:31:13,476][335597] Stopping RolloutWorker_w2... |
| [2023-07-17 05:31:13,476][335599] Stopping RolloutWorker_w3... |
| [2023-07-17 05:31:13,476][335601] Stopping RolloutWorker_w4... |
| [2023-07-17 05:31:13,476][335596] Stopping RolloutWorker_w1... |
| [2023-07-17 05:31:13,476][335600] Loop rollout_proc5_evt_loop terminating... |
| [2023-07-17 05:31:13,476][335665] Stopping RolloutWorker_w6... |
| [2023-07-17 05:31:13,476][335598] Stopping RolloutWorker_w0... |
| [2023-07-17 05:31:13,476][335597] Loop rollout_proc2_evt_loop terminating... |
| [2023-07-17 05:31:13,476][335633] Stopping RolloutWorker_w7... |
| [2023-07-17 05:31:13,476][335599] Loop rollout_proc3_evt_loop terminating... |
| [2023-07-17 05:31:13,476][335596] Loop rollout_proc1_evt_loop terminating... |
| [2023-07-17 05:31:13,476][335601] Loop rollout_proc4_evt_loop terminating... |
| [2023-07-17 05:31:13,476][335665] Loop rollout_proc6_evt_loop terminating... |
| [2023-07-17 05:31:13,476][335598] Loop rollout_proc0_evt_loop terminating... |
| [2023-07-17 05:31:13,476][335633] Loop rollout_proc7_evt_loop terminating... |
| [2023-07-17 05:31:13,476][335310] Component RolloutWorker_w5 stopped! |
| [2023-07-17 05:31:13,476][335310] Component RolloutWorker_w2 stopped! |
| [2023-07-17 05:31:13,476][335551] Stopping Batcher_0... |
| [2023-07-17 05:31:13,477][335310] Component RolloutWorker_w3 stopped! |
| [2023-07-17 05:31:13,477][335551] Loop batcher_evt_loop terminating... |
| [2023-07-17 05:31:13,477][335310] Component RolloutWorker_w6 stopped! |
| [2023-07-17 05:31:13,477][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000019544_10006528.pth... |
| [2023-07-17 05:31:13,477][335310] Component RolloutWorker_w4 stopped! |
| [2023-07-17 05:31:13,478][335310] Component RolloutWorker_w1 stopped! |
| [2023-07-17 05:31:13,478][335310] Component RolloutWorker_w0 stopped! |
| [2023-07-17 05:31:13,478][335310] Component RolloutWorker_w7 stopped! |
| [2023-07-17 05:31:13,478][335310] Component Batcher_0 stopped! |
| [2023-07-17 05:31:13,480][335551] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000018984_9719808.pth |
| [2023-07-17 05:31:13,480][335551] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000019544_10006528.pth... |
| [2023-07-17 05:31:13,482][335551] Stopping LearnerWorker_p0... |
| [2023-07-17 05:31:13,483][335551] Loop learner_proc0_evt_loop terminating... |
| [2023-07-17 05:31:13,483][335310] Component LearnerWorker_p0 stopped! |
| [2023-07-17 05:31:13,539][335595] Weights refcount: 2 0 |
| [2023-07-17 05:31:13,540][335595] Stopping InferenceWorker_p0-w0... |
| [2023-07-17 05:31:13,541][335595] Loop inference_proc0-0_evt_loop terminating... |
| [2023-07-17 05:31:13,541][335310] Component InferenceWorker_p0-w0 stopped! |
| [2023-07-17 05:31:13,541][335310] Waiting for process learner_proc0 to stop... |
| [2023-07-17 05:31:14,071][335310] Waiting for process inference_proc0-0 to join... |
| [2023-07-17 05:31:14,079][335310] Waiting for process rollout_proc0 to join... |
| [2023-07-17 05:31:14,079][335310] Waiting for process rollout_proc1 to join... |
| [2023-07-17 05:31:14,079][335310] Waiting for process rollout_proc2 to join... |
| [2023-07-17 05:31:14,080][335310] Waiting for process rollout_proc3 to join... |
| [2023-07-17 05:31:14,080][335310] Waiting for process rollout_proc4 to join... |
| [2023-07-17 05:31:14,080][335310] Waiting for process rollout_proc5 to join... |
| [2023-07-17 05:31:14,080][335310] Waiting for process rollout_proc6 to join... |
| [2023-07-17 05:31:14,080][335310] Waiting for process rollout_proc7 to join... |
| [2023-07-17 05:31:14,080][335310] Batcher 0 profile tree view: |
| batching: 1.8273, releasing_batches: 1.5976 |
| [2023-07-17 05:31:14,081][335310] InferenceWorker_p0-w0 profile tree view: |
| wait_policy: 0.0051 |
| wait_policy_total: 421.6592 |
| update_model: 12.5927 |
| weight_update: 0.0004 |
| one_step: 0.0007 |
| handle_policy_step: 557.4167 |
| deserialize: 23.7273, stack: 6.0270, obs_to_device_normalize: 101.2733, forward: 275.4663, send_messages: 39.1386 |
| prepare_outputs: 63.5996 |
| to_cpu: 9.7406 |
| [2023-07-17 05:31:14,081][335310] Learner 0 profile tree view: |
| misc: 0.0104, prepare_batch: 10.7012 |
| train: 111.0191 |
| epoch_init: 0.0391, minibatch_init: 1.5246, losses_postprocess: 1.4763, kl_divergence: 0.5123, after_optimizer: 0.7139 |
| calculate_losses: 47.5982 |
| losses_init: 0.0452, forward_head: 18.7123, bptt_initial: 0.1612, bptt: 0.1467, tail: 13.4350, advantages_returns: 1.0194, losses: 12.4351 |
| update: 57.3253 |
| clip: 6.8060 |
| [2023-07-17 05:31:14,081][335310] RolloutWorker_w0 profile tree view: |
| wait_for_trajectories: 0.2771, enqueue_policy_requests: 12.8438, env_step: 784.6904, overhead: 19.8430, complete_rollouts: 0.3447 |
| save_policy_outputs: 38.8160 |
| split_output_tensors: 13.3999 |
| [2023-07-17 05:31:14,081][335310] RolloutWorker_w7 profile tree view: |
| wait_for_trajectories: 0.2697, enqueue_policy_requests: 13.2978, env_step: 780.4665, overhead: 19.6541, complete_rollouts: 0.3294 |
| save_policy_outputs: 39.7095 |
| split_output_tensors: 13.3507 |
| [2023-07-17 05:31:14,081][335310] Loop Runner_EvtLoop terminating... |
| [2023-07-17 05:31:14,081][335310] Runner profile tree view: |
| main_loop: 1063.1574 |
| [2023-07-17 05:31:14,082][335310] Collected {0: 10006528}, FPS: 9412.1 |
|
|