diff --git "a/sf_log.txt" "b/sf_log.txt" --- "a/sf_log.txt" +++ "b/sf_log.txt" @@ -1,32 +1,39 @@ -[2023-09-26 21:53:11,745][51558] Saving configuration to ./train_atari/atari_privateye/config.json... -[2023-09-26 21:53:12,061][51558] Rollout worker 0 uses device cpu -[2023-09-26 21:53:12,062][51558] Rollout worker 1 uses device cpu -[2023-09-26 21:53:12,062][51558] Rollout worker 2 uses device cpu -[2023-09-26 21:53:12,062][51558] Rollout worker 3 uses device cpu -[2023-09-26 21:53:12,063][51558] Rollout worker 4 uses device cpu -[2023-09-26 21:53:12,063][51558] Rollout worker 5 uses device cpu -[2023-09-26 21:53:12,063][51558] Rollout worker 6 uses device cpu -[2023-09-26 21:53:12,063][51558] Rollout worker 7 uses device cpu -[2023-09-26 21:53:12,064][51558] In synchronous mode, we only accumulate one batch. Setting num_batches_to_accumulate to 1 -[2023-09-26 21:53:12,107][51558] Using GPUs [0] for process 0 (actually maps to GPUs [0]) -[2023-09-26 21:53:12,107][51558] InferenceWorker_p0-w0: min num requests: 1 -[2023-09-26 21:53:12,111][51558] Using GPUs [1] for process 1 (actually maps to GPUs [1]) -[2023-09-26 21:53:12,111][51558] InferenceWorker_p1-w0: min num requests: 1 -[2023-09-26 21:53:12,134][51558] Starting all processes... -[2023-09-26 21:53:12,135][51558] Starting process learner_proc0 -[2023-09-26 21:53:13,735][51558] Starting process learner_proc1 -[2023-09-26 21:53:13,738][52310] Using GPUs [0] for process 0 (actually maps to GPUs [0]) -[2023-09-26 21:53:13,738][52310] Set environment var CUDA_VISIBLE_DEVICES to '0' (GPU indices [0]) for learning process 0 -[2023-09-26 21:53:13,757][52310] Num visible devices: 1 -[2023-09-26 21:53:13,781][52310] Starting seed is not provided -[2023-09-26 21:53:13,781][52310] Using GPUs [0] for process 0 (actually maps to GPUs [0]) -[2023-09-26 21:53:13,782][52310] Initializing actor-critic model on device cuda:0 -[2023-09-26 21:53:13,782][52310] RunningMeanStd input shape: (4, 84, 84) -[2023-09-26 21:53:13,783][52310] RunningMeanStd input shape: (1,) -[2023-09-26 21:53:13,795][52310] ConvEncoder: input_channels=4 -[2023-09-26 21:53:13,965][52310] Conv encoder output size: 512 -[2023-09-26 21:53:13,967][52310] Created Actor Critic model with architecture: -[2023-09-26 21:53:13,967][52310] ActorCriticSharedWeights( +[2023-10-14 04:59:42,455][99942] Saving configuration to ./train_atari/atari_privateye_APPO/config.json... +[2023-10-14 04:59:42,772][99942] Rollout worker 0 uses device cpu +[2023-10-14 04:59:42,773][99942] Rollout worker 1 uses device cpu +[2023-10-14 04:59:42,774][99942] Rollout worker 2 uses device cpu +[2023-10-14 04:59:42,774][99942] Rollout worker 3 uses device cpu +[2023-10-14 04:59:42,775][99942] Rollout worker 4 uses device cpu +[2023-10-14 04:59:42,775][99942] Rollout worker 5 uses device cpu +[2023-10-14 04:59:42,775][99942] Rollout worker 6 uses device cpu +[2023-10-14 04:59:42,776][99942] Rollout worker 7 uses device cpu +[2023-10-14 04:59:42,776][99942] Rollout worker 8 uses device cpu +[2023-10-14 04:59:42,777][99942] Rollout worker 9 uses device cpu +[2023-10-14 04:59:42,777][99942] Rollout worker 10 uses device cpu +[2023-10-14 04:59:42,778][99942] Rollout worker 11 uses device cpu +[2023-10-14 04:59:42,778][99942] Rollout worker 12 uses device cpu +[2023-10-14 04:59:42,779][99942] Rollout worker 13 uses device cpu +[2023-10-14 04:59:42,779][99942] Rollout worker 14 uses device cpu +[2023-10-14 04:59:42,779][99942] Rollout worker 15 uses device cpu +[2023-10-14 04:59:43,068][99942] Using GPUs [0] for process 0 (actually maps to GPUs [0]) +[2023-10-14 04:59:43,068][99942] InferenceWorker_p0-w0: min num requests: 2 +[2023-10-14 04:59:43,071][99942] Using GPUs [1] for process 1 (actually maps to GPUs [1]) +[2023-10-14 04:59:43,072][99942] InferenceWorker_p1-w0: min num requests: 2 +[2023-10-14 04:59:43,121][99942] Starting all processes... +[2023-10-14 04:59:43,121][99942] Starting process learner_proc0 +[2023-10-14 04:59:44,824][99942] Starting process learner_proc1 +[2023-10-14 04:59:44,829][100560] Using GPUs [0] for process 0 (actually maps to GPUs [0]) +[2023-10-14 04:59:44,829][100560] Set environment var CUDA_VISIBLE_DEVICES to '0' (GPU indices [0]) for learning process 0 +[2023-10-14 04:59:44,848][100560] Num visible devices: 1 +[2023-10-14 04:59:44,869][100560] Setting fixed seed 1234 +[2023-10-14 04:59:44,870][100560] Using GPUs [0] for process 0 (actually maps to GPUs [0]) +[2023-10-14 04:59:44,870][100560] Initializing actor-critic model on device cuda:0 +[2023-10-14 04:59:44,871][100560] RunningMeanStd input shape: (4, 84, 84) +[2023-10-14 04:59:44,871][100560] RunningMeanStd input shape: (1,) +[2023-10-14 04:59:44,882][100560] ConvEncoder: input_channels=4 +[2023-10-14 04:59:45,062][100560] Conv encoder output size: 512 +[2023-10-14 04:59:45,064][100560] Created Actor Critic model with architecture: +[2023-10-14 04:59:45,064][100560] ActorCriticSharedWeights( (obs_normalizer): ObservationNormalizer( (running_mean_std): RunningMeanStdDictInPlace( (running_mean_std): ModuleDict( @@ -67,35 +74,41 @@ (distribution_linear): Linear(in_features=512, out_features=18, bias=True) ) ) -[2023-09-26 21:53:14,545][52310] Using optimizer -[2023-09-26 21:53:14,546][52310] No checkpoints found -[2023-09-26 21:53:14,546][52310] Did not load from checkpoint, starting from scratch! -[2023-09-26 21:53:14,546][52310] Initialized policy 0 weights for model version 0 -[2023-09-26 21:53:14,548][52310] LearnerWorker_p0 finished initialization! -[2023-09-26 21:53:14,549][52310] Using GPUs [0] for process 0 (actually maps to GPUs [0]) -[2023-09-26 21:53:15,390][51558] Starting all processes... -[2023-09-26 21:53:15,393][52398] Using GPUs [1] for process 1 (actually maps to GPUs [1]) -[2023-09-26 21:53:15,393][52398] Set environment var CUDA_VISIBLE_DEVICES to '1' (GPU indices [1]) for learning process 1 -[2023-09-26 21:53:15,397][51558] Starting process inference_proc0-0 -[2023-09-26 21:53:15,398][51558] Starting process inference_proc1-0 -[2023-09-26 21:53:15,398][51558] Starting process rollout_proc0 -[2023-09-26 21:53:15,398][51558] Starting process rollout_proc1 -[2023-09-26 21:53:15,412][52398] Num visible devices: 1 -[2023-09-26 21:53:15,398][51558] Starting process rollout_proc2 -[2023-09-26 21:53:15,399][51558] Starting process rollout_proc3 -[2023-09-26 21:53:15,434][52398] Starting seed is not provided -[2023-09-26 21:53:15,434][52398] Using GPUs [0] for process 1 (actually maps to GPUs [1]) -[2023-09-26 21:53:15,435][52398] Initializing actor-critic model on device cuda:0 -[2023-09-26 21:53:15,435][52398] RunningMeanStd input shape: (4, 84, 84) -[2023-09-26 21:53:15,435][52398] RunningMeanStd input shape: (1,) -[2023-09-26 21:53:15,399][51558] Starting process rollout_proc4 -[2023-09-26 21:53:15,403][51558] Starting process rollout_proc5 -[2023-09-26 21:53:15,406][51558] Starting process rollout_proc6 -[2023-09-26 21:53:15,407][51558] Starting process rollout_proc7 -[2023-09-26 21:53:15,448][52398] ConvEncoder: input_channels=4 -[2023-09-26 21:53:15,789][52398] Conv encoder output size: 512 -[2023-09-26 21:53:15,792][52398] Created Actor Critic model with architecture: -[2023-09-26 21:53:15,793][52398] ActorCriticSharedWeights( +[2023-10-14 04:59:45,620][100560] Using optimizer +[2023-10-14 04:59:45,621][100560] No checkpoints found +[2023-10-14 04:59:45,621][100560] Did not load from checkpoint, starting from scratch! +[2023-10-14 04:59:45,621][100560] Initialized policy 0 weights for model version 0 +[2023-10-14 04:59:45,623][100560] LearnerWorker_p0 finished initialization! +[2023-10-14 04:59:45,623][100560] Using GPUs [0] for process 0 (actually maps to GPUs [0]) +[2023-10-14 04:59:46,628][99942] Starting all processes... +[2023-10-14 04:59:46,631][100681] Using GPUs [1] for process 1 (actually maps to GPUs [1]) +[2023-10-14 04:59:46,631][100681] Set environment var CUDA_VISIBLE_DEVICES to '1' (GPU indices [1]) for learning process 1 +[2023-10-14 04:59:46,636][99942] Starting process inference_proc0-0 +[2023-10-14 04:59:46,637][99942] Starting process inference_proc1-0 +[2023-10-14 04:59:46,637][99942] Starting process rollout_proc0 +[2023-10-14 04:59:46,649][100681] Num visible devices: 1 +[2023-10-14 04:59:46,637][99942] Starting process rollout_proc1 +[2023-10-14 04:59:46,637][99942] Starting process rollout_proc2 +[2023-10-14 04:59:46,668][100681] Setting fixed seed 1234 +[2023-10-14 04:59:46,638][99942] Starting process rollout_proc3 +[2023-10-14 04:59:46,669][100681] Using GPUs [0] for process 1 (actually maps to GPUs [1]) +[2023-10-14 04:59:46,669][100681] Initializing actor-critic model on device cuda:0 +[2023-10-14 04:59:46,638][99942] Starting process rollout_proc4 +[2023-10-14 04:59:46,670][100681] RunningMeanStd input shape: (4, 84, 84) +[2023-10-14 04:59:46,670][100681] RunningMeanStd input shape: (1,) +[2023-10-14 04:59:46,645][99942] Starting process rollout_proc5 +[2023-10-14 04:59:46,647][99942] Starting process rollout_proc6 +[2023-10-14 04:59:46,648][99942] Starting process rollout_proc7 +[2023-10-14 04:59:46,649][99942] Starting process rollout_proc8 +[2023-10-14 04:59:46,654][99942] Starting process rollout_proc9 +[2023-10-14 04:59:46,683][100681] ConvEncoder: input_channels=4 +[2023-10-14 04:59:46,654][99942] Starting process rollout_proc10 +[2023-10-14 04:59:46,664][99942] Starting process rollout_proc11 +[2023-10-14 04:59:46,665][99942] Starting process rollout_proc12 +[2023-10-14 04:59:46,666][99942] Starting process rollout_proc13 +[2023-10-14 04:59:47,161][100681] Conv encoder output size: 512 +[2023-10-14 04:59:47,164][100681] Created Actor Critic model with architecture: +[2023-10-14 04:59:47,165][100681] ActorCriticSharedWeights( (obs_normalizer): ObservationNormalizer( (running_mean_std): RunningMeanStdDictInPlace( (running_mean_std): ModuleDict( @@ -136,2137 +149,26337 @@ (distribution_linear): Linear(in_features=512, out_features=18, bias=True) ) ) -[2023-09-26 21:53:16,497][52398] Using optimizer -[2023-09-26 21:53:16,498][52398] No checkpoints found -[2023-09-26 21:53:16,498][52398] Did not load from checkpoint, starting from scratch! -[2023-09-26 21:53:16,498][52398] Initialized policy 1 weights for model version 0 -[2023-09-26 21:53:16,500][52398] LearnerWorker_p1 finished initialization! -[2023-09-26 21:53:16,500][52398] Using GPUs [0] for process 1 (actually maps to GPUs [1]) -[2023-09-26 21:53:17,378][52541] Using GPUs [1] for process 1 (actually maps to GPUs [1]) -[2023-09-26 21:53:17,378][52541] Set environment var CUDA_VISIBLE_DEVICES to '1' (GPU indices [1]) for inference process 1 -[2023-09-26 21:53:17,386][52587] Worker 7 uses CPU cores [28, 29, 30, 31] -[2023-09-26 21:53:17,390][52586] Worker 5 uses CPU cores [20, 21, 22, 23] -[2023-09-26 21:53:17,390][52584] Worker 4 uses CPU cores [16, 17, 18, 19] -[2023-09-26 21:53:17,396][52541] Num visible devices: 1 -[2023-09-26 21:53:17,404][52540] Using GPUs [0] for process 0 (actually maps to GPUs [0]) -[2023-09-26 21:53:17,404][52540] Set environment var CUDA_VISIBLE_DEVICES to '0' (GPU indices [0]) for inference process 0 -[2023-09-26 21:53:17,410][52580] Worker 1 uses CPU cores [4, 5, 6, 7] -[2023-09-26 21:53:17,410][52582] Worker 2 uses CPU cores [8, 9, 10, 11] -[2023-09-26 21:53:17,423][52540] Num visible devices: 1 -[2023-09-26 21:53:17,429][52583] Worker 3 uses CPU cores [12, 13, 14, 15] -[2023-09-26 21:53:17,476][52576] Worker 0 uses CPU cores [0, 1, 2, 3] -[2023-09-26 21:53:17,599][52585] Worker 6 uses CPU cores [24, 25, 26, 27] -[2023-09-26 21:53:17,914][51558] Fps is (10 sec: nan, 60 sec: nan, 300 sec: nan). Total num frames: 0. Throughput: 0: nan, 1: nan. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) -[2023-09-26 21:53:18,028][52540] RunningMeanStd input shape: (4, 84, 84) -[2023-09-26 21:53:18,028][52540] RunningMeanStd input shape: (1,) -[2023-09-26 21:53:18,039][52540] ConvEncoder: input_channels=4 -[2023-09-26 21:53:18,045][52541] RunningMeanStd input shape: (4, 84, 84) -[2023-09-26 21:53:18,045][52541] RunningMeanStd input shape: (1,) -[2023-09-26 21:53:18,057][52541] ConvEncoder: input_channels=4 -[2023-09-26 21:53:18,135][52540] Conv encoder output size: 512 -[2023-09-26 21:53:18,141][51558] Inference worker 0-0 is ready! -[2023-09-26 21:53:18,156][52541] Conv encoder output size: 512 -[2023-09-26 21:53:18,162][51558] Inference worker 1-0 is ready! -[2023-09-26 21:53:18,163][51558] All inference workers are ready! Signal rollout workers to start! -[2023-09-26 21:53:18,637][52580] Decorrelating experience for 0 frames... -[2023-09-26 21:53:18,637][52587] Decorrelating experience for 0 frames... -[2023-09-26 21:53:18,638][52584] Decorrelating experience for 0 frames... -[2023-09-26 21:53:18,641][52582] Decorrelating experience for 0 frames... -[2023-09-26 21:53:18,642][52585] Decorrelating experience for 0 frames... -[2023-09-26 21:53:18,643][52586] Decorrelating experience for 0 frames... -[2023-09-26 21:53:18,644][52583] Decorrelating experience for 0 frames... -[2023-09-26 21:53:18,725][52576] Decorrelating experience for 0 frames... -[2023-09-26 21:53:22,914][51558] Fps is (10 sec: 1638.4, 60 sec: 1638.4, 300 sec: 1638.4). Total num frames: 8192. Throughput: 0: 204.8, 1: 204.8. Samples: 2048. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 21:53:27,914][51558] Fps is (10 sec: 3276.9, 60 sec: 3276.9, 300 sec: 3276.9). Total num frames: 32768. Throughput: 0: 405.6, 1: 401.4. Samples: 8070. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 21:53:32,095][51558] Heartbeat connected on Batcher_0 -[2023-09-26 21:53:32,098][51558] Heartbeat connected on LearnerWorker_p0 -[2023-09-26 21:53:32,101][51558] Heartbeat connected on Batcher_1 -[2023-09-26 21:53:32,103][51558] Heartbeat connected on LearnerWorker_p1 -[2023-09-26 21:53:32,109][51558] Heartbeat connected on InferenceWorker_p0-w0 -[2023-09-26 21:53:32,113][51558] Heartbeat connected on InferenceWorker_p1-w0 -[2023-09-26 21:53:32,114][51558] Heartbeat connected on RolloutWorker_w0 -[2023-09-26 21:53:32,118][51558] Heartbeat connected on RolloutWorker_w1 -[2023-09-26 21:53:32,119][51558] Heartbeat connected on RolloutWorker_w2 -[2023-09-26 21:53:32,124][51558] Heartbeat connected on RolloutWorker_w3 -[2023-09-26 21:53:32,127][51558] Heartbeat connected on RolloutWorker_w4 -[2023-09-26 21:53:32,130][51558] Heartbeat connected on RolloutWorker_w5 -[2023-09-26 21:53:32,131][51558] Heartbeat connected on RolloutWorker_w6 -[2023-09-26 21:53:32,133][51558] Heartbeat connected on RolloutWorker_w7 -[2023-09-26 21:53:32,914][51558] Fps is (10 sec: 5734.3, 60 sec: 4369.1, 300 sec: 4369.1). Total num frames: 65536. Throughput: 0: 410.9, 1: 411.9. Samples: 12343. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 21:53:35,401][52541] Updated weights for policy 1, policy_version 160 (0.0016) -[2023-09-26 21:53:35,402][52540] Updated weights for policy 0, policy_version 160 (0.0018) -[2023-09-26 21:53:37,914][51558] Fps is (10 sec: 5734.3, 60 sec: 4505.6, 300 sec: 4505.6). Total num frames: 90112. Throughput: 0: 548.6, 1: 548.2. Samples: 21934. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 21:53:37,915][51558] Avg episode reward: [(0, '-13.750'), (1, '-12.500')] -[2023-09-26 21:53:42,914][51558] Fps is (10 sec: 5734.4, 60 sec: 4915.2, 300 sec: 4915.2). Total num frames: 122880. Throughput: 0: 618.3, 1: 619.6. Samples: 30946. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 21:53:42,915][51558] Avg episode reward: [(0, '-13.750'), (1, '-12.500')] -[2023-09-26 21:53:47,914][51558] Fps is (10 sec: 6553.7, 60 sec: 5188.3, 300 sec: 5188.3). Total num frames: 155648. Throughput: 0: 596.9, 1: 597.3. Samples: 35825. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 21:53:47,915][51558] Avg episode reward: [(0, '-13.750'), (1, '-12.500')] -[2023-09-26 21:53:48,747][52541] Updated weights for policy 1, policy_version 320 (0.0016) -[2023-09-26 21:53:48,747][52540] Updated weights for policy 0, policy_version 320 (0.0017) -[2023-09-26 21:53:52,914][51558] Fps is (10 sec: 6553.7, 60 sec: 5383.3, 300 sec: 5383.3). Total num frames: 188416. Throughput: 0: 640.1, 1: 640.1. Samples: 44806. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 21:53:52,915][51558] Avg episode reward: [(0, '-6.500'), (1, '-5.750')] -[2023-09-26 21:53:57,914][51558] Fps is (10 sec: 5734.3, 60 sec: 5324.8, 300 sec: 5324.8). Total num frames: 212992. Throughput: 0: 672.0, 1: 674.0. Samples: 53838. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 21:53:57,915][51558] Avg episode reward: [(0, '-6.500'), (1, '-5.750')] -[2023-09-26 21:53:57,916][52310] Saving new best policy, reward=-6.500! -[2023-09-26 21:53:58,062][52398] Saving new best policy, reward=-5.750! -[2023-09-26 21:54:02,327][52540] Updated weights for policy 0, policy_version 480 (0.0018) -[2023-09-26 21:54:02,328][52541] Updated weights for policy 1, policy_version 480 (0.0017) -[2023-09-26 21:54:02,914][51558] Fps is (10 sec: 5734.4, 60 sec: 5461.4, 300 sec: 5461.4). Total num frames: 245760. Throughput: 0: 648.6, 1: 649.4. Samples: 58412. Policy #0 lag: (min: 9.0, avg: 9.0, max: 9.0) -[2023-09-26 21:54:02,915][51558] Avg episode reward: [(0, '-6.500'), (1, '-5.750')] -[2023-09-26 21:54:07,914][51558] Fps is (10 sec: 6553.7, 60 sec: 5570.6, 300 sec: 5570.6). Total num frames: 278528. Throughput: 0: 728.2, 1: 728.2. Samples: 67584. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) -[2023-09-26 21:54:07,915][51558] Avg episode reward: [(0, '-4.000'), (1, '-3.500')] -[2023-09-26 21:54:07,917][52310] Saving new best policy, reward=-4.000! -[2023-09-26 21:54:07,917][52398] Saving new best policy, reward=-3.500! -[2023-09-26 21:54:12,914][51558] Fps is (10 sec: 6553.6, 60 sec: 5659.9, 300 sec: 5659.9). Total num frames: 311296. Throughput: 0: 765.0, 1: 766.3. Samples: 76978. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 21:54:12,915][51558] Avg episode reward: [(0, '-4.000'), (1, '-3.500')] -[2023-09-26 21:54:15,707][52540] Updated weights for policy 0, policy_version 640 (0.0019) -[2023-09-26 21:54:15,707][52541] Updated weights for policy 1, policy_version 640 (0.0019) -[2023-09-26 21:54:17,914][51558] Fps is (10 sec: 5734.3, 60 sec: 5597.9, 300 sec: 5597.9). Total num frames: 335872. Throughput: 0: 767.6, 1: 766.9. Samples: 81395. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) -[2023-09-26 21:54:17,915][51558] Avg episode reward: [(0, '-4.000'), (1, '-3.500')] -[2023-09-26 21:54:22,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6007.4, 300 sec: 5671.4). Total num frames: 368640. Throughput: 0: 759.0, 1: 759.8. Samples: 90279. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 21:54:22,915][51558] Avg episode reward: [(0, '-2.812'), (1, '-2.375')] -[2023-09-26 21:54:22,919][52398] Saving new best policy, reward=-2.375! -[2023-09-26 21:54:22,919][52310] Saving new best policy, reward=-2.812! -[2023-09-26 21:54:27,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 5734.4). Total num frames: 401408. Throughput: 0: 764.5, 1: 762.6. Samples: 99663. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 21:54:27,915][51558] Avg episode reward: [(0, '-2.812'), (1, '-2.375')] -[2023-09-26 21:54:29,137][52541] Updated weights for policy 1, policy_version 800 (0.0017) -[2023-09-26 21:54:29,137][52540] Updated weights for policy 0, policy_version 800 (0.0016) -[2023-09-26 21:54:32,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6007.5, 300 sec: 5679.8). Total num frames: 425984. Throughput: 0: 761.2, 1: 761.6. Samples: 104349. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 21:54:32,915][51558] Avg episode reward: [(0, '-3.450'), (1, '-1.700')] -[2023-09-26 21:54:32,916][52398] Saving new best policy, reward=-1.700! -[2023-09-26 21:54:37,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6144.0, 300 sec: 5734.4). Total num frames: 458752. Throughput: 0: 761.4, 1: 761.8. Samples: 113349. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-09-26 21:54:37,915][51558] Avg episode reward: [(0, '-3.450'), (1, '-1.700')] -[2023-09-26 21:54:42,396][52541] Updated weights for policy 1, policy_version 960 (0.0016) -[2023-09-26 21:54:42,396][52540] Updated weights for policy 0, policy_version 960 (0.0018) -[2023-09-26 21:54:42,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 5782.6). Total num frames: 491520. Throughput: 0: 768.0, 1: 766.2. Samples: 122876. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-09-26 21:54:42,915][51558] Avg episode reward: [(0, '-3.450'), (1, '-1.700')] -[2023-09-26 21:54:47,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6144.0, 300 sec: 5825.4). Total num frames: 524288. Throughput: 0: 766.0, 1: 765.0. Samples: 127307. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 21:54:47,915][51558] Avg episode reward: [(0, '-4.542'), (1, '-1.250')] -[2023-09-26 21:54:47,917][52398] Saving new best policy, reward=-1.250! -[2023-09-26 21:54:52,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6144.0, 300 sec: 5863.8). Total num frames: 557056. Throughput: 0: 770.7, 1: 771.0. Samples: 136960. Policy #0 lag: (min: 10.0, avg: 10.0, max: 10.0) -[2023-09-26 21:54:52,915][51558] Avg episode reward: [(0, '-4.542'), (1, '-1.250')] -[2023-09-26 21:54:55,471][52540] Updated weights for policy 0, policy_version 1120 (0.0016) -[2023-09-26 21:54:55,472][52541] Updated weights for policy 1, policy_version 1120 (0.0017) -[2023-09-26 21:54:57,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6144.0, 300 sec: 5816.3). Total num frames: 581632. Throughput: 0: 765.8, 1: 765.9. Samples: 145908. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 21:54:57,915][51558] Avg episode reward: [(0, '-4.542'), (1, '-1.250')] -[2023-09-26 21:55:02,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6144.0, 300 sec: 5851.4). Total num frames: 614400. Throughput: 0: 769.4, 1: 770.6. Samples: 150694. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 21:55:02,915][51558] Avg episode reward: [(0, '-3.893'), (1, '-0.929')] -[2023-09-26 21:55:02,916][52398] Saving new best policy, reward=-0.929! -[2023-09-26 21:55:07,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 5883.4). Total num frames: 647168. Throughput: 0: 772.1, 1: 771.3. Samples: 159730. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 21:55:07,914][51558] Avg episode reward: [(0, '-3.893'), (1, '-0.929')] -[2023-09-26 21:55:07,917][52398] Saving ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000001264_323584.pth... -[2023-09-26 21:55:07,918][52310] Saving ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000001264_323584.pth... -[2023-09-26 21:55:09,129][52541] Updated weights for policy 1, policy_version 1280 (0.0015) -[2023-09-26 21:55:09,129][52540] Updated weights for policy 0, policy_version 1280 (0.0016) -[2023-09-26 21:55:12,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6007.5, 300 sec: 5841.3). Total num frames: 671744. Throughput: 0: 766.8, 1: 768.6. Samples: 168759. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) -[2023-09-26 21:55:12,915][51558] Avg episode reward: [(0, '-3.893'), (1, '-0.929')] -[2023-09-26 21:55:17,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 5870.9). Total num frames: 704512. Throughput: 0: 766.4, 1: 767.1. Samples: 173358. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 21:55:17,915][51558] Avg episode reward: [(0, '-3.375'), (1, '-1.625')] -[2023-09-26 21:55:22,317][52540] Updated weights for policy 0, policy_version 1440 (0.0017) -[2023-09-26 21:55:22,317][52541] Updated weights for policy 1, policy_version 1440 (0.0016) -[2023-09-26 21:55:22,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6144.0, 300 sec: 5898.3). Total num frames: 737280. Throughput: 0: 770.3, 1: 769.6. Samples: 182641. Policy #0 lag: (min: 9.0, avg: 9.0, max: 9.0) -[2023-09-26 21:55:22,914][51558] Avg episode reward: [(0, '-3.375'), (1, '-1.625')] -[2023-09-26 21:55:27,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6144.0, 300 sec: 5923.4). Total num frames: 770048. Throughput: 0: 766.4, 1: 767.4. Samples: 191900. Policy #0 lag: (min: 10.0, avg: 10.0, max: 10.0) -[2023-09-26 21:55:27,915][51558] Avg episode reward: [(0, '-3.375'), (1, '-1.625')] -[2023-09-26 21:55:32,914][51558] Fps is (10 sec: 5734.2, 60 sec: 6144.0, 300 sec: 5886.1). Total num frames: 794624. Throughput: 0: 770.0, 1: 770.1. Samples: 196608. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) -[2023-09-26 21:55:32,915][51558] Avg episode reward: [(0, '-3.028'), (1, '-1.361')] -[2023-09-26 21:55:35,740][52540] Updated weights for policy 0, policy_version 1600 (0.0018) -[2023-09-26 21:55:35,740][52541] Updated weights for policy 1, policy_version 1600 (0.0017) -[2023-09-26 21:55:37,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 5909.9). Total num frames: 827392. Throughput: 0: 762.0, 1: 762.3. Samples: 205551. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) -[2023-09-26 21:55:37,915][51558] Avg episode reward: [(0, '-3.028'), (1, '-1.361')] -[2023-09-26 21:55:42,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 5932.1). Total num frames: 860160. Throughput: 0: 768.4, 1: 767.9. Samples: 215040. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 21:55:42,915][51558] Avg episode reward: [(0, '-2.775'), (1, '-1.225')] -[2023-09-26 21:55:42,916][52310] Saving new best policy, reward=-2.775! -[2023-09-26 21:55:47,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 5952.9). Total num frames: 892928. Throughput: 0: 762.9, 1: 762.6. Samples: 219343. Policy #0 lag: (min: 10.0, avg: 10.0, max: 10.0) -[2023-09-26 21:55:47,915][51558] Avg episode reward: [(0, '-2.775'), (1, '-1.225')] -[2023-09-26 21:55:49,166][52540] Updated weights for policy 0, policy_version 1760 (0.0017) -[2023-09-26 21:55:49,166][52541] Updated weights for policy 1, policy_version 1760 (0.0016) -[2023-09-26 21:55:52,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6007.5, 300 sec: 5919.4). Total num frames: 917504. Throughput: 0: 765.8, 1: 765.9. Samples: 228654. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 21:55:52,914][51558] Avg episode reward: [(0, '-2.775'), (1, '-1.225')] -[2023-09-26 21:55:57,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 5939.2). Total num frames: 950272. Throughput: 0: 765.5, 1: 765.5. Samples: 237652. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 21:55:57,915][51558] Avg episode reward: [(0, '-4.114'), (1, '-1.114')] -[2023-09-26 21:56:02,418][52541] Updated weights for policy 1, policy_version 1920 (0.0017) -[2023-09-26 21:56:02,419][52540] Updated weights for policy 0, policy_version 1920 (0.0018) -[2023-09-26 21:56:02,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6144.0, 300 sec: 5957.8). Total num frames: 983040. Throughput: 0: 769.0, 1: 766.4. Samples: 242453. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 21:56:02,915][51558] Avg episode reward: [(0, '-4.114'), (1, '-1.114')] -[2023-09-26 21:56:07,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6144.0, 300 sec: 5975.3). Total num frames: 1015808. Throughput: 0: 769.4, 1: 769.8. Samples: 251904. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) -[2023-09-26 21:56:07,915][51558] Avg episode reward: [(0, '-4.114'), (1, '-1.114')] -[2023-09-26 21:56:12,914][51558] Fps is (10 sec: 6553.4, 60 sec: 6280.5, 300 sec: 5991.9). Total num frames: 1048576. Throughput: 0: 771.9, 1: 771.2. Samples: 261342. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) -[2023-09-26 21:56:12,915][51558] Avg episode reward: [(0, '-3.750'), (1, '-1.021')] -[2023-09-26 21:56:15,518][52540] Updated weights for policy 0, policy_version 2080 (0.0015) -[2023-09-26 21:56:15,519][52541] Updated weights for policy 1, policy_version 2080 (0.0018) -[2023-09-26 21:56:17,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6144.0, 300 sec: 5962.0). Total num frames: 1073152. Throughput: 0: 771.7, 1: 771.3. Samples: 266041. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) -[2023-09-26 21:56:17,915][51558] Avg episode reward: [(0, '-3.750'), (1, '-1.021')] -[2023-09-26 21:56:22,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 5977.9). Total num frames: 1105920. Throughput: 0: 773.0, 1: 772.7. Samples: 275106. Policy #0 lag: (min: 12.0, avg: 12.0, max: 12.0) -[2023-09-26 21:56:22,915][51558] Avg episode reward: [(0, '-3.750'), (1, '-1.021')] -[2023-09-26 21:56:27,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6144.0, 300 sec: 5993.1). Total num frames: 1138688. Throughput: 0: 773.7, 1: 773.6. Samples: 284670. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) -[2023-09-26 21:56:27,915][51558] Avg episode reward: [(0, '-3.442'), (1, '-0.923')] -[2023-09-26 21:56:27,916][52398] Saving new best policy, reward=-0.923! -[2023-09-26 21:56:28,718][52540] Updated weights for policy 0, policy_version 2240 (0.0018) -[2023-09-26 21:56:28,719][52541] Updated weights for policy 1, policy_version 2240 (0.0018) -[2023-09-26 21:56:32,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6280.5, 300 sec: 6007.5). Total num frames: 1171456. Throughput: 0: 774.8, 1: 774.5. Samples: 289062. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 21:56:32,915][51558] Avg episode reward: [(0, '-3.442'), (1, '-0.923')] -[2023-09-26 21:56:37,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6144.0, 300 sec: 5980.2). Total num frames: 1196032. Throughput: 0: 774.0, 1: 773.2. Samples: 298281. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 21:56:37,914][51558] Avg episode reward: [(0, '-3.442'), (1, '-0.923')] -[2023-09-26 21:56:42,083][52541] Updated weights for policy 1, policy_version 2400 (0.0017) -[2023-09-26 21:56:42,083][52540] Updated weights for policy 0, policy_version 2400 (0.0018) -[2023-09-26 21:56:42,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 5994.1). Total num frames: 1228800. Throughput: 0: 774.0, 1: 773.8. Samples: 307303. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 21:56:42,915][51558] Avg episode reward: [(0, '-3.196'), (1, '-0.821')] -[2023-09-26 21:56:42,917][52398] Saving new best policy, reward=-0.821! -[2023-09-26 21:56:47,914][51558] Fps is (10 sec: 6553.4, 60 sec: 6144.0, 300 sec: 6007.5). Total num frames: 1261568. Throughput: 0: 771.6, 1: 773.1. Samples: 311964. Policy #0 lag: (min: 12.0, avg: 12.0, max: 12.0) -[2023-09-26 21:56:47,915][51558] Avg episode reward: [(0, '-3.196'), (1, '-0.821')] -[2023-09-26 21:56:52,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6144.0, 300 sec: 5982.1). Total num frames: 1286144. Throughput: 0: 769.0, 1: 767.5. Samples: 321049. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 21:56:52,916][51558] Avg episode reward: [(0, '-2.983'), (1, '-0.767')] -[2023-09-26 21:56:52,958][52398] Saving new best policy, reward=-0.767! -[2023-09-26 21:56:55,667][52541] Updated weights for policy 1, policy_version 2560 (0.0016) -[2023-09-26 21:56:55,668][52540] Updated weights for policy 0, policy_version 2560 (0.0016) -[2023-09-26 21:56:57,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 5995.1). Total num frames: 1318912. Throughput: 0: 762.5, 1: 763.1. Samples: 329992. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 21:56:57,915][51558] Avg episode reward: [(0, '-2.983'), (1, '-0.767')] -[2023-09-26 21:57:02,914][51558] Fps is (10 sec: 6553.8, 60 sec: 6144.0, 300 sec: 6007.5). Total num frames: 1351680. Throughput: 0: 765.4, 1: 766.3. Samples: 334970. Policy #0 lag: (min: 10.0, avg: 10.0, max: 10.0) -[2023-09-26 21:57:02,915][51558] Avg episode reward: [(0, '-2.983'), (1, '-0.767')] -[2023-09-26 21:57:07,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6019.3). Total num frames: 1384448. Throughput: 0: 766.4, 1: 766.0. Samples: 344064. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-09-26 21:57:07,915][51558] Avg episode reward: [(0, '-2.766'), (1, '-0.719')] -[2023-09-26 21:57:07,919][52398] Saving ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000002704_692224.pth... -[2023-09-26 21:57:07,919][52310] Saving ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000002704_692224.pth... -[2023-09-26 21:57:07,955][52310] Saving new best policy, reward=-2.766! -[2023-09-26 21:57:07,955][52398] Saving new best policy, reward=-0.719! -[2023-09-26 21:57:08,952][52541] Updated weights for policy 1, policy_version 2720 (0.0017) -[2023-09-26 21:57:08,952][52540] Updated weights for policy 0, policy_version 2720 (0.0019) -[2023-09-26 21:57:12,914][51558] Fps is (10 sec: 6143.9, 60 sec: 6075.7, 300 sec: 6013.3). Total num frames: 1413120. Throughput: 0: 762.5, 1: 763.1. Samples: 353321. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 21:57:12,915][51558] Avg episode reward: [(0, '-2.766'), (1, '-0.719')] -[2023-09-26 21:57:17,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6007.5). Total num frames: 1441792. Throughput: 0: 763.8, 1: 766.2. Samples: 357911. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 21:57:17,915][51558] Avg episode reward: [(0, '-2.766'), (1, '-0.719')] -[2023-09-26 21:57:22,188][52541] Updated weights for policy 1, policy_version 2880 (0.0017) -[2023-09-26 21:57:22,188][52540] Updated weights for policy 0, policy_version 2880 (0.0016) -[2023-09-26 21:57:22,914][51558] Fps is (10 sec: 6144.0, 60 sec: 6144.0, 300 sec: 6018.6). Total num frames: 1474560. Throughput: 0: 762.8, 1: 764.4. Samples: 367008. Policy #0 lag: (min: 12.0, avg: 12.0, max: 12.0) -[2023-09-26 21:57:22,915][51558] Avg episode reward: [(0, '-2.588'), (1, '-0.662')] -[2023-09-26 21:57:22,920][52310] Saving new best policy, reward=-2.588! -[2023-09-26 21:57:22,920][52398] Saving new best policy, reward=-0.662! -[2023-09-26 21:57:27,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6144.0, 300 sec: 6029.3). Total num frames: 1507328. Throughput: 0: 767.6, 1: 767.8. Samples: 376392. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 21:57:27,915][51558] Avg episode reward: [(0, '-2.588'), (1, '-0.662')] -[2023-09-26 21:57:32,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6007.5, 300 sec: 6007.5). Total num frames: 1531904. Throughput: 0: 765.3, 1: 765.9. Samples: 380871. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 21:57:32,915][51558] Avg episode reward: [(0, '-2.588'), (1, '-0.662')] -[2023-09-26 21:57:35,736][52540] Updated weights for policy 0, policy_version 3040 (0.0017) -[2023-09-26 21:57:35,737][52541] Updated weights for policy 1, policy_version 3040 (0.0016) -[2023-09-26 21:57:37,914][51558] Fps is (10 sec: 5734.6, 60 sec: 6144.0, 300 sec: 6018.0). Total num frames: 1564672. Throughput: 0: 764.7, 1: 766.6. Samples: 389957. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 21:57:37,915][51558] Avg episode reward: [(0, '-2.431'), (1, '-0.597')] -[2023-09-26 21:57:37,917][52310] Saving new best policy, reward=-2.431! -[2023-09-26 21:57:37,917][52398] Saving new best policy, reward=-0.597! -[2023-09-26 21:57:42,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6144.0, 300 sec: 6028.1). Total num frames: 1597440. Throughput: 0: 771.1, 1: 770.4. Samples: 399360. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 21:57:42,915][51558] Avg episode reward: [(0, '-2.431'), (1, '-0.597')] -[2023-09-26 21:57:47,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6144.0, 300 sec: 6037.8). Total num frames: 1630208. Throughput: 0: 765.2, 1: 765.1. Samples: 403832. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 21:57:47,915][51558] Avg episode reward: [(0, '-2.431'), (1, '-0.597')] -[2023-09-26 21:57:48,834][52541] Updated weights for policy 1, policy_version 3200 (0.0016) -[2023-09-26 21:57:48,834][52540] Updated weights for policy 0, policy_version 3200 (0.0017) -[2023-09-26 21:57:52,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6280.6, 300 sec: 6047.2). Total num frames: 1662976. Throughput: 0: 773.7, 1: 773.7. Samples: 413696. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) -[2023-09-26 21:57:52,915][51558] Avg episode reward: [(0, '-2.697'), (1, '-0.553')] -[2023-09-26 21:57:52,920][52398] Saving new best policy, reward=-0.553! -[2023-09-26 21:57:57,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6280.5, 300 sec: 6056.2). Total num frames: 1695744. Throughput: 0: 776.3, 1: 776.3. Samples: 423189. Policy #0 lag: (min: 10.0, avg: 10.0, max: 10.0) -[2023-09-26 21:57:57,915][51558] Avg episode reward: [(0, '-2.697'), (1, '-0.553')] -[2023-09-26 21:58:01,730][52540] Updated weights for policy 0, policy_version 3360 (0.0017) -[2023-09-26 21:58:01,730][52541] Updated weights for policy 1, policy_version 3360 (0.0015) -[2023-09-26 21:58:02,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6144.0, 300 sec: 6036.2). Total num frames: 1720320. Throughput: 0: 779.9, 1: 777.4. Samples: 427988. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) -[2023-09-26 21:58:02,915][51558] Avg episode reward: [(0, '-2.663'), (1, '-0.475')] -[2023-09-26 21:58:02,915][52398] Saving new best policy, reward=-0.475! -[2023-09-26 21:58:07,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6045.1). Total num frames: 1753088. Throughput: 0: 773.8, 1: 773.4. Samples: 436628. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) -[2023-09-26 21:58:07,915][51558] Avg episode reward: [(0, '-2.663'), (1, '-0.475')] -[2023-09-26 21:58:12,914][51558] Fps is (10 sec: 6553.4, 60 sec: 6212.3, 300 sec: 6053.7). Total num frames: 1785856. Throughput: 0: 773.0, 1: 773.7. Samples: 445994. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 21:58:12,915][51558] Avg episode reward: [(0, '-2.663'), (1, '-0.475')] -[2023-09-26 21:58:15,390][52540] Updated weights for policy 0, policy_version 3520 (0.0017) -[2023-09-26 21:58:15,391][52541] Updated weights for policy 1, policy_version 3520 (0.0016) -[2023-09-26 21:58:17,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6144.0, 300 sec: 6109.3). Total num frames: 1810432. Throughput: 0: 774.7, 1: 773.8. Samples: 450552. Policy #0 lag: (min: 8.0, avg: 8.0, max: 8.0) -[2023-09-26 21:58:17,915][51558] Avg episode reward: [(0, '-2.631'), (1, '-0.417')] -[2023-09-26 21:58:17,916][52398] Saving new best policy, reward=-0.417! -[2023-09-26 21:58:22,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 1843200. Throughput: 0: 770.0, 1: 769.8. Samples: 459244. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 21:58:22,915][51558] Avg episode reward: [(0, '-2.631'), (1, '-0.417')] -[2023-09-26 21:58:27,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 1875968. Throughput: 0: 773.7, 1: 773.7. Samples: 468992. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 21:58:27,915][51558] Avg episode reward: [(0, '-2.631'), (1, '-0.417')] -[2023-09-26 21:58:28,600][52541] Updated weights for policy 1, policy_version 3680 (0.0018) -[2023-09-26 21:58:28,600][52540] Updated weights for policy 0, policy_version 3680 (0.0017) -[2023-09-26 21:58:32,914][51558] Fps is (10 sec: 6553.8, 60 sec: 6280.5, 300 sec: 6164.8). Total num frames: 1908736. Throughput: 0: 775.3, 1: 775.4. Samples: 473611. Policy #0 lag: (min: 10.0, avg: 10.0, max: 10.0) -[2023-09-26 21:58:32,915][51558] Avg episode reward: [(0, '-2.511'), (1, '-0.352')] -[2023-09-26 21:58:32,915][52398] Saving new best policy, reward=-0.352! -[2023-09-26 21:58:37,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6280.5, 300 sec: 6164.8). Total num frames: 1941504. Throughput: 0: 769.0, 1: 769.8. Samples: 482943. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) -[2023-09-26 21:58:37,915][51558] Avg episode reward: [(0, '-2.511'), (1, '-0.352')] -[2023-09-26 21:58:41,748][52540] Updated weights for policy 0, policy_version 3840 (0.0018) -[2023-09-26 21:58:41,749][52541] Updated weights for policy 1, policy_version 3840 (0.0018) -[2023-09-26 21:58:42,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 1966080. Throughput: 0: 768.0, 1: 767.7. Samples: 492297. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) -[2023-09-26 21:58:42,914][51558] Avg episode reward: [(0, '-2.511'), (1, '-0.352')] -[2023-09-26 21:58:47,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 1998848. Throughput: 0: 768.0, 1: 768.5. Samples: 497132. Policy #0 lag: (min: 10.0, avg: 10.0, max: 10.0) -[2023-09-26 21:58:47,915][51558] Avg episode reward: [(0, '-2.402'), (1, '-0.522')] -[2023-09-26 21:58:47,916][52310] Saving new best policy, reward=-2.402! -[2023-09-26 21:58:52,914][51558] Fps is (10 sec: 6553.4, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 2031616. Throughput: 0: 769.3, 1: 769.1. Samples: 505856. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 21:58:52,915][51558] Avg episode reward: [(0, '-2.402'), (1, '-0.522')] -[2023-09-26 21:58:55,274][52541] Updated weights for policy 1, policy_version 4000 (0.0018) -[2023-09-26 21:58:55,276][52540] Updated weights for policy 0, policy_version 4000 (0.0016) -[2023-09-26 21:58:57,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6007.5, 300 sec: 6137.1). Total num frames: 2056192. Throughput: 0: 767.0, 1: 764.8. Samples: 514924. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 21:58:57,915][51558] Avg episode reward: [(0, '-2.402'), (1, '-0.522')] -[2023-09-26 21:59:02,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 2088960. Throughput: 0: 763.3, 1: 762.0. Samples: 519191. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) -[2023-09-26 21:59:02,915][51558] Avg episode reward: [(0, '-2.281'), (1, '-0.844')] -[2023-09-26 21:59:02,916][52310] Saving new best policy, reward=-2.281! -[2023-09-26 21:59:07,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 2121728. Throughput: 0: 767.9, 1: 768.2. Samples: 528366. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 21:59:07,914][51558] Avg episode reward: [(0, '-2.281'), (1, '-0.844')] -[2023-09-26 21:59:07,918][52310] Saving ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000004144_1060864.pth... -[2023-09-26 21:59:07,918][52398] Saving ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000004144_1060864.pth... -[2023-09-26 21:59:07,947][52310] Removing ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000001264_323584.pth -[2023-09-26 21:59:07,956][52398] Removing ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000001264_323584.pth -[2023-09-26 21:59:09,005][52540] Updated weights for policy 0, policy_version 4160 (0.0015) -[2023-09-26 21:59:09,006][52541] Updated weights for policy 1, policy_version 4160 (0.0017) -[2023-09-26 21:59:12,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6007.5, 300 sec: 6137.1). Total num frames: 2146304. Throughput: 0: 761.6, 1: 762.0. Samples: 537554. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 21:59:12,915][51558] Avg episode reward: [(0, '-2.150'), (1, '-0.770')] -[2023-09-26 21:59:12,980][52310] Saving new best policy, reward=-2.150! -[2023-09-26 21:59:17,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 2179072. Throughput: 0: 761.4, 1: 762.8. Samples: 542197. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 21:59:17,915][51558] Avg episode reward: [(0, '-2.150'), (1, '-0.770')] -[2023-09-26 21:59:22,521][52540] Updated weights for policy 0, policy_version 4320 (0.0013) -[2023-09-26 21:59:22,521][52541] Updated weights for policy 1, policy_version 4320 (0.0015) -[2023-09-26 21:59:22,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 2211840. Throughput: 0: 757.2, 1: 756.5. Samples: 551059. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-09-26 21:59:22,915][51558] Avg episode reward: [(0, '-2.150'), (1, '-0.770')] -[2023-09-26 21:59:27,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6007.5, 300 sec: 6137.1). Total num frames: 2236416. Throughput: 0: 753.3, 1: 753.6. Samples: 560109. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 21:59:27,915][51558] Avg episode reward: [(0, '-1.560'), (1, '-0.230')] -[2023-09-26 21:59:27,925][52398] Saving new best policy, reward=-0.230! -[2023-09-26 21:59:27,945][52310] Saving new best policy, reward=-1.560! -[2023-09-26 21:59:32,914][51558] Fps is (10 sec: 5734.6, 60 sec: 6007.5, 300 sec: 6137.1). Total num frames: 2269184. Throughput: 0: 753.6, 1: 753.6. Samples: 564953. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 21:59:32,914][51558] Avg episode reward: [(0, '-1.560'), (1, '-0.230')] -[2023-09-26 21:59:35,966][52541] Updated weights for policy 1, policy_version 4480 (0.0015) -[2023-09-26 21:59:35,966][52540] Updated weights for policy 0, policy_version 4480 (0.0015) -[2023-09-26 21:59:37,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6007.5, 300 sec: 6137.1). Total num frames: 2301952. Throughput: 0: 754.7, 1: 755.0. Samples: 573793. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) -[2023-09-26 21:59:37,915][51558] Avg episode reward: [(0, '-1.560'), (1, '-0.230')] -[2023-09-26 21:59:42,914][51558] Fps is (10 sec: 6553.4, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 2334720. Throughput: 0: 757.2, 1: 759.2. Samples: 583163. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 21:59:42,915][51558] Avg episode reward: [(0, '-1.560'), (1, '-0.230')] -[2023-09-26 21:59:47,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6007.5, 300 sec: 6109.3). Total num frames: 2359296. Throughput: 0: 761.3, 1: 762.1. Samples: 587746. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 21:59:47,915][51558] Avg episode reward: [(0, '-1.560'), (1, '-0.230')] -[2023-09-26 21:59:49,204][52540] Updated weights for policy 0, policy_version 4640 (0.0018) -[2023-09-26 21:59:49,204][52541] Updated weights for policy 1, policy_version 4640 (0.0015) -[2023-09-26 21:59:52,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6007.5, 300 sec: 6137.1). Total num frames: 2392064. Throughput: 0: 765.2, 1: 764.7. Samples: 597210. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 21:59:52,915][51558] Avg episode reward: [(0, '-1.560'), (1, '-0.230')] -[2023-09-26 21:59:57,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 2424832. Throughput: 0: 763.9, 1: 764.0. Samples: 606307. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 21:59:57,915][51558] Avg episode reward: [(0, '-1.560'), (1, '-0.240')] -[2023-09-26 22:00:02,428][52540] Updated weights for policy 0, policy_version 4800 (0.0018) -[2023-09-26 22:00:02,429][52541] Updated weights for policy 1, policy_version 4800 (0.0018) -[2023-09-26 22:00:02,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 2457600. Throughput: 0: 764.5, 1: 762.6. Samples: 610917. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:00:02,915][51558] Avg episode reward: [(0, '-1.560'), (1, '-0.240')] -[2023-09-26 22:00:07,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 2490368. Throughput: 0: 770.3, 1: 771.0. Samples: 620417. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:00:07,915][51558] Avg episode reward: [(0, '-1.560'), (1, '-0.240')] -[2023-09-26 22:00:12,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 2514944. Throughput: 0: 766.7, 1: 766.2. Samples: 629090. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:00:12,915][51558] Avg episode reward: [(0, '-1.560'), (1, '-0.260')] -[2023-09-26 22:00:15,973][52540] Updated weights for policy 0, policy_version 4960 (0.0017) -[2023-09-26 22:00:15,973][52541] Updated weights for policy 1, policy_version 4960 (0.0017) -[2023-09-26 22:00:17,914][51558] Fps is (10 sec: 5734.6, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 2547712. Throughput: 0: 765.0, 1: 764.7. Samples: 633789. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:00:17,915][51558] Avg episode reward: [(0, '-1.560'), (1, '-0.260')] -[2023-09-26 22:00:22,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 2580480. Throughput: 0: 767.6, 1: 768.2. Samples: 642905. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:00:22,915][51558] Avg episode reward: [(0, '-1.560'), (1, '-0.260')] -[2023-09-26 22:00:27,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 2605056. Throughput: 0: 764.6, 1: 764.0. Samples: 651951. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:00:27,915][51558] Avg episode reward: [(0, '-1.290'), (1, '-0.280')] -[2023-09-26 22:00:28,057][52310] Saving new best policy, reward=-1.290! -[2023-09-26 22:00:29,377][52541] Updated weights for policy 1, policy_version 5120 (0.0017) -[2023-09-26 22:00:29,377][52540] Updated weights for policy 0, policy_version 5120 (0.0017) -[2023-09-26 22:00:32,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 2637824. Throughput: 0: 767.2, 1: 769.0. Samples: 656872. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-09-26 22:00:32,915][51558] Avg episode reward: [(0, '-1.290'), (1, '-0.280')] -[2023-09-26 22:00:37,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 2670592. Throughput: 0: 760.6, 1: 761.0. Samples: 665686. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:00:37,915][51558] Avg episode reward: [(0, '-0.850'), (1, '-0.280')] -[2023-09-26 22:00:37,923][52310] Saving new best policy, reward=-0.850! -[2023-09-26 22:00:42,615][52540] Updated weights for policy 0, policy_version 5280 (0.0017) -[2023-09-26 22:00:42,616][52541] Updated weights for policy 1, policy_version 5280 (0.0018) -[2023-09-26 22:00:42,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 2703360. Throughput: 0: 767.6, 1: 767.1. Samples: 675369. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:00:42,915][51558] Avg episode reward: [(0, '-0.850'), (1, '-0.280')] -[2023-09-26 22:00:47,914][51558] Fps is (10 sec: 6143.9, 60 sec: 6212.3, 300 sec: 6150.9). Total num frames: 2732032. Throughput: 0: 766.9, 1: 766.9. Samples: 679936. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) -[2023-09-26 22:00:47,915][51558] Avg episode reward: [(0, '-0.850'), (1, '-0.280')] -[2023-09-26 22:00:52,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 2760704. Throughput: 0: 763.2, 1: 763.5. Samples: 689121. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) -[2023-09-26 22:00:52,915][51558] Avg episode reward: [(0, '-0.810'), (1, '-0.280')] -[2023-09-26 22:00:52,930][52310] Saving new best policy, reward=-0.810! -[2023-09-26 22:00:56,092][52541] Updated weights for policy 1, policy_version 5440 (0.0019) -[2023-09-26 22:00:56,092][52540] Updated weights for policy 0, policy_version 5440 (0.0018) -[2023-09-26 22:00:57,914][51558] Fps is (10 sec: 6144.0, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 2793472. Throughput: 0: 769.5, 1: 769.2. Samples: 698329. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:00:57,915][51558] Avg episode reward: [(0, '-0.810'), (1, '-0.280')] -[2023-09-26 22:01:02,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 2826240. Throughput: 0: 763.3, 1: 763.1. Samples: 702477. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:01:02,915][51558] Avg episode reward: [(0, '-0.810'), (1, '-0.280')] -[2023-09-26 22:01:07,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6007.5, 300 sec: 6109.3). Total num frames: 2850816. Throughput: 0: 767.8, 1: 768.2. Samples: 712022. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) -[2023-09-26 22:01:07,915][51558] Avg episode reward: [(0, '-0.780'), (1, '0.010')] -[2023-09-26 22:01:07,927][52398] Saving ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000005568_1425408.pth... -[2023-09-26 22:01:07,928][52310] Saving ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000005568_1425408.pth... -[2023-09-26 22:01:07,967][52398] Removing ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000002704_692224.pth -[2023-09-26 22:01:07,967][52310] Removing ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000002704_692224.pth -[2023-09-26 22:01:07,970][52398] Saving new best policy, reward=0.010! -[2023-09-26 22:01:07,971][52310] Saving new best policy, reward=-0.780! -[2023-09-26 22:01:09,547][52541] Updated weights for policy 1, policy_version 5600 (0.0015) -[2023-09-26 22:01:09,547][52540] Updated weights for policy 0, policy_version 5600 (0.0018) -[2023-09-26 22:01:12,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 2883584. Throughput: 0: 767.0, 1: 767.1. Samples: 720986. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:01:12,915][51558] Avg episode reward: [(0, '-0.780'), (1, '0.010')] -[2023-09-26 22:01:17,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 2916352. Throughput: 0: 764.2, 1: 763.6. Samples: 725625. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:01:17,915][51558] Avg episode reward: [(0, '-0.780'), (1, '0.010')] -[2023-09-26 22:01:22,724][52541] Updated weights for policy 1, policy_version 5760 (0.0019) -[2023-09-26 22:01:22,724][52540] Updated weights for policy 0, policy_version 5760 (0.0019) -[2023-09-26 22:01:22,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 2949120. Throughput: 0: 772.6, 1: 772.6. Samples: 735218. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:01:22,915][51558] Avg episode reward: [(0, '-0.730'), (1, '-0.080')] -[2023-09-26 22:01:22,925][52310] Saving new best policy, reward=-0.730! -[2023-09-26 22:01:27,914][51558] Fps is (10 sec: 5734.6, 60 sec: 6144.0, 300 sec: 6109.3). Total num frames: 2973696. Throughput: 0: 766.0, 1: 766.5. Samples: 744333. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:01:27,914][51558] Avg episode reward: [(0, '-0.730'), (1, '-0.080')] -[2023-09-26 22:01:32,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6144.0, 300 sec: 6137.0). Total num frames: 3006464. Throughput: 0: 768.1, 1: 768.2. Samples: 749068. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:01:32,915][51558] Avg episode reward: [(0, '-0.730'), (1, '-0.080')] -[2023-09-26 22:01:36,078][52540] Updated weights for policy 0, policy_version 5920 (0.0017) -[2023-09-26 22:01:36,078][52541] Updated weights for policy 1, policy_version 5920 (0.0017) -[2023-09-26 22:01:37,914][51558] Fps is (10 sec: 6553.4, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 3039232. Throughput: 0: 764.5, 1: 763.9. Samples: 757899. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) -[2023-09-26 22:01:37,915][51558] Avg episode reward: [(0, '-0.690'), (1, '-0.050')] -[2023-09-26 22:01:37,926][52310] Saving new best policy, reward=-0.690! -[2023-09-26 22:01:42,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 3072000. Throughput: 0: 768.2, 1: 768.4. Samples: 767474. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) -[2023-09-26 22:01:42,915][51558] Avg episode reward: [(0, '-0.690'), (1, '-0.050')] -[2023-09-26 22:01:47,914][51558] Fps is (10 sec: 6144.0, 60 sec: 6144.0, 300 sec: 6150.9). Total num frames: 3100672. Throughput: 0: 773.6, 1: 773.5. Samples: 772096. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) -[2023-09-26 22:01:47,915][51558] Avg episode reward: [(0, '0.040'), (1, '-0.010')] -[2023-09-26 22:01:47,916][52310] Saving new best policy, reward=0.040! -[2023-09-26 22:01:49,196][52540] Updated weights for policy 0, policy_version 6080 (0.0016) -[2023-09-26 22:01:49,196][52541] Updated weights for policy 1, policy_version 6080 (0.0018) -[2023-09-26 22:01:52,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 3129344. Throughput: 0: 772.7, 1: 772.4. Samples: 781550. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:01:52,915][51558] Avg episode reward: [(0, '0.040'), (1, '-0.010')] -[2023-09-26 22:01:57,914][51558] Fps is (10 sec: 6144.0, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 3162112. Throughput: 0: 776.2, 1: 776.0. Samples: 790835. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) -[2023-09-26 22:01:57,915][51558] Avg episode reward: [(0, '0.040'), (1, '-0.010')] -[2023-09-26 22:02:02,381][52540] Updated weights for policy 0, policy_version 6240 (0.0016) -[2023-09-26 22:02:02,381][52541] Updated weights for policy 1, policy_version 6240 (0.0017) -[2023-09-26 22:02:02,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 3194880. Throughput: 0: 775.9, 1: 775.5. Samples: 795438. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) -[2023-09-26 22:02:02,915][51558] Avg episode reward: [(0, '0.070'), (1, '0.020')] -[2023-09-26 22:02:02,916][52398] Saving new best policy, reward=0.020! -[2023-09-26 22:02:02,916][52310] Saving new best policy, reward=0.070! -[2023-09-26 22:02:07,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6280.6, 300 sec: 6150.9). Total num frames: 3227648. Throughput: 0: 773.2, 1: 773.6. Samples: 804825. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) -[2023-09-26 22:02:07,915][51558] Avg episode reward: [(0, '0.070'), (1, '0.020')] -[2023-09-26 22:02:12,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 3252224. Throughput: 0: 773.7, 1: 774.2. Samples: 813989. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:02:12,915][51558] Avg episode reward: [(0, '0.070'), (1, '0.020')] -[2023-09-26 22:02:15,616][52540] Updated weights for policy 0, policy_version 6400 (0.0017) -[2023-09-26 22:02:15,616][52541] Updated weights for policy 1, policy_version 6400 (0.0017) -[2023-09-26 22:02:17,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 3284992. Throughput: 0: 774.1, 1: 774.3. Samples: 818745. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:02:17,915][51558] Avg episode reward: [(0, '0.090'), (1, '0.050')] -[2023-09-26 22:02:17,917][52310] Saving new best policy, reward=0.090! -[2023-09-26 22:02:17,917][52398] Saving new best policy, reward=0.050! -[2023-09-26 22:02:22,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 3317760. Throughput: 0: 775.8, 1: 775.7. Samples: 827715. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:02:22,915][51558] Avg episode reward: [(0, '0.090'), (1, '0.050')] -[2023-09-26 22:02:27,914][51558] Fps is (10 sec: 6553.8, 60 sec: 6280.5, 300 sec: 6164.8). Total num frames: 3350528. Throughput: 0: 773.4, 1: 774.7. Samples: 837140. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:02:27,914][51558] Avg episode reward: [(0, '0.090'), (1, '0.050')] -[2023-09-26 22:02:28,934][52540] Updated weights for policy 0, policy_version 6560 (0.0015) -[2023-09-26 22:02:28,935][52541] Updated weights for policy 1, policy_version 6560 (0.0018) -[2023-09-26 22:02:32,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 3375104. Throughput: 0: 773.7, 1: 773.7. Samples: 841728. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:02:32,915][51558] Avg episode reward: [(0, '-0.130'), (1, '0.070')] -[2023-09-26 22:02:32,961][52398] Saving new best policy, reward=0.070! -[2023-09-26 22:02:37,914][51558] Fps is (10 sec: 5734.2, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 3407872. Throughput: 0: 772.2, 1: 769.6. Samples: 850930. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) -[2023-09-26 22:02:37,915][51558] Avg episode reward: [(0, '-0.130'), (1, '0.070')] -[2023-09-26 22:02:42,254][52540] Updated weights for policy 0, policy_version 6720 (0.0015) -[2023-09-26 22:02:42,261][52541] Updated weights for policy 1, policy_version 6720 (0.0018) -[2023-09-26 22:02:42,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 3440640. Throughput: 0: 770.4, 1: 770.2. Samples: 860160. Policy #0 lag: (min: 12.0, avg: 12.0, max: 12.0) -[2023-09-26 22:02:42,915][51558] Avg episode reward: [(0, '-0.120'), (1, '0.080')] -[2023-09-26 22:02:42,917][52398] Saving new best policy, reward=0.080! -[2023-09-26 22:02:47,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6212.3, 300 sec: 6137.1). Total num frames: 3473408. Throughput: 0: 767.6, 1: 768.2. Samples: 864547. Policy #0 lag: (min: 12.0, avg: 12.0, max: 12.0) -[2023-09-26 22:02:47,915][51558] Avg episode reward: [(0, '-0.100'), (1, '0.090')] -[2023-09-26 22:02:47,917][52398] Saving new best policy, reward=0.090! -[2023-09-26 22:02:52,914][51558] Fps is (10 sec: 6143.9, 60 sec: 6212.3, 300 sec: 6123.2). Total num frames: 3502080. Throughput: 0: 771.6, 1: 767.9. Samples: 874105. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:02:52,915][51558] Avg episode reward: [(0, '-0.100'), (1, '0.090')] -[2023-09-26 22:02:55,631][52541] Updated weights for policy 1, policy_version 6880 (0.0019) -[2023-09-26 22:02:55,632][52540] Updated weights for policy 0, policy_version 6880 (0.0020) -[2023-09-26 22:02:57,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 3530752. Throughput: 0: 766.8, 1: 766.2. Samples: 882976. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:02:57,915][51558] Avg episode reward: [(0, '-0.090'), (1, '0.120')] -[2023-09-26 22:02:57,917][52398] Saving new best policy, reward=0.120! -[2023-09-26 22:03:02,914][51558] Fps is (10 sec: 6144.2, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 3563520. Throughput: 0: 767.1, 1: 766.2. Samples: 887743. Policy #0 lag: (min: 10.0, avg: 10.0, max: 10.0) -[2023-09-26 22:03:02,915][51558] Avg episode reward: [(0, '-0.090'), (1, '0.120')] -[2023-09-26 22:03:07,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 3596288. Throughput: 0: 770.4, 1: 770.1. Samples: 897036. Policy #0 lag: (min: 10.0, avg: 10.0, max: 10.0) -[2023-09-26 22:03:07,915][51558] Avg episode reward: [(0, '-0.090'), (1, '0.120')] -[2023-09-26 22:03:07,925][52398] Saving ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000007024_1798144.pth... -[2023-09-26 22:03:07,925][52310] Saving ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000007024_1798144.pth... -[2023-09-26 22:03:07,959][52310] Removing ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000004144_1060864.pth -[2023-09-26 22:03:07,965][52398] Removing ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000004144_1060864.pth -[2023-09-26 22:03:08,824][52541] Updated weights for policy 1, policy_version 7040 (0.0017) -[2023-09-26 22:03:08,824][52540] Updated weights for policy 0, policy_version 7040 (0.0018) -[2023-09-26 22:03:12,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6280.5, 300 sec: 6164.8). Total num frames: 3629056. Throughput: 0: 771.9, 1: 768.5. Samples: 906458. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) -[2023-09-26 22:03:12,915][51558] Avg episode reward: [(0, '-0.060'), (1, '0.140')] -[2023-09-26 22:03:12,916][52398] Saving new best policy, reward=0.140! -[2023-09-26 22:03:17,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 3653632. Throughput: 0: 772.4, 1: 772.0. Samples: 911230. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) -[2023-09-26 22:03:17,915][51558] Avg episode reward: [(0, '-0.060'), (1, '0.140')] -[2023-09-26 22:03:22,031][52540] Updated weights for policy 0, policy_version 7200 (0.0016) -[2023-09-26 22:03:22,031][52541] Updated weights for policy 1, policy_version 7200 (0.0017) -[2023-09-26 22:03:22,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 3686400. Throughput: 0: 770.9, 1: 773.0. Samples: 920405. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) -[2023-09-26 22:03:22,915][51558] Avg episode reward: [(0, '-0.060'), (1, '0.140')] -[2023-09-26 22:03:27,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 3719168. Throughput: 0: 773.7, 1: 773.7. Samples: 929792. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) -[2023-09-26 22:03:27,915][51558] Avg episode reward: [(0, '-0.040'), (1, '0.150')] -[2023-09-26 22:03:27,915][52398] Saving new best policy, reward=0.150! -[2023-09-26 22:03:32,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6280.5, 300 sec: 6137.1). Total num frames: 3751936. Throughput: 0: 770.9, 1: 770.2. Samples: 933898. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) -[2023-09-26 22:03:32,915][51558] Avg episode reward: [(0, '-0.040'), (1, '0.150')] -[2023-09-26 22:03:35,508][52541] Updated weights for policy 1, policy_version 7360 (0.0018) -[2023-09-26 22:03:35,508][52540] Updated weights for policy 0, policy_version 7360 (0.0018) -[2023-09-26 22:03:37,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 3776512. Throughput: 0: 767.0, 1: 769.6. Samples: 943253. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:03:37,915][51558] Avg episode reward: [(0, '-0.040'), (1, '0.150')] -[2023-09-26 22:03:42,914][51558] Fps is (10 sec: 4915.2, 60 sec: 6007.5, 300 sec: 6109.3). Total num frames: 3801088. Throughput: 0: 749.9, 1: 750.0. Samples: 950471. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:03:42,915][51558] Avg episode reward: [(0, '0.280'), (1, '0.150')] -[2023-09-26 22:03:42,916][52310] Saving new best policy, reward=0.280! -[2023-09-26 22:03:47,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6007.5, 300 sec: 6109.3). Total num frames: 3833856. Throughput: 0: 747.3, 1: 748.2. Samples: 955039. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:03:47,915][51558] Avg episode reward: [(0, '0.280'), (1, '0.150')] -[2023-09-26 22:03:50,035][52541] Updated weights for policy 1, policy_version 7520 (0.0017) -[2023-09-26 22:03:50,035][52540] Updated weights for policy 0, policy_version 7520 (0.0016) -[2023-09-26 22:03:52,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6075.7, 300 sec: 6137.1). Total num frames: 3866624. Throughput: 0: 750.8, 1: 750.8. Samples: 964608. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:03:52,915][51558] Avg episode reward: [(0, '0.280'), (1, '0.150')] -[2023-09-26 22:03:57,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 3899392. Throughput: 0: 748.3, 1: 751.0. Samples: 973928. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-09-26 22:03:57,915][51558] Avg episode reward: [(0, '0.400'), (1, '0.120')] -[2023-09-26 22:03:57,916][52310] Saving new best policy, reward=0.400! -[2023-09-26 22:04:02,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6007.5, 300 sec: 6109.3). Total num frames: 3923968. Throughput: 0: 747.7, 1: 748.8. Samples: 978572. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-09-26 22:04:02,915][51558] Avg episode reward: [(0, '0.400'), (1, '0.120')] -[2023-09-26 22:04:03,196][52540] Updated weights for policy 0, policy_version 7680 (0.0017) -[2023-09-26 22:04:03,197][52541] Updated weights for policy 1, policy_version 7680 (0.0018) -[2023-09-26 22:04:07,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6007.5, 300 sec: 6137.1). Total num frames: 3956736. Throughput: 0: 748.9, 1: 748.3. Samples: 987779. Policy #0 lag: (min: 9.0, avg: 9.0, max: 9.0) -[2023-09-26 22:04:07,915][51558] Avg episode reward: [(0, '0.500'), (1, '0.110')] -[2023-09-26 22:04:07,927][52310] Saving new best policy, reward=0.500! -[2023-09-26 22:04:12,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6007.5, 300 sec: 6137.1). Total num frames: 3989504. Throughput: 0: 749.1, 1: 749.9. Samples: 997249. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:04:12,915][51558] Avg episode reward: [(0, '0.520'), (1, '0.110')] -[2023-09-26 22:04:12,917][52310] Saving new best policy, reward=0.520! -[2023-09-26 22:04:16,564][52540] Updated weights for policy 0, policy_version 7840 (0.0017) -[2023-09-26 22:04:16,564][52541] Updated weights for policy 1, policy_version 7840 (0.0017) -[2023-09-26 22:04:17,914][51558] Fps is (10 sec: 6144.1, 60 sec: 6075.7, 300 sec: 6123.2). Total num frames: 4018176. Throughput: 0: 750.8, 1: 750.8. Samples: 1001472. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:04:17,915][51558] Avg episode reward: [(0, '0.520'), (1, '0.110')] -[2023-09-26 22:04:22,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6007.4, 300 sec: 6137.1). Total num frames: 4046848. Throughput: 0: 746.6, 1: 747.4. Samples: 1010481. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:04:22,916][51558] Avg episode reward: [(0, '0.550'), (1, '0.100')] -[2023-09-26 22:04:22,924][52310] Saving new best policy, reward=0.550! -[2023-09-26 22:04:27,914][51558] Fps is (10 sec: 6144.0, 60 sec: 6007.4, 300 sec: 6137.1). Total num frames: 4079616. Throughput: 0: 771.6, 1: 771.3. Samples: 1019904. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) -[2023-09-26 22:04:27,915][51558] Avg episode reward: [(0, '0.550'), (1, '0.100')] -[2023-09-26 22:04:29,896][52540] Updated weights for policy 0, policy_version 8000 (0.0017) -[2023-09-26 22:04:29,896][52541] Updated weights for policy 1, policy_version 8000 (0.0017) -[2023-09-26 22:04:32,914][51558] Fps is (10 sec: 6553.8, 60 sec: 6007.5, 300 sec: 6137.1). Total num frames: 4112384. Throughput: 0: 771.7, 1: 771.8. Samples: 1024496. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) -[2023-09-26 22:04:32,915][51558] Avg episode reward: [(0, '0.550'), (1, '0.100')] -[2023-09-26 22:04:37,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6007.5, 300 sec: 6109.3). Total num frames: 4136960. Throughput: 0: 766.8, 1: 766.1. Samples: 1033592. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) -[2023-09-26 22:04:37,915][51558] Avg episode reward: [(0, '0.590'), (1, '0.300')] -[2023-09-26 22:04:37,924][52310] Saving new best policy, reward=0.590! -[2023-09-26 22:04:37,924][52398] Saving new best policy, reward=0.300! -[2023-09-26 22:04:42,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 4169728. Throughput: 0: 761.5, 1: 760.8. Samples: 1042432. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:04:42,915][51558] Avg episode reward: [(0, '0.590'), (1, '0.300')] -[2023-09-26 22:04:43,605][52541] Updated weights for policy 1, policy_version 8160 (0.0015) -[2023-09-26 22:04:43,605][52540] Updated weights for policy 0, policy_version 8160 (0.0017) -[2023-09-26 22:04:47,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 4202496. Throughput: 0: 760.9, 1: 760.8. Samples: 1047047. Policy #0 lag: (min: 10.0, avg: 10.0, max: 10.0) -[2023-09-26 22:04:47,915][51558] Avg episode reward: [(0, '0.590'), (1, '0.300')] -[2023-09-26 22:04:52,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 4235264. Throughput: 0: 766.5, 1: 766.5. Samples: 1056762. Policy #0 lag: (min: 10.0, avg: 10.0, max: 10.0) -[2023-09-26 22:04:52,915][51558] Avg episode reward: [(0, '0.610'), (1, '0.650')] -[2023-09-26 22:04:52,921][52310] Saving new best policy, reward=0.610! -[2023-09-26 22:04:52,921][52398] Saving new best policy, reward=0.650! -[2023-09-26 22:04:56,676][52540] Updated weights for policy 0, policy_version 8320 (0.0018) -[2023-09-26 22:04:56,676][52541] Updated weights for policy 1, policy_version 8320 (0.0015) -[2023-09-26 22:04:57,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6007.5, 300 sec: 6109.3). Total num frames: 4259840. Throughput: 0: 761.6, 1: 762.1. Samples: 1065815. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:04:57,915][51558] Avg episode reward: [(0, '0.610'), (1, '0.650')] -[2023-09-26 22:05:02,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6109.3). Total num frames: 4292608. Throughput: 0: 760.5, 1: 760.4. Samples: 1069913. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) -[2023-09-26 22:05:02,915][51558] Avg episode reward: [(0, '0.610'), (1, '0.650')] -[2023-09-26 22:05:07,914][51558] Fps is (10 sec: 6144.0, 60 sec: 6075.7, 300 sec: 6123.2). Total num frames: 4321280. Throughput: 0: 761.8, 1: 762.0. Samples: 1079050. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) -[2023-09-26 22:05:07,915][51558] Avg episode reward: [(0, '0.610'), (1, '0.490')] -[2023-09-26 22:05:07,927][52310] Saving ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000008448_2162688.pth... -[2023-09-26 22:05:07,927][52398] Saving ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000008448_2162688.pth... -[2023-09-26 22:05:07,955][52310] Removing ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000005568_1425408.pth -[2023-09-26 22:05:07,963][52398] Removing ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000005568_1425408.pth -[2023-09-26 22:05:10,635][52540] Updated weights for policy 0, policy_version 8480 (0.0018) -[2023-09-26 22:05:10,636][52541] Updated weights for policy 1, policy_version 8480 (0.0018) -[2023-09-26 22:05:12,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6007.5, 300 sec: 6109.3). Total num frames: 4349952. Throughput: 0: 754.6, 1: 755.0. Samples: 1087835. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) -[2023-09-26 22:05:12,915][51558] Avg episode reward: [(0, '0.610'), (1, '0.490')] -[2023-09-26 22:05:17,914][51558] Fps is (10 sec: 6144.1, 60 sec: 6075.7, 300 sec: 6109.3). Total num frames: 4382720. Throughput: 0: 757.5, 1: 757.1. Samples: 1092651. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) -[2023-09-26 22:05:17,915][51558] Avg episode reward: [(0, '0.610'), (1, '0.490')] -[2023-09-26 22:05:22,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 4415488. Throughput: 0: 757.8, 1: 758.5. Samples: 1101825. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:05:22,914][51558] Avg episode reward: [(0, '0.610'), (1, '0.470')] -[2023-09-26 22:05:23,901][52540] Updated weights for policy 0, policy_version 8640 (0.0017) -[2023-09-26 22:05:23,902][52541] Updated weights for policy 1, policy_version 8640 (0.0017) -[2023-09-26 22:05:27,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 4448256. Throughput: 0: 765.0, 1: 765.1. Samples: 1111288. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:05:27,915][51558] Avg episode reward: [(0, '0.610'), (1, '0.470')] -[2023-09-26 22:05:32,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6007.5, 300 sec: 6109.3). Total num frames: 4472832. Throughput: 0: 765.4, 1: 764.0. Samples: 1115871. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:05:32,915][51558] Avg episode reward: [(0, '0.620'), (1, '0.470')] -[2023-09-26 22:05:32,916][52310] Saving new best policy, reward=0.620! -[2023-09-26 22:05:37,532][52541] Updated weights for policy 1, policy_version 8800 (0.0019) -[2023-09-26 22:05:37,532][52540] Updated weights for policy 0, policy_version 8800 (0.0016) -[2023-09-26 22:05:37,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6109.3). Total num frames: 4505600. Throughput: 0: 751.9, 1: 752.3. Samples: 1124451. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:05:37,915][51558] Avg episode reward: [(0, '0.620'), (1, '0.470')] -[2023-09-26 22:05:42,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6007.5, 300 sec: 6095.4). Total num frames: 4530176. Throughput: 0: 751.1, 1: 750.3. Samples: 1133377. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:05:42,915][51558] Avg episode reward: [(0, '0.620'), (1, '0.470')] -[2023-09-26 22:05:47,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6007.5, 300 sec: 6109.3). Total num frames: 4562944. Throughput: 0: 758.0, 1: 758.2. Samples: 1138141. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) -[2023-09-26 22:05:47,915][51558] Avg episode reward: [(0, '0.610'), (1, '0.460')] -[2023-09-26 22:05:50,961][52540] Updated weights for policy 0, policy_version 8960 (0.0015) -[2023-09-26 22:05:50,962][52541] Updated weights for policy 1, policy_version 8960 (0.0015) -[2023-09-26 22:05:52,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6007.4, 300 sec: 6109.3). Total num frames: 4595712. Throughput: 0: 757.7, 1: 757.5. Samples: 1147234. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:05:52,915][51558] Avg episode reward: [(0, '0.610'), (1, '0.460')] -[2023-09-26 22:05:57,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6144.0, 300 sec: 6109.3). Total num frames: 4628480. Throughput: 0: 766.0, 1: 765.3. Samples: 1156746. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:05:57,915][51558] Avg episode reward: [(0, '0.610'), (1, '0.460')] -[2023-09-26 22:06:02,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 4661248. Throughput: 0: 761.8, 1: 761.8. Samples: 1161216. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) -[2023-09-26 22:06:02,915][51558] Avg episode reward: [(0, '0.620'), (1, '0.480')] -[2023-09-26 22:06:04,250][52541] Updated weights for policy 1, policy_version 9120 (0.0019) -[2023-09-26 22:06:04,250][52540] Updated weights for policy 0, policy_version 9120 (0.0019) -[2023-09-26 22:06:07,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6075.8, 300 sec: 6109.3). Total num frames: 4685824. Throughput: 0: 761.3, 1: 762.4. Samples: 1170389. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) -[2023-09-26 22:06:07,915][51558] Avg episode reward: [(0, '0.620'), (1, '0.480')] -[2023-09-26 22:06:12,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6109.3). Total num frames: 4718592. Throughput: 0: 759.6, 1: 759.5. Samples: 1179648. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) -[2023-09-26 22:06:12,915][51558] Avg episode reward: [(0, '0.620'), (1, '0.480')] -[2023-09-26 22:06:17,594][52541] Updated weights for policy 1, policy_version 9280 (0.0017) -[2023-09-26 22:06:17,594][52540] Updated weights for policy 0, policy_version 9280 (0.0019) -[2023-09-26 22:06:17,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6144.0, 300 sec: 6109.3). Total num frames: 4751360. Throughput: 0: 756.6, 1: 757.9. Samples: 1184020. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-09-26 22:06:17,915][51558] Avg episode reward: [(0, '0.630'), (1, '0.500')] -[2023-09-26 22:06:17,917][52310] Saving new best policy, reward=0.630! -[2023-09-26 22:06:22,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 4784128. Throughput: 0: 768.8, 1: 768.7. Samples: 1193638. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:06:22,915][51558] Avg episode reward: [(0, '0.630'), (1, '0.500')] -[2023-09-26 22:06:27,914][51558] Fps is (10 sec: 5734.6, 60 sec: 6007.5, 300 sec: 6109.3). Total num frames: 4808704. Throughput: 0: 769.6, 1: 769.5. Samples: 1202639. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:06:27,914][51558] Avg episode reward: [(0, '0.630'), (1, '0.500')] -[2023-09-26 22:06:31,200][52540] Updated weights for policy 0, policy_version 9440 (0.0017) -[2023-09-26 22:06:31,201][52541] Updated weights for policy 1, policy_version 9440 (0.0018) -[2023-09-26 22:06:32,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6144.0, 300 sec: 6109.3). Total num frames: 4841472. Throughput: 0: 761.4, 1: 762.5. Samples: 1206716. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) -[2023-09-26 22:06:32,915][51558] Avg episode reward: [(0, '0.630'), (1, '0.480')] -[2023-09-26 22:06:37,914][51558] Fps is (10 sec: 6553.4, 60 sec: 6144.0, 300 sec: 6109.3). Total num frames: 4874240. Throughput: 0: 769.4, 1: 768.2. Samples: 1216427. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) -[2023-09-26 22:06:37,915][51558] Avg episode reward: [(0, '0.630'), (1, '0.480')] -[2023-09-26 22:06:42,914][51558] Fps is (10 sec: 6144.0, 60 sec: 6212.3, 300 sec: 6109.3). Total num frames: 4902912. Throughput: 0: 765.3, 1: 766.1. Samples: 1225660. Policy #0 lag: (min: 12.0, avg: 12.0, max: 12.0) -[2023-09-26 22:06:42,915][51558] Avg episode reward: [(0, '0.630'), (1, '0.480')] -[2023-09-26 22:06:44,335][52541] Updated weights for policy 1, policy_version 9600 (0.0017) -[2023-09-26 22:06:44,336][52540] Updated weights for policy 0, policy_version 9600 (0.0018) -[2023-09-26 22:06:47,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6144.0, 300 sec: 6109.3). Total num frames: 4931584. Throughput: 0: 768.4, 1: 768.6. Samples: 1230382. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:06:47,914][51558] Avg episode reward: [(0, '0.630'), (1, '0.480')] -[2023-09-26 22:06:52,914][51558] Fps is (10 sec: 6144.0, 60 sec: 6144.0, 300 sec: 6109.3). Total num frames: 4964352. Throughput: 0: 771.6, 1: 770.5. Samples: 1239782. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:06:52,915][51558] Avg episode reward: [(0, '0.630'), (1, '0.480')] -[2023-09-26 22:06:57,366][52540] Updated weights for policy 0, policy_version 9760 (0.0015) -[2023-09-26 22:06:57,366][52541] Updated weights for policy 1, policy_version 9760 (0.0016) -[2023-09-26 22:06:57,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6144.0, 300 sec: 6109.3). Total num frames: 4997120. Throughput: 0: 773.7, 1: 773.7. Samples: 1249279. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) -[2023-09-26 22:06:57,915][51558] Avg episode reward: [(0, '0.630'), (1, '0.490')] -[2023-09-26 22:07:02,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6144.0, 300 sec: 6109.3). Total num frames: 5029888. Throughput: 0: 772.1, 1: 772.0. Samples: 1253504. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) -[2023-09-26 22:07:02,915][51558] Avg episode reward: [(0, '0.630'), (1, '0.490')] -[2023-09-26 22:07:07,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6109.3). Total num frames: 5054464. Throughput: 0: 770.4, 1: 768.4. Samples: 1262884. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:07:07,915][51558] Avg episode reward: [(0, '0.630'), (1, '0.490')] -[2023-09-26 22:07:07,923][52398] Saving ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000009872_2527232.pth... -[2023-09-26 22:07:07,923][52310] Saving ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000009872_2527232.pth... -[2023-09-26 22:07:07,958][52398] Removing ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000007024_1798144.pth -[2023-09-26 22:07:07,961][52310] Removing ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000007024_1798144.pth -[2023-09-26 22:07:11,155][52540] Updated weights for policy 0, policy_version 9920 (0.0016) -[2023-09-26 22:07:11,155][52541] Updated weights for policy 1, policy_version 9920 (0.0016) -[2023-09-26 22:07:12,914][51558] Fps is (10 sec: 5734.2, 60 sec: 6144.0, 300 sec: 6109.3). Total num frames: 5087232. Throughput: 0: 766.7, 1: 766.8. Samples: 1271644. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) -[2023-09-26 22:07:12,915][51558] Avg episode reward: [(0, '0.630'), (1, '0.590')] -[2023-09-26 22:07:17,914][51558] Fps is (10 sec: 6144.1, 60 sec: 6075.8, 300 sec: 6095.4). Total num frames: 5115904. Throughput: 0: 769.4, 1: 768.2. Samples: 1275909. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) -[2023-09-26 22:07:17,915][51558] Avg episode reward: [(0, '0.630'), (1, '0.590')] -[2023-09-26 22:07:22,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6007.5, 300 sec: 6081.5). Total num frames: 5144576. Throughput: 0: 765.2, 1: 766.2. Samples: 1285339. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) -[2023-09-26 22:07:22,915][51558] Avg episode reward: [(0, '0.630'), (1, '0.590')] -[2023-09-26 22:07:24,460][52541] Updated weights for policy 1, policy_version 10080 (0.0018) -[2023-09-26 22:07:24,460][52540] Updated weights for policy 0, policy_version 10080 (0.0018) -[2023-09-26 22:07:27,914][51558] Fps is (10 sec: 6143.9, 60 sec: 6144.0, 300 sec: 6109.3). Total num frames: 5177344. Throughput: 0: 763.4, 1: 762.8. Samples: 1294338. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) -[2023-09-26 22:07:27,915][51558] Avg episode reward: [(0, '0.650'), (1, '0.600')] -[2023-09-26 22:07:27,916][52310] Saving new best policy, reward=0.650! -[2023-09-26 22:07:32,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6109.3). Total num frames: 5210112. Throughput: 0: 763.7, 1: 763.9. Samples: 1299124. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:07:32,915][51558] Avg episode reward: [(0, '0.650'), (1, '0.600')] -[2023-09-26 22:07:37,714][52540] Updated weights for policy 0, policy_version 10240 (0.0017) -[2023-09-26 22:07:37,714][52541] Updated weights for policy 1, policy_version 10240 (0.0017) -[2023-09-26 22:07:37,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6109.3). Total num frames: 5242880. Throughput: 0: 765.5, 1: 765.2. Samples: 1308663. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:07:37,915][51558] Avg episode reward: [(0, '0.650'), (1, '0.600')] -[2023-09-26 22:07:42,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6075.7, 300 sec: 6081.5). Total num frames: 5267456. Throughput: 0: 758.4, 1: 758.8. Samples: 1317553. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:07:42,915][51558] Avg episode reward: [(0, '0.660'), (1, '0.600')] -[2023-09-26 22:07:43,047][52310] Saving new best policy, reward=0.660! -[2023-09-26 22:07:47,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6144.0, 300 sec: 6095.4). Total num frames: 5300224. Throughput: 0: 765.3, 1: 765.9. Samples: 1322407. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-09-26 22:07:47,915][51558] Avg episode reward: [(0, '0.660'), (1, '0.600')] -[2023-09-26 22:07:51,110][52541] Updated weights for policy 1, policy_version 10400 (0.0017) -[2023-09-26 22:07:51,110][52540] Updated weights for policy 0, policy_version 10400 (0.0018) -[2023-09-26 22:07:52,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6144.0, 300 sec: 6109.3). Total num frames: 5332992. Throughput: 0: 758.8, 1: 760.9. Samples: 1331267. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-09-26 22:07:52,915][51558] Avg episode reward: [(0, '0.660'), (1, '0.600')] -[2023-09-26 22:07:57,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6109.3). Total num frames: 5365760. Throughput: 0: 769.6, 1: 769.1. Samples: 1340887. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) -[2023-09-26 22:07:57,914][51558] Avg episode reward: [(0, '0.660'), (1, '0.610')] -[2023-09-26 22:08:02,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6007.4, 300 sec: 6081.5). Total num frames: 5390336. Throughput: 0: 772.5, 1: 771.8. Samples: 1345399. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) -[2023-09-26 22:08:02,915][51558] Avg episode reward: [(0, '0.660'), (1, '0.610')] -[2023-09-26 22:08:04,415][52540] Updated weights for policy 0, policy_version 10560 (0.0018) -[2023-09-26 22:08:04,415][52541] Updated weights for policy 1, policy_version 10560 (0.0017) -[2023-09-26 22:08:07,914][51558] Fps is (10 sec: 5734.2, 60 sec: 6144.0, 300 sec: 6081.5). Total num frames: 5423104. Throughput: 0: 769.4, 1: 769.4. Samples: 1354586. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:08:07,915][51558] Avg episode reward: [(0, '0.670'), (1, '0.610')] -[2023-09-26 22:08:07,928][52310] Saving new best policy, reward=0.670! -[2023-09-26 22:08:12,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6109.3). Total num frames: 5455872. Throughput: 0: 773.8, 1: 773.8. Samples: 1363980. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:08:12,915][51558] Avg episode reward: [(0, '0.670'), (1, '0.610')] -[2023-09-26 22:08:17,465][52540] Updated weights for policy 0, policy_version 10720 (0.0017) -[2023-09-26 22:08:17,465][52541] Updated weights for policy 1, policy_version 10720 (0.0015) -[2023-09-26 22:08:17,914][51558] Fps is (10 sec: 6553.8, 60 sec: 6212.3, 300 sec: 6109.3). Total num frames: 5488640. Throughput: 0: 773.4, 1: 773.4. Samples: 1368732. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:08:17,915][51558] Avg episode reward: [(0, '0.670'), (1, '0.610')] -[2023-09-26 22:08:22,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6280.6, 300 sec: 6109.3). Total num frames: 5521408. Throughput: 0: 771.2, 1: 773.7. Samples: 1378184. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:08:22,914][51558] Avg episode reward: [(0, '0.930'), (1, '0.610')] -[2023-09-26 22:08:22,923][52310] Saving new best policy, reward=0.930! -[2023-09-26 22:08:27,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6144.0, 300 sec: 6081.5). Total num frames: 5545984. Throughput: 0: 772.0, 1: 772.4. Samples: 1387050. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:08:27,915][51558] Avg episode reward: [(0, '0.930'), (1, '0.610')] -[2023-09-26 22:08:30,907][52540] Updated weights for policy 0, policy_version 10880 (0.0016) -[2023-09-26 22:08:30,907][52541] Updated weights for policy 1, policy_version 10880 (0.0017) -[2023-09-26 22:08:32,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6109.3). Total num frames: 5578752. Throughput: 0: 772.6, 1: 769.6. Samples: 1391804. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) -[2023-09-26 22:08:32,915][51558] Avg episode reward: [(0, '0.930'), (1, '0.610')] -[2023-09-26 22:08:37,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 5611520. Throughput: 0: 773.2, 1: 772.7. Samples: 1400832. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) -[2023-09-26 22:08:37,915][51558] Avg episode reward: [(0, '0.940'), (1, '0.630')] -[2023-09-26 22:08:37,922][52310] Saving new best policy, reward=0.940! -[2023-09-26 22:08:42,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6144.0, 300 sec: 6109.3). Total num frames: 5636096. Throughput: 0: 768.0, 1: 766.8. Samples: 1409953. Policy #0 lag: (min: 12.0, avg: 12.0, max: 12.0) -[2023-09-26 22:08:42,915][51558] Avg episode reward: [(0, '0.940'), (1, '0.630')] -[2023-09-26 22:08:44,446][52540] Updated weights for policy 0, policy_version 11040 (0.0017) -[2023-09-26 22:08:44,447][52541] Updated weights for policy 1, policy_version 11040 (0.0018) -[2023-09-26 22:08:47,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6144.0, 300 sec: 6109.3). Total num frames: 5668864. Throughput: 0: 767.6, 1: 767.9. Samples: 1414497. Policy #0 lag: (min: 12.0, avg: 12.0, max: 12.0) -[2023-09-26 22:08:47,915][51558] Avg episode reward: [(0, '0.940'), (1, '0.630')] -[2023-09-26 22:08:52,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6109.3). Total num frames: 5701632. Throughput: 0: 765.3, 1: 765.5. Samples: 1423472. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:08:52,915][51558] Avg episode reward: [(0, '0.950'), (1, '0.640')] -[2023-09-26 22:08:52,924][52310] Saving new best policy, reward=0.950! -[2023-09-26 22:08:57,881][52541] Updated weights for policy 1, policy_version 11200 (0.0018) -[2023-09-26 22:08:57,881][52540] Updated weights for policy 0, policy_version 11200 (0.0018) -[2023-09-26 22:08:57,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 5734400. Throughput: 0: 764.3, 1: 764.3. Samples: 1432767. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:08:57,915][51558] Avg episode reward: [(0, '0.950'), (1, '0.640')] -[2023-09-26 22:09:02,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6144.0, 300 sec: 6109.3). Total num frames: 5758976. Throughput: 0: 761.7, 1: 761.7. Samples: 1437284. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:09:02,914][51558] Avg episode reward: [(0, '0.950'), (1, '0.640')] -[2023-09-26 22:09:07,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6109.3). Total num frames: 5791744. Throughput: 0: 756.1, 1: 754.0. Samples: 1446139. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:09:07,915][51558] Avg episode reward: [(0, '0.950'), (1, '0.650')] -[2023-09-26 22:09:07,927][52398] Saving ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000011312_2895872.pth... -[2023-09-26 22:09:07,927][52310] Saving ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000011312_2895872.pth... -[2023-09-26 22:09:07,961][52310] Removing ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000008448_2162688.pth -[2023-09-26 22:09:07,965][52398] Removing ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000008448_2162688.pth -[2023-09-26 22:09:11,338][52541] Updated weights for policy 1, policy_version 11360 (0.0018) -[2023-09-26 22:09:11,338][52540] Updated weights for policy 0, policy_version 11360 (0.0019) -[2023-09-26 22:09:12,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6144.0, 300 sec: 6123.2). Total num frames: 5824512. Throughput: 0: 763.2, 1: 762.9. Samples: 1455726. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:09:12,915][51558] Avg episode reward: [(0, '0.950'), (1, '0.650')] -[2023-09-26 22:09:17,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 5857280. Throughput: 0: 759.2, 1: 761.4. Samples: 1460229. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:09:17,915][51558] Avg episode reward: [(0, '0.950'), (1, '0.650')] -[2023-09-26 22:09:22,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6007.4, 300 sec: 6109.3). Total num frames: 5881856. Throughput: 0: 762.1, 1: 761.2. Samples: 1469380. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) -[2023-09-26 22:09:22,915][51558] Avg episode reward: [(0, '0.960'), (1, '0.660')] -[2023-09-26 22:09:22,925][52310] Saving new best policy, reward=0.960! -[2023-09-26 22:09:22,925][52398] Saving new best policy, reward=0.660! -[2023-09-26 22:09:24,605][52541] Updated weights for policy 1, policy_version 11520 (0.0018) -[2023-09-26 22:09:24,605][52540] Updated weights for policy 0, policy_version 11520 (0.0020) -[2023-09-26 22:09:27,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6109.3). Total num frames: 5914624. Throughput: 0: 762.8, 1: 763.9. Samples: 1478656. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) -[2023-09-26 22:09:27,915][51558] Avg episode reward: [(0, '0.960'), (1, '0.660')] -[2023-09-26 22:09:32,914][51558] Fps is (10 sec: 6553.8, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 5947392. Throughput: 0: 762.0, 1: 762.9. Samples: 1483120. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-09-26 22:09:32,915][51558] Avg episode reward: [(0, '0.980'), (1, '0.690')] -[2023-09-26 22:09:32,915][52310] Saving new best policy, reward=0.980! -[2023-09-26 22:09:32,915][52398] Saving new best policy, reward=0.690! -[2023-09-26 22:09:37,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6007.5, 300 sec: 6109.3). Total num frames: 5971968. Throughput: 0: 765.0, 1: 764.6. Samples: 1492302. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-09-26 22:09:37,915][51558] Avg episode reward: [(0, '0.980'), (1, '0.690')] -[2023-09-26 22:09:38,110][52541] Updated weights for policy 1, policy_version 11680 (0.0017) -[2023-09-26 22:09:38,110][52540] Updated weights for policy 0, policy_version 11680 (0.0017) -[2023-09-26 22:09:42,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6144.0, 300 sec: 6109.3). Total num frames: 6004736. Throughput: 0: 761.7, 1: 761.8. Samples: 1501322. Policy #0 lag: (min: 8.0, avg: 8.0, max: 8.0) -[2023-09-26 22:09:42,915][51558] Avg episode reward: [(0, '0.980'), (1, '0.690')] -[2023-09-26 22:09:47,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6109.3). Total num frames: 6037504. Throughput: 0: 762.8, 1: 762.8. Samples: 1505935. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) -[2023-09-26 22:09:47,915][51558] Avg episode reward: [(0, '0.980'), (1, '0.720')] -[2023-09-26 22:09:47,916][52398] Saving new best policy, reward=0.720! -[2023-09-26 22:09:51,413][52541] Updated weights for policy 1, policy_version 11840 (0.0018) -[2023-09-26 22:09:51,413][52540] Updated weights for policy 0, policy_version 11840 (0.0017) -[2023-09-26 22:09:52,914][51558] Fps is (10 sec: 6553.8, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 6070272. Throughput: 0: 770.3, 1: 770.5. Samples: 1515473. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) -[2023-09-26 22:09:52,914][51558] Avg episode reward: [(0, '0.980'), (1, '0.720')] -[2023-09-26 22:09:57,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6007.5, 300 sec: 6109.3). Total num frames: 6094848. Throughput: 0: 762.0, 1: 761.9. Samples: 1524303. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) -[2023-09-26 22:09:57,914][51558] Avg episode reward: [(0, '0.980'), (1, '0.720')] -[2023-09-26 22:10:02,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6144.0, 300 sec: 6123.2). Total num frames: 6127616. Throughput: 0: 763.4, 1: 762.2. Samples: 1528881. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) -[2023-09-26 22:10:02,915][51558] Avg episode reward: [(0, '0.980'), (1, '0.740')] -[2023-09-26 22:10:02,916][52398] Saving new best policy, reward=0.740! -[2023-09-26 22:10:05,119][52541] Updated weights for policy 1, policy_version 12000 (0.0015) -[2023-09-26 22:10:05,119][52540] Updated weights for policy 0, policy_version 12000 (0.0017) -[2023-09-26 22:10:07,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 6160384. Throughput: 0: 762.2, 1: 763.2. Samples: 1538022. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:10:07,915][51558] Avg episode reward: [(0, '0.980'), (1, '0.740')] -[2023-09-26 22:10:12,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6007.5, 300 sec: 6109.3). Total num frames: 6184960. Throughput: 0: 754.0, 1: 754.4. Samples: 1546531. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:10:12,914][51558] Avg episode reward: [(0, '0.980'), (1, '0.740')] -[2023-09-26 22:10:17,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6007.5, 300 sec: 6109.3). Total num frames: 6217728. Throughput: 0: 754.7, 1: 754.8. Samples: 1551048. Policy #0 lag: (min: 8.0, avg: 8.0, max: 8.0) -[2023-09-26 22:10:17,915][51558] Avg episode reward: [(0, '0.990'), (1, '0.750')] -[2023-09-26 22:10:17,916][52310] Saving new best policy, reward=0.990! -[2023-09-26 22:10:17,917][52398] Saving new best policy, reward=0.750! -[2023-09-26 22:10:18,736][52540] Updated weights for policy 0, policy_version 12160 (0.0017) -[2023-09-26 22:10:18,736][52541] Updated weights for policy 1, policy_version 12160 (0.0017) -[2023-09-26 22:10:22,914][51558] Fps is (10 sec: 6143.9, 60 sec: 6075.7, 300 sec: 6095.4). Total num frames: 6246400. Throughput: 0: 757.2, 1: 755.3. Samples: 1560364. Policy #0 lag: (min: 8.0, avg: 8.0, max: 8.0) -[2023-09-26 22:10:22,915][51558] Avg episode reward: [(0, '0.990'), (1, '0.750')] -[2023-09-26 22:10:27,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6007.5, 300 sec: 6109.3). Total num frames: 6275072. Throughput: 0: 752.6, 1: 752.7. Samples: 1569060. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:10:27,915][51558] Avg episode reward: [(0, '0.990'), (1, '0.750')] -[2023-09-26 22:10:32,198][52540] Updated weights for policy 0, policy_version 12320 (0.0015) -[2023-09-26 22:10:32,198][52541] Updated weights for policy 1, policy_version 12320 (0.0018) -[2023-09-26 22:10:32,914][51558] Fps is (10 sec: 6144.0, 60 sec: 6007.4, 300 sec: 6109.3). Total num frames: 6307840. Throughput: 0: 755.8, 1: 755.5. Samples: 1573945. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:10:32,915][51558] Avg episode reward: [(0, '0.990'), (1, '0.760')] -[2023-09-26 22:10:32,916][52398] Saving new best policy, reward=0.760! -[2023-09-26 22:10:37,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 6340608. Throughput: 0: 752.6, 1: 752.7. Samples: 1583210. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:10:37,915][51558] Avg episode reward: [(0, '0.990'), (1, '0.760')] -[2023-09-26 22:10:42,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 6373376. Throughput: 0: 764.8, 1: 765.4. Samples: 1593159. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:10:42,915][51558] Avg episode reward: [(0, '0.980'), (1, '0.780')] -[2023-09-26 22:10:42,916][52398] Saving new best policy, reward=0.780! -[2023-09-26 22:10:45,068][52540] Updated weights for policy 0, policy_version 12480 (0.0018) -[2023-09-26 22:10:45,069][52541] Updated weights for policy 1, policy_version 12480 (0.0017) -[2023-09-26 22:10:47,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 6406144. Throughput: 0: 762.4, 1: 764.2. Samples: 1597579. Policy #0 lag: (min: 12.0, avg: 12.0, max: 12.0) -[2023-09-26 22:10:47,915][51558] Avg episode reward: [(0, '0.980'), (1, '0.780')] -[2023-09-26 22:10:52,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6007.4, 300 sec: 6109.3). Total num frames: 6430720. Throughput: 0: 764.3, 1: 765.5. Samples: 1606862. Policy #0 lag: (min: 12.0, avg: 12.0, max: 12.0) -[2023-09-26 22:10:52,915][51558] Avg episode reward: [(0, '0.980'), (1, '0.780')] -[2023-09-26 22:10:57,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6144.0, 300 sec: 6109.3). Total num frames: 6463488. Throughput: 0: 770.7, 1: 770.2. Samples: 1615872. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) -[2023-09-26 22:10:57,915][51558] Avg episode reward: [(0, '0.980'), (1, '0.940')] -[2023-09-26 22:10:57,915][52398] Saving new best policy, reward=0.940! -[2023-09-26 22:10:58,483][52541] Updated weights for policy 1, policy_version 12640 (0.0018) -[2023-09-26 22:10:58,483][52540] Updated weights for policy 0, policy_version 12640 (0.0019) -[2023-09-26 22:11:02,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 6496256. Throughput: 0: 773.3, 1: 773.0. Samples: 1620633. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) -[2023-09-26 22:11:02,915][51558] Avg episode reward: [(0, '0.980'), (1, '0.940')] -[2023-09-26 22:11:07,914][51558] Fps is (10 sec: 6143.9, 60 sec: 6075.7, 300 sec: 6123.2). Total num frames: 6524928. Throughput: 0: 771.9, 1: 775.1. Samples: 1629977. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) -[2023-09-26 22:11:07,915][51558] Avg episode reward: [(0, '0.980'), (1, '0.940')] -[2023-09-26 22:11:07,928][52310] Saving ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000012752_3264512.pth... -[2023-09-26 22:11:07,928][52398] Saving ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000012752_3264512.pth... -[2023-09-26 22:11:07,957][52310] Removing ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000009872_2527232.pth -[2023-09-26 22:11:07,960][52398] Removing ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000009872_2527232.pth -[2023-09-26 22:11:11,897][52541] Updated weights for policy 1, policy_version 12800 (0.0017) -[2023-09-26 22:11:11,898][52540] Updated weights for policy 0, policy_version 12800 (0.0018) -[2023-09-26 22:11:12,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6109.3). Total num frames: 6553600. Throughput: 0: 774.9, 1: 776.0. Samples: 1638853. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:11:12,915][51558] Avg episode reward: [(0, '0.980'), (1, '0.960')] -[2023-09-26 22:11:12,916][52398] Saving new best policy, reward=0.960! -[2023-09-26 22:11:17,914][51558] Fps is (10 sec: 6144.1, 60 sec: 6144.0, 300 sec: 6109.3). Total num frames: 6586368. Throughput: 0: 776.1, 1: 776.1. Samples: 1643792. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:11:17,915][51558] Avg episode reward: [(0, '0.980'), (1, '0.960')] -[2023-09-26 22:11:22,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6212.3, 300 sec: 6137.1). Total num frames: 6619136. Throughput: 0: 772.9, 1: 772.3. Samples: 1652744. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:11:22,915][51558] Avg episode reward: [(0, '0.980'), (1, '0.960')] -[2023-09-26 22:11:25,266][52541] Updated weights for policy 1, policy_version 12960 (0.0016) -[2023-09-26 22:11:25,267][52540] Updated weights for policy 0, policy_version 12960 (0.0016) -[2023-09-26 22:11:27,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6109.3). Total num frames: 6643712. Throughput: 0: 762.9, 1: 763.1. Samples: 1661827. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:11:27,915][51558] Avg episode reward: [(0, '0.980'), (1, '0.960')] -[2023-09-26 22:11:32,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6109.3). Total num frames: 6676480. Throughput: 0: 765.0, 1: 764.7. Samples: 1666414. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:11:32,915][51558] Avg episode reward: [(0, '0.980'), (1, '0.960')] -[2023-09-26 22:11:37,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6144.0, 300 sec: 6123.2). Total num frames: 6709248. Throughput: 0: 765.1, 1: 764.5. Samples: 1675694. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:11:37,915][51558] Avg episode reward: [(0, '0.980'), (1, '0.960')] -[2023-09-26 22:11:38,554][52540] Updated weights for policy 0, policy_version 13120 (0.0016) -[2023-09-26 22:11:38,555][52541] Updated weights for policy 1, policy_version 13120 (0.0017) -[2023-09-26 22:11:42,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 6742016. Throughput: 0: 772.5, 1: 770.2. Samples: 1685291. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:11:42,915][51558] Avg episode reward: [(0, '0.960'), (1, '0.980')] -[2023-09-26 22:11:42,916][52398] Saving new best policy, reward=0.980! -[2023-09-26 22:11:47,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 6774784. Throughput: 0: 766.6, 1: 766.7. Samples: 1689629. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:11:47,915][51558] Avg episode reward: [(0, '0.960'), (1, '0.980')] -[2023-09-26 22:11:51,691][52541] Updated weights for policy 1, policy_version 13280 (0.0018) -[2023-09-26 22:11:51,691][52540] Updated weights for policy 0, policy_version 13280 (0.0020) -[2023-09-26 22:11:52,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6109.3). Total num frames: 6799360. Throughput: 0: 770.6, 1: 769.5. Samples: 1699283. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:11:52,915][51558] Avg episode reward: [(0, '0.950'), (1, '0.980')] -[2023-09-26 22:11:57,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6144.0, 300 sec: 6109.3). Total num frames: 6832128. Throughput: 0: 775.2, 1: 773.9. Samples: 1708566. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:11:57,914][51558] Avg episode reward: [(0, '0.950'), (1, '0.980')] -[2023-09-26 22:12:02,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 6864896. Throughput: 0: 775.2, 1: 775.8. Samples: 1713588. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) -[2023-09-26 22:12:02,915][51558] Avg episode reward: [(0, '0.950'), (1, '0.980')] -[2023-09-26 22:12:04,602][52540] Updated weights for policy 0, policy_version 13440 (0.0015) -[2023-09-26 22:12:04,602][52541] Updated weights for policy 1, policy_version 13440 (0.0016) -[2023-09-26 22:12:07,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6212.3, 300 sec: 6137.1). Total num frames: 6897664. Throughput: 0: 778.6, 1: 779.5. Samples: 1722859. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) -[2023-09-26 22:12:07,915][51558] Avg episode reward: [(0, '0.950'), (1, '0.980')] -[2023-09-26 22:12:12,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6280.6, 300 sec: 6150.9). Total num frames: 6930432. Throughput: 0: 785.2, 1: 784.8. Samples: 1732474. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:12:12,914][51558] Avg episode reward: [(0, '0.950'), (1, '0.980')] -[2023-09-26 22:12:17,740][52541] Updated weights for policy 1, policy_version 13600 (0.0018) -[2023-09-26 22:12:17,740][52540] Updated weights for policy 0, policy_version 13600 (0.0017) -[2023-09-26 22:12:17,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6280.5, 300 sec: 6164.8). Total num frames: 6963200. Throughput: 0: 781.2, 1: 780.9. Samples: 1736710. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:12:17,914][51558] Avg episode reward: [(0, '0.950'), (1, '0.980')] -[2023-09-26 22:12:22,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 6987776. Throughput: 0: 787.2, 1: 786.4. Samples: 1746503. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:12:22,915][51558] Avg episode reward: [(0, '0.950'), (1, '0.990')] -[2023-09-26 22:12:22,925][52398] Saving new best policy, reward=0.990! -[2023-09-26 22:12:27,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6280.5, 300 sec: 6137.1). Total num frames: 7020544. Throughput: 0: 777.0, 1: 779.8. Samples: 1755349. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:12:27,915][51558] Avg episode reward: [(0, '0.950'), (1, '0.990')] -[2023-09-26 22:12:31,086][52540] Updated weights for policy 0, policy_version 13760 (0.0016) -[2023-09-26 22:12:31,087][52541] Updated weights for policy 1, policy_version 13760 (0.0017) -[2023-09-26 22:12:32,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6280.6, 300 sec: 6137.1). Total num frames: 7053312. Throughput: 0: 780.8, 1: 782.0. Samples: 1759957. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) -[2023-09-26 22:12:32,914][51558] Avg episode reward: [(0, '0.950'), (1, '0.990')] -[2023-09-26 22:12:37,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6280.5, 300 sec: 6164.8). Total num frames: 7086080. Throughput: 0: 780.0, 1: 779.7. Samples: 1769472. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) -[2023-09-26 22:12:37,916][51558] Avg episode reward: [(0, '0.950'), (1, '0.990')] -[2023-09-26 22:12:42,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 7110656. Throughput: 0: 776.8, 1: 776.5. Samples: 1778465. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) -[2023-09-26 22:12:42,915][51558] Avg episode reward: [(0, '0.950'), (1, '0.990')] -[2023-09-26 22:12:44,512][52540] Updated weights for policy 0, policy_version 13920 (0.0018) -[2023-09-26 22:12:44,512][52541] Updated weights for policy 1, policy_version 13920 (0.0017) -[2023-09-26 22:12:47,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 7143424. Throughput: 0: 771.4, 1: 769.8. Samples: 1782941. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) -[2023-09-26 22:12:47,915][51558] Avg episode reward: [(0, '0.950'), (1, '0.990')] -[2023-09-26 22:12:52,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6280.5, 300 sec: 6137.1). Total num frames: 7176192. Throughput: 0: 768.7, 1: 767.9. Samples: 1792005. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) -[2023-09-26 22:12:52,915][51558] Avg episode reward: [(0, '0.940'), (1, '0.990')] -[2023-09-26 22:12:57,914][51558] Fps is (10 sec: 6144.0, 60 sec: 6212.2, 300 sec: 6150.9). Total num frames: 7204864. Throughput: 0: 764.2, 1: 763.7. Samples: 1801231. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) -[2023-09-26 22:12:57,915][51558] Avg episode reward: [(0, '0.940'), (1, '0.990')] -[2023-09-26 22:12:57,942][52540] Updated weights for policy 0, policy_version 14080 (0.0017) -[2023-09-26 22:12:57,942][52541] Updated weights for policy 1, policy_version 14080 (0.0017) -[2023-09-26 22:13:02,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 7233536. Throughput: 0: 769.3, 1: 770.3. Samples: 1805993. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) -[2023-09-26 22:13:02,915][51558] Avg episode reward: [(0, '0.940'), (1, '0.990')] -[2023-09-26 22:13:07,914][51558] Fps is (10 sec: 6144.0, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 7266304. Throughput: 0: 764.5, 1: 765.4. Samples: 1815348. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) -[2023-09-26 22:13:07,915][51558] Avg episode reward: [(0, '0.940'), (1, '0.990')] -[2023-09-26 22:13:07,925][52398] Saving ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000014192_3633152.pth... -[2023-09-26 22:13:07,926][52310] Saving ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000014192_3633152.pth... -[2023-09-26 22:13:07,961][52398] Removing ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000011312_2895872.pth -[2023-09-26 22:13:07,961][52310] Removing ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000011312_2895872.pth -[2023-09-26 22:13:10,963][52541] Updated weights for policy 1, policy_version 14240 (0.0016) -[2023-09-26 22:13:10,963][52540] Updated weights for policy 0, policy_version 14240 (0.0015) -[2023-09-26 22:13:12,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 7299072. Throughput: 0: 771.6, 1: 771.1. Samples: 1824768. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) -[2023-09-26 22:13:12,915][51558] Avg episode reward: [(0, '0.940'), (1, '0.990')] -[2023-09-26 22:13:17,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 7331840. Throughput: 0: 770.3, 1: 769.6. Samples: 1829251. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:13:17,915][51558] Avg episode reward: [(0, '0.940'), (1, '0.990')] -[2023-09-26 22:13:22,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 7356416. Throughput: 0: 767.8, 1: 766.8. Samples: 1838532. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:13:22,915][51558] Avg episode reward: [(0, '0.940'), (1, '0.990')] -[2023-09-26 22:13:24,297][52540] Updated weights for policy 0, policy_version 14400 (0.0017) -[2023-09-26 22:13:24,297][52541] Updated weights for policy 1, policy_version 14400 (0.0016) -[2023-09-26 22:13:27,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 7389184. Throughput: 0: 771.3, 1: 771.8. Samples: 1847908. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) -[2023-09-26 22:13:27,915][51558] Avg episode reward: [(0, '0.940'), (1, '0.990')] -[2023-09-26 22:13:32,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 7421952. Throughput: 0: 774.8, 1: 775.5. Samples: 1852703. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) -[2023-09-26 22:13:32,915][51558] Avg episode reward: [(0, '0.940'), (1, '0.990')] -[2023-09-26 22:13:37,505][52540] Updated weights for policy 0, policy_version 14560 (0.0017) -[2023-09-26 22:13:37,505][52541] Updated weights for policy 1, policy_version 14560 (0.0017) -[2023-09-26 22:13:37,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 7454720. Throughput: 0: 773.7, 1: 773.7. Samples: 1861637. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:13:37,915][51558] Avg episode reward: [(0, '0.940'), (1, '0.990')] -[2023-09-26 22:13:42,914][51558] Fps is (10 sec: 6144.0, 60 sec: 6212.3, 300 sec: 6150.9). Total num frames: 7483392. Throughput: 0: 775.3, 1: 773.6. Samples: 1870931. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:13:42,915][51558] Avg episode reward: [(0, '0.940'), (1, '0.990')] -[2023-09-26 22:13:47,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 7512064. Throughput: 0: 772.0, 1: 769.0. Samples: 1875335. Policy #0 lag: (min: 10.0, avg: 10.0, max: 10.0) -[2023-09-26 22:13:47,915][51558] Avg episode reward: [(0, '0.940'), (1, '0.990')] -[2023-09-26 22:13:51,254][52541] Updated weights for policy 1, policy_version 14720 (0.0017) -[2023-09-26 22:13:51,254][52540] Updated weights for policy 0, policy_version 14720 (0.0018) -[2023-09-26 22:13:52,914][51558] Fps is (10 sec: 6144.1, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 7544832. Throughput: 0: 764.9, 1: 764.2. Samples: 1884160. Policy #0 lag: (min: 10.0, avg: 10.0, max: 10.0) -[2023-09-26 22:13:52,915][51558] Avg episode reward: [(0, '0.940'), (1, '0.990')] -[2023-09-26 22:13:57,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6075.7, 300 sec: 6137.1). Total num frames: 7569408. Throughput: 0: 760.9, 1: 760.5. Samples: 1893233. Policy #0 lag: (min: 10.0, avg: 10.0, max: 10.0) -[2023-09-26 22:13:57,915][51558] Avg episode reward: [(0, '0.940'), (1, '0.990')] -[2023-09-26 22:14:02,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 7602176. Throughput: 0: 762.3, 1: 761.9. Samples: 1897838. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) -[2023-09-26 22:14:02,915][51558] Avg episode reward: [(0, '0.940'), (1, '0.990')] -[2023-09-26 22:14:04,777][52541] Updated weights for policy 1, policy_version 14880 (0.0017) -[2023-09-26 22:14:04,778][52540] Updated weights for policy 0, policy_version 14880 (0.0017) -[2023-09-26 22:14:07,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 7634944. Throughput: 0: 758.3, 1: 760.0. Samples: 1906856. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) -[2023-09-26 22:14:07,915][51558] Avg episode reward: [(0, '0.940'), (1, '0.990')] -[2023-09-26 22:14:12,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 7667712. Throughput: 0: 760.7, 1: 759.9. Samples: 1916334. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:14:12,914][51558] Avg episode reward: [(0, '0.940'), (1, '0.990')] -[2023-09-26 22:14:17,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6007.5, 300 sec: 6137.1). Total num frames: 7692288. Throughput: 0: 759.0, 1: 759.3. Samples: 1921024. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:14:17,915][51558] Avg episode reward: [(0, '0.940'), (1, '0.990')] -[2023-09-26 22:14:17,999][52540] Updated weights for policy 0, policy_version 15040 (0.0018) -[2023-09-26 22:14:17,999][52541] Updated weights for policy 1, policy_version 15040 (0.0017) -[2023-09-26 22:14:22,914][51558] Fps is (10 sec: 5734.2, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 7725056. Throughput: 0: 764.2, 1: 763.9. Samples: 1930401. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:14:22,916][51558] Avg episode reward: [(0, '0.940'), (1, '0.990')] -[2023-09-26 22:14:27,914][51558] Fps is (10 sec: 6553.8, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 7757824. Throughput: 0: 760.6, 1: 762.1. Samples: 1939456. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:14:27,914][51558] Avg episode reward: [(0, '0.940'), (1, '0.990')] -[2023-09-26 22:14:31,297][52541] Updated weights for policy 1, policy_version 15200 (0.0017) -[2023-09-26 22:14:31,298][52540] Updated weights for policy 0, policy_version 15200 (0.0018) -[2023-09-26 22:14:32,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 7790592. Throughput: 0: 760.7, 1: 763.1. Samples: 1943906. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) -[2023-09-26 22:14:32,915][51558] Avg episode reward: [(0, '0.940'), (1, '0.990')] -[2023-09-26 22:14:37,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 7823360. Throughput: 0: 769.2, 1: 769.8. Samples: 1953418. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) -[2023-09-26 22:14:37,915][51558] Avg episode reward: [(0, '0.940'), (1, '0.990')] -[2023-09-26 22:14:42,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6075.7, 300 sec: 6137.1). Total num frames: 7847936. Throughput: 0: 769.6, 1: 770.3. Samples: 1962531. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) -[2023-09-26 22:14:42,915][51558] Avg episode reward: [(0, '0.940'), (1, '0.990')] -[2023-09-26 22:14:44,543][52540] Updated weights for policy 0, policy_version 15360 (0.0016) -[2023-09-26 22:14:44,543][52541] Updated weights for policy 1, policy_version 15360 (0.0017) -[2023-09-26 22:14:47,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 7880704. Throughput: 0: 770.9, 1: 771.7. Samples: 1967255. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:14:47,915][51558] Avg episode reward: [(0, '0.940'), (1, '0.990')] -[2023-09-26 22:14:52,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 7913472. Throughput: 0: 772.4, 1: 772.2. Samples: 1976360. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:14:52,915][51558] Avg episode reward: [(0, '0.940'), (1, '0.990')] -[2023-09-26 22:14:57,836][52540] Updated weights for policy 0, policy_version 15520 (0.0017) -[2023-09-26 22:14:57,836][52541] Updated weights for policy 1, policy_version 15520 (0.0018) -[2023-09-26 22:14:57,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6280.5, 300 sec: 6164.8). Total num frames: 7946240. Throughput: 0: 770.5, 1: 771.0. Samples: 1985705. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:14:57,915][51558] Avg episode reward: [(0, '0.940'), (1, '0.990')] -[2023-09-26 22:15:02,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 7970816. Throughput: 0: 771.6, 1: 771.5. Samples: 1990461. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:15:02,915][51558] Avg episode reward: [(0, '0.940'), (1, '0.990')] -[2023-09-26 22:15:07,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 8003584. Throughput: 0: 771.6, 1: 772.2. Samples: 1999876. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:15:07,915][51558] Avg episode reward: [(0, '0.940'), (1, '0.990')] -[2023-09-26 22:15:07,924][52398] Saving ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000015632_4001792.pth... -[2023-09-26 22:15:07,924][52310] Saving ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000015632_4001792.pth... -[2023-09-26 22:15:07,953][52398] Removing ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000012752_3264512.pth -[2023-09-26 22:15:07,964][52310] Removing ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000012752_3264512.pth -[2023-09-26 22:15:11,105][52540] Updated weights for policy 0, policy_version 15680 (0.0018) -[2023-09-26 22:15:11,106][52541] Updated weights for policy 1, policy_version 15680 (0.0016) -[2023-09-26 22:15:12,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 8036352. Throughput: 0: 772.8, 1: 771.9. Samples: 2008971. Policy #0 lag: (min: 9.0, avg: 9.0, max: 9.0) -[2023-09-26 22:15:12,915][51558] Avg episode reward: [(0, '0.940'), (1, '0.840')] -[2023-09-26 22:15:17,914][51558] Fps is (10 sec: 6553.4, 60 sec: 6280.5, 300 sec: 6178.7). Total num frames: 8069120. Throughput: 0: 770.1, 1: 769.6. Samples: 2013193. Policy #0 lag: (min: 9.0, avg: 9.0, max: 9.0) -[2023-09-26 22:15:17,915][51558] Avg episode reward: [(0, '0.940'), (1, '0.840')] -[2023-09-26 22:15:22,914][51558] Fps is (10 sec: 6143.9, 60 sec: 6212.3, 300 sec: 6178.7). Total num frames: 8097792. Throughput: 0: 772.4, 1: 773.0. Samples: 2022962. Policy #0 lag: (min: 9.0, avg: 9.0, max: 9.0) -[2023-09-26 22:15:22,916][51558] Avg episode reward: [(0, '0.590'), (1, '0.840')] -[2023-09-26 22:15:24,284][52541] Updated weights for policy 1, policy_version 15840 (0.0016) -[2023-09-26 22:15:24,284][52540] Updated weights for policy 0, policy_version 15840 (0.0016) -[2023-09-26 22:15:27,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 8126464. Throughput: 0: 769.2, 1: 769.3. Samples: 2031761. Policy #0 lag: (min: 9.0, avg: 9.0, max: 9.0) -[2023-09-26 22:15:27,915][51558] Avg episode reward: [(0, '0.590'), (1, '0.840')] -[2023-09-26 22:15:32,914][51558] Fps is (10 sec: 6144.1, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 8159232. Throughput: 0: 771.1, 1: 769.7. Samples: 2036590. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-09-26 22:15:32,915][51558] Avg episode reward: [(0, '0.590'), (1, '0.840')] -[2023-09-26 22:15:37,474][52540] Updated weights for policy 0, policy_version 16000 (0.0016) -[2023-09-26 22:15:37,475][52541] Updated weights for policy 1, policy_version 16000 (0.0018) -[2023-09-26 22:15:37,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 8192000. Throughput: 0: 773.5, 1: 773.1. Samples: 2045955. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-09-26 22:15:37,915][51558] Avg episode reward: [(0, '0.590'), (1, '0.840')] -[2023-09-26 22:15:42,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6280.5, 300 sec: 6164.8). Total num frames: 8224768. Throughput: 0: 776.9, 1: 777.2. Samples: 2055639. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-09-26 22:15:42,915][51558] Avg episode reward: [(0, '0.590'), (1, '0.840')] -[2023-09-26 22:15:47,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6280.5, 300 sec: 6192.6). Total num frames: 8257536. Throughput: 0: 775.8, 1: 775.9. Samples: 2060288. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:15:47,915][51558] Avg episode reward: [(0, '0.590'), (1, '0.840')] -[2023-09-26 22:15:50,568][52540] Updated weights for policy 0, policy_version 16160 (0.0018) -[2023-09-26 22:15:50,568][52541] Updated weights for policy 1, policy_version 16160 (0.0016) -[2023-09-26 22:15:52,914][51558] Fps is (10 sec: 5734.2, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 8282112. Throughput: 0: 775.3, 1: 775.1. Samples: 2069642. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:15:52,916][51558] Avg episode reward: [(0, '0.410'), (1, '0.840')] -[2023-09-26 22:15:57,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 8314880. Throughput: 0: 775.2, 1: 776.5. Samples: 2078798. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-09-26 22:15:57,915][51558] Avg episode reward: [(0, '0.410'), (1, '0.840')] -[2023-09-26 22:16:02,914][51558] Fps is (10 sec: 6553.8, 60 sec: 6280.5, 300 sec: 6178.7). Total num frames: 8347648. Throughput: 0: 782.1, 1: 782.6. Samples: 2083605. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-09-26 22:16:02,915][51558] Avg episode reward: [(0, '0.410'), (1, '0.840')] -[2023-09-26 22:16:03,778][52541] Updated weights for policy 1, policy_version 16320 (0.0017) -[2023-09-26 22:16:03,779][52540] Updated weights for policy 0, policy_version 16320 (0.0016) -[2023-09-26 22:16:07,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6280.5, 300 sec: 6192.6). Total num frames: 8380416. Throughput: 0: 779.4, 1: 778.2. Samples: 2093056. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:16:07,916][51558] Avg episode reward: [(0, '0.140'), (1, '0.840')] -[2023-09-26 22:16:12,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6280.6, 300 sec: 6192.6). Total num frames: 8413184. Throughput: 0: 785.1, 1: 786.0. Samples: 2102457. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:16:12,914][51558] Avg episode reward: [(0, '0.140'), (1, '0.840')] -[2023-09-26 22:16:16,965][52540] Updated weights for policy 0, policy_version 16480 (0.0019) -[2023-09-26 22:16:16,965][52541] Updated weights for policy 1, policy_version 16480 (0.0019) -[2023-09-26 22:16:17,914][51558] Fps is (10 sec: 5734.6, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 8437760. Throughput: 0: 782.2, 1: 781.9. Samples: 2106973. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:16:17,915][51558] Avg episode reward: [(0, '-0.780'), (1, '0.840')] -[2023-09-26 22:16:22,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6212.3, 300 sec: 6192.6). Total num frames: 8470528. Throughput: 0: 774.8, 1: 775.1. Samples: 2115700. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) -[2023-09-26 22:16:22,915][51558] Avg episode reward: [(0, '-0.780'), (1, '0.840')] -[2023-09-26 22:16:27,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6280.5, 300 sec: 6192.6). Total num frames: 8503296. Throughput: 0: 776.6, 1: 776.6. Samples: 2125536. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) -[2023-09-26 22:16:27,915][51558] Avg episode reward: [(0, '-0.780'), (1, '0.840')] -[2023-09-26 22:16:30,203][52540] Updated weights for policy 0, policy_version 16640 (0.0018) -[2023-09-26 22:16:30,203][52541] Updated weights for policy 1, policy_version 16640 (0.0020) -[2023-09-26 22:16:32,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6280.6, 300 sec: 6192.6). Total num frames: 8536064. Throughput: 0: 773.7, 1: 773.7. Samples: 2129922. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-09-26 22:16:32,914][51558] Avg episode reward: [(0, '-1.350'), (1, '0.840')] -[2023-09-26 22:16:37,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 8560640. Throughput: 0: 773.8, 1: 773.5. Samples: 2139272. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-09-26 22:16:37,915][51558] Avg episode reward: [(0, '-1.350'), (1, '0.840')] -[2023-09-26 22:16:42,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 8593408. Throughput: 0: 773.0, 1: 772.6. Samples: 2148352. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-09-26 22:16:42,914][51558] Avg episode reward: [(0, '-1.350'), (1, '0.840')] -[2023-09-26 22:16:43,665][52541] Updated weights for policy 1, policy_version 16800 (0.0019) -[2023-09-26 22:16:43,665][52540] Updated weights for policy 0, policy_version 16800 (0.0017) -[2023-09-26 22:16:47,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 8626176. Throughput: 0: 765.8, 1: 765.7. Samples: 2152522. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) -[2023-09-26 22:16:47,915][51558] Avg episode reward: [(0, '-1.840'), (1, '0.830')] -[2023-09-26 22:16:52,914][51558] Fps is (10 sec: 5734.2, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 8650752. Throughput: 0: 764.4, 1: 767.0. Samples: 2161969. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) -[2023-09-26 22:16:52,915][51558] Avg episode reward: [(0, '-1.840'), (1, '0.830')] -[2023-09-26 22:16:57,129][52540] Updated weights for policy 0, policy_version 16960 (0.0018) -[2023-09-26 22:16:57,129][52541] Updated weights for policy 1, policy_version 16960 (0.0019) -[2023-09-26 22:16:57,914][51558] Fps is (10 sec: 5734.6, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 8683520. Throughput: 0: 761.1, 1: 760.2. Samples: 2170914. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) -[2023-09-26 22:16:57,914][51558] Avg episode reward: [(0, '-1.840'), (1, '0.830')] -[2023-09-26 22:17:02,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 8716288. Throughput: 0: 762.6, 1: 763.7. Samples: 2175656. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) -[2023-09-26 22:17:02,915][51558] Avg episode reward: [(0, '-1.840'), (1, '0.830')] -[2023-09-26 22:17:07,914][51558] Fps is (10 sec: 6553.4, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 8749056. Throughput: 0: 772.6, 1: 772.2. Samples: 2185216. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) -[2023-09-26 22:17:07,915][51558] Avg episode reward: [(0, '-1.840'), (1, '0.830')] -[2023-09-26 22:17:07,925][52398] Saving ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000017088_4374528.pth... -[2023-09-26 22:17:07,927][52310] Saving ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000017088_4374528.pth... -[2023-09-26 22:17:07,963][52398] Removing ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000014192_3633152.pth -[2023-09-26 22:17:07,966][52310] Removing ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000014192_3633152.pth -[2023-09-26 22:17:10,405][52540] Updated weights for policy 0, policy_version 17120 (0.0017) -[2023-09-26 22:17:10,405][52541] Updated weights for policy 1, policy_version 17120 (0.0018) -[2023-09-26 22:17:12,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 8781824. Throughput: 0: 765.2, 1: 765.2. Samples: 2194406. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-09-26 22:17:12,915][51558] Avg episode reward: [(0, '-1.840'), (1, '0.830')] -[2023-09-26 22:17:17,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 8806400. Throughput: 0: 772.1, 1: 771.7. Samples: 2199395. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-09-26 22:17:17,916][51558] Avg episode reward: [(0, '-2.040'), (1, '0.820')] -[2023-09-26 22:17:22,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 8839168. Throughput: 0: 771.1, 1: 771.6. Samples: 2208693. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:17:22,915][51558] Avg episode reward: [(0, '-2.040'), (1, '0.820')] -[2023-09-26 22:17:23,164][52541] Updated weights for policy 1, policy_version 17280 (0.0018) -[2023-09-26 22:17:23,164][52540] Updated weights for policy 0, policy_version 17280 (0.0017) -[2023-09-26 22:17:27,914][51558] Fps is (10 sec: 6553.8, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 8871936. Throughput: 0: 775.6, 1: 776.0. Samples: 2218174. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:17:27,915][51558] Avg episode reward: [(0, '-2.010'), (1, '0.820')] -[2023-09-26 22:17:32,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 8904704. Throughput: 0: 782.3, 1: 782.1. Samples: 2222921. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:17:32,915][51558] Avg episode reward: [(0, '-2.010'), (1, '0.820')] -[2023-09-26 22:17:36,439][52540] Updated weights for policy 0, policy_version 17440 (0.0018) -[2023-09-26 22:17:36,439][52541] Updated weights for policy 1, policy_version 17440 (0.0018) -[2023-09-26 22:17:37,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6280.5, 300 sec: 6192.6). Total num frames: 8937472. Throughput: 0: 782.9, 1: 779.5. Samples: 2232278. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:17:37,915][51558] Avg episode reward: [(0, '-2.010'), (1, '0.820')] -[2023-09-26 22:17:42,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 8962048. Throughput: 0: 779.5, 1: 779.3. Samples: 2241059. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:17:42,915][51558] Avg episode reward: [(0, '-2.370'), (1, '0.820')] -[2023-09-26 22:17:47,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 8994816. Throughput: 0: 781.0, 1: 781.7. Samples: 2245980. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:17:47,915][51558] Avg episode reward: [(0, '-2.370'), (1, '0.820')] -[2023-09-26 22:17:49,683][52540] Updated weights for policy 0, policy_version 17600 (0.0017) -[2023-09-26 22:17:49,683][52541] Updated weights for policy 1, policy_version 17600 (0.0017) -[2023-09-26 22:17:52,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6280.5, 300 sec: 6178.7). Total num frames: 9027584. Throughput: 0: 777.1, 1: 777.4. Samples: 2255170. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:17:52,915][51558] Avg episode reward: [(0, '-2.370'), (1, '0.820')] -[2023-09-26 22:17:57,914][51558] Fps is (10 sec: 6553.4, 60 sec: 6280.5, 300 sec: 6192.6). Total num frames: 9060352. Throughput: 0: 781.8, 1: 782.3. Samples: 2264792. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:17:57,915][51558] Avg episode reward: [(0, '-2.500'), (1, '0.820')] -[2023-09-26 22:18:02,802][52541] Updated weights for policy 1, policy_version 17760 (0.0015) -[2023-09-26 22:18:02,803][52540] Updated weights for policy 0, policy_version 17760 (0.0018) -[2023-09-26 22:18:02,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6280.5, 300 sec: 6192.6). Total num frames: 9093120. Throughput: 0: 775.4, 1: 775.9. Samples: 2269204. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:18:02,915][51558] Avg episode reward: [(0, '-2.500'), (1, '0.820')] -[2023-09-26 22:18:07,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 9117696. Throughput: 0: 779.3, 1: 778.6. Samples: 2278797. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:18:07,915][51558] Avg episode reward: [(0, '-2.500'), (1, '0.820')] -[2023-09-26 22:18:12,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 9150464. Throughput: 0: 778.5, 1: 778.3. Samples: 2288230. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:18:12,915][51558] Avg episode reward: [(0, '-2.500'), (1, '0.830')] -[2023-09-26 22:18:15,724][52541] Updated weights for policy 1, policy_version 17920 (0.0017) -[2023-09-26 22:18:15,724][52540] Updated weights for policy 0, policy_version 17920 (0.0017) -[2023-09-26 22:18:17,914][51558] Fps is (10 sec: 6553.8, 60 sec: 6280.6, 300 sec: 6192.6). Total num frames: 9183232. Throughput: 0: 781.3, 1: 780.3. Samples: 2293193. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:18:17,915][51558] Avg episode reward: [(0, '-2.500'), (1, '0.830')] -[2023-09-26 22:18:22,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6280.6, 300 sec: 6192.6). Total num frames: 9216000. Throughput: 0: 773.8, 1: 774.7. Samples: 2301960. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:18:22,914][51558] Avg episode reward: [(0, '-2.760'), (1, '0.360')] -[2023-09-26 22:18:27,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6280.5, 300 sec: 6192.6). Total num frames: 9248768. Throughput: 0: 786.2, 1: 786.2. Samples: 2311814. Policy #0 lag: (min: 9.0, avg: 9.0, max: 9.0) -[2023-09-26 22:18:27,915][51558] Avg episode reward: [(0, '-2.760'), (1, '0.360')] -[2023-09-26 22:18:28,891][52540] Updated weights for policy 0, policy_version 18080 (0.0014) -[2023-09-26 22:18:28,891][52541] Updated weights for policy 1, policy_version 18080 (0.0017) -[2023-09-26 22:18:32,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6280.5, 300 sec: 6192.6). Total num frames: 9281536. Throughput: 0: 781.9, 1: 780.5. Samples: 2316289. Policy #0 lag: (min: 9.0, avg: 9.0, max: 9.0) -[2023-09-26 22:18:32,915][51558] Avg episode reward: [(0, '-2.760'), (1, '0.360')] -[2023-09-26 22:18:37,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6178.7). Total num frames: 9306112. Throughput: 0: 783.5, 1: 783.6. Samples: 2325686. Policy #0 lag: (min: 9.0, avg: 9.0, max: 9.0) -[2023-09-26 22:18:37,915][51558] Avg episode reward: [(0, '-2.940'), (1, '0.360')] -[2023-09-26 22:18:42,373][52540] Updated weights for policy 0, policy_version 18240 (0.0016) -[2023-09-26 22:18:42,373][52541] Updated weights for policy 1, policy_version 18240 (0.0017) -[2023-09-26 22:18:42,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6280.5, 300 sec: 6192.6). Total num frames: 9338880. Throughput: 0: 777.5, 1: 776.4. Samples: 2334714. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) -[2023-09-26 22:18:42,915][51558] Avg episode reward: [(0, '-2.940'), (1, '0.360')] -[2023-09-26 22:18:47,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6280.5, 300 sec: 6192.6). Total num frames: 9371648. Throughput: 0: 774.9, 1: 775.0. Samples: 2338947. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) -[2023-09-26 22:18:47,915][51558] Avg episode reward: [(0, '-2.940'), (1, '0.360')] -[2023-09-26 22:18:52,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 9396224. Throughput: 0: 776.6, 1: 778.1. Samples: 2348757. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) -[2023-09-26 22:18:52,915][51558] Avg episode reward: [(0, '-3.040'), (1, '0.360')] -[2023-09-26 22:18:55,771][52540] Updated weights for policy 0, policy_version 18400 (0.0017) -[2023-09-26 22:18:55,772][52541] Updated weights for policy 1, policy_version 18400 (0.0018) -[2023-09-26 22:18:57,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 9428992. Throughput: 0: 767.6, 1: 767.8. Samples: 2357322. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) -[2023-09-26 22:18:57,915][51558] Avg episode reward: [(0, '-3.040'), (1, '0.360')] -[2023-09-26 22:19:02,914][51558] Fps is (10 sec: 6553.8, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 9461760. Throughput: 0: 765.9, 1: 767.2. Samples: 2362183. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) -[2023-09-26 22:19:02,915][51558] Avg episode reward: [(0, '-3.040'), (1, '0.360')] -[2023-09-26 22:19:07,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6280.5, 300 sec: 6192.6). Total num frames: 9494528. Throughput: 0: 773.6, 1: 773.6. Samples: 2371584. Policy #0 lag: (min: 12.0, avg: 12.0, max: 12.0) -[2023-09-26 22:19:07,915][51558] Avg episode reward: [(0, '-3.040'), (1, '0.360')] -[2023-09-26 22:19:07,929][52398] Saving ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000018544_4747264.pth... -[2023-09-26 22:19:07,929][52310] Saving ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000018544_4747264.pth... -[2023-09-26 22:19:07,966][52310] Removing ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000015632_4001792.pth -[2023-09-26 22:19:07,967][52398] Removing ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000015632_4001792.pth -[2023-09-26 22:19:08,838][52540] Updated weights for policy 0, policy_version 18560 (0.0016) -[2023-09-26 22:19:08,839][52541] Updated weights for policy 1, policy_version 18560 (0.0018) -[2023-09-26 22:19:12,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6280.5, 300 sec: 6220.4). Total num frames: 9527296. Throughput: 0: 767.4, 1: 767.6. Samples: 2380888. Policy #0 lag: (min: 12.0, avg: 12.0, max: 12.0) -[2023-09-26 22:19:12,914][51558] Avg episode reward: [(0, '-3.040'), (1, '0.360')] -[2023-09-26 22:19:17,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 9551872. Throughput: 0: 771.2, 1: 772.4. Samples: 2385749. Policy #0 lag: (min: 12.0, avg: 12.0, max: 12.0) -[2023-09-26 22:19:17,915][51558] Avg episode reward: [(0, '-3.040'), (1, '0.360')] -[2023-09-26 22:19:22,242][52540] Updated weights for policy 0, policy_version 18720 (0.0017) -[2023-09-26 22:19:22,242][52541] Updated weights for policy 1, policy_version 18720 (0.0017) -[2023-09-26 22:19:22,914][51558] Fps is (10 sec: 5734.2, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 9584640. Throughput: 0: 765.0, 1: 764.8. Samples: 2394531. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:19:22,915][51558] Avg episode reward: [(0, '-3.040'), (1, '0.360')] -[2023-09-26 22:19:27,914][51558] Fps is (10 sec: 6553.8, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 9617408. Throughput: 0: 768.1, 1: 769.6. Samples: 2403914. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:19:27,914][51558] Avg episode reward: [(0, '-3.040'), (1, '0.360')] -[2023-09-26 22:19:32,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6007.5, 300 sec: 6164.8). Total num frames: 9641984. Throughput: 0: 772.4, 1: 772.1. Samples: 2408448. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:19:32,915][51558] Avg episode reward: [(0, '-3.040'), (1, '0.360')] -[2023-09-26 22:19:35,717][52541] Updated weights for policy 1, policy_version 18880 (0.0015) -[2023-09-26 22:19:35,717][52540] Updated weights for policy 0, policy_version 18880 (0.0018) -[2023-09-26 22:19:37,914][51558] Fps is (10 sec: 5734.2, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 9674752. Throughput: 0: 764.2, 1: 763.4. Samples: 2417497. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-09-26 22:19:37,915][51558] Avg episode reward: [(0, '-3.040'), (1, '0.360')] -[2023-09-26 22:19:42,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 9707520. Throughput: 0: 773.1, 1: 772.7. Samples: 2426880. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-09-26 22:19:42,914][51558] Avg episode reward: [(0, '-3.040'), (1, '0.360')] -[2023-09-26 22:19:47,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 9740288. Throughput: 0: 767.8, 1: 767.4. Samples: 2431267. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:19:47,915][51558] Avg episode reward: [(0, '-3.040'), (1, '0.360')] -[2023-09-26 22:19:48,955][52540] Updated weights for policy 0, policy_version 19040 (0.0017) -[2023-09-26 22:19:48,955][52541] Updated weights for policy 1, policy_version 19040 (0.0017) -[2023-09-26 22:19:52,914][51558] Fps is (10 sec: 5734.2, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 9764864. Throughput: 0: 768.3, 1: 769.2. Samples: 2440769. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:19:52,915][51558] Avg episode reward: [(0, '-3.040'), (1, '0.360')] -[2023-09-26 22:19:57,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 9797632. Throughput: 0: 766.5, 1: 766.7. Samples: 2449881. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:19:57,915][51558] Avg episode reward: [(0, '-3.040'), (1, '0.360')] -[2023-09-26 22:20:02,201][52540] Updated weights for policy 0, policy_version 19200 (0.0017) -[2023-09-26 22:20:02,202][52541] Updated weights for policy 1, policy_version 19200 (0.0017) -[2023-09-26 22:20:02,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 9830400. Throughput: 0: 765.5, 1: 764.4. Samples: 2454595. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) -[2023-09-26 22:20:02,915][51558] Avg episode reward: [(0, '-3.040'), (1, '0.360')] -[2023-09-26 22:20:07,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 9863168. Throughput: 0: 770.1, 1: 770.4. Samples: 2463853. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) -[2023-09-26 22:20:07,915][51558] Avg episode reward: [(0, '-3.040'), (1, '0.360')] -[2023-09-26 22:20:12,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 9895936. Throughput: 0: 773.8, 1: 772.8. Samples: 2473511. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) -[2023-09-26 22:20:12,915][51558] Avg episode reward: [(0, '-3.040'), (1, '0.360')] -[2023-09-26 22:20:15,282][52540] Updated weights for policy 0, policy_version 19360 (0.0017) -[2023-09-26 22:20:15,282][52541] Updated weights for policy 1, policy_version 19360 (0.0017) -[2023-09-26 22:20:17,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6144.0, 300 sec: 6178.7). Total num frames: 9920512. Throughput: 0: 773.7, 1: 773.7. Samples: 2478080. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:20:17,915][51558] Avg episode reward: [(0, '-3.040'), (1, '0.360')] -[2023-09-26 22:20:22,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 9953280. Throughput: 0: 774.2, 1: 773.7. Samples: 2487153. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:20:22,915][51558] Avg episode reward: [(0, '-3.040'), (1, '0.360')] -[2023-09-26 22:20:27,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 9986048. Throughput: 0: 773.7, 1: 773.7. Samples: 2496512. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:20:27,915][51558] Avg episode reward: [(0, '-3.040'), (1, '0.360')] -[2023-09-26 22:20:28,743][52540] Updated weights for policy 0, policy_version 19520 (0.0018) -[2023-09-26 22:20:28,743][52541] Updated weights for policy 1, policy_version 19520 (0.0019) -[2023-09-26 22:20:32,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6280.5, 300 sec: 6192.6). Total num frames: 10018816. Throughput: 0: 771.2, 1: 771.4. Samples: 2500685. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:20:32,915][51558] Avg episode reward: [(0, '-3.040'), (1, '0.360')] -[2023-09-26 22:20:37,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6280.6, 300 sec: 6192.6). Total num frames: 10051584. Throughput: 0: 774.1, 1: 773.8. Samples: 2510424. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:20:37,915][51558] Avg episode reward: [(0, '-3.040'), (1, '0.360')] -[2023-09-26 22:20:41,910][52540] Updated weights for policy 0, policy_version 19680 (0.0017) -[2023-09-26 22:20:41,911][52541] Updated weights for policy 1, policy_version 19680 (0.0017) -[2023-09-26 22:20:42,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 10076160. Throughput: 0: 773.6, 1: 773.4. Samples: 2519498. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:20:42,915][51558] Avg episode reward: [(0, '-3.160'), (1, '0.360')] -[2023-09-26 22:20:47,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 10108928. Throughput: 0: 775.4, 1: 775.5. Samples: 2524386. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:20:47,915][51558] Avg episode reward: [(0, '-3.160'), (1, '0.360')] -[2023-09-26 22:20:52,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6280.5, 300 sec: 6192.6). Total num frames: 10141696. Throughput: 0: 776.9, 1: 776.9. Samples: 2533777. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:20:52,915][51558] Avg episode reward: [(0, '-3.160'), (1, '0.360')] -[2023-09-26 22:20:54,851][52541] Updated weights for policy 1, policy_version 19840 (0.0018) -[2023-09-26 22:20:54,851][52540] Updated weights for policy 0, policy_version 19840 (0.0018) -[2023-09-26 22:20:57,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6280.5, 300 sec: 6192.6). Total num frames: 10174464. Throughput: 0: 776.4, 1: 777.6. Samples: 2543438. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:20:57,915][51558] Avg episode reward: [(0, '-3.240'), (1, '0.510')] -[2023-09-26 22:21:02,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6280.5, 300 sec: 6192.6). Total num frames: 10207232. Throughput: 0: 773.8, 1: 774.1. Samples: 2547736. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:21:02,915][51558] Avg episode reward: [(0, '-3.240'), (1, '0.510')] -[2023-09-26 22:21:07,879][52540] Updated weights for policy 0, policy_version 20000 (0.0017) -[2023-09-26 22:21:07,879][52541] Updated weights for policy 1, policy_version 20000 (0.0016) -[2023-09-26 22:21:07,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6280.5, 300 sec: 6192.6). Total num frames: 10240000. Throughput: 0: 782.5, 1: 783.9. Samples: 2557640. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:21:07,915][51558] Avg episode reward: [(0, '-3.240'), (1, '0.510')] -[2023-09-26 22:21:07,927][52398] Saving ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000020000_5120000.pth... -[2023-09-26 22:21:07,927][52310] Saving ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000020000_5120000.pth... -[2023-09-26 22:21:07,964][52310] Removing ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000017088_4374528.pth -[2023-09-26 22:21:07,965][52398] Removing ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000017088_4374528.pth -[2023-09-26 22:21:12,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 10264576. Throughput: 0: 779.8, 1: 780.2. Samples: 2566713. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:21:12,915][51558] Avg episode reward: [(0, '-2.890'), (1, '0.510')] -[2023-09-26 22:21:17,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6280.5, 300 sec: 6192.6). Total num frames: 10297344. Throughput: 0: 789.5, 1: 789.4. Samples: 2571733. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:21:17,915][51558] Avg episode reward: [(0, '-2.890'), (1, '0.510')] -[2023-09-26 22:21:20,792][52540] Updated weights for policy 0, policy_version 20160 (0.0018) -[2023-09-26 22:21:20,792][52541] Updated weights for policy 1, policy_version 20160 (0.0019) -[2023-09-26 22:21:22,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6280.5, 300 sec: 6192.6). Total num frames: 10330112. Throughput: 0: 786.4, 1: 786.1. Samples: 2581187. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:21:22,915][51558] Avg episode reward: [(0, '-2.890'), (1, '0.500')] -[2023-09-26 22:21:27,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6280.5, 300 sec: 6192.6). Total num frames: 10362880. Throughput: 0: 791.5, 1: 791.2. Samples: 2590720. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:21:27,915][51558] Avg episode reward: [(0, '-2.890'), (1, '0.480')] -[2023-09-26 22:21:32,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6280.5, 300 sec: 6220.4). Total num frames: 10395648. Throughput: 0: 787.6, 1: 787.6. Samples: 2595269. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:21:32,915][51558] Avg episode reward: [(0, '-2.890'), (1, '0.480')] -[2023-09-26 22:21:33,881][52540] Updated weights for policy 0, policy_version 20320 (0.0016) -[2023-09-26 22:21:33,881][52541] Updated weights for policy 1, policy_version 20320 (0.0017) -[2023-09-26 22:21:37,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6280.5, 300 sec: 6220.4). Total num frames: 10428416. Throughput: 0: 790.3, 1: 789.9. Samples: 2604885. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:21:37,915][51558] Avg episode reward: [(0, '-2.710'), (1, '0.470')] -[2023-09-26 22:21:42,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6417.1, 300 sec: 6220.4). Total num frames: 10461184. Throughput: 0: 788.3, 1: 787.5. Samples: 2614347. Policy #0 lag: (min: 12.0, avg: 12.0, max: 12.0) -[2023-09-26 22:21:42,915][51558] Avg episode reward: [(0, '-2.710'), (1, '0.470')] -[2023-09-26 22:21:46,949][52541] Updated weights for policy 1, policy_version 20480 (0.0017) -[2023-09-26 22:21:46,949][52540] Updated weights for policy 0, policy_version 20480 (0.0017) -[2023-09-26 22:21:47,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6280.5, 300 sec: 6220.4). Total num frames: 10485760. Throughput: 0: 796.2, 1: 793.0. Samples: 2619250. Policy #0 lag: (min: 12.0, avg: 12.0, max: 12.0) -[2023-09-26 22:21:47,915][51558] Avg episode reward: [(0, '-2.710'), (1, '0.470')] -[2023-09-26 22:21:52,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6280.5, 300 sec: 6220.4). Total num frames: 10518528. Throughput: 0: 782.1, 1: 781.3. Samples: 2627992. Policy #0 lag: (min: 12.0, avg: 12.0, max: 12.0) -[2023-09-26 22:21:52,915][51558] Avg episode reward: [(0, '-2.440'), (1, '0.470')] -[2023-09-26 22:21:57,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6280.5, 300 sec: 6220.4). Total num frames: 10551296. Throughput: 0: 786.2, 1: 788.8. Samples: 2637592. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:21:57,915][51558] Avg episode reward: [(0, '-2.440'), (1, '0.470')] -[2023-09-26 22:22:00,441][52541] Updated weights for policy 1, policy_version 20640 (0.0017) -[2023-09-26 22:22:00,441][52540] Updated weights for policy 0, policy_version 20640 (0.0018) -[2023-09-26 22:22:02,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 10575872. Throughput: 0: 778.6, 1: 779.7. Samples: 2641856. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:22:02,915][51558] Avg episode reward: [(0, '-2.440'), (1, '0.470')] -[2023-09-26 22:22:07,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 10608640. Throughput: 0: 772.8, 1: 772.5. Samples: 2650724. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:22:07,914][51558] Avg episode reward: [(0, '-1.520'), (1, '0.460')] -[2023-09-26 22:22:12,914][51558] Fps is (10 sec: 6553.8, 60 sec: 6280.6, 300 sec: 6220.4). Total num frames: 10641408. Throughput: 0: 773.7, 1: 773.7. Samples: 2660352. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:22:12,914][51558] Avg episode reward: [(0, '-1.520'), (1, '0.460')] -[2023-09-26 22:22:13,640][52540] Updated weights for policy 0, policy_version 20800 (0.0016) -[2023-09-26 22:22:13,641][52541] Updated weights for policy 1, policy_version 20800 (0.0016) -[2023-09-26 22:22:17,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6280.5, 300 sec: 6220.4). Total num frames: 10674176. Throughput: 0: 769.4, 1: 769.5. Samples: 2664519. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:22:17,915][51558] Avg episode reward: [(0, '-1.520'), (1, '0.460')] -[2023-09-26 22:22:22,914][51558] Fps is (10 sec: 5734.2, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 10698752. Throughput: 0: 767.8, 1: 768.0. Samples: 2673999. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:22:22,915][51558] Avg episode reward: [(0, '-0.940'), (1, '0.460')] -[2023-09-26 22:22:27,142][52541] Updated weights for policy 1, policy_version 20960 (0.0017) -[2023-09-26 22:22:27,142][52540] Updated weights for policy 0, policy_version 20960 (0.0017) -[2023-09-26 22:22:27,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 10731520. Throughput: 0: 761.9, 1: 761.5. Samples: 2682900. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) -[2023-09-26 22:22:27,915][51558] Avg episode reward: [(0, '-0.940'), (1, '0.460')] -[2023-09-26 22:22:32,914][51558] Fps is (10 sec: 6553.8, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 10764288. Throughput: 0: 758.9, 1: 762.1. Samples: 2687696. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) -[2023-09-26 22:22:32,915][51558] Avg episode reward: [(0, '-0.740'), (1, '0.470')] -[2023-09-26 22:22:37,914][51558] Fps is (10 sec: 6553.4, 60 sec: 6144.0, 300 sec: 6220.4). Total num frames: 10797056. Throughput: 0: 769.4, 1: 768.9. Samples: 2697216. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) -[2023-09-26 22:22:37,915][51558] Avg episode reward: [(0, '-0.450'), (1, '0.470')] -[2023-09-26 22:22:40,382][52540] Updated weights for policy 0, policy_version 21120 (0.0016) -[2023-09-26 22:22:40,382][52541] Updated weights for policy 1, policy_version 21120 (0.0016) -[2023-09-26 22:22:42,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6007.5, 300 sec: 6192.6). Total num frames: 10821632. Throughput: 0: 763.4, 1: 761.4. Samples: 2706205. Policy #0 lag: (min: 10.0, avg: 10.0, max: 10.0) -[2023-09-26 22:22:42,915][51558] Avg episode reward: [(0, '-0.450'), (1, '0.470')] -[2023-09-26 22:22:47,914][51558] Fps is (10 sec: 5734.6, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 10854400. Throughput: 0: 770.8, 1: 770.5. Samples: 2711216. Policy #0 lag: (min: 10.0, avg: 10.0, max: 10.0) -[2023-09-26 22:22:47,914][51558] Avg episode reward: [(0, '-0.450'), (1, '0.470')] -[2023-09-26 22:22:52,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 10887168. Throughput: 0: 776.5, 1: 776.8. Samples: 2720622. Policy #0 lag: (min: 10.0, avg: 10.0, max: 10.0) -[2023-09-26 22:22:52,915][51558] Avg episode reward: [(0, '-0.450'), (1, '0.470')] -[2023-09-26 22:22:53,263][52540] Updated weights for policy 0, policy_version 21280 (0.0017) -[2023-09-26 22:22:53,263][52541] Updated weights for policy 1, policy_version 21280 (0.0018) -[2023-09-26 22:22:57,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 10919936. Throughput: 0: 773.7, 1: 773.7. Samples: 2729987. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) -[2023-09-26 22:22:57,915][51558] Avg episode reward: [(0, '-0.450'), (1, '0.470')] -[2023-09-26 22:23:02,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6280.5, 300 sec: 6220.4). Total num frames: 10952704. Throughput: 0: 777.2, 1: 778.2. Samples: 2734511. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) -[2023-09-26 22:23:02,915][51558] Avg episode reward: [(0, '-0.250'), (1, '0.480')] -[2023-09-26 22:23:06,462][52540] Updated weights for policy 0, policy_version 21440 (0.0017) -[2023-09-26 22:23:06,462][52541] Updated weights for policy 1, policy_version 21440 (0.0017) -[2023-09-26 22:23:07,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6280.5, 300 sec: 6220.4). Total num frames: 10985472. Throughput: 0: 780.3, 1: 779.7. Samples: 2744199. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) -[2023-09-26 22:23:07,915][51558] Avg episode reward: [(0, '-0.250'), (1, '0.480')] -[2023-09-26 22:23:07,922][52310] Saving ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000021456_5492736.pth... -[2023-09-26 22:23:07,922][52398] Saving ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000021456_5492736.pth... -[2023-09-26 22:23:07,959][52398] Removing ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000018544_4747264.pth -[2023-09-26 22:23:07,960][52310] Removing ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000018544_4747264.pth -[2023-09-26 22:23:12,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 11010048. Throughput: 0: 782.4, 1: 781.7. Samples: 2753284. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:23:12,915][51558] Avg episode reward: [(0, '-0.250'), (1, '0.480')] -[2023-09-26 22:23:17,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 11042816. Throughput: 0: 780.0, 1: 782.0. Samples: 2757983. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:23:17,915][51558] Avg episode reward: [(0, '-0.250'), (1, '0.480')] -[2023-09-26 22:23:19,575][52540] Updated weights for policy 0, policy_version 21600 (0.0017) -[2023-09-26 22:23:19,576][52541] Updated weights for policy 1, policy_version 21600 (0.0017) -[2023-09-26 22:23:22,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6280.5, 300 sec: 6192.6). Total num frames: 11075584. Throughput: 0: 779.2, 1: 780.1. Samples: 2767388. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:23:22,916][51558] Avg episode reward: [(0, '-0.250'), (1, '0.480')] -[2023-09-26 22:23:27,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6280.5, 300 sec: 6192.6). Total num frames: 11108352. Throughput: 0: 784.4, 1: 782.9. Samples: 2776732. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:23:27,915][51558] Avg episode reward: [(0, '-0.250'), (1, '0.480')] -[2023-09-26 22:23:32,914][51558] Fps is (10 sec: 6144.0, 60 sec: 6212.2, 300 sec: 6206.5). Total num frames: 11137024. Throughput: 0: 778.1, 1: 777.3. Samples: 2781208. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:23:32,915][51558] Avg episode reward: [(0, '0.120'), (1, '0.470')] -[2023-09-26 22:23:32,918][52540] Updated weights for policy 0, policy_version 21760 (0.0018) -[2023-09-26 22:23:32,918][52541] Updated weights for policy 1, policy_version 21760 (0.0016) -[2023-09-26 22:23:37,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 11165696. Throughput: 0: 777.5, 1: 777.9. Samples: 2790613. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:23:37,915][51558] Avg episode reward: [(0, '0.120'), (1, '0.470')] -[2023-09-26 22:23:42,914][51558] Fps is (10 sec: 6144.1, 60 sec: 6280.5, 300 sec: 6192.6). Total num frames: 11198464. Throughput: 0: 773.8, 1: 773.8. Samples: 2799626. Policy #0 lag: (min: 12.0, avg: 12.0, max: 12.0) -[2023-09-26 22:23:42,915][51558] Avg episode reward: [(0, '0.250'), (1, '0.460')] -[2023-09-26 22:23:46,006][52540] Updated weights for policy 0, policy_version 21920 (0.0017) -[2023-09-26 22:23:46,006][52541] Updated weights for policy 1, policy_version 21920 (0.0018) -[2023-09-26 22:23:47,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6280.5, 300 sec: 6220.4). Total num frames: 11231232. Throughput: 0: 778.3, 1: 777.4. Samples: 2804520. Policy #0 lag: (min: 12.0, avg: 12.0, max: 12.0) -[2023-09-26 22:23:47,915][51558] Avg episode reward: [(0, '0.250'), (1, '0.460')] -[2023-09-26 22:23:52,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6280.5, 300 sec: 6220.4). Total num frames: 11264000. Throughput: 0: 775.0, 1: 775.4. Samples: 2813965. Policy #0 lag: (min: 12.0, avg: 12.0, max: 12.0) -[2023-09-26 22:23:52,915][51558] Avg episode reward: [(0, '0.250'), (1, '0.460')] -[2023-09-26 22:23:57,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6280.5, 300 sec: 6220.4). Total num frames: 11296768. Throughput: 0: 776.5, 1: 777.2. Samples: 2823203. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) -[2023-09-26 22:23:57,915][51558] Avg episode reward: [(0, '0.250'), (1, '0.440')] -[2023-09-26 22:23:59,226][52541] Updated weights for policy 1, policy_version 22080 (0.0017) -[2023-09-26 22:23:59,227][52540] Updated weights for policy 0, policy_version 22080 (0.0015) -[2023-09-26 22:24:02,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 11321344. Throughput: 0: 778.8, 1: 776.5. Samples: 2827971. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) -[2023-09-26 22:24:02,915][51558] Avg episode reward: [(0, '0.250'), (1, '0.440')] -[2023-09-26 22:24:07,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 11354112. Throughput: 0: 776.7, 1: 776.0. Samples: 2837261. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) -[2023-09-26 22:24:07,915][51558] Avg episode reward: [(0, '0.250'), (1, '0.440')] -[2023-09-26 22:24:12,473][52541] Updated weights for policy 1, policy_version 22240 (0.0016) -[2023-09-26 22:24:12,473][52540] Updated weights for policy 0, policy_version 22240 (0.0017) -[2023-09-26 22:24:12,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6280.5, 300 sec: 6220.4). Total num frames: 11386880. Throughput: 0: 776.4, 1: 776.4. Samples: 2846607. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) -[2023-09-26 22:24:12,915][51558] Avg episode reward: [(0, '0.510'), (1, '0.900')] -[2023-09-26 22:24:17,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6280.5, 300 sec: 6220.4). Total num frames: 11419648. Throughput: 0: 773.7, 1: 773.6. Samples: 2850838. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:24:17,915][51558] Avg episode reward: [(0, '0.510'), (1, '0.900')] -[2023-09-26 22:24:22,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 11444224. Throughput: 0: 778.4, 1: 775.9. Samples: 2860556. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:24:22,915][51558] Avg episode reward: [(0, '0.510'), (1, '0.900')] -[2023-09-26 22:24:25,824][52540] Updated weights for policy 0, policy_version 22400 (0.0018) -[2023-09-26 22:24:25,824][52541] Updated weights for policy 1, policy_version 22400 (0.0016) -[2023-09-26 22:24:27,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6220.4). Total num frames: 11476992. Throughput: 0: 773.6, 1: 773.7. Samples: 2869253. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:24:27,915][51558] Avg episode reward: [(0, '0.700'), (1, '0.900')] -[2023-09-26 22:24:32,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6212.3, 300 sec: 6220.4). Total num frames: 11509760. Throughput: 0: 773.9, 1: 773.4. Samples: 2874152. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) -[2023-09-26 22:24:32,915][51558] Avg episode reward: [(0, '0.700'), (1, '0.900')] -[2023-09-26 22:24:37,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6280.5, 300 sec: 6220.4). Total num frames: 11542528. Throughput: 0: 770.4, 1: 772.6. Samples: 2883402. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) -[2023-09-26 22:24:37,915][51558] Avg episode reward: [(0, '0.690'), (1, '0.900')] -[2023-09-26 22:24:39,195][52540] Updated weights for policy 0, policy_version 22560 (0.0016) -[2023-09-26 22:24:39,195][52541] Updated weights for policy 1, policy_version 22560 (0.0017) -[2023-09-26 22:24:42,914][51558] Fps is (10 sec: 5734.6, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 11567104. Throughput: 0: 766.6, 1: 767.1. Samples: 2892218. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) -[2023-09-26 22:24:42,914][51558] Avg episode reward: [(0, '0.790'), (1, '0.900')] -[2023-09-26 22:24:47,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6144.0, 300 sec: 6220.4). Total num frames: 11599872. Throughput: 0: 768.4, 1: 768.6. Samples: 2897137. Policy #0 lag: (min: 10.0, avg: 10.0, max: 10.0) -[2023-09-26 22:24:47,915][51558] Avg episode reward: [(0, '0.790'), (1, '0.900')] -[2023-09-26 22:24:52,440][52540] Updated weights for policy 0, policy_version 22720 (0.0015) -[2023-09-26 22:24:52,440][52541] Updated weights for policy 1, policy_version 22720 (0.0015) -[2023-09-26 22:24:52,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6220.4). Total num frames: 11632640. Throughput: 0: 766.2, 1: 766.6. Samples: 2906237. Policy #0 lag: (min: 10.0, avg: 10.0, max: 10.0) -[2023-09-26 22:24:52,914][51558] Avg episode reward: [(0, '0.790'), (1, '0.900')] -[2023-09-26 22:24:57,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6220.4). Total num frames: 11665408. Throughput: 0: 765.2, 1: 766.5. Samples: 2915530. Policy #0 lag: (min: 10.0, avg: 10.0, max: 10.0) -[2023-09-26 22:24:57,915][51558] Avg episode reward: [(0, '0.790'), (1, '0.900')] -[2023-09-26 22:25:02,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 11689984. Throughput: 0: 773.6, 1: 773.3. Samples: 2920448. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-09-26 22:25:02,915][51558] Avg episode reward: [(0, '0.790'), (1, '0.900')] -[2023-09-26 22:25:05,771][52540] Updated weights for policy 0, policy_version 22880 (0.0016) -[2023-09-26 22:25:05,772][52541] Updated weights for policy 1, policy_version 22880 (0.0017) -[2023-09-26 22:25:07,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 11722752. Throughput: 0: 763.4, 1: 765.7. Samples: 2929365. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-09-26 22:25:07,915][51558] Avg episode reward: [(0, '0.790'), (1, '0.900')] -[2023-09-26 22:25:07,924][52398] Saving ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000022896_5861376.pth... -[2023-09-26 22:25:07,924][52310] Saving ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000022896_5861376.pth... -[2023-09-26 22:25:07,958][52398] Removing ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000020000_5120000.pth -[2023-09-26 22:25:07,966][52310] Removing ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000020000_5120000.pth -[2023-09-26 22:25:12,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6144.0, 300 sec: 6220.4). Total num frames: 11755520. Throughput: 0: 773.7, 1: 773.4. Samples: 2938872. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-09-26 22:25:12,915][51558] Avg episode reward: [(0, '0.790'), (1, '0.900')] -[2023-09-26 22:25:17,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6144.0, 300 sec: 6220.4). Total num frames: 11788288. Throughput: 0: 767.0, 1: 767.5. Samples: 2943205. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:25:17,915][51558] Avg episode reward: [(0, '0.790'), (1, '0.900')] -[2023-09-26 22:25:18,907][52540] Updated weights for policy 0, policy_version 23040 (0.0017) -[2023-09-26 22:25:18,907][52541] Updated weights for policy 1, policy_version 23040 (0.0018) -[2023-09-26 22:25:22,914][51558] Fps is (10 sec: 6144.1, 60 sec: 6212.3, 300 sec: 6206.5). Total num frames: 11816960. Throughput: 0: 771.6, 1: 768.8. Samples: 2952722. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:25:22,915][51558] Avg episode reward: [(0, '0.790'), (1, '0.900')] -[2023-09-26 22:25:27,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 11845632. Throughput: 0: 769.3, 1: 768.5. Samples: 2961419. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:25:27,915][51558] Avg episode reward: [(0, '0.790'), (1, '0.900')] -[2023-09-26 22:25:32,447][52541] Updated weights for policy 1, policy_version 23200 (0.0017) -[2023-09-26 22:25:32,447][52540] Updated weights for policy 0, policy_version 23200 (0.0017) -[2023-09-26 22:25:32,914][51558] Fps is (10 sec: 6144.0, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 11878400. Throughput: 0: 767.0, 1: 767.0. Samples: 2966165. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:25:32,915][51558] Avg episode reward: [(0, '0.790'), (1, '0.900')] -[2023-09-26 22:25:37,915][51558] Fps is (10 sec: 6553.3, 60 sec: 6144.0, 300 sec: 6220.4). Total num frames: 11911168. Throughput: 0: 768.8, 1: 769.8. Samples: 2975476. Policy #0 lag: (min: 9.0, avg: 9.0, max: 9.0) -[2023-09-26 22:25:37,916][51558] Avg episode reward: [(0, '0.790'), (1, '0.900')] -[2023-09-26 22:25:42,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 11935744. Throughput: 0: 766.2, 1: 765.8. Samples: 2984466. Policy #0 lag: (min: 9.0, avg: 9.0, max: 9.0) -[2023-09-26 22:25:42,915][51558] Avg episode reward: [(0, '0.790'), (1, '0.900')] -[2023-09-26 22:25:45,694][52541] Updated weights for policy 1, policy_version 23360 (0.0017) -[2023-09-26 22:25:45,694][52540] Updated weights for policy 0, policy_version 23360 (0.0018) -[2023-09-26 22:25:47,914][51558] Fps is (10 sec: 5734.6, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 11968512. Throughput: 0: 766.6, 1: 766.8. Samples: 2989452. Policy #0 lag: (min: 9.0, avg: 9.0, max: 9.0) -[2023-09-26 22:25:47,915][51558] Avg episode reward: [(0, '0.790'), (1, '0.810')] -[2023-09-26 22:25:52,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 12001280. Throughput: 0: 770.9, 1: 770.6. Samples: 2998733. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-09-26 22:25:52,915][51558] Avg episode reward: [(0, '0.790'), (1, '0.810')] -[2023-09-26 22:25:57,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 12034048. Throughput: 0: 773.1, 1: 773.1. Samples: 3008451. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-09-26 22:25:57,915][51558] Avg episode reward: [(0, '0.790'), (1, '0.810')] -[2023-09-26 22:25:58,729][52540] Updated weights for policy 0, policy_version 23520 (0.0017) -[2023-09-26 22:25:58,729][52541] Updated weights for policy 1, policy_version 23520 (0.0016) -[2023-09-26 22:26:02,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6280.5, 300 sec: 6192.6). Total num frames: 12066816. Throughput: 0: 772.5, 1: 772.6. Samples: 3012734. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-09-26 22:26:02,915][51558] Avg episode reward: [(0, '0.790'), (1, '0.570')] -[2023-09-26 22:26:07,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6280.5, 300 sec: 6220.4). Total num frames: 12099584. Throughput: 0: 775.5, 1: 775.6. Samples: 3022523. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:26:07,915][51558] Avg episode reward: [(0, '0.790'), (1, '0.570')] -[2023-09-26 22:26:11,784][52540] Updated weights for policy 0, policy_version 23680 (0.0014) -[2023-09-26 22:26:11,785][52541] Updated weights for policy 1, policy_version 23680 (0.0019) -[2023-09-26 22:26:12,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 12124160. Throughput: 0: 781.2, 1: 780.4. Samples: 3031693. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:26:12,915][51558] Avg episode reward: [(0, '0.790'), (1, '0.570')] -[2023-09-26 22:26:17,914][51558] Fps is (10 sec: 5734.6, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 12156928. Throughput: 0: 776.1, 1: 775.6. Samples: 3035990. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:26:17,915][51558] Avg episode reward: [(0, '0.790'), (1, '0.520')] -[2023-09-26 22:26:22,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6212.3, 300 sec: 6192.6). Total num frames: 12189696. Throughput: 0: 777.5, 1: 775.8. Samples: 3045376. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:26:22,915][51558] Avg episode reward: [(0, '0.790'), (1, '0.520')] -[2023-09-26 22:26:25,141][52540] Updated weights for policy 0, policy_version 23840 (0.0017) -[2023-09-26 22:26:25,141][52541] Updated weights for policy 1, policy_version 23840 (0.0016) -[2023-09-26 22:26:27,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6280.5, 300 sec: 6192.6). Total num frames: 12222464. Throughput: 0: 782.6, 1: 781.3. Samples: 3054842. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:26:27,915][51558] Avg episode reward: [(0, '0.790'), (1, '0.520')] -[2023-09-26 22:26:32,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 12247040. Throughput: 0: 776.7, 1: 774.2. Samples: 3059242. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:26:32,915][51558] Avg episode reward: [(0, '0.910'), (1, '0.520')] -[2023-09-26 22:26:37,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 12279808. Throughput: 0: 769.9, 1: 770.2. Samples: 3068039. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:26:37,915][51558] Avg episode reward: [(0, '0.910'), (1, '0.520')] -[2023-09-26 22:26:38,852][52541] Updated weights for policy 1, policy_version 24000 (0.0017) -[2023-09-26 22:26:38,852][52540] Updated weights for policy 0, policy_version 24000 (0.0018) -[2023-09-26 22:26:42,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6280.5, 300 sec: 6192.6). Total num frames: 12312576. Throughput: 0: 764.8, 1: 766.4. Samples: 3077358. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:26:42,915][51558] Avg episode reward: [(0, '0.910'), (1, '0.520')] -[2023-09-26 22:26:47,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 12337152. Throughput: 0: 769.0, 1: 767.6. Samples: 3081884. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:26:47,915][51558] Avg episode reward: [(0, '0.990'), (1, '0.520')] -[2023-09-26 22:26:52,039][52541] Updated weights for policy 1, policy_version 24160 (0.0018) -[2023-09-26 22:26:52,040][52540] Updated weights for policy 0, policy_version 24160 (0.0017) -[2023-09-26 22:26:52,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 12369920. Throughput: 0: 762.4, 1: 763.4. Samples: 3091183. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:26:52,914][51558] Avg episode reward: [(0, '0.990'), (1, '0.520')] -[2023-09-26 22:26:57,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 12402688. Throughput: 0: 766.1, 1: 766.8. Samples: 3100672. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) -[2023-09-26 22:26:57,915][51558] Avg episode reward: [(0, '0.990'), (1, '0.520')] -[2023-09-26 22:27:02,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 12435456. Throughput: 0: 764.4, 1: 765.1. Samples: 3104817. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) -[2023-09-26 22:27:02,915][51558] Avg episode reward: [(0, '0.990'), (1, '0.520')] -[2023-09-26 22:27:05,301][52540] Updated weights for policy 0, policy_version 24320 (0.0019) -[2023-09-26 22:27:05,302][52541] Updated weights for policy 1, policy_version 24320 (0.0018) -[2023-09-26 22:27:07,914][51558] Fps is (10 sec: 5734.2, 60 sec: 6007.5, 300 sec: 6164.8). Total num frames: 12460032. Throughput: 0: 769.8, 1: 766.8. Samples: 3114522. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) -[2023-09-26 22:27:07,915][51558] Avg episode reward: [(0, '0.990'), (1, '0.520')] -[2023-09-26 22:27:07,928][52398] Saving ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000024336_6230016.pth... -[2023-09-26 22:27:07,964][52398] Removing ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000021456_5492736.pth -[2023-09-26 22:27:08,053][52310] Saving ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000024352_6234112.pth... -[2023-09-26 22:27:08,080][52310] Removing ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000021456_5492736.pth -[2023-09-26 22:27:12,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 12492800. Throughput: 0: 759.1, 1: 760.0. Samples: 3123200. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) -[2023-09-26 22:27:12,915][51558] Avg episode reward: [(0, '0.990'), (1, '0.550')] -[2023-09-26 22:27:17,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 12525568. Throughput: 0: 759.0, 1: 761.3. Samples: 3127655. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) -[2023-09-26 22:27:17,915][51558] Avg episode reward: [(0, '0.990'), (1, '0.550')] -[2023-09-26 22:27:18,911][52540] Updated weights for policy 0, policy_version 24480 (0.0017) -[2023-09-26 22:27:18,912][52541] Updated weights for policy 1, policy_version 24480 (0.0016) -[2023-09-26 22:27:22,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 12558336. Throughput: 0: 770.1, 1: 768.2. Samples: 3137259. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) -[2023-09-26 22:27:22,915][51558] Avg episode reward: [(0, '0.990'), (1, '0.550')] -[2023-09-26 22:27:27,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6007.5, 300 sec: 6164.8). Total num frames: 12582912. Throughput: 0: 767.8, 1: 766.9. Samples: 3146419. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) -[2023-09-26 22:27:27,914][51558] Avg episode reward: [(0, '0.990'), (1, '0.560')] -[2023-09-26 22:27:32,133][52541] Updated weights for policy 1, policy_version 24640 (0.0017) -[2023-09-26 22:27:32,133][52540] Updated weights for policy 0, policy_version 24640 (0.0016) -[2023-09-26 22:27:32,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 12615680. Throughput: 0: 768.5, 1: 770.0. Samples: 3151118. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:27:32,915][51558] Avg episode reward: [(0, '0.990'), (1, '0.560')] -[2023-09-26 22:27:37,914][51558] Fps is (10 sec: 6553.4, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 12648448. Throughput: 0: 767.0, 1: 766.7. Samples: 3160197. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:27:37,915][51558] Avg episode reward: [(0, '0.990'), (1, '0.560')] -[2023-09-26 22:27:42,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 12681216. Throughput: 0: 768.3, 1: 767.9. Samples: 3169802. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:27:42,915][51558] Avg episode reward: [(0, '0.990'), (1, '0.560')] -[2023-09-26 22:27:45,211][52541] Updated weights for policy 1, policy_version 24800 (0.0017) -[2023-09-26 22:27:45,211][52540] Updated weights for policy 0, policy_version 24800 (0.0017) -[2023-09-26 22:27:47,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6280.5, 300 sec: 6192.6). Total num frames: 12713984. Throughput: 0: 773.4, 1: 773.0. Samples: 3174402. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:27:47,915][51558] Avg episode reward: [(0, '0.990'), (1, '0.560')] -[2023-09-26 22:27:52,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 12738560. Throughput: 0: 767.8, 1: 771.7. Samples: 3183798. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:27:52,915][51558] Avg episode reward: [(0, '0.990'), (1, '0.560')] -[2023-09-26 22:27:57,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 12771328. Throughput: 0: 773.7, 1: 773.7. Samples: 3192832. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:27:57,915][51558] Avg episode reward: [(0, '0.990'), (1, '0.570')] -[2023-09-26 22:27:58,610][52541] Updated weights for policy 1, policy_version 24960 (0.0016) -[2023-09-26 22:27:58,610][52540] Updated weights for policy 0, policy_version 24960 (0.0016) -[2023-09-26 22:28:02,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 12804096. Throughput: 0: 772.5, 1: 772.8. Samples: 3197195. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:28:02,915][51558] Avg episode reward: [(0, '0.990'), (1, '0.570')] -[2023-09-26 22:28:07,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 12828672. Throughput: 0: 770.1, 1: 772.0. Samples: 3206653. Policy #0 lag: (min: 12.0, avg: 12.0, max: 12.0) -[2023-09-26 22:28:07,915][51558] Avg episode reward: [(0, '0.990'), (1, '0.570')] -[2023-09-26 22:28:11,834][52540] Updated weights for policy 0, policy_version 25120 (0.0017) -[2023-09-26 22:28:11,834][52541] Updated weights for policy 1, policy_version 25120 (0.0017) -[2023-09-26 22:28:12,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 12861440. Throughput: 0: 773.0, 1: 772.7. Samples: 3215975. Policy #0 lag: (min: 12.0, avg: 12.0, max: 12.0) -[2023-09-26 22:28:12,915][51558] Avg episode reward: [(0, '0.990'), (1, '0.570')] -[2023-09-26 22:28:17,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 12894208. Throughput: 0: 772.6, 1: 773.6. Samples: 3220698. Policy #0 lag: (min: 12.0, avg: 12.0, max: 12.0) -[2023-09-26 22:28:17,915][51558] Avg episode reward: [(0, '0.990'), (1, '0.570')] -[2023-09-26 22:28:22,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 12926976. Throughput: 0: 774.9, 1: 774.9. Samples: 3229936. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) -[2023-09-26 22:28:22,915][51558] Avg episode reward: [(0, '0.990'), (1, '0.570')] -[2023-09-26 22:28:25,220][52541] Updated weights for policy 1, policy_version 25280 (0.0018) -[2023-09-26 22:28:25,220][52540] Updated weights for policy 0, policy_version 25280 (0.0016) -[2023-09-26 22:28:27,914][51558] Fps is (10 sec: 6144.0, 60 sec: 6212.2, 300 sec: 6164.8). Total num frames: 12955648. Throughput: 0: 767.0, 1: 769.3. Samples: 3238935. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) -[2023-09-26 22:28:27,915][51558] Avg episode reward: [(0, '0.990'), (1, '0.570')] -[2023-09-26 22:28:32,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 12984320. Throughput: 0: 768.3, 1: 769.0. Samples: 3243578. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) -[2023-09-26 22:28:32,915][51558] Avg episode reward: [(0, '0.990'), (1, '0.570')] -[2023-09-26 22:28:37,914][51558] Fps is (10 sec: 6144.0, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 13017088. Throughput: 0: 761.2, 1: 760.9. Samples: 3252293. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) -[2023-09-26 22:28:37,915][51558] Avg episode reward: [(0, '0.990'), (1, '0.570')] -[2023-09-26 22:28:38,719][52540] Updated weights for policy 0, policy_version 25440 (0.0017) -[2023-09-26 22:28:38,719][52541] Updated weights for policy 1, policy_version 25440 (0.0016) -[2023-09-26 22:28:42,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 13049856. Throughput: 0: 766.8, 1: 767.7. Samples: 3261882. Policy #0 lag: (min: 12.0, avg: 12.0, max: 12.0) -[2023-09-26 22:28:42,915][51558] Avg episode reward: [(0, '0.990'), (1, '0.570')] -[2023-09-26 22:28:47,914][51558] Fps is (10 sec: 6144.1, 60 sec: 6075.7, 300 sec: 6150.9). Total num frames: 13078528. Throughput: 0: 770.9, 1: 770.6. Samples: 3266560. Policy #0 lag: (min: 12.0, avg: 12.0, max: 12.0) -[2023-09-26 22:28:47,915][51558] Avg episode reward: [(0, '0.990'), (1, '0.570')] -[2023-09-26 22:28:52,050][52540] Updated weights for policy 0, policy_version 25600 (0.0017) -[2023-09-26 22:28:52,051][52541] Updated weights for policy 1, policy_version 25600 (0.0019) -[2023-09-26 22:28:52,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 13107200. Throughput: 0: 766.8, 1: 766.5. Samples: 3275653. Policy #0 lag: (min: 12.0, avg: 12.0, max: 12.0) -[2023-09-26 22:28:52,915][51558] Avg episode reward: [(0, '0.990'), (1, '0.570')] -[2023-09-26 22:28:57,914][51558] Fps is (10 sec: 6144.0, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 13139968. Throughput: 0: 767.0, 1: 765.0. Samples: 3284914. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:28:57,915][51558] Avg episode reward: [(0, '0.990'), (1, '0.570')] -[2023-09-26 22:29:02,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 13172736. Throughput: 0: 760.7, 1: 759.2. Samples: 3289097. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:29:02,915][51558] Avg episode reward: [(0, '0.990'), (1, '0.570')] -[2023-09-26 22:29:05,498][52541] Updated weights for policy 1, policy_version 25760 (0.0017) -[2023-09-26 22:29:05,498][52540] Updated weights for policy 0, policy_version 25760 (0.0018) -[2023-09-26 22:29:07,914][51558] Fps is (10 sec: 5734.2, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 13197312. Throughput: 0: 762.8, 1: 761.0. Samples: 3298504. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:29:07,915][51558] Avg episode reward: [(0, '0.990'), (1, '0.570')] -[2023-09-26 22:29:08,085][52398] Saving ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000025792_6602752.pth... -[2023-09-26 22:29:08,091][52310] Saving ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000025792_6602752.pth... -[2023-09-26 22:29:08,113][52398] Removing ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000022896_5861376.pth -[2023-09-26 22:29:08,125][52310] Removing ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000022896_5861376.pth -[2023-09-26 22:29:12,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 13230080. Throughput: 0: 766.9, 1: 765.0. Samples: 3307870. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:29:12,914][51558] Avg episode reward: [(0, '0.990'), (1, '0.570')] -[2023-09-26 22:29:17,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 13262848. Throughput: 0: 761.9, 1: 761.4. Samples: 3312125. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) -[2023-09-26 22:29:17,915][51558] Avg episode reward: [(0, '0.990'), (1, '0.550')] -[2023-09-26 22:29:18,801][52540] Updated weights for policy 0, policy_version 25920 (0.0017) -[2023-09-26 22:29:18,802][52541] Updated weights for policy 1, policy_version 25920 (0.0018) -[2023-09-26 22:29:22,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 13295616. Throughput: 0: 772.8, 1: 771.9. Samples: 3321807. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) -[2023-09-26 22:29:22,915][51558] Avg episode reward: [(0, '0.990'), (1, '0.550')] -[2023-09-26 22:29:27,914][51558] Fps is (10 sec: 6144.1, 60 sec: 6144.0, 300 sec: 6150.9). Total num frames: 13324288. Throughput: 0: 768.1, 1: 768.4. Samples: 3331025. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) -[2023-09-26 22:29:27,915][51558] Avg episode reward: [(0, '0.990'), (1, '0.550')] -[2023-09-26 22:29:31,933][52541] Updated weights for policy 1, policy_version 26080 (0.0015) -[2023-09-26 22:29:31,934][52540] Updated weights for policy 0, policy_version 26080 (0.0017) -[2023-09-26 22:29:32,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 13352960. Throughput: 0: 769.7, 1: 769.2. Samples: 3335808. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) -[2023-09-26 22:29:32,914][51558] Avg episode reward: [(0, '0.990'), (1, '0.560')] -[2023-09-26 22:29:37,914][51558] Fps is (10 sec: 6144.0, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 13385728. Throughput: 0: 767.6, 1: 768.1. Samples: 3344758. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) -[2023-09-26 22:29:37,915][51558] Avg episode reward: [(0, '0.990'), (1, '0.560')] -[2023-09-26 22:29:42,914][51558] Fps is (10 sec: 6553.4, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 13418496. Throughput: 0: 772.7, 1: 774.4. Samples: 3354531. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) -[2023-09-26 22:29:42,915][51558] Avg episode reward: [(0, '0.990'), (1, '0.560')] -[2023-09-26 22:29:45,145][52540] Updated weights for policy 0, policy_version 26240 (0.0017) -[2023-09-26 22:29:45,145][52541] Updated weights for policy 1, policy_version 26240 (0.0016) -[2023-09-26 22:29:47,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6212.2, 300 sec: 6164.8). Total num frames: 13451264. Throughput: 0: 773.7, 1: 773.8. Samples: 3358736. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) -[2023-09-26 22:29:47,915][51558] Avg episode reward: [(0, '0.990'), (1, '0.570')] -[2023-09-26 22:29:52,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 13475840. Throughput: 0: 769.7, 1: 772.2. Samples: 3367891. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:29:52,915][51558] Avg episode reward: [(0, '0.990'), (1, '0.570')] -[2023-09-26 22:29:57,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 13508608. Throughput: 0: 769.8, 1: 769.8. Samples: 3377152. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:29:57,915][51558] Avg episode reward: [(0, '0.990'), (1, '0.570')] -[2023-09-26 22:29:58,532][52540] Updated weights for policy 0, policy_version 26400 (0.0017) -[2023-09-26 22:29:58,533][52541] Updated weights for policy 1, policy_version 26400 (0.0018) -[2023-09-26 22:30:02,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 13541376. Throughput: 0: 773.9, 1: 774.1. Samples: 3381787. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:30:02,915][51558] Avg episode reward: [(0, '0.990'), (1, '0.570')] -[2023-09-26 22:30:07,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6280.6, 300 sec: 6164.8). Total num frames: 13574144. Throughput: 0: 773.7, 1: 773.2. Samples: 3391419. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-09-26 22:30:07,915][51558] Avg episode reward: [(0, '0.990'), (1, '0.570')] -[2023-09-26 22:30:11,739][52541] Updated weights for policy 1, policy_version 26560 (0.0018) -[2023-09-26 22:30:11,739][52540] Updated weights for policy 0, policy_version 26560 (0.0018) -[2023-09-26 22:30:12,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 13598720. Throughput: 0: 771.3, 1: 771.1. Samples: 3400432. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-09-26 22:30:12,915][51558] Avg episode reward: [(0, '0.990'), (1, '0.570')] -[2023-09-26 22:30:17,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6150.9). Total num frames: 13631488. Throughput: 0: 769.3, 1: 769.8. Samples: 3405069. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-09-26 22:30:17,915][51558] Avg episode reward: [(0, '0.980'), (1, '0.570')] -[2023-09-26 22:30:22,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 13664256. Throughput: 0: 769.9, 1: 769.4. Samples: 3414027. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-09-26 22:30:22,915][51558] Avg episode reward: [(0, '0.980'), (1, '0.570')] -[2023-09-26 22:30:25,076][52540] Updated weights for policy 0, policy_version 26720 (0.0016) -[2023-09-26 22:30:25,078][52541] Updated weights for policy 1, policy_version 26720 (0.0016) -[2023-09-26 22:30:27,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6212.3, 300 sec: 6164.8). Total num frames: 13697024. Throughput: 0: 769.2, 1: 769.6. Samples: 3423776. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:30:27,915][51558] Avg episode reward: [(0, '0.980'), (1, '0.570')] -[2023-09-26 22:30:32,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 13721600. Throughput: 0: 773.6, 1: 773.3. Samples: 3428345. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:30:32,915][51558] Avg episode reward: [(0, '0.980'), (1, '0.570')] -[2023-09-26 22:30:37,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 13754368. Throughput: 0: 771.4, 1: 771.0. Samples: 3437301. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:30:37,915][51558] Avg episode reward: [(0, '0.980'), (1, '0.570')] -[2023-09-26 22:30:38,387][52540] Updated weights for policy 0, policy_version 26880 (0.0018) -[2023-09-26 22:30:38,387][52541] Updated weights for policy 1, policy_version 26880 (0.0018) -[2023-09-26 22:30:42,914][51558] Fps is (10 sec: 6553.8, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 13787136. Throughput: 0: 773.7, 1: 773.1. Samples: 3446756. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:30:42,914][51558] Avg episode reward: [(0, '0.980'), (1, '0.570')] -[2023-09-26 22:30:47,914][51558] Fps is (10 sec: 6553.8, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 13819904. Throughput: 0: 769.5, 1: 769.7. Samples: 3451052. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:30:47,914][51558] Avg episode reward: [(0, '0.980'), (1, '0.570')] -[2023-09-26 22:30:51,626][52541] Updated weights for policy 1, policy_version 27040 (0.0017) -[2023-09-26 22:30:51,626][52540] Updated weights for policy 0, policy_version 27040 (0.0018) -[2023-09-26 22:30:52,914][51558] Fps is (10 sec: 6143.8, 60 sec: 6212.3, 300 sec: 6150.9). Total num frames: 13848576. Throughput: 0: 770.4, 1: 771.0. Samples: 3460785. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:30:52,915][51558] Avg episode reward: [(0, '0.980'), (1, '0.570')] -[2023-09-26 22:30:57,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 13877248. Throughput: 0: 769.6, 1: 768.6. Samples: 3469651. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:30:57,914][51558] Avg episode reward: [(0, '0.980'), (1, '0.560')] -[2023-09-26 22:31:02,914][51558] Fps is (10 sec: 6144.0, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 13910016. Throughput: 0: 769.3, 1: 769.6. Samples: 3474319. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:31:02,915][51558] Avg episode reward: [(0, '0.980'), (1, '0.560')] -[2023-09-26 22:31:04,820][52541] Updated weights for policy 1, policy_version 27200 (0.0018) -[2023-09-26 22:31:04,821][52540] Updated weights for policy 0, policy_version 27200 (0.0018) -[2023-09-26 22:31:07,914][51558] Fps is (10 sec: 6553.4, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 13942784. Throughput: 0: 774.0, 1: 774.3. Samples: 3483699. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) -[2023-09-26 22:31:07,915][51558] Avg episode reward: [(0, '0.980'), (1, '0.560')] -[2023-09-26 22:31:07,927][52398] Saving ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000027232_6971392.pth... -[2023-09-26 22:31:07,927][52310] Saving ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000027232_6971392.pth... -[2023-09-26 22:31:07,958][52398] Removing ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000024336_6230016.pth -[2023-09-26 22:31:07,963][52310] Removing ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000024352_6234112.pth -[2023-09-26 22:31:12,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6280.5, 300 sec: 6164.8). Total num frames: 13975552. Throughput: 0: 774.2, 1: 774.4. Samples: 3493467. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) -[2023-09-26 22:31:12,915][51558] Avg episode reward: [(0, '0.980'), (1, '0.530')] -[2023-09-26 22:31:17,914][51558] Fps is (10 sec: 6144.1, 60 sec: 6212.3, 300 sec: 6150.9). Total num frames: 14004224. Throughput: 0: 773.7, 1: 773.8. Samples: 3497984. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) -[2023-09-26 22:31:17,915][51558] Avg episode reward: [(0, '0.980'), (1, '0.530')] -[2023-09-26 22:31:17,960][52541] Updated weights for policy 1, policy_version 27360 (0.0015) -[2023-09-26 22:31:17,960][52540] Updated weights for policy 0, policy_version 27360 (0.0017) -[2023-09-26 22:31:22,914][51558] Fps is (10 sec: 5734.2, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 14032896. Throughput: 0: 776.6, 1: 777.4. Samples: 3507234. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:31:22,915][51558] Avg episode reward: [(0, '0.980'), (1, '0.530')] -[2023-09-26 22:31:27,914][51558] Fps is (10 sec: 6144.1, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 14065664. Throughput: 0: 773.7, 1: 774.3. Samples: 3516416. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:31:27,915][51558] Avg episode reward: [(0, '0.980'), (1, '0.530')] -[2023-09-26 22:31:31,319][52540] Updated weights for policy 0, policy_version 27520 (0.0018) -[2023-09-26 22:31:31,319][52541] Updated weights for policy 1, policy_version 27520 (0.0016) -[2023-09-26 22:31:32,914][51558] Fps is (10 sec: 6553.8, 60 sec: 6280.5, 300 sec: 6164.8). Total num frames: 14098432. Throughput: 0: 774.8, 1: 774.8. Samples: 3520786. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:31:32,915][51558] Avg episode reward: [(0, '0.980'), (1, '0.530')] -[2023-09-26 22:31:37,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6280.5, 300 sec: 6164.8). Total num frames: 14131200. Throughput: 0: 773.5, 1: 774.6. Samples: 3530453. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:31:37,915][51558] Avg episode reward: [(0, '0.980'), (1, '0.620')] -[2023-09-26 22:31:42,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 14155776. Throughput: 0: 775.3, 1: 775.9. Samples: 3539456. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:31:42,915][51558] Avg episode reward: [(0, '0.980'), (1, '0.620')] -[2023-09-26 22:31:44,494][52540] Updated weights for policy 0, policy_version 27680 (0.0018) -[2023-09-26 22:31:44,494][52541] Updated weights for policy 1, policy_version 27680 (0.0018) -[2023-09-26 22:31:47,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 14188544. Throughput: 0: 778.1, 1: 777.8. Samples: 3544337. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:31:47,915][51558] Avg episode reward: [(0, '0.980'), (1, '0.620')] -[2023-09-26 22:31:52,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6212.3, 300 sec: 6164.8). Total num frames: 14221312. Throughput: 0: 774.1, 1: 774.1. Samples: 3553367. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:31:52,915][51558] Avg episode reward: [(0, '0.980'), (1, '0.860')] -[2023-09-26 22:31:57,769][52540] Updated weights for policy 0, policy_version 27840 (0.0017) -[2023-09-26 22:31:57,769][52541] Updated weights for policy 1, policy_version 27840 (0.0016) -[2023-09-26 22:31:57,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6280.5, 300 sec: 6164.8). Total num frames: 14254080. Throughput: 0: 770.7, 1: 770.3. Samples: 3562809. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:31:57,915][51558] Avg episode reward: [(0, '0.980'), (1, '0.860')] -[2023-09-26 22:32:02,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 14278656. Throughput: 0: 771.4, 1: 772.6. Samples: 3567466. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:32:02,915][51558] Avg episode reward: [(0, '0.980'), (1, '0.860')] -[2023-09-26 22:32:07,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 14311424. Throughput: 0: 766.1, 1: 765.3. Samples: 3576149. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:32:07,915][51558] Avg episode reward: [(0, '0.980'), (1, '0.910')] -[2023-09-26 22:32:11,308][52540] Updated weights for policy 0, policy_version 28000 (0.0018) -[2023-09-26 22:32:11,309][52541] Updated weights for policy 1, policy_version 28000 (0.0018) -[2023-09-26 22:32:12,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 14344192. Throughput: 0: 769.7, 1: 769.5. Samples: 3585678. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:32:12,915][51558] Avg episode reward: [(0, '0.980'), (1, '0.910')] -[2023-09-26 22:32:17,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6212.3, 300 sec: 6164.8). Total num frames: 14376960. Throughput: 0: 771.0, 1: 770.3. Samples: 3590144. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:32:17,915][51558] Avg episode reward: [(0, '0.980'), (1, '0.910')] -[2023-09-26 22:32:22,914][51558] Fps is (10 sec: 5734.6, 60 sec: 6144.1, 300 sec: 6164.8). Total num frames: 14401536. Throughput: 0: 767.6, 1: 767.5. Samples: 3599532. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:32:22,915][51558] Avg episode reward: [(0, '0.980'), (1, '0.910')] -[2023-09-26 22:32:24,561][52540] Updated weights for policy 0, policy_version 28160 (0.0018) -[2023-09-26 22:32:24,561][52541] Updated weights for policy 1, policy_version 28160 (0.0017) -[2023-09-26 22:32:27,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 14434304. Throughput: 0: 768.3, 1: 767.8. Samples: 3608581. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:32:27,915][51558] Avg episode reward: [(0, '0.980'), (1, '0.910')] -[2023-09-26 22:32:32,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 14467072. Throughput: 0: 764.2, 1: 764.6. Samples: 3613136. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:32:32,915][51558] Avg episode reward: [(0, '0.980'), (1, '0.910')] -[2023-09-26 22:32:37,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6007.5, 300 sec: 6137.1). Total num frames: 14491648. Throughput: 0: 766.3, 1: 766.6. Samples: 3622345. Policy #0 lag: (min: 9.0, avg: 9.0, max: 9.0) -[2023-09-26 22:32:37,915][51558] Avg episode reward: [(0, '0.980'), (1, '0.890')] -[2023-09-26 22:32:37,983][52540] Updated weights for policy 0, policy_version 28320 (0.0016) -[2023-09-26 22:32:37,984][52541] Updated weights for policy 1, policy_version 28320 (0.0017) -[2023-09-26 22:32:42,914][51558] Fps is (10 sec: 5734.2, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 14524416. Throughput: 0: 760.9, 1: 761.1. Samples: 3631302. Policy #0 lag: (min: 9.0, avg: 9.0, max: 9.0) -[2023-09-26 22:32:42,915][51558] Avg episode reward: [(0, '0.980'), (1, '0.890')] -[2023-09-26 22:32:47,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 14557184. Throughput: 0: 760.4, 1: 759.6. Samples: 3635866. Policy #0 lag: (min: 9.0, avg: 9.0, max: 9.0) -[2023-09-26 22:32:47,915][51558] Avg episode reward: [(0, '0.980'), (1, '0.890')] -[2023-09-26 22:32:51,428][52541] Updated weights for policy 1, policy_version 28480 (0.0017) -[2023-09-26 22:32:51,428][52540] Updated weights for policy 0, policy_version 28480 (0.0018) -[2023-09-26 22:32:52,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 14589952. Throughput: 0: 769.1, 1: 769.0. Samples: 3645363. Policy #0 lag: (min: 9.0, avg: 9.0, max: 9.0) -[2023-09-26 22:32:52,915][51558] Avg episode reward: [(0, '0.980'), (1, '0.890')] -[2023-09-26 22:32:57,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 14622720. Throughput: 0: 766.1, 1: 767.2. Samples: 3654673. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:32:57,915][51558] Avg episode reward: [(0, '0.980'), (1, '0.890')] -[2023-09-26 22:33:02,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 14647296. Throughput: 0: 769.9, 1: 769.5. Samples: 3659418. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:33:02,915][51558] Avg episode reward: [(0, '0.970'), (1, '0.880')] -[2023-09-26 22:33:04,514][52540] Updated weights for policy 0, policy_version 28640 (0.0018) -[2023-09-26 22:33:04,514][52541] Updated weights for policy 1, policy_version 28640 (0.0017) -[2023-09-26 22:33:07,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 14680064. Throughput: 0: 768.0, 1: 768.1. Samples: 3668655. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:33:07,915][51558] Avg episode reward: [(0, '0.970'), (1, '0.880')] -[2023-09-26 22:33:07,926][52398] Saving ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000028672_7340032.pth... -[2023-09-26 22:33:07,927][52310] Saving ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000028672_7340032.pth... -[2023-09-26 22:33:07,955][52398] Removing ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000025792_6602752.pth -[2023-09-26 22:33:07,961][52310] Removing ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000025792_6602752.pth -[2023-09-26 22:33:12,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 14712832. Throughput: 0: 773.7, 1: 773.6. Samples: 3678208. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:33:12,915][51558] Avg episode reward: [(0, '0.970'), (1, '0.880')] -[2023-09-26 22:33:17,551][52540] Updated weights for policy 0, policy_version 28800 (0.0016) -[2023-09-26 22:33:17,552][52541] Updated weights for policy 1, policy_version 28800 (0.0017) -[2023-09-26 22:33:17,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 14745600. Throughput: 0: 771.8, 1: 771.7. Samples: 3682593. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-09-26 22:33:17,915][51558] Avg episode reward: [(0, '0.970'), (1, '0.880')] -[2023-09-26 22:33:22,914][51558] Fps is (10 sec: 6144.0, 60 sec: 6212.2, 300 sec: 6164.8). Total num frames: 14774272. Throughput: 0: 775.9, 1: 776.7. Samples: 3692209. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-09-26 22:33:22,915][51558] Avg episode reward: [(0, '0.970'), (1, '0.880')] -[2023-09-26 22:33:27,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 14802944. Throughput: 0: 778.3, 1: 778.4. Samples: 3701352. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-09-26 22:33:27,915][51558] Avg episode reward: [(0, '0.970'), (1, '0.880')] -[2023-09-26 22:33:30,850][52541] Updated weights for policy 1, policy_version 28960 (0.0014) -[2023-09-26 22:33:30,850][52540] Updated weights for policy 0, policy_version 28960 (0.0017) -[2023-09-26 22:33:32,914][51558] Fps is (10 sec: 6144.0, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 14835712. Throughput: 0: 780.4, 1: 780.2. Samples: 3706092. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-09-26 22:33:32,915][51558] Avg episode reward: [(0, '0.960'), (1, '0.880')] -[2023-09-26 22:33:37,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6280.6, 300 sec: 6164.8). Total num frames: 14868480. Throughput: 0: 774.9, 1: 774.4. Samples: 3715081. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-09-26 22:33:37,915][51558] Avg episode reward: [(0, '0.960'), (1, '0.880')] -[2023-09-26 22:33:42,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6280.6, 300 sec: 6178.7). Total num frames: 14901248. Throughput: 0: 775.5, 1: 774.9. Samples: 3724442. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-09-26 22:33:42,915][51558] Avg episode reward: [(0, '0.960'), (1, '0.880')] -[2023-09-26 22:33:44,155][52540] Updated weights for policy 0, policy_version 29120 (0.0016) -[2023-09-26 22:33:44,155][52541] Updated weights for policy 1, policy_version 29120 (0.0018) -[2023-09-26 22:33:47,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 14925824. Throughput: 0: 775.0, 1: 775.6. Samples: 3729198. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-09-26 22:33:47,915][51558] Avg episode reward: [(0, '0.960'), (1, '0.880')] -[2023-09-26 22:33:52,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 14958592. Throughput: 0: 775.2, 1: 775.1. Samples: 3738419. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) -[2023-09-26 22:33:52,915][51558] Avg episode reward: [(0, '0.960'), (1, '0.880')] -[2023-09-26 22:33:57,100][52540] Updated weights for policy 0, policy_version 29280 (0.0018) -[2023-09-26 22:33:57,100][52541] Updated weights for policy 1, policy_version 29280 (0.0018) -[2023-09-26 22:33:57,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 14991360. Throughput: 0: 774.5, 1: 774.9. Samples: 3747930. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) -[2023-09-26 22:33:57,915][51558] Avg episode reward: [(0, '0.960'), (1, '0.880')] -[2023-09-26 22:34:02,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6280.5, 300 sec: 6192.6). Total num frames: 15024128. Throughput: 0: 778.6, 1: 778.9. Samples: 3752678. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) -[2023-09-26 22:34:02,915][51558] Avg episode reward: [(0, '0.960'), (1, '0.880')] -[2023-09-26 22:34:07,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 15048704. Throughput: 0: 775.2, 1: 772.7. Samples: 3761865. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) -[2023-09-26 22:34:07,915][51558] Avg episode reward: [(0, '0.960'), (1, '0.880')] -[2023-09-26 22:34:10,634][52540] Updated weights for policy 0, policy_version 29440 (0.0015) -[2023-09-26 22:34:10,635][52541] Updated weights for policy 1, policy_version 29440 (0.0017) -[2023-09-26 22:34:12,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 15081472. Throughput: 0: 770.9, 1: 771.3. Samples: 3770750. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) -[2023-09-26 22:34:12,915][51558] Avg episode reward: [(0, '0.950'), (1, '0.880')] -[2023-09-26 22:34:17,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 15114240. Throughput: 0: 772.6, 1: 772.2. Samples: 3775610. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:34:17,915][51558] Avg episode reward: [(0, '0.950'), (1, '0.880')] -[2023-09-26 22:34:22,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6212.3, 300 sec: 6178.7). Total num frames: 15147008. Throughput: 0: 774.4, 1: 774.7. Samples: 3784792. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:34:22,915][51558] Avg episode reward: [(0, '0.950'), (1, '0.880')] -[2023-09-26 22:34:23,739][52541] Updated weights for policy 1, policy_version 29600 (0.0019) -[2023-09-26 22:34:23,739][52540] Updated weights for policy 0, policy_version 29600 (0.0019) -[2023-09-26 22:34:27,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6280.5, 300 sec: 6192.6). Total num frames: 15179776. Throughput: 0: 775.4, 1: 774.8. Samples: 3794203. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:34:27,915][51558] Avg episode reward: [(0, '0.940'), (1, '0.880')] -[2023-09-26 22:34:32,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 15204352. Throughput: 0: 776.2, 1: 775.9. Samples: 3799040. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:34:32,915][51558] Avg episode reward: [(0, '0.940'), (1, '0.880')] -[2023-09-26 22:34:36,979][52540] Updated weights for policy 0, policy_version 29760 (0.0018) -[2023-09-26 22:34:36,980][52541] Updated weights for policy 1, policy_version 29760 (0.0017) -[2023-09-26 22:34:37,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 15237120. Throughput: 0: 774.9, 1: 774.9. Samples: 3808159. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:34:37,915][51558] Avg episode reward: [(0, '0.940'), (1, '0.880')] -[2023-09-26 22:34:42,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 15269888. Throughput: 0: 772.9, 1: 772.5. Samples: 3817472. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:34:42,915][51558] Avg episode reward: [(0, '0.920'), (1, '-0.360')] -[2023-09-26 22:34:47,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6280.5, 300 sec: 6192.6). Total num frames: 15302656. Throughput: 0: 770.0, 1: 769.7. Samples: 3821963. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:34:47,915][51558] Avg episode reward: [(0, '0.920'), (1, '-0.360')] -[2023-09-26 22:34:50,046][52540] Updated weights for policy 0, policy_version 29920 (0.0017) -[2023-09-26 22:34:50,046][52541] Updated weights for policy 1, policy_version 29920 (0.0018) -[2023-09-26 22:34:52,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6280.5, 300 sec: 6192.6). Total num frames: 15335424. Throughput: 0: 776.7, 1: 777.6. Samples: 3831808. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:34:52,915][51558] Avg episode reward: [(0, '0.480'), (1, '-0.360')] -[2023-09-26 22:34:57,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 15360000. Throughput: 0: 780.0, 1: 780.1. Samples: 3840955. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:34:57,915][51558] Avg episode reward: [(0, '0.480'), (1, '-0.360')] -[2023-09-26 22:35:02,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 15392768. Throughput: 0: 779.1, 1: 779.9. Samples: 3845764. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:35:02,915][51558] Avg episode reward: [(0, '0.480'), (1, '-0.360')] -[2023-09-26 22:35:03,233][52540] Updated weights for policy 0, policy_version 30080 (0.0016) -[2023-09-26 22:35:03,234][52541] Updated weights for policy 1, policy_version 30080 (0.0014) -[2023-09-26 22:35:07,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6280.5, 300 sec: 6192.6). Total num frames: 15425536. Throughput: 0: 777.2, 1: 777.0. Samples: 3854727. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:35:07,915][51558] Avg episode reward: [(0, '0.470'), (1, '-0.330')] -[2023-09-26 22:35:07,925][52398] Saving ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000030128_7712768.pth... -[2023-09-26 22:35:07,925][52310] Saving ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000030128_7712768.pth... -[2023-09-26 22:35:07,962][52310] Removing ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000027232_6971392.pth -[2023-09-26 22:35:07,969][52398] Removing ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000027232_6971392.pth -[2023-09-26 22:35:12,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6280.5, 300 sec: 6192.6). Total num frames: 15458304. Throughput: 0: 778.1, 1: 778.8. Samples: 3864267. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:35:12,915][51558] Avg episode reward: [(0, '0.470'), (1, '-0.330')] -[2023-09-26 22:35:16,511][52541] Updated weights for policy 1, policy_version 30240 (0.0018) -[2023-09-26 22:35:16,511][52540] Updated weights for policy 0, policy_version 30240 (0.0015) -[2023-09-26 22:35:17,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6280.5, 300 sec: 6192.6). Total num frames: 15491072. Throughput: 0: 773.8, 1: 773.8. Samples: 3868681. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) -[2023-09-26 22:35:17,915][51558] Avg episode reward: [(0, '0.470'), (1, '-0.330')] -[2023-09-26 22:35:22,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 15515648. Throughput: 0: 772.1, 1: 773.5. Samples: 3877712. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) -[2023-09-26 22:35:22,915][51558] Avg episode reward: [(0, '0.450'), (1, '-1.400')] -[2023-09-26 22:35:27,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 15548416. Throughput: 0: 773.7, 1: 773.7. Samples: 3887105. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) -[2023-09-26 22:35:27,915][51558] Avg episode reward: [(0, '0.450'), (1, '-1.400')] -[2023-09-26 22:35:29,854][52540] Updated weights for policy 0, policy_version 30400 (0.0017) -[2023-09-26 22:35:29,854][52541] Updated weights for policy 1, policy_version 30400 (0.0018) -[2023-09-26 22:35:32,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6280.5, 300 sec: 6192.6). Total num frames: 15581184. Throughput: 0: 774.9, 1: 775.6. Samples: 3891736. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) -[2023-09-26 22:35:32,915][51558] Avg episode reward: [(0, '0.450'), (1, '-1.400')] -[2023-09-26 22:35:37,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 15605760. Throughput: 0: 768.7, 1: 767.7. Samples: 3900947. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:35:37,915][51558] Avg episode reward: [(0, '0.450'), (1, '-1.390')] -[2023-09-26 22:35:42,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 15638528. Throughput: 0: 766.9, 1: 766.1. Samples: 3909941. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:35:42,915][51558] Avg episode reward: [(0, '0.450'), (1, '-1.390')] -[2023-09-26 22:35:43,341][52540] Updated weights for policy 0, policy_version 30560 (0.0017) -[2023-09-26 22:35:43,341][52541] Updated weights for policy 1, policy_version 30560 (0.0019) -[2023-09-26 22:35:47,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6178.7). Total num frames: 15671296. Throughput: 0: 764.8, 1: 766.0. Samples: 3914648. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:35:47,915][51558] Avg episode reward: [(0, '0.450'), (1, '-1.390')] -[2023-09-26 22:35:52,914][51558] Fps is (10 sec: 6553.4, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 15704064. Throughput: 0: 769.4, 1: 769.1. Samples: 3923961. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:35:52,915][51558] Avg episode reward: [(0, '0.440'), (1, '-1.380')] -[2023-09-26 22:35:56,568][52541] Updated weights for policy 1, policy_version 30720 (0.0016) -[2023-09-26 22:35:56,568][52540] Updated weights for policy 0, policy_version 30720 (0.0016) -[2023-09-26 22:35:57,914][51558] Fps is (10 sec: 6144.0, 60 sec: 6212.3, 300 sec: 6178.7). Total num frames: 15732736. Throughput: 0: 765.6, 1: 767.4. Samples: 3933251. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:35:57,915][51558] Avg episode reward: [(0, '0.440'), (1, '-1.380')] -[2023-09-26 22:36:02,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 15761408. Throughput: 0: 769.4, 1: 770.8. Samples: 3937989. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:36:02,915][51558] Avg episode reward: [(0, '0.440'), (1, '-1.380')] -[2023-09-26 22:36:07,914][51558] Fps is (10 sec: 6144.1, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 15794176. Throughput: 0: 769.0, 1: 767.6. Samples: 3946858. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:36:07,915][51558] Avg episode reward: [(0, '0.440'), (1, '-1.380')] -[2023-09-26 22:36:09,860][52541] Updated weights for policy 1, policy_version 30880 (0.0015) -[2023-09-26 22:36:09,861][52540] Updated weights for policy 0, policy_version 30880 (0.0017) -[2023-09-26 22:36:12,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6144.0, 300 sec: 6178.7). Total num frames: 15826944. Throughput: 0: 773.0, 1: 772.7. Samples: 3956659. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:36:12,914][51558] Avg episode reward: [(0, '0.440'), (1, '-1.380')] -[2023-09-26 22:36:17,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 15859712. Throughput: 0: 771.2, 1: 770.5. Samples: 3961113. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:36:17,915][51558] Avg episode reward: [(0, '0.450'), (1, '-1.380')] -[2023-09-26 22:36:22,883][52540] Updated weights for policy 0, policy_version 31040 (0.0017) -[2023-09-26 22:36:22,883][52541] Updated weights for policy 1, policy_version 31040 (0.0016) -[2023-09-26 22:36:22,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6280.5, 300 sec: 6192.6). Total num frames: 15892480. Throughput: 0: 775.1, 1: 775.1. Samples: 3970706. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:36:22,915][51558] Avg episode reward: [(0, '0.450'), (1, '-1.380')] -[2023-09-26 22:36:27,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 15917056. Throughput: 0: 775.3, 1: 776.0. Samples: 3979747. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:36:27,915][51558] Avg episode reward: [(0, '0.450'), (1, '-1.380')] -[2023-09-26 22:36:32,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 15949824. Throughput: 0: 774.2, 1: 772.8. Samples: 3984264. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:36:32,915][51558] Avg episode reward: [(0, '0.450'), (1, '-1.380')] -[2023-09-26 22:36:36,389][52540] Updated weights for policy 0, policy_version 31200 (0.0015) -[2023-09-26 22:36:36,389][52541] Updated weights for policy 1, policy_version 31200 (0.0015) -[2023-09-26 22:36:37,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6280.6, 300 sec: 6192.6). Total num frames: 15982592. Throughput: 0: 773.1, 1: 773.9. Samples: 3993575. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:36:37,914][51558] Avg episode reward: [(0, '0.450'), (1, '-1.380')] -[2023-09-26 22:36:42,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 16007168. Throughput: 0: 770.1, 1: 768.8. Samples: 4002501. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:36:42,915][51558] Avg episode reward: [(0, '0.450'), (1, '-1.380')] -[2023-09-26 22:36:47,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 16039936. Throughput: 0: 770.9, 1: 770.9. Samples: 4007371. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:36:47,915][51558] Avg episode reward: [(0, '0.450'), (1, '-1.370')] -[2023-09-26 22:36:49,603][52540] Updated weights for policy 0, policy_version 31360 (0.0018) -[2023-09-26 22:36:49,603][52541] Updated weights for policy 1, policy_version 31360 (0.0017) -[2023-09-26 22:36:52,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 16072704. Throughput: 0: 774.7, 1: 774.5. Samples: 4016574. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:36:52,914][51558] Avg episode reward: [(0, '0.450'), (1, '-1.370')] -[2023-09-26 22:36:57,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6212.3, 300 sec: 6192.6). Total num frames: 16105472. Throughput: 0: 770.9, 1: 770.4. Samples: 4026018. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:36:57,915][51558] Avg episode reward: [(0, '0.450'), (1, '-1.370')] -[2023-09-26 22:37:02,888][52541] Updated weights for policy 1, policy_version 31520 (0.0018) -[2023-09-26 22:37:02,888][52540] Updated weights for policy 0, policy_version 31520 (0.0017) -[2023-09-26 22:37:02,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6280.6, 300 sec: 6192.6). Total num frames: 16138240. Throughput: 0: 770.8, 1: 770.3. Samples: 4030463. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:37:02,914][51558] Avg episode reward: [(0, '0.450'), (1, '-1.340')] -[2023-09-26 22:37:07,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 16162816. Throughput: 0: 767.5, 1: 767.8. Samples: 4039793. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:37:07,915][51558] Avg episode reward: [(0, '0.450'), (1, '-1.340')] -[2023-09-26 22:37:07,924][52310] Saving ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000031568_8081408.pth... -[2023-09-26 22:37:07,924][52398] Saving ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000031568_8081408.pth... -[2023-09-26 22:37:07,959][52310] Removing ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000028672_7340032.pth -[2023-09-26 22:37:07,959][52398] Removing ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000028672_7340032.pth -[2023-09-26 22:37:12,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 16195584. Throughput: 0: 769.8, 1: 769.5. Samples: 4049016. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:37:12,915][51558] Avg episode reward: [(0, '0.450'), (1, '-1.340')] -[2023-09-26 22:37:16,114][52540] Updated weights for policy 0, policy_version 31680 (0.0017) -[2023-09-26 22:37:16,114][52541] Updated weights for policy 1, policy_version 31680 (0.0018) -[2023-09-26 22:37:17,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 16228352. Throughput: 0: 771.4, 1: 771.0. Samples: 4053675. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:37:17,915][51558] Avg episode reward: [(0, '0.450'), (1, '-1.340')] -[2023-09-26 22:37:22,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 16261120. Throughput: 0: 774.2, 1: 773.7. Samples: 4063232. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) -[2023-09-26 22:37:22,915][51558] Avg episode reward: [(0, '0.450'), (1, '-1.340')] -[2023-09-26 22:37:27,914][51558] Fps is (10 sec: 6143.9, 60 sec: 6212.3, 300 sec: 6178.7). Total num frames: 16289792. Throughput: 0: 777.5, 1: 776.4. Samples: 4072424. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) -[2023-09-26 22:37:27,915][51558] Avg episode reward: [(0, '0.450'), (1, '-1.340')] -[2023-09-26 22:37:29,304][52540] Updated weights for policy 0, policy_version 31840 (0.0020) -[2023-09-26 22:37:29,306][52541] Updated weights for policy 1, policy_version 31840 (0.0019) -[2023-09-26 22:37:32,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 16318464. Throughput: 0: 775.4, 1: 774.4. Samples: 4077113. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) -[2023-09-26 22:37:32,914][51558] Avg episode reward: [(0, '0.450'), (1, '-1.340')] -[2023-09-26 22:37:37,914][51558] Fps is (10 sec: 6144.0, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 16351232. Throughput: 0: 772.2, 1: 772.4. Samples: 4086082. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) -[2023-09-26 22:37:37,915][51558] Avg episode reward: [(0, '0.450'), (1, '-1.340')] -[2023-09-26 22:37:42,500][52541] Updated weights for policy 1, policy_version 32000 (0.0018) -[2023-09-26 22:37:42,502][52540] Updated weights for policy 0, policy_version 32000 (0.0017) -[2023-09-26 22:37:42,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6280.5, 300 sec: 6192.6). Total num frames: 16384000. Throughput: 0: 774.2, 1: 775.5. Samples: 4095753. Policy #0 lag: (min: 10.0, avg: 10.0, max: 10.0) -[2023-09-26 22:37:42,915][51558] Avg episode reward: [(0, '0.440'), (1, '-1.510')] -[2023-09-26 22:37:47,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6280.6, 300 sec: 6192.6). Total num frames: 16416768. Throughput: 0: 774.0, 1: 774.3. Samples: 4100135. Policy #0 lag: (min: 10.0, avg: 10.0, max: 10.0) -[2023-09-26 22:37:47,915][51558] Avg episode reward: [(0, '0.440'), (1, '-1.510')] -[2023-09-26 22:37:52,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 16441344. Throughput: 0: 777.4, 1: 777.3. Samples: 4109756. Policy #0 lag: (min: 10.0, avg: 10.0, max: 10.0) -[2023-09-26 22:37:52,915][51558] Avg episode reward: [(0, '0.440'), (1, '-1.510')] -[2023-09-26 22:37:55,619][52540] Updated weights for policy 0, policy_version 32160 (0.0012) -[2023-09-26 22:37:55,620][52541] Updated weights for policy 1, policy_version 32160 (0.0017) -[2023-09-26 22:37:57,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 16474112. Throughput: 0: 777.9, 1: 777.2. Samples: 4118996. Policy #0 lag: (min: 10.0, avg: 10.0, max: 10.0) -[2023-09-26 22:37:57,915][51558] Avg episode reward: [(0, '0.440'), (1, '-1.630')] -[2023-09-26 22:38:02,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 16506880. Throughput: 0: 779.7, 1: 780.3. Samples: 4123877. Policy #0 lag: (min: 10.0, avg: 10.0, max: 10.0) -[2023-09-26 22:38:02,915][51558] Avg episode reward: [(0, '0.440'), (1, '-1.630')] -[2023-09-26 22:38:07,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6280.5, 300 sec: 6192.6). Total num frames: 16539648. Throughput: 0: 778.6, 1: 778.9. Samples: 4133317. Policy #0 lag: (min: 10.0, avg: 10.0, max: 10.0) -[2023-09-26 22:38:07,915][51558] Avg episode reward: [(0, '0.440'), (1, '-1.630')] -[2023-09-26 22:38:08,500][52541] Updated weights for policy 1, policy_version 32320 (0.0017) -[2023-09-26 22:38:08,500][52540] Updated weights for policy 0, policy_version 32320 (0.0017) -[2023-09-26 22:38:12,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6280.5, 300 sec: 6192.6). Total num frames: 16572416. Throughput: 0: 784.1, 1: 785.4. Samples: 4143049. Policy #0 lag: (min: 10.0, avg: 10.0, max: 10.0) -[2023-09-26 22:38:12,915][51558] Avg episode reward: [(0, '0.440'), (1, '-1.630')] -[2023-09-26 22:38:17,914][51558] Fps is (10 sec: 6553.8, 60 sec: 6280.5, 300 sec: 6206.5). Total num frames: 16605184. Throughput: 0: 779.9, 1: 779.8. Samples: 4147301. Policy #0 lag: (min: 10.0, avg: 10.0, max: 10.0) -[2023-09-26 22:38:17,915][51558] Avg episode reward: [(0, '0.440'), (1, '-1.630')] -[2023-09-26 22:38:21,604][52541] Updated weights for policy 1, policy_version 32480 (0.0016) -[2023-09-26 22:38:21,604][52540] Updated weights for policy 0, policy_version 32480 (0.0018) -[2023-09-26 22:38:22,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6280.5, 300 sec: 6220.4). Total num frames: 16637952. Throughput: 0: 788.5, 1: 788.3. Samples: 4157038. Policy #0 lag: (min: 10.0, avg: 10.0, max: 10.0) -[2023-09-26 22:38:22,915][51558] Avg episode reward: [(0, '0.410'), (1, '-1.610')] -[2023-09-26 22:38:27,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6212.3, 300 sec: 6192.6). Total num frames: 16662528. Throughput: 0: 786.8, 1: 786.3. Samples: 4166546. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:38:27,915][51558] Avg episode reward: [(0, '0.410'), (1, '-1.610')] -[2023-09-26 22:38:32,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6280.5, 300 sec: 6192.6). Total num frames: 16695296. Throughput: 0: 792.1, 1: 792.5. Samples: 4171440. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:38:32,915][51558] Avg episode reward: [(0, '0.410'), (1, '-1.610')] -[2023-09-26 22:38:34,385][52540] Updated weights for policy 0, policy_version 32640 (0.0017) -[2023-09-26 22:38:34,385][52541] Updated weights for policy 1, policy_version 32640 (0.0015) -[2023-09-26 22:38:37,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6280.5, 300 sec: 6192.6). Total num frames: 16728064. Throughput: 0: 788.7, 1: 789.9. Samples: 4180791. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:38:37,915][51558] Avg episode reward: [(0, '0.410'), (1, '-1.610')] -[2023-09-26 22:38:42,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6280.5, 300 sec: 6220.4). Total num frames: 16760832. Throughput: 0: 791.4, 1: 792.1. Samples: 4190254. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:38:42,915][51558] Avg episode reward: [(0, '0.410'), (1, '-1.610')] -[2023-09-26 22:38:47,318][52541] Updated weights for policy 1, policy_version 32800 (0.0017) -[2023-09-26 22:38:47,318][52540] Updated weights for policy 0, policy_version 32800 (0.0017) -[2023-09-26 22:38:47,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6280.5, 300 sec: 6220.4). Total num frames: 16793600. Throughput: 0: 793.0, 1: 792.3. Samples: 4195215. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-09-26 22:38:47,915][51558] Avg episode reward: [(0, '0.410'), (1, '-1.600')] -[2023-09-26 22:38:52,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6417.1, 300 sec: 6220.4). Total num frames: 16826368. Throughput: 0: 791.6, 1: 791.3. Samples: 4204544. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-09-26 22:38:52,915][51558] Avg episode reward: [(0, '0.420'), (1, '-1.600')] -[2023-09-26 22:38:57,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6417.1, 300 sec: 6220.4). Total num frames: 16859136. Throughput: 0: 789.4, 1: 789.0. Samples: 4214077. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-09-26 22:38:57,915][51558] Avg episode reward: [(0, '0.420'), (1, '-1.600')] -[2023-09-26 22:39:00,412][52540] Updated weights for policy 0, policy_version 32960 (0.0015) -[2023-09-26 22:39:00,413][52541] Updated weights for policy 1, policy_version 32960 (0.0014) -[2023-09-26 22:39:02,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6280.5, 300 sec: 6220.4). Total num frames: 16883712. Throughput: 0: 795.3, 1: 794.8. Samples: 4218857. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-09-26 22:39:02,915][51558] Avg episode reward: [(0, '0.420'), (1, '-1.600')] -[2023-09-26 22:39:07,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6280.5, 300 sec: 6220.4). Total num frames: 16916480. Throughput: 0: 790.9, 1: 791.5. Samples: 4228247. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-09-26 22:39:07,915][51558] Avg episode reward: [(0, '0.420'), (1, '-1.600')] -[2023-09-26 22:39:08,082][52398] Saving ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000033056_8462336.pth... -[2023-09-26 22:39:08,109][52398] Removing ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000030128_7712768.pth -[2023-09-26 22:39:08,114][52310] Saving ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000033056_8462336.pth... -[2023-09-26 22:39:08,141][52310] Removing ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000030128_7712768.pth -[2023-09-26 22:39:12,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6280.5, 300 sec: 6220.4). Total num frames: 16949248. Throughput: 0: 788.6, 1: 788.8. Samples: 4237525. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:39:12,915][51558] Avg episode reward: [(0, '0.420'), (1, '-1.600')] -[2023-09-26 22:39:13,394][52540] Updated weights for policy 0, policy_version 33120 (0.0017) -[2023-09-26 22:39:13,394][52541] Updated weights for policy 1, policy_version 33120 (0.0018) -[2023-09-26 22:39:17,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6280.5, 300 sec: 6220.4). Total num frames: 16982016. Throughput: 0: 787.1, 1: 786.3. Samples: 4242245. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:39:17,915][51558] Avg episode reward: [(0, '0.430'), (1, '-1.610')] -[2023-09-26 22:39:22,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6280.6, 300 sec: 6220.4). Total num frames: 17014784. Throughput: 0: 787.4, 1: 787.2. Samples: 4251648. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:39:22,915][51558] Avg episode reward: [(0, '0.430'), (1, '-1.610')] -[2023-09-26 22:39:26,480][52540] Updated weights for policy 0, policy_version 33280 (0.0016) -[2023-09-26 22:39:26,481][52541] Updated weights for policy 1, policy_version 33280 (0.0016) -[2023-09-26 22:39:27,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6417.1, 300 sec: 6248.1). Total num frames: 17047552. Throughput: 0: 788.0, 1: 787.5. Samples: 4261155. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:39:27,915][51558] Avg episode reward: [(0, '0.430'), (1, '-1.610')] -[2023-09-26 22:39:32,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6280.5, 300 sec: 6220.4). Total num frames: 17072128. Throughput: 0: 784.3, 1: 782.7. Samples: 4265729. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:39:32,915][51558] Avg episode reward: [(0, '0.420'), (1, '-1.610')] -[2023-09-26 22:39:37,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6280.5, 300 sec: 6220.4). Total num frames: 17104896. Throughput: 0: 780.7, 1: 781.0. Samples: 4274819. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:39:37,915][51558] Avg episode reward: [(0, '0.420'), (1, '-1.610')] -[2023-09-26 22:39:39,774][52540] Updated weights for policy 0, policy_version 33440 (0.0015) -[2023-09-26 22:39:39,774][52541] Updated weights for policy 1, policy_version 33440 (0.0015) -[2023-09-26 22:39:42,914][51558] Fps is (10 sec: 6553.8, 60 sec: 6280.6, 300 sec: 6220.4). Total num frames: 17137664. Throughput: 0: 781.9, 1: 780.6. Samples: 4284389. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:39:42,914][51558] Avg episode reward: [(0, '0.420'), (1, '-1.610')] -[2023-09-26 22:39:47,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6280.5, 300 sec: 6220.4). Total num frames: 17170432. Throughput: 0: 774.0, 1: 774.2. Samples: 4288526. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:39:47,915][51558] Avg episode reward: [(0, '0.410'), (1, '-1.610')] -[2023-09-26 22:39:52,914][51558] Fps is (10 sec: 5734.2, 60 sec: 6144.0, 300 sec: 6220.4). Total num frames: 17195008. Throughput: 0: 771.5, 1: 771.3. Samples: 4297673. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:39:52,915][51558] Avg episode reward: [(0, '0.410'), (1, '-1.610')] -[2023-09-26 22:39:53,287][52541] Updated weights for policy 1, policy_version 33600 (0.0017) -[2023-09-26 22:39:53,287][52540] Updated weights for policy 0, policy_version 33600 (0.0017) -[2023-09-26 22:39:57,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6220.4). Total num frames: 17227776. Throughput: 0: 771.5, 1: 771.2. Samples: 4306947. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:39:57,915][51558] Avg episode reward: [(0, '0.420'), (1, '-2.030')] -[2023-09-26 22:40:02,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6280.5, 300 sec: 6220.4). Total num frames: 17260544. Throughput: 0: 771.3, 1: 771.5. Samples: 4311669. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:40:02,915][51558] Avg episode reward: [(0, '0.420'), (1, '-2.030')] -[2023-09-26 22:40:06,340][52540] Updated weights for policy 0, policy_version 33760 (0.0017) -[2023-09-26 22:40:06,340][52541] Updated weights for policy 1, policy_version 33760 (0.0016) -[2023-09-26 22:40:07,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6280.6, 300 sec: 6220.4). Total num frames: 17293312. Throughput: 0: 773.7, 1: 773.7. Samples: 4321280. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:40:07,915][51558] Avg episode reward: [(0, '0.420'), (1, '-2.030')] -[2023-09-26 22:40:12,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 17317888. Throughput: 0: 768.8, 1: 768.6. Samples: 4330340. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:40:12,915][51558] Avg episode reward: [(0, '0.430'), (1, '-2.470')] -[2023-09-26 22:40:17,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6144.0, 300 sec: 6220.4). Total num frames: 17350656. Throughput: 0: 769.8, 1: 771.7. Samples: 4335099. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:40:17,915][51558] Avg episode reward: [(0, '0.430'), (1, '-2.470')] -[2023-09-26 22:40:19,535][52540] Updated weights for policy 0, policy_version 33920 (0.0017) -[2023-09-26 22:40:19,536][52541] Updated weights for policy 1, policy_version 33920 (0.0017) -[2023-09-26 22:40:22,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6220.4). Total num frames: 17383424. Throughput: 0: 770.9, 1: 771.2. Samples: 4344216. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:40:22,915][51558] Avg episode reward: [(0, '0.430'), (1, '-2.470')] -[2023-09-26 22:40:27,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6220.4). Total num frames: 17416192. Throughput: 0: 768.8, 1: 770.3. Samples: 4353650. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:40:27,915][51558] Avg episode reward: [(0, '0.450'), (1, '-1.410')] -[2023-09-26 22:40:32,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6220.4). Total num frames: 17440768. Throughput: 0: 773.6, 1: 773.5. Samples: 4358144. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:40:32,915][51558] Avg episode reward: [(0, '0.450'), (1, '-1.410')] -[2023-09-26 22:40:32,955][52541] Updated weights for policy 1, policy_version 34080 (0.0018) -[2023-09-26 22:40:32,955][52540] Updated weights for policy 0, policy_version 34080 (0.0017) -[2023-09-26 22:40:37,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6220.4). Total num frames: 17473536. Throughput: 0: 773.3, 1: 773.5. Samples: 4367281. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:40:37,915][51558] Avg episode reward: [(0, '0.450'), (1, '-1.410')] -[2023-09-26 22:40:42,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6144.0, 300 sec: 6220.4). Total num frames: 17506304. Throughput: 0: 773.6, 1: 773.5. Samples: 4376568. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) -[2023-09-26 22:40:42,915][51558] Avg episode reward: [(0, '0.890'), (1, '-1.420')] -[2023-09-26 22:40:46,389][52540] Updated weights for policy 0, policy_version 34240 (0.0013) -[2023-09-26 22:40:46,390][52541] Updated weights for policy 1, policy_version 34240 (0.0016) -[2023-09-26 22:40:47,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6144.0, 300 sec: 6220.4). Total num frames: 17539072. Throughput: 0: 767.8, 1: 768.1. Samples: 4380788. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) -[2023-09-26 22:40:47,915][51558] Avg episode reward: [(0, '0.890'), (1, '-1.420')] -[2023-09-26 22:40:52,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6144.0, 300 sec: 6206.5). Total num frames: 17563648. Throughput: 0: 766.9, 1: 767.8. Samples: 4390340. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) -[2023-09-26 22:40:52,915][51558] Avg episode reward: [(0, '0.900'), (1, '-1.420')] -[2023-09-26 22:40:57,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6220.4). Total num frames: 17596416. Throughput: 0: 765.5, 1: 765.9. Samples: 4399255. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) -[2023-09-26 22:40:57,915][51558] Avg episode reward: [(0, '0.900'), (1, '-1.530')] -[2023-09-26 22:40:59,757][52541] Updated weights for policy 1, policy_version 34400 (0.0017) -[2023-09-26 22:40:59,757][52540] Updated weights for policy 0, policy_version 34400 (0.0018) -[2023-09-26 22:41:02,914][51558] Fps is (10 sec: 6553.8, 60 sec: 6144.0, 300 sec: 6220.4). Total num frames: 17629184. Throughput: 0: 764.5, 1: 764.8. Samples: 4403919. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) -[2023-09-26 22:41:02,915][51558] Avg episode reward: [(0, '0.900'), (1, '-1.530')] -[2023-09-26 22:41:07,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6144.0, 300 sec: 6220.4). Total num frames: 17661952. Throughput: 0: 769.4, 1: 768.9. Samples: 4413440. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) -[2023-09-26 22:41:07,915][51558] Avg episode reward: [(0, '0.920'), (1, '-0.670')] -[2023-09-26 22:41:07,925][52398] Saving ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000034496_8830976.pth... -[2023-09-26 22:41:07,926][52310] Saving ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000034496_8830976.pth... -[2023-09-26 22:41:07,955][52398] Removing ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000031568_8081408.pth -[2023-09-26 22:41:07,962][52310] Removing ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000031568_8081408.pth -[2023-09-26 22:41:12,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 17686528. Throughput: 0: 763.7, 1: 763.5. Samples: 4422376. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) -[2023-09-26 22:41:12,915][51558] Avg episode reward: [(0, '0.920'), (1, '-0.670')] -[2023-09-26 22:41:13,124][52540] Updated weights for policy 0, policy_version 34560 (0.0016) -[2023-09-26 22:41:13,124][52541] Updated weights for policy 1, policy_version 34560 (0.0017) -[2023-09-26 22:41:17,914][51558] Fps is (10 sec: 5734.6, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 17719296. Throughput: 0: 766.8, 1: 765.8. Samples: 4427111. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) -[2023-09-26 22:41:17,914][51558] Avg episode reward: [(0, '0.920'), (1, '-0.670')] -[2023-09-26 22:41:22,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6220.4). Total num frames: 17752064. Throughput: 0: 764.9, 1: 764.5. Samples: 4436104. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) -[2023-09-26 22:41:22,915][51558] Avg episode reward: [(0, '0.910'), (1, '-0.670')] -[2023-09-26 22:41:26,389][52540] Updated weights for policy 0, policy_version 34720 (0.0018) -[2023-09-26 22:41:26,389][52541] Updated weights for policy 1, policy_version 34720 (0.0018) -[2023-09-26 22:41:27,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6220.4). Total num frames: 17784832. Throughput: 0: 766.7, 1: 767.3. Samples: 4445600. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:41:27,915][51558] Avg episode reward: [(0, '0.910'), (1, '-0.670')] -[2023-09-26 22:41:32,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 17809408. Throughput: 0: 770.9, 1: 769.5. Samples: 4450104. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:41:32,915][51558] Avg episode reward: [(0, '0.910'), (1, '-0.670')] -[2023-09-26 22:41:37,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6144.0, 300 sec: 6220.4). Total num frames: 17842176. Throughput: 0: 764.6, 1: 764.7. Samples: 4459159. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:41:37,915][51558] Avg episode reward: [(0, '0.920'), (1, '-0.670')] -[2023-09-26 22:41:39,824][52540] Updated weights for policy 0, policy_version 34880 (0.0017) -[2023-09-26 22:41:39,825][52541] Updated weights for policy 1, policy_version 34880 (0.0016) -[2023-09-26 22:41:42,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6220.4). Total num frames: 17874944. Throughput: 0: 770.4, 1: 769.6. Samples: 4468552. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:41:42,915][51558] Avg episode reward: [(0, '0.920'), (1, '-0.670')] -[2023-09-26 22:41:47,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6220.4). Total num frames: 17907712. Throughput: 0: 766.3, 1: 766.0. Samples: 4472870. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-09-26 22:41:47,915][51558] Avg episode reward: [(0, '0.920'), (1, '-0.670')] -[2023-09-26 22:41:52,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 17932288. Throughput: 0: 766.0, 1: 766.6. Samples: 4482410. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-09-26 22:41:52,915][51558] Avg episode reward: [(0, '0.920'), (1, '-0.670')] -[2023-09-26 22:41:52,960][52541] Updated weights for policy 1, policy_version 35040 (0.0017) -[2023-09-26 22:41:52,960][52540] Updated weights for policy 0, policy_version 35040 (0.0017) -[2023-09-26 22:41:57,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 17965056. Throughput: 0: 770.8, 1: 769.3. Samples: 4491680. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-09-26 22:41:57,915][51558] Avg episode reward: [(0, '0.920'), (1, '-0.670')] -[2023-09-26 22:42:02,914][51558] Fps is (10 sec: 6553.8, 60 sec: 6144.0, 300 sec: 6220.4). Total num frames: 17997824. Throughput: 0: 769.0, 1: 768.9. Samples: 4496317. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-09-26 22:42:02,915][51558] Avg episode reward: [(0, '0.920'), (1, '-0.670')] -[2023-09-26 22:42:06,226][52541] Updated weights for policy 1, policy_version 35200 (0.0017) -[2023-09-26 22:42:06,226][52540] Updated weights for policy 0, policy_version 35200 (0.0018) -[2023-09-26 22:42:07,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6220.4). Total num frames: 18030592. Throughput: 0: 772.5, 1: 771.9. Samples: 4505601. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-09-26 22:42:07,915][51558] Avg episode reward: [(0, '0.910'), (1, '-0.670')] -[2023-09-26 22:42:12,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6280.5, 300 sec: 6220.4). Total num frames: 18063360. Throughput: 0: 770.4, 1: 769.9. Samples: 4514915. Policy #0 lag: (min: 8.0, avg: 8.0, max: 8.0) -[2023-09-26 22:42:12,915][51558] Avg episode reward: [(0, '0.910'), (1, '-0.670')] -[2023-09-26 22:42:17,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 18087936. Throughput: 0: 773.5, 1: 775.7. Samples: 4519819. Policy #0 lag: (min: 8.0, avg: 8.0, max: 8.0) -[2023-09-26 22:42:17,914][51558] Avg episode reward: [(0, '0.910'), (1, '-0.670')] -[2023-09-26 22:42:19,339][52541] Updated weights for policy 1, policy_version 35360 (0.0017) -[2023-09-26 22:42:19,339][52540] Updated weights for policy 0, policy_version 35360 (0.0016) -[2023-09-26 22:42:22,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6206.5). Total num frames: 18120704. Throughput: 0: 777.9, 1: 777.2. Samples: 4529137. Policy #0 lag: (min: 8.0, avg: 8.0, max: 8.0) -[2023-09-26 22:42:22,914][51558] Avg episode reward: [(0, '0.910'), (1, '-0.670')] -[2023-09-26 22:42:27,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6220.4). Total num frames: 18153472. Throughput: 0: 775.4, 1: 776.0. Samples: 4538368. Policy #0 lag: (min: 8.0, avg: 8.0, max: 8.0) -[2023-09-26 22:42:27,915][51558] Avg episode reward: [(0, '0.910'), (1, '-0.670')] -[2023-09-26 22:42:32,716][52541] Updated weights for policy 1, policy_version 35520 (0.0018) -[2023-09-26 22:42:32,717][52540] Updated weights for policy 0, policy_version 35520 (0.0017) -[2023-09-26 22:42:32,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6280.6, 300 sec: 6220.4). Total num frames: 18186240. Throughput: 0: 775.7, 1: 775.7. Samples: 4542684. Policy #0 lag: (min: 8.0, avg: 8.0, max: 8.0) -[2023-09-26 22:42:32,915][51558] Avg episode reward: [(0, '0.810'), (1, '-0.670')] -[2023-09-26 22:42:37,914][51558] Fps is (10 sec: 6143.9, 60 sec: 6212.3, 300 sec: 6206.5). Total num frames: 18214912. Throughput: 0: 777.3, 1: 776.6. Samples: 4552333. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) -[2023-09-26 22:42:37,915][51558] Avg episode reward: [(0, '0.810'), (1, '-0.670')] -[2023-09-26 22:42:42,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 18243584. Throughput: 0: 769.8, 1: 770.9. Samples: 4561014. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) -[2023-09-26 22:42:42,915][51558] Avg episode reward: [(0, '0.810'), (1, '-0.670')] -[2023-09-26 22:42:46,038][52540] Updated weights for policy 0, policy_version 35680 (0.0017) -[2023-09-26 22:42:46,038][52541] Updated weights for policy 1, policy_version 35680 (0.0016) -[2023-09-26 22:42:47,914][51558] Fps is (10 sec: 6144.0, 60 sec: 6144.0, 300 sec: 6220.4). Total num frames: 18276352. Throughput: 0: 771.6, 1: 772.9. Samples: 4565820. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) -[2023-09-26 22:42:47,915][51558] Avg episode reward: [(0, '0.810'), (1, '-0.670')] -[2023-09-26 22:42:52,914][51558] Fps is (10 sec: 6553.4, 60 sec: 6280.5, 300 sec: 6220.4). Total num frames: 18309120. Throughput: 0: 773.7, 1: 773.7. Samples: 4575232. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) -[2023-09-26 22:42:52,915][51558] Avg episode reward: [(0, '0.810'), (1, '-0.670')] -[2023-09-26 22:42:57,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 18333696. Throughput: 0: 774.0, 1: 773.4. Samples: 4584550. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) -[2023-09-26 22:42:57,915][51558] Avg episode reward: [(0, '0.810'), (1, '-0.670')] -[2023-09-26 22:42:59,433][52540] Updated weights for policy 0, policy_version 35840 (0.0017) -[2023-09-26 22:42:59,433][52541] Updated weights for policy 1, policy_version 35840 (0.0016) -[2023-09-26 22:43:02,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 18366464. Throughput: 0: 768.0, 1: 766.6. Samples: 4588874. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:43:02,915][51558] Avg episode reward: [(0, '0.810'), (1, '-0.670')] -[2023-09-26 22:43:07,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 18399232. Throughput: 0: 762.8, 1: 762.6. Samples: 4597779. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:43:07,915][51558] Avg episode reward: [(0, '0.810'), (1, '-0.670')] -[2023-09-26 22:43:07,927][52310] Saving ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000035936_9199616.pth... -[2023-09-26 22:43:07,927][52398] Saving ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000035936_9199616.pth... -[2023-09-26 22:43:07,963][52310] Removing ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000033056_8462336.pth -[2023-09-26 22:43:07,965][52398] Removing ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000033056_8462336.pth -[2023-09-26 22:43:12,896][52540] Updated weights for policy 0, policy_version 36000 (0.0017) -[2023-09-26 22:43:12,897][52541] Updated weights for policy 1, policy_version 36000 (0.0017) -[2023-09-26 22:43:12,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 18432000. Throughput: 0: 762.1, 1: 762.3. Samples: 4606967. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:43:12,915][51558] Avg episode reward: [(0, '0.810'), (1, '-0.670')] -[2023-09-26 22:43:17,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 18456576. Throughput: 0: 769.9, 1: 769.8. Samples: 4611971. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:43:17,915][51558] Avg episode reward: [(0, '0.800'), (1, '-0.670')] -[2023-09-26 22:43:22,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 18489344. Throughput: 0: 762.6, 1: 762.7. Samples: 4620975. Policy #0 lag: (min: 10.0, avg: 10.0, max: 10.0) -[2023-09-26 22:43:22,915][51558] Avg episode reward: [(0, '0.800'), (1, '-0.670')] -[2023-09-26 22:43:26,217][52540] Updated weights for policy 0, policy_version 36160 (0.0019) -[2023-09-26 22:43:26,217][52541] Updated weights for policy 1, policy_version 36160 (0.0020) -[2023-09-26 22:43:27,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 18522112. Throughput: 0: 770.3, 1: 768.9. Samples: 4630276. Policy #0 lag: (min: 10.0, avg: 10.0, max: 10.0) -[2023-09-26 22:43:27,915][51558] Avg episode reward: [(0, '0.800'), (1, '-0.500')] -[2023-09-26 22:43:32,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6144.0, 300 sec: 6192.6). Total num frames: 18554880. Throughput: 0: 764.6, 1: 764.4. Samples: 4634625. Policy #0 lag: (min: 10.0, avg: 10.0, max: 10.0) -[2023-09-26 22:43:32,915][51558] Avg episode reward: [(0, '0.800'), (1, '-0.500')] -[2023-09-26 22:43:37,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6075.7, 300 sec: 6164.8). Total num frames: 18579456. Throughput: 0: 764.8, 1: 764.0. Samples: 4644030. Policy #0 lag: (min: 10.0, avg: 10.0, max: 10.0) -[2023-09-26 22:43:37,915][51558] Avg episode reward: [(0, '0.800'), (1, '-0.500')] -[2023-09-26 22:43:39,700][52540] Updated weights for policy 0, policy_version 36320 (0.0017) -[2023-09-26 22:43:39,700][52541] Updated weights for policy 1, policy_version 36320 (0.0016) -[2023-09-26 22:43:42,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 18612224. Throughput: 0: 759.2, 1: 759.4. Samples: 4652890. Policy #0 lag: (min: 10.0, avg: 10.0, max: 10.0) -[2023-09-26 22:43:42,915][51558] Avg episode reward: [(0, '0.790'), (1, '-0.390')] -[2023-09-26 22:43:47,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 18644992. Throughput: 0: 759.5, 1: 760.1. Samples: 4657257. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:43:47,915][51558] Avg episode reward: [(0, '0.790'), (1, '-0.390')] -[2023-09-26 22:43:52,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6007.5, 300 sec: 6137.1). Total num frames: 18669568. Throughput: 0: 766.8, 1: 766.9. Samples: 4666796. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:43:52,915][51558] Avg episode reward: [(0, '0.790'), (1, '-0.390')] -[2023-09-26 22:43:52,970][52540] Updated weights for policy 0, policy_version 36480 (0.0015) -[2023-09-26 22:43:52,971][52541] Updated weights for policy 1, policy_version 36480 (0.0017) -[2023-09-26 22:43:57,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 18702336. Throughput: 0: 767.7, 1: 768.0. Samples: 4676071. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:43:57,915][51558] Avg episode reward: [(0, '0.790'), (1, '-0.400')] -[2023-09-26 22:44:02,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 18735104. Throughput: 0: 765.8, 1: 766.4. Samples: 4680920. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:44:02,915][51558] Avg episode reward: [(0, '0.790'), (1, '-0.400')] -[2023-09-26 22:44:06,202][52540] Updated weights for policy 0, policy_version 36640 (0.0016) -[2023-09-26 22:44:06,203][52541] Updated weights for policy 1, policy_version 36640 (0.0017) -[2023-09-26 22:44:07,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 18767872. Throughput: 0: 766.0, 1: 766.1. Samples: 4689923. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:44:07,915][51558] Avg episode reward: [(0, '0.790'), (1, '-0.400')] -[2023-09-26 22:44:12,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6007.5, 300 sec: 6137.1). Total num frames: 18792448. Throughput: 0: 762.6, 1: 762.8. Samples: 4698921. Policy #0 lag: (min: 12.0, avg: 12.0, max: 12.0) -[2023-09-26 22:44:12,915][51558] Avg episode reward: [(0, '0.820'), (1, '-0.400')] -[2023-09-26 22:44:17,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 18825216. Throughput: 0: 764.4, 1: 764.2. Samples: 4703409. Policy #0 lag: (min: 12.0, avg: 12.0, max: 12.0) -[2023-09-26 22:44:17,915][51558] Avg episode reward: [(0, '0.820'), (1, '-0.400')] -[2023-09-26 22:44:19,811][52540] Updated weights for policy 0, policy_version 36800 (0.0017) -[2023-09-26 22:44:19,811][52541] Updated weights for policy 1, policy_version 36800 (0.0016) -[2023-09-26 22:44:22,914][51558] Fps is (10 sec: 6553.4, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 18857984. Throughput: 0: 761.1, 1: 762.2. Samples: 4712580. Policy #0 lag: (min: 12.0, avg: 12.0, max: 12.0) -[2023-09-26 22:44:22,915][51558] Avg episode reward: [(0, '0.820'), (1, '-0.400')] -[2023-09-26 22:44:27,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 18890752. Throughput: 0: 769.2, 1: 770.2. Samples: 4722166. Policy #0 lag: (min: 12.0, avg: 12.0, max: 12.0) -[2023-09-26 22:44:27,915][51558] Avg episode reward: [(0, '0.810'), (1, '-0.400')] -[2023-09-26 22:44:32,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6007.5, 300 sec: 6137.1). Total num frames: 18915328. Throughput: 0: 772.7, 1: 772.3. Samples: 4726784. Policy #0 lag: (min: 12.0, avg: 12.0, max: 12.0) -[2023-09-26 22:44:32,915][51558] Avg episode reward: [(0, '0.810'), (1, '-0.400')] -[2023-09-26 22:44:32,971][52540] Updated weights for policy 0, policy_version 36960 (0.0016) -[2023-09-26 22:44:32,972][52541] Updated weights for policy 1, policy_version 36960 (0.0016) -[2023-09-26 22:44:37,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 18948096. Throughput: 0: 767.4, 1: 767.7. Samples: 4735877. Policy #0 lag: (min: 8.0, avg: 8.0, max: 8.0) -[2023-09-26 22:44:37,914][51558] Avg episode reward: [(0, '0.810'), (1, '-0.400')] -[2023-09-26 22:44:42,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 18980864. Throughput: 0: 768.5, 1: 768.1. Samples: 4745216. Policy #0 lag: (min: 8.0, avg: 8.0, max: 8.0) -[2023-09-26 22:44:42,915][51558] Avg episode reward: [(0, '0.810'), (1, '-0.400')] -[2023-09-26 22:44:46,176][52541] Updated weights for policy 1, policy_version 37120 (0.0015) -[2023-09-26 22:44:46,176][52540] Updated weights for policy 0, policy_version 37120 (0.0018) -[2023-09-26 22:44:47,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 19013632. Throughput: 0: 765.3, 1: 765.0. Samples: 4749784. Policy #0 lag: (min: 8.0, avg: 8.0, max: 8.0) -[2023-09-26 22:44:47,915][51558] Avg episode reward: [(0, '0.810'), (1, '-0.400')] -[2023-09-26 22:44:52,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6280.5, 300 sec: 6164.8). Total num frames: 19046400. Throughput: 0: 770.1, 1: 770.9. Samples: 4759269. Policy #0 lag: (min: 8.0, avg: 8.0, max: 8.0) -[2023-09-26 22:44:52,915][51558] Avg episode reward: [(0, '0.810'), (1, '-0.400')] -[2023-09-26 22:44:57,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 19070976. Throughput: 0: 769.5, 1: 770.5. Samples: 4768224. Policy #0 lag: (min: 8.0, avg: 8.0, max: 8.0) -[2023-09-26 22:44:57,915][51558] Avg episode reward: [(0, '0.810'), (1, '-0.400')] -[2023-09-26 22:44:59,460][52540] Updated weights for policy 0, policy_version 37280 (0.0015) -[2023-09-26 22:44:59,461][52541] Updated weights for policy 1, policy_version 37280 (0.0018) -[2023-09-26 22:45:02,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 19103744. Throughput: 0: 774.3, 1: 775.3. Samples: 4773143. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:45:02,915][51558] Avg episode reward: [(0, '0.810'), (1, '-0.400')] -[2023-09-26 22:45:07,914][51558] Fps is (10 sec: 6553.4, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 19136512. Throughput: 0: 776.1, 1: 776.1. Samples: 4782427. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:45:07,915][51558] Avg episode reward: [(0, '0.810'), (1, '-1.280')] -[2023-09-26 22:45:07,928][52398] Saving ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000037376_9568256.pth... -[2023-09-26 22:45:07,928][52310] Saving ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000037376_9568256.pth... -[2023-09-26 22:45:07,963][52310] Removing ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000034496_8830976.pth -[2023-09-26 22:45:07,964][52398] Removing ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000034496_8830976.pth -[2023-09-26 22:45:12,544][52541] Updated weights for policy 1, policy_version 37440 (0.0017) -[2023-09-26 22:45:12,544][52540] Updated weights for policy 0, policy_version 37440 (0.0018) -[2023-09-26 22:45:12,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6280.5, 300 sec: 6164.8). Total num frames: 19169280. Throughput: 0: 775.7, 1: 775.9. Samples: 4791990. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:45:12,915][51558] Avg episode reward: [(0, '0.810'), (1, '-1.280')] -[2023-09-26 22:45:17,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6280.5, 300 sec: 6164.8). Total num frames: 19202048. Throughput: 0: 773.9, 1: 774.4. Samples: 4796455. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:45:17,915][51558] Avg episode reward: [(0, '0.810'), (1, '-1.280')] -[2023-09-26 22:45:22,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 19226624. Throughput: 0: 777.3, 1: 776.8. Samples: 4805813. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:45:22,915][51558] Avg episode reward: [(0, '0.810'), (1, '-1.770')] -[2023-09-26 22:45:25,658][52540] Updated weights for policy 0, policy_version 37600 (0.0018) -[2023-09-26 22:45:25,658][52541] Updated weights for policy 1, policy_version 37600 (0.0016) -[2023-09-26 22:45:27,914][51558] Fps is (10 sec: 5734.5, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 19259392. Throughput: 0: 776.7, 1: 777.5. Samples: 4815155. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) -[2023-09-26 22:45:27,915][51558] Avg episode reward: [(0, '0.810'), (1, '-1.770')] -[2023-09-26 22:45:32,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6280.5, 300 sec: 6164.8). Total num frames: 19292160. Throughput: 0: 776.0, 1: 775.6. Samples: 4819605. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) -[2023-09-26 22:45:32,915][51558] Avg episode reward: [(0, '0.800'), (1, '-2.130')] -[2023-09-26 22:45:37,914][51558] Fps is (10 sec: 6143.9, 60 sec: 6212.2, 300 sec: 6150.9). Total num frames: 19320832. Throughput: 0: 773.1, 1: 774.6. Samples: 4828917. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) -[2023-09-26 22:45:37,915][51558] Avg episode reward: [(0, '0.810'), (1, '-2.320')] -[2023-09-26 22:45:39,237][52541] Updated weights for policy 1, policy_version 37760 (0.0017) -[2023-09-26 22:45:39,237][52540] Updated weights for policy 0, policy_version 37760 (0.0018) -[2023-09-26 22:45:42,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 19349504. Throughput: 0: 774.4, 1: 775.0. Samples: 4837945. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) -[2023-09-26 22:45:42,915][51558] Avg episode reward: [(0, '0.810'), (1, '-2.320')] -[2023-09-26 22:45:47,914][51558] Fps is (10 sec: 6144.1, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 19382272. Throughput: 0: 769.9, 1: 770.2. Samples: 4842448. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-09-26 22:45:47,915][51558] Avg episode reward: [(0, '0.810'), (1, '-2.350')] -[2023-09-26 22:45:52,667][52541] Updated weights for policy 1, policy_version 37920 (0.0017) -[2023-09-26 22:45:52,667][52540] Updated weights for policy 0, policy_version 37920 (0.0018) -[2023-09-26 22:45:52,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 19415040. Throughput: 0: 770.0, 1: 769.6. Samples: 4851712. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-09-26 22:45:52,915][51558] Avg episode reward: [(0, '0.810'), (1, '-2.350')] -[2023-09-26 22:45:57,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 19439616. Throughput: 0: 763.4, 1: 762.8. Samples: 4860669. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-09-26 22:45:57,915][51558] Avg episode reward: [(0, '0.810'), (1, '-2.350')] -[2023-09-26 22:46:02,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 19472384. Throughput: 0: 765.4, 1: 764.1. Samples: 4865282. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-09-26 22:46:02,915][51558] Avg episode reward: [(0, '0.810'), (1, '-3.460')] -[2023-09-26 22:46:06,163][52541] Updated weights for policy 1, policy_version 38080 (0.0016) -[2023-09-26 22:46:06,164][52540] Updated weights for policy 0, policy_version 38080 (0.0018) -[2023-09-26 22:46:07,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 19505152. Throughput: 0: 760.4, 1: 760.4. Samples: 4874249. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) -[2023-09-26 22:46:07,915][51558] Avg episode reward: [(0, '0.810'), (1, '-3.460')] -[2023-09-26 22:46:12,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 19537920. Throughput: 0: 763.2, 1: 761.9. Samples: 4883781. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:46:12,915][51558] Avg episode reward: [(0, '0.810'), (1, '-3.460')] -[2023-09-26 22:46:17,914][51558] Fps is (10 sec: 5734.6, 60 sec: 6007.5, 300 sec: 6137.1). Total num frames: 19562496. Throughput: 0: 765.1, 1: 762.2. Samples: 4888336. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:46:17,915][51558] Avg episode reward: [(0, '0.810'), (1, '-3.280')] -[2023-09-26 22:46:19,561][52541] Updated weights for policy 1, policy_version 38240 (0.0019) -[2023-09-26 22:46:19,561][52540] Updated weights for policy 0, policy_version 38240 (0.0018) -[2023-09-26 22:46:22,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 19595264. Throughput: 0: 761.8, 1: 760.1. Samples: 4897403. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:46:22,915][51558] Avg episode reward: [(0, '0.810'), (1, '-3.280')] -[2023-09-26 22:46:27,914][51558] Fps is (10 sec: 6553.5, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 19628032. Throughput: 0: 767.8, 1: 766.9. Samples: 4907008. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:46:27,915][51558] Avg episode reward: [(0, '0.810'), (1, '-3.280')] -[2023-09-26 22:46:32,754][52540] Updated weights for policy 0, policy_version 38400 (0.0017) -[2023-09-26 22:46:32,754][52541] Updated weights for policy 1, policy_version 38400 (0.0019) -[2023-09-26 22:46:32,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 19660800. Throughput: 0: 763.6, 1: 762.8. Samples: 4911137. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:46:32,915][51558] Avg episode reward: [(0, '0.790'), (1, '-3.380')] -[2023-09-26 22:46:37,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6075.7, 300 sec: 6137.1). Total num frames: 19685376. Throughput: 0: 765.2, 1: 765.1. Samples: 4920576. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:46:37,915][51558] Avg episode reward: [(0, '0.790'), (1, '-3.380')] -[2023-09-26 22:46:42,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 19718144. Throughput: 0: 765.2, 1: 765.3. Samples: 4929542. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:46:42,915][51558] Avg episode reward: [(0, '0.790'), (1, '-3.380')] -[2023-09-26 22:46:46,187][52540] Updated weights for policy 0, policy_version 38560 (0.0017) -[2023-09-26 22:46:46,187][52541] Updated weights for policy 1, policy_version 38560 (0.0016) -[2023-09-26 22:46:47,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 19750912. Throughput: 0: 763.9, 1: 766.0. Samples: 4934127. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:46:47,915][51558] Avg episode reward: [(0, '0.780'), (1, '-3.270')] -[2023-09-26 22:46:52,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 19783680. Throughput: 0: 771.0, 1: 771.6. Samples: 4943667. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:46:52,914][51558] Avg episode reward: [(0, '0.780'), (1, '-3.270')] -[2023-09-26 22:46:57,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 19808256. Throughput: 0: 765.5, 1: 766.8. Samples: 4952736. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:46:57,915][51558] Avg episode reward: [(0, '0.770'), (1, '-3.070')] -[2023-09-26 22:46:59,517][52541] Updated weights for policy 1, policy_version 38720 (0.0015) -[2023-09-26 22:46:59,518][52540] Updated weights for policy 0, policy_version 38720 (0.0018) -[2023-09-26 22:47:02,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 19841024. Throughput: 0: 765.6, 1: 767.7. Samples: 4957332. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:47:02,915][51558] Avg episode reward: [(0, '0.770'), (1, '-3.070')] -[2023-09-26 22:47:07,914][51558] Fps is (10 sec: 6553.7, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 19873792. Throughput: 0: 766.9, 1: 766.3. Samples: 4966400. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:47:07,915][51558] Avg episode reward: [(0, '0.770'), (1, '-3.070')] -[2023-09-26 22:47:07,924][52398] Saving ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000038816_9936896.pth... -[2023-09-26 22:47:07,925][52310] Saving ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000038816_9936896.pth... -[2023-09-26 22:47:07,960][52310] Removing ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000035936_9199616.pth -[2023-09-26 22:47:07,963][52398] Removing ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000035936_9199616.pth -[2023-09-26 22:47:12,882][52541] Updated weights for policy 1, policy_version 38880 (0.0018) -[2023-09-26 22:47:12,882][52540] Updated weights for policy 0, policy_version 38880 (0.0019) -[2023-09-26 22:47:12,914][51558] Fps is (10 sec: 6553.8, 60 sec: 6144.0, 300 sec: 6164.8). Total num frames: 19906560. Throughput: 0: 763.4, 1: 763.6. Samples: 4975719. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:47:12,915][51558] Avg episode reward: [(0, '0.760'), (1, '-3.070')] -[2023-09-26 22:47:17,914][51558] Fps is (10 sec: 5734.4, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 19931136. Throughput: 0: 771.1, 1: 770.2. Samples: 4980495. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:47:17,915][51558] Avg episode reward: [(0, '0.760'), (1, '-3.070')] -[2023-09-26 22:47:22,914][51558] Fps is (10 sec: 5734.3, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 19963904. Throughput: 0: 766.0, 1: 766.5. Samples: 4989541. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:47:22,915][51558] Avg episode reward: [(0, '0.760'), (1, '-3.070')] -[2023-09-26 22:47:26,196][52540] Updated weights for policy 0, policy_version 39040 (0.0017) -[2023-09-26 22:47:26,196][52541] Updated weights for policy 1, policy_version 39040 (0.0017) -[2023-09-26 22:47:27,914][51558] Fps is (10 sec: 6553.6, 60 sec: 6144.0, 300 sec: 6137.1). Total num frames: 19996672. Throughput: 0: 771.7, 1: 769.2. Samples: 4998884. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) -[2023-09-26 22:47:27,915][51558] Avg episode reward: [(0, '0.750'), (1, '-3.070')] -[2023-09-26 22:47:30,169][52310] Saving ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000039088_10006528.pth... -[2023-09-26 22:47:30,170][52584] Stopping RolloutWorker_w4... -[2023-09-26 22:47:30,170][52584] Loop rollout_proc4_evt_loop terminating... -[2023-09-26 22:47:30,170][52586] Stopping RolloutWorker_w5... -[2023-09-26 22:47:30,170][52583] Stopping RolloutWorker_w3... -[2023-09-26 22:47:30,170][52580] Stopping RolloutWorker_w1... -[2023-09-26 22:47:30,170][52582] Stopping RolloutWorker_w2... -[2023-09-26 22:47:30,170][52585] Stopping RolloutWorker_w6... -[2023-09-26 22:47:30,170][51558] Component RolloutWorker_w4 stopped! -[2023-09-26 22:47:30,170][52576] Stopping RolloutWorker_w0... -[2023-09-26 22:47:30,170][52586] Loop rollout_proc5_evt_loop terminating... -[2023-09-26 22:47:30,170][52587] Stopping RolloutWorker_w7... -[2023-09-26 22:47:30,170][52583] Loop rollout_proc3_evt_loop terminating... -[2023-09-26 22:47:30,171][52582] Loop rollout_proc2_evt_loop terminating... -[2023-09-26 22:47:30,171][52585] Loop rollout_proc6_evt_loop terminating... -[2023-09-26 22:47:30,171][52580] Loop rollout_proc1_evt_loop terminating... -[2023-09-26 22:47:30,171][52398] Stopping Batcher_1... -[2023-09-26 22:47:30,171][51558] Component RolloutWorker_w5 stopped! -[2023-09-26 22:47:30,171][52576] Loop rollout_proc0_evt_loop terminating... -[2023-09-26 22:47:30,171][52587] Loop rollout_proc7_evt_loop terminating... -[2023-09-26 22:47:30,171][51558] Component RolloutWorker_w3 stopped! -[2023-09-26 22:47:30,172][52398] Loop batcher_evt_loop terminating... -[2023-09-26 22:47:30,172][51558] Component RolloutWorker_w1 stopped! -[2023-09-26 22:47:30,172][51558] Component RolloutWorker_w2 stopped! -[2023-09-26 22:47:30,173][51558] Component RolloutWorker_w6 stopped! -[2023-09-26 22:47:30,174][51558] Component RolloutWorker_w0 stopped! -[2023-09-26 22:47:30,174][51558] Component RolloutWorker_w7 stopped! -[2023-09-26 22:47:30,174][51558] Component Batcher_1 stopped! -[2023-09-26 22:47:30,180][51558] Component Batcher_0 stopped! -[2023-09-26 22:47:30,190][52310] Stopping Batcher_0... -[2023-09-26 22:47:30,199][52310] Loop batcher_evt_loop terminating... -[2023-09-26 22:47:30,199][52310] Removing ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000037376_9568256.pth -[2023-09-26 22:47:30,204][52310] Saving ./train_atari/atari_privateye/checkpoint_p0/checkpoint_000039088_10006528.pth... -[2023-09-26 22:47:30,219][52540] Weights refcount: 2 0 -[2023-09-26 22:47:30,220][52398] Saving ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000039088_10006528.pth... -[2023-09-26 22:47:30,220][52540] Stopping InferenceWorker_p0-w0... -[2023-09-26 22:47:30,221][52540] Loop inference_proc0-0_evt_loop terminating... -[2023-09-26 22:47:30,220][51558] Component InferenceWorker_p0-w0 stopped! -[2023-09-26 22:47:30,239][52541] Weights refcount: 2 0 -[2023-09-26 22:47:30,239][52310] Stopping LearnerWorker_p0... -[2023-09-26 22:47:30,240][52310] Loop learner_proc0_evt_loop terminating... -[2023-09-26 22:47:30,240][51558] Component LearnerWorker_p0 stopped! -[2023-09-26 22:47:30,240][52541] Stopping InferenceWorker_p1-w0... -[2023-09-26 22:47:30,241][52541] Loop inference_proc1-0_evt_loop terminating... -[2023-09-26 22:47:30,242][51558] Component InferenceWorker_p1-w0 stopped! -[2023-09-26 22:47:30,250][52398] Removing ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000037376_9568256.pth -[2023-09-26 22:47:30,254][52398] Saving ./train_atari/atari_privateye/checkpoint_p1/checkpoint_000039088_10006528.pth... -[2023-09-26 22:47:30,290][52398] Stopping LearnerWorker_p1... -[2023-09-26 22:47:30,290][52398] Loop learner_proc1_evt_loop terminating... -[2023-09-26 22:47:30,290][51558] Component LearnerWorker_p1 stopped! -[2023-09-26 22:47:30,291][51558] Waiting for process learner_proc0 to stop... -[2023-09-26 22:47:30,921][51558] Waiting for process learner_proc1 to stop... -[2023-09-26 22:47:30,967][51558] Waiting for process inference_proc0-0 to join... -[2023-09-26 22:47:30,968][51558] Waiting for process inference_proc1-0 to join... -[2023-09-26 22:47:30,968][51558] Waiting for process rollout_proc0 to join... -[2023-09-26 22:47:30,968][51558] Waiting for process rollout_proc1 to join... -[2023-09-26 22:47:30,969][51558] Waiting for process rollout_proc2 to join... -[2023-09-26 22:47:30,969][51558] Waiting for process rollout_proc3 to join... -[2023-09-26 22:47:30,969][51558] Waiting for process rollout_proc4 to join... -[2023-09-26 22:47:30,970][51558] Waiting for process rollout_proc5 to join... -[2023-09-26 22:47:30,970][51558] Waiting for process rollout_proc6 to join... -[2023-09-26 22:47:30,970][51558] Waiting for process rollout_proc7 to join... -[2023-09-26 22:47:30,971][51558] Batcher 0 profile tree view: -batching: 21.1759, releasing_batches: 1.7345 -[2023-09-26 22:47:30,971][51558] Batcher 1 profile tree view: -batching: 21.0878, releasing_batches: 1.6841 -[2023-09-26 22:47:30,971][51558] InferenceWorker_p0-w0 profile tree view: -wait_policy: 0.0052 - wait_policy_total: 701.3237 -update_model: 37.4277 - weight_update: 0.0017 -one_step: 0.0012 - handle_policy_step: 2308.8765 - deserialize: 68.4658, stack: 16.3172, obs_to_device_normalize: 559.4521, forward: 1117.6095, send_messages: 93.5046 - prepare_outputs: 303.9908 - to_cpu: 152.0091 -[2023-09-26 22:47:30,971][51558] InferenceWorker_p1-w0 profile tree view: -wait_policy: 0.0052 - wait_policy_total: 670.9589 -update_model: 38.0436 - weight_update: 0.0016 -one_step: 0.0012 - handle_policy_step: 2333.4413 - deserialize: 69.2086, stack: 16.8633, obs_to_device_normalize: 565.8813, forward: 1125.8032, send_messages: 96.6896 - prepare_outputs: 310.0986 - to_cpu: 156.5846 -[2023-09-26 22:47:30,972][51558] Learner 0 profile tree view: -misc: 0.0158, prepare_batch: 32.1217 -train: 460.7382 - epoch_init: 0.1063, minibatch_init: 3.1879, losses_postprocess: 62.8606, kl_divergence: 5.4485, after_optimizer: 22.3786 - calculate_losses: 45.6984 - losses_init: 0.1050, forward_head: 14.5392, bptt_initial: 0.4379, bptt: 0.5027, tail: 10.5001, advantages_returns: 3.1169, losses: 12.8614 - update: 316.8990 - clip: 164.2831 -[2023-09-26 22:47:30,972][51558] Learner 1 profile tree view: -misc: 0.0165, prepare_batch: 32.3717 -train: 460.8345 - epoch_init: 0.1034, minibatch_init: 3.1954, losses_postprocess: 62.6023, kl_divergence: 5.4305, after_optimizer: 22.2315 - calculate_losses: 45.1385 - losses_init: 0.1000, forward_head: 14.4210, bptt_initial: 0.4410, bptt: 0.4564, tail: 10.3457, advantages_returns: 3.0708, losses: 12.6582 - update: 318.0256 - clip: 165.6280 -[2023-09-26 22:47:30,972][51558] RolloutWorker_w0 profile tree view: -wait_for_trajectories: 0.4057, enqueue_policy_requests: 42.2659, env_step: 1229.9625, overhead: 29.6236, complete_rollouts: 1.0913 -save_policy_outputs: 53.8278 - split_output_tensors: 18.6833 -[2023-09-26 22:47:30,972][51558] RolloutWorker_w7 profile tree view: -wait_for_trajectories: 0.4058, enqueue_policy_requests: 43.5271, env_step: 1293.0083, overhead: 29.4735, complete_rollouts: 1.0600 -save_policy_outputs: 54.4438 - split_output_tensors: 18.7554 -[2023-09-26 22:47:30,973][51558] Loop Runner_EvtLoop terminating... -[2023-09-26 22:47:30,973][51558] Runner profile tree view: -main_loop: 3258.8388 -[2023-09-26 22:47:30,973][51558] Collected {0: 10006528, 1: 10006528}, FPS: 6141.2 +[2023-10-14 04:59:47,956][100681] Using optimizer +[2023-10-14 04:59:47,957][100681] No checkpoints found +[2023-10-14 04:59:47,957][100681] Did not load from checkpoint, starting from scratch! +[2023-10-14 04:59:47,957][100681] Initialized policy 1 weights for model version 0 +[2023-10-14 04:59:47,958][100681] LearnerWorker_p1 finished initialization! +[2023-10-14 04:59:47,959][100681] Using GPUs [0] for process 1 (actually maps to GPUs [1]) +[2023-10-14 04:59:48,870][99942] Starting process rollout_proc14 +[2023-10-14 04:59:48,874][100961] Worker 10 uses CPU cores [20, 21] +[2023-10-14 04:59:48,890][99942] Starting process rollout_proc15 +[2023-10-14 04:59:48,895][100956] Worker 6 uses CPU cores [12, 13] +[2023-10-14 04:59:48,919][100954] Worker 4 uses CPU cores [8, 9] +[2023-10-14 04:59:48,929][100951] Worker 1 uses CPU cores [2, 3] +[2023-10-14 04:59:48,936][100917] Using GPUs [1] for process 1 (actually maps to GPUs [1]) +[2023-10-14 04:59:48,936][100917] Set environment var CUDA_VISIBLE_DEVICES to '1' (GPU indices [1]) for inference process 1 +[2023-10-14 04:59:48,937][100953] Worker 2 uses CPU cores [4, 5] +[2023-10-14 04:59:48,957][100917] Num visible devices: 1 +[2023-10-14 04:59:49,093][100959] Worker 8 uses CPU cores [16, 17] +[2023-10-14 04:59:49,245][100955] Worker 5 uses CPU cores [10, 11] +[2023-10-14 04:59:49,363][100962] Worker 12 uses CPU cores [24, 25] +[2023-10-14 04:59:49,377][100957] Worker 3 uses CPU cores [6, 7] +[2023-10-14 04:59:49,381][100960] Worker 9 uses CPU cores [18, 19] +[2023-10-14 04:59:49,404][100958] Worker 7 uses CPU cores [14, 15] +[2023-10-14 04:59:49,569][100964] Worker 13 uses CPU cores [26, 27] +[2023-10-14 04:59:49,577][100936] Using GPUs [0] for process 0 (actually maps to GPUs [0]) +[2023-10-14 04:59:49,577][100936] Set environment var CUDA_VISIBLE_DEVICES to '0' (GPU indices [0]) for inference process 0 +[2023-10-14 04:59:49,596][100936] Num visible devices: 1 +[2023-10-14 04:59:49,629][100950] Worker 0 uses CPU cores [0, 1] +[2023-10-14 04:59:49,665][100963] Worker 11 uses CPU cores [22, 23] +[2023-10-14 04:59:49,744][100917] RunningMeanStd input shape: (4, 84, 84) +[2023-10-14 04:59:49,745][100917] RunningMeanStd input shape: (1,) +[2023-10-14 04:59:49,764][100917] ConvEncoder: input_channels=4 +[2023-10-14 04:59:49,880][100917] Conv encoder output size: 512 +[2023-10-14 04:59:50,193][100936] RunningMeanStd input shape: (4, 84, 84) +[2023-10-14 04:59:50,194][100936] RunningMeanStd input shape: (1,) +[2023-10-14 04:59:50,205][100936] ConvEncoder: input_channels=4 +[2023-10-14 04:59:50,311][100936] Conv encoder output size: 512 +[2023-10-14 04:59:50,806][101580] Worker 15 uses CPU cores [30, 31] +[2023-10-14 04:59:50,808][99942] Inference worker 1-0 is ready! +[2023-10-14 04:59:50,810][101548] Worker 14 uses CPU cores [28, 29] +[2023-10-14 04:59:50,809][99942] Inference worker 0-0 is ready! +[2023-10-14 04:59:50,810][99942] All inference workers are ready! Signal rollout workers to start! +[2023-10-14 04:59:50,812][100958] EnvRunner 7-0 uses policy 1 +[2023-10-14 04:59:50,812][100956] EnvRunner 6-0 uses policy 0 +[2023-10-14 04:59:50,812][100963] EnvRunner 11-0 uses policy 1 +[2023-10-14 04:59:50,812][100959] EnvRunner 8-0 uses policy 0 +[2023-10-14 04:59:50,812][100964] EnvRunner 13-0 uses policy 1 +[2023-10-14 04:59:50,812][100953] EnvRunner 2-0 uses policy 0 +[2023-10-14 04:59:50,812][100962] EnvRunner 12-0 uses policy 0 +[2023-10-14 04:59:50,812][100955] EnvRunner 5-0 uses policy 1 +[2023-10-14 04:59:50,812][100961] EnvRunner 10-0 uses policy 0 +[2023-10-14 04:59:50,812][100960] EnvRunner 9-0 uses policy 1 +[2023-10-14 04:59:50,812][99942] Fps is (10 sec: nan, 60 sec: nan, 300 sec: nan). Total num frames: 0. Throughput: 0: nan, 1: nan. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) +[2023-10-14 04:59:50,812][100954] EnvRunner 4-0 uses policy 0 +[2023-10-14 04:59:50,812][100950] EnvRunner 0-0 uses policy 0 +[2023-10-14 04:59:50,812][100957] EnvRunner 3-0 uses policy 1 +[2023-10-14 04:59:50,812][100951] EnvRunner 1-0 uses policy 1 +[2023-10-14 04:59:50,938][101548] EnvRunner 14-0 uses policy 0 +[2023-10-14 04:59:51,030][101580] EnvRunner 15-0 uses policy 1 +[2023-10-14 04:59:53,055][99942] Heartbeat connected on Batcher_0 +[2023-10-14 04:59:53,058][99942] Heartbeat connected on LearnerWorker_p0 +[2023-10-14 04:59:53,061][99942] Heartbeat connected on Batcher_1 +[2023-10-14 04:59:53,064][99942] Heartbeat connected on LearnerWorker_p1 +[2023-10-14 04:59:53,072][99942] Heartbeat connected on InferenceWorker_p0-w0 +[2023-10-14 04:59:53,077][99942] Heartbeat connected on RolloutWorker_w0 +[2023-10-14 04:59:53,078][99942] Heartbeat connected on RolloutWorker_w1 +[2023-10-14 04:59:53,084][99942] Heartbeat connected on RolloutWorker_w3 +[2023-10-14 04:59:53,085][99942] Heartbeat connected on RolloutWorker_w2 +[2023-10-14 04:59:53,087][99942] Heartbeat connected on InferenceWorker_p1-w0 +[2023-10-14 04:59:53,090][99942] Heartbeat connected on RolloutWorker_w4 +[2023-10-14 04:59:53,093][99942] Heartbeat connected on RolloutWorker_w5 +[2023-10-14 04:59:53,097][99942] Heartbeat connected on RolloutWorker_w6 +[2023-10-14 04:59:53,098][99942] Heartbeat connected on RolloutWorker_w8 +[2023-10-14 04:59:53,100][99942] Heartbeat connected on RolloutWorker_w7 +[2023-10-14 04:59:53,108][99942] Heartbeat connected on RolloutWorker_w9 +[2023-10-14 04:59:53,108][99942] Heartbeat connected on RolloutWorker_w11 +[2023-10-14 04:59:53,109][99942] Heartbeat connected on RolloutWorker_w10 +[2023-10-14 04:59:53,112][99942] Heartbeat connected on RolloutWorker_w12 +[2023-10-14 04:59:53,114][99942] Heartbeat connected on RolloutWorker_w13 +[2023-10-14 04:59:53,119][99942] Heartbeat connected on RolloutWorker_w14 +[2023-10-14 04:59:53,120][99942] Heartbeat connected on RolloutWorker_w15 +[2023-10-14 04:59:53,512][99942] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 660.7, 1: 374.8. Samples: 2796. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) +[2023-10-14 04:59:58,512][99942] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 999.7, 1: 854.0. Samples: 14274. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) +[2023-10-14 05:00:01,191][100936] Updated weights for policy 0, policy_version 10 (0.0009) +[2023-10-14 05:00:01,413][100917] Updated weights for policy 1, policy_version 10 (0.0009) +[2023-10-14 05:00:01,559][100936] Updated weights for policy 0, policy_version 20 (0.0008) +[2023-10-14 05:00:01,792][100917] Updated weights for policy 1, policy_version 20 (0.0007) +[2023-10-14 05:00:01,928][100936] Updated weights for policy 0, policy_version 30 (0.0010) +[2023-10-14 05:00:02,162][100917] Updated weights for policy 1, policy_version 30 (0.0008) +[2023-10-14 05:00:03,512][99942] Fps is (10 sec: 6553.6, 60 sec: 5160.3, 300 sec: 5160.3). Total num frames: 65536. Throughput: 0: 1236.2, 1: 1177.6. Samples: 30656. Policy #0 lag: (min: 21.0, avg: 21.0, max: 21.0) +[2023-10-14 05:00:04,271][100936] Updated weights for policy 0, policy_version 40 (0.0008) +[2023-10-14 05:00:04,537][100917] Updated weights for policy 1, policy_version 40 (0.0007) +[2023-10-14 05:00:04,649][100936] Updated weights for policy 0, policy_version 50 (0.0009) +[2023-10-14 05:00:04,906][100917] Updated weights for policy 1, policy_version 50 (0.0007) +[2023-10-14 05:00:05,021][100936] Updated weights for policy 0, policy_version 60 (0.0009) +[2023-10-14 05:00:05,280][100917] Updated weights for policy 1, policy_version 60 (0.0007) +[2023-10-14 05:00:08,512][99942] Fps is (10 sec: 13107.5, 60 sec: 7405.1, 300 sec: 7405.1). Total num frames: 131072. Throughput: 0: 1446.5, 1: 1397.5. Samples: 50340. Policy #0 lag: (min: 33.0, avg: 33.0, max: 33.0) +[2023-10-14 05:00:08,811][100936] Updated weights for policy 0, policy_version 70 (0.0008) +[2023-10-14 05:00:09,112][100917] Updated weights for policy 1, policy_version 70 (0.0009) +[2023-10-14 05:00:09,180][100936] Updated weights for policy 0, policy_version 80 (0.0009) +[2023-10-14 05:00:09,484][100917] Updated weights for policy 1, policy_version 80 (0.0008) +[2023-10-14 05:00:09,541][100936] Updated weights for policy 0, policy_version 90 (0.0008) +[2023-10-14 05:00:09,855][100917] Updated weights for policy 1, policy_version 90 (0.0009) +[2023-10-14 05:00:13,183][100936] Updated weights for policy 0, policy_version 100 (0.0009) +[2023-10-14 05:00:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 8661.1, 300 sec: 8661.1). Total num frames: 196608. Throughput: 0: 1329.1, 1: 1286.0. Samples: 59362. Policy #0 lag: (min: 31.0, avg: 33.7, max: 63.0) +[2023-10-14 05:00:13,552][100917] Updated weights for policy 1, policy_version 100 (0.0010) +[2023-10-14 05:00:13,553][100936] Updated weights for policy 0, policy_version 110 (0.0009) +[2023-10-14 05:00:13,916][100917] Updated weights for policy 1, policy_version 110 (0.0009) +[2023-10-14 05:00:13,924][100936] Updated weights for policy 0, policy_version 120 (0.0008) +[2023-10-14 05:00:14,292][100917] Updated weights for policy 1, policy_version 120 (0.0008) +[2023-10-14 05:00:18,252][100936] Updated weights for policy 0, policy_version 130 (0.0008) +[2023-10-14 05:00:18,347][100917] Updated weights for policy 1, policy_version 130 (0.0007) +[2023-10-14 05:00:18,512][99942] Fps is (10 sec: 13107.4, 60 sec: 9463.7, 300 sec: 9463.7). Total num frames: 262144. Throughput: 0: 1463.2, 1: 1419.1. Samples: 79840. Policy #0 lag: (min: 22.0, avg: 29.5, max: 54.0) +[2023-10-14 05:00:18,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:00:18,622][100936] Updated weights for policy 0, policy_version 140 (0.0008) +[2023-10-14 05:00:18,715][100917] Updated weights for policy 1, policy_version 140 (0.0010) +[2023-10-14 05:00:18,991][100936] Updated weights for policy 0, policy_version 150 (0.0008) +[2023-10-14 05:00:19,085][100917] Updated weights for policy 1, policy_version 150 (0.0007) +[2023-10-14 05:00:19,361][100560] Saving new best policy, reward=1.000! +[2023-10-14 05:00:19,362][100936] Updated weights for policy 0, policy_version 160 (0.0008) +[2023-10-14 05:00:19,451][100681] Saving new best policy, reward=1.000! +[2023-10-14 05:00:19,452][100917] Updated weights for policy 1, policy_version 160 (0.0008) +[2023-10-14 05:00:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 10020.8, 300 sec: 10020.8). Total num frames: 327680. Throughput: 0: 1543.7, 1: 1514.0. Samples: 99988. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:00:23,513][99942] Avg episode reward: [(0, '-0.188'), (1, '-4.000')] +[2023-10-14 05:00:23,702][100936] Updated weights for policy 0, policy_version 170 (0.0009) +[2023-10-14 05:00:23,754][100917] Updated weights for policy 1, policy_version 170 (0.0009) +[2023-10-14 05:00:24,068][100936] Updated weights for policy 0, policy_version 180 (0.0007) +[2023-10-14 05:00:24,129][100917] Updated weights for policy 1, policy_version 180 (0.0009) +[2023-10-14 05:00:24,435][100936] Updated weights for policy 0, policy_version 190 (0.0009) +[2023-10-14 05:00:24,494][100917] Updated weights for policy 1, policy_version 190 (0.0009) +[2023-10-14 05:00:28,405][100936] Updated weights for policy 0, policy_version 200 (0.0007) +[2023-10-14 05:00:28,420][100917] Updated weights for policy 1, policy_version 200 (0.0009) +[2023-10-14 05:00:28,512][99942] Fps is (10 sec: 13107.0, 60 sec: 10430.1, 300 sec: 10430.1). Total num frames: 393216. Throughput: 0: 1459.6, 1: 1432.3. Samples: 109024. Policy #0 lag: (min: 31.0, avg: 38.6, max: 63.0) +[2023-10-14 05:00:28,513][99942] Avg episode reward: [(0, '-0.188'), (1, '-4.000')] +[2023-10-14 05:00:28,781][100936] Updated weights for policy 0, policy_version 210 (0.0009) +[2023-10-14 05:00:28,793][100917] Updated weights for policy 1, policy_version 210 (0.0009) +[2023-10-14 05:00:29,152][100936] Updated weights for policy 0, policy_version 220 (0.0007) +[2023-10-14 05:00:29,155][100917] Updated weights for policy 1, policy_version 220 (0.0009) +[2023-10-14 05:00:33,409][100917] Updated weights for policy 1, policy_version 230 (0.0008) +[2023-10-14 05:00:33,424][100936] Updated weights for policy 0, policy_version 230 (0.0007) +[2023-10-14 05:00:33,512][99942] Fps is (10 sec: 13107.0, 60 sec: 10743.5, 300 sec: 10743.5). Total num frames: 458752. Throughput: 0: 1524.3, 1: 1498.9. Samples: 129090. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:00:33,513][99942] Avg episode reward: [(0, '-0.188'), (1, '-4.000')] +[2023-10-14 05:00:33,784][100917] Updated weights for policy 1, policy_version 240 (0.0009) +[2023-10-14 05:00:33,785][100936] Updated weights for policy 0, policy_version 240 (0.0008) +[2023-10-14 05:00:34,158][100936] Updated weights for policy 0, policy_version 250 (0.0008) +[2023-10-14 05:00:34,159][100917] Updated weights for policy 1, policy_version 250 (0.0008) +[2023-10-14 05:00:38,291][100936] Updated weights for policy 0, policy_version 260 (0.0007) +[2023-10-14 05:00:38,301][100917] Updated weights for policy 1, policy_version 260 (0.0009) +[2023-10-14 05:00:38,512][99942] Fps is (10 sec: 13107.4, 60 sec: 10991.4, 300 sec: 10991.4). Total num frames: 524288. Throughput: 0: 1627.7, 1: 1628.0. Samples: 149304. Policy #0 lag: (min: 4.0, avg: 6.9, max: 36.0) +[2023-10-14 05:00:38,512][99942] Avg episode reward: [(0, '-0.188'), (1, '-4.000')] +[2023-10-14 05:00:38,658][100936] Updated weights for policy 0, policy_version 270 (0.0007) +[2023-10-14 05:00:38,669][100917] Updated weights for policy 1, policy_version 270 (0.0007) +[2023-10-14 05:00:39,036][100936] Updated weights for policy 0, policy_version 280 (0.0008) +[2023-10-14 05:00:39,038][100917] Updated weights for policy 1, policy_version 280 (0.0008) +[2023-10-14 05:00:43,333][100936] Updated weights for policy 0, policy_version 290 (0.0008) +[2023-10-14 05:00:43,391][100917] Updated weights for policy 1, policy_version 290 (0.0009) +[2023-10-14 05:00:43,512][99942] Fps is (10 sec: 13107.5, 60 sec: 11192.1, 300 sec: 11192.1). Total num frames: 589824. Throughput: 0: 1600.2, 1: 1603.4. Samples: 158434. Policy #0 lag: (min: 31.0, avg: 33.7, max: 63.0) +[2023-10-14 05:00:43,512][99942] Avg episode reward: [(0, '-0.188'), (1, '-4.000')] +[2023-10-14 05:00:43,701][100936] Updated weights for policy 0, policy_version 300 (0.0010) +[2023-10-14 05:00:43,774][100917] Updated weights for policy 1, policy_version 300 (0.0008) +[2023-10-14 05:00:44,071][100936] Updated weights for policy 0, policy_version 310 (0.0007) +[2023-10-14 05:00:44,147][100917] Updated weights for policy 1, policy_version 310 (0.0009) +[2023-10-14 05:00:44,440][100936] Updated weights for policy 0, policy_version 320 (0.0008) +[2023-10-14 05:00:44,531][100917] Updated weights for policy 1, policy_version 320 (0.0008) +[2023-10-14 05:00:48,512][99942] Fps is (10 sec: 13106.9, 60 sec: 11358.0, 300 sec: 11358.0). Total num frames: 655360. Throughput: 0: 1642.5, 1: 1635.0. Samples: 178144. Policy #0 lag: (min: 31.0, avg: 37.1, max: 63.0) +[2023-10-14 05:00:48,513][99942] Avg episode reward: [(0, '0.281'), (1, '-1.656')] +[2023-10-14 05:00:48,735][100936] Updated weights for policy 0, policy_version 330 (0.0007) +[2023-10-14 05:00:48,814][100917] Updated weights for policy 1, policy_version 330 (0.0009) +[2023-10-14 05:00:49,097][100936] Updated weights for policy 0, policy_version 340 (0.0009) +[2023-10-14 05:00:49,189][100917] Updated weights for policy 1, policy_version 340 (0.0009) +[2023-10-14 05:00:49,473][100936] Updated weights for policy 0, policy_version 350 (0.0008) +[2023-10-14 05:00:49,555][100917] Updated weights for policy 1, policy_version 350 (0.0008) +[2023-10-14 05:00:53,512][99942] Fps is (10 sec: 13106.9, 60 sec: 12014.9, 300 sec: 11497.5). Total num frames: 720896. Throughput: 0: 1644.4, 1: 1642.0. Samples: 198228. Policy #0 lag: (min: 26.0, avg: 29.0, max: 58.0) +[2023-10-14 05:00:53,513][99942] Avg episode reward: [(0, '0.281'), (1, '-1.656')] +[2023-10-14 05:00:53,571][100936] Updated weights for policy 0, policy_version 360 (0.0008) +[2023-10-14 05:00:53,601][100917] Updated weights for policy 1, policy_version 360 (0.0008) +[2023-10-14 05:00:53,931][100936] Updated weights for policy 0, policy_version 370 (0.0009) +[2023-10-14 05:00:53,971][100917] Updated weights for policy 1, policy_version 370 (0.0008) +[2023-10-14 05:00:54,295][100936] Updated weights for policy 0, policy_version 380 (0.0008) +[2023-10-14 05:00:54,334][100917] Updated weights for policy 1, policy_version 380 (0.0007) +[2023-10-14 05:00:58,371][100936] Updated weights for policy 0, policy_version 390 (0.0008) +[2023-10-14 05:00:58,465][100917] Updated weights for policy 1, policy_version 390 (0.0008) +[2023-10-14 05:00:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 11616.4). Total num frames: 786432. Throughput: 0: 1644.3, 1: 1641.9. Samples: 207238. Policy #0 lag: (min: 31.0, avg: 33.1, max: 63.0) +[2023-10-14 05:00:58,513][99942] Avg episode reward: [(0, '0.281'), (1, '-1.656')] +[2023-10-14 05:00:58,739][100936] Updated weights for policy 0, policy_version 400 (0.0008) +[2023-10-14 05:00:58,843][100917] Updated weights for policy 1, policy_version 400 (0.0008) +[2023-10-14 05:00:59,112][100936] Updated weights for policy 0, policy_version 410 (0.0007) +[2023-10-14 05:00:59,221][100917] Updated weights for policy 1, policy_version 410 (0.0008) +[2023-10-14 05:01:03,231][100936] Updated weights for policy 0, policy_version 420 (0.0009) +[2023-10-14 05:01:03,476][100917] Updated weights for policy 1, policy_version 420 (0.0008) +[2023-10-14 05:01:03,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 11718.9). Total num frames: 851968. Throughput: 0: 1642.5, 1: 1641.5. Samples: 227622. Policy #0 lag: (min: 22.0, avg: 28.6, max: 54.0) +[2023-10-14 05:01:03,513][99942] Avg episode reward: [(0, '0.281'), (1, '-1.656')] +[2023-10-14 05:01:03,597][100936] Updated weights for policy 0, policy_version 430 (0.0009) +[2023-10-14 05:01:03,845][100917] Updated weights for policy 1, policy_version 430 (0.0008) +[2023-10-14 05:01:03,957][100936] Updated weights for policy 0, policy_version 440 (0.0007) +[2023-10-14 05:01:04,211][100917] Updated weights for policy 1, policy_version 440 (0.0008) +[2023-10-14 05:01:08,046][100936] Updated weights for policy 0, policy_version 450 (0.0010) +[2023-10-14 05:01:08,251][100917] Updated weights for policy 1, policy_version 450 (0.0007) +[2023-10-14 05:01:08,423][100936] Updated weights for policy 0, policy_version 460 (0.0008) +[2023-10-14 05:01:08,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 11808.3). Total num frames: 917504. Throughput: 0: 1634.3, 1: 1637.6. Samples: 247224. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:01:08,512][99942] Avg episode reward: [(0, '0.281'), (1, '-1.656')] +[2023-10-14 05:01:08,619][100917] Updated weights for policy 1, policy_version 460 (0.0008) +[2023-10-14 05:01:08,789][100936] Updated weights for policy 0, policy_version 470 (0.0008) +[2023-10-14 05:01:08,992][100917] Updated weights for policy 1, policy_version 470 (0.0010) +[2023-10-14 05:01:09,158][100936] Updated weights for policy 0, policy_version 480 (0.0008) +[2023-10-14 05:01:09,354][100917] Updated weights for policy 1, policy_version 480 (0.0011) +[2023-10-14 05:01:13,360][100936] Updated weights for policy 0, policy_version 490 (0.0008) +[2023-10-14 05:01:13,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 11886.8). Total num frames: 983040. Throughput: 0: 1641.6, 1: 1634.8. Samples: 256460. Policy #0 lag: (min: 1.0, avg: 10.5, max: 33.0) +[2023-10-14 05:01:13,512][99942] Avg episode reward: [(0, '0.521'), (1, '-0.848')] +[2023-10-14 05:01:13,626][100917] Updated weights for policy 1, policy_version 490 (0.0009) +[2023-10-14 05:01:13,733][100936] Updated weights for policy 0, policy_version 500 (0.0007) +[2023-10-14 05:01:13,994][100917] Updated weights for policy 1, policy_version 500 (0.0008) +[2023-10-14 05:01:14,097][100936] Updated weights for policy 0, policy_version 510 (0.0008) +[2023-10-14 05:01:14,359][100917] Updated weights for policy 1, policy_version 510 (0.0008) +[2023-10-14 05:01:18,396][100936] Updated weights for policy 0, policy_version 520 (0.0009) +[2023-10-14 05:01:18,500][100917] Updated weights for policy 1, policy_version 520 (0.0008) +[2023-10-14 05:01:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 11956.4). Total num frames: 1048576. Throughput: 0: 1639.1, 1: 1640.3. Samples: 276662. Policy #0 lag: (min: 26.0, avg: 35.2, max: 58.0) +[2023-10-14 05:01:18,512][99942] Avg episode reward: [(0, '0.521'), (1, '-0.771')] +[2023-10-14 05:01:18,758][100936] Updated weights for policy 0, policy_version 530 (0.0007) +[2023-10-14 05:01:18,872][100917] Updated weights for policy 1, policy_version 530 (0.0008) +[2023-10-14 05:01:19,129][100936] Updated weights for policy 0, policy_version 540 (0.0007) +[2023-10-14 05:01:19,253][100917] Updated weights for policy 1, policy_version 540 (0.0009) +[2023-10-14 05:01:23,275][100917] Updated weights for policy 1, policy_version 550 (0.0008) +[2023-10-14 05:01:23,398][100936] Updated weights for policy 0, policy_version 550 (0.0008) +[2023-10-14 05:01:23,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 12018.4). Total num frames: 1114112. Throughput: 0: 1635.5, 1: 1641.5. Samples: 296766. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:01:23,513][99942] Avg episode reward: [(0, '0.521'), (1, '-0.771')] +[2023-10-14 05:01:23,639][100917] Updated weights for policy 1, policy_version 560 (0.0008) +[2023-10-14 05:01:23,766][100936] Updated weights for policy 0, policy_version 560 (0.0007) +[2023-10-14 05:01:24,010][100917] Updated weights for policy 1, policy_version 570 (0.0009) +[2023-10-14 05:01:24,133][100936] Updated weights for policy 0, policy_version 570 (0.0007) +[2023-10-14 05:01:28,073][100917] Updated weights for policy 1, policy_version 580 (0.0010) +[2023-10-14 05:01:28,459][100917] Updated weights for policy 1, policy_version 590 (0.0010) +[2023-10-14 05:01:28,485][100936] Updated weights for policy 0, policy_version 580 (0.0008) +[2023-10-14 05:01:28,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 12074.2). Total num frames: 1179648. Throughput: 0: 1637.1, 1: 1641.9. Samples: 305986. Policy #0 lag: (min: 31.0, avg: 33.8, max: 63.0) +[2023-10-14 05:01:28,513][99942] Avg episode reward: [(0, '0.521'), (1, '-0.771')] +[2023-10-14 05:01:28,832][100917] Updated weights for policy 1, policy_version 600 (0.0010) +[2023-10-14 05:01:28,877][100936] Updated weights for policy 0, policy_version 590 (0.0010) +[2023-10-14 05:01:29,266][100936] Updated weights for policy 0, policy_version 600 (0.0008) +[2023-10-14 05:01:33,056][100917] Updated weights for policy 1, policy_version 610 (0.0010) +[2023-10-14 05:01:33,274][100936] Updated weights for policy 0, policy_version 610 (0.0008) +[2023-10-14 05:01:33,436][100917] Updated weights for policy 1, policy_version 620 (0.0008) +[2023-10-14 05:01:33,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 12124.5). Total num frames: 1245184. Throughput: 0: 1639.5, 1: 1651.1. Samples: 326222. Policy #0 lag: (min: 17.0, avg: 20.1, max: 49.0) +[2023-10-14 05:01:33,513][99942] Avg episode reward: [(0, '0.521'), (1, '-0.771')] +[2023-10-14 05:01:33,650][100936] Updated weights for policy 0, policy_version 620 (0.0008) +[2023-10-14 05:01:33,815][100917] Updated weights for policy 1, policy_version 630 (0.0007) +[2023-10-14 05:01:34,020][100936] Updated weights for policy 0, policy_version 630 (0.0008) +[2023-10-14 05:01:34,183][100917] Updated weights for policy 1, policy_version 640 (0.0008) +[2023-10-14 05:01:34,393][100936] Updated weights for policy 0, policy_version 640 (0.0008) +[2023-10-14 05:01:38,325][100917] Updated weights for policy 1, policy_version 650 (0.0009) +[2023-10-14 05:01:38,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.1, 300 sec: 12170.1). Total num frames: 1310720. Throughput: 0: 1638.3, 1: 1651.3. Samples: 346262. Policy #0 lag: (min: 28.0, avg: 28.0, max: 30.0) +[2023-10-14 05:01:38,513][99942] Avg episode reward: [(0, '0.617'), (1, '-0.417')] +[2023-10-14 05:01:38,528][100936] Updated weights for policy 0, policy_version 650 (0.0008) +[2023-10-14 05:01:38,706][100917] Updated weights for policy 1, policy_version 660 (0.0009) +[2023-10-14 05:01:38,907][100936] Updated weights for policy 0, policy_version 660 (0.0010) +[2023-10-14 05:01:39,090][100917] Updated weights for policy 1, policy_version 670 (0.0008) +[2023-10-14 05:01:39,159][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000000672_688128.pth... +[2023-10-14 05:01:39,278][100936] Updated weights for policy 0, policy_version 670 (0.0008) +[2023-10-14 05:01:39,348][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000000672_688128.pth... +[2023-10-14 05:01:43,181][100917] Updated weights for policy 1, policy_version 680 (0.0007) +[2023-10-14 05:01:43,318][100936] Updated weights for policy 0, policy_version 680 (0.0008) +[2023-10-14 05:01:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 12211.7). Total num frames: 1376256. Throughput: 0: 1641.7, 1: 1653.2. Samples: 355512. Policy #0 lag: (min: 31.0, avg: 42.0, max: 63.0) +[2023-10-14 05:01:43,512][99942] Avg episode reward: [(0, '0.641'), (1, '-0.328')] +[2023-10-14 05:01:43,549][100917] Updated weights for policy 1, policy_version 690 (0.0007) +[2023-10-14 05:01:43,699][100936] Updated weights for policy 0, policy_version 690 (0.0008) +[2023-10-14 05:01:43,923][100917] Updated weights for policy 1, policy_version 700 (0.0010) +[2023-10-14 05:01:44,074][100936] Updated weights for policy 0, policy_version 700 (0.0008) +[2023-10-14 05:01:48,031][100917] Updated weights for policy 1, policy_version 710 (0.0009) +[2023-10-14 05:01:48,211][100936] Updated weights for policy 0, policy_version 710 (0.0007) +[2023-10-14 05:01:48,400][100917] Updated weights for policy 1, policy_version 720 (0.0008) +[2023-10-14 05:01:48,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 12249.7). Total num frames: 1441792. Throughput: 0: 1643.3, 1: 1653.9. Samples: 375996. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) +[2023-10-14 05:01:48,513][99942] Avg episode reward: [(0, '0.641'), (1, '-0.328')] +[2023-10-14 05:01:48,575][100936] Updated weights for policy 0, policy_version 720 (0.0009) +[2023-10-14 05:01:48,775][100917] Updated weights for policy 1, policy_version 730 (0.0007) +[2023-10-14 05:01:48,938][100936] Updated weights for policy 0, policy_version 730 (0.0009) +[2023-10-14 05:01:52,902][100917] Updated weights for policy 1, policy_version 740 (0.0010) +[2023-10-14 05:01:53,036][100936] Updated weights for policy 0, policy_version 740 (0.0008) +[2023-10-14 05:01:53,273][100917] Updated weights for policy 1, policy_version 750 (0.0007) +[2023-10-14 05:01:53,402][100936] Updated weights for policy 0, policy_version 750 (0.0008) +[2023-10-14 05:01:53,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 12284.7). Total num frames: 1507328. Throughput: 0: 1642.2, 1: 1653.1. Samples: 395510. Policy #0 lag: (min: 4.0, avg: 12.6, max: 36.0) +[2023-10-14 05:01:53,512][99942] Avg episode reward: [(0, '0.641'), (1, '-0.328')] +[2023-10-14 05:01:53,654][100917] Updated weights for policy 1, policy_version 760 (0.0007) +[2023-10-14 05:01:53,768][100936] Updated weights for policy 0, policy_version 760 (0.0010) +[2023-10-14 05:01:57,839][100936] Updated weights for policy 0, policy_version 770 (0.0008) +[2023-10-14 05:01:57,888][100917] Updated weights for policy 1, policy_version 770 (0.0008) +[2023-10-14 05:01:58,209][100936] Updated weights for policy 0, policy_version 780 (0.0008) +[2023-10-14 05:01:58,265][100917] Updated weights for policy 1, policy_version 780 (0.0008) +[2023-10-14 05:01:58,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 12316.9). Total num frames: 1572864. Throughput: 0: 1646.4, 1: 1657.4. Samples: 405134. Policy #0 lag: (min: 31.0, avg: 38.4, max: 63.0) +[2023-10-14 05:01:58,513][99942] Avg episode reward: [(0, '0.641'), (1, '-0.328')] +[2023-10-14 05:01:58,585][100936] Updated weights for policy 0, policy_version 790 (0.0007) +[2023-10-14 05:01:58,637][100917] Updated weights for policy 1, policy_version 790 (0.0007) +[2023-10-14 05:01:58,948][100936] Updated weights for policy 0, policy_version 800 (0.0008) +[2023-10-14 05:01:59,001][100917] Updated weights for policy 1, policy_version 800 (0.0008) +[2023-10-14 05:02:03,199][100936] Updated weights for policy 0, policy_version 810 (0.0007) +[2023-10-14 05:02:03,237][100917] Updated weights for policy 1, policy_version 810 (0.0008) +[2023-10-14 05:02:03,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 12346.6). Total num frames: 1638400. Throughput: 0: 1653.2, 1: 1656.2. Samples: 425586. Policy #0 lag: (min: 15.0, avg: 19.8, max: 47.0) +[2023-10-14 05:02:03,512][99942] Avg episode reward: [(0, '0.689'), (1, '-0.181')] +[2023-10-14 05:02:03,568][100936] Updated weights for policy 0, policy_version 820 (0.0007) +[2023-10-14 05:02:03,621][100917] Updated weights for policy 1, policy_version 820 (0.0007) +[2023-10-14 05:02:03,943][100936] Updated weights for policy 0, policy_version 830 (0.0008) +[2023-10-14 05:02:03,991][100917] Updated weights for policy 1, policy_version 830 (0.0008) +[2023-10-14 05:02:07,916][100936] Updated weights for policy 0, policy_version 840 (0.0008) +[2023-10-14 05:02:08,281][100936] Updated weights for policy 0, policy_version 850 (0.0009) +[2023-10-14 05:02:08,300][100917] Updated weights for policy 1, policy_version 840 (0.0008) +[2023-10-14 05:02:08,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 12374.3). Total num frames: 1703936. Throughput: 0: 1644.8, 1: 1648.7. Samples: 444974. Policy #0 lag: (min: 12.0, avg: 20.5, max: 44.0) +[2023-10-14 05:02:08,513][99942] Avg episode reward: [(0, '0.712'), (1, '-0.125')] +[2023-10-14 05:02:08,647][100936] Updated weights for policy 0, policy_version 860 (0.0008) +[2023-10-14 05:02:08,679][100917] Updated weights for policy 1, policy_version 850 (0.0009) +[2023-10-14 05:02:09,051][100917] Updated weights for policy 1, policy_version 860 (0.0009) +[2023-10-14 05:02:12,924][100936] Updated weights for policy 0, policy_version 870 (0.0007) +[2023-10-14 05:02:13,202][100917] Updated weights for policy 1, policy_version 870 (0.0010) +[2023-10-14 05:02:13,302][100936] Updated weights for policy 0, policy_version 880 (0.0007) +[2023-10-14 05:02:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 12399.9). Total num frames: 1769472. Throughput: 0: 1655.2, 1: 1648.9. Samples: 454672. Policy #0 lag: (min: 8.0, avg: 34.5, max: 40.0) +[2023-10-14 05:02:13,512][99942] Avg episode reward: [(0, '0.712'), (1, '-0.125')] +[2023-10-14 05:02:13,566][100917] Updated weights for policy 1, policy_version 880 (0.0007) +[2023-10-14 05:02:13,673][100936] Updated weights for policy 0, policy_version 890 (0.0007) +[2023-10-14 05:02:13,953][100917] Updated weights for policy 1, policy_version 890 (0.0009) +[2023-10-14 05:02:17,895][100936] Updated weights for policy 0, policy_version 900 (0.0007) +[2023-10-14 05:02:18,166][100917] Updated weights for policy 1, policy_version 900 (0.0009) +[2023-10-14 05:02:18,264][100936] Updated weights for policy 0, policy_version 910 (0.0007) +[2023-10-14 05:02:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 12423.9). Total num frames: 1835008. Throughput: 0: 1659.1, 1: 1647.1. Samples: 474998. Policy #0 lag: (min: 16.0, avg: 39.9, max: 48.0) +[2023-10-14 05:02:18,513][99942] Avg episode reward: [(0, '0.712'), (1, '-0.125')] +[2023-10-14 05:02:18,546][100917] Updated weights for policy 1, policy_version 910 (0.0007) +[2023-10-14 05:02:18,638][100936] Updated weights for policy 0, policy_version 920 (0.0007) +[2023-10-14 05:02:18,922][100917] Updated weights for policy 1, policy_version 920 (0.0007) +[2023-10-14 05:02:22,779][100936] Updated weights for policy 0, policy_version 930 (0.0009) +[2023-10-14 05:02:23,110][100917] Updated weights for policy 1, policy_version 930 (0.0009) +[2023-10-14 05:02:23,148][100936] Updated weights for policy 0, policy_version 940 (0.0009) +[2023-10-14 05:02:23,476][100917] Updated weights for policy 1, policy_version 940 (0.0008) +[2023-10-14 05:02:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 12446.3). Total num frames: 1900544. Throughput: 0: 1651.1, 1: 1645.7. Samples: 494616. Policy #0 lag: (min: 31.0, avg: 32.9, max: 61.0) +[2023-10-14 05:02:23,512][99942] Avg episode reward: [(0, '0.712'), (1, '-0.125')] +[2023-10-14 05:02:23,519][100936] Updated weights for policy 0, policy_version 950 (0.0009) +[2023-10-14 05:02:23,848][100917] Updated weights for policy 1, policy_version 950 (0.0008) +[2023-10-14 05:02:23,891][100936] Updated weights for policy 0, policy_version 960 (0.0008) +[2023-10-14 05:02:24,224][100917] Updated weights for policy 1, policy_version 960 (0.0011) +[2023-10-14 05:02:27,900][100936] Updated weights for policy 0, policy_version 970 (0.0008) +[2023-10-14 05:02:28,273][100936] Updated weights for policy 0, policy_version 980 (0.0008) +[2023-10-14 05:02:28,372][100917] Updated weights for policy 1, policy_version 970 (0.0009) +[2023-10-14 05:02:28,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 12467.2). Total num frames: 1966080. Throughput: 0: 1665.2, 1: 1644.7. Samples: 504454. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:02:28,513][99942] Avg episode reward: [(0, '0.720'), (1, '-0.047')] +[2023-10-14 05:02:28,638][100936] Updated weights for policy 0, policy_version 990 (0.0007) +[2023-10-14 05:02:28,748][100917] Updated weights for policy 1, policy_version 980 (0.0009) +[2023-10-14 05:02:29,122][100917] Updated weights for policy 1, policy_version 990 (0.0007) +[2023-10-14 05:02:32,860][100936] Updated weights for policy 0, policy_version 1000 (0.0008) +[2023-10-14 05:02:33,221][100936] Updated weights for policy 0, policy_version 1010 (0.0007) +[2023-10-14 05:02:33,222][100917] Updated weights for policy 1, policy_version 1000 (0.0008) +[2023-10-14 05:02:33,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 12486.9). Total num frames: 2031616. Throughput: 0: 1661.0, 1: 1644.8. Samples: 524758. Policy #0 lag: (min: 21.0, avg: 21.7, max: 39.0) +[2023-10-14 05:02:33,513][99942] Avg episode reward: [(0, '0.760'), (1, '-0.117')] +[2023-10-14 05:02:33,597][100936] Updated weights for policy 0, policy_version 1020 (0.0007) +[2023-10-14 05:02:33,600][100917] Updated weights for policy 1, policy_version 1010 (0.0007) +[2023-10-14 05:02:33,977][100917] Updated weights for policy 1, policy_version 1020 (0.0009) +[2023-10-14 05:02:37,538][100936] Updated weights for policy 0, policy_version 1030 (0.0008) +[2023-10-14 05:02:37,896][100936] Updated weights for policy 0, policy_version 1040 (0.0007) +[2023-10-14 05:02:37,932][100917] Updated weights for policy 1, policy_version 1030 (0.0008) +[2023-10-14 05:02:38,270][100936] Updated weights for policy 0, policy_version 1050 (0.0007) +[2023-10-14 05:02:38,303][100917] Updated weights for policy 1, policy_version 1040 (0.0007) +[2023-10-14 05:02:38,512][99942] Fps is (10 sec: 16383.9, 60 sec: 13653.4, 300 sec: 12700.8). Total num frames: 2129920. Throughput: 0: 1656.5, 1: 1644.1. Samples: 544038. Policy #0 lag: (min: 3.0, avg: 4.8, max: 33.0) +[2023-10-14 05:02:38,512][99942] Avg episode reward: [(0, '0.760'), (1, '-0.094')] +[2023-10-14 05:02:38,669][100917] Updated weights for policy 1, policy_version 1050 (0.0008) +[2023-10-14 05:02:42,406][100936] Updated weights for policy 0, policy_version 1060 (0.0008) +[2023-10-14 05:02:42,736][100917] Updated weights for policy 1, policy_version 1060 (0.0008) +[2023-10-14 05:02:42,774][100936] Updated weights for policy 0, policy_version 1070 (0.0007) +[2023-10-14 05:02:43,113][100917] Updated weights for policy 1, policy_version 1070 (0.0009) +[2023-10-14 05:02:43,145][100936] Updated weights for policy 0, policy_version 1080 (0.0008) +[2023-10-14 05:02:43,478][100917] Updated weights for policy 1, policy_version 1080 (0.0010) +[2023-10-14 05:02:43,512][99942] Fps is (10 sec: 16384.3, 60 sec: 13653.3, 300 sec: 12712.5). Total num frames: 2195456. Throughput: 0: 1667.0, 1: 1650.0. Samples: 554398. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:02:43,513][99942] Avg episode reward: [(0, '0.760'), (1, '-0.094')] +[2023-10-14 05:02:47,171][100936] Updated weights for policy 0, policy_version 1090 (0.0007) +[2023-10-14 05:02:47,543][100936] Updated weights for policy 0, policy_version 1100 (0.0008) +[2023-10-14 05:02:47,756][100917] Updated weights for policy 1, policy_version 1090 (0.0010) +[2023-10-14 05:02:47,914][100936] Updated weights for policy 0, policy_version 1110 (0.0007) +[2023-10-14 05:02:48,129][100917] Updated weights for policy 1, policy_version 1100 (0.0009) +[2023-10-14 05:02:48,287][100936] Updated weights for policy 0, policy_version 1120 (0.0008) +[2023-10-14 05:02:48,498][100917] Updated weights for policy 1, policy_version 1110 (0.0009) +[2023-10-14 05:02:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 12723.6). Total num frames: 2260992. Throughput: 0: 1664.9, 1: 1642.1. Samples: 574400. Policy #0 lag: (min: 6.0, avg: 13.8, max: 38.0) +[2023-10-14 05:02:48,512][99942] Avg episode reward: [(0, '0.760'), (1, '-0.094')] +[2023-10-14 05:02:48,874][100917] Updated weights for policy 1, policy_version 1120 (0.0007) +[2023-10-14 05:02:52,253][100936] Updated weights for policy 0, policy_version 1130 (0.0008) +[2023-10-14 05:02:52,624][100936] Updated weights for policy 0, policy_version 1140 (0.0008) +[2023-10-14 05:02:52,866][100917] Updated weights for policy 1, policy_version 1130 (0.0008) +[2023-10-14 05:02:52,994][100936] Updated weights for policy 0, policy_version 1150 (0.0008) +[2023-10-14 05:02:53,248][100917] Updated weights for policy 1, policy_version 1140 (0.0007) +[2023-10-14 05:02:53,512][99942] Fps is (10 sec: 13106.7, 60 sec: 13653.2, 300 sec: 12734.1). Total num frames: 2326528. Throughput: 0: 1662.5, 1: 1641.5. Samples: 593654. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:02:53,513][99942] Avg episode reward: [(0, '0.760'), (1, '-0.094')] +[2023-10-14 05:02:53,619][100917] Updated weights for policy 1, policy_version 1150 (0.0008) +[2023-10-14 05:02:57,264][100936] Updated weights for policy 0, policy_version 1160 (0.0008) +[2023-10-14 05:02:57,632][100936] Updated weights for policy 0, policy_version 1170 (0.0008) +[2023-10-14 05:02:57,883][100917] Updated weights for policy 1, policy_version 1160 (0.0008) +[2023-10-14 05:02:57,996][100936] Updated weights for policy 0, policy_version 1180 (0.0008) +[2023-10-14 05:02:58,262][100917] Updated weights for policy 1, policy_version 1170 (0.0009) +[2023-10-14 05:02:58,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13653.3, 300 sec: 12744.1). Total num frames: 2392064. Throughput: 0: 1669.1, 1: 1652.9. Samples: 604160. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:02:58,513][99942] Avg episode reward: [(0, '0.960'), (1, '0.740')] +[2023-10-14 05:02:58,625][100917] Updated weights for policy 1, policy_version 1180 (0.0009) +[2023-10-14 05:03:02,013][100936] Updated weights for policy 0, policy_version 1190 (0.0007) +[2023-10-14 05:03:02,401][100936] Updated weights for policy 0, policy_version 1200 (0.0007) +[2023-10-14 05:03:02,704][100917] Updated weights for policy 1, policy_version 1190 (0.0008) +[2023-10-14 05:03:02,771][100936] Updated weights for policy 0, policy_version 1210 (0.0008) +[2023-10-14 05:03:03,086][100917] Updated weights for policy 1, policy_version 1200 (0.0007) +[2023-10-14 05:03:03,454][100917] Updated weights for policy 1, policy_version 1210 (0.0010) +[2023-10-14 05:03:03,512][99942] Fps is (10 sec: 13107.8, 60 sec: 13653.3, 300 sec: 12753.5). Total num frames: 2457600. Throughput: 0: 1654.3, 1: 1654.1. Samples: 623878. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:03:03,512][99942] Avg episode reward: [(0, '0.960'), (1, '0.740')] +[2023-10-14 05:03:06,881][100936] Updated weights for policy 0, policy_version 1220 (0.0010) +[2023-10-14 05:03:07,258][100936] Updated weights for policy 0, policy_version 1230 (0.0008) +[2023-10-14 05:03:07,622][100936] Updated weights for policy 0, policy_version 1240 (0.0008) +[2023-10-14 05:03:07,656][100917] Updated weights for policy 1, policy_version 1220 (0.0009) +[2023-10-14 05:03:08,035][100917] Updated weights for policy 1, policy_version 1230 (0.0009) +[2023-10-14 05:03:08,400][100917] Updated weights for policy 1, policy_version 1240 (0.0007) +[2023-10-14 05:03:08,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 12762.4). Total num frames: 2523136. Throughput: 0: 1652.0, 1: 1644.7. Samples: 642972. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:03:08,513][99942] Avg episode reward: [(0, '0.960'), (1, '0.740')] +[2023-10-14 05:03:12,046][100936] Updated weights for policy 0, policy_version 1250 (0.0009) +[2023-10-14 05:03:12,419][100936] Updated weights for policy 0, policy_version 1260 (0.0008) +[2023-10-14 05:03:12,430][100917] Updated weights for policy 1, policy_version 1250 (0.0007) +[2023-10-14 05:03:12,791][100936] Updated weights for policy 0, policy_version 1270 (0.0008) +[2023-10-14 05:03:12,796][100917] Updated weights for policy 1, policy_version 1260 (0.0009) +[2023-10-14 05:03:13,152][100936] Updated weights for policy 0, policy_version 1280 (0.0008) +[2023-10-14 05:03:13,174][100917] Updated weights for policy 1, policy_version 1270 (0.0008) +[2023-10-14 05:03:13,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 12770.9). Total num frames: 2588672. Throughput: 0: 1659.4, 1: 1653.3. Samples: 653526. Policy #0 lag: (min: 31.0, avg: 31.0, max: 34.0) +[2023-10-14 05:03:13,513][99942] Avg episode reward: [(0, '0.960'), (1, '0.740')] +[2023-10-14 05:03:13,550][100917] Updated weights for policy 1, policy_version 1280 (0.0009) +[2023-10-14 05:03:17,449][100936] Updated weights for policy 0, policy_version 1290 (0.0009) +[2023-10-14 05:03:17,649][100917] Updated weights for policy 1, policy_version 1290 (0.0007) +[2023-10-14 05:03:17,820][100936] Updated weights for policy 0, policy_version 1300 (0.0010) +[2023-10-14 05:03:18,017][100917] Updated weights for policy 1, policy_version 1300 (0.0008) +[2023-10-14 05:03:18,194][100936] Updated weights for policy 0, policy_version 1310 (0.0007) +[2023-10-14 05:03:18,397][100917] Updated weights for policy 1, policy_version 1310 (0.0008) +[2023-10-14 05:03:18,512][99942] Fps is (10 sec: 16384.5, 60 sec: 14199.5, 300 sec: 12936.8). Total num frames: 2686976. Throughput: 0: 1652.6, 1: 1654.6. Samples: 673582. Policy #0 lag: (min: 26.0, avg: 26.1, max: 33.0) +[2023-10-14 05:03:18,512][99942] Avg episode reward: [(0, '0.960'), (1, '0.740')] +[2023-10-14 05:03:22,084][100936] Updated weights for policy 0, policy_version 1320 (0.0009) +[2023-10-14 05:03:22,459][100936] Updated weights for policy 0, policy_version 1330 (0.0009) +[2023-10-14 05:03:22,652][100917] Updated weights for policy 1, policy_version 1320 (0.0007) +[2023-10-14 05:03:22,827][100936] Updated weights for policy 0, policy_version 1340 (0.0009) +[2023-10-14 05:03:23,031][100917] Updated weights for policy 1, policy_version 1330 (0.0008) +[2023-10-14 05:03:23,397][100917] Updated weights for policy 1, policy_version 1340 (0.0008) +[2023-10-14 05:03:23,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 12786.8). Total num frames: 2719744. Throughput: 0: 1653.1, 1: 1643.1. Samples: 692368. Policy #0 lag: (min: 28.0, avg: 30.1, max: 60.0) +[2023-10-14 05:03:23,512][99942] Avg episode reward: [(0, '0.960'), (1, '0.750')] +[2023-10-14 05:03:27,171][100936] Updated weights for policy 0, policy_version 1350 (0.0008) +[2023-10-14 05:03:27,532][100936] Updated weights for policy 0, policy_version 1360 (0.0008) +[2023-10-14 05:03:27,644][100917] Updated weights for policy 1, policy_version 1350 (0.0008) +[2023-10-14 05:03:27,904][100936] Updated weights for policy 0, policy_version 1370 (0.0007) +[2023-10-14 05:03:28,007][100917] Updated weights for policy 1, policy_version 1360 (0.0008) +[2023-10-14 05:03:28,390][100917] Updated weights for policy 1, policy_version 1370 (0.0008) +[2023-10-14 05:03:28,512][99942] Fps is (10 sec: 9830.3, 60 sec: 13653.3, 300 sec: 12794.1). Total num frames: 2785280. Throughput: 0: 1655.1, 1: 1647.2. Samples: 702998. Policy #0 lag: (min: 10.0, avg: 10.1, max: 18.0) +[2023-10-14 05:03:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.790')] +[2023-10-14 05:03:32,067][100936] Updated weights for policy 0, policy_version 1380 (0.0008) +[2023-10-14 05:03:32,441][100936] Updated weights for policy 0, policy_version 1390 (0.0010) +[2023-10-14 05:03:32,615][100917] Updated weights for policy 1, policy_version 1380 (0.0008) +[2023-10-14 05:03:32,812][100936] Updated weights for policy 0, policy_version 1400 (0.0008) +[2023-10-14 05:03:32,997][100917] Updated weights for policy 1, policy_version 1390 (0.0008) +[2023-10-14 05:03:33,373][100917] Updated weights for policy 1, policy_version 1400 (0.0009) +[2023-10-14 05:03:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 12801.1). Total num frames: 2850816. Throughput: 0: 1647.6, 1: 1652.2. Samples: 722892. Policy #0 lag: (min: 10.0, avg: 17.7, max: 42.0) +[2023-10-14 05:03:33,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.790')] +[2023-10-14 05:03:36,977][100936] Updated weights for policy 0, policy_version 1410 (0.0008) +[2023-10-14 05:03:37,346][100936] Updated weights for policy 0, policy_version 1420 (0.0007) +[2023-10-14 05:03:37,489][100917] Updated weights for policy 1, policy_version 1410 (0.0008) +[2023-10-14 05:03:37,715][100936] Updated weights for policy 0, policy_version 1430 (0.0009) +[2023-10-14 05:03:37,865][100917] Updated weights for policy 1, policy_version 1420 (0.0010) +[2023-10-14 05:03:38,088][100936] Updated weights for policy 0, policy_version 1440 (0.0007) +[2023-10-14 05:03:38,256][100917] Updated weights for policy 1, policy_version 1430 (0.0009) +[2023-10-14 05:03:38,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 12807.9). Total num frames: 2916352. Throughput: 0: 1647.4, 1: 1644.8. Samples: 741804. Policy #0 lag: (min: 15.0, avg: 15.9, max: 36.0) +[2023-10-14 05:03:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.790')] +[2023-10-14 05:03:38,522][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000001440_1474560.pth... +[2023-10-14 05:03:38,621][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000001440_1474560.pth... +[2023-10-14 05:03:38,622][100917] Updated weights for policy 1, policy_version 1440 (0.0009) +[2023-10-14 05:03:42,023][100936] Updated weights for policy 0, policy_version 1450 (0.0007) +[2023-10-14 05:03:42,399][100936] Updated weights for policy 0, policy_version 1460 (0.0008) +[2023-10-14 05:03:42,688][100917] Updated weights for policy 1, policy_version 1450 (0.0009) +[2023-10-14 05:03:42,762][100936] Updated weights for policy 0, policy_version 1470 (0.0007) +[2023-10-14 05:03:43,060][100917] Updated weights for policy 1, policy_version 1460 (0.0007) +[2023-10-14 05:03:43,444][100917] Updated weights for policy 1, policy_version 1470 (0.0007) +[2023-10-14 05:03:43,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 12814.3). Total num frames: 2981888. Throughput: 0: 1657.8, 1: 1648.1. Samples: 752926. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:03:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.790')] +[2023-10-14 05:03:46,886][100936] Updated weights for policy 0, policy_version 1480 (0.0009) +[2023-10-14 05:03:47,256][100936] Updated weights for policy 0, policy_version 1490 (0.0008) +[2023-10-14 05:03:47,626][100936] Updated weights for policy 0, policy_version 1500 (0.0010) +[2023-10-14 05:03:47,834][100917] Updated weights for policy 1, policy_version 1480 (0.0009) +[2023-10-14 05:03:48,222][100917] Updated weights for policy 1, policy_version 1490 (0.0009) +[2023-10-14 05:03:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 12820.5). Total num frames: 3047424. Throughput: 0: 1652.1, 1: 1640.1. Samples: 772028. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:03:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.660')] +[2023-10-14 05:03:48,586][100917] Updated weights for policy 1, policy_version 1500 (0.0010) +[2023-10-14 05:03:51,695][100936] Updated weights for policy 0, policy_version 1510 (0.0009) +[2023-10-14 05:03:52,067][100936] Updated weights for policy 0, policy_version 1520 (0.0010) +[2023-10-14 05:03:52,435][100936] Updated weights for policy 0, policy_version 1530 (0.0010) +[2023-10-14 05:03:52,726][100917] Updated weights for policy 1, policy_version 1510 (0.0007) +[2023-10-14 05:03:53,094][100917] Updated weights for policy 1, policy_version 1520 (0.0007) +[2023-10-14 05:03:53,476][100917] Updated weights for policy 1, policy_version 1530 (0.0008) +[2023-10-14 05:03:53,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 12826.4). Total num frames: 3112960. Throughput: 0: 1655.2, 1: 1642.2. Samples: 791358. Policy #0 lag: (min: 5.0, avg: 6.0, max: 28.0) +[2023-10-14 05:03:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.660')] +[2023-10-14 05:03:56,668][100936] Updated weights for policy 0, policy_version 1540 (0.0008) +[2023-10-14 05:03:57,047][100936] Updated weights for policy 0, policy_version 1550 (0.0009) +[2023-10-14 05:03:57,415][100936] Updated weights for policy 0, policy_version 1560 (0.0009) +[2023-10-14 05:03:57,592][100917] Updated weights for policy 1, policy_version 1540 (0.0007) +[2023-10-14 05:03:57,972][100917] Updated weights for policy 1, policy_version 1550 (0.0009) +[2023-10-14 05:03:58,354][100917] Updated weights for policy 1, policy_version 1560 (0.0008) +[2023-10-14 05:03:58,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.3, 300 sec: 12832.0). Total num frames: 3178496. Throughput: 0: 1653.8, 1: 1640.9. Samples: 801786. Policy #0 lag: (min: 17.0, avg: 25.0, max: 49.0) +[2023-10-14 05:03:58,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.660')] +[2023-10-14 05:04:01,480][100936] Updated weights for policy 0, policy_version 1570 (0.0009) +[2023-10-14 05:04:01,841][100936] Updated weights for policy 0, policy_version 1580 (0.0007) +[2023-10-14 05:04:02,214][100936] Updated weights for policy 0, policy_version 1590 (0.0008) +[2023-10-14 05:04:02,434][100917] Updated weights for policy 1, policy_version 1570 (0.0010) +[2023-10-14 05:04:02,584][100936] Updated weights for policy 0, policy_version 1600 (0.0008) +[2023-10-14 05:04:02,804][100917] Updated weights for policy 1, policy_version 1580 (0.0009) +[2023-10-14 05:04:03,183][100917] Updated weights for policy 1, policy_version 1590 (0.0008) +[2023-10-14 05:04:03,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 12837.5). Total num frames: 3244032. Throughput: 0: 1641.0, 1: 1642.8. Samples: 821354. Policy #0 lag: (min: 30.0, avg: 32.8, max: 62.0) +[2023-10-14 05:04:03,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.660')] +[2023-10-14 05:04:03,549][100917] Updated weights for policy 1, policy_version 1600 (0.0007) +[2023-10-14 05:04:06,802][100936] Updated weights for policy 0, policy_version 1610 (0.0012) +[2023-10-14 05:04:07,169][100936] Updated weights for policy 0, policy_version 1620 (0.0008) +[2023-10-14 05:04:07,545][100936] Updated weights for policy 0, policy_version 1630 (0.0007) +[2023-10-14 05:04:07,599][100917] Updated weights for policy 1, policy_version 1610 (0.0007) +[2023-10-14 05:04:07,981][100917] Updated weights for policy 1, policy_version 1620 (0.0010) +[2023-10-14 05:04:08,354][100917] Updated weights for policy 1, policy_version 1630 (0.0009) +[2023-10-14 05:04:08,512][99942] Fps is (10 sec: 16383.7, 60 sec: 13653.4, 300 sec: 12969.9). Total num frames: 3342336. Throughput: 0: 1653.9, 1: 1641.0. Samples: 840638. Policy #0 lag: (min: 13.0, avg: 14.1, max: 35.0) +[2023-10-14 05:04:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.660')] +[2023-10-14 05:04:11,832][100936] Updated weights for policy 0, policy_version 1640 (0.0008) +[2023-10-14 05:04:12,203][100936] Updated weights for policy 0, policy_version 1650 (0.0008) +[2023-10-14 05:04:12,447][100917] Updated weights for policy 1, policy_version 1640 (0.0007) +[2023-10-14 05:04:12,573][100936] Updated weights for policy 0, policy_version 1660 (0.0009) +[2023-10-14 05:04:12,830][100917] Updated weights for policy 1, policy_version 1650 (0.0009) +[2023-10-14 05:04:13,199][100917] Updated weights for policy 1, policy_version 1660 (0.0008) +[2023-10-14 05:04:13,512][99942] Fps is (10 sec: 16383.7, 60 sec: 13653.3, 300 sec: 12972.5). Total num frames: 3407872. Throughput: 0: 1656.8, 1: 1643.2. Samples: 851494. Policy #0 lag: (min: 4.0, avg: 15.0, max: 36.0) +[2023-10-14 05:04:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.660')] +[2023-10-14 05:04:16,642][100936] Updated weights for policy 0, policy_version 1670 (0.0007) +[2023-10-14 05:04:17,013][100936] Updated weights for policy 0, policy_version 1680 (0.0010) +[2023-10-14 05:04:17,359][100917] Updated weights for policy 1, policy_version 1670 (0.0009) +[2023-10-14 05:04:17,396][100936] Updated weights for policy 0, policy_version 1690 (0.0008) +[2023-10-14 05:04:17,737][100917] Updated weights for policy 1, policy_version 1680 (0.0010) +[2023-10-14 05:04:18,101][100917] Updated weights for policy 1, policy_version 1690 (0.0009) +[2023-10-14 05:04:18,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 12975.0). Total num frames: 3473408. Throughput: 0: 1642.2, 1: 1644.5. Samples: 870792. Policy #0 lag: (min: 31.0, avg: 33.8, max: 63.0) +[2023-10-14 05:04:18,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.520')] +[2023-10-14 05:04:21,452][100936] Updated weights for policy 0, policy_version 1700 (0.0009) +[2023-10-14 05:04:21,817][100936] Updated weights for policy 0, policy_version 1710 (0.0010) +[2023-10-14 05:04:22,196][100936] Updated weights for policy 0, policy_version 1720 (0.0009) +[2023-10-14 05:04:22,265][100917] Updated weights for policy 1, policy_version 1700 (0.0007) +[2023-10-14 05:04:22,634][100917] Updated weights for policy 1, policy_version 1710 (0.0008) +[2023-10-14 05:04:23,007][100917] Updated weights for policy 1, policy_version 1720 (0.0008) +[2023-10-14 05:04:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 12977.4). Total num frames: 3538944. Throughput: 0: 1664.1, 1: 1639.4. Samples: 890462. Policy #0 lag: (min: 31.0, avg: 33.4, max: 63.0) +[2023-10-14 05:04:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.510')] +[2023-10-14 05:04:26,332][100936] Updated weights for policy 0, policy_version 1730 (0.0008) +[2023-10-14 05:04:26,705][100936] Updated weights for policy 0, policy_version 1740 (0.0008) +[2023-10-14 05:04:27,086][100936] Updated weights for policy 0, policy_version 1750 (0.0008) +[2023-10-14 05:04:27,277][100917] Updated weights for policy 1, policy_version 1730 (0.0009) +[2023-10-14 05:04:27,456][100936] Updated weights for policy 0, policy_version 1760 (0.0008) +[2023-10-14 05:04:27,654][100917] Updated weights for policy 1, policy_version 1740 (0.0009) +[2023-10-14 05:04:28,035][100917] Updated weights for policy 1, policy_version 1750 (0.0011) +[2023-10-14 05:04:28,416][100917] Updated weights for policy 1, policy_version 1760 (0.0007) +[2023-10-14 05:04:28,512][99942] Fps is (10 sec: 13106.7, 60 sec: 13653.3, 300 sec: 12979.7). Total num frames: 3604480. Throughput: 0: 1655.5, 1: 1642.5. Samples: 901334. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:04:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.510')] +[2023-10-14 05:04:31,653][100936] Updated weights for policy 0, policy_version 1770 (0.0007) +[2023-10-14 05:04:32,021][100936] Updated weights for policy 0, policy_version 1780 (0.0008) +[2023-10-14 05:04:32,408][100936] Updated weights for policy 0, policy_version 1790 (0.0008) +[2023-10-14 05:04:32,482][100917] Updated weights for policy 1, policy_version 1770 (0.0009) +[2023-10-14 05:04:32,865][100917] Updated weights for policy 1, policy_version 1780 (0.0007) +[2023-10-14 05:04:33,237][100917] Updated weights for policy 1, policy_version 1790 (0.0007) +[2023-10-14 05:04:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 12982.0). Total num frames: 3670016. Throughput: 0: 1655.6, 1: 1652.9. Samples: 920912. Policy #0 lag: (min: 27.0, avg: 39.4, max: 40.0) +[2023-10-14 05:04:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.510')] +[2023-10-14 05:04:36,351][100936] Updated weights for policy 0, policy_version 1800 (0.0007) +[2023-10-14 05:04:36,733][100936] Updated weights for policy 0, policy_version 1810 (0.0007) +[2023-10-14 05:04:37,105][100936] Updated weights for policy 0, policy_version 1820 (0.0007) +[2023-10-14 05:04:37,363][100917] Updated weights for policy 1, policy_version 1800 (0.0008) +[2023-10-14 05:04:37,737][100917] Updated weights for policy 1, policy_version 1810 (0.0009) +[2023-10-14 05:04:38,123][100917] Updated weights for policy 1, policy_version 1820 (0.0009) +[2023-10-14 05:04:38,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13653.4, 300 sec: 12984.2). Total num frames: 3735552. Throughput: 0: 1665.0, 1: 1638.0. Samples: 939992. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:04:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.510')] +[2023-10-14 05:04:41,145][100936] Updated weights for policy 0, policy_version 1830 (0.0009) +[2023-10-14 05:04:41,515][100936] Updated weights for policy 0, policy_version 1840 (0.0009) +[2023-10-14 05:04:41,881][100936] Updated weights for policy 0, policy_version 1850 (0.0008) +[2023-10-14 05:04:42,262][100917] Updated weights for policy 1, policy_version 1830 (0.0008) +[2023-10-14 05:04:42,649][100917] Updated weights for policy 1, policy_version 1840 (0.0007) +[2023-10-14 05:04:43,027][100917] Updated weights for policy 1, policy_version 1850 (0.0007) +[2023-10-14 05:04:43,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 12986.3). Total num frames: 3801088. Throughput: 0: 1656.2, 1: 1652.1. Samples: 950658. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:04:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.460')] +[2023-10-14 05:04:46,310][100936] Updated weights for policy 0, policy_version 1860 (0.0008) +[2023-10-14 05:04:46,675][100936] Updated weights for policy 0, policy_version 1870 (0.0007) +[2023-10-14 05:04:46,962][100917] Updated weights for policy 1, policy_version 1860 (0.0008) +[2023-10-14 05:04:47,040][100936] Updated weights for policy 0, policy_version 1880 (0.0007) +[2023-10-14 05:04:47,336][100917] Updated weights for policy 1, policy_version 1870 (0.0007) +[2023-10-14 05:04:47,709][100917] Updated weights for policy 1, policy_version 1880 (0.0010) +[2023-10-14 05:04:48,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13107.2). Total num frames: 3866624. Throughput: 0: 1654.1, 1: 1650.0. Samples: 970042. Policy #0 lag: (min: 26.0, avg: 27.3, max: 44.0) +[2023-10-14 05:04:48,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.460')] +[2023-10-14 05:04:51,107][100936] Updated weights for policy 0, policy_version 1890 (0.0008) +[2023-10-14 05:04:51,470][100936] Updated weights for policy 0, policy_version 1900 (0.0011) +[2023-10-14 05:04:51,711][100917] Updated weights for policy 1, policy_version 1890 (0.0009) +[2023-10-14 05:04:51,841][100936] Updated weights for policy 0, policy_version 1910 (0.0008) +[2023-10-14 05:04:52,092][100917] Updated weights for policy 1, policy_version 1900 (0.0007) +[2023-10-14 05:04:52,211][100936] Updated weights for policy 0, policy_version 1920 (0.0008) +[2023-10-14 05:04:52,473][100917] Updated weights for policy 1, policy_version 1910 (0.0009) +[2023-10-14 05:04:52,840][100917] Updated weights for policy 1, policy_version 1920 (0.0009) +[2023-10-14 05:04:53,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 3932160. Throughput: 0: 1664.1, 1: 1650.1. Samples: 989780. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:04:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.460')] +[2023-10-14 05:04:56,178][100936] Updated weights for policy 0, policy_version 1930 (0.0008) +[2023-10-14 05:04:56,553][100936] Updated weights for policy 0, policy_version 1940 (0.0008) +[2023-10-14 05:04:56,920][100936] Updated weights for policy 0, policy_version 1950 (0.0008) +[2023-10-14 05:04:57,099][100917] Updated weights for policy 1, policy_version 1930 (0.0009) +[2023-10-14 05:04:57,471][100917] Updated weights for policy 1, policy_version 1940 (0.0010) +[2023-10-14 05:04:57,855][100917] Updated weights for policy 1, policy_version 1950 (0.0010) +[2023-10-14 05:04:58,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 3997696. Throughput: 0: 1653.3, 1: 1664.6. Samples: 1000800. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:04:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.460')] +[2023-10-14 05:05:01,100][100936] Updated weights for policy 0, policy_version 1960 (0.0008) +[2023-10-14 05:05:01,478][100936] Updated weights for policy 0, policy_version 1970 (0.0009) +[2023-10-14 05:05:01,851][100936] Updated weights for policy 0, policy_version 1980 (0.0007) +[2023-10-14 05:05:01,863][100917] Updated weights for policy 1, policy_version 1960 (0.0007) +[2023-10-14 05:05:02,241][100917] Updated weights for policy 1, policy_version 1970 (0.0007) +[2023-10-14 05:05:02,620][100917] Updated weights for policy 1, policy_version 1980 (0.0008) +[2023-10-14 05:05:03,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 4063232. Throughput: 0: 1660.8, 1: 1658.1. Samples: 1020144. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:05:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.510')] +[2023-10-14 05:05:06,016][100936] Updated weights for policy 0, policy_version 1990 (0.0009) +[2023-10-14 05:05:06,394][100936] Updated weights for policy 0, policy_version 2000 (0.0007) +[2023-10-14 05:05:06,740][100917] Updated weights for policy 1, policy_version 1990 (0.0009) +[2023-10-14 05:05:06,756][100936] Updated weights for policy 0, policy_version 2010 (0.0009) +[2023-10-14 05:05:07,125][100917] Updated weights for policy 1, policy_version 2000 (0.0009) +[2023-10-14 05:05:07,499][100917] Updated weights for policy 1, policy_version 2010 (0.0009) +[2023-10-14 05:05:08,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 4128768. Throughput: 0: 1653.4, 1: 1654.2. Samples: 1039304. Policy #0 lag: (min: 1.0, avg: 7.1, max: 33.0) +[2023-10-14 05:05:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.510')] +[2023-10-14 05:05:10,982][100936] Updated weights for policy 0, policy_version 2020 (0.0008) +[2023-10-14 05:05:11,357][100936] Updated weights for policy 0, policy_version 2030 (0.0010) +[2023-10-14 05:05:11,614][100917] Updated weights for policy 1, policy_version 2020 (0.0009) +[2023-10-14 05:05:11,732][100936] Updated weights for policy 0, policy_version 2040 (0.0008) +[2023-10-14 05:05:11,989][100917] Updated weights for policy 1, policy_version 2030 (0.0009) +[2023-10-14 05:05:12,358][100917] Updated weights for policy 1, policy_version 2040 (0.0009) +[2023-10-14 05:05:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 4194304. Throughput: 0: 1644.3, 1: 1665.1. Samples: 1050258. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:05:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.510')] +[2023-10-14 05:05:16,025][100936] Updated weights for policy 0, policy_version 2050 (0.0009) +[2023-10-14 05:05:16,395][100936] Updated weights for policy 0, policy_version 2060 (0.0007) +[2023-10-14 05:05:16,531][100917] Updated weights for policy 1, policy_version 2050 (0.0008) +[2023-10-14 05:05:16,767][100936] Updated weights for policy 0, policy_version 2070 (0.0008) +[2023-10-14 05:05:16,895][100917] Updated weights for policy 1, policy_version 2060 (0.0007) +[2023-10-14 05:05:17,148][100936] Updated weights for policy 0, policy_version 2080 (0.0008) +[2023-10-14 05:05:17,267][100917] Updated weights for policy 1, policy_version 2070 (0.0007) +[2023-10-14 05:05:17,647][100917] Updated weights for policy 1, policy_version 2080 (0.0010) +[2023-10-14 05:05:18,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.1, 300 sec: 13329.3). Total num frames: 4259840. Throughput: 0: 1645.3, 1: 1652.0. Samples: 1069290. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:05:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.510')] +[2023-10-14 05:05:21,139][100936] Updated weights for policy 0, policy_version 2090 (0.0009) +[2023-10-14 05:05:21,514][100936] Updated weights for policy 0, policy_version 2100 (0.0008) +[2023-10-14 05:05:21,791][100917] Updated weights for policy 1, policy_version 2090 (0.0007) +[2023-10-14 05:05:21,884][100936] Updated weights for policy 0, policy_version 2110 (0.0008) +[2023-10-14 05:05:22,165][100917] Updated weights for policy 1, policy_version 2100 (0.0007) +[2023-10-14 05:05:22,538][100917] Updated weights for policy 1, policy_version 2110 (0.0007) +[2023-10-14 05:05:23,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.3, 300 sec: 13329.4). Total num frames: 4325376. Throughput: 0: 1651.4, 1: 1657.2. Samples: 1088880. Policy #0 lag: (min: 26.0, avg: 32.9, max: 58.0) +[2023-10-14 05:05:23,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.510')] +[2023-10-14 05:05:26,073][100936] Updated weights for policy 0, policy_version 2120 (0.0008) +[2023-10-14 05:05:26,445][100936] Updated weights for policy 0, policy_version 2130 (0.0007) +[2023-10-14 05:05:26,786][100917] Updated weights for policy 1, policy_version 2120 (0.0009) +[2023-10-14 05:05:26,809][100936] Updated weights for policy 0, policy_version 2140 (0.0008) +[2023-10-14 05:05:27,153][100917] Updated weights for policy 1, policy_version 2130 (0.0007) +[2023-10-14 05:05:27,536][100917] Updated weights for policy 1, policy_version 2140 (0.0009) +[2023-10-14 05:05:28,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.3, 300 sec: 13329.4). Total num frames: 4390912. Throughput: 0: 1647.6, 1: 1662.8. Samples: 1099626. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:05:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.510')] +[2023-10-14 05:05:30,976][100936] Updated weights for policy 0, policy_version 2150 (0.0008) +[2023-10-14 05:05:31,344][100936] Updated weights for policy 0, policy_version 2160 (0.0010) +[2023-10-14 05:05:31,709][100936] Updated weights for policy 0, policy_version 2170 (0.0009) +[2023-10-14 05:05:31,741][100917] Updated weights for policy 1, policy_version 2150 (0.0008) +[2023-10-14 05:05:32,113][100917] Updated weights for policy 1, policy_version 2160 (0.0008) +[2023-10-14 05:05:32,496][100917] Updated weights for policy 1, policy_version 2170 (0.0007) +[2023-10-14 05:05:33,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 4456448. Throughput: 0: 1658.2, 1: 1648.1. Samples: 1118826. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:05:33,513][99942] Avg episode reward: [(0, '0.800'), (1, '0.510')] +[2023-10-14 05:05:35,839][100936] Updated weights for policy 0, policy_version 2180 (0.0007) +[2023-10-14 05:05:36,206][100936] Updated weights for policy 0, policy_version 2190 (0.0010) +[2023-10-14 05:05:36,577][100936] Updated weights for policy 0, policy_version 2200 (0.0009) +[2023-10-14 05:05:36,735][100917] Updated weights for policy 1, policy_version 2180 (0.0009) +[2023-10-14 05:05:37,115][100917] Updated weights for policy 1, policy_version 2190 (0.0009) +[2023-10-14 05:05:37,488][100917] Updated weights for policy 1, policy_version 2200 (0.0008) +[2023-10-14 05:05:38,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 4521984. Throughput: 0: 1650.7, 1: 1646.8. Samples: 1138168. Policy #0 lag: (min: 31.0, avg: 32.7, max: 61.0) +[2023-10-14 05:05:38,513][99942] Avg episode reward: [(0, '0.800'), (1, '0.510')] +[2023-10-14 05:05:38,523][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000002208_2260992.pth... +[2023-10-14 05:05:38,524][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000002208_2260992.pth... +[2023-10-14 05:05:38,564][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000000672_688128.pth +[2023-10-14 05:05:38,564][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000000672_688128.pth +[2023-10-14 05:05:40,562][100936] Updated weights for policy 0, policy_version 2210 (0.0009) +[2023-10-14 05:05:40,931][100936] Updated weights for policy 0, policy_version 2220 (0.0009) +[2023-10-14 05:05:41,305][100936] Updated weights for policy 0, policy_version 2230 (0.0010) +[2023-10-14 05:05:41,381][100917] Updated weights for policy 1, policy_version 2210 (0.0008) +[2023-10-14 05:05:41,675][100936] Updated weights for policy 0, policy_version 2240 (0.0008) +[2023-10-14 05:05:41,752][100917] Updated weights for policy 1, policy_version 2220 (0.0007) +[2023-10-14 05:05:42,127][100917] Updated weights for policy 1, policy_version 2230 (0.0008) +[2023-10-14 05:05:42,493][100917] Updated weights for policy 1, policy_version 2240 (0.0007) +[2023-10-14 05:05:43,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 4587520. Throughput: 0: 1638.5, 1: 1651.5. Samples: 1148852. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:05:43,513][99942] Avg episode reward: [(0, '0.690'), (1, '0.510')] +[2023-10-14 05:05:45,792][100936] Updated weights for policy 0, policy_version 2250 (0.0008) +[2023-10-14 05:05:46,156][100936] Updated weights for policy 0, policy_version 2260 (0.0009) +[2023-10-14 05:05:46,523][100936] Updated weights for policy 0, policy_version 2270 (0.0009) +[2023-10-14 05:05:46,654][100917] Updated weights for policy 1, policy_version 2250 (0.0007) +[2023-10-14 05:05:47,024][100917] Updated weights for policy 1, policy_version 2260 (0.0007) +[2023-10-14 05:05:47,406][100917] Updated weights for policy 1, policy_version 2270 (0.0007) +[2023-10-14 05:05:48,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 4653056. Throughput: 0: 1648.7, 1: 1640.7. Samples: 1168166. Policy #0 lag: (min: 31.0, avg: 31.9, max: 51.0) +[2023-10-14 05:05:48,513][99942] Avg episode reward: [(0, '0.690'), (1, '0.510')] +[2023-10-14 05:05:50,642][100936] Updated weights for policy 0, policy_version 2280 (0.0008) +[2023-10-14 05:05:51,007][100936] Updated weights for policy 0, policy_version 2290 (0.0008) +[2023-10-14 05:05:51,383][100936] Updated weights for policy 0, policy_version 2300 (0.0007) +[2023-10-14 05:05:51,568][100917] Updated weights for policy 1, policy_version 2280 (0.0008) +[2023-10-14 05:05:51,941][100917] Updated weights for policy 1, policy_version 2290 (0.0007) +[2023-10-14 05:05:52,324][100917] Updated weights for policy 1, policy_version 2300 (0.0007) +[2023-10-14 05:05:53,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 4718592. Throughput: 0: 1655.7, 1: 1648.5. Samples: 1187992. Policy #0 lag: (min: 9.0, avg: 9.7, max: 27.0) +[2023-10-14 05:05:53,513][99942] Avg episode reward: [(0, '0.690'), (1, '0.510')] +[2023-10-14 05:05:55,537][100936] Updated weights for policy 0, policy_version 2310 (0.0009) +[2023-10-14 05:05:55,903][100936] Updated weights for policy 0, policy_version 2320 (0.0007) +[2023-10-14 05:05:56,277][100936] Updated weights for policy 0, policy_version 2330 (0.0009) +[2023-10-14 05:05:56,514][100917] Updated weights for policy 1, policy_version 2310 (0.0008) +[2023-10-14 05:05:56,887][100917] Updated weights for policy 1, policy_version 2320 (0.0009) +[2023-10-14 05:05:57,255][100917] Updated weights for policy 1, policy_version 2330 (0.0009) +[2023-10-14 05:05:58,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 4784128. Throughput: 0: 1642.0, 1: 1646.1. Samples: 1198222. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:05:58,513][99942] Avg episode reward: [(0, '0.430'), (1, '0.510')] +[2023-10-14 05:06:00,436][100936] Updated weights for policy 0, policy_version 2340 (0.0007) +[2023-10-14 05:06:00,813][100936] Updated weights for policy 0, policy_version 2350 (0.0008) +[2023-10-14 05:06:01,177][100936] Updated weights for policy 0, policy_version 2360 (0.0009) +[2023-10-14 05:06:01,396][100917] Updated weights for policy 1, policy_version 2340 (0.0007) +[2023-10-14 05:06:01,761][100917] Updated weights for policy 1, policy_version 2350 (0.0007) +[2023-10-14 05:06:02,216][100917] Updated weights for policy 1, policy_version 2362 (0.0009) +[2023-10-14 05:06:03,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 4849664. Throughput: 0: 1657.0, 1: 1639.4. Samples: 1217626. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) +[2023-10-14 05:06:03,513][99942] Avg episode reward: [(0, '0.430'), (1, '0.500')] +[2023-10-14 05:06:05,246][100936] Updated weights for policy 0, policy_version 2370 (0.0010) +[2023-10-14 05:06:05,614][100936] Updated weights for policy 0, policy_version 2380 (0.0008) +[2023-10-14 05:06:05,988][100936] Updated weights for policy 0, policy_version 2390 (0.0009) +[2023-10-14 05:06:06,360][100936] Updated weights for policy 0, policy_version 2400 (0.0010) +[2023-10-14 05:06:06,468][100917] Updated weights for policy 1, policy_version 2372 (0.0007) +[2023-10-14 05:06:06,841][100917] Updated weights for policy 1, policy_version 2382 (0.0008) +[2023-10-14 05:06:07,219][100917] Updated weights for policy 1, policy_version 2392 (0.0007) +[2023-10-14 05:06:08,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 4915200. Throughput: 0: 1658.1, 1: 1644.4. Samples: 1237492. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:06:08,512][99942] Avg episode reward: [(0, '0.430'), (1, '0.500')] +[2023-10-14 05:06:10,380][100936] Updated weights for policy 0, policy_version 2410 (0.0009) +[2023-10-14 05:06:10,758][100936] Updated weights for policy 0, policy_version 2420 (0.0010) +[2023-10-14 05:06:11,127][100936] Updated weights for policy 0, policy_version 2430 (0.0008) +[2023-10-14 05:06:11,471][100917] Updated weights for policy 1, policy_version 2402 (0.0009) +[2023-10-14 05:06:11,846][100917] Updated weights for policy 1, policy_version 2412 (0.0007) +[2023-10-14 05:06:12,212][100917] Updated weights for policy 1, policy_version 2422 (0.0009) +[2023-10-14 05:06:12,587][100917] Updated weights for policy 1, policy_version 2432 (0.0007) +[2023-10-14 05:06:13,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 4980736. Throughput: 0: 1644.3, 1: 1647.8. Samples: 1247774. Policy #0 lag: (min: 4.0, avg: 8.4, max: 36.0) +[2023-10-14 05:06:13,513][99942] Avg episode reward: [(0, '0.430'), (1, '0.500')] +[2023-10-14 05:06:15,129][100936] Updated weights for policy 0, policy_version 2440 (0.0009) +[2023-10-14 05:06:15,497][100936] Updated weights for policy 0, policy_version 2450 (0.0010) +[2023-10-14 05:06:15,875][100936] Updated weights for policy 0, policy_version 2460 (0.0007) +[2023-10-14 05:06:16,946][100917] Updated weights for policy 1, policy_version 2442 (0.0010) +[2023-10-14 05:06:17,325][100917] Updated weights for policy 1, policy_version 2452 (0.0010) +[2023-10-14 05:06:17,692][100917] Updated weights for policy 1, policy_version 2462 (0.0007) +[2023-10-14 05:06:18,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 5046272. Throughput: 0: 1662.0, 1: 1649.5. Samples: 1267846. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:06:18,513][99942] Avg episode reward: [(0, '0.420'), (1, '0.500')] +[2023-10-14 05:06:20,118][100936] Updated weights for policy 0, policy_version 2470 (0.0008) +[2023-10-14 05:06:20,482][100936] Updated weights for policy 0, policy_version 2480 (0.0009) +[2023-10-14 05:06:20,856][100936] Updated weights for policy 0, policy_version 2490 (0.0007) +[2023-10-14 05:06:21,616][100917] Updated weights for policy 1, policy_version 2472 (0.0010) +[2023-10-14 05:06:21,985][100917] Updated weights for policy 1, policy_version 2482 (0.0008) +[2023-10-14 05:06:22,370][100917] Updated weights for policy 1, policy_version 2492 (0.0007) +[2023-10-14 05:06:23,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 5111808. Throughput: 0: 1667.3, 1: 1653.8. Samples: 1287616. Policy #0 lag: (min: 21.0, avg: 21.0, max: 21.0) +[2023-10-14 05:06:23,513][99942] Avg episode reward: [(0, '0.410'), (1, '0.640')] +[2023-10-14 05:06:25,032][100936] Updated weights for policy 0, policy_version 2500 (0.0008) +[2023-10-14 05:06:25,410][100936] Updated weights for policy 0, policy_version 2510 (0.0007) +[2023-10-14 05:06:25,772][100936] Updated weights for policy 0, policy_version 2520 (0.0008) +[2023-10-14 05:06:26,433][100917] Updated weights for policy 1, policy_version 2502 (0.0008) +[2023-10-14 05:06:26,810][100917] Updated weights for policy 1, policy_version 2512 (0.0009) +[2023-10-14 05:06:27,178][100917] Updated weights for policy 1, policy_version 2522 (0.0009) +[2023-10-14 05:06:28,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 5177344. Throughput: 0: 1661.0, 1: 1648.6. Samples: 1297784. Policy #0 lag: (min: 18.0, avg: 19.0, max: 38.0) +[2023-10-14 05:06:28,513][99942] Avg episode reward: [(0, '0.410'), (1, '0.640')] +[2023-10-14 05:06:29,770][100936] Updated weights for policy 0, policy_version 2530 (0.0008) +[2023-10-14 05:06:30,141][100936] Updated weights for policy 0, policy_version 2540 (0.0007) +[2023-10-14 05:06:30,512][100936] Updated weights for policy 0, policy_version 2550 (0.0007) +[2023-10-14 05:06:30,892][100936] Updated weights for policy 0, policy_version 2560 (0.0009) +[2023-10-14 05:06:31,320][100917] Updated weights for policy 1, policy_version 2532 (0.0009) +[2023-10-14 05:06:31,697][100917] Updated weights for policy 1, policy_version 2542 (0.0007) +[2023-10-14 05:06:32,078][100917] Updated weights for policy 1, policy_version 2552 (0.0007) +[2023-10-14 05:06:33,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 5242880. Throughput: 0: 1667.5, 1: 1649.0. Samples: 1317410. Policy #0 lag: (min: 31.0, avg: 31.8, max: 52.0) +[2023-10-14 05:06:33,513][99942] Avg episode reward: [(0, '0.390'), (1, '0.640')] +[2023-10-14 05:06:35,047][100936] Updated weights for policy 0, policy_version 2570 (0.0010) +[2023-10-14 05:06:35,428][100936] Updated weights for policy 0, policy_version 2580 (0.0010) +[2023-10-14 05:06:35,794][100936] Updated weights for policy 0, policy_version 2590 (0.0008) +[2023-10-14 05:06:36,123][100917] Updated weights for policy 1, policy_version 2562 (0.0008) +[2023-10-14 05:06:36,501][100917] Updated weights for policy 1, policy_version 2572 (0.0009) +[2023-10-14 05:06:36,866][100917] Updated weights for policy 1, policy_version 2582 (0.0008) +[2023-10-14 05:06:37,237][100917] Updated weights for policy 1, policy_version 2592 (0.0008) +[2023-10-14 05:06:38,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 5308416. Throughput: 0: 1668.6, 1: 1652.3. Samples: 1337432. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:06:38,513][99942] Avg episode reward: [(0, '0.390'), (1, '0.640')] +[2023-10-14 05:06:39,816][100936] Updated weights for policy 0, policy_version 2600 (0.0008) +[2023-10-14 05:06:40,194][100936] Updated weights for policy 0, policy_version 2610 (0.0007) +[2023-10-14 05:06:40,566][100936] Updated weights for policy 0, policy_version 2620 (0.0008) +[2023-10-14 05:06:41,393][100917] Updated weights for policy 1, policy_version 2602 (0.0008) +[2023-10-14 05:06:41,772][100917] Updated weights for policy 1, policy_version 2612 (0.0007) +[2023-10-14 05:06:42,140][100917] Updated weights for policy 1, policy_version 2622 (0.0007) +[2023-10-14 05:06:43,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 5373952. Throughput: 0: 1666.7, 1: 1655.9. Samples: 1347740. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:06:43,513][99942] Avg episode reward: [(0, '0.390'), (1, '0.640')] +[2023-10-14 05:06:44,701][100936] Updated weights for policy 0, policy_version 2630 (0.0010) +[2023-10-14 05:06:45,081][100936] Updated weights for policy 0, policy_version 2640 (0.0009) +[2023-10-14 05:06:45,451][100936] Updated weights for policy 0, policy_version 2650 (0.0007) +[2023-10-14 05:06:46,270][100917] Updated weights for policy 1, policy_version 2632 (0.0010) +[2023-10-14 05:06:46,636][100917] Updated weights for policy 1, policy_version 2642 (0.0009) +[2023-10-14 05:06:47,018][100917] Updated weights for policy 1, policy_version 2652 (0.0008) +[2023-10-14 05:06:48,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 5439488. Throughput: 0: 1673.2, 1: 1650.9. Samples: 1367212. Policy #0 lag: (min: 31.0, avg: 36.7, max: 63.0) +[2023-10-14 05:06:48,513][99942] Avg episode reward: [(0, '0.390'), (1, '0.640')] +[2023-10-14 05:06:49,675][100936] Updated weights for policy 0, policy_version 2660 (0.0009) +[2023-10-14 05:06:50,044][100936] Updated weights for policy 0, policy_version 2670 (0.0009) +[2023-10-14 05:06:50,426][100936] Updated weights for policy 0, policy_version 2680 (0.0010) +[2023-10-14 05:06:51,194][100917] Updated weights for policy 1, policy_version 2662 (0.0009) +[2023-10-14 05:06:51,570][100917] Updated weights for policy 1, policy_version 2672 (0.0008) +[2023-10-14 05:06:51,942][100917] Updated weights for policy 1, policy_version 2682 (0.0007) +[2023-10-14 05:06:53,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.3, 300 sec: 13329.4). Total num frames: 5505024. Throughput: 0: 1664.0, 1: 1659.2. Samples: 1387034. Policy #0 lag: (min: 31.0, avg: 31.8, max: 50.0) +[2023-10-14 05:06:53,512][99942] Avg episode reward: [(0, '0.390'), (1, '0.640')] +[2023-10-14 05:06:54,626][100936] Updated weights for policy 0, policy_version 2690 (0.0009) +[2023-10-14 05:06:55,009][100936] Updated weights for policy 0, policy_version 2700 (0.0007) +[2023-10-14 05:06:55,385][100936] Updated weights for policy 0, policy_version 2710 (0.0009) +[2023-10-14 05:06:55,751][100936] Updated weights for policy 0, policy_version 2720 (0.0008) +[2023-10-14 05:06:56,097][100917] Updated weights for policy 1, policy_version 2692 (0.0007) +[2023-10-14 05:06:56,470][100917] Updated weights for policy 1, policy_version 2702 (0.0008) +[2023-10-14 05:06:56,851][100917] Updated weights for policy 1, policy_version 2712 (0.0008) +[2023-10-14 05:06:58,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 5570560. Throughput: 0: 1663.5, 1: 1655.7. Samples: 1397136. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) +[2023-10-14 05:06:58,513][99942] Avg episode reward: [(0, '0.390'), (1, '0.760')] +[2023-10-14 05:06:59,930][100936] Updated weights for policy 0, policy_version 2730 (0.0007) +[2023-10-14 05:07:00,300][100936] Updated weights for policy 0, policy_version 2740 (0.0008) +[2023-10-14 05:07:00,667][100936] Updated weights for policy 0, policy_version 2750 (0.0009) +[2023-10-14 05:07:00,889][100917] Updated weights for policy 1, policy_version 2722 (0.0008) +[2023-10-14 05:07:01,259][100917] Updated weights for policy 1, policy_version 2732 (0.0008) +[2023-10-14 05:07:01,631][100917] Updated weights for policy 1, policy_version 2742 (0.0011) +[2023-10-14 05:07:02,004][100917] Updated weights for policy 1, policy_version 2752 (0.0010) +[2023-10-14 05:07:03,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 5636096. Throughput: 0: 1659.7, 1: 1641.8. Samples: 1416412. Policy #0 lag: (min: 15.0, avg: 16.0, max: 37.0) +[2023-10-14 05:07:03,513][99942] Avg episode reward: [(0, '0.390'), (1, '0.770')] +[2023-10-14 05:07:04,770][100936] Updated weights for policy 0, policy_version 2760 (0.0009) +[2023-10-14 05:07:05,144][100936] Updated weights for policy 0, policy_version 2770 (0.0008) +[2023-10-14 05:07:05,506][100936] Updated weights for policy 0, policy_version 2780 (0.0008) +[2023-10-14 05:07:06,125][100917] Updated weights for policy 1, policy_version 2762 (0.0009) +[2023-10-14 05:07:06,507][100917] Updated weights for policy 1, policy_version 2772 (0.0009) +[2023-10-14 05:07:06,873][100917] Updated weights for policy 1, policy_version 2782 (0.0009) +[2023-10-14 05:07:08,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.1, 300 sec: 13329.3). Total num frames: 5701632. Throughput: 0: 1656.4, 1: 1658.5. Samples: 1436790. Policy #0 lag: (min: 25.0, avg: 37.3, max: 57.0) +[2023-10-14 05:07:08,513][99942] Avg episode reward: [(0, '0.390'), (1, '0.770')] +[2023-10-14 05:07:09,677][100936] Updated weights for policy 0, policy_version 2790 (0.0009) +[2023-10-14 05:07:10,042][100936] Updated weights for policy 0, policy_version 2800 (0.0008) +[2023-10-14 05:07:10,416][100936] Updated weights for policy 0, policy_version 2810 (0.0007) +[2023-10-14 05:07:10,960][100917] Updated weights for policy 1, policy_version 2792 (0.0009) +[2023-10-14 05:07:11,331][100917] Updated weights for policy 1, policy_version 2802 (0.0008) +[2023-10-14 05:07:11,708][100917] Updated weights for policy 1, policy_version 2812 (0.0008) +[2023-10-14 05:07:13,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 5767168. Throughput: 0: 1658.1, 1: 1654.0. Samples: 1446830. Policy #0 lag: (min: 19.0, avg: 19.0, max: 19.0) +[2023-10-14 05:07:13,513][99942] Avg episode reward: [(0, '0.390'), (1, '0.780')] +[2023-10-14 05:07:14,577][100936] Updated weights for policy 0, policy_version 2820 (0.0008) +[2023-10-14 05:07:14,938][100936] Updated weights for policy 0, policy_version 2830 (0.0008) +[2023-10-14 05:07:15,313][100936] Updated weights for policy 0, policy_version 2840 (0.0009) +[2023-10-14 05:07:15,747][100917] Updated weights for policy 1, policy_version 2822 (0.0009) +[2023-10-14 05:07:16,127][100917] Updated weights for policy 1, policy_version 2832 (0.0009) +[2023-10-14 05:07:16,501][100917] Updated weights for policy 1, policy_version 2842 (0.0010) +[2023-10-14 05:07:18,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 5832704. Throughput: 0: 1655.2, 1: 1648.2. Samples: 1466060. Policy #0 lag: (min: 31.0, avg: 31.1, max: 37.0) +[2023-10-14 05:07:18,513][99942] Avg episode reward: [(0, '0.390'), (1, '0.780')] +[2023-10-14 05:07:19,513][100936] Updated weights for policy 0, policy_version 2850 (0.0009) +[2023-10-14 05:07:19,888][100936] Updated weights for policy 0, policy_version 2860 (0.0009) +[2023-10-14 05:07:20,261][100936] Updated weights for policy 0, policy_version 2870 (0.0008) +[2023-10-14 05:07:20,638][100936] Updated weights for policy 0, policy_version 2880 (0.0008) +[2023-10-14 05:07:20,734][100917] Updated weights for policy 1, policy_version 2852 (0.0008) +[2023-10-14 05:07:21,118][100917] Updated weights for policy 1, policy_version 2862 (0.0007) +[2023-10-14 05:07:21,486][100917] Updated weights for policy 1, policy_version 2872 (0.0009) +[2023-10-14 05:07:23,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 5898240. Throughput: 0: 1652.9, 1: 1659.0. Samples: 1486468. Policy #0 lag: (min: 27.0, avg: 46.9, max: 48.0) +[2023-10-14 05:07:23,513][99942] Avg episode reward: [(0, '0.390'), (1, '0.790')] +[2023-10-14 05:07:24,729][100936] Updated weights for policy 0, policy_version 2890 (0.0007) +[2023-10-14 05:07:25,106][100936] Updated weights for policy 0, policy_version 2900 (0.0007) +[2023-10-14 05:07:25,363][100917] Updated weights for policy 1, policy_version 2882 (0.0009) +[2023-10-14 05:07:25,480][100936] Updated weights for policy 0, policy_version 2910 (0.0008) +[2023-10-14 05:07:25,739][100917] Updated weights for policy 1, policy_version 2892 (0.0008) +[2023-10-14 05:07:26,107][100917] Updated weights for policy 1, policy_version 2902 (0.0008) +[2023-10-14 05:07:26,480][100917] Updated weights for policy 1, policy_version 2912 (0.0008) +[2023-10-14 05:07:28,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 5963776. Throughput: 0: 1654.0, 1: 1645.0. Samples: 1496194. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) +[2023-10-14 05:07:28,513][99942] Avg episode reward: [(0, '0.390'), (1, '0.790')] +[2023-10-14 05:07:29,709][100936] Updated weights for policy 0, policy_version 2920 (0.0008) +[2023-10-14 05:07:30,077][100936] Updated weights for policy 0, policy_version 2930 (0.0008) +[2023-10-14 05:07:30,454][100936] Updated weights for policy 0, policy_version 2940 (0.0008) +[2023-10-14 05:07:30,738][100917] Updated weights for policy 1, policy_version 2922 (0.0008) +[2023-10-14 05:07:31,113][100917] Updated weights for policy 1, policy_version 2932 (0.0008) +[2023-10-14 05:07:31,483][100917] Updated weights for policy 1, policy_version 2942 (0.0007) +[2023-10-14 05:07:33,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 6029312. Throughput: 0: 1645.5, 1: 1653.5. Samples: 1515664. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-10-14 05:07:33,513][99942] Avg episode reward: [(0, '0.390'), (1, '0.830')] +[2023-10-14 05:07:34,637][100936] Updated weights for policy 0, policy_version 2950 (0.0010) +[2023-10-14 05:07:35,010][100936] Updated weights for policy 0, policy_version 2960 (0.0010) +[2023-10-14 05:07:35,386][100936] Updated weights for policy 0, policy_version 2970 (0.0010) +[2023-10-14 05:07:35,562][100917] Updated weights for policy 1, policy_version 2952 (0.0007) +[2023-10-14 05:07:35,928][100917] Updated weights for policy 1, policy_version 2962 (0.0010) +[2023-10-14 05:07:36,301][100917] Updated weights for policy 1, policy_version 2972 (0.0010) +[2023-10-14 05:07:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 6094848. Throughput: 0: 1646.3, 1: 1660.4. Samples: 1535840. Policy #0 lag: (min: 31.0, avg: 35.2, max: 63.0) +[2023-10-14 05:07:38,513][99942] Avg episode reward: [(0, '0.390'), (1, '0.830')] +[2023-10-14 05:07:38,524][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000002976_3047424.pth... +[2023-10-14 05:07:38,524][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000002976_3047424.pth... +[2023-10-14 05:07:38,556][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000001440_1474560.pth +[2023-10-14 05:07:38,563][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000001440_1474560.pth +[2023-10-14 05:07:39,471][100936] Updated weights for policy 0, policy_version 2980 (0.0007) +[2023-10-14 05:07:39,835][100936] Updated weights for policy 0, policy_version 2990 (0.0010) +[2023-10-14 05:07:40,205][100936] Updated weights for policy 0, policy_version 3000 (0.0010) +[2023-10-14 05:07:40,515][100917] Updated weights for policy 1, policy_version 2982 (0.0009) +[2023-10-14 05:07:40,901][100917] Updated weights for policy 1, policy_version 2992 (0.0010) +[2023-10-14 05:07:41,265][100917] Updated weights for policy 1, policy_version 3002 (0.0010) +[2023-10-14 05:07:43,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 6160384. Throughput: 0: 1646.3, 1: 1642.6. Samples: 1545136. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:07:43,512][99942] Avg episode reward: [(0, '0.390'), (1, '0.960')] +[2023-10-14 05:07:44,279][100936] Updated weights for policy 0, policy_version 3010 (0.0009) +[2023-10-14 05:07:44,656][100936] Updated weights for policy 0, policy_version 3020 (0.0009) +[2023-10-14 05:07:45,012][100936] Updated weights for policy 0, policy_version 3030 (0.0010) +[2023-10-14 05:07:45,364][100917] Updated weights for policy 1, policy_version 3012 (0.0010) +[2023-10-14 05:07:45,387][100936] Updated weights for policy 0, policy_version 3040 (0.0008) +[2023-10-14 05:07:45,730][100917] Updated weights for policy 1, policy_version 3022 (0.0009) +[2023-10-14 05:07:46,102][100917] Updated weights for policy 1, policy_version 3032 (0.0007) +[2023-10-14 05:07:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 6225920. Throughput: 0: 1643.1, 1: 1653.7. Samples: 1564768. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:07:48,513][99942] Avg episode reward: [(0, '0.390'), (1, '0.980')] +[2023-10-14 05:07:49,666][100936] Updated weights for policy 0, policy_version 3050 (0.0010) +[2023-10-14 05:07:50,044][100936] Updated weights for policy 0, policy_version 3060 (0.0009) +[2023-10-14 05:07:50,299][100917] Updated weights for policy 1, policy_version 3042 (0.0008) +[2023-10-14 05:07:50,414][100936] Updated weights for policy 0, policy_version 3070 (0.0007) +[2023-10-14 05:07:50,681][100917] Updated weights for policy 1, policy_version 3052 (0.0007) +[2023-10-14 05:07:51,047][100917] Updated weights for policy 1, policy_version 3062 (0.0008) +[2023-10-14 05:07:51,420][100917] Updated weights for policy 1, policy_version 3072 (0.0009) +[2023-10-14 05:07:53,512][99942] Fps is (10 sec: 13106.6, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 6291456. Throughput: 0: 1640.5, 1: 1652.8. Samples: 1584992. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:07:53,514][99942] Avg episode reward: [(0, '0.390'), (1, '0.980')] +[2023-10-14 05:07:54,603][100936] Updated weights for policy 0, policy_version 3080 (0.0009) +[2023-10-14 05:07:54,979][100936] Updated weights for policy 0, policy_version 3090 (0.0009) +[2023-10-14 05:07:55,346][100936] Updated weights for policy 0, policy_version 3100 (0.0007) +[2023-10-14 05:07:55,544][100917] Updated weights for policy 1, policy_version 3082 (0.0008) +[2023-10-14 05:07:55,921][100917] Updated weights for policy 1, policy_version 3092 (0.0007) +[2023-10-14 05:07:56,291][100917] Updated weights for policy 1, policy_version 3102 (0.0008) +[2023-10-14 05:07:58,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 6356992. Throughput: 0: 1638.6, 1: 1638.7. Samples: 1594310. Policy #0 lag: (min: 3.0, avg: 3.7, max: 21.0) +[2023-10-14 05:07:58,513][99942] Avg episode reward: [(0, '0.390'), (1, '0.980')] +[2023-10-14 05:07:59,606][100936] Updated weights for policy 0, policy_version 3110 (0.0008) +[2023-10-14 05:07:59,983][100936] Updated weights for policy 0, policy_version 3120 (0.0008) +[2023-10-14 05:08:00,346][100917] Updated weights for policy 1, policy_version 3112 (0.0009) +[2023-10-14 05:08:00,359][100936] Updated weights for policy 0, policy_version 3130 (0.0008) +[2023-10-14 05:08:00,718][100917] Updated weights for policy 1, policy_version 3122 (0.0010) +[2023-10-14 05:08:01,094][100917] Updated weights for policy 1, policy_version 3132 (0.0009) +[2023-10-14 05:08:03,512][99942] Fps is (10 sec: 13107.7, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 6422528. Throughput: 0: 1640.8, 1: 1651.4. Samples: 1614208. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:08:03,513][99942] Avg episode reward: [(0, '0.390'), (1, '0.980')] +[2023-10-14 05:08:04,398][100936] Updated weights for policy 0, policy_version 3140 (0.0010) +[2023-10-14 05:08:04,762][100936] Updated weights for policy 0, policy_version 3150 (0.0009) +[2023-10-14 05:08:05,134][100936] Updated weights for policy 0, policy_version 3160 (0.0007) +[2023-10-14 05:08:05,421][100917] Updated weights for policy 1, policy_version 3142 (0.0009) +[2023-10-14 05:08:05,799][100917] Updated weights for policy 1, policy_version 3152 (0.0009) +[2023-10-14 05:08:06,160][100917] Updated weights for policy 1, policy_version 3162 (0.0007) +[2023-10-14 05:08:08,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 6488064. Throughput: 0: 1646.8, 1: 1645.0. Samples: 1634598. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:08:08,512][99942] Avg episode reward: [(0, '0.390'), (1, '0.980')] +[2023-10-14 05:08:09,184][100936] Updated weights for policy 0, policy_version 3170 (0.0009) +[2023-10-14 05:08:09,560][100936] Updated weights for policy 0, policy_version 3180 (0.0007) +[2023-10-14 05:08:09,929][100936] Updated weights for policy 0, policy_version 3190 (0.0007) +[2023-10-14 05:08:10,301][100936] Updated weights for policy 0, policy_version 3200 (0.0007) +[2023-10-14 05:08:10,430][100917] Updated weights for policy 1, policy_version 3172 (0.0008) +[2023-10-14 05:08:10,811][100917] Updated weights for policy 1, policy_version 3182 (0.0008) +[2023-10-14 05:08:11,188][100917] Updated weights for policy 1, policy_version 3192 (0.0009) +[2023-10-14 05:08:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 6553600. Throughput: 0: 1645.8, 1: 1641.4. Samples: 1644116. Policy #0 lag: (min: 29.0, avg: 29.0, max: 29.0) +[2023-10-14 05:08:13,513][99942] Avg episode reward: [(0, '0.380'), (1, '0.980')] +[2023-10-14 05:08:14,483][100936] Updated weights for policy 0, policy_version 3210 (0.0011) +[2023-10-14 05:08:14,852][100936] Updated weights for policy 0, policy_version 3220 (0.0012) +[2023-10-14 05:08:15,224][100936] Updated weights for policy 0, policy_version 3230 (0.0011) +[2023-10-14 05:08:15,228][100917] Updated weights for policy 1, policy_version 3202 (0.0010) +[2023-10-14 05:08:15,601][100917] Updated weights for policy 1, policy_version 3212 (0.0010) +[2023-10-14 05:08:15,978][100917] Updated weights for policy 1, policy_version 3222 (0.0010) +[2023-10-14 05:08:16,342][100917] Updated weights for policy 1, policy_version 3232 (0.0010) +[2023-10-14 05:08:18,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 6619136. Throughput: 0: 1649.0, 1: 1644.6. Samples: 1663876. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:08:18,513][99942] Avg episode reward: [(0, '0.590'), (1, '0.980')] +[2023-10-14 05:08:19,487][100936] Updated weights for policy 0, policy_version 3240 (0.0008) +[2023-10-14 05:08:19,860][100936] Updated weights for policy 0, policy_version 3250 (0.0008) +[2023-10-14 05:08:20,237][100936] Updated weights for policy 0, policy_version 3260 (0.0008) +[2023-10-14 05:08:20,525][100917] Updated weights for policy 1, policy_version 3242 (0.0008) +[2023-10-14 05:08:20,897][100917] Updated weights for policy 1, policy_version 3252 (0.0009) +[2023-10-14 05:08:21,266][100917] Updated weights for policy 1, policy_version 3262 (0.0008) +[2023-10-14 05:08:23,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 6684672. Throughput: 0: 1651.0, 1: 1640.3. Samples: 1683946. Policy #0 lag: (min: 19.0, avg: 19.0, max: 19.0) +[2023-10-14 05:08:23,513][99942] Avg episode reward: [(0, '0.590'), (1, '0.980')] +[2023-10-14 05:08:24,339][100936] Updated weights for policy 0, policy_version 3270 (0.0007) +[2023-10-14 05:08:24,710][100936] Updated weights for policy 0, policy_version 3280 (0.0007) +[2023-10-14 05:08:25,077][100936] Updated weights for policy 0, policy_version 3290 (0.0011) +[2023-10-14 05:08:25,536][100917] Updated weights for policy 1, policy_version 3272 (0.0010) +[2023-10-14 05:08:25,907][100917] Updated weights for policy 1, policy_version 3282 (0.0008) +[2023-10-14 05:08:26,296][100917] Updated weights for policy 1, policy_version 3292 (0.0007) +[2023-10-14 05:08:28,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 6750208. Throughput: 0: 1650.9, 1: 1641.3. Samples: 1693286. Policy #0 lag: (min: 31.0, avg: 32.4, max: 58.0) +[2023-10-14 05:08:28,513][99942] Avg episode reward: [(0, '0.700'), (1, '0.980')] +[2023-10-14 05:08:29,203][100936] Updated weights for policy 0, policy_version 3300 (0.0009) +[2023-10-14 05:08:29,577][100936] Updated weights for policy 0, policy_version 3310 (0.0007) +[2023-10-14 05:08:29,951][100936] Updated weights for policy 0, policy_version 3320 (0.0008) +[2023-10-14 05:08:30,610][100917] Updated weights for policy 1, policy_version 3302 (0.0007) +[2023-10-14 05:08:30,990][100917] Updated weights for policy 1, policy_version 3312 (0.0007) +[2023-10-14 05:08:31,359][100917] Updated weights for policy 1, policy_version 3322 (0.0008) +[2023-10-14 05:08:33,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 6815744. Throughput: 0: 1652.9, 1: 1641.7. Samples: 1713024. Policy #0 lag: (min: 31.0, avg: 32.4, max: 58.0) +[2023-10-14 05:08:33,513][99942] Avg episode reward: [(0, '0.700'), (1, '0.980')] +[2023-10-14 05:08:34,189][100936] Updated weights for policy 0, policy_version 3330 (0.0008) +[2023-10-14 05:08:34,602][100936] Updated weights for policy 0, policy_version 3340 (0.0009) +[2023-10-14 05:08:34,973][100936] Updated weights for policy 0, policy_version 3350 (0.0009) +[2023-10-14 05:08:35,352][100936] Updated weights for policy 0, policy_version 3360 (0.0008) +[2023-10-14 05:08:35,414][100917] Updated weights for policy 1, policy_version 3332 (0.0008) +[2023-10-14 05:08:35,787][100917] Updated weights for policy 1, policy_version 3342 (0.0008) +[2023-10-14 05:08:36,155][100917] Updated weights for policy 1, policy_version 3352 (0.0009) +[2023-10-14 05:08:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 6881280. Throughput: 0: 1652.5, 1: 1641.8. Samples: 1733236. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:08:38,513][99942] Avg episode reward: [(0, '0.900'), (1, '0.730')] +[2023-10-14 05:08:39,583][100936] Updated weights for policy 0, policy_version 3370 (0.0008) +[2023-10-14 05:08:39,955][100936] Updated weights for policy 0, policy_version 3380 (0.0009) +[2023-10-14 05:08:40,326][100936] Updated weights for policy 0, policy_version 3390 (0.0009) +[2023-10-14 05:08:40,344][100917] Updated weights for policy 1, policy_version 3362 (0.0010) +[2023-10-14 05:08:40,705][100917] Updated weights for policy 1, policy_version 3372 (0.0008) +[2023-10-14 05:08:41,087][100917] Updated weights for policy 1, policy_version 3382 (0.0009) +[2023-10-14 05:08:41,455][100917] Updated weights for policy 1, policy_version 3392 (0.0008) +[2023-10-14 05:08:43,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 6946816. Throughput: 0: 1648.7, 1: 1645.6. Samples: 1742554. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:08:43,513][99942] Avg episode reward: [(0, '0.960'), (1, '0.740')] +[2023-10-14 05:08:44,425][100936] Updated weights for policy 0, policy_version 3400 (0.0007) +[2023-10-14 05:08:44,798][100936] Updated weights for policy 0, policy_version 3410 (0.0007) +[2023-10-14 05:08:45,168][100936] Updated weights for policy 0, policy_version 3420 (0.0007) +[2023-10-14 05:08:45,623][100917] Updated weights for policy 1, policy_version 3402 (0.0010) +[2023-10-14 05:08:46,001][100917] Updated weights for policy 1, policy_version 3412 (0.0008) +[2023-10-14 05:08:46,365][100917] Updated weights for policy 1, policy_version 3422 (0.0009) +[2023-10-14 05:08:48,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 7012352. Throughput: 0: 1645.6, 1: 1639.0. Samples: 1762012. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 05:08:48,513][99942] Avg episode reward: [(0, '0.960'), (1, '0.740')] +[2023-10-14 05:08:49,265][100936] Updated weights for policy 0, policy_version 3430 (0.0008) +[2023-10-14 05:08:49,634][100936] Updated weights for policy 0, policy_version 3440 (0.0010) +[2023-10-14 05:08:50,008][100936] Updated weights for policy 0, policy_version 3450 (0.0007) +[2023-10-14 05:08:50,403][100917] Updated weights for policy 1, policy_version 3432 (0.0008) +[2023-10-14 05:08:50,768][100917] Updated weights for policy 1, policy_version 3442 (0.0007) +[2023-10-14 05:08:51,138][100917] Updated weights for policy 1, policy_version 3452 (0.0007) +[2023-10-14 05:08:53,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 7077888. Throughput: 0: 1641.5, 1: 1645.9. Samples: 1782532. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 05:08:53,513][99942] Avg episode reward: [(0, '0.960'), (1, '0.740')] +[2023-10-14 05:08:54,157][100936] Updated weights for policy 0, policy_version 3460 (0.0008) +[2023-10-14 05:08:54,531][100936] Updated weights for policy 0, policy_version 3470 (0.0010) +[2023-10-14 05:08:54,903][100936] Updated weights for policy 0, policy_version 3480 (0.0010) +[2023-10-14 05:08:55,444][100917] Updated weights for policy 1, policy_version 3462 (0.0009) +[2023-10-14 05:08:55,815][100917] Updated weights for policy 1, policy_version 3472 (0.0007) +[2023-10-14 05:08:56,192][100917] Updated weights for policy 1, policy_version 3482 (0.0010) +[2023-10-14 05:08:58,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 7143424. Throughput: 0: 1639.9, 1: 1641.2. Samples: 1791762. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-10-14 05:08:58,512][99942] Avg episode reward: [(0, '0.980'), (1, '0.680')] +[2023-10-14 05:08:59,134][100936] Updated weights for policy 0, policy_version 3490 (0.0009) +[2023-10-14 05:08:59,509][100936] Updated weights for policy 0, policy_version 3500 (0.0007) +[2023-10-14 05:08:59,885][100936] Updated weights for policy 0, policy_version 3510 (0.0007) +[2023-10-14 05:09:00,253][100936] Updated weights for policy 0, policy_version 3520 (0.0008) +[2023-10-14 05:09:00,505][100917] Updated weights for policy 1, policy_version 3492 (0.0010) +[2023-10-14 05:09:00,885][100917] Updated weights for policy 1, policy_version 3502 (0.0010) +[2023-10-14 05:09:01,261][100917] Updated weights for policy 1, policy_version 3512 (0.0009) +[2023-10-14 05:09:03,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 7208960. Throughput: 0: 1640.7, 1: 1635.4. Samples: 1811300. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:09:03,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.680')] +[2023-10-14 05:09:04,417][100936] Updated weights for policy 0, policy_version 3530 (0.0008) +[2023-10-14 05:09:04,792][100936] Updated weights for policy 0, policy_version 3540 (0.0008) +[2023-10-14 05:09:05,162][100936] Updated weights for policy 0, policy_version 3550 (0.0008) +[2023-10-14 05:09:05,388][100917] Updated weights for policy 1, policy_version 3522 (0.0008) +[2023-10-14 05:09:05,766][100917] Updated weights for policy 1, policy_version 3532 (0.0009) +[2023-10-14 05:09:06,136][100917] Updated weights for policy 1, policy_version 3542 (0.0008) +[2023-10-14 05:09:06,512][100917] Updated weights for policy 1, policy_version 3552 (0.0008) +[2023-10-14 05:09:08,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 7274496. Throughput: 0: 1646.8, 1: 1635.1. Samples: 1831630. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:09:08,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.680')] +[2023-10-14 05:09:09,326][100936] Updated weights for policy 0, policy_version 3560 (0.0008) +[2023-10-14 05:09:09,705][100936] Updated weights for policy 0, policy_version 3570 (0.0009) +[2023-10-14 05:09:10,081][100936] Updated weights for policy 0, policy_version 3580 (0.0009) +[2023-10-14 05:09:10,708][100917] Updated weights for policy 1, policy_version 3562 (0.0009) +[2023-10-14 05:09:11,076][100917] Updated weights for policy 1, policy_version 3572 (0.0008) +[2023-10-14 05:09:11,454][100917] Updated weights for policy 1, policy_version 3582 (0.0007) +[2023-10-14 05:09:13,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 7340032. Throughput: 0: 1648.8, 1: 1639.7. Samples: 1841266. Policy #0 lag: (min: 31.0, avg: 33.6, max: 63.0) +[2023-10-14 05:09:13,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.680')] +[2023-10-14 05:09:14,207][100936] Updated weights for policy 0, policy_version 3590 (0.0009) +[2023-10-14 05:09:14,586][100936] Updated weights for policy 0, policy_version 3600 (0.0008) +[2023-10-14 05:09:14,958][100936] Updated weights for policy 0, policy_version 3610 (0.0007) +[2023-10-14 05:09:15,550][100917] Updated weights for policy 1, policy_version 3592 (0.0010) +[2023-10-14 05:09:15,924][100917] Updated weights for policy 1, policy_version 3602 (0.0010) +[2023-10-14 05:09:16,292][100917] Updated weights for policy 1, policy_version 3612 (0.0011) +[2023-10-14 05:09:18,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 7405568. Throughput: 0: 1646.7, 1: 1640.8. Samples: 1860958. Policy #0 lag: (min: 31.0, avg: 33.6, max: 63.0) +[2023-10-14 05:09:18,513][99942] Avg episode reward: [(0, '0.990'), (1, '0.680')] +[2023-10-14 05:09:19,136][100936] Updated weights for policy 0, policy_version 3620 (0.0009) +[2023-10-14 05:09:19,536][100936] Updated weights for policy 0, policy_version 3630 (0.0009) +[2023-10-14 05:09:19,908][100936] Updated weights for policy 0, policy_version 3640 (0.0010) +[2023-10-14 05:09:20,252][100917] Updated weights for policy 1, policy_version 3622 (0.0010) +[2023-10-14 05:09:20,621][100917] Updated weights for policy 1, policy_version 3632 (0.0010) +[2023-10-14 05:09:20,996][100917] Updated weights for policy 1, policy_version 3642 (0.0011) +[2023-10-14 05:09:23,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 7471104. Throughput: 0: 1647.8, 1: 1639.8. Samples: 1881178. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:09:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.680')] +[2023-10-14 05:09:24,035][100936] Updated weights for policy 0, policy_version 3650 (0.0010) +[2023-10-14 05:09:24,406][100936] Updated weights for policy 0, policy_version 3660 (0.0011) +[2023-10-14 05:09:24,787][100936] Updated weights for policy 0, policy_version 3670 (0.0011) +[2023-10-14 05:09:25,150][100936] Updated weights for policy 0, policy_version 3680 (0.0009) +[2023-10-14 05:09:25,340][100917] Updated weights for policy 1, policy_version 3652 (0.0009) +[2023-10-14 05:09:25,711][100917] Updated weights for policy 1, policy_version 3662 (0.0007) +[2023-10-14 05:09:26,072][100917] Updated weights for policy 1, policy_version 3672 (0.0010) +[2023-10-14 05:09:28,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 7536640. Throughput: 0: 1652.1, 1: 1638.0. Samples: 1890608. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:09:28,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.670')] +[2023-10-14 05:09:29,332][100936] Updated weights for policy 0, policy_version 3690 (0.0007) +[2023-10-14 05:09:29,699][100936] Updated weights for policy 0, policy_version 3700 (0.0010) +[2023-10-14 05:09:30,075][100936] Updated weights for policy 0, policy_version 3710 (0.0008) +[2023-10-14 05:09:30,173][100917] Updated weights for policy 1, policy_version 3682 (0.0011) +[2023-10-14 05:09:30,546][100917] Updated weights for policy 1, policy_version 3692 (0.0007) +[2023-10-14 05:09:30,934][100917] Updated weights for policy 1, policy_version 3702 (0.0007) +[2023-10-14 05:09:31,309][100917] Updated weights for policy 1, policy_version 3712 (0.0008) +[2023-10-14 05:09:33,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 7602176. Throughput: 0: 1648.3, 1: 1645.0. Samples: 1910208. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:09:33,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.670')] +[2023-10-14 05:09:34,194][100936] Updated weights for policy 0, policy_version 3720 (0.0008) +[2023-10-14 05:09:34,568][100936] Updated weights for policy 0, policy_version 3730 (0.0007) +[2023-10-14 05:09:34,935][100936] Updated weights for policy 0, policy_version 3740 (0.0008) +[2023-10-14 05:09:35,393][100917] Updated weights for policy 1, policy_version 3722 (0.0007) +[2023-10-14 05:09:35,764][100917] Updated weights for policy 1, policy_version 3732 (0.0010) +[2023-10-14 05:09:36,138][100917] Updated weights for policy 1, policy_version 3742 (0.0009) +[2023-10-14 05:09:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 7667712. Throughput: 0: 1649.0, 1: 1642.7. Samples: 1930660. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:09:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.670')] +[2023-10-14 05:09:38,521][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000003744_3833856.pth... +[2023-10-14 05:09:38,521][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000003744_3833856.pth... +[2023-10-14 05:09:38,553][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000002208_2260992.pth +[2023-10-14 05:09:38,557][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000002208_2260992.pth +[2023-10-14 05:09:39,107][100936] Updated weights for policy 0, policy_version 3750 (0.0009) +[2023-10-14 05:09:39,487][100936] Updated weights for policy 0, policy_version 3760 (0.0010) +[2023-10-14 05:09:39,863][100936] Updated weights for policy 0, policy_version 3770 (0.0010) +[2023-10-14 05:09:40,232][100917] Updated weights for policy 1, policy_version 3752 (0.0007) +[2023-10-14 05:09:40,600][100917] Updated weights for policy 1, policy_version 3762 (0.0007) +[2023-10-14 05:09:40,971][100917] Updated weights for policy 1, policy_version 3772 (0.0009) +[2023-10-14 05:09:43,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 7733248. Throughput: 0: 1650.7, 1: 1642.0. Samples: 1939930. Policy #0 lag: (min: 33.0, avg: 47.5, max: 48.0) +[2023-10-14 05:09:43,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.670')] +[2023-10-14 05:09:43,743][100936] Updated weights for policy 0, policy_version 3780 (0.0009) +[2023-10-14 05:09:44,113][100936] Updated weights for policy 0, policy_version 3790 (0.0008) +[2023-10-14 05:09:44,479][100936] Updated weights for policy 0, policy_version 3800 (0.0007) +[2023-10-14 05:09:45,345][100917] Updated weights for policy 1, policy_version 3782 (0.0010) +[2023-10-14 05:09:45,731][100917] Updated weights for policy 1, policy_version 3792 (0.0008) +[2023-10-14 05:09:46,094][100917] Updated weights for policy 1, policy_version 3802 (0.0008) +[2023-10-14 05:09:48,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 7798784. Throughput: 0: 1656.2, 1: 1653.3. Samples: 1960226. Policy #0 lag: (min: 33.0, avg: 47.5, max: 48.0) +[2023-10-14 05:09:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.670')] +[2023-10-14 05:09:48,748][100936] Updated weights for policy 0, policy_version 3810 (0.0009) +[2023-10-14 05:09:49,113][100936] Updated weights for policy 0, policy_version 3820 (0.0008) +[2023-10-14 05:09:49,479][100936] Updated weights for policy 0, policy_version 3830 (0.0011) +[2023-10-14 05:09:49,848][100936] Updated weights for policy 0, policy_version 3840 (0.0008) +[2023-10-14 05:09:50,220][100917] Updated weights for policy 1, policy_version 3812 (0.0009) +[2023-10-14 05:09:50,596][100917] Updated weights for policy 1, policy_version 3822 (0.0007) +[2023-10-14 05:09:50,968][100917] Updated weights for policy 1, policy_version 3832 (0.0007) +[2023-10-14 05:09:53,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 7864320. Throughput: 0: 1652.0, 1: 1657.6. Samples: 1980558. Policy #0 lag: (min: 29.0, avg: 40.1, max: 61.0) +[2023-10-14 05:09:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.680')] +[2023-10-14 05:09:54,026][100936] Updated weights for policy 0, policy_version 3850 (0.0008) +[2023-10-14 05:09:54,390][100936] Updated weights for policy 0, policy_version 3860 (0.0010) +[2023-10-14 05:09:54,767][100936] Updated weights for policy 0, policy_version 3870 (0.0008) +[2023-10-14 05:09:55,230][100917] Updated weights for policy 1, policy_version 3842 (0.0007) +[2023-10-14 05:09:55,617][100917] Updated weights for policy 1, policy_version 3852 (0.0008) +[2023-10-14 05:09:56,002][100917] Updated weights for policy 1, policy_version 3862 (0.0007) +[2023-10-14 05:09:56,371][100917] Updated weights for policy 1, policy_version 3872 (0.0007) +[2023-10-14 05:09:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 7929856. Throughput: 0: 1653.2, 1: 1649.9. Samples: 1989904. Policy #0 lag: (min: 29.0, avg: 40.1, max: 61.0) +[2023-10-14 05:09:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.680')] +[2023-10-14 05:09:58,765][100936] Updated weights for policy 0, policy_version 3880 (0.0008) +[2023-10-14 05:09:59,139][100936] Updated weights for policy 0, policy_version 3890 (0.0009) +[2023-10-14 05:09:59,519][100936] Updated weights for policy 0, policy_version 3900 (0.0008) +[2023-10-14 05:10:00,563][100917] Updated weights for policy 1, policy_version 3882 (0.0011) +[2023-10-14 05:10:00,936][100917] Updated weights for policy 1, policy_version 3892 (0.0008) +[2023-10-14 05:10:01,316][100917] Updated weights for policy 1, policy_version 3902 (0.0008) +[2023-10-14 05:10:03,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 7995392. Throughput: 0: 1656.1, 1: 1649.5. Samples: 2009710. Policy #0 lag: (min: 31.0, avg: 31.1, max: 36.0) +[2023-10-14 05:10:03,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.680')] +[2023-10-14 05:10:03,572][100936] Updated weights for policy 0, policy_version 3910 (0.0009) +[2023-10-14 05:10:03,944][100936] Updated weights for policy 0, policy_version 3920 (0.0008) +[2023-10-14 05:10:04,312][100936] Updated weights for policy 0, policy_version 3930 (0.0007) +[2023-10-14 05:10:05,403][100917] Updated weights for policy 1, policy_version 3912 (0.0007) +[2023-10-14 05:10:05,781][100917] Updated weights for policy 1, policy_version 3922 (0.0010) +[2023-10-14 05:10:06,161][100917] Updated weights for policy 1, policy_version 3932 (0.0009) +[2023-10-14 05:10:08,449][100936] Updated weights for policy 0, policy_version 3940 (0.0007) +[2023-10-14 05:10:08,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 8060928. Throughput: 0: 1660.8, 1: 1647.7. Samples: 2030058. Policy #0 lag: (min: 31.0, avg: 31.1, max: 36.0) +[2023-10-14 05:10:08,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.680')] +[2023-10-14 05:10:08,824][100936] Updated weights for policy 0, policy_version 3950 (0.0008) +[2023-10-14 05:10:09,208][100936] Updated weights for policy 0, policy_version 3960 (0.0008) +[2023-10-14 05:10:10,281][100917] Updated weights for policy 1, policy_version 3942 (0.0010) +[2023-10-14 05:10:10,659][100917] Updated weights for policy 1, policy_version 3952 (0.0010) +[2023-10-14 05:10:11,037][100917] Updated weights for policy 1, policy_version 3962 (0.0009) +[2023-10-14 05:10:13,419][100936] Updated weights for policy 0, policy_version 3970 (0.0007) +[2023-10-14 05:10:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 8126464. Throughput: 0: 1660.6, 1: 1643.9. Samples: 2039310. Policy #0 lag: (min: 31.0, avg: 38.8, max: 63.0) +[2023-10-14 05:10:13,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.680')] +[2023-10-14 05:10:13,793][100936] Updated weights for policy 0, policy_version 3980 (0.0008) +[2023-10-14 05:10:14,156][100936] Updated weights for policy 0, policy_version 3990 (0.0010) +[2023-10-14 05:10:14,537][100936] Updated weights for policy 0, policy_version 4000 (0.0008) +[2023-10-14 05:10:15,088][100917] Updated weights for policy 1, policy_version 3972 (0.0009) +[2023-10-14 05:10:15,466][100917] Updated weights for policy 1, policy_version 3982 (0.0008) +[2023-10-14 05:10:15,831][100917] Updated weights for policy 1, policy_version 3992 (0.0009) +[2023-10-14 05:10:18,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 8192000. Throughput: 0: 1666.2, 1: 1648.3. Samples: 2059358. Policy #0 lag: (min: 31.0, avg: 38.8, max: 63.0) +[2023-10-14 05:10:18,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.680')] +[2023-10-14 05:10:18,604][100936] Updated weights for policy 0, policy_version 4010 (0.0010) +[2023-10-14 05:10:18,980][100936] Updated weights for policy 0, policy_version 4020 (0.0010) +[2023-10-14 05:10:19,351][100936] Updated weights for policy 0, policy_version 4030 (0.0011) +[2023-10-14 05:10:20,017][100917] Updated weights for policy 1, policy_version 4002 (0.0010) +[2023-10-14 05:10:20,384][100917] Updated weights for policy 1, policy_version 4012 (0.0010) +[2023-10-14 05:10:20,766][100917] Updated weights for policy 1, policy_version 4022 (0.0010) +[2023-10-14 05:10:21,147][100917] Updated weights for policy 1, policy_version 4032 (0.0008) +[2023-10-14 05:10:23,403][100936] Updated weights for policy 0, policy_version 4040 (0.0008) +[2023-10-14 05:10:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 8257536. Throughput: 0: 1659.7, 1: 1649.7. Samples: 2079584. Policy #0 lag: (min: 31.0, avg: 49.8, max: 63.0) +[2023-10-14 05:10:23,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.680')] +[2023-10-14 05:10:23,788][100936] Updated weights for policy 0, policy_version 4050 (0.0008) +[2023-10-14 05:10:24,168][100936] Updated weights for policy 0, policy_version 4060 (0.0008) +[2023-10-14 05:10:25,258][100917] Updated weights for policy 1, policy_version 4042 (0.0010) +[2023-10-14 05:10:25,628][100917] Updated weights for policy 1, policy_version 4052 (0.0010) +[2023-10-14 05:10:26,003][100917] Updated weights for policy 1, policy_version 4062 (0.0010) +[2023-10-14 05:10:28,416][100936] Updated weights for policy 0, policy_version 4070 (0.0009) +[2023-10-14 05:10:28,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 8323072. Throughput: 0: 1665.6, 1: 1645.8. Samples: 2088942. Policy #0 lag: (min: 31.0, avg: 49.8, max: 63.0) +[2023-10-14 05:10:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.680')] +[2023-10-14 05:10:28,782][100936] Updated weights for policy 0, policy_version 4080 (0.0007) +[2023-10-14 05:10:29,151][100936] Updated weights for policy 0, policy_version 4090 (0.0007) +[2023-10-14 05:10:30,215][100917] Updated weights for policy 1, policy_version 4072 (0.0011) +[2023-10-14 05:10:30,593][100917] Updated weights for policy 1, policy_version 4082 (0.0008) +[2023-10-14 05:10:30,966][100917] Updated weights for policy 1, policy_version 4092 (0.0007) +[2023-10-14 05:10:33,271][100936] Updated weights for policy 0, policy_version 4100 (0.0008) +[2023-10-14 05:10:33,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 8388608. Throughput: 0: 1661.7, 1: 1648.3. Samples: 2109178. Policy #0 lag: (min: 2.0, avg: 5.8, max: 34.0) +[2023-10-14 05:10:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.680')] +[2023-10-14 05:10:33,643][100936] Updated weights for policy 0, policy_version 4110 (0.0007) +[2023-10-14 05:10:34,011][100936] Updated weights for policy 0, policy_version 4120 (0.0008) +[2023-10-14 05:10:35,087][100917] Updated weights for policy 1, policy_version 4102 (0.0009) +[2023-10-14 05:10:35,459][100917] Updated weights for policy 1, policy_version 4112 (0.0010) +[2023-10-14 05:10:35,836][100917] Updated weights for policy 1, policy_version 4122 (0.0010) +[2023-10-14 05:10:38,138][100936] Updated weights for policy 0, policy_version 4130 (0.0008) +[2023-10-14 05:10:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 8454144. Throughput: 0: 1652.1, 1: 1645.3. Samples: 2128942. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-10-14 05:10:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.680')] +[2023-10-14 05:10:38,519][100936] Updated weights for policy 0, policy_version 4140 (0.0009) +[2023-10-14 05:10:38,898][100936] Updated weights for policy 0, policy_version 4150 (0.0011) +[2023-10-14 05:10:39,274][100936] Updated weights for policy 0, policy_version 4160 (0.0009) +[2023-10-14 05:10:39,961][100917] Updated weights for policy 1, policy_version 4132 (0.0009) +[2023-10-14 05:10:40,328][100917] Updated weights for policy 1, policy_version 4142 (0.0008) +[2023-10-14 05:10:40,700][100917] Updated weights for policy 1, policy_version 4152 (0.0007) +[2023-10-14 05:10:43,449][100936] Updated weights for policy 0, policy_version 4170 (0.0008) +[2023-10-14 05:10:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 8519680. Throughput: 0: 1656.2, 1: 1640.5. Samples: 2138256. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:10:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.680')] +[2023-10-14 05:10:43,814][100936] Updated weights for policy 0, policy_version 4180 (0.0011) +[2023-10-14 05:10:44,183][100936] Updated weights for policy 0, policy_version 4190 (0.0011) +[2023-10-14 05:10:44,771][100917] Updated weights for policy 1, policy_version 4162 (0.0008) +[2023-10-14 05:10:45,149][100917] Updated weights for policy 1, policy_version 4172 (0.0010) +[2023-10-14 05:10:45,519][100917] Updated weights for policy 1, policy_version 4182 (0.0010) +[2023-10-14 05:10:45,894][100917] Updated weights for policy 1, policy_version 4192 (0.0010) +[2023-10-14 05:10:48,417][100936] Updated weights for policy 0, policy_version 4200 (0.0009) +[2023-10-14 05:10:48,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 8585216. Throughput: 0: 1654.4, 1: 1650.0. Samples: 2158410. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:10:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.680')] +[2023-10-14 05:10:48,795][100936] Updated weights for policy 0, policy_version 4210 (0.0009) +[2023-10-14 05:10:49,167][100936] Updated weights for policy 0, policy_version 4220 (0.0007) +[2023-10-14 05:10:50,188][100917] Updated weights for policy 1, policy_version 4202 (0.0010) +[2023-10-14 05:10:50,560][100917] Updated weights for policy 1, policy_version 4212 (0.0008) +[2023-10-14 05:10:50,934][100917] Updated weights for policy 1, policy_version 4222 (0.0008) +[2023-10-14 05:10:53,291][100936] Updated weights for policy 0, policy_version 4230 (0.0007) +[2023-10-14 05:10:53,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 8650752. Throughput: 0: 1642.5, 1: 1648.6. Samples: 2178156. Policy #0 lag: (min: 31.0, avg: 32.7, max: 59.0) +[2023-10-14 05:10:53,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.680')] +[2023-10-14 05:10:53,673][100936] Updated weights for policy 0, policy_version 4240 (0.0007) +[2023-10-14 05:10:54,040][100936] Updated weights for policy 0, policy_version 4250 (0.0007) +[2023-10-14 05:10:55,042][100917] Updated weights for policy 1, policy_version 4232 (0.0007) +[2023-10-14 05:10:55,427][100917] Updated weights for policy 1, policy_version 4242 (0.0007) +[2023-10-14 05:10:55,799][100917] Updated weights for policy 1, policy_version 4252 (0.0009) +[2023-10-14 05:10:58,086][100936] Updated weights for policy 0, policy_version 4260 (0.0010) +[2023-10-14 05:10:58,455][100936] Updated weights for policy 0, policy_version 4270 (0.0009) +[2023-10-14 05:10:58,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 8716288. Throughput: 0: 1652.3, 1: 1643.9. Samples: 2187644. Policy #0 lag: (min: 31.0, avg: 32.7, max: 59.0) +[2023-10-14 05:10:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.680')] +[2023-10-14 05:10:58,822][100936] Updated weights for policy 0, policy_version 4280 (0.0007) +[2023-10-14 05:10:59,806][100917] Updated weights for policy 1, policy_version 4262 (0.0009) +[2023-10-14 05:11:00,177][100917] Updated weights for policy 1, policy_version 4272 (0.0009) +[2023-10-14 05:11:00,555][100917] Updated weights for policy 1, policy_version 4282 (0.0007) +[2023-10-14 05:11:02,682][100936] Updated weights for policy 0, policy_version 4290 (0.0007) +[2023-10-14 05:11:03,058][100936] Updated weights for policy 0, policy_version 4300 (0.0007) +[2023-10-14 05:11:03,422][100936] Updated weights for policy 0, policy_version 4310 (0.0007) +[2023-10-14 05:11:03,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 8781824. Throughput: 0: 1658.2, 1: 1648.5. Samples: 2208162. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:11:03,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.680')] +[2023-10-14 05:11:03,792][100936] Updated weights for policy 0, policy_version 4320 (0.0008) +[2023-10-14 05:11:04,688][100917] Updated weights for policy 1, policy_version 4292 (0.0010) +[2023-10-14 05:11:05,067][100917] Updated weights for policy 1, policy_version 4302 (0.0007) +[2023-10-14 05:11:05,454][100917] Updated weights for policy 1, policy_version 4312 (0.0008) +[2023-10-14 05:11:07,990][100936] Updated weights for policy 0, policy_version 4330 (0.0010) +[2023-10-14 05:11:08,360][100936] Updated weights for policy 0, policy_version 4340 (0.0009) +[2023-10-14 05:11:08,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 8847360. Throughput: 0: 1645.9, 1: 1647.0. Samples: 2227762. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:11:08,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.680')] +[2023-10-14 05:11:08,742][100936] Updated weights for policy 0, policy_version 4350 (0.0008) +[2023-10-14 05:11:09,625][100917] Updated weights for policy 1, policy_version 4322 (0.0008) +[2023-10-14 05:11:09,998][100917] Updated weights for policy 1, policy_version 4332 (0.0008) +[2023-10-14 05:11:10,378][100917] Updated weights for policy 1, policy_version 4342 (0.0007) +[2023-10-14 05:11:10,758][100917] Updated weights for policy 1, policy_version 4352 (0.0010) +[2023-10-14 05:11:12,756][100936] Updated weights for policy 0, policy_version 4360 (0.0009) +[2023-10-14 05:11:13,130][100936] Updated weights for policy 0, policy_version 4370 (0.0007) +[2023-10-14 05:11:13,511][100936] Updated weights for policy 0, policy_version 4380 (0.0010) +[2023-10-14 05:11:13,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 8912896. Throughput: 0: 1660.4, 1: 1643.3. Samples: 2237608. Policy #0 lag: (min: 31.0, avg: 31.7, max: 49.0) +[2023-10-14 05:11:13,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.680')] +[2023-10-14 05:11:14,948][100917] Updated weights for policy 1, policy_version 4362 (0.0010) +[2023-10-14 05:11:15,326][100917] Updated weights for policy 1, policy_version 4372 (0.0007) +[2023-10-14 05:11:15,702][100917] Updated weights for policy 1, policy_version 4382 (0.0007) +[2023-10-14 05:11:17,531][100936] Updated weights for policy 0, policy_version 4390 (0.0007) +[2023-10-14 05:11:17,910][100936] Updated weights for policy 0, policy_version 4400 (0.0007) +[2023-10-14 05:11:18,283][100936] Updated weights for policy 0, policy_version 4410 (0.0007) +[2023-10-14 05:11:18,514][99942] Fps is (10 sec: 16380.8, 60 sec: 13652.9, 300 sec: 13218.2). Total num frames: 9011200. Throughput: 0: 1664.9, 1: 1644.6. Samples: 2258114. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:11:18,515][99942] Avg episode reward: [(0, '1.000'), (1, '0.680')] +[2023-10-14 05:11:19,763][100917] Updated weights for policy 1, policy_version 4392 (0.0007) +[2023-10-14 05:11:20,140][100917] Updated weights for policy 1, policy_version 4402 (0.0010) +[2023-10-14 05:11:20,512][100917] Updated weights for policy 1, policy_version 4412 (0.0009) +[2023-10-14 05:11:22,474][100936] Updated weights for policy 0, policy_version 4420 (0.0007) +[2023-10-14 05:11:22,856][100936] Updated weights for policy 0, policy_version 4430 (0.0009) +[2023-10-14 05:11:23,226][100936] Updated weights for policy 0, policy_version 4440 (0.0010) +[2023-10-14 05:11:23,512][99942] Fps is (10 sec: 13106.6, 60 sec: 13107.1, 300 sec: 13107.2). Total num frames: 9043968. Throughput: 0: 1650.9, 1: 1647.5. Samples: 2277372. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:11:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.680')] +[2023-10-14 05:11:24,648][100917] Updated weights for policy 1, policy_version 4422 (0.0008) +[2023-10-14 05:11:25,009][100917] Updated weights for policy 1, policy_version 4432 (0.0007) +[2023-10-14 05:11:25,382][100917] Updated weights for policy 1, policy_version 4442 (0.0008) +[2023-10-14 05:11:27,434][100936] Updated weights for policy 0, policy_version 4450 (0.0010) +[2023-10-14 05:11:27,801][100936] Updated weights for policy 0, policy_version 4460 (0.0008) +[2023-10-14 05:11:28,179][100936] Updated weights for policy 0, policy_version 4470 (0.0007) +[2023-10-14 05:11:28,512][99942] Fps is (10 sec: 9832.3, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 9109504. Throughput: 0: 1666.9, 1: 1645.9. Samples: 2287330. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:11:28,512][99942] Avg episode reward: [(0, '0.830'), (1, '0.680')] +[2023-10-14 05:11:28,544][100936] Updated weights for policy 0, policy_version 4480 (0.0008) +[2023-10-14 05:11:29,475][100917] Updated weights for policy 1, policy_version 4452 (0.0007) +[2023-10-14 05:11:29,840][100917] Updated weights for policy 1, policy_version 4462 (0.0010) +[2023-10-14 05:11:30,210][100917] Updated weights for policy 1, policy_version 4472 (0.0011) +[2023-10-14 05:11:32,646][100936] Updated weights for policy 0, policy_version 4490 (0.0010) +[2023-10-14 05:11:33,028][100936] Updated weights for policy 0, policy_version 4500 (0.0009) +[2023-10-14 05:11:33,400][100936] Updated weights for policy 0, policy_version 4510 (0.0008) +[2023-10-14 05:11:33,512][99942] Fps is (10 sec: 16384.5, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 9207808. Throughput: 0: 1666.9, 1: 1649.2. Samples: 2307634. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:11:33,513][99942] Avg episode reward: [(0, '0.830'), (1, '0.920')] +[2023-10-14 05:11:34,338][100917] Updated weights for policy 1, policy_version 4482 (0.0009) +[2023-10-14 05:11:34,749][100917] Updated weights for policy 1, policy_version 4492 (0.0008) +[2023-10-14 05:11:35,135][100917] Updated weights for policy 1, policy_version 4502 (0.0008) +[2023-10-14 05:11:35,513][100917] Updated weights for policy 1, policy_version 4512 (0.0008) +[2023-10-14 05:11:37,482][100936] Updated weights for policy 0, policy_version 4520 (0.0009) +[2023-10-14 05:11:37,843][100936] Updated weights for policy 0, policy_version 4530 (0.0008) +[2023-10-14 05:11:38,222][100936] Updated weights for policy 0, policy_version 4540 (0.0007) +[2023-10-14 05:11:38,512][99942] Fps is (10 sec: 16383.4, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 9273344. Throughput: 0: 1652.5, 1: 1647.0. Samples: 2326636. Policy #0 lag: (min: 31.0, avg: 32.1, max: 54.0) +[2023-10-14 05:11:38,513][99942] Avg episode reward: [(0, '0.830'), (1, '0.940')] +[2023-10-14 05:11:38,526][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000004544_4653056.pth... +[2023-10-14 05:11:38,527][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000004512_4620288.pth... +[2023-10-14 05:11:38,561][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000002976_3047424.pth +[2023-10-14 05:11:38,561][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000002976_3047424.pth +[2023-10-14 05:11:39,682][100917] Updated weights for policy 1, policy_version 4522 (0.0007) +[2023-10-14 05:11:40,053][100917] Updated weights for policy 1, policy_version 4532 (0.0011) +[2023-10-14 05:11:40,415][100917] Updated weights for policy 1, policy_version 4542 (0.0011) +[2023-10-14 05:11:42,495][100936] Updated weights for policy 0, policy_version 4550 (0.0009) +[2023-10-14 05:11:42,876][100936] Updated weights for policy 0, policy_version 4560 (0.0009) +[2023-10-14 05:11:43,255][100936] Updated weights for policy 0, policy_version 4570 (0.0008) +[2023-10-14 05:11:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 9338880. Throughput: 0: 1668.5, 1: 1642.2. Samples: 2336628. Policy #0 lag: (min: 31.0, avg: 32.1, max: 54.0) +[2023-10-14 05:11:43,513][99942] Avg episode reward: [(0, '0.830'), (1, '0.980')] +[2023-10-14 05:11:44,568][100917] Updated weights for policy 1, policy_version 4552 (0.0010) +[2023-10-14 05:11:44,941][100917] Updated weights for policy 1, policy_version 4562 (0.0011) +[2023-10-14 05:11:45,311][100917] Updated weights for policy 1, policy_version 4572 (0.0008) +[2023-10-14 05:11:47,149][100936] Updated weights for policy 0, policy_version 4580 (0.0009) +[2023-10-14 05:11:47,520][100936] Updated weights for policy 0, policy_version 4590 (0.0009) +[2023-10-14 05:11:47,892][100936] Updated weights for policy 0, policy_version 4600 (0.0008) +[2023-10-14 05:11:48,512][99942] Fps is (10 sec: 13107.8, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 9404416. Throughput: 0: 1652.0, 1: 1645.3. Samples: 2356540. Policy #0 lag: (min: 10.0, avg: 10.2, max: 18.0) +[2023-10-14 05:11:48,512][99942] Avg episode reward: [(0, '0.830'), (1, '0.980')] +[2023-10-14 05:11:49,637][100917] Updated weights for policy 1, policy_version 4582 (0.0008) +[2023-10-14 05:11:50,015][100917] Updated weights for policy 1, policy_version 4592 (0.0007) +[2023-10-14 05:11:50,393][100917] Updated weights for policy 1, policy_version 4602 (0.0009) +[2023-10-14 05:11:52,033][100936] Updated weights for policy 0, policy_version 4610 (0.0009) +[2023-10-14 05:11:52,408][100936] Updated weights for policy 0, policy_version 4620 (0.0009) +[2023-10-14 05:11:52,781][100936] Updated weights for policy 0, policy_version 4630 (0.0010) +[2023-10-14 05:11:53,150][100936] Updated weights for policy 0, policy_version 4640 (0.0009) +[2023-10-14 05:11:53,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 9469952. Throughput: 0: 1650.1, 1: 1648.8. Samples: 2376216. Policy #0 lag: (min: 10.0, avg: 10.2, max: 18.0) +[2023-10-14 05:11:53,513][99942] Avg episode reward: [(0, '0.830'), (1, '0.980')] +[2023-10-14 05:11:54,560][100917] Updated weights for policy 1, policy_version 4612 (0.0008) +[2023-10-14 05:11:54,939][100917] Updated weights for policy 1, policy_version 4622 (0.0010) +[2023-10-14 05:11:55,307][100917] Updated weights for policy 1, policy_version 4632 (0.0007) +[2023-10-14 05:11:57,359][100936] Updated weights for policy 0, policy_version 4650 (0.0008) +[2023-10-14 05:11:57,726][100936] Updated weights for policy 0, policy_version 4660 (0.0008) +[2023-10-14 05:11:58,097][100936] Updated weights for policy 0, policy_version 4670 (0.0009) +[2023-10-14 05:11:58,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 9535488. Throughput: 0: 1661.7, 1: 1645.8. Samples: 2386446. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:11:58,513][99942] Avg episode reward: [(0, '0.580'), (1, '0.970')] +[2023-10-14 05:11:59,622][100917] Updated weights for policy 1, policy_version 4642 (0.0008) +[2023-10-14 05:11:59,991][100917] Updated weights for policy 1, policy_version 4652 (0.0010) +[2023-10-14 05:12:00,358][100917] Updated weights for policy 1, policy_version 4662 (0.0010) +[2023-10-14 05:12:00,741][100917] Updated weights for policy 1, policy_version 4672 (0.0008) +[2023-10-14 05:12:02,200][100936] Updated weights for policy 0, policy_version 4680 (0.0010) +[2023-10-14 05:12:02,570][100936] Updated weights for policy 0, policy_version 4690 (0.0008) +[2023-10-14 05:12:02,939][100936] Updated weights for policy 0, policy_version 4700 (0.0009) +[2023-10-14 05:12:03,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 9601024. Throughput: 0: 1648.1, 1: 1644.1. Samples: 2406254. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:12:03,513][99942] Avg episode reward: [(0, '0.580'), (1, '0.970')] +[2023-10-14 05:12:04,759][100917] Updated weights for policy 1, policy_version 4682 (0.0009) +[2023-10-14 05:12:05,137][100917] Updated weights for policy 1, policy_version 4692 (0.0010) +[2023-10-14 05:12:05,517][100917] Updated weights for policy 1, policy_version 4702 (0.0010) +[2023-10-14 05:12:07,017][100936] Updated weights for policy 0, policy_version 4710 (0.0009) +[2023-10-14 05:12:07,383][100936] Updated weights for policy 0, policy_version 4720 (0.0009) +[2023-10-14 05:12:07,754][100936] Updated weights for policy 0, policy_version 4730 (0.0011) +[2023-10-14 05:12:08,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 9666560. Throughput: 0: 1655.3, 1: 1649.2. Samples: 2426072. Policy #0 lag: (min: 31.0, avg: 34.0, max: 63.0) +[2023-10-14 05:12:08,513][99942] Avg episode reward: [(0, '0.580'), (1, '0.980')] +[2023-10-14 05:12:09,711][100917] Updated weights for policy 1, policy_version 4712 (0.0009) +[2023-10-14 05:12:10,087][100917] Updated weights for policy 1, policy_version 4722 (0.0008) +[2023-10-14 05:12:10,462][100917] Updated weights for policy 1, policy_version 4732 (0.0010) +[2023-10-14 05:12:11,786][100936] Updated weights for policy 0, policy_version 4740 (0.0007) +[2023-10-14 05:12:12,157][100936] Updated weights for policy 0, policy_version 4750 (0.0008) +[2023-10-14 05:12:12,536][100936] Updated weights for policy 0, policy_version 4760 (0.0007) +[2023-10-14 05:12:13,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 9732096. Throughput: 0: 1664.6, 1: 1646.3. Samples: 2436318. Policy #0 lag: (min: 31.0, avg: 34.0, max: 63.0) +[2023-10-14 05:12:13,513][99942] Avg episode reward: [(0, '0.580'), (1, '0.980')] +[2023-10-14 05:12:14,620][100917] Updated weights for policy 1, policy_version 4742 (0.0008) +[2023-10-14 05:12:14,982][100917] Updated weights for policy 1, policy_version 4752 (0.0007) +[2023-10-14 05:12:15,361][100917] Updated weights for policy 1, policy_version 4762 (0.0008) +[2023-10-14 05:12:16,685][100936] Updated weights for policy 0, policy_version 4770 (0.0008) +[2023-10-14 05:12:17,055][100936] Updated weights for policy 0, policy_version 4780 (0.0007) +[2023-10-14 05:12:17,424][100936] Updated weights for policy 0, policy_version 4790 (0.0007) +[2023-10-14 05:12:17,806][100936] Updated weights for policy 0, policy_version 4800 (0.0007) +[2023-10-14 05:12:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.6, 300 sec: 13218.3). Total num frames: 9797632. Throughput: 0: 1647.1, 1: 1646.8. Samples: 2455858. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-10-14 05:12:18,513][99942] Avg episode reward: [(0, '0.580'), (1, '0.980')] +[2023-10-14 05:12:19,525][100917] Updated weights for policy 1, policy_version 4772 (0.0008) +[2023-10-14 05:12:19,916][100917] Updated weights for policy 1, policy_version 4782 (0.0010) +[2023-10-14 05:12:20,290][100917] Updated weights for policy 1, policy_version 4792 (0.0009) +[2023-10-14 05:12:21,858][100936] Updated weights for policy 0, policy_version 4810 (0.0008) +[2023-10-14 05:12:22,216][100936] Updated weights for policy 0, policy_version 4820 (0.0008) +[2023-10-14 05:12:22,594][100936] Updated weights for policy 0, policy_version 4830 (0.0007) +[2023-10-14 05:12:23,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 9863168. Throughput: 0: 1666.6, 1: 1654.6. Samples: 2476092. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-10-14 05:12:23,513][99942] Avg episode reward: [(0, '0.580'), (1, '0.980')] +[2023-10-14 05:12:24,289][100917] Updated weights for policy 1, policy_version 4802 (0.0007) +[2023-10-14 05:12:24,662][100917] Updated weights for policy 1, policy_version 4812 (0.0007) +[2023-10-14 05:12:25,022][100917] Updated weights for policy 1, policy_version 4822 (0.0010) +[2023-10-14 05:12:25,401][100917] Updated weights for policy 1, policy_version 4832 (0.0009) +[2023-10-14 05:12:26,677][100936] Updated weights for policy 0, policy_version 4840 (0.0009) +[2023-10-14 05:12:27,057][100936] Updated weights for policy 0, policy_version 4850 (0.0009) +[2023-10-14 05:12:27,416][100936] Updated weights for policy 0, policy_version 4860 (0.0009) +[2023-10-14 05:12:28,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 9928704. Throughput: 0: 1670.1, 1: 1655.6. Samples: 2486284. Policy #0 lag: (min: 12.0, avg: 12.0, max: 12.0) +[2023-10-14 05:12:28,512][99942] Avg episode reward: [(0, '0.580'), (1, '0.980')] +[2023-10-14 05:12:29,357][100917] Updated weights for policy 1, policy_version 4842 (0.0010) +[2023-10-14 05:12:29,733][100917] Updated weights for policy 1, policy_version 4852 (0.0010) +[2023-10-14 05:12:30,106][100917] Updated weights for policy 1, policy_version 4862 (0.0009) +[2023-10-14 05:12:31,617][100936] Updated weights for policy 0, policy_version 4870 (0.0009) +[2023-10-14 05:12:31,982][100936] Updated weights for policy 0, policy_version 4880 (0.0011) +[2023-10-14 05:12:32,356][100936] Updated weights for policy 0, policy_version 4890 (0.0008) +[2023-10-14 05:12:33,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 9994240. Throughput: 0: 1657.7, 1: 1657.6. Samples: 2505730. Policy #0 lag: (min: 12.0, avg: 12.0, max: 12.0) +[2023-10-14 05:12:33,513][99942] Avg episode reward: [(0, '0.580'), (1, '0.980')] +[2023-10-14 05:12:34,341][100917] Updated weights for policy 1, policy_version 4872 (0.0009) +[2023-10-14 05:12:34,715][100917] Updated weights for policy 1, policy_version 4882 (0.0007) +[2023-10-14 05:12:35,085][100917] Updated weights for policy 1, policy_version 4892 (0.0007) +[2023-10-14 05:12:36,499][100936] Updated weights for policy 0, policy_version 4900 (0.0010) +[2023-10-14 05:12:36,870][100936] Updated weights for policy 0, policy_version 4910 (0.0009) +[2023-10-14 05:12:37,242][100936] Updated weights for policy 0, policy_version 4920 (0.0007) +[2023-10-14 05:12:38,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 10059776. Throughput: 0: 1669.6, 1: 1656.0. Samples: 2525872. Policy #0 lag: (min: 31.0, avg: 35.9, max: 63.0) +[2023-10-14 05:12:38,513][99942] Avg episode reward: [(0, '0.580'), (1, '0.980')] +[2023-10-14 05:12:39,235][100917] Updated weights for policy 1, policy_version 4902 (0.0009) +[2023-10-14 05:12:39,600][100917] Updated weights for policy 1, policy_version 4912 (0.0010) +[2023-10-14 05:12:39,978][100917] Updated weights for policy 1, policy_version 4922 (0.0009) +[2023-10-14 05:12:41,492][100936] Updated weights for policy 0, policy_version 4930 (0.0009) +[2023-10-14 05:12:41,861][100936] Updated weights for policy 0, policy_version 4940 (0.0008) +[2023-10-14 05:12:42,242][100936] Updated weights for policy 0, policy_version 4950 (0.0010) +[2023-10-14 05:12:42,617][100936] Updated weights for policy 0, policy_version 4960 (0.0010) +[2023-10-14 05:12:43,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 10125312. Throughput: 0: 1665.8, 1: 1657.1. Samples: 2535974. Policy #0 lag: (min: 31.0, avg: 35.9, max: 63.0) +[2023-10-14 05:12:43,512][99942] Avg episode reward: [(0, '0.580'), (1, '0.980')] +[2023-10-14 05:12:44,095][100917] Updated weights for policy 1, policy_version 4932 (0.0009) +[2023-10-14 05:12:44,459][100917] Updated weights for policy 1, policy_version 4942 (0.0010) +[2023-10-14 05:12:44,842][100917] Updated weights for policy 1, policy_version 4952 (0.0011) +[2023-10-14 05:12:46,648][100936] Updated weights for policy 0, policy_version 4970 (0.0008) +[2023-10-14 05:12:47,020][100936] Updated weights for policy 0, policy_version 4980 (0.0008) +[2023-10-14 05:12:47,390][100936] Updated weights for policy 0, policy_version 4990 (0.0009) +[2023-10-14 05:12:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 10190848. Throughput: 0: 1653.6, 1: 1658.2. Samples: 2555286. Policy #0 lag: (min: 8.0, avg: 23.2, max: 40.0) +[2023-10-14 05:12:48,513][99942] Avg episode reward: [(0, '0.580'), (1, '0.980')] +[2023-10-14 05:12:49,099][100917] Updated weights for policy 1, policy_version 4962 (0.0008) +[2023-10-14 05:12:49,480][100917] Updated weights for policy 1, policy_version 4972 (0.0009) +[2023-10-14 05:12:49,860][100917] Updated weights for policy 1, policy_version 4982 (0.0010) +[2023-10-14 05:12:50,240][100917] Updated weights for policy 1, policy_version 4992 (0.0010) +[2023-10-14 05:12:51,382][100936] Updated weights for policy 0, policy_version 5000 (0.0008) +[2023-10-14 05:12:51,762][100936] Updated weights for policy 0, policy_version 5010 (0.0008) +[2023-10-14 05:12:52,132][100936] Updated weights for policy 0, policy_version 5020 (0.0008) +[2023-10-14 05:12:53,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 10256384. Throughput: 0: 1670.2, 1: 1654.5. Samples: 2575682. Policy #0 lag: (min: 8.0, avg: 23.2, max: 40.0) +[2023-10-14 05:12:53,513][99942] Avg episode reward: [(0, '0.580'), (1, '0.970')] +[2023-10-14 05:12:54,405][100917] Updated weights for policy 1, policy_version 5002 (0.0009) +[2023-10-14 05:12:54,765][100917] Updated weights for policy 1, policy_version 5012 (0.0009) +[2023-10-14 05:12:55,137][100917] Updated weights for policy 1, policy_version 5022 (0.0009) +[2023-10-14 05:12:56,100][100936] Updated weights for policy 0, policy_version 5030 (0.0007) +[2023-10-14 05:12:56,469][100936] Updated weights for policy 0, policy_version 5040 (0.0007) +[2023-10-14 05:12:56,845][100936] Updated weights for policy 0, policy_version 5050 (0.0008) +[2023-10-14 05:12:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 10321920. Throughput: 0: 1657.2, 1: 1657.5. Samples: 2585476. Policy #0 lag: (min: 27.0, avg: 27.3, max: 39.0) +[2023-10-14 05:12:58,513][99942] Avg episode reward: [(0, '0.580'), (1, '0.970')] +[2023-10-14 05:12:59,066][100917] Updated weights for policy 1, policy_version 5032 (0.0008) +[2023-10-14 05:12:59,455][100917] Updated weights for policy 1, policy_version 5042 (0.0007) +[2023-10-14 05:12:59,827][100917] Updated weights for policy 1, policy_version 5052 (0.0009) +[2023-10-14 05:13:01,038][100936] Updated weights for policy 0, policy_version 5060 (0.0009) +[2023-10-14 05:13:01,403][100936] Updated weights for policy 0, policy_version 5070 (0.0009) +[2023-10-14 05:13:01,777][100936] Updated weights for policy 0, policy_version 5080 (0.0010) +[2023-10-14 05:13:03,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 10387456. Throughput: 0: 1655.5, 1: 1662.0. Samples: 2605144. Policy #0 lag: (min: 27.0, avg: 27.3, max: 39.0) +[2023-10-14 05:13:03,513][99942] Avg episode reward: [(0, '0.580'), (1, '0.970')] +[2023-10-14 05:13:03,903][100917] Updated weights for policy 1, policy_version 5062 (0.0008) +[2023-10-14 05:13:04,276][100917] Updated weights for policy 1, policy_version 5072 (0.0009) +[2023-10-14 05:13:04,654][100917] Updated weights for policy 1, policy_version 5082 (0.0011) +[2023-10-14 05:13:05,866][100936] Updated weights for policy 0, policy_version 5090 (0.0008) +[2023-10-14 05:13:06,240][100936] Updated weights for policy 0, policy_version 5100 (0.0008) +[2023-10-14 05:13:06,598][100936] Updated weights for policy 0, policy_version 5110 (0.0008) +[2023-10-14 05:13:06,982][100936] Updated weights for policy 0, policy_version 5120 (0.0009) +[2023-10-14 05:13:08,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 10452992. Throughput: 0: 1665.5, 1: 1666.7. Samples: 2626042. Policy #0 lag: (min: 31.0, avg: 37.3, max: 63.0) +[2023-10-14 05:13:08,513][99942] Avg episode reward: [(0, '0.580'), (1, '0.970')] +[2023-10-14 05:13:08,764][100917] Updated weights for policy 1, policy_version 5092 (0.0010) +[2023-10-14 05:13:09,143][100917] Updated weights for policy 1, policy_version 5102 (0.0009) +[2023-10-14 05:13:09,516][100917] Updated weights for policy 1, policy_version 5112 (0.0009) +[2023-10-14 05:13:11,167][100936] Updated weights for policy 0, policy_version 5130 (0.0011) +[2023-10-14 05:13:11,536][100936] Updated weights for policy 0, policy_version 5140 (0.0008) +[2023-10-14 05:13:11,911][100936] Updated weights for policy 0, policy_version 5150 (0.0008) +[2023-10-14 05:13:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 10518528. Throughput: 0: 1648.4, 1: 1661.9. Samples: 2635250. Policy #0 lag: (min: 31.0, avg: 37.3, max: 63.0) +[2023-10-14 05:13:13,513][99942] Avg episode reward: [(0, '0.580'), (1, '0.970')] +[2023-10-14 05:13:13,798][100917] Updated weights for policy 1, policy_version 5122 (0.0010) +[2023-10-14 05:13:14,166][100917] Updated weights for policy 1, policy_version 5132 (0.0007) +[2023-10-14 05:13:14,553][100917] Updated weights for policy 1, policy_version 5142 (0.0007) +[2023-10-14 05:13:14,926][100917] Updated weights for policy 1, policy_version 5152 (0.0007) +[2023-10-14 05:13:16,296][100936] Updated weights for policy 0, policy_version 5160 (0.0008) +[2023-10-14 05:13:16,668][100936] Updated weights for policy 0, policy_version 5170 (0.0008) +[2023-10-14 05:13:17,043][100936] Updated weights for policy 0, policy_version 5180 (0.0007) +[2023-10-14 05:13:18,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 10584064. Throughput: 0: 1657.1, 1: 1655.9. Samples: 2654814. Policy #0 lag: (min: 26.0, avg: 33.4, max: 58.0) +[2023-10-14 05:13:18,512][99942] Avg episode reward: [(0, '0.580'), (1, '0.970')] +[2023-10-14 05:13:19,095][100917] Updated weights for policy 1, policy_version 5162 (0.0008) +[2023-10-14 05:13:19,477][100917] Updated weights for policy 1, policy_version 5172 (0.0007) +[2023-10-14 05:13:19,840][100917] Updated weights for policy 1, policy_version 5182 (0.0010) +[2023-10-14 05:13:21,113][100936] Updated weights for policy 0, policy_version 5190 (0.0008) +[2023-10-14 05:13:21,484][100936] Updated weights for policy 0, policy_version 5200 (0.0007) +[2023-10-14 05:13:21,858][100936] Updated weights for policy 0, policy_version 5210 (0.0009) +[2023-10-14 05:13:23,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 10649600. Throughput: 0: 1665.5, 1: 1653.3. Samples: 2675214. Policy #0 lag: (min: 26.0, avg: 33.4, max: 58.0) +[2023-10-14 05:13:23,512][99942] Avg episode reward: [(0, '0.580'), (1, '0.970')] +[2023-10-14 05:13:23,944][100917] Updated weights for policy 1, policy_version 5192 (0.0008) +[2023-10-14 05:13:24,318][100917] Updated weights for policy 1, policy_version 5202 (0.0008) +[2023-10-14 05:13:24,694][100917] Updated weights for policy 1, policy_version 5212 (0.0008) +[2023-10-14 05:13:25,768][100936] Updated weights for policy 0, policy_version 5220 (0.0009) +[2023-10-14 05:13:26,157][100936] Updated weights for policy 0, policy_version 5230 (0.0010) +[2023-10-14 05:13:26,527][100936] Updated weights for policy 0, policy_version 5240 (0.0009) +[2023-10-14 05:13:28,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 10715136. Throughput: 0: 1650.5, 1: 1654.7. Samples: 2684708. Policy #0 lag: (min: 1.0, avg: 5.2, max: 33.0) +[2023-10-14 05:13:28,513][99942] Avg episode reward: [(0, '0.580'), (1, '0.970')] +[2023-10-14 05:13:28,886][100917] Updated weights for policy 1, policy_version 5222 (0.0010) +[2023-10-14 05:13:29,270][100917] Updated weights for policy 1, policy_version 5232 (0.0010) +[2023-10-14 05:13:29,651][100917] Updated weights for policy 1, policy_version 5242 (0.0009) +[2023-10-14 05:13:30,415][100936] Updated weights for policy 0, policy_version 5250 (0.0010) +[2023-10-14 05:13:30,793][100936] Updated weights for policy 0, policy_version 5260 (0.0009) +[2023-10-14 05:13:31,169][100936] Updated weights for policy 0, policy_version 5270 (0.0010) +[2023-10-14 05:13:31,528][100936] Updated weights for policy 0, policy_version 5280 (0.0009) +[2023-10-14 05:13:33,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 10780672. Throughput: 0: 1664.0, 1: 1654.5. Samples: 2704614. Policy #0 lag: (min: 1.0, avg: 5.2, max: 33.0) +[2023-10-14 05:13:33,512][99942] Avg episode reward: [(0, '0.580'), (1, '0.970')] +[2023-10-14 05:13:33,744][100917] Updated weights for policy 1, policy_version 5252 (0.0008) +[2023-10-14 05:13:34,112][100917] Updated weights for policy 1, policy_version 5262 (0.0007) +[2023-10-14 05:13:34,487][100917] Updated weights for policy 1, policy_version 5272 (0.0010) +[2023-10-14 05:13:35,717][100936] Updated weights for policy 0, policy_version 5290 (0.0007) +[2023-10-14 05:13:36,081][100936] Updated weights for policy 0, policy_version 5300 (0.0009) +[2023-10-14 05:13:36,458][100936] Updated weights for policy 0, policy_version 5310 (0.0008) +[2023-10-14 05:13:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 10846208. Throughput: 0: 1666.8, 1: 1655.3. Samples: 2725176. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:13:38,513][99942] Avg episode reward: [(0, '0.580'), (1, '0.970')] +[2023-10-14 05:13:38,524][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000005312_5439488.pth... +[2023-10-14 05:13:38,559][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000003744_3833856.pth +[2023-10-14 05:13:38,560][100917] Updated weights for policy 1, policy_version 5282 (0.0007) +[2023-10-14 05:13:38,934][100917] Updated weights for policy 1, policy_version 5292 (0.0008) +[2023-10-14 05:13:39,306][100917] Updated weights for policy 1, policy_version 5302 (0.0008) +[2023-10-14 05:13:39,682][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000005312_5439488.pth... +[2023-10-14 05:13:39,684][100917] Updated weights for policy 1, policy_version 5312 (0.0008) +[2023-10-14 05:13:39,711][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000003744_3833856.pth +[2023-10-14 05:13:40,551][100936] Updated weights for policy 0, policy_version 5320 (0.0009) +[2023-10-14 05:13:40,920][100936] Updated weights for policy 0, policy_version 5330 (0.0009) +[2023-10-14 05:13:41,296][100936] Updated weights for policy 0, policy_version 5340 (0.0009) +[2023-10-14 05:13:43,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 10911744. Throughput: 0: 1649.9, 1: 1655.2. Samples: 2734206. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:13:43,513][99942] Avg episode reward: [(0, '0.580'), (1, '0.970')] +[2023-10-14 05:13:43,815][100917] Updated weights for policy 1, policy_version 5322 (0.0009) +[2023-10-14 05:13:44,196][100917] Updated weights for policy 1, policy_version 5332 (0.0010) +[2023-10-14 05:13:44,573][100917] Updated weights for policy 1, policy_version 5342 (0.0009) +[2023-10-14 05:13:45,552][100936] Updated weights for policy 0, policy_version 5350 (0.0009) +[2023-10-14 05:13:45,926][100936] Updated weights for policy 0, policy_version 5360 (0.0011) +[2023-10-14 05:13:46,289][100936] Updated weights for policy 0, policy_version 5370 (0.0007) +[2023-10-14 05:13:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 10977280. Throughput: 0: 1667.4, 1: 1654.2. Samples: 2754614. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-10-14 05:13:48,513][99942] Avg episode reward: [(0, '0.580'), (1, '0.970')] +[2023-10-14 05:13:48,682][100917] Updated weights for policy 1, policy_version 5352 (0.0009) +[2023-10-14 05:13:49,070][100917] Updated weights for policy 1, policy_version 5362 (0.0008) +[2023-10-14 05:13:49,447][100917] Updated weights for policy 1, policy_version 5372 (0.0007) +[2023-10-14 05:13:50,386][100936] Updated weights for policy 0, policy_version 5380 (0.0008) +[2023-10-14 05:13:50,761][100936] Updated weights for policy 0, policy_version 5390 (0.0010) +[2023-10-14 05:13:51,134][100936] Updated weights for policy 0, policy_version 5400 (0.0010) +[2023-10-14 05:13:53,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 11042816. Throughput: 0: 1664.6, 1: 1644.1. Samples: 2774934. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-10-14 05:13:53,512][99942] Avg episode reward: [(0, '0.580'), (1, '0.970')] +[2023-10-14 05:13:53,672][100917] Updated weights for policy 1, policy_version 5382 (0.0010) +[2023-10-14 05:13:54,056][100917] Updated weights for policy 1, policy_version 5392 (0.0008) +[2023-10-14 05:13:54,431][100917] Updated weights for policy 1, policy_version 5402 (0.0008) +[2023-10-14 05:13:55,203][100936] Updated weights for policy 0, policy_version 5410 (0.0009) +[2023-10-14 05:13:55,581][100936] Updated weights for policy 0, policy_version 5420 (0.0009) +[2023-10-14 05:13:55,943][100936] Updated weights for policy 0, policy_version 5430 (0.0007) +[2023-10-14 05:13:56,307][100936] Updated weights for policy 0, policy_version 5440 (0.0008) +[2023-10-14 05:13:58,487][100917] Updated weights for policy 1, policy_version 5412 (0.0009) +[2023-10-14 05:13:58,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 11108352. Throughput: 0: 1655.6, 1: 1646.9. Samples: 2783860. Policy #0 lag: (min: 2.0, avg: 3.4, max: 27.0) +[2023-10-14 05:13:58,512][99942] Avg episode reward: [(0, '0.580'), (1, '0.970')] +[2023-10-14 05:13:58,877][100917] Updated weights for policy 1, policy_version 5422 (0.0010) +[2023-10-14 05:13:59,247][100917] Updated weights for policy 1, policy_version 5432 (0.0009) +[2023-10-14 05:14:00,332][100936] Updated weights for policy 0, policy_version 5450 (0.0007) +[2023-10-14 05:14:00,716][100936] Updated weights for policy 0, policy_version 5460 (0.0007) +[2023-10-14 05:14:01,081][100936] Updated weights for policy 0, policy_version 5470 (0.0009) +[2023-10-14 05:14:03,262][100917] Updated weights for policy 1, policy_version 5442 (0.0009) +[2023-10-14 05:14:03,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 11173888. Throughput: 0: 1671.6, 1: 1653.4. Samples: 2804438. Policy #0 lag: (min: 2.0, avg: 3.4, max: 27.0) +[2023-10-14 05:14:03,512][99942] Avg episode reward: [(0, '0.580'), (1, '0.970')] +[2023-10-14 05:14:03,624][100917] Updated weights for policy 1, policy_version 5452 (0.0008) +[2023-10-14 05:14:04,003][100917] Updated weights for policy 1, policy_version 5462 (0.0011) +[2023-10-14 05:14:04,377][100917] Updated weights for policy 1, policy_version 5472 (0.0010) +[2023-10-14 05:14:05,298][100936] Updated weights for policy 0, policy_version 5480 (0.0010) +[2023-10-14 05:14:05,669][100936] Updated weights for policy 0, policy_version 5490 (0.0010) +[2023-10-14 05:14:06,038][100936] Updated weights for policy 0, policy_version 5500 (0.0010) +[2023-10-14 05:14:08,364][100917] Updated weights for policy 1, policy_version 5482 (0.0011) +[2023-10-14 05:14:08,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 11239424. Throughput: 0: 1669.8, 1: 1657.4. Samples: 2824938. Policy #0 lag: (min: 17.0, avg: 22.1, max: 49.0) +[2023-10-14 05:14:08,512][99942] Avg episode reward: [(0, '0.750'), (1, '0.970')] +[2023-10-14 05:14:08,743][100917] Updated weights for policy 1, policy_version 5492 (0.0009) +[2023-10-14 05:14:09,116][100917] Updated weights for policy 1, policy_version 5502 (0.0007) +[2023-10-14 05:14:09,974][100936] Updated weights for policy 0, policy_version 5510 (0.0010) +[2023-10-14 05:14:10,343][100936] Updated weights for policy 0, policy_version 5520 (0.0009) +[2023-10-14 05:14:10,709][100936] Updated weights for policy 0, policy_version 5530 (0.0010) +[2023-10-14 05:14:13,376][100917] Updated weights for policy 1, policy_version 5512 (0.0009) +[2023-10-14 05:14:13,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 11304960. Throughput: 0: 1655.2, 1: 1657.6. Samples: 2833782. Policy #0 lag: (min: 17.0, avg: 22.1, max: 49.0) +[2023-10-14 05:14:13,513][99942] Avg episode reward: [(0, '0.750'), (1, '0.980')] +[2023-10-14 05:14:13,755][100917] Updated weights for policy 1, policy_version 5522 (0.0009) +[2023-10-14 05:14:14,127][100917] Updated weights for policy 1, policy_version 5532 (0.0009) +[2023-10-14 05:14:14,794][100936] Updated weights for policy 0, policy_version 5540 (0.0009) +[2023-10-14 05:14:15,170][100936] Updated weights for policy 0, policy_version 5550 (0.0011) +[2023-10-14 05:14:15,550][100936] Updated weights for policy 0, policy_version 5560 (0.0009) +[2023-10-14 05:14:18,378][100917] Updated weights for policy 1, policy_version 5542 (0.0009) +[2023-10-14 05:14:18,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 11370496. Throughput: 0: 1665.5, 1: 1655.3. Samples: 2854050. Policy #0 lag: (min: 17.0, avg: 18.9, max: 49.0) +[2023-10-14 05:14:18,513][99942] Avg episode reward: [(0, '0.750'), (1, '0.980')] +[2023-10-14 05:14:18,754][100917] Updated weights for policy 1, policy_version 5552 (0.0008) +[2023-10-14 05:14:19,127][100917] Updated weights for policy 1, policy_version 5562 (0.0007) +[2023-10-14 05:14:19,662][100936] Updated weights for policy 0, policy_version 5570 (0.0008) +[2023-10-14 05:14:20,034][100936] Updated weights for policy 0, policy_version 5580 (0.0008) +[2023-10-14 05:14:20,409][100936] Updated weights for policy 0, policy_version 5590 (0.0009) +[2023-10-14 05:14:20,779][100936] Updated weights for policy 0, policy_version 5600 (0.0007) +[2023-10-14 05:14:23,311][100917] Updated weights for policy 1, policy_version 5572 (0.0008) +[2023-10-14 05:14:23,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 11436032. Throughput: 0: 1664.1, 1: 1650.1. Samples: 2874316. Policy #0 lag: (min: 17.0, avg: 18.9, max: 49.0) +[2023-10-14 05:14:23,512][99942] Avg episode reward: [(0, '0.750'), (1, '0.980')] +[2023-10-14 05:14:23,675][100917] Updated weights for policy 1, policy_version 5582 (0.0008) +[2023-10-14 05:14:24,042][100917] Updated weights for policy 1, policy_version 5592 (0.0009) +[2023-10-14 05:14:24,957][100936] Updated weights for policy 0, policy_version 5610 (0.0008) +[2023-10-14 05:14:25,334][100936] Updated weights for policy 0, policy_version 5620 (0.0008) +[2023-10-14 05:14:25,705][100936] Updated weights for policy 0, policy_version 5630 (0.0008) +[2023-10-14 05:14:28,408][100917] Updated weights for policy 1, policy_version 5602 (0.0010) +[2023-10-14 05:14:28,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 11501568. Throughput: 0: 1663.1, 1: 1648.8. Samples: 2883242. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:14:28,513][99942] Avg episode reward: [(0, '0.750'), (1, '0.980')] +[2023-10-14 05:14:28,781][100917] Updated weights for policy 1, policy_version 5612 (0.0007) +[2023-10-14 05:14:29,144][100917] Updated weights for policy 1, policy_version 5622 (0.0009) +[2023-10-14 05:14:29,519][100917] Updated weights for policy 1, policy_version 5632 (0.0009) +[2023-10-14 05:14:30,105][100936] Updated weights for policy 0, policy_version 5640 (0.0008) +[2023-10-14 05:14:30,484][100936] Updated weights for policy 0, policy_version 5650 (0.0009) +[2023-10-14 05:14:30,854][100936] Updated weights for policy 0, policy_version 5660 (0.0009) +[2023-10-14 05:14:33,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 11567104. Throughput: 0: 1662.9, 1: 1640.4. Samples: 2903264. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:14:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.970')] +[2023-10-14 05:14:33,795][100917] Updated weights for policy 1, policy_version 5642 (0.0009) +[2023-10-14 05:14:34,176][100917] Updated weights for policy 1, policy_version 5652 (0.0009) +[2023-10-14 05:14:34,551][100917] Updated weights for policy 1, policy_version 5662 (0.0011) +[2023-10-14 05:14:35,034][100936] Updated weights for policy 0, policy_version 5670 (0.0008) +[2023-10-14 05:14:35,420][100936] Updated weights for policy 0, policy_version 5680 (0.0010) +[2023-10-14 05:14:35,796][100936] Updated weights for policy 0, policy_version 5690 (0.0008) +[2023-10-14 05:14:38,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 11632640. Throughput: 0: 1655.6, 1: 1642.2. Samples: 2923336. Policy #0 lag: (min: 31.0, avg: 36.6, max: 63.0) +[2023-10-14 05:14:38,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.960')] +[2023-10-14 05:14:38,640][100917] Updated weights for policy 1, policy_version 5672 (0.0008) +[2023-10-14 05:14:39,023][100917] Updated weights for policy 1, policy_version 5682 (0.0008) +[2023-10-14 05:14:39,392][100917] Updated weights for policy 1, policy_version 5692 (0.0007) +[2023-10-14 05:14:40,058][100936] Updated weights for policy 0, policy_version 5700 (0.0008) +[2023-10-14 05:14:40,427][100936] Updated weights for policy 0, policy_version 5710 (0.0007) +[2023-10-14 05:14:40,793][100936] Updated weights for policy 0, policy_version 5720 (0.0008) +[2023-10-14 05:14:43,474][100917] Updated weights for policy 1, policy_version 5702 (0.0009) +[2023-10-14 05:14:43,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 11698176. Throughput: 0: 1652.8, 1: 1646.8. Samples: 2932342. Policy #0 lag: (min: 31.0, avg: 36.6, max: 63.0) +[2023-10-14 05:14:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.960')] +[2023-10-14 05:14:43,845][100917] Updated weights for policy 1, policy_version 5712 (0.0009) +[2023-10-14 05:14:44,209][100917] Updated weights for policy 1, policy_version 5722 (0.0008) +[2023-10-14 05:14:44,938][100936] Updated weights for policy 0, policy_version 5730 (0.0008) +[2023-10-14 05:14:45,303][100936] Updated weights for policy 0, policy_version 5740 (0.0009) +[2023-10-14 05:14:45,679][100936] Updated weights for policy 0, policy_version 5750 (0.0008) +[2023-10-14 05:14:46,051][100936] Updated weights for policy 0, policy_version 5760 (0.0008) +[2023-10-14 05:14:48,260][100917] Updated weights for policy 1, policy_version 5732 (0.0010) +[2023-10-14 05:14:48,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 11763712. Throughput: 0: 1651.3, 1: 1638.6. Samples: 2952482. Policy #0 lag: (min: 31.0, avg: 31.6, max: 48.0) +[2023-10-14 05:14:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.960')] +[2023-10-14 05:14:48,638][100917] Updated weights for policy 1, policy_version 5742 (0.0010) +[2023-10-14 05:14:49,006][100917] Updated weights for policy 1, policy_version 5752 (0.0009) +[2023-10-14 05:14:50,366][100936] Updated weights for policy 0, policy_version 5770 (0.0007) +[2023-10-14 05:14:50,734][100936] Updated weights for policy 0, policy_version 5780 (0.0008) +[2023-10-14 05:14:51,093][100936] Updated weights for policy 0, policy_version 5790 (0.0009) +[2023-10-14 05:14:53,011][100917] Updated weights for policy 1, policy_version 5762 (0.0010) +[2023-10-14 05:14:53,386][100917] Updated weights for policy 1, policy_version 5772 (0.0010) +[2023-10-14 05:14:53,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 11829248. Throughput: 0: 1646.0, 1: 1640.4. Samples: 2972828. Policy #0 lag: (min: 31.0, avg: 31.6, max: 48.0) +[2023-10-14 05:14:53,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.960')] +[2023-10-14 05:14:53,760][100917] Updated weights for policy 1, policy_version 5782 (0.0008) +[2023-10-14 05:14:54,130][100917] Updated weights for policy 1, policy_version 5792 (0.0008) +[2023-10-14 05:14:55,127][100936] Updated weights for policy 0, policy_version 5800 (0.0008) +[2023-10-14 05:14:55,496][100936] Updated weights for policy 0, policy_version 5810 (0.0007) +[2023-10-14 05:14:55,869][100936] Updated weights for policy 0, policy_version 5820 (0.0007) +[2023-10-14 05:14:58,298][100917] Updated weights for policy 1, policy_version 5802 (0.0007) +[2023-10-14 05:14:58,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 11894784. Throughput: 0: 1647.8, 1: 1639.8. Samples: 2981726. Policy #0 lag: (min: 31.0, avg: 32.0, max: 54.0) +[2023-10-14 05:14:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.950')] +[2023-10-14 05:14:58,668][100917] Updated weights for policy 1, policy_version 5812 (0.0007) +[2023-10-14 05:14:59,043][100917] Updated weights for policy 1, policy_version 5822 (0.0008) +[2023-10-14 05:14:59,839][100936] Updated weights for policy 0, policy_version 5830 (0.0007) +[2023-10-14 05:15:00,219][100936] Updated weights for policy 0, policy_version 5840 (0.0008) +[2023-10-14 05:15:00,578][100936] Updated weights for policy 0, policy_version 5850 (0.0008) +[2023-10-14 05:15:03,145][100917] Updated weights for policy 1, policy_version 5832 (0.0009) +[2023-10-14 05:15:03,512][99942] Fps is (10 sec: 13106.7, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 11960320. Throughput: 0: 1643.6, 1: 1643.3. Samples: 3001962. Policy #0 lag: (min: 31.0, avg: 32.0, max: 54.0) +[2023-10-14 05:15:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.950')] +[2023-10-14 05:15:03,526][100917] Updated weights for policy 1, policy_version 5842 (0.0007) +[2023-10-14 05:15:03,887][100917] Updated weights for policy 1, policy_version 5852 (0.0009) +[2023-10-14 05:15:04,565][100936] Updated weights for policy 0, policy_version 5860 (0.0008) +[2023-10-14 05:15:04,940][100936] Updated weights for policy 0, policy_version 5870 (0.0008) +[2023-10-14 05:15:05,307][100936] Updated weights for policy 0, policy_version 5880 (0.0008) +[2023-10-14 05:15:08,197][100917] Updated weights for policy 1, policy_version 5862 (0.0008) +[2023-10-14 05:15:08,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 12025856. Throughput: 0: 1642.9, 1: 1644.7. Samples: 3022260. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:15:08,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.950')] +[2023-10-14 05:15:08,573][100917] Updated weights for policy 1, policy_version 5872 (0.0008) +[2023-10-14 05:15:08,943][100917] Updated weights for policy 1, policy_version 5882 (0.0007) +[2023-10-14 05:15:09,474][100936] Updated weights for policy 0, policy_version 5890 (0.0009) +[2023-10-14 05:15:09,838][100936] Updated weights for policy 0, policy_version 5900 (0.0007) +[2023-10-14 05:15:10,210][100936] Updated weights for policy 0, policy_version 5910 (0.0008) +[2023-10-14 05:15:10,586][100936] Updated weights for policy 0, policy_version 5920 (0.0010) +[2023-10-14 05:15:13,115][100917] Updated weights for policy 1, policy_version 5892 (0.0008) +[2023-10-14 05:15:13,497][100917] Updated weights for policy 1, policy_version 5902 (0.0010) +[2023-10-14 05:15:13,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 12091392. Throughput: 0: 1643.2, 1: 1646.1. Samples: 3031262. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:15:13,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.910')] +[2023-10-14 05:15:13,862][100917] Updated weights for policy 1, policy_version 5912 (0.0008) +[2023-10-14 05:15:14,795][100936] Updated weights for policy 0, policy_version 5930 (0.0009) +[2023-10-14 05:15:15,173][100936] Updated weights for policy 0, policy_version 5940 (0.0009) +[2023-10-14 05:15:15,550][100936] Updated weights for policy 0, policy_version 5950 (0.0009) +[2023-10-14 05:15:17,857][100917] Updated weights for policy 1, policy_version 5922 (0.0011) +[2023-10-14 05:15:18,237][100917] Updated weights for policy 1, policy_version 5932 (0.0008) +[2023-10-14 05:15:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 12156928. Throughput: 0: 1646.1, 1: 1654.5. Samples: 3051794. Policy #0 lag: (min: 17.0, avg: 21.5, max: 49.0) +[2023-10-14 05:15:18,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.910')] +[2023-10-14 05:15:18,609][100917] Updated weights for policy 1, policy_version 5942 (0.0010) +[2023-10-14 05:15:18,980][100917] Updated weights for policy 1, policy_version 5952 (0.0008) +[2023-10-14 05:15:19,757][100936] Updated weights for policy 0, policy_version 5960 (0.0009) +[2023-10-14 05:15:20,126][100936] Updated weights for policy 0, policy_version 5970 (0.0007) +[2023-10-14 05:15:20,495][100936] Updated weights for policy 0, policy_version 5980 (0.0009) +[2023-10-14 05:15:23,020][100917] Updated weights for policy 1, policy_version 5962 (0.0009) +[2023-10-14 05:15:23,393][100917] Updated weights for policy 1, policy_version 5972 (0.0011) +[2023-10-14 05:15:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 12222464. Throughput: 0: 1655.6, 1: 1654.8. Samples: 3072300. Policy #0 lag: (min: 17.0, avg: 21.5, max: 49.0) +[2023-10-14 05:15:23,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.860')] +[2023-10-14 05:15:23,770][100917] Updated weights for policy 1, policy_version 5982 (0.0007) +[2023-10-14 05:15:24,657][100936] Updated weights for policy 0, policy_version 5990 (0.0008) +[2023-10-14 05:15:25,036][100936] Updated weights for policy 0, policy_version 6000 (0.0009) +[2023-10-14 05:15:25,415][100936] Updated weights for policy 0, policy_version 6010 (0.0007) +[2023-10-14 05:15:27,946][100917] Updated weights for policy 1, policy_version 5992 (0.0008) +[2023-10-14 05:15:28,332][100917] Updated weights for policy 1, policy_version 6002 (0.0007) +[2023-10-14 05:15:28,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 12288000. Throughput: 0: 1652.0, 1: 1660.7. Samples: 3081412. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:15:28,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.860')] +[2023-10-14 05:15:28,709][100917] Updated weights for policy 1, policy_version 6012 (0.0007) +[2023-10-14 05:15:29,444][100936] Updated weights for policy 0, policy_version 6020 (0.0007) +[2023-10-14 05:15:29,811][100936] Updated weights for policy 0, policy_version 6030 (0.0008) +[2023-10-14 05:15:30,193][100936] Updated weights for policy 0, policy_version 6040 (0.0007) +[2023-10-14 05:15:32,920][100917] Updated weights for policy 1, policy_version 6022 (0.0008) +[2023-10-14 05:15:33,301][100917] Updated weights for policy 1, policy_version 6032 (0.0007) +[2023-10-14 05:15:33,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 12353536. Throughput: 0: 1655.2, 1: 1656.3. Samples: 3101502. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:15:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.860')] +[2023-10-14 05:15:33,679][100917] Updated weights for policy 1, policy_version 6042 (0.0009) +[2023-10-14 05:15:34,522][100936] Updated weights for policy 0, policy_version 6050 (0.0008) +[2023-10-14 05:15:34,899][100936] Updated weights for policy 0, policy_version 6060 (0.0008) +[2023-10-14 05:15:35,269][100936] Updated weights for policy 0, policy_version 6070 (0.0008) +[2023-10-14 05:15:35,644][100936] Updated weights for policy 0, policy_version 6080 (0.0008) +[2023-10-14 05:15:37,955][100917] Updated weights for policy 1, policy_version 6052 (0.0010) +[2023-10-14 05:15:38,336][100917] Updated weights for policy 1, policy_version 6062 (0.0008) +[2023-10-14 05:15:38,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 12419072. Throughput: 0: 1658.7, 1: 1647.8. Samples: 3121624. Policy #0 lag: (min: 31.0, avg: 32.3, max: 56.0) +[2023-10-14 05:15:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.870')] +[2023-10-14 05:15:38,520][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000006080_6225920.pth... +[2023-10-14 05:15:38,549][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000004544_4653056.pth +[2023-10-14 05:15:38,701][100917] Updated weights for policy 1, policy_version 6072 (0.0009) +[2023-10-14 05:15:39,005][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000006080_6225920.pth... +[2023-10-14 05:15:39,035][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000004512_4620288.pth +[2023-10-14 05:15:39,630][100936] Updated weights for policy 0, policy_version 6090 (0.0007) +[2023-10-14 05:15:39,993][100936] Updated weights for policy 0, policy_version 6100 (0.0010) +[2023-10-14 05:15:40,359][100936] Updated weights for policy 0, policy_version 6110 (0.0008) +[2023-10-14 05:15:42,641][100917] Updated weights for policy 1, policy_version 6082 (0.0007) +[2023-10-14 05:15:43,023][100917] Updated weights for policy 1, policy_version 6092 (0.0007) +[2023-10-14 05:15:43,396][100917] Updated weights for policy 1, policy_version 6102 (0.0007) +[2023-10-14 05:15:43,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 12484608. Throughput: 0: 1659.7, 1: 1655.8. Samples: 3130924. Policy #0 lag: (min: 31.0, avg: 32.3, max: 56.0) +[2023-10-14 05:15:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.870')] +[2023-10-14 05:15:43,768][100917] Updated weights for policy 1, policy_version 6112 (0.0008) +[2023-10-14 05:15:44,450][100936] Updated weights for policy 0, policy_version 6120 (0.0008) +[2023-10-14 05:15:44,838][100936] Updated weights for policy 0, policy_version 6130 (0.0009) +[2023-10-14 05:15:45,210][100936] Updated weights for policy 0, policy_version 6140 (0.0011) +[2023-10-14 05:15:47,944][100917] Updated weights for policy 1, policy_version 6122 (0.0007) +[2023-10-14 05:15:48,323][100917] Updated weights for policy 1, policy_version 6132 (0.0008) +[2023-10-14 05:15:48,512][99942] Fps is (10 sec: 13107.7, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 12550144. Throughput: 0: 1657.8, 1: 1657.5. Samples: 3151150. Policy #0 lag: (min: 10.0, avg: 10.0, max: 13.0) +[2023-10-14 05:15:48,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.870')] +[2023-10-14 05:15:48,699][100917] Updated weights for policy 1, policy_version 6142 (0.0009) +[2023-10-14 05:15:49,447][100936] Updated weights for policy 0, policy_version 6150 (0.0008) +[2023-10-14 05:15:49,819][100936] Updated weights for policy 0, policy_version 6160 (0.0010) +[2023-10-14 05:15:50,188][100936] Updated weights for policy 0, policy_version 6170 (0.0008) +[2023-10-14 05:15:52,883][100917] Updated weights for policy 1, policy_version 6152 (0.0009) +[2023-10-14 05:15:53,257][100917] Updated weights for policy 1, policy_version 6162 (0.0007) +[2023-10-14 05:15:53,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 12615680. Throughput: 0: 1651.9, 1: 1652.0. Samples: 3170932. Policy #0 lag: (min: 10.0, avg: 10.0, max: 13.0) +[2023-10-14 05:15:53,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.870')] +[2023-10-14 05:15:53,632][100917] Updated weights for policy 1, policy_version 6172 (0.0008) +[2023-10-14 05:15:54,407][100936] Updated weights for policy 0, policy_version 6180 (0.0010) +[2023-10-14 05:15:54,776][100936] Updated weights for policy 0, policy_version 6190 (0.0007) +[2023-10-14 05:15:55,149][100936] Updated weights for policy 0, policy_version 6200 (0.0007) +[2023-10-14 05:15:57,695][100917] Updated weights for policy 1, policy_version 6182 (0.0008) +[2023-10-14 05:15:58,067][100917] Updated weights for policy 1, policy_version 6192 (0.0007) +[2023-10-14 05:15:58,441][100917] Updated weights for policy 1, policy_version 6202 (0.0010) +[2023-10-14 05:15:58,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 12681216. Throughput: 0: 1650.8, 1: 1660.4. Samples: 3180268. Policy #0 lag: (min: 30.0, avg: 34.5, max: 62.0) +[2023-10-14 05:15:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.870')] +[2023-10-14 05:15:59,356][100936] Updated weights for policy 0, policy_version 6210 (0.0007) +[2023-10-14 05:15:59,731][100936] Updated weights for policy 0, policy_version 6220 (0.0008) +[2023-10-14 05:16:00,099][100936] Updated weights for policy 0, policy_version 6230 (0.0010) +[2023-10-14 05:16:00,476][100936] Updated weights for policy 0, policy_version 6240 (0.0008) +[2023-10-14 05:16:02,556][100917] Updated weights for policy 1, policy_version 6212 (0.0009) +[2023-10-14 05:16:02,932][100917] Updated weights for policy 1, policy_version 6222 (0.0009) +[2023-10-14 05:16:03,300][100917] Updated weights for policy 1, policy_version 6232 (0.0007) +[2023-10-14 05:16:03,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 12746752. Throughput: 0: 1647.4, 1: 1660.8. Samples: 3200666. Policy #0 lag: (min: 30.0, avg: 34.5, max: 62.0) +[2023-10-14 05:16:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.870')] +[2023-10-14 05:16:04,666][100936] Updated weights for policy 0, policy_version 6250 (0.0007) +[2023-10-14 05:16:05,035][100936] Updated weights for policy 0, policy_version 6260 (0.0010) +[2023-10-14 05:16:05,408][100936] Updated weights for policy 0, policy_version 6270 (0.0010) +[2023-10-14 05:16:07,440][100917] Updated weights for policy 1, policy_version 6242 (0.0008) +[2023-10-14 05:16:07,811][100917] Updated weights for policy 1, policy_version 6252 (0.0007) +[2023-10-14 05:16:08,186][100917] Updated weights for policy 1, policy_version 6262 (0.0007) +[2023-10-14 05:16:08,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 12812288. Throughput: 0: 1646.6, 1: 1644.7. Samples: 3220408. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:16:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.870')] +[2023-10-14 05:16:08,556][100917] Updated weights for policy 1, policy_version 6272 (0.0010) +[2023-10-14 05:16:09,378][100936] Updated weights for policy 0, policy_version 6280 (0.0011) +[2023-10-14 05:16:09,754][100936] Updated weights for policy 0, policy_version 6290 (0.0011) +[2023-10-14 05:16:10,131][100936] Updated weights for policy 0, policy_version 6300 (0.0009) +[2023-10-14 05:16:12,660][100917] Updated weights for policy 1, policy_version 6282 (0.0011) +[2023-10-14 05:16:13,030][100917] Updated weights for policy 1, policy_version 6292 (0.0010) +[2023-10-14 05:16:13,400][100917] Updated weights for policy 1, policy_version 6302 (0.0010) +[2023-10-14 05:16:13,512][99942] Fps is (10 sec: 16384.3, 60 sec: 13653.3, 300 sec: 13218.4). Total num frames: 12910592. Throughput: 0: 1647.6, 1: 1648.3. Samples: 3229726. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:16:13,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.870')] +[2023-10-14 05:16:14,375][100936] Updated weights for policy 0, policy_version 6310 (0.0008) +[2023-10-14 05:16:14,739][100936] Updated weights for policy 0, policy_version 6320 (0.0009) +[2023-10-14 05:16:15,121][100936] Updated weights for policy 0, policy_version 6330 (0.0008) +[2023-10-14 05:16:17,614][100917] Updated weights for policy 1, policy_version 6312 (0.0009) +[2023-10-14 05:16:17,993][100917] Updated weights for policy 1, policy_version 6322 (0.0008) +[2023-10-14 05:16:18,369][100917] Updated weights for policy 1, policy_version 6332 (0.0008) +[2023-10-14 05:16:18,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 12943360. Throughput: 0: 1646.6, 1: 1661.7. Samples: 3250374. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:16:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.870')] +[2023-10-14 05:16:19,280][100936] Updated weights for policy 0, policy_version 6340 (0.0007) +[2023-10-14 05:16:19,651][100936] Updated weights for policy 0, policy_version 6350 (0.0009) +[2023-10-14 05:16:20,022][100936] Updated weights for policy 0, policy_version 6360 (0.0011) +[2023-10-14 05:16:22,637][100917] Updated weights for policy 1, policy_version 6342 (0.0008) +[2023-10-14 05:16:23,004][100917] Updated weights for policy 1, policy_version 6352 (0.0007) +[2023-10-14 05:16:23,387][100917] Updated weights for policy 1, policy_version 6362 (0.0008) +[2023-10-14 05:16:23,512][99942] Fps is (10 sec: 9830.0, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 13008896. Throughput: 0: 1652.9, 1: 1652.3. Samples: 3270358. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:16:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.870')] +[2023-10-14 05:16:24,059][100936] Updated weights for policy 0, policy_version 6370 (0.0008) +[2023-10-14 05:16:24,463][100936] Updated weights for policy 0, policy_version 6380 (0.0008) +[2023-10-14 05:16:24,832][100936] Updated weights for policy 0, policy_version 6390 (0.0010) +[2023-10-14 05:16:25,206][100936] Updated weights for policy 0, policy_version 6400 (0.0008) +[2023-10-14 05:16:27,397][100917] Updated weights for policy 1, policy_version 6372 (0.0010) +[2023-10-14 05:16:27,766][100917] Updated weights for policy 1, policy_version 6382 (0.0010) +[2023-10-14 05:16:28,147][100917] Updated weights for policy 1, policy_version 6392 (0.0011) +[2023-10-14 05:16:28,512][99942] Fps is (10 sec: 16384.0, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 13107200. Throughput: 0: 1649.7, 1: 1658.5. Samples: 3279792. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:16:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.870')] +[2023-10-14 05:16:29,316][100936] Updated weights for policy 0, policy_version 6410 (0.0010) +[2023-10-14 05:16:29,689][100936] Updated weights for policy 0, policy_version 6420 (0.0010) +[2023-10-14 05:16:30,067][100936] Updated weights for policy 0, policy_version 6430 (0.0009) +[2023-10-14 05:16:32,414][100917] Updated weights for policy 1, policy_version 6402 (0.0009) +[2023-10-14 05:16:32,789][100917] Updated weights for policy 1, policy_version 6412 (0.0009) +[2023-10-14 05:16:33,170][100917] Updated weights for policy 1, policy_version 6422 (0.0009) +[2023-10-14 05:16:33,512][99942] Fps is (10 sec: 13107.8, 60 sec: 13107.3, 300 sec: 13107.2). Total num frames: 13139968. Throughput: 0: 1657.3, 1: 1657.0. Samples: 3300292. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:16:33,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.870')] +[2023-10-14 05:16:33,539][100917] Updated weights for policy 1, policy_version 6432 (0.0008) +[2023-10-14 05:16:34,138][100936] Updated weights for policy 0, policy_version 6440 (0.0009) +[2023-10-14 05:16:34,520][100936] Updated weights for policy 0, policy_version 6450 (0.0009) +[2023-10-14 05:16:34,901][100936] Updated weights for policy 0, policy_version 6460 (0.0007) +[2023-10-14 05:16:37,607][100917] Updated weights for policy 1, policy_version 6442 (0.0010) +[2023-10-14 05:16:37,974][100917] Updated weights for policy 1, policy_version 6452 (0.0010) +[2023-10-14 05:16:38,356][100917] Updated weights for policy 1, policy_version 6462 (0.0007) +[2023-10-14 05:16:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 13238272. Throughput: 0: 1666.0, 1: 1649.2. Samples: 3320116. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:16:38,514][99942] Avg episode reward: [(0, '1.000'), (1, '0.870')] +[2023-10-14 05:16:38,883][100936] Updated weights for policy 0, policy_version 6470 (0.0009) +[2023-10-14 05:16:39,257][100936] Updated weights for policy 0, policy_version 6480 (0.0010) +[2023-10-14 05:16:39,636][100936] Updated weights for policy 0, policy_version 6490 (0.0009) +[2023-10-14 05:16:42,407][100917] Updated weights for policy 1, policy_version 6472 (0.0008) +[2023-10-14 05:16:42,790][100917] Updated weights for policy 1, policy_version 6482 (0.0009) +[2023-10-14 05:16:43,161][100917] Updated weights for policy 1, policy_version 6492 (0.0008) +[2023-10-14 05:16:43,512][99942] Fps is (10 sec: 16383.5, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 13303808. Throughput: 0: 1664.3, 1: 1657.1. Samples: 3329730. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:16:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.870')] +[2023-10-14 05:16:43,801][100936] Updated weights for policy 0, policy_version 6500 (0.0007) +[2023-10-14 05:16:44,167][100936] Updated weights for policy 0, policy_version 6510 (0.0010) +[2023-10-14 05:16:44,540][100936] Updated weights for policy 0, policy_version 6520 (0.0010) +[2023-10-14 05:16:47,444][100917] Updated weights for policy 1, policy_version 6502 (0.0008) +[2023-10-14 05:16:47,805][100917] Updated weights for policy 1, policy_version 6512 (0.0007) +[2023-10-14 05:16:48,173][100917] Updated weights for policy 1, policy_version 6522 (0.0008) +[2023-10-14 05:16:48,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 13369344. Throughput: 0: 1663.9, 1: 1646.9. Samples: 3349650. Policy #0 lag: (min: 0.0, avg: 24.1, max: 32.0) +[2023-10-14 05:16:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.870')] +[2023-10-14 05:16:48,724][100936] Updated weights for policy 0, policy_version 6530 (0.0010) +[2023-10-14 05:16:49,101][100936] Updated weights for policy 0, policy_version 6540 (0.0007) +[2023-10-14 05:16:49,471][100936] Updated weights for policy 0, policy_version 6550 (0.0007) +[2023-10-14 05:16:49,850][100936] Updated weights for policy 0, policy_version 6560 (0.0010) +[2023-10-14 05:16:52,307][100917] Updated weights for policy 1, policy_version 6532 (0.0008) +[2023-10-14 05:16:52,686][100917] Updated weights for policy 1, policy_version 6542 (0.0009) +[2023-10-14 05:16:53,068][100917] Updated weights for policy 1, policy_version 6552 (0.0007) +[2023-10-14 05:16:53,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 13434880. Throughput: 0: 1663.9, 1: 1650.1. Samples: 3369538. Policy #0 lag: (min: 0.0, avg: 24.1, max: 32.0) +[2023-10-14 05:16:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.870')] +[2023-10-14 05:16:53,985][100936] Updated weights for policy 0, policy_version 6570 (0.0009) +[2023-10-14 05:16:54,357][100936] Updated weights for policy 0, policy_version 6580 (0.0009) +[2023-10-14 05:16:54,731][100936] Updated weights for policy 0, policy_version 6590 (0.0009) +[2023-10-14 05:16:57,020][100917] Updated weights for policy 1, policy_version 6562 (0.0009) +[2023-10-14 05:16:57,399][100917] Updated weights for policy 1, policy_version 6572 (0.0009) +[2023-10-14 05:16:57,785][100917] Updated weights for policy 1, policy_version 6582 (0.0010) +[2023-10-14 05:16:58,155][100917] Updated weights for policy 1, policy_version 6592 (0.0009) +[2023-10-14 05:16:58,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 13500416. Throughput: 0: 1667.2, 1: 1658.8. Samples: 3379398. Policy #0 lag: (min: 31.0, avg: 31.0, max: 33.0) +[2023-10-14 05:16:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.870')] +[2023-10-14 05:16:58,695][100936] Updated weights for policy 0, policy_version 6600 (0.0009) +[2023-10-14 05:16:59,061][100936] Updated weights for policy 0, policy_version 6610 (0.0009) +[2023-10-14 05:16:59,440][100936] Updated weights for policy 0, policy_version 6620 (0.0011) +[2023-10-14 05:17:02,447][100917] Updated weights for policy 1, policy_version 6602 (0.0008) +[2023-10-14 05:17:02,820][100917] Updated weights for policy 1, policy_version 6612 (0.0011) +[2023-10-14 05:17:03,199][100917] Updated weights for policy 1, policy_version 6622 (0.0008) +[2023-10-14 05:17:03,398][100936] Updated weights for policy 0, policy_version 6630 (0.0009) +[2023-10-14 05:17:03,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 13565952. Throughput: 0: 1671.0, 1: 1651.6. Samples: 3399892. Policy #0 lag: (min: 31.0, avg: 31.0, max: 33.0) +[2023-10-14 05:17:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.870')] +[2023-10-14 05:17:03,778][100936] Updated weights for policy 0, policy_version 6640 (0.0009) +[2023-10-14 05:17:04,139][100936] Updated weights for policy 0, policy_version 6650 (0.0011) +[2023-10-14 05:17:07,300][100917] Updated weights for policy 1, policy_version 6632 (0.0011) +[2023-10-14 05:17:07,665][100917] Updated weights for policy 1, policy_version 6642 (0.0007) +[2023-10-14 05:17:08,047][100917] Updated weights for policy 1, policy_version 6652 (0.0010) +[2023-10-14 05:17:08,250][100936] Updated weights for policy 0, policy_version 6660 (0.0008) +[2023-10-14 05:17:08,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 13631488. Throughput: 0: 1667.3, 1: 1640.9. Samples: 3419228. Policy #0 lag: (min: 31.0, avg: 38.7, max: 63.0) +[2023-10-14 05:17:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.870')] +[2023-10-14 05:17:08,603][100936] Updated weights for policy 0, policy_version 6670 (0.0008) +[2023-10-14 05:17:08,981][100936] Updated weights for policy 0, policy_version 6680 (0.0007) +[2023-10-14 05:17:12,252][100917] Updated weights for policy 1, policy_version 6662 (0.0008) +[2023-10-14 05:17:12,627][100917] Updated weights for policy 1, policy_version 6672 (0.0007) +[2023-10-14 05:17:13,012][100917] Updated weights for policy 1, policy_version 6682 (0.0007) +[2023-10-14 05:17:13,184][100936] Updated weights for policy 0, policy_version 6690 (0.0007) +[2023-10-14 05:17:13,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 13697024. Throughput: 0: 1673.4, 1: 1652.3. Samples: 3429448. Policy #0 lag: (min: 31.0, avg: 38.7, max: 63.0) +[2023-10-14 05:17:13,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.880')] +[2023-10-14 05:17:13,579][100936] Updated weights for policy 0, policy_version 6700 (0.0009) +[2023-10-14 05:17:13,951][100936] Updated weights for policy 0, policy_version 6710 (0.0008) +[2023-10-14 05:17:14,321][100936] Updated weights for policy 0, policy_version 6720 (0.0008) +[2023-10-14 05:17:17,022][100917] Updated weights for policy 1, policy_version 6692 (0.0008) +[2023-10-14 05:17:17,409][100917] Updated weights for policy 1, policy_version 6702 (0.0008) +[2023-10-14 05:17:17,771][100917] Updated weights for policy 1, policy_version 6712 (0.0009) +[2023-10-14 05:17:18,483][100936] Updated weights for policy 0, policy_version 6730 (0.0011) +[2023-10-14 05:17:18,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 13762560. Throughput: 0: 1667.1, 1: 1650.4. Samples: 3449576. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:17:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.880')] +[2023-10-14 05:17:18,859][100936] Updated weights for policy 0, policy_version 6740 (0.0008) +[2023-10-14 05:17:19,224][100936] Updated weights for policy 0, policy_version 6750 (0.0009) +[2023-10-14 05:17:21,980][100917] Updated weights for policy 1, policy_version 6722 (0.0010) +[2023-10-14 05:17:22,360][100917] Updated weights for policy 1, policy_version 6732 (0.0008) +[2023-10-14 05:17:22,731][100917] Updated weights for policy 1, policy_version 6742 (0.0007) +[2023-10-14 05:17:23,113][100917] Updated weights for policy 1, policy_version 6752 (0.0007) +[2023-10-14 05:17:23,265][100936] Updated weights for policy 0, policy_version 6760 (0.0009) +[2023-10-14 05:17:23,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 13828096. Throughput: 0: 1653.5, 1: 1644.4. Samples: 3468518. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:17:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.900')] +[2023-10-14 05:17:23,635][100936] Updated weights for policy 0, policy_version 6770 (0.0008) +[2023-10-14 05:17:24,006][100936] Updated weights for policy 0, policy_version 6780 (0.0009) +[2023-10-14 05:17:27,323][100917] Updated weights for policy 1, policy_version 6762 (0.0008) +[2023-10-14 05:17:27,696][100917] Updated weights for policy 1, policy_version 6772 (0.0009) +[2023-10-14 05:17:27,990][100936] Updated weights for policy 0, policy_version 6790 (0.0008) +[2023-10-14 05:17:28,071][100917] Updated weights for policy 1, policy_version 6782 (0.0009) +[2023-10-14 05:17:28,362][100936] Updated weights for policy 0, policy_version 6800 (0.0009) +[2023-10-14 05:17:28,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 13893632. Throughput: 0: 1668.1, 1: 1648.0. Samples: 3478954. Policy #0 lag: (min: 31.0, avg: 31.9, max: 47.0) +[2023-10-14 05:17:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.900')] +[2023-10-14 05:17:28,736][100936] Updated weights for policy 0, policy_version 6810 (0.0010) +[2023-10-14 05:17:32,054][100917] Updated weights for policy 1, policy_version 6792 (0.0009) +[2023-10-14 05:17:32,422][100917] Updated weights for policy 1, policy_version 6802 (0.0010) +[2023-10-14 05:17:32,799][100917] Updated weights for policy 1, policy_version 6812 (0.0007) +[2023-10-14 05:17:32,899][100936] Updated weights for policy 0, policy_version 6820 (0.0008) +[2023-10-14 05:17:33,274][100936] Updated weights for policy 0, policy_version 6830 (0.0008) +[2023-10-14 05:17:33,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 13959168. Throughput: 0: 1673.4, 1: 1650.5. Samples: 3499226. Policy #0 lag: (min: 31.0, avg: 31.9, max: 47.0) +[2023-10-14 05:17:33,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.900')] +[2023-10-14 05:17:33,650][100936] Updated weights for policy 0, policy_version 6840 (0.0008) +[2023-10-14 05:17:37,013][100917] Updated weights for policy 1, policy_version 6822 (0.0007) +[2023-10-14 05:17:37,379][100917] Updated weights for policy 1, policy_version 6832 (0.0007) +[2023-10-14 05:17:37,753][100917] Updated weights for policy 1, policy_version 6842 (0.0007) +[2023-10-14 05:17:37,842][100936] Updated weights for policy 0, policy_version 6850 (0.0008) +[2023-10-14 05:17:38,217][100936] Updated weights for policy 0, policy_version 6860 (0.0009) +[2023-10-14 05:17:38,512][99942] Fps is (10 sec: 13106.6, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 14024704. Throughput: 0: 1656.0, 1: 1643.0. Samples: 3517992. Policy #0 lag: (min: 14.0, avg: 16.8, max: 46.0) +[2023-10-14 05:17:38,514][99942] Avg episode reward: [(0, '1.000'), (1, '0.910')] +[2023-10-14 05:17:38,524][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000006848_7012352.pth... +[2023-10-14 05:17:38,562][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000005312_5439488.pth +[2023-10-14 05:17:38,594][100936] Updated weights for policy 0, policy_version 6870 (0.0008) +[2023-10-14 05:17:38,962][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000006880_7045120.pth... +[2023-10-14 05:17:38,966][100936] Updated weights for policy 0, policy_version 6880 (0.0009) +[2023-10-14 05:17:38,991][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000005312_5439488.pth +[2023-10-14 05:17:41,492][100917] Updated weights for policy 1, policy_version 6852 (0.0007) +[2023-10-14 05:17:41,864][100917] Updated weights for policy 1, policy_version 6862 (0.0008) +[2023-10-14 05:17:42,242][100917] Updated weights for policy 1, policy_version 6872 (0.0010) +[2023-10-14 05:17:43,074][100936] Updated weights for policy 0, policy_version 6890 (0.0009) +[2023-10-14 05:17:43,453][100936] Updated weights for policy 0, policy_version 6900 (0.0008) +[2023-10-14 05:17:43,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 14090240. Throughput: 0: 1670.3, 1: 1658.0. Samples: 3529172. Policy #0 lag: (min: 14.0, avg: 16.8, max: 46.0) +[2023-10-14 05:17:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.910')] +[2023-10-14 05:17:43,819][100936] Updated weights for policy 0, policy_version 6910 (0.0008) +[2023-10-14 05:17:46,402][100917] Updated weights for policy 1, policy_version 6882 (0.0010) +[2023-10-14 05:17:46,819][100917] Updated weights for policy 1, policy_version 6892 (0.0009) +[2023-10-14 05:17:47,188][100917] Updated weights for policy 1, policy_version 6902 (0.0011) +[2023-10-14 05:17:47,567][100917] Updated weights for policy 1, policy_version 6912 (0.0009) +[2023-10-14 05:17:47,799][100936] Updated weights for policy 0, policy_version 6920 (0.0009) +[2023-10-14 05:17:48,176][100936] Updated weights for policy 0, policy_version 6930 (0.0008) +[2023-10-14 05:17:48,512][99942] Fps is (10 sec: 13107.8, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 14155776. Throughput: 0: 1669.4, 1: 1646.8. Samples: 3549120. Policy #0 lag: (min: 24.0, avg: 47.1, max: 56.0) +[2023-10-14 05:17:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.910')] +[2023-10-14 05:17:48,540][100936] Updated weights for policy 0, policy_version 6940 (0.0009) +[2023-10-14 05:17:51,595][100917] Updated weights for policy 1, policy_version 6922 (0.0008) +[2023-10-14 05:17:51,962][100917] Updated weights for policy 1, policy_version 6932 (0.0007) +[2023-10-14 05:17:52,334][100917] Updated weights for policy 1, policy_version 6942 (0.0007) +[2023-10-14 05:17:52,866][100936] Updated weights for policy 0, policy_version 6950 (0.0010) +[2023-10-14 05:17:53,233][100936] Updated weights for policy 0, policy_version 6960 (0.0008) +[2023-10-14 05:17:53,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 14221312. Throughput: 0: 1648.3, 1: 1666.7. Samples: 3568402. Policy #0 lag: (min: 24.0, avg: 47.1, max: 56.0) +[2023-10-14 05:17:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.910')] +[2023-10-14 05:17:53,600][100936] Updated weights for policy 0, policy_version 6970 (0.0007) +[2023-10-14 05:17:56,355][100917] Updated weights for policy 1, policy_version 6952 (0.0009) +[2023-10-14 05:17:56,732][100917] Updated weights for policy 1, policy_version 6962 (0.0009) +[2023-10-14 05:17:57,097][100917] Updated weights for policy 1, policy_version 6972 (0.0009) +[2023-10-14 05:17:57,648][100936] Updated weights for policy 0, policy_version 6980 (0.0008) +[2023-10-14 05:17:58,038][100936] Updated weights for policy 0, policy_version 6990 (0.0010) +[2023-10-14 05:17:58,406][100936] Updated weights for policy 0, policy_version 7000 (0.0010) +[2023-10-14 05:17:58,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 14286848. Throughput: 0: 1661.4, 1: 1670.7. Samples: 3579394. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:17:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.910')] +[2023-10-14 05:18:01,109][100917] Updated weights for policy 1, policy_version 6982 (0.0008) +[2023-10-14 05:18:01,483][100917] Updated weights for policy 1, policy_version 6992 (0.0009) +[2023-10-14 05:18:01,864][100917] Updated weights for policy 1, policy_version 7002 (0.0010) +[2023-10-14 05:18:02,580][100936] Updated weights for policy 0, policy_version 7010 (0.0008) +[2023-10-14 05:18:02,946][100936] Updated weights for policy 0, policy_version 7020 (0.0008) +[2023-10-14 05:18:03,313][100936] Updated weights for policy 0, policy_version 7030 (0.0009) +[2023-10-14 05:18:03,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 14352384. Throughput: 0: 1665.9, 1: 1652.8. Samples: 3598918. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:18:03,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:18:03,687][100936] Updated weights for policy 0, policy_version 7040 (0.0008) +[2023-10-14 05:18:05,993][100917] Updated weights for policy 1, policy_version 7012 (0.0008) +[2023-10-14 05:18:06,355][100917] Updated weights for policy 1, policy_version 7022 (0.0009) +[2023-10-14 05:18:06,744][100917] Updated weights for policy 1, policy_version 7032 (0.0008) +[2023-10-14 05:18:07,863][100936] Updated weights for policy 0, policy_version 7050 (0.0008) +[2023-10-14 05:18:08,232][100936] Updated weights for policy 0, policy_version 7060 (0.0008) +[2023-10-14 05:18:08,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 14417920. Throughput: 0: 1657.5, 1: 1672.8. Samples: 3618380. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:18:08,514][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:18:08,596][100936] Updated weights for policy 0, policy_version 7070 (0.0008) +[2023-10-14 05:18:11,012][100917] Updated weights for policy 1, policy_version 7042 (0.0009) +[2023-10-14 05:18:11,391][100917] Updated weights for policy 1, policy_version 7052 (0.0008) +[2023-10-14 05:18:11,770][100917] Updated weights for policy 1, policy_version 7062 (0.0009) +[2023-10-14 05:18:12,145][100917] Updated weights for policy 1, policy_version 7072 (0.0010) +[2023-10-14 05:18:12,593][100936] Updated weights for policy 0, policy_version 7080 (0.0009) +[2023-10-14 05:18:12,961][100936] Updated weights for policy 0, policy_version 7090 (0.0009) +[2023-10-14 05:18:13,332][100936] Updated weights for policy 0, policy_version 7100 (0.0011) +[2023-10-14 05:18:13,512][99942] Fps is (10 sec: 16384.0, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 14516224. Throughput: 0: 1667.2, 1: 1680.1. Samples: 3629580. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:18:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:18:16,125][100917] Updated weights for policy 1, policy_version 7082 (0.0008) +[2023-10-14 05:18:16,502][100917] Updated weights for policy 1, policy_version 7092 (0.0010) +[2023-10-14 05:18:16,885][100917] Updated weights for policy 1, policy_version 7102 (0.0010) +[2023-10-14 05:18:17,498][100936] Updated weights for policy 0, policy_version 7110 (0.0008) +[2023-10-14 05:18:17,855][100936] Updated weights for policy 0, policy_version 7120 (0.0007) +[2023-10-14 05:18:18,234][100936] Updated weights for policy 0, policy_version 7130 (0.0010) +[2023-10-14 05:18:18,512][99942] Fps is (10 sec: 16384.7, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 14581760. Throughput: 0: 1661.8, 1: 1656.5. Samples: 3648548. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:18:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:18:20,988][100917] Updated weights for policy 1, policy_version 7112 (0.0007) +[2023-10-14 05:18:21,361][100917] Updated weights for policy 1, policy_version 7122 (0.0009) +[2023-10-14 05:18:21,730][100917] Updated weights for policy 1, policy_version 7132 (0.0007) +[2023-10-14 05:18:22,382][100936] Updated weights for policy 0, policy_version 7140 (0.0008) +[2023-10-14 05:18:22,752][100936] Updated weights for policy 0, policy_version 7150 (0.0007) +[2023-10-14 05:18:23,123][100936] Updated weights for policy 0, policy_version 7160 (0.0007) +[2023-10-14 05:18:23,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 14647296. Throughput: 0: 1653.1, 1: 1680.6. Samples: 3668010. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) +[2023-10-14 05:18:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:18:25,840][100917] Updated weights for policy 1, policy_version 7142 (0.0009) +[2023-10-14 05:18:26,213][100917] Updated weights for policy 1, policy_version 7152 (0.0008) +[2023-10-14 05:18:26,584][100917] Updated weights for policy 1, policy_version 7162 (0.0008) +[2023-10-14 05:18:27,337][100936] Updated weights for policy 0, policy_version 7170 (0.0008) +[2023-10-14 05:18:27,705][100936] Updated weights for policy 0, policy_version 7180 (0.0010) +[2023-10-14 05:18:28,082][100936] Updated weights for policy 0, policy_version 7190 (0.0009) +[2023-10-14 05:18:28,445][100936] Updated weights for policy 0, policy_version 7200 (0.0009) +[2023-10-14 05:18:28,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 14712832. Throughput: 0: 1663.9, 1: 1663.5. Samples: 3678906. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) +[2023-10-14 05:18:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:18:30,738][100917] Updated weights for policy 1, policy_version 7172 (0.0010) +[2023-10-14 05:18:31,106][100917] Updated weights for policy 1, policy_version 7182 (0.0008) +[2023-10-14 05:18:31,483][100917] Updated weights for policy 1, policy_version 7192 (0.0007) +[2023-10-14 05:18:32,432][100936] Updated weights for policy 0, policy_version 7210 (0.0007) +[2023-10-14 05:18:32,815][100936] Updated weights for policy 0, policy_version 7220 (0.0008) +[2023-10-14 05:18:33,188][100936] Updated weights for policy 0, policy_version 7230 (0.0011) +[2023-10-14 05:18:33,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 14778368. Throughput: 0: 1653.3, 1: 1659.5. Samples: 3698194. Policy #0 lag: (min: 29.0, avg: 35.6, max: 61.0) +[2023-10-14 05:18:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:18:35,618][100917] Updated weights for policy 1, policy_version 7202 (0.0008) +[2023-10-14 05:18:36,026][100917] Updated weights for policy 1, policy_version 7212 (0.0007) +[2023-10-14 05:18:36,394][100917] Updated weights for policy 1, policy_version 7222 (0.0007) +[2023-10-14 05:18:36,757][100917] Updated weights for policy 1, policy_version 7232 (0.0009) +[2023-10-14 05:18:37,444][100936] Updated weights for policy 0, policy_version 7240 (0.0008) +[2023-10-14 05:18:37,822][100936] Updated weights for policy 0, policy_version 7250 (0.0009) +[2023-10-14 05:18:38,202][100936] Updated weights for policy 0, policy_version 7260 (0.0010) +[2023-10-14 05:18:38,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13653.5, 300 sec: 13329.4). Total num frames: 14843904. Throughput: 0: 1652.5, 1: 1663.0. Samples: 3717598. Policy #0 lag: (min: 29.0, avg: 35.6, max: 61.0) +[2023-10-14 05:18:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:18:40,742][100917] Updated weights for policy 1, policy_version 7242 (0.0007) +[2023-10-14 05:18:41,118][100917] Updated weights for policy 1, policy_version 7252 (0.0007) +[2023-10-14 05:18:41,493][100917] Updated weights for policy 1, policy_version 7262 (0.0008) +[2023-10-14 05:18:42,204][100936] Updated weights for policy 0, policy_version 7270 (0.0007) +[2023-10-14 05:18:42,570][100936] Updated weights for policy 0, policy_version 7280 (0.0009) +[2023-10-14 05:18:42,946][100936] Updated weights for policy 0, policy_version 7290 (0.0008) +[2023-10-14 05:18:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 14909440. Throughput: 0: 1662.6, 1: 1650.6. Samples: 3728486. Policy #0 lag: (min: 1.0, avg: 10.6, max: 33.0) +[2023-10-14 05:18:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:18:45,587][100917] Updated weights for policy 1, policy_version 7272 (0.0008) +[2023-10-14 05:18:45,966][100917] Updated weights for policy 1, policy_version 7282 (0.0009) +[2023-10-14 05:18:46,344][100917] Updated weights for policy 1, policy_version 7292 (0.0010) +[2023-10-14 05:18:47,213][100936] Updated weights for policy 0, policy_version 7300 (0.0007) +[2023-10-14 05:18:47,592][100936] Updated weights for policy 0, policy_version 7310 (0.0008) +[2023-10-14 05:18:47,962][100936] Updated weights for policy 0, policy_version 7320 (0.0008) +[2023-10-14 05:18:48,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 14974976. Throughput: 0: 1653.1, 1: 1657.6. Samples: 3747900. Policy #0 lag: (min: 1.0, avg: 10.6, max: 33.0) +[2023-10-14 05:18:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:18:50,568][100917] Updated weights for policy 1, policy_version 7302 (0.0008) +[2023-10-14 05:18:50,938][100917] Updated weights for policy 1, policy_version 7312 (0.0008) +[2023-10-14 05:18:51,311][100917] Updated weights for policy 1, policy_version 7322 (0.0008) +[2023-10-14 05:18:52,192][100936] Updated weights for policy 0, policy_version 7330 (0.0009) +[2023-10-14 05:18:52,553][100936] Updated weights for policy 0, policy_version 7340 (0.0007) +[2023-10-14 05:18:52,932][100936] Updated weights for policy 0, policy_version 7350 (0.0007) +[2023-10-14 05:18:53,300][100936] Updated weights for policy 0, policy_version 7360 (0.0008) +[2023-10-14 05:18:53,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13329.3). Total num frames: 15040512. Throughput: 0: 1653.6, 1: 1656.3. Samples: 3767326. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) +[2023-10-14 05:18:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:18:55,478][100917] Updated weights for policy 1, policy_version 7332 (0.0008) +[2023-10-14 05:18:55,851][100917] Updated weights for policy 1, policy_version 7342 (0.0009) +[2023-10-14 05:18:56,227][100917] Updated weights for policy 1, policy_version 7352 (0.0009) +[2023-10-14 05:18:57,174][100936] Updated weights for policy 0, policy_version 7370 (0.0007) +[2023-10-14 05:18:57,548][100936] Updated weights for policy 0, policy_version 7380 (0.0009) +[2023-10-14 05:18:57,908][100936] Updated weights for policy 0, policy_version 7390 (0.0009) +[2023-10-14 05:18:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 15106048. Throughput: 0: 1662.4, 1: 1641.1. Samples: 3778238. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) +[2023-10-14 05:18:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:19:00,448][100917] Updated weights for policy 1, policy_version 7362 (0.0008) +[2023-10-14 05:19:00,819][100917] Updated weights for policy 1, policy_version 7372 (0.0009) +[2023-10-14 05:19:01,204][100917] Updated weights for policy 1, policy_version 7382 (0.0009) +[2023-10-14 05:19:01,573][100917] Updated weights for policy 1, policy_version 7392 (0.0008) +[2023-10-14 05:19:02,013][100936] Updated weights for policy 0, policy_version 7400 (0.0009) +[2023-10-14 05:19:02,381][100936] Updated weights for policy 0, policy_version 7410 (0.0010) +[2023-10-14 05:19:02,749][100936] Updated weights for policy 0, policy_version 7420 (0.0009) +[2023-10-14 05:19:03,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 15171584. Throughput: 0: 1652.3, 1: 1656.0. Samples: 3797418. Policy #0 lag: (min: 31.0, avg: 44.8, max: 63.0) +[2023-10-14 05:19:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:19:05,621][100917] Updated weights for policy 1, policy_version 7402 (0.0008) +[2023-10-14 05:19:06,000][100917] Updated weights for policy 1, policy_version 7412 (0.0009) +[2023-10-14 05:19:06,380][100917] Updated weights for policy 1, policy_version 7422 (0.0010) +[2023-10-14 05:19:06,839][100936] Updated weights for policy 0, policy_version 7430 (0.0008) +[2023-10-14 05:19:07,214][100936] Updated weights for policy 0, policy_version 7440 (0.0008) +[2023-10-14 05:19:07,590][100936] Updated weights for policy 0, policy_version 7450 (0.0007) +[2023-10-14 05:19:08,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 15237120. Throughput: 0: 1663.0, 1: 1656.0. Samples: 3817366. Policy #0 lag: (min: 31.0, avg: 44.8, max: 63.0) +[2023-10-14 05:19:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:19:10,380][100917] Updated weights for policy 1, policy_version 7432 (0.0008) +[2023-10-14 05:19:10,759][100917] Updated weights for policy 1, policy_version 7442 (0.0007) +[2023-10-14 05:19:11,138][100917] Updated weights for policy 1, policy_version 7452 (0.0008) +[2023-10-14 05:19:11,669][100936] Updated weights for policy 0, policy_version 7460 (0.0008) +[2023-10-14 05:19:12,040][100936] Updated weights for policy 0, policy_version 7470 (0.0008) +[2023-10-14 05:19:12,407][100936] Updated weights for policy 0, policy_version 7480 (0.0008) +[2023-10-14 05:19:13,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 15302656. Throughput: 0: 1663.7, 1: 1646.9. Samples: 3827880. Policy #0 lag: (min: 31.0, avg: 32.1, max: 54.0) +[2023-10-14 05:19:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:19:15,181][100917] Updated weights for policy 1, policy_version 7462 (0.0007) +[2023-10-14 05:19:15,557][100917] Updated weights for policy 1, policy_version 7472 (0.0007) +[2023-10-14 05:19:15,925][100917] Updated weights for policy 1, policy_version 7482 (0.0009) +[2023-10-14 05:19:16,560][100936] Updated weights for policy 0, policy_version 7490 (0.0009) +[2023-10-14 05:19:16,938][100936] Updated weights for policy 0, policy_version 7500 (0.0010) +[2023-10-14 05:19:17,312][100936] Updated weights for policy 0, policy_version 7510 (0.0007) +[2023-10-14 05:19:17,683][100936] Updated weights for policy 0, policy_version 7520 (0.0007) +[2023-10-14 05:19:18,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 15368192. Throughput: 0: 1649.9, 1: 1657.2. Samples: 3847012. Policy #0 lag: (min: 31.0, avg: 32.1, max: 54.0) +[2023-10-14 05:19:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:19:20,141][100917] Updated weights for policy 1, policy_version 7492 (0.0009) +[2023-10-14 05:19:20,518][100917] Updated weights for policy 1, policy_version 7502 (0.0009) +[2023-10-14 05:19:20,902][100917] Updated weights for policy 1, policy_version 7512 (0.0011) +[2023-10-14 05:19:21,813][100936] Updated weights for policy 0, policy_version 7530 (0.0011) +[2023-10-14 05:19:22,180][100936] Updated weights for policy 0, policy_version 7540 (0.0009) +[2023-10-14 05:19:22,552][100936] Updated weights for policy 0, policy_version 7550 (0.0007) +[2023-10-14 05:19:23,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 15433728. Throughput: 0: 1666.4, 1: 1653.2. Samples: 3866982. Policy #0 lag: (min: 31.0, avg: 34.5, max: 63.0) +[2023-10-14 05:19:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:19:25,297][100917] Updated weights for policy 1, policy_version 7522 (0.0009) +[2023-10-14 05:19:25,710][100917] Updated weights for policy 1, policy_version 7532 (0.0008) +[2023-10-14 05:19:26,072][100917] Updated weights for policy 1, policy_version 7542 (0.0010) +[2023-10-14 05:19:26,454][100917] Updated weights for policy 1, policy_version 7552 (0.0010) +[2023-10-14 05:19:26,701][100936] Updated weights for policy 0, policy_version 7560 (0.0008) +[2023-10-14 05:19:27,076][100936] Updated weights for policy 0, policy_version 7570 (0.0010) +[2023-10-14 05:19:27,439][100936] Updated weights for policy 0, policy_version 7580 (0.0009) +[2023-10-14 05:19:28,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.3, 300 sec: 13329.4). Total num frames: 15499264. Throughput: 0: 1669.9, 1: 1646.6. Samples: 3877728. Policy #0 lag: (min: 31.0, avg: 34.5, max: 63.0) +[2023-10-14 05:19:28,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:19:30,494][100917] Updated weights for policy 1, policy_version 7562 (0.0009) +[2023-10-14 05:19:30,858][100917] Updated weights for policy 1, policy_version 7572 (0.0009) +[2023-10-14 05:19:31,236][100917] Updated weights for policy 1, policy_version 7582 (0.0009) +[2023-10-14 05:19:31,557][100936] Updated weights for policy 0, policy_version 7590 (0.0010) +[2023-10-14 05:19:31,924][100936] Updated weights for policy 0, policy_version 7600 (0.0007) +[2023-10-14 05:19:32,304][100936] Updated weights for policy 0, policy_version 7610 (0.0008) +[2023-10-14 05:19:33,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 15564800. Throughput: 0: 1653.7, 1: 1651.9. Samples: 3896650. Policy #0 lag: (min: 28.0, avg: 28.0, max: 28.0) +[2023-10-14 05:19:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:19:35,373][100917] Updated weights for policy 1, policy_version 7592 (0.0010) +[2023-10-14 05:19:35,738][100917] Updated weights for policy 1, policy_version 7602 (0.0009) +[2023-10-14 05:19:36,111][100917] Updated weights for policy 1, policy_version 7612 (0.0009) +[2023-10-14 05:19:36,406][100936] Updated weights for policy 0, policy_version 7620 (0.0008) +[2023-10-14 05:19:36,797][100936] Updated weights for policy 0, policy_version 7630 (0.0009) +[2023-10-14 05:19:37,169][100936] Updated weights for policy 0, policy_version 7640 (0.0008) +[2023-10-14 05:19:38,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 15630336. Throughput: 0: 1667.0, 1: 1653.7. Samples: 3916758. Policy #0 lag: (min: 28.0, avg: 28.0, max: 28.0) +[2023-10-14 05:19:38,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:19:38,520][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000007616_7798784.pth... +[2023-10-14 05:19:38,520][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000007648_7831552.pth... +[2023-10-14 05:19:38,554][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000006080_6225920.pth +[2023-10-14 05:19:38,558][100560] Saving a milestone ./train_atari/atari_privateye_APPO/checkpoint_p0/milestones/checkpoint_000007648_7831552.pth +[2023-10-14 05:19:38,561][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000006080_6225920.pth +[2023-10-14 05:19:38,567][100681] Saving a milestone ./train_atari/atari_privateye_APPO/checkpoint_p1/milestones/checkpoint_000007616_7798784.pth +[2023-10-14 05:19:40,261][100917] Updated weights for policy 1, policy_version 7622 (0.0008) +[2023-10-14 05:19:40,642][100917] Updated weights for policy 1, policy_version 7632 (0.0007) +[2023-10-14 05:19:41,015][100917] Updated weights for policy 1, policy_version 7642 (0.0010) +[2023-10-14 05:19:41,278][100936] Updated weights for policy 0, policy_version 7650 (0.0009) +[2023-10-14 05:19:41,659][100936] Updated weights for policy 0, policy_version 7660 (0.0007) +[2023-10-14 05:19:42,029][100936] Updated weights for policy 0, policy_version 7670 (0.0008) +[2023-10-14 05:19:42,402][100936] Updated weights for policy 0, policy_version 7680 (0.0008) +[2023-10-14 05:19:43,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 15695872. Throughput: 0: 1658.6, 1: 1646.5. Samples: 3926968. Policy #0 lag: (min: 25.0, avg: 26.4, max: 45.0) +[2023-10-14 05:19:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:19:45,109][100917] Updated weights for policy 1, policy_version 7652 (0.0009) +[2023-10-14 05:19:45,493][100917] Updated weights for policy 1, policy_version 7662 (0.0008) +[2023-10-14 05:19:45,875][100917] Updated weights for policy 1, policy_version 7672 (0.0009) +[2023-10-14 05:19:46,614][100936] Updated weights for policy 0, policy_version 7690 (0.0007) +[2023-10-14 05:19:46,997][100936] Updated weights for policy 0, policy_version 7700 (0.0007) +[2023-10-14 05:19:47,369][100936] Updated weights for policy 0, policy_version 7710 (0.0010) +[2023-10-14 05:19:48,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 15761408. Throughput: 0: 1648.4, 1: 1652.4. Samples: 3945952. Policy #0 lag: (min: 25.0, avg: 26.4, max: 45.0) +[2023-10-14 05:19:48,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:19:50,161][100917] Updated weights for policy 1, policy_version 7682 (0.0008) +[2023-10-14 05:19:50,538][100917] Updated weights for policy 1, policy_version 7692 (0.0007) +[2023-10-14 05:19:50,901][100917] Updated weights for policy 1, policy_version 7702 (0.0009) +[2023-10-14 05:19:51,281][100917] Updated weights for policy 1, policy_version 7712 (0.0008) +[2023-10-14 05:19:51,560][100936] Updated weights for policy 0, policy_version 7720 (0.0010) +[2023-10-14 05:19:51,937][100936] Updated weights for policy 0, policy_version 7730 (0.0011) +[2023-10-14 05:19:52,313][100936] Updated weights for policy 0, policy_version 7740 (0.0009) +[2023-10-14 05:19:53,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 15826944. Throughput: 0: 1657.4, 1: 1648.2. Samples: 3966116. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:19:53,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:19:55,289][100917] Updated weights for policy 1, policy_version 7722 (0.0010) +[2023-10-14 05:19:55,658][100917] Updated weights for policy 1, policy_version 7732 (0.0008) +[2023-10-14 05:19:56,037][100917] Updated weights for policy 1, policy_version 7742 (0.0007) +[2023-10-14 05:19:56,153][100936] Updated weights for policy 0, policy_version 7750 (0.0008) +[2023-10-14 05:19:56,529][100936] Updated weights for policy 0, policy_version 7760 (0.0009) +[2023-10-14 05:19:56,891][100936] Updated weights for policy 0, policy_version 7770 (0.0007) +[2023-10-14 05:19:58,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 15892480. Throughput: 0: 1653.8, 1: 1645.5. Samples: 3976350. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:19:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:20:00,215][100917] Updated weights for policy 1, policy_version 7752 (0.0007) +[2023-10-14 05:20:00,578][100917] Updated weights for policy 1, policy_version 7762 (0.0008) +[2023-10-14 05:20:00,945][100917] Updated weights for policy 1, policy_version 7772 (0.0007) +[2023-10-14 05:20:01,087][100936] Updated weights for policy 0, policy_version 7780 (0.0008) +[2023-10-14 05:20:01,455][100936] Updated weights for policy 0, policy_version 7790 (0.0009) +[2023-10-14 05:20:01,834][100936] Updated weights for policy 0, policy_version 7800 (0.0007) +[2023-10-14 05:20:03,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.1, 300 sec: 13329.3). Total num frames: 15958016. Throughput: 0: 1658.3, 1: 1651.5. Samples: 3995954. Policy #0 lag: (min: 31.0, avg: 34.6, max: 63.0) +[2023-10-14 05:20:03,515][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:20:05,144][100917] Updated weights for policy 1, policy_version 7782 (0.0008) +[2023-10-14 05:20:05,511][100917] Updated weights for policy 1, policy_version 7792 (0.0009) +[2023-10-14 05:20:05,886][100917] Updated weights for policy 1, policy_version 7802 (0.0009) +[2023-10-14 05:20:06,094][100936] Updated weights for policy 0, policy_version 7810 (0.0008) +[2023-10-14 05:20:06,472][100936] Updated weights for policy 0, policy_version 7820 (0.0009) +[2023-10-14 05:20:06,838][100936] Updated weights for policy 0, policy_version 7830 (0.0007) +[2023-10-14 05:20:07,212][100936] Updated weights for policy 0, policy_version 7840 (0.0010) +[2023-10-14 05:20:08,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.3, 300 sec: 13329.4). Total num frames: 16023552. Throughput: 0: 1664.0, 1: 1656.2. Samples: 4016390. Policy #0 lag: (min: 31.0, avg: 34.6, max: 63.0) +[2023-10-14 05:20:08,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:20:10,122][100917] Updated weights for policy 1, policy_version 7812 (0.0009) +[2023-10-14 05:20:10,519][100917] Updated weights for policy 1, policy_version 7822 (0.0008) +[2023-10-14 05:20:10,890][100917] Updated weights for policy 1, policy_version 7832 (0.0008) +[2023-10-14 05:20:11,107][100936] Updated weights for policy 0, policy_version 7850 (0.0007) +[2023-10-14 05:20:11,482][100936] Updated weights for policy 0, policy_version 7860 (0.0008) +[2023-10-14 05:20:11,845][100936] Updated weights for policy 0, policy_version 7870 (0.0009) +[2023-10-14 05:20:13,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 16089088. Throughput: 0: 1650.8, 1: 1654.9. Samples: 4026486. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:20:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:20:15,025][100917] Updated weights for policy 1, policy_version 7842 (0.0008) +[2023-10-14 05:20:15,398][100917] Updated weights for policy 1, policy_version 7852 (0.0008) +[2023-10-14 05:20:15,773][100917] Updated weights for policy 1, policy_version 7862 (0.0008) +[2023-10-14 05:20:15,951][100936] Updated weights for policy 0, policy_version 7880 (0.0009) +[2023-10-14 05:20:16,145][100917] Updated weights for policy 1, policy_version 7872 (0.0008) +[2023-10-14 05:20:16,322][100936] Updated weights for policy 0, policy_version 7890 (0.0008) +[2023-10-14 05:20:16,701][100936] Updated weights for policy 0, policy_version 7900 (0.0007) +[2023-10-14 05:20:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 16154624. Throughput: 0: 1658.0, 1: 1654.4. Samples: 4045712. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:20:18,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:20:20,274][100917] Updated weights for policy 1, policy_version 7882 (0.0009) +[2023-10-14 05:20:20,645][100917] Updated weights for policy 1, policy_version 7892 (0.0009) +[2023-10-14 05:20:21,012][100936] Updated weights for policy 0, policy_version 7910 (0.0009) +[2023-10-14 05:20:21,025][100917] Updated weights for policy 1, policy_version 7902 (0.0007) +[2023-10-14 05:20:21,395][100936] Updated weights for policy 0, policy_version 7920 (0.0009) +[2023-10-14 05:20:21,764][100936] Updated weights for policy 0, policy_version 7930 (0.0009) +[2023-10-14 05:20:23,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.3, 300 sec: 13329.4). Total num frames: 16220160. Throughput: 0: 1665.0, 1: 1657.9. Samples: 4066288. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:20:23,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:20:25,031][100917] Updated weights for policy 1, policy_version 7912 (0.0008) +[2023-10-14 05:20:25,395][100917] Updated weights for policy 1, policy_version 7922 (0.0007) +[2023-10-14 05:20:25,682][100936] Updated weights for policy 0, policy_version 7940 (0.0007) +[2023-10-14 05:20:25,763][100917] Updated weights for policy 1, policy_version 7932 (0.0008) +[2023-10-14 05:20:26,045][100936] Updated weights for policy 0, policy_version 7950 (0.0007) +[2023-10-14 05:20:26,422][100936] Updated weights for policy 0, policy_version 7960 (0.0008) +[2023-10-14 05:20:28,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 16285696. Throughput: 0: 1654.2, 1: 1650.8. Samples: 4075694. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:20:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:20:29,784][100917] Updated weights for policy 1, policy_version 7942 (0.0009) +[2023-10-14 05:20:30,165][100917] Updated weights for policy 1, policy_version 7952 (0.0010) +[2023-10-14 05:20:30,519][100936] Updated weights for policy 0, policy_version 7970 (0.0010) +[2023-10-14 05:20:30,527][100917] Updated weights for policy 1, policy_version 7962 (0.0007) +[2023-10-14 05:20:30,892][100936] Updated weights for policy 0, policy_version 7980 (0.0007) +[2023-10-14 05:20:31,269][100936] Updated weights for policy 0, policy_version 7990 (0.0009) +[2023-10-14 05:20:31,631][100936] Updated weights for policy 0, policy_version 8000 (0.0009) +[2023-10-14 05:20:33,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 16351232. Throughput: 0: 1669.1, 1: 1666.9. Samples: 4096072. Policy #0 lag: (min: 1.0, avg: 3.2, max: 24.0) +[2023-10-14 05:20:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:20:34,602][100917] Updated weights for policy 1, policy_version 7972 (0.0009) +[2023-10-14 05:20:34,969][100917] Updated weights for policy 1, policy_version 7982 (0.0008) +[2023-10-14 05:20:35,352][100917] Updated weights for policy 1, policy_version 7992 (0.0010) +[2023-10-14 05:20:35,806][100936] Updated weights for policy 0, policy_version 8010 (0.0009) +[2023-10-14 05:20:36,184][100936] Updated weights for policy 0, policy_version 8020 (0.0008) +[2023-10-14 05:20:36,560][100936] Updated weights for policy 0, policy_version 8030 (0.0007) +[2023-10-14 05:20:38,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 16416768. Throughput: 0: 1668.6, 1: 1675.2. Samples: 4116588. Policy #0 lag: (min: 1.0, avg: 3.2, max: 24.0) +[2023-10-14 05:20:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:20:39,343][100917] Updated weights for policy 1, policy_version 8002 (0.0008) +[2023-10-14 05:20:39,722][100917] Updated weights for policy 1, policy_version 8012 (0.0007) +[2023-10-14 05:20:40,095][100917] Updated weights for policy 1, policy_version 8022 (0.0008) +[2023-10-14 05:20:40,457][100917] Updated weights for policy 1, policy_version 8032 (0.0007) +[2023-10-14 05:20:40,724][100936] Updated weights for policy 0, policy_version 8040 (0.0008) +[2023-10-14 05:20:41,094][100936] Updated weights for policy 0, policy_version 8050 (0.0009) +[2023-10-14 05:20:41,465][100936] Updated weights for policy 0, policy_version 8060 (0.0009) +[2023-10-14 05:20:43,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 16482304. Throughput: 0: 1654.7, 1: 1668.9. Samples: 4125912. Policy #0 lag: (min: 21.0, avg: 21.0, max: 21.0) +[2023-10-14 05:20:43,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:20:44,524][100917] Updated weights for policy 1, policy_version 8042 (0.0008) +[2023-10-14 05:20:44,900][100917] Updated weights for policy 1, policy_version 8052 (0.0008) +[2023-10-14 05:20:45,265][100917] Updated weights for policy 1, policy_version 8062 (0.0009) +[2023-10-14 05:20:45,635][100936] Updated weights for policy 0, policy_version 8070 (0.0009) +[2023-10-14 05:20:45,999][100936] Updated weights for policy 0, policy_version 8080 (0.0010) +[2023-10-14 05:20:46,369][100936] Updated weights for policy 0, policy_version 8090 (0.0007) +[2023-10-14 05:20:48,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 16547840. Throughput: 0: 1664.2, 1: 1673.3. Samples: 4146140. Policy #0 lag: (min: 21.0, avg: 21.0, max: 21.0) +[2023-10-14 05:20:48,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:20:49,414][100917] Updated weights for policy 1, policy_version 8072 (0.0009) +[2023-10-14 05:20:49,779][100917] Updated weights for policy 1, policy_version 8082 (0.0009) +[2023-10-14 05:20:50,159][100917] Updated weights for policy 1, policy_version 8092 (0.0009) +[2023-10-14 05:20:50,486][100936] Updated weights for policy 0, policy_version 8100 (0.0009) +[2023-10-14 05:20:50,854][100936] Updated weights for policy 0, policy_version 8110 (0.0008) +[2023-10-14 05:20:51,222][100936] Updated weights for policy 0, policy_version 8120 (0.0008) +[2023-10-14 05:20:53,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 16613376. Throughput: 0: 1663.8, 1: 1673.9. Samples: 4166586. Policy #0 lag: (min: 2.0, avg: 12.8, max: 34.0) +[2023-10-14 05:20:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:20:54,343][100917] Updated weights for policy 1, policy_version 8102 (0.0008) +[2023-10-14 05:20:54,712][100917] Updated weights for policy 1, policy_version 8112 (0.0009) +[2023-10-14 05:20:54,988][100936] Updated weights for policy 0, policy_version 8130 (0.0010) +[2023-10-14 05:20:55,086][100917] Updated weights for policy 1, policy_version 8122 (0.0008) +[2023-10-14 05:20:55,356][100936] Updated weights for policy 0, policy_version 8140 (0.0009) +[2023-10-14 05:20:55,730][100936] Updated weights for policy 0, policy_version 8150 (0.0009) +[2023-10-14 05:20:56,088][100936] Updated weights for policy 0, policy_version 8160 (0.0008) +[2023-10-14 05:20:58,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.3, 300 sec: 13329.4). Total num frames: 16678912. Throughput: 0: 1647.4, 1: 1668.2. Samples: 4175686. Policy #0 lag: (min: 2.0, avg: 12.8, max: 34.0) +[2023-10-14 05:20:58,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:20:59,063][100917] Updated weights for policy 1, policy_version 8132 (0.0007) +[2023-10-14 05:20:59,471][100917] Updated weights for policy 1, policy_version 8142 (0.0008) +[2023-10-14 05:20:59,847][100917] Updated weights for policy 1, policy_version 8152 (0.0007) +[2023-10-14 05:21:00,326][100936] Updated weights for policy 0, policy_version 8170 (0.0009) +[2023-10-14 05:21:00,695][100936] Updated weights for policy 0, policy_version 8180 (0.0008) +[2023-10-14 05:21:01,062][100936] Updated weights for policy 0, policy_version 8190 (0.0008) +[2023-10-14 05:21:03,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 16744448. Throughput: 0: 1669.5, 1: 1680.0. Samples: 4196440. Policy #0 lag: (min: 15.0, avg: 20.0, max: 47.0) +[2023-10-14 05:21:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:21:03,826][100917] Updated weights for policy 1, policy_version 8162 (0.0008) +[2023-10-14 05:21:04,202][100917] Updated weights for policy 1, policy_version 8172 (0.0007) +[2023-10-14 05:21:04,579][100917] Updated weights for policy 1, policy_version 8182 (0.0009) +[2023-10-14 05:21:04,957][100917] Updated weights for policy 1, policy_version 8192 (0.0009) +[2023-10-14 05:21:05,215][100936] Updated weights for policy 0, policy_version 8200 (0.0008) +[2023-10-14 05:21:05,583][100936] Updated weights for policy 0, policy_version 8210 (0.0007) +[2023-10-14 05:21:05,958][100936] Updated weights for policy 0, policy_version 8220 (0.0008) +[2023-10-14 05:21:08,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 16809984. Throughput: 0: 1671.7, 1: 1682.9. Samples: 4217248. Policy #0 lag: (min: 15.0, avg: 20.0, max: 47.0) +[2023-10-14 05:21:08,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:21:08,824][100917] Updated weights for policy 1, policy_version 8202 (0.0010) +[2023-10-14 05:21:09,210][100917] Updated weights for policy 1, policy_version 8212 (0.0010) +[2023-10-14 05:21:09,580][100917] Updated weights for policy 1, policy_version 8222 (0.0009) +[2023-10-14 05:21:09,955][100936] Updated weights for policy 0, policy_version 8230 (0.0008) +[2023-10-14 05:21:10,340][100936] Updated weights for policy 0, policy_version 8240 (0.0009) +[2023-10-14 05:21:10,708][100936] Updated weights for policy 0, policy_version 8250 (0.0009) +[2023-10-14 05:21:13,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 16875520. Throughput: 0: 1660.3, 1: 1688.9. Samples: 4226408. Policy #0 lag: (min: 30.0, avg: 32.9, max: 62.0) +[2023-10-14 05:21:13,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:21:13,634][100917] Updated weights for policy 1, policy_version 8232 (0.0010) +[2023-10-14 05:21:14,001][100917] Updated weights for policy 1, policy_version 8242 (0.0010) +[2023-10-14 05:21:14,377][100917] Updated weights for policy 1, policy_version 8252 (0.0009) +[2023-10-14 05:21:14,678][100936] Updated weights for policy 0, policy_version 8260 (0.0009) +[2023-10-14 05:21:15,047][100936] Updated weights for policy 0, policy_version 8270 (0.0007) +[2023-10-14 05:21:15,414][100936] Updated weights for policy 0, policy_version 8280 (0.0007) +[2023-10-14 05:21:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 16941056. Throughput: 0: 1668.0, 1: 1676.4. Samples: 4246572. Policy #0 lag: (min: 30.0, avg: 32.9, max: 62.0) +[2023-10-14 05:21:18,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:21:18,739][100917] Updated weights for policy 1, policy_version 8262 (0.0010) +[2023-10-14 05:21:19,118][100917] Updated weights for policy 1, policy_version 8272 (0.0009) +[2023-10-14 05:21:19,494][100917] Updated weights for policy 1, policy_version 8282 (0.0008) +[2023-10-14 05:21:19,537][100936] Updated weights for policy 0, policy_version 8290 (0.0008) +[2023-10-14 05:21:19,907][100936] Updated weights for policy 0, policy_version 8300 (0.0010) +[2023-10-14 05:21:20,284][100936] Updated weights for policy 0, policy_version 8310 (0.0010) +[2023-10-14 05:21:20,647][100936] Updated weights for policy 0, policy_version 8320 (0.0010) +[2023-10-14 05:21:23,480][100917] Updated weights for policy 1, policy_version 8292 (0.0008) +[2023-10-14 05:21:23,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 17006592. Throughput: 0: 1672.0, 1: 1672.4. Samples: 4267082. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) +[2023-10-14 05:21:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:21:23,858][100917] Updated weights for policy 1, policy_version 8302 (0.0007) +[2023-10-14 05:21:24,230][100917] Updated weights for policy 1, policy_version 8312 (0.0009) +[2023-10-14 05:21:24,796][100936] Updated weights for policy 0, policy_version 8330 (0.0010) +[2023-10-14 05:21:25,165][100936] Updated weights for policy 0, policy_version 8340 (0.0010) +[2023-10-14 05:21:25,538][100936] Updated weights for policy 0, policy_version 8350 (0.0007) +[2023-10-14 05:21:28,466][100917] Updated weights for policy 1, policy_version 8322 (0.0007) +[2023-10-14 05:21:28,512][99942] Fps is (10 sec: 13106.7, 60 sec: 13107.1, 300 sec: 13329.3). Total num frames: 17072128. Throughput: 0: 1662.9, 1: 1671.5. Samples: 4275964. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) +[2023-10-14 05:21:28,514][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:21:28,837][100917] Updated weights for policy 1, policy_version 8332 (0.0008) +[2023-10-14 05:21:29,218][100917] Updated weights for policy 1, policy_version 8342 (0.0010) +[2023-10-14 05:21:29,584][100917] Updated weights for policy 1, policy_version 8352 (0.0007) +[2023-10-14 05:21:29,593][100936] Updated weights for policy 0, policy_version 8360 (0.0009) +[2023-10-14 05:21:29,971][100936] Updated weights for policy 0, policy_version 8370 (0.0010) +[2023-10-14 05:21:30,331][100936] Updated weights for policy 0, policy_version 8380 (0.0010) +[2023-10-14 05:21:33,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 17137664. Throughput: 0: 1678.2, 1: 1669.4. Samples: 4296780. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:21:33,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:21:33,733][100917] Updated weights for policy 1, policy_version 8362 (0.0009) +[2023-10-14 05:21:34,096][100917] Updated weights for policy 1, policy_version 8372 (0.0010) +[2023-10-14 05:21:34,474][100917] Updated weights for policy 1, policy_version 8382 (0.0008) +[2023-10-14 05:21:34,512][100936] Updated weights for policy 0, policy_version 8390 (0.0008) +[2023-10-14 05:21:34,876][100936] Updated weights for policy 0, policy_version 8400 (0.0007) +[2023-10-14 05:21:35,251][100936] Updated weights for policy 0, policy_version 8410 (0.0007) +[2023-10-14 05:21:38,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 17203200. Throughput: 0: 1676.8, 1: 1670.7. Samples: 4317222. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:21:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:21:38,521][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000008416_8617984.pth... +[2023-10-14 05:21:38,550][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000006880_7045120.pth +[2023-10-14 05:21:38,631][100917] Updated weights for policy 1, policy_version 8392 (0.0008) +[2023-10-14 05:21:39,014][100917] Updated weights for policy 1, policy_version 8402 (0.0007) +[2023-10-14 05:21:39,350][100936] Updated weights for policy 0, policy_version 8420 (0.0009) +[2023-10-14 05:21:39,384][100917] Updated weights for policy 1, policy_version 8412 (0.0008) +[2023-10-14 05:21:39,533][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000008416_8617984.pth... +[2023-10-14 05:21:39,568][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000006848_7012352.pth +[2023-10-14 05:21:39,718][100936] Updated weights for policy 0, policy_version 8430 (0.0010) +[2023-10-14 05:21:40,108][100936] Updated weights for policy 0, policy_version 8440 (0.0008) +[2023-10-14 05:21:43,347][100917] Updated weights for policy 1, policy_version 8422 (0.0008) +[2023-10-14 05:21:43,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 17268736. Throughput: 0: 1671.9, 1: 1668.1. Samples: 4325990. Policy #0 lag: (min: 21.0, avg: 21.0, max: 21.0) +[2023-10-14 05:21:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:21:43,737][100917] Updated weights for policy 1, policy_version 8432 (0.0008) +[2023-10-14 05:21:44,116][100917] Updated weights for policy 1, policy_version 8442 (0.0009) +[2023-10-14 05:21:44,248][100936] Updated weights for policy 0, policy_version 8450 (0.0008) +[2023-10-14 05:21:44,611][100936] Updated weights for policy 0, policy_version 8460 (0.0007) +[2023-10-14 05:21:44,987][100936] Updated weights for policy 0, policy_version 8470 (0.0010) +[2023-10-14 05:21:45,355][100936] Updated weights for policy 0, policy_version 8480 (0.0008) +[2023-10-14 05:21:48,141][100917] Updated weights for policy 1, policy_version 8452 (0.0009) +[2023-10-14 05:21:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 17334272. Throughput: 0: 1669.6, 1: 1662.6. Samples: 4346388. Policy #0 lag: (min: 21.0, avg: 21.0, max: 21.0) +[2023-10-14 05:21:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:21:48,523][100917] Updated weights for policy 1, policy_version 8462 (0.0009) +[2023-10-14 05:21:48,891][100917] Updated weights for policy 1, policy_version 8472 (0.0011) +[2023-10-14 05:21:49,675][100936] Updated weights for policy 0, policy_version 8490 (0.0007) +[2023-10-14 05:21:50,044][100936] Updated weights for policy 0, policy_version 8500 (0.0009) +[2023-10-14 05:21:50,408][100936] Updated weights for policy 0, policy_version 8510 (0.0008) +[2023-10-14 05:21:53,037][100917] Updated weights for policy 1, policy_version 8482 (0.0008) +[2023-10-14 05:21:53,422][100917] Updated weights for policy 1, policy_version 8492 (0.0008) +[2023-10-14 05:21:53,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 17399808. Throughput: 0: 1666.4, 1: 1656.1. Samples: 4366760. Policy #0 lag: (min: 26.0, avg: 34.3, max: 58.0) +[2023-10-14 05:21:53,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:21:53,796][100917] Updated weights for policy 1, policy_version 8502 (0.0008) +[2023-10-14 05:21:54,164][100917] Updated weights for policy 1, policy_version 8512 (0.0008) +[2023-10-14 05:21:54,449][100936] Updated weights for policy 0, policy_version 8520 (0.0008) +[2023-10-14 05:21:54,823][100936] Updated weights for policy 0, policy_version 8530 (0.0007) +[2023-10-14 05:21:55,200][100936] Updated weights for policy 0, policy_version 8540 (0.0007) +[2023-10-14 05:21:58,367][100917] Updated weights for policy 1, policy_version 8522 (0.0008) +[2023-10-14 05:21:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 17465344. Throughput: 0: 1664.4, 1: 1654.6. Samples: 4375764. Policy #0 lag: (min: 26.0, avg: 34.3, max: 58.0) +[2023-10-14 05:21:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:21:58,749][100917] Updated weights for policy 1, policy_version 8532 (0.0007) +[2023-10-14 05:21:59,111][100917] Updated weights for policy 1, policy_version 8542 (0.0008) +[2023-10-14 05:21:59,486][100936] Updated weights for policy 0, policy_version 8550 (0.0008) +[2023-10-14 05:21:59,868][100936] Updated weights for policy 0, policy_version 8560 (0.0010) +[2023-10-14 05:22:00,229][100936] Updated weights for policy 0, policy_version 8570 (0.0009) +[2023-10-14 05:22:03,218][100917] Updated weights for policy 1, policy_version 8552 (0.0010) +[2023-10-14 05:22:03,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 17530880. Throughput: 0: 1666.8, 1: 1656.9. Samples: 4396136. Policy #0 lag: (min: 31.0, avg: 34.7, max: 63.0) +[2023-10-14 05:22:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:22:03,586][100917] Updated weights for policy 1, policy_version 8562 (0.0008) +[2023-10-14 05:22:03,959][100917] Updated weights for policy 1, policy_version 8572 (0.0009) +[2023-10-14 05:22:03,994][100936] Updated weights for policy 0, policy_version 8580 (0.0009) +[2023-10-14 05:22:04,359][100936] Updated weights for policy 0, policy_version 8590 (0.0009) +[2023-10-14 05:22:04,739][100936] Updated weights for policy 0, policy_version 8600 (0.0008) +[2023-10-14 05:22:08,045][100917] Updated weights for policy 1, policy_version 8582 (0.0008) +[2023-10-14 05:22:08,421][100917] Updated weights for policy 1, policy_version 8592 (0.0007) +[2023-10-14 05:22:08,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 17596416. Throughput: 0: 1671.1, 1: 1654.3. Samples: 4416722. Policy #0 lag: (min: 31.0, avg: 34.7, max: 63.0) +[2023-10-14 05:22:08,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:22:08,754][100936] Updated weights for policy 0, policy_version 8610 (0.0008) +[2023-10-14 05:22:08,794][100917] Updated weights for policy 1, policy_version 8602 (0.0007) +[2023-10-14 05:22:09,130][100936] Updated weights for policy 0, policy_version 8620 (0.0010) +[2023-10-14 05:22:09,513][100936] Updated weights for policy 0, policy_version 8630 (0.0010) +[2023-10-14 05:22:09,882][100936] Updated weights for policy 0, policy_version 8640 (0.0010) +[2023-10-14 05:22:12,832][100917] Updated weights for policy 1, policy_version 8612 (0.0008) +[2023-10-14 05:22:13,210][100917] Updated weights for policy 1, policy_version 8622 (0.0007) +[2023-10-14 05:22:13,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 17661952. Throughput: 0: 1668.7, 1: 1662.2. Samples: 4425854. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:22:13,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:22:13,578][100917] Updated weights for policy 1, policy_version 8632 (0.0009) +[2023-10-14 05:22:14,220][100936] Updated weights for policy 0, policy_version 8650 (0.0010) +[2023-10-14 05:22:14,596][100936] Updated weights for policy 0, policy_version 8660 (0.0008) +[2023-10-14 05:22:14,957][100936] Updated weights for policy 0, policy_version 8670 (0.0010) +[2023-10-14 05:22:17,651][100917] Updated weights for policy 1, policy_version 8642 (0.0010) +[2023-10-14 05:22:18,036][100917] Updated weights for policy 1, policy_version 8652 (0.0011) +[2023-10-14 05:22:18,412][100917] Updated weights for policy 1, policy_version 8662 (0.0008) +[2023-10-14 05:22:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 17727488. Throughput: 0: 1654.9, 1: 1664.8. Samples: 4446170. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:22:18,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:22:18,787][100917] Updated weights for policy 1, policy_version 8672 (0.0009) +[2023-10-14 05:22:19,226][100936] Updated weights for policy 0, policy_version 8680 (0.0008) +[2023-10-14 05:22:19,604][100936] Updated weights for policy 0, policy_version 8690 (0.0010) +[2023-10-14 05:22:19,975][100936] Updated weights for policy 0, policy_version 8700 (0.0010) +[2023-10-14 05:22:22,948][100917] Updated weights for policy 1, policy_version 8682 (0.0008) +[2023-10-14 05:22:23,320][100917] Updated weights for policy 1, policy_version 8692 (0.0010) +[2023-10-14 05:22:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 17793024. Throughput: 0: 1652.1, 1: 1654.5. Samples: 4466020. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) +[2023-10-14 05:22:23,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:22:23,709][100917] Updated weights for policy 1, policy_version 8702 (0.0008) +[2023-10-14 05:22:24,222][100936] Updated weights for policy 0, policy_version 8710 (0.0009) +[2023-10-14 05:22:24,588][100936] Updated weights for policy 0, policy_version 8720 (0.0008) +[2023-10-14 05:22:24,955][100936] Updated weights for policy 0, policy_version 8730 (0.0007) +[2023-10-14 05:22:27,766][100917] Updated weights for policy 1, policy_version 8712 (0.0008) +[2023-10-14 05:22:28,137][100917] Updated weights for policy 1, policy_version 8722 (0.0008) +[2023-10-14 05:22:28,506][100917] Updated weights for policy 1, policy_version 8732 (0.0008) +[2023-10-14 05:22:28,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 17858560. Throughput: 0: 1656.0, 1: 1663.5. Samples: 4475364. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) +[2023-10-14 05:22:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:22:28,947][100936] Updated weights for policy 0, policy_version 8740 (0.0008) +[2023-10-14 05:22:29,322][100936] Updated weights for policy 0, policy_version 8750 (0.0007) +[2023-10-14 05:22:29,690][100936] Updated weights for policy 0, policy_version 8760 (0.0009) +[2023-10-14 05:22:32,802][100917] Updated weights for policy 1, policy_version 8742 (0.0008) +[2023-10-14 05:22:33,185][100917] Updated weights for policy 1, policy_version 8752 (0.0008) +[2023-10-14 05:22:33,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 17924096. Throughput: 0: 1653.6, 1: 1665.9. Samples: 4495768. Policy #0 lag: (min: 9.0, avg: 22.4, max: 41.0) +[2023-10-14 05:22:33,514][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:22:33,564][100917] Updated weights for policy 1, policy_version 8762 (0.0010) +[2023-10-14 05:22:33,879][100936] Updated weights for policy 0, policy_version 8770 (0.0011) +[2023-10-14 05:22:34,253][100936] Updated weights for policy 0, policy_version 8780 (0.0008) +[2023-10-14 05:22:34,625][100936] Updated weights for policy 0, policy_version 8790 (0.0008) +[2023-10-14 05:22:35,000][100936] Updated weights for policy 0, policy_version 8800 (0.0008) +[2023-10-14 05:22:37,794][100917] Updated weights for policy 1, policy_version 8772 (0.0010) +[2023-10-14 05:22:38,176][100917] Updated weights for policy 1, policy_version 8782 (0.0009) +[2023-10-14 05:22:38,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 17989632. Throughput: 0: 1654.8, 1: 1655.1. Samples: 4515706. Policy #0 lag: (min: 9.0, avg: 22.4, max: 41.0) +[2023-10-14 05:22:38,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:22:38,537][100917] Updated weights for policy 1, policy_version 8792 (0.0007) +[2023-10-14 05:22:39,219][100936] Updated weights for policy 0, policy_version 8810 (0.0008) +[2023-10-14 05:22:39,597][100936] Updated weights for policy 0, policy_version 8820 (0.0008) +[2023-10-14 05:22:39,972][100936] Updated weights for policy 0, policy_version 8830 (0.0009) +[2023-10-14 05:22:42,608][100917] Updated weights for policy 1, policy_version 8802 (0.0007) +[2023-10-14 05:22:42,977][100917] Updated weights for policy 1, policy_version 8812 (0.0007) +[2023-10-14 05:22:43,358][100917] Updated weights for policy 1, policy_version 8822 (0.0008) +[2023-10-14 05:22:43,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 18055168. Throughput: 0: 1656.2, 1: 1658.2. Samples: 4524910. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:22:43,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:22:43,727][100917] Updated weights for policy 1, policy_version 8832 (0.0011) +[2023-10-14 05:22:44,091][100936] Updated weights for policy 0, policy_version 8840 (0.0007) +[2023-10-14 05:22:44,462][100936] Updated weights for policy 0, policy_version 8850 (0.0008) +[2023-10-14 05:22:44,838][100936] Updated weights for policy 0, policy_version 8860 (0.0008) +[2023-10-14 05:22:47,812][100917] Updated weights for policy 1, policy_version 8842 (0.0007) +[2023-10-14 05:22:48,192][100917] Updated weights for policy 1, policy_version 8852 (0.0009) +[2023-10-14 05:22:48,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 18120704. Throughput: 0: 1655.3, 1: 1658.0. Samples: 4545238. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:22:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:22:48,557][100917] Updated weights for policy 1, policy_version 8862 (0.0009) +[2023-10-14 05:22:48,904][100936] Updated weights for policy 0, policy_version 8870 (0.0007) +[2023-10-14 05:22:49,270][100936] Updated weights for policy 0, policy_version 8880 (0.0008) +[2023-10-14 05:22:49,643][100936] Updated weights for policy 0, policy_version 8890 (0.0008) +[2023-10-14 05:22:52,657][100917] Updated weights for policy 1, policy_version 8872 (0.0009) +[2023-10-14 05:22:53,041][100917] Updated weights for policy 1, policy_version 8882 (0.0009) +[2023-10-14 05:22:53,414][100917] Updated weights for policy 1, policy_version 8892 (0.0009) +[2023-10-14 05:22:53,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 18186240. Throughput: 0: 1654.0, 1: 1645.9. Samples: 4565218. Policy #0 lag: (min: 17.0, avg: 35.6, max: 49.0) +[2023-10-14 05:22:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:22:53,706][100936] Updated weights for policy 0, policy_version 8900 (0.0007) +[2023-10-14 05:22:54,065][100936] Updated weights for policy 0, policy_version 8910 (0.0007) +[2023-10-14 05:22:54,448][100936] Updated weights for policy 0, policy_version 8920 (0.0008) +[2023-10-14 05:22:57,379][100917] Updated weights for policy 1, policy_version 8902 (0.0009) +[2023-10-14 05:22:57,746][100917] Updated weights for policy 1, policy_version 8912 (0.0009) +[2023-10-14 05:22:58,125][100917] Updated weights for policy 1, policy_version 8922 (0.0007) +[2023-10-14 05:22:58,512][99942] Fps is (10 sec: 16384.1, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 18284544. Throughput: 0: 1654.5, 1: 1653.0. Samples: 4574692. Policy #0 lag: (min: 17.0, avg: 35.6, max: 49.0) +[2023-10-14 05:22:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:22:58,598][100936] Updated weights for policy 0, policy_version 8930 (0.0009) +[2023-10-14 05:22:58,967][100936] Updated weights for policy 0, policy_version 8940 (0.0007) +[2023-10-14 05:22:59,336][100936] Updated weights for policy 0, policy_version 8950 (0.0008) +[2023-10-14 05:22:59,709][100936] Updated weights for policy 0, policy_version 8960 (0.0007) +[2023-10-14 05:23:02,156][100917] Updated weights for policy 1, policy_version 8932 (0.0008) +[2023-10-14 05:23:02,532][100917] Updated weights for policy 1, policy_version 8942 (0.0010) +[2023-10-14 05:23:02,897][100917] Updated weights for policy 1, policy_version 8952 (0.0008) +[2023-10-14 05:23:03,512][99942] Fps is (10 sec: 16384.3, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 18350080. Throughput: 0: 1657.6, 1: 1654.3. Samples: 4595208. Policy #0 lag: (min: 31.0, avg: 38.9, max: 63.0) +[2023-10-14 05:23:03,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:23:03,824][100936] Updated weights for policy 0, policy_version 8970 (0.0011) +[2023-10-14 05:23:04,199][100936] Updated weights for policy 0, policy_version 8980 (0.0010) +[2023-10-14 05:23:04,573][100936] Updated weights for policy 0, policy_version 8990 (0.0010) +[2023-10-14 05:23:07,246][100917] Updated weights for policy 1, policy_version 8962 (0.0008) +[2023-10-14 05:23:07,618][100917] Updated weights for policy 1, policy_version 8972 (0.0008) +[2023-10-14 05:23:07,986][100917] Updated weights for policy 1, policy_version 8982 (0.0007) +[2023-10-14 05:23:08,365][100917] Updated weights for policy 1, policy_version 8992 (0.0009) +[2023-10-14 05:23:08,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 18415616. Throughput: 0: 1659.8, 1: 1645.7. Samples: 4614768. Policy #0 lag: (min: 31.0, avg: 38.9, max: 63.0) +[2023-10-14 05:23:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:23:08,659][100936] Updated weights for policy 0, policy_version 9000 (0.0008) +[2023-10-14 05:23:09,034][100936] Updated weights for policy 0, policy_version 9010 (0.0011) +[2023-10-14 05:23:09,396][100936] Updated weights for policy 0, policy_version 9020 (0.0010) +[2023-10-14 05:23:12,564][100917] Updated weights for policy 1, policy_version 9002 (0.0007) +[2023-10-14 05:23:12,947][100917] Updated weights for policy 1, policy_version 9012 (0.0008) +[2023-10-14 05:23:13,326][100917] Updated weights for policy 1, policy_version 9022 (0.0008) +[2023-10-14 05:23:13,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 18481152. Throughput: 0: 1655.7, 1: 1660.1. Samples: 4624578. Policy #0 lag: (min: 31.0, avg: 34.2, max: 63.0) +[2023-10-14 05:23:13,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:23:13,721][100936] Updated weights for policy 0, policy_version 9030 (0.0009) +[2023-10-14 05:23:14,094][100936] Updated weights for policy 0, policy_version 9040 (0.0010) +[2023-10-14 05:23:14,472][100936] Updated weights for policy 0, policy_version 9050 (0.0010) +[2023-10-14 05:23:17,455][100917] Updated weights for policy 1, policy_version 9032 (0.0008) +[2023-10-14 05:23:17,834][100917] Updated weights for policy 1, policy_version 9042 (0.0007) +[2023-10-14 05:23:18,220][100917] Updated weights for policy 1, policy_version 9052 (0.0007) +[2023-10-14 05:23:18,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 18546688. Throughput: 0: 1653.7, 1: 1660.4. Samples: 4644902. Policy #0 lag: (min: 31.0, avg: 34.2, max: 63.0) +[2023-10-14 05:23:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:23:18,554][100936] Updated weights for policy 0, policy_version 9060 (0.0009) +[2023-10-14 05:23:18,923][100936] Updated weights for policy 0, policy_version 9070 (0.0007) +[2023-10-14 05:23:19,287][100936] Updated weights for policy 0, policy_version 9080 (0.0008) +[2023-10-14 05:23:22,182][100917] Updated weights for policy 1, policy_version 9062 (0.0010) +[2023-10-14 05:23:22,558][100917] Updated weights for policy 1, policy_version 9072 (0.0011) +[2023-10-14 05:23:22,930][100917] Updated weights for policy 1, policy_version 9082 (0.0011) +[2023-10-14 05:23:23,458][100936] Updated weights for policy 0, policy_version 9090 (0.0008) +[2023-10-14 05:23:23,512][99942] Fps is (10 sec: 13106.7, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 18612224. Throughput: 0: 1654.1, 1: 1652.9. Samples: 4664524. Policy #0 lag: (min: 26.0, avg: 26.0, max: 30.0) +[2023-10-14 05:23:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:23:23,837][100936] Updated weights for policy 0, policy_version 9100 (0.0009) +[2023-10-14 05:23:24,204][100936] Updated weights for policy 0, policy_version 9110 (0.0008) +[2023-10-14 05:23:24,578][100936] Updated weights for policy 0, policy_version 9120 (0.0007) +[2023-10-14 05:23:26,917][100917] Updated weights for policy 1, policy_version 9092 (0.0011) +[2023-10-14 05:23:27,276][100917] Updated weights for policy 1, policy_version 9102 (0.0010) +[2023-10-14 05:23:27,646][100917] Updated weights for policy 1, policy_version 9112 (0.0008) +[2023-10-14 05:23:28,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 18677760. Throughput: 0: 1654.4, 1: 1669.2. Samples: 4674474. Policy #0 lag: (min: 26.0, avg: 26.0, max: 30.0) +[2023-10-14 05:23:28,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.950')] +[2023-10-14 05:23:28,643][100936] Updated weights for policy 0, policy_version 9130 (0.0008) +[2023-10-14 05:23:29,018][100936] Updated weights for policy 0, policy_version 9140 (0.0008) +[2023-10-14 05:23:29,386][100936] Updated weights for policy 0, policy_version 9150 (0.0008) +[2023-10-14 05:23:31,787][100917] Updated weights for policy 1, policy_version 9122 (0.0007) +[2023-10-14 05:23:32,155][100917] Updated weights for policy 1, policy_version 9132 (0.0007) +[2023-10-14 05:23:32,531][100917] Updated weights for policy 1, policy_version 9142 (0.0007) +[2023-10-14 05:23:32,902][100917] Updated weights for policy 1, policy_version 9152 (0.0008) +[2023-10-14 05:23:33,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 18743296. Throughput: 0: 1648.7, 1: 1667.5. Samples: 4694468. Policy #0 lag: (min: 31.0, avg: 31.2, max: 41.0) +[2023-10-14 05:23:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.950')] +[2023-10-14 05:23:33,684][100936] Updated weights for policy 0, policy_version 9160 (0.0007) +[2023-10-14 05:23:34,048][100936] Updated weights for policy 0, policy_version 9170 (0.0009) +[2023-10-14 05:23:34,426][100936] Updated weights for policy 0, policy_version 9180 (0.0007) +[2023-10-14 05:23:37,145][100917] Updated weights for policy 1, policy_version 9162 (0.0008) +[2023-10-14 05:23:37,514][100917] Updated weights for policy 1, policy_version 9172 (0.0009) +[2023-10-14 05:23:37,892][100917] Updated weights for policy 1, policy_version 9182 (0.0011) +[2023-10-14 05:23:38,449][100936] Updated weights for policy 0, policy_version 9190 (0.0008) +[2023-10-14 05:23:38,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 18808832. Throughput: 0: 1641.8, 1: 1658.9. Samples: 4713750. Policy #0 lag: (min: 31.0, avg: 31.2, max: 41.0) +[2023-10-14 05:23:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.950')] +[2023-10-14 05:23:38,524][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000009184_9404416.pth... +[2023-10-14 05:23:38,558][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000007616_7798784.pth +[2023-10-14 05:23:38,821][100936] Updated weights for policy 0, policy_version 9200 (0.0007) +[2023-10-14 05:23:39,195][100936] Updated weights for policy 0, policy_version 9210 (0.0007) +[2023-10-14 05:23:39,412][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000009216_9437184.pth... +[2023-10-14 05:23:39,442][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000007648_7831552.pth +[2023-10-14 05:23:42,130][100917] Updated weights for policy 1, policy_version 9192 (0.0011) +[2023-10-14 05:23:42,493][100917] Updated weights for policy 1, policy_version 9202 (0.0011) +[2023-10-14 05:23:42,873][100917] Updated weights for policy 1, policy_version 9212 (0.0010) +[2023-10-14 05:23:43,411][100936] Updated weights for policy 0, policy_version 9220 (0.0009) +[2023-10-14 05:23:43,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 18874368. Throughput: 0: 1648.4, 1: 1671.7. Samples: 4724094. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:23:43,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.950')] +[2023-10-14 05:23:43,778][100936] Updated weights for policy 0, policy_version 9230 (0.0009) +[2023-10-14 05:23:44,144][100936] Updated weights for policy 0, policy_version 9240 (0.0010) +[2023-10-14 05:23:46,970][100917] Updated weights for policy 1, policy_version 9222 (0.0009) +[2023-10-14 05:23:47,345][100917] Updated weights for policy 1, policy_version 9232 (0.0008) +[2023-10-14 05:23:47,720][100917] Updated weights for policy 1, policy_version 9242 (0.0009) +[2023-10-14 05:23:48,205][100936] Updated weights for policy 0, policy_version 9250 (0.0009) +[2023-10-14 05:23:48,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 18939904. Throughput: 0: 1653.2, 1: 1660.9. Samples: 4744344. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:23:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.950')] +[2023-10-14 05:23:48,568][100936] Updated weights for policy 0, policy_version 9260 (0.0008) +[2023-10-14 05:23:48,948][100936] Updated weights for policy 0, policy_version 9270 (0.0010) +[2023-10-14 05:23:49,316][100936] Updated weights for policy 0, policy_version 9280 (0.0010) +[2023-10-14 05:23:51,612][100917] Updated weights for policy 1, policy_version 9252 (0.0008) +[2023-10-14 05:23:51,994][100917] Updated weights for policy 1, policy_version 9262 (0.0010) +[2023-10-14 05:23:52,356][100917] Updated weights for policy 1, policy_version 9272 (0.0009) +[2023-10-14 05:23:53,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 19005440. Throughput: 0: 1644.1, 1: 1658.8. Samples: 4763400. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:23:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.950')] +[2023-10-14 05:23:53,588][100936] Updated weights for policy 0, policy_version 9290 (0.0009) +[2023-10-14 05:23:53,951][100936] Updated weights for policy 0, policy_version 9300 (0.0010) +[2023-10-14 05:23:54,327][100936] Updated weights for policy 0, policy_version 9310 (0.0011) +[2023-10-14 05:23:56,451][100917] Updated weights for policy 1, policy_version 9282 (0.0007) +[2023-10-14 05:23:56,819][100917] Updated weights for policy 1, policy_version 9292 (0.0010) +[2023-10-14 05:23:57,199][100917] Updated weights for policy 1, policy_version 9302 (0.0007) +[2023-10-14 05:23:57,577][100917] Updated weights for policy 1, policy_version 9312 (0.0009) +[2023-10-14 05:23:58,310][100936] Updated weights for policy 0, policy_version 9320 (0.0008) +[2023-10-14 05:23:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 19070976. Throughput: 0: 1653.3, 1: 1665.3. Samples: 4773918. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:23:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.950')] +[2023-10-14 05:23:58,686][100936] Updated weights for policy 0, policy_version 9330 (0.0007) +[2023-10-14 05:23:59,058][100936] Updated weights for policy 0, policy_version 9340 (0.0007) +[2023-10-14 05:24:01,510][100917] Updated weights for policy 1, policy_version 9322 (0.0010) +[2023-10-14 05:24:01,885][100917] Updated weights for policy 1, policy_version 9332 (0.0010) +[2023-10-14 05:24:02,265][100917] Updated weights for policy 1, policy_version 9342 (0.0010) +[2023-10-14 05:24:03,158][100936] Updated weights for policy 0, policy_version 9350 (0.0007) +[2023-10-14 05:24:03,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 19136512. Throughput: 0: 1655.3, 1: 1649.5. Samples: 4793618. Policy #0 lag: (min: 31.0, avg: 34.5, max: 63.0) +[2023-10-14 05:24:03,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.950')] +[2023-10-14 05:24:03,530][100936] Updated weights for policy 0, policy_version 9360 (0.0008) +[2023-10-14 05:24:03,899][100936] Updated weights for policy 0, policy_version 9370 (0.0008) +[2023-10-14 05:24:06,414][100917] Updated weights for policy 1, policy_version 9352 (0.0008) +[2023-10-14 05:24:06,798][100917] Updated weights for policy 1, policy_version 9362 (0.0008) +[2023-10-14 05:24:07,173][100917] Updated weights for policy 1, policy_version 9372 (0.0007) +[2023-10-14 05:24:08,058][100936] Updated weights for policy 0, policy_version 9380 (0.0010) +[2023-10-14 05:24:08,432][100936] Updated weights for policy 0, policy_version 9390 (0.0010) +[2023-10-14 05:24:08,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 19202048. Throughput: 0: 1641.7, 1: 1659.2. Samples: 4813064. Policy #0 lag: (min: 31.0, avg: 34.5, max: 63.0) +[2023-10-14 05:24:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.950')] +[2023-10-14 05:24:08,810][100936] Updated weights for policy 0, policy_version 9400 (0.0009) +[2023-10-14 05:24:11,306][100917] Updated weights for policy 1, policy_version 9382 (0.0007) +[2023-10-14 05:24:11,679][100917] Updated weights for policy 1, policy_version 9392 (0.0007) +[2023-10-14 05:24:12,061][100917] Updated weights for policy 1, policy_version 9402 (0.0008) +[2023-10-14 05:24:13,189][100936] Updated weights for policy 0, policy_version 9410 (0.0007) +[2023-10-14 05:24:13,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 19267584. Throughput: 0: 1649.4, 1: 1667.1. Samples: 4823716. Policy #0 lag: (min: 31.0, avg: 34.7, max: 63.0) +[2023-10-14 05:24:13,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.950')] +[2023-10-14 05:24:13,557][100936] Updated weights for policy 0, policy_version 9420 (0.0008) +[2023-10-14 05:24:13,935][100936] Updated weights for policy 0, policy_version 9430 (0.0008) +[2023-10-14 05:24:14,310][100936] Updated weights for policy 0, policy_version 9440 (0.0009) +[2023-10-14 05:24:16,238][100917] Updated weights for policy 1, policy_version 9412 (0.0007) +[2023-10-14 05:24:16,615][100917] Updated weights for policy 1, policy_version 9422 (0.0008) +[2023-10-14 05:24:16,989][100917] Updated weights for policy 1, policy_version 9432 (0.0007) +[2023-10-14 05:24:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 19333120. Throughput: 0: 1650.3, 1: 1651.9. Samples: 4843064. Policy #0 lag: (min: 31.0, avg: 34.7, max: 63.0) +[2023-10-14 05:24:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.950')] +[2023-10-14 05:24:18,558][100936] Updated weights for policy 0, policy_version 9450 (0.0010) +[2023-10-14 05:24:18,925][100936] Updated weights for policy 0, policy_version 9460 (0.0009) +[2023-10-14 05:24:19,299][100936] Updated weights for policy 0, policy_version 9470 (0.0010) +[2023-10-14 05:24:21,057][100917] Updated weights for policy 1, policy_version 9442 (0.0008) +[2023-10-14 05:24:21,433][100917] Updated weights for policy 1, policy_version 9452 (0.0008) +[2023-10-14 05:24:21,806][100917] Updated weights for policy 1, policy_version 9462 (0.0007) +[2023-10-14 05:24:22,184][100917] Updated weights for policy 1, policy_version 9472 (0.0009) +[2023-10-14 05:24:23,400][100936] Updated weights for policy 0, policy_version 9480 (0.0008) +[2023-10-14 05:24:23,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 19398656. Throughput: 0: 1642.7, 1: 1670.4. Samples: 4862838. Policy #0 lag: (min: 31.0, avg: 39.9, max: 63.0) +[2023-10-14 05:24:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.950')] +[2023-10-14 05:24:23,763][100936] Updated weights for policy 0, policy_version 9490 (0.0009) +[2023-10-14 05:24:24,145][100936] Updated weights for policy 0, policy_version 9500 (0.0007) +[2023-10-14 05:24:26,231][100917] Updated weights for policy 1, policy_version 9482 (0.0008) +[2023-10-14 05:24:26,599][100917] Updated weights for policy 1, policy_version 9492 (0.0009) +[2023-10-14 05:24:26,967][100917] Updated weights for policy 1, policy_version 9502 (0.0010) +[2023-10-14 05:24:28,263][100936] Updated weights for policy 0, policy_version 9510 (0.0007) +[2023-10-14 05:24:28,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 19464192. Throughput: 0: 1644.9, 1: 1668.4. Samples: 4873194. Policy #0 lag: (min: 31.0, avg: 39.9, max: 63.0) +[2023-10-14 05:24:28,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.950')] +[2023-10-14 05:24:28,638][100936] Updated weights for policy 0, policy_version 9520 (0.0007) +[2023-10-14 05:24:29,006][100936] Updated weights for policy 0, policy_version 9530 (0.0008) +[2023-10-14 05:24:31,089][100917] Updated weights for policy 1, policy_version 9512 (0.0009) +[2023-10-14 05:24:31,463][100917] Updated weights for policy 1, policy_version 9522 (0.0009) +[2023-10-14 05:24:31,833][100917] Updated weights for policy 1, policy_version 9532 (0.0008) +[2023-10-14 05:24:32,969][100936] Updated weights for policy 0, policy_version 9540 (0.0008) +[2023-10-14 05:24:33,345][100936] Updated weights for policy 0, policy_version 9550 (0.0009) +[2023-10-14 05:24:33,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 19529728. Throughput: 0: 1647.0, 1: 1652.1. Samples: 4892800. Policy #0 lag: (min: 9.0, avg: 22.0, max: 41.0) +[2023-10-14 05:24:33,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.930')] +[2023-10-14 05:24:33,718][100936] Updated weights for policy 0, policy_version 9560 (0.0008) +[2023-10-14 05:24:35,818][100917] Updated weights for policy 1, policy_version 9542 (0.0008) +[2023-10-14 05:24:36,192][100917] Updated weights for policy 1, policy_version 9552 (0.0010) +[2023-10-14 05:24:36,562][100917] Updated weights for policy 1, policy_version 9562 (0.0008) +[2023-10-14 05:24:37,809][100936] Updated weights for policy 0, policy_version 9570 (0.0009) +[2023-10-14 05:24:38,174][100936] Updated weights for policy 0, policy_version 9580 (0.0008) +[2023-10-14 05:24:38,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 19595264. Throughput: 0: 1640.8, 1: 1670.6. Samples: 4912414. Policy #0 lag: (min: 9.0, avg: 22.0, max: 41.0) +[2023-10-14 05:24:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 05:24:38,560][100936] Updated weights for policy 0, policy_version 9590 (0.0008) +[2023-10-14 05:24:38,922][100936] Updated weights for policy 0, policy_version 9600 (0.0009) +[2023-10-14 05:24:40,576][100917] Updated weights for policy 1, policy_version 9572 (0.0008) +[2023-10-14 05:24:40,933][100917] Updated weights for policy 1, policy_version 9582 (0.0010) +[2023-10-14 05:24:41,302][100917] Updated weights for policy 1, policy_version 9592 (0.0009) +[2023-10-14 05:24:42,847][100936] Updated weights for policy 0, policy_version 9610 (0.0008) +[2023-10-14 05:24:43,216][100936] Updated weights for policy 0, policy_version 9620 (0.0007) +[2023-10-14 05:24:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 19660800. Throughput: 0: 1651.9, 1: 1660.9. Samples: 4922992. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:24:43,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 05:24:43,597][100936] Updated weights for policy 0, policy_version 9630 (0.0007) +[2023-10-14 05:24:45,463][100917] Updated weights for policy 1, policy_version 9602 (0.0009) +[2023-10-14 05:24:45,825][100917] Updated weights for policy 1, policy_version 9612 (0.0009) +[2023-10-14 05:24:46,204][100917] Updated weights for policy 1, policy_version 9622 (0.0008) +[2023-10-14 05:24:46,570][100917] Updated weights for policy 1, policy_version 9632 (0.0009) +[2023-10-14 05:24:47,749][100936] Updated weights for policy 0, policy_version 9640 (0.0008) +[2023-10-14 05:24:48,125][100936] Updated weights for policy 0, policy_version 9650 (0.0010) +[2023-10-14 05:24:48,502][100936] Updated weights for policy 0, policy_version 9660 (0.0008) +[2023-10-14 05:24:48,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 19726336. Throughput: 0: 1656.7, 1: 1660.1. Samples: 4942878. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:24:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 05:24:50,894][100917] Updated weights for policy 1, policy_version 9642 (0.0007) +[2023-10-14 05:24:51,278][100917] Updated weights for policy 1, policy_version 9652 (0.0009) +[2023-10-14 05:24:51,638][100917] Updated weights for policy 1, policy_version 9662 (0.0007) +[2023-10-14 05:24:52,527][100936] Updated weights for policy 0, policy_version 9670 (0.0010) +[2023-10-14 05:24:52,885][100936] Updated weights for policy 0, policy_version 9680 (0.0010) +[2023-10-14 05:24:53,267][100936] Updated weights for policy 0, policy_version 9690 (0.0010) +[2023-10-14 05:24:53,512][99942] Fps is (10 sec: 16384.0, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 19824640. Throughput: 0: 1642.2, 1: 1666.2. Samples: 4961940. Policy #0 lag: (min: 25.0, avg: 32.8, max: 57.0) +[2023-10-14 05:24:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 05:24:55,785][100917] Updated weights for policy 1, policy_version 9672 (0.0009) +[2023-10-14 05:24:56,144][100917] Updated weights for policy 1, policy_version 9682 (0.0010) +[2023-10-14 05:24:56,521][100917] Updated weights for policy 1, policy_version 9692 (0.0009) +[2023-10-14 05:24:57,665][100936] Updated weights for policy 0, policy_version 9700 (0.0010) +[2023-10-14 05:24:58,032][100936] Updated weights for policy 0, policy_version 9710 (0.0007) +[2023-10-14 05:24:58,396][100936] Updated weights for policy 0, policy_version 9720 (0.0009) +[2023-10-14 05:24:58,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 19857408. Throughput: 0: 1659.1, 1: 1654.2. Samples: 4972812. Policy #0 lag: (min: 25.0, avg: 32.8, max: 57.0) +[2023-10-14 05:24:58,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 05:25:00,647][100917] Updated weights for policy 1, policy_version 9702 (0.0008) +[2023-10-14 05:25:01,007][100917] Updated weights for policy 1, policy_version 9712 (0.0007) +[2023-10-14 05:25:01,387][100917] Updated weights for policy 1, policy_version 9722 (0.0008) +[2023-10-14 05:25:02,504][100936] Updated weights for policy 0, policy_version 9730 (0.0010) +[2023-10-14 05:25:02,889][100936] Updated weights for policy 0, policy_version 9740 (0.0008) +[2023-10-14 05:25:03,267][100936] Updated weights for policy 0, policy_version 9750 (0.0008) +[2023-10-14 05:25:03,512][99942] Fps is (10 sec: 9830.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 19922944. Throughput: 0: 1658.4, 1: 1656.8. Samples: 4992248. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:25:03,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 05:25:03,634][100936] Updated weights for policy 0, policy_version 9760 (0.0009) +[2023-10-14 05:25:05,501][100917] Updated weights for policy 1, policy_version 9732 (0.0010) +[2023-10-14 05:25:05,871][100917] Updated weights for policy 1, policy_version 9742 (0.0007) +[2023-10-14 05:25:06,244][100917] Updated weights for policy 1, policy_version 9752 (0.0007) +[2023-10-14 05:25:07,915][100936] Updated weights for policy 0, policy_version 9770 (0.0010) +[2023-10-14 05:25:08,289][100936] Updated weights for policy 0, policy_version 9780 (0.0009) +[2023-10-14 05:25:08,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 19988480. Throughput: 0: 1647.2, 1: 1658.8. Samples: 5011606. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:25:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 05:25:08,674][100936] Updated weights for policy 0, policy_version 9790 (0.0008) +[2023-10-14 05:25:10,397][100917] Updated weights for policy 1, policy_version 9762 (0.0008) +[2023-10-14 05:25:10,770][100917] Updated weights for policy 1, policy_version 9772 (0.0007) +[2023-10-14 05:25:11,139][100917] Updated weights for policy 1, policy_version 9782 (0.0007) +[2023-10-14 05:25:11,508][100917] Updated weights for policy 1, policy_version 9792 (0.0009) +[2023-10-14 05:25:12,625][100936] Updated weights for policy 0, policy_version 9800 (0.0008) +[2023-10-14 05:25:12,986][100936] Updated weights for policy 0, policy_version 9810 (0.0009) +[2023-10-14 05:25:13,359][100936] Updated weights for policy 0, policy_version 9820 (0.0007) +[2023-10-14 05:25:13,512][99942] Fps is (10 sec: 16384.2, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 20086784. Throughput: 0: 1661.9, 1: 1648.4. Samples: 5022158. Policy #0 lag: (min: 31.0, avg: 34.5, max: 63.0) +[2023-10-14 05:25:13,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 05:25:15,724][100917] Updated weights for policy 1, policy_version 9802 (0.0007) +[2023-10-14 05:25:16,105][100917] Updated weights for policy 1, policy_version 9812 (0.0008) +[2023-10-14 05:25:16,483][100917] Updated weights for policy 1, policy_version 9822 (0.0009) +[2023-10-14 05:25:17,642][100936] Updated weights for policy 0, policy_version 9830 (0.0008) +[2023-10-14 05:25:18,011][100936] Updated weights for policy 0, policy_version 9840 (0.0011) +[2023-10-14 05:25:18,388][100936] Updated weights for policy 0, policy_version 9850 (0.0010) +[2023-10-14 05:25:18,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 20119552. Throughput: 0: 1654.3, 1: 1660.5. Samples: 5041966. Policy #0 lag: (min: 31.0, avg: 34.5, max: 63.0) +[2023-10-14 05:25:18,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 05:25:20,496][100917] Updated weights for policy 1, policy_version 9832 (0.0009) +[2023-10-14 05:25:20,881][100917] Updated weights for policy 1, policy_version 9842 (0.0008) +[2023-10-14 05:25:21,254][100917] Updated weights for policy 1, policy_version 9852 (0.0008) +[2023-10-14 05:25:22,483][100936] Updated weights for policy 0, policy_version 9860 (0.0007) +[2023-10-14 05:25:22,859][100936] Updated weights for policy 0, policy_version 9870 (0.0007) +[2023-10-14 05:25:23,226][100936] Updated weights for policy 0, policy_version 9880 (0.0008) +[2023-10-14 05:25:23,512][99942] Fps is (10 sec: 9830.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 20185088. Throughput: 0: 1649.6, 1: 1663.9. Samples: 5061522. Policy #0 lag: (min: 3.0, avg: 6.5, max: 35.0) +[2023-10-14 05:25:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 05:25:25,256][100917] Updated weights for policy 1, policy_version 9862 (0.0007) +[2023-10-14 05:25:25,629][100917] Updated weights for policy 1, policy_version 9872 (0.0011) +[2023-10-14 05:25:26,015][100917] Updated weights for policy 1, policy_version 9882 (0.0008) +[2023-10-14 05:25:27,404][100936] Updated weights for policy 0, policy_version 9890 (0.0009) +[2023-10-14 05:25:27,774][100936] Updated weights for policy 0, policy_version 9900 (0.0010) +[2023-10-14 05:25:28,142][100936] Updated weights for policy 0, policy_version 9910 (0.0009) +[2023-10-14 05:25:28,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 20250624. Throughput: 0: 1658.1, 1: 1654.6. Samples: 5072064. Policy #0 lag: (min: 3.0, avg: 6.5, max: 35.0) +[2023-10-14 05:25:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 05:25:28,519][100936] Updated weights for policy 0, policy_version 9920 (0.0007) +[2023-10-14 05:25:30,153][100917] Updated weights for policy 1, policy_version 9892 (0.0008) +[2023-10-14 05:25:30,532][100917] Updated weights for policy 1, policy_version 9902 (0.0009) +[2023-10-14 05:25:30,919][100917] Updated weights for policy 1, policy_version 9912 (0.0010) +[2023-10-14 05:25:32,662][100936] Updated weights for policy 0, policy_version 9930 (0.0008) +[2023-10-14 05:25:33,030][100936] Updated weights for policy 0, policy_version 9940 (0.0007) +[2023-10-14 05:25:33,403][100936] Updated weights for policy 0, policy_version 9950 (0.0008) +[2023-10-14 05:25:33,512][99942] Fps is (10 sec: 16384.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 20348928. Throughput: 0: 1650.5, 1: 1664.6. Samples: 5092058. Policy #0 lag: (min: 24.0, avg: 35.5, max: 56.0) +[2023-10-14 05:25:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 05:25:35,074][100917] Updated weights for policy 1, policy_version 9922 (0.0010) +[2023-10-14 05:25:35,438][100917] Updated weights for policy 1, policy_version 9932 (0.0007) +[2023-10-14 05:25:35,811][100917] Updated weights for policy 1, policy_version 9942 (0.0007) +[2023-10-14 05:25:36,171][100917] Updated weights for policy 1, policy_version 9952 (0.0010) +[2023-10-14 05:25:37,557][100936] Updated weights for policy 0, policy_version 9960 (0.0011) +[2023-10-14 05:25:37,918][100936] Updated weights for policy 0, policy_version 9970 (0.0011) +[2023-10-14 05:25:38,287][100936] Updated weights for policy 0, policy_version 9980 (0.0011) +[2023-10-14 05:25:38,512][99942] Fps is (10 sec: 16383.4, 60 sec: 13653.2, 300 sec: 13329.3). Total num frames: 20414464. Throughput: 0: 1649.5, 1: 1665.9. Samples: 5111138. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:25:38,514][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 05:25:38,525][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000009952_10190848.pth... +[2023-10-14 05:25:38,526][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000009984_10223616.pth... +[2023-10-14 05:25:38,555][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000008416_8617984.pth +[2023-10-14 05:25:38,558][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000008416_8617984.pth +[2023-10-14 05:25:40,321][100917] Updated weights for policy 1, policy_version 9962 (0.0008) +[2023-10-14 05:25:40,687][100917] Updated weights for policy 1, policy_version 9972 (0.0007) +[2023-10-14 05:25:41,066][100917] Updated weights for policy 1, policy_version 9982 (0.0010) +[2023-10-14 05:25:42,541][100936] Updated weights for policy 0, policy_version 9990 (0.0007) +[2023-10-14 05:25:42,914][100936] Updated weights for policy 0, policy_version 10000 (0.0007) +[2023-10-14 05:25:43,294][100936] Updated weights for policy 0, policy_version 10010 (0.0008) +[2023-10-14 05:25:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 20480000. Throughput: 0: 1652.1, 1: 1653.4. Samples: 5121560. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:25:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 05:25:45,266][100917] Updated weights for policy 1, policy_version 9992 (0.0008) +[2023-10-14 05:25:45,627][100917] Updated weights for policy 1, policy_version 10002 (0.0009) +[2023-10-14 05:25:45,999][100917] Updated weights for policy 1, policy_version 10012 (0.0008) +[2023-10-14 05:25:47,544][100936] Updated weights for policy 0, policy_version 10020 (0.0007) +[2023-10-14 05:25:47,912][100936] Updated weights for policy 0, policy_version 10030 (0.0009) +[2023-10-14 05:25:48,285][100936] Updated weights for policy 0, policy_version 10040 (0.0008) +[2023-10-14 05:25:48,512][99942] Fps is (10 sec: 9830.6, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 20512768. Throughput: 0: 1656.3, 1: 1659.9. Samples: 5141478. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:25:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 05:25:50,137][100917] Updated weights for policy 1, policy_version 10022 (0.0008) +[2023-10-14 05:25:50,500][100917] Updated weights for policy 1, policy_version 10032 (0.0009) +[2023-10-14 05:25:50,872][100917] Updated weights for policy 1, policy_version 10042 (0.0010) +[2023-10-14 05:25:52,291][100936] Updated weights for policy 0, policy_version 10050 (0.0008) +[2023-10-14 05:25:52,707][100936] Updated weights for policy 0, policy_version 10060 (0.0011) +[2023-10-14 05:25:53,077][100936] Updated weights for policy 0, policy_version 10070 (0.0010) +[2023-10-14 05:25:53,443][100936] Updated weights for policy 0, policy_version 10080 (0.0009) +[2023-10-14 05:25:53,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 20611072. Throughput: 0: 1650.4, 1: 1661.5. Samples: 5160640. Policy #0 lag: (min: 31.0, avg: 36.1, max: 63.0) +[2023-10-14 05:25:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 05:25:54,932][100917] Updated weights for policy 1, policy_version 10052 (0.0011) +[2023-10-14 05:25:55,293][100917] Updated weights for policy 1, policy_version 10062 (0.0007) +[2023-10-14 05:25:55,674][100917] Updated weights for policy 1, policy_version 10072 (0.0008) +[2023-10-14 05:25:57,588][100936] Updated weights for policy 0, policy_version 10090 (0.0008) +[2023-10-14 05:25:57,960][100936] Updated weights for policy 0, policy_version 10100 (0.0007) +[2023-10-14 05:25:58,336][100936] Updated weights for policy 0, policy_version 10110 (0.0008) +[2023-10-14 05:25:58,512][99942] Fps is (10 sec: 16384.2, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 20676608. Throughput: 0: 1651.9, 1: 1648.0. Samples: 5170656. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:25:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 05:25:59,784][100917] Updated weights for policy 1, policy_version 10082 (0.0010) +[2023-10-14 05:26:00,162][100917] Updated weights for policy 1, policy_version 10092 (0.0007) +[2023-10-14 05:26:00,538][100917] Updated weights for policy 1, policy_version 10102 (0.0007) +[2023-10-14 05:26:00,911][100917] Updated weights for policy 1, policy_version 10112 (0.0007) +[2023-10-14 05:26:02,448][100936] Updated weights for policy 0, policy_version 10120 (0.0008) +[2023-10-14 05:26:02,818][100936] Updated weights for policy 0, policy_version 10130 (0.0007) +[2023-10-14 05:26:03,192][100936] Updated weights for policy 0, policy_version 10140 (0.0009) +[2023-10-14 05:26:03,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 20742144. Throughput: 0: 1649.5, 1: 1658.4. Samples: 5190818. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:26:03,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 05:26:04,950][100917] Updated weights for policy 1, policy_version 10122 (0.0010) +[2023-10-14 05:26:05,333][100917] Updated weights for policy 1, policy_version 10132 (0.0007) +[2023-10-14 05:26:05,709][100917] Updated weights for policy 1, policy_version 10142 (0.0009) +[2023-10-14 05:26:07,399][100936] Updated weights for policy 0, policy_version 10150 (0.0010) +[2023-10-14 05:26:07,772][100936] Updated weights for policy 0, policy_version 10160 (0.0007) +[2023-10-14 05:26:08,137][100936] Updated weights for policy 0, policy_version 10170 (0.0007) +[2023-10-14 05:26:08,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 20807680. Throughput: 0: 1647.5, 1: 1655.9. Samples: 5210178. Policy #0 lag: (min: 31.0, avg: 31.1, max: 39.0) +[2023-10-14 05:26:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.980')] +[2023-10-14 05:26:09,817][100917] Updated weights for policy 1, policy_version 10152 (0.0007) +[2023-10-14 05:26:10,195][100917] Updated weights for policy 1, policy_version 10162 (0.0008) +[2023-10-14 05:26:10,562][100917] Updated weights for policy 1, policy_version 10172 (0.0007) +[2023-10-14 05:26:12,205][100936] Updated weights for policy 0, policy_version 10180 (0.0008) +[2023-10-14 05:26:12,575][100936] Updated weights for policy 0, policy_version 10190 (0.0007) +[2023-10-14 05:26:12,940][100936] Updated weights for policy 0, policy_version 10200 (0.0007) +[2023-10-14 05:26:13,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.1, 300 sec: 13329.3). Total num frames: 20873216. Throughput: 0: 1649.4, 1: 1646.8. Samples: 5220394. Policy #0 lag: (min: 31.0, avg: 31.1, max: 39.0) +[2023-10-14 05:26:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.970')] +[2023-10-14 05:26:14,725][100917] Updated weights for policy 1, policy_version 10182 (0.0008) +[2023-10-14 05:26:15,094][100917] Updated weights for policy 1, policy_version 10192 (0.0010) +[2023-10-14 05:26:15,465][100917] Updated weights for policy 1, policy_version 10202 (0.0010) +[2023-10-14 05:26:17,178][100936] Updated weights for policy 0, policy_version 10210 (0.0007) +[2023-10-14 05:26:17,546][100936] Updated weights for policy 0, policy_version 10220 (0.0009) +[2023-10-14 05:26:17,928][100936] Updated weights for policy 0, policy_version 10230 (0.0009) +[2023-10-14 05:26:18,298][100936] Updated weights for policy 0, policy_version 10240 (0.0008) +[2023-10-14 05:26:18,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 20938752. Throughput: 0: 1643.9, 1: 1651.9. Samples: 5240366. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-10-14 05:26:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.970')] +[2023-10-14 05:26:19,610][100917] Updated weights for policy 1, policy_version 10212 (0.0009) +[2023-10-14 05:26:19,992][100917] Updated weights for policy 1, policy_version 10222 (0.0009) +[2023-10-14 05:26:20,364][100917] Updated weights for policy 1, policy_version 10232 (0.0009) +[2023-10-14 05:26:22,327][100936] Updated weights for policy 0, policy_version 10250 (0.0007) +[2023-10-14 05:26:22,705][100936] Updated weights for policy 0, policy_version 10260 (0.0008) +[2023-10-14 05:26:23,067][100936] Updated weights for policy 0, policy_version 10270 (0.0007) +[2023-10-14 05:26:23,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 21004288. Throughput: 0: 1649.7, 1: 1655.6. Samples: 5259878. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-10-14 05:26:23,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.970')] +[2023-10-14 05:26:24,523][100917] Updated weights for policy 1, policy_version 10242 (0.0010) +[2023-10-14 05:26:24,919][100917] Updated weights for policy 1, policy_version 10252 (0.0008) +[2023-10-14 05:26:25,296][100917] Updated weights for policy 1, policy_version 10262 (0.0008) +[2023-10-14 05:26:25,672][100917] Updated weights for policy 1, policy_version 10272 (0.0008) +[2023-10-14 05:26:27,216][100936] Updated weights for policy 0, policy_version 10280 (0.0010) +[2023-10-14 05:26:27,593][100936] Updated weights for policy 0, policy_version 10290 (0.0009) +[2023-10-14 05:26:27,975][100936] Updated weights for policy 0, policy_version 10300 (0.0008) +[2023-10-14 05:26:28,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 21069824. Throughput: 0: 1652.7, 1: 1645.9. Samples: 5270002. Policy #0 lag: (min: 19.0, avg: 19.0, max: 19.0) +[2023-10-14 05:26:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.970')] +[2023-10-14 05:26:29,723][100917] Updated weights for policy 1, policy_version 10282 (0.0008) +[2023-10-14 05:26:30,101][100917] Updated weights for policy 1, policy_version 10292 (0.0007) +[2023-10-14 05:26:30,481][100917] Updated weights for policy 1, policy_version 10302 (0.0007) +[2023-10-14 05:26:31,979][100936] Updated weights for policy 0, policy_version 10310 (0.0007) +[2023-10-14 05:26:32,349][100936] Updated weights for policy 0, policy_version 10320 (0.0007) +[2023-10-14 05:26:32,723][100936] Updated weights for policy 0, policy_version 10330 (0.0008) +[2023-10-14 05:26:33,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 21135360. Throughput: 0: 1640.7, 1: 1657.0. Samples: 5289874. Policy #0 lag: (min: 19.0, avg: 19.0, max: 19.0) +[2023-10-14 05:26:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.970')] +[2023-10-14 05:26:34,706][100917] Updated weights for policy 1, policy_version 10312 (0.0007) +[2023-10-14 05:26:35,086][100917] Updated weights for policy 1, policy_version 10322 (0.0007) +[2023-10-14 05:26:35,467][100917] Updated weights for policy 1, policy_version 10332 (0.0007) +[2023-10-14 05:26:36,896][100936] Updated weights for policy 0, policy_version 10340 (0.0009) +[2023-10-14 05:26:37,281][100936] Updated weights for policy 0, policy_version 10350 (0.0010) +[2023-10-14 05:26:37,658][100936] Updated weights for policy 0, policy_version 10360 (0.0009) +[2023-10-14 05:26:38,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.3, 300 sec: 13329.4). Total num frames: 21200896. Throughput: 0: 1651.6, 1: 1659.5. Samples: 5309640. Policy #0 lag: (min: 22.0, avg: 28.0, max: 54.0) +[2023-10-14 05:26:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.970')] +[2023-10-14 05:26:39,510][100917] Updated weights for policy 1, policy_version 10342 (0.0008) +[2023-10-14 05:26:39,878][100917] Updated weights for policy 1, policy_version 10352 (0.0008) +[2023-10-14 05:26:40,256][100917] Updated weights for policy 1, policy_version 10362 (0.0008) +[2023-10-14 05:26:41,815][100936] Updated weights for policy 0, policy_version 10370 (0.0007) +[2023-10-14 05:26:42,196][100936] Updated weights for policy 0, policy_version 10380 (0.0007) +[2023-10-14 05:26:42,567][100936] Updated weights for policy 0, policy_version 10390 (0.0007) +[2023-10-14 05:26:42,932][100936] Updated weights for policy 0, policy_version 10400 (0.0007) +[2023-10-14 05:26:43,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 21266432. Throughput: 0: 1659.0, 1: 1660.7. Samples: 5320044. Policy #0 lag: (min: 22.0, avg: 28.0, max: 54.0) +[2023-10-14 05:26:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.970')] +[2023-10-14 05:26:44,356][100917] Updated weights for policy 1, policy_version 10372 (0.0008) +[2023-10-14 05:26:44,737][100917] Updated weights for policy 1, policy_version 10382 (0.0009) +[2023-10-14 05:26:45,123][100917] Updated weights for policy 1, policy_version 10392 (0.0007) +[2023-10-14 05:26:46,906][100936] Updated weights for policy 0, policy_version 10410 (0.0009) +[2023-10-14 05:26:47,278][100936] Updated weights for policy 0, policy_version 10420 (0.0008) +[2023-10-14 05:26:47,649][100936] Updated weights for policy 0, policy_version 10430 (0.0010) +[2023-10-14 05:26:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 21331968. Throughput: 0: 1644.1, 1: 1663.6. Samples: 5339664. Policy #0 lag: (min: 31.0, avg: 46.3, max: 63.0) +[2023-10-14 05:26:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.970')] +[2023-10-14 05:26:49,314][100917] Updated weights for policy 1, policy_version 10402 (0.0009) +[2023-10-14 05:26:49,682][100917] Updated weights for policy 1, policy_version 10412 (0.0010) +[2023-10-14 05:26:50,058][100917] Updated weights for policy 1, policy_version 10422 (0.0009) +[2023-10-14 05:26:50,444][100917] Updated weights for policy 1, policy_version 10432 (0.0007) +[2023-10-14 05:26:51,570][100936] Updated weights for policy 0, policy_version 10440 (0.0009) +[2023-10-14 05:26:51,939][100936] Updated weights for policy 0, policy_version 10450 (0.0008) +[2023-10-14 05:26:52,317][100936] Updated weights for policy 0, policy_version 10460 (0.0007) +[2023-10-14 05:26:53,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.1, 300 sec: 13329.3). Total num frames: 21397504. Throughput: 0: 1665.6, 1: 1665.5. Samples: 5360078. Policy #0 lag: (min: 31.0, avg: 46.3, max: 63.0) +[2023-10-14 05:26:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.970')] +[2023-10-14 05:26:54,455][100917] Updated weights for policy 1, policy_version 10442 (0.0008) +[2023-10-14 05:26:54,831][100917] Updated weights for policy 1, policy_version 10452 (0.0008) +[2023-10-14 05:26:55,211][100917] Updated weights for policy 1, policy_version 10462 (0.0009) +[2023-10-14 05:26:56,574][100936] Updated weights for policy 0, policy_version 10470 (0.0009) +[2023-10-14 05:26:56,951][100936] Updated weights for policy 0, policy_version 10480 (0.0007) +[2023-10-14 05:26:57,326][100936] Updated weights for policy 0, policy_version 10490 (0.0008) +[2023-10-14 05:26:58,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 21463040. Throughput: 0: 1666.6, 1: 1664.8. Samples: 5370310. Policy #0 lag: (min: 28.0, avg: 28.1, max: 33.0) +[2023-10-14 05:26:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.970')] +[2023-10-14 05:26:59,371][100917] Updated weights for policy 1, policy_version 10472 (0.0010) +[2023-10-14 05:26:59,740][100917] Updated weights for policy 1, policy_version 10482 (0.0009) +[2023-10-14 05:27:00,115][100917] Updated weights for policy 1, policy_version 10492 (0.0010) +[2023-10-14 05:27:01,408][100936] Updated weights for policy 0, policy_version 10500 (0.0009) +[2023-10-14 05:27:01,786][100936] Updated weights for policy 0, policy_version 10510 (0.0008) +[2023-10-14 05:27:02,155][100936] Updated weights for policy 0, policy_version 10520 (0.0008) +[2023-10-14 05:27:03,512][99942] Fps is (10 sec: 13107.7, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 21528576. Throughput: 0: 1649.3, 1: 1663.9. Samples: 5389460. Policy #0 lag: (min: 28.0, avg: 28.1, max: 33.0) +[2023-10-14 05:27:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.970')] +[2023-10-14 05:27:04,188][100917] Updated weights for policy 1, policy_version 10502 (0.0009) +[2023-10-14 05:27:04,565][100917] Updated weights for policy 1, policy_version 10512 (0.0009) +[2023-10-14 05:27:04,933][100917] Updated weights for policy 1, policy_version 10522 (0.0008) +[2023-10-14 05:27:06,283][100936] Updated weights for policy 0, policy_version 10530 (0.0008) +[2023-10-14 05:27:06,650][100936] Updated weights for policy 0, policy_version 10540 (0.0011) +[2023-10-14 05:27:07,028][100936] Updated weights for policy 0, policy_version 10550 (0.0009) +[2023-10-14 05:27:07,400][100936] Updated weights for policy 0, policy_version 10560 (0.0010) +[2023-10-14 05:27:08,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 21594112. Throughput: 0: 1664.9, 1: 1667.1. Samples: 5409820. Policy #0 lag: (min: 25.0, avg: 26.9, max: 52.0) +[2023-10-14 05:27:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.970')] +[2023-10-14 05:27:08,923][100917] Updated weights for policy 1, policy_version 10532 (0.0008) +[2023-10-14 05:27:09,295][100917] Updated weights for policy 1, policy_version 10542 (0.0010) +[2023-10-14 05:27:09,666][100917] Updated weights for policy 1, policy_version 10552 (0.0011) +[2023-10-14 05:27:11,579][100936] Updated weights for policy 0, policy_version 10570 (0.0008) +[2023-10-14 05:27:11,944][100936] Updated weights for policy 0, policy_version 10580 (0.0010) +[2023-10-14 05:27:12,315][100936] Updated weights for policy 0, policy_version 10590 (0.0009) +[2023-10-14 05:27:13,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 21659648. Throughput: 0: 1661.6, 1: 1670.3. Samples: 5419934. Policy #0 lag: (min: 25.0, avg: 26.9, max: 52.0) +[2023-10-14 05:27:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:27:13,831][100917] Updated weights for policy 1, policy_version 10562 (0.0009) +[2023-10-14 05:27:14,235][100917] Updated weights for policy 1, policy_version 10572 (0.0007) +[2023-10-14 05:27:14,599][100917] Updated weights for policy 1, policy_version 10582 (0.0009) +[2023-10-14 05:27:14,965][100917] Updated weights for policy 1, policy_version 10592 (0.0010) +[2023-10-14 05:27:16,445][100936] Updated weights for policy 0, policy_version 10600 (0.0010) +[2023-10-14 05:27:16,817][100936] Updated weights for policy 0, policy_version 10610 (0.0010) +[2023-10-14 05:27:17,190][100936] Updated weights for policy 0, policy_version 10620 (0.0009) +[2023-10-14 05:27:18,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.3, 300 sec: 13329.4). Total num frames: 21725184. Throughput: 0: 1647.3, 1: 1668.3. Samples: 5439076. Policy #0 lag: (min: 30.0, avg: 30.1, max: 39.0) +[2023-10-14 05:27:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:27:19,111][100917] Updated weights for policy 1, policy_version 10602 (0.0008) +[2023-10-14 05:27:19,495][100917] Updated weights for policy 1, policy_version 10612 (0.0009) +[2023-10-14 05:27:19,862][100917] Updated weights for policy 1, policy_version 10622 (0.0009) +[2023-10-14 05:27:21,260][100936] Updated weights for policy 0, policy_version 10630 (0.0008) +[2023-10-14 05:27:21,628][100936] Updated weights for policy 0, policy_version 10640 (0.0010) +[2023-10-14 05:27:22,002][100936] Updated weights for policy 0, policy_version 10650 (0.0011) +[2023-10-14 05:27:23,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 21790720. Throughput: 0: 1673.4, 1: 1666.7. Samples: 5459946. Policy #0 lag: (min: 30.0, avg: 30.1, max: 39.0) +[2023-10-14 05:27:23,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:27:24,032][100917] Updated weights for policy 1, policy_version 10632 (0.0009) +[2023-10-14 05:27:24,408][100917] Updated weights for policy 1, policy_version 10642 (0.0007) +[2023-10-14 05:27:24,782][100917] Updated weights for policy 1, policy_version 10652 (0.0011) +[2023-10-14 05:27:26,077][100936] Updated weights for policy 0, policy_version 10660 (0.0008) +[2023-10-14 05:27:26,471][100936] Updated weights for policy 0, policy_version 10670 (0.0007) +[2023-10-14 05:27:26,837][100936] Updated weights for policy 0, policy_version 10680 (0.0007) +[2023-10-14 05:27:28,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 21856256. Throughput: 0: 1658.8, 1: 1663.2. Samples: 5469536. Policy #0 lag: (min: 27.0, avg: 27.0, max: 27.0) +[2023-10-14 05:27:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:27:28,934][100917] Updated weights for policy 1, policy_version 10662 (0.0009) +[2023-10-14 05:27:29,294][100917] Updated weights for policy 1, policy_version 10672 (0.0009) +[2023-10-14 05:27:29,663][100917] Updated weights for policy 1, policy_version 10682 (0.0008) +[2023-10-14 05:27:30,794][100936] Updated weights for policy 0, policy_version 10690 (0.0008) +[2023-10-14 05:27:31,157][100936] Updated weights for policy 0, policy_version 10700 (0.0009) +[2023-10-14 05:27:31,520][100936] Updated weights for policy 0, policy_version 10710 (0.0009) +[2023-10-14 05:27:31,890][100936] Updated weights for policy 0, policy_version 10720 (0.0008) +[2023-10-14 05:27:33,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 21921792. Throughput: 0: 1665.5, 1: 1662.4. Samples: 5489422. Policy #0 lag: (min: 27.0, avg: 27.0, max: 27.0) +[2023-10-14 05:27:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:27:33,764][100917] Updated weights for policy 1, policy_version 10692 (0.0009) +[2023-10-14 05:27:34,139][100917] Updated weights for policy 1, policy_version 10702 (0.0007) +[2023-10-14 05:27:34,516][100917] Updated weights for policy 1, policy_version 10712 (0.0007) +[2023-10-14 05:27:36,221][100936] Updated weights for policy 0, policy_version 10730 (0.0008) +[2023-10-14 05:27:36,581][100936] Updated weights for policy 0, policy_version 10740 (0.0011) +[2023-10-14 05:27:36,951][100936] Updated weights for policy 0, policy_version 10750 (0.0008) +[2023-10-14 05:27:38,468][100917] Updated weights for policy 1, policy_version 10722 (0.0007) +[2023-10-14 05:27:38,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 21987328. Throughput: 0: 1666.1, 1: 1661.7. Samples: 5509826. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-10-14 05:27:38,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:27:38,519][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000010752_11010048.pth... +[2023-10-14 05:27:38,549][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000009216_9437184.pth +[2023-10-14 05:27:38,848][100917] Updated weights for policy 1, policy_version 10732 (0.0009) +[2023-10-14 05:27:39,231][100917] Updated weights for policy 1, policy_version 10742 (0.0010) +[2023-10-14 05:27:39,604][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000010752_11010048.pth... +[2023-10-14 05:27:39,605][100917] Updated weights for policy 1, policy_version 10752 (0.0010) +[2023-10-14 05:27:39,633][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000009184_9404416.pth +[2023-10-14 05:27:41,175][100936] Updated weights for policy 0, policy_version 10760 (0.0007) +[2023-10-14 05:27:41,546][100936] Updated weights for policy 0, policy_version 10770 (0.0007) +[2023-10-14 05:27:41,910][100936] Updated weights for policy 0, policy_version 10780 (0.0007) +[2023-10-14 05:27:43,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 22052864. Throughput: 0: 1653.8, 1: 1659.2. Samples: 5519394. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-10-14 05:27:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:27:43,681][100917] Updated weights for policy 1, policy_version 10762 (0.0008) +[2023-10-14 05:27:44,062][100917] Updated weights for policy 1, policy_version 10772 (0.0008) +[2023-10-14 05:27:44,426][100917] Updated weights for policy 1, policy_version 10782 (0.0009) +[2023-10-14 05:27:45,951][100936] Updated weights for policy 0, policy_version 10790 (0.0008) +[2023-10-14 05:27:46,322][100936] Updated weights for policy 0, policy_version 10800 (0.0008) +[2023-10-14 05:27:46,695][100936] Updated weights for policy 0, policy_version 10810 (0.0011) +[2023-10-14 05:27:48,512][100917] Updated weights for policy 1, policy_version 10792 (0.0009) +[2023-10-14 05:27:48,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 22118400. Throughput: 0: 1661.8, 1: 1666.8. Samples: 5539250. Policy #0 lag: (min: 29.0, avg: 35.9, max: 61.0) +[2023-10-14 05:27:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:27:48,890][100917] Updated weights for policy 1, policy_version 10802 (0.0008) +[2023-10-14 05:27:49,254][100917] Updated weights for policy 1, policy_version 10812 (0.0008) +[2023-10-14 05:27:50,611][100936] Updated weights for policy 0, policy_version 10820 (0.0010) +[2023-10-14 05:27:50,981][100936] Updated weights for policy 0, policy_version 10830 (0.0007) +[2023-10-14 05:27:51,347][100936] Updated weights for policy 0, policy_version 10840 (0.0009) +[2023-10-14 05:27:53,399][100917] Updated weights for policy 1, policy_version 10822 (0.0009) +[2023-10-14 05:27:53,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 22183936. Throughput: 0: 1678.2, 1: 1659.0. Samples: 5559994. Policy #0 lag: (min: 29.0, avg: 35.9, max: 61.0) +[2023-10-14 05:27:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:27:53,763][100917] Updated weights for policy 1, policy_version 10832 (0.0009) +[2023-10-14 05:27:54,131][100917] Updated weights for policy 1, policy_version 10842 (0.0010) +[2023-10-14 05:27:55,353][100936] Updated weights for policy 0, policy_version 10850 (0.0009) +[2023-10-14 05:27:55,728][100936] Updated weights for policy 0, policy_version 10860 (0.0008) +[2023-10-14 05:27:56,093][100936] Updated weights for policy 0, policy_version 10870 (0.0007) +[2023-10-14 05:27:56,470][100936] Updated weights for policy 0, policy_version 10880 (0.0007) +[2023-10-14 05:27:58,379][100917] Updated weights for policy 1, policy_version 10852 (0.0009) +[2023-10-14 05:27:58,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 22249472. Throughput: 0: 1655.6, 1: 1657.7. Samples: 5569032. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:27:58,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:27:58,771][100917] Updated weights for policy 1, policy_version 10862 (0.0007) +[2023-10-14 05:27:59,148][100917] Updated weights for policy 1, policy_version 10872 (0.0007) +[2023-10-14 05:28:00,611][100936] Updated weights for policy 0, policy_version 10890 (0.0008) +[2023-10-14 05:28:00,987][100936] Updated weights for policy 0, policy_version 10900 (0.0008) +[2023-10-14 05:28:01,352][100936] Updated weights for policy 0, policy_version 10910 (0.0008) +[2023-10-14 05:28:03,250][100917] Updated weights for policy 1, policy_version 10882 (0.0009) +[2023-10-14 05:28:03,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 22315008. Throughput: 0: 1682.7, 1: 1658.6. Samples: 5589434. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:28:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:28:03,624][100917] Updated weights for policy 1, policy_version 10892 (0.0008) +[2023-10-14 05:28:04,009][100917] Updated weights for policy 1, policy_version 10902 (0.0010) +[2023-10-14 05:28:04,373][100917] Updated weights for policy 1, policy_version 10912 (0.0008) +[2023-10-14 05:28:05,290][100936] Updated weights for policy 0, policy_version 10920 (0.0009) +[2023-10-14 05:28:05,663][100936] Updated weights for policy 0, policy_version 10930 (0.0009) +[2023-10-14 05:28:06,044][100936] Updated weights for policy 0, policy_version 10940 (0.0007) +[2023-10-14 05:28:08,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 22380544. Throughput: 0: 1674.7, 1: 1655.6. Samples: 5609812. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:28:08,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.950')] +[2023-10-14 05:28:08,542][100917] Updated weights for policy 1, policy_version 10922 (0.0009) +[2023-10-14 05:28:08,919][100917] Updated weights for policy 1, policy_version 10932 (0.0007) +[2023-10-14 05:28:09,298][100917] Updated weights for policy 1, policy_version 10942 (0.0009) +[2023-10-14 05:28:10,155][100936] Updated weights for policy 0, policy_version 10950 (0.0009) +[2023-10-14 05:28:10,531][100936] Updated weights for policy 0, policy_version 10960 (0.0009) +[2023-10-14 05:28:10,894][100936] Updated weights for policy 0, policy_version 10970 (0.0010) +[2023-10-14 05:28:13,415][100917] Updated weights for policy 1, policy_version 10952 (0.0009) +[2023-10-14 05:28:13,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 22446080. Throughput: 0: 1657.2, 1: 1657.4. Samples: 5618694. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:28:13,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.950')] +[2023-10-14 05:28:13,785][100917] Updated weights for policy 1, policy_version 10962 (0.0011) +[2023-10-14 05:28:14,166][100917] Updated weights for policy 1, policy_version 10972 (0.0010) +[2023-10-14 05:28:15,087][100936] Updated weights for policy 0, policy_version 10980 (0.0009) +[2023-10-14 05:28:15,452][100936] Updated weights for policy 0, policy_version 10990 (0.0007) +[2023-10-14 05:28:15,827][100936] Updated weights for policy 0, policy_version 11000 (0.0007) +[2023-10-14 05:28:18,396][100917] Updated weights for policy 1, policy_version 10982 (0.0009) +[2023-10-14 05:28:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 22511616. Throughput: 0: 1666.4, 1: 1653.8. Samples: 5638830. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:28:18,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.950')] +[2023-10-14 05:28:18,774][100917] Updated weights for policy 1, policy_version 10992 (0.0009) +[2023-10-14 05:28:19,143][100917] Updated weights for policy 1, policy_version 11002 (0.0007) +[2023-10-14 05:28:20,114][100936] Updated weights for policy 0, policy_version 11010 (0.0008) +[2023-10-14 05:28:20,531][100936] Updated weights for policy 0, policy_version 11020 (0.0008) +[2023-10-14 05:28:20,890][100936] Updated weights for policy 0, policy_version 11030 (0.0011) +[2023-10-14 05:28:21,262][100936] Updated weights for policy 0, policy_version 11040 (0.0009) +[2023-10-14 05:28:23,169][100917] Updated weights for policy 1, policy_version 11012 (0.0008) +[2023-10-14 05:28:23,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 22577152. Throughput: 0: 1663.5, 1: 1657.6. Samples: 5659278. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:28:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.950')] +[2023-10-14 05:28:23,540][100917] Updated weights for policy 1, policy_version 11022 (0.0010) +[2023-10-14 05:28:23,922][100917] Updated weights for policy 1, policy_version 11032 (0.0009) +[2023-10-14 05:28:25,413][100936] Updated weights for policy 0, policy_version 11050 (0.0007) +[2023-10-14 05:28:25,785][100936] Updated weights for policy 0, policy_version 11060 (0.0007) +[2023-10-14 05:28:26,155][100936] Updated weights for policy 0, policy_version 11070 (0.0008) +[2023-10-14 05:28:28,003][100917] Updated weights for policy 1, policy_version 11042 (0.0009) +[2023-10-14 05:28:28,383][100917] Updated weights for policy 1, policy_version 11052 (0.0009) +[2023-10-14 05:28:28,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 22642688. Throughput: 0: 1647.6, 1: 1662.0. Samples: 5668330. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:28:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.950')] +[2023-10-14 05:28:28,766][100917] Updated weights for policy 1, policy_version 11062 (0.0011) +[2023-10-14 05:28:29,140][100917] Updated weights for policy 1, policy_version 11072 (0.0008) +[2023-10-14 05:28:30,283][100936] Updated weights for policy 0, policy_version 11080 (0.0009) +[2023-10-14 05:28:30,646][100936] Updated weights for policy 0, policy_version 11090 (0.0009) +[2023-10-14 05:28:31,022][100936] Updated weights for policy 0, policy_version 11100 (0.0008) +[2023-10-14 05:28:33,151][100917] Updated weights for policy 1, policy_version 11082 (0.0007) +[2023-10-14 05:28:33,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 22708224. Throughput: 0: 1661.3, 1: 1664.3. Samples: 5688902. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:28:33,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.950')] +[2023-10-14 05:28:33,523][100917] Updated weights for policy 1, policy_version 11092 (0.0008) +[2023-10-14 05:28:33,908][100917] Updated weights for policy 1, policy_version 11102 (0.0009) +[2023-10-14 05:28:35,266][100936] Updated weights for policy 0, policy_version 11110 (0.0010) +[2023-10-14 05:28:35,631][100936] Updated weights for policy 0, policy_version 11120 (0.0008) +[2023-10-14 05:28:36,006][100936] Updated weights for policy 0, policy_version 11130 (0.0009) +[2023-10-14 05:28:37,948][100917] Updated weights for policy 1, policy_version 11112 (0.0008) +[2023-10-14 05:28:38,311][100917] Updated weights for policy 1, policy_version 11122 (0.0008) +[2023-10-14 05:28:38,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 22773760. Throughput: 0: 1651.2, 1: 1660.2. Samples: 5709008. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:28:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.930')] +[2023-10-14 05:28:38,685][100917] Updated weights for policy 1, policy_version 11132 (0.0009) +[2023-10-14 05:28:40,262][100936] Updated weights for policy 0, policy_version 11140 (0.0008) +[2023-10-14 05:28:40,640][100936] Updated weights for policy 0, policy_version 11150 (0.0007) +[2023-10-14 05:28:41,011][100936] Updated weights for policy 0, policy_version 11160 (0.0007) +[2023-10-14 05:28:42,928][100917] Updated weights for policy 1, policy_version 11142 (0.0008) +[2023-10-14 05:28:43,315][100917] Updated weights for policy 1, policy_version 11152 (0.0007) +[2023-10-14 05:28:43,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 22839296. Throughput: 0: 1647.3, 1: 1669.0. Samples: 5718266. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:28:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 05:28:43,684][100917] Updated weights for policy 1, policy_version 11162 (0.0007) +[2023-10-14 05:28:44,972][100936] Updated weights for policy 0, policy_version 11170 (0.0009) +[2023-10-14 05:28:45,351][100936] Updated weights for policy 0, policy_version 11180 (0.0008) +[2023-10-14 05:28:45,723][100936] Updated weights for policy 0, policy_version 11190 (0.0008) +[2023-10-14 05:28:46,097][100936] Updated weights for policy 0, policy_version 11200 (0.0008) +[2023-10-14 05:28:47,883][100917] Updated weights for policy 1, policy_version 11172 (0.0008) +[2023-10-14 05:28:48,276][100917] Updated weights for policy 1, policy_version 11182 (0.0007) +[2023-10-14 05:28:48,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 22904832. Throughput: 0: 1653.5, 1: 1665.2. Samples: 5738776. Policy #0 lag: (min: 10.0, avg: 15.9, max: 42.0) +[2023-10-14 05:28:48,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 05:28:48,650][100917] Updated weights for policy 1, policy_version 11192 (0.0009) +[2023-10-14 05:28:50,234][100936] Updated weights for policy 0, policy_version 11210 (0.0008) +[2023-10-14 05:28:50,612][100936] Updated weights for policy 0, policy_version 11220 (0.0009) +[2023-10-14 05:28:50,986][100936] Updated weights for policy 0, policy_version 11230 (0.0007) +[2023-10-14 05:28:52,819][100917] Updated weights for policy 1, policy_version 11202 (0.0008) +[2023-10-14 05:28:53,194][100917] Updated weights for policy 1, policy_version 11212 (0.0009) +[2023-10-14 05:28:53,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 22970368. Throughput: 0: 1655.7, 1: 1652.8. Samples: 5758698. Policy #0 lag: (min: 10.0, avg: 15.9, max: 42.0) +[2023-10-14 05:28:53,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.930')] +[2023-10-14 05:28:53,572][100917] Updated weights for policy 1, policy_version 11222 (0.0008) +[2023-10-14 05:28:53,939][100917] Updated weights for policy 1, policy_version 11232 (0.0008) +[2023-10-14 05:28:54,941][100936] Updated weights for policy 0, policy_version 11240 (0.0008) +[2023-10-14 05:28:55,319][100936] Updated weights for policy 0, policy_version 11250 (0.0008) +[2023-10-14 05:28:55,691][100936] Updated weights for policy 0, policy_version 11260 (0.0008) +[2023-10-14 05:28:58,086][100917] Updated weights for policy 1, policy_version 11242 (0.0011) +[2023-10-14 05:28:58,462][100917] Updated weights for policy 1, policy_version 11252 (0.0008) +[2023-10-14 05:28:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 23035904. Throughput: 0: 1656.1, 1: 1660.8. Samples: 5767956. Policy #0 lag: (min: 28.0, avg: 36.0, max: 60.0) +[2023-10-14 05:28:58,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.930')] +[2023-10-14 05:28:58,838][100917] Updated weights for policy 1, policy_version 11262 (0.0009) +[2023-10-14 05:28:59,727][100936] Updated weights for policy 0, policy_version 11270 (0.0010) +[2023-10-14 05:29:00,103][100936] Updated weights for policy 0, policy_version 11280 (0.0009) +[2023-10-14 05:29:00,476][100936] Updated weights for policy 0, policy_version 11290 (0.0008) +[2023-10-14 05:29:03,026][100917] Updated weights for policy 1, policy_version 11272 (0.0009) +[2023-10-14 05:29:03,386][100917] Updated weights for policy 1, policy_version 11282 (0.0009) +[2023-10-14 05:29:03,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 23101440. Throughput: 0: 1656.4, 1: 1659.5. Samples: 5788044. Policy #0 lag: (min: 28.0, avg: 36.0, max: 60.0) +[2023-10-14 05:29:03,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.930')] +[2023-10-14 05:29:03,765][100917] Updated weights for policy 1, policy_version 11292 (0.0009) +[2023-10-14 05:29:04,705][100936] Updated weights for policy 0, policy_version 11300 (0.0008) +[2023-10-14 05:29:05,075][100936] Updated weights for policy 0, policy_version 11310 (0.0011) +[2023-10-14 05:29:05,451][100936] Updated weights for policy 0, policy_version 11320 (0.0008) +[2023-10-14 05:29:07,853][100917] Updated weights for policy 1, policy_version 11302 (0.0008) +[2023-10-14 05:29:08,232][100917] Updated weights for policy 1, policy_version 11312 (0.0007) +[2023-10-14 05:29:08,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 23166976. Throughput: 0: 1657.6, 1: 1648.1. Samples: 5808030. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:29:08,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.930')] +[2023-10-14 05:29:08,608][100917] Updated weights for policy 1, policy_version 11322 (0.0008) +[2023-10-14 05:29:09,594][100936] Updated weights for policy 0, policy_version 11330 (0.0007) +[2023-10-14 05:29:09,993][100936] Updated weights for policy 0, policy_version 11340 (0.0008) +[2023-10-14 05:29:10,363][100936] Updated weights for policy 0, policy_version 11350 (0.0009) +[2023-10-14 05:29:10,729][100936] Updated weights for policy 0, policy_version 11360 (0.0007) +[2023-10-14 05:29:12,529][100917] Updated weights for policy 1, policy_version 11332 (0.0007) +[2023-10-14 05:29:12,901][100917] Updated weights for policy 1, policy_version 11342 (0.0012) +[2023-10-14 05:29:13,275][100917] Updated weights for policy 1, policy_version 11352 (0.0010) +[2023-10-14 05:29:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 23232512. Throughput: 0: 1656.9, 1: 1655.3. Samples: 5817382. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:29:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.930')] +[2023-10-14 05:29:14,854][100936] Updated weights for policy 0, policy_version 11370 (0.0009) +[2023-10-14 05:29:15,222][100936] Updated weights for policy 0, policy_version 11380 (0.0007) +[2023-10-14 05:29:15,599][100936] Updated weights for policy 0, policy_version 11390 (0.0008) +[2023-10-14 05:29:17,530][100917] Updated weights for policy 1, policy_version 11362 (0.0011) +[2023-10-14 05:29:17,904][100917] Updated weights for policy 1, policy_version 11372 (0.0007) +[2023-10-14 05:29:18,291][100917] Updated weights for policy 1, policy_version 11382 (0.0008) +[2023-10-14 05:29:18,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 23298048. Throughput: 0: 1655.7, 1: 1645.1. Samples: 5837440. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:29:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 05:29:18,663][100917] Updated weights for policy 1, policy_version 11392 (0.0010) +[2023-10-14 05:29:19,843][100936] Updated weights for policy 0, policy_version 11400 (0.0007) +[2023-10-14 05:29:20,208][100936] Updated weights for policy 0, policy_version 11410 (0.0008) +[2023-10-14 05:29:20,583][100936] Updated weights for policy 0, policy_version 11420 (0.0008) +[2023-10-14 05:29:22,803][100917] Updated weights for policy 1, policy_version 11402 (0.0011) +[2023-10-14 05:29:23,184][100917] Updated weights for policy 1, policy_version 11412 (0.0007) +[2023-10-14 05:29:23,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 23363584. Throughput: 0: 1655.8, 1: 1637.1. Samples: 5857186. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:29:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 05:29:23,554][100917] Updated weights for policy 1, policy_version 11422 (0.0009) +[2023-10-14 05:29:24,852][100936] Updated weights for policy 0, policy_version 11430 (0.0009) +[2023-10-14 05:29:25,222][100936] Updated weights for policy 0, policy_version 11440 (0.0010) +[2023-10-14 05:29:25,599][100936] Updated weights for policy 0, policy_version 11450 (0.0007) +[2023-10-14 05:29:27,818][100917] Updated weights for policy 1, policy_version 11432 (0.0007) +[2023-10-14 05:29:28,195][100917] Updated weights for policy 1, policy_version 11442 (0.0007) +[2023-10-14 05:29:28,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 23429120. Throughput: 0: 1654.8, 1: 1641.1. Samples: 5866580. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:29:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 05:29:28,583][100917] Updated weights for policy 1, policy_version 11452 (0.0007) +[2023-10-14 05:29:29,698][100936] Updated weights for policy 0, policy_version 11460 (0.0008) +[2023-10-14 05:29:30,065][100936] Updated weights for policy 0, policy_version 11470 (0.0007) +[2023-10-14 05:29:30,441][100936] Updated weights for policy 0, policy_version 11480 (0.0007) +[2023-10-14 05:29:32,852][100917] Updated weights for policy 1, policy_version 11462 (0.0009) +[2023-10-14 05:29:33,223][100917] Updated weights for policy 1, policy_version 11472 (0.0010) +[2023-10-14 05:29:33,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 23494656. Throughput: 0: 1647.3, 1: 1641.7. Samples: 5886780. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:29:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 05:29:33,592][100917] Updated weights for policy 1, policy_version 11482 (0.0009) +[2023-10-14 05:29:34,539][100936] Updated weights for policy 0, policy_version 11490 (0.0007) +[2023-10-14 05:29:34,908][100936] Updated weights for policy 0, policy_version 11500 (0.0007) +[2023-10-14 05:29:35,286][100936] Updated weights for policy 0, policy_version 11510 (0.0011) +[2023-10-14 05:29:35,653][100936] Updated weights for policy 0, policy_version 11520 (0.0007) +[2023-10-14 05:29:37,697][100917] Updated weights for policy 1, policy_version 11492 (0.0008) +[2023-10-14 05:29:38,066][100917] Updated weights for policy 1, policy_version 11502 (0.0007) +[2023-10-14 05:29:38,450][100917] Updated weights for policy 1, policy_version 11512 (0.0008) +[2023-10-14 05:29:38,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 23560192. Throughput: 0: 1644.8, 1: 1642.0. Samples: 5906602. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-10-14 05:29:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 05:29:38,524][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000011520_11796480.pth... +[2023-10-14 05:29:38,562][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000009984_10223616.pth +[2023-10-14 05:29:38,738][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000011520_11796480.pth... +[2023-10-14 05:29:38,767][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000009952_10190848.pth +[2023-10-14 05:29:39,786][100936] Updated weights for policy 0, policy_version 11530 (0.0008) +[2023-10-14 05:29:40,151][100936] Updated weights for policy 0, policy_version 11540 (0.0008) +[2023-10-14 05:29:40,525][100936] Updated weights for policy 0, policy_version 11550 (0.0007) +[2023-10-14 05:29:42,371][100917] Updated weights for policy 1, policy_version 11522 (0.0008) +[2023-10-14 05:29:42,748][100917] Updated weights for policy 1, policy_version 11532 (0.0009) +[2023-10-14 05:29:43,120][100917] Updated weights for policy 1, policy_version 11542 (0.0008) +[2023-10-14 05:29:43,498][100917] Updated weights for policy 1, policy_version 11552 (0.0009) +[2023-10-14 05:29:43,512][99942] Fps is (10 sec: 16383.8, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 23658496. Throughput: 0: 1644.0, 1: 1647.7. Samples: 5916084. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-10-14 05:29:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 05:29:44,687][100936] Updated weights for policy 0, policy_version 11560 (0.0010) +[2023-10-14 05:29:45,073][100936] Updated weights for policy 0, policy_version 11570 (0.0008) +[2023-10-14 05:29:45,432][100936] Updated weights for policy 0, policy_version 11580 (0.0010) +[2023-10-14 05:29:47,597][100917] Updated weights for policy 1, policy_version 11562 (0.0009) +[2023-10-14 05:29:47,978][100917] Updated weights for policy 1, policy_version 11572 (0.0008) +[2023-10-14 05:29:48,343][100917] Updated weights for policy 1, policy_version 11582 (0.0009) +[2023-10-14 05:29:48,512][99942] Fps is (10 sec: 16384.1, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 23724032. Throughput: 0: 1647.8, 1: 1655.4. Samples: 5936688. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:29:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 05:29:49,545][100936] Updated weights for policy 0, policy_version 11590 (0.0007) +[2023-10-14 05:29:49,912][100936] Updated weights for policy 0, policy_version 11600 (0.0008) +[2023-10-14 05:29:50,281][100936] Updated weights for policy 0, policy_version 11610 (0.0007) +[2023-10-14 05:29:52,435][100917] Updated weights for policy 1, policy_version 11592 (0.0011) +[2023-10-14 05:29:52,794][100917] Updated weights for policy 1, policy_version 11602 (0.0010) +[2023-10-14 05:29:53,163][100917] Updated weights for policy 1, policy_version 11612 (0.0011) +[2023-10-14 05:29:53,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 23789568. Throughput: 0: 1649.5, 1: 1643.4. Samples: 5956208. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:29:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 05:29:54,458][100936] Updated weights for policy 0, policy_version 11620 (0.0009) +[2023-10-14 05:29:54,855][100936] Updated weights for policy 0, policy_version 11630 (0.0010) +[2023-10-14 05:29:55,232][100936] Updated weights for policy 0, policy_version 11640 (0.0009) +[2023-10-14 05:29:57,308][100917] Updated weights for policy 1, policy_version 11622 (0.0010) +[2023-10-14 05:29:57,671][100917] Updated weights for policy 1, policy_version 11632 (0.0009) +[2023-10-14 05:29:58,052][100917] Updated weights for policy 1, policy_version 11642 (0.0009) +[2023-10-14 05:29:58,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 23855104. Throughput: 0: 1644.5, 1: 1652.5. Samples: 5965748. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:29:58,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 05:29:59,373][100936] Updated weights for policy 0, policy_version 11650 (0.0010) +[2023-10-14 05:29:59,745][100936] Updated weights for policy 0, policy_version 11660 (0.0009) +[2023-10-14 05:30:00,123][100936] Updated weights for policy 0, policy_version 11670 (0.0009) +[2023-10-14 05:30:00,490][100936] Updated weights for policy 0, policy_version 11680 (0.0010) +[2023-10-14 05:30:02,050][100917] Updated weights for policy 1, policy_version 11652 (0.0009) +[2023-10-14 05:30:02,424][100917] Updated weights for policy 1, policy_version 11662 (0.0010) +[2023-10-14 05:30:02,804][100917] Updated weights for policy 1, policy_version 11672 (0.0010) +[2023-10-14 05:30:03,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 23920640. Throughput: 0: 1647.2, 1: 1656.7. Samples: 5986114. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:30:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 05:30:04,790][100936] Updated weights for policy 0, policy_version 11690 (0.0009) +[2023-10-14 05:30:05,159][100936] Updated weights for policy 0, policy_version 11700 (0.0007) +[2023-10-14 05:30:05,528][100936] Updated weights for policy 0, policy_version 11710 (0.0008) +[2023-10-14 05:30:07,128][100917] Updated weights for policy 1, policy_version 11682 (0.0008) +[2023-10-14 05:30:07,507][100917] Updated weights for policy 1, policy_version 11692 (0.0009) +[2023-10-14 05:30:07,881][100917] Updated weights for policy 1, policy_version 11702 (0.0009) +[2023-10-14 05:30:08,254][100917] Updated weights for policy 1, policy_version 11712 (0.0008) +[2023-10-14 05:30:08,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 23986176. Throughput: 0: 1651.7, 1: 1646.0. Samples: 6005586. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:30:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 05:30:09,717][100936] Updated weights for policy 0, policy_version 11720 (0.0007) +[2023-10-14 05:30:10,089][100936] Updated weights for policy 0, policy_version 11730 (0.0009) +[2023-10-14 05:30:10,455][100936] Updated weights for policy 0, policy_version 11740 (0.0008) +[2023-10-14 05:30:12,305][100917] Updated weights for policy 1, policy_version 11722 (0.0009) +[2023-10-14 05:30:12,679][100917] Updated weights for policy 1, policy_version 11732 (0.0009) +[2023-10-14 05:30:13,044][100917] Updated weights for policy 1, policy_version 11742 (0.0009) +[2023-10-14 05:30:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 24051712. Throughput: 0: 1652.0, 1: 1656.1. Samples: 6015446. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 05:30:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 05:30:14,462][100936] Updated weights for policy 0, policy_version 11750 (0.0009) +[2023-10-14 05:30:14,831][100936] Updated weights for policy 0, policy_version 11760 (0.0009) +[2023-10-14 05:30:15,199][100936] Updated weights for policy 0, policy_version 11770 (0.0008) +[2023-10-14 05:30:17,269][100917] Updated weights for policy 1, policy_version 11752 (0.0010) +[2023-10-14 05:30:17,655][100917] Updated weights for policy 1, policy_version 11762 (0.0008) +[2023-10-14 05:30:18,029][100917] Updated weights for policy 1, policy_version 11772 (0.0010) +[2023-10-14 05:30:18,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 24117248. Throughput: 0: 1654.8, 1: 1659.7. Samples: 6035932. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 05:30:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 05:30:19,286][100936] Updated weights for policy 0, policy_version 11780 (0.0008) +[2023-10-14 05:30:19,654][100936] Updated weights for policy 0, policy_version 11790 (0.0007) +[2023-10-14 05:30:20,031][100936] Updated weights for policy 0, policy_version 11800 (0.0008) +[2023-10-14 05:30:22,345][100917] Updated weights for policy 1, policy_version 11782 (0.0008) +[2023-10-14 05:30:22,732][100917] Updated weights for policy 1, policy_version 11792 (0.0008) +[2023-10-14 05:30:23,111][100917] Updated weights for policy 1, policy_version 11802 (0.0009) +[2023-10-14 05:30:23,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 24182784. Throughput: 0: 1655.4, 1: 1646.2. Samples: 6055174. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 05:30:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 05:30:24,101][100936] Updated weights for policy 0, policy_version 11810 (0.0007) +[2023-10-14 05:30:24,478][100936] Updated weights for policy 0, policy_version 11820 (0.0010) +[2023-10-14 05:30:24,853][100936] Updated weights for policy 0, policy_version 11830 (0.0010) +[2023-10-14 05:30:25,216][100936] Updated weights for policy 0, policy_version 11840 (0.0008) +[2023-10-14 05:30:27,261][100917] Updated weights for policy 1, policy_version 11812 (0.0008) +[2023-10-14 05:30:27,640][100917] Updated weights for policy 1, policy_version 11822 (0.0009) +[2023-10-14 05:30:28,010][100917] Updated weights for policy 1, policy_version 11832 (0.0009) +[2023-10-14 05:30:28,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 24248320. Throughput: 0: 1654.4, 1: 1651.9. Samples: 6064866. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:30:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 05:30:29,341][100936] Updated weights for policy 0, policy_version 11850 (0.0010) +[2023-10-14 05:30:29,710][100936] Updated weights for policy 0, policy_version 11860 (0.0009) +[2023-10-14 05:30:30,080][100936] Updated weights for policy 0, policy_version 11870 (0.0008) +[2023-10-14 05:30:32,127][100917] Updated weights for policy 1, policy_version 11842 (0.0009) +[2023-10-14 05:30:32,503][100917] Updated weights for policy 1, policy_version 11852 (0.0009) +[2023-10-14 05:30:32,872][100917] Updated weights for policy 1, policy_version 11862 (0.0008) +[2023-10-14 05:30:33,251][100917] Updated weights for policy 1, policy_version 11872 (0.0009) +[2023-10-14 05:30:33,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 24313856. Throughput: 0: 1657.6, 1: 1646.6. Samples: 6085378. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:30:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 05:30:34,123][100936] Updated weights for policy 0, policy_version 11880 (0.0009) +[2023-10-14 05:30:34,493][100936] Updated weights for policy 0, policy_version 11890 (0.0010) +[2023-10-14 05:30:34,866][100936] Updated weights for policy 0, policy_version 11900 (0.0010) +[2023-10-14 05:30:37,531][100917] Updated weights for policy 1, policy_version 11882 (0.0008) +[2023-10-14 05:30:37,916][100917] Updated weights for policy 1, policy_version 11892 (0.0007) +[2023-10-14 05:30:38,286][100917] Updated weights for policy 1, policy_version 11902 (0.0007) +[2023-10-14 05:30:38,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 24379392. Throughput: 0: 1661.1, 1: 1646.1. Samples: 6105030. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:30:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 05:30:38,969][100936] Updated weights for policy 0, policy_version 11910 (0.0009) +[2023-10-14 05:30:39,330][100936] Updated weights for policy 0, policy_version 11920 (0.0008) +[2023-10-14 05:30:39,700][100936] Updated weights for policy 0, policy_version 11930 (0.0008) +[2023-10-14 05:30:42,304][100917] Updated weights for policy 1, policy_version 11912 (0.0007) +[2023-10-14 05:30:42,668][100917] Updated weights for policy 1, policy_version 11922 (0.0009) +[2023-10-14 05:30:43,044][100917] Updated weights for policy 1, policy_version 11932 (0.0008) +[2023-10-14 05:30:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 24444928. Throughput: 0: 1665.8, 1: 1646.8. Samples: 6114814. Policy #0 lag: (min: 1.0, avg: 13.1, max: 33.0) +[2023-10-14 05:30:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 05:30:43,816][100936] Updated weights for policy 0, policy_version 11940 (0.0008) +[2023-10-14 05:30:44,193][100936] Updated weights for policy 0, policy_version 11950 (0.0009) +[2023-10-14 05:30:44,559][100936] Updated weights for policy 0, policy_version 11960 (0.0008) +[2023-10-14 05:30:47,220][100917] Updated weights for policy 1, policy_version 11942 (0.0008) +[2023-10-14 05:30:47,593][100917] Updated weights for policy 1, policy_version 11952 (0.0009) +[2023-10-14 05:30:47,956][100917] Updated weights for policy 1, policy_version 11962 (0.0008) +[2023-10-14 05:30:48,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 24510464. Throughput: 0: 1671.0, 1: 1645.6. Samples: 6135360. Policy #0 lag: (min: 1.0, avg: 13.1, max: 33.0) +[2023-10-14 05:30:48,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 05:30:48,795][100936] Updated weights for policy 0, policy_version 11970 (0.0009) +[2023-10-14 05:30:49,163][100936] Updated weights for policy 0, policy_version 11980 (0.0008) +[2023-10-14 05:30:49,526][100936] Updated weights for policy 0, policy_version 11990 (0.0007) +[2023-10-14 05:30:49,893][100936] Updated weights for policy 0, policy_version 12000 (0.0008) +[2023-10-14 05:30:52,045][100917] Updated weights for policy 1, policy_version 11972 (0.0009) +[2023-10-14 05:30:52,415][100917] Updated weights for policy 1, policy_version 11982 (0.0007) +[2023-10-14 05:30:52,792][100917] Updated weights for policy 1, policy_version 11992 (0.0007) +[2023-10-14 05:30:53,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 24576000. Throughput: 0: 1660.5, 1: 1648.9. Samples: 6154510. Policy #0 lag: (min: 31.0, avg: 35.1, max: 63.0) +[2023-10-14 05:30:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.960')] +[2023-10-14 05:30:54,162][100936] Updated weights for policy 0, policy_version 12010 (0.0009) +[2023-10-14 05:30:54,543][100936] Updated weights for policy 0, policy_version 12020 (0.0008) +[2023-10-14 05:30:54,913][100936] Updated weights for policy 0, policy_version 12030 (0.0010) +[2023-10-14 05:30:56,882][100917] Updated weights for policy 1, policy_version 12002 (0.0007) +[2023-10-14 05:30:57,256][100917] Updated weights for policy 1, policy_version 12012 (0.0007) +[2023-10-14 05:30:57,634][100917] Updated weights for policy 1, policy_version 12022 (0.0008) +[2023-10-14 05:30:58,013][100917] Updated weights for policy 1, policy_version 12032 (0.0012) +[2023-10-14 05:30:58,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 24641536. Throughput: 0: 1659.6, 1: 1655.4. Samples: 6164622. Policy #0 lag: (min: 31.0, avg: 35.1, max: 63.0) +[2023-10-14 05:30:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.960')] +[2023-10-14 05:30:59,214][100936] Updated weights for policy 0, policy_version 12040 (0.0008) +[2023-10-14 05:30:59,588][100936] Updated weights for policy 0, policy_version 12050 (0.0008) +[2023-10-14 05:30:59,960][100936] Updated weights for policy 0, policy_version 12060 (0.0009) +[2023-10-14 05:31:02,122][100917] Updated weights for policy 1, policy_version 12042 (0.0009) +[2023-10-14 05:31:02,491][100917] Updated weights for policy 1, policy_version 12052 (0.0008) +[2023-10-14 05:31:02,867][100917] Updated weights for policy 1, policy_version 12062 (0.0009) +[2023-10-14 05:31:03,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 24707072. Throughput: 0: 1658.1, 1: 1648.9. Samples: 6184748. Policy #0 lag: (min: 13.0, avg: 14.4, max: 39.0) +[2023-10-14 05:31:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.960')] +[2023-10-14 05:31:04,213][100936] Updated weights for policy 0, policy_version 12070 (0.0008) +[2023-10-14 05:31:04,584][100936] Updated weights for policy 0, policy_version 12080 (0.0008) +[2023-10-14 05:31:04,953][100936] Updated weights for policy 0, policy_version 12090 (0.0007) +[2023-10-14 05:31:07,020][100917] Updated weights for policy 1, policy_version 12072 (0.0008) +[2023-10-14 05:31:07,390][100917] Updated weights for policy 1, policy_version 12082 (0.0008) +[2023-10-14 05:31:07,750][100917] Updated weights for policy 1, policy_version 12092 (0.0009) +[2023-10-14 05:31:08,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 24772608. Throughput: 0: 1655.2, 1: 1651.4. Samples: 6203972. Policy #0 lag: (min: 13.0, avg: 14.4, max: 39.0) +[2023-10-14 05:31:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.960')] +[2023-10-14 05:31:08,897][100936] Updated weights for policy 0, policy_version 12100 (0.0007) +[2023-10-14 05:31:09,266][100936] Updated weights for policy 0, policy_version 12110 (0.0009) +[2023-10-14 05:31:09,638][100936] Updated weights for policy 0, policy_version 12120 (0.0009) +[2023-10-14 05:31:11,810][100917] Updated weights for policy 1, policy_version 12102 (0.0008) +[2023-10-14 05:31:12,184][100917] Updated weights for policy 1, policy_version 12112 (0.0007) +[2023-10-14 05:31:12,553][100917] Updated weights for policy 1, policy_version 12122 (0.0007) +[2023-10-14 05:31:13,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 24838144. Throughput: 0: 1657.7, 1: 1662.7. Samples: 6214280. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:31:13,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.960')] +[2023-10-14 05:31:13,654][100936] Updated weights for policy 0, policy_version 12130 (0.0008) +[2023-10-14 05:31:14,026][100936] Updated weights for policy 0, policy_version 12140 (0.0010) +[2023-10-14 05:31:14,393][100936] Updated weights for policy 0, policy_version 12150 (0.0010) +[2023-10-14 05:31:14,754][100936] Updated weights for policy 0, policy_version 12160 (0.0011) +[2023-10-14 05:31:16,743][100917] Updated weights for policy 1, policy_version 12132 (0.0007) +[2023-10-14 05:31:17,122][100917] Updated weights for policy 1, policy_version 12142 (0.0007) +[2023-10-14 05:31:17,493][100917] Updated weights for policy 1, policy_version 12152 (0.0008) +[2023-10-14 05:31:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 24903680. Throughput: 0: 1657.6, 1: 1651.9. Samples: 6234302. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:31:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.960')] +[2023-10-14 05:31:18,786][100936] Updated weights for policy 0, policy_version 12170 (0.0007) +[2023-10-14 05:31:19,154][100936] Updated weights for policy 0, policy_version 12180 (0.0010) +[2023-10-14 05:31:19,519][100936] Updated weights for policy 0, policy_version 12190 (0.0009) +[2023-10-14 05:31:21,751][100917] Updated weights for policy 1, policy_version 12162 (0.0008) +[2023-10-14 05:31:22,126][100917] Updated weights for policy 1, policy_version 12172 (0.0009) +[2023-10-14 05:31:22,498][100917] Updated weights for policy 1, policy_version 12182 (0.0010) +[2023-10-14 05:31:22,878][100917] Updated weights for policy 1, policy_version 12192 (0.0008) +[2023-10-14 05:31:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 24969216. Throughput: 0: 1655.6, 1: 1650.1. Samples: 6253782. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:31:23,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.980')] +[2023-10-14 05:31:23,667][100936] Updated weights for policy 0, policy_version 12200 (0.0010) +[2023-10-14 05:31:24,050][100936] Updated weights for policy 0, policy_version 12210 (0.0010) +[2023-10-14 05:31:24,429][100936] Updated weights for policy 0, policy_version 12220 (0.0008) +[2023-10-14 05:31:26,896][100917] Updated weights for policy 1, policy_version 12202 (0.0011) +[2023-10-14 05:31:27,260][100917] Updated weights for policy 1, policy_version 12212 (0.0009) +[2023-10-14 05:31:27,630][100917] Updated weights for policy 1, policy_version 12222 (0.0009) +[2023-10-14 05:31:28,479][100936] Updated weights for policy 0, policy_version 12230 (0.0009) +[2023-10-14 05:31:28,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 25034752. Throughput: 0: 1654.9, 1: 1657.6. Samples: 6263876. Policy #0 lag: (min: 5.0, avg: 15.1, max: 37.0) +[2023-10-14 05:31:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:31:28,872][100936] Updated weights for policy 0, policy_version 12240 (0.0008) +[2023-10-14 05:31:29,241][100936] Updated weights for policy 0, policy_version 12250 (0.0008) +[2023-10-14 05:31:31,428][100917] Updated weights for policy 1, policy_version 12232 (0.0010) +[2023-10-14 05:31:31,797][100917] Updated weights for policy 1, policy_version 12242 (0.0009) +[2023-10-14 05:31:32,171][100917] Updated weights for policy 1, policy_version 12252 (0.0010) +[2023-10-14 05:31:33,500][100936] Updated weights for policy 0, policy_version 12260 (0.0008) +[2023-10-14 05:31:33,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 25100288. Throughput: 0: 1654.4, 1: 1643.3. Samples: 6283758. Policy #0 lag: (min: 5.0, avg: 15.1, max: 37.0) +[2023-10-14 05:31:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:31:33,864][100936] Updated weights for policy 0, policy_version 12270 (0.0007) +[2023-10-14 05:31:34,235][100936] Updated weights for policy 0, policy_version 12280 (0.0007) +[2023-10-14 05:31:36,370][100917] Updated weights for policy 1, policy_version 12262 (0.0010) +[2023-10-14 05:31:36,752][100917] Updated weights for policy 1, policy_version 12272 (0.0011) +[2023-10-14 05:31:37,123][100917] Updated weights for policy 1, policy_version 12282 (0.0010) +[2023-10-14 05:31:38,359][100936] Updated weights for policy 0, policy_version 12290 (0.0008) +[2023-10-14 05:31:38,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 25165824. Throughput: 0: 1658.7, 1: 1658.4. Samples: 6303780. Policy #0 lag: (min: 31.0, avg: 34.6, max: 63.0) +[2023-10-14 05:31:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:31:38,529][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000012288_12582912.pth... +[2023-10-14 05:31:38,563][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000010752_11010048.pth +[2023-10-14 05:31:38,733][100936] Updated weights for policy 0, policy_version 12300 (0.0007) +[2023-10-14 05:31:39,100][100936] Updated weights for policy 0, policy_version 12310 (0.0007) +[2023-10-14 05:31:39,461][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000012320_12615680.pth... +[2023-10-14 05:31:39,461][100936] Updated weights for policy 0, policy_version 12320 (0.0008) +[2023-10-14 05:31:39,500][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000010752_11010048.pth +[2023-10-14 05:31:41,238][100917] Updated weights for policy 1, policy_version 12292 (0.0010) +[2023-10-14 05:31:41,616][100917] Updated weights for policy 1, policy_version 12302 (0.0009) +[2023-10-14 05:31:41,986][100917] Updated weights for policy 1, policy_version 12312 (0.0007) +[2023-10-14 05:31:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 25231360. Throughput: 0: 1660.3, 1: 1660.5. Samples: 6314058. Policy #0 lag: (min: 31.0, avg: 34.6, max: 63.0) +[2023-10-14 05:31:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:31:43,660][100936] Updated weights for policy 0, policy_version 12330 (0.0007) +[2023-10-14 05:31:44,035][100936] Updated weights for policy 0, policy_version 12340 (0.0011) +[2023-10-14 05:31:44,405][100936] Updated weights for policy 0, policy_version 12350 (0.0007) +[2023-10-14 05:31:46,167][100917] Updated weights for policy 1, policy_version 12322 (0.0009) +[2023-10-14 05:31:46,540][100917] Updated weights for policy 1, policy_version 12332 (0.0009) +[2023-10-14 05:31:46,915][100917] Updated weights for policy 1, policy_version 12342 (0.0009) +[2023-10-14 05:31:47,286][100917] Updated weights for policy 1, policy_version 12352 (0.0008) +[2023-10-14 05:31:48,488][100936] Updated weights for policy 0, policy_version 12360 (0.0010) +[2023-10-14 05:31:48,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 25296896. Throughput: 0: 1667.8, 1: 1646.8. Samples: 6333906. Policy #0 lag: (min: 31.0, avg: 34.6, max: 63.0) +[2023-10-14 05:31:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:31:48,862][100936] Updated weights for policy 0, policy_version 12370 (0.0009) +[2023-10-14 05:31:49,236][100936] Updated weights for policy 0, policy_version 12380 (0.0007) +[2023-10-14 05:31:51,145][100917] Updated weights for policy 1, policy_version 12362 (0.0009) +[2023-10-14 05:31:51,514][100917] Updated weights for policy 1, policy_version 12372 (0.0009) +[2023-10-14 05:31:51,898][100917] Updated weights for policy 1, policy_version 12382 (0.0007) +[2023-10-14 05:31:53,205][100936] Updated weights for policy 0, policy_version 12390 (0.0008) +[2023-10-14 05:31:53,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 25362432. Throughput: 0: 1662.0, 1: 1666.7. Samples: 6353760. Policy #0 lag: (min: 31.0, avg: 34.0, max: 63.0) +[2023-10-14 05:31:53,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:31:53,575][100936] Updated weights for policy 0, policy_version 12400 (0.0008) +[2023-10-14 05:31:53,955][100936] Updated weights for policy 0, policy_version 12410 (0.0007) +[2023-10-14 05:31:56,072][100917] Updated weights for policy 1, policy_version 12392 (0.0009) +[2023-10-14 05:31:56,443][100917] Updated weights for policy 1, policy_version 12402 (0.0009) +[2023-10-14 05:31:56,812][100917] Updated weights for policy 1, policy_version 12412 (0.0011) +[2023-10-14 05:31:57,962][100936] Updated weights for policy 0, policy_version 12420 (0.0008) +[2023-10-14 05:31:58,329][100936] Updated weights for policy 0, policy_version 12430 (0.0008) +[2023-10-14 05:31:58,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 25427968. Throughput: 0: 1672.5, 1: 1660.5. Samples: 6364266. Policy #0 lag: (min: 31.0, avg: 34.0, max: 63.0) +[2023-10-14 05:31:58,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:31:58,700][100936] Updated weights for policy 0, policy_version 12440 (0.0008) +[2023-10-14 05:32:00,814][100917] Updated weights for policy 1, policy_version 12422 (0.0007) +[2023-10-14 05:32:01,183][100917] Updated weights for policy 1, policy_version 12432 (0.0009) +[2023-10-14 05:32:01,550][100917] Updated weights for policy 1, policy_version 12442 (0.0009) +[2023-10-14 05:32:02,687][100936] Updated weights for policy 0, policy_version 12450 (0.0007) +[2023-10-14 05:32:03,062][100936] Updated weights for policy 0, policy_version 12460 (0.0009) +[2023-10-14 05:32:03,432][100936] Updated weights for policy 0, policy_version 12470 (0.0008) +[2023-10-14 05:32:03,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 25493504. Throughput: 0: 1674.0, 1: 1651.7. Samples: 6383958. Policy #0 lag: (min: 31.0, avg: 34.0, max: 63.0) +[2023-10-14 05:32:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:32:03,809][100936] Updated weights for policy 0, policy_version 12480 (0.0010) +[2023-10-14 05:32:05,719][100917] Updated weights for policy 1, policy_version 12452 (0.0009) +[2023-10-14 05:32:06,106][100917] Updated weights for policy 1, policy_version 12462 (0.0008) +[2023-10-14 05:32:06,469][100917] Updated weights for policy 1, policy_version 12472 (0.0009) +[2023-10-14 05:32:08,005][100936] Updated weights for policy 0, policy_version 12490 (0.0007) +[2023-10-14 05:32:08,380][100936] Updated weights for policy 0, policy_version 12500 (0.0007) +[2023-10-14 05:32:08,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 25559040. Throughput: 0: 1655.1, 1: 1671.6. Samples: 6403482. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:32:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:32:08,745][100936] Updated weights for policy 0, policy_version 12510 (0.0009) +[2023-10-14 05:32:10,614][100917] Updated weights for policy 1, policy_version 12482 (0.0010) +[2023-10-14 05:32:10,981][100917] Updated weights for policy 1, policy_version 12492 (0.0008) +[2023-10-14 05:32:11,353][100917] Updated weights for policy 1, policy_version 12502 (0.0008) +[2023-10-14 05:32:11,722][100917] Updated weights for policy 1, policy_version 12512 (0.0007) +[2023-10-14 05:32:12,817][100936] Updated weights for policy 0, policy_version 12520 (0.0008) +[2023-10-14 05:32:13,197][100936] Updated weights for policy 0, policy_version 12530 (0.0010) +[2023-10-14 05:32:13,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 25624576. Throughput: 0: 1674.6, 1: 1664.8. Samples: 6414148. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:32:13,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:32:13,569][100936] Updated weights for policy 0, policy_version 12540 (0.0007) +[2023-10-14 05:32:15,885][100917] Updated weights for policy 1, policy_version 12522 (0.0009) +[2023-10-14 05:32:16,267][100917] Updated weights for policy 1, policy_version 12532 (0.0008) +[2023-10-14 05:32:16,640][100917] Updated weights for policy 1, policy_version 12542 (0.0007) +[2023-10-14 05:32:17,693][100936] Updated weights for policy 0, policy_version 12550 (0.0007) +[2023-10-14 05:32:18,060][100936] Updated weights for policy 0, policy_version 12560 (0.0008) +[2023-10-14 05:32:18,440][100936] Updated weights for policy 0, policy_version 12570 (0.0009) +[2023-10-14 05:32:18,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 25690112. Throughput: 0: 1671.5, 1: 1660.2. Samples: 6433684. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:32:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:32:20,648][100917] Updated weights for policy 1, policy_version 12552 (0.0009) +[2023-10-14 05:32:21,024][100917] Updated weights for policy 1, policy_version 12562 (0.0009) +[2023-10-14 05:32:21,400][100917] Updated weights for policy 1, policy_version 12572 (0.0007) +[2023-10-14 05:32:22,649][100936] Updated weights for policy 0, policy_version 12580 (0.0009) +[2023-10-14 05:32:23,016][100936] Updated weights for policy 0, policy_version 12590 (0.0010) +[2023-10-14 05:32:23,395][100936] Updated weights for policy 0, policy_version 12600 (0.0012) +[2023-10-14 05:32:23,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 25755648. Throughput: 0: 1650.5, 1: 1668.0. Samples: 6453110. Policy #0 lag: (min: 12.0, avg: 19.9, max: 44.0) +[2023-10-14 05:32:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:32:25,660][100917] Updated weights for policy 1, policy_version 12582 (0.0008) +[2023-10-14 05:32:26,027][100917] Updated weights for policy 1, policy_version 12592 (0.0009) +[2023-10-14 05:32:26,397][100917] Updated weights for policy 1, policy_version 12602 (0.0009) +[2023-10-14 05:32:27,509][100936] Updated weights for policy 0, policy_version 12610 (0.0007) +[2023-10-14 05:32:27,885][100936] Updated weights for policy 0, policy_version 12620 (0.0007) +[2023-10-14 05:32:28,248][100936] Updated weights for policy 0, policy_version 12630 (0.0007) +[2023-10-14 05:32:28,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 25821184. Throughput: 0: 1672.7, 1: 1650.9. Samples: 6463622. Policy #0 lag: (min: 12.0, avg: 19.9, max: 44.0) +[2023-10-14 05:32:28,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:32:28,616][100936] Updated weights for policy 0, policy_version 12640 (0.0007) +[2023-10-14 05:32:30,718][100917] Updated weights for policy 1, policy_version 12612 (0.0010) +[2023-10-14 05:32:31,095][100917] Updated weights for policy 1, policy_version 12622 (0.0009) +[2023-10-14 05:32:31,458][100917] Updated weights for policy 1, policy_version 12632 (0.0011) +[2023-10-14 05:32:32,879][100936] Updated weights for policy 0, policy_version 12650 (0.0008) +[2023-10-14 05:32:33,254][100936] Updated weights for policy 0, policy_version 12660 (0.0009) +[2023-10-14 05:32:33,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 25886720. Throughput: 0: 1662.8, 1: 1657.8. Samples: 6483334. Policy #0 lag: (min: 19.0, avg: 21.8, max: 51.0) +[2023-10-14 05:32:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:32:33,620][100936] Updated weights for policy 0, policy_version 12670 (0.0010) +[2023-10-14 05:32:35,567][100917] Updated weights for policy 1, policy_version 12642 (0.0010) +[2023-10-14 05:32:35,949][100917] Updated weights for policy 1, policy_version 12652 (0.0007) +[2023-10-14 05:32:36,311][100917] Updated weights for policy 1, policy_version 12662 (0.0009) +[2023-10-14 05:32:36,686][100917] Updated weights for policy 1, policy_version 12672 (0.0009) +[2023-10-14 05:32:37,867][100936] Updated weights for policy 0, policy_version 12680 (0.0011) +[2023-10-14 05:32:38,239][100936] Updated weights for policy 0, policy_version 12690 (0.0008) +[2023-10-14 05:32:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 25952256. Throughput: 0: 1651.3, 1: 1663.2. Samples: 6502914. Policy #0 lag: (min: 19.0, avg: 21.8, max: 51.0) +[2023-10-14 05:32:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:32:38,614][100936] Updated weights for policy 0, policy_version 12700 (0.0008) +[2023-10-14 05:32:40,840][100917] Updated weights for policy 1, policy_version 12682 (0.0007) +[2023-10-14 05:32:41,219][100917] Updated weights for policy 1, policy_version 12692 (0.0008) +[2023-10-14 05:32:41,591][100917] Updated weights for policy 1, policy_version 12702 (0.0007) +[2023-10-14 05:32:42,636][100936] Updated weights for policy 0, policy_version 12710 (0.0008) +[2023-10-14 05:32:43,002][100936] Updated weights for policy 0, policy_version 12720 (0.0009) +[2023-10-14 05:32:43,372][100936] Updated weights for policy 0, policy_version 12730 (0.0008) +[2023-10-14 05:32:43,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 26017792. Throughput: 0: 1663.6, 1: 1652.6. Samples: 6513492. Policy #0 lag: (min: 38.0, avg: 54.8, max: 56.0) +[2023-10-14 05:32:43,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:32:45,876][100917] Updated weights for policy 1, policy_version 12712 (0.0008) +[2023-10-14 05:32:46,250][100917] Updated weights for policy 1, policy_version 12722 (0.0009) +[2023-10-14 05:32:46,610][100917] Updated weights for policy 1, policy_version 12732 (0.0008) +[2023-10-14 05:32:47,394][100936] Updated weights for policy 0, policy_version 12740 (0.0009) +[2023-10-14 05:32:47,762][100936] Updated weights for policy 0, policy_version 12750 (0.0010) +[2023-10-14 05:32:48,136][100936] Updated weights for policy 0, policy_version 12760 (0.0007) +[2023-10-14 05:32:48,512][99942] Fps is (10 sec: 16384.0, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 26116096. Throughput: 0: 1659.6, 1: 1653.9. Samples: 6533066. Policy #0 lag: (min: 38.0, avg: 54.8, max: 56.0) +[2023-10-14 05:32:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:32:50,796][100917] Updated weights for policy 1, policy_version 12742 (0.0010) +[2023-10-14 05:32:51,163][100917] Updated weights for policy 1, policy_version 12752 (0.0008) +[2023-10-14 05:32:51,544][100917] Updated weights for policy 1, policy_version 12762 (0.0007) +[2023-10-14 05:32:51,893][100936] Updated weights for policy 0, policy_version 12770 (0.0011) +[2023-10-14 05:32:52,268][100936] Updated weights for policy 0, policy_version 12780 (0.0010) +[2023-10-14 05:32:52,635][100936] Updated weights for policy 0, policy_version 12790 (0.0011) +[2023-10-14 05:32:53,010][100936] Updated weights for policy 0, policy_version 12800 (0.0010) +[2023-10-14 05:32:53,512][99942] Fps is (10 sec: 16383.7, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 26181632. Throughput: 0: 1656.7, 1: 1655.5. Samples: 6552536. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:32:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:32:55,520][100917] Updated weights for policy 1, policy_version 12772 (0.0009) +[2023-10-14 05:32:55,902][100917] Updated weights for policy 1, policy_version 12782 (0.0009) +[2023-10-14 05:32:56,273][100917] Updated weights for policy 1, policy_version 12792 (0.0008) +[2023-10-14 05:32:57,228][100936] Updated weights for policy 0, policy_version 12810 (0.0009) +[2023-10-14 05:32:57,603][100936] Updated weights for policy 0, policy_version 12820 (0.0010) +[2023-10-14 05:32:57,967][100936] Updated weights for policy 0, policy_version 12830 (0.0010) +[2023-10-14 05:32:58,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 26247168. Throughput: 0: 1667.9, 1: 1654.4. Samples: 6563648. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:32:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:33:00,347][100917] Updated weights for policy 1, policy_version 12802 (0.0007) +[2023-10-14 05:33:00,736][100917] Updated weights for policy 1, policy_version 12812 (0.0010) +[2023-10-14 05:33:01,111][100917] Updated weights for policy 1, policy_version 12822 (0.0007) +[2023-10-14 05:33:01,478][100917] Updated weights for policy 1, policy_version 12832 (0.0008) +[2023-10-14 05:33:02,432][100936] Updated weights for policy 0, policy_version 12840 (0.0007) +[2023-10-14 05:33:02,810][100936] Updated weights for policy 0, policy_version 12850 (0.0008) +[2023-10-14 05:33:03,188][100936] Updated weights for policy 0, policy_version 12860 (0.0008) +[2023-10-14 05:33:03,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 26312704. Throughput: 0: 1658.7, 1: 1657.7. Samples: 6582922. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:33:03,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:33:05,433][100917] Updated weights for policy 1, policy_version 12842 (0.0007) +[2023-10-14 05:33:05,821][100917] Updated weights for policy 1, policy_version 12852 (0.0010) +[2023-10-14 05:33:06,181][100917] Updated weights for policy 1, policy_version 12862 (0.0008) +[2023-10-14 05:33:07,159][100936] Updated weights for policy 0, policy_version 12870 (0.0008) +[2023-10-14 05:33:07,530][100936] Updated weights for policy 0, policy_version 12880 (0.0009) +[2023-10-14 05:33:07,910][100936] Updated weights for policy 0, policy_version 12890 (0.0009) +[2023-10-14 05:33:08,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 26378240. Throughput: 0: 1662.6, 1: 1661.3. Samples: 6602688. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:33:08,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:33:10,184][100917] Updated weights for policy 1, policy_version 12872 (0.0009) +[2023-10-14 05:33:10,548][100917] Updated weights for policy 1, policy_version 12882 (0.0008) +[2023-10-14 05:33:10,926][100917] Updated weights for policy 1, policy_version 12892 (0.0009) +[2023-10-14 05:33:11,964][100936] Updated weights for policy 0, policy_version 12900 (0.0009) +[2023-10-14 05:33:12,338][100936] Updated weights for policy 0, policy_version 12910 (0.0008) +[2023-10-14 05:33:12,705][100936] Updated weights for policy 0, policy_version 12920 (0.0008) +[2023-10-14 05:33:13,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 26443776. Throughput: 0: 1666.8, 1: 1654.2. Samples: 6613068. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:33:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:33:15,222][100917] Updated weights for policy 1, policy_version 12902 (0.0009) +[2023-10-14 05:33:15,595][100917] Updated weights for policy 1, policy_version 12912 (0.0008) +[2023-10-14 05:33:15,964][100917] Updated weights for policy 1, policy_version 12922 (0.0008) +[2023-10-14 05:33:16,929][100936] Updated weights for policy 0, policy_version 12930 (0.0007) +[2023-10-14 05:33:17,310][100936] Updated weights for policy 0, policy_version 12940 (0.0007) +[2023-10-14 05:33:17,697][100936] Updated weights for policy 0, policy_version 12950 (0.0008) +[2023-10-14 05:33:18,062][100936] Updated weights for policy 0, policy_version 12960 (0.0007) +[2023-10-14 05:33:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 26509312. Throughput: 0: 1658.0, 1: 1662.7. Samples: 6632762. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:33:18,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:33:20,132][100917] Updated weights for policy 1, policy_version 12932 (0.0009) +[2023-10-14 05:33:20,512][100917] Updated weights for policy 1, policy_version 12942 (0.0009) +[2023-10-14 05:33:20,885][100917] Updated weights for policy 1, policy_version 12952 (0.0007) +[2023-10-14 05:33:22,037][100936] Updated weights for policy 0, policy_version 12970 (0.0007) +[2023-10-14 05:33:22,408][100936] Updated weights for policy 0, policy_version 12980 (0.0008) +[2023-10-14 05:33:22,787][100936] Updated weights for policy 0, policy_version 12990 (0.0008) +[2023-10-14 05:33:23,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 26574848. Throughput: 0: 1663.6, 1: 1662.9. Samples: 6652608. Policy #0 lag: (min: 10.0, avg: 12.5, max: 38.0) +[2023-10-14 05:33:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:33:24,884][100917] Updated weights for policy 1, policy_version 12962 (0.0009) +[2023-10-14 05:33:25,267][100917] Updated weights for policy 1, policy_version 12972 (0.0007) +[2023-10-14 05:33:25,648][100917] Updated weights for policy 1, policy_version 12982 (0.0007) +[2023-10-14 05:33:26,019][100917] Updated weights for policy 1, policy_version 12992 (0.0009) +[2023-10-14 05:33:26,920][100936] Updated weights for policy 0, policy_version 13000 (0.0010) +[2023-10-14 05:33:27,289][100936] Updated weights for policy 0, policy_version 13010 (0.0010) +[2023-10-14 05:33:27,662][100936] Updated weights for policy 0, policy_version 13020 (0.0009) +[2023-10-14 05:33:28,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 26640384. Throughput: 0: 1672.1, 1: 1651.9. Samples: 6663072. Policy #0 lag: (min: 10.0, avg: 12.5, max: 38.0) +[2023-10-14 05:33:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:33:30,079][100917] Updated weights for policy 1, policy_version 13002 (0.0009) +[2023-10-14 05:33:30,455][100917] Updated weights for policy 1, policy_version 13012 (0.0008) +[2023-10-14 05:33:30,833][100917] Updated weights for policy 1, policy_version 13022 (0.0009) +[2023-10-14 05:33:31,823][100936] Updated weights for policy 0, policy_version 13030 (0.0009) +[2023-10-14 05:33:32,186][100936] Updated weights for policy 0, policy_version 13040 (0.0008) +[2023-10-14 05:33:32,562][100936] Updated weights for policy 0, policy_version 13050 (0.0008) +[2023-10-14 05:33:33,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 26705920. Throughput: 0: 1653.8, 1: 1672.1. Samples: 6682732. Policy #0 lag: (min: 10.0, avg: 12.5, max: 38.0) +[2023-10-14 05:33:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:33:35,021][100917] Updated weights for policy 1, policy_version 13032 (0.0008) +[2023-10-14 05:33:35,403][100917] Updated weights for policy 1, policy_version 13042 (0.0007) +[2023-10-14 05:33:35,785][100917] Updated weights for policy 1, policy_version 13052 (0.0009) +[2023-10-14 05:33:36,452][100936] Updated weights for policy 0, policy_version 13060 (0.0008) +[2023-10-14 05:33:36,822][100936] Updated weights for policy 0, policy_version 13070 (0.0009) +[2023-10-14 05:33:37,189][100936] Updated weights for policy 0, policy_version 13080 (0.0010) +[2023-10-14 05:33:38,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 26771456. Throughput: 0: 1669.3, 1: 1674.1. Samples: 6702992. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-10-14 05:33:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:33:38,524][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000013088_13402112.pth... +[2023-10-14 05:33:38,524][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000013056_13369344.pth... +[2023-10-14 05:33:38,562][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000011520_11796480.pth +[2023-10-14 05:33:38,562][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000011520_11796480.pth +[2023-10-14 05:33:39,761][100917] Updated weights for policy 1, policy_version 13062 (0.0007) +[2023-10-14 05:33:40,134][100917] Updated weights for policy 1, policy_version 13072 (0.0010) +[2023-10-14 05:33:40,513][100917] Updated weights for policy 1, policy_version 13082 (0.0008) +[2023-10-14 05:33:41,201][100936] Updated weights for policy 0, policy_version 13090 (0.0011) +[2023-10-14 05:33:41,576][100936] Updated weights for policy 0, policy_version 13100 (0.0009) +[2023-10-14 05:33:41,948][100936] Updated weights for policy 0, policy_version 13110 (0.0007) +[2023-10-14 05:33:42,314][100936] Updated weights for policy 0, policy_version 13120 (0.0007) +[2023-10-14 05:33:43,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 26836992. Throughput: 0: 1664.9, 1: 1654.9. Samples: 6713042. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-10-14 05:33:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:33:44,699][100917] Updated weights for policy 1, policy_version 13092 (0.0009) +[2023-10-14 05:33:45,069][100917] Updated weights for policy 1, policy_version 13102 (0.0008) +[2023-10-14 05:33:45,452][100917] Updated weights for policy 1, policy_version 13112 (0.0010) +[2023-10-14 05:33:46,552][100936] Updated weights for policy 0, policy_version 13130 (0.0009) +[2023-10-14 05:33:46,927][100936] Updated weights for policy 0, policy_version 13140 (0.0007) +[2023-10-14 05:33:47,305][100936] Updated weights for policy 0, policy_version 13150 (0.0007) +[2023-10-14 05:33:48,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 26902528. Throughput: 0: 1655.1, 1: 1670.1. Samples: 6732556. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) +[2023-10-14 05:33:48,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:33:49,475][100917] Updated weights for policy 1, policy_version 13122 (0.0008) +[2023-10-14 05:33:49,845][100917] Updated weights for policy 1, policy_version 13132 (0.0010) +[2023-10-14 05:33:50,228][100917] Updated weights for policy 1, policy_version 13142 (0.0010) +[2023-10-14 05:33:50,597][100917] Updated weights for policy 1, policy_version 13152 (0.0011) +[2023-10-14 05:33:51,432][100936] Updated weights for policy 0, policy_version 13160 (0.0008) +[2023-10-14 05:33:51,803][100936] Updated weights for policy 0, policy_version 13170 (0.0010) +[2023-10-14 05:33:52,179][100936] Updated weights for policy 0, policy_version 13180 (0.0009) +[2023-10-14 05:33:53,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 26968064. Throughput: 0: 1668.9, 1: 1666.7. Samples: 6752794. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) +[2023-10-14 05:33:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:33:54,689][100917] Updated weights for policy 1, policy_version 13162 (0.0007) +[2023-10-14 05:33:55,067][100917] Updated weights for policy 1, policy_version 13172 (0.0007) +[2023-10-14 05:33:55,442][100917] Updated weights for policy 1, policy_version 13182 (0.0008) +[2023-10-14 05:33:56,219][100936] Updated weights for policy 0, policy_version 13190 (0.0008) +[2023-10-14 05:33:56,588][100936] Updated weights for policy 0, policy_version 13200 (0.0008) +[2023-10-14 05:33:56,961][100936] Updated weights for policy 0, policy_version 13210 (0.0008) +[2023-10-14 05:33:58,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 27033600. Throughput: 0: 1663.3, 1: 1660.6. Samples: 6762642. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) +[2023-10-14 05:33:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:33:59,643][100917] Updated weights for policy 1, policy_version 13192 (0.0008) +[2023-10-14 05:34:00,022][100917] Updated weights for policy 1, policy_version 13202 (0.0008) +[2023-10-14 05:34:00,390][100917] Updated weights for policy 1, policy_version 13212 (0.0008) +[2023-10-14 05:34:00,975][100936] Updated weights for policy 0, policy_version 13220 (0.0007) +[2023-10-14 05:34:01,359][100936] Updated weights for policy 0, policy_version 13230 (0.0008) +[2023-10-14 05:34:01,730][100936] Updated weights for policy 0, policy_version 13240 (0.0010) +[2023-10-14 05:34:03,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.1, 300 sec: 13329.3). Total num frames: 27099136. Throughput: 0: 1658.1, 1: 1665.4. Samples: 6782322. Policy #0 lag: (min: 8.0, avg: 31.0, max: 40.0) +[2023-10-14 05:34:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.870')] +[2023-10-14 05:34:04,432][100917] Updated weights for policy 1, policy_version 13222 (0.0007) +[2023-10-14 05:34:04,810][100917] Updated weights for policy 1, policy_version 13232 (0.0007) +[2023-10-14 05:34:05,181][100917] Updated weights for policy 1, policy_version 13242 (0.0007) +[2023-10-14 05:34:05,813][100936] Updated weights for policy 0, policy_version 13250 (0.0009) +[2023-10-14 05:34:06,182][100936] Updated weights for policy 0, policy_version 13260 (0.0010) +[2023-10-14 05:34:06,560][100936] Updated weights for policy 0, policy_version 13270 (0.0007) +[2023-10-14 05:34:06,922][100936] Updated weights for policy 0, policy_version 13280 (0.0009) +[2023-10-14 05:34:08,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.1, 300 sec: 13329.3). Total num frames: 27164672. Throughput: 0: 1675.3, 1: 1662.4. Samples: 6802802. Policy #0 lag: (min: 8.0, avg: 31.0, max: 40.0) +[2023-10-14 05:34:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.870')] +[2023-10-14 05:34:09,371][100917] Updated weights for policy 1, policy_version 13252 (0.0010) +[2023-10-14 05:34:09,749][100917] Updated weights for policy 1, policy_version 13262 (0.0007) +[2023-10-14 05:34:10,121][100917] Updated weights for policy 1, policy_version 13272 (0.0008) +[2023-10-14 05:34:11,166][100936] Updated weights for policy 0, policy_version 13290 (0.0009) +[2023-10-14 05:34:11,530][100936] Updated weights for policy 0, policy_version 13300 (0.0010) +[2023-10-14 05:34:11,901][100936] Updated weights for policy 0, policy_version 13310 (0.0007) +[2023-10-14 05:34:13,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 27230208. Throughput: 0: 1658.2, 1: 1657.5. Samples: 6812276. Policy #0 lag: (min: 8.0, avg: 31.0, max: 40.0) +[2023-10-14 05:34:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.870')] +[2023-10-14 05:34:14,191][100917] Updated weights for policy 1, policy_version 13282 (0.0008) +[2023-10-14 05:34:14,564][100917] Updated weights for policy 1, policy_version 13292 (0.0009) +[2023-10-14 05:34:14,926][100917] Updated weights for policy 1, policy_version 13302 (0.0008) +[2023-10-14 05:34:15,312][100917] Updated weights for policy 1, policy_version 13312 (0.0009) +[2023-10-14 05:34:15,921][100936] Updated weights for policy 0, policy_version 13320 (0.0007) +[2023-10-14 05:34:16,284][100936] Updated weights for policy 0, policy_version 13330 (0.0009) +[2023-10-14 05:34:16,654][100936] Updated weights for policy 0, policy_version 13340 (0.0010) +[2023-10-14 05:34:18,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 27295744. Throughput: 0: 1662.8, 1: 1658.7. Samples: 6832198. Policy #0 lag: (min: 31.0, avg: 37.6, max: 63.0) +[2023-10-14 05:34:18,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.870')] +[2023-10-14 05:34:19,455][100917] Updated weights for policy 1, policy_version 13322 (0.0008) +[2023-10-14 05:34:19,830][100917] Updated weights for policy 1, policy_version 13332 (0.0009) +[2023-10-14 05:34:20,211][100917] Updated weights for policy 1, policy_version 13342 (0.0009) +[2023-10-14 05:34:20,824][100936] Updated weights for policy 0, policy_version 13350 (0.0007) +[2023-10-14 05:34:21,198][100936] Updated weights for policy 0, policy_version 13360 (0.0007) +[2023-10-14 05:34:21,573][100936] Updated weights for policy 0, policy_version 13370 (0.0008) +[2023-10-14 05:34:23,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 27361280. Throughput: 0: 1668.9, 1: 1654.7. Samples: 6852552. Policy #0 lag: (min: 31.0, avg: 37.6, max: 63.0) +[2023-10-14 05:34:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.870')] +[2023-10-14 05:34:24,331][100917] Updated weights for policy 1, policy_version 13352 (0.0007) +[2023-10-14 05:34:24,710][100917] Updated weights for policy 1, policy_version 13362 (0.0008) +[2023-10-14 05:34:25,083][100917] Updated weights for policy 1, policy_version 13372 (0.0009) +[2023-10-14 05:34:25,641][100936] Updated weights for policy 0, policy_version 13380 (0.0008) +[2023-10-14 05:34:26,011][100936] Updated weights for policy 0, policy_version 13390 (0.0008) +[2023-10-14 05:34:26,385][100936] Updated weights for policy 0, policy_version 13400 (0.0008) +[2023-10-14 05:34:28,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 27426816. Throughput: 0: 1655.0, 1: 1655.2. Samples: 6862002. Policy #0 lag: (min: 31.0, avg: 37.6, max: 63.0) +[2023-10-14 05:34:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.870')] +[2023-10-14 05:34:29,192][100917] Updated weights for policy 1, policy_version 13382 (0.0009) +[2023-10-14 05:34:29,561][100917] Updated weights for policy 1, policy_version 13392 (0.0010) +[2023-10-14 05:34:29,934][100917] Updated weights for policy 1, policy_version 13402 (0.0011) +[2023-10-14 05:34:30,344][100936] Updated weights for policy 0, policy_version 13410 (0.0008) +[2023-10-14 05:34:30,722][100936] Updated weights for policy 0, policy_version 13420 (0.0008) +[2023-10-14 05:34:31,091][100936] Updated weights for policy 0, policy_version 13430 (0.0010) +[2023-10-14 05:34:31,462][100936] Updated weights for policy 0, policy_version 13440 (0.0008) +[2023-10-14 05:34:33,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 27492352. Throughput: 0: 1670.9, 1: 1659.3. Samples: 6882418. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:34:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.870')] +[2023-10-14 05:34:34,142][100917] Updated weights for policy 1, policy_version 13412 (0.0010) +[2023-10-14 05:34:34,507][100917] Updated weights for policy 1, policy_version 13422 (0.0007) +[2023-10-14 05:34:34,888][100917] Updated weights for policy 1, policy_version 13432 (0.0008) +[2023-10-14 05:34:35,567][100936] Updated weights for policy 0, policy_version 13450 (0.0008) +[2023-10-14 05:34:35,931][100936] Updated weights for policy 0, policy_version 13460 (0.0011) +[2023-10-14 05:34:36,307][100936] Updated weights for policy 0, policy_version 13470 (0.0008) +[2023-10-14 05:34:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 27557888. Throughput: 0: 1671.0, 1: 1657.8. Samples: 6902590. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:34:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.870')] +[2023-10-14 05:34:39,073][100917] Updated weights for policy 1, policy_version 13442 (0.0007) +[2023-10-14 05:34:39,446][100917] Updated weights for policy 1, policy_version 13452 (0.0007) +[2023-10-14 05:34:39,824][100917] Updated weights for policy 1, policy_version 13462 (0.0008) +[2023-10-14 05:34:40,208][100917] Updated weights for policy 1, policy_version 13472 (0.0008) +[2023-10-14 05:34:40,576][100936] Updated weights for policy 0, policy_version 13480 (0.0008) +[2023-10-14 05:34:40,944][100936] Updated weights for policy 0, policy_version 13490 (0.0008) +[2023-10-14 05:34:41,314][100936] Updated weights for policy 0, policy_version 13500 (0.0008) +[2023-10-14 05:34:43,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 27623424. Throughput: 0: 1652.8, 1: 1655.4. Samples: 6911510. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:34:43,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.870')] +[2023-10-14 05:34:44,199][100917] Updated weights for policy 1, policy_version 13482 (0.0009) +[2023-10-14 05:34:44,571][100917] Updated weights for policy 1, policy_version 13492 (0.0008) +[2023-10-14 05:34:44,938][100917] Updated weights for policy 1, policy_version 13502 (0.0009) +[2023-10-14 05:34:45,508][100936] Updated weights for policy 0, policy_version 13510 (0.0010) +[2023-10-14 05:34:45,889][100936] Updated weights for policy 0, policy_version 13520 (0.0008) +[2023-10-14 05:34:46,254][100936] Updated weights for policy 0, policy_version 13530 (0.0010) +[2023-10-14 05:34:48,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 27688960. Throughput: 0: 1663.1, 1: 1656.1. Samples: 6931688. Policy #0 lag: (min: 31.0, avg: 38.3, max: 63.0) +[2023-10-14 05:34:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.870')] +[2023-10-14 05:34:48,978][100917] Updated weights for policy 1, policy_version 13512 (0.0007) +[2023-10-14 05:34:49,347][100917] Updated weights for policy 1, policy_version 13522 (0.0009) +[2023-10-14 05:34:49,717][100917] Updated weights for policy 1, policy_version 13532 (0.0009) +[2023-10-14 05:34:50,290][100936] Updated weights for policy 0, policy_version 13540 (0.0007) +[2023-10-14 05:34:50,663][100936] Updated weights for policy 0, policy_version 13550 (0.0008) +[2023-10-14 05:34:51,044][100936] Updated weights for policy 0, policy_version 13560 (0.0010) +[2023-10-14 05:34:53,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 27754496. Throughput: 0: 1664.7, 1: 1657.2. Samples: 6952286. Policy #0 lag: (min: 31.0, avg: 38.3, max: 63.0) +[2023-10-14 05:34:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.870')] +[2023-10-14 05:34:53,920][100917] Updated weights for policy 1, policy_version 13542 (0.0009) +[2023-10-14 05:34:54,282][100917] Updated weights for policy 1, policy_version 13552 (0.0007) +[2023-10-14 05:34:54,662][100917] Updated weights for policy 1, policy_version 13562 (0.0009) +[2023-10-14 05:34:54,998][100936] Updated weights for policy 0, policy_version 13570 (0.0008) +[2023-10-14 05:34:55,364][100936] Updated weights for policy 0, policy_version 13580 (0.0010) +[2023-10-14 05:34:55,733][100936] Updated weights for policy 0, policy_version 13590 (0.0009) +[2023-10-14 05:34:56,108][100936] Updated weights for policy 0, policy_version 13600 (0.0009) +[2023-10-14 05:34:58,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 27820032. Throughput: 0: 1654.8, 1: 1657.3. Samples: 6961322. Policy #0 lag: (min: 31.0, avg: 38.3, max: 63.0) +[2023-10-14 05:34:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.870')] +[2023-10-14 05:34:58,716][100917] Updated weights for policy 1, policy_version 13572 (0.0007) +[2023-10-14 05:34:59,097][100917] Updated weights for policy 1, policy_version 13582 (0.0007) +[2023-10-14 05:34:59,468][100917] Updated weights for policy 1, policy_version 13592 (0.0009) +[2023-10-14 05:35:00,224][100936] Updated weights for policy 0, policy_version 13610 (0.0010) +[2023-10-14 05:35:00,605][100936] Updated weights for policy 0, policy_version 13620 (0.0009) +[2023-10-14 05:35:00,979][100936] Updated weights for policy 0, policy_version 13630 (0.0009) +[2023-10-14 05:35:03,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 27885568. Throughput: 0: 1664.8, 1: 1658.9. Samples: 6981768. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) +[2023-10-14 05:35:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.870')] +[2023-10-14 05:35:03,579][100917] Updated weights for policy 1, policy_version 13602 (0.0008) +[2023-10-14 05:35:03,998][100917] Updated weights for policy 1, policy_version 13612 (0.0009) +[2023-10-14 05:35:04,364][100917] Updated weights for policy 1, policy_version 13622 (0.0009) +[2023-10-14 05:35:04,742][100917] Updated weights for policy 1, policy_version 13632 (0.0009) +[2023-10-14 05:35:05,054][100936] Updated weights for policy 0, policy_version 13640 (0.0009) +[2023-10-14 05:35:05,425][100936] Updated weights for policy 0, policy_version 13650 (0.0007) +[2023-10-14 05:35:05,801][100936] Updated weights for policy 0, policy_version 13660 (0.0008) +[2023-10-14 05:35:08,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 27951104. Throughput: 0: 1660.9, 1: 1657.6. Samples: 7001884. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) +[2023-10-14 05:35:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.870')] +[2023-10-14 05:35:08,795][100917] Updated weights for policy 1, policy_version 13642 (0.0010) +[2023-10-14 05:35:09,174][100917] Updated weights for policy 1, policy_version 13652 (0.0011) +[2023-10-14 05:35:09,546][100917] Updated weights for policy 1, policy_version 13662 (0.0009) +[2023-10-14 05:35:10,185][100936] Updated weights for policy 0, policy_version 13670 (0.0010) +[2023-10-14 05:35:10,560][100936] Updated weights for policy 0, policy_version 13680 (0.0008) +[2023-10-14 05:35:10,928][100936] Updated weights for policy 0, policy_version 13690 (0.0008) +[2023-10-14 05:35:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 28016640. Throughput: 0: 1649.2, 1: 1661.3. Samples: 7010972. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) +[2023-10-14 05:35:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.870')] +[2023-10-14 05:35:13,716][100917] Updated weights for policy 1, policy_version 13672 (0.0010) +[2023-10-14 05:35:14,084][100917] Updated weights for policy 1, policy_version 13682 (0.0008) +[2023-10-14 05:35:14,456][100917] Updated weights for policy 1, policy_version 13692 (0.0010) +[2023-10-14 05:35:15,010][100936] Updated weights for policy 0, policy_version 13700 (0.0009) +[2023-10-14 05:35:15,391][100936] Updated weights for policy 0, policy_version 13710 (0.0008) +[2023-10-14 05:35:15,764][100936] Updated weights for policy 0, policy_version 13720 (0.0008) +[2023-10-14 05:35:18,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 28082176. Throughput: 0: 1650.7, 1: 1658.1. Samples: 7031312. Policy #0 lag: (min: 2.0, avg: 9.1, max: 34.0) +[2023-10-14 05:35:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.800')] +[2023-10-14 05:35:18,525][100917] Updated weights for policy 1, policy_version 13702 (0.0008) +[2023-10-14 05:35:18,910][100917] Updated weights for policy 1, policy_version 13712 (0.0009) +[2023-10-14 05:35:19,280][100917] Updated weights for policy 1, policy_version 13722 (0.0010) +[2023-10-14 05:35:20,063][100936] Updated weights for policy 0, policy_version 13730 (0.0007) +[2023-10-14 05:35:20,443][100936] Updated weights for policy 0, policy_version 13740 (0.0008) +[2023-10-14 05:35:20,808][100936] Updated weights for policy 0, policy_version 13750 (0.0007) +[2023-10-14 05:35:21,169][100936] Updated weights for policy 0, policy_version 13760 (0.0008) +[2023-10-14 05:35:23,436][100917] Updated weights for policy 1, policy_version 13732 (0.0008) +[2023-10-14 05:35:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 28147712. Throughput: 0: 1654.0, 1: 1665.6. Samples: 7051968. Policy #0 lag: (min: 2.0, avg: 9.1, max: 34.0) +[2023-10-14 05:35:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.800')] +[2023-10-14 05:35:23,810][100917] Updated weights for policy 1, policy_version 13742 (0.0007) +[2023-10-14 05:35:24,193][100917] Updated weights for policy 1, policy_version 13752 (0.0009) +[2023-10-14 05:35:25,408][100936] Updated weights for policy 0, policy_version 13770 (0.0009) +[2023-10-14 05:35:25,782][100936] Updated weights for policy 0, policy_version 13780 (0.0009) +[2023-10-14 05:35:26,156][100936] Updated weights for policy 0, policy_version 13790 (0.0008) +[2023-10-14 05:35:28,274][100917] Updated weights for policy 1, policy_version 13762 (0.0009) +[2023-10-14 05:35:28,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 28213248. Throughput: 0: 1650.5, 1: 1669.4. Samples: 7060908. Policy #0 lag: (min: 31.0, avg: 41.2, max: 63.0) +[2023-10-14 05:35:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.800')] +[2023-10-14 05:35:28,649][100917] Updated weights for policy 1, policy_version 13772 (0.0009) +[2023-10-14 05:35:29,023][100917] Updated weights for policy 1, policy_version 13782 (0.0009) +[2023-10-14 05:35:29,404][100917] Updated weights for policy 1, policy_version 13792 (0.0010) +[2023-10-14 05:35:30,252][100936] Updated weights for policy 0, policy_version 13800 (0.0008) +[2023-10-14 05:35:30,620][100936] Updated weights for policy 0, policy_version 13810 (0.0010) +[2023-10-14 05:35:30,992][100936] Updated weights for policy 0, policy_version 13820 (0.0009) +[2023-10-14 05:35:33,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 28278784. Throughput: 0: 1655.6, 1: 1671.9. Samples: 7081424. Policy #0 lag: (min: 31.0, avg: 41.2, max: 63.0) +[2023-10-14 05:35:33,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.800')] +[2023-10-14 05:35:33,528][100917] Updated weights for policy 1, policy_version 13802 (0.0007) +[2023-10-14 05:35:33,906][100917] Updated weights for policy 1, policy_version 13812 (0.0007) +[2023-10-14 05:35:34,277][100917] Updated weights for policy 1, policy_version 13822 (0.0007) +[2023-10-14 05:35:35,149][100936] Updated weights for policy 0, policy_version 13830 (0.0010) +[2023-10-14 05:35:35,512][100936] Updated weights for policy 0, policy_version 13840 (0.0009) +[2023-10-14 05:35:35,892][100936] Updated weights for policy 0, policy_version 13850 (0.0009) +[2023-10-14 05:35:38,192][100917] Updated weights for policy 1, policy_version 13832 (0.0008) +[2023-10-14 05:35:38,512][99942] Fps is (10 sec: 13106.7, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 28344320. Throughput: 0: 1645.9, 1: 1671.9. Samples: 7101592. Policy #0 lag: (min: 31.0, avg: 41.2, max: 63.0) +[2023-10-14 05:35:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.800')] +[2023-10-14 05:35:38,527][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000013856_14188544.pth... +[2023-10-14 05:35:38,559][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000012320_12615680.pth +[2023-10-14 05:35:38,566][100917] Updated weights for policy 1, policy_version 13842 (0.0007) +[2023-10-14 05:35:38,945][100917] Updated weights for policy 1, policy_version 13852 (0.0008) +[2023-10-14 05:35:39,086][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000013856_14188544.pth... +[2023-10-14 05:35:39,125][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000012288_12582912.pth +[2023-10-14 05:35:40,064][100936] Updated weights for policy 0, policy_version 13860 (0.0008) +[2023-10-14 05:35:40,443][100936] Updated weights for policy 0, policy_version 13870 (0.0009) +[2023-10-14 05:35:40,807][100936] Updated weights for policy 0, policy_version 13880 (0.0010) +[2023-10-14 05:35:43,191][100917] Updated weights for policy 1, policy_version 13862 (0.0009) +[2023-10-14 05:35:43,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 28409856. Throughput: 0: 1641.1, 1: 1674.5. Samples: 7110524. Policy #0 lag: (min: 18.0, avg: 18.0, max: 18.0) +[2023-10-14 05:35:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.800')] +[2023-10-14 05:35:43,565][100917] Updated weights for policy 1, policy_version 13872 (0.0007) +[2023-10-14 05:35:43,948][100917] Updated weights for policy 1, policy_version 13882 (0.0008) +[2023-10-14 05:35:44,858][100936] Updated weights for policy 0, policy_version 13890 (0.0010) +[2023-10-14 05:35:45,227][100936] Updated weights for policy 0, policy_version 13900 (0.0007) +[2023-10-14 05:35:45,601][100936] Updated weights for policy 0, policy_version 13910 (0.0007) +[2023-10-14 05:35:45,974][100936] Updated weights for policy 0, policy_version 13920 (0.0007) +[2023-10-14 05:35:47,977][100917] Updated weights for policy 1, policy_version 13892 (0.0007) +[2023-10-14 05:35:48,351][100917] Updated weights for policy 1, policy_version 13902 (0.0009) +[2023-10-14 05:35:48,512][99942] Fps is (10 sec: 13107.8, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 28475392. Throughput: 0: 1644.8, 1: 1674.4. Samples: 7131130. Policy #0 lag: (min: 18.0, avg: 18.0, max: 18.0) +[2023-10-14 05:35:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.800')] +[2023-10-14 05:35:48,731][100917] Updated weights for policy 1, policy_version 13912 (0.0008) +[2023-10-14 05:35:50,051][100936] Updated weights for policy 0, policy_version 13930 (0.0009) +[2023-10-14 05:35:50,421][100936] Updated weights for policy 0, policy_version 13940 (0.0009) +[2023-10-14 05:35:50,796][100936] Updated weights for policy 0, policy_version 13950 (0.0010) +[2023-10-14 05:35:52,781][100917] Updated weights for policy 1, policy_version 13922 (0.0008) +[2023-10-14 05:35:53,202][100917] Updated weights for policy 1, policy_version 13932 (0.0011) +[2023-10-14 05:35:53,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 28540928. Throughput: 0: 1647.0, 1: 1673.7. Samples: 7151314. Policy #0 lag: (min: 18.0, avg: 18.0, max: 18.0) +[2023-10-14 05:35:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.800')] +[2023-10-14 05:35:53,584][100917] Updated weights for policy 1, policy_version 13942 (0.0008) +[2023-10-14 05:35:53,947][100917] Updated weights for policy 1, policy_version 13952 (0.0007) +[2023-10-14 05:35:54,964][100936] Updated weights for policy 0, policy_version 13960 (0.0009) +[2023-10-14 05:35:55,343][100936] Updated weights for policy 0, policy_version 13970 (0.0008) +[2023-10-14 05:35:55,707][100936] Updated weights for policy 0, policy_version 13980 (0.0009) +[2023-10-14 05:35:58,081][100917] Updated weights for policy 1, policy_version 13962 (0.0007) +[2023-10-14 05:35:58,450][100917] Updated weights for policy 1, policy_version 13972 (0.0007) +[2023-10-14 05:35:58,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 28606464. Throughput: 0: 1648.5, 1: 1677.3. Samples: 7160634. Policy #0 lag: (min: 24.0, avg: 51.4, max: 56.0) +[2023-10-14 05:35:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.670')] +[2023-10-14 05:35:58,816][100917] Updated weights for policy 1, policy_version 13982 (0.0007) +[2023-10-14 05:35:59,754][100936] Updated weights for policy 0, policy_version 13990 (0.0008) +[2023-10-14 05:36:00,129][100936] Updated weights for policy 0, policy_version 14000 (0.0009) +[2023-10-14 05:36:00,506][100936] Updated weights for policy 0, policy_version 14010 (0.0009) +[2023-10-14 05:36:02,770][100917] Updated weights for policy 1, policy_version 13992 (0.0008) +[2023-10-14 05:36:03,140][100917] Updated weights for policy 1, policy_version 14002 (0.0007) +[2023-10-14 05:36:03,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 28672000. Throughput: 0: 1653.7, 1: 1679.4. Samples: 7181304. Policy #0 lag: (min: 24.0, avg: 51.4, max: 56.0) +[2023-10-14 05:36:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.670')] +[2023-10-14 05:36:03,524][100917] Updated weights for policy 1, policy_version 14012 (0.0010) +[2023-10-14 05:36:04,650][100936] Updated weights for policy 0, policy_version 14020 (0.0009) +[2023-10-14 05:36:05,017][100936] Updated weights for policy 0, policy_version 14030 (0.0008) +[2023-10-14 05:36:05,397][100936] Updated weights for policy 0, policy_version 14040 (0.0007) +[2023-10-14 05:36:07,809][100917] Updated weights for policy 1, policy_version 14022 (0.0010) +[2023-10-14 05:36:08,175][100917] Updated weights for policy 1, policy_version 14032 (0.0008) +[2023-10-14 05:36:08,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 28737536. Throughput: 0: 1661.6, 1: 1662.4. Samples: 7201552. Policy #0 lag: (min: 24.0, avg: 51.4, max: 56.0) +[2023-10-14 05:36:08,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.670')] +[2023-10-14 05:36:08,531][100917] Updated weights for policy 1, policy_version 14042 (0.0008) +[2023-10-14 05:36:09,518][100936] Updated weights for policy 0, policy_version 14050 (0.0009) +[2023-10-14 05:36:09,928][100936] Updated weights for policy 0, policy_version 14060 (0.0010) +[2023-10-14 05:36:10,292][100936] Updated weights for policy 0, policy_version 14070 (0.0009) +[2023-10-14 05:36:10,667][100936] Updated weights for policy 0, policy_version 14080 (0.0007) +[2023-10-14 05:36:12,514][100917] Updated weights for policy 1, policy_version 14052 (0.0009) +[2023-10-14 05:36:12,893][100917] Updated weights for policy 1, policy_version 14062 (0.0011) +[2023-10-14 05:36:13,261][100917] Updated weights for policy 1, policy_version 14072 (0.0010) +[2023-10-14 05:36:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 28803072. Throughput: 0: 1660.1, 1: 1673.4. Samples: 7210916. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-10-14 05:36:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.670')] +[2023-10-14 05:36:14,784][100936] Updated weights for policy 0, policy_version 14090 (0.0008) +[2023-10-14 05:36:15,153][100936] Updated weights for policy 0, policy_version 14100 (0.0008) +[2023-10-14 05:36:15,530][100936] Updated weights for policy 0, policy_version 14110 (0.0007) +[2023-10-14 05:36:17,433][100917] Updated weights for policy 1, policy_version 14082 (0.0011) +[2023-10-14 05:36:17,808][100917] Updated weights for policy 1, policy_version 14092 (0.0008) +[2023-10-14 05:36:18,179][100917] Updated weights for policy 1, policy_version 14102 (0.0007) +[2023-10-14 05:36:18,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 28868608. Throughput: 0: 1664.2, 1: 1666.4. Samples: 7231302. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-10-14 05:36:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.670')] +[2023-10-14 05:36:18,546][100917] Updated weights for policy 1, policy_version 14112 (0.0007) +[2023-10-14 05:36:19,641][100936] Updated weights for policy 0, policy_version 14120 (0.0010) +[2023-10-14 05:36:20,008][100936] Updated weights for policy 0, policy_version 14130 (0.0009) +[2023-10-14 05:36:20,387][100936] Updated weights for policy 0, policy_version 14140 (0.0010) +[2023-10-14 05:36:22,572][100917] Updated weights for policy 1, policy_version 14122 (0.0007) +[2023-10-14 05:36:22,938][100917] Updated weights for policy 1, policy_version 14132 (0.0009) +[2023-10-14 05:36:23,321][100917] Updated weights for policy 1, policy_version 14142 (0.0010) +[2023-10-14 05:36:23,512][99942] Fps is (10 sec: 16384.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 28966912. Throughput: 0: 1670.6, 1: 1651.5. Samples: 7251084. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-10-14 05:36:23,513][99942] Avg episode reward: [(0, '0.720'), (1, '0.670')] +[2023-10-14 05:36:24,446][100936] Updated weights for policy 0, policy_version 14150 (0.0008) +[2023-10-14 05:36:24,813][100936] Updated weights for policy 0, policy_version 14160 (0.0007) +[2023-10-14 05:36:25,187][100936] Updated weights for policy 0, policy_version 14170 (0.0007) +[2023-10-14 05:36:27,506][100917] Updated weights for policy 1, policy_version 14152 (0.0007) +[2023-10-14 05:36:27,882][100917] Updated weights for policy 1, policy_version 14162 (0.0007) +[2023-10-14 05:36:28,256][100917] Updated weights for policy 1, policy_version 14172 (0.0009) +[2023-10-14 05:36:28,512][99942] Fps is (10 sec: 16384.3, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 29032448. Throughput: 0: 1673.4, 1: 1666.6. Samples: 7260822. Policy #0 lag: (min: 30.0, avg: 30.0, max: 30.0) +[2023-10-14 05:36:28,513][99942] Avg episode reward: [(0, '0.720'), (1, '0.670')] +[2023-10-14 05:36:29,377][100936] Updated weights for policy 0, policy_version 14180 (0.0007) +[2023-10-14 05:36:29,741][100936] Updated weights for policy 0, policy_version 14190 (0.0008) +[2023-10-14 05:36:30,120][100936] Updated weights for policy 0, policy_version 14200 (0.0009) +[2023-10-14 05:36:32,314][100917] Updated weights for policy 1, policy_version 14182 (0.0008) +[2023-10-14 05:36:32,687][100917] Updated weights for policy 1, policy_version 14192 (0.0007) +[2023-10-14 05:36:33,063][100917] Updated weights for policy 1, policy_version 14202 (0.0007) +[2023-10-14 05:36:33,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 29097984. Throughput: 0: 1670.4, 1: 1663.5. Samples: 7281154. Policy #0 lag: (min: 30.0, avg: 30.0, max: 30.0) +[2023-10-14 05:36:33,513][99942] Avg episode reward: [(0, '0.720'), (1, '0.670')] +[2023-10-14 05:36:34,246][100936] Updated weights for policy 0, policy_version 14210 (0.0008) +[2023-10-14 05:36:34,627][100936] Updated weights for policy 0, policy_version 14220 (0.0009) +[2023-10-14 05:36:34,997][100936] Updated weights for policy 0, policy_version 14230 (0.0011) +[2023-10-14 05:36:35,363][100936] Updated weights for policy 0, policy_version 14240 (0.0011) +[2023-10-14 05:36:37,121][100917] Updated weights for policy 1, policy_version 14212 (0.0008) +[2023-10-14 05:36:37,508][100917] Updated weights for policy 1, policy_version 14222 (0.0010) +[2023-10-14 05:36:37,885][100917] Updated weights for policy 1, policy_version 14232 (0.0008) +[2023-10-14 05:36:38,512][99942] Fps is (10 sec: 13106.7, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 29163520. Throughput: 0: 1676.0, 1: 1647.7. Samples: 7300880. Policy #0 lag: (min: 30.0, avg: 30.0, max: 30.0) +[2023-10-14 05:36:38,513][99942] Avg episode reward: [(0, '0.720'), (1, '0.670')] +[2023-10-14 05:36:39,473][100936] Updated weights for policy 0, policy_version 14250 (0.0008) +[2023-10-14 05:36:39,841][100936] Updated weights for policy 0, policy_version 14260 (0.0008) +[2023-10-14 05:36:40,224][100936] Updated weights for policy 0, policy_version 14270 (0.0009) +[2023-10-14 05:36:42,056][100917] Updated weights for policy 1, policy_version 14242 (0.0007) +[2023-10-14 05:36:42,475][100917] Updated weights for policy 1, policy_version 14252 (0.0008) +[2023-10-14 05:36:42,847][100917] Updated weights for policy 1, policy_version 14262 (0.0009) +[2023-10-14 05:36:43,227][100917] Updated weights for policy 1, policy_version 14272 (0.0008) +[2023-10-14 05:36:43,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 29229056. Throughput: 0: 1675.5, 1: 1666.7. Samples: 7311032. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:36:43,513][99942] Avg episode reward: [(0, '0.720'), (1, '0.670')] +[2023-10-14 05:36:44,318][100936] Updated weights for policy 0, policy_version 14280 (0.0008) +[2023-10-14 05:36:44,675][100936] Updated weights for policy 0, policy_version 14290 (0.0009) +[2023-10-14 05:36:45,045][100936] Updated weights for policy 0, policy_version 14300 (0.0008) +[2023-10-14 05:36:47,332][100917] Updated weights for policy 1, policy_version 14282 (0.0007) +[2023-10-14 05:36:47,705][100917] Updated weights for policy 1, policy_version 14292 (0.0009) +[2023-10-14 05:36:48,075][100917] Updated weights for policy 1, policy_version 14302 (0.0010) +[2023-10-14 05:36:48,512][99942] Fps is (10 sec: 13107.8, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 29294592. Throughput: 0: 1674.4, 1: 1660.5. Samples: 7331374. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:36:48,512][99942] Avg episode reward: [(0, '0.720'), (1, '0.800')] +[2023-10-14 05:36:49,120][100936] Updated weights for policy 0, policy_version 14310 (0.0007) +[2023-10-14 05:36:49,493][100936] Updated weights for policy 0, policy_version 14320 (0.0008) +[2023-10-14 05:36:49,861][100936] Updated weights for policy 0, policy_version 14330 (0.0009) +[2023-10-14 05:36:52,004][100917] Updated weights for policy 1, policy_version 14312 (0.0008) +[2023-10-14 05:36:52,374][100917] Updated weights for policy 1, policy_version 14322 (0.0009) +[2023-10-14 05:36:52,743][100917] Updated weights for policy 1, policy_version 14332 (0.0008) +[2023-10-14 05:36:53,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 29360128. Throughput: 0: 1664.7, 1: 1648.2. Samples: 7350632. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:36:53,512][99942] Avg episode reward: [(0, '0.720'), (1, '0.800')] +[2023-10-14 05:36:54,126][100936] Updated weights for policy 0, policy_version 14340 (0.0008) +[2023-10-14 05:36:54,520][100936] Updated weights for policy 0, policy_version 14350 (0.0011) +[2023-10-14 05:36:54,890][100936] Updated weights for policy 0, policy_version 14360 (0.0010) +[2023-10-14 05:36:56,965][100917] Updated weights for policy 1, policy_version 14342 (0.0009) +[2023-10-14 05:36:57,337][100917] Updated weights for policy 1, policy_version 14352 (0.0010) +[2023-10-14 05:36:57,708][100917] Updated weights for policy 1, policy_version 14362 (0.0007) +[2023-10-14 05:36:58,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 29425664. Throughput: 0: 1664.4, 1: 1665.9. Samples: 7360776. Policy #0 lag: (min: 27.0, avg: 29.5, max: 59.0) +[2023-10-14 05:36:58,513][99942] Avg episode reward: [(0, '0.720'), (1, '0.800')] +[2023-10-14 05:36:58,918][100936] Updated weights for policy 0, policy_version 14370 (0.0010) +[2023-10-14 05:36:59,289][100936] Updated weights for policy 0, policy_version 14380 (0.0007) +[2023-10-14 05:36:59,659][100936] Updated weights for policy 0, policy_version 14390 (0.0008) +[2023-10-14 05:37:00,030][100936] Updated weights for policy 0, policy_version 14400 (0.0007) +[2023-10-14 05:37:01,856][100917] Updated weights for policy 1, policy_version 14372 (0.0007) +[2023-10-14 05:37:02,238][100917] Updated weights for policy 1, policy_version 14382 (0.0007) +[2023-10-14 05:37:02,611][100917] Updated weights for policy 1, policy_version 14392 (0.0007) +[2023-10-14 05:37:03,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 29491200. Throughput: 0: 1661.3, 1: 1663.0. Samples: 7380896. Policy #0 lag: (min: 27.0, avg: 29.5, max: 59.0) +[2023-10-14 05:37:03,513][99942] Avg episode reward: [(0, '0.720'), (1, '0.800')] +[2023-10-14 05:37:04,173][100936] Updated weights for policy 0, policy_version 14410 (0.0009) +[2023-10-14 05:37:04,553][100936] Updated weights for policy 0, policy_version 14420 (0.0007) +[2023-10-14 05:37:04,923][100936] Updated weights for policy 0, policy_version 14430 (0.0008) +[2023-10-14 05:37:06,589][100917] Updated weights for policy 1, policy_version 14402 (0.0008) +[2023-10-14 05:37:06,954][100917] Updated weights for policy 1, policy_version 14412 (0.0008) +[2023-10-14 05:37:07,337][100917] Updated weights for policy 1, policy_version 14422 (0.0011) +[2023-10-14 05:37:07,705][100917] Updated weights for policy 1, policy_version 14432 (0.0008) +[2023-10-14 05:37:08,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 29556736. Throughput: 0: 1661.1, 1: 1658.4. Samples: 7400460. Policy #0 lag: (min: 27.0, avg: 29.5, max: 59.0) +[2023-10-14 05:37:08,512][99942] Avg episode reward: [(0, '0.720'), (1, '0.790')] +[2023-10-14 05:37:09,041][100936] Updated weights for policy 0, policy_version 14440 (0.0009) +[2023-10-14 05:37:09,427][100936] Updated weights for policy 0, policy_version 14450 (0.0008) +[2023-10-14 05:37:09,797][100936] Updated weights for policy 0, policy_version 14460 (0.0010) +[2023-10-14 05:37:11,777][100917] Updated weights for policy 1, policy_version 14442 (0.0008) +[2023-10-14 05:37:12,152][100917] Updated weights for policy 1, policy_version 14452 (0.0008) +[2023-10-14 05:37:12,534][100917] Updated weights for policy 1, policy_version 14462 (0.0007) +[2023-10-14 05:37:13,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 29622272. Throughput: 0: 1658.6, 1: 1673.2. Samples: 7410752. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:37:13,513][99942] Avg episode reward: [(0, '0.720'), (1, '0.790')] +[2023-10-14 05:37:13,858][100936] Updated weights for policy 0, policy_version 14470 (0.0008) +[2023-10-14 05:37:14,232][100936] Updated weights for policy 0, policy_version 14480 (0.0008) +[2023-10-14 05:37:14,608][100936] Updated weights for policy 0, policy_version 14490 (0.0007) +[2023-10-14 05:37:16,520][100917] Updated weights for policy 1, policy_version 14472 (0.0010) +[2023-10-14 05:37:16,889][100917] Updated weights for policy 1, policy_version 14482 (0.0010) +[2023-10-14 05:37:17,268][100917] Updated weights for policy 1, policy_version 14492 (0.0008) +[2023-10-14 05:37:18,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 29687808. Throughput: 0: 1665.8, 1: 1655.2. Samples: 7430602. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:37:18,513][99942] Avg episode reward: [(0, '0.720'), (1, '0.790')] +[2023-10-14 05:37:18,625][100936] Updated weights for policy 0, policy_version 14500 (0.0008) +[2023-10-14 05:37:18,991][100936] Updated weights for policy 0, policy_version 14510 (0.0009) +[2023-10-14 05:37:19,363][100936] Updated weights for policy 0, policy_version 14520 (0.0007) +[2023-10-14 05:37:21,470][100917] Updated weights for policy 1, policy_version 14502 (0.0009) +[2023-10-14 05:37:21,840][100917] Updated weights for policy 1, policy_version 14512 (0.0009) +[2023-10-14 05:37:22,217][100917] Updated weights for policy 1, policy_version 14522 (0.0007) +[2023-10-14 05:37:23,430][100936] Updated weights for policy 0, policy_version 14530 (0.0007) +[2023-10-14 05:37:23,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 29753344. Throughput: 0: 1660.2, 1: 1662.9. Samples: 7450422. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:37:23,513][99942] Avg episode reward: [(0, '0.720'), (1, '0.790')] +[2023-10-14 05:37:23,800][100936] Updated weights for policy 0, policy_version 14540 (0.0009) +[2023-10-14 05:37:24,170][100936] Updated weights for policy 0, policy_version 14550 (0.0009) +[2023-10-14 05:37:24,545][100936] Updated weights for policy 0, policy_version 14560 (0.0007) +[2023-10-14 05:37:26,220][100917] Updated weights for policy 1, policy_version 14532 (0.0009) +[2023-10-14 05:37:26,591][100917] Updated weights for policy 1, policy_version 14542 (0.0008) +[2023-10-14 05:37:26,971][100917] Updated weights for policy 1, policy_version 14552 (0.0010) +[2023-10-14 05:37:28,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.1, 300 sec: 13329.4). Total num frames: 29818880. Throughput: 0: 1658.7, 1: 1667.2. Samples: 7460702. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:37:28,513][99942] Avg episode reward: [(0, '0.720'), (1, '0.790')] +[2023-10-14 05:37:28,757][100936] Updated weights for policy 0, policy_version 14570 (0.0009) +[2023-10-14 05:37:29,138][100936] Updated weights for policy 0, policy_version 14580 (0.0009) +[2023-10-14 05:37:29,514][100936] Updated weights for policy 0, policy_version 14590 (0.0011) +[2023-10-14 05:37:31,118][100917] Updated weights for policy 1, policy_version 14562 (0.0009) +[2023-10-14 05:37:31,534][100917] Updated weights for policy 1, policy_version 14572 (0.0010) +[2023-10-14 05:37:31,904][100917] Updated weights for policy 1, policy_version 14582 (0.0007) +[2023-10-14 05:37:32,275][100917] Updated weights for policy 1, policy_version 14592 (0.0007) +[2023-10-14 05:37:33,460][100936] Updated weights for policy 0, policy_version 14600 (0.0009) +[2023-10-14 05:37:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 29884416. Throughput: 0: 1659.7, 1: 1648.0. Samples: 7480220. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:37:33,513][99942] Avg episode reward: [(0, '0.720'), (1, '0.790')] +[2023-10-14 05:37:33,834][100936] Updated weights for policy 0, policy_version 14610 (0.0008) +[2023-10-14 05:37:34,201][100936] Updated weights for policy 0, policy_version 14620 (0.0007) +[2023-10-14 05:37:36,503][100917] Updated weights for policy 1, policy_version 14602 (0.0010) +[2023-10-14 05:37:36,879][100917] Updated weights for policy 1, policy_version 14612 (0.0009) +[2023-10-14 05:37:37,263][100917] Updated weights for policy 1, policy_version 14622 (0.0007) +[2023-10-14 05:37:38,492][100936] Updated weights for policy 0, policy_version 14630 (0.0009) +[2023-10-14 05:37:38,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 29949952. Throughput: 0: 1655.9, 1: 1663.7. Samples: 7500014. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:37:38,513][99942] Avg episode reward: [(0, '0.720'), (1, '0.790')] +[2023-10-14 05:37:38,520][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000014624_14974976.pth... +[2023-10-14 05:37:38,551][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000013056_13369344.pth +[2023-10-14 05:37:38,861][100936] Updated weights for policy 0, policy_version 14640 (0.0007) +[2023-10-14 05:37:39,242][100936] Updated weights for policy 0, policy_version 14650 (0.0008) +[2023-10-14 05:37:39,461][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000014656_15007744.pth... +[2023-10-14 05:37:39,491][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000013088_13402112.pth +[2023-10-14 05:37:41,168][100917] Updated weights for policy 1, policy_version 14632 (0.0009) +[2023-10-14 05:37:41,539][100917] Updated weights for policy 1, policy_version 14642 (0.0008) +[2023-10-14 05:37:41,925][100917] Updated weights for policy 1, policy_version 14652 (0.0007) +[2023-10-14 05:37:43,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 30015488. Throughput: 0: 1660.8, 1: 1664.8. Samples: 7510424. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:37:43,513][99942] Avg episode reward: [(0, '0.720'), (1, '0.790')] +[2023-10-14 05:37:43,514][100936] Updated weights for policy 0, policy_version 14660 (0.0010) +[2023-10-14 05:37:43,885][100936] Updated weights for policy 0, policy_version 14670 (0.0009) +[2023-10-14 05:37:44,260][100936] Updated weights for policy 0, policy_version 14680 (0.0007) +[2023-10-14 05:37:46,022][100917] Updated weights for policy 1, policy_version 14662 (0.0008) +[2023-10-14 05:37:46,387][100917] Updated weights for policy 1, policy_version 14672 (0.0009) +[2023-10-14 05:37:46,763][100917] Updated weights for policy 1, policy_version 14682 (0.0010) +[2023-10-14 05:37:48,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 30081024. Throughput: 0: 1658.0, 1: 1646.8. Samples: 7529614. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:37:48,512][99942] Avg episode reward: [(0, '0.720'), (1, '0.790')] +[2023-10-14 05:37:48,549][100936] Updated weights for policy 0, policy_version 14690 (0.0009) +[2023-10-14 05:37:48,918][100936] Updated weights for policy 0, policy_version 14700 (0.0010) +[2023-10-14 05:37:49,300][100936] Updated weights for policy 0, policy_version 14710 (0.0010) +[2023-10-14 05:37:49,672][100936] Updated weights for policy 0, policy_version 14720 (0.0010) +[2023-10-14 05:37:51,056][100917] Updated weights for policy 1, policy_version 14692 (0.0009) +[2023-10-14 05:37:51,427][100917] Updated weights for policy 1, policy_version 14702 (0.0009) +[2023-10-14 05:37:51,792][100917] Updated weights for policy 1, policy_version 14712 (0.0009) +[2023-10-14 05:37:53,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 30146560. Throughput: 0: 1653.3, 1: 1663.4. Samples: 7549714. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:37:53,512][99942] Avg episode reward: [(0, '0.720'), (1, '0.790')] +[2023-10-14 05:37:53,783][100936] Updated weights for policy 0, policy_version 14730 (0.0009) +[2023-10-14 05:37:54,153][100936] Updated weights for policy 0, policy_version 14740 (0.0008) +[2023-10-14 05:37:54,530][100936] Updated weights for policy 0, policy_version 14750 (0.0008) +[2023-10-14 05:37:55,834][100917] Updated weights for policy 1, policy_version 14722 (0.0008) +[2023-10-14 05:37:56,214][100917] Updated weights for policy 1, policy_version 14732 (0.0009) +[2023-10-14 05:37:56,590][100917] Updated weights for policy 1, policy_version 14742 (0.0010) +[2023-10-14 05:37:56,960][100917] Updated weights for policy 1, policy_version 14752 (0.0009) +[2023-10-14 05:37:58,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 30212096. Throughput: 0: 1654.0, 1: 1656.7. Samples: 7559732. Policy #0 lag: (min: 5.0, avg: 13.0, max: 37.0) +[2023-10-14 05:37:58,513][99942] Avg episode reward: [(0, '0.720'), (1, '0.790')] +[2023-10-14 05:37:58,696][100936] Updated weights for policy 0, policy_version 14760 (0.0008) +[2023-10-14 05:37:59,059][100936] Updated weights for policy 0, policy_version 14770 (0.0008) +[2023-10-14 05:37:59,432][100936] Updated weights for policy 0, policy_version 14780 (0.0009) +[2023-10-14 05:38:01,036][100917] Updated weights for policy 1, policy_version 14762 (0.0007) +[2023-10-14 05:38:01,400][100917] Updated weights for policy 1, policy_version 14772 (0.0010) +[2023-10-14 05:38:01,779][100917] Updated weights for policy 1, policy_version 14782 (0.0007) +[2023-10-14 05:38:03,481][100936] Updated weights for policy 0, policy_version 14790 (0.0009) +[2023-10-14 05:38:03,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 30277632. Throughput: 0: 1647.4, 1: 1651.6. Samples: 7579058. Policy #0 lag: (min: 5.0, avg: 13.0, max: 37.0) +[2023-10-14 05:38:03,513][99942] Avg episode reward: [(0, '0.720'), (1, '0.860')] +[2023-10-14 05:38:03,852][100936] Updated weights for policy 0, policy_version 14800 (0.0008) +[2023-10-14 05:38:04,223][100936] Updated weights for policy 0, policy_version 14810 (0.0009) +[2023-10-14 05:38:06,010][100917] Updated weights for policy 1, policy_version 14792 (0.0010) +[2023-10-14 05:38:06,383][100917] Updated weights for policy 1, policy_version 14802 (0.0010) +[2023-10-14 05:38:06,764][100917] Updated weights for policy 1, policy_version 14812 (0.0007) +[2023-10-14 05:38:08,294][100936] Updated weights for policy 0, policy_version 14820 (0.0008) +[2023-10-14 05:38:08,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 30343168. Throughput: 0: 1645.9, 1: 1660.8. Samples: 7599226. Policy #0 lag: (min: 5.0, avg: 13.0, max: 37.0) +[2023-10-14 05:38:08,513][99942] Avg episode reward: [(0, '0.720'), (1, '0.860')] +[2023-10-14 05:38:08,678][100936] Updated weights for policy 0, policy_version 14830 (0.0008) +[2023-10-14 05:38:09,051][100936] Updated weights for policy 0, policy_version 14840 (0.0007) +[2023-10-14 05:38:10,883][100917] Updated weights for policy 1, policy_version 14822 (0.0008) +[2023-10-14 05:38:11,242][100917] Updated weights for policy 1, policy_version 14832 (0.0012) +[2023-10-14 05:38:11,619][100917] Updated weights for policy 1, policy_version 14842 (0.0011) +[2023-10-14 05:38:13,065][100936] Updated weights for policy 0, policy_version 14850 (0.0007) +[2023-10-14 05:38:13,440][100936] Updated weights for policy 0, policy_version 14860 (0.0008) +[2023-10-14 05:38:13,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 30408704. Throughput: 0: 1651.9, 1: 1652.0. Samples: 7609378. Policy #0 lag: (min: 31.0, avg: 41.3, max: 63.0) +[2023-10-14 05:38:13,513][99942] Avg episode reward: [(0, '0.720'), (1, '0.790')] +[2023-10-14 05:38:13,812][100936] Updated weights for policy 0, policy_version 14870 (0.0008) +[2023-10-14 05:38:14,195][100936] Updated weights for policy 0, policy_version 14880 (0.0010) +[2023-10-14 05:38:15,867][100917] Updated weights for policy 1, policy_version 14852 (0.0011) +[2023-10-14 05:38:16,247][100917] Updated weights for policy 1, policy_version 14862 (0.0010) +[2023-10-14 05:38:16,623][100917] Updated weights for policy 1, policy_version 14872 (0.0011) +[2023-10-14 05:38:18,272][100936] Updated weights for policy 0, policy_version 14890 (0.0010) +[2023-10-14 05:38:18,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 30474240. Throughput: 0: 1653.0, 1: 1650.2. Samples: 7628862. Policy #0 lag: (min: 31.0, avg: 41.3, max: 63.0) +[2023-10-14 05:38:18,513][99942] Avg episode reward: [(0, '0.720'), (1, '0.790')] +[2023-10-14 05:38:18,642][100936] Updated weights for policy 0, policy_version 14900 (0.0008) +[2023-10-14 05:38:19,015][100936] Updated weights for policy 0, policy_version 14910 (0.0010) +[2023-10-14 05:38:20,897][100917] Updated weights for policy 1, policy_version 14882 (0.0009) +[2023-10-14 05:38:21,303][100917] Updated weights for policy 1, policy_version 14892 (0.0010) +[2023-10-14 05:38:21,676][100917] Updated weights for policy 1, policy_version 14902 (0.0009) +[2023-10-14 05:38:22,041][100917] Updated weights for policy 1, policy_version 14912 (0.0011) +[2023-10-14 05:38:23,130][100936] Updated weights for policy 0, policy_version 14920 (0.0007) +[2023-10-14 05:38:23,492][100936] Updated weights for policy 0, policy_version 14930 (0.0007) +[2023-10-14 05:38:23,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 30539776. Throughput: 0: 1651.5, 1: 1652.2. Samples: 7648680. Policy #0 lag: (min: 31.0, avg: 41.3, max: 63.0) +[2023-10-14 05:38:23,513][99942] Avg episode reward: [(0, '0.720'), (1, '0.790')] +[2023-10-14 05:38:23,850][100936] Updated weights for policy 0, policy_version 14940 (0.0007) +[2023-10-14 05:38:26,115][100917] Updated weights for policy 1, policy_version 14922 (0.0010) +[2023-10-14 05:38:26,489][100917] Updated weights for policy 1, policy_version 14932 (0.0012) +[2023-10-14 05:38:26,864][100917] Updated weights for policy 1, policy_version 14942 (0.0010) +[2023-10-14 05:38:28,007][100936] Updated weights for policy 0, policy_version 14950 (0.0008) +[2023-10-14 05:38:28,394][100936] Updated weights for policy 0, policy_version 14960 (0.0010) +[2023-10-14 05:38:28,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 30605312. Throughput: 0: 1662.3, 1: 1645.6. Samples: 7659276. Policy #0 lag: (min: 5.0, avg: 8.7, max: 37.0) +[2023-10-14 05:38:28,513][99942] Avg episode reward: [(0, '0.720'), (1, '0.790')] +[2023-10-14 05:38:28,773][100936] Updated weights for policy 0, policy_version 14970 (0.0009) +[2023-10-14 05:38:31,020][100917] Updated weights for policy 1, policy_version 14952 (0.0008) +[2023-10-14 05:38:31,390][100917] Updated weights for policy 1, policy_version 14962 (0.0010) +[2023-10-14 05:38:31,754][100917] Updated weights for policy 1, policy_version 14972 (0.0007) +[2023-10-14 05:38:32,679][100936] Updated weights for policy 0, policy_version 14980 (0.0008) +[2023-10-14 05:38:33,049][100936] Updated weights for policy 0, policy_version 14990 (0.0008) +[2023-10-14 05:38:33,423][100936] Updated weights for policy 0, policy_version 15000 (0.0011) +[2023-10-14 05:38:33,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 30670848. Throughput: 0: 1663.4, 1: 1645.3. Samples: 7678506. Policy #0 lag: (min: 5.0, avg: 8.7, max: 37.0) +[2023-10-14 05:38:33,513][99942] Avg episode reward: [(0, '0.720'), (1, '0.790')] +[2023-10-14 05:38:35,949][100917] Updated weights for policy 1, policy_version 14982 (0.0008) +[2023-10-14 05:38:36,320][100917] Updated weights for policy 1, policy_version 14992 (0.0010) +[2023-10-14 05:38:36,696][100917] Updated weights for policy 1, policy_version 15002 (0.0009) +[2023-10-14 05:38:37,666][100936] Updated weights for policy 0, policy_version 15010 (0.0009) +[2023-10-14 05:38:38,045][100936] Updated weights for policy 0, policy_version 15020 (0.0011) +[2023-10-14 05:38:38,420][100936] Updated weights for policy 0, policy_version 15030 (0.0009) +[2023-10-14 05:38:38,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 30736384. Throughput: 0: 1646.4, 1: 1652.1. Samples: 7698148. Policy #0 lag: (min: 5.0, avg: 8.7, max: 37.0) +[2023-10-14 05:38:38,513][99942] Avg episode reward: [(0, '0.720'), (1, '0.920')] +[2023-10-14 05:38:38,789][100936] Updated weights for policy 0, policy_version 15040 (0.0009) +[2023-10-14 05:38:40,715][100917] Updated weights for policy 1, policy_version 15012 (0.0009) +[2023-10-14 05:38:41,086][100917] Updated weights for policy 1, policy_version 15022 (0.0007) +[2023-10-14 05:38:41,451][100917] Updated weights for policy 1, policy_version 15032 (0.0008) +[2023-10-14 05:38:42,924][100936] Updated weights for policy 0, policy_version 15050 (0.0007) +[2023-10-14 05:38:43,296][100936] Updated weights for policy 0, policy_version 15060 (0.0008) +[2023-10-14 05:38:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 30801920. Throughput: 0: 1664.7, 1: 1650.1. Samples: 7708898. Policy #0 lag: (min: 31.0, avg: 36.6, max: 63.0) +[2023-10-14 05:38:43,513][99942] Avg episode reward: [(0, '0.720'), (1, '0.920')] +[2023-10-14 05:38:43,667][100936] Updated weights for policy 0, policy_version 15070 (0.0007) +[2023-10-14 05:38:45,631][100917] Updated weights for policy 1, policy_version 15042 (0.0008) +[2023-10-14 05:38:46,005][100917] Updated weights for policy 1, policy_version 15052 (0.0010) +[2023-10-14 05:38:46,380][100917] Updated weights for policy 1, policy_version 15062 (0.0011) +[2023-10-14 05:38:46,754][100917] Updated weights for policy 1, policy_version 15072 (0.0009) +[2023-10-14 05:38:47,594][100936] Updated weights for policy 0, policy_version 15080 (0.0010) +[2023-10-14 05:38:47,972][100936] Updated weights for policy 0, policy_version 15090 (0.0010) +[2023-10-14 05:38:48,329][100936] Updated weights for policy 0, policy_version 15100 (0.0010) +[2023-10-14 05:38:48,512][99942] Fps is (10 sec: 16384.4, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 30900224. Throughput: 0: 1672.9, 1: 1651.6. Samples: 7728660. Policy #0 lag: (min: 31.0, avg: 36.6, max: 63.0) +[2023-10-14 05:38:48,513][99942] Avg episode reward: [(0, '0.720'), (1, '0.920')] +[2023-10-14 05:38:50,828][100917] Updated weights for policy 1, policy_version 15082 (0.0008) +[2023-10-14 05:38:51,194][100917] Updated weights for policy 1, policy_version 15092 (0.0010) +[2023-10-14 05:38:51,572][100917] Updated weights for policy 1, policy_version 15102 (0.0008) +[2023-10-14 05:38:52,459][100936] Updated weights for policy 0, policy_version 15110 (0.0010) +[2023-10-14 05:38:52,826][100936] Updated weights for policy 0, policy_version 15120 (0.0008) +[2023-10-14 05:38:53,198][100936] Updated weights for policy 0, policy_version 15130 (0.0007) +[2023-10-14 05:38:53,512][99942] Fps is (10 sec: 16384.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 30965760. Throughput: 0: 1653.9, 1: 1656.3. Samples: 7748184. Policy #0 lag: (min: 24.0, avg: 42.7, max: 56.0) +[2023-10-14 05:38:53,512][99942] Avg episode reward: [(0, '0.720'), (1, '0.920')] +[2023-10-14 05:38:55,685][100917] Updated weights for policy 1, policy_version 15112 (0.0009) +[2023-10-14 05:38:56,061][100917] Updated weights for policy 1, policy_version 15122 (0.0009) +[2023-10-14 05:38:56,432][100917] Updated weights for policy 1, policy_version 15132 (0.0007) +[2023-10-14 05:38:57,326][100936] Updated weights for policy 0, policy_version 15140 (0.0009) +[2023-10-14 05:38:57,695][100936] Updated weights for policy 0, policy_version 15150 (0.0010) +[2023-10-14 05:38:58,062][100936] Updated weights for policy 0, policy_version 15160 (0.0010) +[2023-10-14 05:38:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 31031296. Throughput: 0: 1674.5, 1: 1648.5. Samples: 7758914. Policy #0 lag: (min: 24.0, avg: 42.7, max: 56.0) +[2023-10-14 05:38:58,513][99942] Avg episode reward: [(0, '0.720'), (1, '0.920')] +[2023-10-14 05:39:00,477][100917] Updated weights for policy 1, policy_version 15142 (0.0009) +[2023-10-14 05:39:00,863][100917] Updated weights for policy 1, policy_version 15152 (0.0007) +[2023-10-14 05:39:01,231][100917] Updated weights for policy 1, policy_version 15162 (0.0010) +[2023-10-14 05:39:02,308][100936] Updated weights for policy 0, policy_version 15170 (0.0007) +[2023-10-14 05:39:02,678][100936] Updated weights for policy 0, policy_version 15180 (0.0008) +[2023-10-14 05:39:03,041][100936] Updated weights for policy 0, policy_version 15190 (0.0010) +[2023-10-14 05:39:03,415][100936] Updated weights for policy 0, policy_version 15200 (0.0009) +[2023-10-14 05:39:03,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 31096832. Throughput: 0: 1663.3, 1: 1657.9. Samples: 7778316. Policy #0 lag: (min: 24.0, avg: 42.7, max: 56.0) +[2023-10-14 05:39:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 05:39:05,451][100917] Updated weights for policy 1, policy_version 15172 (0.0009) +[2023-10-14 05:39:05,854][100917] Updated weights for policy 1, policy_version 15182 (0.0008) +[2023-10-14 05:39:06,216][100917] Updated weights for policy 1, policy_version 15192 (0.0011) +[2023-10-14 05:39:07,505][100936] Updated weights for policy 0, policy_version 15210 (0.0008) +[2023-10-14 05:39:07,876][100936] Updated weights for policy 0, policy_version 15220 (0.0009) +[2023-10-14 05:39:08,234][100936] Updated weights for policy 0, policy_version 15230 (0.0010) +[2023-10-14 05:39:08,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 31162368. Throughput: 0: 1645.4, 1: 1658.0. Samples: 7797334. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:39:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 05:39:10,310][100917] Updated weights for policy 1, policy_version 15202 (0.0009) +[2023-10-14 05:39:10,684][100917] Updated weights for policy 1, policy_version 15212 (0.0010) +[2023-10-14 05:39:11,053][100917] Updated weights for policy 1, policy_version 15222 (0.0011) +[2023-10-14 05:39:11,425][100917] Updated weights for policy 1, policy_version 15232 (0.0009) +[2023-10-14 05:39:12,560][100936] Updated weights for policy 0, policy_version 15240 (0.0008) +[2023-10-14 05:39:12,927][100936] Updated weights for policy 0, policy_version 15250 (0.0012) +[2023-10-14 05:39:13,300][100936] Updated weights for policy 0, policy_version 15260 (0.0009) +[2023-10-14 05:39:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 31227904. Throughput: 0: 1658.4, 1: 1649.2. Samples: 7808118. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:39:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 05:39:15,540][100917] Updated weights for policy 1, policy_version 15242 (0.0008) +[2023-10-14 05:39:15,909][100917] Updated weights for policy 1, policy_version 15252 (0.0009) +[2023-10-14 05:39:16,277][100917] Updated weights for policy 1, policy_version 15262 (0.0009) +[2023-10-14 05:39:17,553][100936] Updated weights for policy 0, policy_version 15270 (0.0009) +[2023-10-14 05:39:17,932][100936] Updated weights for policy 0, policy_version 15280 (0.0007) +[2023-10-14 05:39:18,310][100936] Updated weights for policy 0, policy_version 15290 (0.0008) +[2023-10-14 05:39:18,512][99942] Fps is (10 sec: 9830.3, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 31260672. Throughput: 0: 1655.6, 1: 1664.0. Samples: 7827890. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:39:18,514][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 05:39:20,489][100917] Updated weights for policy 1, policy_version 15272 (0.0007) +[2023-10-14 05:39:20,859][100917] Updated weights for policy 1, policy_version 15282 (0.0010) +[2023-10-14 05:39:21,229][100917] Updated weights for policy 1, policy_version 15292 (0.0010) +[2023-10-14 05:39:22,434][100936] Updated weights for policy 0, policy_version 15300 (0.0009) +[2023-10-14 05:39:22,804][100936] Updated weights for policy 0, policy_version 15310 (0.0008) +[2023-10-14 05:39:23,166][100936] Updated weights for policy 0, policy_version 15320 (0.0008) +[2023-10-14 05:39:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 31358976. Throughput: 0: 1650.0, 1: 1664.9. Samples: 7847320. Policy #0 lag: (min: 31.0, avg: 41.7, max: 63.0) +[2023-10-14 05:39:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 05:39:25,180][100917] Updated weights for policy 1, policy_version 15302 (0.0009) +[2023-10-14 05:39:25,559][100917] Updated weights for policy 1, policy_version 15312 (0.0007) +[2023-10-14 05:39:25,936][100917] Updated weights for policy 1, policy_version 15322 (0.0007) +[2023-10-14 05:39:27,375][100936] Updated weights for policy 0, policy_version 15330 (0.0009) +[2023-10-14 05:39:27,738][100936] Updated weights for policy 0, policy_version 15340 (0.0007) +[2023-10-14 05:39:28,109][100936] Updated weights for policy 0, policy_version 15350 (0.0008) +[2023-10-14 05:39:28,485][100936] Updated weights for policy 0, policy_version 15360 (0.0007) +[2023-10-14 05:39:28,512][99942] Fps is (10 sec: 16384.6, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 31424512. Throughput: 0: 1655.8, 1: 1650.9. Samples: 7857702. Policy #0 lag: (min: 31.0, avg: 41.7, max: 63.0) +[2023-10-14 05:39:28,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 05:39:30,067][100917] Updated weights for policy 1, policy_version 15332 (0.0008) +[2023-10-14 05:39:30,436][100917] Updated weights for policy 1, policy_version 15342 (0.0008) +[2023-10-14 05:39:30,815][100917] Updated weights for policy 1, policy_version 15352 (0.0007) +[2023-10-14 05:39:32,492][100936] Updated weights for policy 0, policy_version 15370 (0.0010) +[2023-10-14 05:39:32,867][100936] Updated weights for policy 0, policy_version 15380 (0.0008) +[2023-10-14 05:39:33,237][100936] Updated weights for policy 0, policy_version 15390 (0.0008) +[2023-10-14 05:39:33,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 31490048. Throughput: 0: 1643.4, 1: 1661.0. Samples: 7877360. Policy #0 lag: (min: 31.0, avg: 41.7, max: 63.0) +[2023-10-14 05:39:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 05:39:35,017][100917] Updated weights for policy 1, policy_version 15362 (0.0007) +[2023-10-14 05:39:35,382][100917] Updated weights for policy 1, policy_version 15372 (0.0008) +[2023-10-14 05:39:35,759][100917] Updated weights for policy 1, policy_version 15382 (0.0009) +[2023-10-14 05:39:36,129][100917] Updated weights for policy 1, policy_version 15392 (0.0009) +[2023-10-14 05:39:37,426][100936] Updated weights for policy 0, policy_version 15400 (0.0008) +[2023-10-14 05:39:37,795][100936] Updated weights for policy 0, policy_version 15410 (0.0009) +[2023-10-14 05:39:38,164][100936] Updated weights for policy 0, policy_version 15420 (0.0009) +[2023-10-14 05:39:38,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.4, 300 sec: 13329.3). Total num frames: 31555584. Throughput: 0: 1640.4, 1: 1657.1. Samples: 7896570. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:39:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 05:39:38,523][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000015392_15761408.pth... +[2023-10-14 05:39:38,523][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000015424_15794176.pth... +[2023-10-14 05:39:38,564][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000013856_14188544.pth +[2023-10-14 05:39:38,564][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000013856_14188544.pth +[2023-10-14 05:39:38,569][100681] Saving a milestone ./train_atari/atari_privateye_APPO/checkpoint_p1/milestones/checkpoint_000015392_15761408.pth +[2023-10-14 05:39:38,569][100560] Saving a milestone ./train_atari/atari_privateye_APPO/checkpoint_p0/milestones/checkpoint_000015424_15794176.pth +[2023-10-14 05:39:40,183][100917] Updated weights for policy 1, policy_version 15402 (0.0009) +[2023-10-14 05:39:40,565][100917] Updated weights for policy 1, policy_version 15412 (0.0010) +[2023-10-14 05:39:40,925][100917] Updated weights for policy 1, policy_version 15422 (0.0010) +[2023-10-14 05:39:42,394][100936] Updated weights for policy 0, policy_version 15430 (0.0007) +[2023-10-14 05:39:42,754][100936] Updated weights for policy 0, policy_version 15440 (0.0007) +[2023-10-14 05:39:43,124][100936] Updated weights for policy 0, policy_version 15450 (0.0007) +[2023-10-14 05:39:43,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 31621120. Throughput: 0: 1648.1, 1: 1643.5. Samples: 7907034. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:39:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 05:39:44,984][100917] Updated weights for policy 1, policy_version 15432 (0.0009) +[2023-10-14 05:39:45,356][100917] Updated weights for policy 1, policy_version 15442 (0.0007) +[2023-10-14 05:39:45,737][100917] Updated weights for policy 1, policy_version 15452 (0.0010) +[2023-10-14 05:39:47,039][100936] Updated weights for policy 0, policy_version 15460 (0.0008) +[2023-10-14 05:39:47,417][100936] Updated weights for policy 0, policy_version 15470 (0.0009) +[2023-10-14 05:39:47,788][100936] Updated weights for policy 0, policy_version 15480 (0.0009) +[2023-10-14 05:39:48,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 31686656. Throughput: 0: 1648.0, 1: 1657.4. Samples: 7927062. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:39:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 05:39:49,919][100917] Updated weights for policy 1, policy_version 15462 (0.0008) +[2023-10-14 05:39:50,296][100917] Updated weights for policy 1, policy_version 15472 (0.0009) +[2023-10-14 05:39:50,674][100917] Updated weights for policy 1, policy_version 15482 (0.0009) +[2023-10-14 05:39:52,030][100936] Updated weights for policy 0, policy_version 15490 (0.0010) +[2023-10-14 05:39:52,398][100936] Updated weights for policy 0, policy_version 15500 (0.0010) +[2023-10-14 05:39:52,772][100936] Updated weights for policy 0, policy_version 15510 (0.0009) +[2023-10-14 05:39:53,146][100936] Updated weights for policy 0, policy_version 15520 (0.0007) +[2023-10-14 05:39:53,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 31752192. Throughput: 0: 1650.4, 1: 1663.1. Samples: 7946438. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) +[2023-10-14 05:39:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 05:39:54,955][100917] Updated weights for policy 1, policy_version 15492 (0.0009) +[2023-10-14 05:39:55,356][100917] Updated weights for policy 1, policy_version 15502 (0.0007) +[2023-10-14 05:39:55,727][100917] Updated weights for policy 1, policy_version 15512 (0.0007) +[2023-10-14 05:39:57,316][100936] Updated weights for policy 0, policy_version 15530 (0.0010) +[2023-10-14 05:39:57,690][100936] Updated weights for policy 0, policy_version 15540 (0.0009) +[2023-10-14 05:39:58,057][100936] Updated weights for policy 0, policy_version 15550 (0.0007) +[2023-10-14 05:39:58,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 31817728. Throughput: 0: 1654.2, 1: 1646.4. Samples: 7956644. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) +[2023-10-14 05:39:58,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.930')] +[2023-10-14 05:39:59,811][100917] Updated weights for policy 1, policy_version 15522 (0.0009) +[2023-10-14 05:40:00,182][100917] Updated weights for policy 1, policy_version 15532 (0.0007) +[2023-10-14 05:40:00,544][100917] Updated weights for policy 1, policy_version 15542 (0.0007) +[2023-10-14 05:40:00,922][100917] Updated weights for policy 1, policy_version 15552 (0.0009) +[2023-10-14 05:40:02,213][100936] Updated weights for policy 0, policy_version 15560 (0.0010) +[2023-10-14 05:40:02,591][100936] Updated weights for policy 0, policy_version 15570 (0.0010) +[2023-10-14 05:40:02,973][100936] Updated weights for policy 0, policy_version 15580 (0.0007) +[2023-10-14 05:40:03,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 31883264. Throughput: 0: 1643.4, 1: 1649.6. Samples: 7976074. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) +[2023-10-14 05:40:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.930')] +[2023-10-14 05:40:05,121][100917] Updated weights for policy 1, policy_version 15562 (0.0008) +[2023-10-14 05:40:05,493][100917] Updated weights for policy 1, policy_version 15572 (0.0010) +[2023-10-14 05:40:05,873][100917] Updated weights for policy 1, policy_version 15582 (0.0009) +[2023-10-14 05:40:06,974][100936] Updated weights for policy 0, policy_version 15590 (0.0009) +[2023-10-14 05:40:07,350][100936] Updated weights for policy 0, policy_version 15600 (0.0009) +[2023-10-14 05:40:07,719][100936] Updated weights for policy 0, policy_version 15610 (0.0007) +[2023-10-14 05:40:08,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 13329.4). Total num frames: 31948800. Throughput: 0: 1654.6, 1: 1645.2. Samples: 7995814. Policy #0 lag: (min: 21.0, avg: 29.0, max: 53.0) +[2023-10-14 05:40:08,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.930')] +[2023-10-14 05:40:09,888][100917] Updated weights for policy 1, policy_version 15592 (0.0010) +[2023-10-14 05:40:10,255][100917] Updated weights for policy 1, policy_version 15602 (0.0009) +[2023-10-14 05:40:10,635][100917] Updated weights for policy 1, policy_version 15612 (0.0010) +[2023-10-14 05:40:11,830][100936] Updated weights for policy 0, policy_version 15620 (0.0007) +[2023-10-14 05:40:12,209][100936] Updated weights for policy 0, policy_version 15630 (0.0008) +[2023-10-14 05:40:12,576][100936] Updated weights for policy 0, policy_version 15640 (0.0007) +[2023-10-14 05:40:13,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 32014336. Throughput: 0: 1662.4, 1: 1637.8. Samples: 8006208. Policy #0 lag: (min: 21.0, avg: 29.0, max: 53.0) +[2023-10-14 05:40:13,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.930')] +[2023-10-14 05:40:14,861][100917] Updated weights for policy 1, policy_version 15622 (0.0008) +[2023-10-14 05:40:15,231][100917] Updated weights for policy 1, policy_version 15632 (0.0011) +[2023-10-14 05:40:15,606][100917] Updated weights for policy 1, policy_version 15642 (0.0010) +[2023-10-14 05:40:16,749][100936] Updated weights for policy 0, policy_version 15650 (0.0008) +[2023-10-14 05:40:17,118][100936] Updated weights for policy 0, policy_version 15660 (0.0012) +[2023-10-14 05:40:17,485][100936] Updated weights for policy 0, policy_version 15670 (0.0011) +[2023-10-14 05:40:17,862][100936] Updated weights for policy 0, policy_version 15680 (0.0007) +[2023-10-14 05:40:18,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 32079872. Throughput: 0: 1649.9, 1: 1645.6. Samples: 8025654. Policy #0 lag: (min: 21.0, avg: 29.0, max: 53.0) +[2023-10-14 05:40:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.930')] +[2023-10-14 05:40:19,886][100917] Updated weights for policy 1, policy_version 15652 (0.0010) +[2023-10-14 05:40:20,258][100917] Updated weights for policy 1, policy_version 15662 (0.0008) +[2023-10-14 05:40:20,633][100917] Updated weights for policy 1, policy_version 15672 (0.0010) +[2023-10-14 05:40:22,068][100936] Updated weights for policy 0, policy_version 15690 (0.0008) +[2023-10-14 05:40:22,445][100936] Updated weights for policy 0, policy_version 15700 (0.0010) +[2023-10-14 05:40:22,821][100936] Updated weights for policy 0, policy_version 15710 (0.0008) +[2023-10-14 05:40:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 32145408. Throughput: 0: 1663.9, 1: 1645.5. Samples: 8045492. Policy #0 lag: (min: 1.0, avg: 8.3, max: 33.0) +[2023-10-14 05:40:23,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.930')] +[2023-10-14 05:40:24,840][100917] Updated weights for policy 1, policy_version 15682 (0.0008) +[2023-10-14 05:40:25,205][100917] Updated weights for policy 1, policy_version 15692 (0.0008) +[2023-10-14 05:40:25,584][100917] Updated weights for policy 1, policy_version 15702 (0.0010) +[2023-10-14 05:40:25,955][100917] Updated weights for policy 1, policy_version 15712 (0.0007) +[2023-10-14 05:40:26,814][100936] Updated weights for policy 0, policy_version 15720 (0.0009) +[2023-10-14 05:40:27,188][100936] Updated weights for policy 0, policy_version 15730 (0.0009) +[2023-10-14 05:40:27,554][100936] Updated weights for policy 0, policy_version 15740 (0.0008) +[2023-10-14 05:40:28,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 32210944. Throughput: 0: 1663.3, 1: 1644.6. Samples: 8055886. Policy #0 lag: (min: 1.0, avg: 8.3, max: 33.0) +[2023-10-14 05:40:28,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.930')] +[2023-10-14 05:40:30,022][100917] Updated weights for policy 1, policy_version 15722 (0.0009) +[2023-10-14 05:40:30,396][100917] Updated weights for policy 1, policy_version 15732 (0.0008) +[2023-10-14 05:40:30,771][100917] Updated weights for policy 1, policy_version 15742 (0.0008) +[2023-10-14 05:40:31,625][100936] Updated weights for policy 0, policy_version 15750 (0.0008) +[2023-10-14 05:40:32,001][100936] Updated weights for policy 0, policy_version 15760 (0.0007) +[2023-10-14 05:40:32,379][100936] Updated weights for policy 0, policy_version 15770 (0.0007) +[2023-10-14 05:40:33,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 32276480. Throughput: 0: 1651.2, 1: 1650.0. Samples: 8075616. Policy #0 lag: (min: 1.0, avg: 8.3, max: 33.0) +[2023-10-14 05:40:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.930')] +[2023-10-14 05:40:34,891][100917] Updated weights for policy 1, policy_version 15752 (0.0009) +[2023-10-14 05:40:35,257][100917] Updated weights for policy 1, policy_version 15762 (0.0009) +[2023-10-14 05:40:35,623][100917] Updated weights for policy 1, policy_version 15772 (0.0008) +[2023-10-14 05:40:36,485][100936] Updated weights for policy 0, policy_version 15780 (0.0007) +[2023-10-14 05:40:36,856][100936] Updated weights for policy 0, policy_version 15790 (0.0008) +[2023-10-14 05:40:37,218][100936] Updated weights for policy 0, policy_version 15800 (0.0010) +[2023-10-14 05:40:38,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 32342016. Throughput: 0: 1666.1, 1: 1650.0. Samples: 8095662. Policy #0 lag: (min: 2.0, avg: 3.5, max: 28.0) +[2023-10-14 05:40:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.930')] +[2023-10-14 05:40:39,887][100917] Updated weights for policy 1, policy_version 15782 (0.0008) +[2023-10-14 05:40:40,275][100917] Updated weights for policy 1, policy_version 15792 (0.0011) +[2023-10-14 05:40:40,649][100917] Updated weights for policy 1, policy_version 15802 (0.0008) +[2023-10-14 05:40:41,424][100936] Updated weights for policy 0, policy_version 15810 (0.0009) +[2023-10-14 05:40:41,793][100936] Updated weights for policy 0, policy_version 15820 (0.0007) +[2023-10-14 05:40:42,159][100936] Updated weights for policy 0, policy_version 15830 (0.0007) +[2023-10-14 05:40:42,536][100936] Updated weights for policy 0, policy_version 15840 (0.0007) +[2023-10-14 05:40:43,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.3, 300 sec: 13329.4). Total num frames: 32407552. Throughput: 0: 1661.9, 1: 1650.5. Samples: 8105700. Policy #0 lag: (min: 2.0, avg: 3.5, max: 28.0) +[2023-10-14 05:40:43,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.930')] +[2023-10-14 05:40:44,673][100917] Updated weights for policy 1, policy_version 15812 (0.0008) +[2023-10-14 05:40:45,044][100917] Updated weights for policy 1, policy_version 15822 (0.0007) +[2023-10-14 05:40:45,403][100917] Updated weights for policy 1, policy_version 15832 (0.0007) +[2023-10-14 05:40:46,702][100936] Updated weights for policy 0, policy_version 15850 (0.0008) +[2023-10-14 05:40:47,070][100936] Updated weights for policy 0, policy_version 15860 (0.0009) +[2023-10-14 05:40:47,446][100936] Updated weights for policy 0, policy_version 15870 (0.0008) +[2023-10-14 05:40:48,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 32473088. Throughput: 0: 1651.8, 1: 1662.4. Samples: 8125216. Policy #0 lag: (min: 2.0, avg: 3.5, max: 28.0) +[2023-10-14 05:40:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.930')] +[2023-10-14 05:40:49,536][100917] Updated weights for policy 1, policy_version 15842 (0.0009) +[2023-10-14 05:40:49,904][100917] Updated weights for policy 1, policy_version 15852 (0.0010) +[2023-10-14 05:40:50,274][100917] Updated weights for policy 1, policy_version 15862 (0.0009) +[2023-10-14 05:40:50,649][100917] Updated weights for policy 1, policy_version 15872 (0.0011) +[2023-10-14 05:40:51,410][100936] Updated weights for policy 0, policy_version 15880 (0.0009) +[2023-10-14 05:40:51,779][100936] Updated weights for policy 0, policy_version 15890 (0.0010) +[2023-10-14 05:40:52,151][100936] Updated weights for policy 0, policy_version 15900 (0.0008) +[2023-10-14 05:40:53,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 32538624. Throughput: 0: 1662.7, 1: 1662.2. Samples: 8145434. Policy #0 lag: (min: 0.0, avg: 25.2, max: 32.0) +[2023-10-14 05:40:53,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:40:54,628][100917] Updated weights for policy 1, policy_version 15882 (0.0012) +[2023-10-14 05:40:55,009][100917] Updated weights for policy 1, policy_version 15892 (0.0010) +[2023-10-14 05:40:55,376][100917] Updated weights for policy 1, policy_version 15902 (0.0007) +[2023-10-14 05:40:56,361][100936] Updated weights for policy 0, policy_version 15910 (0.0008) +[2023-10-14 05:40:56,723][100936] Updated weights for policy 0, policy_version 15920 (0.0010) +[2023-10-14 05:40:57,095][100936] Updated weights for policy 0, policy_version 15930 (0.0009) +[2023-10-14 05:40:58,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 32604160. Throughput: 0: 1655.4, 1: 1661.1. Samples: 8155450. Policy #0 lag: (min: 0.0, avg: 25.2, max: 32.0) +[2023-10-14 05:40:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:40:59,341][100917] Updated weights for policy 1, policy_version 15912 (0.0008) +[2023-10-14 05:40:59,705][100917] Updated weights for policy 1, policy_version 15922 (0.0009) +[2023-10-14 05:41:00,086][100917] Updated weights for policy 1, policy_version 15932 (0.0009) +[2023-10-14 05:41:01,216][100936] Updated weights for policy 0, policy_version 15940 (0.0009) +[2023-10-14 05:41:01,585][100936] Updated weights for policy 0, policy_version 15950 (0.0009) +[2023-10-14 05:41:01,961][100936] Updated weights for policy 0, policy_version 15960 (0.0008) +[2023-10-14 05:41:03,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 32669696. Throughput: 0: 1650.0, 1: 1662.7. Samples: 8174726. Policy #0 lag: (min: 0.0, avg: 25.2, max: 32.0) +[2023-10-14 05:41:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:41:04,094][100917] Updated weights for policy 1, policy_version 15942 (0.0007) +[2023-10-14 05:41:04,466][100917] Updated weights for policy 1, policy_version 15952 (0.0007) +[2023-10-14 05:41:04,833][100917] Updated weights for policy 1, policy_version 15962 (0.0010) +[2023-10-14 05:41:06,102][100936] Updated weights for policy 0, policy_version 15970 (0.0008) +[2023-10-14 05:41:06,473][100936] Updated weights for policy 0, policy_version 15980 (0.0007) +[2023-10-14 05:41:06,838][100936] Updated weights for policy 0, policy_version 15990 (0.0008) +[2023-10-14 05:41:07,210][100936] Updated weights for policy 0, policy_version 16000 (0.0007) +[2023-10-14 05:41:08,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 32735232. Throughput: 0: 1662.8, 1: 1670.4. Samples: 8195488. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 05:41:08,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:41:08,987][100917] Updated weights for policy 1, policy_version 15972 (0.0009) +[2023-10-14 05:41:09,370][100917] Updated weights for policy 1, policy_version 15982 (0.0010) +[2023-10-14 05:41:09,740][100917] Updated weights for policy 1, policy_version 15992 (0.0008) +[2023-10-14 05:41:11,258][100936] Updated weights for policy 0, policy_version 16010 (0.0009) +[2023-10-14 05:41:11,626][100936] Updated weights for policy 0, policy_version 16020 (0.0010) +[2023-10-14 05:41:11,991][100936] Updated weights for policy 0, policy_version 16030 (0.0007) +[2023-10-14 05:41:13,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 32800768. Throughput: 0: 1650.1, 1: 1667.2. Samples: 8205164. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 05:41:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:41:13,891][100917] Updated weights for policy 1, policy_version 16002 (0.0009) +[2023-10-14 05:41:14,269][100917] Updated weights for policy 1, policy_version 16012 (0.0008) +[2023-10-14 05:41:14,644][100917] Updated weights for policy 1, policy_version 16022 (0.0009) +[2023-10-14 05:41:15,012][100917] Updated weights for policy 1, policy_version 16032 (0.0008) +[2023-10-14 05:41:16,197][100936] Updated weights for policy 0, policy_version 16040 (0.0009) +[2023-10-14 05:41:16,558][100936] Updated weights for policy 0, policy_version 16050 (0.0010) +[2023-10-14 05:41:16,927][100936] Updated weights for policy 0, policy_version 16060 (0.0010) +[2023-10-14 05:41:18,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 32866304. Throughput: 0: 1648.3, 1: 1660.7. Samples: 8224520. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 05:41:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:41:19,171][100917] Updated weights for policy 1, policy_version 16042 (0.0009) +[2023-10-14 05:41:19,540][100917] Updated weights for policy 1, policy_version 16052 (0.0008) +[2023-10-14 05:41:19,904][100917] Updated weights for policy 1, policy_version 16062 (0.0007) +[2023-10-14 05:41:20,993][100936] Updated weights for policy 0, policy_version 16070 (0.0008) +[2023-10-14 05:41:21,361][100936] Updated weights for policy 0, policy_version 16080 (0.0010) +[2023-10-14 05:41:21,725][100936] Updated weights for policy 0, policy_version 16090 (0.0007) +[2023-10-14 05:41:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 32931840. Throughput: 0: 1660.4, 1: 1659.5. Samples: 8245058. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:41:23,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:41:24,006][100917] Updated weights for policy 1, policy_version 16072 (0.0009) +[2023-10-14 05:41:24,380][100917] Updated weights for policy 1, policy_version 16082 (0.0011) +[2023-10-14 05:41:24,761][100917] Updated weights for policy 1, policy_version 16092 (0.0010) +[2023-10-14 05:41:25,805][100936] Updated weights for policy 0, policy_version 16100 (0.0007) +[2023-10-14 05:41:26,175][100936] Updated weights for policy 0, policy_version 16110 (0.0007) +[2023-10-14 05:41:26,552][100936] Updated weights for policy 0, policy_version 16120 (0.0009) +[2023-10-14 05:41:28,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 32997376. Throughput: 0: 1644.4, 1: 1658.1. Samples: 8254312. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:41:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:41:29,022][100917] Updated weights for policy 1, policy_version 16102 (0.0008) +[2023-10-14 05:41:29,400][100917] Updated weights for policy 1, policy_version 16112 (0.0009) +[2023-10-14 05:41:29,775][100917] Updated weights for policy 1, policy_version 16122 (0.0009) +[2023-10-14 05:41:30,741][100936] Updated weights for policy 0, policy_version 16130 (0.0007) +[2023-10-14 05:41:31,117][100936] Updated weights for policy 0, policy_version 16140 (0.0009) +[2023-10-14 05:41:31,486][100936] Updated weights for policy 0, policy_version 16150 (0.0008) +[2023-10-14 05:41:31,851][100936] Updated weights for policy 0, policy_version 16160 (0.0008) +[2023-10-14 05:41:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 33062912. Throughput: 0: 1661.9, 1: 1648.8. Samples: 8274196. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:41:33,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:41:33,883][100917] Updated weights for policy 1, policy_version 16132 (0.0007) +[2023-10-14 05:41:34,258][100917] Updated weights for policy 1, policy_version 16142 (0.0010) +[2023-10-14 05:41:34,622][100917] Updated weights for policy 1, policy_version 16152 (0.0009) +[2023-10-14 05:41:35,986][100936] Updated weights for policy 0, policy_version 16170 (0.0008) +[2023-10-14 05:41:36,354][100936] Updated weights for policy 0, policy_version 16180 (0.0009) +[2023-10-14 05:41:36,722][100936] Updated weights for policy 0, policy_version 16190 (0.0008) +[2023-10-14 05:41:38,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 33128448. Throughput: 0: 1667.2, 1: 1653.2. Samples: 8294854. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:41:38,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:41:38,522][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000016160_16547840.pth... +[2023-10-14 05:41:38,522][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000016192_16580608.pth... +[2023-10-14 05:41:38,551][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000014624_14974976.pth +[2023-10-14 05:41:38,569][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000014656_15007744.pth +[2023-10-14 05:41:38,761][100917] Updated weights for policy 1, policy_version 16162 (0.0009) +[2023-10-14 05:41:39,133][100917] Updated weights for policy 1, policy_version 16172 (0.0007) +[2023-10-14 05:41:39,512][100917] Updated weights for policy 1, policy_version 16182 (0.0010) +[2023-10-14 05:41:39,883][100917] Updated weights for policy 1, policy_version 16192 (0.0009) +[2023-10-14 05:41:40,795][100936] Updated weights for policy 0, policy_version 16200 (0.0009) +[2023-10-14 05:41:41,165][100936] Updated weights for policy 0, policy_version 16210 (0.0011) +[2023-10-14 05:41:41,535][100936] Updated weights for policy 0, policy_version 16220 (0.0010) +[2023-10-14 05:41:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 33193984. Throughput: 0: 1651.5, 1: 1655.4. Samples: 8304260. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:41:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:41:43,884][100917] Updated weights for policy 1, policy_version 16202 (0.0010) +[2023-10-14 05:41:44,258][100917] Updated weights for policy 1, policy_version 16212 (0.0008) +[2023-10-14 05:41:44,614][100917] Updated weights for policy 1, policy_version 16222 (0.0007) +[2023-10-14 05:41:45,582][100936] Updated weights for policy 0, policy_version 16230 (0.0009) +[2023-10-14 05:41:45,942][100936] Updated weights for policy 0, policy_version 16240 (0.0010) +[2023-10-14 05:41:46,316][100936] Updated weights for policy 0, policy_version 16250 (0.0008) +[2023-10-14 05:41:48,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 33259520. Throughput: 0: 1668.0, 1: 1655.7. Samples: 8324294. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:41:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.470')] +[2023-10-14 05:41:48,842][100917] Updated weights for policy 1, policy_version 16232 (0.0009) +[2023-10-14 05:41:49,217][100917] Updated weights for policy 1, policy_version 16242 (0.0008) +[2023-10-14 05:41:49,597][100917] Updated weights for policy 1, policy_version 16252 (0.0010) +[2023-10-14 05:41:50,445][100936] Updated weights for policy 0, policy_version 16260 (0.0009) +[2023-10-14 05:41:50,815][100936] Updated weights for policy 0, policy_version 16270 (0.0010) +[2023-10-14 05:41:51,184][100936] Updated weights for policy 0, policy_version 16280 (0.0010) +[2023-10-14 05:41:53,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 33325056. Throughput: 0: 1663.7, 1: 1645.4. Samples: 8344398. Policy #0 lag: (min: 27.0, avg: 33.4, max: 59.0) +[2023-10-14 05:41:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.440')] +[2023-10-14 05:41:53,851][100917] Updated weights for policy 1, policy_version 16262 (0.0008) +[2023-10-14 05:41:54,223][100917] Updated weights for policy 1, policy_version 16272 (0.0008) +[2023-10-14 05:41:54,600][100917] Updated weights for policy 1, policy_version 16282 (0.0010) +[2023-10-14 05:41:55,281][100936] Updated weights for policy 0, policy_version 16290 (0.0009) +[2023-10-14 05:41:55,645][100936] Updated weights for policy 0, policy_version 16300 (0.0007) +[2023-10-14 05:41:56,021][100936] Updated weights for policy 0, policy_version 16310 (0.0007) +[2023-10-14 05:41:56,388][100936] Updated weights for policy 0, policy_version 16320 (0.0009) +[2023-10-14 05:41:58,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 33390592. Throughput: 0: 1648.1, 1: 1646.4. Samples: 8353418. Policy #0 lag: (min: 27.0, avg: 33.4, max: 59.0) +[2023-10-14 05:41:58,513][99942] Avg episode reward: [(0, '0.990'), (1, '0.440')] +[2023-10-14 05:41:58,867][100917] Updated weights for policy 1, policy_version 16292 (0.0009) +[2023-10-14 05:41:59,238][100917] Updated weights for policy 1, policy_version 16302 (0.0009) +[2023-10-14 05:41:59,618][100917] Updated weights for policy 1, policy_version 16312 (0.0008) +[2023-10-14 05:42:00,435][100936] Updated weights for policy 0, policy_version 16330 (0.0007) +[2023-10-14 05:42:00,813][100936] Updated weights for policy 0, policy_version 16340 (0.0010) +[2023-10-14 05:42:01,186][100936] Updated weights for policy 0, policy_version 16350 (0.0009) +[2023-10-14 05:42:03,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 33456128. Throughput: 0: 1669.6, 1: 1649.1. Samples: 8373864. Policy #0 lag: (min: 27.0, avg: 33.4, max: 59.0) +[2023-10-14 05:42:03,513][99942] Avg episode reward: [(0, '0.990'), (1, '0.440')] +[2023-10-14 05:42:03,893][100917] Updated weights for policy 1, policy_version 16322 (0.0009) +[2023-10-14 05:42:04,275][100917] Updated weights for policy 1, policy_version 16332 (0.0009) +[2023-10-14 05:42:04,632][100917] Updated weights for policy 1, policy_version 16342 (0.0007) +[2023-10-14 05:42:05,006][100917] Updated weights for policy 1, policy_version 16352 (0.0008) +[2023-10-14 05:42:05,310][100936] Updated weights for policy 0, policy_version 16360 (0.0009) +[2023-10-14 05:42:05,679][100936] Updated weights for policy 0, policy_version 16370 (0.0009) +[2023-10-14 05:42:06,062][100936] Updated weights for policy 0, policy_version 16380 (0.0008) +[2023-10-14 05:42:08,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 33521664. Throughput: 0: 1667.8, 1: 1654.0. Samples: 8394540. Policy #0 lag: (min: 8.0, avg: 32.6, max: 40.0) +[2023-10-14 05:42:08,513][99942] Avg episode reward: [(0, '0.990'), (1, '0.440')] +[2023-10-14 05:42:09,169][100917] Updated weights for policy 1, policy_version 16362 (0.0009) +[2023-10-14 05:42:09,539][100917] Updated weights for policy 1, policy_version 16372 (0.0007) +[2023-10-14 05:42:09,920][100917] Updated weights for policy 1, policy_version 16382 (0.0007) +[2023-10-14 05:42:10,088][100936] Updated weights for policy 0, policy_version 16390 (0.0007) +[2023-10-14 05:42:10,458][100936] Updated weights for policy 0, policy_version 16400 (0.0008) +[2023-10-14 05:42:10,832][100936] Updated weights for policy 0, policy_version 16410 (0.0008) +[2023-10-14 05:42:13,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 33587200. Throughput: 0: 1659.4, 1: 1657.3. Samples: 8403566. Policy #0 lag: (min: 8.0, avg: 32.6, max: 40.0) +[2023-10-14 05:42:13,513][99942] Avg episode reward: [(0, '0.990'), (1, '0.440')] +[2023-10-14 05:42:14,039][100917] Updated weights for policy 1, policy_version 16392 (0.0008) +[2023-10-14 05:42:14,412][100917] Updated weights for policy 1, policy_version 16402 (0.0007) +[2023-10-14 05:42:14,772][100936] Updated weights for policy 0, policy_version 16420 (0.0010) +[2023-10-14 05:42:14,784][100917] Updated weights for policy 1, policy_version 16412 (0.0007) +[2023-10-14 05:42:15,149][100936] Updated weights for policy 0, policy_version 16430 (0.0011) +[2023-10-14 05:42:15,527][100936] Updated weights for policy 0, policy_version 16440 (0.0010) +[2023-10-14 05:42:18,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 33652736. Throughput: 0: 1674.3, 1: 1659.9. Samples: 8424234. Policy #0 lag: (min: 8.0, avg: 32.6, max: 40.0) +[2023-10-14 05:42:18,513][99942] Avg episode reward: [(0, '0.990'), (1, '0.440')] +[2023-10-14 05:42:18,907][100917] Updated weights for policy 1, policy_version 16422 (0.0008) +[2023-10-14 05:42:19,271][100917] Updated weights for policy 1, policy_version 16432 (0.0009) +[2023-10-14 05:42:19,648][100917] Updated weights for policy 1, policy_version 16442 (0.0008) +[2023-10-14 05:42:19,734][100936] Updated weights for policy 0, policy_version 16450 (0.0008) +[2023-10-14 05:42:20,114][100936] Updated weights for policy 0, policy_version 16460 (0.0008) +[2023-10-14 05:42:20,485][100936] Updated weights for policy 0, policy_version 16470 (0.0008) +[2023-10-14 05:42:20,856][100936] Updated weights for policy 0, policy_version 16480 (0.0008) +[2023-10-14 05:42:23,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 33718272. Throughput: 0: 1666.7, 1: 1653.0. Samples: 8444238. Policy #0 lag: (min: 6.0, avg: 9.0, max: 38.0) +[2023-10-14 05:42:23,512][99942] Avg episode reward: [(0, '0.990'), (1, '0.440')] +[2023-10-14 05:42:23,861][100917] Updated weights for policy 1, policy_version 16452 (0.0008) +[2023-10-14 05:42:24,225][100917] Updated weights for policy 1, policy_version 16462 (0.0008) +[2023-10-14 05:42:24,593][100917] Updated weights for policy 1, policy_version 16472 (0.0009) +[2023-10-14 05:42:25,155][100936] Updated weights for policy 0, policy_version 16490 (0.0010) +[2023-10-14 05:42:25,520][100936] Updated weights for policy 0, policy_version 16500 (0.0008) +[2023-10-14 05:42:25,890][100936] Updated weights for policy 0, policy_version 16510 (0.0007) +[2023-10-14 05:42:28,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 33783808. Throughput: 0: 1656.9, 1: 1652.4. Samples: 8453180. Policy #0 lag: (min: 6.0, avg: 9.0, max: 38.0) +[2023-10-14 05:42:28,513][99942] Avg episode reward: [(0, '0.990'), (1, '0.440')] +[2023-10-14 05:42:28,661][100917] Updated weights for policy 1, policy_version 16482 (0.0009) +[2023-10-14 05:42:29,041][100917] Updated weights for policy 1, policy_version 16492 (0.0010) +[2023-10-14 05:42:29,419][100917] Updated weights for policy 1, policy_version 16502 (0.0010) +[2023-10-14 05:42:29,791][100917] Updated weights for policy 1, policy_version 16512 (0.0009) +[2023-10-14 05:42:29,971][100936] Updated weights for policy 0, policy_version 16520 (0.0009) +[2023-10-14 05:42:30,345][100936] Updated weights for policy 0, policy_version 16530 (0.0007) +[2023-10-14 05:42:30,708][100936] Updated weights for policy 0, policy_version 16540 (0.0008) +[2023-10-14 05:42:33,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 33849344. Throughput: 0: 1664.4, 1: 1656.1. Samples: 8473714. Policy #0 lag: (min: 6.0, avg: 9.0, max: 38.0) +[2023-10-14 05:42:33,513][99942] Avg episode reward: [(0, '0.990'), (1, '0.440')] +[2023-10-14 05:42:33,835][100917] Updated weights for policy 1, policy_version 16522 (0.0008) +[2023-10-14 05:42:34,215][100917] Updated weights for policy 1, policy_version 16532 (0.0012) +[2023-10-14 05:42:34,588][100917] Updated weights for policy 1, policy_version 16542 (0.0008) +[2023-10-14 05:42:35,024][100936] Updated weights for policy 0, policy_version 16550 (0.0007) +[2023-10-14 05:42:35,393][100936] Updated weights for policy 0, policy_version 16560 (0.0007) +[2023-10-14 05:42:35,770][100936] Updated weights for policy 0, policy_version 16570 (0.0007) +[2023-10-14 05:42:38,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 33914880. Throughput: 0: 1662.0, 1: 1662.1. Samples: 8493980. Policy #0 lag: (min: 9.0, avg: 18.0, max: 41.0) +[2023-10-14 05:42:38,513][99942] Avg episode reward: [(0, '0.630'), (1, '0.440')] +[2023-10-14 05:42:38,646][100917] Updated weights for policy 1, policy_version 16552 (0.0010) +[2023-10-14 05:42:39,028][100917] Updated weights for policy 1, policy_version 16562 (0.0010) +[2023-10-14 05:42:39,397][100917] Updated weights for policy 1, policy_version 16572 (0.0008) +[2023-10-14 05:42:39,920][100936] Updated weights for policy 0, policy_version 16580 (0.0009) +[2023-10-14 05:42:40,293][100936] Updated weights for policy 0, policy_version 16590 (0.0010) +[2023-10-14 05:42:40,663][100936] Updated weights for policy 0, policy_version 16600 (0.0007) +[2023-10-14 05:42:43,460][100917] Updated weights for policy 1, policy_version 16582 (0.0009) +[2023-10-14 05:42:43,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 33980416. Throughput: 0: 1659.7, 1: 1667.3. Samples: 8503136. Policy #0 lag: (min: 9.0, avg: 18.0, max: 41.0) +[2023-10-14 05:42:43,513][99942] Avg episode reward: [(0, '0.630'), (1, '0.440')] +[2023-10-14 05:42:43,826][100917] Updated weights for policy 1, policy_version 16592 (0.0008) +[2023-10-14 05:42:44,204][100917] Updated weights for policy 1, policy_version 16602 (0.0008) +[2023-10-14 05:42:44,915][100936] Updated weights for policy 0, policy_version 16610 (0.0008) +[2023-10-14 05:42:45,288][100936] Updated weights for policy 0, policy_version 16620 (0.0008) +[2023-10-14 05:42:45,657][100936] Updated weights for policy 0, policy_version 16630 (0.0008) +[2023-10-14 05:42:46,023][100936] Updated weights for policy 0, policy_version 16640 (0.0009) +[2023-10-14 05:42:48,242][100917] Updated weights for policy 1, policy_version 16612 (0.0007) +[2023-10-14 05:42:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 34045952. Throughput: 0: 1656.6, 1: 1667.7. Samples: 8523458. Policy #0 lag: (min: 9.0, avg: 18.0, max: 41.0) +[2023-10-14 05:42:48,512][99942] Avg episode reward: [(0, '0.630'), (1, '0.440')] +[2023-10-14 05:42:48,626][100917] Updated weights for policy 1, policy_version 16622 (0.0009) +[2023-10-14 05:42:49,004][100917] Updated weights for policy 1, policy_version 16632 (0.0007) +[2023-10-14 05:42:50,253][100936] Updated weights for policy 0, policy_version 16650 (0.0008) +[2023-10-14 05:42:50,633][100936] Updated weights for policy 0, policy_version 16660 (0.0007) +[2023-10-14 05:42:51,014][100936] Updated weights for policy 0, policy_version 16670 (0.0007) +[2023-10-14 05:42:53,103][100917] Updated weights for policy 1, policy_version 16642 (0.0007) +[2023-10-14 05:42:53,474][100917] Updated weights for policy 1, policy_version 16652 (0.0007) +[2023-10-14 05:42:53,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 34111488. Throughput: 0: 1651.9, 1: 1663.7. Samples: 8543744. Policy #0 lag: (min: 23.0, avg: 25.7, max: 55.0) +[2023-10-14 05:42:53,513][99942] Avg episode reward: [(0, '0.630'), (1, '0.440')] +[2023-10-14 05:42:53,845][100917] Updated weights for policy 1, policy_version 16662 (0.0007) +[2023-10-14 05:42:54,218][100917] Updated weights for policy 1, policy_version 16672 (0.0009) +[2023-10-14 05:42:55,150][100936] Updated weights for policy 0, policy_version 16680 (0.0007) +[2023-10-14 05:42:55,521][100936] Updated weights for policy 0, policy_version 16690 (0.0008) +[2023-10-14 05:42:55,882][100936] Updated weights for policy 0, policy_version 16700 (0.0008) +[2023-10-14 05:42:58,148][100917] Updated weights for policy 1, policy_version 16682 (0.0009) +[2023-10-14 05:42:58,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 34177024. Throughput: 0: 1650.2, 1: 1665.4. Samples: 8552768. Policy #0 lag: (min: 23.0, avg: 25.7, max: 55.0) +[2023-10-14 05:42:58,513][99942] Avg episode reward: [(0, '0.630'), (1, '0.440')] +[2023-10-14 05:42:58,525][100917] Updated weights for policy 1, policy_version 16692 (0.0007) +[2023-10-14 05:42:58,896][100917] Updated weights for policy 1, policy_version 16702 (0.0007) +[2023-10-14 05:42:59,973][100936] Updated weights for policy 0, policy_version 16710 (0.0009) +[2023-10-14 05:43:00,351][100936] Updated weights for policy 0, policy_version 16720 (0.0009) +[2023-10-14 05:43:00,724][100936] Updated weights for policy 0, policy_version 16730 (0.0009) +[2023-10-14 05:43:03,000][100917] Updated weights for policy 1, policy_version 16712 (0.0007) +[2023-10-14 05:43:03,373][100917] Updated weights for policy 1, policy_version 16722 (0.0009) +[2023-10-14 05:43:03,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 34242560. Throughput: 0: 1642.8, 1: 1669.2. Samples: 8573272. Policy #0 lag: (min: 23.0, avg: 25.7, max: 55.0) +[2023-10-14 05:43:03,513][99942] Avg episode reward: [(0, '0.630'), (1, '0.440')] +[2023-10-14 05:43:03,753][100917] Updated weights for policy 1, policy_version 16732 (0.0007) +[2023-10-14 05:43:04,722][100936] Updated weights for policy 0, policy_version 16740 (0.0008) +[2023-10-14 05:43:05,103][100936] Updated weights for policy 0, policy_version 16750 (0.0008) +[2023-10-14 05:43:05,474][100936] Updated weights for policy 0, policy_version 16760 (0.0009) +[2023-10-14 05:43:07,943][100917] Updated weights for policy 1, policy_version 16742 (0.0009) +[2023-10-14 05:43:08,327][100917] Updated weights for policy 1, policy_version 16752 (0.0008) +[2023-10-14 05:43:08,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 34308096. Throughput: 0: 1654.3, 1: 1662.4. Samples: 8593492. Policy #0 lag: (min: 31.0, avg: 45.4, max: 63.0) +[2023-10-14 05:43:08,512][99942] Avg episode reward: [(0, '0.630'), (1, '0.440')] +[2023-10-14 05:43:08,707][100917] Updated weights for policy 1, policy_version 16762 (0.0010) +[2023-10-14 05:43:09,419][100936] Updated weights for policy 0, policy_version 16770 (0.0007) +[2023-10-14 05:43:09,787][100936] Updated weights for policy 0, policy_version 16780 (0.0010) +[2023-10-14 05:43:10,165][100936] Updated weights for policy 0, policy_version 16790 (0.0007) +[2023-10-14 05:43:10,531][100936] Updated weights for policy 0, policy_version 16800 (0.0008) +[2023-10-14 05:43:12,791][100917] Updated weights for policy 1, policy_version 16772 (0.0010) +[2023-10-14 05:43:13,156][100917] Updated weights for policy 1, policy_version 16782 (0.0010) +[2023-10-14 05:43:13,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 34373632. Throughput: 0: 1658.2, 1: 1668.2. Samples: 8602868. Policy #0 lag: (min: 31.0, avg: 45.4, max: 63.0) +[2023-10-14 05:43:13,512][99942] Avg episode reward: [(0, '0.630'), (1, '0.440')] +[2023-10-14 05:43:13,532][100917] Updated weights for policy 1, policy_version 16792 (0.0007) +[2023-10-14 05:43:14,639][100936] Updated weights for policy 0, policy_version 16810 (0.0007) +[2023-10-14 05:43:15,002][100936] Updated weights for policy 0, policy_version 16820 (0.0007) +[2023-10-14 05:43:15,375][100936] Updated weights for policy 0, policy_version 16830 (0.0007) +[2023-10-14 05:43:17,758][100917] Updated weights for policy 1, policy_version 16802 (0.0011) +[2023-10-14 05:43:18,129][100917] Updated weights for policy 1, policy_version 16812 (0.0012) +[2023-10-14 05:43:18,505][100917] Updated weights for policy 1, policy_version 16822 (0.0009) +[2023-10-14 05:43:18,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 34439168. Throughput: 0: 1662.9, 1: 1666.6. Samples: 8623542. Policy #0 lag: (min: 31.0, avg: 45.4, max: 63.0) +[2023-10-14 05:43:18,512][99942] Avg episode reward: [(0, '0.630'), (1, '0.440')] +[2023-10-14 05:43:18,884][100917] Updated weights for policy 1, policy_version 16832 (0.0008) +[2023-10-14 05:43:19,306][100936] Updated weights for policy 0, policy_version 16840 (0.0009) +[2023-10-14 05:43:19,695][100936] Updated weights for policy 0, policy_version 16850 (0.0009) +[2023-10-14 05:43:20,072][100936] Updated weights for policy 0, policy_version 16860 (0.0009) +[2023-10-14 05:43:22,862][100917] Updated weights for policy 1, policy_version 16842 (0.0011) +[2023-10-14 05:43:23,242][100917] Updated weights for policy 1, policy_version 16852 (0.0011) +[2023-10-14 05:43:23,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 34504704. Throughput: 0: 1666.9, 1: 1657.0. Samples: 8643558. Policy #0 lag: (min: 31.0, avg: 34.4, max: 63.0) +[2023-10-14 05:43:23,513][99942] Avg episode reward: [(0, '0.630'), (1, '0.440')] +[2023-10-14 05:43:23,606][100917] Updated weights for policy 1, policy_version 16862 (0.0011) +[2023-10-14 05:43:24,171][100936] Updated weights for policy 0, policy_version 16870 (0.0009) +[2023-10-14 05:43:24,546][100936] Updated weights for policy 0, policy_version 16880 (0.0007) +[2023-10-14 05:43:24,910][100936] Updated weights for policy 0, policy_version 16890 (0.0010) +[2023-10-14 05:43:27,521][100917] Updated weights for policy 1, policy_version 16872 (0.0007) +[2023-10-14 05:43:27,901][100917] Updated weights for policy 1, policy_version 16882 (0.0007) +[2023-10-14 05:43:28,268][100917] Updated weights for policy 1, policy_version 16892 (0.0010) +[2023-10-14 05:43:28,512][99942] Fps is (10 sec: 16384.2, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 34603008. Throughput: 0: 1664.5, 1: 1666.5. Samples: 8653028. Policy #0 lag: (min: 31.0, avg: 34.4, max: 63.0) +[2023-10-14 05:43:28,512][99942] Avg episode reward: [(0, '0.630'), (1, '0.440')] +[2023-10-14 05:43:29,079][100936] Updated weights for policy 0, policy_version 16900 (0.0008) +[2023-10-14 05:43:29,456][100936] Updated weights for policy 0, policy_version 16910 (0.0009) +[2023-10-14 05:43:29,825][100936] Updated weights for policy 0, policy_version 16920 (0.0009) +[2023-10-14 05:43:32,650][100917] Updated weights for policy 1, policy_version 16902 (0.0007) +[2023-10-14 05:43:33,031][100917] Updated weights for policy 1, policy_version 16912 (0.0007) +[2023-10-14 05:43:33,394][100917] Updated weights for policy 1, policy_version 16922 (0.0009) +[2023-10-14 05:43:33,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 34635776. Throughput: 0: 1665.9, 1: 1668.0. Samples: 8673480. Policy #0 lag: (min: 31.0, avg: 34.4, max: 63.0) +[2023-10-14 05:43:33,512][99942] Avg episode reward: [(0, '0.630'), (1, '0.440')] +[2023-10-14 05:43:33,947][100936] Updated weights for policy 0, policy_version 16930 (0.0009) +[2023-10-14 05:43:34,320][100936] Updated weights for policy 0, policy_version 16940 (0.0007) +[2023-10-14 05:43:34,695][100936] Updated weights for policy 0, policy_version 16950 (0.0008) +[2023-10-14 05:43:35,062][100936] Updated weights for policy 0, policy_version 16960 (0.0008) +[2023-10-14 05:43:37,649][100917] Updated weights for policy 1, policy_version 16932 (0.0010) +[2023-10-14 05:43:38,028][100917] Updated weights for policy 1, policy_version 16942 (0.0009) +[2023-10-14 05:43:38,396][100917] Updated weights for policy 1, policy_version 16952 (0.0009) +[2023-10-14 05:43:38,512][99942] Fps is (10 sec: 9830.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 34701312. Throughput: 0: 1674.5, 1: 1653.9. Samples: 8693522. Policy #0 lag: (min: 12.0, avg: 12.0, max: 12.0) +[2023-10-14 05:43:38,513][99942] Avg episode reward: [(0, '0.630'), (1, '0.440')] +[2023-10-14 05:43:38,522][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000016960_17367040.pth... +[2023-10-14 05:43:38,557][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000015424_15794176.pth +[2023-10-14 05:43:38,695][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000016960_17367040.pth... +[2023-10-14 05:43:38,724][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000015392_15761408.pth +[2023-10-14 05:43:39,093][100936] Updated weights for policy 0, policy_version 16970 (0.0007) +[2023-10-14 05:43:39,458][100936] Updated weights for policy 0, policy_version 16980 (0.0008) +[2023-10-14 05:43:39,834][100936] Updated weights for policy 0, policy_version 16990 (0.0009) +[2023-10-14 05:43:42,524][100917] Updated weights for policy 1, policy_version 16962 (0.0009) +[2023-10-14 05:43:42,912][100917] Updated weights for policy 1, policy_version 16972 (0.0007) +[2023-10-14 05:43:43,282][100917] Updated weights for policy 1, policy_version 16982 (0.0008) +[2023-10-14 05:43:43,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 34766848. Throughput: 0: 1674.3, 1: 1664.5. Samples: 8703014. Policy #0 lag: (min: 12.0, avg: 12.0, max: 12.0) +[2023-10-14 05:43:43,513][99942] Avg episode reward: [(0, '0.630'), (1, '0.440')] +[2023-10-14 05:43:43,657][100917] Updated weights for policy 1, policy_version 16992 (0.0008) +[2023-10-14 05:43:43,948][100936] Updated weights for policy 0, policy_version 17000 (0.0008) +[2023-10-14 05:43:44,322][100936] Updated weights for policy 0, policy_version 17010 (0.0009) +[2023-10-14 05:43:44,685][100936] Updated weights for policy 0, policy_version 17020 (0.0008) +[2023-10-14 05:43:47,931][100917] Updated weights for policy 1, policy_version 17002 (0.0010) +[2023-10-14 05:43:48,307][100917] Updated weights for policy 1, policy_version 17012 (0.0009) +[2023-10-14 05:43:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 34832384. Throughput: 0: 1677.1, 1: 1658.5. Samples: 8723370. Policy #0 lag: (min: 12.0, avg: 12.0, max: 12.0) +[2023-10-14 05:43:48,513][99942] Avg episode reward: [(0, '0.630'), (1, '0.440')] +[2023-10-14 05:43:48,685][100917] Updated weights for policy 1, policy_version 17022 (0.0010) +[2023-10-14 05:43:48,959][100936] Updated weights for policy 0, policy_version 17030 (0.0009) +[2023-10-14 05:43:49,332][100936] Updated weights for policy 0, policy_version 17040 (0.0007) +[2023-10-14 05:43:49,703][100936] Updated weights for policy 0, policy_version 17050 (0.0008) +[2023-10-14 05:43:52,808][100917] Updated weights for policy 1, policy_version 17032 (0.0008) +[2023-10-14 05:43:53,186][100917] Updated weights for policy 1, policy_version 17042 (0.0007) +[2023-10-14 05:43:53,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 34897920. Throughput: 0: 1672.2, 1: 1653.5. Samples: 8743148. Policy #0 lag: (min: 23.0, avg: 23.0, max: 23.0) +[2023-10-14 05:43:53,512][99942] Avg episode reward: [(0, '0.630'), (1, '0.440')] +[2023-10-14 05:43:53,566][100917] Updated weights for policy 1, policy_version 17052 (0.0007) +[2023-10-14 05:43:53,815][100936] Updated weights for policy 0, policy_version 17060 (0.0008) +[2023-10-14 05:43:54,191][100936] Updated weights for policy 0, policy_version 17070 (0.0009) +[2023-10-14 05:43:54,555][100936] Updated weights for policy 0, policy_version 17080 (0.0009) +[2023-10-14 05:43:57,587][100917] Updated weights for policy 1, policy_version 17062 (0.0008) +[2023-10-14 05:43:57,963][100917] Updated weights for policy 1, policy_version 17072 (0.0007) +[2023-10-14 05:43:58,347][100917] Updated weights for policy 1, policy_version 17082 (0.0009) +[2023-10-14 05:43:58,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 34963456. Throughput: 0: 1667.6, 1: 1658.2. Samples: 8752532. Policy #0 lag: (min: 23.0, avg: 23.0, max: 23.0) +[2023-10-14 05:43:58,513][99942] Avg episode reward: [(0, '0.630'), (1, '0.440')] +[2023-10-14 05:43:58,875][100936] Updated weights for policy 0, policy_version 17090 (0.0010) +[2023-10-14 05:43:59,242][100936] Updated weights for policy 0, policy_version 17100 (0.0010) +[2023-10-14 05:43:59,614][100936] Updated weights for policy 0, policy_version 17110 (0.0008) +[2023-10-14 05:43:59,978][100936] Updated weights for policy 0, policy_version 17120 (0.0011) +[2023-10-14 05:44:02,658][100917] Updated weights for policy 1, policy_version 17092 (0.0010) +[2023-10-14 05:44:03,027][100917] Updated weights for policy 1, policy_version 17102 (0.0009) +[2023-10-14 05:44:03,407][100917] Updated weights for policy 1, policy_version 17112 (0.0010) +[2023-10-14 05:44:03,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 35028992. Throughput: 0: 1660.2, 1: 1653.5. Samples: 8772656. Policy #0 lag: (min: 23.0, avg: 23.0, max: 23.0) +[2023-10-14 05:44:03,513][99942] Avg episode reward: [(0, '0.630'), (1, '0.440')] +[2023-10-14 05:44:04,109][100936] Updated weights for policy 0, policy_version 17130 (0.0009) +[2023-10-14 05:44:04,482][100936] Updated weights for policy 0, policy_version 17140 (0.0009) +[2023-10-14 05:44:04,851][100936] Updated weights for policy 0, policy_version 17150 (0.0008) +[2023-10-14 05:44:07,386][100917] Updated weights for policy 1, policy_version 17122 (0.0009) +[2023-10-14 05:44:07,752][100917] Updated weights for policy 1, policy_version 17132 (0.0007) +[2023-10-14 05:44:08,122][100917] Updated weights for policy 1, policy_version 17142 (0.0010) +[2023-10-14 05:44:08,506][100917] Updated weights for policy 1, policy_version 17152 (0.0010) +[2023-10-14 05:44:08,512][99942] Fps is (10 sec: 16384.3, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 35127296. Throughput: 0: 1660.6, 1: 1648.9. Samples: 8792486. Policy #0 lag: (min: 31.0, avg: 31.1, max: 37.0) +[2023-10-14 05:44:08,512][99942] Avg episode reward: [(0, '0.630'), (1, '0.440')] +[2023-10-14 05:44:08,859][100936] Updated weights for policy 0, policy_version 17160 (0.0008) +[2023-10-14 05:44:09,236][100936] Updated weights for policy 0, policy_version 17170 (0.0007) +[2023-10-14 05:44:09,597][100936] Updated weights for policy 0, policy_version 17180 (0.0007) +[2023-10-14 05:44:12,402][100917] Updated weights for policy 1, policy_version 17162 (0.0007) +[2023-10-14 05:44:12,784][100917] Updated weights for policy 1, policy_version 17172 (0.0009) +[2023-10-14 05:44:13,173][100917] Updated weights for policy 1, policy_version 17182 (0.0009) +[2023-10-14 05:44:13,512][99942] Fps is (10 sec: 16384.3, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 35192832. Throughput: 0: 1657.6, 1: 1654.5. Samples: 8802074. Policy #0 lag: (min: 31.0, avg: 31.1, max: 37.0) +[2023-10-14 05:44:13,512][99942] Avg episode reward: [(0, '0.630'), (1, '0.440')] +[2023-10-14 05:44:13,867][100936] Updated weights for policy 0, policy_version 17190 (0.0008) +[2023-10-14 05:44:14,231][100936] Updated weights for policy 0, policy_version 17200 (0.0009) +[2023-10-14 05:44:14,606][100936] Updated weights for policy 0, policy_version 17210 (0.0008) +[2023-10-14 05:44:17,282][100917] Updated weights for policy 1, policy_version 17192 (0.0008) +[2023-10-14 05:44:17,656][100917] Updated weights for policy 1, policy_version 17202 (0.0010) +[2023-10-14 05:44:18,033][100917] Updated weights for policy 1, policy_version 17212 (0.0008) +[2023-10-14 05:44:18,495][100936] Updated weights for policy 0, policy_version 17220 (0.0009) +[2023-10-14 05:44:18,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 35258368. Throughput: 0: 1664.0, 1: 1651.5. Samples: 8822676. Policy #0 lag: (min: 31.0, avg: 31.1, max: 37.0) +[2023-10-14 05:44:18,512][99942] Avg episode reward: [(0, '0.630'), (1, '0.440')] +[2023-10-14 05:44:18,870][100936] Updated weights for policy 0, policy_version 17230 (0.0009) +[2023-10-14 05:44:19,240][100936] Updated weights for policy 0, policy_version 17240 (0.0010) +[2023-10-14 05:44:22,315][100917] Updated weights for policy 1, policy_version 17222 (0.0010) +[2023-10-14 05:44:22,685][100917] Updated weights for policy 1, policy_version 17232 (0.0009) +[2023-10-14 05:44:23,055][100917] Updated weights for policy 1, policy_version 17242 (0.0009) +[2023-10-14 05:44:23,290][100936] Updated weights for policy 0, policy_version 17250 (0.0008) +[2023-10-14 05:44:23,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 35323904. Throughput: 0: 1656.5, 1: 1640.5. Samples: 8841888. Policy #0 lag: (min: 26.0, avg: 29.1, max: 58.0) +[2023-10-14 05:44:23,513][99942] Avg episode reward: [(0, '0.630'), (1, '0.440')] +[2023-10-14 05:44:23,663][100936] Updated weights for policy 0, policy_version 17260 (0.0008) +[2023-10-14 05:44:24,043][100936] Updated weights for policy 0, policy_version 17270 (0.0008) +[2023-10-14 05:44:24,405][100936] Updated weights for policy 0, policy_version 17280 (0.0007) +[2023-10-14 05:44:27,291][100917] Updated weights for policy 1, policy_version 17252 (0.0009) +[2023-10-14 05:44:27,656][100917] Updated weights for policy 1, policy_version 17262 (0.0008) +[2023-10-14 05:44:28,037][100917] Updated weights for policy 1, policy_version 17272 (0.0009) +[2023-10-14 05:44:28,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 35389440. Throughput: 0: 1659.2, 1: 1647.3. Samples: 8851806. Policy #0 lag: (min: 26.0, avg: 29.1, max: 58.0) +[2023-10-14 05:44:28,513][99942] Avg episode reward: [(0, '0.630'), (1, '0.970')] +[2023-10-14 05:44:28,764][100936] Updated weights for policy 0, policy_version 17290 (0.0008) +[2023-10-14 05:44:29,127][100936] Updated weights for policy 0, policy_version 17300 (0.0008) +[2023-10-14 05:44:29,496][100936] Updated weights for policy 0, policy_version 17310 (0.0009) +[2023-10-14 05:44:32,188][100917] Updated weights for policy 1, policy_version 17282 (0.0008) +[2023-10-14 05:44:32,557][100917] Updated weights for policy 1, policy_version 17292 (0.0008) +[2023-10-14 05:44:32,933][100917] Updated weights for policy 1, policy_version 17302 (0.0007) +[2023-10-14 05:44:33,304][100917] Updated weights for policy 1, policy_version 17312 (0.0008) +[2023-10-14 05:44:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 35454976. Throughput: 0: 1654.5, 1: 1649.9. Samples: 8872070. Policy #0 lag: (min: 26.0, avg: 29.1, max: 58.0) +[2023-10-14 05:44:33,513][99942] Avg episode reward: [(0, '0.630'), (1, '0.970')] +[2023-10-14 05:44:33,629][100936] Updated weights for policy 0, policy_version 17320 (0.0008) +[2023-10-14 05:44:33,994][100936] Updated weights for policy 0, policy_version 17330 (0.0010) +[2023-10-14 05:44:34,363][100936] Updated weights for policy 0, policy_version 17340 (0.0010) +[2023-10-14 05:44:37,578][100917] Updated weights for policy 1, policy_version 17322 (0.0008) +[2023-10-14 05:44:37,963][100917] Updated weights for policy 1, policy_version 17332 (0.0007) +[2023-10-14 05:44:38,339][100917] Updated weights for policy 1, policy_version 17342 (0.0007) +[2023-10-14 05:44:38,371][100936] Updated weights for policy 0, policy_version 17350 (0.0009) +[2023-10-14 05:44:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 35520512. Throughput: 0: 1649.9, 1: 1642.2. Samples: 8891294. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) +[2023-10-14 05:44:38,513][99942] Avg episode reward: [(0, '0.630'), (1, '1.000')] +[2023-10-14 05:44:38,742][100936] Updated weights for policy 0, policy_version 17360 (0.0008) +[2023-10-14 05:44:39,122][100936] Updated weights for policy 0, policy_version 17370 (0.0008) +[2023-10-14 05:44:42,407][100917] Updated weights for policy 1, policy_version 17352 (0.0007) +[2023-10-14 05:44:42,787][100917] Updated weights for policy 1, policy_version 17362 (0.0009) +[2023-10-14 05:44:43,142][100936] Updated weights for policy 0, policy_version 17380 (0.0009) +[2023-10-14 05:44:43,163][100917] Updated weights for policy 1, policy_version 17372 (0.0009) +[2023-10-14 05:44:43,505][100936] Updated weights for policy 0, policy_version 17390 (0.0009) +[2023-10-14 05:44:43,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 35586048. Throughput: 0: 1656.1, 1: 1649.7. Samples: 8901288. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) +[2023-10-14 05:44:43,513][99942] Avg episode reward: [(0, '0.640'), (1, '1.000')] +[2023-10-14 05:44:43,873][100936] Updated weights for policy 0, policy_version 17400 (0.0010) +[2023-10-14 05:44:47,021][100917] Updated weights for policy 1, policy_version 17382 (0.0008) +[2023-10-14 05:44:47,393][100917] Updated weights for policy 1, policy_version 17392 (0.0007) +[2023-10-14 05:44:47,768][100917] Updated weights for policy 1, policy_version 17402 (0.0010) +[2023-10-14 05:44:48,155][100936] Updated weights for policy 0, policy_version 17410 (0.0011) +[2023-10-14 05:44:48,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 35651584. Throughput: 0: 1655.9, 1: 1653.1. Samples: 8921558. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) +[2023-10-14 05:44:48,513][99942] Avg episode reward: [(0, '0.640'), (1, '1.000')] +[2023-10-14 05:44:48,531][100936] Updated weights for policy 0, policy_version 17420 (0.0009) +[2023-10-14 05:44:48,902][100936] Updated weights for policy 0, policy_version 17430 (0.0007) +[2023-10-14 05:44:49,268][100936] Updated weights for policy 0, policy_version 17440 (0.0007) +[2023-10-14 05:44:52,019][100917] Updated weights for policy 1, policy_version 17412 (0.0010) +[2023-10-14 05:44:52,394][100917] Updated weights for policy 1, policy_version 17422 (0.0011) +[2023-10-14 05:44:52,758][100917] Updated weights for policy 1, policy_version 17432 (0.0007) +[2023-10-14 05:44:53,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 35717120. Throughput: 0: 1648.8, 1: 1644.1. Samples: 8940670. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) +[2023-10-14 05:44:53,512][99942] Avg episode reward: [(0, '0.640'), (1, '1.000')] +[2023-10-14 05:44:53,608][100936] Updated weights for policy 0, policy_version 17450 (0.0010) +[2023-10-14 05:44:53,978][100936] Updated weights for policy 0, policy_version 17460 (0.0009) +[2023-10-14 05:44:54,353][100936] Updated weights for policy 0, policy_version 17470 (0.0010) +[2023-10-14 05:44:57,014][100917] Updated weights for policy 1, policy_version 17442 (0.0008) +[2023-10-14 05:44:57,381][100917] Updated weights for policy 1, policy_version 17452 (0.0010) +[2023-10-14 05:44:57,760][100917] Updated weights for policy 1, policy_version 17462 (0.0009) +[2023-10-14 05:44:58,120][100917] Updated weights for policy 1, policy_version 17472 (0.0007) +[2023-10-14 05:44:58,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 35782656. Throughput: 0: 1652.0, 1: 1650.5. Samples: 8950686. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) +[2023-10-14 05:44:58,512][99942] Avg episode reward: [(0, '0.640'), (1, '1.000')] +[2023-10-14 05:44:58,571][100936] Updated weights for policy 0, policy_version 17480 (0.0008) +[2023-10-14 05:44:58,937][100936] Updated weights for policy 0, policy_version 17490 (0.0010) +[2023-10-14 05:44:59,305][100936] Updated weights for policy 0, policy_version 17500 (0.0008) +[2023-10-14 05:45:02,226][100917] Updated weights for policy 1, policy_version 17482 (0.0008) +[2023-10-14 05:45:02,584][100917] Updated weights for policy 1, policy_version 17492 (0.0008) +[2023-10-14 05:45:02,959][100917] Updated weights for policy 1, policy_version 17502 (0.0010) +[2023-10-14 05:45:03,418][100936] Updated weights for policy 0, policy_version 17510 (0.0009) +[2023-10-14 05:45:03,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 35848192. Throughput: 0: 1646.2, 1: 1651.2. Samples: 8971062. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) +[2023-10-14 05:45:03,513][99942] Avg episode reward: [(0, '0.640'), (1, '1.000')] +[2023-10-14 05:45:03,782][100936] Updated weights for policy 0, policy_version 17520 (0.0007) +[2023-10-14 05:45:04,152][100936] Updated weights for policy 0, policy_version 17530 (0.0008) +[2023-10-14 05:45:06,973][100917] Updated weights for policy 1, policy_version 17512 (0.0008) +[2023-10-14 05:45:07,346][100917] Updated weights for policy 1, policy_version 17522 (0.0008) +[2023-10-14 05:45:07,727][100917] Updated weights for policy 1, policy_version 17532 (0.0007) +[2023-10-14 05:45:08,189][100936] Updated weights for policy 0, policy_version 17540 (0.0010) +[2023-10-14 05:45:08,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 35913728. Throughput: 0: 1642.9, 1: 1647.1. Samples: 8989938. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:45:08,513][99942] Avg episode reward: [(0, '0.640'), (1, '1.000')] +[2023-10-14 05:45:08,552][100936] Updated weights for policy 0, policy_version 17550 (0.0010) +[2023-10-14 05:45:08,926][100936] Updated weights for policy 0, policy_version 17560 (0.0008) +[2023-10-14 05:45:11,733][100917] Updated weights for policy 1, policy_version 17542 (0.0007) +[2023-10-14 05:45:12,097][100917] Updated weights for policy 1, policy_version 17552 (0.0007) +[2023-10-14 05:45:12,476][100917] Updated weights for policy 1, policy_version 17562 (0.0010) +[2023-10-14 05:45:12,975][100936] Updated weights for policy 0, policy_version 17570 (0.0009) +[2023-10-14 05:45:13,343][100936] Updated weights for policy 0, policy_version 17580 (0.0009) +[2023-10-14 05:45:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 35979264. Throughput: 0: 1650.8, 1: 1660.1. Samples: 9000796. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:45:13,513][99942] Avg episode reward: [(0, '0.640'), (1, '1.000')] +[2023-10-14 05:45:13,709][100936] Updated weights for policy 0, policy_version 17590 (0.0007) +[2023-10-14 05:45:14,086][100936] Updated weights for policy 0, policy_version 17600 (0.0009) +[2023-10-14 05:45:16,896][100917] Updated weights for policy 1, policy_version 17572 (0.0009) +[2023-10-14 05:45:17,268][100917] Updated weights for policy 1, policy_version 17582 (0.0007) +[2023-10-14 05:45:17,638][100917] Updated weights for policy 1, policy_version 17592 (0.0008) +[2023-10-14 05:45:18,456][100936] Updated weights for policy 0, policy_version 17610 (0.0009) +[2023-10-14 05:45:18,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 36044800. Throughput: 0: 1655.5, 1: 1653.4. Samples: 9020970. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:45:18,512][99942] Avg episode reward: [(0, '0.220'), (1, '1.000')] +[2023-10-14 05:45:18,843][100936] Updated weights for policy 0, policy_version 17620 (0.0007) +[2023-10-14 05:45:19,214][100936] Updated weights for policy 0, policy_version 17630 (0.0009) +[2023-10-14 05:45:21,828][100917] Updated weights for policy 1, policy_version 17602 (0.0008) +[2023-10-14 05:45:22,223][100917] Updated weights for policy 1, policy_version 17612 (0.0009) +[2023-10-14 05:45:22,594][100917] Updated weights for policy 1, policy_version 17622 (0.0007) +[2023-10-14 05:45:22,976][100917] Updated weights for policy 1, policy_version 17632 (0.0007) +[2023-10-14 05:45:23,160][100936] Updated weights for policy 0, policy_version 17640 (0.0010) +[2023-10-14 05:45:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 36110336. Throughput: 0: 1651.2, 1: 1653.2. Samples: 9039992. Policy #0 lag: (min: 24.0, avg: 50.9, max: 56.0) +[2023-10-14 05:45:23,513][99942] Avg episode reward: [(0, '0.220'), (1, '1.000')] +[2023-10-14 05:45:23,535][100936] Updated weights for policy 0, policy_version 17650 (0.0010) +[2023-10-14 05:45:23,903][100936] Updated weights for policy 0, policy_version 17660 (0.0008) +[2023-10-14 05:45:26,859][100917] Updated weights for policy 1, policy_version 17642 (0.0008) +[2023-10-14 05:45:27,237][100917] Updated weights for policy 1, policy_version 17652 (0.0007) +[2023-10-14 05:45:27,607][100917] Updated weights for policy 1, policy_version 17662 (0.0008) +[2023-10-14 05:45:28,259][100936] Updated weights for policy 0, policy_version 17670 (0.0008) +[2023-10-14 05:45:28,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 36175872. Throughput: 0: 1656.8, 1: 1663.6. Samples: 9050704. Policy #0 lag: (min: 24.0, avg: 50.9, max: 56.0) +[2023-10-14 05:45:28,512][99942] Avg episode reward: [(0, '0.220'), (1, '1.000')] +[2023-10-14 05:45:28,630][100936] Updated weights for policy 0, policy_version 17680 (0.0009) +[2023-10-14 05:45:29,006][100936] Updated weights for policy 0, policy_version 17690 (0.0008) +[2023-10-14 05:45:31,517][100917] Updated weights for policy 1, policy_version 17672 (0.0009) +[2023-10-14 05:45:31,889][100917] Updated weights for policy 1, policy_version 17682 (0.0010) +[2023-10-14 05:45:32,252][100917] Updated weights for policy 1, policy_version 17692 (0.0010) +[2023-10-14 05:45:33,007][100936] Updated weights for policy 0, policy_version 17700 (0.0010) +[2023-10-14 05:45:33,389][100936] Updated weights for policy 0, policy_version 17710 (0.0008) +[2023-10-14 05:45:33,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 36241408. Throughput: 0: 1659.4, 1: 1649.3. Samples: 9070452. Policy #0 lag: (min: 24.0, avg: 50.9, max: 56.0) +[2023-10-14 05:45:33,513][99942] Avg episode reward: [(0, '0.220'), (1, '1.000')] +[2023-10-14 05:45:33,757][100936] Updated weights for policy 0, policy_version 17720 (0.0008) +[2023-10-14 05:45:36,422][100917] Updated weights for policy 1, policy_version 17702 (0.0009) +[2023-10-14 05:45:36,791][100917] Updated weights for policy 1, policy_version 17712 (0.0010) +[2023-10-14 05:45:37,166][100917] Updated weights for policy 1, policy_version 17722 (0.0008) +[2023-10-14 05:45:37,824][100936] Updated weights for policy 0, policy_version 17730 (0.0008) +[2023-10-14 05:45:38,236][100936] Updated weights for policy 0, policy_version 17740 (0.0007) +[2023-10-14 05:45:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 36306944. Throughput: 0: 1648.5, 1: 1660.1. Samples: 9089556. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:45:38,512][99942] Avg episode reward: [(0, '0.220'), (1, '1.000')] +[2023-10-14 05:45:38,521][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000017728_18153472.pth... +[2023-10-14 05:45:38,553][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000016160_16547840.pth +[2023-10-14 05:45:38,611][100936] Updated weights for policy 0, policy_version 17750 (0.0009) +[2023-10-14 05:45:38,980][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000017760_18186240.pth... +[2023-10-14 05:45:38,980][100936] Updated weights for policy 0, policy_version 17760 (0.0009) +[2023-10-14 05:45:39,009][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000016192_16580608.pth +[2023-10-14 05:45:41,268][100917] Updated weights for policy 1, policy_version 17732 (0.0009) +[2023-10-14 05:45:41,642][100917] Updated weights for policy 1, policy_version 17742 (0.0011) +[2023-10-14 05:45:42,015][100917] Updated weights for policy 1, policy_version 17752 (0.0009) +[2023-10-14 05:45:43,140][100936] Updated weights for policy 0, policy_version 17770 (0.0007) +[2023-10-14 05:45:43,511][100936] Updated weights for policy 0, policy_version 17780 (0.0008) +[2023-10-14 05:45:43,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 36372480. Throughput: 0: 1661.8, 1: 1663.7. Samples: 9100332. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:45:43,513][99942] Avg episode reward: [(0, '0.220'), (1, '1.000')] +[2023-10-14 05:45:43,891][100936] Updated weights for policy 0, policy_version 17790 (0.0009) +[2023-10-14 05:45:46,176][100917] Updated weights for policy 1, policy_version 17762 (0.0007) +[2023-10-14 05:45:46,555][100917] Updated weights for policy 1, policy_version 17772 (0.0010) +[2023-10-14 05:45:46,934][100917] Updated weights for policy 1, policy_version 17782 (0.0008) +[2023-10-14 05:45:47,301][100917] Updated weights for policy 1, policy_version 17792 (0.0009) +[2023-10-14 05:45:47,884][100936] Updated weights for policy 0, policy_version 17800 (0.0008) +[2023-10-14 05:45:48,253][100936] Updated weights for policy 0, policy_version 17810 (0.0008) +[2023-10-14 05:45:48,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 36438016. Throughput: 0: 1661.0, 1: 1647.3. Samples: 9119936. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:45:48,513][99942] Avg episode reward: [(0, '0.220'), (1, '1.000')] +[2023-10-14 05:45:48,623][100936] Updated weights for policy 0, policy_version 17820 (0.0010) +[2023-10-14 05:45:51,509][100917] Updated weights for policy 1, policy_version 17802 (0.0009) +[2023-10-14 05:45:51,881][100917] Updated weights for policy 1, policy_version 17812 (0.0009) +[2023-10-14 05:45:52,252][100917] Updated weights for policy 1, policy_version 17822 (0.0010) +[2023-10-14 05:45:53,034][100936] Updated weights for policy 0, policy_version 17830 (0.0009) +[2023-10-14 05:45:53,400][100936] Updated weights for policy 0, policy_version 17840 (0.0009) +[2023-10-14 05:45:53,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 36503552. Throughput: 0: 1653.0, 1: 1667.6. Samples: 9139364. Policy #0 lag: (min: 18.0, avg: 31.1, max: 50.0) +[2023-10-14 05:45:53,513][99942] Avg episode reward: [(0, '0.220'), (1, '1.000')] +[2023-10-14 05:45:53,777][100936] Updated weights for policy 0, policy_version 17850 (0.0010) +[2023-10-14 05:45:56,249][100917] Updated weights for policy 1, policy_version 17832 (0.0009) +[2023-10-14 05:45:56,632][100917] Updated weights for policy 1, policy_version 17842 (0.0008) +[2023-10-14 05:45:57,002][100917] Updated weights for policy 1, policy_version 17852 (0.0007) +[2023-10-14 05:45:57,675][100936] Updated weights for policy 0, policy_version 17860 (0.0010) +[2023-10-14 05:45:58,039][100936] Updated weights for policy 0, policy_version 17870 (0.0010) +[2023-10-14 05:45:58,415][100936] Updated weights for policy 0, policy_version 17880 (0.0008) +[2023-10-14 05:45:58,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 36569088. Throughput: 0: 1658.0, 1: 1664.6. Samples: 9150312. Policy #0 lag: (min: 18.0, avg: 31.1, max: 50.0) +[2023-10-14 05:45:58,512][99942] Avg episode reward: [(0, '0.220'), (1, '1.000')] +[2023-10-14 05:46:01,032][100917] Updated weights for policy 1, policy_version 17862 (0.0009) +[2023-10-14 05:46:01,407][100917] Updated weights for policy 1, policy_version 17872 (0.0010) +[2023-10-14 05:46:01,788][100917] Updated weights for policy 1, policy_version 17882 (0.0009) +[2023-10-14 05:46:02,517][100936] Updated weights for policy 0, policy_version 17890 (0.0008) +[2023-10-14 05:46:02,890][100936] Updated weights for policy 0, policy_version 17900 (0.0008) +[2023-10-14 05:46:03,264][100936] Updated weights for policy 0, policy_version 17910 (0.0008) +[2023-10-14 05:46:03,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 36634624. Throughput: 0: 1659.3, 1: 1648.8. Samples: 9169832. Policy #0 lag: (min: 18.0, avg: 31.1, max: 50.0) +[2023-10-14 05:46:03,512][99942] Avg episode reward: [(0, '0.220'), (1, '1.000')] +[2023-10-14 05:46:03,645][100936] Updated weights for policy 0, policy_version 17920 (0.0008) +[2023-10-14 05:46:05,834][100917] Updated weights for policy 1, policy_version 17892 (0.0010) +[2023-10-14 05:46:06,215][100917] Updated weights for policy 1, policy_version 17902 (0.0011) +[2023-10-14 05:46:06,583][100917] Updated weights for policy 1, policy_version 17912 (0.0009) +[2023-10-14 05:46:07,902][100936] Updated weights for policy 0, policy_version 17930 (0.0008) +[2023-10-14 05:46:08,275][100936] Updated weights for policy 0, policy_version 17940 (0.0009) +[2023-10-14 05:46:08,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 36700160. Throughput: 0: 1644.4, 1: 1672.4. Samples: 9189244. Policy #0 lag: (min: 22.0, avg: 24.8, max: 54.0) +[2023-10-14 05:46:08,513][99942] Avg episode reward: [(0, '0.220'), (1, '1.000')] +[2023-10-14 05:46:08,652][100936] Updated weights for policy 0, policy_version 17950 (0.0007) +[2023-10-14 05:46:10,776][100917] Updated weights for policy 1, policy_version 17922 (0.0010) +[2023-10-14 05:46:11,199][100917] Updated weights for policy 1, policy_version 17932 (0.0007) +[2023-10-14 05:46:11,576][100917] Updated weights for policy 1, policy_version 17942 (0.0007) +[2023-10-14 05:46:11,949][100917] Updated weights for policy 1, policy_version 17952 (0.0007) +[2023-10-14 05:46:12,862][100936] Updated weights for policy 0, policy_version 17960 (0.0008) +[2023-10-14 05:46:13,246][100936] Updated weights for policy 0, policy_version 17970 (0.0007) +[2023-10-14 05:46:13,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 36765696. Throughput: 0: 1652.8, 1: 1661.0. Samples: 9199824. Policy #0 lag: (min: 22.0, avg: 24.8, max: 54.0) +[2023-10-14 05:46:13,513][99942] Avg episode reward: [(0, '0.220'), (1, '1.000')] +[2023-10-14 05:46:13,620][100936] Updated weights for policy 0, policy_version 17980 (0.0008) +[2023-10-14 05:46:16,002][100917] Updated weights for policy 1, policy_version 17962 (0.0009) +[2023-10-14 05:46:16,381][100917] Updated weights for policy 1, policy_version 17972 (0.0009) +[2023-10-14 05:46:16,751][100917] Updated weights for policy 1, policy_version 17982 (0.0008) +[2023-10-14 05:46:17,729][100936] Updated weights for policy 0, policy_version 17990 (0.0010) +[2023-10-14 05:46:18,105][100936] Updated weights for policy 0, policy_version 18000 (0.0010) +[2023-10-14 05:46:18,476][100936] Updated weights for policy 0, policy_version 18010 (0.0008) +[2023-10-14 05:46:18,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 36831232. Throughput: 0: 1653.0, 1: 1655.7. Samples: 9219342. Policy #0 lag: (min: 22.0, avg: 24.8, max: 54.0) +[2023-10-14 05:46:18,513][99942] Avg episode reward: [(0, '0.220'), (1, '1.000')] +[2023-10-14 05:46:20,815][100917] Updated weights for policy 1, policy_version 17992 (0.0008) +[2023-10-14 05:46:21,187][100917] Updated weights for policy 1, policy_version 18002 (0.0007) +[2023-10-14 05:46:21,558][100917] Updated weights for policy 1, policy_version 18012 (0.0007) +[2023-10-14 05:46:22,623][100936] Updated weights for policy 0, policy_version 18020 (0.0008) +[2023-10-14 05:46:22,997][100936] Updated weights for policy 0, policy_version 18030 (0.0009) +[2023-10-14 05:46:23,370][100936] Updated weights for policy 0, policy_version 18040 (0.0009) +[2023-10-14 05:46:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 36896768. Throughput: 0: 1646.6, 1: 1668.1. Samples: 9238720. Policy #0 lag: (min: 24.0, avg: 50.2, max: 56.0) +[2023-10-14 05:46:23,513][99942] Avg episode reward: [(0, '0.220'), (1, '1.000')] +[2023-10-14 05:46:25,683][100917] Updated weights for policy 1, policy_version 18022 (0.0007) +[2023-10-14 05:46:26,049][100917] Updated weights for policy 1, policy_version 18032 (0.0010) +[2023-10-14 05:46:26,424][100917] Updated weights for policy 1, policy_version 18042 (0.0011) +[2023-10-14 05:46:27,600][100936] Updated weights for policy 0, policy_version 18050 (0.0008) +[2023-10-14 05:46:27,998][100936] Updated weights for policy 0, policy_version 18060 (0.0007) +[2023-10-14 05:46:28,374][100936] Updated weights for policy 0, policy_version 18070 (0.0007) +[2023-10-14 05:46:28,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 36962304. Throughput: 0: 1651.3, 1: 1658.5. Samples: 9249272. Policy #0 lag: (min: 24.0, avg: 50.2, max: 56.0) +[2023-10-14 05:46:28,512][99942] Avg episode reward: [(0, '0.220'), (1, '0.990')] +[2023-10-14 05:46:28,745][100936] Updated weights for policy 0, policy_version 18080 (0.0007) +[2023-10-14 05:46:30,557][100917] Updated weights for policy 1, policy_version 18052 (0.0008) +[2023-10-14 05:46:30,926][100917] Updated weights for policy 1, policy_version 18062 (0.0007) +[2023-10-14 05:46:31,296][100917] Updated weights for policy 1, policy_version 18072 (0.0008) +[2023-10-14 05:46:32,743][100936] Updated weights for policy 0, policy_version 18090 (0.0007) +[2023-10-14 05:46:33,127][100936] Updated weights for policy 0, policy_version 18100 (0.0009) +[2023-10-14 05:46:33,499][100936] Updated weights for policy 0, policy_version 18110 (0.0008) +[2023-10-14 05:46:33,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 37027840. Throughput: 0: 1653.4, 1: 1656.5. Samples: 9268882. Policy #0 lag: (min: 24.0, avg: 50.2, max: 56.0) +[2023-10-14 05:46:33,512][99942] Avg episode reward: [(0, '0.220'), (1, '0.990')] +[2023-10-14 05:46:35,392][100917] Updated weights for policy 1, policy_version 18082 (0.0007) +[2023-10-14 05:46:35,767][100917] Updated weights for policy 1, policy_version 18092 (0.0007) +[2023-10-14 05:46:36,144][100917] Updated weights for policy 1, policy_version 18102 (0.0007) +[2023-10-14 05:46:36,512][100917] Updated weights for policy 1, policy_version 18112 (0.0008) +[2023-10-14 05:46:37,588][100936] Updated weights for policy 0, policy_version 18120 (0.0007) +[2023-10-14 05:46:37,968][100936] Updated weights for policy 0, policy_version 18130 (0.0010) +[2023-10-14 05:46:38,342][100936] Updated weights for policy 0, policy_version 18140 (0.0010) +[2023-10-14 05:46:38,512][99942] Fps is (10 sec: 16384.0, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 37126144. Throughput: 0: 1645.0, 1: 1666.5. Samples: 9288380. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:46:38,512][99942] Avg episode reward: [(0, '0.220'), (1, '0.990')] +[2023-10-14 05:46:40,755][100917] Updated weights for policy 1, policy_version 18122 (0.0010) +[2023-10-14 05:46:41,134][100917] Updated weights for policy 1, policy_version 18132 (0.0010) +[2023-10-14 05:46:41,502][100917] Updated weights for policy 1, policy_version 18142 (0.0009) +[2023-10-14 05:46:42,415][100936] Updated weights for policy 0, policy_version 18150 (0.0008) +[2023-10-14 05:46:42,788][100936] Updated weights for policy 0, policy_version 18160 (0.0007) +[2023-10-14 05:46:43,161][100936] Updated weights for policy 0, policy_version 18170 (0.0008) +[2023-10-14 05:46:43,512][99942] Fps is (10 sec: 16383.7, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 37191680. Throughput: 0: 1654.8, 1: 1650.9. Samples: 9299072. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:46:43,513][99942] Avg episode reward: [(0, '0.220'), (1, '0.990')] +[2023-10-14 05:46:45,721][100917] Updated weights for policy 1, policy_version 18152 (0.0008) +[2023-10-14 05:46:46,094][100917] Updated weights for policy 1, policy_version 18162 (0.0008) +[2023-10-14 05:46:46,458][100917] Updated weights for policy 1, policy_version 18172 (0.0009) +[2023-10-14 05:46:47,338][100936] Updated weights for policy 0, policy_version 18180 (0.0009) +[2023-10-14 05:46:47,718][100936] Updated weights for policy 0, policy_version 18190 (0.0007) +[2023-10-14 05:46:48,087][100936] Updated weights for policy 0, policy_version 18200 (0.0007) +[2023-10-14 05:46:48,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 37257216. Throughput: 0: 1648.5, 1: 1661.5. Samples: 9318784. Policy #0 lag: (min: 32.0, avg: 46.7, max: 48.0) +[2023-10-14 05:46:48,512][99942] Avg episode reward: [(0, '0.220'), (1, '0.990')] +[2023-10-14 05:46:50,479][100917] Updated weights for policy 1, policy_version 18182 (0.0009) +[2023-10-14 05:46:50,847][100917] Updated weights for policy 1, policy_version 18192 (0.0007) +[2023-10-14 05:46:51,220][100917] Updated weights for policy 1, policy_version 18202 (0.0009) +[2023-10-14 05:46:52,123][100936] Updated weights for policy 0, policy_version 18210 (0.0009) +[2023-10-14 05:46:52,491][100936] Updated weights for policy 0, policy_version 18220 (0.0007) +[2023-10-14 05:46:52,862][100936] Updated weights for policy 0, policy_version 18230 (0.0007) +[2023-10-14 05:46:53,227][100936] Updated weights for policy 0, policy_version 18240 (0.0008) +[2023-10-14 05:46:53,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 37322752. Throughput: 0: 1655.4, 1: 1667.4. Samples: 9338770. Policy #0 lag: (min: 32.0, avg: 46.7, max: 48.0) +[2023-10-14 05:46:53,513][99942] Avg episode reward: [(0, '0.220'), (1, '0.990')] +[2023-10-14 05:46:55,293][100917] Updated weights for policy 1, policy_version 18212 (0.0008) +[2023-10-14 05:46:55,664][100917] Updated weights for policy 1, policy_version 18222 (0.0008) +[2023-10-14 05:46:56,049][100917] Updated weights for policy 1, policy_version 18232 (0.0009) +[2023-10-14 05:46:57,121][100936] Updated weights for policy 0, policy_version 18250 (0.0010) +[2023-10-14 05:46:57,494][100936] Updated weights for policy 0, policy_version 18260 (0.0009) +[2023-10-14 05:46:57,865][100936] Updated weights for policy 0, policy_version 18270 (0.0009) +[2023-10-14 05:46:58,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 37388288. Throughput: 0: 1667.9, 1: 1659.1. Samples: 9349536. Policy #0 lag: (min: 32.0, avg: 46.7, max: 48.0) +[2023-10-14 05:46:58,513][99942] Avg episode reward: [(0, '0.220'), (1, '0.990')] +[2023-10-14 05:47:00,136][100917] Updated weights for policy 1, policy_version 18242 (0.0010) +[2023-10-14 05:47:00,547][100917] Updated weights for policy 1, policy_version 18252 (0.0007) +[2023-10-14 05:47:00,923][100917] Updated weights for policy 1, policy_version 18262 (0.0007) +[2023-10-14 05:47:01,293][100917] Updated weights for policy 1, policy_version 18272 (0.0008) +[2023-10-14 05:47:01,993][100936] Updated weights for policy 0, policy_version 18280 (0.0010) +[2023-10-14 05:47:02,372][100936] Updated weights for policy 0, policy_version 18290 (0.0008) +[2023-10-14 05:47:02,742][100936] Updated weights for policy 0, policy_version 18300 (0.0008) +[2023-10-14 05:47:03,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 37453824. Throughput: 0: 1652.2, 1: 1672.2. Samples: 9368940. Policy #0 lag: (min: 31.0, avg: 32.1, max: 53.0) +[2023-10-14 05:47:03,512][99942] Avg episode reward: [(0, '0.220'), (1, '0.990')] +[2023-10-14 05:47:05,306][100917] Updated weights for policy 1, policy_version 18282 (0.0009) +[2023-10-14 05:47:05,663][100917] Updated weights for policy 1, policy_version 18292 (0.0008) +[2023-10-14 05:47:06,040][100917] Updated weights for policy 1, policy_version 18302 (0.0009) +[2023-10-14 05:47:06,766][100936] Updated weights for policy 0, policy_version 18310 (0.0007) +[2023-10-14 05:47:07,135][100936] Updated weights for policy 0, policy_version 18320 (0.0008) +[2023-10-14 05:47:07,506][100936] Updated weights for policy 0, policy_version 18330 (0.0009) +[2023-10-14 05:47:08,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 37519360. Throughput: 0: 1663.3, 1: 1673.9. Samples: 9388894. Policy #0 lag: (min: 31.0, avg: 32.1, max: 53.0) +[2023-10-14 05:47:08,512][99942] Avg episode reward: [(0, '0.220'), (1, '0.990')] +[2023-10-14 05:47:10,031][100917] Updated weights for policy 1, policy_version 18312 (0.0010) +[2023-10-14 05:47:10,402][100917] Updated weights for policy 1, policy_version 18322 (0.0007) +[2023-10-14 05:47:10,780][100917] Updated weights for policy 1, policy_version 18332 (0.0009) +[2023-10-14 05:47:11,837][100936] Updated weights for policy 0, policy_version 18340 (0.0009) +[2023-10-14 05:47:12,218][100936] Updated weights for policy 0, policy_version 18350 (0.0009) +[2023-10-14 05:47:12,586][100936] Updated weights for policy 0, policy_version 18360 (0.0008) +[2023-10-14 05:47:13,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 37584896. Throughput: 0: 1671.9, 1: 1659.5. Samples: 9399188. Policy #0 lag: (min: 31.0, avg: 32.1, max: 53.0) +[2023-10-14 05:47:13,513][99942] Avg episode reward: [(0, '0.220'), (1, '0.990')] +[2023-10-14 05:47:14,752][100917] Updated weights for policy 1, policy_version 18342 (0.0011) +[2023-10-14 05:47:15,129][100917] Updated weights for policy 1, policy_version 18352 (0.0011) +[2023-10-14 05:47:15,494][100917] Updated weights for policy 1, policy_version 18362 (0.0007) +[2023-10-14 05:47:16,777][100936] Updated weights for policy 0, policy_version 18370 (0.0008) +[2023-10-14 05:47:17,199][100936] Updated weights for policy 0, policy_version 18380 (0.0009) +[2023-10-14 05:47:17,562][100936] Updated weights for policy 0, policy_version 18390 (0.0008) +[2023-10-14 05:47:17,933][100936] Updated weights for policy 0, policy_version 18400 (0.0007) +[2023-10-14 05:47:18,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 37650432. Throughput: 0: 1647.5, 1: 1684.4. Samples: 9418816. Policy #0 lag: (min: 31.0, avg: 38.7, max: 63.0) +[2023-10-14 05:47:18,513][99942] Avg episode reward: [(0, '0.220'), (1, '0.990')] +[2023-10-14 05:47:19,409][100917] Updated weights for policy 1, policy_version 18372 (0.0008) +[2023-10-14 05:47:19,782][100917] Updated weights for policy 1, policy_version 18382 (0.0009) +[2023-10-14 05:47:20,144][100917] Updated weights for policy 1, policy_version 18392 (0.0009) +[2023-10-14 05:47:21,996][100936] Updated weights for policy 0, policy_version 18410 (0.0009) +[2023-10-14 05:47:22,360][100936] Updated weights for policy 0, policy_version 18420 (0.0008) +[2023-10-14 05:47:22,734][100936] Updated weights for policy 0, policy_version 18430 (0.0009) +[2023-10-14 05:47:23,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 37715968. Throughput: 0: 1654.2, 1: 1691.5. Samples: 9438940. Policy #0 lag: (min: 31.0, avg: 38.7, max: 63.0) +[2023-10-14 05:47:23,513][99942] Avg episode reward: [(0, '0.220'), (1, '0.990')] +[2023-10-14 05:47:24,257][100917] Updated weights for policy 1, policy_version 18402 (0.0008) +[2023-10-14 05:47:24,625][100917] Updated weights for policy 1, policy_version 18412 (0.0010) +[2023-10-14 05:47:24,998][100917] Updated weights for policy 1, policy_version 18422 (0.0009) +[2023-10-14 05:47:25,375][100917] Updated weights for policy 1, policy_version 18432 (0.0008) +[2023-10-14 05:47:26,968][100936] Updated weights for policy 0, policy_version 18440 (0.0007) +[2023-10-14 05:47:27,335][100936] Updated weights for policy 0, policy_version 18450 (0.0008) +[2023-10-14 05:47:27,716][100936] Updated weights for policy 0, policy_version 18460 (0.0009) +[2023-10-14 05:47:28,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 37781504. Throughput: 0: 1660.8, 1: 1676.4. Samples: 9449246. Policy #0 lag: (min: 31.0, avg: 38.7, max: 63.0) +[2023-10-14 05:47:28,513][99942] Avg episode reward: [(0, '0.220'), (1, '0.990')] +[2023-10-14 05:47:29,476][100917] Updated weights for policy 1, policy_version 18442 (0.0007) +[2023-10-14 05:47:29,847][100917] Updated weights for policy 1, policy_version 18452 (0.0007) +[2023-10-14 05:47:30,217][100917] Updated weights for policy 1, policy_version 18462 (0.0007) +[2023-10-14 05:47:31,901][100936] Updated weights for policy 0, policy_version 18470 (0.0008) +[2023-10-14 05:47:32,281][100936] Updated weights for policy 0, policy_version 18480 (0.0009) +[2023-10-14 05:47:32,645][100936] Updated weights for policy 0, policy_version 18490 (0.0008) +[2023-10-14 05:47:33,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 37847040. Throughput: 0: 1652.0, 1: 1692.6. Samples: 9469292. Policy #0 lag: (min: 31.0, avg: 35.8, max: 63.0) +[2023-10-14 05:47:33,513][99942] Avg episode reward: [(0, '0.220'), (1, '0.990')] +[2023-10-14 05:47:34,134][100917] Updated weights for policy 1, policy_version 18472 (0.0009) +[2023-10-14 05:47:34,503][100917] Updated weights for policy 1, policy_version 18482 (0.0008) +[2023-10-14 05:47:34,877][100917] Updated weights for policy 1, policy_version 18492 (0.0009) +[2023-10-14 05:47:36,779][100936] Updated weights for policy 0, policy_version 18500 (0.0009) +[2023-10-14 05:47:37,147][100936] Updated weights for policy 0, policy_version 18510 (0.0008) +[2023-10-14 05:47:37,518][100936] Updated weights for policy 0, policy_version 18520 (0.0008) +[2023-10-14 05:47:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.1, 300 sec: 13329.4). Total num frames: 37912576. Throughput: 0: 1655.6, 1: 1690.9. Samples: 9489362. Policy #0 lag: (min: 31.0, avg: 35.8, max: 63.0) +[2023-10-14 05:47:38,513][99942] Avg episode reward: [(0, '0.220'), (1, '0.990')] +[2023-10-14 05:47:38,524][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000018496_18939904.pth... +[2023-10-14 05:47:38,524][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000018528_18972672.pth... +[2023-10-14 05:47:38,558][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000016960_17367040.pth +[2023-10-14 05:47:38,560][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000016960_17367040.pth +[2023-10-14 05:47:39,007][100917] Updated weights for policy 1, policy_version 18502 (0.0009) +[2023-10-14 05:47:39,381][100917] Updated weights for policy 1, policy_version 18512 (0.0008) +[2023-10-14 05:47:39,758][100917] Updated weights for policy 1, policy_version 18522 (0.0009) +[2023-10-14 05:47:41,668][100936] Updated weights for policy 0, policy_version 18530 (0.0010) +[2023-10-14 05:47:42,039][100936] Updated weights for policy 0, policy_version 18540 (0.0007) +[2023-10-14 05:47:42,410][100936] Updated weights for policy 0, policy_version 18550 (0.0007) +[2023-10-14 05:47:42,771][100936] Updated weights for policy 0, policy_version 18560 (0.0009) +[2023-10-14 05:47:43,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 37978112. Throughput: 0: 1652.4, 1: 1681.3. Samples: 9499552. Policy #0 lag: (min: 31.0, avg: 35.8, max: 63.0) +[2023-10-14 05:47:43,513][99942] Avg episode reward: [(0, '0.220'), (1, '0.990')] +[2023-10-14 05:47:43,831][100917] Updated weights for policy 1, policy_version 18532 (0.0009) +[2023-10-14 05:47:44,205][100917] Updated weights for policy 1, policy_version 18542 (0.0009) +[2023-10-14 05:47:44,585][100917] Updated weights for policy 1, policy_version 18552 (0.0010) +[2023-10-14 05:47:46,946][100936] Updated weights for policy 0, policy_version 18570 (0.0007) +[2023-10-14 05:47:47,318][100936] Updated weights for policy 0, policy_version 18580 (0.0008) +[2023-10-14 05:47:47,677][100936] Updated weights for policy 0, policy_version 18590 (0.0010) +[2023-10-14 05:47:48,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 38043648. Throughput: 0: 1642.9, 1: 1690.0. Samples: 9518920. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 05:47:48,513][99942] Avg episode reward: [(0, '0.220'), (1, '0.990')] +[2023-10-14 05:47:48,642][100917] Updated weights for policy 1, policy_version 18562 (0.0009) +[2023-10-14 05:47:49,056][100917] Updated weights for policy 1, policy_version 18572 (0.0008) +[2023-10-14 05:47:49,428][100917] Updated weights for policy 1, policy_version 18582 (0.0009) +[2023-10-14 05:47:49,801][100917] Updated weights for policy 1, policy_version 18592 (0.0009) +[2023-10-14 05:47:51,774][100936] Updated weights for policy 0, policy_version 18600 (0.0007) +[2023-10-14 05:47:52,155][100936] Updated weights for policy 0, policy_version 18610 (0.0007) +[2023-10-14 05:47:52,525][100936] Updated weights for policy 0, policy_version 18620 (0.0008) +[2023-10-14 05:47:53,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 38109184. Throughput: 0: 1646.5, 1: 1685.9. Samples: 9538850. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 05:47:53,513][99942] Avg episode reward: [(0, '0.220'), (1, '0.990')] +[2023-10-14 05:47:53,997][100917] Updated weights for policy 1, policy_version 18602 (0.0011) +[2023-10-14 05:47:54,365][100917] Updated weights for policy 1, policy_version 18612 (0.0010) +[2023-10-14 05:47:54,735][100917] Updated weights for policy 1, policy_version 18622 (0.0010) +[2023-10-14 05:47:56,544][100936] Updated weights for policy 0, policy_version 18630 (0.0009) +[2023-10-14 05:47:56,902][100936] Updated weights for policy 0, policy_version 18640 (0.0011) +[2023-10-14 05:47:57,283][100936] Updated weights for policy 0, policy_version 18650 (0.0009) +[2023-10-14 05:47:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 38174720. Throughput: 0: 1649.0, 1: 1680.8. Samples: 9549026. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 05:47:58,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:47:58,868][100917] Updated weights for policy 1, policy_version 18632 (0.0008) +[2023-10-14 05:47:59,233][100917] Updated weights for policy 1, policy_version 18642 (0.0009) +[2023-10-14 05:47:59,615][100917] Updated weights for policy 1, policy_version 18652 (0.0009) +[2023-10-14 05:48:01,492][100936] Updated weights for policy 0, policy_version 18660 (0.0008) +[2023-10-14 05:48:01,868][100936] Updated weights for policy 0, policy_version 18670 (0.0007) +[2023-10-14 05:48:02,239][100936] Updated weights for policy 0, policy_version 18680 (0.0008) +[2023-10-14 05:48:03,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 38240256. Throughput: 0: 1647.8, 1: 1679.0. Samples: 9568520. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) +[2023-10-14 05:48:03,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:48:03,567][100917] Updated weights for policy 1, policy_version 18662 (0.0009) +[2023-10-14 05:48:03,933][100917] Updated weights for policy 1, policy_version 18672 (0.0010) +[2023-10-14 05:48:04,303][100917] Updated weights for policy 1, policy_version 18682 (0.0010) +[2023-10-14 05:48:06,355][100936] Updated weights for policy 0, policy_version 18690 (0.0008) +[2023-10-14 05:48:06,765][100936] Updated weights for policy 0, policy_version 18700 (0.0009) +[2023-10-14 05:48:07,132][100936] Updated weights for policy 0, policy_version 18710 (0.0008) +[2023-10-14 05:48:07,502][100936] Updated weights for policy 0, policy_version 18720 (0.0009) +[2023-10-14 05:48:08,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.1, 300 sec: 13329.3). Total num frames: 38305792. Throughput: 0: 1654.9, 1: 1670.2. Samples: 9588570. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) +[2023-10-14 05:48:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:48:08,573][100917] Updated weights for policy 1, policy_version 18692 (0.0011) +[2023-10-14 05:48:08,953][100917] Updated weights for policy 1, policy_version 18702 (0.0007) +[2023-10-14 05:48:09,326][100917] Updated weights for policy 1, policy_version 18712 (0.0007) +[2023-10-14 05:48:11,592][100936] Updated weights for policy 0, policy_version 18730 (0.0011) +[2023-10-14 05:48:11,957][100936] Updated weights for policy 0, policy_version 18740 (0.0010) +[2023-10-14 05:48:12,321][100936] Updated weights for policy 0, policy_version 18750 (0.0007) +[2023-10-14 05:48:13,306][100917] Updated weights for policy 1, policy_version 18722 (0.0007) +[2023-10-14 05:48:13,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 38371328. Throughput: 0: 1649.0, 1: 1673.1. Samples: 9598740. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) +[2023-10-14 05:48:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:48:13,680][100917] Updated weights for policy 1, policy_version 18732 (0.0010) +[2023-10-14 05:48:14,044][100917] Updated weights for policy 1, policy_version 18742 (0.0008) +[2023-10-14 05:48:14,415][100917] Updated weights for policy 1, policy_version 18752 (0.0007) +[2023-10-14 05:48:16,456][100936] Updated weights for policy 0, policy_version 18760 (0.0009) +[2023-10-14 05:48:16,834][100936] Updated weights for policy 0, policy_version 18770 (0.0009) +[2023-10-14 05:48:17,214][100936] Updated weights for policy 0, policy_version 18780 (0.0008) +[2023-10-14 05:48:18,493][100917] Updated weights for policy 1, policy_version 18762 (0.0008) +[2023-10-14 05:48:18,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 38436864. Throughput: 0: 1634.2, 1: 1670.7. Samples: 9618012. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:48:18,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:48:18,873][100917] Updated weights for policy 1, policy_version 18772 (0.0010) +[2023-10-14 05:48:19,238][100917] Updated weights for policy 1, policy_version 18782 (0.0008) +[2023-10-14 05:48:21,502][100936] Updated weights for policy 0, policy_version 18790 (0.0009) +[2023-10-14 05:48:21,864][100936] Updated weights for policy 0, policy_version 18800 (0.0010) +[2023-10-14 05:48:22,236][100936] Updated weights for policy 0, policy_version 18810 (0.0010) +[2023-10-14 05:48:23,427][100917] Updated weights for policy 1, policy_version 18792 (0.0008) +[2023-10-14 05:48:23,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 38502400. Throughput: 0: 1642.2, 1: 1662.0. Samples: 9638052. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:48:23,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:48:23,797][100917] Updated weights for policy 1, policy_version 18802 (0.0009) +[2023-10-14 05:48:24,168][100917] Updated weights for policy 1, policy_version 18812 (0.0008) +[2023-10-14 05:48:26,269][100936] Updated weights for policy 0, policy_version 18820 (0.0007) +[2023-10-14 05:48:26,637][100936] Updated weights for policy 0, policy_version 18830 (0.0009) +[2023-10-14 05:48:27,015][100936] Updated weights for policy 0, policy_version 18840 (0.0009) +[2023-10-14 05:48:28,217][100917] Updated weights for policy 1, policy_version 18822 (0.0010) +[2023-10-14 05:48:28,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 38567936. Throughput: 0: 1637.6, 1: 1662.3. Samples: 9648048. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:48:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:48:28,596][100917] Updated weights for policy 1, policy_version 18832 (0.0008) +[2023-10-14 05:48:28,981][100917] Updated weights for policy 1, policy_version 18842 (0.0009) +[2023-10-14 05:48:31,152][100936] Updated weights for policy 0, policy_version 18850 (0.0010) +[2023-10-14 05:48:31,526][100936] Updated weights for policy 0, policy_version 18860 (0.0010) +[2023-10-14 05:48:31,894][100936] Updated weights for policy 0, policy_version 18870 (0.0009) +[2023-10-14 05:48:32,271][100936] Updated weights for policy 0, policy_version 18880 (0.0009) +[2023-10-14 05:48:33,028][100917] Updated weights for policy 1, policy_version 18852 (0.0009) +[2023-10-14 05:48:33,391][100917] Updated weights for policy 1, policy_version 18862 (0.0010) +[2023-10-14 05:48:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 38633472. Throughput: 0: 1639.1, 1: 1664.1. Samples: 9667566. Policy #0 lag: (min: 23.0, avg: 23.0, max: 23.0) +[2023-10-14 05:48:33,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:48:33,764][100917] Updated weights for policy 1, policy_version 18872 (0.0010) +[2023-10-14 05:48:36,614][100936] Updated weights for policy 0, policy_version 18890 (0.0007) +[2023-10-14 05:48:36,989][100936] Updated weights for policy 0, policy_version 18900 (0.0008) +[2023-10-14 05:48:37,357][100936] Updated weights for policy 0, policy_version 18910 (0.0008) +[2023-10-14 05:48:37,816][100917] Updated weights for policy 1, policy_version 18882 (0.0007) +[2023-10-14 05:48:38,200][100917] Updated weights for policy 1, policy_version 18892 (0.0009) +[2023-10-14 05:48:38,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 38699008. Throughput: 0: 1648.4, 1: 1660.8. Samples: 9687766. Policy #0 lag: (min: 23.0, avg: 23.0, max: 23.0) +[2023-10-14 05:48:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:48:38,570][100917] Updated weights for policy 1, policy_version 18902 (0.0007) +[2023-10-14 05:48:38,947][100917] Updated weights for policy 1, policy_version 18912 (0.0009) +[2023-10-14 05:48:41,563][100936] Updated weights for policy 0, policy_version 18920 (0.0008) +[2023-10-14 05:48:41,937][100936] Updated weights for policy 0, policy_version 18930 (0.0007) +[2023-10-14 05:48:42,315][100936] Updated weights for policy 0, policy_version 18940 (0.0010) +[2023-10-14 05:48:43,148][100917] Updated weights for policy 1, policy_version 18922 (0.0008) +[2023-10-14 05:48:43,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 38764544. Throughput: 0: 1646.6, 1: 1665.0. Samples: 9698048. Policy #0 lag: (min: 23.0, avg: 23.0, max: 23.0) +[2023-10-14 05:48:43,513][100917] Updated weights for policy 1, policy_version 18932 (0.0009) +[2023-10-14 05:48:43,514][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:48:43,894][100917] Updated weights for policy 1, policy_version 18942 (0.0010) +[2023-10-14 05:48:46,497][100936] Updated weights for policy 0, policy_version 18950 (0.0010) +[2023-10-14 05:48:46,866][100936] Updated weights for policy 0, policy_version 18960 (0.0010) +[2023-10-14 05:48:47,250][100936] Updated weights for policy 0, policy_version 18970 (0.0010) +[2023-10-14 05:48:48,057][100917] Updated weights for policy 1, policy_version 18952 (0.0008) +[2023-10-14 05:48:48,438][100917] Updated weights for policy 1, policy_version 18962 (0.0009) +[2023-10-14 05:48:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 38830080. Throughput: 0: 1649.5, 1: 1663.5. Samples: 9717610. Policy #0 lag: (min: 31.0, avg: 38.6, max: 63.0) +[2023-10-14 05:48:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:48:48,811][100917] Updated weights for policy 1, policy_version 18972 (0.0008) +[2023-10-14 05:48:51,491][100936] Updated weights for policy 0, policy_version 18980 (0.0009) +[2023-10-14 05:48:51,876][100936] Updated weights for policy 0, policy_version 18990 (0.0009) +[2023-10-14 05:48:52,252][100936] Updated weights for policy 0, policy_version 19000 (0.0008) +[2023-10-14 05:48:52,852][100917] Updated weights for policy 1, policy_version 18982 (0.0009) +[2023-10-14 05:48:53,224][100917] Updated weights for policy 1, policy_version 18992 (0.0008) +[2023-10-14 05:48:53,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 38895616. Throughput: 0: 1652.1, 1: 1657.3. Samples: 9737492. Policy #0 lag: (min: 31.0, avg: 38.6, max: 63.0) +[2023-10-14 05:48:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:48:53,603][100917] Updated weights for policy 1, policy_version 19002 (0.0007) +[2023-10-14 05:48:56,285][100936] Updated weights for policy 0, policy_version 19010 (0.0008) +[2023-10-14 05:48:56,650][100936] Updated weights for policy 0, policy_version 19020 (0.0009) +[2023-10-14 05:48:57,015][100936] Updated weights for policy 0, policy_version 19030 (0.0009) +[2023-10-14 05:48:57,387][100936] Updated weights for policy 0, policy_version 19040 (0.0008) +[2023-10-14 05:48:57,694][100917] Updated weights for policy 1, policy_version 19012 (0.0010) +[2023-10-14 05:48:58,076][100917] Updated weights for policy 1, policy_version 19022 (0.0009) +[2023-10-14 05:48:58,455][100917] Updated weights for policy 1, policy_version 19032 (0.0009) +[2023-10-14 05:48:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 38961152. Throughput: 0: 1650.8, 1: 1663.3. Samples: 9747874. Policy #0 lag: (min: 31.0, avg: 38.6, max: 63.0) +[2023-10-14 05:48:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:49:01,349][100936] Updated weights for policy 0, policy_version 19050 (0.0011) +[2023-10-14 05:49:01,717][100936] Updated weights for policy 0, policy_version 19060 (0.0010) +[2023-10-14 05:49:02,088][100936] Updated weights for policy 0, policy_version 19070 (0.0009) +[2023-10-14 05:49:02,672][100917] Updated weights for policy 1, policy_version 19042 (0.0010) +[2023-10-14 05:49:03,051][100917] Updated weights for policy 1, policy_version 19052 (0.0009) +[2023-10-14 05:49:03,431][100917] Updated weights for policy 1, policy_version 19062 (0.0009) +[2023-10-14 05:49:03,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 39026688. Throughput: 0: 1654.4, 1: 1664.3. Samples: 9767354. Policy #0 lag: (min: 31.0, avg: 41.5, max: 63.0) +[2023-10-14 05:49:03,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:49:03,813][100917] Updated weights for policy 1, policy_version 19072 (0.0009) +[2023-10-14 05:49:06,234][100936] Updated weights for policy 0, policy_version 19080 (0.0009) +[2023-10-14 05:49:06,607][100936] Updated weights for policy 0, policy_version 19090 (0.0008) +[2023-10-14 05:49:06,980][100936] Updated weights for policy 0, policy_version 19100 (0.0008) +[2023-10-14 05:49:07,912][100917] Updated weights for policy 1, policy_version 19082 (0.0008) +[2023-10-14 05:49:08,299][100917] Updated weights for policy 1, policy_version 19092 (0.0010) +[2023-10-14 05:49:08,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 39092224. Throughput: 0: 1656.9, 1: 1655.7. Samples: 9787122. Policy #0 lag: (min: 31.0, avg: 41.5, max: 63.0) +[2023-10-14 05:49:08,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:49:08,669][100917] Updated weights for policy 1, policy_version 19102 (0.0009) +[2023-10-14 05:49:11,020][100936] Updated weights for policy 0, policy_version 19110 (0.0008) +[2023-10-14 05:49:11,395][100936] Updated weights for policy 0, policy_version 19120 (0.0009) +[2023-10-14 05:49:11,767][100936] Updated weights for policy 0, policy_version 19130 (0.0009) +[2023-10-14 05:49:12,609][100917] Updated weights for policy 1, policy_version 19112 (0.0009) +[2023-10-14 05:49:12,987][100917] Updated weights for policy 1, policy_version 19122 (0.0010) +[2023-10-14 05:49:13,363][100917] Updated weights for policy 1, policy_version 19132 (0.0007) +[2023-10-14 05:49:13,512][99942] Fps is (10 sec: 16384.0, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 39190528. Throughput: 0: 1650.0, 1: 1663.6. Samples: 9797160. Policy #0 lag: (min: 31.0, avg: 41.5, max: 63.0) +[2023-10-14 05:49:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:49:15,926][100936] Updated weights for policy 0, policy_version 19140 (0.0009) +[2023-10-14 05:49:16,284][100936] Updated weights for policy 0, policy_version 19150 (0.0011) +[2023-10-14 05:49:16,661][100936] Updated weights for policy 0, policy_version 19160 (0.0011) +[2023-10-14 05:49:17,482][100917] Updated weights for policy 1, policy_version 19142 (0.0008) +[2023-10-14 05:49:17,858][100917] Updated weights for policy 1, policy_version 19152 (0.0009) +[2023-10-14 05:49:18,228][100917] Updated weights for policy 1, policy_version 19162 (0.0009) +[2023-10-14 05:49:18,512][99942] Fps is (10 sec: 16383.6, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 39256064. Throughput: 0: 1657.1, 1: 1665.5. Samples: 9817084. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:49:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:49:20,776][100936] Updated weights for policy 0, policy_version 19170 (0.0009) +[2023-10-14 05:49:21,143][100936] Updated weights for policy 0, policy_version 19180 (0.0008) +[2023-10-14 05:49:21,505][100936] Updated weights for policy 0, policy_version 19190 (0.0008) +[2023-10-14 05:49:21,880][100936] Updated weights for policy 0, policy_version 19200 (0.0007) +[2023-10-14 05:49:22,406][100917] Updated weights for policy 1, policy_version 19172 (0.0008) +[2023-10-14 05:49:22,818][100917] Updated weights for policy 1, policy_version 19182 (0.0008) +[2023-10-14 05:49:23,184][100917] Updated weights for policy 1, policy_version 19192 (0.0010) +[2023-10-14 05:49:23,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 39321600. Throughput: 0: 1666.5, 1: 1650.8. Samples: 9837044. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:49:23,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:49:25,935][100936] Updated weights for policy 0, policy_version 19210 (0.0009) +[2023-10-14 05:49:26,299][100936] Updated weights for policy 0, policy_version 19220 (0.0007) +[2023-10-14 05:49:26,680][100936] Updated weights for policy 0, policy_version 19230 (0.0007) +[2023-10-14 05:49:27,354][100917] Updated weights for policy 1, policy_version 19202 (0.0010) +[2023-10-14 05:49:27,734][100917] Updated weights for policy 1, policy_version 19212 (0.0008) +[2023-10-14 05:49:28,100][100917] Updated weights for policy 1, policy_version 19222 (0.0008) +[2023-10-14 05:49:28,467][100917] Updated weights for policy 1, policy_version 19232 (0.0007) +[2023-10-14 05:49:28,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 39387136. Throughput: 0: 1647.8, 1: 1665.9. Samples: 9847162. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:49:28,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:49:30,732][100936] Updated weights for policy 0, policy_version 19240 (0.0010) +[2023-10-14 05:49:31,107][100936] Updated weights for policy 0, policy_version 19250 (0.0008) +[2023-10-14 05:49:31,483][100936] Updated weights for policy 0, policy_version 19260 (0.0009) +[2023-10-14 05:49:32,499][100917] Updated weights for policy 1, policy_version 19242 (0.0009) +[2023-10-14 05:49:32,875][100917] Updated weights for policy 1, policy_version 19252 (0.0009) +[2023-10-14 05:49:33,254][100917] Updated weights for policy 1, policy_version 19262 (0.0009) +[2023-10-14 05:49:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 39452672. Throughput: 0: 1662.5, 1: 1663.5. Samples: 9867278. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:49:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:49:35,551][100936] Updated weights for policy 0, policy_version 19270 (0.0008) +[2023-10-14 05:49:35,921][100936] Updated weights for policy 0, policy_version 19280 (0.0007) +[2023-10-14 05:49:36,296][100936] Updated weights for policy 0, policy_version 19290 (0.0009) +[2023-10-14 05:49:37,509][100917] Updated weights for policy 1, policy_version 19272 (0.0009) +[2023-10-14 05:49:37,882][100917] Updated weights for policy 1, policy_version 19282 (0.0009) +[2023-10-14 05:49:38,251][100917] Updated weights for policy 1, policy_version 19292 (0.0009) +[2023-10-14 05:49:38,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 39518208. Throughput: 0: 1667.3, 1: 1649.5. Samples: 9886746. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:49:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:49:38,525][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000019296_19759104.pth... +[2023-10-14 05:49:38,525][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000019296_19759104.pth... +[2023-10-14 05:49:38,561][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000017728_18153472.pth +[2023-10-14 05:49:38,567][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000017760_18186240.pth +[2023-10-14 05:49:40,404][100936] Updated weights for policy 0, policy_version 19300 (0.0009) +[2023-10-14 05:49:40,784][100936] Updated weights for policy 0, policy_version 19310 (0.0009) +[2023-10-14 05:49:41,157][100936] Updated weights for policy 0, policy_version 19320 (0.0009) +[2023-10-14 05:49:42,274][100917] Updated weights for policy 1, policy_version 19302 (0.0009) +[2023-10-14 05:49:42,652][100917] Updated weights for policy 1, policy_version 19312 (0.0008) +[2023-10-14 05:49:43,020][100917] Updated weights for policy 1, policy_version 19322 (0.0009) +[2023-10-14 05:49:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 39583744. Throughput: 0: 1643.8, 1: 1658.9. Samples: 9896492. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:49:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:49:45,214][100936] Updated weights for policy 0, policy_version 19330 (0.0010) +[2023-10-14 05:49:45,585][100936] Updated weights for policy 0, policy_version 19340 (0.0010) +[2023-10-14 05:49:45,960][100936] Updated weights for policy 0, policy_version 19350 (0.0010) +[2023-10-14 05:49:46,331][100936] Updated weights for policy 0, policy_version 19360 (0.0008) +[2023-10-14 05:49:47,357][100917] Updated weights for policy 1, policy_version 19332 (0.0007) +[2023-10-14 05:49:47,734][100917] Updated weights for policy 1, policy_version 19342 (0.0008) +[2023-10-14 05:49:48,099][100917] Updated weights for policy 1, policy_version 19352 (0.0010) +[2023-10-14 05:49:48,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 39649280. Throughput: 0: 1665.3, 1: 1652.0. Samples: 9916634. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:49:48,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:49:50,649][100936] Updated weights for policy 0, policy_version 19370 (0.0007) +[2023-10-14 05:49:51,031][100936] Updated weights for policy 0, policy_version 19380 (0.0007) +[2023-10-14 05:49:51,389][100936] Updated weights for policy 0, policy_version 19390 (0.0011) +[2023-10-14 05:49:52,168][100917] Updated weights for policy 1, policy_version 19362 (0.0010) +[2023-10-14 05:49:52,536][100917] Updated weights for policy 1, policy_version 19372 (0.0008) +[2023-10-14 05:49:52,912][100917] Updated weights for policy 1, policy_version 19382 (0.0007) +[2023-10-14 05:49:53,288][100917] Updated weights for policy 1, policy_version 19392 (0.0009) +[2023-10-14 05:49:53,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 39714816. Throughput: 0: 1669.3, 1: 1645.8. Samples: 9936302. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:49:53,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:49:55,511][100936] Updated weights for policy 0, policy_version 19400 (0.0008) +[2023-10-14 05:49:55,893][100936] Updated weights for policy 0, policy_version 19410 (0.0007) +[2023-10-14 05:49:56,260][100936] Updated weights for policy 0, policy_version 19420 (0.0009) +[2023-10-14 05:49:57,478][100917] Updated weights for policy 1, policy_version 19402 (0.0008) +[2023-10-14 05:49:57,856][100917] Updated weights for policy 1, policy_version 19412 (0.0007) +[2023-10-14 05:49:58,224][100917] Updated weights for policy 1, policy_version 19422 (0.0007) +[2023-10-14 05:49:58,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 39780352. Throughput: 0: 1652.8, 1: 1653.1. Samples: 9945924. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:49:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:50:00,417][100936] Updated weights for policy 0, policy_version 19430 (0.0008) +[2023-10-14 05:50:00,785][100936] Updated weights for policy 0, policy_version 19440 (0.0008) +[2023-10-14 05:50:01,158][100936] Updated weights for policy 0, policy_version 19450 (0.0008) +[2023-10-14 05:50:02,337][100917] Updated weights for policy 1, policy_version 19432 (0.0008) +[2023-10-14 05:50:02,723][100917] Updated weights for policy 1, policy_version 19442 (0.0007) +[2023-10-14 05:50:03,092][100917] Updated weights for policy 1, policy_version 19452 (0.0007) +[2023-10-14 05:50:03,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 39845888. Throughput: 0: 1662.7, 1: 1650.5. Samples: 9966178. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:50:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:50:05,445][100936] Updated weights for policy 0, policy_version 19460 (0.0008) +[2023-10-14 05:50:05,819][100936] Updated weights for policy 0, policy_version 19470 (0.0007) +[2023-10-14 05:50:06,181][100936] Updated weights for policy 0, policy_version 19480 (0.0011) +[2023-10-14 05:50:07,314][100917] Updated weights for policy 1, policy_version 19462 (0.0012) +[2023-10-14 05:50:07,687][100917] Updated weights for policy 1, policy_version 19472 (0.0008) +[2023-10-14 05:50:08,064][100917] Updated weights for policy 1, policy_version 19482 (0.0009) +[2023-10-14 05:50:08,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 39911424. Throughput: 0: 1655.8, 1: 1646.6. Samples: 9985654. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:50:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:50:10,220][100936] Updated weights for policy 0, policy_version 19490 (0.0011) +[2023-10-14 05:50:10,591][100936] Updated weights for policy 0, policy_version 19500 (0.0009) +[2023-10-14 05:50:10,956][100936] Updated weights for policy 0, policy_version 19510 (0.0008) +[2023-10-14 05:50:11,330][100936] Updated weights for policy 0, policy_version 19520 (0.0009) +[2023-10-14 05:50:12,181][100917] Updated weights for policy 1, policy_version 19492 (0.0008) +[2023-10-14 05:50:12,569][100917] Updated weights for policy 1, policy_version 19502 (0.0007) +[2023-10-14 05:50:12,949][100917] Updated weights for policy 1, policy_version 19512 (0.0008) +[2023-10-14 05:50:13,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.1, 300 sec: 13329.3). Total num frames: 39976960. Throughput: 0: 1645.9, 1: 1650.1. Samples: 9995482. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:50:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:50:15,188][100936] Updated weights for policy 0, policy_version 19530 (0.0008) +[2023-10-14 05:50:15,557][100936] Updated weights for policy 0, policy_version 19540 (0.0009) +[2023-10-14 05:50:15,939][100936] Updated weights for policy 0, policy_version 19550 (0.0007) +[2023-10-14 05:50:17,018][100917] Updated weights for policy 1, policy_version 19522 (0.0009) +[2023-10-14 05:50:17,390][100917] Updated weights for policy 1, policy_version 19532 (0.0011) +[2023-10-14 05:50:17,761][100917] Updated weights for policy 1, policy_version 19542 (0.0008) +[2023-10-14 05:50:18,141][100917] Updated weights for policy 1, policy_version 19552 (0.0010) +[2023-10-14 05:50:18,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 40042496. Throughput: 0: 1661.9, 1: 1647.1. Samples: 10016184. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:50:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:50:19,940][100936] Updated weights for policy 0, policy_version 19560 (0.0007) +[2023-10-14 05:50:20,312][100936] Updated weights for policy 0, policy_version 19570 (0.0007) +[2023-10-14 05:50:20,676][100936] Updated weights for policy 0, policy_version 19580 (0.0008) +[2023-10-14 05:50:22,308][100917] Updated weights for policy 1, policy_version 19562 (0.0009) +[2023-10-14 05:50:22,681][100917] Updated weights for policy 1, policy_version 19572 (0.0009) +[2023-10-14 05:50:23,058][100917] Updated weights for policy 1, policy_version 19582 (0.0008) +[2023-10-14 05:50:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 40108032. Throughput: 0: 1668.3, 1: 1645.9. Samples: 10035886. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:50:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:50:24,903][100936] Updated weights for policy 0, policy_version 19590 (0.0008) +[2023-10-14 05:50:25,279][100936] Updated weights for policy 0, policy_version 19600 (0.0007) +[2023-10-14 05:50:25,655][100936] Updated weights for policy 0, policy_version 19610 (0.0007) +[2023-10-14 05:50:27,195][100917] Updated weights for policy 1, policy_version 19592 (0.0010) +[2023-10-14 05:50:27,575][100917] Updated weights for policy 1, policy_version 19602 (0.0009) +[2023-10-14 05:50:27,953][100917] Updated weights for policy 1, policy_version 19612 (0.0008) +[2023-10-14 05:50:28,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 40173568. Throughput: 0: 1665.6, 1: 1650.9. Samples: 10045732. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:50:28,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:50:29,854][100936] Updated weights for policy 0, policy_version 19620 (0.0009) +[2023-10-14 05:50:30,253][100936] Updated weights for policy 0, policy_version 19630 (0.0008) +[2023-10-14 05:50:30,624][100936] Updated weights for policy 0, policy_version 19640 (0.0009) +[2023-10-14 05:50:31,996][100917] Updated weights for policy 1, policy_version 19622 (0.0007) +[2023-10-14 05:50:32,365][100917] Updated weights for policy 1, policy_version 19632 (0.0007) +[2023-10-14 05:50:32,749][100917] Updated weights for policy 1, policy_version 19642 (0.0008) +[2023-10-14 05:50:33,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 40239104. Throughput: 0: 1662.5, 1: 1654.6. Samples: 10065902. Policy #0 lag: (min: 31.0, avg: 38.0, max: 63.0) +[2023-10-14 05:50:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:50:34,690][100936] Updated weights for policy 0, policy_version 19650 (0.0008) +[2023-10-14 05:50:35,064][100936] Updated weights for policy 0, policy_version 19660 (0.0009) +[2023-10-14 05:50:35,431][100936] Updated weights for policy 0, policy_version 19670 (0.0008) +[2023-10-14 05:50:35,795][100936] Updated weights for policy 0, policy_version 19680 (0.0008) +[2023-10-14 05:50:36,998][100917] Updated weights for policy 1, policy_version 19652 (0.0008) +[2023-10-14 05:50:37,371][100917] Updated weights for policy 1, policy_version 19662 (0.0009) +[2023-10-14 05:50:37,752][100917] Updated weights for policy 1, policy_version 19672 (0.0010) +[2023-10-14 05:50:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 40304640. Throughput: 0: 1659.4, 1: 1648.6. Samples: 10085164. Policy #0 lag: (min: 31.0, avg: 38.0, max: 63.0) +[2023-10-14 05:50:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:50:40,135][100936] Updated weights for policy 0, policy_version 19690 (0.0009) +[2023-10-14 05:50:40,506][100936] Updated weights for policy 0, policy_version 19700 (0.0007) +[2023-10-14 05:50:40,875][100936] Updated weights for policy 0, policy_version 19710 (0.0009) +[2023-10-14 05:50:41,935][100917] Updated weights for policy 1, policy_version 19682 (0.0007) +[2023-10-14 05:50:42,313][100917] Updated weights for policy 1, policy_version 19692 (0.0007) +[2023-10-14 05:50:42,689][100917] Updated weights for policy 1, policy_version 19702 (0.0009) +[2023-10-14 05:50:43,059][100917] Updated weights for policy 1, policy_version 19712 (0.0011) +[2023-10-14 05:50:43,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 40370176. Throughput: 0: 1656.2, 1: 1659.8. Samples: 10095142. Policy #0 lag: (min: 31.0, avg: 38.0, max: 63.0) +[2023-10-14 05:50:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:50:44,980][100936] Updated weights for policy 0, policy_version 19720 (0.0009) +[2023-10-14 05:50:45,349][100936] Updated weights for policy 0, policy_version 19730 (0.0009) +[2023-10-14 05:50:45,722][100936] Updated weights for policy 0, policy_version 19740 (0.0008) +[2023-10-14 05:50:47,134][100917] Updated weights for policy 1, policy_version 19722 (0.0011) +[2023-10-14 05:50:47,512][100917] Updated weights for policy 1, policy_version 19732 (0.0009) +[2023-10-14 05:50:47,882][100917] Updated weights for policy 1, policy_version 19742 (0.0010) +[2023-10-14 05:50:48,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.1, 300 sec: 13329.3). Total num frames: 40435712. Throughput: 0: 1662.3, 1: 1653.9. Samples: 10115412. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:50:48,514][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:50:49,929][100936] Updated weights for policy 0, policy_version 19750 (0.0008) +[2023-10-14 05:50:50,293][100936] Updated weights for policy 0, policy_version 19760 (0.0009) +[2023-10-14 05:50:50,667][100936] Updated weights for policy 0, policy_version 19770 (0.0009) +[2023-10-14 05:50:52,023][100917] Updated weights for policy 1, policy_version 19752 (0.0009) +[2023-10-14 05:50:52,402][100917] Updated weights for policy 1, policy_version 19762 (0.0007) +[2023-10-14 05:50:52,775][100917] Updated weights for policy 1, policy_version 19772 (0.0009) +[2023-10-14 05:50:53,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 40501248. Throughput: 0: 1658.3, 1: 1653.7. Samples: 10134696. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:50:53,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:50:54,773][100936] Updated weights for policy 0, policy_version 19780 (0.0009) +[2023-10-14 05:50:55,147][100936] Updated weights for policy 0, policy_version 19790 (0.0010) +[2023-10-14 05:50:55,524][100936] Updated weights for policy 0, policy_version 19800 (0.0009) +[2023-10-14 05:50:56,899][100917] Updated weights for policy 1, policy_version 19782 (0.0008) +[2023-10-14 05:50:57,288][100917] Updated weights for policy 1, policy_version 19792 (0.0010) +[2023-10-14 05:50:57,667][100917] Updated weights for policy 1, policy_version 19802 (0.0009) +[2023-10-14 05:50:58,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 40566784. Throughput: 0: 1661.0, 1: 1661.7. Samples: 10145002. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:50:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:50:59,697][100936] Updated weights for policy 0, policy_version 19810 (0.0007) +[2023-10-14 05:51:00,065][100936] Updated weights for policy 0, policy_version 19820 (0.0010) +[2023-10-14 05:51:00,431][100936] Updated weights for policy 0, policy_version 19830 (0.0011) +[2023-10-14 05:51:00,806][100936] Updated weights for policy 0, policy_version 19840 (0.0009) +[2023-10-14 05:51:01,708][100917] Updated weights for policy 1, policy_version 19812 (0.0009) +[2023-10-14 05:51:02,088][100917] Updated weights for policy 1, policy_version 19822 (0.0011) +[2023-10-14 05:51:02,456][100917] Updated weights for policy 1, policy_version 19832 (0.0009) +[2023-10-14 05:51:03,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 40632320. Throughput: 0: 1652.6, 1: 1653.1. Samples: 10164938. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:51:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:51:04,887][100936] Updated weights for policy 0, policy_version 19850 (0.0011) +[2023-10-14 05:51:05,255][100936] Updated weights for policy 0, policy_version 19860 (0.0011) +[2023-10-14 05:51:05,640][100936] Updated weights for policy 0, policy_version 19870 (0.0009) +[2023-10-14 05:51:06,608][100917] Updated weights for policy 1, policy_version 19842 (0.0009) +[2023-10-14 05:51:06,981][100917] Updated weights for policy 1, policy_version 19852 (0.0009) +[2023-10-14 05:51:07,348][100917] Updated weights for policy 1, policy_version 19862 (0.0010) +[2023-10-14 05:51:07,717][100917] Updated weights for policy 1, policy_version 19872 (0.0008) +[2023-10-14 05:51:08,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 40697856. Throughput: 0: 1649.4, 1: 1655.5. Samples: 10184606. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:51:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:51:09,660][100936] Updated weights for policy 0, policy_version 19880 (0.0008) +[2023-10-14 05:51:10,030][100936] Updated weights for policy 0, policy_version 19890 (0.0008) +[2023-10-14 05:51:10,402][100936] Updated weights for policy 0, policy_version 19900 (0.0008) +[2023-10-14 05:51:11,806][100917] Updated weights for policy 1, policy_version 19882 (0.0008) +[2023-10-14 05:51:12,173][100917] Updated weights for policy 1, policy_version 19892 (0.0009) +[2023-10-14 05:51:12,548][100917] Updated weights for policy 1, policy_version 19902 (0.0011) +[2023-10-14 05:51:13,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.3, 300 sec: 13329.4). Total num frames: 40763392. Throughput: 0: 1652.2, 1: 1662.1. Samples: 10194876. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:51:13,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:51:14,586][100936] Updated weights for policy 0, policy_version 19910 (0.0008) +[2023-10-14 05:51:14,948][100936] Updated weights for policy 0, policy_version 19920 (0.0009) +[2023-10-14 05:51:15,319][100936] Updated weights for policy 0, policy_version 19930 (0.0007) +[2023-10-14 05:51:16,649][100917] Updated weights for policy 1, policy_version 19912 (0.0008) +[2023-10-14 05:51:17,015][100917] Updated weights for policy 1, policy_version 19922 (0.0009) +[2023-10-14 05:51:17,394][100917] Updated weights for policy 1, policy_version 19932 (0.0007) +[2023-10-14 05:51:18,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 40828928. Throughput: 0: 1658.2, 1: 1648.8. Samples: 10214716. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:51:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:51:19,498][100936] Updated weights for policy 0, policy_version 19940 (0.0008) +[2023-10-14 05:51:19,877][100936] Updated weights for policy 0, policy_version 19950 (0.0008) +[2023-10-14 05:51:20,245][100936] Updated weights for policy 0, policy_version 19960 (0.0009) +[2023-10-14 05:51:21,417][100917] Updated weights for policy 1, policy_version 19942 (0.0009) +[2023-10-14 05:51:21,785][100917] Updated weights for policy 1, policy_version 19952 (0.0009) +[2023-10-14 05:51:22,163][100917] Updated weights for policy 1, policy_version 19962 (0.0009) +[2023-10-14 05:51:23,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 40894464. Throughput: 0: 1660.5, 1: 1664.0. Samples: 10234764. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:51:23,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:51:24,242][100936] Updated weights for policy 0, policy_version 19970 (0.0011) +[2023-10-14 05:51:24,625][100936] Updated weights for policy 0, policy_version 19980 (0.0007) +[2023-10-14 05:51:24,985][100936] Updated weights for policy 0, policy_version 19990 (0.0007) +[2023-10-14 05:51:25,357][100936] Updated weights for policy 0, policy_version 20000 (0.0009) +[2023-10-14 05:51:26,417][100917] Updated weights for policy 1, policy_version 19972 (0.0009) +[2023-10-14 05:51:26,783][100917] Updated weights for policy 1, policy_version 19982 (0.0010) +[2023-10-14 05:51:27,152][100917] Updated weights for policy 1, policy_version 19992 (0.0008) +[2023-10-14 05:51:28,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 40960000. Throughput: 0: 1664.0, 1: 1663.9. Samples: 10244894. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:51:28,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:51:29,613][100936] Updated weights for policy 0, policy_version 20010 (0.0009) +[2023-10-14 05:51:29,980][100936] Updated weights for policy 0, policy_version 20020 (0.0008) +[2023-10-14 05:51:30,342][100936] Updated weights for policy 0, policy_version 20030 (0.0009) +[2023-10-14 05:51:31,358][100917] Updated weights for policy 1, policy_version 20002 (0.0008) +[2023-10-14 05:51:31,734][100917] Updated weights for policy 1, policy_version 20012 (0.0010) +[2023-10-14 05:51:32,113][100917] Updated weights for policy 1, policy_version 20022 (0.0007) +[2023-10-14 05:51:32,490][100917] Updated weights for policy 1, policy_version 20032 (0.0008) +[2023-10-14 05:51:33,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 41025536. Throughput: 0: 1662.9, 1: 1650.6. Samples: 10264516. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:51:33,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:51:34,258][100936] Updated weights for policy 0, policy_version 20040 (0.0010) +[2023-10-14 05:51:34,631][100936] Updated weights for policy 0, policy_version 20050 (0.0007) +[2023-10-14 05:51:35,005][100936] Updated weights for policy 0, policy_version 20060 (0.0009) +[2023-10-14 05:51:36,608][100917] Updated weights for policy 1, policy_version 20042 (0.0009) +[2023-10-14 05:51:36,977][100917] Updated weights for policy 1, policy_version 20052 (0.0009) +[2023-10-14 05:51:37,348][100917] Updated weights for policy 1, policy_version 20062 (0.0010) +[2023-10-14 05:51:38,512][99942] Fps is (10 sec: 13106.7, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 41091072. Throughput: 0: 1669.5, 1: 1657.3. Samples: 10284406. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:51:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:51:38,526][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000020064_20545536.pth... +[2023-10-14 05:51:38,527][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000020064_20545536.pth... +[2023-10-14 05:51:38,560][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000018528_18972672.pth +[2023-10-14 05:51:38,564][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000018496_18939904.pth +[2023-10-14 05:51:39,180][100936] Updated weights for policy 0, policy_version 20070 (0.0007) +[2023-10-14 05:51:39,548][100936] Updated weights for policy 0, policy_version 20080 (0.0008) +[2023-10-14 05:51:39,916][100936] Updated weights for policy 0, policy_version 20090 (0.0008) +[2023-10-14 05:51:41,389][100917] Updated weights for policy 1, policy_version 20072 (0.0009) +[2023-10-14 05:51:41,766][100917] Updated weights for policy 1, policy_version 20082 (0.0007) +[2023-10-14 05:51:42,138][100917] Updated weights for policy 1, policy_version 20092 (0.0010) +[2023-10-14 05:51:43,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 41156608. Throughput: 0: 1665.6, 1: 1659.0. Samples: 10294606. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:51:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:51:44,041][100936] Updated weights for policy 0, policy_version 20100 (0.0010) +[2023-10-14 05:51:44,409][100936] Updated weights for policy 0, policy_version 20110 (0.0008) +[2023-10-14 05:51:44,775][100936] Updated weights for policy 0, policy_version 20120 (0.0008) +[2023-10-14 05:51:46,240][100917] Updated weights for policy 1, policy_version 20102 (0.0009) +[2023-10-14 05:51:46,606][100917] Updated weights for policy 1, policy_version 20112 (0.0009) +[2023-10-14 05:51:46,973][100917] Updated weights for policy 1, policy_version 20122 (0.0008) +[2023-10-14 05:51:48,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 41222144. Throughput: 0: 1666.0, 1: 1645.6. Samples: 10313964. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:51:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:51:48,759][100936] Updated weights for policy 0, policy_version 20130 (0.0009) +[2023-10-14 05:51:49,132][100936] Updated weights for policy 0, policy_version 20140 (0.0008) +[2023-10-14 05:51:49,500][100936] Updated weights for policy 0, policy_version 20150 (0.0008) +[2023-10-14 05:51:49,877][100936] Updated weights for policy 0, policy_version 20160 (0.0008) +[2023-10-14 05:51:51,083][100917] Updated weights for policy 1, policy_version 20132 (0.0010) +[2023-10-14 05:51:51,450][100917] Updated weights for policy 1, policy_version 20142 (0.0010) +[2023-10-14 05:51:51,820][100917] Updated weights for policy 1, policy_version 20152 (0.0010) +[2023-10-14 05:51:53,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 41287680. Throughput: 0: 1663.4, 1: 1658.2. Samples: 10334076. Policy #0 lag: (min: 31.0, avg: 31.3, max: 42.0) +[2023-10-14 05:51:53,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:51:54,043][100936] Updated weights for policy 0, policy_version 20170 (0.0007) +[2023-10-14 05:51:54,417][100936] Updated weights for policy 0, policy_version 20180 (0.0011) +[2023-10-14 05:51:54,793][100936] Updated weights for policy 0, policy_version 20190 (0.0010) +[2023-10-14 05:51:55,915][100917] Updated weights for policy 1, policy_version 20162 (0.0008) +[2023-10-14 05:51:56,278][100917] Updated weights for policy 1, policy_version 20172 (0.0010) +[2023-10-14 05:51:56,658][100917] Updated weights for policy 1, policy_version 20182 (0.0010) +[2023-10-14 05:51:57,027][100917] Updated weights for policy 1, policy_version 20192 (0.0010) +[2023-10-14 05:51:58,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 41353216. Throughput: 0: 1663.9, 1: 1654.8. Samples: 10344220. Policy #0 lag: (min: 31.0, avg: 31.3, max: 42.0) +[2023-10-14 05:51:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:51:58,917][100936] Updated weights for policy 0, policy_version 20200 (0.0009) +[2023-10-14 05:51:59,299][100936] Updated weights for policy 0, policy_version 20210 (0.0008) +[2023-10-14 05:51:59,671][100936] Updated weights for policy 0, policy_version 20220 (0.0008) +[2023-10-14 05:52:01,143][100917] Updated weights for policy 1, policy_version 20202 (0.0010) +[2023-10-14 05:52:01,520][100917] Updated weights for policy 1, policy_version 20212 (0.0008) +[2023-10-14 05:52:01,897][100917] Updated weights for policy 1, policy_version 20222 (0.0009) +[2023-10-14 05:52:03,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 41418752. Throughput: 0: 1665.5, 1: 1646.5. Samples: 10363754. Policy #0 lag: (min: 31.0, avg: 31.3, max: 42.0) +[2023-10-14 05:52:03,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:52:03,678][100936] Updated weights for policy 0, policy_version 20230 (0.0009) +[2023-10-14 05:52:04,061][100936] Updated weights for policy 0, policy_version 20240 (0.0010) +[2023-10-14 05:52:04,436][100936] Updated weights for policy 0, policy_version 20250 (0.0007) +[2023-10-14 05:52:06,053][100917] Updated weights for policy 1, policy_version 20232 (0.0008) +[2023-10-14 05:52:06,424][100917] Updated weights for policy 1, policy_version 20242 (0.0007) +[2023-10-14 05:52:06,803][100917] Updated weights for policy 1, policy_version 20252 (0.0009) +[2023-10-14 05:52:08,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 41484288. Throughput: 0: 1666.4, 1: 1651.5. Samples: 10384070. Policy #0 lag: (min: 31.0, avg: 31.3, max: 42.0) +[2023-10-14 05:52:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:52:08,614][100936] Updated weights for policy 0, policy_version 20260 (0.0008) +[2023-10-14 05:52:09,002][100936] Updated weights for policy 0, policy_version 20270 (0.0007) +[2023-10-14 05:52:09,378][100936] Updated weights for policy 0, policy_version 20280 (0.0007) +[2023-10-14 05:52:10,956][100917] Updated weights for policy 1, policy_version 20262 (0.0009) +[2023-10-14 05:52:11,323][100917] Updated weights for policy 1, policy_version 20272 (0.0009) +[2023-10-14 05:52:11,700][100917] Updated weights for policy 1, policy_version 20282 (0.0010) +[2023-10-14 05:52:13,510][100936] Updated weights for policy 0, policy_version 20290 (0.0009) +[2023-10-14 05:52:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 41549824. Throughput: 0: 1663.5, 1: 1646.9. Samples: 10393862. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-10-14 05:52:13,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 05:52:13,876][100936] Updated weights for policy 0, policy_version 20300 (0.0010) +[2023-10-14 05:52:14,252][100936] Updated weights for policy 0, policy_version 20310 (0.0007) +[2023-10-14 05:52:14,628][100936] Updated weights for policy 0, policy_version 20320 (0.0008) +[2023-10-14 05:52:15,871][100917] Updated weights for policy 1, policy_version 20292 (0.0008) +[2023-10-14 05:52:16,245][100917] Updated weights for policy 1, policy_version 20302 (0.0009) +[2023-10-14 05:52:16,620][100917] Updated weights for policy 1, policy_version 20312 (0.0008) +[2023-10-14 05:52:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 41615360. Throughput: 0: 1664.7, 1: 1638.3. Samples: 10413152. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-10-14 05:52:18,512][99942] Avg episode reward: [(0, '0.990'), (1, '0.990')] +[2023-10-14 05:52:18,768][100936] Updated weights for policy 0, policy_version 20330 (0.0007) +[2023-10-14 05:52:19,133][100936] Updated weights for policy 0, policy_version 20340 (0.0007) +[2023-10-14 05:52:19,510][100936] Updated weights for policy 0, policy_version 20350 (0.0007) +[2023-10-14 05:52:20,577][100917] Updated weights for policy 1, policy_version 20322 (0.0009) +[2023-10-14 05:52:20,947][100917] Updated weights for policy 1, policy_version 20332 (0.0008) +[2023-10-14 05:52:21,320][100917] Updated weights for policy 1, policy_version 20342 (0.0010) +[2023-10-14 05:52:21,693][100917] Updated weights for policy 1, policy_version 20352 (0.0010) +[2023-10-14 05:52:23,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 41680896. Throughput: 0: 1660.5, 1: 1661.4. Samples: 10433888. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-10-14 05:52:23,513][99942] Avg episode reward: [(0, '0.990'), (1, '0.990')] +[2023-10-14 05:52:23,607][100936] Updated weights for policy 0, policy_version 20360 (0.0010) +[2023-10-14 05:52:23,980][100936] Updated weights for policy 0, policy_version 20370 (0.0009) +[2023-10-14 05:52:24,354][100936] Updated weights for policy 0, policy_version 20380 (0.0009) +[2023-10-14 05:52:25,699][100917] Updated weights for policy 1, policy_version 20362 (0.0007) +[2023-10-14 05:52:26,076][100917] Updated weights for policy 1, policy_version 20372 (0.0007) +[2023-10-14 05:52:26,443][100917] Updated weights for policy 1, policy_version 20382 (0.0008) +[2023-10-14 05:52:28,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 41746432. Throughput: 0: 1666.1, 1: 1647.2. Samples: 10443702. Policy #0 lag: (min: 18.0, avg: 20.3, max: 41.0) +[2023-10-14 05:52:28,513][99942] Avg episode reward: [(0, '0.990'), (1, '0.990')] +[2023-10-14 05:52:28,644][100936] Updated weights for policy 0, policy_version 20390 (0.0010) +[2023-10-14 05:52:29,016][100936] Updated weights for policy 0, policy_version 20400 (0.0007) +[2023-10-14 05:52:29,391][100936] Updated weights for policy 0, policy_version 20410 (0.0008) +[2023-10-14 05:52:30,499][100917] Updated weights for policy 1, policy_version 20392 (0.0010) +[2023-10-14 05:52:30,871][100917] Updated weights for policy 1, policy_version 20402 (0.0009) +[2023-10-14 05:52:31,241][100917] Updated weights for policy 1, policy_version 20412 (0.0009) +[2023-10-14 05:52:33,499][100936] Updated weights for policy 0, policy_version 20420 (0.0009) +[2023-10-14 05:52:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 41811968. Throughput: 0: 1667.6, 1: 1657.2. Samples: 10463582. Policy #0 lag: (min: 18.0, avg: 20.3, max: 41.0) +[2023-10-14 05:52:33,513][99942] Avg episode reward: [(0, '0.990'), (1, '0.990')] +[2023-10-14 05:52:33,874][100936] Updated weights for policy 0, policy_version 20430 (0.0008) +[2023-10-14 05:52:34,258][100936] Updated weights for policy 0, policy_version 20440 (0.0011) +[2023-10-14 05:52:35,601][100917] Updated weights for policy 1, policy_version 20422 (0.0008) +[2023-10-14 05:52:35,987][100917] Updated weights for policy 1, policy_version 20432 (0.0007) +[2023-10-14 05:52:36,363][100917] Updated weights for policy 1, policy_version 20442 (0.0008) +[2023-10-14 05:52:38,410][100936] Updated weights for policy 0, policy_version 20450 (0.0010) +[2023-10-14 05:52:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 41877504. Throughput: 0: 1662.4, 1: 1657.1. Samples: 10483450. Policy #0 lag: (min: 18.0, avg: 20.3, max: 41.0) +[2023-10-14 05:52:38,513][99942] Avg episode reward: [(0, '0.990'), (1, '0.990')] +[2023-10-14 05:52:38,783][100936] Updated weights for policy 0, policy_version 20460 (0.0011) +[2023-10-14 05:52:39,159][100936] Updated weights for policy 0, policy_version 20470 (0.0009) +[2023-10-14 05:52:39,528][100936] Updated weights for policy 0, policy_version 20480 (0.0009) +[2023-10-14 05:52:40,628][100917] Updated weights for policy 1, policy_version 20452 (0.0009) +[2023-10-14 05:52:41,009][100917] Updated weights for policy 1, policy_version 20462 (0.0010) +[2023-10-14 05:52:41,386][100917] Updated weights for policy 1, policy_version 20472 (0.0007) +[2023-10-14 05:52:43,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 41943040. Throughput: 0: 1660.4, 1: 1648.5. Samples: 10493120. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:52:43,513][99942] Avg episode reward: [(0, '0.990'), (1, '0.990')] +[2023-10-14 05:52:43,781][100936] Updated weights for policy 0, policy_version 20490 (0.0007) +[2023-10-14 05:52:44,153][100936] Updated weights for policy 0, policy_version 20500 (0.0007) +[2023-10-14 05:52:44,525][100936] Updated weights for policy 0, policy_version 20510 (0.0009) +[2023-10-14 05:52:45,400][100917] Updated weights for policy 1, policy_version 20482 (0.0008) +[2023-10-14 05:52:45,774][100917] Updated weights for policy 1, policy_version 20492 (0.0007) +[2023-10-14 05:52:46,138][100917] Updated weights for policy 1, policy_version 20502 (0.0009) +[2023-10-14 05:52:46,506][100917] Updated weights for policy 1, policy_version 20512 (0.0009) +[2023-10-14 05:52:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 42008576. Throughput: 0: 1654.6, 1: 1658.0. Samples: 10512820. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:52:48,513][99942] Avg episode reward: [(0, '0.990'), (1, '0.990')] +[2023-10-14 05:52:48,714][100936] Updated weights for policy 0, policy_version 20520 (0.0008) +[2023-10-14 05:52:49,083][100936] Updated weights for policy 0, policy_version 20530 (0.0010) +[2023-10-14 05:52:49,459][100936] Updated weights for policy 0, policy_version 20540 (0.0008) +[2023-10-14 05:52:50,552][100917] Updated weights for policy 1, policy_version 20522 (0.0010) +[2023-10-14 05:52:50,934][100917] Updated weights for policy 1, policy_version 20532 (0.0010) +[2023-10-14 05:52:51,317][100917] Updated weights for policy 1, policy_version 20542 (0.0008) +[2023-10-14 05:52:53,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 42074112. Throughput: 0: 1650.6, 1: 1667.2. Samples: 10533374. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:52:53,513][99942] Avg episode reward: [(0, '0.990'), (1, '0.990')] +[2023-10-14 05:52:53,586][100936] Updated weights for policy 0, policy_version 20550 (0.0008) +[2023-10-14 05:52:53,968][100936] Updated weights for policy 0, policy_version 20560 (0.0008) +[2023-10-14 05:52:54,334][100936] Updated weights for policy 0, policy_version 20570 (0.0007) +[2023-10-14 05:52:55,474][100917] Updated weights for policy 1, policy_version 20552 (0.0008) +[2023-10-14 05:52:55,844][100917] Updated weights for policy 1, policy_version 20562 (0.0010) +[2023-10-14 05:52:56,223][100917] Updated weights for policy 1, policy_version 20572 (0.0008) +[2023-10-14 05:52:58,243][100936] Updated weights for policy 0, policy_version 20580 (0.0009) +[2023-10-14 05:52:58,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 42139648. Throughput: 0: 1656.7, 1: 1654.4. Samples: 10542864. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:52:58,512][99942] Avg episode reward: [(0, '0.990'), (1, '0.990')] +[2023-10-14 05:52:58,611][100936] Updated weights for policy 0, policy_version 20590 (0.0007) +[2023-10-14 05:52:58,989][100936] Updated weights for policy 0, policy_version 20600 (0.0007) +[2023-10-14 05:53:00,313][100917] Updated weights for policy 1, policy_version 20582 (0.0008) +[2023-10-14 05:53:00,681][100917] Updated weights for policy 1, policy_version 20592 (0.0009) +[2023-10-14 05:53:01,056][100917] Updated weights for policy 1, policy_version 20602 (0.0008) +[2023-10-14 05:53:03,067][100936] Updated weights for policy 0, policy_version 20610 (0.0009) +[2023-10-14 05:53:03,442][100936] Updated weights for policy 0, policy_version 20620 (0.0009) +[2023-10-14 05:53:03,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 42205184. Throughput: 0: 1658.4, 1: 1666.9. Samples: 10562794. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:53:03,512][99942] Avg episode reward: [(0, '0.990'), (1, '0.990')] +[2023-10-14 05:53:03,815][100936] Updated weights for policy 0, policy_version 20630 (0.0010) +[2023-10-14 05:53:04,193][100936] Updated weights for policy 0, policy_version 20640 (0.0011) +[2023-10-14 05:53:05,177][100917] Updated weights for policy 1, policy_version 20612 (0.0009) +[2023-10-14 05:53:05,553][100917] Updated weights for policy 1, policy_version 20622 (0.0008) +[2023-10-14 05:53:05,928][100917] Updated weights for policy 1, policy_version 20632 (0.0009) +[2023-10-14 05:53:08,216][100936] Updated weights for policy 0, policy_version 20650 (0.0010) +[2023-10-14 05:53:08,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 42270720. Throughput: 0: 1645.5, 1: 1655.2. Samples: 10582420. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:53:08,513][99942] Avg episode reward: [(0, '0.990'), (1, '0.990')] +[2023-10-14 05:53:08,583][100936] Updated weights for policy 0, policy_version 20660 (0.0011) +[2023-10-14 05:53:08,954][100936] Updated weights for policy 0, policy_version 20670 (0.0010) +[2023-10-14 05:53:10,084][100917] Updated weights for policy 1, policy_version 20642 (0.0009) +[2023-10-14 05:53:10,460][100917] Updated weights for policy 1, policy_version 20652 (0.0009) +[2023-10-14 05:53:10,834][100917] Updated weights for policy 1, policy_version 20662 (0.0010) +[2023-10-14 05:53:11,211][100917] Updated weights for policy 1, policy_version 20672 (0.0007) +[2023-10-14 05:53:13,118][100936] Updated weights for policy 0, policy_version 20680 (0.0008) +[2023-10-14 05:53:13,501][100936] Updated weights for policy 0, policy_version 20690 (0.0011) +[2023-10-14 05:53:13,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 42336256. Throughput: 0: 1655.7, 1: 1646.4. Samples: 10592300. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:53:13,513][99942] Avg episode reward: [(0, '0.990'), (1, '0.990')] +[2023-10-14 05:53:13,867][100936] Updated weights for policy 0, policy_version 20700 (0.0009) +[2023-10-14 05:53:15,263][100917] Updated weights for policy 1, policy_version 20682 (0.0007) +[2023-10-14 05:53:15,640][100917] Updated weights for policy 1, policy_version 20692 (0.0007) +[2023-10-14 05:53:16,016][100917] Updated weights for policy 1, policy_version 20702 (0.0007) +[2023-10-14 05:53:18,108][100936] Updated weights for policy 0, policy_version 20710 (0.0009) +[2023-10-14 05:53:18,479][100936] Updated weights for policy 0, policy_version 20720 (0.0009) +[2023-10-14 05:53:18,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 42401792. Throughput: 0: 1652.0, 1: 1655.7. Samples: 10612426. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:53:18,512][99942] Avg episode reward: [(0, '0.990'), (1, '0.990')] +[2023-10-14 05:53:18,854][100936] Updated weights for policy 0, policy_version 20730 (0.0008) +[2023-10-14 05:53:20,183][100917] Updated weights for policy 1, policy_version 20712 (0.0008) +[2023-10-14 05:53:20,566][100917] Updated weights for policy 1, policy_version 20722 (0.0008) +[2023-10-14 05:53:20,942][100917] Updated weights for policy 1, policy_version 20732 (0.0007) +[2023-10-14 05:53:22,730][100936] Updated weights for policy 0, policy_version 20740 (0.0008) +[2023-10-14 05:53:23,106][100936] Updated weights for policy 0, policy_version 20750 (0.0007) +[2023-10-14 05:53:23,479][100936] Updated weights for policy 0, policy_version 20760 (0.0007) +[2023-10-14 05:53:23,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 42467328. Throughput: 0: 1642.5, 1: 1665.1. Samples: 10632292. Policy #0 lag: (min: 15.0, avg: 30.5, max: 32.0) +[2023-10-14 05:53:23,513][99942] Avg episode reward: [(0, '0.990'), (1, '0.990')] +[2023-10-14 05:53:25,167][100917] Updated weights for policy 1, policy_version 20742 (0.0009) +[2023-10-14 05:53:25,540][100917] Updated weights for policy 1, policy_version 20752 (0.0009) +[2023-10-14 05:53:25,916][100917] Updated weights for policy 1, policy_version 20762 (0.0007) +[2023-10-14 05:53:27,625][100936] Updated weights for policy 0, policy_version 20770 (0.0007) +[2023-10-14 05:53:27,999][100936] Updated weights for policy 0, policy_version 20780 (0.0008) +[2023-10-14 05:53:28,358][100936] Updated weights for policy 0, policy_version 20790 (0.0007) +[2023-10-14 05:53:28,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 42532864. Throughput: 0: 1661.8, 1: 1651.7. Samples: 10642230. Policy #0 lag: (min: 15.0, avg: 30.5, max: 32.0) +[2023-10-14 05:53:28,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 05:53:28,734][100936] Updated weights for policy 0, policy_version 20800 (0.0007) +[2023-10-14 05:53:30,113][100917] Updated weights for policy 1, policy_version 20772 (0.0010) +[2023-10-14 05:53:30,486][100917] Updated weights for policy 1, policy_version 20782 (0.0009) +[2023-10-14 05:53:30,862][100917] Updated weights for policy 1, policy_version 20792 (0.0010) +[2023-10-14 05:53:32,775][100936] Updated weights for policy 0, policy_version 20810 (0.0007) +[2023-10-14 05:53:33,138][100936] Updated weights for policy 0, policy_version 20820 (0.0007) +[2023-10-14 05:53:33,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 42598400. Throughput: 0: 1668.4, 1: 1659.6. Samples: 10662582. Policy #0 lag: (min: 15.0, avg: 30.5, max: 32.0) +[2023-10-14 05:53:33,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 05:53:33,522][100936] Updated weights for policy 0, policy_version 20830 (0.0009) +[2023-10-14 05:53:34,890][100917] Updated weights for policy 1, policy_version 20802 (0.0010) +[2023-10-14 05:53:35,270][100917] Updated weights for policy 1, policy_version 20812 (0.0009) +[2023-10-14 05:53:35,649][100917] Updated weights for policy 1, policy_version 20822 (0.0009) +[2023-10-14 05:53:36,021][100917] Updated weights for policy 1, policy_version 20832 (0.0008) +[2023-10-14 05:53:37,766][100936] Updated weights for policy 0, policy_version 20840 (0.0010) +[2023-10-14 05:53:38,145][100936] Updated weights for policy 0, policy_version 20850 (0.0010) +[2023-10-14 05:53:38,512][99942] Fps is (10 sec: 13106.6, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 42663936. Throughput: 0: 1647.0, 1: 1660.3. Samples: 10682200. Policy #0 lag: (min: 20.0, avg: 20.0, max: 20.0) +[2023-10-14 05:53:38,514][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 05:53:38,521][100936] Updated weights for policy 0, policy_version 20860 (0.0010) +[2023-10-14 05:53:38,522][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000020832_21331968.pth... +[2023-10-14 05:53:38,553][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000019296_19759104.pth +[2023-10-14 05:53:38,659][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000020864_21364736.pth... +[2023-10-14 05:53:38,689][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000019296_19759104.pth +[2023-10-14 05:53:40,053][100917] Updated weights for policy 1, policy_version 20842 (0.0008) +[2023-10-14 05:53:40,434][100917] Updated weights for policy 1, policy_version 20852 (0.0008) +[2023-10-14 05:53:40,803][100917] Updated weights for policy 1, policy_version 20862 (0.0008) +[2023-10-14 05:53:42,672][100936] Updated weights for policy 0, policy_version 20870 (0.0008) +[2023-10-14 05:53:43,050][100936] Updated weights for policy 0, policy_version 20880 (0.0007) +[2023-10-14 05:53:43,421][100936] Updated weights for policy 0, policy_version 20890 (0.0009) +[2023-10-14 05:53:43,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 42729472. Throughput: 0: 1667.9, 1: 1652.0. Samples: 10692256. Policy #0 lag: (min: 20.0, avg: 20.0, max: 20.0) +[2023-10-14 05:53:43,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 05:53:44,803][100917] Updated weights for policy 1, policy_version 20872 (0.0009) +[2023-10-14 05:53:45,181][100917] Updated weights for policy 1, policy_version 20882 (0.0010) +[2023-10-14 05:53:45,553][100917] Updated weights for policy 1, policy_version 20892 (0.0010) +[2023-10-14 05:53:47,479][100936] Updated weights for policy 0, policy_version 20900 (0.0009) +[2023-10-14 05:53:47,846][100936] Updated weights for policy 0, policy_version 20910 (0.0008) +[2023-10-14 05:53:48,225][100936] Updated weights for policy 0, policy_version 20920 (0.0011) +[2023-10-14 05:53:48,512][99942] Fps is (10 sec: 13107.8, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 42795008. Throughput: 0: 1662.8, 1: 1666.4. Samples: 10712606. Policy #0 lag: (min: 20.0, avg: 20.0, max: 20.0) +[2023-10-14 05:53:48,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 05:53:49,749][100917] Updated weights for policy 1, policy_version 20902 (0.0008) +[2023-10-14 05:53:50,136][100917] Updated weights for policy 1, policy_version 20912 (0.0007) +[2023-10-14 05:53:50,507][100917] Updated weights for policy 1, policy_version 20922 (0.0009) +[2023-10-14 05:53:52,424][100936] Updated weights for policy 0, policy_version 20930 (0.0009) +[2023-10-14 05:53:52,803][100936] Updated weights for policy 0, policy_version 20940 (0.0008) +[2023-10-14 05:53:53,168][100936] Updated weights for policy 0, policy_version 20950 (0.0007) +[2023-10-14 05:53:53,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 42860544. Throughput: 0: 1652.8, 1: 1669.2. Samples: 10731906. Policy #0 lag: (min: 9.0, avg: 18.6, max: 41.0) +[2023-10-14 05:53:53,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 05:53:53,540][100936] Updated weights for policy 0, policy_version 20960 (0.0008) +[2023-10-14 05:53:54,498][100917] Updated weights for policy 1, policy_version 20932 (0.0009) +[2023-10-14 05:53:54,866][100917] Updated weights for policy 1, policy_version 20942 (0.0010) +[2023-10-14 05:53:55,241][100917] Updated weights for policy 1, policy_version 20952 (0.0011) +[2023-10-14 05:53:57,646][100936] Updated weights for policy 0, policy_version 20970 (0.0009) +[2023-10-14 05:53:58,024][100936] Updated weights for policy 0, policy_version 20980 (0.0008) +[2023-10-14 05:53:58,397][100936] Updated weights for policy 0, policy_version 20990 (0.0008) +[2023-10-14 05:53:58,512][99942] Fps is (10 sec: 16384.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 42958848. Throughput: 0: 1662.7, 1: 1662.3. Samples: 10741924. Policy #0 lag: (min: 9.0, avg: 18.6, max: 41.0) +[2023-10-14 05:53:58,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 05:53:59,455][100917] Updated weights for policy 1, policy_version 20962 (0.0010) +[2023-10-14 05:53:59,824][100917] Updated weights for policy 1, policy_version 20972 (0.0008) +[2023-10-14 05:54:00,197][100917] Updated weights for policy 1, policy_version 20982 (0.0007) +[2023-10-14 05:54:00,580][100917] Updated weights for policy 1, policy_version 20992 (0.0008) +[2023-10-14 05:54:02,466][100936] Updated weights for policy 0, policy_version 21000 (0.0009) +[2023-10-14 05:54:02,830][100936] Updated weights for policy 0, policy_version 21010 (0.0007) +[2023-10-14 05:54:03,199][100936] Updated weights for policy 0, policy_version 21020 (0.0007) +[2023-10-14 05:54:03,512][99942] Fps is (10 sec: 16384.2, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 43024384. Throughput: 0: 1660.0, 1: 1667.1. Samples: 10762146. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:54:03,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 05:54:04,624][100917] Updated weights for policy 1, policy_version 21002 (0.0007) +[2023-10-14 05:54:04,995][100917] Updated weights for policy 1, policy_version 21012 (0.0008) +[2023-10-14 05:54:05,364][100917] Updated weights for policy 1, policy_version 21022 (0.0007) +[2023-10-14 05:54:07,285][100936] Updated weights for policy 0, policy_version 21030 (0.0008) +[2023-10-14 05:54:07,651][100936] Updated weights for policy 0, policy_version 21040 (0.0010) +[2023-10-14 05:54:08,014][100936] Updated weights for policy 0, policy_version 21050 (0.0010) +[2023-10-14 05:54:08,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 43089920. Throughput: 0: 1652.0, 1: 1666.8. Samples: 10781640. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:54:08,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 05:54:09,409][100917] Updated weights for policy 1, policy_version 21032 (0.0008) +[2023-10-14 05:54:09,784][100917] Updated weights for policy 1, policy_version 21042 (0.0009) +[2023-10-14 05:54:10,160][100917] Updated weights for policy 1, policy_version 21052 (0.0009) +[2023-10-14 05:54:12,213][100936] Updated weights for policy 0, policy_version 21060 (0.0009) +[2023-10-14 05:54:12,585][100936] Updated weights for policy 0, policy_version 21070 (0.0009) +[2023-10-14 05:54:12,960][100936] Updated weights for policy 0, policy_version 21080 (0.0008) +[2023-10-14 05:54:13,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 43155456. Throughput: 0: 1662.8, 1: 1665.8. Samples: 10792018. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:54:13,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 05:54:14,288][100917] Updated weights for policy 1, policy_version 21062 (0.0007) +[2023-10-14 05:54:14,665][100917] Updated weights for policy 1, policy_version 21072 (0.0011) +[2023-10-14 05:54:15,047][100917] Updated weights for policy 1, policy_version 21082 (0.0010) +[2023-10-14 05:54:17,056][100936] Updated weights for policy 0, policy_version 21090 (0.0008) +[2023-10-14 05:54:17,421][100936] Updated weights for policy 0, policy_version 21100 (0.0007) +[2023-10-14 05:54:17,799][100936] Updated weights for policy 0, policy_version 21110 (0.0007) +[2023-10-14 05:54:18,174][100936] Updated weights for policy 0, policy_version 21120 (0.0009) +[2023-10-14 05:54:18,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 43220992. Throughput: 0: 1648.7, 1: 1670.5. Samples: 10811946. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:54:18,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 05:54:19,230][100917] Updated weights for policy 1, policy_version 21092 (0.0008) +[2023-10-14 05:54:19,599][100917] Updated weights for policy 1, policy_version 21102 (0.0007) +[2023-10-14 05:54:19,973][100917] Updated weights for policy 1, policy_version 21112 (0.0009) +[2023-10-14 05:54:22,280][100936] Updated weights for policy 0, policy_version 21130 (0.0008) +[2023-10-14 05:54:22,648][100936] Updated weights for policy 0, policy_version 21140 (0.0007) +[2023-10-14 05:54:23,029][100936] Updated weights for policy 0, policy_version 21150 (0.0007) +[2023-10-14 05:54:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 43286528. Throughput: 0: 1656.1, 1: 1662.8. Samples: 10831548. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:54:23,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 05:54:24,051][100917] Updated weights for policy 1, policy_version 21122 (0.0008) +[2023-10-14 05:54:24,435][100917] Updated weights for policy 1, policy_version 21132 (0.0010) +[2023-10-14 05:54:24,804][100917] Updated weights for policy 1, policy_version 21142 (0.0009) +[2023-10-14 05:54:25,174][100917] Updated weights for policy 1, policy_version 21152 (0.0009) +[2023-10-14 05:54:27,269][100936] Updated weights for policy 0, policy_version 21160 (0.0010) +[2023-10-14 05:54:27,655][100936] Updated weights for policy 0, policy_version 21170 (0.0012) +[2023-10-14 05:54:28,009][100936] Updated weights for policy 0, policy_version 21180 (0.0011) +[2023-10-14 05:54:28,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 43352064. Throughput: 0: 1661.6, 1: 1661.6. Samples: 10841800. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:54:28,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 05:54:29,424][100917] Updated weights for policy 1, policy_version 21162 (0.0010) +[2023-10-14 05:54:29,786][100917] Updated weights for policy 1, policy_version 21172 (0.0010) +[2023-10-14 05:54:30,164][100917] Updated weights for policy 1, policy_version 21182 (0.0009) +[2023-10-14 05:54:32,143][100936] Updated weights for policy 0, policy_version 21190 (0.0008) +[2023-10-14 05:54:32,516][100936] Updated weights for policy 0, policy_version 21200 (0.0007) +[2023-10-14 05:54:32,882][100936] Updated weights for policy 0, policy_version 21210 (0.0010) +[2023-10-14 05:54:33,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 43417600. Throughput: 0: 1648.7, 1: 1656.6. Samples: 10861342. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:54:33,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 05:54:34,182][100917] Updated weights for policy 1, policy_version 21192 (0.0007) +[2023-10-14 05:54:34,548][100917] Updated weights for policy 1, policy_version 21202 (0.0011) +[2023-10-14 05:54:34,933][100917] Updated weights for policy 1, policy_version 21212 (0.0009) +[2023-10-14 05:54:36,728][100936] Updated weights for policy 0, policy_version 21220 (0.0010) +[2023-10-14 05:54:37,109][100936] Updated weights for policy 0, policy_version 21230 (0.0011) +[2023-10-14 05:54:37,476][100936] Updated weights for policy 0, policy_version 21240 (0.0008) +[2023-10-14 05:54:38,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 43483136. Throughput: 0: 1657.5, 1: 1655.8. Samples: 10881004. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:54:38,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 05:54:39,139][100917] Updated weights for policy 1, policy_version 21222 (0.0007) +[2023-10-14 05:54:39,510][100917] Updated weights for policy 1, policy_version 21232 (0.0008) +[2023-10-14 05:54:39,883][100917] Updated weights for policy 1, policy_version 21242 (0.0009) +[2023-10-14 05:54:41,681][100936] Updated weights for policy 0, policy_version 21250 (0.0007) +[2023-10-14 05:54:42,043][100936] Updated weights for policy 0, policy_version 21260 (0.0008) +[2023-10-14 05:54:42,419][100936] Updated weights for policy 0, policy_version 21270 (0.0010) +[2023-10-14 05:54:42,785][100936] Updated weights for policy 0, policy_version 21280 (0.0007) +[2023-10-14 05:54:43,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 43548672. Throughput: 0: 1665.8, 1: 1653.9. Samples: 10891312. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:54:43,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 05:54:43,927][100917] Updated weights for policy 1, policy_version 21252 (0.0009) +[2023-10-14 05:54:44,300][100917] Updated weights for policy 1, policy_version 21262 (0.0007) +[2023-10-14 05:54:44,674][100917] Updated weights for policy 1, policy_version 21272 (0.0009) +[2023-10-14 05:54:47,045][100936] Updated weights for policy 0, policy_version 21290 (0.0009) +[2023-10-14 05:54:47,411][100936] Updated weights for policy 0, policy_version 21300 (0.0008) +[2023-10-14 05:54:47,778][100936] Updated weights for policy 0, policy_version 21310 (0.0009) +[2023-10-14 05:54:48,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 43614208. Throughput: 0: 1649.3, 1: 1657.1. Samples: 10910932. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:54:48,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 05:54:48,831][100917] Updated weights for policy 1, policy_version 21282 (0.0007) +[2023-10-14 05:54:49,207][100917] Updated weights for policy 1, policy_version 21292 (0.0007) +[2023-10-14 05:54:49,583][100917] Updated weights for policy 1, policy_version 21302 (0.0009) +[2023-10-14 05:54:49,949][100917] Updated weights for policy 1, policy_version 21312 (0.0011) +[2023-10-14 05:54:51,969][100936] Updated weights for policy 0, policy_version 21320 (0.0007) +[2023-10-14 05:54:52,335][100936] Updated weights for policy 0, policy_version 21330 (0.0007) +[2023-10-14 05:54:52,710][100936] Updated weights for policy 0, policy_version 21340 (0.0008) +[2023-10-14 05:54:53,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 43679744. Throughput: 0: 1658.1, 1: 1657.8. Samples: 10930858. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:54:53,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 05:54:54,158][100917] Updated weights for policy 1, policy_version 21322 (0.0007) +[2023-10-14 05:54:54,529][100917] Updated weights for policy 1, policy_version 21332 (0.0007) +[2023-10-14 05:54:54,906][100917] Updated weights for policy 1, policy_version 21342 (0.0011) +[2023-10-14 05:54:56,734][100936] Updated weights for policy 0, policy_version 21350 (0.0008) +[2023-10-14 05:54:57,102][100936] Updated weights for policy 0, policy_version 21360 (0.0011) +[2023-10-14 05:54:57,486][100936] Updated weights for policy 0, policy_version 21370 (0.0009) +[2023-10-14 05:54:58,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 43745280. Throughput: 0: 1659.2, 1: 1650.4. Samples: 10940950. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:54:58,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 05:54:59,074][100917] Updated weights for policy 1, policy_version 21352 (0.0010) +[2023-10-14 05:54:59,450][100917] Updated weights for policy 1, policy_version 21362 (0.0009) +[2023-10-14 05:54:59,826][100917] Updated weights for policy 1, policy_version 21372 (0.0007) +[2023-10-14 05:55:01,467][100936] Updated weights for policy 0, policy_version 21380 (0.0008) +[2023-10-14 05:55:01,835][100936] Updated weights for policy 0, policy_version 21390 (0.0007) +[2023-10-14 05:55:02,204][100936] Updated weights for policy 0, policy_version 21400 (0.0008) +[2023-10-14 05:55:03,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 43810816. Throughput: 0: 1648.9, 1: 1651.6. Samples: 10960468. Policy #0 lag: (min: 12.0, avg: 12.0, max: 12.0) +[2023-10-14 05:55:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:55:03,891][100917] Updated weights for policy 1, policy_version 21382 (0.0008) +[2023-10-14 05:55:04,266][100917] Updated weights for policy 1, policy_version 21392 (0.0010) +[2023-10-14 05:55:04,644][100917] Updated weights for policy 1, policy_version 21402 (0.0009) +[2023-10-14 05:55:06,196][100936] Updated weights for policy 0, policy_version 21410 (0.0009) +[2023-10-14 05:55:06,579][100936] Updated weights for policy 0, policy_version 21420 (0.0009) +[2023-10-14 05:55:06,956][100936] Updated weights for policy 0, policy_version 21430 (0.0009) +[2023-10-14 05:55:07,320][100936] Updated weights for policy 0, policy_version 21440 (0.0010) +[2023-10-14 05:55:08,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 43876352. Throughput: 0: 1666.0, 1: 1653.5. Samples: 10980924. Policy #0 lag: (min: 12.0, avg: 12.0, max: 12.0) +[2023-10-14 05:55:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:55:08,805][100917] Updated weights for policy 1, policy_version 21412 (0.0009) +[2023-10-14 05:55:09,170][100917] Updated weights for policy 1, policy_version 21422 (0.0010) +[2023-10-14 05:55:09,547][100917] Updated weights for policy 1, policy_version 21432 (0.0010) +[2023-10-14 05:55:11,597][100936] Updated weights for policy 0, policy_version 21450 (0.0010) +[2023-10-14 05:55:11,983][100936] Updated weights for policy 0, policy_version 21460 (0.0009) +[2023-10-14 05:55:12,351][100936] Updated weights for policy 0, policy_version 21470 (0.0008) +[2023-10-14 05:55:13,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 43941888. Throughput: 0: 1662.5, 1: 1650.8. Samples: 10990896. Policy #0 lag: (min: 12.0, avg: 12.0, max: 12.0) +[2023-10-14 05:55:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:55:13,804][100917] Updated weights for policy 1, policy_version 21442 (0.0012) +[2023-10-14 05:55:14,175][100917] Updated weights for policy 1, policy_version 21452 (0.0008) +[2023-10-14 05:55:14,547][100917] Updated weights for policy 1, policy_version 21462 (0.0009) +[2023-10-14 05:55:14,916][100917] Updated weights for policy 1, policy_version 21472 (0.0008) +[2023-10-14 05:55:16,691][100936] Updated weights for policy 0, policy_version 21480 (0.0008) +[2023-10-14 05:55:17,062][100936] Updated weights for policy 0, policy_version 21490 (0.0007) +[2023-10-14 05:55:17,438][100936] Updated weights for policy 0, policy_version 21500 (0.0008) +[2023-10-14 05:55:18,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 44007424. Throughput: 0: 1654.8, 1: 1654.6. Samples: 11010266. Policy #0 lag: (min: 15.0, avg: 22.3, max: 47.0) +[2023-10-14 05:55:18,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:55:18,986][100917] Updated weights for policy 1, policy_version 21482 (0.0010) +[2023-10-14 05:55:19,370][100917] Updated weights for policy 1, policy_version 21492 (0.0007) +[2023-10-14 05:55:19,738][100917] Updated weights for policy 1, policy_version 21502 (0.0010) +[2023-10-14 05:55:21,502][100936] Updated weights for policy 0, policy_version 21510 (0.0009) +[2023-10-14 05:55:21,881][100936] Updated weights for policy 0, policy_version 21520 (0.0008) +[2023-10-14 05:55:22,250][100936] Updated weights for policy 0, policy_version 21530 (0.0010) +[2023-10-14 05:55:23,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 44072960. Throughput: 0: 1665.8, 1: 1658.8. Samples: 11030612. Policy #0 lag: (min: 15.0, avg: 22.3, max: 47.0) +[2023-10-14 05:55:23,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:55:23,694][100917] Updated weights for policy 1, policy_version 21512 (0.0009) +[2023-10-14 05:55:24,062][100917] Updated weights for policy 1, policy_version 21522 (0.0008) +[2023-10-14 05:55:24,434][100917] Updated weights for policy 1, policy_version 21532 (0.0007) +[2023-10-14 05:55:26,187][100936] Updated weights for policy 0, policy_version 21540 (0.0007) +[2023-10-14 05:55:26,558][100936] Updated weights for policy 0, policy_version 21550 (0.0009) +[2023-10-14 05:55:26,930][100936] Updated weights for policy 0, policy_version 21560 (0.0009) +[2023-10-14 05:55:28,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 44138496. Throughput: 0: 1658.1, 1: 1657.1. Samples: 11040496. Policy #0 lag: (min: 15.0, avg: 22.3, max: 47.0) +[2023-10-14 05:55:28,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:55:28,608][100917] Updated weights for policy 1, policy_version 21542 (0.0009) +[2023-10-14 05:55:28,983][100917] Updated weights for policy 1, policy_version 21552 (0.0009) +[2023-10-14 05:55:29,360][100917] Updated weights for policy 1, policy_version 21562 (0.0008) +[2023-10-14 05:55:31,048][100936] Updated weights for policy 0, policy_version 21570 (0.0008) +[2023-10-14 05:55:31,423][100936] Updated weights for policy 0, policy_version 21580 (0.0010) +[2023-10-14 05:55:31,782][100936] Updated weights for policy 0, policy_version 21590 (0.0011) +[2023-10-14 05:55:32,158][100936] Updated weights for policy 0, policy_version 21600 (0.0010) +[2023-10-14 05:55:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 44204032. Throughput: 0: 1657.0, 1: 1657.3. Samples: 11060072. Policy #0 lag: (min: 15.0, avg: 22.3, max: 47.0) +[2023-10-14 05:55:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:55:33,564][100917] Updated weights for policy 1, policy_version 21572 (0.0008) +[2023-10-14 05:55:33,932][100917] Updated weights for policy 1, policy_version 21582 (0.0008) +[2023-10-14 05:55:34,314][100917] Updated weights for policy 1, policy_version 21592 (0.0009) +[2023-10-14 05:55:36,309][100936] Updated weights for policy 0, policy_version 21610 (0.0010) +[2023-10-14 05:55:36,684][100936] Updated weights for policy 0, policy_version 21620 (0.0011) +[2023-10-14 05:55:37,048][100936] Updated weights for policy 0, policy_version 21630 (0.0008) +[2023-10-14 05:55:38,413][100917] Updated weights for policy 1, policy_version 21602 (0.0011) +[2023-10-14 05:55:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 44269568. Throughput: 0: 1669.3, 1: 1655.1. Samples: 11080456. Policy #0 lag: (min: 5.0, avg: 20.9, max: 37.0) +[2023-10-14 05:55:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:55:38,522][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000021632_22151168.pth... +[2023-10-14 05:55:38,561][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000020064_20545536.pth +[2023-10-14 05:55:38,790][100917] Updated weights for policy 1, policy_version 21612 (0.0007) +[2023-10-14 05:55:39,160][100917] Updated weights for policy 1, policy_version 21622 (0.0007) +[2023-10-14 05:55:39,536][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000021632_22151168.pth... +[2023-10-14 05:55:39,536][100917] Updated weights for policy 1, policy_version 21632 (0.0007) +[2023-10-14 05:55:39,564][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000020064_20545536.pth +[2023-10-14 05:55:41,171][100936] Updated weights for policy 0, policy_version 21640 (0.0007) +[2023-10-14 05:55:41,538][100936] Updated weights for policy 0, policy_version 21650 (0.0007) +[2023-10-14 05:55:41,908][100936] Updated weights for policy 0, policy_version 21660 (0.0009) +[2023-10-14 05:55:43,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 44335104. Throughput: 0: 1659.7, 1: 1659.1. Samples: 11090300. Policy #0 lag: (min: 5.0, avg: 20.9, max: 37.0) +[2023-10-14 05:55:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:55:43,743][100917] Updated weights for policy 1, policy_version 21642 (0.0011) +[2023-10-14 05:55:44,113][100917] Updated weights for policy 1, policy_version 21652 (0.0009) +[2023-10-14 05:55:44,497][100917] Updated weights for policy 1, policy_version 21662 (0.0007) +[2023-10-14 05:55:45,971][100936] Updated weights for policy 0, policy_version 21670 (0.0008) +[2023-10-14 05:55:46,340][100936] Updated weights for policy 0, policy_version 21680 (0.0008) +[2023-10-14 05:55:46,702][100936] Updated weights for policy 0, policy_version 21690 (0.0010) +[2023-10-14 05:55:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 44400640. Throughput: 0: 1663.0, 1: 1659.0. Samples: 11109958. Policy #0 lag: (min: 5.0, avg: 20.9, max: 37.0) +[2023-10-14 05:55:48,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:55:48,552][100917] Updated weights for policy 1, policy_version 21672 (0.0010) +[2023-10-14 05:55:48,923][100917] Updated weights for policy 1, policy_version 21682 (0.0007) +[2023-10-14 05:55:49,297][100917] Updated weights for policy 1, policy_version 21692 (0.0009) +[2023-10-14 05:55:50,681][100936] Updated weights for policy 0, policy_version 21700 (0.0008) +[2023-10-14 05:55:51,046][100936] Updated weights for policy 0, policy_version 21710 (0.0011) +[2023-10-14 05:55:51,417][100936] Updated weights for policy 0, policy_version 21720 (0.0009) +[2023-10-14 05:55:53,424][100917] Updated weights for policy 1, policy_version 21702 (0.0012) +[2023-10-14 05:55:53,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 44466176. Throughput: 0: 1668.7, 1: 1659.2. Samples: 11130680. Policy #0 lag: (min: 5.0, avg: 20.9, max: 37.0) +[2023-10-14 05:55:53,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:55:53,801][100917] Updated weights for policy 1, policy_version 21712 (0.0008) +[2023-10-14 05:55:54,169][100917] Updated weights for policy 1, policy_version 21722 (0.0009) +[2023-10-14 05:55:55,365][100936] Updated weights for policy 0, policy_version 21730 (0.0008) +[2023-10-14 05:55:55,740][100936] Updated weights for policy 0, policy_version 21740 (0.0007) +[2023-10-14 05:55:56,106][100936] Updated weights for policy 0, policy_version 21750 (0.0007) +[2023-10-14 05:55:56,479][100936] Updated weights for policy 0, policy_version 21760 (0.0008) +[2023-10-14 05:55:58,166][100917] Updated weights for policy 1, policy_version 21732 (0.0007) +[2023-10-14 05:55:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 44531712. Throughput: 0: 1650.9, 1: 1664.0. Samples: 11140064. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:55:58,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:55:58,539][100917] Updated weights for policy 1, policy_version 21742 (0.0007) +[2023-10-14 05:55:58,913][100917] Updated weights for policy 1, policy_version 21752 (0.0008) +[2023-10-14 05:56:00,519][100936] Updated weights for policy 0, policy_version 21770 (0.0010) +[2023-10-14 05:56:00,890][100936] Updated weights for policy 0, policy_version 21780 (0.0009) +[2023-10-14 05:56:01,269][100936] Updated weights for policy 0, policy_version 21790 (0.0010) +[2023-10-14 05:56:03,110][100917] Updated weights for policy 1, policy_version 21762 (0.0009) +[2023-10-14 05:56:03,475][100917] Updated weights for policy 1, policy_version 21772 (0.0009) +[2023-10-14 05:56:03,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 44597248. Throughput: 0: 1674.6, 1: 1660.7. Samples: 11160354. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:56:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:56:03,846][100917] Updated weights for policy 1, policy_version 21782 (0.0009) +[2023-10-14 05:56:04,226][100917] Updated weights for policy 1, policy_version 21792 (0.0008) +[2023-10-14 05:56:05,553][100936] Updated weights for policy 0, policy_version 21800 (0.0011) +[2023-10-14 05:56:05,924][100936] Updated weights for policy 0, policy_version 21810 (0.0008) +[2023-10-14 05:56:06,291][100936] Updated weights for policy 0, policy_version 21820 (0.0011) +[2023-10-14 05:56:08,311][100917] Updated weights for policy 1, policy_version 21802 (0.0007) +[2023-10-14 05:56:08,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 44662784. Throughput: 0: 1673.7, 1: 1660.7. Samples: 11180662. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:56:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:56:08,688][100917] Updated weights for policy 1, policy_version 21812 (0.0009) +[2023-10-14 05:56:09,059][100917] Updated weights for policy 1, policy_version 21822 (0.0009) +[2023-10-14 05:56:10,346][100936] Updated weights for policy 0, policy_version 21830 (0.0008) +[2023-10-14 05:56:10,721][100936] Updated weights for policy 0, policy_version 21840 (0.0009) +[2023-10-14 05:56:11,099][100936] Updated weights for policy 0, policy_version 21850 (0.0008) +[2023-10-14 05:56:13,162][100917] Updated weights for policy 1, policy_version 21832 (0.0009) +[2023-10-14 05:56:13,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 44728320. Throughput: 0: 1649.7, 1: 1663.2. Samples: 11189576. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 05:56:13,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:56:13,532][100917] Updated weights for policy 1, policy_version 21842 (0.0009) +[2023-10-14 05:56:13,907][100917] Updated weights for policy 1, policy_version 21852 (0.0010) +[2023-10-14 05:56:15,250][100936] Updated weights for policy 0, policy_version 21860 (0.0008) +[2023-10-14 05:56:15,622][100936] Updated weights for policy 0, policy_version 21870 (0.0008) +[2023-10-14 05:56:15,992][100936] Updated weights for policy 0, policy_version 21880 (0.0008) +[2023-10-14 05:56:18,176][100917] Updated weights for policy 1, policy_version 21862 (0.0010) +[2023-10-14 05:56:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 44793856. Throughput: 0: 1670.3, 1: 1658.2. Samples: 11209854. Policy #0 lag: (min: 1.0, avg: 16.7, max: 33.0) +[2023-10-14 05:56:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:56:18,545][100917] Updated weights for policy 1, policy_version 21872 (0.0012) +[2023-10-14 05:56:18,916][100917] Updated weights for policy 1, policy_version 21882 (0.0010) +[2023-10-14 05:56:20,081][100936] Updated weights for policy 0, policy_version 21890 (0.0009) +[2023-10-14 05:56:20,457][100936] Updated weights for policy 0, policy_version 21900 (0.0007) +[2023-10-14 05:56:20,826][100936] Updated weights for policy 0, policy_version 21910 (0.0009) +[2023-10-14 05:56:21,185][100936] Updated weights for policy 0, policy_version 21920 (0.0011) +[2023-10-14 05:56:23,016][100917] Updated weights for policy 1, policy_version 21892 (0.0011) +[2023-10-14 05:56:23,380][100917] Updated weights for policy 1, policy_version 21902 (0.0009) +[2023-10-14 05:56:23,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 44859392. Throughput: 0: 1675.6, 1: 1654.0. Samples: 11230286. Policy #0 lag: (min: 1.0, avg: 16.7, max: 33.0) +[2023-10-14 05:56:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:56:23,761][100917] Updated weights for policy 1, policy_version 21912 (0.0009) +[2023-10-14 05:56:25,334][100936] Updated weights for policy 0, policy_version 21930 (0.0009) +[2023-10-14 05:56:25,702][100936] Updated weights for policy 0, policy_version 21940 (0.0010) +[2023-10-14 05:56:26,083][100936] Updated weights for policy 0, policy_version 21950 (0.0010) +[2023-10-14 05:56:27,854][100917] Updated weights for policy 1, policy_version 21922 (0.0009) +[2023-10-14 05:56:28,235][100917] Updated weights for policy 1, policy_version 21932 (0.0009) +[2023-10-14 05:56:28,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 44924928. Throughput: 0: 1652.5, 1: 1660.2. Samples: 11239368. Policy #0 lag: (min: 1.0, avg: 16.7, max: 33.0) +[2023-10-14 05:56:28,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:56:28,608][100917] Updated weights for policy 1, policy_version 21942 (0.0007) +[2023-10-14 05:56:28,984][100917] Updated weights for policy 1, policy_version 21952 (0.0008) +[2023-10-14 05:56:30,247][100936] Updated weights for policy 0, policy_version 21960 (0.0007) +[2023-10-14 05:56:30,626][100936] Updated weights for policy 0, policy_version 21970 (0.0008) +[2023-10-14 05:56:30,999][100936] Updated weights for policy 0, policy_version 21980 (0.0009) +[2023-10-14 05:56:33,055][100917] Updated weights for policy 1, policy_version 21962 (0.0011) +[2023-10-14 05:56:33,433][100917] Updated weights for policy 1, policy_version 21972 (0.0010) +[2023-10-14 05:56:33,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 44990464. Throughput: 0: 1670.7, 1: 1663.6. Samples: 11260000. Policy #0 lag: (min: 18.0, avg: 23.2, max: 50.0) +[2023-10-14 05:56:33,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:56:33,803][100917] Updated weights for policy 1, policy_version 21982 (0.0009) +[2023-10-14 05:56:34,966][100936] Updated weights for policy 0, policy_version 21990 (0.0008) +[2023-10-14 05:56:35,344][100936] Updated weights for policy 0, policy_version 22000 (0.0007) +[2023-10-14 05:56:35,710][100936] Updated weights for policy 0, policy_version 22010 (0.0007) +[2023-10-14 05:56:37,839][100917] Updated weights for policy 1, policy_version 21992 (0.0008) +[2023-10-14 05:56:38,208][100917] Updated weights for policy 1, policy_version 22002 (0.0010) +[2023-10-14 05:56:38,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 45056000. Throughput: 0: 1667.1, 1: 1649.0. Samples: 11279904. Policy #0 lag: (min: 18.0, avg: 23.2, max: 50.0) +[2023-10-14 05:56:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:56:38,586][100917] Updated weights for policy 1, policy_version 22012 (0.0010) +[2023-10-14 05:56:39,930][100936] Updated weights for policy 0, policy_version 22020 (0.0007) +[2023-10-14 05:56:40,303][100936] Updated weights for policy 0, policy_version 22030 (0.0007) +[2023-10-14 05:56:40,666][100936] Updated weights for policy 0, policy_version 22040 (0.0007) +[2023-10-14 05:56:42,656][100917] Updated weights for policy 1, policy_version 22022 (0.0007) +[2023-10-14 05:56:43,025][100917] Updated weights for policy 1, policy_version 22032 (0.0008) +[2023-10-14 05:56:43,404][100917] Updated weights for policy 1, policy_version 22042 (0.0007) +[2023-10-14 05:56:43,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 45121536. Throughput: 0: 1657.6, 1: 1660.9. Samples: 11289400. Policy #0 lag: (min: 18.0, avg: 23.2, max: 50.0) +[2023-10-14 05:56:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:56:44,847][100936] Updated weights for policy 0, policy_version 22050 (0.0009) +[2023-10-14 05:56:45,214][100936] Updated weights for policy 0, policy_version 22060 (0.0007) +[2023-10-14 05:56:45,574][100936] Updated weights for policy 0, policy_version 22070 (0.0009) +[2023-10-14 05:56:45,946][100936] Updated weights for policy 0, policy_version 22080 (0.0008) +[2023-10-14 05:56:47,538][100917] Updated weights for policy 1, policy_version 22052 (0.0009) +[2023-10-14 05:56:47,913][100917] Updated weights for policy 1, policy_version 22062 (0.0011) +[2023-10-14 05:56:48,297][100917] Updated weights for policy 1, policy_version 22072 (0.0010) +[2023-10-14 05:56:48,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 45187072. Throughput: 0: 1658.4, 1: 1661.8. Samples: 11309766. Policy #0 lag: (min: 18.0, avg: 23.2, max: 50.0) +[2023-10-14 05:56:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:56:50,003][100936] Updated weights for policy 0, policy_version 22090 (0.0008) +[2023-10-14 05:56:50,371][100936] Updated weights for policy 0, policy_version 22100 (0.0009) +[2023-10-14 05:56:50,743][100936] Updated weights for policy 0, policy_version 22110 (0.0008) +[2023-10-14 05:56:52,383][100917] Updated weights for policy 1, policy_version 22082 (0.0009) +[2023-10-14 05:56:52,767][100917] Updated weights for policy 1, policy_version 22092 (0.0010) +[2023-10-14 05:56:53,136][100917] Updated weights for policy 1, policy_version 22102 (0.0010) +[2023-10-14 05:56:53,512][99942] Fps is (10 sec: 16384.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 45285376. Throughput: 0: 1663.6, 1: 1646.6. Samples: 11329622. Policy #0 lag: (min: 31.0, avg: 31.1, max: 38.0) +[2023-10-14 05:56:53,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:56:53,515][100917] Updated weights for policy 1, policy_version 22112 (0.0007) +[2023-10-14 05:56:54,988][100936] Updated weights for policy 0, policy_version 22120 (0.0010) +[2023-10-14 05:56:55,363][100936] Updated weights for policy 0, policy_version 22130 (0.0008) +[2023-10-14 05:56:55,723][100936] Updated weights for policy 0, policy_version 22140 (0.0007) +[2023-10-14 05:56:57,555][100917] Updated weights for policy 1, policy_version 22122 (0.0009) +[2023-10-14 05:56:57,928][100917] Updated weights for policy 1, policy_version 22132 (0.0007) +[2023-10-14 05:56:58,294][100917] Updated weights for policy 1, policy_version 22142 (0.0008) +[2023-10-14 05:56:58,512][99942] Fps is (10 sec: 16384.0, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 45350912. Throughput: 0: 1660.7, 1: 1664.9. Samples: 11339228. Policy #0 lag: (min: 31.0, avg: 31.1, max: 38.0) +[2023-10-14 05:56:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:56:59,932][100936] Updated weights for policy 0, policy_version 22150 (0.0008) +[2023-10-14 05:57:00,306][100936] Updated weights for policy 0, policy_version 22160 (0.0009) +[2023-10-14 05:57:00,687][100936] Updated weights for policy 0, policy_version 22170 (0.0009) +[2023-10-14 05:57:02,148][100917] Updated weights for policy 1, policy_version 22152 (0.0008) +[2023-10-14 05:57:02,525][100917] Updated weights for policy 1, policy_version 22162 (0.0007) +[2023-10-14 05:57:02,904][100917] Updated weights for policy 1, policy_version 22172 (0.0008) +[2023-10-14 05:57:03,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 45416448. Throughput: 0: 1662.3, 1: 1667.7. Samples: 11359706. Policy #0 lag: (min: 31.0, avg: 31.1, max: 38.0) +[2023-10-14 05:57:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:57:04,737][100936] Updated weights for policy 0, policy_version 22180 (0.0010) +[2023-10-14 05:57:05,102][100936] Updated weights for policy 0, policy_version 22190 (0.0011) +[2023-10-14 05:57:05,478][100936] Updated weights for policy 0, policy_version 22200 (0.0010) +[2023-10-14 05:57:07,239][100917] Updated weights for policy 1, policy_version 22182 (0.0007) +[2023-10-14 05:57:07,610][100917] Updated weights for policy 1, policy_version 22192 (0.0009) +[2023-10-14 05:57:07,986][100917] Updated weights for policy 1, policy_version 22202 (0.0009) +[2023-10-14 05:57:08,512][99942] Fps is (10 sec: 13106.7, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 45481984. Throughput: 0: 1655.2, 1: 1657.2. Samples: 11379346. Policy #0 lag: (min: 31.0, avg: 31.1, max: 38.0) +[2023-10-14 05:57:08,514][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:57:09,715][100936] Updated weights for policy 0, policy_version 22210 (0.0010) +[2023-10-14 05:57:10,084][100936] Updated weights for policy 0, policy_version 22220 (0.0008) +[2023-10-14 05:57:10,460][100936] Updated weights for policy 0, policy_version 22230 (0.0009) +[2023-10-14 05:57:10,825][100936] Updated weights for policy 0, policy_version 22240 (0.0009) +[2023-10-14 05:57:11,986][100917] Updated weights for policy 1, policy_version 22212 (0.0009) +[2023-10-14 05:57:12,356][100917] Updated weights for policy 1, policy_version 22222 (0.0010) +[2023-10-14 05:57:12,725][100917] Updated weights for policy 1, policy_version 22232 (0.0010) +[2023-10-14 05:57:13,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 45547520. Throughput: 0: 1656.1, 1: 1679.0. Samples: 11389450. Policy #0 lag: (min: 31.0, avg: 31.0, max: 33.0) +[2023-10-14 05:57:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:57:14,891][100936] Updated weights for policy 0, policy_version 22250 (0.0008) +[2023-10-14 05:57:15,261][100936] Updated weights for policy 0, policy_version 22260 (0.0008) +[2023-10-14 05:57:15,640][100936] Updated weights for policy 0, policy_version 22270 (0.0009) +[2023-10-14 05:57:16,916][100917] Updated weights for policy 1, policy_version 22242 (0.0010) +[2023-10-14 05:57:17,297][100917] Updated weights for policy 1, policy_version 22252 (0.0007) +[2023-10-14 05:57:17,669][100917] Updated weights for policy 1, policy_version 22262 (0.0008) +[2023-10-14 05:57:18,035][100917] Updated weights for policy 1, policy_version 22272 (0.0010) +[2023-10-14 05:57:18,512][99942] Fps is (10 sec: 13107.7, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 45613056. Throughput: 0: 1656.8, 1: 1670.0. Samples: 11409704. Policy #0 lag: (min: 31.0, avg: 31.0, max: 33.0) +[2023-10-14 05:57:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:57:19,877][100936] Updated weights for policy 0, policy_version 22280 (0.0009) +[2023-10-14 05:57:20,255][100936] Updated weights for policy 0, policy_version 22290 (0.0007) +[2023-10-14 05:57:20,629][100936] Updated weights for policy 0, policy_version 22300 (0.0007) +[2023-10-14 05:57:22,031][100917] Updated weights for policy 1, policy_version 22282 (0.0009) +[2023-10-14 05:57:22,409][100917] Updated weights for policy 1, policy_version 22292 (0.0008) +[2023-10-14 05:57:22,780][100917] Updated weights for policy 1, policy_version 22302 (0.0009) +[2023-10-14 05:57:23,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 45678592. Throughput: 0: 1654.3, 1: 1658.8. Samples: 11428992. Policy #0 lag: (min: 31.0, avg: 31.0, max: 33.0) +[2023-10-14 05:57:23,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:57:24,818][100936] Updated weights for policy 0, policy_version 22310 (0.0010) +[2023-10-14 05:57:25,191][100936] Updated weights for policy 0, policy_version 22320 (0.0010) +[2023-10-14 05:57:25,561][100936] Updated weights for policy 0, policy_version 22330 (0.0011) +[2023-10-14 05:57:26,904][100917] Updated weights for policy 1, policy_version 22312 (0.0009) +[2023-10-14 05:57:27,283][100917] Updated weights for policy 1, policy_version 22322 (0.0010) +[2023-10-14 05:57:27,654][100917] Updated weights for policy 1, policy_version 22332 (0.0008) +[2023-10-14 05:57:28,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 45744128. Throughput: 0: 1654.8, 1: 1671.5. Samples: 11439084. Policy #0 lag: (min: 31.0, avg: 31.0, max: 33.0) +[2023-10-14 05:57:28,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:57:29,616][100936] Updated weights for policy 0, policy_version 22340 (0.0008) +[2023-10-14 05:57:29,977][100936] Updated weights for policy 0, policy_version 22350 (0.0008) +[2023-10-14 05:57:30,356][100936] Updated weights for policy 0, policy_version 22360 (0.0007) +[2023-10-14 05:57:31,765][100917] Updated weights for policy 1, policy_version 22342 (0.0008) +[2023-10-14 05:57:32,141][100917] Updated weights for policy 1, policy_version 22352 (0.0008) +[2023-10-14 05:57:32,524][100917] Updated weights for policy 1, policy_version 22362 (0.0007) +[2023-10-14 05:57:33,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 45809664. Throughput: 0: 1653.8, 1: 1663.2. Samples: 11459032. Policy #0 lag: (min: 31.0, avg: 38.7, max: 63.0) +[2023-10-14 05:57:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:57:34,539][100936] Updated weights for policy 0, policy_version 22370 (0.0009) +[2023-10-14 05:57:34,903][100936] Updated weights for policy 0, policy_version 22380 (0.0010) +[2023-10-14 05:57:35,270][100936] Updated weights for policy 0, policy_version 22390 (0.0009) +[2023-10-14 05:57:35,640][100936] Updated weights for policy 0, policy_version 22400 (0.0008) +[2023-10-14 05:57:36,668][100917] Updated weights for policy 1, policy_version 22372 (0.0009) +[2023-10-14 05:57:37,042][100917] Updated weights for policy 1, policy_version 22382 (0.0011) +[2023-10-14 05:57:37,417][100917] Updated weights for policy 1, policy_version 22392 (0.0007) +[2023-10-14 05:57:38,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13653.4, 300 sec: 13329.3). Total num frames: 45875200. Throughput: 0: 1653.0, 1: 1656.7. Samples: 11478558. Policy #0 lag: (min: 31.0, avg: 38.7, max: 63.0) +[2023-10-14 05:57:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:57:38,520][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000022400_22937600.pth... +[2023-10-14 05:57:38,520][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000022400_22937600.pth... +[2023-10-14 05:57:38,553][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000020864_21364736.pth +[2023-10-14 05:57:38,556][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000020832_21331968.pth +[2023-10-14 05:57:39,763][100936] Updated weights for policy 0, policy_version 22410 (0.0007) +[2023-10-14 05:57:40,142][100936] Updated weights for policy 0, policy_version 22420 (0.0007) +[2023-10-14 05:57:40,512][100936] Updated weights for policy 0, policy_version 22430 (0.0011) +[2023-10-14 05:57:41,581][100917] Updated weights for policy 1, policy_version 22402 (0.0008) +[2023-10-14 05:57:41,953][100917] Updated weights for policy 1, policy_version 22412 (0.0009) +[2023-10-14 05:57:42,339][100917] Updated weights for policy 1, policy_version 22422 (0.0008) +[2023-10-14 05:57:42,713][100917] Updated weights for policy 1, policy_version 22432 (0.0007) +[2023-10-14 05:57:43,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 45940736. Throughput: 0: 1652.2, 1: 1666.6. Samples: 11488574. Policy #0 lag: (min: 31.0, avg: 38.7, max: 63.0) +[2023-10-14 05:57:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:57:44,692][100936] Updated weights for policy 0, policy_version 22440 (0.0009) +[2023-10-14 05:57:45,059][100936] Updated weights for policy 0, policy_version 22450 (0.0010) +[2023-10-14 05:57:45,421][100936] Updated weights for policy 0, policy_version 22460 (0.0010) +[2023-10-14 05:57:46,810][100917] Updated weights for policy 1, policy_version 22442 (0.0008) +[2023-10-14 05:57:47,181][100917] Updated weights for policy 1, policy_version 22452 (0.0009) +[2023-10-14 05:57:47,554][100917] Updated weights for policy 1, policy_version 22462 (0.0009) +[2023-10-14 05:57:48,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 46006272. Throughput: 0: 1650.3, 1: 1661.0. Samples: 11508714. Policy #0 lag: (min: 31.0, avg: 38.7, max: 63.0) +[2023-10-14 05:57:48,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:57:49,576][100936] Updated weights for policy 0, policy_version 22470 (0.0011) +[2023-10-14 05:57:49,945][100936] Updated weights for policy 0, policy_version 22480 (0.0007) +[2023-10-14 05:57:50,319][100936] Updated weights for policy 0, policy_version 22490 (0.0009) +[2023-10-14 05:57:51,826][100917] Updated weights for policy 1, policy_version 22472 (0.0007) +[2023-10-14 05:57:52,198][100917] Updated weights for policy 1, policy_version 22482 (0.0008) +[2023-10-14 05:57:52,571][100917] Updated weights for policy 1, policy_version 22492 (0.0007) +[2023-10-14 05:57:53,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 46071808. Throughput: 0: 1660.7, 1: 1650.5. Samples: 11528350. Policy #0 lag: (min: 29.0, avg: 30.7, max: 57.0) +[2023-10-14 05:57:53,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:57:54,300][100936] Updated weights for policy 0, policy_version 22500 (0.0008) +[2023-10-14 05:57:54,670][100936] Updated weights for policy 0, policy_version 22510 (0.0008) +[2023-10-14 05:57:55,040][100936] Updated weights for policy 0, policy_version 22520 (0.0010) +[2023-10-14 05:57:56,940][100917] Updated weights for policy 1, policy_version 22502 (0.0007) +[2023-10-14 05:57:57,320][100917] Updated weights for policy 1, policy_version 22512 (0.0007) +[2023-10-14 05:57:57,699][100917] Updated weights for policy 1, policy_version 22522 (0.0008) +[2023-10-14 05:57:58,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 46137344. Throughput: 0: 1659.3, 1: 1648.6. Samples: 11538304. Policy #0 lag: (min: 29.0, avg: 30.7, max: 57.0) +[2023-10-14 05:57:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:57:59,063][100936] Updated weights for policy 0, policy_version 22530 (0.0009) +[2023-10-14 05:57:59,442][100936] Updated weights for policy 0, policy_version 22540 (0.0008) +[2023-10-14 05:57:59,807][100936] Updated weights for policy 0, policy_version 22550 (0.0009) +[2023-10-14 05:58:00,182][100936] Updated weights for policy 0, policy_version 22560 (0.0009) +[2023-10-14 05:58:01,819][100917] Updated weights for policy 1, policy_version 22532 (0.0009) +[2023-10-14 05:58:02,182][100917] Updated weights for policy 1, policy_version 22542 (0.0008) +[2023-10-14 05:58:02,560][100917] Updated weights for policy 1, policy_version 22552 (0.0007) +[2023-10-14 05:58:03,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 46202880. Throughput: 0: 1658.1, 1: 1644.7. Samples: 11558332. Policy #0 lag: (min: 29.0, avg: 30.7, max: 57.0) +[2023-10-14 05:58:03,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:58:04,371][100936] Updated weights for policy 0, policy_version 22570 (0.0008) +[2023-10-14 05:58:04,743][100936] Updated weights for policy 0, policy_version 22580 (0.0010) +[2023-10-14 05:58:05,121][100936] Updated weights for policy 0, policy_version 22590 (0.0009) +[2023-10-14 05:58:06,833][100917] Updated weights for policy 1, policy_version 22562 (0.0010) +[2023-10-14 05:58:07,260][100917] Updated weights for policy 1, policy_version 22572 (0.0009) +[2023-10-14 05:58:07,630][100917] Updated weights for policy 1, policy_version 22582 (0.0010) +[2023-10-14 05:58:07,995][100917] Updated weights for policy 1, policy_version 22592 (0.0010) +[2023-10-14 05:58:08,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 13329.4). Total num frames: 46268416. Throughput: 0: 1661.8, 1: 1648.6. Samples: 11577960. Policy #0 lag: (min: 29.0, avg: 30.7, max: 57.0) +[2023-10-14 05:58:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:58:09,030][100936] Updated weights for policy 0, policy_version 22600 (0.0009) +[2023-10-14 05:58:09,391][100936] Updated weights for policy 0, policy_version 22610 (0.0010) +[2023-10-14 05:58:09,761][100936] Updated weights for policy 0, policy_version 22620 (0.0010) +[2023-10-14 05:58:11,931][100917] Updated weights for policy 1, policy_version 22602 (0.0007) +[2023-10-14 05:58:12,297][100917] Updated weights for policy 1, policy_version 22612 (0.0007) +[2023-10-14 05:58:12,676][100917] Updated weights for policy 1, policy_version 22622 (0.0008) +[2023-10-14 05:58:13,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 46333952. Throughput: 0: 1660.4, 1: 1650.7. Samples: 11588080. Policy #0 lag: (min: 31.0, avg: 38.3, max: 63.0) +[2023-10-14 05:58:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:58:14,079][100936] Updated weights for policy 0, policy_version 22630 (0.0009) +[2023-10-14 05:58:14,454][100936] Updated weights for policy 0, policy_version 22640 (0.0007) +[2023-10-14 05:58:14,830][100936] Updated weights for policy 0, policy_version 22650 (0.0008) +[2023-10-14 05:58:16,592][100917] Updated weights for policy 1, policy_version 22632 (0.0008) +[2023-10-14 05:58:16,968][100917] Updated weights for policy 1, policy_version 22642 (0.0008) +[2023-10-14 05:58:17,340][100917] Updated weights for policy 1, policy_version 22652 (0.0011) +[2023-10-14 05:58:18,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 46399488. Throughput: 0: 1663.2, 1: 1649.2. Samples: 11608088. Policy #0 lag: (min: 31.0, avg: 38.3, max: 63.0) +[2023-10-14 05:58:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:58:18,816][100936] Updated weights for policy 0, policy_version 22660 (0.0009) +[2023-10-14 05:58:19,192][100936] Updated weights for policy 0, policy_version 22670 (0.0008) +[2023-10-14 05:58:19,555][100936] Updated weights for policy 0, policy_version 22680 (0.0009) +[2023-10-14 05:58:21,622][100917] Updated weights for policy 1, policy_version 22662 (0.0011) +[2023-10-14 05:58:21,988][100917] Updated weights for policy 1, policy_version 22672 (0.0008) +[2023-10-14 05:58:22,364][100917] Updated weights for policy 1, policy_version 22682 (0.0011) +[2023-10-14 05:58:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 46465024. Throughput: 0: 1666.9, 1: 1656.7. Samples: 11628120. Policy #0 lag: (min: 31.0, avg: 38.3, max: 63.0) +[2023-10-14 05:58:23,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:58:23,747][100936] Updated weights for policy 0, policy_version 22690 (0.0011) +[2023-10-14 05:58:24,116][100936] Updated weights for policy 0, policy_version 22700 (0.0010) +[2023-10-14 05:58:24,483][100936] Updated weights for policy 0, policy_version 22710 (0.0008) +[2023-10-14 05:58:24,851][100936] Updated weights for policy 0, policy_version 22720 (0.0008) +[2023-10-14 05:58:26,398][100917] Updated weights for policy 1, policy_version 22692 (0.0008) +[2023-10-14 05:58:26,761][100917] Updated weights for policy 1, policy_version 22702 (0.0009) +[2023-10-14 05:58:27,139][100917] Updated weights for policy 1, policy_version 22712 (0.0009) +[2023-10-14 05:58:28,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.1, 300 sec: 13329.3). Total num frames: 46530560. Throughput: 0: 1670.5, 1: 1658.3. Samples: 11638372. Policy #0 lag: (min: 31.0, avg: 38.3, max: 63.0) +[2023-10-14 05:58:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:58:29,138][100936] Updated weights for policy 0, policy_version 22730 (0.0007) +[2023-10-14 05:58:29,511][100936] Updated weights for policy 0, policy_version 22740 (0.0009) +[2023-10-14 05:58:29,882][100936] Updated weights for policy 0, policy_version 22750 (0.0008) +[2023-10-14 05:58:31,094][100917] Updated weights for policy 1, policy_version 22722 (0.0010) +[2023-10-14 05:58:31,462][100917] Updated weights for policy 1, policy_version 22732 (0.0010) +[2023-10-14 05:58:31,842][100917] Updated weights for policy 1, policy_version 22742 (0.0011) +[2023-10-14 05:58:32,216][100917] Updated weights for policy 1, policy_version 22752 (0.0008) +[2023-10-14 05:58:33,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 46596096. Throughput: 0: 1670.4, 1: 1644.0. Samples: 11657866. Policy #0 lag: (min: 31.0, avg: 31.6, max: 48.0) +[2023-10-14 05:58:33,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:58:33,937][100936] Updated weights for policy 0, policy_version 22760 (0.0007) +[2023-10-14 05:58:34,307][100936] Updated weights for policy 0, policy_version 22770 (0.0008) +[2023-10-14 05:58:34,677][100936] Updated weights for policy 0, policy_version 22780 (0.0008) +[2023-10-14 05:58:36,517][100917] Updated weights for policy 1, policy_version 22762 (0.0008) +[2023-10-14 05:58:36,884][100917] Updated weights for policy 1, policy_version 22772 (0.0009) +[2023-10-14 05:58:37,251][100917] Updated weights for policy 1, policy_version 22782 (0.0009) +[2023-10-14 05:58:38,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 46661632. Throughput: 0: 1662.0, 1: 1654.9. Samples: 11677614. Policy #0 lag: (min: 31.0, avg: 31.6, max: 48.0) +[2023-10-14 05:58:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:58:38,907][100936] Updated weights for policy 0, policy_version 22790 (0.0009) +[2023-10-14 05:58:39,274][100936] Updated weights for policy 0, policy_version 22800 (0.0009) +[2023-10-14 05:58:39,654][100936] Updated weights for policy 0, policy_version 22810 (0.0010) +[2023-10-14 05:58:41,417][100917] Updated weights for policy 1, policy_version 22792 (0.0010) +[2023-10-14 05:58:41,786][100917] Updated weights for policy 1, policy_version 22802 (0.0009) +[2023-10-14 05:58:42,148][100917] Updated weights for policy 1, policy_version 22812 (0.0010) +[2023-10-14 05:58:43,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 46727168. Throughput: 0: 1665.7, 1: 1656.9. Samples: 11687820. Policy #0 lag: (min: 31.0, avg: 31.6, max: 48.0) +[2023-10-14 05:58:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:58:43,746][100936] Updated weights for policy 0, policy_version 22820 (0.0010) +[2023-10-14 05:58:44,121][100936] Updated weights for policy 0, policy_version 22830 (0.0008) +[2023-10-14 05:58:44,489][100936] Updated weights for policy 0, policy_version 22840 (0.0007) +[2023-10-14 05:58:46,297][100917] Updated weights for policy 1, policy_version 22822 (0.0009) +[2023-10-14 05:58:46,670][100917] Updated weights for policy 1, policy_version 22832 (0.0009) +[2023-10-14 05:58:47,048][100917] Updated weights for policy 1, policy_version 22842 (0.0009) +[2023-10-14 05:58:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 46792704. Throughput: 0: 1663.5, 1: 1647.4. Samples: 11707320. Policy #0 lag: (min: 31.0, avg: 31.6, max: 48.0) +[2023-10-14 05:58:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:58:48,725][100936] Updated weights for policy 0, policy_version 22850 (0.0010) +[2023-10-14 05:58:49,092][100936] Updated weights for policy 0, policy_version 22860 (0.0008) +[2023-10-14 05:58:49,460][100936] Updated weights for policy 0, policy_version 22870 (0.0007) +[2023-10-14 05:58:49,836][100936] Updated weights for policy 0, policy_version 22880 (0.0010) +[2023-10-14 05:58:51,140][100917] Updated weights for policy 1, policy_version 22852 (0.0010) +[2023-10-14 05:58:51,501][100917] Updated weights for policy 1, policy_version 22862 (0.0008) +[2023-10-14 05:58:51,879][100917] Updated weights for policy 1, policy_version 22872 (0.0009) +[2023-10-14 05:58:53,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 46858240. Throughput: 0: 1660.1, 1: 1662.7. Samples: 11727486. Policy #0 lag: (min: 16.0, avg: 43.8, max: 48.0) +[2023-10-14 05:58:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:58:53,870][100936] Updated weights for policy 0, policy_version 22890 (0.0008) +[2023-10-14 05:58:54,247][100936] Updated weights for policy 0, policy_version 22900 (0.0010) +[2023-10-14 05:58:54,611][100936] Updated weights for policy 0, policy_version 22910 (0.0010) +[2023-10-14 05:58:56,198][100917] Updated weights for policy 1, policy_version 22882 (0.0010) +[2023-10-14 05:58:56,578][100917] Updated weights for policy 1, policy_version 22892 (0.0007) +[2023-10-14 05:58:56,943][100917] Updated weights for policy 1, policy_version 22902 (0.0007) +[2023-10-14 05:58:57,314][100917] Updated weights for policy 1, policy_version 22912 (0.0008) +[2023-10-14 05:58:58,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 46923776. Throughput: 0: 1660.8, 1: 1662.7. Samples: 11737636. Policy #0 lag: (min: 16.0, avg: 43.8, max: 48.0) +[2023-10-14 05:58:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:58:58,808][100936] Updated weights for policy 0, policy_version 22920 (0.0009) +[2023-10-14 05:58:59,179][100936] Updated weights for policy 0, policy_version 22930 (0.0008) +[2023-10-14 05:58:59,546][100936] Updated weights for policy 0, policy_version 22940 (0.0009) +[2023-10-14 05:59:01,440][100917] Updated weights for policy 1, policy_version 22922 (0.0008) +[2023-10-14 05:59:01,812][100917] Updated weights for policy 1, policy_version 22932 (0.0009) +[2023-10-14 05:59:02,174][100917] Updated weights for policy 1, policy_version 22942 (0.0008) +[2023-10-14 05:59:03,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 46989312. Throughput: 0: 1662.1, 1: 1649.9. Samples: 11757128. Policy #0 lag: (min: 16.0, avg: 43.8, max: 48.0) +[2023-10-14 05:59:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:59:03,533][100936] Updated weights for policy 0, policy_version 22950 (0.0009) +[2023-10-14 05:59:03,891][100936] Updated weights for policy 0, policy_version 22960 (0.0008) +[2023-10-14 05:59:04,263][100936] Updated weights for policy 0, policy_version 22970 (0.0010) +[2023-10-14 05:59:06,319][100917] Updated weights for policy 1, policy_version 22952 (0.0008) +[2023-10-14 05:59:06,683][100917] Updated weights for policy 1, policy_version 22962 (0.0007) +[2023-10-14 05:59:07,067][100917] Updated weights for policy 1, policy_version 22972 (0.0007) +[2023-10-14 05:59:08,222][100936] Updated weights for policy 0, policy_version 22980 (0.0008) +[2023-10-14 05:59:08,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 47054848. Throughput: 0: 1650.1, 1: 1655.3. Samples: 11776864. Policy #0 lag: (min: 16.0, avg: 43.8, max: 48.0) +[2023-10-14 05:59:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:59:08,600][100936] Updated weights for policy 0, policy_version 22990 (0.0008) +[2023-10-14 05:59:08,970][100936] Updated weights for policy 0, policy_version 23000 (0.0007) +[2023-10-14 05:59:11,063][100917] Updated weights for policy 1, policy_version 22982 (0.0007) +[2023-10-14 05:59:11,435][100917] Updated weights for policy 1, policy_version 22992 (0.0011) +[2023-10-14 05:59:11,815][100917] Updated weights for policy 1, policy_version 23002 (0.0011) +[2023-10-14 05:59:13,396][100936] Updated weights for policy 0, policy_version 23010 (0.0009) +[2023-10-14 05:59:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 47120384. Throughput: 0: 1656.4, 1: 1649.1. Samples: 11787118. Policy #0 lag: (min: 6.0, avg: 6.9, max: 27.0) +[2023-10-14 05:59:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:59:13,781][100936] Updated weights for policy 0, policy_version 23020 (0.0008) +[2023-10-14 05:59:14,160][100936] Updated weights for policy 0, policy_version 23030 (0.0007) +[2023-10-14 05:59:14,528][100936] Updated weights for policy 0, policy_version 23040 (0.0007) +[2023-10-14 05:59:15,956][100917] Updated weights for policy 1, policy_version 23012 (0.0010) +[2023-10-14 05:59:16,316][100917] Updated weights for policy 1, policy_version 23022 (0.0009) +[2023-10-14 05:59:16,686][100917] Updated weights for policy 1, policy_version 23032 (0.0009) +[2023-10-14 05:59:18,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 47185920. Throughput: 0: 1655.2, 1: 1643.6. Samples: 11806314. Policy #0 lag: (min: 6.0, avg: 6.9, max: 27.0) +[2023-10-14 05:59:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:59:18,687][100936] Updated weights for policy 0, policy_version 23050 (0.0008) +[2023-10-14 05:59:19,055][100936] Updated weights for policy 0, policy_version 23060 (0.0007) +[2023-10-14 05:59:19,428][100936] Updated weights for policy 0, policy_version 23070 (0.0007) +[2023-10-14 05:59:20,655][100917] Updated weights for policy 1, policy_version 23042 (0.0008) +[2023-10-14 05:59:21,024][100917] Updated weights for policy 1, policy_version 23052 (0.0010) +[2023-10-14 05:59:21,395][100917] Updated weights for policy 1, policy_version 23062 (0.0011) +[2023-10-14 05:59:21,771][100917] Updated weights for policy 1, policy_version 23072 (0.0007) +[2023-10-14 05:59:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 47251456. Throughput: 0: 1650.8, 1: 1658.1. Samples: 11826514. Policy #0 lag: (min: 6.0, avg: 6.9, max: 27.0) +[2023-10-14 05:59:23,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:59:23,608][100936] Updated weights for policy 0, policy_version 23080 (0.0009) +[2023-10-14 05:59:23,977][100936] Updated weights for policy 0, policy_version 23090 (0.0011) +[2023-10-14 05:59:24,348][100936] Updated weights for policy 0, policy_version 23100 (0.0008) +[2023-10-14 05:59:26,008][100917] Updated weights for policy 1, policy_version 23082 (0.0008) +[2023-10-14 05:59:26,372][100917] Updated weights for policy 1, policy_version 23092 (0.0009) +[2023-10-14 05:59:26,750][100917] Updated weights for policy 1, policy_version 23102 (0.0007) +[2023-10-14 05:59:28,456][100936] Updated weights for policy 0, policy_version 23110 (0.0009) +[2023-10-14 05:59:28,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 47316992. Throughput: 0: 1651.7, 1: 1652.2. Samples: 11836496. Policy #0 lag: (min: 6.0, avg: 6.9, max: 27.0) +[2023-10-14 05:59:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:59:28,823][100936] Updated weights for policy 0, policy_version 23120 (0.0009) +[2023-10-14 05:59:29,193][100936] Updated weights for policy 0, policy_version 23130 (0.0008) +[2023-10-14 05:59:30,759][100917] Updated weights for policy 1, policy_version 23112 (0.0009) +[2023-10-14 05:59:31,126][100917] Updated weights for policy 1, policy_version 23122 (0.0011) +[2023-10-14 05:59:31,508][100917] Updated weights for policy 1, policy_version 23132 (0.0010) +[2023-10-14 05:59:33,303][100936] Updated weights for policy 0, policy_version 23140 (0.0009) +[2023-10-14 05:59:33,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 47382528. Throughput: 0: 1652.5, 1: 1650.4. Samples: 11855952. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 05:59:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:59:33,666][100936] Updated weights for policy 0, policy_version 23150 (0.0009) +[2023-10-14 05:59:34,035][100936] Updated weights for policy 0, policy_version 23160 (0.0008) +[2023-10-14 05:59:35,683][100917] Updated weights for policy 1, policy_version 23142 (0.0008) +[2023-10-14 05:59:36,058][100917] Updated weights for policy 1, policy_version 23152 (0.0008) +[2023-10-14 05:59:36,423][100917] Updated weights for policy 1, policy_version 23162 (0.0009) +[2023-10-14 05:59:38,141][100936] Updated weights for policy 0, policy_version 23170 (0.0010) +[2023-10-14 05:59:38,510][100936] Updated weights for policy 0, policy_version 23180 (0.0011) +[2023-10-14 05:59:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 47448064. Throughput: 0: 1639.4, 1: 1656.9. Samples: 11875820. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 05:59:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:59:38,523][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000023168_23724032.pth... +[2023-10-14 05:59:38,561][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000021632_22151168.pth +[2023-10-14 05:59:38,566][100681] Saving a milestone ./train_atari/atari_privateye_APPO/checkpoint_p1/milestones/checkpoint_000023168_23724032.pth +[2023-10-14 05:59:38,869][100936] Updated weights for policy 0, policy_version 23190 (0.0010) +[2023-10-14 05:59:39,238][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000023200_23756800.pth... +[2023-10-14 05:59:39,241][100936] Updated weights for policy 0, policy_version 23200 (0.0010) +[2023-10-14 05:59:39,267][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000021632_22151168.pth +[2023-10-14 05:59:39,270][100560] Saving a milestone ./train_atari/atari_privateye_APPO/checkpoint_p0/milestones/checkpoint_000023200_23756800.pth +[2023-10-14 05:59:40,460][100917] Updated weights for policy 1, policy_version 23172 (0.0009) +[2023-10-14 05:59:40,871][100917] Updated weights for policy 1, policy_version 23182 (0.0007) +[2023-10-14 05:59:41,234][100917] Updated weights for policy 1, policy_version 23192 (0.0009) +[2023-10-14 05:59:43,429][100936] Updated weights for policy 0, policy_version 23210 (0.0010) +[2023-10-14 05:59:43,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 47513600. Throughput: 0: 1648.1, 1: 1643.7. Samples: 11885766. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 05:59:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:59:43,800][100936] Updated weights for policy 0, policy_version 23220 (0.0008) +[2023-10-14 05:59:44,170][100936] Updated weights for policy 0, policy_version 23230 (0.0009) +[2023-10-14 05:59:45,133][100917] Updated weights for policy 1, policy_version 23202 (0.0010) +[2023-10-14 05:59:45,502][100917] Updated weights for policy 1, policy_version 23212 (0.0007) +[2023-10-14 05:59:45,868][100917] Updated weights for policy 1, policy_version 23222 (0.0009) +[2023-10-14 05:59:46,242][100917] Updated weights for policy 1, policy_version 23232 (0.0010) +[2023-10-14 05:59:48,257][100936] Updated weights for policy 0, policy_version 23240 (0.0008) +[2023-10-14 05:59:48,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 47579136. Throughput: 0: 1647.1, 1: 1656.9. Samples: 11905810. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 05:59:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:59:48,627][100936] Updated weights for policy 0, policy_version 23250 (0.0008) +[2023-10-14 05:59:48,990][100936] Updated weights for policy 0, policy_version 23260 (0.0009) +[2023-10-14 05:59:50,383][100917] Updated weights for policy 1, policy_version 23242 (0.0009) +[2023-10-14 05:59:50,760][100917] Updated weights for policy 1, policy_version 23252 (0.0007) +[2023-10-14 05:59:51,135][100917] Updated weights for policy 1, policy_version 23262 (0.0007) +[2023-10-14 05:59:53,222][100936] Updated weights for policy 0, policy_version 23270 (0.0007) +[2023-10-14 05:59:53,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 47644672. Throughput: 0: 1645.6, 1: 1670.4. Samples: 11926082. Policy #0 lag: (min: 0.0, avg: 28.0, max: 32.0) +[2023-10-14 05:59:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:59:53,591][100936] Updated weights for policy 0, policy_version 23280 (0.0007) +[2023-10-14 05:59:53,959][100936] Updated weights for policy 0, policy_version 23290 (0.0009) +[2023-10-14 05:59:55,301][100917] Updated weights for policy 1, policy_version 23272 (0.0009) +[2023-10-14 05:59:55,670][100917] Updated weights for policy 1, policy_version 23282 (0.0009) +[2023-10-14 05:59:56,043][100917] Updated weights for policy 1, policy_version 23292 (0.0009) +[2023-10-14 05:59:57,982][100936] Updated weights for policy 0, policy_version 23300 (0.0007) +[2023-10-14 05:59:58,377][100936] Updated weights for policy 0, policy_version 23310 (0.0008) +[2023-10-14 05:59:58,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 47710208. Throughput: 0: 1650.8, 1: 1651.9. Samples: 11935740. Policy #0 lag: (min: 0.0, avg: 28.0, max: 32.0) +[2023-10-14 05:59:58,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 05:59:58,753][100936] Updated weights for policy 0, policy_version 23320 (0.0007) +[2023-10-14 06:00:00,157][100917] Updated weights for policy 1, policy_version 23302 (0.0009) +[2023-10-14 06:00:00,533][100917] Updated weights for policy 1, policy_version 23312 (0.0010) +[2023-10-14 06:00:00,901][100917] Updated weights for policy 1, policy_version 23322 (0.0007) +[2023-10-14 06:00:02,848][100936] Updated weights for policy 0, policy_version 23330 (0.0007) +[2023-10-14 06:00:03,225][100936] Updated weights for policy 0, policy_version 23340 (0.0009) +[2023-10-14 06:00:03,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 47775744. Throughput: 0: 1654.8, 1: 1667.9. Samples: 11955836. Policy #0 lag: (min: 0.0, avg: 28.0, max: 32.0) +[2023-10-14 06:00:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:00:03,589][100936] Updated weights for policy 0, policy_version 23350 (0.0007) +[2023-10-14 06:00:03,962][100936] Updated weights for policy 0, policy_version 23360 (0.0010) +[2023-10-14 06:00:05,100][100917] Updated weights for policy 1, policy_version 23332 (0.0009) +[2023-10-14 06:00:05,467][100917] Updated weights for policy 1, policy_version 23342 (0.0011) +[2023-10-14 06:00:05,842][100917] Updated weights for policy 1, policy_version 23352 (0.0010) +[2023-10-14 06:00:08,089][100936] Updated weights for policy 0, policy_version 23370 (0.0011) +[2023-10-14 06:00:08,467][100936] Updated weights for policy 0, policy_version 23380 (0.0011) +[2023-10-14 06:00:08,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 47841280. Throughput: 0: 1647.0, 1: 1666.4. Samples: 11975616. Policy #0 lag: (min: 0.0, avg: 28.0, max: 32.0) +[2023-10-14 06:00:08,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:00:08,837][100936] Updated weights for policy 0, policy_version 23390 (0.0008) +[2023-10-14 06:00:09,793][100917] Updated weights for policy 1, policy_version 23362 (0.0011) +[2023-10-14 06:00:10,166][100917] Updated weights for policy 1, policy_version 23372 (0.0007) +[2023-10-14 06:00:10,538][100917] Updated weights for policy 1, policy_version 23382 (0.0008) +[2023-10-14 06:00:10,901][100917] Updated weights for policy 1, policy_version 23392 (0.0011) +[2023-10-14 06:00:13,015][100936] Updated weights for policy 0, policy_version 23400 (0.0007) +[2023-10-14 06:00:13,391][100936] Updated weights for policy 0, policy_version 23410 (0.0007) +[2023-10-14 06:00:13,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 47906816. Throughput: 0: 1663.3, 1: 1647.7. Samples: 11985492. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:00:13,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:00:13,753][100936] Updated weights for policy 0, policy_version 23420 (0.0007) +[2023-10-14 06:00:15,055][100917] Updated weights for policy 1, policy_version 23402 (0.0008) +[2023-10-14 06:00:15,433][100917] Updated weights for policy 1, policy_version 23412 (0.0008) +[2023-10-14 06:00:15,805][100917] Updated weights for policy 1, policy_version 23422 (0.0008) +[2023-10-14 06:00:17,802][100936] Updated weights for policy 0, policy_version 23430 (0.0008) +[2023-10-14 06:00:18,172][100936] Updated weights for policy 0, policy_version 23440 (0.0008) +[2023-10-14 06:00:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 47972352. Throughput: 0: 1665.6, 1: 1665.1. Samples: 12005832. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:00:18,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:00:18,538][100936] Updated weights for policy 0, policy_version 23450 (0.0010) +[2023-10-14 06:00:20,158][100917] Updated weights for policy 1, policy_version 23432 (0.0010) +[2023-10-14 06:00:20,535][100917] Updated weights for policy 1, policy_version 23442 (0.0009) +[2023-10-14 06:00:20,903][100917] Updated weights for policy 1, policy_version 23452 (0.0009) +[2023-10-14 06:00:22,488][100936] Updated weights for policy 0, policy_version 23460 (0.0007) +[2023-10-14 06:00:22,851][100936] Updated weights for policy 0, policy_version 23470 (0.0007) +[2023-10-14 06:00:23,221][100936] Updated weights for policy 0, policy_version 23480 (0.0007) +[2023-10-14 06:00:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 48037888. Throughput: 0: 1655.7, 1: 1666.1. Samples: 12025300. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:00:23,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:00:25,035][100917] Updated weights for policy 1, policy_version 23462 (0.0008) +[2023-10-14 06:00:25,406][100917] Updated weights for policy 1, policy_version 23472 (0.0007) +[2023-10-14 06:00:25,785][100917] Updated weights for policy 1, policy_version 23482 (0.0008) +[2023-10-14 06:00:27,419][100936] Updated weights for policy 0, policy_version 23490 (0.0007) +[2023-10-14 06:00:27,785][100936] Updated weights for policy 0, policy_version 23500 (0.0008) +[2023-10-14 06:00:28,163][100936] Updated weights for policy 0, policy_version 23510 (0.0008) +[2023-10-14 06:00:28,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 48103424. Throughput: 0: 1672.0, 1: 1652.5. Samples: 12035372. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:00:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:00:28,533][100936] Updated weights for policy 0, policy_version 23520 (0.0009) +[2023-10-14 06:00:29,963][100917] Updated weights for policy 1, policy_version 23492 (0.0007) +[2023-10-14 06:00:30,355][100917] Updated weights for policy 1, policy_version 23502 (0.0010) +[2023-10-14 06:00:30,735][100917] Updated weights for policy 1, policy_version 23512 (0.0010) +[2023-10-14 06:00:32,700][100936] Updated weights for policy 0, policy_version 23530 (0.0008) +[2023-10-14 06:00:33,081][100936] Updated weights for policy 0, policy_version 23540 (0.0007) +[2023-10-14 06:00:33,452][100936] Updated weights for policy 0, policy_version 23550 (0.0007) +[2023-10-14 06:00:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 48168960. Throughput: 0: 1661.0, 1: 1663.9. Samples: 12055430. Policy #0 lag: (min: 8.0, avg: 31.3, max: 40.0) +[2023-10-14 06:00:33,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:00:34,767][100917] Updated weights for policy 1, policy_version 23522 (0.0009) +[2023-10-14 06:00:35,150][100917] Updated weights for policy 1, policy_version 23532 (0.0007) +[2023-10-14 06:00:35,526][100917] Updated weights for policy 1, policy_version 23542 (0.0008) +[2023-10-14 06:00:35,893][100917] Updated weights for policy 1, policy_version 23552 (0.0008) +[2023-10-14 06:00:37,631][100936] Updated weights for policy 0, policy_version 23560 (0.0010) +[2023-10-14 06:00:38,006][100936] Updated weights for policy 0, policy_version 23570 (0.0008) +[2023-10-14 06:00:38,369][100936] Updated weights for policy 0, policy_version 23580 (0.0007) +[2023-10-14 06:00:38,512][99942] Fps is (10 sec: 16383.9, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 48267264. Throughput: 0: 1645.1, 1: 1658.0. Samples: 12074720. Policy #0 lag: (min: 8.0, avg: 31.3, max: 40.0) +[2023-10-14 06:00:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:00:39,869][100917] Updated weights for policy 1, policy_version 23562 (0.0010) +[2023-10-14 06:00:40,231][100917] Updated weights for policy 1, policy_version 23572 (0.0010) +[2023-10-14 06:00:40,608][100917] Updated weights for policy 1, policy_version 23582 (0.0009) +[2023-10-14 06:00:42,567][100936] Updated weights for policy 0, policy_version 23590 (0.0008) +[2023-10-14 06:00:42,942][100936] Updated weights for policy 0, policy_version 23600 (0.0007) +[2023-10-14 06:00:43,315][100936] Updated weights for policy 0, policy_version 23610 (0.0008) +[2023-10-14 06:00:43,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 48300032. Throughput: 0: 1659.2, 1: 1657.0. Samples: 12084970. Policy #0 lag: (min: 8.0, avg: 31.3, max: 40.0) +[2023-10-14 06:00:43,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:00:44,774][100917] Updated weights for policy 1, policy_version 23592 (0.0008) +[2023-10-14 06:00:45,151][100917] Updated weights for policy 1, policy_version 23602 (0.0007) +[2023-10-14 06:00:45,520][100917] Updated weights for policy 1, policy_version 23612 (0.0008) +[2023-10-14 06:00:47,561][100936] Updated weights for policy 0, policy_version 23620 (0.0008) +[2023-10-14 06:00:47,954][100936] Updated weights for policy 0, policy_version 23630 (0.0008) +[2023-10-14 06:00:48,328][100936] Updated weights for policy 0, policy_version 23640 (0.0008) +[2023-10-14 06:00:48,512][99942] Fps is (10 sec: 9830.6, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 48365568. Throughput: 0: 1659.3, 1: 1660.2. Samples: 12105210. Policy #0 lag: (min: 8.0, avg: 31.3, max: 40.0) +[2023-10-14 06:00:48,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:00:49,768][100917] Updated weights for policy 1, policy_version 23622 (0.0008) +[2023-10-14 06:00:50,129][100917] Updated weights for policy 1, policy_version 23632 (0.0010) +[2023-10-14 06:00:50,500][100917] Updated weights for policy 1, policy_version 23642 (0.0011) +[2023-10-14 06:00:52,337][100936] Updated weights for policy 0, policy_version 23650 (0.0007) +[2023-10-14 06:00:52,700][100936] Updated weights for policy 0, policy_version 23660 (0.0007) +[2023-10-14 06:00:53,072][100936] Updated weights for policy 0, policy_version 23670 (0.0007) +[2023-10-14 06:00:53,436][100936] Updated weights for policy 0, policy_version 23680 (0.0008) +[2023-10-14 06:00:53,512][99942] Fps is (10 sec: 16384.3, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 48463872. Throughput: 0: 1648.9, 1: 1660.3. Samples: 12124528. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 06:00:53,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:00:54,661][100917] Updated weights for policy 1, policy_version 23652 (0.0010) +[2023-10-14 06:00:55,028][100917] Updated weights for policy 1, policy_version 23662 (0.0010) +[2023-10-14 06:00:55,406][100917] Updated weights for policy 1, policy_version 23672 (0.0009) +[2023-10-14 06:00:57,677][100936] Updated weights for policy 0, policy_version 23690 (0.0007) +[2023-10-14 06:00:58,049][100936] Updated weights for policy 0, policy_version 23700 (0.0007) +[2023-10-14 06:00:58,420][100936] Updated weights for policy 0, policy_version 23710 (0.0007) +[2023-10-14 06:00:58,512][99942] Fps is (10 sec: 16383.5, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 48529408. Throughput: 0: 1652.4, 1: 1653.6. Samples: 12134262. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 06:00:58,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:00:59,493][100917] Updated weights for policy 1, policy_version 23682 (0.0009) +[2023-10-14 06:00:59,863][100917] Updated weights for policy 1, policy_version 23692 (0.0009) +[2023-10-14 06:01:00,235][100917] Updated weights for policy 1, policy_version 23702 (0.0009) +[2023-10-14 06:01:00,610][100917] Updated weights for policy 1, policy_version 23712 (0.0009) +[2023-10-14 06:01:02,466][100936] Updated weights for policy 0, policy_version 23720 (0.0007) +[2023-10-14 06:01:02,837][100936] Updated weights for policy 0, policy_version 23730 (0.0009) +[2023-10-14 06:01:03,202][100936] Updated weights for policy 0, policy_version 23740 (0.0010) +[2023-10-14 06:01:03,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 48594944. Throughput: 0: 1648.7, 1: 1660.3. Samples: 12154738. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 06:01:03,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:01:04,605][100917] Updated weights for policy 1, policy_version 23722 (0.0009) +[2023-10-14 06:01:04,986][100917] Updated weights for policy 1, policy_version 23732 (0.0009) +[2023-10-14 06:01:05,358][100917] Updated weights for policy 1, policy_version 23742 (0.0009) +[2023-10-14 06:01:07,409][100936] Updated weights for policy 0, policy_version 23750 (0.0008) +[2023-10-14 06:01:07,783][100936] Updated weights for policy 0, policy_version 23760 (0.0007) +[2023-10-14 06:01:08,146][100936] Updated weights for policy 0, policy_version 23770 (0.0007) +[2023-10-14 06:01:08,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 48660480. Throughput: 0: 1650.1, 1: 1663.8. Samples: 12174424. Policy #0 lag: (min: 31.0, avg: 31.2, max: 41.0) +[2023-10-14 06:01:08,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:01:09,390][100917] Updated weights for policy 1, policy_version 23752 (0.0009) +[2023-10-14 06:01:09,762][100917] Updated weights for policy 1, policy_version 23762 (0.0009) +[2023-10-14 06:01:10,133][100917] Updated weights for policy 1, policy_version 23772 (0.0008) +[2023-10-14 06:01:12,378][100936] Updated weights for policy 0, policy_version 23780 (0.0008) +[2023-10-14 06:01:12,746][100936] Updated weights for policy 0, policy_version 23790 (0.0007) +[2023-10-14 06:01:13,114][100936] Updated weights for policy 0, policy_version 23800 (0.0007) +[2023-10-14 06:01:13,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 48726016. Throughput: 0: 1658.2, 1: 1661.3. Samples: 12184750. Policy #0 lag: (min: 31.0, avg: 31.2, max: 41.0) +[2023-10-14 06:01:13,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:01:14,260][100917] Updated weights for policy 1, policy_version 23782 (0.0010) +[2023-10-14 06:01:14,620][100917] Updated weights for policy 1, policy_version 23792 (0.0010) +[2023-10-14 06:01:14,998][100917] Updated weights for policy 1, policy_version 23802 (0.0008) +[2023-10-14 06:01:17,073][100936] Updated weights for policy 0, policy_version 23810 (0.0007) +[2023-10-14 06:01:17,445][100936] Updated weights for policy 0, policy_version 23820 (0.0008) +[2023-10-14 06:01:17,808][100936] Updated weights for policy 0, policy_version 23830 (0.0010) +[2023-10-14 06:01:18,181][100936] Updated weights for policy 0, policy_version 23840 (0.0009) +[2023-10-14 06:01:18,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 48791552. Throughput: 0: 1656.7, 1: 1662.8. Samples: 12204806. Policy #0 lag: (min: 31.0, avg: 31.2, max: 41.0) +[2023-10-14 06:01:18,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:01:19,127][100917] Updated weights for policy 1, policy_version 23812 (0.0009) +[2023-10-14 06:01:19,534][100917] Updated weights for policy 1, policy_version 23822 (0.0008) +[2023-10-14 06:01:19,897][100917] Updated weights for policy 1, policy_version 23832 (0.0009) +[2023-10-14 06:01:22,136][100936] Updated weights for policy 0, policy_version 23850 (0.0010) +[2023-10-14 06:01:22,499][100936] Updated weights for policy 0, policy_version 23860 (0.0011) +[2023-10-14 06:01:22,872][100936] Updated weights for policy 0, policy_version 23870 (0.0010) +[2023-10-14 06:01:23,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 48857088. Throughput: 0: 1663.4, 1: 1662.7. Samples: 12224396. Policy #0 lag: (min: 31.0, avg: 31.2, max: 41.0) +[2023-10-14 06:01:23,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:01:23,834][100917] Updated weights for policy 1, policy_version 23842 (0.0008) +[2023-10-14 06:01:24,216][100917] Updated weights for policy 1, policy_version 23852 (0.0009) +[2023-10-14 06:01:24,577][100917] Updated weights for policy 1, policy_version 23862 (0.0008) +[2023-10-14 06:01:24,954][100917] Updated weights for policy 1, policy_version 23872 (0.0007) +[2023-10-14 06:01:26,893][100936] Updated weights for policy 0, policy_version 23880 (0.0010) +[2023-10-14 06:01:27,273][100936] Updated weights for policy 0, policy_version 23890 (0.0010) +[2023-10-14 06:01:27,658][100936] Updated weights for policy 0, policy_version 23900 (0.0008) +[2023-10-14 06:01:28,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 48922624. Throughput: 0: 1672.1, 1: 1658.6. Samples: 12234850. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) +[2023-10-14 06:01:28,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:01:29,040][100917] Updated weights for policy 1, policy_version 23882 (0.0009) +[2023-10-14 06:01:29,415][100917] Updated weights for policy 1, policy_version 23892 (0.0007) +[2023-10-14 06:01:29,798][100917] Updated weights for policy 1, policy_version 23902 (0.0010) +[2023-10-14 06:01:31,827][100936] Updated weights for policy 0, policy_version 23910 (0.0007) +[2023-10-14 06:01:32,199][100936] Updated weights for policy 0, policy_version 23920 (0.0007) +[2023-10-14 06:01:32,574][100936] Updated weights for policy 0, policy_version 23930 (0.0011) +[2023-10-14 06:01:33,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 48988160. Throughput: 0: 1654.5, 1: 1663.2. Samples: 12254508. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) +[2023-10-14 06:01:33,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:01:33,843][100917] Updated weights for policy 1, policy_version 23912 (0.0007) +[2023-10-14 06:01:34,208][100917] Updated weights for policy 1, policy_version 23922 (0.0010) +[2023-10-14 06:01:34,582][100917] Updated weights for policy 1, policy_version 23932 (0.0010) +[2023-10-14 06:01:36,716][100936] Updated weights for policy 0, policy_version 23940 (0.0008) +[2023-10-14 06:01:37,115][100936] Updated weights for policy 0, policy_version 23950 (0.0007) +[2023-10-14 06:01:37,493][100936] Updated weights for policy 0, policy_version 23960 (0.0009) +[2023-10-14 06:01:38,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 49053696. Throughput: 0: 1664.7, 1: 1663.8. Samples: 12274310. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) +[2023-10-14 06:01:38,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:01:38,523][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000023968_24543232.pth... +[2023-10-14 06:01:38,563][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000022400_22937600.pth +[2023-10-14 06:01:38,773][100917] Updated weights for policy 1, policy_version 23942 (0.0008) +[2023-10-14 06:01:39,157][100917] Updated weights for policy 1, policy_version 23952 (0.0009) +[2023-10-14 06:01:39,526][100917] Updated weights for policy 1, policy_version 23962 (0.0009) +[2023-10-14 06:01:39,744][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000023968_24543232.pth... +[2023-10-14 06:01:39,772][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000022400_22937600.pth +[2023-10-14 06:01:41,375][100936] Updated weights for policy 0, policy_version 23970 (0.0007) +[2023-10-14 06:01:41,751][100936] Updated weights for policy 0, policy_version 23980 (0.0008) +[2023-10-14 06:01:42,124][100936] Updated weights for policy 0, policy_version 23990 (0.0007) +[2023-10-14 06:01:42,492][100936] Updated weights for policy 0, policy_version 24000 (0.0009) +[2023-10-14 06:01:43,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 49119232. Throughput: 0: 1669.8, 1: 1665.7. Samples: 12284358. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) +[2023-10-14 06:01:43,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:01:43,815][100917] Updated weights for policy 1, policy_version 23972 (0.0008) +[2023-10-14 06:01:44,181][100917] Updated weights for policy 1, policy_version 23982 (0.0010) +[2023-10-14 06:01:44,559][100917] Updated weights for policy 1, policy_version 23992 (0.0008) +[2023-10-14 06:01:46,530][100936] Updated weights for policy 0, policy_version 24010 (0.0008) +[2023-10-14 06:01:46,908][100936] Updated weights for policy 0, policy_version 24020 (0.0011) +[2023-10-14 06:01:47,286][100936] Updated weights for policy 0, policy_version 24030 (0.0011) +[2023-10-14 06:01:48,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13653.2, 300 sec: 13218.3). Total num frames: 49184768. Throughput: 0: 1649.6, 1: 1661.5. Samples: 12303740. Policy #0 lag: (min: 21.0, avg: 21.0, max: 21.0) +[2023-10-14 06:01:48,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:01:48,851][100917] Updated weights for policy 1, policy_version 24002 (0.0008) +[2023-10-14 06:01:49,222][100917] Updated weights for policy 1, policy_version 24012 (0.0007) +[2023-10-14 06:01:49,589][100917] Updated weights for policy 1, policy_version 24022 (0.0010) +[2023-10-14 06:01:49,967][100917] Updated weights for policy 1, policy_version 24032 (0.0008) +[2023-10-14 06:01:51,382][100936] Updated weights for policy 0, policy_version 24040 (0.0010) +[2023-10-14 06:01:51,758][100936] Updated weights for policy 0, policy_version 24050 (0.0007) +[2023-10-14 06:01:52,125][100936] Updated weights for policy 0, policy_version 24060 (0.0008) +[2023-10-14 06:01:53,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 49250304. Throughput: 0: 1671.2, 1: 1661.3. Samples: 12324388. Policy #0 lag: (min: 21.0, avg: 21.0, max: 21.0) +[2023-10-14 06:01:53,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:01:54,231][100917] Updated weights for policy 1, policy_version 24042 (0.0007) +[2023-10-14 06:01:54,607][100917] Updated weights for policy 1, policy_version 24052 (0.0007) +[2023-10-14 06:01:54,989][100917] Updated weights for policy 1, policy_version 24062 (0.0009) +[2023-10-14 06:01:56,319][100936] Updated weights for policy 0, policy_version 24070 (0.0009) +[2023-10-14 06:01:56,699][100936] Updated weights for policy 0, policy_version 24080 (0.0007) +[2023-10-14 06:01:57,070][100936] Updated weights for policy 0, policy_version 24090 (0.0007) +[2023-10-14 06:01:58,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 49315840. Throughput: 0: 1663.5, 1: 1659.2. Samples: 12334272. Policy #0 lag: (min: 21.0, avg: 21.0, max: 21.0) +[2023-10-14 06:01:58,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:01:59,140][100917] Updated weights for policy 1, policy_version 24072 (0.0009) +[2023-10-14 06:01:59,516][100917] Updated weights for policy 1, policy_version 24082 (0.0010) +[2023-10-14 06:01:59,885][100917] Updated weights for policy 1, policy_version 24092 (0.0010) +[2023-10-14 06:02:01,097][100936] Updated weights for policy 0, policy_version 24100 (0.0007) +[2023-10-14 06:02:01,479][100936] Updated weights for policy 0, policy_version 24110 (0.0009) +[2023-10-14 06:02:01,850][100936] Updated weights for policy 0, policy_version 24120 (0.0011) +[2023-10-14 06:02:03,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 49381376. Throughput: 0: 1654.3, 1: 1658.9. Samples: 12353900. Policy #0 lag: (min: 21.0, avg: 21.0, max: 21.0) +[2023-10-14 06:02:03,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:02:04,001][100917] Updated weights for policy 1, policy_version 24102 (0.0009) +[2023-10-14 06:02:04,393][100917] Updated weights for policy 1, policy_version 24112 (0.0007) +[2023-10-14 06:02:04,757][100917] Updated weights for policy 1, policy_version 24122 (0.0009) +[2023-10-14 06:02:06,126][100936] Updated weights for policy 0, policy_version 24130 (0.0009) +[2023-10-14 06:02:06,498][100936] Updated weights for policy 0, policy_version 24140 (0.0007) +[2023-10-14 06:02:06,869][100936] Updated weights for policy 0, policy_version 24150 (0.0010) +[2023-10-14 06:02:07,243][100936] Updated weights for policy 0, policy_version 24160 (0.0009) +[2023-10-14 06:02:08,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 49446912. Throughput: 0: 1671.7, 1: 1664.4. Samples: 12374518. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-10-14 06:02:08,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:02:08,661][100917] Updated weights for policy 1, policy_version 24132 (0.0008) +[2023-10-14 06:02:09,034][100917] Updated weights for policy 1, policy_version 24142 (0.0008) +[2023-10-14 06:02:09,412][100917] Updated weights for policy 1, policy_version 24152 (0.0009) +[2023-10-14 06:02:11,276][100936] Updated weights for policy 0, policy_version 24170 (0.0009) +[2023-10-14 06:02:11,650][100936] Updated weights for policy 0, policy_version 24180 (0.0008) +[2023-10-14 06:02:12,015][100936] Updated weights for policy 0, policy_version 24190 (0.0010) +[2023-10-14 06:02:13,477][100917] Updated weights for policy 1, policy_version 24162 (0.0010) +[2023-10-14 06:02:13,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 49512448. Throughput: 0: 1659.0, 1: 1666.2. Samples: 12384484. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-10-14 06:02:13,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:02:13,859][100917] Updated weights for policy 1, policy_version 24172 (0.0010) +[2023-10-14 06:02:14,222][100917] Updated weights for policy 1, policy_version 24182 (0.0009) +[2023-10-14 06:02:14,596][100917] Updated weights for policy 1, policy_version 24192 (0.0010) +[2023-10-14 06:02:16,070][100936] Updated weights for policy 0, policy_version 24200 (0.0008) +[2023-10-14 06:02:16,440][100936] Updated weights for policy 0, policy_version 24210 (0.0010) +[2023-10-14 06:02:16,805][100936] Updated weights for policy 0, policy_version 24220 (0.0008) +[2023-10-14 06:02:18,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 49577984. Throughput: 0: 1658.9, 1: 1666.4. Samples: 12404148. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-10-14 06:02:18,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:02:18,680][100917] Updated weights for policy 1, policy_version 24202 (0.0008) +[2023-10-14 06:02:19,050][100917] Updated weights for policy 1, policy_version 24212 (0.0007) +[2023-10-14 06:02:19,421][100917] Updated weights for policy 1, policy_version 24222 (0.0007) +[2023-10-14 06:02:20,823][100936] Updated weights for policy 0, policy_version 24230 (0.0009) +[2023-10-14 06:02:21,210][100936] Updated weights for policy 0, policy_version 24240 (0.0008) +[2023-10-14 06:02:21,578][100936] Updated weights for policy 0, policy_version 24250 (0.0008) +[2023-10-14 06:02:23,410][100917] Updated weights for policy 1, policy_version 24232 (0.0009) +[2023-10-14 06:02:23,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 49643520. Throughput: 0: 1679.2, 1: 1667.6. Samples: 12424918. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-10-14 06:02:23,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:02:23,778][100917] Updated weights for policy 1, policy_version 24242 (0.0008) +[2023-10-14 06:02:24,148][100917] Updated weights for policy 1, policy_version 24252 (0.0009) +[2023-10-14 06:02:25,508][100936] Updated weights for policy 0, policy_version 24260 (0.0009) +[2023-10-14 06:02:25,874][100936] Updated weights for policy 0, policy_version 24270 (0.0009) +[2023-10-14 06:02:26,252][100936] Updated weights for policy 0, policy_version 24280 (0.0008) +[2023-10-14 06:02:28,337][100917] Updated weights for policy 1, policy_version 24262 (0.0007) +[2023-10-14 06:02:28,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 49709056. Throughput: 0: 1658.8, 1: 1668.8. Samples: 12434098. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 06:02:28,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:02:28,707][100917] Updated weights for policy 1, policy_version 24272 (0.0007) +[2023-10-14 06:02:29,089][100917] Updated weights for policy 1, policy_version 24282 (0.0009) +[2023-10-14 06:02:30,372][100936] Updated weights for policy 0, policy_version 24290 (0.0009) +[2023-10-14 06:02:30,739][100936] Updated weights for policy 0, policy_version 24300 (0.0009) +[2023-10-14 06:02:31,107][100936] Updated weights for policy 0, policy_version 24310 (0.0009) +[2023-10-14 06:02:31,476][100936] Updated weights for policy 0, policy_version 24320 (0.0008) +[2023-10-14 06:02:33,142][100917] Updated weights for policy 1, policy_version 24292 (0.0010) +[2023-10-14 06:02:33,507][100917] Updated weights for policy 1, policy_version 24302 (0.0007) +[2023-10-14 06:02:33,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 49774592. Throughput: 0: 1676.8, 1: 1668.4. Samples: 12454274. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 06:02:33,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:02:33,873][100917] Updated weights for policy 1, policy_version 24312 (0.0008) +[2023-10-14 06:02:35,722][100936] Updated weights for policy 0, policy_version 24330 (0.0008) +[2023-10-14 06:02:36,091][100936] Updated weights for policy 0, policy_version 24340 (0.0009) +[2023-10-14 06:02:36,452][100936] Updated weights for policy 0, policy_version 24350 (0.0008) +[2023-10-14 06:02:38,238][100917] Updated weights for policy 1, policy_version 24322 (0.0009) +[2023-10-14 06:02:38,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 49840128. Throughput: 0: 1677.4, 1: 1663.0. Samples: 12474708. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 06:02:38,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:02:38,612][100917] Updated weights for policy 1, policy_version 24332 (0.0008) +[2023-10-14 06:02:38,999][100917] Updated weights for policy 1, policy_version 24342 (0.0009) +[2023-10-14 06:02:39,366][100917] Updated weights for policy 1, policy_version 24352 (0.0010) +[2023-10-14 06:02:40,491][100936] Updated weights for policy 0, policy_version 24360 (0.0009) +[2023-10-14 06:02:40,850][100936] Updated weights for policy 0, policy_version 24370 (0.0011) +[2023-10-14 06:02:41,220][100936] Updated weights for policy 0, policy_version 24380 (0.0008) +[2023-10-14 06:02:43,266][100917] Updated weights for policy 1, policy_version 24362 (0.0007) +[2023-10-14 06:02:43,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 49905664. Throughput: 0: 1656.1, 1: 1667.7. Samples: 12483842. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 06:02:43,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:02:43,641][100917] Updated weights for policy 1, policy_version 24372 (0.0009) +[2023-10-14 06:02:44,023][100917] Updated weights for policy 1, policy_version 24382 (0.0008) +[2023-10-14 06:02:45,405][100936] Updated weights for policy 0, policy_version 24390 (0.0009) +[2023-10-14 06:02:45,778][100936] Updated weights for policy 0, policy_version 24400 (0.0009) +[2023-10-14 06:02:46,153][100936] Updated weights for policy 0, policy_version 24410 (0.0008) +[2023-10-14 06:02:48,024][100917] Updated weights for policy 1, policy_version 24392 (0.0009) +[2023-10-14 06:02:48,396][100917] Updated weights for policy 1, policy_version 24402 (0.0010) +[2023-10-14 06:02:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 49971200. Throughput: 0: 1677.6, 1: 1671.1. Samples: 12504590. Policy #0 lag: (min: 22.0, avg: 45.0, max: 48.0) +[2023-10-14 06:02:48,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:02:48,767][100917] Updated weights for policy 1, policy_version 24412 (0.0011) +[2023-10-14 06:02:50,268][100936] Updated weights for policy 0, policy_version 24420 (0.0008) +[2023-10-14 06:02:50,638][100936] Updated weights for policy 0, policy_version 24430 (0.0008) +[2023-10-14 06:02:51,007][100936] Updated weights for policy 0, policy_version 24440 (0.0010) +[2023-10-14 06:02:52,971][100917] Updated weights for policy 1, policy_version 24422 (0.0009) +[2023-10-14 06:02:53,351][100917] Updated weights for policy 1, policy_version 24432 (0.0010) +[2023-10-14 06:02:53,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 50036736. Throughput: 0: 1681.2, 1: 1654.4. Samples: 12524620. Policy #0 lag: (min: 22.0, avg: 45.0, max: 48.0) +[2023-10-14 06:02:53,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:02:53,726][100917] Updated weights for policy 1, policy_version 24442 (0.0007) +[2023-10-14 06:02:55,274][100936] Updated weights for policy 0, policy_version 24450 (0.0010) +[2023-10-14 06:02:55,644][100936] Updated weights for policy 0, policy_version 24460 (0.0010) +[2023-10-14 06:02:56,010][100936] Updated weights for policy 0, policy_version 24470 (0.0009) +[2023-10-14 06:02:56,388][100936] Updated weights for policy 0, policy_version 24480 (0.0008) +[2023-10-14 06:02:57,745][100917] Updated weights for policy 1, policy_version 24452 (0.0009) +[2023-10-14 06:02:58,120][100917] Updated weights for policy 1, policy_version 24462 (0.0007) +[2023-10-14 06:02:58,493][100917] Updated weights for policy 1, policy_version 24472 (0.0008) +[2023-10-14 06:02:58,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 50102272. Throughput: 0: 1659.3, 1: 1658.0. Samples: 12533758. Policy #0 lag: (min: 22.0, avg: 45.0, max: 48.0) +[2023-10-14 06:02:58,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:03:00,381][100936] Updated weights for policy 0, policy_version 24490 (0.0010) +[2023-10-14 06:03:00,744][100936] Updated weights for policy 0, policy_version 24500 (0.0010) +[2023-10-14 06:03:01,120][100936] Updated weights for policy 0, policy_version 24510 (0.0010) +[2023-10-14 06:03:02,600][100917] Updated weights for policy 1, policy_version 24482 (0.0008) +[2023-10-14 06:03:02,976][100917] Updated weights for policy 1, policy_version 24492 (0.0008) +[2023-10-14 06:03:03,346][100917] Updated weights for policy 1, policy_version 24502 (0.0009) +[2023-10-14 06:03:03,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 50167808. Throughput: 0: 1671.2, 1: 1665.0. Samples: 12554280. Policy #0 lag: (min: 22.0, avg: 45.0, max: 48.0) +[2023-10-14 06:03:03,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:03:03,736][100917] Updated weights for policy 1, policy_version 24512 (0.0008) +[2023-10-14 06:03:05,203][100936] Updated weights for policy 0, policy_version 24520 (0.0009) +[2023-10-14 06:03:05,576][100936] Updated weights for policy 0, policy_version 24530 (0.0007) +[2023-10-14 06:03:05,942][100936] Updated weights for policy 0, policy_version 24540 (0.0007) +[2023-10-14 06:03:07,912][100917] Updated weights for policy 1, policy_version 24522 (0.0008) +[2023-10-14 06:03:08,295][100917] Updated weights for policy 1, policy_version 24532 (0.0008) +[2023-10-14 06:03:08,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 50233344. Throughput: 0: 1664.7, 1: 1652.1. Samples: 12574174. Policy #0 lag: (min: 3.0, avg: 13.5, max: 35.0) +[2023-10-14 06:03:08,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:03:08,669][100917] Updated weights for policy 1, policy_version 24542 (0.0008) +[2023-10-14 06:03:10,208][100936] Updated weights for policy 0, policy_version 24550 (0.0011) +[2023-10-14 06:03:10,600][100936] Updated weights for policy 0, policy_version 24560 (0.0007) +[2023-10-14 06:03:10,964][100936] Updated weights for policy 0, policy_version 24570 (0.0010) +[2023-10-14 06:03:12,825][100917] Updated weights for policy 1, policy_version 24552 (0.0010) +[2023-10-14 06:03:13,199][100917] Updated weights for policy 1, policy_version 24562 (0.0007) +[2023-10-14 06:03:13,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 50298880. Throughput: 0: 1654.4, 1: 1660.6. Samples: 12583276. Policy #0 lag: (min: 3.0, avg: 13.5, max: 35.0) +[2023-10-14 06:03:13,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:03:13,582][100917] Updated weights for policy 1, policy_version 24572 (0.0007) +[2023-10-14 06:03:15,057][100936] Updated weights for policy 0, policy_version 24580 (0.0009) +[2023-10-14 06:03:15,446][100936] Updated weights for policy 0, policy_version 24590 (0.0008) +[2023-10-14 06:03:15,812][100936] Updated weights for policy 0, policy_version 24600 (0.0009) +[2023-10-14 06:03:17,843][100917] Updated weights for policy 1, policy_version 24582 (0.0008) +[2023-10-14 06:03:18,215][100917] Updated weights for policy 1, policy_version 24592 (0.0008) +[2023-10-14 06:03:18,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 50364416. Throughput: 0: 1664.4, 1: 1657.7. Samples: 12603766. Policy #0 lag: (min: 3.0, avg: 13.5, max: 35.0) +[2023-10-14 06:03:18,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:03:18,595][100917] Updated weights for policy 1, policy_version 24602 (0.0010) +[2023-10-14 06:03:19,790][100936] Updated weights for policy 0, policy_version 24610 (0.0008) +[2023-10-14 06:03:20,154][100936] Updated weights for policy 0, policy_version 24620 (0.0008) +[2023-10-14 06:03:20,521][100936] Updated weights for policy 0, policy_version 24630 (0.0008) +[2023-10-14 06:03:20,893][100936] Updated weights for policy 0, policy_version 24640 (0.0007) +[2023-10-14 06:03:22,660][100917] Updated weights for policy 1, policy_version 24612 (0.0008) +[2023-10-14 06:03:23,041][100917] Updated weights for policy 1, policy_version 24622 (0.0008) +[2023-10-14 06:03:23,419][100917] Updated weights for policy 1, policy_version 24632 (0.0007) +[2023-10-14 06:03:23,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 50429952. Throughput: 0: 1670.3, 1: 1648.2. Samples: 12624040. Policy #0 lag: (min: 3.0, avg: 13.5, max: 35.0) +[2023-10-14 06:03:23,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:03:24,711][100936] Updated weights for policy 0, policy_version 24650 (0.0007) +[2023-10-14 06:03:25,078][100936] Updated weights for policy 0, policy_version 24660 (0.0009) +[2023-10-14 06:03:25,441][100936] Updated weights for policy 0, policy_version 24670 (0.0009) +[2023-10-14 06:03:27,586][100917] Updated weights for policy 1, policy_version 24642 (0.0010) +[2023-10-14 06:03:27,960][100917] Updated weights for policy 1, policy_version 24652 (0.0007) +[2023-10-14 06:03:28,334][100917] Updated weights for policy 1, policy_version 24662 (0.0009) +[2023-10-14 06:03:28,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 50495488. Throughput: 0: 1668.0, 1: 1655.8. Samples: 12633416. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-10-14 06:03:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:03:28,705][100917] Updated weights for policy 1, policy_version 24672 (0.0011) +[2023-10-14 06:03:29,707][100936] Updated weights for policy 0, policy_version 24680 (0.0009) +[2023-10-14 06:03:30,085][100936] Updated weights for policy 0, policy_version 24690 (0.0007) +[2023-10-14 06:03:30,449][100936] Updated weights for policy 0, policy_version 24700 (0.0009) +[2023-10-14 06:03:32,988][100917] Updated weights for policy 1, policy_version 24682 (0.0010) +[2023-10-14 06:03:33,361][100917] Updated weights for policy 1, policy_version 24692 (0.0009) +[2023-10-14 06:03:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 50561024. Throughput: 0: 1671.7, 1: 1647.8. Samples: 12653968. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-10-14 06:03:33,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:03:33,737][100917] Updated weights for policy 1, policy_version 24702 (0.0011) +[2023-10-14 06:03:34,349][100936] Updated weights for policy 0, policy_version 24710 (0.0011) +[2023-10-14 06:03:34,718][100936] Updated weights for policy 0, policy_version 24720 (0.0008) +[2023-10-14 06:03:35,090][100936] Updated weights for policy 0, policy_version 24730 (0.0008) +[2023-10-14 06:03:37,727][100917] Updated weights for policy 1, policy_version 24712 (0.0007) +[2023-10-14 06:03:38,098][100917] Updated weights for policy 1, policy_version 24722 (0.0008) +[2023-10-14 06:03:38,478][100917] Updated weights for policy 1, policy_version 24732 (0.0007) +[2023-10-14 06:03:38,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 50626560. Throughput: 0: 1677.4, 1: 1647.7. Samples: 12674248. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-10-14 06:03:38,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:03:38,519][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000024736_25329664.pth... +[2023-10-14 06:03:38,548][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000023200_23756800.pth +[2023-10-14 06:03:38,621][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000024736_25329664.pth... +[2023-10-14 06:03:38,657][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000023168_23724032.pth +[2023-10-14 06:03:39,224][100936] Updated weights for policy 0, policy_version 24740 (0.0008) +[2023-10-14 06:03:39,591][100936] Updated weights for policy 0, policy_version 24750 (0.0008) +[2023-10-14 06:03:39,957][100936] Updated weights for policy 0, policy_version 24760 (0.0009) +[2023-10-14 06:03:42,731][100917] Updated weights for policy 1, policy_version 24742 (0.0008) +[2023-10-14 06:03:43,112][100917] Updated weights for policy 1, policy_version 24752 (0.0007) +[2023-10-14 06:03:43,477][100917] Updated weights for policy 1, policy_version 24762 (0.0010) +[2023-10-14 06:03:43,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 50692096. Throughput: 0: 1677.5, 1: 1654.1. Samples: 12683680. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-10-14 06:03:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:03:44,211][100936] Updated weights for policy 0, policy_version 24770 (0.0008) +[2023-10-14 06:03:44,588][100936] Updated weights for policy 0, policy_version 24780 (0.0007) +[2023-10-14 06:03:44,953][100936] Updated weights for policy 0, policy_version 24790 (0.0009) +[2023-10-14 06:03:45,331][100936] Updated weights for policy 0, policy_version 24800 (0.0007) +[2023-10-14 06:03:47,600][100917] Updated weights for policy 1, policy_version 24772 (0.0007) +[2023-10-14 06:03:47,968][100917] Updated weights for policy 1, policy_version 24782 (0.0009) +[2023-10-14 06:03:48,335][100917] Updated weights for policy 1, policy_version 24792 (0.0008) +[2023-10-14 06:03:48,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 50757632. Throughput: 0: 1681.3, 1: 1642.1. Samples: 12703834. Policy #0 lag: (min: 7.0, avg: 15.0, max: 39.0) +[2023-10-14 06:03:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:03:49,416][100936] Updated weights for policy 0, policy_version 24810 (0.0010) +[2023-10-14 06:03:49,796][100936] Updated weights for policy 0, policy_version 24820 (0.0009) +[2023-10-14 06:03:50,166][100936] Updated weights for policy 0, policy_version 24830 (0.0008) +[2023-10-14 06:03:52,466][100917] Updated weights for policy 1, policy_version 24802 (0.0008) +[2023-10-14 06:03:52,839][100917] Updated weights for policy 1, policy_version 24812 (0.0011) +[2023-10-14 06:03:53,214][100917] Updated weights for policy 1, policy_version 24822 (0.0010) +[2023-10-14 06:03:53,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 50823168. Throughput: 0: 1680.6, 1: 1641.3. Samples: 12723658. Policy #0 lag: (min: 7.0, avg: 15.0, max: 39.0) +[2023-10-14 06:03:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:03:53,588][100917] Updated weights for policy 1, policy_version 24832 (0.0010) +[2023-10-14 06:03:54,289][100936] Updated weights for policy 0, policy_version 24840 (0.0009) +[2023-10-14 06:03:54,667][100936] Updated weights for policy 0, policy_version 24850 (0.0009) +[2023-10-14 06:03:55,033][100936] Updated weights for policy 0, policy_version 24860 (0.0008) +[2023-10-14 06:03:57,847][100917] Updated weights for policy 1, policy_version 24842 (0.0008) +[2023-10-14 06:03:58,217][100917] Updated weights for policy 1, policy_version 24852 (0.0007) +[2023-10-14 06:03:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 50888704. Throughput: 0: 1684.5, 1: 1646.0. Samples: 12733148. Policy #0 lag: (min: 7.0, avg: 15.0, max: 39.0) +[2023-10-14 06:03:58,513][99942] Avg episode reward: [(0, '0.330'), (1, '1.000')] +[2023-10-14 06:03:58,602][100917] Updated weights for policy 1, policy_version 24862 (0.0007) +[2023-10-14 06:03:59,321][100936] Updated weights for policy 0, policy_version 24870 (0.0008) +[2023-10-14 06:03:59,715][100936] Updated weights for policy 0, policy_version 24880 (0.0008) +[2023-10-14 06:04:00,080][100936] Updated weights for policy 0, policy_version 24890 (0.0008) +[2023-10-14 06:04:02,697][100917] Updated weights for policy 1, policy_version 24872 (0.0010) +[2023-10-14 06:04:03,071][100917] Updated weights for policy 1, policy_version 24882 (0.0008) +[2023-10-14 06:04:03,436][100917] Updated weights for policy 1, policy_version 24892 (0.0010) +[2023-10-14 06:04:03,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 50954240. Throughput: 0: 1676.9, 1: 1650.4. Samples: 12753498. Policy #0 lag: (min: 7.0, avg: 15.0, max: 39.0) +[2023-10-14 06:04:03,513][99942] Avg episode reward: [(0, '0.330'), (1, '1.000')] +[2023-10-14 06:04:03,939][100936] Updated weights for policy 0, policy_version 24900 (0.0008) +[2023-10-14 06:04:04,319][100936] Updated weights for policy 0, policy_version 24910 (0.0008) +[2023-10-14 06:04:04,689][100936] Updated weights for policy 0, policy_version 24920 (0.0007) +[2023-10-14 06:04:07,435][100917] Updated weights for policy 1, policy_version 24902 (0.0009) +[2023-10-14 06:04:07,807][100917] Updated weights for policy 1, policy_version 24912 (0.0008) +[2023-10-14 06:04:08,188][100917] Updated weights for policy 1, policy_version 24922 (0.0008) +[2023-10-14 06:04:08,512][99942] Fps is (10 sec: 16383.8, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 51052544. Throughput: 0: 1671.6, 1: 1649.3. Samples: 12773480. Policy #0 lag: (min: 29.0, avg: 29.0, max: 29.0) +[2023-10-14 06:04:08,513][99942] Avg episode reward: [(0, '0.330'), (1, '1.000')] +[2023-10-14 06:04:08,807][100936] Updated weights for policy 0, policy_version 24930 (0.0007) +[2023-10-14 06:04:09,189][100936] Updated weights for policy 0, policy_version 24940 (0.0009) +[2023-10-14 06:04:09,560][100936] Updated weights for policy 0, policy_version 24950 (0.0009) +[2023-10-14 06:04:09,924][100936] Updated weights for policy 0, policy_version 24960 (0.0008) +[2023-10-14 06:04:12,282][100917] Updated weights for policy 1, policy_version 24932 (0.0007) +[2023-10-14 06:04:12,661][100917] Updated weights for policy 1, policy_version 24942 (0.0007) +[2023-10-14 06:04:13,036][100917] Updated weights for policy 1, policy_version 24952 (0.0007) +[2023-10-14 06:04:13,512][99942] Fps is (10 sec: 16384.4, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 51118080. Throughput: 0: 1673.6, 1: 1659.8. Samples: 12783420. Policy #0 lag: (min: 29.0, avg: 29.0, max: 29.0) +[2023-10-14 06:04:13,512][99942] Avg episode reward: [(0, '0.330'), (1, '1.000')] +[2023-10-14 06:04:13,815][100936] Updated weights for policy 0, policy_version 24970 (0.0009) +[2023-10-14 06:04:14,183][100936] Updated weights for policy 0, policy_version 24980 (0.0008) +[2023-10-14 06:04:14,558][100936] Updated weights for policy 0, policy_version 24990 (0.0008) +[2023-10-14 06:04:17,178][100917] Updated weights for policy 1, policy_version 24962 (0.0009) +[2023-10-14 06:04:17,547][100917] Updated weights for policy 1, policy_version 24972 (0.0008) +[2023-10-14 06:04:17,933][100917] Updated weights for policy 1, policy_version 24982 (0.0007) +[2023-10-14 06:04:18,304][100917] Updated weights for policy 1, policy_version 24992 (0.0008) +[2023-10-14 06:04:18,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 51183616. Throughput: 0: 1669.7, 1: 1662.1. Samples: 12803900. Policy #0 lag: (min: 29.0, avg: 29.0, max: 29.0) +[2023-10-14 06:04:18,513][99942] Avg episode reward: [(0, '0.330'), (1, '1.000')] +[2023-10-14 06:04:18,696][100936] Updated weights for policy 0, policy_version 25000 (0.0008) +[2023-10-14 06:04:19,061][100936] Updated weights for policy 0, policy_version 25010 (0.0008) +[2023-10-14 06:04:19,424][100936] Updated weights for policy 0, policy_version 25020 (0.0008) +[2023-10-14 06:04:22,360][100917] Updated weights for policy 1, policy_version 25002 (0.0008) +[2023-10-14 06:04:22,745][100917] Updated weights for policy 1, policy_version 25012 (0.0008) +[2023-10-14 06:04:23,116][100917] Updated weights for policy 1, policy_version 25022 (0.0008) +[2023-10-14 06:04:23,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 51249152. Throughput: 0: 1659.6, 1: 1652.7. Samples: 12823304. Policy #0 lag: (min: 29.0, avg: 29.0, max: 29.0) +[2023-10-14 06:04:23,513][99942] Avg episode reward: [(0, '0.330'), (1, '1.000')] +[2023-10-14 06:04:23,524][100936] Updated weights for policy 0, policy_version 25030 (0.0008) +[2023-10-14 06:04:23,895][100936] Updated weights for policy 0, policy_version 25040 (0.0009) +[2023-10-14 06:04:24,262][100936] Updated weights for policy 0, policy_version 25050 (0.0008) +[2023-10-14 06:04:27,289][100917] Updated weights for policy 1, policy_version 25032 (0.0009) +[2023-10-14 06:04:27,670][100917] Updated weights for policy 1, policy_version 25042 (0.0009) +[2023-10-14 06:04:28,047][100917] Updated weights for policy 1, policy_version 25052 (0.0008) +[2023-10-14 06:04:28,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 51314688. Throughput: 0: 1661.6, 1: 1662.6. Samples: 12833270. Policy #0 lag: (min: 1.0, avg: 12.0, max: 33.0) +[2023-10-14 06:04:28,513][99942] Avg episode reward: [(0, '0.330'), (1, '1.000')] +[2023-10-14 06:04:28,528][100936] Updated weights for policy 0, policy_version 25060 (0.0007) +[2023-10-14 06:04:28,897][100936] Updated weights for policy 0, policy_version 25070 (0.0007) +[2023-10-14 06:04:29,266][100936] Updated weights for policy 0, policy_version 25080 (0.0007) +[2023-10-14 06:04:32,162][100917] Updated weights for policy 1, policy_version 25062 (0.0009) +[2023-10-14 06:04:32,538][100917] Updated weights for policy 1, policy_version 25072 (0.0009) +[2023-10-14 06:04:32,909][100917] Updated weights for policy 1, policy_version 25082 (0.0009) +[2023-10-14 06:04:33,274][100936] Updated weights for policy 0, policy_version 25090 (0.0008) +[2023-10-14 06:04:33,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 51380224. Throughput: 0: 1663.6, 1: 1663.7. Samples: 12853562. Policy #0 lag: (min: 1.0, avg: 12.0, max: 33.0) +[2023-10-14 06:04:33,512][99942] Avg episode reward: [(0, '0.310'), (1, '1.000')] +[2023-10-14 06:04:33,641][100936] Updated weights for policy 0, policy_version 25100 (0.0009) +[2023-10-14 06:04:34,006][100936] Updated weights for policy 0, policy_version 25110 (0.0008) +[2023-10-14 06:04:34,380][100936] Updated weights for policy 0, policy_version 25120 (0.0007) +[2023-10-14 06:04:37,043][100917] Updated weights for policy 1, policy_version 25092 (0.0010) +[2023-10-14 06:04:37,428][100917] Updated weights for policy 1, policy_version 25102 (0.0011) +[2023-10-14 06:04:37,797][100917] Updated weights for policy 1, policy_version 25112 (0.0010) +[2023-10-14 06:04:38,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 51445760. Throughput: 0: 1662.1, 1: 1652.7. Samples: 12872826. Policy #0 lag: (min: 1.0, avg: 12.0, max: 33.0) +[2023-10-14 06:04:38,512][99942] Avg episode reward: [(0, '0.310'), (1, '1.000')] +[2023-10-14 06:04:38,608][100936] Updated weights for policy 0, policy_version 25130 (0.0008) +[2023-10-14 06:04:38,979][100936] Updated weights for policy 0, policy_version 25140 (0.0009) +[2023-10-14 06:04:39,356][100936] Updated weights for policy 0, policy_version 25150 (0.0009) +[2023-10-14 06:04:41,931][100917] Updated weights for policy 1, policy_version 25122 (0.0010) +[2023-10-14 06:04:42,309][100917] Updated weights for policy 1, policy_version 25132 (0.0009) +[2023-10-14 06:04:42,682][100917] Updated weights for policy 1, policy_version 25142 (0.0010) +[2023-10-14 06:04:43,053][100917] Updated weights for policy 1, policy_version 25152 (0.0010) +[2023-10-14 06:04:43,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 51511296. Throughput: 0: 1666.9, 1: 1666.4. Samples: 12883144. Policy #0 lag: (min: 1.0, avg: 12.0, max: 33.0) +[2023-10-14 06:04:43,513][99942] Avg episode reward: [(0, '0.310'), (1, '1.000')] +[2023-10-14 06:04:43,520][100936] Updated weights for policy 0, policy_version 25160 (0.0007) +[2023-10-14 06:04:43,899][100936] Updated weights for policy 0, policy_version 25170 (0.0008) +[2023-10-14 06:04:44,270][100936] Updated weights for policy 0, policy_version 25180 (0.0009) +[2023-10-14 06:04:47,258][100917] Updated weights for policy 1, policy_version 25162 (0.0009) +[2023-10-14 06:04:47,624][100917] Updated weights for policy 1, policy_version 25172 (0.0009) +[2023-10-14 06:04:48,001][100917] Updated weights for policy 1, policy_version 25182 (0.0009) +[2023-10-14 06:04:48,372][100936] Updated weights for policy 0, policy_version 25190 (0.0008) +[2023-10-14 06:04:48,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 51576832. Throughput: 0: 1672.2, 1: 1662.6. Samples: 12903562. Policy #0 lag: (min: 31.0, avg: 31.9, max: 52.0) +[2023-10-14 06:04:48,513][99942] Avg episode reward: [(0, '0.310'), (1, '0.800')] +[2023-10-14 06:04:48,753][100936] Updated weights for policy 0, policy_version 25200 (0.0007) +[2023-10-14 06:04:49,135][100936] Updated weights for policy 0, policy_version 25210 (0.0008) +[2023-10-14 06:04:52,108][100917] Updated weights for policy 1, policy_version 25192 (0.0008) +[2023-10-14 06:04:52,479][100917] Updated weights for policy 1, policy_version 25202 (0.0007) +[2023-10-14 06:04:52,858][100917] Updated weights for policy 1, policy_version 25212 (0.0007) +[2023-10-14 06:04:53,177][100936] Updated weights for policy 0, policy_version 25220 (0.0009) +[2023-10-14 06:04:53,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 51642368. Throughput: 0: 1660.8, 1: 1649.7. Samples: 12922450. Policy #0 lag: (min: 31.0, avg: 31.9, max: 52.0) +[2023-10-14 06:04:53,513][99942] Avg episode reward: [(0, '0.310'), (1, '0.800')] +[2023-10-14 06:04:53,548][100936] Updated weights for policy 0, policy_version 25230 (0.0009) +[2023-10-14 06:04:53,923][100936] Updated weights for policy 0, policy_version 25240 (0.0008) +[2023-10-14 06:04:56,999][100917] Updated weights for policy 1, policy_version 25222 (0.0008) +[2023-10-14 06:04:57,376][100917] Updated weights for policy 1, policy_version 25232 (0.0008) +[2023-10-14 06:04:57,744][100917] Updated weights for policy 1, policy_version 25242 (0.0009) +[2023-10-14 06:04:58,025][100936] Updated weights for policy 0, policy_version 25250 (0.0008) +[2023-10-14 06:04:58,391][100936] Updated weights for policy 0, policy_version 25260 (0.0008) +[2023-10-14 06:04:58,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 51707904. Throughput: 0: 1669.5, 1: 1654.8. Samples: 12933016. Policy #0 lag: (min: 31.0, avg: 31.9, max: 52.0) +[2023-10-14 06:04:58,512][99942] Avg episode reward: [(0, '0.310'), (1, '0.800')] +[2023-10-14 06:04:58,775][100936] Updated weights for policy 0, policy_version 25270 (0.0008) +[2023-10-14 06:04:59,137][100936] Updated weights for policy 0, policy_version 25280 (0.0008) +[2023-10-14 06:05:01,810][100917] Updated weights for policy 1, policy_version 25252 (0.0008) +[2023-10-14 06:05:02,172][100917] Updated weights for policy 1, policy_version 25262 (0.0010) +[2023-10-14 06:05:02,542][100917] Updated weights for policy 1, policy_version 25272 (0.0007) +[2023-10-14 06:05:03,267][100936] Updated weights for policy 0, policy_version 25290 (0.0009) +[2023-10-14 06:05:03,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 51773440. Throughput: 0: 1668.0, 1: 1647.1. Samples: 12953076. Policy #0 lag: (min: 31.0, avg: 31.9, max: 52.0) +[2023-10-14 06:05:03,513][99942] Avg episode reward: [(0, '0.310'), (1, '0.800')] +[2023-10-14 06:05:03,648][100936] Updated weights for policy 0, policy_version 25300 (0.0008) +[2023-10-14 06:05:04,010][100936] Updated weights for policy 0, policy_version 25310 (0.0007) +[2023-10-14 06:05:06,636][100917] Updated weights for policy 1, policy_version 25282 (0.0008) +[2023-10-14 06:05:07,016][100917] Updated weights for policy 1, policy_version 25292 (0.0007) +[2023-10-14 06:05:07,385][100917] Updated weights for policy 1, policy_version 25302 (0.0008) +[2023-10-14 06:05:07,752][100917] Updated weights for policy 1, policy_version 25312 (0.0009) +[2023-10-14 06:05:08,084][100936] Updated weights for policy 0, policy_version 25320 (0.0010) +[2023-10-14 06:05:08,452][100936] Updated weights for policy 0, policy_version 25330 (0.0010) +[2023-10-14 06:05:08,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 51838976. Throughput: 0: 1658.0, 1: 1647.5. Samples: 12972052. Policy #0 lag: (min: 3.0, avg: 5.1, max: 35.0) +[2023-10-14 06:05:08,513][99942] Avg episode reward: [(0, '0.310'), (1, '0.800')] +[2023-10-14 06:05:08,820][100936] Updated weights for policy 0, policy_version 25340 (0.0008) +[2023-10-14 06:05:12,040][100917] Updated weights for policy 1, policy_version 25322 (0.0009) +[2023-10-14 06:05:12,410][100917] Updated weights for policy 1, policy_version 25332 (0.0008) +[2023-10-14 06:05:12,791][100917] Updated weights for policy 1, policy_version 25342 (0.0009) +[2023-10-14 06:05:12,999][100936] Updated weights for policy 0, policy_version 25350 (0.0008) +[2023-10-14 06:05:13,364][100936] Updated weights for policy 0, policy_version 25360 (0.0008) +[2023-10-14 06:05:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 51904512. Throughput: 0: 1671.4, 1: 1652.3. Samples: 12982838. Policy #0 lag: (min: 3.0, avg: 5.1, max: 35.0) +[2023-10-14 06:05:13,512][99942] Avg episode reward: [(0, '0.310'), (1, '0.800')] +[2023-10-14 06:05:13,737][100936] Updated weights for policy 0, policy_version 25370 (0.0008) +[2023-10-14 06:05:16,729][100917] Updated weights for policy 1, policy_version 25352 (0.0009) +[2023-10-14 06:05:17,108][100917] Updated weights for policy 1, policy_version 25362 (0.0009) +[2023-10-14 06:05:17,475][100917] Updated weights for policy 1, policy_version 25372 (0.0008) +[2023-10-14 06:05:17,996][100936] Updated weights for policy 0, policy_version 25380 (0.0008) +[2023-10-14 06:05:18,359][100936] Updated weights for policy 0, policy_version 25390 (0.0009) +[2023-10-14 06:05:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 51970048. Throughput: 0: 1664.9, 1: 1641.6. Samples: 13002354. Policy #0 lag: (min: 3.0, avg: 5.1, max: 35.0) +[2023-10-14 06:05:18,513][99942] Avg episode reward: [(0, '0.310'), (1, '0.800')] +[2023-10-14 06:05:18,730][100936] Updated weights for policy 0, policy_version 25400 (0.0009) +[2023-10-14 06:05:21,558][100917] Updated weights for policy 1, policy_version 25382 (0.0008) +[2023-10-14 06:05:21,934][100917] Updated weights for policy 1, policy_version 25392 (0.0009) +[2023-10-14 06:05:22,298][100917] Updated weights for policy 1, policy_version 25402 (0.0009) +[2023-10-14 06:05:22,819][100936] Updated weights for policy 0, policy_version 25410 (0.0008) +[2023-10-14 06:05:23,188][100936] Updated weights for policy 0, policy_version 25420 (0.0007) +[2023-10-14 06:05:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 52035584. Throughput: 0: 1655.8, 1: 1654.5. Samples: 13021790. Policy #0 lag: (min: 3.0, avg: 5.1, max: 35.0) +[2023-10-14 06:05:23,512][99942] Avg episode reward: [(0, '0.310'), (1, '0.800')] +[2023-10-14 06:05:23,571][100936] Updated weights for policy 0, policy_version 25430 (0.0008) +[2023-10-14 06:05:23,935][100936] Updated weights for policy 0, policy_version 25440 (0.0008) +[2023-10-14 06:05:26,505][100917] Updated weights for policy 1, policy_version 25412 (0.0009) +[2023-10-14 06:05:26,864][100917] Updated weights for policy 1, policy_version 25422 (0.0007) +[2023-10-14 06:05:27,242][100917] Updated weights for policy 1, policy_version 25432 (0.0007) +[2023-10-14 06:05:28,023][100936] Updated weights for policy 0, policy_version 25450 (0.0010) +[2023-10-14 06:05:28,385][100936] Updated weights for policy 0, policy_version 25460 (0.0008) +[2023-10-14 06:05:28,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.3, 300 sec: 13329.4). Total num frames: 52101120. Throughput: 0: 1666.1, 1: 1655.1. Samples: 13032596. Policy #0 lag: (min: 31.0, avg: 36.3, max: 63.0) +[2023-10-14 06:05:28,512][99942] Avg episode reward: [(0, '0.310'), (1, '0.800')] +[2023-10-14 06:05:28,757][100936] Updated weights for policy 0, policy_version 25470 (0.0008) +[2023-10-14 06:05:31,290][100917] Updated weights for policy 1, policy_version 25442 (0.0008) +[2023-10-14 06:05:31,655][100917] Updated weights for policy 1, policy_version 25452 (0.0010) +[2023-10-14 06:05:32,036][100917] Updated weights for policy 1, policy_version 25462 (0.0009) +[2023-10-14 06:05:32,402][100917] Updated weights for policy 1, policy_version 25472 (0.0009) +[2023-10-14 06:05:32,932][100936] Updated weights for policy 0, policy_version 25480 (0.0008) +[2023-10-14 06:05:33,301][100936] Updated weights for policy 0, policy_version 25490 (0.0007) +[2023-10-14 06:05:33,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 52166656. Throughput: 0: 1661.4, 1: 1645.0. Samples: 13052350. Policy #0 lag: (min: 31.0, avg: 36.3, max: 63.0) +[2023-10-14 06:05:33,513][99942] Avg episode reward: [(0, '0.310'), (1, '0.800')] +[2023-10-14 06:05:33,673][100936] Updated weights for policy 0, policy_version 25500 (0.0007) +[2023-10-14 06:05:36,491][100917] Updated weights for policy 1, policy_version 25482 (0.0009) +[2023-10-14 06:05:36,864][100917] Updated weights for policy 1, policy_version 25492 (0.0007) +[2023-10-14 06:05:37,240][100917] Updated weights for policy 1, policy_version 25502 (0.0011) +[2023-10-14 06:05:37,804][100936] Updated weights for policy 0, policy_version 25510 (0.0007) +[2023-10-14 06:05:38,173][100936] Updated weights for policy 0, policy_version 25520 (0.0007) +[2023-10-14 06:05:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 52232192. Throughput: 0: 1651.4, 1: 1659.8. Samples: 13071454. Policy #0 lag: (min: 31.0, avg: 36.3, max: 63.0) +[2023-10-14 06:05:38,512][99942] Avg episode reward: [(0, '0.310'), (1, '0.800')] +[2023-10-14 06:05:38,521][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000025504_26116096.pth... +[2023-10-14 06:05:38,545][100936] Updated weights for policy 0, policy_version 25530 (0.0007) +[2023-10-14 06:05:38,560][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000023968_24543232.pth +[2023-10-14 06:05:38,765][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000025536_26148864.pth... +[2023-10-14 06:05:38,803][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000023968_24543232.pth +[2023-10-14 06:05:41,275][100917] Updated weights for policy 1, policy_version 25512 (0.0010) +[2023-10-14 06:05:41,656][100917] Updated weights for policy 1, policy_version 25522 (0.0009) +[2023-10-14 06:05:42,031][100917] Updated weights for policy 1, policy_version 25532 (0.0008) +[2023-10-14 06:05:42,612][100936] Updated weights for policy 0, policy_version 25540 (0.0010) +[2023-10-14 06:05:42,975][100936] Updated weights for policy 0, policy_version 25550 (0.0008) +[2023-10-14 06:05:43,349][100936] Updated weights for policy 0, policy_version 25560 (0.0007) +[2023-10-14 06:05:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 52297728. Throughput: 0: 1655.5, 1: 1664.0. Samples: 13082392. Policy #0 lag: (min: 31.0, avg: 36.3, max: 63.0) +[2023-10-14 06:05:43,513][99942] Avg episode reward: [(0, '0.310'), (1, '0.800')] +[2023-10-14 06:05:46,250][100917] Updated weights for policy 1, policy_version 25542 (0.0010) +[2023-10-14 06:05:46,624][100917] Updated weights for policy 1, policy_version 25552 (0.0008) +[2023-10-14 06:05:46,993][100917] Updated weights for policy 1, policy_version 25562 (0.0007) +[2023-10-14 06:05:47,488][100936] Updated weights for policy 0, policy_version 25570 (0.0008) +[2023-10-14 06:05:47,846][100936] Updated weights for policy 0, policy_version 25580 (0.0009) +[2023-10-14 06:05:48,221][100936] Updated weights for policy 0, policy_version 25590 (0.0007) +[2023-10-14 06:05:48,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 52363264. Throughput: 0: 1654.0, 1: 1651.4. Samples: 13101822. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) +[2023-10-14 06:05:48,513][99942] Avg episode reward: [(0, '0.310'), (1, '0.800')] +[2023-10-14 06:05:48,589][100936] Updated weights for policy 0, policy_version 25600 (0.0009) +[2023-10-14 06:05:51,089][100917] Updated weights for policy 1, policy_version 25572 (0.0007) +[2023-10-14 06:05:51,466][100917] Updated weights for policy 1, policy_version 25582 (0.0009) +[2023-10-14 06:05:51,831][100917] Updated weights for policy 1, policy_version 25592 (0.0010) +[2023-10-14 06:05:52,604][100936] Updated weights for policy 0, policy_version 25610 (0.0007) +[2023-10-14 06:05:52,966][100936] Updated weights for policy 0, policy_version 25620 (0.0009) +[2023-10-14 06:05:53,336][100936] Updated weights for policy 0, policy_version 25630 (0.0009) +[2023-10-14 06:05:53,512][99942] Fps is (10 sec: 16384.0, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 52461568. Throughput: 0: 1643.5, 1: 1660.8. Samples: 13120744. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) +[2023-10-14 06:05:53,513][99942] Avg episode reward: [(0, '0.310'), (1, '0.800')] +[2023-10-14 06:05:56,036][100917] Updated weights for policy 1, policy_version 25602 (0.0010) +[2023-10-14 06:05:56,411][100917] Updated weights for policy 1, policy_version 25612 (0.0007) +[2023-10-14 06:05:56,781][100917] Updated weights for policy 1, policy_version 25622 (0.0007) +[2023-10-14 06:05:57,155][100917] Updated weights for policy 1, policy_version 25632 (0.0007) +[2023-10-14 06:05:57,494][100936] Updated weights for policy 0, policy_version 25640 (0.0007) +[2023-10-14 06:05:57,864][100936] Updated weights for policy 0, policy_version 25650 (0.0008) +[2023-10-14 06:05:58,240][100936] Updated weights for policy 0, policy_version 25660 (0.0008) +[2023-10-14 06:05:58,512][99942] Fps is (10 sec: 16384.3, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 52527104. Throughput: 0: 1654.2, 1: 1661.6. Samples: 13132050. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) +[2023-10-14 06:05:58,512][99942] Avg episode reward: [(0, '0.310'), (1, '0.800')] +[2023-10-14 06:06:01,256][100917] Updated weights for policy 1, policy_version 25642 (0.0007) +[2023-10-14 06:06:01,632][100917] Updated weights for policy 1, policy_version 25652 (0.0010) +[2023-10-14 06:06:02,011][100917] Updated weights for policy 1, policy_version 25662 (0.0007) +[2023-10-14 06:06:02,388][100936] Updated weights for policy 0, policy_version 25670 (0.0008) +[2023-10-14 06:06:02,759][100936] Updated weights for policy 0, policy_version 25680 (0.0007) +[2023-10-14 06:06:03,135][100936] Updated weights for policy 0, policy_version 25690 (0.0008) +[2023-10-14 06:06:03,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 52592640. Throughput: 0: 1655.2, 1: 1651.4. Samples: 13151150. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:06:03,513][99942] Avg episode reward: [(0, '0.310'), (1, '0.800')] +[2023-10-14 06:06:06,122][100917] Updated weights for policy 1, policy_version 25672 (0.0009) +[2023-10-14 06:06:06,485][100917] Updated weights for policy 1, policy_version 25682 (0.0008) +[2023-10-14 06:06:06,858][100917] Updated weights for policy 1, policy_version 25692 (0.0009) +[2023-10-14 06:06:07,441][100936] Updated weights for policy 0, policy_version 25700 (0.0007) +[2023-10-14 06:06:07,808][100936] Updated weights for policy 0, policy_version 25710 (0.0009) +[2023-10-14 06:06:08,184][100936] Updated weights for policy 0, policy_version 25720 (0.0008) +[2023-10-14 06:06:08,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 52658176. Throughput: 0: 1646.6, 1: 1658.6. Samples: 13170522. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:06:08,512][99942] Avg episode reward: [(0, '0.310'), (1, '0.800')] +[2023-10-14 06:06:11,009][100917] Updated weights for policy 1, policy_version 25702 (0.0008) +[2023-10-14 06:06:11,377][100917] Updated weights for policy 1, policy_version 25712 (0.0009) +[2023-10-14 06:06:11,752][100917] Updated weights for policy 1, policy_version 25722 (0.0011) +[2023-10-14 06:06:12,215][100936] Updated weights for policy 0, policy_version 25730 (0.0010) +[2023-10-14 06:06:12,591][100936] Updated weights for policy 0, policy_version 25740 (0.0009) +[2023-10-14 06:06:12,970][100936] Updated weights for policy 0, policy_version 25750 (0.0009) +[2023-10-14 06:06:13,335][100936] Updated weights for policy 0, policy_version 25760 (0.0008) +[2023-10-14 06:06:13,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 52723712. Throughput: 0: 1655.5, 1: 1654.7. Samples: 13181554. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:06:13,512][99942] Avg episode reward: [(0, '0.310'), (1, '0.800')] +[2023-10-14 06:06:15,917][100917] Updated weights for policy 1, policy_version 25732 (0.0008) +[2023-10-14 06:06:16,281][100917] Updated weights for policy 1, policy_version 25742 (0.0011) +[2023-10-14 06:06:16,662][100917] Updated weights for policy 1, policy_version 25752 (0.0010) +[2023-10-14 06:06:17,684][100936] Updated weights for policy 0, policy_version 25770 (0.0008) +[2023-10-14 06:06:18,058][100936] Updated weights for policy 0, policy_version 25780 (0.0009) +[2023-10-14 06:06:18,421][100936] Updated weights for policy 0, policy_version 25790 (0.0007) +[2023-10-14 06:06:18,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 52789248. Throughput: 0: 1648.8, 1: 1646.0. Samples: 13200618. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:06:18,513][99942] Avg episode reward: [(0, '0.310'), (1, '0.800')] +[2023-10-14 06:06:20,767][100917] Updated weights for policy 1, policy_version 25762 (0.0008) +[2023-10-14 06:06:21,144][100917] Updated weights for policy 1, policy_version 25772 (0.0008) +[2023-10-14 06:06:21,517][100917] Updated weights for policy 1, policy_version 25782 (0.0011) +[2023-10-14 06:06:21,893][100917] Updated weights for policy 1, policy_version 25792 (0.0007) +[2023-10-14 06:06:22,689][100936] Updated weights for policy 0, policy_version 25800 (0.0008) +[2023-10-14 06:06:23,070][100936] Updated weights for policy 0, policy_version 25810 (0.0008) +[2023-10-14 06:06:23,432][100936] Updated weights for policy 0, policy_version 25820 (0.0008) +[2023-10-14 06:06:23,512][99942] Fps is (10 sec: 9830.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 52822016. Throughput: 0: 1643.2, 1: 1658.6. Samples: 13220034. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:06:23,512][99942] Avg episode reward: [(0, '0.310'), (1, '0.800')] +[2023-10-14 06:06:25,965][100917] Updated weights for policy 1, policy_version 25802 (0.0009) +[2023-10-14 06:06:26,341][100917] Updated weights for policy 1, policy_version 25812 (0.0008) +[2023-10-14 06:06:26,718][100917] Updated weights for policy 1, policy_version 25822 (0.0011) +[2023-10-14 06:06:27,517][100936] Updated weights for policy 0, policy_version 25830 (0.0009) +[2023-10-14 06:06:27,879][100936] Updated weights for policy 0, policy_version 25840 (0.0010) +[2023-10-14 06:06:28,247][100936] Updated weights for policy 0, policy_version 25850 (0.0010) +[2023-10-14 06:06:28,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 52920320. Throughput: 0: 1653.3, 1: 1648.9. Samples: 13230992. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:06:28,513][99942] Avg episode reward: [(0, '0.310'), (1, '0.800')] +[2023-10-14 06:06:30,906][100917] Updated weights for policy 1, policy_version 25832 (0.0010) +[2023-10-14 06:06:31,276][100917] Updated weights for policy 1, policy_version 25842 (0.0008) +[2023-10-14 06:06:31,641][100917] Updated weights for policy 1, policy_version 25852 (0.0010) +[2023-10-14 06:06:32,476][100936] Updated weights for policy 0, policy_version 25860 (0.0010) +[2023-10-14 06:06:32,850][100936] Updated weights for policy 0, policy_version 25870 (0.0011) +[2023-10-14 06:06:33,225][100936] Updated weights for policy 0, policy_version 25880 (0.0010) +[2023-10-14 06:06:33,512][99942] Fps is (10 sec: 16383.5, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 52985856. Throughput: 0: 1645.0, 1: 1648.9. Samples: 13250050. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:06:33,514][99942] Avg episode reward: [(0, '0.310'), (1, '0.800')] +[2023-10-14 06:06:35,605][100917] Updated weights for policy 1, policy_version 25862 (0.0008) +[2023-10-14 06:06:35,994][100917] Updated weights for policy 1, policy_version 25872 (0.0007) +[2023-10-14 06:06:36,356][100917] Updated weights for policy 1, policy_version 25882 (0.0008) +[2023-10-14 06:06:37,382][100936] Updated weights for policy 0, policy_version 25890 (0.0011) +[2023-10-14 06:06:37,752][100936] Updated weights for policy 0, policy_version 25900 (0.0009) +[2023-10-14 06:06:38,121][100936] Updated weights for policy 0, policy_version 25910 (0.0010) +[2023-10-14 06:06:38,489][100936] Updated weights for policy 0, policy_version 25920 (0.0010) +[2023-10-14 06:06:38,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 53051392. Throughput: 0: 1643.9, 1: 1664.5. Samples: 13269622. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:06:38,513][99942] Avg episode reward: [(0, '0.310'), (1, '0.800')] +[2023-10-14 06:06:40,584][100917] Updated weights for policy 1, policy_version 25892 (0.0009) +[2023-10-14 06:06:40,958][100917] Updated weights for policy 1, policy_version 25902 (0.0007) +[2023-10-14 06:06:41,322][100917] Updated weights for policy 1, policy_version 25912 (0.0008) +[2023-10-14 06:06:42,595][100936] Updated weights for policy 0, policy_version 25930 (0.0009) +[2023-10-14 06:06:42,970][100936] Updated weights for policy 0, policy_version 25940 (0.0011) +[2023-10-14 06:06:43,333][100936] Updated weights for policy 0, policy_version 25950 (0.0009) +[2023-10-14 06:06:43,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 53116928. Throughput: 0: 1642.8, 1: 1653.3. Samples: 13280374. Policy #0 lag: (min: 31.0, avg: 32.8, max: 53.0) +[2023-10-14 06:06:43,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.800')] +[2023-10-14 06:06:45,543][100917] Updated weights for policy 1, policy_version 25922 (0.0007) +[2023-10-14 06:06:45,916][100917] Updated weights for policy 1, policy_version 25932 (0.0007) +[2023-10-14 06:06:46,288][100917] Updated weights for policy 1, policy_version 25942 (0.0007) +[2023-10-14 06:06:46,654][100917] Updated weights for policy 1, policy_version 25952 (0.0009) +[2023-10-14 06:06:47,558][100936] Updated weights for policy 0, policy_version 25960 (0.0008) +[2023-10-14 06:06:47,927][100936] Updated weights for policy 0, policy_version 25970 (0.0008) +[2023-10-14 06:06:48,293][100936] Updated weights for policy 0, policy_version 25980 (0.0009) +[2023-10-14 06:06:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 53182464. Throughput: 0: 1639.9, 1: 1660.6. Samples: 13299670. Policy #0 lag: (min: 31.0, avg: 32.8, max: 53.0) +[2023-10-14 06:06:48,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.800')] +[2023-10-14 06:06:50,830][100917] Updated weights for policy 1, policy_version 25962 (0.0008) +[2023-10-14 06:06:51,200][100917] Updated weights for policy 1, policy_version 25972 (0.0009) +[2023-10-14 06:06:51,577][100917] Updated weights for policy 1, policy_version 25982 (0.0007) +[2023-10-14 06:06:52,439][100936] Updated weights for policy 0, policy_version 25990 (0.0008) +[2023-10-14 06:06:52,805][100936] Updated weights for policy 0, policy_version 26000 (0.0007) +[2023-10-14 06:06:53,179][100936] Updated weights for policy 0, policy_version 26010 (0.0009) +[2023-10-14 06:06:53,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 53248000. Throughput: 0: 1640.4, 1: 1661.1. Samples: 13319088. Policy #0 lag: (min: 31.0, avg: 32.8, max: 53.0) +[2023-10-14 06:06:53,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.800')] +[2023-10-14 06:06:55,739][100917] Updated weights for policy 1, policy_version 25992 (0.0010) +[2023-10-14 06:06:56,112][100917] Updated weights for policy 1, policy_version 26002 (0.0009) +[2023-10-14 06:06:56,485][100917] Updated weights for policy 1, policy_version 26012 (0.0009) +[2023-10-14 06:06:57,307][100936] Updated weights for policy 0, policy_version 26020 (0.0007) +[2023-10-14 06:06:57,679][100936] Updated weights for policy 0, policy_version 26030 (0.0008) +[2023-10-14 06:06:58,045][100936] Updated weights for policy 0, policy_version 26040 (0.0008) +[2023-10-14 06:06:58,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 53313536. Throughput: 0: 1642.7, 1: 1653.4. Samples: 13329882. Policy #0 lag: (min: 31.0, avg: 32.8, max: 53.0) +[2023-10-14 06:06:58,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.800')] +[2023-10-14 06:07:00,614][100917] Updated weights for policy 1, policy_version 26022 (0.0010) +[2023-10-14 06:07:00,980][100917] Updated weights for policy 1, policy_version 26032 (0.0009) +[2023-10-14 06:07:01,342][100917] Updated weights for policy 1, policy_version 26042 (0.0007) +[2023-10-14 06:07:02,171][100936] Updated weights for policy 0, policy_version 26050 (0.0008) +[2023-10-14 06:07:02,536][100936] Updated weights for policy 0, policy_version 26060 (0.0008) +[2023-10-14 06:07:02,911][100936] Updated weights for policy 0, policy_version 26070 (0.0007) +[2023-10-14 06:07:03,284][100936] Updated weights for policy 0, policy_version 26080 (0.0008) +[2023-10-14 06:07:03,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 53379072. Throughput: 0: 1642.6, 1: 1660.3. Samples: 13349250. Policy #0 lag: (min: 31.0, avg: 36.6, max: 63.0) +[2023-10-14 06:07:03,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.800')] +[2023-10-14 06:07:05,331][100917] Updated weights for policy 1, policy_version 26052 (0.0007) +[2023-10-14 06:07:05,709][100917] Updated weights for policy 1, policy_version 26062 (0.0008) +[2023-10-14 06:07:06,075][100917] Updated weights for policy 1, policy_version 26072 (0.0011) +[2023-10-14 06:07:07,484][100936] Updated weights for policy 0, policy_version 26090 (0.0010) +[2023-10-14 06:07:07,858][100936] Updated weights for policy 0, policy_version 26100 (0.0008) +[2023-10-14 06:07:08,235][100936] Updated weights for policy 0, policy_version 26110 (0.0010) +[2023-10-14 06:07:08,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 53444608. Throughput: 0: 1641.7, 1: 1662.0. Samples: 13368698. Policy #0 lag: (min: 31.0, avg: 36.6, max: 63.0) +[2023-10-14 06:07:08,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.800')] +[2023-10-14 06:07:10,083][100917] Updated weights for policy 1, policy_version 26082 (0.0009) +[2023-10-14 06:07:10,462][100917] Updated weights for policy 1, policy_version 26092 (0.0007) +[2023-10-14 06:07:10,833][100917] Updated weights for policy 1, policy_version 26102 (0.0008) +[2023-10-14 06:07:11,196][100917] Updated weights for policy 1, policy_version 26112 (0.0008) +[2023-10-14 06:07:12,247][100936] Updated weights for policy 0, policy_version 26120 (0.0009) +[2023-10-14 06:07:12,616][100936] Updated weights for policy 0, policy_version 26130 (0.0008) +[2023-10-14 06:07:12,983][100936] Updated weights for policy 0, policy_version 26140 (0.0007) +[2023-10-14 06:07:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 53510144. Throughput: 0: 1648.4, 1: 1650.6. Samples: 13379448. Policy #0 lag: (min: 31.0, avg: 36.6, max: 63.0) +[2023-10-14 06:07:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.800')] +[2023-10-14 06:07:15,360][100917] Updated weights for policy 1, policy_version 26122 (0.0008) +[2023-10-14 06:07:15,743][100917] Updated weights for policy 1, policy_version 26132 (0.0009) +[2023-10-14 06:07:16,109][100917] Updated weights for policy 1, policy_version 26142 (0.0011) +[2023-10-14 06:07:17,152][100936] Updated weights for policy 0, policy_version 26150 (0.0010) +[2023-10-14 06:07:17,523][100936] Updated weights for policy 0, policy_version 26160 (0.0010) +[2023-10-14 06:07:17,888][100936] Updated weights for policy 0, policy_version 26170 (0.0008) +[2023-10-14 06:07:18,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 53575680. Throughput: 0: 1644.6, 1: 1661.1. Samples: 13398808. Policy #0 lag: (min: 31.0, avg: 36.6, max: 63.0) +[2023-10-14 06:07:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.800')] +[2023-10-14 06:07:20,193][100917] Updated weights for policy 1, policy_version 26152 (0.0009) +[2023-10-14 06:07:20,565][100917] Updated weights for policy 1, policy_version 26162 (0.0009) +[2023-10-14 06:07:20,941][100917] Updated weights for policy 1, policy_version 26172 (0.0008) +[2023-10-14 06:07:22,043][100936] Updated weights for policy 0, policy_version 26180 (0.0008) +[2023-10-14 06:07:22,421][100936] Updated weights for policy 0, policy_version 26190 (0.0009) +[2023-10-14 06:07:22,791][100936] Updated weights for policy 0, policy_version 26200 (0.0009) +[2023-10-14 06:07:23,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 53641216. Throughput: 0: 1648.0, 1: 1659.7. Samples: 13418472. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:07:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.800')] +[2023-10-14 06:07:25,080][100917] Updated weights for policy 1, policy_version 26182 (0.0007) +[2023-10-14 06:07:25,443][100917] Updated weights for policy 1, policy_version 26192 (0.0008) +[2023-10-14 06:07:25,810][100917] Updated weights for policy 1, policy_version 26202 (0.0010) +[2023-10-14 06:07:26,904][100936] Updated weights for policy 0, policy_version 26210 (0.0007) +[2023-10-14 06:07:27,278][100936] Updated weights for policy 0, policy_version 26220 (0.0007) +[2023-10-14 06:07:27,647][100936] Updated weights for policy 0, policy_version 26230 (0.0010) +[2023-10-14 06:07:28,012][100936] Updated weights for policy 0, policy_version 26240 (0.0007) +[2023-10-14 06:07:28,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 53706752. Throughput: 0: 1654.9, 1: 1647.5. Samples: 13428980. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:07:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.800')] +[2023-10-14 06:07:29,949][100917] Updated weights for policy 1, policy_version 26212 (0.0009) +[2023-10-14 06:07:30,309][100917] Updated weights for policy 1, policy_version 26222 (0.0008) +[2023-10-14 06:07:30,680][100917] Updated weights for policy 1, policy_version 26232 (0.0009) +[2023-10-14 06:07:32,213][100936] Updated weights for policy 0, policy_version 26250 (0.0009) +[2023-10-14 06:07:32,590][100936] Updated weights for policy 0, policy_version 26260 (0.0008) +[2023-10-14 06:07:32,969][100936] Updated weights for policy 0, policy_version 26270 (0.0009) +[2023-10-14 06:07:33,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.3, 300 sec: 13329.4). Total num frames: 53772288. Throughput: 0: 1648.5, 1: 1660.1. Samples: 13448556. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:07:33,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:07:34,964][100917] Updated weights for policy 1, policy_version 26242 (0.0009) +[2023-10-14 06:07:35,330][100917] Updated weights for policy 1, policy_version 26252 (0.0008) +[2023-10-14 06:07:35,704][100917] Updated weights for policy 1, policy_version 26262 (0.0008) +[2023-10-14 06:07:36,074][100917] Updated weights for policy 1, policy_version 26272 (0.0009) +[2023-10-14 06:07:37,010][100936] Updated weights for policy 0, policy_version 26280 (0.0007) +[2023-10-14 06:07:37,377][100936] Updated weights for policy 0, policy_version 26290 (0.0007) +[2023-10-14 06:07:37,737][100936] Updated weights for policy 0, policy_version 26300 (0.0007) +[2023-10-14 06:07:38,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 53837824. Throughput: 0: 1654.3, 1: 1664.2. Samples: 13468422. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:07:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:07:38,523][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000026304_26935296.pth... +[2023-10-14 06:07:38,524][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000026272_26902528.pth... +[2023-10-14 06:07:38,554][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000024736_25329664.pth +[2023-10-14 06:07:38,554][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000024736_25329664.pth +[2023-10-14 06:07:40,370][100917] Updated weights for policy 1, policy_version 26282 (0.0007) +[2023-10-14 06:07:40,742][100917] Updated weights for policy 1, policy_version 26292 (0.0007) +[2023-10-14 06:07:41,110][100917] Updated weights for policy 1, policy_version 26302 (0.0009) +[2023-10-14 06:07:41,856][100936] Updated weights for policy 0, policy_version 26310 (0.0007) +[2023-10-14 06:07:42,225][100936] Updated weights for policy 0, policy_version 26320 (0.0007) +[2023-10-14 06:07:42,586][100936] Updated weights for policy 0, policy_version 26330 (0.0008) +[2023-10-14 06:07:43,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 53903360. Throughput: 0: 1659.2, 1: 1651.8. Samples: 13478876. Policy #0 lag: (min: 31.0, avg: 34.2, max: 63.0) +[2023-10-14 06:07:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:07:45,136][100917] Updated weights for policy 1, policy_version 26312 (0.0008) +[2023-10-14 06:07:45,506][100917] Updated weights for policy 1, policy_version 26322 (0.0010) +[2023-10-14 06:07:45,887][100917] Updated weights for policy 1, policy_version 26332 (0.0008) +[2023-10-14 06:07:46,574][100936] Updated weights for policy 0, policy_version 26340 (0.0008) +[2023-10-14 06:07:46,947][100936] Updated weights for policy 0, policy_version 26350 (0.0008) +[2023-10-14 06:07:47,316][100936] Updated weights for policy 0, policy_version 26360 (0.0007) +[2023-10-14 06:07:48,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 53968896. Throughput: 0: 1643.3, 1: 1664.3. Samples: 13498092. Policy #0 lag: (min: 31.0, avg: 34.2, max: 63.0) +[2023-10-14 06:07:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:07:49,976][100917] Updated weights for policy 1, policy_version 26342 (0.0010) +[2023-10-14 06:07:50,345][100917] Updated weights for policy 1, policy_version 26352 (0.0008) +[2023-10-14 06:07:50,718][100917] Updated weights for policy 1, policy_version 26362 (0.0008) +[2023-10-14 06:07:51,451][100936] Updated weights for policy 0, policy_version 26370 (0.0008) +[2023-10-14 06:07:51,830][100936] Updated weights for policy 0, policy_version 26380 (0.0009) +[2023-10-14 06:07:52,199][100936] Updated weights for policy 0, policy_version 26390 (0.0009) +[2023-10-14 06:07:52,580][100936] Updated weights for policy 0, policy_version 26400 (0.0008) +[2023-10-14 06:07:53,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 54034432. Throughput: 0: 1660.3, 1: 1660.0. Samples: 13518110. Policy #0 lag: (min: 31.0, avg: 34.2, max: 63.0) +[2023-10-14 06:07:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:07:54,855][100917] Updated weights for policy 1, policy_version 26372 (0.0008) +[2023-10-14 06:07:55,224][100917] Updated weights for policy 1, policy_version 26382 (0.0008) +[2023-10-14 06:07:55,594][100917] Updated weights for policy 1, policy_version 26392 (0.0008) +[2023-10-14 06:07:56,833][100936] Updated weights for policy 0, policy_version 26410 (0.0011) +[2023-10-14 06:07:57,205][100936] Updated weights for policy 0, policy_version 26420 (0.0007) +[2023-10-14 06:07:57,577][100936] Updated weights for policy 0, policy_version 26430 (0.0010) +[2023-10-14 06:07:58,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 54099968. Throughput: 0: 1657.0, 1: 1650.9. Samples: 13528304. Policy #0 lag: (min: 31.0, avg: 34.2, max: 63.0) +[2023-10-14 06:07:58,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:07:59,612][100917] Updated weights for policy 1, policy_version 26402 (0.0009) +[2023-10-14 06:07:59,977][100917] Updated weights for policy 1, policy_version 26412 (0.0009) +[2023-10-14 06:08:00,349][100917] Updated weights for policy 1, policy_version 26422 (0.0007) +[2023-10-14 06:08:00,726][100917] Updated weights for policy 1, policy_version 26432 (0.0007) +[2023-10-14 06:08:01,609][100936] Updated weights for policy 0, policy_version 26440 (0.0008) +[2023-10-14 06:08:01,983][100936] Updated weights for policy 0, policy_version 26450 (0.0011) +[2023-10-14 06:08:02,365][100936] Updated weights for policy 0, policy_version 26460 (0.0011) +[2023-10-14 06:08:03,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 54165504. Throughput: 0: 1643.3, 1: 1660.5. Samples: 13547476. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:08:03,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:08:04,703][100917] Updated weights for policy 1, policy_version 26442 (0.0009) +[2023-10-14 06:08:05,082][100917] Updated weights for policy 1, policy_version 26452 (0.0009) +[2023-10-14 06:08:05,452][100917] Updated weights for policy 1, policy_version 26462 (0.0010) +[2023-10-14 06:08:06,457][100936] Updated weights for policy 0, policy_version 26470 (0.0009) +[2023-10-14 06:08:06,832][100936] Updated weights for policy 0, policy_version 26480 (0.0009) +[2023-10-14 06:08:07,204][100936] Updated weights for policy 0, policy_version 26490 (0.0007) +[2023-10-14 06:08:08,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 54231040. Throughput: 0: 1663.7, 1: 1658.0. Samples: 13567948. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:08:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:08:09,740][100917] Updated weights for policy 1, policy_version 26472 (0.0009) +[2023-10-14 06:08:10,113][100917] Updated weights for policy 1, policy_version 26482 (0.0009) +[2023-10-14 06:08:10,491][100917] Updated weights for policy 1, policy_version 26492 (0.0010) +[2023-10-14 06:08:11,211][100936] Updated weights for policy 0, policy_version 26500 (0.0007) +[2023-10-14 06:08:11,578][100936] Updated weights for policy 0, policy_version 26510 (0.0009) +[2023-10-14 06:08:11,958][100936] Updated weights for policy 0, policy_version 26520 (0.0009) +[2023-10-14 06:08:13,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 54296576. Throughput: 0: 1657.1, 1: 1650.9. Samples: 13577838. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:08:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:08:14,666][100917] Updated weights for policy 1, policy_version 26502 (0.0010) +[2023-10-14 06:08:15,041][100917] Updated weights for policy 1, policy_version 26512 (0.0009) +[2023-10-14 06:08:15,419][100917] Updated weights for policy 1, policy_version 26522 (0.0009) +[2023-10-14 06:08:16,056][100936] Updated weights for policy 0, policy_version 26530 (0.0008) +[2023-10-14 06:08:16,419][100936] Updated weights for policy 0, policy_version 26540 (0.0007) +[2023-10-14 06:08:16,801][100936] Updated weights for policy 0, policy_version 26550 (0.0010) +[2023-10-14 06:08:17,163][100936] Updated weights for policy 0, policy_version 26560 (0.0008) +[2023-10-14 06:08:18,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 54362112. Throughput: 0: 1653.8, 1: 1657.0. Samples: 13597542. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:08:18,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:08:19,500][100917] Updated weights for policy 1, policy_version 26532 (0.0007) +[2023-10-14 06:08:19,873][100917] Updated weights for policy 1, policy_version 26542 (0.0009) +[2023-10-14 06:08:20,239][100917] Updated weights for policy 1, policy_version 26552 (0.0010) +[2023-10-14 06:08:21,205][100936] Updated weights for policy 0, policy_version 26570 (0.0009) +[2023-10-14 06:08:21,580][100936] Updated weights for policy 0, policy_version 26580 (0.0008) +[2023-10-14 06:08:21,941][100936] Updated weights for policy 0, policy_version 26590 (0.0008) +[2023-10-14 06:08:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 54427648. Throughput: 0: 1669.6, 1: 1658.1. Samples: 13618166. Policy #0 lag: (min: 7.0, avg: 11.1, max: 39.0) +[2023-10-14 06:08:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:08:24,166][100917] Updated weights for policy 1, policy_version 26562 (0.0010) +[2023-10-14 06:08:24,544][100917] Updated weights for policy 1, policy_version 26572 (0.0007) +[2023-10-14 06:08:24,911][100917] Updated weights for policy 1, policy_version 26582 (0.0008) +[2023-10-14 06:08:25,286][100917] Updated weights for policy 1, policy_version 26592 (0.0009) +[2023-10-14 06:08:26,044][100936] Updated weights for policy 0, policy_version 26600 (0.0008) +[2023-10-14 06:08:26,419][100936] Updated weights for policy 0, policy_version 26610 (0.0009) +[2023-10-14 06:08:26,781][100936] Updated weights for policy 0, policy_version 26620 (0.0008) +[2023-10-14 06:08:28,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 54493184. Throughput: 0: 1654.4, 1: 1654.6. Samples: 13627784. Policy #0 lag: (min: 7.0, avg: 11.1, max: 39.0) +[2023-10-14 06:08:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:08:29,559][100917] Updated weights for policy 1, policy_version 26602 (0.0011) +[2023-10-14 06:08:29,939][100917] Updated weights for policy 1, policy_version 26612 (0.0010) +[2023-10-14 06:08:30,316][100917] Updated weights for policy 1, policy_version 26622 (0.0008) +[2023-10-14 06:08:30,938][100936] Updated weights for policy 0, policy_version 26630 (0.0009) +[2023-10-14 06:08:31,310][100936] Updated weights for policy 0, policy_version 26640 (0.0008) +[2023-10-14 06:08:31,676][100936] Updated weights for policy 0, policy_version 26650 (0.0007) +[2023-10-14 06:08:33,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 54558720. Throughput: 0: 1665.0, 1: 1654.6. Samples: 13647472. Policy #0 lag: (min: 7.0, avg: 11.1, max: 39.0) +[2023-10-14 06:08:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:08:34,420][100917] Updated weights for policy 1, policy_version 26632 (0.0008) +[2023-10-14 06:08:34,787][100917] Updated weights for policy 1, policy_version 26642 (0.0007) +[2023-10-14 06:08:35,165][100917] Updated weights for policy 1, policy_version 26652 (0.0011) +[2023-10-14 06:08:35,663][100936] Updated weights for policy 0, policy_version 26660 (0.0008) +[2023-10-14 06:08:36,034][100936] Updated weights for policy 0, policy_version 26670 (0.0007) +[2023-10-14 06:08:36,397][100936] Updated weights for policy 0, policy_version 26680 (0.0008) +[2023-10-14 06:08:38,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 54624256. Throughput: 0: 1679.0, 1: 1653.0. Samples: 13668050. Policy #0 lag: (min: 7.0, avg: 11.1, max: 39.0) +[2023-10-14 06:08:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:08:39,478][100917] Updated weights for policy 1, policy_version 26662 (0.0010) +[2023-10-14 06:08:39,844][100917] Updated weights for policy 1, policy_version 26672 (0.0009) +[2023-10-14 06:08:40,224][100917] Updated weights for policy 1, policy_version 26682 (0.0011) +[2023-10-14 06:08:40,499][100936] Updated weights for policy 0, policy_version 26690 (0.0008) +[2023-10-14 06:08:40,881][100936] Updated weights for policy 0, policy_version 26700 (0.0008) +[2023-10-14 06:08:41,258][100936] Updated weights for policy 0, policy_version 26710 (0.0008) +[2023-10-14 06:08:41,621][100936] Updated weights for policy 0, policy_version 26720 (0.0008) +[2023-10-14 06:08:43,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 54689792. Throughput: 0: 1660.0, 1: 1651.5. Samples: 13677324. Policy #0 lag: (min: 24.0, avg: 40.1, max: 56.0) +[2023-10-14 06:08:43,514][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:08:44,235][100917] Updated weights for policy 1, policy_version 26692 (0.0009) +[2023-10-14 06:08:44,613][100917] Updated weights for policy 1, policy_version 26702 (0.0007) +[2023-10-14 06:08:44,980][100917] Updated weights for policy 1, policy_version 26712 (0.0007) +[2023-10-14 06:08:45,867][100936] Updated weights for policy 0, policy_version 26730 (0.0010) +[2023-10-14 06:08:46,243][100936] Updated weights for policy 0, policy_version 26740 (0.0010) +[2023-10-14 06:08:46,605][100936] Updated weights for policy 0, policy_version 26750 (0.0008) +[2023-10-14 06:08:48,512][99942] Fps is (10 sec: 13107.7, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 54755328. Throughput: 0: 1677.0, 1: 1656.6. Samples: 13697490. Policy #0 lag: (min: 24.0, avg: 40.1, max: 56.0) +[2023-10-14 06:08:48,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:08:49,227][100917] Updated weights for policy 1, policy_version 26722 (0.0007) +[2023-10-14 06:08:49,597][100917] Updated weights for policy 1, policy_version 26732 (0.0008) +[2023-10-14 06:08:49,978][100917] Updated weights for policy 1, policy_version 26742 (0.0009) +[2023-10-14 06:08:50,354][100917] Updated weights for policy 1, policy_version 26752 (0.0009) +[2023-10-14 06:08:50,744][100936] Updated weights for policy 0, policy_version 26760 (0.0009) +[2023-10-14 06:08:51,111][100936] Updated weights for policy 0, policy_version 26770 (0.0008) +[2023-10-14 06:08:51,487][100936] Updated weights for policy 0, policy_version 26780 (0.0007) +[2023-10-14 06:08:53,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 54820864. Throughput: 0: 1674.7, 1: 1657.2. Samples: 13717882. Policy #0 lag: (min: 24.0, avg: 40.1, max: 56.0) +[2023-10-14 06:08:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:08:54,508][100917] Updated weights for policy 1, policy_version 26762 (0.0009) +[2023-10-14 06:08:54,873][100917] Updated weights for policy 1, policy_version 26772 (0.0009) +[2023-10-14 06:08:55,253][100917] Updated weights for policy 1, policy_version 26782 (0.0007) +[2023-10-14 06:08:55,547][100936] Updated weights for policy 0, policy_version 26790 (0.0007) +[2023-10-14 06:08:55,917][100936] Updated weights for policy 0, policy_version 26800 (0.0009) +[2023-10-14 06:08:56,294][100936] Updated weights for policy 0, policy_version 26810 (0.0009) +[2023-10-14 06:08:58,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 54886400. Throughput: 0: 1656.8, 1: 1658.9. Samples: 13727044. Policy #0 lag: (min: 24.0, avg: 40.1, max: 56.0) +[2023-10-14 06:08:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:08:59,363][100917] Updated weights for policy 1, policy_version 26792 (0.0010) +[2023-10-14 06:08:59,721][100917] Updated weights for policy 1, policy_version 26802 (0.0010) +[2023-10-14 06:09:00,100][100917] Updated weights for policy 1, policy_version 26812 (0.0010) +[2023-10-14 06:09:00,406][100936] Updated weights for policy 0, policy_version 26820 (0.0009) +[2023-10-14 06:09:00,782][100936] Updated weights for policy 0, policy_version 26830 (0.0010) +[2023-10-14 06:09:01,161][100936] Updated weights for policy 0, policy_version 26840 (0.0008) +[2023-10-14 06:09:03,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 54951936. Throughput: 0: 1672.9, 1: 1657.6. Samples: 13747416. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:09:03,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:09:04,280][100917] Updated weights for policy 1, policy_version 26822 (0.0010) +[2023-10-14 06:09:04,652][100917] Updated weights for policy 1, policy_version 26832 (0.0008) +[2023-10-14 06:09:05,025][100917] Updated weights for policy 1, policy_version 26842 (0.0009) +[2023-10-14 06:09:05,264][100936] Updated weights for policy 0, policy_version 26850 (0.0007) +[2023-10-14 06:09:05,634][100936] Updated weights for policy 0, policy_version 26860 (0.0007) +[2023-10-14 06:09:06,008][100936] Updated weights for policy 0, policy_version 26870 (0.0011) +[2023-10-14 06:09:06,377][100936] Updated weights for policy 0, policy_version 26880 (0.0008) +[2023-10-14 06:09:08,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 55017472. Throughput: 0: 1676.2, 1: 1653.4. Samples: 13768000. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:09:08,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:09:09,176][100917] Updated weights for policy 1, policy_version 26852 (0.0009) +[2023-10-14 06:09:09,547][100917] Updated weights for policy 1, policy_version 26862 (0.0008) +[2023-10-14 06:09:09,913][100917] Updated weights for policy 1, policy_version 26872 (0.0008) +[2023-10-14 06:09:10,268][100936] Updated weights for policy 0, policy_version 26890 (0.0010) +[2023-10-14 06:09:10,629][100936] Updated weights for policy 0, policy_version 26900 (0.0007) +[2023-10-14 06:09:11,009][100936] Updated weights for policy 0, policy_version 26910 (0.0007) +[2023-10-14 06:09:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 55083008. Throughput: 0: 1661.4, 1: 1656.0. Samples: 13777066. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:09:13,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:09:13,991][100917] Updated weights for policy 1, policy_version 26882 (0.0008) +[2023-10-14 06:09:14,378][100917] Updated weights for policy 1, policy_version 26892 (0.0011) +[2023-10-14 06:09:14,749][100917] Updated weights for policy 1, policy_version 26902 (0.0009) +[2023-10-14 06:09:15,129][100917] Updated weights for policy 1, policy_version 26912 (0.0009) +[2023-10-14 06:09:15,152][100936] Updated weights for policy 0, policy_version 26920 (0.0007) +[2023-10-14 06:09:15,520][100936] Updated weights for policy 0, policy_version 26930 (0.0008) +[2023-10-14 06:09:15,887][100936] Updated weights for policy 0, policy_version 26940 (0.0009) +[2023-10-14 06:09:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 55148544. Throughput: 0: 1676.1, 1: 1660.2. Samples: 13797608. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:09:18,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:09:19,085][100917] Updated weights for policy 1, policy_version 26922 (0.0010) +[2023-10-14 06:09:19,449][100917] Updated weights for policy 1, policy_version 26932 (0.0009) +[2023-10-14 06:09:19,827][100917] Updated weights for policy 1, policy_version 26942 (0.0009) +[2023-10-14 06:09:19,986][100936] Updated weights for policy 0, policy_version 26950 (0.0008) +[2023-10-14 06:09:20,357][100936] Updated weights for policy 0, policy_version 26960 (0.0009) +[2023-10-14 06:09:20,734][100936] Updated weights for policy 0, policy_version 26970 (0.0008) +[2023-10-14 06:09:23,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 55214080. Throughput: 0: 1672.4, 1: 1664.2. Samples: 13818194. Policy #0 lag: (min: 31.0, avg: 34.8, max: 63.0) +[2023-10-14 06:09:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:09:23,969][100917] Updated weights for policy 1, policy_version 26952 (0.0010) +[2023-10-14 06:09:24,338][100917] Updated weights for policy 1, policy_version 26962 (0.0007) +[2023-10-14 06:09:24,717][100917] Updated weights for policy 1, policy_version 26972 (0.0010) +[2023-10-14 06:09:24,886][100936] Updated weights for policy 0, policy_version 26980 (0.0009) +[2023-10-14 06:09:25,253][100936] Updated weights for policy 0, policy_version 26990 (0.0008) +[2023-10-14 06:09:25,625][100936] Updated weights for policy 0, policy_version 27000 (0.0009) +[2023-10-14 06:09:28,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 55279616. Throughput: 0: 1664.2, 1: 1666.8. Samples: 13827220. Policy #0 lag: (min: 31.0, avg: 34.8, max: 63.0) +[2023-10-14 06:09:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:09:28,868][100917] Updated weights for policy 1, policy_version 26982 (0.0008) +[2023-10-14 06:09:29,241][100917] Updated weights for policy 1, policy_version 26992 (0.0007) +[2023-10-14 06:09:29,618][100917] Updated weights for policy 1, policy_version 27002 (0.0008) +[2023-10-14 06:09:29,710][100936] Updated weights for policy 0, policy_version 27010 (0.0009) +[2023-10-14 06:09:30,079][100936] Updated weights for policy 0, policy_version 27020 (0.0009) +[2023-10-14 06:09:30,447][100936] Updated weights for policy 0, policy_version 27030 (0.0009) +[2023-10-14 06:09:30,825][100936] Updated weights for policy 0, policy_version 27040 (0.0009) +[2023-10-14 06:09:33,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 55345152. Throughput: 0: 1679.4, 1: 1661.5. Samples: 13847830. Policy #0 lag: (min: 31.0, avg: 34.8, max: 63.0) +[2023-10-14 06:09:33,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:09:33,787][100917] Updated weights for policy 1, policy_version 27012 (0.0010) +[2023-10-14 06:09:34,166][100917] Updated weights for policy 1, policy_version 27022 (0.0009) +[2023-10-14 06:09:34,527][100917] Updated weights for policy 1, policy_version 27032 (0.0008) +[2023-10-14 06:09:34,801][100936] Updated weights for policy 0, policy_version 27050 (0.0009) +[2023-10-14 06:09:35,174][100936] Updated weights for policy 0, policy_version 27060 (0.0009) +[2023-10-14 06:09:35,537][100936] Updated weights for policy 0, policy_version 27070 (0.0007) +[2023-10-14 06:09:38,512][100917] Updated weights for policy 1, policy_version 27042 (0.0008) +[2023-10-14 06:09:38,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 55410688. Throughput: 0: 1684.9, 1: 1661.5. Samples: 13868472. Policy #0 lag: (min: 31.0, avg: 34.8, max: 63.0) +[2023-10-14 06:09:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:09:38,521][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000027072_27721728.pth... +[2023-10-14 06:09:38,556][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000025536_26148864.pth +[2023-10-14 06:09:38,874][100917] Updated weights for policy 1, policy_version 27052 (0.0010) +[2023-10-14 06:09:39,251][100917] Updated weights for policy 1, policy_version 27062 (0.0007) +[2023-10-14 06:09:39,615][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000027072_27721728.pth... +[2023-10-14 06:09:39,621][100917] Updated weights for policy 1, policy_version 27072 (0.0007) +[2023-10-14 06:09:39,649][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000025504_26116096.pth +[2023-10-14 06:09:39,739][100936] Updated weights for policy 0, policy_version 27080 (0.0009) +[2023-10-14 06:09:40,111][100936] Updated weights for policy 0, policy_version 27090 (0.0008) +[2023-10-14 06:09:40,481][100936] Updated weights for policy 0, policy_version 27100 (0.0009) +[2023-10-14 06:09:43,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 55476224. Throughput: 0: 1678.6, 1: 1666.5. Samples: 13877576. Policy #0 lag: (min: 15.0, avg: 38.6, max: 40.0) +[2023-10-14 06:09:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:09:43,690][100917] Updated weights for policy 1, policy_version 27082 (0.0008) +[2023-10-14 06:09:44,066][100917] Updated weights for policy 1, policy_version 27092 (0.0009) +[2023-10-14 06:09:44,430][100917] Updated weights for policy 1, policy_version 27102 (0.0008) +[2023-10-14 06:09:44,554][100936] Updated weights for policy 0, policy_version 27110 (0.0008) +[2023-10-14 06:09:44,916][100936] Updated weights for policy 0, policy_version 27120 (0.0007) +[2023-10-14 06:09:45,281][100936] Updated weights for policy 0, policy_version 27130 (0.0007) +[2023-10-14 06:09:48,477][100917] Updated weights for policy 1, policy_version 27112 (0.0008) +[2023-10-14 06:09:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 55541760. Throughput: 0: 1678.0, 1: 1668.9. Samples: 13898028. Policy #0 lag: (min: 15.0, avg: 38.6, max: 40.0) +[2023-10-14 06:09:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:09:48,848][100917] Updated weights for policy 1, policy_version 27122 (0.0008) +[2023-10-14 06:09:49,233][100917] Updated weights for policy 1, policy_version 27132 (0.0009) +[2023-10-14 06:09:49,474][100936] Updated weights for policy 0, policy_version 27140 (0.0008) +[2023-10-14 06:09:49,838][100936] Updated weights for policy 0, policy_version 27150 (0.0008) +[2023-10-14 06:09:50,212][100936] Updated weights for policy 0, policy_version 27160 (0.0009) +[2023-10-14 06:09:53,395][100917] Updated weights for policy 1, policy_version 27142 (0.0009) +[2023-10-14 06:09:53,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 55607296. Throughput: 0: 1675.0, 1: 1671.9. Samples: 13918608. Policy #0 lag: (min: 15.0, avg: 38.6, max: 40.0) +[2023-10-14 06:09:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:09:53,768][100917] Updated weights for policy 1, policy_version 27152 (0.0009) +[2023-10-14 06:09:54,148][100917] Updated weights for policy 1, policy_version 27162 (0.0010) +[2023-10-14 06:09:54,262][100936] Updated weights for policy 0, policy_version 27170 (0.0008) +[2023-10-14 06:09:54,640][100936] Updated weights for policy 0, policy_version 27180 (0.0007) +[2023-10-14 06:09:55,011][100936] Updated weights for policy 0, policy_version 27190 (0.0007) +[2023-10-14 06:09:55,382][100936] Updated weights for policy 0, policy_version 27200 (0.0007) +[2023-10-14 06:09:58,253][100917] Updated weights for policy 1, policy_version 27172 (0.0008) +[2023-10-14 06:09:58,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 55672832. Throughput: 0: 1674.8, 1: 1670.4. Samples: 13927600. Policy #0 lag: (min: 15.0, avg: 38.6, max: 40.0) +[2023-10-14 06:09:58,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:09:58,640][100917] Updated weights for policy 1, policy_version 27182 (0.0009) +[2023-10-14 06:09:59,011][100917] Updated weights for policy 1, policy_version 27192 (0.0010) +[2023-10-14 06:09:59,479][100936] Updated weights for policy 0, policy_version 27210 (0.0009) +[2023-10-14 06:09:59,847][100936] Updated weights for policy 0, policy_version 27220 (0.0010) +[2023-10-14 06:10:00,218][100936] Updated weights for policy 0, policy_version 27230 (0.0010) +[2023-10-14 06:10:03,195][100917] Updated weights for policy 1, policy_version 27202 (0.0009) +[2023-10-14 06:10:03,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 55738368. Throughput: 0: 1671.5, 1: 1668.1. Samples: 13947888. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:10:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:10:03,598][100917] Updated weights for policy 1, policy_version 27212 (0.0008) +[2023-10-14 06:10:03,968][100917] Updated weights for policy 1, policy_version 27222 (0.0008) +[2023-10-14 06:10:04,314][100936] Updated weights for policy 0, policy_version 27240 (0.0008) +[2023-10-14 06:10:04,345][100917] Updated weights for policy 1, policy_version 27232 (0.0009) +[2023-10-14 06:10:04,679][100936] Updated weights for policy 0, policy_version 27250 (0.0009) +[2023-10-14 06:10:05,050][100936] Updated weights for policy 0, policy_version 27260 (0.0008) +[2023-10-14 06:10:08,401][100917] Updated weights for policy 1, policy_version 27242 (0.0011) +[2023-10-14 06:10:08,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 55803904. Throughput: 0: 1672.5, 1: 1662.9. Samples: 13968290. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:10:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:10:08,771][100917] Updated weights for policy 1, policy_version 27252 (0.0011) +[2023-10-14 06:10:09,143][100917] Updated weights for policy 1, policy_version 27262 (0.0007) +[2023-10-14 06:10:09,165][100936] Updated weights for policy 0, policy_version 27270 (0.0009) +[2023-10-14 06:10:09,534][100936] Updated weights for policy 0, policy_version 27280 (0.0011) +[2023-10-14 06:10:09,905][100936] Updated weights for policy 0, policy_version 27290 (0.0007) +[2023-10-14 06:10:13,156][100917] Updated weights for policy 1, policy_version 27272 (0.0008) +[2023-10-14 06:10:13,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 55869440. Throughput: 0: 1670.3, 1: 1665.3. Samples: 13977318. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:10:13,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:10:13,528][100917] Updated weights for policy 1, policy_version 27282 (0.0008) +[2023-10-14 06:10:13,903][100936] Updated weights for policy 0, policy_version 27300 (0.0008) +[2023-10-14 06:10:13,905][100917] Updated weights for policy 1, policy_version 27292 (0.0009) +[2023-10-14 06:10:14,286][100936] Updated weights for policy 0, policy_version 27310 (0.0009) +[2023-10-14 06:10:14,655][100936] Updated weights for policy 0, policy_version 27320 (0.0010) +[2023-10-14 06:10:18,044][100917] Updated weights for policy 1, policy_version 27302 (0.0007) +[2023-10-14 06:10:18,424][100917] Updated weights for policy 1, policy_version 27312 (0.0008) +[2023-10-14 06:10:18,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 55934976. Throughput: 0: 1665.6, 1: 1668.1. Samples: 13997850. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:10:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:10:18,786][100917] Updated weights for policy 1, policy_version 27322 (0.0009) +[2023-10-14 06:10:18,793][100936] Updated weights for policy 0, policy_version 27330 (0.0007) +[2023-10-14 06:10:19,168][100936] Updated weights for policy 0, policy_version 27340 (0.0008) +[2023-10-14 06:10:19,545][100936] Updated weights for policy 0, policy_version 27350 (0.0008) +[2023-10-14 06:10:19,920][100936] Updated weights for policy 0, policy_version 27360 (0.0007) +[2023-10-14 06:10:22,920][100917] Updated weights for policy 1, policy_version 27332 (0.0009) +[2023-10-14 06:10:23,302][100917] Updated weights for policy 1, policy_version 27342 (0.0011) +[2023-10-14 06:10:23,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 56000512. Throughput: 0: 1664.1, 1: 1663.6. Samples: 14018216. Policy #0 lag: (min: 31.0, avg: 34.7, max: 63.0) +[2023-10-14 06:10:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:10:23,669][100917] Updated weights for policy 1, policy_version 27352 (0.0007) +[2023-10-14 06:10:24,174][100936] Updated weights for policy 0, policy_version 27370 (0.0008) +[2023-10-14 06:10:24,544][100936] Updated weights for policy 0, policy_version 27380 (0.0007) +[2023-10-14 06:10:24,919][100936] Updated weights for policy 0, policy_version 27390 (0.0007) +[2023-10-14 06:10:27,825][100917] Updated weights for policy 1, policy_version 27362 (0.0008) +[2023-10-14 06:10:28,199][100917] Updated weights for policy 1, policy_version 27372 (0.0007) +[2023-10-14 06:10:28,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 56066048. Throughput: 0: 1667.3, 1: 1663.7. Samples: 14027470. Policy #0 lag: (min: 31.0, avg: 34.7, max: 63.0) +[2023-10-14 06:10:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:10:28,587][100917] Updated weights for policy 1, policy_version 27382 (0.0010) +[2023-10-14 06:10:28,950][100917] Updated weights for policy 1, policy_version 27392 (0.0009) +[2023-10-14 06:10:28,980][100936] Updated weights for policy 0, policy_version 27400 (0.0007) +[2023-10-14 06:10:29,340][100936] Updated weights for policy 0, policy_version 27410 (0.0007) +[2023-10-14 06:10:29,716][100936] Updated weights for policy 0, policy_version 27420 (0.0008) +[2023-10-14 06:10:33,001][100917] Updated weights for policy 1, policy_version 27402 (0.0007) +[2023-10-14 06:10:33,373][100917] Updated weights for policy 1, policy_version 27412 (0.0008) +[2023-10-14 06:10:33,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 56131584. Throughput: 0: 1664.7, 1: 1663.9. Samples: 14047812. Policy #0 lag: (min: 31.0, avg: 34.7, max: 63.0) +[2023-10-14 06:10:33,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:10:33,752][100917] Updated weights for policy 1, policy_version 27422 (0.0008) +[2023-10-14 06:10:33,763][100936] Updated weights for policy 0, policy_version 27430 (0.0008) +[2023-10-14 06:10:34,132][100936] Updated weights for policy 0, policy_version 27440 (0.0008) +[2023-10-14 06:10:34,497][100936] Updated weights for policy 0, policy_version 27450 (0.0008) +[2023-10-14 06:10:37,733][100917] Updated weights for policy 1, policy_version 27432 (0.0010) +[2023-10-14 06:10:38,120][100917] Updated weights for policy 1, policy_version 27442 (0.0010) +[2023-10-14 06:10:38,494][100917] Updated weights for policy 1, policy_version 27452 (0.0008) +[2023-10-14 06:10:38,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 56197120. Throughput: 0: 1658.8, 1: 1653.9. Samples: 14067680. Policy #0 lag: (min: 31.0, avg: 34.7, max: 63.0) +[2023-10-14 06:10:38,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:10:38,728][100936] Updated weights for policy 0, policy_version 27460 (0.0010) +[2023-10-14 06:10:39,102][100936] Updated weights for policy 0, policy_version 27470 (0.0008) +[2023-10-14 06:10:39,470][100936] Updated weights for policy 0, policy_version 27480 (0.0008) +[2023-10-14 06:10:42,751][100917] Updated weights for policy 1, policy_version 27462 (0.0008) +[2023-10-14 06:10:43,124][100917] Updated weights for policy 1, policy_version 27472 (0.0010) +[2023-10-14 06:10:43,495][100917] Updated weights for policy 1, policy_version 27482 (0.0008) +[2023-10-14 06:10:43,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 56262656. Throughput: 0: 1660.7, 1: 1664.4. Samples: 14077226. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:10:43,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:10:43,594][100936] Updated weights for policy 0, policy_version 27490 (0.0008) +[2023-10-14 06:10:43,966][100936] Updated weights for policy 0, policy_version 27500 (0.0009) +[2023-10-14 06:10:44,345][100936] Updated weights for policy 0, policy_version 27510 (0.0010) +[2023-10-14 06:10:44,717][100936] Updated weights for policy 0, policy_version 27520 (0.0011) +[2023-10-14 06:10:47,766][100917] Updated weights for policy 1, policy_version 27492 (0.0008) +[2023-10-14 06:10:48,159][100917] Updated weights for policy 1, policy_version 27502 (0.0007) +[2023-10-14 06:10:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 56328192. Throughput: 0: 1660.0, 1: 1659.5. Samples: 14097264. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:10:48,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:10:48,538][100917] Updated weights for policy 1, policy_version 27512 (0.0009) +[2023-10-14 06:10:48,965][100936] Updated weights for policy 0, policy_version 27530 (0.0009) +[2023-10-14 06:10:49,347][100936] Updated weights for policy 0, policy_version 27540 (0.0008) +[2023-10-14 06:10:49,725][100936] Updated weights for policy 0, policy_version 27550 (0.0007) +[2023-10-14 06:10:52,604][100917] Updated weights for policy 1, policy_version 27522 (0.0008) +[2023-10-14 06:10:52,981][100917] Updated weights for policy 1, policy_version 27532 (0.0009) +[2023-10-14 06:10:53,354][100917] Updated weights for policy 1, policy_version 27542 (0.0007) +[2023-10-14 06:10:53,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 56393728. Throughput: 0: 1655.8, 1: 1652.9. Samples: 14117182. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:10:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:10:53,719][100917] Updated weights for policy 1, policy_version 27552 (0.0007) +[2023-10-14 06:10:53,799][100936] Updated weights for policy 0, policy_version 27560 (0.0010) +[2023-10-14 06:10:54,177][100936] Updated weights for policy 0, policy_version 27570 (0.0011) +[2023-10-14 06:10:54,544][100936] Updated weights for policy 0, policy_version 27580 (0.0009) +[2023-10-14 06:10:58,018][100917] Updated weights for policy 1, policy_version 27562 (0.0009) +[2023-10-14 06:10:58,384][100917] Updated weights for policy 1, policy_version 27572 (0.0009) +[2023-10-14 06:10:58,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 56459264. Throughput: 0: 1657.1, 1: 1657.6. Samples: 14126482. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:10:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:10:58,735][100936] Updated weights for policy 0, policy_version 27590 (0.0008) +[2023-10-14 06:10:58,754][100917] Updated weights for policy 1, policy_version 27582 (0.0008) +[2023-10-14 06:10:59,097][100936] Updated weights for policy 0, policy_version 27600 (0.0010) +[2023-10-14 06:10:59,463][100936] Updated weights for policy 0, policy_version 27610 (0.0009) +[2023-10-14 06:11:02,814][100917] Updated weights for policy 1, policy_version 27592 (0.0008) +[2023-10-14 06:11:03,179][100917] Updated weights for policy 1, policy_version 27602 (0.0010) +[2023-10-14 06:11:03,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 56524800. Throughput: 0: 1649.9, 1: 1658.1. Samples: 14146712. Policy #0 lag: (min: 10.0, avg: 10.9, max: 31.0) +[2023-10-14 06:11:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:11:03,555][100917] Updated weights for policy 1, policy_version 27612 (0.0010) +[2023-10-14 06:11:03,568][100936] Updated weights for policy 0, policy_version 27620 (0.0010) +[2023-10-14 06:11:03,934][100936] Updated weights for policy 0, policy_version 27630 (0.0008) +[2023-10-14 06:11:04,314][100936] Updated weights for policy 0, policy_version 27640 (0.0011) +[2023-10-14 06:11:07,644][100917] Updated weights for policy 1, policy_version 27622 (0.0010) +[2023-10-14 06:11:08,019][100917] Updated weights for policy 1, policy_version 27632 (0.0009) +[2023-10-14 06:11:08,392][100917] Updated weights for policy 1, policy_version 27642 (0.0010) +[2023-10-14 06:11:08,494][100936] Updated weights for policy 0, policy_version 27650 (0.0010) +[2023-10-14 06:11:08,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 56590336. Throughput: 0: 1647.0, 1: 1648.7. Samples: 14166524. Policy #0 lag: (min: 10.0, avg: 10.9, max: 31.0) +[2023-10-14 06:11:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:11:08,865][100936] Updated weights for policy 0, policy_version 27660 (0.0009) +[2023-10-14 06:11:09,246][100936] Updated weights for policy 0, policy_version 27670 (0.0008) +[2023-10-14 06:11:09,613][100936] Updated weights for policy 0, policy_version 27680 (0.0009) +[2023-10-14 06:11:12,417][100917] Updated weights for policy 1, policy_version 27652 (0.0007) +[2023-10-14 06:11:12,796][100917] Updated weights for policy 1, policy_version 27662 (0.0007) +[2023-10-14 06:11:13,173][100917] Updated weights for policy 1, policy_version 27672 (0.0009) +[2023-10-14 06:11:13,512][99942] Fps is (10 sec: 16384.0, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 56688640. Throughput: 0: 1641.5, 1: 1658.4. Samples: 14175966. Policy #0 lag: (min: 10.0, avg: 10.9, max: 31.0) +[2023-10-14 06:11:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:11:13,876][100936] Updated weights for policy 0, policy_version 27690 (0.0008) +[2023-10-14 06:11:14,243][100936] Updated weights for policy 0, policy_version 27700 (0.0009) +[2023-10-14 06:11:14,617][100936] Updated weights for policy 0, policy_version 27710 (0.0007) +[2023-10-14 06:11:17,331][100917] Updated weights for policy 1, policy_version 27682 (0.0007) +[2023-10-14 06:11:17,710][100917] Updated weights for policy 1, policy_version 27692 (0.0009) +[2023-10-14 06:11:18,077][100917] Updated weights for policy 1, policy_version 27702 (0.0007) +[2023-10-14 06:11:18,443][100917] Updated weights for policy 1, policy_version 27712 (0.0008) +[2023-10-14 06:11:18,512][99942] Fps is (10 sec: 16384.4, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 56754176. Throughput: 0: 1644.9, 1: 1656.5. Samples: 14196376. Policy #0 lag: (min: 10.0, avg: 10.9, max: 31.0) +[2023-10-14 06:11:18,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:11:18,740][100936] Updated weights for policy 0, policy_version 27720 (0.0009) +[2023-10-14 06:11:19,114][100936] Updated weights for policy 0, policy_version 27730 (0.0007) +[2023-10-14 06:11:19,483][100936] Updated weights for policy 0, policy_version 27740 (0.0007) +[2023-10-14 06:11:22,352][100917] Updated weights for policy 1, policy_version 27722 (0.0010) +[2023-10-14 06:11:22,714][100917] Updated weights for policy 1, policy_version 27732 (0.0007) +[2023-10-14 06:11:23,084][100917] Updated weights for policy 1, policy_version 27742 (0.0009) +[2023-10-14 06:11:23,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 56819712. Throughput: 0: 1655.4, 1: 1646.3. Samples: 14216258. Policy #0 lag: (min: 10.0, avg: 10.9, max: 30.0) +[2023-10-14 06:11:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:11:23,626][100936] Updated weights for policy 0, policy_version 27750 (0.0010) +[2023-10-14 06:11:23,986][100936] Updated weights for policy 0, policy_version 27760 (0.0010) +[2023-10-14 06:11:24,349][100936] Updated weights for policy 0, policy_version 27770 (0.0007) +[2023-10-14 06:11:27,113][100917] Updated weights for policy 1, policy_version 27752 (0.0007) +[2023-10-14 06:11:27,477][100917] Updated weights for policy 1, policy_version 27762 (0.0008) +[2023-10-14 06:11:27,852][100917] Updated weights for policy 1, policy_version 27772 (0.0010) +[2023-10-14 06:11:28,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 56885248. Throughput: 0: 1651.0, 1: 1662.0. Samples: 14226308. Policy #0 lag: (min: 10.0, avg: 10.9, max: 30.0) +[2023-10-14 06:11:28,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:11:28,529][100936] Updated weights for policy 0, policy_version 27780 (0.0010) +[2023-10-14 06:11:28,906][100936] Updated weights for policy 0, policy_version 27790 (0.0010) +[2023-10-14 06:11:29,273][100936] Updated weights for policy 0, policy_version 27800 (0.0009) +[2023-10-14 06:11:31,930][100917] Updated weights for policy 1, policy_version 27782 (0.0008) +[2023-10-14 06:11:32,300][100917] Updated weights for policy 1, policy_version 27792 (0.0008) +[2023-10-14 06:11:32,679][100917] Updated weights for policy 1, policy_version 27802 (0.0009) +[2023-10-14 06:11:33,245][100936] Updated weights for policy 0, policy_version 27810 (0.0008) +[2023-10-14 06:11:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 56950784. Throughput: 0: 1657.6, 1: 1667.2. Samples: 14246880. Policy #0 lag: (min: 10.0, avg: 10.9, max: 30.0) +[2023-10-14 06:11:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:11:33,624][100936] Updated weights for policy 0, policy_version 27820 (0.0009) +[2023-10-14 06:11:33,989][100936] Updated weights for policy 0, policy_version 27830 (0.0008) +[2023-10-14 06:11:34,357][100936] Updated weights for policy 0, policy_version 27840 (0.0011) +[2023-10-14 06:11:36,749][100917] Updated weights for policy 1, policy_version 27812 (0.0009) +[2023-10-14 06:11:37,156][100917] Updated weights for policy 1, policy_version 27822 (0.0007) +[2023-10-14 06:11:37,539][100917] Updated weights for policy 1, policy_version 27832 (0.0009) +[2023-10-14 06:11:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 57016320. Throughput: 0: 1651.6, 1: 1652.9. Samples: 14265886. Policy #0 lag: (min: 10.0, avg: 10.9, max: 30.0) +[2023-10-14 06:11:38,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:11:38,524][100936] Updated weights for policy 0, policy_version 27850 (0.0009) +[2023-10-14 06:11:38,525][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000027840_28508160.pth... +[2023-10-14 06:11:38,554][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000026272_26902528.pth +[2023-10-14 06:11:38,897][100936] Updated weights for policy 0, policy_version 27860 (0.0009) +[2023-10-14 06:11:39,263][100936] Updated weights for policy 0, policy_version 27870 (0.0007) +[2023-10-14 06:11:39,340][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000027872_28540928.pth... +[2023-10-14 06:11:39,380][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000026304_26935296.pth +[2023-10-14 06:11:41,674][100917] Updated weights for policy 1, policy_version 27842 (0.0008) +[2023-10-14 06:11:42,034][100917] Updated weights for policy 1, policy_version 27852 (0.0008) +[2023-10-14 06:11:42,407][100917] Updated weights for policy 1, policy_version 27862 (0.0008) +[2023-10-14 06:11:42,773][100917] Updated weights for policy 1, policy_version 27872 (0.0009) +[2023-10-14 06:11:43,413][100936] Updated weights for policy 0, policy_version 27880 (0.0009) +[2023-10-14 06:11:43,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 57081856. Throughput: 0: 1653.3, 1: 1677.8. Samples: 14276384. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:11:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:11:43,781][100936] Updated weights for policy 0, policy_version 27890 (0.0009) +[2023-10-14 06:11:44,156][100936] Updated weights for policy 0, policy_version 27900 (0.0009) +[2023-10-14 06:11:46,819][100917] Updated weights for policy 1, policy_version 27882 (0.0007) +[2023-10-14 06:11:47,186][100917] Updated weights for policy 1, policy_version 27892 (0.0007) +[2023-10-14 06:11:47,567][100917] Updated weights for policy 1, policy_version 27902 (0.0007) +[2023-10-14 06:11:48,225][100936] Updated weights for policy 0, policy_version 27910 (0.0009) +[2023-10-14 06:11:48,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 57147392. Throughput: 0: 1659.3, 1: 1659.8. Samples: 14296070. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:11:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:11:48,592][100936] Updated weights for policy 0, policy_version 27920 (0.0009) +[2023-10-14 06:11:48,962][100936] Updated weights for policy 0, policy_version 27930 (0.0007) +[2023-10-14 06:11:51,559][100917] Updated weights for policy 1, policy_version 27912 (0.0009) +[2023-10-14 06:11:51,935][100917] Updated weights for policy 1, policy_version 27922 (0.0010) +[2023-10-14 06:11:52,308][100917] Updated weights for policy 1, policy_version 27932 (0.0010) +[2023-10-14 06:11:52,994][100936] Updated weights for policy 0, policy_version 27940 (0.0007) +[2023-10-14 06:11:53,362][100936] Updated weights for policy 0, policy_version 27950 (0.0007) +[2023-10-14 06:11:53,512][99942] Fps is (10 sec: 13107.7, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 57212928. Throughput: 0: 1649.5, 1: 1658.3. Samples: 14315374. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:11:53,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:11:53,735][100936] Updated weights for policy 0, policy_version 27960 (0.0008) +[2023-10-14 06:11:56,459][100917] Updated weights for policy 1, policy_version 27942 (0.0008) +[2023-10-14 06:11:56,827][100917] Updated weights for policy 1, policy_version 27952 (0.0008) +[2023-10-14 06:11:57,215][100917] Updated weights for policy 1, policy_version 27962 (0.0008) +[2023-10-14 06:11:57,995][100936] Updated weights for policy 0, policy_version 27970 (0.0008) +[2023-10-14 06:11:58,406][100936] Updated weights for policy 0, policy_version 27980 (0.0007) +[2023-10-14 06:11:58,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 57278464. Throughput: 0: 1666.7, 1: 1673.3. Samples: 14326268. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:11:58,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:11:58,787][100936] Updated weights for policy 0, policy_version 27990 (0.0009) +[2023-10-14 06:11:59,158][100936] Updated weights for policy 0, policy_version 28000 (0.0008) +[2023-10-14 06:12:01,350][100917] Updated weights for policy 1, policy_version 27972 (0.0009) +[2023-10-14 06:12:01,719][100917] Updated weights for policy 1, policy_version 27982 (0.0007) +[2023-10-14 06:12:02,098][100917] Updated weights for policy 1, policy_version 27992 (0.0007) +[2023-10-14 06:12:03,283][100936] Updated weights for policy 0, policy_version 28010 (0.0010) +[2023-10-14 06:12:03,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 57344000. Throughput: 0: 1661.7, 1: 1657.8. Samples: 14345754. Policy #0 lag: (min: 2.0, avg: 10.4, max: 34.0) +[2023-10-14 06:12:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:12:03,655][100936] Updated weights for policy 0, policy_version 28020 (0.0010) +[2023-10-14 06:12:04,028][100936] Updated weights for policy 0, policy_version 28030 (0.0007) +[2023-10-14 06:12:06,190][100917] Updated weights for policy 1, policy_version 28002 (0.0008) +[2023-10-14 06:12:06,574][100917] Updated weights for policy 1, policy_version 28012 (0.0010) +[2023-10-14 06:12:06,953][100917] Updated weights for policy 1, policy_version 28022 (0.0009) +[2023-10-14 06:12:07,324][100917] Updated weights for policy 1, policy_version 28032 (0.0009) +[2023-10-14 06:12:08,151][100936] Updated weights for policy 0, policy_version 28040 (0.0009) +[2023-10-14 06:12:08,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 57409536. Throughput: 0: 1638.2, 1: 1669.2. Samples: 14365092. Policy #0 lag: (min: 2.0, avg: 10.4, max: 34.0) +[2023-10-14 06:12:08,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:12:08,519][100936] Updated weights for policy 0, policy_version 28050 (0.0009) +[2023-10-14 06:12:08,888][100936] Updated weights for policy 0, policy_version 28060 (0.0009) +[2023-10-14 06:12:11,453][100917] Updated weights for policy 1, policy_version 28042 (0.0007) +[2023-10-14 06:12:11,824][100917] Updated weights for policy 1, policy_version 28052 (0.0008) +[2023-10-14 06:12:12,203][100917] Updated weights for policy 1, policy_version 28062 (0.0008) +[2023-10-14 06:12:13,063][100936] Updated weights for policy 0, policy_version 28070 (0.0009) +[2023-10-14 06:12:13,424][100936] Updated weights for policy 0, policy_version 28080 (0.0008) +[2023-10-14 06:12:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 57475072. Throughput: 0: 1651.3, 1: 1673.5. Samples: 14375922. Policy #0 lag: (min: 2.0, avg: 10.4, max: 34.0) +[2023-10-14 06:12:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:12:13,802][100936] Updated weights for policy 0, policy_version 28090 (0.0008) +[2023-10-14 06:12:16,300][100917] Updated weights for policy 1, policy_version 28072 (0.0009) +[2023-10-14 06:12:16,666][100917] Updated weights for policy 1, policy_version 28082 (0.0010) +[2023-10-14 06:12:17,037][100917] Updated weights for policy 1, policy_version 28092 (0.0009) +[2023-10-14 06:12:17,935][100936] Updated weights for policy 0, policy_version 28100 (0.0009) +[2023-10-14 06:12:18,301][100936] Updated weights for policy 0, policy_version 28110 (0.0009) +[2023-10-14 06:12:18,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 57540608. Throughput: 0: 1646.5, 1: 1655.8. Samples: 14395482. Policy #0 lag: (min: 2.0, avg: 10.4, max: 34.0) +[2023-10-14 06:12:18,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:12:18,668][100936] Updated weights for policy 0, policy_version 28120 (0.0007) +[2023-10-14 06:12:21,086][100917] Updated weights for policy 1, policy_version 28102 (0.0008) +[2023-10-14 06:12:21,457][100917] Updated weights for policy 1, policy_version 28112 (0.0009) +[2023-10-14 06:12:21,823][100917] Updated weights for policy 1, policy_version 28122 (0.0009) +[2023-10-14 06:12:22,861][100936] Updated weights for policy 0, policy_version 28130 (0.0008) +[2023-10-14 06:12:23,238][100936] Updated weights for policy 0, policy_version 28140 (0.0009) +[2023-10-14 06:12:23,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 57606144. Throughput: 0: 1636.8, 1: 1678.1. Samples: 14415058. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:12:23,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:12:23,617][100936] Updated weights for policy 0, policy_version 28150 (0.0007) +[2023-10-14 06:12:23,989][100936] Updated weights for policy 0, policy_version 28160 (0.0008) +[2023-10-14 06:12:26,003][100917] Updated weights for policy 1, policy_version 28132 (0.0009) +[2023-10-14 06:12:26,409][100917] Updated weights for policy 1, policy_version 28142 (0.0008) +[2023-10-14 06:12:26,791][100917] Updated weights for policy 1, policy_version 28152 (0.0007) +[2023-10-14 06:12:28,101][100936] Updated weights for policy 0, policy_version 28170 (0.0009) +[2023-10-14 06:12:28,470][100936] Updated weights for policy 0, policy_version 28180 (0.0007) +[2023-10-14 06:12:28,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 57671680. Throughput: 0: 1652.7, 1: 1667.7. Samples: 14425802. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:12:28,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:12:28,840][100936] Updated weights for policy 0, policy_version 28190 (0.0009) +[2023-10-14 06:12:30,948][100917] Updated weights for policy 1, policy_version 28162 (0.0007) +[2023-10-14 06:12:31,314][100917] Updated weights for policy 1, policy_version 28172 (0.0008) +[2023-10-14 06:12:31,683][100917] Updated weights for policy 1, policy_version 28182 (0.0009) +[2023-10-14 06:12:32,065][100917] Updated weights for policy 1, policy_version 28192 (0.0009) +[2023-10-14 06:12:33,009][100936] Updated weights for policy 0, policy_version 28200 (0.0009) +[2023-10-14 06:12:33,384][100936] Updated weights for policy 0, policy_version 28210 (0.0009) +[2023-10-14 06:12:33,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 57737216. Throughput: 0: 1650.8, 1: 1662.5. Samples: 14445172. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:12:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:12:33,756][100936] Updated weights for policy 0, policy_version 28220 (0.0007) +[2023-10-14 06:12:36,206][100917] Updated weights for policy 1, policy_version 28202 (0.0008) +[2023-10-14 06:12:36,579][100917] Updated weights for policy 1, policy_version 28212 (0.0007) +[2023-10-14 06:12:36,957][100917] Updated weights for policy 1, policy_version 28222 (0.0007) +[2023-10-14 06:12:37,832][100936] Updated weights for policy 0, policy_version 28230 (0.0007) +[2023-10-14 06:12:38,202][100936] Updated weights for policy 0, policy_version 28240 (0.0009) +[2023-10-14 06:12:38,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 57802752. Throughput: 0: 1645.4, 1: 1673.2. Samples: 14464710. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:12:38,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:12:38,580][100936] Updated weights for policy 0, policy_version 28250 (0.0009) +[2023-10-14 06:12:41,109][100917] Updated weights for policy 1, policy_version 28232 (0.0009) +[2023-10-14 06:12:41,477][100917] Updated weights for policy 1, policy_version 28242 (0.0008) +[2023-10-14 06:12:41,848][100917] Updated weights for policy 1, policy_version 28252 (0.0009) +[2023-10-14 06:12:42,836][100936] Updated weights for policy 0, policy_version 28260 (0.0010) +[2023-10-14 06:12:43,200][100936] Updated weights for policy 0, policy_version 28270 (0.0008) +[2023-10-14 06:12:43,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 57868288. Throughput: 0: 1645.1, 1: 1669.4. Samples: 14475418. Policy #0 lag: (min: 26.0, avg: 26.4, max: 38.0) +[2023-10-14 06:12:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:12:43,563][100936] Updated weights for policy 0, policy_version 28280 (0.0007) +[2023-10-14 06:12:45,776][100917] Updated weights for policy 1, policy_version 28262 (0.0009) +[2023-10-14 06:12:46,148][100917] Updated weights for policy 1, policy_version 28272 (0.0010) +[2023-10-14 06:12:46,531][100917] Updated weights for policy 1, policy_version 28282 (0.0008) +[2023-10-14 06:12:47,809][100936] Updated weights for policy 0, policy_version 28290 (0.0007) +[2023-10-14 06:12:48,214][100936] Updated weights for policy 0, policy_version 28300 (0.0007) +[2023-10-14 06:12:48,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 57933824. Throughput: 0: 1651.5, 1: 1662.2. Samples: 14494870. Policy #0 lag: (min: 26.0, avg: 26.4, max: 38.0) +[2023-10-14 06:12:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:12:48,579][100936] Updated weights for policy 0, policy_version 28310 (0.0007) +[2023-10-14 06:12:48,942][100936] Updated weights for policy 0, policy_version 28320 (0.0008) +[2023-10-14 06:12:50,542][100917] Updated weights for policy 1, policy_version 28292 (0.0009) +[2023-10-14 06:12:50,921][100917] Updated weights for policy 1, policy_version 28302 (0.0010) +[2023-10-14 06:12:51,294][100917] Updated weights for policy 1, policy_version 28312 (0.0008) +[2023-10-14 06:12:52,828][100936] Updated weights for policy 0, policy_version 28330 (0.0008) +[2023-10-14 06:12:53,196][100936] Updated weights for policy 0, policy_version 28340 (0.0008) +[2023-10-14 06:12:53,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 57999360. Throughput: 0: 1646.6, 1: 1672.3. Samples: 14514440. Policy #0 lag: (min: 26.0, avg: 26.4, max: 38.0) +[2023-10-14 06:12:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:12:53,566][100936] Updated weights for policy 0, policy_version 28350 (0.0007) +[2023-10-14 06:12:55,485][100917] Updated weights for policy 1, policy_version 28322 (0.0008) +[2023-10-14 06:12:55,856][100917] Updated weights for policy 1, policy_version 28332 (0.0009) +[2023-10-14 06:12:56,229][100917] Updated weights for policy 1, policy_version 28342 (0.0007) +[2023-10-14 06:12:56,603][100917] Updated weights for policy 1, policy_version 28352 (0.0008) +[2023-10-14 06:12:57,758][100936] Updated weights for policy 0, policy_version 28360 (0.0008) +[2023-10-14 06:12:58,120][100936] Updated weights for policy 0, policy_version 28370 (0.0007) +[2023-10-14 06:12:58,492][100936] Updated weights for policy 0, policy_version 28380 (0.0007) +[2023-10-14 06:12:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 58064896. Throughput: 0: 1654.5, 1: 1656.8. Samples: 14524934. Policy #0 lag: (min: 26.0, avg: 26.4, max: 38.0) +[2023-10-14 06:12:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:13:00,832][100917] Updated weights for policy 1, policy_version 28362 (0.0009) +[2023-10-14 06:13:01,206][100917] Updated weights for policy 1, policy_version 28372 (0.0010) +[2023-10-14 06:13:01,577][100917] Updated weights for policy 1, policy_version 28382 (0.0010) +[2023-10-14 06:13:02,469][100936] Updated weights for policy 0, policy_version 28390 (0.0010) +[2023-10-14 06:13:02,843][100936] Updated weights for policy 0, policy_version 28400 (0.0007) +[2023-10-14 06:13:03,220][100936] Updated weights for policy 0, policy_version 28410 (0.0010) +[2023-10-14 06:13:03,512][99942] Fps is (10 sec: 16383.9, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 58163200. Throughput: 0: 1655.1, 1: 1657.3. Samples: 14544542. Policy #0 lag: (min: 15.0, avg: 15.8, max: 34.0) +[2023-10-14 06:13:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:13:05,755][100917] Updated weights for policy 1, policy_version 28392 (0.0007) +[2023-10-14 06:13:06,131][100917] Updated weights for policy 1, policy_version 28402 (0.0009) +[2023-10-14 06:13:06,504][100917] Updated weights for policy 1, policy_version 28412 (0.0009) +[2023-10-14 06:13:07,466][100936] Updated weights for policy 0, policy_version 28420 (0.0009) +[2023-10-14 06:13:07,835][100936] Updated weights for policy 0, policy_version 28430 (0.0007) +[2023-10-14 06:13:08,214][100936] Updated weights for policy 0, policy_version 28440 (0.0009) +[2023-10-14 06:13:08,512][99942] Fps is (10 sec: 16384.0, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 58228736. Throughput: 0: 1652.8, 1: 1658.8. Samples: 14564080. Policy #0 lag: (min: 15.0, avg: 15.8, max: 34.0) +[2023-10-14 06:13:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:13:10,788][100917] Updated weights for policy 1, policy_version 28422 (0.0009) +[2023-10-14 06:13:11,178][100917] Updated weights for policy 1, policy_version 28432 (0.0010) +[2023-10-14 06:13:11,557][100917] Updated weights for policy 1, policy_version 28442 (0.0010) +[2023-10-14 06:13:12,349][100936] Updated weights for policy 0, policy_version 28450 (0.0010) +[2023-10-14 06:13:12,722][100936] Updated weights for policy 0, policy_version 28460 (0.0010) +[2023-10-14 06:13:13,101][100936] Updated weights for policy 0, policy_version 28470 (0.0010) +[2023-10-14 06:13:13,466][100936] Updated weights for policy 0, policy_version 28480 (0.0007) +[2023-10-14 06:13:13,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 58294272. Throughput: 0: 1657.7, 1: 1652.9. Samples: 14574778. Policy #0 lag: (min: 15.0, avg: 15.8, max: 34.0) +[2023-10-14 06:13:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:13:15,420][100917] Updated weights for policy 1, policy_version 28452 (0.0009) +[2023-10-14 06:13:15,798][100917] Updated weights for policy 1, policy_version 28462 (0.0008) +[2023-10-14 06:13:16,167][100917] Updated weights for policy 1, policy_version 28472 (0.0009) +[2023-10-14 06:13:17,674][100936] Updated weights for policy 0, policy_version 28490 (0.0010) +[2023-10-14 06:13:18,052][100936] Updated weights for policy 0, policy_version 28500 (0.0008) +[2023-10-14 06:13:18,420][100936] Updated weights for policy 0, policy_version 28510 (0.0011) +[2023-10-14 06:13:18,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 58359808. Throughput: 0: 1655.6, 1: 1654.8. Samples: 14594144. Policy #0 lag: (min: 0.0, avg: 25.3, max: 32.0) +[2023-10-14 06:13:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:13:20,235][100917] Updated weights for policy 1, policy_version 28482 (0.0008) +[2023-10-14 06:13:20,608][100917] Updated weights for policy 1, policy_version 28492 (0.0007) +[2023-10-14 06:13:20,987][100917] Updated weights for policy 1, policy_version 28502 (0.0009) +[2023-10-14 06:13:21,362][100917] Updated weights for policy 1, policy_version 28512 (0.0011) +[2023-10-14 06:13:22,653][100936] Updated weights for policy 0, policy_version 28520 (0.0008) +[2023-10-14 06:13:23,031][100936] Updated weights for policy 0, policy_version 28530 (0.0008) +[2023-10-14 06:13:23,396][100936] Updated weights for policy 0, policy_version 28540 (0.0009) +[2023-10-14 06:13:23,512][99942] Fps is (10 sec: 9830.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 58392576. Throughput: 0: 1651.2, 1: 1664.7. Samples: 14613926. Policy #0 lag: (min: 0.0, avg: 25.3, max: 32.0) +[2023-10-14 06:13:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:13:25,142][100917] Updated weights for policy 1, policy_version 28522 (0.0008) +[2023-10-14 06:13:25,506][100917] Updated weights for policy 1, policy_version 28532 (0.0009) +[2023-10-14 06:13:25,889][100917] Updated weights for policy 1, policy_version 28542 (0.0010) +[2023-10-14 06:13:27,405][100936] Updated weights for policy 0, policy_version 28550 (0.0008) +[2023-10-14 06:13:27,773][100936] Updated weights for policy 0, policy_version 28560 (0.0011) +[2023-10-14 06:13:28,143][100936] Updated weights for policy 0, policy_version 28570 (0.0009) +[2023-10-14 06:13:28,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 58490880. Throughput: 0: 1660.8, 1: 1644.2. Samples: 14624140. Policy #0 lag: (min: 0.0, avg: 25.3, max: 32.0) +[2023-10-14 06:13:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:13:30,142][100917] Updated weights for policy 1, policy_version 28552 (0.0009) +[2023-10-14 06:13:30,523][100917] Updated weights for policy 1, policy_version 28562 (0.0009) +[2023-10-14 06:13:30,892][100917] Updated weights for policy 1, policy_version 28572 (0.0008) +[2023-10-14 06:13:32,302][100936] Updated weights for policy 0, policy_version 28580 (0.0008) +[2023-10-14 06:13:32,702][100936] Updated weights for policy 0, policy_version 28590 (0.0012) +[2023-10-14 06:13:33,070][100936] Updated weights for policy 0, policy_version 28600 (0.0007) +[2023-10-14 06:13:33,512][99942] Fps is (10 sec: 16384.0, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 58556416. Throughput: 0: 1652.1, 1: 1666.2. Samples: 14644194. Policy #0 lag: (min: 0.0, avg: 25.3, max: 32.0) +[2023-10-14 06:13:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:13:34,969][100917] Updated weights for policy 1, policy_version 28582 (0.0009) +[2023-10-14 06:13:35,337][100917] Updated weights for policy 1, policy_version 28592 (0.0007) +[2023-10-14 06:13:35,725][100917] Updated weights for policy 1, policy_version 28602 (0.0007) +[2023-10-14 06:13:37,236][100936] Updated weights for policy 0, policy_version 28610 (0.0007) +[2023-10-14 06:13:37,600][100936] Updated weights for policy 0, policy_version 28620 (0.0007) +[2023-10-14 06:13:37,966][100936] Updated weights for policy 0, policy_version 28630 (0.0007) +[2023-10-14 06:13:38,335][100936] Updated weights for policy 0, policy_version 28640 (0.0007) +[2023-10-14 06:13:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 58621952. Throughput: 0: 1648.9, 1: 1667.2. Samples: 14663666. Policy #0 lag: (min: 6.0, avg: 18.9, max: 38.0) +[2023-10-14 06:13:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:13:38,524][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000028608_29294592.pth... +[2023-10-14 06:13:38,524][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000028640_29327360.pth... +[2023-10-14 06:13:38,559][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000027072_27721728.pth +[2023-10-14 06:13:38,563][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000027072_27721728.pth +[2023-10-14 06:13:39,850][100917] Updated weights for policy 1, policy_version 28612 (0.0007) +[2023-10-14 06:13:40,218][100917] Updated weights for policy 1, policy_version 28622 (0.0007) +[2023-10-14 06:13:40,595][100917] Updated weights for policy 1, policy_version 28632 (0.0007) +[2023-10-14 06:13:42,395][100936] Updated weights for policy 0, policy_version 28650 (0.0009) +[2023-10-14 06:13:42,776][100936] Updated weights for policy 0, policy_version 28660 (0.0007) +[2023-10-14 06:13:43,143][100936] Updated weights for policy 0, policy_version 28670 (0.0008) +[2023-10-14 06:13:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 58687488. Throughput: 0: 1659.1, 1: 1653.3. Samples: 14673994. Policy #0 lag: (min: 6.0, avg: 18.9, max: 38.0) +[2023-10-14 06:13:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:13:44,750][100917] Updated weights for policy 1, policy_version 28642 (0.0008) +[2023-10-14 06:13:45,115][100917] Updated weights for policy 1, policy_version 28652 (0.0009) +[2023-10-14 06:13:45,484][100917] Updated weights for policy 1, policy_version 28662 (0.0007) +[2023-10-14 06:13:45,856][100917] Updated weights for policy 1, policy_version 28672 (0.0007) +[2023-10-14 06:13:47,016][100936] Updated weights for policy 0, policy_version 28680 (0.0009) +[2023-10-14 06:13:47,384][100936] Updated weights for policy 0, policy_version 28690 (0.0007) +[2023-10-14 06:13:47,766][100936] Updated weights for policy 0, policy_version 28700 (0.0011) +[2023-10-14 06:13:48,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 58753024. Throughput: 0: 1648.7, 1: 1669.5. Samples: 14693858. Policy #0 lag: (min: 6.0, avg: 18.9, max: 38.0) +[2023-10-14 06:13:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:13:49,947][100917] Updated weights for policy 1, policy_version 28682 (0.0008) +[2023-10-14 06:13:50,318][100917] Updated weights for policy 1, policy_version 28692 (0.0008) +[2023-10-14 06:13:50,692][100917] Updated weights for policy 1, policy_version 28702 (0.0008) +[2023-10-14 06:13:52,006][100936] Updated weights for policy 0, policy_version 28710 (0.0008) +[2023-10-14 06:13:52,374][100936] Updated weights for policy 0, policy_version 28720 (0.0008) +[2023-10-14 06:13:52,752][100936] Updated weights for policy 0, policy_version 28730 (0.0008) +[2023-10-14 06:13:53,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 58818560. Throughput: 0: 1653.8, 1: 1668.5. Samples: 14713584. Policy #0 lag: (min: 6.0, avg: 18.9, max: 38.0) +[2023-10-14 06:13:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:13:54,855][100917] Updated weights for policy 1, policy_version 28712 (0.0008) +[2023-10-14 06:13:55,227][100917] Updated weights for policy 1, policy_version 28722 (0.0010) +[2023-10-14 06:13:55,604][100917] Updated weights for policy 1, policy_version 28732 (0.0007) +[2023-10-14 06:13:56,986][100936] Updated weights for policy 0, policy_version 28740 (0.0009) +[2023-10-14 06:13:57,356][100936] Updated weights for policy 0, policy_version 28750 (0.0009) +[2023-10-14 06:13:57,734][100936] Updated weights for policy 0, policy_version 28760 (0.0008) +[2023-10-14 06:13:58,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 58884096. Throughput: 0: 1658.0, 1: 1648.9. Samples: 14723592. Policy #0 lag: (min: 3.0, avg: 6.9, max: 35.0) +[2023-10-14 06:13:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:13:59,856][100917] Updated weights for policy 1, policy_version 28742 (0.0007) +[2023-10-14 06:14:00,245][100917] Updated weights for policy 1, policy_version 28752 (0.0008) +[2023-10-14 06:14:00,606][100917] Updated weights for policy 1, policy_version 28762 (0.0009) +[2023-10-14 06:14:01,889][100936] Updated weights for policy 0, policy_version 28770 (0.0010) +[2023-10-14 06:14:02,259][100936] Updated weights for policy 0, policy_version 28780 (0.0009) +[2023-10-14 06:14:02,638][100936] Updated weights for policy 0, policy_version 28790 (0.0008) +[2023-10-14 06:14:03,008][100936] Updated weights for policy 0, policy_version 28800 (0.0007) +[2023-10-14 06:14:03,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.3, 300 sec: 13329.4). Total num frames: 58949632. Throughput: 0: 1648.3, 1: 1663.3. Samples: 14743168. Policy #0 lag: (min: 3.0, avg: 6.9, max: 35.0) +[2023-10-14 06:14:03,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:14:04,597][100917] Updated weights for policy 1, policy_version 28772 (0.0007) +[2023-10-14 06:14:04,968][100917] Updated weights for policy 1, policy_version 28782 (0.0008) +[2023-10-14 06:14:05,342][100917] Updated weights for policy 1, policy_version 28792 (0.0009) +[2023-10-14 06:14:07,088][100936] Updated weights for policy 0, policy_version 28810 (0.0009) +[2023-10-14 06:14:07,451][100936] Updated weights for policy 0, policy_version 28820 (0.0009) +[2023-10-14 06:14:07,819][100936] Updated weights for policy 0, policy_version 28830 (0.0010) +[2023-10-14 06:14:08,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 59015168. Throughput: 0: 1651.7, 1: 1663.0. Samples: 14763088. Policy #0 lag: (min: 3.0, avg: 6.9, max: 35.0) +[2023-10-14 06:14:08,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:14:09,421][100917] Updated weights for policy 1, policy_version 28802 (0.0008) +[2023-10-14 06:14:09,794][100917] Updated weights for policy 1, policy_version 28812 (0.0010) +[2023-10-14 06:14:10,170][100917] Updated weights for policy 1, policy_version 28822 (0.0010) +[2023-10-14 06:14:10,546][100917] Updated weights for policy 1, policy_version 28832 (0.0009) +[2023-10-14 06:14:11,942][100936] Updated weights for policy 0, policy_version 28840 (0.0009) +[2023-10-14 06:14:12,317][100936] Updated weights for policy 0, policy_version 28850 (0.0008) +[2023-10-14 06:14:12,691][100936] Updated weights for policy 0, policy_version 28860 (0.0007) +[2023-10-14 06:14:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 59080704. Throughput: 0: 1658.6, 1: 1654.9. Samples: 14773244. Policy #0 lag: (min: 3.0, avg: 6.9, max: 35.0) +[2023-10-14 06:14:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:14:14,730][100917] Updated weights for policy 1, policy_version 28842 (0.0010) +[2023-10-14 06:14:15,105][100917] Updated weights for policy 1, policy_version 28852 (0.0008) +[2023-10-14 06:14:15,476][100917] Updated weights for policy 1, policy_version 28862 (0.0008) +[2023-10-14 06:14:16,775][100936] Updated weights for policy 0, policy_version 28870 (0.0009) +[2023-10-14 06:14:17,164][100936] Updated weights for policy 0, policy_version 28880 (0.0008) +[2023-10-14 06:14:17,531][100936] Updated weights for policy 0, policy_version 28890 (0.0010) +[2023-10-14 06:14:18,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 59146240. Throughput: 0: 1643.2, 1: 1655.0. Samples: 14792614. Policy #0 lag: (min: 30.0, avg: 30.0, max: 30.0) +[2023-10-14 06:14:18,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:14:19,701][100917] Updated weights for policy 1, policy_version 28872 (0.0008) +[2023-10-14 06:14:20,070][100917] Updated weights for policy 1, policy_version 28882 (0.0009) +[2023-10-14 06:14:20,441][100917] Updated weights for policy 1, policy_version 28892 (0.0007) +[2023-10-14 06:14:21,659][100936] Updated weights for policy 0, policy_version 28900 (0.0010) +[2023-10-14 06:14:22,032][100936] Updated weights for policy 0, policy_version 28910 (0.0008) +[2023-10-14 06:14:22,407][100936] Updated weights for policy 0, policy_version 28920 (0.0007) +[2023-10-14 06:14:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 59211776. Throughput: 0: 1658.6, 1: 1654.0. Samples: 14812732. Policy #0 lag: (min: 30.0, avg: 30.0, max: 30.0) +[2023-10-14 06:14:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:14:24,585][100917] Updated weights for policy 1, policy_version 28902 (0.0008) +[2023-10-14 06:14:24,961][100917] Updated weights for policy 1, policy_version 28912 (0.0010) +[2023-10-14 06:14:25,328][100917] Updated weights for policy 1, policy_version 28922 (0.0010) +[2023-10-14 06:14:26,548][100936] Updated weights for policy 0, policy_version 28930 (0.0008) +[2023-10-14 06:14:26,920][100936] Updated weights for policy 0, policy_version 28940 (0.0011) +[2023-10-14 06:14:27,296][100936] Updated weights for policy 0, policy_version 28950 (0.0009) +[2023-10-14 06:14:27,656][100936] Updated weights for policy 0, policy_version 28960 (0.0008) +[2023-10-14 06:14:28,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 59277312. Throughput: 0: 1658.9, 1: 1649.0. Samples: 14822848. Policy #0 lag: (min: 30.0, avg: 30.0, max: 30.0) +[2023-10-14 06:14:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:14:29,361][100917] Updated weights for policy 1, policy_version 28932 (0.0009) +[2023-10-14 06:14:29,725][100917] Updated weights for policy 1, policy_version 28942 (0.0011) +[2023-10-14 06:14:30,100][100917] Updated weights for policy 1, policy_version 28952 (0.0011) +[2023-10-14 06:14:31,724][100936] Updated weights for policy 0, policy_version 28970 (0.0008) +[2023-10-14 06:14:32,099][100936] Updated weights for policy 0, policy_version 28980 (0.0008) +[2023-10-14 06:14:32,467][100936] Updated weights for policy 0, policy_version 28990 (0.0009) +[2023-10-14 06:14:33,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 59342848. Throughput: 0: 1646.7, 1: 1652.3. Samples: 14842314. Policy #0 lag: (min: 30.0, avg: 30.0, max: 30.0) +[2023-10-14 06:14:33,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:14:34,340][100917] Updated weights for policy 1, policy_version 28962 (0.0009) +[2023-10-14 06:14:34,719][100917] Updated weights for policy 1, policy_version 28972 (0.0007) +[2023-10-14 06:14:35,095][100917] Updated weights for policy 1, policy_version 28982 (0.0009) +[2023-10-14 06:14:35,462][100917] Updated weights for policy 1, policy_version 28992 (0.0008) +[2023-10-14 06:14:36,556][100936] Updated weights for policy 0, policy_version 29000 (0.0008) +[2023-10-14 06:14:36,942][100936] Updated weights for policy 0, policy_version 29010 (0.0010) +[2023-10-14 06:14:37,307][100936] Updated weights for policy 0, policy_version 29020 (0.0009) +[2023-10-14 06:14:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 59408384. Throughput: 0: 1659.2, 1: 1650.9. Samples: 14862542. Policy #0 lag: (min: 30.0, avg: 30.0, max: 30.0) +[2023-10-14 06:14:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:14:39,482][100917] Updated weights for policy 1, policy_version 29002 (0.0010) +[2023-10-14 06:14:39,848][100917] Updated weights for policy 1, policy_version 29012 (0.0009) +[2023-10-14 06:14:40,219][100917] Updated weights for policy 1, policy_version 29022 (0.0008) +[2023-10-14 06:14:41,349][100936] Updated weights for policy 0, policy_version 29030 (0.0008) +[2023-10-14 06:14:41,726][100936] Updated weights for policy 0, policy_version 29040 (0.0008) +[2023-10-14 06:14:42,093][100936] Updated weights for policy 0, policy_version 29050 (0.0010) +[2023-10-14 06:14:43,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 59473920. Throughput: 0: 1659.4, 1: 1655.3. Samples: 14872752. Policy #0 lag: (min: 21.0, avg: 21.0, max: 21.0) +[2023-10-14 06:14:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:14:44,392][100917] Updated weights for policy 1, policy_version 29032 (0.0011) +[2023-10-14 06:14:44,766][100917] Updated weights for policy 1, policy_version 29042 (0.0009) +[2023-10-14 06:14:45,131][100917] Updated weights for policy 1, policy_version 29052 (0.0010) +[2023-10-14 06:14:46,168][100936] Updated weights for policy 0, policy_version 29060 (0.0011) +[2023-10-14 06:14:46,538][100936] Updated weights for policy 0, policy_version 29070 (0.0009) +[2023-10-14 06:14:46,909][100936] Updated weights for policy 0, policy_version 29080 (0.0008) +[2023-10-14 06:14:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 59539456. Throughput: 0: 1648.0, 1: 1656.1. Samples: 14891856. Policy #0 lag: (min: 21.0, avg: 21.0, max: 21.0) +[2023-10-14 06:14:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:14:49,570][100917] Updated weights for policy 1, policy_version 29062 (0.0010) +[2023-10-14 06:14:49,959][100917] Updated weights for policy 1, policy_version 29072 (0.0010) +[2023-10-14 06:14:50,336][100917] Updated weights for policy 1, policy_version 29082 (0.0007) +[2023-10-14 06:14:51,103][100936] Updated weights for policy 0, policy_version 29090 (0.0009) +[2023-10-14 06:14:51,474][100936] Updated weights for policy 0, policy_version 29100 (0.0008) +[2023-10-14 06:14:51,843][100936] Updated weights for policy 0, policy_version 29110 (0.0008) +[2023-10-14 06:14:52,207][100936] Updated weights for policy 0, policy_version 29120 (0.0008) +[2023-10-14 06:14:53,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 59604992. Throughput: 0: 1660.3, 1: 1646.3. Samples: 14911884. Policy #0 lag: (min: 21.0, avg: 21.0, max: 21.0) +[2023-10-14 06:14:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:14:54,365][100917] Updated weights for policy 1, policy_version 29092 (0.0009) +[2023-10-14 06:14:54,731][100917] Updated weights for policy 1, policy_version 29102 (0.0007) +[2023-10-14 06:14:55,100][100917] Updated weights for policy 1, policy_version 29112 (0.0009) +[2023-10-14 06:14:56,389][100936] Updated weights for policy 0, policy_version 29130 (0.0008) +[2023-10-14 06:14:56,762][100936] Updated weights for policy 0, policy_version 29140 (0.0008) +[2023-10-14 06:14:57,145][100936] Updated weights for policy 0, policy_version 29150 (0.0008) +[2023-10-14 06:14:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 59670528. Throughput: 0: 1651.5, 1: 1649.9. Samples: 14921810. Policy #0 lag: (min: 21.0, avg: 21.0, max: 21.0) +[2023-10-14 06:14:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:14:59,199][100917] Updated weights for policy 1, policy_version 29122 (0.0009) +[2023-10-14 06:14:59,569][100917] Updated weights for policy 1, policy_version 29132 (0.0008) +[2023-10-14 06:14:59,940][100917] Updated weights for policy 1, policy_version 29142 (0.0009) +[2023-10-14 06:15:00,315][100917] Updated weights for policy 1, policy_version 29152 (0.0009) +[2023-10-14 06:15:01,252][100936] Updated weights for policy 0, policy_version 29160 (0.0009) +[2023-10-14 06:15:01,613][100936] Updated weights for policy 0, policy_version 29170 (0.0011) +[2023-10-14 06:15:01,989][100936] Updated weights for policy 0, policy_version 29180 (0.0008) +[2023-10-14 06:15:03,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 59736064. Throughput: 0: 1653.1, 1: 1648.5. Samples: 14941188. Policy #0 lag: (min: 31.0, avg: 34.4, max: 63.0) +[2023-10-14 06:15:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:15:04,405][100917] Updated weights for policy 1, policy_version 29162 (0.0009) +[2023-10-14 06:15:04,773][100917] Updated weights for policy 1, policy_version 29172 (0.0007) +[2023-10-14 06:15:05,154][100917] Updated weights for policy 1, policy_version 29182 (0.0007) +[2023-10-14 06:15:06,293][100936] Updated weights for policy 0, policy_version 29190 (0.0009) +[2023-10-14 06:15:06,663][100936] Updated weights for policy 0, policy_version 29200 (0.0010) +[2023-10-14 06:15:07,034][100936] Updated weights for policy 0, policy_version 29210 (0.0011) +[2023-10-14 06:15:08,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.1, 300 sec: 13329.3). Total num frames: 59801600. Throughput: 0: 1654.3, 1: 1649.7. Samples: 14961412. Policy #0 lag: (min: 31.0, avg: 34.4, max: 63.0) +[2023-10-14 06:15:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:15:09,282][100917] Updated weights for policy 1, policy_version 29192 (0.0009) +[2023-10-14 06:15:09,649][100917] Updated weights for policy 1, policy_version 29202 (0.0008) +[2023-10-14 06:15:10,026][100917] Updated weights for policy 1, policy_version 29212 (0.0008) +[2023-10-14 06:15:11,222][100936] Updated weights for policy 0, policy_version 29220 (0.0008) +[2023-10-14 06:15:11,599][100936] Updated weights for policy 0, policy_version 29230 (0.0009) +[2023-10-14 06:15:11,974][100936] Updated weights for policy 0, policy_version 29240 (0.0008) +[2023-10-14 06:15:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 59867136. Throughput: 0: 1646.4, 1: 1653.5. Samples: 14971342. Policy #0 lag: (min: 31.0, avg: 34.4, max: 63.0) +[2023-10-14 06:15:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:15:14,129][100917] Updated weights for policy 1, policy_version 29222 (0.0009) +[2023-10-14 06:15:14,515][100917] Updated weights for policy 1, policy_version 29232 (0.0011) +[2023-10-14 06:15:14,880][100917] Updated weights for policy 1, policy_version 29242 (0.0010) +[2023-10-14 06:15:16,215][100936] Updated weights for policy 0, policy_version 29250 (0.0009) +[2023-10-14 06:15:16,577][100936] Updated weights for policy 0, policy_version 29260 (0.0009) +[2023-10-14 06:15:16,946][100936] Updated weights for policy 0, policy_version 29270 (0.0007) +[2023-10-14 06:15:17,312][100936] Updated weights for policy 0, policy_version 29280 (0.0010) +[2023-10-14 06:15:18,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.1, 300 sec: 13329.4). Total num frames: 59932672. Throughput: 0: 1646.7, 1: 1652.9. Samples: 14990796. Policy #0 lag: (min: 31.0, avg: 34.4, max: 63.0) +[2023-10-14 06:15:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:15:18,980][100917] Updated weights for policy 1, policy_version 29252 (0.0008) +[2023-10-14 06:15:19,353][100917] Updated weights for policy 1, policy_version 29262 (0.0008) +[2023-10-14 06:15:19,727][100917] Updated weights for policy 1, policy_version 29272 (0.0008) +[2023-10-14 06:15:21,431][100936] Updated weights for policy 0, policy_version 29290 (0.0009) +[2023-10-14 06:15:21,798][100936] Updated weights for policy 0, policy_version 29300 (0.0010) +[2023-10-14 06:15:22,173][100936] Updated weights for policy 0, policy_version 29310 (0.0010) +[2023-10-14 06:15:23,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 59998208. Throughput: 0: 1651.4, 1: 1655.7. Samples: 15011360. Policy #0 lag: (min: 31.0, avg: 34.4, max: 63.0) +[2023-10-14 06:15:23,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:15:23,763][100917] Updated weights for policy 1, policy_version 29282 (0.0009) +[2023-10-14 06:15:24,134][100917] Updated weights for policy 1, policy_version 29292 (0.0007) +[2023-10-14 06:15:24,506][100917] Updated weights for policy 1, policy_version 29302 (0.0008) +[2023-10-14 06:15:24,871][100917] Updated weights for policy 1, policy_version 29312 (0.0007) +[2023-10-14 06:15:26,170][100936] Updated weights for policy 0, policy_version 29320 (0.0009) +[2023-10-14 06:15:26,539][100936] Updated weights for policy 0, policy_version 29330 (0.0007) +[2023-10-14 06:15:26,904][100936] Updated weights for policy 0, policy_version 29340 (0.0011) +[2023-10-14 06:15:28,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 60063744. Throughput: 0: 1640.6, 1: 1653.2. Samples: 15020972. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:15:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:15:29,111][100917] Updated weights for policy 1, policy_version 29322 (0.0007) +[2023-10-14 06:15:29,482][100917] Updated weights for policy 1, policy_version 29332 (0.0009) +[2023-10-14 06:15:29,857][100917] Updated weights for policy 1, policy_version 29342 (0.0009) +[2023-10-14 06:15:31,110][100936] Updated weights for policy 0, policy_version 29350 (0.0010) +[2023-10-14 06:15:31,477][100936] Updated weights for policy 0, policy_version 29360 (0.0010) +[2023-10-14 06:15:31,853][100936] Updated weights for policy 0, policy_version 29370 (0.0007) +[2023-10-14 06:15:33,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.1, 300 sec: 13329.3). Total num frames: 60129280. Throughput: 0: 1653.2, 1: 1665.7. Samples: 15041206. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:15:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:15:34,061][100917] Updated weights for policy 1, policy_version 29352 (0.0009) +[2023-10-14 06:15:34,430][100917] Updated weights for policy 1, policy_version 29362 (0.0007) +[2023-10-14 06:15:34,812][100917] Updated weights for policy 1, policy_version 29372 (0.0007) +[2023-10-14 06:15:35,964][100936] Updated weights for policy 0, policy_version 29380 (0.0010) +[2023-10-14 06:15:36,323][100936] Updated weights for policy 0, policy_version 29390 (0.0010) +[2023-10-14 06:15:36,701][100936] Updated weights for policy 0, policy_version 29400 (0.0009) +[2023-10-14 06:15:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 60194816. Throughput: 0: 1661.2, 1: 1669.1. Samples: 15061752. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:15:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:15:38,526][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000029408_30113792.pth... +[2023-10-14 06:15:38,526][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000029376_30081024.pth... +[2023-10-14 06:15:38,561][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000027840_28508160.pth +[2023-10-14 06:15:38,566][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000027872_28540928.pth +[2023-10-14 06:15:38,920][100917] Updated weights for policy 1, policy_version 29382 (0.0007) +[2023-10-14 06:15:39,303][100917] Updated weights for policy 1, policy_version 29392 (0.0007) +[2023-10-14 06:15:39,666][100917] Updated weights for policy 1, policy_version 29402 (0.0008) +[2023-10-14 06:15:40,779][100936] Updated weights for policy 0, policy_version 29410 (0.0007) +[2023-10-14 06:15:41,149][100936] Updated weights for policy 0, policy_version 29420 (0.0008) +[2023-10-14 06:15:41,503][100936] Updated weights for policy 0, policy_version 29430 (0.0009) +[2023-10-14 06:15:41,872][100936] Updated weights for policy 0, policy_version 29440 (0.0009) +[2023-10-14 06:15:43,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 60260352. Throughput: 0: 1652.4, 1: 1669.0. Samples: 15071272. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:15:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:15:43,703][100917] Updated weights for policy 1, policy_version 29412 (0.0008) +[2023-10-14 06:15:44,069][100917] Updated weights for policy 1, policy_version 29422 (0.0007) +[2023-10-14 06:15:44,432][100917] Updated weights for policy 1, policy_version 29432 (0.0007) +[2023-10-14 06:15:45,891][100936] Updated weights for policy 0, policy_version 29450 (0.0009) +[2023-10-14 06:15:46,260][100936] Updated weights for policy 0, policy_version 29460 (0.0009) +[2023-10-14 06:15:46,629][100936] Updated weights for policy 0, policy_version 29470 (0.0010) +[2023-10-14 06:15:48,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 60325888. Throughput: 0: 1660.4, 1: 1671.8. Samples: 15091138. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:15:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:15:48,619][100917] Updated weights for policy 1, policy_version 29442 (0.0009) +[2023-10-14 06:15:48,993][100917] Updated weights for policy 1, policy_version 29452 (0.0009) +[2023-10-14 06:15:49,362][100917] Updated weights for policy 1, policy_version 29462 (0.0008) +[2023-10-14 06:15:49,728][100917] Updated weights for policy 1, policy_version 29472 (0.0008) +[2023-10-14 06:15:50,633][100936] Updated weights for policy 0, policy_version 29480 (0.0011) +[2023-10-14 06:15:51,000][100936] Updated weights for policy 0, policy_version 29490 (0.0011) +[2023-10-14 06:15:51,368][100936] Updated weights for policy 0, policy_version 29500 (0.0009) +[2023-10-14 06:15:53,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 60391424. Throughput: 0: 1666.1, 1: 1666.8. Samples: 15111396. Policy #0 lag: (min: 31.0, avg: 36.7, max: 63.0) +[2023-10-14 06:15:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:15:53,833][100917] Updated weights for policy 1, policy_version 29482 (0.0009) +[2023-10-14 06:15:54,202][100917] Updated weights for policy 1, policy_version 29492 (0.0010) +[2023-10-14 06:15:54,580][100917] Updated weights for policy 1, policy_version 29502 (0.0007) +[2023-10-14 06:15:55,624][100936] Updated weights for policy 0, policy_version 29510 (0.0012) +[2023-10-14 06:15:55,999][100936] Updated weights for policy 0, policy_version 29520 (0.0010) +[2023-10-14 06:15:56,368][100936] Updated weights for policy 0, policy_version 29530 (0.0007) +[2023-10-14 06:15:58,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 60456960. Throughput: 0: 1650.9, 1: 1667.5. Samples: 15120672. Policy #0 lag: (min: 31.0, avg: 36.7, max: 63.0) +[2023-10-14 06:15:58,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:15:58,557][100917] Updated weights for policy 1, policy_version 29512 (0.0007) +[2023-10-14 06:15:58,935][100917] Updated weights for policy 1, policy_version 29522 (0.0007) +[2023-10-14 06:15:59,318][100917] Updated weights for policy 1, policy_version 29532 (0.0007) +[2023-10-14 06:16:00,750][100936] Updated weights for policy 0, policy_version 29540 (0.0008) +[2023-10-14 06:16:01,121][100936] Updated weights for policy 0, policy_version 29550 (0.0009) +[2023-10-14 06:16:01,484][100936] Updated weights for policy 0, policy_version 29560 (0.0009) +[2023-10-14 06:16:03,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 60522496. Throughput: 0: 1660.8, 1: 1669.2. Samples: 15140644. Policy #0 lag: (min: 31.0, avg: 36.7, max: 63.0) +[2023-10-14 06:16:03,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:16:03,585][100917] Updated weights for policy 1, policy_version 29542 (0.0008) +[2023-10-14 06:16:03,954][100917] Updated weights for policy 1, policy_version 29552 (0.0008) +[2023-10-14 06:16:04,323][100917] Updated weights for policy 1, policy_version 29562 (0.0007) +[2023-10-14 06:16:05,731][100936] Updated weights for policy 0, policy_version 29570 (0.0009) +[2023-10-14 06:16:06,098][100936] Updated weights for policy 0, policy_version 29580 (0.0007) +[2023-10-14 06:16:06,476][100936] Updated weights for policy 0, policy_version 29590 (0.0009) +[2023-10-14 06:16:06,840][100936] Updated weights for policy 0, policy_version 29600 (0.0009) +[2023-10-14 06:16:08,403][100917] Updated weights for policy 1, policy_version 29572 (0.0008) +[2023-10-14 06:16:08,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 60588032. Throughput: 0: 1661.6, 1: 1663.4. Samples: 15160988. Policy #0 lag: (min: 31.0, avg: 36.7, max: 63.0) +[2023-10-14 06:16:08,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:16:08,777][100917] Updated weights for policy 1, policy_version 29582 (0.0011) +[2023-10-14 06:16:09,156][100917] Updated weights for policy 1, policy_version 29592 (0.0007) +[2023-10-14 06:16:10,863][100936] Updated weights for policy 0, policy_version 29610 (0.0010) +[2023-10-14 06:16:11,236][100936] Updated weights for policy 0, policy_version 29620 (0.0008) +[2023-10-14 06:16:11,605][100936] Updated weights for policy 0, policy_version 29630 (0.0007) +[2023-10-14 06:16:13,215][100917] Updated weights for policy 1, policy_version 29602 (0.0007) +[2023-10-14 06:16:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 60653568. Throughput: 0: 1655.6, 1: 1663.8. Samples: 15170344. Policy #0 lag: (min: 31.0, avg: 36.7, max: 63.0) +[2023-10-14 06:16:13,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:16:13,583][100917] Updated weights for policy 1, policy_version 29612 (0.0008) +[2023-10-14 06:16:13,959][100917] Updated weights for policy 1, policy_version 29622 (0.0010) +[2023-10-14 06:16:14,339][100917] Updated weights for policy 1, policy_version 29632 (0.0010) +[2023-10-14 06:16:15,695][100936] Updated weights for policy 0, policy_version 29640 (0.0010) +[2023-10-14 06:16:16,062][100936] Updated weights for policy 0, policy_version 29650 (0.0010) +[2023-10-14 06:16:16,442][100936] Updated weights for policy 0, policy_version 29660 (0.0008) +[2023-10-14 06:16:18,502][100917] Updated weights for policy 1, policy_version 29642 (0.0011) +[2023-10-14 06:16:18,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 60719104. Throughput: 0: 1660.8, 1: 1655.8. Samples: 15190450. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:16:18,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:16:18,871][100917] Updated weights for policy 1, policy_version 29652 (0.0008) +[2023-10-14 06:16:19,247][100917] Updated weights for policy 1, policy_version 29662 (0.0007) +[2023-10-14 06:16:20,283][100936] Updated weights for policy 0, policy_version 29670 (0.0008) +[2023-10-14 06:16:20,648][100936] Updated weights for policy 0, policy_version 29680 (0.0008) +[2023-10-14 06:16:21,013][100936] Updated weights for policy 0, policy_version 29690 (0.0010) +[2023-10-14 06:16:23,444][100917] Updated weights for policy 1, policy_version 29672 (0.0010) +[2023-10-14 06:16:23,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 60784640. Throughput: 0: 1665.7, 1: 1660.5. Samples: 15211432. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:16:23,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:16:23,825][100917] Updated weights for policy 1, policy_version 29682 (0.0008) +[2023-10-14 06:16:24,182][100917] Updated weights for policy 1, policy_version 29692 (0.0009) +[2023-10-14 06:16:24,999][100936] Updated weights for policy 0, policy_version 29700 (0.0008) +[2023-10-14 06:16:25,367][100936] Updated weights for policy 0, policy_version 29710 (0.0009) +[2023-10-14 06:16:25,740][100936] Updated weights for policy 0, policy_version 29720 (0.0010) +[2023-10-14 06:16:28,229][100917] Updated weights for policy 1, policy_version 29702 (0.0009) +[2023-10-14 06:16:28,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 60850176. Throughput: 0: 1654.9, 1: 1659.0. Samples: 15220400. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:16:28,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:16:28,616][100917] Updated weights for policy 1, policy_version 29712 (0.0010) +[2023-10-14 06:16:28,992][100917] Updated weights for policy 1, policy_version 29722 (0.0009) +[2023-10-14 06:16:29,987][100936] Updated weights for policy 0, policy_version 29730 (0.0007) +[2023-10-14 06:16:30,352][100936] Updated weights for policy 0, policy_version 29740 (0.0007) +[2023-10-14 06:16:30,727][100936] Updated weights for policy 0, policy_version 29750 (0.0007) +[2023-10-14 06:16:31,096][100936] Updated weights for policy 0, policy_version 29760 (0.0007) +[2023-10-14 06:16:32,836][100917] Updated weights for policy 1, policy_version 29732 (0.0008) +[2023-10-14 06:16:33,206][100917] Updated weights for policy 1, policy_version 29742 (0.0008) +[2023-10-14 06:16:33,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 60915712. Throughput: 0: 1669.9, 1: 1659.9. Samples: 15240980. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:16:33,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:16:33,568][100917] Updated weights for policy 1, policy_version 29752 (0.0008) +[2023-10-14 06:16:35,193][100936] Updated weights for policy 0, policy_version 29770 (0.0009) +[2023-10-14 06:16:35,572][100936] Updated weights for policy 0, policy_version 29780 (0.0009) +[2023-10-14 06:16:35,936][100936] Updated weights for policy 0, policy_version 29790 (0.0008) +[2023-10-14 06:16:37,766][100917] Updated weights for policy 1, policy_version 29762 (0.0009) +[2023-10-14 06:16:38,130][100917] Updated weights for policy 1, policy_version 29772 (0.0007) +[2023-10-14 06:16:38,502][100917] Updated weights for policy 1, policy_version 29782 (0.0010) +[2023-10-14 06:16:38,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 60981248. Throughput: 0: 1670.1, 1: 1653.6. Samples: 15260960. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:16:38,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:16:38,880][100917] Updated weights for policy 1, policy_version 29792 (0.0009) +[2023-10-14 06:16:40,080][100936] Updated weights for policy 0, policy_version 29800 (0.0011) +[2023-10-14 06:16:40,442][100936] Updated weights for policy 0, policy_version 29810 (0.0011) +[2023-10-14 06:16:40,808][100936] Updated weights for policy 0, policy_version 29820 (0.0011) +[2023-10-14 06:16:42,967][100917] Updated weights for policy 1, policy_version 29802 (0.0010) +[2023-10-14 06:16:43,334][100917] Updated weights for policy 1, policy_version 29812 (0.0008) +[2023-10-14 06:16:43,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 61046784. Throughput: 0: 1661.8, 1: 1663.6. Samples: 15270318. Policy #0 lag: (min: 1.0, avg: 5.7, max: 33.0) +[2023-10-14 06:16:43,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:16:43,713][100917] Updated weights for policy 1, policy_version 29822 (0.0009) +[2023-10-14 06:16:45,035][100936] Updated weights for policy 0, policy_version 29830 (0.0010) +[2023-10-14 06:16:45,413][100936] Updated weights for policy 0, policy_version 29840 (0.0008) +[2023-10-14 06:16:45,785][100936] Updated weights for policy 0, policy_version 29850 (0.0008) +[2023-10-14 06:16:47,722][100917] Updated weights for policy 1, policy_version 29832 (0.0009) +[2023-10-14 06:16:48,091][100917] Updated weights for policy 1, policy_version 29842 (0.0009) +[2023-10-14 06:16:48,461][100917] Updated weights for policy 1, policy_version 29852 (0.0008) +[2023-10-14 06:16:48,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 61112320. Throughput: 0: 1670.7, 1: 1660.4. Samples: 15290542. Policy #0 lag: (min: 1.0, avg: 5.7, max: 33.0) +[2023-10-14 06:16:48,513][99942] Avg episode reward: [(0, '-0.110'), (1, '1.000')] +[2023-10-14 06:16:50,097][100936] Updated weights for policy 0, policy_version 29860 (0.0008) +[2023-10-14 06:16:50,494][100936] Updated weights for policy 0, policy_version 29870 (0.0008) +[2023-10-14 06:16:50,868][100936] Updated weights for policy 0, policy_version 29880 (0.0009) +[2023-10-14 06:16:52,525][100917] Updated weights for policy 1, policy_version 29862 (0.0011) +[2023-10-14 06:16:52,900][100917] Updated weights for policy 1, policy_version 29872 (0.0008) +[2023-10-14 06:16:53,276][100917] Updated weights for policy 1, policy_version 29882 (0.0008) +[2023-10-14 06:16:53,512][99942] Fps is (10 sec: 16383.6, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 61210624. Throughput: 0: 1661.9, 1: 1655.5. Samples: 15310270. Policy #0 lag: (min: 1.0, avg: 5.7, max: 33.0) +[2023-10-14 06:16:53,513][99942] Avg episode reward: [(0, '-0.110'), (1, '1.000')] +[2023-10-14 06:16:54,860][100936] Updated weights for policy 0, policy_version 29890 (0.0010) +[2023-10-14 06:16:55,233][100936] Updated weights for policy 0, policy_version 29900 (0.0009) +[2023-10-14 06:16:55,600][100936] Updated weights for policy 0, policy_version 29910 (0.0011) +[2023-10-14 06:16:55,965][100936] Updated weights for policy 0, policy_version 29920 (0.0010) +[2023-10-14 06:16:57,636][100917] Updated weights for policy 1, policy_version 29892 (0.0011) +[2023-10-14 06:16:58,014][100917] Updated weights for policy 1, policy_version 29902 (0.0007) +[2023-10-14 06:16:58,385][100917] Updated weights for policy 1, policy_version 29912 (0.0008) +[2023-10-14 06:16:58,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 61243392. Throughput: 0: 1652.7, 1: 1671.4. Samples: 15319930. Policy #0 lag: (min: 1.0, avg: 5.7, max: 33.0) +[2023-10-14 06:16:58,513][99942] Avg episode reward: [(0, '-0.110'), (1, '1.000')] +[2023-10-14 06:17:00,116][100936] Updated weights for policy 0, policy_version 29930 (0.0008) +[2023-10-14 06:17:00,494][100936] Updated weights for policy 0, policy_version 29940 (0.0009) +[2023-10-14 06:17:00,861][100936] Updated weights for policy 0, policy_version 29950 (0.0007) +[2023-10-14 06:17:02,639][100917] Updated weights for policy 1, policy_version 29922 (0.0009) +[2023-10-14 06:17:03,022][100917] Updated weights for policy 1, policy_version 29932 (0.0011) +[2023-10-14 06:17:03,387][100917] Updated weights for policy 1, policy_version 29942 (0.0008) +[2023-10-14 06:17:03,512][99942] Fps is (10 sec: 9830.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 61308928. Throughput: 0: 1663.6, 1: 1672.4. Samples: 15340572. Policy #0 lag: (min: 20.0, avg: 20.0, max: 20.0) +[2023-10-14 06:17:03,513][99942] Avg episode reward: [(0, '-0.110'), (1, '1.000')] +[2023-10-14 06:17:03,768][100917] Updated weights for policy 1, policy_version 29952 (0.0007) +[2023-10-14 06:17:05,040][100936] Updated weights for policy 0, policy_version 29960 (0.0011) +[2023-10-14 06:17:05,410][100936] Updated weights for policy 0, policy_version 29970 (0.0008) +[2023-10-14 06:17:05,787][100936] Updated weights for policy 0, policy_version 29980 (0.0008) +[2023-10-14 06:17:07,698][100917] Updated weights for policy 1, policy_version 29962 (0.0009) +[2023-10-14 06:17:08,075][100917] Updated weights for policy 1, policy_version 29972 (0.0009) +[2023-10-14 06:17:08,452][100917] Updated weights for policy 1, policy_version 29982 (0.0007) +[2023-10-14 06:17:08,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 61374464. Throughput: 0: 1649.8, 1: 1655.4. Samples: 15360164. Policy #0 lag: (min: 20.0, avg: 20.0, max: 20.0) +[2023-10-14 06:17:08,513][99942] Avg episode reward: [(0, '-0.110'), (1, '1.000')] +[2023-10-14 06:17:09,865][100936] Updated weights for policy 0, policy_version 29990 (0.0009) +[2023-10-14 06:17:10,221][100936] Updated weights for policy 0, policy_version 30000 (0.0009) +[2023-10-14 06:17:10,594][100936] Updated weights for policy 0, policy_version 30010 (0.0010) +[2023-10-14 06:17:12,453][100917] Updated weights for policy 1, policy_version 29992 (0.0010) +[2023-10-14 06:17:12,824][100917] Updated weights for policy 1, policy_version 30002 (0.0008) +[2023-10-14 06:17:13,207][100917] Updated weights for policy 1, policy_version 30012 (0.0007) +[2023-10-14 06:17:13,512][99942] Fps is (10 sec: 16384.2, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 61472768. Throughput: 0: 1646.1, 1: 1673.7. Samples: 15369794. Policy #0 lag: (min: 20.0, avg: 20.0, max: 20.0) +[2023-10-14 06:17:13,512][99942] Avg episode reward: [(0, '-0.110'), (1, '1.000')] +[2023-10-14 06:17:14,722][100936] Updated weights for policy 0, policy_version 30020 (0.0009) +[2023-10-14 06:17:15,102][100936] Updated weights for policy 0, policy_version 30030 (0.0007) +[2023-10-14 06:17:15,476][100936] Updated weights for policy 0, policy_version 30040 (0.0008) +[2023-10-14 06:17:17,293][100917] Updated weights for policy 1, policy_version 30022 (0.0007) +[2023-10-14 06:17:17,664][100917] Updated weights for policy 1, policy_version 30032 (0.0009) +[2023-10-14 06:17:18,035][100917] Updated weights for policy 1, policy_version 30042 (0.0010) +[2023-10-14 06:17:18,512][99942] Fps is (10 sec: 16384.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 61538304. Throughput: 0: 1642.1, 1: 1671.0. Samples: 15390072. Policy #0 lag: (min: 20.0, avg: 20.0, max: 20.0) +[2023-10-14 06:17:18,513][99942] Avg episode reward: [(0, '-0.110'), (1, '1.000')] +[2023-10-14 06:17:19,614][100936] Updated weights for policy 0, policy_version 30050 (0.0007) +[2023-10-14 06:17:19,984][100936] Updated weights for policy 0, policy_version 30060 (0.0007) +[2023-10-14 06:17:20,358][100936] Updated weights for policy 0, policy_version 30070 (0.0008) +[2023-10-14 06:17:20,724][100936] Updated weights for policy 0, policy_version 30080 (0.0008) +[2023-10-14 06:17:22,069][100917] Updated weights for policy 1, policy_version 30052 (0.0008) +[2023-10-14 06:17:22,451][100917] Updated weights for policy 1, policy_version 30062 (0.0008) +[2023-10-14 06:17:22,822][100917] Updated weights for policy 1, policy_version 30072 (0.0010) +[2023-10-14 06:17:23,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 61603840. Throughput: 0: 1652.7, 1: 1657.3. Samples: 15409912. Policy #0 lag: (min: 20.0, avg: 20.0, max: 20.0) +[2023-10-14 06:17:23,513][99942] Avg episode reward: [(0, '-0.110'), (1, '1.000')] +[2023-10-14 06:17:24,806][100936] Updated weights for policy 0, policy_version 30090 (0.0011) +[2023-10-14 06:17:25,171][100936] Updated weights for policy 0, policy_version 30100 (0.0011) +[2023-10-14 06:17:25,546][100936] Updated weights for policy 0, policy_version 30110 (0.0009) +[2023-10-14 06:17:26,821][100917] Updated weights for policy 1, policy_version 30082 (0.0008) +[2023-10-14 06:17:27,195][100917] Updated weights for policy 1, policy_version 30092 (0.0010) +[2023-10-14 06:17:27,572][100917] Updated weights for policy 1, policy_version 30102 (0.0010) +[2023-10-14 06:17:27,940][100917] Updated weights for policy 1, policy_version 30112 (0.0009) +[2023-10-14 06:17:28,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 61669376. Throughput: 0: 1653.0, 1: 1671.2. Samples: 15419908. Policy #0 lag: (min: 31.0, avg: 38.2, max: 63.0) +[2023-10-14 06:17:28,513][99942] Avg episode reward: [(0, '-0.110'), (1, '1.000')] +[2023-10-14 06:17:29,659][100936] Updated weights for policy 0, policy_version 30120 (0.0009) +[2023-10-14 06:17:30,035][100936] Updated weights for policy 0, policy_version 30130 (0.0007) +[2023-10-14 06:17:30,410][100936] Updated weights for policy 0, policy_version 30140 (0.0008) +[2023-10-14 06:17:32,121][100917] Updated weights for policy 1, policy_version 30122 (0.0010) +[2023-10-14 06:17:32,490][100917] Updated weights for policy 1, policy_version 30132 (0.0008) +[2023-10-14 06:17:32,863][100917] Updated weights for policy 1, policy_version 30142 (0.0007) +[2023-10-14 06:17:33,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 61734912. Throughput: 0: 1658.1, 1: 1670.2. Samples: 15440316. Policy #0 lag: (min: 31.0, avg: 38.2, max: 63.0) +[2023-10-14 06:17:33,513][99942] Avg episode reward: [(0, '-0.110'), (1, '1.000')] +[2023-10-14 06:17:34,599][100936] Updated weights for policy 0, policy_version 30150 (0.0010) +[2023-10-14 06:17:34,977][100936] Updated weights for policy 0, policy_version 30160 (0.0010) +[2023-10-14 06:17:35,343][100936] Updated weights for policy 0, policy_version 30170 (0.0010) +[2023-10-14 06:17:37,178][100917] Updated weights for policy 1, policy_version 30152 (0.0007) +[2023-10-14 06:17:37,563][100917] Updated weights for policy 1, policy_version 30162 (0.0009) +[2023-10-14 06:17:37,926][100917] Updated weights for policy 1, policy_version 30172 (0.0009) +[2023-10-14 06:17:38,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 61800448. Throughput: 0: 1659.1, 1: 1660.0. Samples: 15459630. Policy #0 lag: (min: 31.0, avg: 38.2, max: 63.0) +[2023-10-14 06:17:38,513][99942] Avg episode reward: [(0, '-0.110'), (1, '1.000')] +[2023-10-14 06:17:38,525][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000030176_30900224.pth... +[2023-10-14 06:17:38,525][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000030176_30900224.pth... +[2023-10-14 06:17:38,562][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000028640_29327360.pth +[2023-10-14 06:17:38,562][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000028608_29294592.pth +[2023-10-14 06:17:39,568][100936] Updated weights for policy 0, policy_version 30180 (0.0010) +[2023-10-14 06:17:39,934][100936] Updated weights for policy 0, policy_version 30190 (0.0007) +[2023-10-14 06:17:40,310][100936] Updated weights for policy 0, policy_version 30200 (0.0008) +[2023-10-14 06:17:42,021][100917] Updated weights for policy 1, policy_version 30182 (0.0007) +[2023-10-14 06:17:42,411][100917] Updated weights for policy 1, policy_version 30192 (0.0009) +[2023-10-14 06:17:42,781][100917] Updated weights for policy 1, policy_version 30202 (0.0011) +[2023-10-14 06:17:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 61865984. Throughput: 0: 1655.4, 1: 1668.2. Samples: 15469492. Policy #0 lag: (min: 31.0, avg: 38.2, max: 63.0) +[2023-10-14 06:17:43,513][99942] Avg episode reward: [(0, '-0.110'), (1, '1.000')] +[2023-10-14 06:17:44,506][100936] Updated weights for policy 0, policy_version 30210 (0.0009) +[2023-10-14 06:17:44,875][100936] Updated weights for policy 0, policy_version 30220 (0.0007) +[2023-10-14 06:17:45,244][100936] Updated weights for policy 0, policy_version 30230 (0.0007) +[2023-10-14 06:17:45,611][100936] Updated weights for policy 0, policy_version 30240 (0.0007) +[2023-10-14 06:17:46,932][100917] Updated weights for policy 1, policy_version 30212 (0.0010) +[2023-10-14 06:17:47,309][100917] Updated weights for policy 1, policy_version 30222 (0.0010) +[2023-10-14 06:17:47,689][100917] Updated weights for policy 1, policy_version 30232 (0.0008) +[2023-10-14 06:17:48,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 61931520. Throughput: 0: 1650.4, 1: 1661.9. Samples: 15489624. Policy #0 lag: (min: 31.0, avg: 38.2, max: 63.0) +[2023-10-14 06:17:48,513][99942] Avg episode reward: [(0, '-0.110'), (1, '1.000')] +[2023-10-14 06:17:50,030][100936] Updated weights for policy 0, policy_version 30250 (0.0009) +[2023-10-14 06:17:50,407][100936] Updated weights for policy 0, policy_version 30260 (0.0008) +[2023-10-14 06:17:50,777][100936] Updated weights for policy 0, policy_version 30270 (0.0009) +[2023-10-14 06:17:51,794][100917] Updated weights for policy 1, policy_version 30242 (0.0008) +[2023-10-14 06:17:52,158][100917] Updated weights for policy 1, policy_version 30252 (0.0009) +[2023-10-14 06:17:52,533][100917] Updated weights for policy 1, policy_version 30262 (0.0008) +[2023-10-14 06:17:52,919][100917] Updated weights for policy 1, policy_version 30272 (0.0008) +[2023-10-14 06:17:53,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 61997056. Throughput: 0: 1652.0, 1: 1652.8. Samples: 15508884. Policy #0 lag: (min: 31.0, avg: 33.8, max: 63.0) +[2023-10-14 06:17:53,513][99942] Avg episode reward: [(0, '-0.110'), (1, '1.000')] +[2023-10-14 06:17:54,831][100936] Updated weights for policy 0, policy_version 30280 (0.0008) +[2023-10-14 06:17:55,201][100936] Updated weights for policy 0, policy_version 30290 (0.0007) +[2023-10-14 06:17:55,565][100936] Updated weights for policy 0, policy_version 30300 (0.0009) +[2023-10-14 06:17:57,047][100917] Updated weights for policy 1, policy_version 30282 (0.0009) +[2023-10-14 06:17:57,422][100917] Updated weights for policy 1, policy_version 30292 (0.0009) +[2023-10-14 06:17:57,797][100917] Updated weights for policy 1, policy_version 30302 (0.0007) +[2023-10-14 06:17:58,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 62062592. Throughput: 0: 1654.3, 1: 1666.9. Samples: 15519246. Policy #0 lag: (min: 31.0, avg: 33.8, max: 63.0) +[2023-10-14 06:17:58,513][99942] Avg episode reward: [(0, '-0.110'), (1, '1.000')] +[2023-10-14 06:17:59,562][100936] Updated weights for policy 0, policy_version 30310 (0.0009) +[2023-10-14 06:17:59,929][100936] Updated weights for policy 0, policy_version 30320 (0.0011) +[2023-10-14 06:18:00,299][100936] Updated weights for policy 0, policy_version 30330 (0.0010) +[2023-10-14 06:18:01,717][100917] Updated weights for policy 1, policy_version 30312 (0.0007) +[2023-10-14 06:18:02,087][100917] Updated weights for policy 1, policy_version 30322 (0.0007) +[2023-10-14 06:18:02,455][100917] Updated weights for policy 1, policy_version 30332 (0.0008) +[2023-10-14 06:18:03,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 62128128. Throughput: 0: 1662.7, 1: 1659.1. Samples: 15539550. Policy #0 lag: (min: 31.0, avg: 33.8, max: 63.0) +[2023-10-14 06:18:03,513][99942] Avg episode reward: [(0, '-0.110'), (1, '1.000')] +[2023-10-14 06:18:04,358][100936] Updated weights for policy 0, policy_version 30340 (0.0010) +[2023-10-14 06:18:04,733][100936] Updated weights for policy 0, policy_version 30350 (0.0010) +[2023-10-14 06:18:05,103][100936] Updated weights for policy 0, policy_version 30360 (0.0011) +[2023-10-14 06:18:06,567][100917] Updated weights for policy 1, policy_version 30342 (0.0009) +[2023-10-14 06:18:06,943][100917] Updated weights for policy 1, policy_version 30352 (0.0009) +[2023-10-14 06:18:07,305][100917] Updated weights for policy 1, policy_version 30362 (0.0009) +[2023-10-14 06:18:08,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 62193664. Throughput: 0: 1658.0, 1: 1666.1. Samples: 15559496. Policy #0 lag: (min: 31.0, avg: 33.8, max: 63.0) +[2023-10-14 06:18:08,513][99942] Avg episode reward: [(0, '-0.110'), (1, '1.000')] +[2023-10-14 06:18:09,055][100936] Updated weights for policy 0, policy_version 30370 (0.0009) +[2023-10-14 06:18:09,425][100936] Updated weights for policy 0, policy_version 30380 (0.0008) +[2023-10-14 06:18:09,793][100936] Updated weights for policy 0, policy_version 30390 (0.0011) +[2023-10-14 06:18:10,171][100936] Updated weights for policy 0, policy_version 30400 (0.0009) +[2023-10-14 06:18:11,264][100917] Updated weights for policy 1, policy_version 30372 (0.0009) +[2023-10-14 06:18:11,647][100917] Updated weights for policy 1, policy_version 30382 (0.0010) +[2023-10-14 06:18:12,018][100917] Updated weights for policy 1, policy_version 30392 (0.0007) +[2023-10-14 06:18:13,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 62259200. Throughput: 0: 1656.1, 1: 1676.6. Samples: 15569880. Policy #0 lag: (min: 31.0, avg: 33.8, max: 63.0) +[2023-10-14 06:18:13,513][99942] Avg episode reward: [(0, '-0.110'), (1, '1.000')] +[2023-10-14 06:18:14,387][100936] Updated weights for policy 0, policy_version 30410 (0.0007) +[2023-10-14 06:18:14,760][100936] Updated weights for policy 0, policy_version 30420 (0.0007) +[2023-10-14 06:18:15,121][100936] Updated weights for policy 0, policy_version 30430 (0.0009) +[2023-10-14 06:18:16,144][100917] Updated weights for policy 1, policy_version 30402 (0.0009) +[2023-10-14 06:18:16,514][100917] Updated weights for policy 1, policy_version 30412 (0.0010) +[2023-10-14 06:18:16,889][100917] Updated weights for policy 1, policy_version 30422 (0.0009) +[2023-10-14 06:18:17,255][100917] Updated weights for policy 1, policy_version 30432 (0.0007) +[2023-10-14 06:18:18,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 62324736. Throughput: 0: 1651.6, 1: 1657.4. Samples: 15589224. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) +[2023-10-14 06:18:18,513][99942] Avg episode reward: [(0, '-0.110'), (1, '1.000')] +[2023-10-14 06:18:19,331][100936] Updated weights for policy 0, policy_version 30440 (0.0009) +[2023-10-14 06:18:19,700][100936] Updated weights for policy 0, policy_version 30450 (0.0011) +[2023-10-14 06:18:20,069][100936] Updated weights for policy 0, policy_version 30460 (0.0009) +[2023-10-14 06:18:21,356][100917] Updated weights for policy 1, policy_version 30442 (0.0008) +[2023-10-14 06:18:21,733][100917] Updated weights for policy 1, policy_version 30452 (0.0011) +[2023-10-14 06:18:22,102][100917] Updated weights for policy 1, policy_version 30462 (0.0007) +[2023-10-14 06:18:23,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 62390272. Throughput: 0: 1657.3, 1: 1672.7. Samples: 15609478. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) +[2023-10-14 06:18:23,512][99942] Avg episode reward: [(0, '-0.110'), (1, '1.000')] +[2023-10-14 06:18:24,172][100936] Updated weights for policy 0, policy_version 30470 (0.0010) +[2023-10-14 06:18:24,538][100936] Updated weights for policy 0, policy_version 30480 (0.0008) +[2023-10-14 06:18:24,921][100936] Updated weights for policy 0, policy_version 30490 (0.0008) +[2023-10-14 06:18:26,230][100917] Updated weights for policy 1, policy_version 30472 (0.0010) +[2023-10-14 06:18:26,594][100917] Updated weights for policy 1, policy_version 30482 (0.0008) +[2023-10-14 06:18:26,961][100917] Updated weights for policy 1, policy_version 30492 (0.0008) +[2023-10-14 06:18:28,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 62455808. Throughput: 0: 1657.6, 1: 1676.7. Samples: 15619536. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) +[2023-10-14 06:18:28,513][99942] Avg episode reward: [(0, '-0.110'), (1, '1.000')] +[2023-10-14 06:18:29,102][100936] Updated weights for policy 0, policy_version 30500 (0.0008) +[2023-10-14 06:18:29,478][100936] Updated weights for policy 0, policy_version 30510 (0.0008) +[2023-10-14 06:18:29,855][100936] Updated weights for policy 0, policy_version 30520 (0.0008) +[2023-10-14 06:18:31,063][100917] Updated weights for policy 1, policy_version 30502 (0.0008) +[2023-10-14 06:18:31,435][100917] Updated weights for policy 1, policy_version 30512 (0.0008) +[2023-10-14 06:18:31,824][100917] Updated weights for policy 1, policy_version 30522 (0.0009) +[2023-10-14 06:18:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 62521344. Throughput: 0: 1661.8, 1: 1660.2. Samples: 15639116. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) +[2023-10-14 06:18:33,512][99942] Avg episode reward: [(0, '-0.520'), (1, '1.000')] +[2023-10-14 06:18:33,793][100936] Updated weights for policy 0, policy_version 30530 (0.0011) +[2023-10-14 06:18:34,173][100936] Updated weights for policy 0, policy_version 30540 (0.0009) +[2023-10-14 06:18:34,538][100936] Updated weights for policy 0, policy_version 30550 (0.0010) +[2023-10-14 06:18:34,906][100936] Updated weights for policy 0, policy_version 30560 (0.0009) +[2023-10-14 06:18:35,857][100917] Updated weights for policy 1, policy_version 30532 (0.0007) +[2023-10-14 06:18:36,237][100917] Updated weights for policy 1, policy_version 30542 (0.0008) +[2023-10-14 06:18:36,602][100917] Updated weights for policy 1, policy_version 30552 (0.0007) +[2023-10-14 06:18:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 62586880. Throughput: 0: 1664.8, 1: 1677.2. Samples: 15659276. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) +[2023-10-14 06:18:38,513][99942] Avg episode reward: [(0, '-0.520'), (1, '1.000')] +[2023-10-14 06:18:39,307][100936] Updated weights for policy 0, policy_version 30570 (0.0007) +[2023-10-14 06:18:39,671][100936] Updated weights for policy 0, policy_version 30580 (0.0009) +[2023-10-14 06:18:40,040][100936] Updated weights for policy 0, policy_version 30590 (0.0011) +[2023-10-14 06:18:40,437][100917] Updated weights for policy 1, policy_version 30562 (0.0008) +[2023-10-14 06:18:40,816][100917] Updated weights for policy 1, policy_version 30572 (0.0007) +[2023-10-14 06:18:41,192][100917] Updated weights for policy 1, policy_version 30582 (0.0008) +[2023-10-14 06:18:41,560][100917] Updated weights for policy 1, policy_version 30592 (0.0009) +[2023-10-14 06:18:43,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 62652416. Throughput: 0: 1662.8, 1: 1665.0. Samples: 15668996. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:18:43,513][99942] Avg episode reward: [(0, '-0.520'), (1, '1.000')] +[2023-10-14 06:18:44,173][100936] Updated weights for policy 0, policy_version 30600 (0.0009) +[2023-10-14 06:18:44,556][100936] Updated weights for policy 0, policy_version 30610 (0.0008) +[2023-10-14 06:18:44,926][100936] Updated weights for policy 0, policy_version 30620 (0.0010) +[2023-10-14 06:18:45,662][100917] Updated weights for policy 1, policy_version 30602 (0.0008) +[2023-10-14 06:18:46,032][100917] Updated weights for policy 1, policy_version 30612 (0.0009) +[2023-10-14 06:18:46,404][100917] Updated weights for policy 1, policy_version 30622 (0.0008) +[2023-10-14 06:18:48,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 62717952. Throughput: 0: 1654.4, 1: 1658.7. Samples: 15688638. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:18:48,513][99942] Avg episode reward: [(0, '-0.520'), (1, '1.000')] +[2023-10-14 06:18:49,092][100936] Updated weights for policy 0, policy_version 30630 (0.0009) +[2023-10-14 06:18:49,463][100936] Updated weights for policy 0, policy_version 30640 (0.0012) +[2023-10-14 06:18:49,830][100936] Updated weights for policy 0, policy_version 30650 (0.0010) +[2023-10-14 06:18:50,740][100917] Updated weights for policy 1, policy_version 30632 (0.0008) +[2023-10-14 06:18:51,133][100917] Updated weights for policy 1, policy_version 30642 (0.0008) +[2023-10-14 06:18:51,497][100917] Updated weights for policy 1, policy_version 30652 (0.0011) +[2023-10-14 06:18:53,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 62783488. Throughput: 0: 1648.3, 1: 1669.4. Samples: 15708792. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:18:53,513][99942] Avg episode reward: [(0, '-0.520'), (1, '1.000')] +[2023-10-14 06:18:53,856][100936] Updated weights for policy 0, policy_version 30660 (0.0009) +[2023-10-14 06:18:54,230][100936] Updated weights for policy 0, policy_version 30670 (0.0007) +[2023-10-14 06:18:54,599][100936] Updated weights for policy 0, policy_version 30680 (0.0008) +[2023-10-14 06:18:55,752][100917] Updated weights for policy 1, policy_version 30662 (0.0007) +[2023-10-14 06:18:56,121][100917] Updated weights for policy 1, policy_version 30672 (0.0008) +[2023-10-14 06:18:56,493][100917] Updated weights for policy 1, policy_version 30682 (0.0009) +[2023-10-14 06:18:58,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 62849024. Throughput: 0: 1653.5, 1: 1654.0. Samples: 15718716. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:18:58,513][99942] Avg episode reward: [(0, '-0.510'), (1, '1.000')] +[2023-10-14 06:18:58,712][100936] Updated weights for policy 0, policy_version 30690 (0.0008) +[2023-10-14 06:18:59,077][100936] Updated weights for policy 0, policy_version 30700 (0.0010) +[2023-10-14 06:18:59,449][100936] Updated weights for policy 0, policy_version 30710 (0.0010) +[2023-10-14 06:18:59,811][100936] Updated weights for policy 0, policy_version 30720 (0.0009) +[2023-10-14 06:19:00,562][100917] Updated weights for policy 1, policy_version 30692 (0.0009) +[2023-10-14 06:19:00,926][100917] Updated weights for policy 1, policy_version 30702 (0.0009) +[2023-10-14 06:19:01,296][100917] Updated weights for policy 1, policy_version 30712 (0.0010) +[2023-10-14 06:19:03,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 62914560. Throughput: 0: 1655.0, 1: 1657.1. Samples: 15738266. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:19:03,513][99942] Avg episode reward: [(0, '-0.510'), (1, '1.000')] +[2023-10-14 06:19:03,922][100936] Updated weights for policy 0, policy_version 30730 (0.0008) +[2023-10-14 06:19:04,294][100936] Updated weights for policy 0, policy_version 30740 (0.0007) +[2023-10-14 06:19:04,663][100936] Updated weights for policy 0, policy_version 30750 (0.0009) +[2023-10-14 06:19:05,494][100917] Updated weights for policy 1, policy_version 30722 (0.0010) +[2023-10-14 06:19:05,866][100917] Updated weights for policy 1, policy_version 30732 (0.0009) +[2023-10-14 06:19:06,238][100917] Updated weights for policy 1, policy_version 30742 (0.0009) +[2023-10-14 06:19:06,607][100917] Updated weights for policy 1, policy_version 30752 (0.0008) +[2023-10-14 06:19:08,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 62980096. Throughput: 0: 1650.6, 1: 1660.3. Samples: 15758470. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:19:08,514][99942] Avg episode reward: [(0, '-0.510'), (1, '1.000')] +[2023-10-14 06:19:08,935][100936] Updated weights for policy 0, policy_version 30760 (0.0007) +[2023-10-14 06:19:09,303][100936] Updated weights for policy 0, policy_version 30770 (0.0007) +[2023-10-14 06:19:09,679][100936] Updated weights for policy 0, policy_version 30780 (0.0007) +[2023-10-14 06:19:10,696][100917] Updated weights for policy 1, policy_version 30762 (0.0009) +[2023-10-14 06:19:11,066][100917] Updated weights for policy 1, policy_version 30772 (0.0009) +[2023-10-14 06:19:11,446][100917] Updated weights for policy 1, policy_version 30782 (0.0009) +[2023-10-14 06:19:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 63045632. Throughput: 0: 1652.3, 1: 1648.7. Samples: 15768080. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:19:13,513][99942] Avg episode reward: [(0, '-0.510'), (1, '1.000')] +[2023-10-14 06:19:13,665][100936] Updated weights for policy 0, policy_version 30790 (0.0008) +[2023-10-14 06:19:14,036][100936] Updated weights for policy 0, policy_version 30800 (0.0007) +[2023-10-14 06:19:14,413][100936] Updated weights for policy 0, policy_version 30810 (0.0008) +[2023-10-14 06:19:15,443][100917] Updated weights for policy 1, policy_version 30792 (0.0011) +[2023-10-14 06:19:15,816][100917] Updated weights for policy 1, policy_version 30802 (0.0009) +[2023-10-14 06:19:16,188][100917] Updated weights for policy 1, policy_version 30812 (0.0010) +[2023-10-14 06:19:18,476][100936] Updated weights for policy 0, policy_version 30820 (0.0007) +[2023-10-14 06:19:18,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 63111168. Throughput: 0: 1653.5, 1: 1657.3. Samples: 15788102. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:19:18,513][99942] Avg episode reward: [(0, '-0.510'), (1, '1.000')] +[2023-10-14 06:19:18,848][100936] Updated weights for policy 0, policy_version 30830 (0.0007) +[2023-10-14 06:19:19,216][100936] Updated weights for policy 0, policy_version 30840 (0.0007) +[2023-10-14 06:19:20,260][100917] Updated weights for policy 1, policy_version 30822 (0.0008) +[2023-10-14 06:19:20,631][100917] Updated weights for policy 1, policy_version 30832 (0.0007) +[2023-10-14 06:19:21,001][100917] Updated weights for policy 1, policy_version 30842 (0.0007) +[2023-10-14 06:19:23,355][100936] Updated weights for policy 0, policy_version 30850 (0.0007) +[2023-10-14 06:19:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 63176704. Throughput: 0: 1654.2, 1: 1662.0. Samples: 15808502. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:19:23,513][99942] Avg episode reward: [(0, '-0.510'), (1, '1.000')] +[2023-10-14 06:19:23,739][100936] Updated weights for policy 0, policy_version 30860 (0.0008) +[2023-10-14 06:19:24,108][100936] Updated weights for policy 0, policy_version 30870 (0.0010) +[2023-10-14 06:19:24,473][100936] Updated weights for policy 0, policy_version 30880 (0.0008) +[2023-10-14 06:19:25,170][100917] Updated weights for policy 1, policy_version 30852 (0.0011) +[2023-10-14 06:19:25,550][100917] Updated weights for policy 1, policy_version 30862 (0.0010) +[2023-10-14 06:19:25,917][100917] Updated weights for policy 1, policy_version 30872 (0.0008) +[2023-10-14 06:19:28,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 63242240. Throughput: 0: 1660.9, 1: 1650.4. Samples: 15818004. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:19:28,513][99942] Avg episode reward: [(0, '-0.510'), (1, '1.000')] +[2023-10-14 06:19:28,694][100936] Updated weights for policy 0, policy_version 30890 (0.0007) +[2023-10-14 06:19:29,058][100936] Updated weights for policy 0, policy_version 30900 (0.0008) +[2023-10-14 06:19:29,437][100936] Updated weights for policy 0, policy_version 30910 (0.0008) +[2023-10-14 06:19:29,984][100917] Updated weights for policy 1, policy_version 30882 (0.0007) +[2023-10-14 06:19:30,366][100917] Updated weights for policy 1, policy_version 30892 (0.0010) +[2023-10-14 06:19:30,725][100917] Updated weights for policy 1, policy_version 30902 (0.0009) +[2023-10-14 06:19:31,102][100917] Updated weights for policy 1, policy_version 30912 (0.0009) +[2023-10-14 06:19:33,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 63307776. Throughput: 0: 1664.7, 1: 1659.2. Samples: 15838214. Policy #0 lag: (min: 26.0, avg: 31.4, max: 58.0) +[2023-10-14 06:19:33,513][99942] Avg episode reward: [(0, '0.590'), (1, '1.000')] +[2023-10-14 06:19:33,524][100936] Updated weights for policy 0, policy_version 30920 (0.0007) +[2023-10-14 06:19:33,901][100936] Updated weights for policy 0, policy_version 30930 (0.0007) +[2023-10-14 06:19:34,269][100936] Updated weights for policy 0, policy_version 30940 (0.0008) +[2023-10-14 06:19:35,267][100917] Updated weights for policy 1, policy_version 30922 (0.0008) +[2023-10-14 06:19:35,650][100917] Updated weights for policy 1, policy_version 30932 (0.0008) +[2023-10-14 06:19:36,027][100917] Updated weights for policy 1, policy_version 30942 (0.0009) +[2023-10-14 06:19:38,327][100936] Updated weights for policy 0, policy_version 30950 (0.0009) +[2023-10-14 06:19:38,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 63373312. Throughput: 0: 1660.0, 1: 1656.3. Samples: 15858026. Policy #0 lag: (min: 26.0, avg: 31.4, max: 58.0) +[2023-10-14 06:19:38,512][99942] Avg episode reward: [(0, '0.590'), (1, '1.000')] +[2023-10-14 06:19:38,520][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000030944_31686656.pth... +[2023-10-14 06:19:38,560][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000029376_30081024.pth +[2023-10-14 06:19:38,566][100681] Saving a milestone ./train_atari/atari_privateye_APPO/checkpoint_p1/milestones/checkpoint_000030944_31686656.pth +[2023-10-14 06:19:38,702][100936] Updated weights for policy 0, policy_version 30960 (0.0009) +[2023-10-14 06:19:39,070][100936] Updated weights for policy 0, policy_version 30970 (0.0009) +[2023-10-14 06:19:39,291][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000030976_31719424.pth... +[2023-10-14 06:19:39,330][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000029408_30113792.pth +[2023-10-14 06:19:39,335][100560] Saving a milestone ./train_atari/atari_privateye_APPO/checkpoint_p0/milestones/checkpoint_000030976_31719424.pth +[2023-10-14 06:19:40,278][100917] Updated weights for policy 1, policy_version 30952 (0.0007) +[2023-10-14 06:19:40,644][100917] Updated weights for policy 1, policy_version 30962 (0.0007) +[2023-10-14 06:19:41,021][100917] Updated weights for policy 1, policy_version 30972 (0.0007) +[2023-10-14 06:19:43,137][100936] Updated weights for policy 0, policy_version 30980 (0.0008) +[2023-10-14 06:19:43,507][100936] Updated weights for policy 0, policy_version 30990 (0.0009) +[2023-10-14 06:19:43,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 63438848. Throughput: 0: 1663.3, 1: 1645.2. Samples: 15867600. Policy #0 lag: (min: 26.0, avg: 31.4, max: 58.0) +[2023-10-14 06:19:43,513][99942] Avg episode reward: [(0, '0.590'), (1, '1.000')] +[2023-10-14 06:19:43,869][100936] Updated weights for policy 0, policy_version 31000 (0.0009) +[2023-10-14 06:19:45,189][100917] Updated weights for policy 1, policy_version 30982 (0.0010) +[2023-10-14 06:19:45,569][100917] Updated weights for policy 1, policy_version 30992 (0.0008) +[2023-10-14 06:19:45,949][100917] Updated weights for policy 1, policy_version 31002 (0.0008) +[2023-10-14 06:19:48,052][100936] Updated weights for policy 0, policy_version 31010 (0.0007) +[2023-10-14 06:19:48,419][100936] Updated weights for policy 0, policy_version 31020 (0.0009) +[2023-10-14 06:19:48,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 63504384. Throughput: 0: 1666.2, 1: 1655.7. Samples: 15887754. Policy #0 lag: (min: 26.0, avg: 31.4, max: 58.0) +[2023-10-14 06:19:48,513][99942] Avg episode reward: [(0, '0.590'), (1, '1.000')] +[2023-10-14 06:19:48,784][100936] Updated weights for policy 0, policy_version 31030 (0.0010) +[2023-10-14 06:19:49,151][100936] Updated weights for policy 0, policy_version 31040 (0.0010) +[2023-10-14 06:19:50,116][100917] Updated weights for policy 1, policy_version 31012 (0.0008) +[2023-10-14 06:19:50,487][100917] Updated weights for policy 1, policy_version 31022 (0.0011) +[2023-10-14 06:19:50,861][100917] Updated weights for policy 1, policy_version 31032 (0.0011) +[2023-10-14 06:19:53,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 63569920. Throughput: 0: 1660.4, 1: 1655.6. Samples: 15907688. Policy #0 lag: (min: 26.0, avg: 31.4, max: 58.0) +[2023-10-14 06:19:53,513][99942] Avg episode reward: [(0, '0.590'), (1, '1.000')] +[2023-10-14 06:19:53,518][100936] Updated weights for policy 0, policy_version 31050 (0.0010) +[2023-10-14 06:19:53,896][100936] Updated weights for policy 0, policy_version 31060 (0.0012) +[2023-10-14 06:19:54,261][100936] Updated weights for policy 0, policy_version 31070 (0.0009) +[2023-10-14 06:19:54,932][100917] Updated weights for policy 1, policy_version 31042 (0.0008) +[2023-10-14 06:19:55,294][100917] Updated weights for policy 1, policy_version 31052 (0.0009) +[2023-10-14 06:19:55,665][100917] Updated weights for policy 1, policy_version 31062 (0.0009) +[2023-10-14 06:19:56,036][100917] Updated weights for policy 1, policy_version 31072 (0.0009) +[2023-10-14 06:19:58,483][100936] Updated weights for policy 0, policy_version 31080 (0.0008) +[2023-10-14 06:19:58,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 63635456. Throughput: 0: 1665.3, 1: 1644.1. Samples: 15917002. Policy #0 lag: (min: 0.0, avg: 24.2, max: 32.0) +[2023-10-14 06:19:58,512][99942] Avg episode reward: [(0, '0.590'), (1, '1.000')] +[2023-10-14 06:19:58,854][100936] Updated weights for policy 0, policy_version 31090 (0.0008) +[2023-10-14 06:19:59,228][100936] Updated weights for policy 0, policy_version 31100 (0.0009) +[2023-10-14 06:20:00,299][100917] Updated weights for policy 1, policy_version 31082 (0.0008) +[2023-10-14 06:20:00,670][100917] Updated weights for policy 1, policy_version 31092 (0.0009) +[2023-10-14 06:20:01,045][100917] Updated weights for policy 1, policy_version 31102 (0.0008) +[2023-10-14 06:20:03,334][100936] Updated weights for policy 0, policy_version 31110 (0.0010) +[2023-10-14 06:20:03,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 63700992. Throughput: 0: 1661.0, 1: 1651.5. Samples: 15937164. Policy #0 lag: (min: 0.0, avg: 24.2, max: 32.0) +[2023-10-14 06:20:03,513][99942] Avg episode reward: [(0, '0.590'), (1, '1.000')] +[2023-10-14 06:20:03,703][100936] Updated weights for policy 0, policy_version 31120 (0.0011) +[2023-10-14 06:20:04,080][100936] Updated weights for policy 0, policy_version 31130 (0.0009) +[2023-10-14 06:20:05,160][100917] Updated weights for policy 1, policy_version 31112 (0.0007) +[2023-10-14 06:20:05,524][100917] Updated weights for policy 1, policy_version 31122 (0.0008) +[2023-10-14 06:20:05,908][100917] Updated weights for policy 1, policy_version 31132 (0.0010) +[2023-10-14 06:20:08,322][100936] Updated weights for policy 0, policy_version 31140 (0.0008) +[2023-10-14 06:20:08,512][99942] Fps is (10 sec: 13106.7, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 63766528. Throughput: 0: 1651.9, 1: 1653.1. Samples: 15957228. Policy #0 lag: (min: 0.0, avg: 24.2, max: 32.0) +[2023-10-14 06:20:08,513][99942] Avg episode reward: [(0, '0.590'), (1, '1.000')] +[2023-10-14 06:20:08,684][100936] Updated weights for policy 0, policy_version 31150 (0.0007) +[2023-10-14 06:20:09,058][100936] Updated weights for policy 0, policy_version 31160 (0.0008) +[2023-10-14 06:20:09,872][100917] Updated weights for policy 1, policy_version 31142 (0.0008) +[2023-10-14 06:20:10,234][100917] Updated weights for policy 1, policy_version 31152 (0.0008) +[2023-10-14 06:20:10,611][100917] Updated weights for policy 1, policy_version 31162 (0.0011) +[2023-10-14 06:20:13,117][100936] Updated weights for policy 0, policy_version 31170 (0.0007) +[2023-10-14 06:20:13,496][100936] Updated weights for policy 0, policy_version 31180 (0.0007) +[2023-10-14 06:20:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 63832064. Throughput: 0: 1655.1, 1: 1653.0. Samples: 15966868. Policy #0 lag: (min: 0.0, avg: 24.2, max: 32.0) +[2023-10-14 06:20:13,512][99942] Avg episode reward: [(0, '0.590'), (1, '1.000')] +[2023-10-14 06:20:13,855][100936] Updated weights for policy 0, policy_version 31190 (0.0008) +[2023-10-14 06:20:14,228][100936] Updated weights for policy 0, policy_version 31200 (0.0012) +[2023-10-14 06:20:14,741][100917] Updated weights for policy 1, policy_version 31172 (0.0009) +[2023-10-14 06:20:15,109][100917] Updated weights for policy 1, policy_version 31182 (0.0010) +[2023-10-14 06:20:15,480][100917] Updated weights for policy 1, policy_version 31192 (0.0008) +[2023-10-14 06:20:18,292][100936] Updated weights for policy 0, policy_version 31210 (0.0009) +[2023-10-14 06:20:18,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 63897600. Throughput: 0: 1654.3, 1: 1657.7. Samples: 15987254. Policy #0 lag: (min: 0.0, avg: 24.2, max: 32.0) +[2023-10-14 06:20:18,513][99942] Avg episode reward: [(0, '0.590'), (1, '1.000')] +[2023-10-14 06:20:18,659][100936] Updated weights for policy 0, policy_version 31220 (0.0009) +[2023-10-14 06:20:19,032][100936] Updated weights for policy 0, policy_version 31230 (0.0008) +[2023-10-14 06:20:19,731][100917] Updated weights for policy 1, policy_version 31202 (0.0009) +[2023-10-14 06:20:20,141][100917] Updated weights for policy 1, policy_version 31212 (0.0007) +[2023-10-14 06:20:20,501][100917] Updated weights for policy 1, policy_version 31222 (0.0008) +[2023-10-14 06:20:20,880][100917] Updated weights for policy 1, policy_version 31232 (0.0008) +[2023-10-14 06:20:23,138][100936] Updated weights for policy 0, policy_version 31240 (0.0008) +[2023-10-14 06:20:23,509][100936] Updated weights for policy 0, policy_version 31250 (0.0007) +[2023-10-14 06:20:23,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 63963136. Throughput: 0: 1645.5, 1: 1662.3. Samples: 16006878. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 06:20:23,513][99942] Avg episode reward: [(0, '0.590'), (1, '1.000')] +[2023-10-14 06:20:23,884][100936] Updated weights for policy 0, policy_version 31260 (0.0008) +[2023-10-14 06:20:24,879][100917] Updated weights for policy 1, policy_version 31242 (0.0010) +[2023-10-14 06:20:25,259][100917] Updated weights for policy 1, policy_version 31252 (0.0008) +[2023-10-14 06:20:25,624][100917] Updated weights for policy 1, policy_version 31262 (0.0008) +[2023-10-14 06:20:28,099][100936] Updated weights for policy 0, policy_version 31270 (0.0007) +[2023-10-14 06:20:28,474][100936] Updated weights for policy 0, policy_version 31280 (0.0008) +[2023-10-14 06:20:28,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 64028672. Throughput: 0: 1654.0, 1: 1653.3. Samples: 16016426. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 06:20:28,513][99942] Avg episode reward: [(0, '0.590'), (1, '1.000')] +[2023-10-14 06:20:28,838][100936] Updated weights for policy 0, policy_version 31290 (0.0007) +[2023-10-14 06:20:30,050][100917] Updated weights for policy 1, policy_version 31272 (0.0008) +[2023-10-14 06:20:30,417][100917] Updated weights for policy 1, policy_version 31282 (0.0007) +[2023-10-14 06:20:30,783][100917] Updated weights for policy 1, policy_version 31292 (0.0009) +[2023-10-14 06:20:32,838][100936] Updated weights for policy 0, policy_version 31300 (0.0008) +[2023-10-14 06:20:33,211][100936] Updated weights for policy 0, policy_version 31310 (0.0008) +[2023-10-14 06:20:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 64094208. Throughput: 0: 1654.7, 1: 1654.5. Samples: 16036670. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 06:20:33,513][99942] Avg episode reward: [(0, '0.590'), (1, '1.000')] +[2023-10-14 06:20:33,582][100936] Updated weights for policy 0, policy_version 31320 (0.0009) +[2023-10-14 06:20:34,798][100917] Updated weights for policy 1, policy_version 31302 (0.0009) +[2023-10-14 06:20:35,173][100917] Updated weights for policy 1, policy_version 31312 (0.0010) +[2023-10-14 06:20:35,545][100917] Updated weights for policy 1, policy_version 31322 (0.0008) +[2023-10-14 06:20:37,725][100936] Updated weights for policy 0, policy_version 31330 (0.0009) +[2023-10-14 06:20:38,091][100936] Updated weights for policy 0, policy_version 31340 (0.0010) +[2023-10-14 06:20:38,460][100936] Updated weights for policy 0, policy_version 31350 (0.0011) +[2023-10-14 06:20:38,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 64159744. Throughput: 0: 1646.3, 1: 1655.8. Samples: 16056280. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 06:20:38,513][99942] Avg episode reward: [(0, '0.590'), (1, '1.000')] +[2023-10-14 06:20:38,833][100936] Updated weights for policy 0, policy_version 31360 (0.0008) +[2023-10-14 06:20:39,563][100917] Updated weights for policy 1, policy_version 31332 (0.0008) +[2023-10-14 06:20:39,929][100917] Updated weights for policy 1, policy_version 31342 (0.0009) +[2023-10-14 06:20:40,315][100917] Updated weights for policy 1, policy_version 31352 (0.0009) +[2023-10-14 06:20:43,060][100936] Updated weights for policy 0, policy_version 31370 (0.0008) +[2023-10-14 06:20:43,429][100936] Updated weights for policy 0, policy_version 31380 (0.0010) +[2023-10-14 06:20:43,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 64225280. Throughput: 0: 1656.6, 1: 1651.9. Samples: 16065884. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 06:20:43,512][99942] Avg episode reward: [(0, '0.590'), (1, '1.000')] +[2023-10-14 06:20:43,806][100936] Updated weights for policy 0, policy_version 31390 (0.0010) +[2023-10-14 06:20:44,537][100917] Updated weights for policy 1, policy_version 31362 (0.0007) +[2023-10-14 06:20:44,909][100917] Updated weights for policy 1, policy_version 31372 (0.0008) +[2023-10-14 06:20:45,280][100917] Updated weights for policy 1, policy_version 31382 (0.0007) +[2023-10-14 06:20:45,657][100917] Updated weights for policy 1, policy_version 31392 (0.0007) +[2023-10-14 06:20:47,955][100936] Updated weights for policy 0, policy_version 31400 (0.0011) +[2023-10-14 06:20:48,329][100936] Updated weights for policy 0, policy_version 31410 (0.0009) +[2023-10-14 06:20:48,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 64290816. Throughput: 0: 1652.0, 1: 1656.0. Samples: 16086026. Policy #0 lag: (min: 18.0, avg: 19.7, max: 46.0) +[2023-10-14 06:20:48,513][99942] Avg episode reward: [(0, '0.590'), (1, '1.000')] +[2023-10-14 06:20:48,700][100936] Updated weights for policy 0, policy_version 31420 (0.0007) +[2023-10-14 06:20:49,746][100917] Updated weights for policy 1, policy_version 31402 (0.0008) +[2023-10-14 06:20:50,121][100917] Updated weights for policy 1, policy_version 31412 (0.0009) +[2023-10-14 06:20:50,501][100917] Updated weights for policy 1, policy_version 31422 (0.0010) +[2023-10-14 06:20:52,710][100936] Updated weights for policy 0, policy_version 31430 (0.0009) +[2023-10-14 06:20:53,088][100936] Updated weights for policy 0, policy_version 31440 (0.0007) +[2023-10-14 06:20:53,455][100936] Updated weights for policy 0, policy_version 31450 (0.0007) +[2023-10-14 06:20:53,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 64356352. Throughput: 0: 1638.9, 1: 1654.3. Samples: 16105424. Policy #0 lag: (min: 18.0, avg: 19.7, max: 46.0) +[2023-10-14 06:20:53,512][99942] Avg episode reward: [(0, '0.590'), (1, '1.000')] +[2023-10-14 06:20:54,634][100917] Updated weights for policy 1, policy_version 31432 (0.0009) +[2023-10-14 06:20:55,013][100917] Updated weights for policy 1, policy_version 31442 (0.0007) +[2023-10-14 06:20:55,392][100917] Updated weights for policy 1, policy_version 31452 (0.0008) +[2023-10-14 06:20:57,745][100936] Updated weights for policy 0, policy_version 31460 (0.0008) +[2023-10-14 06:20:58,113][100936] Updated weights for policy 0, policy_version 31470 (0.0007) +[2023-10-14 06:20:58,477][100936] Updated weights for policy 0, policy_version 31480 (0.0009) +[2023-10-14 06:20:58,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 64421888. Throughput: 0: 1650.5, 1: 1650.9. Samples: 16115430. Policy #0 lag: (min: 18.0, avg: 19.7, max: 46.0) +[2023-10-14 06:20:58,512][99942] Avg episode reward: [(0, '0.590'), (1, '1.000')] +[2023-10-14 06:20:59,249][100917] Updated weights for policy 1, policy_version 31462 (0.0007) +[2023-10-14 06:20:59,609][100917] Updated weights for policy 1, policy_version 31472 (0.0008) +[2023-10-14 06:20:59,982][100917] Updated weights for policy 1, policy_version 31482 (0.0009) +[2023-10-14 06:21:02,692][100936] Updated weights for policy 0, policy_version 31490 (0.0009) +[2023-10-14 06:21:03,066][100936] Updated weights for policy 0, policy_version 31500 (0.0010) +[2023-10-14 06:21:03,436][100936] Updated weights for policy 0, policy_version 31510 (0.0009) +[2023-10-14 06:21:03,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 64487424. Throughput: 0: 1651.8, 1: 1655.4. Samples: 16136078. Policy #0 lag: (min: 18.0, avg: 19.7, max: 46.0) +[2023-10-14 06:21:03,513][99942] Avg episode reward: [(0, '0.590'), (1, '1.000')] +[2023-10-14 06:21:03,805][100936] Updated weights for policy 0, policy_version 31520 (0.0009) +[2023-10-14 06:21:04,084][100917] Updated weights for policy 1, policy_version 31492 (0.0010) +[2023-10-14 06:21:04,469][100917] Updated weights for policy 1, policy_version 31502 (0.0007) +[2023-10-14 06:21:04,835][100917] Updated weights for policy 1, policy_version 31512 (0.0008) +[2023-10-14 06:21:07,908][100936] Updated weights for policy 0, policy_version 31530 (0.0007) +[2023-10-14 06:21:08,275][100936] Updated weights for policy 0, policy_version 31540 (0.0007) +[2023-10-14 06:21:08,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 64552960. Throughput: 0: 1646.9, 1: 1664.8. Samples: 16155904. Policy #0 lag: (min: 18.0, avg: 19.7, max: 46.0) +[2023-10-14 06:21:08,513][99942] Avg episode reward: [(0, '0.590'), (1, '1.000')] +[2023-10-14 06:21:08,648][100936] Updated weights for policy 0, policy_version 31550 (0.0007) +[2023-10-14 06:21:09,003][100917] Updated weights for policy 1, policy_version 31522 (0.0009) +[2023-10-14 06:21:09,373][100917] Updated weights for policy 1, policy_version 31532 (0.0010) +[2023-10-14 06:21:09,744][100917] Updated weights for policy 1, policy_version 31542 (0.0010) +[2023-10-14 06:21:10,123][100917] Updated weights for policy 1, policy_version 31552 (0.0007) +[2023-10-14 06:21:12,826][100936] Updated weights for policy 0, policy_version 31560 (0.0007) +[2023-10-14 06:21:13,194][100936] Updated weights for policy 0, policy_version 31570 (0.0007) +[2023-10-14 06:21:13,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 64618496. Throughput: 0: 1652.2, 1: 1662.1. Samples: 16165568. Policy #0 lag: (min: 14.0, avg: 22.4, max: 46.0) +[2023-10-14 06:21:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:21:13,556][100936] Updated weights for policy 0, policy_version 31580 (0.0009) +[2023-10-14 06:21:14,345][100917] Updated weights for policy 1, policy_version 31562 (0.0008) +[2023-10-14 06:21:14,718][100917] Updated weights for policy 1, policy_version 31572 (0.0009) +[2023-10-14 06:21:15,096][100917] Updated weights for policy 1, policy_version 31582 (0.0008) +[2023-10-14 06:21:17,610][100936] Updated weights for policy 0, policy_version 31590 (0.0009) +[2023-10-14 06:21:17,979][100936] Updated weights for policy 0, policy_version 31600 (0.0009) +[2023-10-14 06:21:18,357][100936] Updated weights for policy 0, policy_version 31610 (0.0008) +[2023-10-14 06:21:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 64684032. Throughput: 0: 1647.6, 1: 1662.1. Samples: 16185604. Policy #0 lag: (min: 14.0, avg: 22.4, max: 46.0) +[2023-10-14 06:21:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:21:19,178][100917] Updated weights for policy 1, policy_version 31592 (0.0008) +[2023-10-14 06:21:19,552][100917] Updated weights for policy 1, policy_version 31602 (0.0009) +[2023-10-14 06:21:19,920][100917] Updated weights for policy 1, policy_version 31612 (0.0009) +[2023-10-14 06:21:22,450][100936] Updated weights for policy 0, policy_version 31620 (0.0009) +[2023-10-14 06:21:22,819][100936] Updated weights for policy 0, policy_version 31630 (0.0008) +[2023-10-14 06:21:23,186][100936] Updated weights for policy 0, policy_version 31640 (0.0007) +[2023-10-14 06:21:23,512][99942] Fps is (10 sec: 16383.9, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 64782336. Throughput: 0: 1644.1, 1: 1662.8. Samples: 16205090. Policy #0 lag: (min: 14.0, avg: 22.4, max: 46.0) +[2023-10-14 06:21:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:21:23,847][100917] Updated weights for policy 1, policy_version 31622 (0.0010) +[2023-10-14 06:21:24,221][100917] Updated weights for policy 1, policy_version 31632 (0.0007) +[2023-10-14 06:21:24,589][100917] Updated weights for policy 1, policy_version 31642 (0.0008) +[2023-10-14 06:21:27,179][100936] Updated weights for policy 0, policy_version 31650 (0.0008) +[2023-10-14 06:21:27,569][100936] Updated weights for policy 0, policy_version 31660 (0.0007) +[2023-10-14 06:21:27,935][100936] Updated weights for policy 0, policy_version 31670 (0.0008) +[2023-10-14 06:21:28,303][100936] Updated weights for policy 0, policy_version 31680 (0.0008) +[2023-10-14 06:21:28,512][99942] Fps is (10 sec: 16384.1, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 64847872. Throughput: 0: 1654.7, 1: 1663.9. Samples: 16215220. Policy #0 lag: (min: 14.0, avg: 22.4, max: 46.0) +[2023-10-14 06:21:28,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:21:28,703][100917] Updated weights for policy 1, policy_version 31652 (0.0009) +[2023-10-14 06:21:29,074][100917] Updated weights for policy 1, policy_version 31662 (0.0007) +[2023-10-14 06:21:29,445][100917] Updated weights for policy 1, policy_version 31672 (0.0008) +[2023-10-14 06:21:32,407][100936] Updated weights for policy 0, policy_version 31690 (0.0007) +[2023-10-14 06:21:32,789][100936] Updated weights for policy 0, policy_version 31700 (0.0009) +[2023-10-14 06:21:33,153][100936] Updated weights for policy 0, policy_version 31710 (0.0008) +[2023-10-14 06:21:33,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 64913408. Throughput: 0: 1653.5, 1: 1665.7. Samples: 16235388. Policy #0 lag: (min: 31.0, avg: 32.2, max: 55.0) +[2023-10-14 06:21:33,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:21:33,528][100917] Updated weights for policy 1, policy_version 31682 (0.0010) +[2023-10-14 06:21:33,893][100917] Updated weights for policy 1, policy_version 31692 (0.0008) +[2023-10-14 06:21:34,273][100917] Updated weights for policy 1, policy_version 31702 (0.0009) +[2023-10-14 06:21:34,652][100917] Updated weights for policy 1, policy_version 31712 (0.0010) +[2023-10-14 06:21:37,223][100936] Updated weights for policy 0, policy_version 31720 (0.0009) +[2023-10-14 06:21:37,586][100936] Updated weights for policy 0, policy_version 31730 (0.0010) +[2023-10-14 06:21:37,957][100936] Updated weights for policy 0, policy_version 31740 (0.0007) +[2023-10-14 06:21:38,512][99942] Fps is (10 sec: 13106.7, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 64978944. Throughput: 0: 1654.8, 1: 1667.1. Samples: 16254914. Policy #0 lag: (min: 31.0, avg: 32.2, max: 55.0) +[2023-10-14 06:21:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:21:38,524][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000031744_32505856.pth... +[2023-10-14 06:21:38,562][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000030176_30900224.pth +[2023-10-14 06:21:38,822][100917] Updated weights for policy 1, policy_version 31722 (0.0007) +[2023-10-14 06:21:39,192][100917] Updated weights for policy 1, policy_version 31732 (0.0007) +[2023-10-14 06:21:39,573][100917] Updated weights for policy 1, policy_version 31742 (0.0008) +[2023-10-14 06:21:39,636][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000031744_32505856.pth... +[2023-10-14 06:21:39,665][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000030176_30900224.pth +[2023-10-14 06:21:42,167][100936] Updated weights for policy 0, policy_version 31750 (0.0007) +[2023-10-14 06:21:42,528][100936] Updated weights for policy 0, policy_version 31760 (0.0007) +[2023-10-14 06:21:42,897][100936] Updated weights for policy 0, policy_version 31770 (0.0007) +[2023-10-14 06:21:43,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 65044480. Throughput: 0: 1662.8, 1: 1666.0. Samples: 16265222. Policy #0 lag: (min: 31.0, avg: 32.2, max: 55.0) +[2023-10-14 06:21:43,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:21:43,790][100917] Updated weights for policy 1, policy_version 31752 (0.0009) +[2023-10-14 06:21:44,168][100917] Updated weights for policy 1, policy_version 31762 (0.0011) +[2023-10-14 06:21:44,541][100917] Updated weights for policy 1, policy_version 31772 (0.0007) +[2023-10-14 06:21:47,179][100936] Updated weights for policy 0, policy_version 31780 (0.0008) +[2023-10-14 06:21:47,556][100936] Updated weights for policy 0, policy_version 31790 (0.0009) +[2023-10-14 06:21:47,930][100936] Updated weights for policy 0, policy_version 31800 (0.0010) +[2023-10-14 06:21:48,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 65110016. Throughput: 0: 1650.5, 1: 1661.6. Samples: 16285120. Policy #0 lag: (min: 31.0, avg: 32.2, max: 55.0) +[2023-10-14 06:21:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:21:48,783][100917] Updated weights for policy 1, policy_version 31782 (0.0007) +[2023-10-14 06:21:49,148][100917] Updated weights for policy 1, policy_version 31792 (0.0007) +[2023-10-14 06:21:49,518][100917] Updated weights for policy 1, policy_version 31802 (0.0010) +[2023-10-14 06:21:52,068][100936] Updated weights for policy 0, policy_version 31810 (0.0009) +[2023-10-14 06:21:52,426][100936] Updated weights for policy 0, policy_version 31820 (0.0009) +[2023-10-14 06:21:52,795][100936] Updated weights for policy 0, policy_version 31830 (0.0008) +[2023-10-14 06:21:53,160][100936] Updated weights for policy 0, policy_version 31840 (0.0008) +[2023-10-14 06:21:53,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 65175552. Throughput: 0: 1649.7, 1: 1654.2. Samples: 16304580. Policy #0 lag: (min: 31.0, avg: 32.2, max: 55.0) +[2023-10-14 06:21:53,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:21:53,873][100917] Updated weights for policy 1, policy_version 31812 (0.0008) +[2023-10-14 06:21:54,270][100917] Updated weights for policy 1, policy_version 31822 (0.0008) +[2023-10-14 06:21:54,643][100917] Updated weights for policy 1, policy_version 31832 (0.0010) +[2023-10-14 06:21:57,244][100936] Updated weights for policy 0, policy_version 31850 (0.0007) +[2023-10-14 06:21:57,619][100936] Updated weights for policy 0, policy_version 31860 (0.0009) +[2023-10-14 06:21:58,000][100936] Updated weights for policy 0, policy_version 31870 (0.0007) +[2023-10-14 06:21:58,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 65241088. Throughput: 0: 1661.2, 1: 1652.7. Samples: 16314692. Policy #0 lag: (min: 28.0, avg: 29.8, max: 57.0) +[2023-10-14 06:21:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:21:58,723][100917] Updated weights for policy 1, policy_version 31842 (0.0009) +[2023-10-14 06:21:59,103][100917] Updated weights for policy 1, policy_version 31852 (0.0007) +[2023-10-14 06:21:59,466][100917] Updated weights for policy 1, policy_version 31862 (0.0008) +[2023-10-14 06:21:59,841][100917] Updated weights for policy 1, policy_version 31872 (0.0007) +[2023-10-14 06:22:02,046][100936] Updated weights for policy 0, policy_version 31880 (0.0008) +[2023-10-14 06:22:02,425][100936] Updated weights for policy 0, policy_version 31890 (0.0008) +[2023-10-14 06:22:02,805][100936] Updated weights for policy 0, policy_version 31900 (0.0008) +[2023-10-14 06:22:03,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 65306624. Throughput: 0: 1648.7, 1: 1661.4. Samples: 16334556. Policy #0 lag: (min: 28.0, avg: 29.8, max: 57.0) +[2023-10-14 06:22:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:22:03,832][100917] Updated weights for policy 1, policy_version 31882 (0.0010) +[2023-10-14 06:22:04,201][100917] Updated weights for policy 1, policy_version 31892 (0.0008) +[2023-10-14 06:22:04,578][100917] Updated weights for policy 1, policy_version 31902 (0.0010) +[2023-10-14 06:22:07,120][100936] Updated weights for policy 0, policy_version 31910 (0.0010) +[2023-10-14 06:22:07,500][100936] Updated weights for policy 0, policy_version 31920 (0.0008) +[2023-10-14 06:22:07,864][100936] Updated weights for policy 0, policy_version 31930 (0.0007) +[2023-10-14 06:22:08,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 65372160. Throughput: 0: 1652.2, 1: 1664.7. Samples: 16354350. Policy #0 lag: (min: 28.0, avg: 29.8, max: 57.0) +[2023-10-14 06:22:08,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:22:08,708][100917] Updated weights for policy 1, policy_version 31912 (0.0007) +[2023-10-14 06:22:09,078][100917] Updated weights for policy 1, policy_version 31922 (0.0008) +[2023-10-14 06:22:09,451][100917] Updated weights for policy 1, policy_version 31932 (0.0007) +[2023-10-14 06:22:12,093][100936] Updated weights for policy 0, policy_version 31940 (0.0008) +[2023-10-14 06:22:12,481][100936] Updated weights for policy 0, policy_version 31950 (0.0007) +[2023-10-14 06:22:12,856][100936] Updated weights for policy 0, policy_version 31960 (0.0007) +[2023-10-14 06:22:13,478][100917] Updated weights for policy 1, policy_version 31942 (0.0008) +[2023-10-14 06:22:13,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 65437696. Throughput: 0: 1654.3, 1: 1666.7. Samples: 16364664. Policy #0 lag: (min: 28.0, avg: 29.8, max: 57.0) +[2023-10-14 06:22:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:22:13,850][100917] Updated weights for policy 1, policy_version 31952 (0.0008) +[2023-10-14 06:22:14,211][100917] Updated weights for policy 1, policy_version 31962 (0.0007) +[2023-10-14 06:22:16,774][100936] Updated weights for policy 0, policy_version 31970 (0.0008) +[2023-10-14 06:22:17,156][100936] Updated weights for policy 0, policy_version 31980 (0.0011) +[2023-10-14 06:22:17,523][100936] Updated weights for policy 0, policy_version 31990 (0.0009) +[2023-10-14 06:22:17,888][100936] Updated weights for policy 0, policy_version 32000 (0.0010) +[2023-10-14 06:22:18,247][100917] Updated weights for policy 1, policy_version 31972 (0.0008) +[2023-10-14 06:22:18,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 65503232. Throughput: 0: 1646.0, 1: 1666.6. Samples: 16384454. Policy #0 lag: (min: 28.0, avg: 29.8, max: 57.0) +[2023-10-14 06:22:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:22:18,609][100917] Updated weights for policy 1, policy_version 31982 (0.0007) +[2023-10-14 06:22:18,989][100917] Updated weights for policy 1, policy_version 31992 (0.0009) +[2023-10-14 06:22:22,032][100936] Updated weights for policy 0, policy_version 32010 (0.0007) +[2023-10-14 06:22:22,395][100936] Updated weights for policy 0, policy_version 32020 (0.0008) +[2023-10-14 06:22:22,770][100936] Updated weights for policy 0, policy_version 32030 (0.0009) +[2023-10-14 06:22:22,933][100917] Updated weights for policy 1, policy_version 32002 (0.0010) +[2023-10-14 06:22:23,307][100917] Updated weights for policy 1, policy_version 32012 (0.0008) +[2023-10-14 06:22:23,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 65568768. Throughput: 0: 1651.2, 1: 1670.5. Samples: 16404390. Policy #0 lag: (min: 24.0, avg: 48.8, max: 56.0) +[2023-10-14 06:22:23,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:22:23,682][100917] Updated weights for policy 1, policy_version 32022 (0.0009) +[2023-10-14 06:22:24,062][100917] Updated weights for policy 1, policy_version 32032 (0.0010) +[2023-10-14 06:22:26,933][100936] Updated weights for policy 0, policy_version 32040 (0.0008) +[2023-10-14 06:22:27,300][100936] Updated weights for policy 0, policy_version 32050 (0.0009) +[2023-10-14 06:22:27,673][100936] Updated weights for policy 0, policy_version 32060 (0.0008) +[2023-10-14 06:22:28,173][100917] Updated weights for policy 1, policy_version 32042 (0.0010) +[2023-10-14 06:22:28,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 65634304. Throughput: 0: 1651.0, 1: 1670.9. Samples: 16414708. Policy #0 lag: (min: 24.0, avg: 48.8, max: 56.0) +[2023-10-14 06:22:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:22:28,546][100917] Updated weights for policy 1, policy_version 32052 (0.0010) +[2023-10-14 06:22:28,927][100917] Updated weights for policy 1, policy_version 32062 (0.0012) +[2023-10-14 06:22:31,789][100936] Updated weights for policy 0, policy_version 32070 (0.0009) +[2023-10-14 06:22:32,162][100936] Updated weights for policy 0, policy_version 32080 (0.0010) +[2023-10-14 06:22:32,532][100936] Updated weights for policy 0, policy_version 32090 (0.0011) +[2023-10-14 06:22:33,128][100917] Updated weights for policy 1, policy_version 32072 (0.0009) +[2023-10-14 06:22:33,506][100917] Updated weights for policy 1, policy_version 32082 (0.0007) +[2023-10-14 06:22:33,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 65699840. Throughput: 0: 1642.9, 1: 1671.1. Samples: 16434250. Policy #0 lag: (min: 24.0, avg: 48.8, max: 56.0) +[2023-10-14 06:22:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:22:33,884][100917] Updated weights for policy 1, policy_version 32092 (0.0010) +[2023-10-14 06:22:36,708][100936] Updated weights for policy 0, policy_version 32100 (0.0008) +[2023-10-14 06:22:37,090][100936] Updated weights for policy 0, policy_version 32110 (0.0008) +[2023-10-14 06:22:37,465][100936] Updated weights for policy 0, policy_version 32120 (0.0007) +[2023-10-14 06:22:37,919][100917] Updated weights for policy 1, policy_version 32102 (0.0008) +[2023-10-14 06:22:38,300][100917] Updated weights for policy 1, policy_version 32112 (0.0007) +[2023-10-14 06:22:38,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 65765376. Throughput: 0: 1656.9, 1: 1665.2. Samples: 16454072. Policy #0 lag: (min: 24.0, avg: 48.8, max: 56.0) +[2023-10-14 06:22:38,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:22:38,666][100917] Updated weights for policy 1, policy_version 32122 (0.0007) +[2023-10-14 06:22:41,553][100936] Updated weights for policy 0, policy_version 32130 (0.0007) +[2023-10-14 06:22:41,917][100936] Updated weights for policy 0, policy_version 32140 (0.0009) +[2023-10-14 06:22:42,297][100936] Updated weights for policy 0, policy_version 32150 (0.0008) +[2023-10-14 06:22:42,661][100936] Updated weights for policy 0, policy_version 32160 (0.0007) +[2023-10-14 06:22:42,912][100917] Updated weights for policy 1, policy_version 32132 (0.0009) +[2023-10-14 06:22:43,293][100917] Updated weights for policy 1, policy_version 32142 (0.0008) +[2023-10-14 06:22:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 65830912. Throughput: 0: 1654.3, 1: 1675.1. Samples: 16464516. Policy #0 lag: (min: 24.0, avg: 48.8, max: 56.0) +[2023-10-14 06:22:43,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:22:43,667][100917] Updated weights for policy 1, policy_version 32152 (0.0008) +[2023-10-14 06:22:46,747][100936] Updated weights for policy 0, policy_version 32170 (0.0010) +[2023-10-14 06:22:47,114][100936] Updated weights for policy 0, policy_version 32180 (0.0007) +[2023-10-14 06:22:47,477][100936] Updated weights for policy 0, policy_version 32190 (0.0010) +[2023-10-14 06:22:47,732][100917] Updated weights for policy 1, policy_version 32162 (0.0008) +[2023-10-14 06:22:48,105][100917] Updated weights for policy 1, policy_version 32172 (0.0008) +[2023-10-14 06:22:48,479][100917] Updated weights for policy 1, policy_version 32182 (0.0010) +[2023-10-14 06:22:48,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 65896448. Throughput: 0: 1645.1, 1: 1667.4. Samples: 16483618. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:22:48,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:22:48,843][100917] Updated weights for policy 1, policy_version 32192 (0.0007) +[2023-10-14 06:22:51,628][100936] Updated weights for policy 0, policy_version 32200 (0.0008) +[2023-10-14 06:22:52,002][100936] Updated weights for policy 0, policy_version 32210 (0.0009) +[2023-10-14 06:22:52,371][100936] Updated weights for policy 0, policy_version 32220 (0.0007) +[2023-10-14 06:22:52,862][100917] Updated weights for policy 1, policy_version 32202 (0.0009) +[2023-10-14 06:22:53,230][100917] Updated weights for policy 1, policy_version 32212 (0.0008) +[2023-10-14 06:22:53,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 65961984. Throughput: 0: 1655.9, 1: 1660.3. Samples: 16503580. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:22:53,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:22:53,604][100917] Updated weights for policy 1, policy_version 32222 (0.0010) +[2023-10-14 06:22:56,540][100936] Updated weights for policy 0, policy_version 32230 (0.0008) +[2023-10-14 06:22:56,906][100936] Updated weights for policy 0, policy_version 32240 (0.0011) +[2023-10-14 06:22:57,276][100936] Updated weights for policy 0, policy_version 32250 (0.0010) +[2023-10-14 06:22:57,870][100917] Updated weights for policy 1, policy_version 32232 (0.0008) +[2023-10-14 06:22:58,244][100917] Updated weights for policy 1, policy_version 32242 (0.0007) +[2023-10-14 06:22:58,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 66027520. Throughput: 0: 1652.0, 1: 1665.8. Samples: 16513966. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:22:58,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:22:58,608][100917] Updated weights for policy 1, policy_version 32252 (0.0007) +[2023-10-14 06:23:01,469][100936] Updated weights for policy 0, policy_version 32260 (0.0010) +[2023-10-14 06:23:01,858][100936] Updated weights for policy 0, policy_version 32270 (0.0007) +[2023-10-14 06:23:02,228][100936] Updated weights for policy 0, policy_version 32280 (0.0009) +[2023-10-14 06:23:02,690][100917] Updated weights for policy 1, policy_version 32262 (0.0010) +[2023-10-14 06:23:03,061][100917] Updated weights for policy 1, policy_version 32272 (0.0010) +[2023-10-14 06:23:03,428][100917] Updated weights for policy 1, policy_version 32282 (0.0010) +[2023-10-14 06:23:03,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 66093056. Throughput: 0: 1642.7, 1: 1666.4. Samples: 16533366. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:23:03,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:23:06,229][100936] Updated weights for policy 0, policy_version 32290 (0.0010) +[2023-10-14 06:23:06,597][100936] Updated weights for policy 0, policy_version 32300 (0.0010) +[2023-10-14 06:23:06,967][100936] Updated weights for policy 0, policy_version 32310 (0.0010) +[2023-10-14 06:23:07,335][100936] Updated weights for policy 0, policy_version 32320 (0.0010) +[2023-10-14 06:23:07,564][100917] Updated weights for policy 1, policy_version 32292 (0.0010) +[2023-10-14 06:23:07,931][100917] Updated weights for policy 1, policy_version 32302 (0.0007) +[2023-10-14 06:23:08,291][100917] Updated weights for policy 1, policy_version 32312 (0.0008) +[2023-10-14 06:23:08,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 66158592. Throughput: 0: 1657.0, 1: 1646.6. Samples: 16553052. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:23:08,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:23:11,569][100936] Updated weights for policy 0, policy_version 32330 (0.0010) +[2023-10-14 06:23:11,939][100936] Updated weights for policy 0, policy_version 32340 (0.0009) +[2023-10-14 06:23:12,307][100936] Updated weights for policy 0, policy_version 32350 (0.0009) +[2023-10-14 06:23:12,416][100917] Updated weights for policy 1, policy_version 32322 (0.0008) +[2023-10-14 06:23:12,801][100917] Updated weights for policy 1, policy_version 32332 (0.0007) +[2023-10-14 06:23:13,174][100917] Updated weights for policy 1, policy_version 32342 (0.0008) +[2023-10-14 06:23:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 66224128. Throughput: 0: 1649.3, 1: 1661.3. Samples: 16563686. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:23:13,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:23:13,542][100917] Updated weights for policy 1, policy_version 32352 (0.0010) +[2023-10-14 06:23:16,448][100936] Updated weights for policy 0, policy_version 32360 (0.0008) +[2023-10-14 06:23:16,818][100936] Updated weights for policy 0, policy_version 32370 (0.0009) +[2023-10-14 06:23:17,189][100936] Updated weights for policy 0, policy_version 32380 (0.0009) +[2023-10-14 06:23:17,644][100917] Updated weights for policy 1, policy_version 32362 (0.0009) +[2023-10-14 06:23:18,018][100917] Updated weights for policy 1, policy_version 32372 (0.0007) +[2023-10-14 06:23:18,382][100917] Updated weights for policy 1, policy_version 32382 (0.0007) +[2023-10-14 06:23:18,512][99942] Fps is (10 sec: 16384.3, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 66322432. Throughput: 0: 1645.5, 1: 1664.1. Samples: 16583180. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:23:18,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:23:21,294][100936] Updated weights for policy 0, policy_version 32390 (0.0007) +[2023-10-14 06:23:21,658][100936] Updated weights for policy 0, policy_version 32400 (0.0007) +[2023-10-14 06:23:22,029][100936] Updated weights for policy 0, policy_version 32410 (0.0007) +[2023-10-14 06:23:22,514][100917] Updated weights for policy 1, policy_version 32392 (0.0010) +[2023-10-14 06:23:22,877][100917] Updated weights for policy 1, policy_version 32402 (0.0010) +[2023-10-14 06:23:23,252][100917] Updated weights for policy 1, policy_version 32412 (0.0011) +[2023-10-14 06:23:23,512][99942] Fps is (10 sec: 16384.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 66387968. Throughput: 0: 1654.2, 1: 1656.0. Samples: 16603034. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:23:23,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:23:26,194][100936] Updated weights for policy 0, policy_version 32420 (0.0008) +[2023-10-14 06:23:26,563][100936] Updated weights for policy 0, policy_version 32430 (0.0007) +[2023-10-14 06:23:26,943][100936] Updated weights for policy 0, policy_version 32440 (0.0008) +[2023-10-14 06:23:27,495][100917] Updated weights for policy 1, policy_version 32422 (0.0010) +[2023-10-14 06:23:27,856][100917] Updated weights for policy 1, policy_version 32432 (0.0009) +[2023-10-14 06:23:28,229][100917] Updated weights for policy 1, policy_version 32442 (0.0007) +[2023-10-14 06:23:28,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 66453504. Throughput: 0: 1642.4, 1: 1663.3. Samples: 16613276. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:23:28,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:23:30,992][100936] Updated weights for policy 0, policy_version 32450 (0.0010) +[2023-10-14 06:23:31,367][100936] Updated weights for policy 0, policy_version 32460 (0.0010) +[2023-10-14 06:23:31,737][100936] Updated weights for policy 0, policy_version 32470 (0.0008) +[2023-10-14 06:23:32,105][100936] Updated weights for policy 0, policy_version 32480 (0.0007) +[2023-10-14 06:23:32,474][100917] Updated weights for policy 1, policy_version 32452 (0.0008) +[2023-10-14 06:23:32,869][100917] Updated weights for policy 1, policy_version 32462 (0.0010) +[2023-10-14 06:23:33,238][100917] Updated weights for policy 1, policy_version 32472 (0.0010) +[2023-10-14 06:23:33,512][99942] Fps is (10 sec: 9830.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 66486272. Throughput: 0: 1647.6, 1: 1668.1. Samples: 16632828. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:23:33,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:23:36,252][100936] Updated weights for policy 0, policy_version 32490 (0.0007) +[2023-10-14 06:23:36,619][100936] Updated weights for policy 0, policy_version 32500 (0.0009) +[2023-10-14 06:23:36,994][100936] Updated weights for policy 0, policy_version 32510 (0.0007) +[2023-10-14 06:23:37,135][100917] Updated weights for policy 1, policy_version 32482 (0.0009) +[2023-10-14 06:23:37,497][100917] Updated weights for policy 1, policy_version 32492 (0.0010) +[2023-10-14 06:23:37,874][100917] Updated weights for policy 1, policy_version 32502 (0.0009) +[2023-10-14 06:23:38,243][100917] Updated weights for policy 1, policy_version 32512 (0.0010) +[2023-10-14 06:23:38,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 66584576. Throughput: 0: 1656.6, 1: 1648.8. Samples: 16652324. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:23:38,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:23:38,522][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000032512_33292288.pth... +[2023-10-14 06:23:38,523][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000032512_33292288.pth... +[2023-10-14 06:23:38,552][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000030944_31686656.pth +[2023-10-14 06:23:38,558][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000030976_31719424.pth +[2023-10-14 06:23:41,073][100936] Updated weights for policy 0, policy_version 32520 (0.0009) +[2023-10-14 06:23:41,448][100936] Updated weights for policy 0, policy_version 32530 (0.0008) +[2023-10-14 06:23:41,805][100936] Updated weights for policy 0, policy_version 32540 (0.0008) +[2023-10-14 06:23:42,483][100917] Updated weights for policy 1, policy_version 32522 (0.0007) +[2023-10-14 06:23:42,861][100917] Updated weights for policy 1, policy_version 32532 (0.0007) +[2023-10-14 06:23:43,238][100917] Updated weights for policy 1, policy_version 32542 (0.0008) +[2023-10-14 06:23:43,512][99942] Fps is (10 sec: 16383.7, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 66650112. Throughput: 0: 1649.4, 1: 1658.4. Samples: 16662816. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:23:43,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:23:46,115][100936] Updated weights for policy 0, policy_version 32550 (0.0010) +[2023-10-14 06:23:46,488][100936] Updated weights for policy 0, policy_version 32560 (0.0008) +[2023-10-14 06:23:46,859][100936] Updated weights for policy 0, policy_version 32570 (0.0009) +[2023-10-14 06:23:47,144][100917] Updated weights for policy 1, policy_version 32552 (0.0009) +[2023-10-14 06:23:47,517][100917] Updated weights for policy 1, policy_version 32562 (0.0008) +[2023-10-14 06:23:47,887][100917] Updated weights for policy 1, policy_version 32572 (0.0008) +[2023-10-14 06:23:48,512][99942] Fps is (10 sec: 13107.7, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 66715648. Throughput: 0: 1657.7, 1: 1655.0. Samples: 16682440. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:23:48,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:23:51,079][100936] Updated weights for policy 0, policy_version 32580 (0.0007) +[2023-10-14 06:23:51,479][100936] Updated weights for policy 0, policy_version 32590 (0.0008) +[2023-10-14 06:23:51,857][100936] Updated weights for policy 0, policy_version 32600 (0.0008) +[2023-10-14 06:23:52,116][100917] Updated weights for policy 1, policy_version 32582 (0.0007) +[2023-10-14 06:23:52,486][100917] Updated weights for policy 1, policy_version 32592 (0.0007) +[2023-10-14 06:23:52,860][100917] Updated weights for policy 1, policy_version 32602 (0.0007) +[2023-10-14 06:23:53,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 66781184. Throughput: 0: 1660.5, 1: 1647.3. Samples: 16701902. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:23:53,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:23:55,770][100936] Updated weights for policy 0, policy_version 32610 (0.0008) +[2023-10-14 06:23:56,131][100936] Updated weights for policy 0, policy_version 32620 (0.0008) +[2023-10-14 06:23:56,500][100936] Updated weights for policy 0, policy_version 32630 (0.0009) +[2023-10-14 06:23:56,868][100936] Updated weights for policy 0, policy_version 32640 (0.0010) +[2023-10-14 06:23:56,934][100917] Updated weights for policy 1, policy_version 32612 (0.0009) +[2023-10-14 06:23:57,312][100917] Updated weights for policy 1, policy_version 32622 (0.0008) +[2023-10-14 06:23:57,684][100917] Updated weights for policy 1, policy_version 32632 (0.0008) +[2023-10-14 06:23:58,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 66846720. Throughput: 0: 1656.6, 1: 1655.1. Samples: 16712712. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:23:58,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:24:00,889][100936] Updated weights for policy 0, policy_version 32650 (0.0008) +[2023-10-14 06:24:01,253][100936] Updated weights for policy 0, policy_version 32660 (0.0009) +[2023-10-14 06:24:01,628][100936] Updated weights for policy 0, policy_version 32670 (0.0009) +[2023-10-14 06:24:01,842][100917] Updated weights for policy 1, policy_version 32642 (0.0010) +[2023-10-14 06:24:02,213][100917] Updated weights for policy 1, policy_version 32652 (0.0007) +[2023-10-14 06:24:02,592][100917] Updated weights for policy 1, policy_version 32662 (0.0008) +[2023-10-14 06:24:02,968][100917] Updated weights for policy 1, policy_version 32672 (0.0008) +[2023-10-14 06:24:03,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 66912256. Throughput: 0: 1667.2, 1: 1647.5. Samples: 16732338. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-10-14 06:24:03,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:24:05,724][100936] Updated weights for policy 0, policy_version 32680 (0.0009) +[2023-10-14 06:24:06,086][100936] Updated weights for policy 0, policy_version 32690 (0.0007) +[2023-10-14 06:24:06,454][100936] Updated weights for policy 0, policy_version 32700 (0.0007) +[2023-10-14 06:24:07,251][100917] Updated weights for policy 1, policy_version 32682 (0.0008) +[2023-10-14 06:24:07,623][100917] Updated weights for policy 1, policy_version 32692 (0.0008) +[2023-10-14 06:24:07,994][100917] Updated weights for policy 1, policy_version 32702 (0.0010) +[2023-10-14 06:24:08,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 66977792. Throughput: 0: 1670.3, 1: 1638.7. Samples: 16751936. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-10-14 06:24:08,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:24:10,796][100936] Updated weights for policy 0, policy_version 32710 (0.0007) +[2023-10-14 06:24:11,155][100936] Updated weights for policy 0, policy_version 32720 (0.0008) +[2023-10-14 06:24:11,535][100936] Updated weights for policy 0, policy_version 32730 (0.0007) +[2023-10-14 06:24:12,387][100917] Updated weights for policy 1, policy_version 32712 (0.0008) +[2023-10-14 06:24:12,756][100917] Updated weights for policy 1, policy_version 32722 (0.0007) +[2023-10-14 06:24:13,131][100917] Updated weights for policy 1, policy_version 32732 (0.0007) +[2023-10-14 06:24:13,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 67043328. Throughput: 0: 1659.9, 1: 1646.9. Samples: 16762084. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-10-14 06:24:13,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:24:15,418][100936] Updated weights for policy 0, policy_version 32740 (0.0008) +[2023-10-14 06:24:15,783][100936] Updated weights for policy 0, policy_version 32750 (0.0008) +[2023-10-14 06:24:16,158][100936] Updated weights for policy 0, policy_version 32760 (0.0010) +[2023-10-14 06:24:17,381][100917] Updated weights for policy 1, policy_version 32742 (0.0007) +[2023-10-14 06:24:17,765][100917] Updated weights for policy 1, policy_version 32752 (0.0007) +[2023-10-14 06:24:18,143][100917] Updated weights for policy 1, policy_version 32762 (0.0007) +[2023-10-14 06:24:18,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 67108864. Throughput: 0: 1671.7, 1: 1644.8. Samples: 16782070. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-10-14 06:24:18,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:24:20,357][100936] Updated weights for policy 0, policy_version 32770 (0.0007) +[2023-10-14 06:24:20,722][100936] Updated weights for policy 0, policy_version 32780 (0.0008) +[2023-10-14 06:24:21,098][100936] Updated weights for policy 0, policy_version 32790 (0.0008) +[2023-10-14 06:24:21,462][100936] Updated weights for policy 0, policy_version 32800 (0.0010) +[2023-10-14 06:24:22,132][100917] Updated weights for policy 1, policy_version 32772 (0.0008) +[2023-10-14 06:24:22,511][100917] Updated weights for policy 1, policy_version 32782 (0.0008) +[2023-10-14 06:24:22,883][100917] Updated weights for policy 1, policy_version 32792 (0.0007) +[2023-10-14 06:24:23,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 67174400. Throughput: 0: 1671.3, 1: 1648.1. Samples: 16801698. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-10-14 06:24:23,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:24:25,411][100936] Updated weights for policy 0, policy_version 32810 (0.0007) +[2023-10-14 06:24:25,780][100936] Updated weights for policy 0, policy_version 32820 (0.0007) +[2023-10-14 06:24:26,153][100936] Updated weights for policy 0, policy_version 32830 (0.0009) +[2023-10-14 06:24:27,091][100917] Updated weights for policy 1, policy_version 32802 (0.0008) +[2023-10-14 06:24:27,466][100917] Updated weights for policy 1, policy_version 32812 (0.0010) +[2023-10-14 06:24:27,840][100917] Updated weights for policy 1, policy_version 32822 (0.0010) +[2023-10-14 06:24:28,214][100917] Updated weights for policy 1, policy_version 32832 (0.0009) +[2023-10-14 06:24:28,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 67239936. Throughput: 0: 1656.2, 1: 1650.7. Samples: 16811624. Policy #0 lag: (min: 31.0, avg: 31.2, max: 41.0) +[2023-10-14 06:24:28,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:24:30,323][100936] Updated weights for policy 0, policy_version 32840 (0.0008) +[2023-10-14 06:24:30,699][100936] Updated weights for policy 0, policy_version 32850 (0.0007) +[2023-10-14 06:24:31,064][100936] Updated weights for policy 0, policy_version 32860 (0.0007) +[2023-10-14 06:24:32,088][100917] Updated weights for policy 1, policy_version 32842 (0.0010) +[2023-10-14 06:24:32,462][100917] Updated weights for policy 1, policy_version 32852 (0.0008) +[2023-10-14 06:24:32,845][100917] Updated weights for policy 1, policy_version 32862 (0.0009) +[2023-10-14 06:24:33,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 67305472. Throughput: 0: 1672.6, 1: 1650.4. Samples: 16831978. Policy #0 lag: (min: 31.0, avg: 31.2, max: 41.0) +[2023-10-14 06:24:33,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:24:35,367][100936] Updated weights for policy 0, policy_version 32870 (0.0008) +[2023-10-14 06:24:35,760][100936] Updated weights for policy 0, policy_version 32880 (0.0007) +[2023-10-14 06:24:36,133][100936] Updated weights for policy 0, policy_version 32890 (0.0008) +[2023-10-14 06:24:36,849][100917] Updated weights for policy 1, policy_version 32872 (0.0010) +[2023-10-14 06:24:37,229][100917] Updated weights for policy 1, policy_version 32882 (0.0009) +[2023-10-14 06:24:37,603][100917] Updated weights for policy 1, policy_version 32892 (0.0010) +[2023-10-14 06:24:38,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 67371008. Throughput: 0: 1668.8, 1: 1650.3. Samples: 16851262. Policy #0 lag: (min: 31.0, avg: 31.2, max: 41.0) +[2023-10-14 06:24:38,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:24:40,315][100936] Updated weights for policy 0, policy_version 32900 (0.0009) +[2023-10-14 06:24:40,685][100936] Updated weights for policy 0, policy_version 32910 (0.0010) +[2023-10-14 06:24:41,056][100936] Updated weights for policy 0, policy_version 32920 (0.0010) +[2023-10-14 06:24:41,660][100917] Updated weights for policy 1, policy_version 32902 (0.0008) +[2023-10-14 06:24:42,034][100917] Updated weights for policy 1, policy_version 32912 (0.0009) +[2023-10-14 06:24:42,396][100917] Updated weights for policy 1, policy_version 32922 (0.0011) +[2023-10-14 06:24:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 67436544. Throughput: 0: 1650.9, 1: 1654.1. Samples: 16861436. Policy #0 lag: (min: 31.0, avg: 31.2, max: 41.0) +[2023-10-14 06:24:43,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:24:45,163][100936] Updated weights for policy 0, policy_version 32930 (0.0010) +[2023-10-14 06:24:45,534][100936] Updated weights for policy 0, policy_version 32940 (0.0008) +[2023-10-14 06:24:45,904][100936] Updated weights for policy 0, policy_version 32950 (0.0007) +[2023-10-14 06:24:46,277][100936] Updated weights for policy 0, policy_version 32960 (0.0008) +[2023-10-14 06:24:46,800][100917] Updated weights for policy 1, policy_version 32932 (0.0009) +[2023-10-14 06:24:47,181][100917] Updated weights for policy 1, policy_version 32942 (0.0008) +[2023-10-14 06:24:47,548][100917] Updated weights for policy 1, policy_version 32952 (0.0009) +[2023-10-14 06:24:48,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 67502080. Throughput: 0: 1662.7, 1: 1647.5. Samples: 16881296. Policy #0 lag: (min: 31.0, avg: 31.2, max: 41.0) +[2023-10-14 06:24:48,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:24:50,418][100936] Updated weights for policy 0, policy_version 32970 (0.0007) +[2023-10-14 06:24:50,791][100936] Updated weights for policy 0, policy_version 32980 (0.0009) +[2023-10-14 06:24:51,166][100936] Updated weights for policy 0, policy_version 32990 (0.0007) +[2023-10-14 06:24:51,774][100917] Updated weights for policy 1, policy_version 32962 (0.0008) +[2023-10-14 06:24:52,155][100917] Updated weights for policy 1, policy_version 32972 (0.0008) +[2023-10-14 06:24:52,524][100917] Updated weights for policy 1, policy_version 32982 (0.0010) +[2023-10-14 06:24:52,899][100917] Updated weights for policy 1, policy_version 32992 (0.0007) +[2023-10-14 06:24:53,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 67567616. Throughput: 0: 1659.3, 1: 1648.8. Samples: 16900802. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) +[2023-10-14 06:24:53,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:24:55,456][100936] Updated weights for policy 0, policy_version 33000 (0.0009) +[2023-10-14 06:24:55,843][100936] Updated weights for policy 0, policy_version 33010 (0.0009) +[2023-10-14 06:24:56,211][100936] Updated weights for policy 0, policy_version 33020 (0.0009) +[2023-10-14 06:24:56,825][100917] Updated weights for policy 1, policy_version 33002 (0.0008) +[2023-10-14 06:24:57,197][100917] Updated weights for policy 1, policy_version 33012 (0.0009) +[2023-10-14 06:24:57,567][100917] Updated weights for policy 1, policy_version 33022 (0.0010) +[2023-10-14 06:24:58,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.3, 300 sec: 13329.4). Total num frames: 67633152. Throughput: 0: 1651.8, 1: 1663.3. Samples: 16911266. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) +[2023-10-14 06:24:58,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:25:00,284][100936] Updated weights for policy 0, policy_version 33030 (0.0008) +[2023-10-14 06:25:00,665][100936] Updated weights for policy 0, policy_version 33040 (0.0007) +[2023-10-14 06:25:01,032][100936] Updated weights for policy 0, policy_version 33050 (0.0009) +[2023-10-14 06:25:01,672][100917] Updated weights for policy 1, policy_version 33032 (0.0007) +[2023-10-14 06:25:02,054][100917] Updated weights for policy 1, policy_version 33042 (0.0008) +[2023-10-14 06:25:02,428][100917] Updated weights for policy 1, policy_version 33052 (0.0008) +[2023-10-14 06:25:03,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 67698688. Throughput: 0: 1657.9, 1: 1652.5. Samples: 16931036. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) +[2023-10-14 06:25:03,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:25:05,204][100936] Updated weights for policy 0, policy_version 33060 (0.0008) +[2023-10-14 06:25:05,586][100936] Updated weights for policy 0, policy_version 33070 (0.0007) +[2023-10-14 06:25:05,951][100936] Updated weights for policy 0, policy_version 33080 (0.0009) +[2023-10-14 06:25:06,610][100917] Updated weights for policy 1, policy_version 33062 (0.0008) +[2023-10-14 06:25:06,989][100917] Updated weights for policy 1, policy_version 33072 (0.0011) +[2023-10-14 06:25:07,368][100917] Updated weights for policy 1, policy_version 33082 (0.0010) +[2023-10-14 06:25:08,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 67764224. Throughput: 0: 1654.4, 1: 1655.2. Samples: 16950632. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) +[2023-10-14 06:25:08,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:25:09,901][100936] Updated weights for policy 0, policy_version 33090 (0.0008) +[2023-10-14 06:25:10,276][100936] Updated weights for policy 0, policy_version 33100 (0.0008) +[2023-10-14 06:25:10,656][100936] Updated weights for policy 0, policy_version 33110 (0.0008) +[2023-10-14 06:25:11,028][100936] Updated weights for policy 0, policy_version 33120 (0.0011) +[2023-10-14 06:25:11,358][100917] Updated weights for policy 1, policy_version 33092 (0.0010) +[2023-10-14 06:25:11,729][100917] Updated weights for policy 1, policy_version 33102 (0.0010) +[2023-10-14 06:25:12,106][100917] Updated weights for policy 1, policy_version 33112 (0.0007) +[2023-10-14 06:25:13,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 67829760. Throughput: 0: 1651.1, 1: 1664.7. Samples: 16960834. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) +[2023-10-14 06:25:13,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:25:15,139][100936] Updated weights for policy 0, policy_version 33130 (0.0010) +[2023-10-14 06:25:15,508][100936] Updated weights for policy 0, policy_version 33140 (0.0008) +[2023-10-14 06:25:15,890][100936] Updated weights for policy 0, policy_version 33150 (0.0007) +[2023-10-14 06:25:16,213][100917] Updated weights for policy 1, policy_version 33122 (0.0009) +[2023-10-14 06:25:16,599][100917] Updated weights for policy 1, policy_version 33132 (0.0008) +[2023-10-14 06:25:16,983][100917] Updated weights for policy 1, policy_version 33142 (0.0008) +[2023-10-14 06:25:17,360][100917] Updated weights for policy 1, policy_version 33152 (0.0008) +[2023-10-14 06:25:18,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 67895296. Throughput: 0: 1651.2, 1: 1650.2. Samples: 16980544. Policy #0 lag: (min: 9.0, avg: 22.3, max: 41.0) +[2023-10-14 06:25:18,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:25:19,977][100936] Updated weights for policy 0, policy_version 33160 (0.0008) +[2023-10-14 06:25:20,357][100936] Updated weights for policy 0, policy_version 33170 (0.0007) +[2023-10-14 06:25:20,737][100936] Updated weights for policy 0, policy_version 33180 (0.0010) +[2023-10-14 06:25:21,462][100917] Updated weights for policy 1, policy_version 33162 (0.0009) +[2023-10-14 06:25:21,831][100917] Updated weights for policy 1, policy_version 33172 (0.0007) +[2023-10-14 06:25:22,197][100917] Updated weights for policy 1, policy_version 33182 (0.0008) +[2023-10-14 06:25:23,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.1, 300 sec: 13329.3). Total num frames: 67960832. Throughput: 0: 1655.2, 1: 1659.4. Samples: 17000418. Policy #0 lag: (min: 9.0, avg: 22.3, max: 41.0) +[2023-10-14 06:25:23,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:25:24,845][100936] Updated weights for policy 0, policy_version 33190 (0.0010) +[2023-10-14 06:25:25,214][100936] Updated weights for policy 0, policy_version 33200 (0.0011) +[2023-10-14 06:25:25,584][100936] Updated weights for policy 0, policy_version 33210 (0.0008) +[2023-10-14 06:25:26,330][100917] Updated weights for policy 1, policy_version 33192 (0.0008) +[2023-10-14 06:25:26,703][100917] Updated weights for policy 1, policy_version 33202 (0.0008) +[2023-10-14 06:25:27,086][100917] Updated weights for policy 1, policy_version 33212 (0.0007) +[2023-10-14 06:25:28,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 68026368. Throughput: 0: 1657.7, 1: 1657.8. Samples: 17010634. Policy #0 lag: (min: 9.0, avg: 22.3, max: 41.0) +[2023-10-14 06:25:28,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:25:29,575][100936] Updated weights for policy 0, policy_version 33220 (0.0008) +[2023-10-14 06:25:29,941][100936] Updated weights for policy 0, policy_version 33230 (0.0008) +[2023-10-14 06:25:30,317][100936] Updated weights for policy 0, policy_version 33240 (0.0007) +[2023-10-14 06:25:31,160][100917] Updated weights for policy 1, policy_version 33222 (0.0010) +[2023-10-14 06:25:31,532][100917] Updated weights for policy 1, policy_version 33232 (0.0010) +[2023-10-14 06:25:31,901][100917] Updated weights for policy 1, policy_version 33242 (0.0008) +[2023-10-14 06:25:33,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 68091904. Throughput: 0: 1659.2, 1: 1647.5. Samples: 17030096. Policy #0 lag: (min: 9.0, avg: 22.3, max: 41.0) +[2023-10-14 06:25:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:25:34,458][100936] Updated weights for policy 0, policy_version 33250 (0.0007) +[2023-10-14 06:25:34,829][100936] Updated weights for policy 0, policy_version 33260 (0.0009) +[2023-10-14 06:25:35,205][100936] Updated weights for policy 0, policy_version 33270 (0.0007) +[2023-10-14 06:25:35,573][100936] Updated weights for policy 0, policy_version 33280 (0.0007) +[2023-10-14 06:25:36,103][100917] Updated weights for policy 1, policy_version 33252 (0.0009) +[2023-10-14 06:25:36,469][100917] Updated weights for policy 1, policy_version 33262 (0.0010) +[2023-10-14 06:25:36,847][100917] Updated weights for policy 1, policy_version 33272 (0.0009) +[2023-10-14 06:25:38,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 68157440. Throughput: 0: 1657.3, 1: 1661.8. Samples: 17050160. Policy #0 lag: (min: 9.0, avg: 22.3, max: 41.0) +[2023-10-14 06:25:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:25:38,523][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000033280_34078720.pth... +[2023-10-14 06:25:38,523][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000033280_34078720.pth... +[2023-10-14 06:25:38,555][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000031744_32505856.pth +[2023-10-14 06:25:38,563][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000031744_32505856.pth +[2023-10-14 06:25:39,781][100936] Updated weights for policy 0, policy_version 33290 (0.0009) +[2023-10-14 06:25:40,146][100936] Updated weights for policy 0, policy_version 33300 (0.0008) +[2023-10-14 06:25:40,515][100936] Updated weights for policy 0, policy_version 33310 (0.0009) +[2023-10-14 06:25:40,897][100917] Updated weights for policy 1, policy_version 33282 (0.0007) +[2023-10-14 06:25:41,268][100917] Updated weights for policy 1, policy_version 33292 (0.0008) +[2023-10-14 06:25:41,636][100917] Updated weights for policy 1, policy_version 33302 (0.0010) +[2023-10-14 06:25:42,015][100917] Updated weights for policy 1, policy_version 33312 (0.0009) +[2023-10-14 06:25:43,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 68222976. Throughput: 0: 1657.5, 1: 1654.7. Samples: 17060320. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:25:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:25:44,594][100936] Updated weights for policy 0, policy_version 33320 (0.0008) +[2023-10-14 06:25:44,962][100936] Updated weights for policy 0, policy_version 33330 (0.0009) +[2023-10-14 06:25:45,329][100936] Updated weights for policy 0, policy_version 33340 (0.0009) +[2023-10-14 06:25:46,221][100917] Updated weights for policy 1, policy_version 33322 (0.0011) +[2023-10-14 06:25:46,598][100917] Updated weights for policy 1, policy_version 33332 (0.0008) +[2023-10-14 06:25:46,978][100917] Updated weights for policy 1, policy_version 33342 (0.0008) +[2023-10-14 06:25:48,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 68288512. Throughput: 0: 1657.1, 1: 1646.4. Samples: 17079692. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:25:48,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:25:49,502][100936] Updated weights for policy 0, policy_version 33350 (0.0008) +[2023-10-14 06:25:49,864][100936] Updated weights for policy 0, policy_version 33360 (0.0009) +[2023-10-14 06:25:50,236][100936] Updated weights for policy 0, policy_version 33370 (0.0008) +[2023-10-14 06:25:50,972][100917] Updated weights for policy 1, policy_version 33352 (0.0010) +[2023-10-14 06:25:51,334][100917] Updated weights for policy 1, policy_version 33362 (0.0009) +[2023-10-14 06:25:51,703][100917] Updated weights for policy 1, policy_version 33372 (0.0009) +[2023-10-14 06:25:53,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 68354048. Throughput: 0: 1656.1, 1: 1662.0. Samples: 17099950. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:25:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:25:54,346][100936] Updated weights for policy 0, policy_version 33380 (0.0008) +[2023-10-14 06:25:54,725][100936] Updated weights for policy 0, policy_version 33390 (0.0008) +[2023-10-14 06:25:55,099][100936] Updated weights for policy 0, policy_version 33400 (0.0007) +[2023-10-14 06:25:55,694][100917] Updated weights for policy 1, policy_version 33382 (0.0010) +[2023-10-14 06:25:56,070][100917] Updated weights for policy 1, policy_version 33392 (0.0007) +[2023-10-14 06:25:56,444][100917] Updated weights for policy 1, policy_version 33402 (0.0009) +[2023-10-14 06:25:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 68419584. Throughput: 0: 1661.0, 1: 1652.3. Samples: 17109934. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:25:58,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:25:59,068][100936] Updated weights for policy 0, policy_version 33410 (0.0007) +[2023-10-14 06:25:59,447][100936] Updated weights for policy 0, policy_version 33420 (0.0007) +[2023-10-14 06:25:59,812][100936] Updated weights for policy 0, policy_version 33430 (0.0007) +[2023-10-14 06:26:00,185][100936] Updated weights for policy 0, policy_version 33440 (0.0007) +[2023-10-14 06:26:00,722][100917] Updated weights for policy 1, policy_version 33412 (0.0008) +[2023-10-14 06:26:01,103][100917] Updated weights for policy 1, policy_version 33422 (0.0009) +[2023-10-14 06:26:01,468][100917] Updated weights for policy 1, policy_version 33432 (0.0010) +[2023-10-14 06:26:03,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.3, 300 sec: 13329.4). Total num frames: 68485120. Throughput: 0: 1665.9, 1: 1648.0. Samples: 17129668. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:26:03,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:26:04,303][100936] Updated weights for policy 0, policy_version 33450 (0.0011) +[2023-10-14 06:26:04,684][100936] Updated weights for policy 0, policy_version 33460 (0.0008) +[2023-10-14 06:26:05,053][100936] Updated weights for policy 0, policy_version 33470 (0.0007) +[2023-10-14 06:26:05,594][100917] Updated weights for policy 1, policy_version 33442 (0.0009) +[2023-10-14 06:26:05,967][100917] Updated weights for policy 1, policy_version 33452 (0.0008) +[2023-10-14 06:26:06,337][100917] Updated weights for policy 1, policy_version 33462 (0.0010) +[2023-10-14 06:26:06,715][100917] Updated weights for policy 1, policy_version 33472 (0.0009) +[2023-10-14 06:26:08,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 68550656. Throughput: 0: 1662.4, 1: 1663.7. Samples: 17150090. Policy #0 lag: (min: 14.0, avg: 21.9, max: 46.0) +[2023-10-14 06:26:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:26:09,393][100936] Updated weights for policy 0, policy_version 33480 (0.0010) +[2023-10-14 06:26:09,762][100936] Updated weights for policy 0, policy_version 33490 (0.0008) +[2023-10-14 06:26:10,130][100936] Updated weights for policy 0, policy_version 33500 (0.0008) +[2023-10-14 06:26:10,789][100917] Updated weights for policy 1, policy_version 33482 (0.0010) +[2023-10-14 06:26:11,168][100917] Updated weights for policy 1, policy_version 33492 (0.0010) +[2023-10-14 06:26:11,547][100917] Updated weights for policy 1, policy_version 33502 (0.0010) +[2023-10-14 06:26:13,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 68616192. Throughput: 0: 1657.6, 1: 1651.9. Samples: 17159566. Policy #0 lag: (min: 14.0, avg: 21.9, max: 46.0) +[2023-10-14 06:26:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:26:14,279][100936] Updated weights for policy 0, policy_version 33510 (0.0010) +[2023-10-14 06:26:14,649][100936] Updated weights for policy 0, policy_version 33520 (0.0008) +[2023-10-14 06:26:15,018][100936] Updated weights for policy 0, policy_version 33530 (0.0008) +[2023-10-14 06:26:15,607][100917] Updated weights for policy 1, policy_version 33512 (0.0007) +[2023-10-14 06:26:15,985][100917] Updated weights for policy 1, policy_version 33522 (0.0010) +[2023-10-14 06:26:16,355][100917] Updated weights for policy 1, policy_version 33532 (0.0007) +[2023-10-14 06:26:18,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 68681728. Throughput: 0: 1662.3, 1: 1662.9. Samples: 17179726. Policy #0 lag: (min: 14.0, avg: 21.9, max: 46.0) +[2023-10-14 06:26:18,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:26:18,940][100936] Updated weights for policy 0, policy_version 33540 (0.0010) +[2023-10-14 06:26:19,319][100936] Updated weights for policy 0, policy_version 33550 (0.0010) +[2023-10-14 06:26:19,678][100936] Updated weights for policy 0, policy_version 33560 (0.0008) +[2023-10-14 06:26:20,406][100917] Updated weights for policy 1, policy_version 33542 (0.0007) +[2023-10-14 06:26:20,786][100917] Updated weights for policy 1, policy_version 33552 (0.0007) +[2023-10-14 06:26:21,157][100917] Updated weights for policy 1, policy_version 33562 (0.0010) +[2023-10-14 06:26:23,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 68747264. Throughput: 0: 1666.8, 1: 1673.8. Samples: 17200488. Policy #0 lag: (min: 14.0, avg: 21.9, max: 46.0) +[2023-10-14 06:26:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:26:23,799][100936] Updated weights for policy 0, policy_version 33570 (0.0009) +[2023-10-14 06:26:24,166][100936] Updated weights for policy 0, policy_version 33580 (0.0007) +[2023-10-14 06:26:24,538][100936] Updated weights for policy 0, policy_version 33590 (0.0010) +[2023-10-14 06:26:24,907][100936] Updated weights for policy 0, policy_version 33600 (0.0011) +[2023-10-14 06:26:25,333][100917] Updated weights for policy 1, policy_version 33572 (0.0008) +[2023-10-14 06:26:25,703][100917] Updated weights for policy 1, policy_version 33582 (0.0009) +[2023-10-14 06:26:26,071][100917] Updated weights for policy 1, policy_version 33592 (0.0010) +[2023-10-14 06:26:28,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 68812800. Throughput: 0: 1670.0, 1: 1656.9. Samples: 17210032. Policy #0 lag: (min: 14.0, avg: 21.9, max: 46.0) +[2023-10-14 06:26:28,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:26:29,165][100936] Updated weights for policy 0, policy_version 33610 (0.0008) +[2023-10-14 06:26:29,541][100936] Updated weights for policy 0, policy_version 33620 (0.0009) +[2023-10-14 06:26:29,901][100936] Updated weights for policy 0, policy_version 33630 (0.0011) +[2023-10-14 06:26:30,256][100917] Updated weights for policy 1, policy_version 33602 (0.0011) +[2023-10-14 06:26:30,629][100917] Updated weights for policy 1, policy_version 33612 (0.0011) +[2023-10-14 06:26:31,002][100917] Updated weights for policy 1, policy_version 33622 (0.0008) +[2023-10-14 06:26:31,376][100917] Updated weights for policy 1, policy_version 33632 (0.0010) +[2023-10-14 06:26:33,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 68878336. Throughput: 0: 1668.0, 1: 1671.1. Samples: 17229950. Policy #0 lag: (min: 17.0, avg: 27.2, max: 49.0) +[2023-10-14 06:26:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:26:34,071][100936] Updated weights for policy 0, policy_version 33640 (0.0007) +[2023-10-14 06:26:34,435][100936] Updated weights for policy 0, policy_version 33650 (0.0007) +[2023-10-14 06:26:34,803][100936] Updated weights for policy 0, policy_version 33660 (0.0009) +[2023-10-14 06:26:35,246][100917] Updated weights for policy 1, policy_version 33642 (0.0008) +[2023-10-14 06:26:35,621][100917] Updated weights for policy 1, policy_version 33652 (0.0007) +[2023-10-14 06:26:36,010][100917] Updated weights for policy 1, policy_version 33662 (0.0008) +[2023-10-14 06:26:38,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 68943872. Throughput: 0: 1669.2, 1: 1677.6. Samples: 17250556. Policy #0 lag: (min: 17.0, avg: 27.2, max: 49.0) +[2023-10-14 06:26:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:26:39,023][100936] Updated weights for policy 0, policy_version 33670 (0.0009) +[2023-10-14 06:26:39,386][100936] Updated weights for policy 0, policy_version 33680 (0.0010) +[2023-10-14 06:26:39,768][100936] Updated weights for policy 0, policy_version 33690 (0.0010) +[2023-10-14 06:26:40,154][100917] Updated weights for policy 1, policy_version 33672 (0.0010) +[2023-10-14 06:26:40,533][100917] Updated weights for policy 1, policy_version 33682 (0.0007) +[2023-10-14 06:26:40,913][100917] Updated weights for policy 1, policy_version 33692 (0.0008) +[2023-10-14 06:26:43,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 69009408. Throughput: 0: 1665.3, 1: 1660.9. Samples: 17259616. Policy #0 lag: (min: 17.0, avg: 27.2, max: 49.0) +[2023-10-14 06:26:43,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:26:43,817][100936] Updated weights for policy 0, policy_version 33700 (0.0008) +[2023-10-14 06:26:44,188][100936] Updated weights for policy 0, policy_version 33710 (0.0010) +[2023-10-14 06:26:44,556][100936] Updated weights for policy 0, policy_version 33720 (0.0010) +[2023-10-14 06:26:44,963][100917] Updated weights for policy 1, policy_version 33702 (0.0009) +[2023-10-14 06:26:45,343][100917] Updated weights for policy 1, policy_version 33712 (0.0010) +[2023-10-14 06:26:45,724][100917] Updated weights for policy 1, policy_version 33722 (0.0009) +[2023-10-14 06:26:48,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 69074944. Throughput: 0: 1655.5, 1: 1684.1. Samples: 17279948. Policy #0 lag: (min: 17.0, avg: 27.2, max: 49.0) +[2023-10-14 06:26:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:26:48,702][100936] Updated weights for policy 0, policy_version 33730 (0.0010) +[2023-10-14 06:26:49,083][100936] Updated weights for policy 0, policy_version 33740 (0.0007) +[2023-10-14 06:26:49,456][100936] Updated weights for policy 0, policy_version 33750 (0.0009) +[2023-10-14 06:26:49,718][100917] Updated weights for policy 1, policy_version 33732 (0.0009) +[2023-10-14 06:26:49,819][100936] Updated weights for policy 0, policy_version 33760 (0.0009) +[2023-10-14 06:26:50,101][100917] Updated weights for policy 1, policy_version 33742 (0.0007) +[2023-10-14 06:26:50,476][100917] Updated weights for policy 1, policy_version 33752 (0.0007) +[2023-10-14 06:26:53,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 69140480. Throughput: 0: 1654.5, 1: 1687.4. Samples: 17300478. Policy #0 lag: (min: 17.0, avg: 27.2, max: 49.0) +[2023-10-14 06:26:53,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:26:53,994][100936] Updated weights for policy 0, policy_version 33770 (0.0008) +[2023-10-14 06:26:54,360][100936] Updated weights for policy 0, policy_version 33780 (0.0009) +[2023-10-14 06:26:54,491][100917] Updated weights for policy 1, policy_version 33762 (0.0007) +[2023-10-14 06:26:54,743][100936] Updated weights for policy 0, policy_version 33790 (0.0008) +[2023-10-14 06:26:54,848][100917] Updated weights for policy 1, policy_version 33772 (0.0009) +[2023-10-14 06:26:55,228][100917] Updated weights for policy 1, policy_version 33782 (0.0009) +[2023-10-14 06:26:55,603][100917] Updated weights for policy 1, policy_version 33792 (0.0008) +[2023-10-14 06:26:58,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 69206016. Throughput: 0: 1658.3, 1: 1669.7. Samples: 17309326. Policy #0 lag: (min: 21.0, avg: 21.1, max: 27.0) +[2023-10-14 06:26:58,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:26:58,880][100936] Updated weights for policy 0, policy_version 33800 (0.0007) +[2023-10-14 06:26:59,254][100936] Updated weights for policy 0, policy_version 33810 (0.0008) +[2023-10-14 06:26:59,617][100936] Updated weights for policy 0, policy_version 33820 (0.0008) +[2023-10-14 06:26:59,731][100917] Updated weights for policy 1, policy_version 33802 (0.0009) +[2023-10-14 06:27:00,103][100917] Updated weights for policy 1, policy_version 33812 (0.0009) +[2023-10-14 06:27:00,474][100917] Updated weights for policy 1, policy_version 33822 (0.0009) +[2023-10-14 06:27:03,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 69271552. Throughput: 0: 1656.3, 1: 1678.7. Samples: 17329800. Policy #0 lag: (min: 21.0, avg: 21.1, max: 27.0) +[2023-10-14 06:27:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:27:03,795][100936] Updated weights for policy 0, policy_version 33830 (0.0007) +[2023-10-14 06:27:04,159][100936] Updated weights for policy 0, policy_version 33840 (0.0011) +[2023-10-14 06:27:04,518][100917] Updated weights for policy 1, policy_version 33832 (0.0010) +[2023-10-14 06:27:04,528][100936] Updated weights for policy 0, policy_version 33850 (0.0007) +[2023-10-14 06:27:04,888][100917] Updated weights for policy 1, policy_version 33842 (0.0007) +[2023-10-14 06:27:05,255][100917] Updated weights for policy 1, policy_version 33852 (0.0007) +[2023-10-14 06:27:08,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 69337088. Throughput: 0: 1650.2, 1: 1676.4. Samples: 17350186. Policy #0 lag: (min: 21.0, avg: 21.1, max: 27.0) +[2023-10-14 06:27:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:27:08,628][100936] Updated weights for policy 0, policy_version 33860 (0.0008) +[2023-10-14 06:27:08,999][100936] Updated weights for policy 0, policy_version 33870 (0.0008) +[2023-10-14 06:27:09,364][100936] Updated weights for policy 0, policy_version 33880 (0.0008) +[2023-10-14 06:27:09,479][100917] Updated weights for policy 1, policy_version 33862 (0.0007) +[2023-10-14 06:27:09,841][100917] Updated weights for policy 1, policy_version 33872 (0.0008) +[2023-10-14 06:27:10,220][100917] Updated weights for policy 1, policy_version 33882 (0.0007) +[2023-10-14 06:27:13,399][100936] Updated weights for policy 0, policy_version 33890 (0.0009) +[2023-10-14 06:27:13,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 69402624. Throughput: 0: 1648.6, 1: 1663.8. Samples: 17359090. Policy #0 lag: (min: 21.0, avg: 21.1, max: 27.0) +[2023-10-14 06:27:13,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:27:13,766][100936] Updated weights for policy 0, policy_version 33900 (0.0007) +[2023-10-14 06:27:14,134][100936] Updated weights for policy 0, policy_version 33910 (0.0007) +[2023-10-14 06:27:14,385][100917] Updated weights for policy 1, policy_version 33892 (0.0009) +[2023-10-14 06:27:14,500][100936] Updated weights for policy 0, policy_version 33920 (0.0008) +[2023-10-14 06:27:14,762][100917] Updated weights for policy 1, policy_version 33902 (0.0008) +[2023-10-14 06:27:15,120][100917] Updated weights for policy 1, policy_version 33912 (0.0009) +[2023-10-14 06:27:18,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 69468160. Throughput: 0: 1654.0, 1: 1670.7. Samples: 17379562. Policy #0 lag: (min: 21.0, avg: 21.1, max: 27.0) +[2023-10-14 06:27:18,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:27:18,620][100936] Updated weights for policy 0, policy_version 33930 (0.0007) +[2023-10-14 06:27:18,995][100936] Updated weights for policy 0, policy_version 33940 (0.0007) +[2023-10-14 06:27:19,223][100917] Updated weights for policy 1, policy_version 33922 (0.0007) +[2023-10-14 06:27:19,371][100936] Updated weights for policy 0, policy_version 33950 (0.0007) +[2023-10-14 06:27:19,593][100917] Updated weights for policy 1, policy_version 33932 (0.0009) +[2023-10-14 06:27:19,959][100917] Updated weights for policy 1, policy_version 33942 (0.0011) +[2023-10-14 06:27:20,330][100917] Updated weights for policy 1, policy_version 33952 (0.0009) +[2023-10-14 06:27:23,459][100936] Updated weights for policy 0, policy_version 33960 (0.0008) +[2023-10-14 06:27:23,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 69533696. Throughput: 0: 1652.3, 1: 1663.0. Samples: 17399744. Policy #0 lag: (min: 0.0, avg: 23.0, max: 32.0) +[2023-10-14 06:27:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:27:23,832][100936] Updated weights for policy 0, policy_version 33970 (0.0010) +[2023-10-14 06:27:24,205][100936] Updated weights for policy 0, policy_version 33980 (0.0008) +[2023-10-14 06:27:24,538][100917] Updated weights for policy 1, policy_version 33962 (0.0007) +[2023-10-14 06:27:24,916][100917] Updated weights for policy 1, policy_version 33972 (0.0008) +[2023-10-14 06:27:25,286][100917] Updated weights for policy 1, policy_version 33982 (0.0009) +[2023-10-14 06:27:28,282][100936] Updated weights for policy 0, policy_version 33990 (0.0010) +[2023-10-14 06:27:28,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 69599232. Throughput: 0: 1660.8, 1: 1659.4. Samples: 17409028. Policy #0 lag: (min: 0.0, avg: 23.0, max: 32.0) +[2023-10-14 06:27:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:27:28,647][100936] Updated weights for policy 0, policy_version 34000 (0.0007) +[2023-10-14 06:27:29,023][100936] Updated weights for policy 0, policy_version 34010 (0.0009) +[2023-10-14 06:27:29,499][100917] Updated weights for policy 1, policy_version 33992 (0.0009) +[2023-10-14 06:27:29,875][100917] Updated weights for policy 1, policy_version 34002 (0.0009) +[2023-10-14 06:27:30,241][100917] Updated weights for policy 1, policy_version 34012 (0.0009) +[2023-10-14 06:27:33,142][100936] Updated weights for policy 0, policy_version 34020 (0.0008) +[2023-10-14 06:27:33,509][100936] Updated weights for policy 0, policy_version 34030 (0.0007) +[2023-10-14 06:27:33,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 69664768. Throughput: 0: 1664.7, 1: 1655.3. Samples: 17429350. Policy #0 lag: (min: 0.0, avg: 23.0, max: 32.0) +[2023-10-14 06:27:33,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:27:33,878][100936] Updated weights for policy 0, policy_version 34040 (0.0009) +[2023-10-14 06:27:34,305][100917] Updated weights for policy 1, policy_version 34022 (0.0008) +[2023-10-14 06:27:34,682][100917] Updated weights for policy 1, policy_version 34032 (0.0009) +[2023-10-14 06:27:35,054][100917] Updated weights for policy 1, policy_version 34042 (0.0009) +[2023-10-14 06:27:38,097][100936] Updated weights for policy 0, policy_version 34050 (0.0008) +[2023-10-14 06:27:38,467][100936] Updated weights for policy 0, policy_version 34060 (0.0007) +[2023-10-14 06:27:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 69730304. Throughput: 0: 1659.1, 1: 1651.1. Samples: 17449440. Policy #0 lag: (min: 0.0, avg: 23.0, max: 32.0) +[2023-10-14 06:27:38,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:27:38,523][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000034048_34865152.pth... +[2023-10-14 06:27:38,559][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000032512_33292288.pth +[2023-10-14 06:27:38,840][100936] Updated weights for policy 0, policy_version 34070 (0.0008) +[2023-10-14 06:27:39,071][100917] Updated weights for policy 1, policy_version 34052 (0.0008) +[2023-10-14 06:27:39,199][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000034080_34897920.pth... +[2023-10-14 06:27:39,202][100936] Updated weights for policy 0, policy_version 34080 (0.0008) +[2023-10-14 06:27:39,233][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000032512_33292288.pth +[2023-10-14 06:27:39,442][100917] Updated weights for policy 1, policy_version 34062 (0.0007) +[2023-10-14 06:27:39,813][100917] Updated weights for policy 1, policy_version 34072 (0.0008) +[2023-10-14 06:27:43,447][100936] Updated weights for policy 0, policy_version 34090 (0.0007) +[2023-10-14 06:27:43,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 69795840. Throughput: 0: 1667.9, 1: 1655.3. Samples: 17458870. Policy #0 lag: (min: 0.0, avg: 23.0, max: 32.0) +[2023-10-14 06:27:43,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:27:43,822][100936] Updated weights for policy 0, policy_version 34100 (0.0007) +[2023-10-14 06:27:43,882][100917] Updated weights for policy 1, policy_version 34082 (0.0007) +[2023-10-14 06:27:44,198][100936] Updated weights for policy 0, policy_version 34110 (0.0007) +[2023-10-14 06:27:44,258][100917] Updated weights for policy 1, policy_version 34092 (0.0008) +[2023-10-14 06:27:44,633][100917] Updated weights for policy 1, policy_version 34102 (0.0008) +[2023-10-14 06:27:45,009][100917] Updated weights for policy 1, policy_version 34112 (0.0009) +[2023-10-14 06:27:48,324][100936] Updated weights for policy 0, policy_version 34120 (0.0010) +[2023-10-14 06:27:48,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 69861376. Throughput: 0: 1659.4, 1: 1657.9. Samples: 17479080. Policy #0 lag: (min: 10.0, avg: 10.0, max: 10.0) +[2023-10-14 06:27:48,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:27:48,694][100936] Updated weights for policy 0, policy_version 34130 (0.0008) +[2023-10-14 06:27:49,059][100936] Updated weights for policy 0, policy_version 34140 (0.0010) +[2023-10-14 06:27:49,238][100917] Updated weights for policy 1, policy_version 34122 (0.0007) +[2023-10-14 06:27:49,601][100917] Updated weights for policy 1, policy_version 34132 (0.0007) +[2023-10-14 06:27:49,978][100917] Updated weights for policy 1, policy_version 34142 (0.0008) +[2023-10-14 06:27:53,178][100936] Updated weights for policy 0, policy_version 34150 (0.0009) +[2023-10-14 06:27:53,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 69926912. Throughput: 0: 1648.6, 1: 1660.6. Samples: 17499098. Policy #0 lag: (min: 10.0, avg: 10.0, max: 10.0) +[2023-10-14 06:27:53,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:27:53,540][100936] Updated weights for policy 0, policy_version 34160 (0.0009) +[2023-10-14 06:27:53,901][100936] Updated weights for policy 0, policy_version 34170 (0.0007) +[2023-10-14 06:27:54,069][100917] Updated weights for policy 1, policy_version 34152 (0.0007) +[2023-10-14 06:27:54,437][100917] Updated weights for policy 1, policy_version 34162 (0.0009) +[2023-10-14 06:27:54,806][100917] Updated weights for policy 1, policy_version 34172 (0.0009) +[2023-10-14 06:27:57,970][100936] Updated weights for policy 0, policy_version 34180 (0.0008) +[2023-10-14 06:27:58,338][100936] Updated weights for policy 0, policy_version 34190 (0.0010) +[2023-10-14 06:27:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 69992448. Throughput: 0: 1663.2, 1: 1660.2. Samples: 17508640. Policy #0 lag: (min: 10.0, avg: 10.0, max: 10.0) +[2023-10-14 06:27:58,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:27:58,720][100936] Updated weights for policy 0, policy_version 34200 (0.0008) +[2023-10-14 06:27:58,731][100917] Updated weights for policy 1, policy_version 34182 (0.0008) +[2023-10-14 06:27:59,108][100917] Updated weights for policy 1, policy_version 34192 (0.0008) +[2023-10-14 06:27:59,483][100917] Updated weights for policy 1, policy_version 34202 (0.0009) +[2023-10-14 06:28:02,875][100936] Updated weights for policy 0, policy_version 34210 (0.0008) +[2023-10-14 06:28:03,242][100936] Updated weights for policy 0, policy_version 34220 (0.0008) +[2023-10-14 06:28:03,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 70057984. Throughput: 0: 1660.3, 1: 1662.6. Samples: 17529092. Policy #0 lag: (min: 10.0, avg: 10.0, max: 10.0) +[2023-10-14 06:28:03,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:28:03,604][100936] Updated weights for policy 0, policy_version 34230 (0.0010) +[2023-10-14 06:28:03,719][100917] Updated weights for policy 1, policy_version 34212 (0.0009) +[2023-10-14 06:28:03,979][100936] Updated weights for policy 0, policy_version 34240 (0.0008) +[2023-10-14 06:28:04,084][100917] Updated weights for policy 1, policy_version 34222 (0.0009) +[2023-10-14 06:28:04,458][100917] Updated weights for policy 1, policy_version 34232 (0.0009) +[2023-10-14 06:28:08,332][100936] Updated weights for policy 0, policy_version 34250 (0.0010) +[2023-10-14 06:28:08,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 70123520. Throughput: 0: 1650.4, 1: 1666.0. Samples: 17548978. Policy #0 lag: (min: 10.0, avg: 10.0, max: 10.0) +[2023-10-14 06:28:08,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:28:08,666][100917] Updated weights for policy 1, policy_version 34242 (0.0009) +[2023-10-14 06:28:08,700][100936] Updated weights for policy 0, policy_version 34260 (0.0007) +[2023-10-14 06:28:09,042][100917] Updated weights for policy 1, policy_version 34252 (0.0008) +[2023-10-14 06:28:09,064][100936] Updated weights for policy 0, policy_version 34270 (0.0007) +[2023-10-14 06:28:09,408][100917] Updated weights for policy 1, policy_version 34262 (0.0008) +[2023-10-14 06:28:09,785][100917] Updated weights for policy 1, policy_version 34272 (0.0007) +[2023-10-14 06:28:13,147][100936] Updated weights for policy 0, policy_version 34280 (0.0008) +[2023-10-14 06:28:13,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 70189056. Throughput: 0: 1651.6, 1: 1665.2. Samples: 17558282. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:28:13,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:28:13,519][100936] Updated weights for policy 0, policy_version 34290 (0.0007) +[2023-10-14 06:28:13,885][100936] Updated weights for policy 0, policy_version 34300 (0.0007) +[2023-10-14 06:28:14,034][100917] Updated weights for policy 1, policy_version 34282 (0.0007) +[2023-10-14 06:28:14,407][100917] Updated weights for policy 1, policy_version 34292 (0.0007) +[2023-10-14 06:28:14,779][100917] Updated weights for policy 1, policy_version 34302 (0.0010) +[2023-10-14 06:28:18,031][100936] Updated weights for policy 0, policy_version 34310 (0.0008) +[2023-10-14 06:28:18,400][100936] Updated weights for policy 0, policy_version 34320 (0.0009) +[2023-10-14 06:28:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 70254592. Throughput: 0: 1649.9, 1: 1668.0. Samples: 17578654. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:28:18,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:28:18,769][100936] Updated weights for policy 0, policy_version 34330 (0.0009) +[2023-10-14 06:28:18,813][100917] Updated weights for policy 1, policy_version 34312 (0.0008) +[2023-10-14 06:28:19,184][100917] Updated weights for policy 1, policy_version 34322 (0.0008) +[2023-10-14 06:28:19,557][100917] Updated weights for policy 1, policy_version 34332 (0.0009) +[2023-10-14 06:28:22,784][100936] Updated weights for policy 0, policy_version 34340 (0.0008) +[2023-10-14 06:28:23,153][100936] Updated weights for policy 0, policy_version 34350 (0.0008) +[2023-10-14 06:28:23,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 70320128. Throughput: 0: 1643.6, 1: 1671.9. Samples: 17598638. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:28:23,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:28:23,517][100936] Updated weights for policy 0, policy_version 34360 (0.0008) +[2023-10-14 06:28:23,625][100917] Updated weights for policy 1, policy_version 34342 (0.0009) +[2023-10-14 06:28:23,993][100917] Updated weights for policy 1, policy_version 34352 (0.0009) +[2023-10-14 06:28:24,364][100917] Updated weights for policy 1, policy_version 34362 (0.0009) +[2023-10-14 06:28:27,597][100936] Updated weights for policy 0, policy_version 34370 (0.0008) +[2023-10-14 06:28:27,970][100936] Updated weights for policy 0, policy_version 34380 (0.0008) +[2023-10-14 06:28:28,341][100936] Updated weights for policy 0, policy_version 34390 (0.0007) +[2023-10-14 06:28:28,494][100917] Updated weights for policy 1, policy_version 34372 (0.0009) +[2023-10-14 06:28:28,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 70385664. Throughput: 0: 1651.6, 1: 1667.7. Samples: 17608242. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:28:28,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:28:28,707][100936] Updated weights for policy 0, policy_version 34400 (0.0007) +[2023-10-14 06:28:28,876][100917] Updated weights for policy 1, policy_version 34382 (0.0009) +[2023-10-14 06:28:29,236][100917] Updated weights for policy 1, policy_version 34392 (0.0011) +[2023-10-14 06:28:32,798][100936] Updated weights for policy 0, policy_version 34410 (0.0007) +[2023-10-14 06:28:33,165][100936] Updated weights for policy 0, policy_version 34420 (0.0007) +[2023-10-14 06:28:33,318][100917] Updated weights for policy 1, policy_version 34402 (0.0010) +[2023-10-14 06:28:33,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.1, 300 sec: 13107.2). Total num frames: 70451200. Throughput: 0: 1660.1, 1: 1665.5. Samples: 17628732. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:28:33,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:28:33,535][100936] Updated weights for policy 0, policy_version 34430 (0.0008) +[2023-10-14 06:28:33,688][100917] Updated weights for policy 1, policy_version 34412 (0.0009) +[2023-10-14 06:28:34,079][100917] Updated weights for policy 1, policy_version 34422 (0.0010) +[2023-10-14 06:28:34,441][100917] Updated weights for policy 1, policy_version 34432 (0.0009) +[2023-10-14 06:28:37,717][100936] Updated weights for policy 0, policy_version 34440 (0.0010) +[2023-10-14 06:28:38,099][100936] Updated weights for policy 0, policy_version 34450 (0.0011) +[2023-10-14 06:28:38,472][100936] Updated weights for policy 0, policy_version 34460 (0.0009) +[2023-10-14 06:28:38,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 13107.2). Total num frames: 70516736. Throughput: 0: 1649.5, 1: 1662.4. Samples: 17648136. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:28:38,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:28:38,547][100917] Updated weights for policy 1, policy_version 34442 (0.0009) +[2023-10-14 06:28:38,923][100917] Updated weights for policy 1, policy_version 34452 (0.0009) +[2023-10-14 06:28:39,293][100917] Updated weights for policy 1, policy_version 34462 (0.0007) +[2023-10-14 06:28:42,471][100936] Updated weights for policy 0, policy_version 34470 (0.0008) +[2023-10-14 06:28:42,839][100936] Updated weights for policy 0, policy_version 34480 (0.0007) +[2023-10-14 06:28:43,139][100917] Updated weights for policy 1, policy_version 34472 (0.0007) +[2023-10-14 06:28:43,211][100936] Updated weights for policy 0, policy_version 34490 (0.0007) +[2023-10-14 06:28:43,509][100917] Updated weights for policy 1, policy_version 34482 (0.0007) +[2023-10-14 06:28:43,512][99942] Fps is (10 sec: 16384.3, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 70615040. Throughput: 0: 1659.8, 1: 1664.4. Samples: 17658230. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:28:43,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:28:43,874][100917] Updated weights for policy 1, policy_version 34492 (0.0009) +[2023-10-14 06:28:47,366][100936] Updated weights for policy 0, policy_version 34500 (0.0007) +[2023-10-14 06:28:47,740][100936] Updated weights for policy 0, policy_version 34510 (0.0008) +[2023-10-14 06:28:48,089][100917] Updated weights for policy 1, policy_version 34502 (0.0009) +[2023-10-14 06:28:48,119][100936] Updated weights for policy 0, policy_version 34520 (0.0008) +[2023-10-14 06:28:48,457][100917] Updated weights for policy 1, policy_version 34512 (0.0008) +[2023-10-14 06:28:48,512][99942] Fps is (10 sec: 16384.0, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 70680576. Throughput: 0: 1653.3, 1: 1662.0. Samples: 17678282. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:28:48,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:28:48,839][100917] Updated weights for policy 1, policy_version 34522 (0.0007) +[2023-10-14 06:28:52,258][100936] Updated weights for policy 0, policy_version 34530 (0.0008) +[2023-10-14 06:28:52,630][100936] Updated weights for policy 0, policy_version 34540 (0.0007) +[2023-10-14 06:28:52,998][100936] Updated weights for policy 0, policy_version 34550 (0.0007) +[2023-10-14 06:28:53,148][100917] Updated weights for policy 1, policy_version 34532 (0.0007) +[2023-10-14 06:28:53,364][100936] Updated weights for policy 0, policy_version 34560 (0.0007) +[2023-10-14 06:28:53,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 70746112. Throughput: 0: 1644.6, 1: 1661.0. Samples: 17697730. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:28:53,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:28:53,528][100917] Updated weights for policy 1, policy_version 34542 (0.0010) +[2023-10-14 06:28:53,904][100917] Updated weights for policy 1, policy_version 34552 (0.0009) +[2023-10-14 06:28:57,522][100936] Updated weights for policy 0, policy_version 34570 (0.0009) +[2023-10-14 06:28:57,885][100936] Updated weights for policy 0, policy_version 34580 (0.0007) +[2023-10-14 06:28:58,186][100917] Updated weights for policy 1, policy_version 34562 (0.0009) +[2023-10-14 06:28:58,256][100936] Updated weights for policy 0, policy_version 34590 (0.0008) +[2023-10-14 06:28:58,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 70811648. Throughput: 0: 1665.0, 1: 1661.0. Samples: 17707954. Policy #0 lag: (min: 31.0, avg: 33.6, max: 63.0) +[2023-10-14 06:28:58,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:28:58,550][100917] Updated weights for policy 1, policy_version 34572 (0.0009) +[2023-10-14 06:28:58,942][100917] Updated weights for policy 1, policy_version 34582 (0.0010) +[2023-10-14 06:28:59,317][100917] Updated weights for policy 1, policy_version 34592 (0.0010) +[2023-10-14 06:29:02,474][100936] Updated weights for policy 0, policy_version 34600 (0.0008) +[2023-10-14 06:29:02,849][100936] Updated weights for policy 0, policy_version 34610 (0.0010) +[2023-10-14 06:29:03,217][100936] Updated weights for policy 0, policy_version 34620 (0.0007) +[2023-10-14 06:29:03,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 70877184. Throughput: 0: 1659.7, 1: 1659.0. Samples: 17727996. Policy #0 lag: (min: 31.0, avg: 33.6, max: 63.0) +[2023-10-14 06:29:03,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:29:03,527][100917] Updated weights for policy 1, policy_version 34602 (0.0009) +[2023-10-14 06:29:03,909][100917] Updated weights for policy 1, policy_version 34612 (0.0007) +[2023-10-14 06:29:04,274][100917] Updated weights for policy 1, policy_version 34622 (0.0008) +[2023-10-14 06:29:07,323][100936] Updated weights for policy 0, policy_version 34630 (0.0009) +[2023-10-14 06:29:07,684][100936] Updated weights for policy 0, policy_version 34640 (0.0011) +[2023-10-14 06:29:08,062][100936] Updated weights for policy 0, policy_version 34650 (0.0009) +[2023-10-14 06:29:08,288][100917] Updated weights for policy 1, policy_version 34632 (0.0009) +[2023-10-14 06:29:08,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 70942720. Throughput: 0: 1651.3, 1: 1648.5. Samples: 17747130. Policy #0 lag: (min: 31.0, avg: 33.6, max: 63.0) +[2023-10-14 06:29:08,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:29:08,657][100917] Updated weights for policy 1, policy_version 34642 (0.0007) +[2023-10-14 06:29:09,031][100917] Updated weights for policy 1, policy_version 34652 (0.0007) +[2023-10-14 06:29:12,174][100936] Updated weights for policy 0, policy_version 34660 (0.0008) +[2023-10-14 06:29:12,539][100936] Updated weights for policy 0, policy_version 34670 (0.0011) +[2023-10-14 06:29:12,916][100936] Updated weights for policy 0, policy_version 34680 (0.0009) +[2023-10-14 06:29:13,224][100917] Updated weights for policy 1, policy_version 34662 (0.0007) +[2023-10-14 06:29:13,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 71008256. Throughput: 0: 1662.8, 1: 1651.0. Samples: 17757366. Policy #0 lag: (min: 31.0, avg: 33.6, max: 63.0) +[2023-10-14 06:29:13,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:29:13,590][100917] Updated weights for policy 1, policy_version 34672 (0.0009) +[2023-10-14 06:29:13,959][100917] Updated weights for policy 1, policy_version 34682 (0.0011) +[2023-10-14 06:29:17,093][100936] Updated weights for policy 0, policy_version 34690 (0.0008) +[2023-10-14 06:29:17,462][100936] Updated weights for policy 0, policy_version 34700 (0.0007) +[2023-10-14 06:29:17,835][100936] Updated weights for policy 0, policy_version 34710 (0.0008) +[2023-10-14 06:29:17,981][100917] Updated weights for policy 1, policy_version 34692 (0.0010) +[2023-10-14 06:29:18,210][100936] Updated weights for policy 0, policy_version 34720 (0.0008) +[2023-10-14 06:29:18,348][100917] Updated weights for policy 1, policy_version 34702 (0.0010) +[2023-10-14 06:29:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 71073792. Throughput: 0: 1649.9, 1: 1656.9. Samples: 17777536. Policy #0 lag: (min: 31.0, avg: 33.6, max: 63.0) +[2023-10-14 06:29:18,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:29:18,721][100917] Updated weights for policy 1, policy_version 34712 (0.0010) +[2023-10-14 06:29:22,119][100936] Updated weights for policy 0, policy_version 34730 (0.0008) +[2023-10-14 06:29:22,487][100936] Updated weights for policy 0, policy_version 34740 (0.0009) +[2023-10-14 06:29:22,549][100917] Updated weights for policy 1, policy_version 34722 (0.0009) +[2023-10-14 06:29:22,850][100936] Updated weights for policy 0, policy_version 34750 (0.0008) +[2023-10-14 06:29:22,927][100917] Updated weights for policy 1, policy_version 34732 (0.0009) +[2023-10-14 06:29:23,302][100917] Updated weights for policy 1, policy_version 34742 (0.0008) +[2023-10-14 06:29:23,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 71139328. Throughput: 0: 1658.7, 1: 1656.1. Samples: 17797304. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:29:23,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:29:23,672][100917] Updated weights for policy 1, policy_version 34752 (0.0008) +[2023-10-14 06:29:27,052][100936] Updated weights for policy 0, policy_version 34760 (0.0008) +[2023-10-14 06:29:27,423][100936] Updated weights for policy 0, policy_version 34770 (0.0009) +[2023-10-14 06:29:27,781][100936] Updated weights for policy 0, policy_version 34780 (0.0007) +[2023-10-14 06:29:27,835][100917] Updated weights for policy 1, policy_version 34762 (0.0008) +[2023-10-14 06:29:28,222][100917] Updated weights for policy 1, policy_version 34772 (0.0010) +[2023-10-14 06:29:28,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 71204864. Throughput: 0: 1661.3, 1: 1665.0. Samples: 17807912. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:29:28,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:29:28,589][100917] Updated weights for policy 1, policy_version 34782 (0.0007) +[2023-10-14 06:29:31,823][100936] Updated weights for policy 0, policy_version 34790 (0.0009) +[2023-10-14 06:29:32,196][100936] Updated weights for policy 0, policy_version 34800 (0.0007) +[2023-10-14 06:29:32,572][100936] Updated weights for policy 0, policy_version 34810 (0.0009) +[2023-10-14 06:29:32,600][100917] Updated weights for policy 1, policy_version 34792 (0.0007) +[2023-10-14 06:29:32,960][100917] Updated weights for policy 1, policy_version 34802 (0.0007) +[2023-10-14 06:29:33,342][100917] Updated weights for policy 1, policy_version 34812 (0.0007) +[2023-10-14 06:29:33,512][99942] Fps is (10 sec: 16384.4, 60 sec: 14199.5, 300 sec: 13329.4). Total num frames: 71303168. Throughput: 0: 1651.4, 1: 1665.9. Samples: 17827560. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:29:33,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:29:36,659][100936] Updated weights for policy 0, policy_version 34820 (0.0010) +[2023-10-14 06:29:37,025][100936] Updated weights for policy 0, policy_version 34830 (0.0008) +[2023-10-14 06:29:37,398][100936] Updated weights for policy 0, policy_version 34840 (0.0008) +[2023-10-14 06:29:37,576][100917] Updated weights for policy 1, policy_version 34822 (0.0009) +[2023-10-14 06:29:37,951][100917] Updated weights for policy 1, policy_version 34832 (0.0009) +[2023-10-14 06:29:38,323][100917] Updated weights for policy 1, policy_version 34842 (0.0008) +[2023-10-14 06:29:38,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 71335936. Throughput: 0: 1662.2, 1: 1647.6. Samples: 17846668. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:29:38,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:29:38,520][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000034848_35684352.pth... +[2023-10-14 06:29:38,548][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000034848_35684352.pth... +[2023-10-14 06:29:38,551][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000033280_34078720.pth +[2023-10-14 06:29:38,586][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000033280_34078720.pth +[2023-10-14 06:29:41,553][100936] Updated weights for policy 0, policy_version 34850 (0.0009) +[2023-10-14 06:29:41,914][100936] Updated weights for policy 0, policy_version 34860 (0.0008) +[2023-10-14 06:29:42,296][100936] Updated weights for policy 0, policy_version 34870 (0.0008) +[2023-10-14 06:29:42,508][100917] Updated weights for policy 1, policy_version 34852 (0.0009) +[2023-10-14 06:29:42,664][100936] Updated weights for policy 0, policy_version 34880 (0.0008) +[2023-10-14 06:29:42,879][100917] Updated weights for policy 1, policy_version 34862 (0.0007) +[2023-10-14 06:29:43,256][100917] Updated weights for policy 1, policy_version 34872 (0.0007) +[2023-10-14 06:29:43,512][99942] Fps is (10 sec: 9830.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 71401472. Throughput: 0: 1660.1, 1: 1663.6. Samples: 17857518. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:29:43,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:29:47,091][100936] Updated weights for policy 0, policy_version 34890 (0.0008) +[2023-10-14 06:29:47,335][100917] Updated weights for policy 1, policy_version 34882 (0.0008) +[2023-10-14 06:29:47,456][100936] Updated weights for policy 0, policy_version 34900 (0.0009) +[2023-10-14 06:29:47,703][100917] Updated weights for policy 1, policy_version 34892 (0.0009) +[2023-10-14 06:29:47,818][100936] Updated weights for policy 0, policy_version 34910 (0.0009) +[2023-10-14 06:29:48,081][100917] Updated weights for policy 1, policy_version 34902 (0.0010) +[2023-10-14 06:29:48,449][100917] Updated weights for policy 1, policy_version 34912 (0.0008) +[2023-10-14 06:29:48,512][99942] Fps is (10 sec: 16383.7, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 71499776. Throughput: 0: 1648.2, 1: 1662.8. Samples: 17876992. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:29:48,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:29:51,936][100936] Updated weights for policy 0, policy_version 34920 (0.0010) +[2023-10-14 06:29:52,298][100936] Updated weights for policy 0, policy_version 34930 (0.0010) +[2023-10-14 06:29:52,594][100917] Updated weights for policy 1, policy_version 34922 (0.0009) +[2023-10-14 06:29:52,664][100936] Updated weights for policy 0, policy_version 34940 (0.0008) +[2023-10-14 06:29:52,956][100917] Updated weights for policy 1, policy_version 34932 (0.0008) +[2023-10-14 06:29:53,333][100917] Updated weights for policy 1, policy_version 34942 (0.0008) +[2023-10-14 06:29:53,512][99942] Fps is (10 sec: 16383.9, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 71565312. Throughput: 0: 1658.0, 1: 1652.4. Samples: 17896096. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:29:53,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:29:56,796][100936] Updated weights for policy 0, policy_version 34950 (0.0009) +[2023-10-14 06:29:57,171][100936] Updated weights for policy 0, policy_version 34960 (0.0011) +[2023-10-14 06:29:57,518][100917] Updated weights for policy 1, policy_version 34952 (0.0007) +[2023-10-14 06:29:57,528][100936] Updated weights for policy 0, policy_version 34970 (0.0008) +[2023-10-14 06:29:57,888][100917] Updated weights for policy 1, policy_version 34962 (0.0008) +[2023-10-14 06:29:58,255][100917] Updated weights for policy 1, policy_version 34972 (0.0007) +[2023-10-14 06:29:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 71630848. Throughput: 0: 1658.5, 1: 1669.9. Samples: 17907142. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:29:58,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:30:01,637][100936] Updated weights for policy 0, policy_version 34980 (0.0008) +[2023-10-14 06:30:02,006][100936] Updated weights for policy 0, policy_version 34990 (0.0010) +[2023-10-14 06:30:02,254][100917] Updated weights for policy 1, policy_version 34982 (0.0008) +[2023-10-14 06:30:02,382][100936] Updated weights for policy 0, policy_version 35000 (0.0010) +[2023-10-14 06:30:02,629][100917] Updated weights for policy 1, policy_version 34992 (0.0009) +[2023-10-14 06:30:03,008][100917] Updated weights for policy 1, policy_version 35002 (0.0008) +[2023-10-14 06:30:03,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 71696384. Throughput: 0: 1645.1, 1: 1671.5. Samples: 17926782. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:30:03,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:30:06,566][100936] Updated weights for policy 0, policy_version 35010 (0.0009) +[2023-10-14 06:30:06,963][100936] Updated weights for policy 0, policy_version 35020 (0.0008) +[2023-10-14 06:30:07,141][100917] Updated weights for policy 1, policy_version 35012 (0.0011) +[2023-10-14 06:30:07,328][100936] Updated weights for policy 0, policy_version 35030 (0.0009) +[2023-10-14 06:30:07,517][100917] Updated weights for policy 1, policy_version 35022 (0.0010) +[2023-10-14 06:30:07,700][100936] Updated weights for policy 0, policy_version 35040 (0.0009) +[2023-10-14 06:30:07,884][100917] Updated weights for policy 1, policy_version 35032 (0.0010) +[2023-10-14 06:30:08,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 71761920. Throughput: 0: 1649.0, 1: 1649.0. Samples: 17945714. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:30:08,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:30:11,891][100917] Updated weights for policy 1, policy_version 35042 (0.0008) +[2023-10-14 06:30:11,934][100936] Updated weights for policy 0, policy_version 35050 (0.0008) +[2023-10-14 06:30:12,270][100917] Updated weights for policy 1, policy_version 35052 (0.0009) +[2023-10-14 06:30:12,302][100936] Updated weights for policy 0, policy_version 35060 (0.0009) +[2023-10-14 06:30:12,635][100917] Updated weights for policy 1, policy_version 35062 (0.0008) +[2023-10-14 06:30:12,678][100936] Updated weights for policy 0, policy_version 35070 (0.0008) +[2023-10-14 06:30:13,001][100917] Updated weights for policy 1, policy_version 35072 (0.0008) +[2023-10-14 06:30:13,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 71827456. Throughput: 0: 1646.7, 1: 1661.3. Samples: 17956772. Policy #0 lag: (min: 12.0, avg: 16.4, max: 44.0) +[2023-10-14 06:30:13,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:30:16,972][100936] Updated weights for policy 0, policy_version 35080 (0.0009) +[2023-10-14 06:30:17,197][100917] Updated weights for policy 1, policy_version 35082 (0.0009) +[2023-10-14 06:30:17,340][100936] Updated weights for policy 0, policy_version 35090 (0.0008) +[2023-10-14 06:30:17,570][100917] Updated weights for policy 1, policy_version 35092 (0.0008) +[2023-10-14 06:30:17,713][100936] Updated weights for policy 0, policy_version 35100 (0.0007) +[2023-10-14 06:30:17,945][100917] Updated weights for policy 1, policy_version 35102 (0.0009) +[2023-10-14 06:30:18,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 71892992. Throughput: 0: 1646.4, 1: 1653.2. Samples: 17976042. Policy #0 lag: (min: 12.0, avg: 16.4, max: 44.0) +[2023-10-14 06:30:18,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:30:21,797][100936] Updated weights for policy 0, policy_version 35110 (0.0008) +[2023-10-14 06:30:21,986][100917] Updated weights for policy 1, policy_version 35112 (0.0009) +[2023-10-14 06:30:22,164][100936] Updated weights for policy 0, policy_version 35120 (0.0009) +[2023-10-14 06:30:22,356][100917] Updated weights for policy 1, policy_version 35122 (0.0008) +[2023-10-14 06:30:22,539][100936] Updated weights for policy 0, policy_version 35130 (0.0009) +[2023-10-14 06:30:22,728][100917] Updated weights for policy 1, policy_version 35132 (0.0009) +[2023-10-14 06:30:23,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 71958528. Throughput: 0: 1644.5, 1: 1646.1. Samples: 17994748. Policy #0 lag: (min: 12.0, avg: 16.4, max: 44.0) +[2023-10-14 06:30:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:30:26,613][100936] Updated weights for policy 0, policy_version 35140 (0.0009) +[2023-10-14 06:30:26,804][100917] Updated weights for policy 1, policy_version 35142 (0.0009) +[2023-10-14 06:30:26,980][100936] Updated weights for policy 0, policy_version 35150 (0.0008) +[2023-10-14 06:30:27,171][100917] Updated weights for policy 1, policy_version 35152 (0.0009) +[2023-10-14 06:30:27,350][100936] Updated weights for policy 0, policy_version 35160 (0.0008) +[2023-10-14 06:30:27,543][100917] Updated weights for policy 1, policy_version 35162 (0.0009) +[2023-10-14 06:30:28,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 72024064. Throughput: 0: 1646.0, 1: 1661.6. Samples: 18006360. Policy #0 lag: (min: 12.0, avg: 16.4, max: 44.0) +[2023-10-14 06:30:28,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:30:31,530][100917] Updated weights for policy 1, policy_version 35172 (0.0009) +[2023-10-14 06:30:31,684][100936] Updated weights for policy 0, policy_version 35170 (0.0007) +[2023-10-14 06:30:31,897][100917] Updated weights for policy 1, policy_version 35182 (0.0010) +[2023-10-14 06:30:32,058][100936] Updated weights for policy 0, policy_version 35180 (0.0007) +[2023-10-14 06:30:32,271][100917] Updated weights for policy 1, policy_version 35192 (0.0010) +[2023-10-14 06:30:32,437][100936] Updated weights for policy 0, policy_version 35190 (0.0008) +[2023-10-14 06:30:32,796][100936] Updated weights for policy 0, policy_version 35200 (0.0009) +[2023-10-14 06:30:33,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 72089600. Throughput: 0: 1643.1, 1: 1654.0. Samples: 18025364. Policy #0 lag: (min: 12.0, avg: 16.4, max: 44.0) +[2023-10-14 06:30:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:30:36,451][100917] Updated weights for policy 1, policy_version 35202 (0.0009) +[2023-10-14 06:30:36,823][100917] Updated weights for policy 1, policy_version 35212 (0.0007) +[2023-10-14 06:30:36,852][100936] Updated weights for policy 0, policy_version 35210 (0.0009) +[2023-10-14 06:30:37,185][100917] Updated weights for policy 1, policy_version 35222 (0.0008) +[2023-10-14 06:30:37,224][100936] Updated weights for policy 0, policy_version 35220 (0.0008) +[2023-10-14 06:30:37,562][100917] Updated weights for policy 1, policy_version 35232 (0.0008) +[2023-10-14 06:30:37,587][100936] Updated weights for policy 0, policy_version 35230 (0.0008) +[2023-10-14 06:30:38,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 72155136. Throughput: 0: 1647.4, 1: 1657.2. Samples: 18044802. Policy #0 lag: (min: 31.0, avg: 33.7, max: 63.0) +[2023-10-14 06:30:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:30:41,635][100936] Updated weights for policy 0, policy_version 35240 (0.0008) +[2023-10-14 06:30:41,886][100917] Updated weights for policy 1, policy_version 35242 (0.0010) +[2023-10-14 06:30:42,001][100936] Updated weights for policy 0, policy_version 35250 (0.0008) +[2023-10-14 06:30:42,265][100917] Updated weights for policy 1, policy_version 35252 (0.0007) +[2023-10-14 06:30:42,378][100936] Updated weights for policy 0, policy_version 35260 (0.0009) +[2023-10-14 06:30:42,631][100917] Updated weights for policy 1, policy_version 35262 (0.0007) +[2023-10-14 06:30:43,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 72220672. Throughput: 0: 1646.6, 1: 1668.1. Samples: 18056300. Policy #0 lag: (min: 31.0, avg: 33.7, max: 63.0) +[2023-10-14 06:30:43,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:30:46,576][100936] Updated weights for policy 0, policy_version 35270 (0.0009) +[2023-10-14 06:30:46,748][100917] Updated weights for policy 1, policy_version 35272 (0.0009) +[2023-10-14 06:30:46,943][100936] Updated weights for policy 0, policy_version 35280 (0.0007) +[2023-10-14 06:30:47,112][100917] Updated weights for policy 1, policy_version 35282 (0.0008) +[2023-10-14 06:30:47,311][100936] Updated weights for policy 0, policy_version 35290 (0.0007) +[2023-10-14 06:30:47,481][100917] Updated weights for policy 1, policy_version 35292 (0.0008) +[2023-10-14 06:30:48,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 72286208. Throughput: 0: 1644.0, 1: 1646.6. Samples: 18074860. Policy #0 lag: (min: 31.0, avg: 33.7, max: 63.0) +[2023-10-14 06:30:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:30:51,564][100936] Updated weights for policy 0, policy_version 35300 (0.0007) +[2023-10-14 06:30:51,745][100917] Updated weights for policy 1, policy_version 35302 (0.0008) +[2023-10-14 06:30:51,943][100936] Updated weights for policy 0, policy_version 35310 (0.0008) +[2023-10-14 06:30:52,113][100917] Updated weights for policy 1, policy_version 35312 (0.0007) +[2023-10-14 06:30:52,318][100936] Updated weights for policy 0, policy_version 35320 (0.0009) +[2023-10-14 06:30:52,491][100917] Updated weights for policy 1, policy_version 35322 (0.0008) +[2023-10-14 06:30:53,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 72351744. Throughput: 0: 1643.0, 1: 1648.7. Samples: 18093840. Policy #0 lag: (min: 31.0, avg: 33.7, max: 63.0) +[2023-10-14 06:30:53,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:30:56,386][100936] Updated weights for policy 0, policy_version 35330 (0.0010) +[2023-10-14 06:30:56,763][100936] Updated weights for policy 0, policy_version 35340 (0.0009) +[2023-10-14 06:30:56,886][100917] Updated weights for policy 1, policy_version 35332 (0.0007) +[2023-10-14 06:30:57,122][100936] Updated weights for policy 0, policy_version 35350 (0.0008) +[2023-10-14 06:30:57,250][100917] Updated weights for policy 1, policy_version 35342 (0.0007) +[2023-10-14 06:30:57,491][100936] Updated weights for policy 0, policy_version 35360 (0.0007) +[2023-10-14 06:30:57,622][100917] Updated weights for policy 1, policy_version 35352 (0.0007) +[2023-10-14 06:30:58,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 72417280. Throughput: 0: 1644.1, 1: 1650.2. Samples: 18105018. Policy #0 lag: (min: 31.0, avg: 33.7, max: 63.0) +[2023-10-14 06:30:58,513][99942] Avg episode reward: [(0, '0.880'), (1, '1.000')] +[2023-10-14 06:31:01,604][100936] Updated weights for policy 0, policy_version 35370 (0.0010) +[2023-10-14 06:31:01,793][100917] Updated weights for policy 1, policy_version 35362 (0.0009) +[2023-10-14 06:31:01,971][100936] Updated weights for policy 0, policy_version 35380 (0.0009) +[2023-10-14 06:31:02,168][100917] Updated weights for policy 1, policy_version 35372 (0.0008) +[2023-10-14 06:31:02,339][100936] Updated weights for policy 0, policy_version 35390 (0.0009) +[2023-10-14 06:31:02,554][100917] Updated weights for policy 1, policy_version 35382 (0.0007) +[2023-10-14 06:31:02,928][100917] Updated weights for policy 1, policy_version 35392 (0.0007) +[2023-10-14 06:31:03,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.1, 300 sec: 13329.4). Total num frames: 72482816. Throughput: 0: 1641.8, 1: 1648.0. Samples: 18124084. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:31:03,513][99942] Avg episode reward: [(0, '0.880'), (1, '1.000')] +[2023-10-14 06:31:06,498][100936] Updated weights for policy 0, policy_version 35400 (0.0008) +[2023-10-14 06:31:06,870][100936] Updated weights for policy 0, policy_version 35410 (0.0007) +[2023-10-14 06:31:06,997][100917] Updated weights for policy 1, policy_version 35402 (0.0009) +[2023-10-14 06:31:07,238][100936] Updated weights for policy 0, policy_version 35420 (0.0007) +[2023-10-14 06:31:07,371][100917] Updated weights for policy 1, policy_version 35412 (0.0008) +[2023-10-14 06:31:07,735][100917] Updated weights for policy 1, policy_version 35422 (0.0010) +[2023-10-14 06:31:08,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.1, 300 sec: 13329.4). Total num frames: 72548352. Throughput: 0: 1650.4, 1: 1648.2. Samples: 18143184. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:31:08,513][99942] Avg episode reward: [(0, '0.880'), (1, '1.000')] +[2023-10-14 06:31:11,347][100936] Updated weights for policy 0, policy_version 35430 (0.0009) +[2023-10-14 06:31:11,713][100936] Updated weights for policy 0, policy_version 35440 (0.0008) +[2023-10-14 06:31:11,851][100917] Updated weights for policy 1, policy_version 35432 (0.0008) +[2023-10-14 06:31:12,090][100936] Updated weights for policy 0, policy_version 35450 (0.0009) +[2023-10-14 06:31:12,229][100917] Updated weights for policy 1, policy_version 35442 (0.0010) +[2023-10-14 06:31:12,592][100917] Updated weights for policy 1, policy_version 35452 (0.0007) +[2023-10-14 06:31:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 72613888. Throughput: 0: 1645.9, 1: 1643.4. Samples: 18154378. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:31:13,513][99942] Avg episode reward: [(0, '0.880'), (1, '1.000')] +[2023-10-14 06:31:16,244][100936] Updated weights for policy 0, policy_version 35460 (0.0008) +[2023-10-14 06:31:16,607][100936] Updated weights for policy 0, policy_version 35470 (0.0008) +[2023-10-14 06:31:16,824][100917] Updated weights for policy 1, policy_version 35462 (0.0007) +[2023-10-14 06:31:16,983][100936] Updated weights for policy 0, policy_version 35480 (0.0008) +[2023-10-14 06:31:17,205][100917] Updated weights for policy 1, policy_version 35472 (0.0008) +[2023-10-14 06:31:17,574][100917] Updated weights for policy 1, policy_version 35482 (0.0009) +[2023-10-14 06:31:18,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 72679424. Throughput: 0: 1642.5, 1: 1644.1. Samples: 18173264. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:31:18,513][99942] Avg episode reward: [(0, '0.880'), (1, '1.000')] +[2023-10-14 06:31:21,092][100936] Updated weights for policy 0, policy_version 35490 (0.0009) +[2023-10-14 06:31:21,463][100936] Updated weights for policy 0, policy_version 35500 (0.0009) +[2023-10-14 06:31:21,707][100917] Updated weights for policy 1, policy_version 35492 (0.0007) +[2023-10-14 06:31:21,841][100936] Updated weights for policy 0, policy_version 35510 (0.0007) +[2023-10-14 06:31:22,072][100917] Updated weights for policy 1, policy_version 35502 (0.0008) +[2023-10-14 06:31:22,209][100936] Updated weights for policy 0, policy_version 35520 (0.0007) +[2023-10-14 06:31:22,443][100917] Updated weights for policy 1, policy_version 35512 (0.0007) +[2023-10-14 06:31:23,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 72744960. Throughput: 0: 1657.9, 1: 1636.5. Samples: 18193054. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:31:23,513][99942] Avg episode reward: [(0, '0.880'), (1, '1.000')] +[2023-10-14 06:31:26,330][100936] Updated weights for policy 0, policy_version 35530 (0.0010) +[2023-10-14 06:31:26,577][100917] Updated weights for policy 1, policy_version 35522 (0.0007) +[2023-10-14 06:31:26,709][100936] Updated weights for policy 0, policy_version 35540 (0.0007) +[2023-10-14 06:31:26,986][100917] Updated weights for policy 1, policy_version 35532 (0.0008) +[2023-10-14 06:31:27,078][100936] Updated weights for policy 0, policy_version 35550 (0.0008) +[2023-10-14 06:31:27,364][100917] Updated weights for policy 1, policy_version 35542 (0.0007) +[2023-10-14 06:31:27,742][100917] Updated weights for policy 1, policy_version 35552 (0.0007) +[2023-10-14 06:31:28,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 72810496. Throughput: 0: 1647.0, 1: 1638.0. Samples: 18204126. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-10-14 06:31:28,513][99942] Avg episode reward: [(0, '0.880'), (1, '1.000')] +[2023-10-14 06:31:31,360][100936] Updated weights for policy 0, policy_version 35560 (0.0008) +[2023-10-14 06:31:31,726][100936] Updated weights for policy 0, policy_version 35570 (0.0008) +[2023-10-14 06:31:31,758][100917] Updated weights for policy 1, policy_version 35562 (0.0007) +[2023-10-14 06:31:32,102][100936] Updated weights for policy 0, policy_version 35580 (0.0008) +[2023-10-14 06:31:32,137][100917] Updated weights for policy 1, policy_version 35572 (0.0007) +[2023-10-14 06:31:32,506][100917] Updated weights for policy 1, policy_version 35582 (0.0008) +[2023-10-14 06:31:33,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 72876032. Throughput: 0: 1653.0, 1: 1642.8. Samples: 18223172. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-10-14 06:31:33,512][99942] Avg episode reward: [(0, '0.880'), (1, '1.000')] +[2023-10-14 06:31:36,283][100936] Updated weights for policy 0, policy_version 35590 (0.0009) +[2023-10-14 06:31:36,542][100917] Updated weights for policy 1, policy_version 35592 (0.0008) +[2023-10-14 06:31:36,666][100936] Updated weights for policy 0, policy_version 35600 (0.0009) +[2023-10-14 06:31:36,926][100917] Updated weights for policy 1, policy_version 35602 (0.0009) +[2023-10-14 06:31:37,042][100936] Updated weights for policy 0, policy_version 35610 (0.0007) +[2023-10-14 06:31:37,296][100917] Updated weights for policy 1, policy_version 35612 (0.0008) +[2023-10-14 06:31:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 72941568. Throughput: 0: 1664.4, 1: 1647.9. Samples: 18242892. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-10-14 06:31:38,513][99942] Avg episode reward: [(0, '0.880'), (1, '1.000')] +[2023-10-14 06:31:38,522][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000035616_36470784.pth... +[2023-10-14 06:31:38,522][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000035616_36470784.pth... +[2023-10-14 06:31:38,551][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000034048_34865152.pth +[2023-10-14 06:31:38,561][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000034080_34897920.pth +[2023-10-14 06:31:41,041][100936] Updated weights for policy 0, policy_version 35620 (0.0008) +[2023-10-14 06:31:41,410][100936] Updated weights for policy 0, policy_version 35630 (0.0008) +[2023-10-14 06:31:41,611][100917] Updated weights for policy 1, policy_version 35622 (0.0008) +[2023-10-14 06:31:41,769][100936] Updated weights for policy 0, policy_version 35640 (0.0009) +[2023-10-14 06:31:41,968][100917] Updated weights for policy 1, policy_version 35632 (0.0010) +[2023-10-14 06:31:42,338][100917] Updated weights for policy 1, policy_version 35642 (0.0009) +[2023-10-14 06:31:43,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.1, 300 sec: 13329.4). Total num frames: 73007104. Throughput: 0: 1654.8, 1: 1650.8. Samples: 18253772. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-10-14 06:31:43,513][99942] Avg episode reward: [(0, '0.880'), (1, '1.000')] +[2023-10-14 06:31:45,850][100936] Updated weights for policy 0, policy_version 35650 (0.0007) +[2023-10-14 06:31:46,234][100936] Updated weights for policy 0, policy_version 35660 (0.0008) +[2023-10-14 06:31:46,495][100917] Updated weights for policy 1, policy_version 35652 (0.0009) +[2023-10-14 06:31:46,600][100936] Updated weights for policy 0, policy_version 35670 (0.0008) +[2023-10-14 06:31:46,863][100917] Updated weights for policy 1, policy_version 35662 (0.0007) +[2023-10-14 06:31:46,968][100936] Updated weights for policy 0, policy_version 35680 (0.0009) +[2023-10-14 06:31:47,244][100917] Updated weights for policy 1, policy_version 35672 (0.0007) +[2023-10-14 06:31:48,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 73072640. Throughput: 0: 1660.5, 1: 1644.1. Samples: 18272792. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-10-14 06:31:48,513][99942] Avg episode reward: [(0, '0.880'), (1, '1.000')] +[2023-10-14 06:31:51,032][100936] Updated weights for policy 0, policy_version 35690 (0.0008) +[2023-10-14 06:31:51,392][100936] Updated weights for policy 0, policy_version 35700 (0.0008) +[2023-10-14 06:31:51,426][100917] Updated weights for policy 1, policy_version 35682 (0.0009) +[2023-10-14 06:31:51,766][100936] Updated weights for policy 0, policy_version 35710 (0.0007) +[2023-10-14 06:31:51,801][100917] Updated weights for policy 1, policy_version 35692 (0.0010) +[2023-10-14 06:31:52,173][100917] Updated weights for policy 1, policy_version 35702 (0.0009) +[2023-10-14 06:31:52,553][100917] Updated weights for policy 1, policy_version 35712 (0.0007) +[2023-10-14 06:31:53,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 73138176. Throughput: 0: 1665.4, 1: 1653.8. Samples: 18292548. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:31:53,512][99942] Avg episode reward: [(0, '0.880'), (1, '1.000')] +[2023-10-14 06:31:55,904][100936] Updated weights for policy 0, policy_version 35720 (0.0011) +[2023-10-14 06:31:56,268][100936] Updated weights for policy 0, policy_version 35730 (0.0009) +[2023-10-14 06:31:56,568][100917] Updated weights for policy 1, policy_version 35722 (0.0008) +[2023-10-14 06:31:56,640][100936] Updated weights for policy 0, policy_version 35740 (0.0008) +[2023-10-14 06:31:56,928][100917] Updated weights for policy 1, policy_version 35732 (0.0007) +[2023-10-14 06:31:57,308][100917] Updated weights for policy 1, policy_version 35742 (0.0008) +[2023-10-14 06:31:58,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 73203712. Throughput: 0: 1649.1, 1: 1657.4. Samples: 18303170. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:31:58,512][99942] Avg episode reward: [(0, '0.880'), (1, '1.000')] +[2023-10-14 06:32:00,801][100936] Updated weights for policy 0, policy_version 35750 (0.0010) +[2023-10-14 06:32:01,169][100936] Updated weights for policy 0, policy_version 35760 (0.0010) +[2023-10-14 06:32:01,481][100917] Updated weights for policy 1, policy_version 35752 (0.0009) +[2023-10-14 06:32:01,541][100936] Updated weights for policy 0, policy_version 35770 (0.0007) +[2023-10-14 06:32:01,853][100917] Updated weights for policy 1, policy_version 35762 (0.0009) +[2023-10-14 06:32:02,220][100917] Updated weights for policy 1, policy_version 35772 (0.0010) +[2023-10-14 06:32:03,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 73269248. Throughput: 0: 1660.1, 1: 1650.8. Samples: 18322254. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:32:03,513][99942] Avg episode reward: [(0, '0.880'), (1, '1.000')] +[2023-10-14 06:32:05,672][100936] Updated weights for policy 0, policy_version 35780 (0.0008) +[2023-10-14 06:32:06,035][100936] Updated weights for policy 0, policy_version 35790 (0.0009) +[2023-10-14 06:32:06,220][100917] Updated weights for policy 1, policy_version 35782 (0.0009) +[2023-10-14 06:32:06,415][100936] Updated weights for policy 0, policy_version 35800 (0.0007) +[2023-10-14 06:32:06,594][100917] Updated weights for policy 1, policy_version 35792 (0.0011) +[2023-10-14 06:32:06,962][100917] Updated weights for policy 1, policy_version 35802 (0.0010) +[2023-10-14 06:32:08,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 73334784. Throughput: 0: 1658.4, 1: 1662.1. Samples: 18342478. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:32:08,512][99942] Avg episode reward: [(0, '0.880'), (1, '1.000')] +[2023-10-14 06:32:10,478][100936] Updated weights for policy 0, policy_version 35810 (0.0008) +[2023-10-14 06:32:10,855][100936] Updated weights for policy 0, policy_version 35820 (0.0009) +[2023-10-14 06:32:11,000][100917] Updated weights for policy 1, policy_version 35812 (0.0010) +[2023-10-14 06:32:11,232][100936] Updated weights for policy 0, policy_version 35830 (0.0007) +[2023-10-14 06:32:11,377][100917] Updated weights for policy 1, policy_version 35822 (0.0009) +[2023-10-14 06:32:11,604][100936] Updated weights for policy 0, policy_version 35840 (0.0009) +[2023-10-14 06:32:11,747][100917] Updated weights for policy 1, policy_version 35832 (0.0010) +[2023-10-14 06:32:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 73400320. Throughput: 0: 1646.6, 1: 1656.8. Samples: 18352780. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:32:13,513][99942] Avg episode reward: [(0, '0.880'), (1, '1.000')] +[2023-10-14 06:32:15,655][100936] Updated weights for policy 0, policy_version 35850 (0.0010) +[2023-10-14 06:32:15,789][100917] Updated weights for policy 1, policy_version 35842 (0.0010) +[2023-10-14 06:32:16,012][100936] Updated weights for policy 0, policy_version 35860 (0.0008) +[2023-10-14 06:32:16,165][100917] Updated weights for policy 1, policy_version 35852 (0.0009) +[2023-10-14 06:32:16,377][100936] Updated weights for policy 0, policy_version 35870 (0.0008) +[2023-10-14 06:32:16,544][100917] Updated weights for policy 1, policy_version 35862 (0.0008) +[2023-10-14 06:32:16,910][100917] Updated weights for policy 1, policy_version 35872 (0.0007) +[2023-10-14 06:32:18,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 73465856. Throughput: 0: 1661.2, 1: 1642.8. Samples: 18371852. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:32:18,513][99942] Avg episode reward: [(0, '0.880'), (1, '1.000')] +[2023-10-14 06:32:20,662][100936] Updated weights for policy 0, policy_version 35880 (0.0009) +[2023-10-14 06:32:21,044][100936] Updated weights for policy 0, policy_version 35890 (0.0008) +[2023-10-14 06:32:21,176][100917] Updated weights for policy 1, policy_version 35882 (0.0008) +[2023-10-14 06:32:21,405][100936] Updated weights for policy 0, policy_version 35900 (0.0009) +[2023-10-14 06:32:21,547][100917] Updated weights for policy 1, policy_version 35892 (0.0010) +[2023-10-14 06:32:21,922][100917] Updated weights for policy 1, policy_version 35902 (0.0008) +[2023-10-14 06:32:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 73531392. Throughput: 0: 1666.1, 1: 1653.6. Samples: 18392280. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:32:23,513][99942] Avg episode reward: [(0, '0.880'), (1, '1.000')] +[2023-10-14 06:32:25,586][100936] Updated weights for policy 0, policy_version 35910 (0.0008) +[2023-10-14 06:32:25,968][100936] Updated weights for policy 0, policy_version 35920 (0.0008) +[2023-10-14 06:32:26,213][100917] Updated weights for policy 1, policy_version 35912 (0.0007) +[2023-10-14 06:32:26,345][100936] Updated weights for policy 0, policy_version 35930 (0.0007) +[2023-10-14 06:32:26,577][100917] Updated weights for policy 1, policy_version 35922 (0.0009) +[2023-10-14 06:32:26,950][100917] Updated weights for policy 1, policy_version 35932 (0.0010) +[2023-10-14 06:32:28,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 73596928. Throughput: 0: 1649.9, 1: 1654.6. Samples: 18402474. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:32:28,513][99942] Avg episode reward: [(0, '0.880'), (1, '1.000')] +[2023-10-14 06:32:30,392][100936] Updated weights for policy 0, policy_version 35940 (0.0008) +[2023-10-14 06:32:30,763][100936] Updated weights for policy 0, policy_version 35950 (0.0007) +[2023-10-14 06:32:31,123][100917] Updated weights for policy 1, policy_version 35942 (0.0010) +[2023-10-14 06:32:31,138][100936] Updated weights for policy 0, policy_version 35960 (0.0007) +[2023-10-14 06:32:31,481][100917] Updated weights for policy 1, policy_version 35952 (0.0009) +[2023-10-14 06:32:31,853][100917] Updated weights for policy 1, policy_version 35962 (0.0008) +[2023-10-14 06:32:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 73662464. Throughput: 0: 1660.4, 1: 1649.5. Samples: 18421734. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:32:33,513][99942] Avg episode reward: [(0, '0.880'), (1, '1.000')] +[2023-10-14 06:32:35,227][100936] Updated weights for policy 0, policy_version 35970 (0.0009) +[2023-10-14 06:32:35,608][100936] Updated weights for policy 0, policy_version 35980 (0.0007) +[2023-10-14 06:32:35,922][100917] Updated weights for policy 1, policy_version 35972 (0.0007) +[2023-10-14 06:32:35,981][100936] Updated weights for policy 0, policy_version 35990 (0.0008) +[2023-10-14 06:32:36,300][100917] Updated weights for policy 1, policy_version 35982 (0.0008) +[2023-10-14 06:32:36,348][100936] Updated weights for policy 0, policy_version 36000 (0.0008) +[2023-10-14 06:32:36,663][100917] Updated weights for policy 1, policy_version 35992 (0.0010) +[2023-10-14 06:32:38,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 73728000. Throughput: 0: 1661.9, 1: 1664.2. Samples: 18442222. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:32:38,513][99942] Avg episode reward: [(0, '0.880'), (1, '1.000')] +[2023-10-14 06:32:40,579][100936] Updated weights for policy 0, policy_version 36010 (0.0007) +[2023-10-14 06:32:40,743][100917] Updated weights for policy 1, policy_version 36002 (0.0008) +[2023-10-14 06:32:40,946][100936] Updated weights for policy 0, policy_version 36020 (0.0008) +[2023-10-14 06:32:41,122][100917] Updated weights for policy 1, policy_version 36012 (0.0009) +[2023-10-14 06:32:41,321][100936] Updated weights for policy 0, policy_version 36030 (0.0008) +[2023-10-14 06:32:41,488][100917] Updated weights for policy 1, policy_version 36022 (0.0009) +[2023-10-14 06:32:41,862][100917] Updated weights for policy 1, policy_version 36032 (0.0011) +[2023-10-14 06:32:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 73793536. Throughput: 0: 1653.6, 1: 1656.9. Samples: 18452142. Policy #0 lag: (min: 24.0, avg: 50.0, max: 56.0) +[2023-10-14 06:32:43,513][99942] Avg episode reward: [(0, '0.880'), (1, '1.000')] +[2023-10-14 06:32:45,303][100936] Updated weights for policy 0, policy_version 36040 (0.0009) +[2023-10-14 06:32:45,676][100936] Updated weights for policy 0, policy_version 36050 (0.0009) +[2023-10-14 06:32:45,921][100917] Updated weights for policy 1, policy_version 36042 (0.0009) +[2023-10-14 06:32:46,043][100936] Updated weights for policy 0, policy_version 36060 (0.0008) +[2023-10-14 06:32:46,286][100917] Updated weights for policy 1, policy_version 36052 (0.0009) +[2023-10-14 06:32:46,663][100917] Updated weights for policy 1, policy_version 36062 (0.0010) +[2023-10-14 06:32:48,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 73859072. Throughput: 0: 1668.8, 1: 1650.5. Samples: 18471620. Policy #0 lag: (min: 24.0, avg: 50.0, max: 56.0) +[2023-10-14 06:32:48,513][99942] Avg episode reward: [(0, '0.880'), (1, '1.000')] +[2023-10-14 06:32:50,284][100936] Updated weights for policy 0, policy_version 36070 (0.0009) +[2023-10-14 06:32:50,665][100936] Updated weights for policy 0, policy_version 36080 (0.0007) +[2023-10-14 06:32:50,861][100917] Updated weights for policy 1, policy_version 36072 (0.0009) +[2023-10-14 06:32:51,023][100936] Updated weights for policy 0, policy_version 36090 (0.0008) +[2023-10-14 06:32:51,238][100917] Updated weights for policy 1, policy_version 36082 (0.0008) +[2023-10-14 06:32:51,612][100917] Updated weights for policy 1, policy_version 36092 (0.0007) +[2023-10-14 06:32:53,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 73924608. Throughput: 0: 1667.1, 1: 1659.3. Samples: 18492164. Policy #0 lag: (min: 24.0, avg: 50.0, max: 56.0) +[2023-10-14 06:32:53,513][99942] Avg episode reward: [(0, '0.880'), (1, '1.000')] +[2023-10-14 06:32:54,933][100936] Updated weights for policy 0, policy_version 36100 (0.0008) +[2023-10-14 06:32:55,306][100936] Updated weights for policy 0, policy_version 36110 (0.0007) +[2023-10-14 06:32:55,683][100936] Updated weights for policy 0, policy_version 36120 (0.0007) +[2023-10-14 06:32:55,768][100917] Updated weights for policy 1, policy_version 36102 (0.0008) +[2023-10-14 06:32:56,136][100917] Updated weights for policy 1, policy_version 36112 (0.0010) +[2023-10-14 06:32:56,517][100917] Updated weights for policy 1, policy_version 36122 (0.0008) +[2023-10-14 06:32:58,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 73990144. Throughput: 0: 1660.5, 1: 1652.1. Samples: 18501846. Policy #0 lag: (min: 24.0, avg: 50.0, max: 56.0) +[2023-10-14 06:32:58,512][99942] Avg episode reward: [(0, '0.880'), (1, '1.000')] +[2023-10-14 06:32:59,775][100936] Updated weights for policy 0, policy_version 36130 (0.0008) +[2023-10-14 06:33:00,152][100936] Updated weights for policy 0, policy_version 36140 (0.0009) +[2023-10-14 06:33:00,519][100936] Updated weights for policy 0, policy_version 36150 (0.0008) +[2023-10-14 06:33:00,535][100917] Updated weights for policy 1, policy_version 36132 (0.0008) +[2023-10-14 06:33:00,885][100936] Updated weights for policy 0, policy_version 36160 (0.0008) +[2023-10-14 06:33:00,907][100917] Updated weights for policy 1, policy_version 36142 (0.0007) +[2023-10-14 06:33:01,283][100917] Updated weights for policy 1, policy_version 36152 (0.0010) +[2023-10-14 06:33:03,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 74055680. Throughput: 0: 1662.5, 1: 1659.9. Samples: 18521358. Policy #0 lag: (min: 24.0, avg: 50.0, max: 56.0) +[2023-10-14 06:33:03,513][99942] Avg episode reward: [(0, '0.880'), (1, '1.000')] +[2023-10-14 06:33:05,071][100936] Updated weights for policy 0, policy_version 36170 (0.0008) +[2023-10-14 06:33:05,384][100917] Updated weights for policy 1, policy_version 36162 (0.0008) +[2023-10-14 06:33:05,443][100936] Updated weights for policy 0, policy_version 36180 (0.0007) +[2023-10-14 06:33:05,751][100917] Updated weights for policy 1, policy_version 36172 (0.0009) +[2023-10-14 06:33:05,820][100936] Updated weights for policy 0, policy_version 36190 (0.0007) +[2023-10-14 06:33:06,122][100917] Updated weights for policy 1, policy_version 36182 (0.0010) +[2023-10-14 06:33:06,502][100917] Updated weights for policy 1, policy_version 36192 (0.0011) +[2023-10-14 06:33:08,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 74121216. Throughput: 0: 1657.4, 1: 1664.8. Samples: 18541776. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:33:08,513][99942] Avg episode reward: [(0, '0.880'), (1, '1.000')] +[2023-10-14 06:33:09,944][100936] Updated weights for policy 0, policy_version 36200 (0.0007) +[2023-10-14 06:33:10,313][100936] Updated weights for policy 0, policy_version 36210 (0.0007) +[2023-10-14 06:33:10,682][100936] Updated weights for policy 0, policy_version 36220 (0.0008) +[2023-10-14 06:33:10,728][100917] Updated weights for policy 1, policy_version 36202 (0.0008) +[2023-10-14 06:33:11,098][100917] Updated weights for policy 1, policy_version 36212 (0.0008) +[2023-10-14 06:33:11,460][100917] Updated weights for policy 1, policy_version 36222 (0.0007) +[2023-10-14 06:33:13,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 74186752. Throughput: 0: 1657.3, 1: 1648.4. Samples: 18551230. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:33:13,513][99942] Avg episode reward: [(0, '0.880'), (1, '1.000')] +[2023-10-14 06:33:14,875][100936] Updated weights for policy 0, policy_version 36230 (0.0007) +[2023-10-14 06:33:15,252][100936] Updated weights for policy 0, policy_version 36240 (0.0008) +[2023-10-14 06:33:15,578][100917] Updated weights for policy 1, policy_version 36232 (0.0008) +[2023-10-14 06:33:15,630][100936] Updated weights for policy 0, policy_version 36250 (0.0009) +[2023-10-14 06:33:15,941][100917] Updated weights for policy 1, policy_version 36242 (0.0008) +[2023-10-14 06:33:16,312][100917] Updated weights for policy 1, policy_version 36252 (0.0007) +[2023-10-14 06:33:18,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 74252288. Throughput: 0: 1659.4, 1: 1655.8. Samples: 18570920. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:33:18,513][99942] Avg episode reward: [(0, '0.880'), (1, '1.000')] +[2023-10-14 06:33:19,646][100936] Updated weights for policy 0, policy_version 36260 (0.0008) +[2023-10-14 06:33:20,027][100936] Updated weights for policy 0, policy_version 36270 (0.0007) +[2023-10-14 06:33:20,408][100936] Updated weights for policy 0, policy_version 36280 (0.0007) +[2023-10-14 06:33:20,448][100917] Updated weights for policy 1, policy_version 36262 (0.0008) +[2023-10-14 06:33:20,821][100917] Updated weights for policy 1, policy_version 36272 (0.0008) +[2023-10-14 06:33:21,201][100917] Updated weights for policy 1, policy_version 36282 (0.0009) +[2023-10-14 06:33:23,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.3, 300 sec: 13329.4). Total num frames: 74317824. Throughput: 0: 1660.6, 1: 1655.7. Samples: 18591454. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:33:23,512][99942] Avg episode reward: [(0, '0.880'), (1, '1.000')] +[2023-10-14 06:33:24,460][100936] Updated weights for policy 0, policy_version 36290 (0.0007) +[2023-10-14 06:33:24,835][100936] Updated weights for policy 0, policy_version 36300 (0.0007) +[2023-10-14 06:33:25,211][100936] Updated weights for policy 0, policy_version 36310 (0.0009) +[2023-10-14 06:33:25,426][100917] Updated weights for policy 1, policy_version 36292 (0.0009) +[2023-10-14 06:33:25,579][100936] Updated weights for policy 0, policy_version 36320 (0.0007) +[2023-10-14 06:33:25,811][100917] Updated weights for policy 1, policy_version 36302 (0.0010) +[2023-10-14 06:33:26,185][100917] Updated weights for policy 1, policy_version 36312 (0.0010) +[2023-10-14 06:33:28,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 74383360. Throughput: 0: 1661.8, 1: 1644.1. Samples: 18600908. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:33:28,513][99942] Avg episode reward: [(0, '0.880'), (1, '1.000')] +[2023-10-14 06:33:29,751][100936] Updated weights for policy 0, policy_version 36330 (0.0007) +[2023-10-14 06:33:30,112][100936] Updated weights for policy 0, policy_version 36340 (0.0010) +[2023-10-14 06:33:30,334][100917] Updated weights for policy 1, policy_version 36322 (0.0010) +[2023-10-14 06:33:30,489][100936] Updated weights for policy 0, policy_version 36350 (0.0009) +[2023-10-14 06:33:30,696][100917] Updated weights for policy 1, policy_version 36332 (0.0008) +[2023-10-14 06:33:31,065][100917] Updated weights for policy 1, policy_version 36342 (0.0009) +[2023-10-14 06:33:31,444][100917] Updated weights for policy 1, policy_version 36352 (0.0011) +[2023-10-14 06:33:33,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 74448896. Throughput: 0: 1658.8, 1: 1653.6. Samples: 18620674. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:33:33,513][99942] Avg episode reward: [(0, '0.880'), (1, '1.000')] +[2023-10-14 06:33:34,681][100936] Updated weights for policy 0, policy_version 36360 (0.0010) +[2023-10-14 06:33:35,058][100936] Updated weights for policy 0, policy_version 36370 (0.0008) +[2023-10-14 06:33:35,426][100936] Updated weights for policy 0, policy_version 36380 (0.0008) +[2023-10-14 06:33:35,583][100917] Updated weights for policy 1, policy_version 36362 (0.0009) +[2023-10-14 06:33:35,951][100917] Updated weights for policy 1, policy_version 36372 (0.0008) +[2023-10-14 06:33:36,326][100917] Updated weights for policy 1, policy_version 36382 (0.0010) +[2023-10-14 06:33:38,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 74514432. Throughput: 0: 1651.8, 1: 1653.7. Samples: 18640912. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:33:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:33:38,525][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000036384_37257216.pth... +[2023-10-14 06:33:38,525][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000036384_37257216.pth... +[2023-10-14 06:33:38,565][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000034848_35684352.pth +[2023-10-14 06:33:38,566][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000034848_35684352.pth +[2023-10-14 06:33:39,889][100936] Updated weights for policy 0, policy_version 36390 (0.0009) +[2023-10-14 06:33:40,261][100936] Updated weights for policy 0, policy_version 36400 (0.0010) +[2023-10-14 06:33:40,573][100917] Updated weights for policy 1, policy_version 36392 (0.0007) +[2023-10-14 06:33:40,625][100936] Updated weights for policy 0, policy_version 36410 (0.0008) +[2023-10-14 06:33:40,953][100917] Updated weights for policy 1, policy_version 36402 (0.0008) +[2023-10-14 06:33:41,316][100917] Updated weights for policy 1, policy_version 36412 (0.0009) +[2023-10-14 06:33:43,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 74579968. Throughput: 0: 1651.8, 1: 1646.5. Samples: 18650272. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:33:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:33:44,945][100936] Updated weights for policy 0, policy_version 36420 (0.0007) +[2023-10-14 06:33:45,306][100936] Updated weights for policy 0, policy_version 36430 (0.0007) +[2023-10-14 06:33:45,464][100917] Updated weights for policy 1, policy_version 36422 (0.0009) +[2023-10-14 06:33:45,672][100936] Updated weights for policy 0, policy_version 36440 (0.0007) +[2023-10-14 06:33:45,831][100917] Updated weights for policy 1, policy_version 36432 (0.0010) +[2023-10-14 06:33:46,201][100917] Updated weights for policy 1, policy_version 36442 (0.0009) +[2023-10-14 06:33:48,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 74645504. Throughput: 0: 1652.4, 1: 1649.8. Samples: 18669958. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:33:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:33:49,775][100936] Updated weights for policy 0, policy_version 36450 (0.0008) +[2023-10-14 06:33:50,135][100936] Updated weights for policy 0, policy_version 36460 (0.0009) +[2023-10-14 06:33:50,386][100917] Updated weights for policy 1, policy_version 36452 (0.0008) +[2023-10-14 06:33:50,502][100936] Updated weights for policy 0, policy_version 36470 (0.0007) +[2023-10-14 06:33:50,762][100917] Updated weights for policy 1, policy_version 36462 (0.0009) +[2023-10-14 06:33:50,871][100936] Updated weights for policy 0, policy_version 36480 (0.0007) +[2023-10-14 06:33:51,138][100917] Updated weights for policy 1, policy_version 36472 (0.0010) +[2023-10-14 06:33:53,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 74711040. Throughput: 0: 1655.8, 1: 1649.6. Samples: 18690522. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:33:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:33:54,793][100936] Updated weights for policy 0, policy_version 36490 (0.0010) +[2023-10-14 06:33:55,166][100936] Updated weights for policy 0, policy_version 36500 (0.0010) +[2023-10-14 06:33:55,304][100917] Updated weights for policy 1, policy_version 36482 (0.0008) +[2023-10-14 06:33:55,538][100936] Updated weights for policy 0, policy_version 36510 (0.0008) +[2023-10-14 06:33:55,703][100917] Updated weights for policy 1, policy_version 36492 (0.0010) +[2023-10-14 06:33:56,076][100917] Updated weights for policy 1, policy_version 36502 (0.0010) +[2023-10-14 06:33:56,451][100917] Updated weights for policy 1, policy_version 36512 (0.0008) +[2023-10-14 06:33:58,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 74776576. Throughput: 0: 1657.3, 1: 1645.8. Samples: 18699868. Policy #0 lag: (min: 31.0, avg: 31.7, max: 50.0) +[2023-10-14 06:33:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:33:59,579][100936] Updated weights for policy 0, policy_version 36520 (0.0007) +[2023-10-14 06:33:59,955][100936] Updated weights for policy 0, policy_version 36530 (0.0007) +[2023-10-14 06:34:00,321][100936] Updated weights for policy 0, policy_version 36540 (0.0007) +[2023-10-14 06:34:00,505][100917] Updated weights for policy 1, policy_version 36522 (0.0008) +[2023-10-14 06:34:00,884][100917] Updated weights for policy 1, policy_version 36532 (0.0010) +[2023-10-14 06:34:01,258][100917] Updated weights for policy 1, policy_version 36542 (0.0008) +[2023-10-14 06:34:03,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 74842112. Throughput: 0: 1653.6, 1: 1651.2. Samples: 18719638. Policy #0 lag: (min: 31.0, avg: 31.7, max: 50.0) +[2023-10-14 06:34:03,513][99942] Avg episode reward: [(0, '0.850'), (1, '1.000')] +[2023-10-14 06:34:04,756][100936] Updated weights for policy 0, policy_version 36550 (0.0009) +[2023-10-14 06:34:05,145][100936] Updated weights for policy 0, policy_version 36560 (0.0010) +[2023-10-14 06:34:05,412][100917] Updated weights for policy 1, policy_version 36552 (0.0009) +[2023-10-14 06:34:05,503][100936] Updated weights for policy 0, policy_version 36570 (0.0008) +[2023-10-14 06:34:05,781][100917] Updated weights for policy 1, policy_version 36562 (0.0009) +[2023-10-14 06:34:06,160][100917] Updated weights for policy 1, policy_version 36572 (0.0010) +[2023-10-14 06:34:08,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 74907648. Throughput: 0: 1647.2, 1: 1649.8. Samples: 18739820. Policy #0 lag: (min: 31.0, avg: 31.7, max: 50.0) +[2023-10-14 06:34:08,513][99942] Avg episode reward: [(0, '0.850'), (1, '1.000')] +[2023-10-14 06:34:09,568][100936] Updated weights for policy 0, policy_version 36580 (0.0009) +[2023-10-14 06:34:09,942][100936] Updated weights for policy 0, policy_version 36590 (0.0007) +[2023-10-14 06:34:10,310][100936] Updated weights for policy 0, policy_version 36600 (0.0009) +[2023-10-14 06:34:10,355][100917] Updated weights for policy 1, policy_version 36582 (0.0009) +[2023-10-14 06:34:10,722][100917] Updated weights for policy 1, policy_version 36592 (0.0007) +[2023-10-14 06:34:11,094][100917] Updated weights for policy 1, policy_version 36602 (0.0008) +[2023-10-14 06:34:13,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 74973184. Throughput: 0: 1645.2, 1: 1646.7. Samples: 18749042. Policy #0 lag: (min: 31.0, avg: 31.7, max: 50.0) +[2023-10-14 06:34:13,513][99942] Avg episode reward: [(0, '0.850'), (1, '1.000')] +[2023-10-14 06:34:14,516][100936] Updated weights for policy 0, policy_version 36610 (0.0009) +[2023-10-14 06:34:14,878][100936] Updated weights for policy 0, policy_version 36620 (0.0009) +[2023-10-14 06:34:15,246][100917] Updated weights for policy 1, policy_version 36612 (0.0009) +[2023-10-14 06:34:15,257][100936] Updated weights for policy 0, policy_version 36630 (0.0008) +[2023-10-14 06:34:15,626][100936] Updated weights for policy 0, policy_version 36640 (0.0008) +[2023-10-14 06:34:15,626][100917] Updated weights for policy 1, policy_version 36622 (0.0009) +[2023-10-14 06:34:15,998][100917] Updated weights for policy 1, policy_version 36632 (0.0011) +[2023-10-14 06:34:18,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 75038720. Throughput: 0: 1642.4, 1: 1646.8. Samples: 18768684. Policy #0 lag: (min: 31.0, avg: 31.7, max: 50.0) +[2023-10-14 06:34:18,513][99942] Avg episode reward: [(0, '0.850'), (1, '1.000')] +[2023-10-14 06:34:19,642][100936] Updated weights for policy 0, policy_version 36650 (0.0008) +[2023-10-14 06:34:20,020][100936] Updated weights for policy 0, policy_version 36660 (0.0008) +[2023-10-14 06:34:20,032][100917] Updated weights for policy 1, policy_version 36642 (0.0009) +[2023-10-14 06:34:20,395][100936] Updated weights for policy 0, policy_version 36670 (0.0008) +[2023-10-14 06:34:20,405][100917] Updated weights for policy 1, policy_version 36652 (0.0008) +[2023-10-14 06:34:20,778][100917] Updated weights for policy 1, policy_version 36662 (0.0008) +[2023-10-14 06:34:21,152][100917] Updated weights for policy 1, policy_version 36672 (0.0007) +[2023-10-14 06:34:23,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 75104256. Throughput: 0: 1645.4, 1: 1648.5. Samples: 18789136. Policy #0 lag: (min: 1.0, avg: 18.2, max: 33.0) +[2023-10-14 06:34:23,513][99942] Avg episode reward: [(0, '0.850'), (1, '1.000')] +[2023-10-14 06:34:24,584][100936] Updated weights for policy 0, policy_version 36680 (0.0008) +[2023-10-14 06:34:24,943][100936] Updated weights for policy 0, policy_version 36690 (0.0007) +[2023-10-14 06:34:25,117][100917] Updated weights for policy 1, policy_version 36682 (0.0007) +[2023-10-14 06:34:25,309][100936] Updated weights for policy 0, policy_version 36700 (0.0008) +[2023-10-14 06:34:25,495][100917] Updated weights for policy 1, policy_version 36692 (0.0007) +[2023-10-14 06:34:25,861][100917] Updated weights for policy 1, policy_version 36702 (0.0010) +[2023-10-14 06:34:28,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 75169792. Throughput: 0: 1648.1, 1: 1639.1. Samples: 18798194. Policy #0 lag: (min: 1.0, avg: 18.2, max: 33.0) +[2023-10-14 06:34:28,513][99942] Avg episode reward: [(0, '0.850'), (1, '1.000')] +[2023-10-14 06:34:29,588][100936] Updated weights for policy 0, policy_version 36710 (0.0008) +[2023-10-14 06:34:29,966][100936] Updated weights for policy 0, policy_version 36720 (0.0007) +[2023-10-14 06:34:30,087][100917] Updated weights for policy 1, policy_version 36712 (0.0009) +[2023-10-14 06:34:30,332][100936] Updated weights for policy 0, policy_version 36730 (0.0007) +[2023-10-14 06:34:30,462][100917] Updated weights for policy 1, policy_version 36722 (0.0008) +[2023-10-14 06:34:30,839][100917] Updated weights for policy 1, policy_version 36732 (0.0010) +[2023-10-14 06:34:33,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 75235328. Throughput: 0: 1648.0, 1: 1646.8. Samples: 18818224. Policy #0 lag: (min: 1.0, avg: 18.2, max: 33.0) +[2023-10-14 06:34:33,513][99942] Avg episode reward: [(0, '0.840'), (1, '1.000')] +[2023-10-14 06:34:34,522][100936] Updated weights for policy 0, policy_version 36740 (0.0009) +[2023-10-14 06:34:34,885][100936] Updated weights for policy 0, policy_version 36750 (0.0011) +[2023-10-14 06:34:35,098][100917] Updated weights for policy 1, policy_version 36742 (0.0010) +[2023-10-14 06:34:35,254][100936] Updated weights for policy 0, policy_version 36760 (0.0009) +[2023-10-14 06:34:35,477][100917] Updated weights for policy 1, policy_version 36752 (0.0011) +[2023-10-14 06:34:35,863][100917] Updated weights for policy 1, policy_version 36762 (0.0011) +[2023-10-14 06:34:38,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 75300864. Throughput: 0: 1644.5, 1: 1645.3. Samples: 18838562. Policy #0 lag: (min: 1.0, avg: 18.2, max: 33.0) +[2023-10-14 06:34:38,513][99942] Avg episode reward: [(0, '0.840'), (1, '1.000')] +[2023-10-14 06:34:39,214][100936] Updated weights for policy 0, policy_version 36770 (0.0008) +[2023-10-14 06:34:39,582][100936] Updated weights for policy 0, policy_version 36780 (0.0011) +[2023-10-14 06:34:39,958][100936] Updated weights for policy 0, policy_version 36790 (0.0008) +[2023-10-14 06:34:40,014][100917] Updated weights for policy 1, policy_version 36772 (0.0009) +[2023-10-14 06:34:40,328][100936] Updated weights for policy 0, policy_version 36800 (0.0007) +[2023-10-14 06:34:40,414][100917] Updated weights for policy 1, policy_version 36782 (0.0008) +[2023-10-14 06:34:40,795][100917] Updated weights for policy 1, policy_version 36792 (0.0007) +[2023-10-14 06:34:43,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 75366400. Throughput: 0: 1643.9, 1: 1639.9. Samples: 18847640. Policy #0 lag: (min: 1.0, avg: 18.2, max: 33.0) +[2023-10-14 06:34:43,513][99942] Avg episode reward: [(0, '0.840'), (1, '1.000')] +[2023-10-14 06:34:44,566][100936] Updated weights for policy 0, policy_version 36810 (0.0009) +[2023-10-14 06:34:44,840][100917] Updated weights for policy 1, policy_version 36802 (0.0008) +[2023-10-14 06:34:44,930][100936] Updated weights for policy 0, policy_version 36820 (0.0011) +[2023-10-14 06:34:45,220][100917] Updated weights for policy 1, policy_version 36812 (0.0007) +[2023-10-14 06:34:45,296][100936] Updated weights for policy 0, policy_version 36830 (0.0010) +[2023-10-14 06:34:45,585][100917] Updated weights for policy 1, policy_version 36822 (0.0008) +[2023-10-14 06:34:45,961][100917] Updated weights for policy 1, policy_version 36832 (0.0008) +[2023-10-14 06:34:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 75431936. Throughput: 0: 1645.5, 1: 1646.7. Samples: 18867784. Policy #0 lag: (min: 31.0, avg: 32.7, max: 60.0) +[2023-10-14 06:34:48,513][99942] Avg episode reward: [(0, '0.840'), (1, '1.000')] +[2023-10-14 06:34:49,315][100936] Updated weights for policy 0, policy_version 36840 (0.0007) +[2023-10-14 06:34:49,696][100936] Updated weights for policy 0, policy_version 36850 (0.0007) +[2023-10-14 06:34:50,065][100936] Updated weights for policy 0, policy_version 36860 (0.0007) +[2023-10-14 06:34:50,150][100917] Updated weights for policy 1, policy_version 36842 (0.0007) +[2023-10-14 06:34:50,530][100917] Updated weights for policy 1, policy_version 36852 (0.0007) +[2023-10-14 06:34:50,897][100917] Updated weights for policy 1, policy_version 36862 (0.0009) +[2023-10-14 06:34:53,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 75497472. Throughput: 0: 1650.4, 1: 1648.4. Samples: 18888266. Policy #0 lag: (min: 31.0, avg: 32.7, max: 60.0) +[2023-10-14 06:34:53,513][99942] Avg episode reward: [(0, '0.840'), (1, '1.000')] +[2023-10-14 06:34:54,151][100936] Updated weights for policy 0, policy_version 36870 (0.0007) +[2023-10-14 06:34:54,529][100936] Updated weights for policy 0, policy_version 36880 (0.0008) +[2023-10-14 06:34:54,890][100936] Updated weights for policy 0, policy_version 36890 (0.0008) +[2023-10-14 06:34:55,001][100917] Updated weights for policy 1, policy_version 36872 (0.0007) +[2023-10-14 06:34:55,384][100917] Updated weights for policy 1, policy_version 36882 (0.0009) +[2023-10-14 06:34:55,748][100917] Updated weights for policy 1, policy_version 36892 (0.0011) +[2023-10-14 06:34:58,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 75563008. Throughput: 0: 1652.0, 1: 1642.1. Samples: 18897280. Policy #0 lag: (min: 31.0, avg: 32.7, max: 60.0) +[2023-10-14 06:34:58,513][99942] Avg episode reward: [(0, '0.840'), (1, '1.000')] +[2023-10-14 06:34:59,198][100936] Updated weights for policy 0, policy_version 36900 (0.0009) +[2023-10-14 06:34:59,571][100936] Updated weights for policy 0, policy_version 36910 (0.0011) +[2023-10-14 06:34:59,831][100917] Updated weights for policy 1, policy_version 36902 (0.0007) +[2023-10-14 06:34:59,937][100936] Updated weights for policy 0, policy_version 36920 (0.0008) +[2023-10-14 06:35:00,192][100917] Updated weights for policy 1, policy_version 36912 (0.0009) +[2023-10-14 06:35:00,562][100917] Updated weights for policy 1, policy_version 36922 (0.0011) +[2023-10-14 06:35:03,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 75628544. Throughput: 0: 1654.1, 1: 1656.4. Samples: 18917660. Policy #0 lag: (min: 31.0, avg: 32.7, max: 60.0) +[2023-10-14 06:35:03,513][99942] Avg episode reward: [(0, '0.840'), (1, '1.000')] +[2023-10-14 06:35:04,100][100936] Updated weights for policy 0, policy_version 36930 (0.0008) +[2023-10-14 06:35:04,469][100936] Updated weights for policy 0, policy_version 36940 (0.0008) +[2023-10-14 06:35:04,752][100917] Updated weights for policy 1, policy_version 36932 (0.0009) +[2023-10-14 06:35:04,846][100936] Updated weights for policy 0, policy_version 36950 (0.0007) +[2023-10-14 06:35:05,122][100917] Updated weights for policy 1, policy_version 36942 (0.0007) +[2023-10-14 06:35:05,202][100936] Updated weights for policy 0, policy_version 36960 (0.0008) +[2023-10-14 06:35:05,496][100917] Updated weights for policy 1, policy_version 36952 (0.0008) +[2023-10-14 06:35:08,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 75694080. Throughput: 0: 1658.7, 1: 1652.7. Samples: 18938146. Policy #0 lag: (min: 31.0, avg: 32.7, max: 60.0) +[2023-10-14 06:35:08,513][99942] Avg episode reward: [(0, '0.840'), (1, '1.000')] +[2023-10-14 06:35:09,231][100936] Updated weights for policy 0, policy_version 36970 (0.0010) +[2023-10-14 06:35:09,547][100917] Updated weights for policy 1, policy_version 36962 (0.0008) +[2023-10-14 06:35:09,602][100936] Updated weights for policy 0, policy_version 36980 (0.0009) +[2023-10-14 06:35:09,932][100917] Updated weights for policy 1, policy_version 36972 (0.0011) +[2023-10-14 06:35:09,969][100936] Updated weights for policy 0, policy_version 36990 (0.0010) +[2023-10-14 06:35:10,295][100917] Updated weights for policy 1, policy_version 36982 (0.0009) +[2023-10-14 06:35:10,680][100917] Updated weights for policy 1, policy_version 36992 (0.0010) +[2023-10-14 06:35:13,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 75759616. Throughput: 0: 1657.4, 1: 1650.7. Samples: 18947060. Policy #0 lag: (min: 7.0, avg: 7.5, max: 22.0) +[2023-10-14 06:35:13,512][99942] Avg episode reward: [(0, '0.840'), (1, '1.000')] +[2023-10-14 06:35:14,375][100936] Updated weights for policy 0, policy_version 37000 (0.0007) +[2023-10-14 06:35:14,741][100936] Updated weights for policy 0, policy_version 37010 (0.0008) +[2023-10-14 06:35:14,852][100917] Updated weights for policy 1, policy_version 37002 (0.0009) +[2023-10-14 06:35:15,111][100936] Updated weights for policy 0, policy_version 37020 (0.0008) +[2023-10-14 06:35:15,219][100917] Updated weights for policy 1, policy_version 37012 (0.0009) +[2023-10-14 06:35:15,594][100917] Updated weights for policy 1, policy_version 37022 (0.0008) +[2023-10-14 06:35:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 75825152. Throughput: 0: 1653.6, 1: 1657.8. Samples: 18967236. Policy #0 lag: (min: 7.0, avg: 7.5, max: 22.0) +[2023-10-14 06:35:18,513][99942] Avg episode reward: [(0, '0.840'), (1, '1.000')] +[2023-10-14 06:35:19,248][100936] Updated weights for policy 0, policy_version 37030 (0.0007) +[2023-10-14 06:35:19,616][100917] Updated weights for policy 1, policy_version 37032 (0.0007) +[2023-10-14 06:35:19,619][100936] Updated weights for policy 0, policy_version 37040 (0.0008) +[2023-10-14 06:35:19,987][100936] Updated weights for policy 0, policy_version 37050 (0.0010) +[2023-10-14 06:35:19,993][100917] Updated weights for policy 1, policy_version 37042 (0.0007) +[2023-10-14 06:35:20,357][100917] Updated weights for policy 1, policy_version 37052 (0.0007) +[2023-10-14 06:35:23,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 75890688. Throughput: 0: 1661.0, 1: 1660.0. Samples: 18988010. Policy #0 lag: (min: 7.0, avg: 7.5, max: 22.0) +[2023-10-14 06:35:23,513][99942] Avg episode reward: [(0, '0.840'), (1, '1.000')] +[2023-10-14 06:35:23,979][100936] Updated weights for policy 0, policy_version 37060 (0.0011) +[2023-10-14 06:35:24,346][100936] Updated weights for policy 0, policy_version 37070 (0.0009) +[2023-10-14 06:35:24,412][100917] Updated weights for policy 1, policy_version 37062 (0.0009) +[2023-10-14 06:35:24,706][100936] Updated weights for policy 0, policy_version 37080 (0.0009) +[2023-10-14 06:35:24,781][100917] Updated weights for policy 1, policy_version 37072 (0.0008) +[2023-10-14 06:35:25,146][100917] Updated weights for policy 1, policy_version 37082 (0.0009) +[2023-10-14 06:35:28,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 75956224. Throughput: 0: 1661.3, 1: 1659.2. Samples: 18997066. Policy #0 lag: (min: 7.0, avg: 7.5, max: 22.0) +[2023-10-14 06:35:28,513][99942] Avg episode reward: [(0, '0.840'), (1, '1.000')] +[2023-10-14 06:35:28,668][100936] Updated weights for policy 0, policy_version 37090 (0.0009) +[2023-10-14 06:35:29,037][100936] Updated weights for policy 0, policy_version 37100 (0.0007) +[2023-10-14 06:35:29,403][100936] Updated weights for policy 0, policy_version 37110 (0.0008) +[2023-10-14 06:35:29,466][100917] Updated weights for policy 1, policy_version 37092 (0.0007) +[2023-10-14 06:35:29,768][100936] Updated weights for policy 0, policy_version 37120 (0.0008) +[2023-10-14 06:35:29,857][100917] Updated weights for policy 1, policy_version 37102 (0.0008) +[2023-10-14 06:35:30,231][100917] Updated weights for policy 1, policy_version 37112 (0.0007) +[2023-10-14 06:35:33,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 76021760. Throughput: 0: 1668.8, 1: 1656.7. Samples: 19017432. Policy #0 lag: (min: 7.0, avg: 7.5, max: 22.0) +[2023-10-14 06:35:33,513][99942] Avg episode reward: [(0, '0.840'), (1, '1.000')] +[2023-10-14 06:35:33,902][100936] Updated weights for policy 0, policy_version 37130 (0.0009) +[2023-10-14 06:35:34,271][100936] Updated weights for policy 0, policy_version 37140 (0.0007) +[2023-10-14 06:35:34,409][100917] Updated weights for policy 1, policy_version 37122 (0.0009) +[2023-10-14 06:35:34,627][100936] Updated weights for policy 0, policy_version 37150 (0.0009) +[2023-10-14 06:35:34,771][100917] Updated weights for policy 1, policy_version 37132 (0.0009) +[2023-10-14 06:35:35,148][100917] Updated weights for policy 1, policy_version 37142 (0.0011) +[2023-10-14 06:35:35,522][100917] Updated weights for policy 1, policy_version 37152 (0.0009) +[2023-10-14 06:35:38,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 76087296. Throughput: 0: 1669.6, 1: 1655.7. Samples: 19037902. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:35:38,513][99942] Avg episode reward: [(0, '0.830'), (1, '1.000')] +[2023-10-14 06:35:38,522][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000037152_38043648.pth... +[2023-10-14 06:35:38,557][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000035616_36470784.pth +[2023-10-14 06:35:38,713][100936] Updated weights for policy 0, policy_version 37160 (0.0009) +[2023-10-14 06:35:39,085][100936] Updated weights for policy 0, policy_version 37170 (0.0007) +[2023-10-14 06:35:39,467][100936] Updated weights for policy 0, policy_version 37180 (0.0007) +[2023-10-14 06:35:39,603][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000037184_38076416.pth... +[2023-10-14 06:35:39,638][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000035616_36470784.pth +[2023-10-14 06:35:39,722][100917] Updated weights for policy 1, policy_version 37162 (0.0009) +[2023-10-14 06:35:40,100][100917] Updated weights for policy 1, policy_version 37172 (0.0007) +[2023-10-14 06:35:40,472][100917] Updated weights for policy 1, policy_version 37182 (0.0008) +[2023-10-14 06:35:43,511][100936] Updated weights for policy 0, policy_version 37190 (0.0008) +[2023-10-14 06:35:43,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 76152832. Throughput: 0: 1669.1, 1: 1656.2. Samples: 19046918. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:35:43,513][99942] Avg episode reward: [(0, '0.830'), (1, '1.000')] +[2023-10-14 06:35:43,884][100936] Updated weights for policy 0, policy_version 37200 (0.0007) +[2023-10-14 06:35:44,265][100936] Updated weights for policy 0, policy_version 37210 (0.0008) +[2023-10-14 06:35:44,568][100917] Updated weights for policy 1, policy_version 37192 (0.0009) +[2023-10-14 06:35:44,946][100917] Updated weights for policy 1, policy_version 37202 (0.0009) +[2023-10-14 06:35:45,316][100917] Updated weights for policy 1, policy_version 37212 (0.0010) +[2023-10-14 06:35:48,327][100936] Updated weights for policy 0, policy_version 37220 (0.0009) +[2023-10-14 06:35:48,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 76218368. Throughput: 0: 1673.6, 1: 1655.2. Samples: 19067452. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:35:48,513][99942] Avg episode reward: [(0, '0.830'), (1, '1.000')] +[2023-10-14 06:35:48,693][100936] Updated weights for policy 0, policy_version 37230 (0.0008) +[2023-10-14 06:35:49,062][100936] Updated weights for policy 0, policy_version 37240 (0.0010) +[2023-10-14 06:35:49,363][100917] Updated weights for policy 1, policy_version 37222 (0.0010) +[2023-10-14 06:35:49,744][100917] Updated weights for policy 1, policy_version 37232 (0.0011) +[2023-10-14 06:35:50,107][100917] Updated weights for policy 1, policy_version 37242 (0.0009) +[2023-10-14 06:35:53,059][100936] Updated weights for policy 0, policy_version 37250 (0.0007) +[2023-10-14 06:35:53,430][100936] Updated weights for policy 0, policy_version 37260 (0.0007) +[2023-10-14 06:35:53,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.3, 300 sec: 13107.2). Total num frames: 76283904. Throughput: 0: 1658.2, 1: 1662.3. Samples: 19087568. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:35:53,512][99942] Avg episode reward: [(0, '0.830'), (1, '1.000')] +[2023-10-14 06:35:53,803][100936] Updated weights for policy 0, policy_version 37270 (0.0008) +[2023-10-14 06:35:54,033][100917] Updated weights for policy 1, policy_version 37252 (0.0007) +[2023-10-14 06:35:54,164][100936] Updated weights for policy 0, policy_version 37280 (0.0008) +[2023-10-14 06:35:54,408][100917] Updated weights for policy 1, policy_version 37262 (0.0009) +[2023-10-14 06:35:54,791][100917] Updated weights for policy 1, policy_version 37272 (0.0009) +[2023-10-14 06:35:58,415][100936] Updated weights for policy 0, policy_version 37290 (0.0008) +[2023-10-14 06:35:58,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 76349440. Throughput: 0: 1669.5, 1: 1667.8. Samples: 19097240. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:35:58,513][99942] Avg episode reward: [(0, '0.830'), (1, '1.000')] +[2023-10-14 06:35:58,781][100936] Updated weights for policy 0, policy_version 37300 (0.0007) +[2023-10-14 06:35:58,905][100917] Updated weights for policy 1, policy_version 37282 (0.0007) +[2023-10-14 06:35:59,151][100936] Updated weights for policy 0, policy_version 37310 (0.0007) +[2023-10-14 06:35:59,268][100917] Updated weights for policy 1, policy_version 37292 (0.0008) +[2023-10-14 06:35:59,644][100917] Updated weights for policy 1, policy_version 37302 (0.0009) +[2023-10-14 06:36:00,011][100917] Updated weights for policy 1, policy_version 37312 (0.0007) +[2023-10-14 06:36:03,272][100936] Updated weights for policy 0, policy_version 37320 (0.0008) +[2023-10-14 06:36:03,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 76414976. Throughput: 0: 1670.1, 1: 1668.8. Samples: 19117488. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:36:03,512][99942] Avg episode reward: [(0, '0.830'), (1, '1.000')] +[2023-10-14 06:36:03,640][100936] Updated weights for policy 0, policy_version 37330 (0.0010) +[2023-10-14 06:36:04,005][100936] Updated weights for policy 0, policy_version 37340 (0.0007) +[2023-10-14 06:36:04,056][100917] Updated weights for policy 1, policy_version 37322 (0.0008) +[2023-10-14 06:36:04,430][100917] Updated weights for policy 1, policy_version 37332 (0.0007) +[2023-10-14 06:36:04,799][100917] Updated weights for policy 1, policy_version 37342 (0.0007) +[2023-10-14 06:36:08,144][100936] Updated weights for policy 0, policy_version 37350 (0.0008) +[2023-10-14 06:36:08,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 76480512. Throughput: 0: 1654.2, 1: 1669.8. Samples: 19137592. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) +[2023-10-14 06:36:08,513][99942] Avg episode reward: [(0, '0.830'), (1, '1.000')] +[2023-10-14 06:36:08,517][100936] Updated weights for policy 0, policy_version 37360 (0.0008) +[2023-10-14 06:36:08,881][100917] Updated weights for policy 1, policy_version 37352 (0.0007) +[2023-10-14 06:36:08,890][100936] Updated weights for policy 0, policy_version 37370 (0.0009) +[2023-10-14 06:36:09,253][100917] Updated weights for policy 1, policy_version 37362 (0.0007) +[2023-10-14 06:36:09,631][100917] Updated weights for policy 1, policy_version 37372 (0.0008) +[2023-10-14 06:36:13,035][100936] Updated weights for policy 0, policy_version 37380 (0.0007) +[2023-10-14 06:36:13,404][100936] Updated weights for policy 0, policy_version 37390 (0.0007) +[2023-10-14 06:36:13,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 76546048. Throughput: 0: 1661.5, 1: 1672.0. Samples: 19147070. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) +[2023-10-14 06:36:13,512][99942] Avg episode reward: [(0, '0.830'), (1, '1.000')] +[2023-10-14 06:36:13,552][100917] Updated weights for policy 1, policy_version 37382 (0.0008) +[2023-10-14 06:36:13,780][100936] Updated weights for policy 0, policy_version 37400 (0.0007) +[2023-10-14 06:36:13,922][100917] Updated weights for policy 1, policy_version 37392 (0.0008) +[2023-10-14 06:36:14,286][100917] Updated weights for policy 1, policy_version 37402 (0.0009) +[2023-10-14 06:36:18,052][100936] Updated weights for policy 0, policy_version 37410 (0.0007) +[2023-10-14 06:36:18,426][100936] Updated weights for policy 0, policy_version 37420 (0.0008) +[2023-10-14 06:36:18,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 76611584. Throughput: 0: 1655.5, 1: 1675.2. Samples: 19167312. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) +[2023-10-14 06:36:18,513][99942] Avg episode reward: [(0, '0.830'), (1, '1.000')] +[2023-10-14 06:36:18,585][100917] Updated weights for policy 1, policy_version 37412 (0.0008) +[2023-10-14 06:36:18,797][100936] Updated weights for policy 0, policy_version 37430 (0.0008) +[2023-10-14 06:36:18,980][100917] Updated weights for policy 1, policy_version 37422 (0.0008) +[2023-10-14 06:36:19,156][100936] Updated weights for policy 0, policy_version 37440 (0.0008) +[2023-10-14 06:36:19,357][100917] Updated weights for policy 1, policy_version 37432 (0.0008) +[2023-10-14 06:36:23,325][100936] Updated weights for policy 0, policy_version 37450 (0.0007) +[2023-10-14 06:36:23,500][100917] Updated weights for policy 1, policy_version 37442 (0.0007) +[2023-10-14 06:36:23,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 76677120. Throughput: 0: 1642.9, 1: 1674.6. Samples: 19187188. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) +[2023-10-14 06:36:23,512][99942] Avg episode reward: [(0, '0.830'), (1, '1.000')] +[2023-10-14 06:36:23,697][100936] Updated weights for policy 0, policy_version 37460 (0.0007) +[2023-10-14 06:36:23,885][100917] Updated weights for policy 1, policy_version 37452 (0.0007) +[2023-10-14 06:36:24,059][100936] Updated weights for policy 0, policy_version 37470 (0.0007) +[2023-10-14 06:36:24,265][100917] Updated weights for policy 1, policy_version 37462 (0.0009) +[2023-10-14 06:36:24,644][100917] Updated weights for policy 1, policy_version 37472 (0.0007) +[2023-10-14 06:36:28,309][100936] Updated weights for policy 0, policy_version 37480 (0.0009) +[2023-10-14 06:36:28,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 76742656. Throughput: 0: 1653.6, 1: 1669.2. Samples: 19196442. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) +[2023-10-14 06:36:28,513][99942] Avg episode reward: [(0, '0.830'), (1, '1.000')] +[2023-10-14 06:36:28,691][100936] Updated weights for policy 0, policy_version 37490 (0.0008) +[2023-10-14 06:36:28,962][100917] Updated weights for policy 1, policy_version 37482 (0.0007) +[2023-10-14 06:36:29,055][100936] Updated weights for policy 0, policy_version 37500 (0.0008) +[2023-10-14 06:36:29,338][100917] Updated weights for policy 1, policy_version 37492 (0.0008) +[2023-10-14 06:36:29,715][100917] Updated weights for policy 1, policy_version 37502 (0.0008) +[2023-10-14 06:36:33,042][100936] Updated weights for policy 0, policy_version 37510 (0.0007) +[2023-10-14 06:36:33,407][100936] Updated weights for policy 0, policy_version 37520 (0.0007) +[2023-10-14 06:36:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 76808192. Throughput: 0: 1648.2, 1: 1669.6. Samples: 19216752. Policy #0 lag: (min: 13.0, avg: 35.8, max: 40.0) +[2023-10-14 06:36:33,513][99942] Avg episode reward: [(0, '0.830'), (1, '1.000')] +[2023-10-14 06:36:33,788][100936] Updated weights for policy 0, policy_version 37530 (0.0008) +[2023-10-14 06:36:33,897][100917] Updated weights for policy 1, policy_version 37512 (0.0009) +[2023-10-14 06:36:34,280][100917] Updated weights for policy 1, policy_version 37522 (0.0007) +[2023-10-14 06:36:34,649][100917] Updated weights for policy 1, policy_version 37532 (0.0008) +[2023-10-14 06:36:37,934][100936] Updated weights for policy 0, policy_version 37540 (0.0009) +[2023-10-14 06:36:38,308][100936] Updated weights for policy 0, policy_version 37550 (0.0009) +[2023-10-14 06:36:38,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.3, 300 sec: 13107.2). Total num frames: 76873728. Throughput: 0: 1650.3, 1: 1663.6. Samples: 19236696. Policy #0 lag: (min: 13.0, avg: 35.8, max: 40.0) +[2023-10-14 06:36:38,512][99942] Avg episode reward: [(0, '0.830'), (1, '1.000')] +[2023-10-14 06:36:38,613][100917] Updated weights for policy 1, policy_version 37542 (0.0009) +[2023-10-14 06:36:38,680][100936] Updated weights for policy 0, policy_version 37560 (0.0009) +[2023-10-14 06:36:38,985][100917] Updated weights for policy 1, policy_version 37552 (0.0009) +[2023-10-14 06:36:39,367][100917] Updated weights for policy 1, policy_version 37562 (0.0008) +[2023-10-14 06:36:42,676][100936] Updated weights for policy 0, policy_version 37570 (0.0007) +[2023-10-14 06:36:43,053][100936] Updated weights for policy 0, policy_version 37580 (0.0008) +[2023-10-14 06:36:43,429][100936] Updated weights for policy 0, policy_version 37590 (0.0009) +[2023-10-14 06:36:43,434][100917] Updated weights for policy 1, policy_version 37572 (0.0007) +[2023-10-14 06:36:43,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 76939264. Throughput: 0: 1654.8, 1: 1660.1. Samples: 19246412. Policy #0 lag: (min: 13.0, avg: 35.8, max: 40.0) +[2023-10-14 06:36:43,512][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 06:36:43,793][100936] Updated weights for policy 0, policy_version 37600 (0.0008) +[2023-10-14 06:36:43,806][100917] Updated weights for policy 1, policy_version 37582 (0.0007) +[2023-10-14 06:36:44,178][100917] Updated weights for policy 1, policy_version 37592 (0.0010) +[2023-10-14 06:36:47,981][100936] Updated weights for policy 0, policy_version 37610 (0.0009) +[2023-10-14 06:36:48,200][100917] Updated weights for policy 1, policy_version 37602 (0.0009) +[2023-10-14 06:36:48,347][100936] Updated weights for policy 0, policy_version 37620 (0.0009) +[2023-10-14 06:36:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 77004800. Throughput: 0: 1656.8, 1: 1662.0. Samples: 19266834. Policy #0 lag: (min: 13.0, avg: 35.8, max: 40.0) +[2023-10-14 06:36:48,512][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 06:36:48,563][100917] Updated weights for policy 1, policy_version 37612 (0.0009) +[2023-10-14 06:36:48,724][100936] Updated weights for policy 0, policy_version 37630 (0.0009) +[2023-10-14 06:36:48,942][100917] Updated weights for policy 1, policy_version 37622 (0.0008) +[2023-10-14 06:36:49,308][100917] Updated weights for policy 1, policy_version 37632 (0.0008) +[2023-10-14 06:36:52,821][100936] Updated weights for policy 0, policy_version 37640 (0.0007) +[2023-10-14 06:36:53,187][100917] Updated weights for policy 1, policy_version 37642 (0.0008) +[2023-10-14 06:36:53,198][100936] Updated weights for policy 0, policy_version 37650 (0.0009) +[2023-10-14 06:36:53,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 77070336. Throughput: 0: 1645.3, 1: 1661.0. Samples: 19286374. Policy #0 lag: (min: 13.0, avg: 35.8, max: 40.0) +[2023-10-14 06:36:53,512][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 06:36:53,569][100917] Updated weights for policy 1, policy_version 37652 (0.0009) +[2023-10-14 06:36:53,570][100936] Updated weights for policy 0, policy_version 37660 (0.0009) +[2023-10-14 06:36:53,947][100917] Updated weights for policy 1, policy_version 37662 (0.0008) +[2023-10-14 06:36:57,881][100936] Updated weights for policy 0, policy_version 37670 (0.0009) +[2023-10-14 06:36:58,243][100936] Updated weights for policy 0, policy_version 37680 (0.0008) +[2023-10-14 06:36:58,249][100917] Updated weights for policy 1, policy_version 37672 (0.0010) +[2023-10-14 06:36:58,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 77135872. Throughput: 0: 1652.1, 1: 1656.7. Samples: 19295970. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:36:58,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 06:36:58,611][100936] Updated weights for policy 0, policy_version 37690 (0.0007) +[2023-10-14 06:36:58,624][100917] Updated weights for policy 1, policy_version 37682 (0.0008) +[2023-10-14 06:36:58,985][100917] Updated weights for policy 1, policy_version 37692 (0.0009) +[2023-10-14 06:37:02,791][100936] Updated weights for policy 0, policy_version 37700 (0.0008) +[2023-10-14 06:37:03,151][100917] Updated weights for policy 1, policy_version 37702 (0.0008) +[2023-10-14 06:37:03,167][100936] Updated weights for policy 0, policy_version 37710 (0.0007) +[2023-10-14 06:37:03,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 77201408. Throughput: 0: 1653.4, 1: 1659.5. Samples: 19316390. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:37:03,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 06:37:03,529][100917] Updated weights for policy 1, policy_version 37712 (0.0009) +[2023-10-14 06:37:03,538][100936] Updated weights for policy 0, policy_version 37720 (0.0008) +[2023-10-14 06:37:03,901][100917] Updated weights for policy 1, policy_version 37722 (0.0010) +[2023-10-14 06:37:07,683][100936] Updated weights for policy 0, policy_version 37730 (0.0009) +[2023-10-14 06:37:08,047][100936] Updated weights for policy 0, policy_version 37740 (0.0008) +[2023-10-14 06:37:08,082][100917] Updated weights for policy 1, policy_version 37732 (0.0008) +[2023-10-14 06:37:08,425][100936] Updated weights for policy 0, policy_version 37750 (0.0008) +[2023-10-14 06:37:08,448][100917] Updated weights for policy 1, policy_version 37742 (0.0010) +[2023-10-14 06:37:08,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.3, 300 sec: 13107.2). Total num frames: 77266944. Throughput: 0: 1642.6, 1: 1659.9. Samples: 19335798. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:37:08,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 06:37:08,794][100936] Updated weights for policy 0, policy_version 37760 (0.0008) +[2023-10-14 06:37:08,827][100917] Updated weights for policy 1, policy_version 37752 (0.0008) +[2023-10-14 06:37:13,047][100917] Updated weights for policy 1, policy_version 37762 (0.0008) +[2023-10-14 06:37:13,123][100936] Updated weights for policy 0, policy_version 37770 (0.0008) +[2023-10-14 06:37:13,412][100917] Updated weights for policy 1, policy_version 37772 (0.0007) +[2023-10-14 06:37:13,503][100936] Updated weights for policy 0, policy_version 37780 (0.0009) +[2023-10-14 06:37:13,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.1, 300 sec: 13107.2). Total num frames: 77332480. Throughput: 0: 1646.1, 1: 1668.1. Samples: 19345582. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:37:13,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:37:13,781][100917] Updated weights for policy 1, policy_version 37782 (0.0008) +[2023-10-14 06:37:13,860][100936] Updated weights for policy 0, policy_version 37790 (0.0008) +[2023-10-14 06:37:14,152][100917] Updated weights for policy 1, policy_version 37792 (0.0007) +[2023-10-14 06:37:17,867][100936] Updated weights for policy 0, policy_version 37800 (0.0009) +[2023-10-14 06:37:18,226][100936] Updated weights for policy 0, policy_version 37810 (0.0008) +[2023-10-14 06:37:18,250][100917] Updated weights for policy 1, policy_version 37802 (0.0007) +[2023-10-14 06:37:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 77398016. Throughput: 0: 1650.4, 1: 1667.0. Samples: 19366032. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:37:18,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:37:18,597][100936] Updated weights for policy 0, policy_version 37820 (0.0008) +[2023-10-14 06:37:18,619][100917] Updated weights for policy 1, policy_version 37812 (0.0007) +[2023-10-14 06:37:19,007][100917] Updated weights for policy 1, policy_version 37822 (0.0009) +[2023-10-14 06:37:22,626][100936] Updated weights for policy 0, policy_version 37830 (0.0007) +[2023-10-14 06:37:22,989][100917] Updated weights for policy 1, policy_version 37832 (0.0008) +[2023-10-14 06:37:22,998][100936] Updated weights for policy 0, policy_version 37840 (0.0007) +[2023-10-14 06:37:23,364][100936] Updated weights for policy 0, policy_version 37850 (0.0008) +[2023-10-14 06:37:23,370][100917] Updated weights for policy 1, policy_version 37842 (0.0008) +[2023-10-14 06:37:23,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 77463552. Throughput: 0: 1639.2, 1: 1661.9. Samples: 19385246. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:37:23,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:37:23,731][100917] Updated weights for policy 1, policy_version 37852 (0.0008) +[2023-10-14 06:37:27,700][100917] Updated weights for policy 1, policy_version 37862 (0.0008) +[2023-10-14 06:37:27,712][100936] Updated weights for policy 0, policy_version 37860 (0.0009) +[2023-10-14 06:37:28,067][100917] Updated weights for policy 1, policy_version 37872 (0.0008) +[2023-10-14 06:37:28,082][100936] Updated weights for policy 0, policy_version 37870 (0.0008) +[2023-10-14 06:37:28,436][100917] Updated weights for policy 1, policy_version 37882 (0.0007) +[2023-10-14 06:37:28,445][100936] Updated weights for policy 0, policy_version 37880 (0.0009) +[2023-10-14 06:37:28,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 77529088. Throughput: 0: 1643.5, 1: 1669.4. Samples: 19395490. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:37:28,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:37:32,550][100917] Updated weights for policy 1, policy_version 37892 (0.0007) +[2023-10-14 06:37:32,635][100936] Updated weights for policy 0, policy_version 37890 (0.0008) +[2023-10-14 06:37:32,920][100917] Updated weights for policy 1, policy_version 37902 (0.0007) +[2023-10-14 06:37:33,011][100936] Updated weights for policy 0, policy_version 37900 (0.0009) +[2023-10-14 06:37:33,288][100917] Updated weights for policy 1, policy_version 37912 (0.0007) +[2023-10-14 06:37:33,380][100936] Updated weights for policy 0, policy_version 37910 (0.0009) +[2023-10-14 06:37:33,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 77594624. Throughput: 0: 1644.8, 1: 1665.4. Samples: 19415790. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:37:33,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:37:33,754][100936] Updated weights for policy 0, policy_version 37920 (0.0009) +[2023-10-14 06:37:37,340][100917] Updated weights for policy 1, policy_version 37922 (0.0008) +[2023-10-14 06:37:37,715][100917] Updated weights for policy 1, policy_version 37932 (0.0008) +[2023-10-14 06:37:37,904][100936] Updated weights for policy 0, policy_version 37930 (0.0009) +[2023-10-14 06:37:38,090][100917] Updated weights for policy 1, policy_version 37942 (0.0008) +[2023-10-14 06:37:38,271][100936] Updated weights for policy 0, policy_version 37940 (0.0007) +[2023-10-14 06:37:38,454][100917] Updated weights for policy 1, policy_version 37952 (0.0007) +[2023-10-14 06:37:38,512][99942] Fps is (10 sec: 16383.9, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 77692928. Throughput: 0: 1644.5, 1: 1650.6. Samples: 19434654. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:37:38,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:37:38,518][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000037952_38862848.pth... +[2023-10-14 06:37:38,552][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000036384_37257216.pth +[2023-10-14 06:37:38,644][100936] Updated weights for policy 0, policy_version 37950 (0.0008) +[2023-10-14 06:37:38,716][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000037952_38862848.pth... +[2023-10-14 06:37:38,758][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000036384_37257216.pth +[2023-10-14 06:37:42,667][100917] Updated weights for policy 1, policy_version 37962 (0.0009) +[2023-10-14 06:37:42,811][100936] Updated weights for policy 0, policy_version 37960 (0.0008) +[2023-10-14 06:37:43,033][100917] Updated weights for policy 1, policy_version 37972 (0.0008) +[2023-10-14 06:37:43,174][100936] Updated weights for policy 0, policy_version 37970 (0.0008) +[2023-10-14 06:37:43,397][100917] Updated weights for policy 1, policy_version 37982 (0.0008) +[2023-10-14 06:37:43,512][99942] Fps is (10 sec: 16383.7, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 77758464. Throughput: 0: 1646.5, 1: 1667.8. Samples: 19445112. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:37:43,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:37:43,550][100936] Updated weights for policy 0, policy_version 37980 (0.0007) +[2023-10-14 06:37:47,643][100917] Updated weights for policy 1, policy_version 37992 (0.0009) +[2023-10-14 06:37:47,657][100936] Updated weights for policy 0, policy_version 37990 (0.0008) +[2023-10-14 06:37:48,016][100917] Updated weights for policy 1, policy_version 38002 (0.0009) +[2023-10-14 06:37:48,028][100936] Updated weights for policy 0, policy_version 38000 (0.0010) +[2023-10-14 06:37:48,384][100917] Updated weights for policy 1, policy_version 38012 (0.0008) +[2023-10-14 06:37:48,396][100936] Updated weights for policy 0, policy_version 38010 (0.0010) +[2023-10-14 06:37:48,512][99942] Fps is (10 sec: 9830.4, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 77791232. Throughput: 0: 1644.8, 1: 1663.0. Samples: 19465238. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:37:48,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:37:52,495][100936] Updated weights for policy 0, policy_version 38020 (0.0007) +[2023-10-14 06:37:52,754][100917] Updated weights for policy 1, policy_version 38022 (0.0008) +[2023-10-14 06:37:52,872][100936] Updated weights for policy 0, policy_version 38030 (0.0009) +[2023-10-14 06:37:53,144][100917] Updated weights for policy 1, policy_version 38032 (0.0008) +[2023-10-14 06:37:53,233][100936] Updated weights for policy 0, policy_version 38040 (0.0008) +[2023-10-14 06:37:53,512][99942] Fps is (10 sec: 9830.5, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 77856768. Throughput: 0: 1641.4, 1: 1651.6. Samples: 19483980. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:37:53,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:37:53,516][100917] Updated weights for policy 1, policy_version 38042 (0.0008) +[2023-10-14 06:37:57,521][100936] Updated weights for policy 0, policy_version 38050 (0.0007) +[2023-10-14 06:37:57,653][100917] Updated weights for policy 1, policy_version 38052 (0.0008) +[2023-10-14 06:37:57,923][100936] Updated weights for policy 0, policy_version 38060 (0.0008) +[2023-10-14 06:37:58,031][100917] Updated weights for policy 1, policy_version 38062 (0.0007) +[2023-10-14 06:37:58,293][100936] Updated weights for policy 0, policy_version 38070 (0.0009) +[2023-10-14 06:37:58,391][100917] Updated weights for policy 1, policy_version 38072 (0.0009) +[2023-10-14 06:37:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 77922304. Throughput: 0: 1651.9, 1: 1656.0. Samples: 19494436. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:37:58,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:37:58,667][100936] Updated weights for policy 0, policy_version 38080 (0.0008) +[2023-10-14 06:38:02,468][100917] Updated weights for policy 1, policy_version 38082 (0.0008) +[2023-10-14 06:38:02,835][100917] Updated weights for policy 1, policy_version 38092 (0.0007) +[2023-10-14 06:38:02,865][100936] Updated weights for policy 0, policy_version 38090 (0.0009) +[2023-10-14 06:38:03,213][100917] Updated weights for policy 1, policy_version 38102 (0.0007) +[2023-10-14 06:38:03,245][100936] Updated weights for policy 0, policy_version 38100 (0.0008) +[2023-10-14 06:38:03,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 77987840. Throughput: 0: 1646.0, 1: 1653.6. Samples: 19514514. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:38:03,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:38:03,583][100917] Updated weights for policy 1, policy_version 38112 (0.0009) +[2023-10-14 06:38:03,618][100936] Updated weights for policy 0, policy_version 38110 (0.0009) +[2023-10-14 06:38:07,634][100936] Updated weights for policy 0, policy_version 38120 (0.0007) +[2023-10-14 06:38:07,722][100917] Updated weights for policy 1, policy_version 38122 (0.0010) +[2023-10-14 06:38:07,997][100936] Updated weights for policy 0, policy_version 38130 (0.0007) +[2023-10-14 06:38:08,090][100917] Updated weights for policy 1, policy_version 38132 (0.0007) +[2023-10-14 06:38:08,373][100936] Updated weights for policy 0, policy_version 38140 (0.0007) +[2023-10-14 06:38:08,462][100917] Updated weights for policy 1, policy_version 38142 (0.0008) +[2023-10-14 06:38:08,512][99942] Fps is (10 sec: 16384.0, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 78086144. Throughput: 0: 1645.1, 1: 1645.6. Samples: 19533326. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:38:08,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:38:12,530][100936] Updated weights for policy 0, policy_version 38150 (0.0008) +[2023-10-14 06:38:12,688][100917] Updated weights for policy 1, policy_version 38152 (0.0008) +[2023-10-14 06:38:12,900][100936] Updated weights for policy 0, policy_version 38160 (0.0009) +[2023-10-14 06:38:13,052][100917] Updated weights for policy 1, policy_version 38162 (0.0009) +[2023-10-14 06:38:13,275][100936] Updated weights for policy 0, policy_version 38170 (0.0008) +[2023-10-14 06:38:13,426][100917] Updated weights for policy 1, policy_version 38172 (0.0010) +[2023-10-14 06:38:13,512][99942] Fps is (10 sec: 16383.8, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 78151680. Throughput: 0: 1648.4, 1: 1648.8. Samples: 19543866. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:38:13,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:38:17,398][100936] Updated weights for policy 0, policy_version 38180 (0.0008) +[2023-10-14 06:38:17,727][100917] Updated weights for policy 1, policy_version 38182 (0.0009) +[2023-10-14 06:38:17,766][100936] Updated weights for policy 0, policy_version 38190 (0.0007) +[2023-10-14 06:38:18,100][100917] Updated weights for policy 1, policy_version 38192 (0.0008) +[2023-10-14 06:38:18,139][100936] Updated weights for policy 0, policy_version 38200 (0.0009) +[2023-10-14 06:38:18,465][100917] Updated weights for policy 1, policy_version 38202 (0.0010) +[2023-10-14 06:38:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 78217216. Throughput: 0: 1646.7, 1: 1647.5. Samples: 19564026. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:38:18,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:38:22,300][100936] Updated weights for policy 0, policy_version 38210 (0.0009) +[2023-10-14 06:38:22,576][100917] Updated weights for policy 1, policy_version 38212 (0.0009) +[2023-10-14 06:38:22,674][100936] Updated weights for policy 0, policy_version 38220 (0.0007) +[2023-10-14 06:38:22,939][100917] Updated weights for policy 1, policy_version 38222 (0.0008) +[2023-10-14 06:38:23,035][100936] Updated weights for policy 0, policy_version 38230 (0.0007) +[2023-10-14 06:38:23,317][100917] Updated weights for policy 1, policy_version 38232 (0.0008) +[2023-10-14 06:38:23,405][100936] Updated weights for policy 0, policy_version 38240 (0.0007) +[2023-10-14 06:38:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 78282752. Throughput: 0: 1646.8, 1: 1648.1. Samples: 19582924. Policy #0 lag: (min: 26.0, avg: 26.0, max: 29.0) +[2023-10-14 06:38:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:38:27,314][100917] Updated weights for policy 1, policy_version 38242 (0.0009) +[2023-10-14 06:38:27,608][100936] Updated weights for policy 0, policy_version 38250 (0.0008) +[2023-10-14 06:38:27,687][100917] Updated weights for policy 1, policy_version 38252 (0.0009) +[2023-10-14 06:38:27,984][100936] Updated weights for policy 0, policy_version 38260 (0.0007) +[2023-10-14 06:38:28,068][100917] Updated weights for policy 1, policy_version 38262 (0.0010) +[2023-10-14 06:38:28,352][100936] Updated weights for policy 0, policy_version 38270 (0.0008) +[2023-10-14 06:38:28,437][100917] Updated weights for policy 1, policy_version 38272 (0.0008) +[2023-10-14 06:38:28,512][99942] Fps is (10 sec: 16384.0, 60 sec: 14199.5, 300 sec: 13329.4). Total num frames: 78381056. Throughput: 0: 1654.0, 1: 1645.2. Samples: 19593576. Policy #0 lag: (min: 26.0, avg: 26.0, max: 29.0) +[2023-10-14 06:38:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:38:32,463][100936] Updated weights for policy 0, policy_version 38280 (0.0008) +[2023-10-14 06:38:32,720][100917] Updated weights for policy 1, policy_version 38282 (0.0007) +[2023-10-14 06:38:32,831][100936] Updated weights for policy 0, policy_version 38290 (0.0010) +[2023-10-14 06:38:33,092][100917] Updated weights for policy 1, policy_version 38292 (0.0008) +[2023-10-14 06:38:33,214][100936] Updated weights for policy 0, policy_version 38300 (0.0009) +[2023-10-14 06:38:33,469][100917] Updated weights for policy 1, policy_version 38302 (0.0009) +[2023-10-14 06:38:33,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 78413824. Throughput: 0: 1650.2, 1: 1645.8. Samples: 19613560. Policy #0 lag: (min: 26.0, avg: 26.0, max: 29.0) +[2023-10-14 06:38:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:38:37,338][100936] Updated weights for policy 0, policy_version 38310 (0.0008) +[2023-10-14 06:38:37,495][100917] Updated weights for policy 1, policy_version 38312 (0.0008) +[2023-10-14 06:38:37,699][100936] Updated weights for policy 0, policy_version 38320 (0.0007) +[2023-10-14 06:38:37,865][100917] Updated weights for policy 1, policy_version 38322 (0.0010) +[2023-10-14 06:38:38,072][100936] Updated weights for policy 0, policy_version 38330 (0.0007) +[2023-10-14 06:38:38,234][100917] Updated weights for policy 1, policy_version 38332 (0.0009) +[2023-10-14 06:38:38,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 78512128. Throughput: 0: 1650.1, 1: 1641.3. Samples: 19632094. Policy #0 lag: (min: 26.0, avg: 26.0, max: 29.0) +[2023-10-14 06:38:38,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:38:42,298][100936] Updated weights for policy 0, policy_version 38340 (0.0008) +[2023-10-14 06:38:42,458][100917] Updated weights for policy 1, policy_version 38342 (0.0009) +[2023-10-14 06:38:42,679][100936] Updated weights for policy 0, policy_version 38350 (0.0007) +[2023-10-14 06:38:42,827][100917] Updated weights for policy 1, policy_version 38352 (0.0009) +[2023-10-14 06:38:43,039][100936] Updated weights for policy 0, policy_version 38360 (0.0007) +[2023-10-14 06:38:43,201][100917] Updated weights for policy 1, policy_version 38362 (0.0009) +[2023-10-14 06:38:43,512][99942] Fps is (10 sec: 16383.6, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 78577664. Throughput: 0: 1653.7, 1: 1647.5. Samples: 19642988. Policy #0 lag: (min: 26.0, avg: 26.0, max: 29.0) +[2023-10-14 06:38:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:38:47,168][100936] Updated weights for policy 0, policy_version 38370 (0.0008) +[2023-10-14 06:38:47,318][100917] Updated weights for policy 1, policy_version 38372 (0.0008) +[2023-10-14 06:38:47,540][100936] Updated weights for policy 0, policy_version 38380 (0.0008) +[2023-10-14 06:38:47,688][100917] Updated weights for policy 1, policy_version 38382 (0.0008) +[2023-10-14 06:38:47,903][100936] Updated weights for policy 0, policy_version 38390 (0.0010) +[2023-10-14 06:38:48,062][100917] Updated weights for policy 1, policy_version 38392 (0.0008) +[2023-10-14 06:38:48,273][100936] Updated weights for policy 0, policy_version 38400 (0.0008) +[2023-10-14 06:38:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13329.4). Total num frames: 78643200. Throughput: 0: 1646.4, 1: 1641.5. Samples: 19662468. Policy #0 lag: (min: 26.0, avg: 26.0, max: 29.0) +[2023-10-14 06:38:48,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:38:52,086][100917] Updated weights for policy 1, policy_version 38402 (0.0009) +[2023-10-14 06:38:52,399][100936] Updated weights for policy 0, policy_version 38410 (0.0008) +[2023-10-14 06:38:52,451][100917] Updated weights for policy 1, policy_version 38412 (0.0007) +[2023-10-14 06:38:52,763][100936] Updated weights for policy 0, policy_version 38420 (0.0009) +[2023-10-14 06:38:52,824][100917] Updated weights for policy 1, policy_version 38422 (0.0010) +[2023-10-14 06:38:53,131][100936] Updated weights for policy 0, policy_version 38430 (0.0008) +[2023-10-14 06:38:53,184][100917] Updated weights for policy 1, policy_version 38432 (0.0007) +[2023-10-14 06:38:53,512][99942] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13329.4). Total num frames: 78708736. Throughput: 0: 1644.4, 1: 1635.0. Samples: 19680900. Policy #0 lag: (min: 20.0, avg: 27.5, max: 52.0) +[2023-10-14 06:38:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:38:57,304][100936] Updated weights for policy 0, policy_version 38440 (0.0008) +[2023-10-14 06:38:57,494][100917] Updated weights for policy 1, policy_version 38442 (0.0008) +[2023-10-14 06:38:57,673][100936] Updated weights for policy 0, policy_version 38450 (0.0009) +[2023-10-14 06:38:57,862][100917] Updated weights for policy 1, policy_version 38452 (0.0009) +[2023-10-14 06:38:58,039][100936] Updated weights for policy 0, policy_version 38460 (0.0008) +[2023-10-14 06:38:58,240][100917] Updated weights for policy 1, policy_version 38462 (0.0009) +[2023-10-14 06:38:58,512][99942] Fps is (10 sec: 13106.9, 60 sec: 14199.4, 300 sec: 13329.4). Total num frames: 78774272. Throughput: 0: 1647.1, 1: 1642.8. Samples: 19691910. Policy #0 lag: (min: 20.0, avg: 27.5, max: 52.0) +[2023-10-14 06:38:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:39:02,154][100936] Updated weights for policy 0, policy_version 38470 (0.0007) +[2023-10-14 06:39:02,276][100917] Updated weights for policy 1, policy_version 38472 (0.0008) +[2023-10-14 06:39:02,529][100936] Updated weights for policy 0, policy_version 38480 (0.0009) +[2023-10-14 06:39:02,651][100917] Updated weights for policy 1, policy_version 38482 (0.0008) +[2023-10-14 06:39:02,891][100936] Updated weights for policy 0, policy_version 38490 (0.0009) +[2023-10-14 06:39:03,018][100917] Updated weights for policy 1, policy_version 38492 (0.0007) +[2023-10-14 06:39:03,512][99942] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13329.4). Total num frames: 78839808. Throughput: 0: 1638.2, 1: 1646.3. Samples: 19711828. Policy #0 lag: (min: 20.0, avg: 27.5, max: 52.0) +[2023-10-14 06:39:03,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:39:07,085][100936] Updated weights for policy 0, policy_version 38500 (0.0007) +[2023-10-14 06:39:07,135][100917] Updated weights for policy 1, policy_version 38502 (0.0010) +[2023-10-14 06:39:07,443][100936] Updated weights for policy 0, policy_version 38510 (0.0010) +[2023-10-14 06:39:07,501][100917] Updated weights for policy 1, policy_version 38512 (0.0008) +[2023-10-14 06:39:07,820][100936] Updated weights for policy 0, policy_version 38520 (0.0009) +[2023-10-14 06:39:07,879][100917] Updated weights for policy 1, policy_version 38522 (0.0009) +[2023-10-14 06:39:08,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 78905344. Throughput: 0: 1637.6, 1: 1638.0. Samples: 19730328. Policy #0 lag: (min: 20.0, avg: 27.5, max: 52.0) +[2023-10-14 06:39:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:39:11,930][100936] Updated weights for policy 0, policy_version 38530 (0.0008) +[2023-10-14 06:39:11,967][100917] Updated weights for policy 1, policy_version 38532 (0.0009) +[2023-10-14 06:39:12,301][100936] Updated weights for policy 0, policy_version 38540 (0.0008) +[2023-10-14 06:39:12,336][100917] Updated weights for policy 1, policy_version 38542 (0.0007) +[2023-10-14 06:39:12,667][100936] Updated weights for policy 0, policy_version 38550 (0.0009) +[2023-10-14 06:39:12,702][100917] Updated weights for policy 1, policy_version 38552 (0.0007) +[2023-10-14 06:39:13,038][100936] Updated weights for policy 0, policy_version 38560 (0.0008) +[2023-10-14 06:39:13,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 78970880. Throughput: 0: 1643.2, 1: 1651.0. Samples: 19741814. Policy #0 lag: (min: 20.0, avg: 27.5, max: 52.0) +[2023-10-14 06:39:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:39:16,992][100917] Updated weights for policy 1, policy_version 38562 (0.0009) +[2023-10-14 06:39:17,197][100936] Updated weights for policy 0, policy_version 38570 (0.0007) +[2023-10-14 06:39:17,364][100917] Updated weights for policy 1, policy_version 38572 (0.0008) +[2023-10-14 06:39:17,564][100936] Updated weights for policy 0, policy_version 38580 (0.0007) +[2023-10-14 06:39:17,731][100917] Updated weights for policy 1, policy_version 38582 (0.0008) +[2023-10-14 06:39:17,928][100936] Updated weights for policy 0, policy_version 38590 (0.0007) +[2023-10-14 06:39:18,101][100917] Updated weights for policy 1, policy_version 38592 (0.0009) +[2023-10-14 06:39:18,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 79036416. Throughput: 0: 1635.5, 1: 1648.9. Samples: 19761358. Policy #0 lag: (min: 20.0, avg: 27.5, max: 52.0) +[2023-10-14 06:39:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:39:22,077][100936] Updated weights for policy 0, policy_version 38600 (0.0007) +[2023-10-14 06:39:22,451][100917] Updated weights for policy 1, policy_version 38602 (0.0008) +[2023-10-14 06:39:22,457][100936] Updated weights for policy 0, policy_version 38610 (0.0007) +[2023-10-14 06:39:22,838][100936] Updated weights for policy 0, policy_version 38620 (0.0008) +[2023-10-14 06:39:22,838][100917] Updated weights for policy 1, policy_version 38612 (0.0008) +[2023-10-14 06:39:23,211][100917] Updated weights for policy 1, policy_version 38622 (0.0011) +[2023-10-14 06:39:23,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 79101952. Throughput: 0: 1647.5, 1: 1641.5. Samples: 19780100. Policy #0 lag: (min: 27.0, avg: 27.0, max: 31.0) +[2023-10-14 06:39:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:39:26,912][100936] Updated weights for policy 0, policy_version 38630 (0.0009) +[2023-10-14 06:39:27,285][100936] Updated weights for policy 0, policy_version 38640 (0.0008) +[2023-10-14 06:39:27,287][100917] Updated weights for policy 1, policy_version 38632 (0.0008) +[2023-10-14 06:39:27,653][100936] Updated weights for policy 0, policy_version 38650 (0.0008) +[2023-10-14 06:39:27,658][100917] Updated weights for policy 1, policy_version 38642 (0.0008) +[2023-10-14 06:39:28,037][100917] Updated weights for policy 1, policy_version 38652 (0.0010) +[2023-10-14 06:39:28,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 79167488. Throughput: 0: 1650.9, 1: 1645.2. Samples: 19791310. Policy #0 lag: (min: 27.0, avg: 27.0, max: 31.0) +[2023-10-14 06:39:28,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:39:31,846][100936] Updated weights for policy 0, policy_version 38660 (0.0008) +[2023-10-14 06:39:32,119][100917] Updated weights for policy 1, policy_version 38662 (0.0010) +[2023-10-14 06:39:32,240][100936] Updated weights for policy 0, policy_version 38670 (0.0007) +[2023-10-14 06:39:32,501][100917] Updated weights for policy 1, policy_version 38672 (0.0008) +[2023-10-14 06:39:32,621][100936] Updated weights for policy 0, policy_version 38680 (0.0008) +[2023-10-14 06:39:32,872][100917] Updated weights for policy 1, policy_version 38682 (0.0008) +[2023-10-14 06:39:33,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 79233024. Throughput: 0: 1640.4, 1: 1655.5. Samples: 19810784. Policy #0 lag: (min: 27.0, avg: 27.0, max: 31.0) +[2023-10-14 06:39:33,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:39:36,736][100936] Updated weights for policy 0, policy_version 38690 (0.0008) +[2023-10-14 06:39:36,994][100917] Updated weights for policy 1, policy_version 38692 (0.0008) +[2023-10-14 06:39:37,106][100936] Updated weights for policy 0, policy_version 38700 (0.0009) +[2023-10-14 06:39:37,376][100917] Updated weights for policy 1, policy_version 38702 (0.0008) +[2023-10-14 06:39:37,484][100936] Updated weights for policy 0, policy_version 38710 (0.0009) +[2023-10-14 06:39:37,735][100917] Updated weights for policy 1, policy_version 38712 (0.0008) +[2023-10-14 06:39:37,845][100936] Updated weights for policy 0, policy_version 38720 (0.0007) +[2023-10-14 06:39:38,512][99942] Fps is (10 sec: 13106.6, 60 sec: 13107.1, 300 sec: 13329.3). Total num frames: 79298560. Throughput: 0: 1650.2, 1: 1650.7. Samples: 19829442. Policy #0 lag: (min: 27.0, avg: 27.0, max: 31.0) +[2023-10-14 06:39:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:39:38,526][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000038720_39649280.pth... +[2023-10-14 06:39:38,526][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000038720_39649280.pth... +[2023-10-14 06:39:38,563][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000037184_38076416.pth +[2023-10-14 06:39:38,567][100560] Saving a milestone ./train_atari/atari_privateye_APPO/checkpoint_p0/milestones/checkpoint_000038720_39649280.pth +[2023-10-14 06:39:38,567][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000037152_38043648.pth +[2023-10-14 06:39:38,572][100681] Saving a milestone ./train_atari/atari_privateye_APPO/checkpoint_p1/milestones/checkpoint_000038720_39649280.pth +[2023-10-14 06:39:41,783][100917] Updated weights for policy 1, policy_version 38722 (0.0009) +[2023-10-14 06:39:42,082][100936] Updated weights for policy 0, policy_version 38730 (0.0008) +[2023-10-14 06:39:42,143][100917] Updated weights for policy 1, policy_version 38732 (0.0008) +[2023-10-14 06:39:42,451][100936] Updated weights for policy 0, policy_version 38740 (0.0010) +[2023-10-14 06:39:42,513][100917] Updated weights for policy 1, policy_version 38742 (0.0009) +[2023-10-14 06:39:42,818][100936] Updated weights for policy 0, policy_version 38750 (0.0007) +[2023-10-14 06:39:42,886][100917] Updated weights for policy 1, policy_version 38752 (0.0008) +[2023-10-14 06:39:43,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 79364096. Throughput: 0: 1650.4, 1: 1656.0. Samples: 19840700. Policy #0 lag: (min: 27.0, avg: 27.0, max: 31.0) +[2023-10-14 06:39:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:39:46,908][100917] Updated weights for policy 1, policy_version 38762 (0.0009) +[2023-10-14 06:39:47,027][100936] Updated weights for policy 0, policy_version 38760 (0.0008) +[2023-10-14 06:39:47,274][100917] Updated weights for policy 1, policy_version 38772 (0.0008) +[2023-10-14 06:39:47,393][100936] Updated weights for policy 0, policy_version 38770 (0.0009) +[2023-10-14 06:39:47,652][100917] Updated weights for policy 1, policy_version 38782 (0.0007) +[2023-10-14 06:39:47,773][100936] Updated weights for policy 0, policy_version 38780 (0.0007) +[2023-10-14 06:39:48,512][99942] Fps is (10 sec: 13107.7, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 79429632. Throughput: 0: 1644.2, 1: 1648.8. Samples: 19860014. Policy #0 lag: (min: 27.0, avg: 27.0, max: 31.0) +[2023-10-14 06:39:48,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:39:51,727][100917] Updated weights for policy 1, policy_version 38792 (0.0007) +[2023-10-14 06:39:52,037][100936] Updated weights for policy 0, policy_version 38790 (0.0009) +[2023-10-14 06:39:52,087][100917] Updated weights for policy 1, policy_version 38802 (0.0008) +[2023-10-14 06:39:52,415][100936] Updated weights for policy 0, policy_version 38800 (0.0007) +[2023-10-14 06:39:52,467][100917] Updated weights for policy 1, policy_version 38812 (0.0008) +[2023-10-14 06:39:52,782][100936] Updated weights for policy 0, policy_version 38810 (0.0007) +[2023-10-14 06:39:53,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 79495168. Throughput: 0: 1648.9, 1: 1652.5. Samples: 19878890. Policy #0 lag: (min: 3.0, avg: 6.6, max: 35.0) +[2023-10-14 06:39:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:39:56,588][100917] Updated weights for policy 1, policy_version 38822 (0.0007) +[2023-10-14 06:39:56,715][100936] Updated weights for policy 0, policy_version 38820 (0.0009) +[2023-10-14 06:39:56,959][100917] Updated weights for policy 1, policy_version 38832 (0.0011) +[2023-10-14 06:39:57,097][100936] Updated weights for policy 0, policy_version 38830 (0.0009) +[2023-10-14 06:39:57,342][100917] Updated weights for policy 1, policy_version 38842 (0.0008) +[2023-10-14 06:39:57,459][100936] Updated weights for policy 0, policy_version 38840 (0.0009) +[2023-10-14 06:39:58,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 79560704. Throughput: 0: 1648.8, 1: 1656.9. Samples: 19890572. Policy #0 lag: (min: 3.0, avg: 6.6, max: 35.0) +[2023-10-14 06:39:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:40:01,505][100917] Updated weights for policy 1, policy_version 38852 (0.0008) +[2023-10-14 06:40:01,700][100936] Updated weights for policy 0, policy_version 38850 (0.0009) +[2023-10-14 06:40:01,875][100917] Updated weights for policy 1, policy_version 38862 (0.0008) +[2023-10-14 06:40:02,075][100936] Updated weights for policy 0, policy_version 38860 (0.0007) +[2023-10-14 06:40:02,234][100917] Updated weights for policy 1, policy_version 38872 (0.0009) +[2023-10-14 06:40:02,441][100936] Updated weights for policy 0, policy_version 38870 (0.0008) +[2023-10-14 06:40:02,813][100936] Updated weights for policy 0, policy_version 38880 (0.0007) +[2023-10-14 06:40:03,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 79626240. Throughput: 0: 1643.8, 1: 1651.6. Samples: 19909648. Policy #0 lag: (min: 3.0, avg: 6.6, max: 35.0) +[2023-10-14 06:40:03,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:40:06,327][100917] Updated weights for policy 1, policy_version 38882 (0.0009) +[2023-10-14 06:40:06,699][100917] Updated weights for policy 1, policy_version 38892 (0.0009) +[2023-10-14 06:40:07,017][100936] Updated weights for policy 0, policy_version 38890 (0.0008) +[2023-10-14 06:40:07,075][100917] Updated weights for policy 1, policy_version 38902 (0.0009) +[2023-10-14 06:40:07,375][100936] Updated weights for policy 0, policy_version 38900 (0.0007) +[2023-10-14 06:40:07,441][100917] Updated weights for policy 1, policy_version 38912 (0.0009) +[2023-10-14 06:40:07,748][100936] Updated weights for policy 0, policy_version 38910 (0.0008) +[2023-10-14 06:40:08,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 79691776. Throughput: 0: 1643.5, 1: 1666.2. Samples: 19929036. Policy #0 lag: (min: 3.0, avg: 6.6, max: 35.0) +[2023-10-14 06:40:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:40:11,600][100917] Updated weights for policy 1, policy_version 38922 (0.0008) +[2023-10-14 06:40:11,877][100936] Updated weights for policy 0, policy_version 38920 (0.0009) +[2023-10-14 06:40:11,979][100917] Updated weights for policy 1, policy_version 38932 (0.0009) +[2023-10-14 06:40:12,248][100936] Updated weights for policy 0, policy_version 38930 (0.0008) +[2023-10-14 06:40:12,359][100917] Updated weights for policy 1, policy_version 38942 (0.0008) +[2023-10-14 06:40:12,611][100936] Updated weights for policy 0, policy_version 38940 (0.0008) +[2023-10-14 06:40:13,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 79757312. Throughput: 0: 1639.4, 1: 1676.8. Samples: 19940540. Policy #0 lag: (min: 3.0, avg: 6.6, max: 35.0) +[2023-10-14 06:40:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:40:16,453][100917] Updated weights for policy 1, policy_version 38952 (0.0009) +[2023-10-14 06:40:16,835][100917] Updated weights for policy 1, policy_version 38962 (0.0008) +[2023-10-14 06:40:16,991][100936] Updated weights for policy 0, policy_version 38950 (0.0009) +[2023-10-14 06:40:17,194][100917] Updated weights for policy 1, policy_version 38972 (0.0007) +[2023-10-14 06:40:17,376][100936] Updated weights for policy 0, policy_version 38960 (0.0009) +[2023-10-14 06:40:17,745][100936] Updated weights for policy 0, policy_version 38970 (0.0009) +[2023-10-14 06:40:18,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 79822848. Throughput: 0: 1641.1, 1: 1658.8. Samples: 19959278. Policy #0 lag: (min: 16.0, avg: 41.1, max: 48.0) +[2023-10-14 06:40:18,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:40:21,383][100917] Updated weights for policy 1, policy_version 38982 (0.0009) +[2023-10-14 06:40:21,766][100917] Updated weights for policy 1, policy_version 38992 (0.0009) +[2023-10-14 06:40:21,867][100936] Updated weights for policy 0, policy_version 38980 (0.0009) +[2023-10-14 06:40:22,129][100917] Updated weights for policy 1, policy_version 39002 (0.0007) +[2023-10-14 06:40:22,243][100936] Updated weights for policy 0, policy_version 38990 (0.0008) +[2023-10-14 06:40:22,612][100936] Updated weights for policy 0, policy_version 39000 (0.0007) +[2023-10-14 06:40:23,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.3, 300 sec: 13329.4). Total num frames: 79888384. Throughput: 0: 1640.5, 1: 1677.1. Samples: 19978734. Policy #0 lag: (min: 16.0, avg: 41.1, max: 48.0) +[2023-10-14 06:40:23,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:40:26,035][100917] Updated weights for policy 1, policy_version 39012 (0.0007) +[2023-10-14 06:40:26,396][100917] Updated weights for policy 1, policy_version 39022 (0.0008) +[2023-10-14 06:40:26,695][100936] Updated weights for policy 0, policy_version 39010 (0.0008) +[2023-10-14 06:40:26,776][100917] Updated weights for policy 1, policy_version 39032 (0.0009) +[2023-10-14 06:40:27,067][100936] Updated weights for policy 0, policy_version 39020 (0.0007) +[2023-10-14 06:40:27,438][100936] Updated weights for policy 0, policy_version 39030 (0.0008) +[2023-10-14 06:40:27,803][100936] Updated weights for policy 0, policy_version 39040 (0.0010) +[2023-10-14 06:40:28,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 79953920. Throughput: 0: 1642.8, 1: 1680.6. Samples: 19990254. Policy #0 lag: (min: 16.0, avg: 41.1, max: 48.0) +[2023-10-14 06:40:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:40:30,680][100917] Updated weights for policy 1, policy_version 39042 (0.0008) +[2023-10-14 06:40:31,048][100917] Updated weights for policy 1, policy_version 39052 (0.0009) +[2023-10-14 06:40:31,422][100917] Updated weights for policy 1, policy_version 39062 (0.0009) +[2023-10-14 06:40:31,802][100917] Updated weights for policy 1, policy_version 39072 (0.0010) +[2023-10-14 06:40:32,081][100936] Updated weights for policy 0, policy_version 39050 (0.0010) +[2023-10-14 06:40:32,446][100936] Updated weights for policy 0, policy_version 39060 (0.0008) +[2023-10-14 06:40:32,820][100936] Updated weights for policy 0, policy_version 39070 (0.0011) +[2023-10-14 06:40:33,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 80019456. Throughput: 0: 1642.8, 1: 1664.2. Samples: 20008830. Policy #0 lag: (min: 16.0, avg: 41.1, max: 48.0) +[2023-10-14 06:40:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:40:35,880][100917] Updated weights for policy 1, policy_version 39082 (0.0010) +[2023-10-14 06:40:36,242][100917] Updated weights for policy 1, policy_version 39092 (0.0009) +[2023-10-14 06:40:36,615][100917] Updated weights for policy 1, policy_version 39102 (0.0009) +[2023-10-14 06:40:36,935][100936] Updated weights for policy 0, policy_version 39080 (0.0008) +[2023-10-14 06:40:37,299][100936] Updated weights for policy 0, policy_version 39090 (0.0008) +[2023-10-14 06:40:37,669][100936] Updated weights for policy 0, policy_version 39100 (0.0007) +[2023-10-14 06:40:38,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 13329.4). Total num frames: 80084992. Throughput: 0: 1648.2, 1: 1683.2. Samples: 20028804. Policy #0 lag: (min: 16.0, avg: 41.1, max: 48.0) +[2023-10-14 06:40:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:40:40,727][100917] Updated weights for policy 1, policy_version 39112 (0.0010) +[2023-10-14 06:40:41,091][100917] Updated weights for policy 1, policy_version 39122 (0.0011) +[2023-10-14 06:40:41,465][100917] Updated weights for policy 1, policy_version 39132 (0.0009) +[2023-10-14 06:40:41,934][100936] Updated weights for policy 0, policy_version 39110 (0.0008) +[2023-10-14 06:40:42,293][100936] Updated weights for policy 0, policy_version 39120 (0.0008) +[2023-10-14 06:40:42,665][100936] Updated weights for policy 0, policy_version 39130 (0.0007) +[2023-10-14 06:40:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 13329.4). Total num frames: 80150528. Throughput: 0: 1645.7, 1: 1671.6. Samples: 20039848. Policy #0 lag: (min: 16.0, avg: 41.1, max: 48.0) +[2023-10-14 06:40:43,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:40:45,668][100917] Updated weights for policy 1, policy_version 39142 (0.0011) +[2023-10-14 06:40:46,044][100917] Updated weights for policy 1, policy_version 39152 (0.0009) +[2023-10-14 06:40:46,416][100917] Updated weights for policy 1, policy_version 39162 (0.0011) +[2023-10-14 06:40:46,758][100936] Updated weights for policy 0, policy_version 39140 (0.0009) +[2023-10-14 06:40:47,123][100936] Updated weights for policy 0, policy_version 39150 (0.0009) +[2023-10-14 06:40:47,496][100936] Updated weights for policy 0, policy_version 39160 (0.0007) +[2023-10-14 06:40:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 80216064. Throughput: 0: 1646.0, 1: 1664.4. Samples: 20058616. Policy #0 lag: (min: 21.0, avg: 45.7, max: 48.0) +[2023-10-14 06:40:48,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:40:50,406][100917] Updated weights for policy 1, policy_version 39172 (0.0007) +[2023-10-14 06:40:50,782][100917] Updated weights for policy 1, policy_version 39182 (0.0007) +[2023-10-14 06:40:51,155][100917] Updated weights for policy 1, policy_version 39192 (0.0009) +[2023-10-14 06:40:51,638][100936] Updated weights for policy 0, policy_version 39170 (0.0007) +[2023-10-14 06:40:52,002][100936] Updated weights for policy 0, policy_version 39180 (0.0007) +[2023-10-14 06:40:52,365][100936] Updated weights for policy 0, policy_version 39190 (0.0008) +[2023-10-14 06:40:52,738][100936] Updated weights for policy 0, policy_version 39200 (0.0008) +[2023-10-14 06:40:53,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 80281600. Throughput: 0: 1645.2, 1: 1675.4. Samples: 20078462. Policy #0 lag: (min: 21.0, avg: 45.7, max: 48.0) +[2023-10-14 06:40:53,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:40:55,216][100917] Updated weights for policy 1, policy_version 39202 (0.0010) +[2023-10-14 06:40:55,581][100917] Updated weights for policy 1, policy_version 39212 (0.0007) +[2023-10-14 06:40:55,959][100917] Updated weights for policy 1, policy_version 39222 (0.0008) +[2023-10-14 06:40:56,340][100917] Updated weights for policy 1, policy_version 39232 (0.0009) +[2023-10-14 06:40:57,029][100936] Updated weights for policy 0, policy_version 39210 (0.0008) +[2023-10-14 06:40:57,404][100936] Updated weights for policy 0, policy_version 39220 (0.0007) +[2023-10-14 06:40:57,768][100936] Updated weights for policy 0, policy_version 39230 (0.0007) +[2023-10-14 06:40:58,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 80347136. Throughput: 0: 1644.8, 1: 1657.7. Samples: 20089152. Policy #0 lag: (min: 21.0, avg: 45.7, max: 48.0) +[2023-10-14 06:40:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:41:00,614][100917] Updated weights for policy 1, policy_version 39242 (0.0007) +[2023-10-14 06:41:00,990][100917] Updated weights for policy 1, policy_version 39252 (0.0009) +[2023-10-14 06:41:01,361][100917] Updated weights for policy 1, policy_version 39262 (0.0008) +[2023-10-14 06:41:01,726][100936] Updated weights for policy 0, policy_version 39240 (0.0009) +[2023-10-14 06:41:02,100][100936] Updated weights for policy 0, policy_version 39250 (0.0010) +[2023-10-14 06:41:02,476][100936] Updated weights for policy 0, policy_version 39260 (0.0010) +[2023-10-14 06:41:03,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 80412672. Throughput: 0: 1650.7, 1: 1661.0. Samples: 20108304. Policy #0 lag: (min: 21.0, avg: 45.7, max: 48.0) +[2023-10-14 06:41:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:41:05,513][100917] Updated weights for policy 1, policy_version 39272 (0.0008) +[2023-10-14 06:41:05,889][100917] Updated weights for policy 1, policy_version 39282 (0.0007) +[2023-10-14 06:41:06,270][100917] Updated weights for policy 1, policy_version 39292 (0.0011) +[2023-10-14 06:41:06,634][100936] Updated weights for policy 0, policy_version 39270 (0.0009) +[2023-10-14 06:41:07,009][100936] Updated weights for policy 0, policy_version 39280 (0.0010) +[2023-10-14 06:41:07,363][100936] Updated weights for policy 0, policy_version 39290 (0.0007) +[2023-10-14 06:41:08,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 80478208. Throughput: 0: 1655.9, 1: 1666.0. Samples: 20128220. Policy #0 lag: (min: 21.0, avg: 45.7, max: 48.0) +[2023-10-14 06:41:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:41:10,384][100917] Updated weights for policy 1, policy_version 39302 (0.0008) +[2023-10-14 06:41:10,753][100917] Updated weights for policy 1, policy_version 39312 (0.0011) +[2023-10-14 06:41:11,131][100917] Updated weights for policy 1, policy_version 39322 (0.0010) +[2023-10-14 06:41:11,464][100936] Updated weights for policy 0, policy_version 39300 (0.0007) +[2023-10-14 06:41:11,826][100936] Updated weights for policy 0, policy_version 39310 (0.0011) +[2023-10-14 06:41:12,200][100936] Updated weights for policy 0, policy_version 39320 (0.0009) +[2023-10-14 06:41:13,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 80543744. Throughput: 0: 1657.8, 1: 1645.1. Samples: 20138882. Policy #0 lag: (min: 21.0, avg: 45.7, max: 48.0) +[2023-10-14 06:41:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:41:15,279][100917] Updated weights for policy 1, policy_version 39332 (0.0010) +[2023-10-14 06:41:15,660][100917] Updated weights for policy 1, policy_version 39342 (0.0009) +[2023-10-14 06:41:16,035][100917] Updated weights for policy 1, policy_version 39352 (0.0010) +[2023-10-14 06:41:16,250][100936] Updated weights for policy 0, policy_version 39330 (0.0007) +[2023-10-14 06:41:16,618][100936] Updated weights for policy 0, policy_version 39340 (0.0010) +[2023-10-14 06:41:16,988][100936] Updated weights for policy 0, policy_version 39350 (0.0011) +[2023-10-14 06:41:17,362][100936] Updated weights for policy 0, policy_version 39360 (0.0009) +[2023-10-14 06:41:18,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.1, 300 sec: 13329.3). Total num frames: 80609280. Throughput: 0: 1650.1, 1: 1656.2. Samples: 20157616. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:41:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:41:19,987][100917] Updated weights for policy 1, policy_version 39362 (0.0008) +[2023-10-14 06:41:20,357][100917] Updated weights for policy 1, policy_version 39372 (0.0009) +[2023-10-14 06:41:20,730][100917] Updated weights for policy 1, policy_version 39382 (0.0010) +[2023-10-14 06:41:21,111][100917] Updated weights for policy 1, policy_version 39392 (0.0010) +[2023-10-14 06:41:21,517][100936] Updated weights for policy 0, policy_version 39370 (0.0010) +[2023-10-14 06:41:21,890][100936] Updated weights for policy 0, policy_version 39380 (0.0008) +[2023-10-14 06:41:22,263][100936] Updated weights for policy 0, policy_version 39390 (0.0007) +[2023-10-14 06:41:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 80674816. Throughput: 0: 1662.0, 1: 1657.7. Samples: 20178190. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:41:23,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:41:25,208][100917] Updated weights for policy 1, policy_version 39402 (0.0010) +[2023-10-14 06:41:25,578][100917] Updated weights for policy 1, policy_version 39412 (0.0010) +[2023-10-14 06:41:25,963][100917] Updated weights for policy 1, policy_version 39422 (0.0010) +[2023-10-14 06:41:26,377][100936] Updated weights for policy 0, policy_version 39400 (0.0008) +[2023-10-14 06:41:26,754][100936] Updated weights for policy 0, policy_version 39410 (0.0007) +[2023-10-14 06:41:27,138][100936] Updated weights for policy 0, policy_version 39420 (0.0008) +[2023-10-14 06:41:28,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 80740352. Throughput: 0: 1655.1, 1: 1639.6. Samples: 20188110. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:41:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:41:30,045][100917] Updated weights for policy 1, policy_version 39432 (0.0010) +[2023-10-14 06:41:30,424][100917] Updated weights for policy 1, policy_version 39442 (0.0009) +[2023-10-14 06:41:30,792][100917] Updated weights for policy 1, policy_version 39452 (0.0009) +[2023-10-14 06:41:31,071][100936] Updated weights for policy 0, policy_version 39430 (0.0010) +[2023-10-14 06:41:31,441][100936] Updated weights for policy 0, policy_version 39440 (0.0010) +[2023-10-14 06:41:31,813][100936] Updated weights for policy 0, policy_version 39450 (0.0007) +[2023-10-14 06:41:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 80805888. Throughput: 0: 1654.5, 1: 1660.3. Samples: 20207784. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:41:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:41:34,945][100917] Updated weights for policy 1, policy_version 39462 (0.0008) +[2023-10-14 06:41:35,327][100917] Updated weights for policy 1, policy_version 39472 (0.0009) +[2023-10-14 06:41:35,705][100917] Updated weights for policy 1, policy_version 39482 (0.0008) +[2023-10-14 06:41:36,065][100936] Updated weights for policy 0, policy_version 39460 (0.0007) +[2023-10-14 06:41:36,442][100936] Updated weights for policy 0, policy_version 39470 (0.0011) +[2023-10-14 06:41:36,815][100936] Updated weights for policy 0, policy_version 39480 (0.0007) +[2023-10-14 06:41:38,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.1, 300 sec: 13329.3). Total num frames: 80871424. Throughput: 0: 1667.8, 1: 1656.7. Samples: 20228066. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:41:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:41:38,526][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000039488_40435712.pth... +[2023-10-14 06:41:38,526][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000039488_40435712.pth... +[2023-10-14 06:41:38,559][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000037952_38862848.pth +[2023-10-14 06:41:38,562][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000037952_38862848.pth +[2023-10-14 06:41:39,807][100917] Updated weights for policy 1, policy_version 39492 (0.0011) +[2023-10-14 06:41:40,182][100917] Updated weights for policy 1, policy_version 39502 (0.0008) +[2023-10-14 06:41:40,553][100917] Updated weights for policy 1, policy_version 39512 (0.0009) +[2023-10-14 06:41:40,949][100936] Updated weights for policy 0, policy_version 39490 (0.0007) +[2023-10-14 06:41:41,320][100936] Updated weights for policy 0, policy_version 39500 (0.0009) +[2023-10-14 06:41:41,699][100936] Updated weights for policy 0, policy_version 39510 (0.0008) +[2023-10-14 06:41:42,068][100936] Updated weights for policy 0, policy_version 39520 (0.0007) +[2023-10-14 06:41:43,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 80936960. Throughput: 0: 1659.6, 1: 1645.3. Samples: 20237872. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:41:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:41:44,641][100917] Updated weights for policy 1, policy_version 39522 (0.0008) +[2023-10-14 06:41:45,015][100917] Updated weights for policy 1, policy_version 39532 (0.0009) +[2023-10-14 06:41:45,383][100917] Updated weights for policy 1, policy_version 39542 (0.0007) +[2023-10-14 06:41:45,752][100917] Updated weights for policy 1, policy_version 39552 (0.0008) +[2023-10-14 06:41:46,186][100936] Updated weights for policy 0, policy_version 39530 (0.0008) +[2023-10-14 06:41:46,561][100936] Updated weights for policy 0, policy_version 39540 (0.0009) +[2023-10-14 06:41:46,928][100936] Updated weights for policy 0, policy_version 39550 (0.0009) +[2023-10-14 06:41:48,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 81002496. Throughput: 0: 1658.4, 1: 1659.8. Samples: 20257624. Policy #0 lag: (min: 14.0, avg: 15.6, max: 43.0) +[2023-10-14 06:41:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:41:49,902][100917] Updated weights for policy 1, policy_version 39562 (0.0009) +[2023-10-14 06:41:50,282][100917] Updated weights for policy 1, policy_version 39572 (0.0010) +[2023-10-14 06:41:50,659][100917] Updated weights for policy 1, policy_version 39582 (0.0008) +[2023-10-14 06:41:51,173][100936] Updated weights for policy 0, policy_version 39560 (0.0011) +[2023-10-14 06:41:51,548][100936] Updated weights for policy 0, policy_version 39570 (0.0008) +[2023-10-14 06:41:51,920][100936] Updated weights for policy 0, policy_version 39580 (0.0008) +[2023-10-14 06:41:53,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 81068032. Throughput: 0: 1664.9, 1: 1658.0. Samples: 20277752. Policy #0 lag: (min: 14.0, avg: 15.6, max: 43.0) +[2023-10-14 06:41:53,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:41:54,833][100917] Updated weights for policy 1, policy_version 39592 (0.0009) +[2023-10-14 06:41:55,200][100917] Updated weights for policy 1, policy_version 39602 (0.0007) +[2023-10-14 06:41:55,570][100917] Updated weights for policy 1, policy_version 39612 (0.0010) +[2023-10-14 06:41:55,867][100936] Updated weights for policy 0, policy_version 39590 (0.0010) +[2023-10-14 06:41:56,238][100936] Updated weights for policy 0, policy_version 39600 (0.0008) +[2023-10-14 06:41:56,606][100936] Updated weights for policy 0, policy_version 39610 (0.0009) +[2023-10-14 06:41:58,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 81133568. Throughput: 0: 1650.2, 1: 1649.6. Samples: 20287376. Policy #0 lag: (min: 14.0, avg: 15.6, max: 43.0) +[2023-10-14 06:41:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:41:59,584][100917] Updated weights for policy 1, policy_version 39622 (0.0010) +[2023-10-14 06:41:59,958][100917] Updated weights for policy 1, policy_version 39632 (0.0008) +[2023-10-14 06:42:00,331][100917] Updated weights for policy 1, policy_version 39642 (0.0008) +[2023-10-14 06:42:00,722][100936] Updated weights for policy 0, policy_version 39620 (0.0008) +[2023-10-14 06:42:01,091][100936] Updated weights for policy 0, policy_version 39630 (0.0009) +[2023-10-14 06:42:01,469][100936] Updated weights for policy 0, policy_version 39640 (0.0008) +[2023-10-14 06:42:03,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 13329.4). Total num frames: 81199104. Throughput: 0: 1666.4, 1: 1664.8. Samples: 20307518. Policy #0 lag: (min: 14.0, avg: 15.6, max: 43.0) +[2023-10-14 06:42:03,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:42:04,523][100917] Updated weights for policy 1, policy_version 39652 (0.0010) +[2023-10-14 06:42:04,901][100917] Updated weights for policy 1, policy_version 39662 (0.0007) +[2023-10-14 06:42:05,277][100917] Updated weights for policy 1, policy_version 39672 (0.0007) +[2023-10-14 06:42:05,704][100936] Updated weights for policy 0, policy_version 39650 (0.0009) +[2023-10-14 06:42:06,080][100936] Updated weights for policy 0, policy_version 39660 (0.0007) +[2023-10-14 06:42:06,454][100936] Updated weights for policy 0, policy_version 39670 (0.0008) +[2023-10-14 06:42:06,819][100936] Updated weights for policy 0, policy_version 39680 (0.0009) +[2023-10-14 06:42:08,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 81264640. Throughput: 0: 1666.2, 1: 1664.2. Samples: 20328060. Policy #0 lag: (min: 14.0, avg: 15.6, max: 43.0) +[2023-10-14 06:42:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:42:09,386][100917] Updated weights for policy 1, policy_version 39682 (0.0008) +[2023-10-14 06:42:09,773][100917] Updated weights for policy 1, policy_version 39692 (0.0010) +[2023-10-14 06:42:10,147][100917] Updated weights for policy 1, policy_version 39702 (0.0009) +[2023-10-14 06:42:10,522][100917] Updated weights for policy 1, policy_version 39712 (0.0010) +[2023-10-14 06:42:10,864][100936] Updated weights for policy 0, policy_version 39690 (0.0007) +[2023-10-14 06:42:11,229][100936] Updated weights for policy 0, policy_version 39700 (0.0008) +[2023-10-14 06:42:11,598][100936] Updated weights for policy 0, policy_version 39710 (0.0010) +[2023-10-14 06:42:13,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 81330176. Throughput: 0: 1652.6, 1: 1661.1. Samples: 20337224. Policy #0 lag: (min: 14.0, avg: 15.6, max: 43.0) +[2023-10-14 06:42:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:42:14,670][100917] Updated weights for policy 1, policy_version 39722 (0.0008) +[2023-10-14 06:42:15,054][100917] Updated weights for policy 1, policy_version 39732 (0.0008) +[2023-10-14 06:42:15,430][100917] Updated weights for policy 1, policy_version 39742 (0.0007) +[2023-10-14 06:42:15,641][100936] Updated weights for policy 0, policy_version 39720 (0.0008) +[2023-10-14 06:42:16,014][100936] Updated weights for policy 0, policy_version 39730 (0.0009) +[2023-10-14 06:42:16,381][100936] Updated weights for policy 0, policy_version 39740 (0.0008) +[2023-10-14 06:42:18,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 81395712. Throughput: 0: 1668.5, 1: 1655.9. Samples: 20357382. Policy #0 lag: (min: 12.0, avg: 24.7, max: 44.0) +[2023-10-14 06:42:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:42:19,489][100917] Updated weights for policy 1, policy_version 39752 (0.0009) +[2023-10-14 06:42:19,859][100917] Updated weights for policy 1, policy_version 39762 (0.0008) +[2023-10-14 06:42:20,223][100917] Updated weights for policy 1, policy_version 39772 (0.0008) +[2023-10-14 06:42:20,325][100936] Updated weights for policy 0, policy_version 39750 (0.0010) +[2023-10-14 06:42:20,695][100936] Updated weights for policy 0, policy_version 39760 (0.0008) +[2023-10-14 06:42:21,061][100936] Updated weights for policy 0, policy_version 39770 (0.0008) +[2023-10-14 06:42:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 81461248. Throughput: 0: 1668.1, 1: 1658.9. Samples: 20377778. Policy #0 lag: (min: 12.0, avg: 24.7, max: 44.0) +[2023-10-14 06:42:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:42:24,490][100917] Updated weights for policy 1, policy_version 39782 (0.0010) +[2023-10-14 06:42:24,874][100917] Updated weights for policy 1, policy_version 39792 (0.0011) +[2023-10-14 06:42:25,239][100917] Updated weights for policy 1, policy_version 39802 (0.0008) +[2023-10-14 06:42:25,263][100936] Updated weights for policy 0, policy_version 39780 (0.0009) +[2023-10-14 06:42:25,630][100936] Updated weights for policy 0, policy_version 39790 (0.0007) +[2023-10-14 06:42:25,997][100936] Updated weights for policy 0, policy_version 39800 (0.0007) +[2023-10-14 06:42:28,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 81526784. Throughput: 0: 1645.1, 1: 1657.4. Samples: 20386482. Policy #0 lag: (min: 12.0, avg: 24.7, max: 44.0) +[2023-10-14 06:42:28,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:42:29,335][100917] Updated weights for policy 1, policy_version 39812 (0.0007) +[2023-10-14 06:42:29,696][100917] Updated weights for policy 1, policy_version 39822 (0.0009) +[2023-10-14 06:42:30,075][100917] Updated weights for policy 1, policy_version 39832 (0.0007) +[2023-10-14 06:42:30,212][100936] Updated weights for policy 0, policy_version 39810 (0.0007) +[2023-10-14 06:42:30,579][100936] Updated weights for policy 0, policy_version 39820 (0.0007) +[2023-10-14 06:42:30,945][100936] Updated weights for policy 0, policy_version 39830 (0.0007) +[2023-10-14 06:42:31,313][100936] Updated weights for policy 0, policy_version 39840 (0.0010) +[2023-10-14 06:42:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 81592320. Throughput: 0: 1656.6, 1: 1662.9. Samples: 20407002. Policy #0 lag: (min: 12.0, avg: 24.7, max: 44.0) +[2023-10-14 06:42:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:42:34,154][100917] Updated weights for policy 1, policy_version 39842 (0.0008) +[2023-10-14 06:42:34,527][100917] Updated weights for policy 1, policy_version 39852 (0.0010) +[2023-10-14 06:42:34,902][100917] Updated weights for policy 1, policy_version 39862 (0.0007) +[2023-10-14 06:42:35,276][100917] Updated weights for policy 1, policy_version 39872 (0.0009) +[2023-10-14 06:42:35,678][100936] Updated weights for policy 0, policy_version 39850 (0.0007) +[2023-10-14 06:42:36,041][100936] Updated weights for policy 0, policy_version 39860 (0.0007) +[2023-10-14 06:42:36,415][100936] Updated weights for policy 0, policy_version 39870 (0.0008) +[2023-10-14 06:42:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 81657856. Throughput: 0: 1656.3, 1: 1670.2. Samples: 20427446. Policy #0 lag: (min: 12.0, avg: 24.7, max: 44.0) +[2023-10-14 06:42:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:42:39,394][100917] Updated weights for policy 1, policy_version 39882 (0.0009) +[2023-10-14 06:42:39,768][100917] Updated weights for policy 1, policy_version 39892 (0.0008) +[2023-10-14 06:42:40,138][100917] Updated weights for policy 1, policy_version 39902 (0.0007) +[2023-10-14 06:42:40,466][100936] Updated weights for policy 0, policy_version 39880 (0.0009) +[2023-10-14 06:42:40,831][100936] Updated weights for policy 0, policy_version 39890 (0.0008) +[2023-10-14 06:42:41,204][100936] Updated weights for policy 0, policy_version 39900 (0.0008) +[2023-10-14 06:42:43,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 81723392. Throughput: 0: 1641.2, 1: 1670.1. Samples: 20436382. Policy #0 lag: (min: 12.0, avg: 24.7, max: 44.0) +[2023-10-14 06:42:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:42:44,382][100917] Updated weights for policy 1, policy_version 39912 (0.0009) +[2023-10-14 06:42:44,755][100917] Updated weights for policy 1, policy_version 39922 (0.0010) +[2023-10-14 06:42:45,139][100917] Updated weights for policy 1, policy_version 39932 (0.0011) +[2023-10-14 06:42:45,422][100936] Updated weights for policy 0, policy_version 39910 (0.0008) +[2023-10-14 06:42:45,784][100936] Updated weights for policy 0, policy_version 39920 (0.0007) +[2023-10-14 06:42:46,153][100936] Updated weights for policy 0, policy_version 39930 (0.0007) +[2023-10-14 06:42:48,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 81788928. Throughput: 0: 1654.2, 1: 1667.7. Samples: 20457004. Policy #0 lag: (min: 22.0, avg: 22.0, max: 22.0) +[2023-10-14 06:42:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:42:49,345][100917] Updated weights for policy 1, policy_version 39942 (0.0009) +[2023-10-14 06:42:49,724][100917] Updated weights for policy 1, policy_version 39952 (0.0011) +[2023-10-14 06:42:50,096][100917] Updated weights for policy 1, policy_version 39962 (0.0010) +[2023-10-14 06:42:50,414][100936] Updated weights for policy 0, policy_version 39940 (0.0007) +[2023-10-14 06:42:50,790][100936] Updated weights for policy 0, policy_version 39950 (0.0008) +[2023-10-14 06:42:51,157][100936] Updated weights for policy 0, policy_version 39960 (0.0007) +[2023-10-14 06:42:53,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 81854464. Throughput: 0: 1653.6, 1: 1667.2. Samples: 20477500. Policy #0 lag: (min: 22.0, avg: 22.0, max: 22.0) +[2023-10-14 06:42:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:42:54,177][100917] Updated weights for policy 1, policy_version 39972 (0.0009) +[2023-10-14 06:42:54,548][100917] Updated weights for policy 1, policy_version 39982 (0.0010) +[2023-10-14 06:42:54,920][100917] Updated weights for policy 1, policy_version 39992 (0.0008) +[2023-10-14 06:42:54,936][100936] Updated weights for policy 0, policy_version 39970 (0.0009) +[2023-10-14 06:42:55,305][100936] Updated weights for policy 0, policy_version 39980 (0.0008) +[2023-10-14 06:42:55,678][100936] Updated weights for policy 0, policy_version 39990 (0.0007) +[2023-10-14 06:42:56,047][100936] Updated weights for policy 0, policy_version 40000 (0.0009) +[2023-10-14 06:42:58,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 81920000. Throughput: 0: 1646.6, 1: 1668.1. Samples: 20486386. Policy #0 lag: (min: 22.0, avg: 22.0, max: 22.0) +[2023-10-14 06:42:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:42:58,976][100917] Updated weights for policy 1, policy_version 40002 (0.0008) +[2023-10-14 06:42:59,346][100917] Updated weights for policy 1, policy_version 40012 (0.0011) +[2023-10-14 06:42:59,721][100917] Updated weights for policy 1, policy_version 40022 (0.0009) +[2023-10-14 06:43:00,093][100917] Updated weights for policy 1, policy_version 40032 (0.0010) +[2023-10-14 06:43:00,211][100936] Updated weights for policy 0, policy_version 40010 (0.0008) +[2023-10-14 06:43:00,580][100936] Updated weights for policy 0, policy_version 40020 (0.0008) +[2023-10-14 06:43:00,954][100936] Updated weights for policy 0, policy_version 40030 (0.0009) +[2023-10-14 06:43:03,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 81985536. Throughput: 0: 1648.6, 1: 1669.4. Samples: 20506692. Policy #0 lag: (min: 22.0, avg: 22.0, max: 22.0) +[2023-10-14 06:43:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:43:04,078][100917] Updated weights for policy 1, policy_version 40042 (0.0011) +[2023-10-14 06:43:04,458][100917] Updated weights for policy 1, policy_version 40052 (0.0011) +[2023-10-14 06:43:04,822][100917] Updated weights for policy 1, policy_version 40062 (0.0010) +[2023-10-14 06:43:05,074][100936] Updated weights for policy 0, policy_version 40040 (0.0010) +[2023-10-14 06:43:05,433][100936] Updated weights for policy 0, policy_version 40050 (0.0007) +[2023-10-14 06:43:05,811][100936] Updated weights for policy 0, policy_version 40060 (0.0007) +[2023-10-14 06:43:08,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 82051072. Throughput: 0: 1649.6, 1: 1669.2. Samples: 20527122. Policy #0 lag: (min: 22.0, avg: 22.0, max: 22.0) +[2023-10-14 06:43:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:43:08,950][100917] Updated weights for policy 1, policy_version 40072 (0.0008) +[2023-10-14 06:43:09,338][100917] Updated weights for policy 1, policy_version 40082 (0.0009) +[2023-10-14 06:43:09,702][100917] Updated weights for policy 1, policy_version 40092 (0.0008) +[2023-10-14 06:43:09,985][100936] Updated weights for policy 0, policy_version 40070 (0.0008) +[2023-10-14 06:43:10,337][100936] Updated weights for policy 0, policy_version 40080 (0.0009) +[2023-10-14 06:43:10,717][100936] Updated weights for policy 0, policy_version 40090 (0.0008) +[2023-10-14 06:43:13,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 82116608. Throughput: 0: 1653.9, 1: 1675.1. Samples: 20536288. Policy #0 lag: (min: 22.0, avg: 22.0, max: 22.0) +[2023-10-14 06:43:13,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:43:13,647][100917] Updated weights for policy 1, policy_version 40102 (0.0010) +[2023-10-14 06:43:14,030][100917] Updated weights for policy 1, policy_version 40112 (0.0008) +[2023-10-14 06:43:14,399][100917] Updated weights for policy 1, policy_version 40122 (0.0007) +[2023-10-14 06:43:14,974][100936] Updated weights for policy 0, policy_version 40100 (0.0008) +[2023-10-14 06:43:15,341][100936] Updated weights for policy 0, policy_version 40110 (0.0008) +[2023-10-14 06:43:15,724][100936] Updated weights for policy 0, policy_version 40120 (0.0007) +[2023-10-14 06:43:18,355][100917] Updated weights for policy 1, policy_version 40132 (0.0009) +[2023-10-14 06:43:18,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 82182144. Throughput: 0: 1659.2, 1: 1666.1. Samples: 20556638. Policy #0 lag: (min: 23.0, avg: 23.3, max: 35.0) +[2023-10-14 06:43:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:43:18,726][100917] Updated weights for policy 1, policy_version 40142 (0.0009) +[2023-10-14 06:43:19,107][100917] Updated weights for policy 1, policy_version 40152 (0.0010) +[2023-10-14 06:43:19,831][100936] Updated weights for policy 0, policy_version 40130 (0.0009) +[2023-10-14 06:43:20,206][100936] Updated weights for policy 0, policy_version 40140 (0.0009) +[2023-10-14 06:43:20,578][100936] Updated weights for policy 0, policy_version 40150 (0.0010) +[2023-10-14 06:43:20,940][100936] Updated weights for policy 0, policy_version 40160 (0.0008) +[2023-10-14 06:43:23,093][100917] Updated weights for policy 1, policy_version 40162 (0.0009) +[2023-10-14 06:43:23,470][100917] Updated weights for policy 1, policy_version 40172 (0.0008) +[2023-10-14 06:43:23,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 82247680. Throughput: 0: 1664.1, 1: 1665.5. Samples: 20577278. Policy #0 lag: (min: 23.0, avg: 23.3, max: 35.0) +[2023-10-14 06:43:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:43:23,857][100917] Updated weights for policy 1, policy_version 40182 (0.0007) +[2023-10-14 06:43:24,221][100917] Updated weights for policy 1, policy_version 40192 (0.0009) +[2023-10-14 06:43:25,230][100936] Updated weights for policy 0, policy_version 40170 (0.0007) +[2023-10-14 06:43:25,613][100936] Updated weights for policy 0, policy_version 40180 (0.0009) +[2023-10-14 06:43:25,976][100936] Updated weights for policy 0, policy_version 40190 (0.0007) +[2023-10-14 06:43:28,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 82313216. Throughput: 0: 1657.9, 1: 1667.4. Samples: 20586022. Policy #0 lag: (min: 23.0, avg: 23.3, max: 35.0) +[2023-10-14 06:43:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:43:28,603][100917] Updated weights for policy 1, policy_version 40202 (0.0009) +[2023-10-14 06:43:28,972][100917] Updated weights for policy 1, policy_version 40212 (0.0007) +[2023-10-14 06:43:29,349][100917] Updated weights for policy 1, policy_version 40222 (0.0008) +[2023-10-14 06:43:30,196][100936] Updated weights for policy 0, policy_version 40200 (0.0010) +[2023-10-14 06:43:30,572][100936] Updated weights for policy 0, policy_version 40210 (0.0007) +[2023-10-14 06:43:30,946][100936] Updated weights for policy 0, policy_version 40220 (0.0009) +[2023-10-14 06:43:33,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.3, 300 sec: 13107.2). Total num frames: 82378752. Throughput: 0: 1652.2, 1: 1661.1. Samples: 20606104. Policy #0 lag: (min: 23.0, avg: 23.3, max: 35.0) +[2023-10-14 06:43:33,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:43:33,552][100917] Updated weights for policy 1, policy_version 40232 (0.0009) +[2023-10-14 06:43:33,917][100917] Updated weights for policy 1, policy_version 40242 (0.0008) +[2023-10-14 06:43:34,295][100917] Updated weights for policy 1, policy_version 40252 (0.0008) +[2023-10-14 06:43:35,006][100936] Updated weights for policy 0, policy_version 40230 (0.0008) +[2023-10-14 06:43:35,377][100936] Updated weights for policy 0, policy_version 40240 (0.0008) +[2023-10-14 06:43:35,746][100936] Updated weights for policy 0, policy_version 40250 (0.0008) +[2023-10-14 06:43:38,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 82444288. Throughput: 0: 1654.4, 1: 1655.7. Samples: 20626458. Policy #0 lag: (min: 23.0, avg: 23.3, max: 35.0) +[2023-10-14 06:43:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:43:38,522][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000040256_41222144.pth... +[2023-10-14 06:43:38,552][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000038720_39649280.pth +[2023-10-14 06:43:38,600][100917] Updated weights for policy 1, policy_version 40262 (0.0008) +[2023-10-14 06:43:38,978][100917] Updated weights for policy 1, policy_version 40272 (0.0009) +[2023-10-14 06:43:39,353][100917] Updated weights for policy 1, policy_version 40282 (0.0008) +[2023-10-14 06:43:39,571][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000040288_41254912.pth... +[2023-10-14 06:43:39,610][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000038720_39649280.pth +[2023-10-14 06:43:39,623][100936] Updated weights for policy 0, policy_version 40260 (0.0008) +[2023-10-14 06:43:39,991][100936] Updated weights for policy 0, policy_version 40270 (0.0008) +[2023-10-14 06:43:40,365][100936] Updated weights for policy 0, policy_version 40280 (0.0008) +[2023-10-14 06:43:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 82509824. Throughput: 0: 1654.2, 1: 1658.2. Samples: 20635444. Policy #0 lag: (min: 23.0, avg: 23.3, max: 35.0) +[2023-10-14 06:43:43,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:43:43,535][100917] Updated weights for policy 1, policy_version 40292 (0.0009) +[2023-10-14 06:43:43,904][100917] Updated weights for policy 1, policy_version 40302 (0.0007) +[2023-10-14 06:43:44,276][100917] Updated weights for policy 1, policy_version 40312 (0.0007) +[2023-10-14 06:43:44,525][100936] Updated weights for policy 0, policy_version 40290 (0.0008) +[2023-10-14 06:43:44,883][100936] Updated weights for policy 0, policy_version 40300 (0.0007) +[2023-10-14 06:43:45,257][100936] Updated weights for policy 0, policy_version 40310 (0.0008) +[2023-10-14 06:43:45,618][100936] Updated weights for policy 0, policy_version 40320 (0.0007) +[2023-10-14 06:43:48,405][100917] Updated weights for policy 1, policy_version 40322 (0.0010) +[2023-10-14 06:43:48,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 82575360. Throughput: 0: 1651.6, 1: 1656.1. Samples: 20655540. Policy #0 lag: (min: 0.0, avg: 22.5, max: 32.0) +[2023-10-14 06:43:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:43:48,773][100917] Updated weights for policy 1, policy_version 40332 (0.0009) +[2023-10-14 06:43:49,143][100917] Updated weights for policy 1, policy_version 40342 (0.0008) +[2023-10-14 06:43:49,529][100917] Updated weights for policy 1, policy_version 40352 (0.0009) +[2023-10-14 06:43:50,000][100936] Updated weights for policy 0, policy_version 40330 (0.0008) +[2023-10-14 06:43:50,366][100936] Updated weights for policy 0, policy_version 40340 (0.0008) +[2023-10-14 06:43:50,739][100936] Updated weights for policy 0, policy_version 40350 (0.0008) +[2023-10-14 06:43:53,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 82640896. Throughput: 0: 1650.4, 1: 1658.7. Samples: 20676030. Policy #0 lag: (min: 0.0, avg: 22.5, max: 32.0) +[2023-10-14 06:43:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:43:53,571][100917] Updated weights for policy 1, policy_version 40362 (0.0009) +[2023-10-14 06:43:53,951][100917] Updated weights for policy 1, policy_version 40372 (0.0009) +[2023-10-14 06:43:54,325][100917] Updated weights for policy 1, policy_version 40382 (0.0008) +[2023-10-14 06:43:54,825][100936] Updated weights for policy 0, policy_version 40360 (0.0008) +[2023-10-14 06:43:55,194][100936] Updated weights for policy 0, policy_version 40370 (0.0009) +[2023-10-14 06:43:55,564][100936] Updated weights for policy 0, policy_version 40380 (0.0007) +[2023-10-14 06:43:58,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 82706432. Throughput: 0: 1652.5, 1: 1658.2. Samples: 20685268. Policy #0 lag: (min: 0.0, avg: 22.5, max: 32.0) +[2023-10-14 06:43:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:43:58,550][100917] Updated weights for policy 1, policy_version 40392 (0.0008) +[2023-10-14 06:43:58,929][100917] Updated weights for policy 1, policy_version 40402 (0.0009) +[2023-10-14 06:43:59,287][100917] Updated weights for policy 1, policy_version 40412 (0.0009) +[2023-10-14 06:43:59,617][100936] Updated weights for policy 0, policy_version 40390 (0.0008) +[2023-10-14 06:43:59,995][100936] Updated weights for policy 0, policy_version 40400 (0.0009) +[2023-10-14 06:44:00,365][100936] Updated weights for policy 0, policy_version 40410 (0.0008) +[2023-10-14 06:44:03,172][100917] Updated weights for policy 1, policy_version 40422 (0.0009) +[2023-10-14 06:44:03,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.3, 300 sec: 13107.2). Total num frames: 82771968. Throughput: 0: 1654.5, 1: 1658.4. Samples: 20705718. Policy #0 lag: (min: 0.0, avg: 22.5, max: 32.0) +[2023-10-14 06:44:03,512][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:44:03,546][100917] Updated weights for policy 1, policy_version 40432 (0.0010) +[2023-10-14 06:44:03,927][100917] Updated weights for policy 1, policy_version 40442 (0.0008) +[2023-10-14 06:44:04,537][100936] Updated weights for policy 0, policy_version 40420 (0.0009) +[2023-10-14 06:44:04,914][100936] Updated weights for policy 0, policy_version 40430 (0.0010) +[2023-10-14 06:44:05,283][100936] Updated weights for policy 0, policy_version 40440 (0.0007) +[2023-10-14 06:44:08,037][100917] Updated weights for policy 1, policy_version 40452 (0.0009) +[2023-10-14 06:44:08,420][100917] Updated weights for policy 1, policy_version 40462 (0.0009) +[2023-10-14 06:44:08,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 82837504. Throughput: 0: 1656.4, 1: 1657.3. Samples: 20726394. Policy #0 lag: (min: 0.0, avg: 22.5, max: 32.0) +[2023-10-14 06:44:08,513][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:44:08,782][100917] Updated weights for policy 1, policy_version 40472 (0.0009) +[2023-10-14 06:44:09,357][100936] Updated weights for policy 0, policy_version 40450 (0.0008) +[2023-10-14 06:44:09,749][100936] Updated weights for policy 0, policy_version 40460 (0.0009) +[2023-10-14 06:44:10,115][100936] Updated weights for policy 0, policy_version 40470 (0.0007) +[2023-10-14 06:44:10,480][100936] Updated weights for policy 0, policy_version 40480 (0.0007) +[2023-10-14 06:44:13,024][100917] Updated weights for policy 1, policy_version 40482 (0.0009) +[2023-10-14 06:44:13,420][100917] Updated weights for policy 1, policy_version 40492 (0.0008) +[2023-10-14 06:44:13,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 82903040. Throughput: 0: 1662.0, 1: 1656.8. Samples: 20735368. Policy #0 lag: (min: 0.0, avg: 22.5, max: 32.0) +[2023-10-14 06:44:13,513][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:44:13,797][100917] Updated weights for policy 1, policy_version 40502 (0.0009) +[2023-10-14 06:44:14,164][100917] Updated weights for policy 1, policy_version 40512 (0.0011) +[2023-10-14 06:44:14,670][100936] Updated weights for policy 0, policy_version 40490 (0.0009) +[2023-10-14 06:44:15,036][100936] Updated weights for policy 0, policy_version 40500 (0.0009) +[2023-10-14 06:44:15,405][100936] Updated weights for policy 0, policy_version 40510 (0.0010) +[2023-10-14 06:44:18,323][100917] Updated weights for policy 1, policy_version 40522 (0.0007) +[2023-10-14 06:44:18,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 82968576. Throughput: 0: 1662.5, 1: 1658.3. Samples: 20755540. Policy #0 lag: (min: 31.0, avg: 38.9, max: 63.0) +[2023-10-14 06:44:18,512][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:44:18,693][100917] Updated weights for policy 1, policy_version 40532 (0.0007) +[2023-10-14 06:44:19,064][100917] Updated weights for policy 1, policy_version 40542 (0.0010) +[2023-10-14 06:44:19,458][100936] Updated weights for policy 0, policy_version 40520 (0.0010) +[2023-10-14 06:44:19,833][100936] Updated weights for policy 0, policy_version 40530 (0.0008) +[2023-10-14 06:44:20,210][100936] Updated weights for policy 0, policy_version 40540 (0.0010) +[2023-10-14 06:44:23,230][100917] Updated weights for policy 1, policy_version 40552 (0.0007) +[2023-10-14 06:44:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 83034112. Throughput: 0: 1663.3, 1: 1659.4. Samples: 20775980. Policy #0 lag: (min: 31.0, avg: 38.9, max: 63.0) +[2023-10-14 06:44:23,513][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:44:23,601][100917] Updated weights for policy 1, policy_version 40562 (0.0010) +[2023-10-14 06:44:23,970][100917] Updated weights for policy 1, policy_version 40572 (0.0008) +[2023-10-14 06:44:24,398][100936] Updated weights for policy 0, policy_version 40550 (0.0010) +[2023-10-14 06:44:24,766][100936] Updated weights for policy 0, policy_version 40560 (0.0008) +[2023-10-14 06:44:25,146][100936] Updated weights for policy 0, policy_version 40570 (0.0009) +[2023-10-14 06:44:28,010][100917] Updated weights for policy 1, policy_version 40582 (0.0010) +[2023-10-14 06:44:28,384][100917] Updated weights for policy 1, policy_version 40592 (0.0009) +[2023-10-14 06:44:28,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 83099648. Throughput: 0: 1662.9, 1: 1660.9. Samples: 20785014. Policy #0 lag: (min: 31.0, avg: 38.9, max: 63.0) +[2023-10-14 06:44:28,513][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:44:28,759][100917] Updated weights for policy 1, policy_version 40602 (0.0009) +[2023-10-14 06:44:29,340][100936] Updated weights for policy 0, policy_version 40580 (0.0008) +[2023-10-14 06:44:29,719][100936] Updated weights for policy 0, policy_version 40590 (0.0009) +[2023-10-14 06:44:30,085][100936] Updated weights for policy 0, policy_version 40600 (0.0010) +[2023-10-14 06:44:32,904][100917] Updated weights for policy 1, policy_version 40612 (0.0008) +[2023-10-14 06:44:33,272][100917] Updated weights for policy 1, policy_version 40622 (0.0008) +[2023-10-14 06:44:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 83165184. Throughput: 0: 1668.6, 1: 1660.9. Samples: 20805366. Policy #0 lag: (min: 31.0, avg: 38.9, max: 63.0) +[2023-10-14 06:44:33,512][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:44:33,646][100917] Updated weights for policy 1, policy_version 40632 (0.0007) +[2023-10-14 06:44:34,105][100936] Updated weights for policy 0, policy_version 40610 (0.0010) +[2023-10-14 06:44:34,478][100936] Updated weights for policy 0, policy_version 40620 (0.0007) +[2023-10-14 06:44:34,844][100936] Updated weights for policy 0, policy_version 40630 (0.0009) +[2023-10-14 06:44:35,207][100936] Updated weights for policy 0, policy_version 40640 (0.0008) +[2023-10-14 06:44:37,713][100917] Updated weights for policy 1, policy_version 40642 (0.0007) +[2023-10-14 06:44:38,079][100917] Updated weights for policy 1, policy_version 40652 (0.0009) +[2023-10-14 06:44:38,449][100917] Updated weights for policy 1, policy_version 40662 (0.0007) +[2023-10-14 06:44:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 83230720. Throughput: 0: 1665.7, 1: 1655.8. Samples: 20825498. Policy #0 lag: (min: 31.0, avg: 38.9, max: 63.0) +[2023-10-14 06:44:38,512][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:44:38,824][100917] Updated weights for policy 1, policy_version 40672 (0.0008) +[2023-10-14 06:44:39,351][100936] Updated weights for policy 0, policy_version 40650 (0.0009) +[2023-10-14 06:44:39,715][100936] Updated weights for policy 0, policy_version 40660 (0.0010) +[2023-10-14 06:44:40,088][100936] Updated weights for policy 0, policy_version 40670 (0.0010) +[2023-10-14 06:44:42,707][100917] Updated weights for policy 1, policy_version 40682 (0.0009) +[2023-10-14 06:44:43,088][100917] Updated weights for policy 1, policy_version 40692 (0.0010) +[2023-10-14 06:44:43,455][100917] Updated weights for policy 1, policy_version 40702 (0.0010) +[2023-10-14 06:44:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 83296256. Throughput: 0: 1662.0, 1: 1664.3. Samples: 20834950. Policy #0 lag: (min: 31.0, avg: 38.9, max: 63.0) +[2023-10-14 06:44:43,512][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:44:44,328][100936] Updated weights for policy 0, policy_version 40680 (0.0009) +[2023-10-14 06:44:44,695][100936] Updated weights for policy 0, policy_version 40690 (0.0008) +[2023-10-14 06:44:45,076][100936] Updated weights for policy 0, policy_version 40700 (0.0007) +[2023-10-14 06:44:47,516][100917] Updated weights for policy 1, policy_version 40712 (0.0008) +[2023-10-14 06:44:47,887][100917] Updated weights for policy 1, policy_version 40722 (0.0008) +[2023-10-14 06:44:48,252][100917] Updated weights for policy 1, policy_version 40732 (0.0008) +[2023-10-14 06:44:48,512][99942] Fps is (10 sec: 16384.2, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 83394560. Throughput: 0: 1660.7, 1: 1662.9. Samples: 20855280. Policy #0 lag: (min: 31.0, avg: 32.6, max: 59.0) +[2023-10-14 06:44:48,512][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:44:49,173][100936] Updated weights for policy 0, policy_version 40710 (0.0009) +[2023-10-14 06:44:49,544][100936] Updated weights for policy 0, policy_version 40720 (0.0007) +[2023-10-14 06:44:49,920][100936] Updated weights for policy 0, policy_version 40730 (0.0007) +[2023-10-14 06:44:52,333][100917] Updated weights for policy 1, policy_version 40742 (0.0009) +[2023-10-14 06:44:52,709][100917] Updated weights for policy 1, policy_version 40752 (0.0008) +[2023-10-14 06:44:53,091][100917] Updated weights for policy 1, policy_version 40762 (0.0009) +[2023-10-14 06:44:53,512][99942] Fps is (10 sec: 16384.1, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 83460096. Throughput: 0: 1654.4, 1: 1642.1. Samples: 20874740. Policy #0 lag: (min: 31.0, avg: 32.6, max: 59.0) +[2023-10-14 06:44:53,512][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:44:54,060][100936] Updated weights for policy 0, policy_version 40740 (0.0009) +[2023-10-14 06:44:54,424][100936] Updated weights for policy 0, policy_version 40750 (0.0007) +[2023-10-14 06:44:54,800][100936] Updated weights for policy 0, policy_version 40760 (0.0008) +[2023-10-14 06:44:57,148][100917] Updated weights for policy 1, policy_version 40772 (0.0010) +[2023-10-14 06:44:57,511][100917] Updated weights for policy 1, policy_version 40782 (0.0009) +[2023-10-14 06:44:57,880][100917] Updated weights for policy 1, policy_version 40792 (0.0010) +[2023-10-14 06:44:58,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 83525632. Throughput: 0: 1652.4, 1: 1662.7. Samples: 20884544. Policy #0 lag: (min: 31.0, avg: 32.6, max: 59.0) +[2023-10-14 06:44:58,513][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:44:59,124][100936] Updated weights for policy 0, policy_version 40770 (0.0008) +[2023-10-14 06:44:59,518][100936] Updated weights for policy 0, policy_version 40780 (0.0009) +[2023-10-14 06:44:59,892][100936] Updated weights for policy 0, policy_version 40790 (0.0009) +[2023-10-14 06:45:00,256][100936] Updated weights for policy 0, policy_version 40800 (0.0009) +[2023-10-14 06:45:02,157][100917] Updated weights for policy 1, policy_version 40802 (0.0008) +[2023-10-14 06:45:02,565][100917] Updated weights for policy 1, policy_version 40812 (0.0008) +[2023-10-14 06:45:02,946][100917] Updated weights for policy 1, policy_version 40822 (0.0008) +[2023-10-14 06:45:03,317][100917] Updated weights for policy 1, policy_version 40832 (0.0007) +[2023-10-14 06:45:03,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 83591168. Throughput: 0: 1650.2, 1: 1667.1. Samples: 20904818. Policy #0 lag: (min: 31.0, avg: 32.6, max: 59.0) +[2023-10-14 06:45:03,513][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:45:04,286][100936] Updated weights for policy 0, policy_version 40810 (0.0007) +[2023-10-14 06:45:04,650][100936] Updated weights for policy 0, policy_version 40820 (0.0007) +[2023-10-14 06:45:05,027][100936] Updated weights for policy 0, policy_version 40830 (0.0007) +[2023-10-14 06:45:07,552][100917] Updated weights for policy 1, policy_version 40842 (0.0009) +[2023-10-14 06:45:07,922][100917] Updated weights for policy 1, policy_version 40852 (0.0010) +[2023-10-14 06:45:08,300][100917] Updated weights for policy 1, policy_version 40862 (0.0010) +[2023-10-14 06:45:08,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 83656704. Throughput: 0: 1650.2, 1: 1647.9. Samples: 20924396. Policy #0 lag: (min: 31.0, avg: 32.6, max: 59.0) +[2023-10-14 06:45:08,513][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:45:09,132][100936] Updated weights for policy 0, policy_version 40840 (0.0007) +[2023-10-14 06:45:09,495][100936] Updated weights for policy 0, policy_version 40850 (0.0008) +[2023-10-14 06:45:09,868][100936] Updated weights for policy 0, policy_version 40860 (0.0007) +[2023-10-14 06:45:12,468][100917] Updated weights for policy 1, policy_version 40872 (0.0010) +[2023-10-14 06:45:12,837][100917] Updated weights for policy 1, policy_version 40882 (0.0008) +[2023-10-14 06:45:13,205][100917] Updated weights for policy 1, policy_version 40892 (0.0007) +[2023-10-14 06:45:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 83722240. Throughput: 0: 1650.0, 1: 1663.2. Samples: 20934110. Policy #0 lag: (min: 31.0, avg: 32.6, max: 59.0) +[2023-10-14 06:45:13,512][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:45:14,034][100936] Updated weights for policy 0, policy_version 40870 (0.0009) +[2023-10-14 06:45:14,400][100936] Updated weights for policy 0, policy_version 40880 (0.0009) +[2023-10-14 06:45:14,774][100936] Updated weights for policy 0, policy_version 40890 (0.0008) +[2023-10-14 06:45:17,396][100917] Updated weights for policy 1, policy_version 40902 (0.0009) +[2023-10-14 06:45:17,769][100917] Updated weights for policy 1, policy_version 40912 (0.0008) +[2023-10-14 06:45:18,151][100917] Updated weights for policy 1, policy_version 40922 (0.0009) +[2023-10-14 06:45:18,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 83787776. Throughput: 0: 1648.7, 1: 1663.2. Samples: 20954400. Policy #0 lag: (min: 1.0, avg: 8.5, max: 33.0) +[2023-10-14 06:45:18,513][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:45:18,864][100936] Updated weights for policy 0, policy_version 40900 (0.0008) +[2023-10-14 06:45:19,231][100936] Updated weights for policy 0, policy_version 40910 (0.0009) +[2023-10-14 06:45:19,589][100936] Updated weights for policy 0, policy_version 40920 (0.0007) +[2023-10-14 06:45:22,400][100917] Updated weights for policy 1, policy_version 40932 (0.0008) +[2023-10-14 06:45:22,767][100917] Updated weights for policy 1, policy_version 40942 (0.0010) +[2023-10-14 06:45:23,140][100917] Updated weights for policy 1, policy_version 40952 (0.0011) +[2023-10-14 06:45:23,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 83853312. Throughput: 0: 1653.3, 1: 1646.1. Samples: 20973972. Policy #0 lag: (min: 1.0, avg: 8.5, max: 33.0) +[2023-10-14 06:45:23,513][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:45:23,762][100936] Updated weights for policy 0, policy_version 40930 (0.0010) +[2023-10-14 06:45:24,137][100936] Updated weights for policy 0, policy_version 40940 (0.0008) +[2023-10-14 06:45:24,504][100936] Updated weights for policy 0, policy_version 40950 (0.0009) +[2023-10-14 06:45:24,877][100936] Updated weights for policy 0, policy_version 40960 (0.0009) +[2023-10-14 06:45:27,321][100917] Updated weights for policy 1, policy_version 40962 (0.0007) +[2023-10-14 06:45:27,687][100917] Updated weights for policy 1, policy_version 40972 (0.0008) +[2023-10-14 06:45:28,055][100917] Updated weights for policy 1, policy_version 40982 (0.0009) +[2023-10-14 06:45:28,425][100917] Updated weights for policy 1, policy_version 40992 (0.0009) +[2023-10-14 06:45:28,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 83918848. Throughput: 0: 1654.8, 1: 1648.6. Samples: 20983604. Policy #0 lag: (min: 1.0, avg: 8.5, max: 33.0) +[2023-10-14 06:45:28,513][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:45:28,853][100936] Updated weights for policy 0, policy_version 40970 (0.0008) +[2023-10-14 06:45:29,210][100936] Updated weights for policy 0, policy_version 40980 (0.0009) +[2023-10-14 06:45:29,581][100936] Updated weights for policy 0, policy_version 40990 (0.0010) +[2023-10-14 06:45:32,653][100917] Updated weights for policy 1, policy_version 41002 (0.0008) +[2023-10-14 06:45:33,025][100917] Updated weights for policy 1, policy_version 41012 (0.0010) +[2023-10-14 06:45:33,399][100917] Updated weights for policy 1, policy_version 41022 (0.0009) +[2023-10-14 06:45:33,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 83984384. Throughput: 0: 1654.2, 1: 1648.5. Samples: 21003904. Policy #0 lag: (min: 1.0, avg: 8.5, max: 33.0) +[2023-10-14 06:45:33,512][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:45:33,791][100936] Updated weights for policy 0, policy_version 41000 (0.0010) +[2023-10-14 06:45:34,173][100936] Updated weights for policy 0, policy_version 41010 (0.0009) +[2023-10-14 06:45:34,530][100936] Updated weights for policy 0, policy_version 41020 (0.0008) +[2023-10-14 06:45:37,446][100917] Updated weights for policy 1, policy_version 41032 (0.0010) +[2023-10-14 06:45:37,816][100917] Updated weights for policy 1, policy_version 41042 (0.0011) +[2023-10-14 06:45:38,187][100917] Updated weights for policy 1, policy_version 41052 (0.0011) +[2023-10-14 06:45:38,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 84049920. Throughput: 0: 1661.0, 1: 1648.0. Samples: 21023646. Policy #0 lag: (min: 1.0, avg: 8.5, max: 33.0) +[2023-10-14 06:45:38,513][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:45:38,521][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000041056_42041344.pth... +[2023-10-14 06:45:38,549][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000039488_40435712.pth +[2023-10-14 06:45:38,555][100936] Updated weights for policy 0, policy_version 41030 (0.0008) +[2023-10-14 06:45:38,918][100936] Updated weights for policy 0, policy_version 41040 (0.0008) +[2023-10-14 06:45:39,297][100936] Updated weights for policy 0, policy_version 41050 (0.0010) +[2023-10-14 06:45:39,517][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000041056_42041344.pth... +[2023-10-14 06:45:39,554][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000039488_40435712.pth +[2023-10-14 06:45:42,341][100917] Updated weights for policy 1, policy_version 41062 (0.0009) +[2023-10-14 06:45:42,720][100917] Updated weights for policy 1, policy_version 41072 (0.0009) +[2023-10-14 06:45:43,095][100917] Updated weights for policy 1, policy_version 41082 (0.0009) +[2023-10-14 06:45:43,472][100936] Updated weights for policy 0, policy_version 41060 (0.0010) +[2023-10-14 06:45:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 84115456. Throughput: 0: 1665.0, 1: 1645.8. Samples: 21033528. Policy #0 lag: (min: 1.0, avg: 8.5, max: 33.0) +[2023-10-14 06:45:43,512][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:45:43,852][100936] Updated weights for policy 0, policy_version 41070 (0.0008) +[2023-10-14 06:45:44,234][100936] Updated weights for policy 0, policy_version 41080 (0.0007) +[2023-10-14 06:45:47,410][100917] Updated weights for policy 1, policy_version 41092 (0.0010) +[2023-10-14 06:45:47,814][100917] Updated weights for policy 1, policy_version 41102 (0.0010) +[2023-10-14 06:45:48,191][100917] Updated weights for policy 1, policy_version 41112 (0.0008) +[2023-10-14 06:45:48,360][100936] Updated weights for policy 0, policy_version 41090 (0.0010) +[2023-10-14 06:45:48,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 84180992. Throughput: 0: 1666.3, 1: 1642.8. Samples: 21053728. Policy #0 lag: (min: 28.0, avg: 40.9, max: 60.0) +[2023-10-14 06:45:48,513][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:45:48,736][100936] Updated weights for policy 0, policy_version 41100 (0.0008) +[2023-10-14 06:45:49,111][100936] Updated weights for policy 0, policy_version 41110 (0.0010) +[2023-10-14 06:45:49,481][100936] Updated weights for policy 0, policy_version 41120 (0.0010) +[2023-10-14 06:45:52,153][100917] Updated weights for policy 1, policy_version 41122 (0.0010) +[2023-10-14 06:45:52,522][100917] Updated weights for policy 1, policy_version 41132 (0.0011) +[2023-10-14 06:45:52,910][100917] Updated weights for policy 1, policy_version 41142 (0.0010) +[2023-10-14 06:45:53,275][100917] Updated weights for policy 1, policy_version 41152 (0.0010) +[2023-10-14 06:45:53,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 84246528. Throughput: 0: 1660.6, 1: 1642.0. Samples: 21073010. Policy #0 lag: (min: 28.0, avg: 40.9, max: 60.0) +[2023-10-14 06:45:53,512][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:45:53,620][100936] Updated weights for policy 0, policy_version 41130 (0.0011) +[2023-10-14 06:45:53,982][100936] Updated weights for policy 0, policy_version 41140 (0.0008) +[2023-10-14 06:45:54,357][100936] Updated weights for policy 0, policy_version 41150 (0.0009) +[2023-10-14 06:45:57,298][100917] Updated weights for policy 1, policy_version 41162 (0.0008) +[2023-10-14 06:45:57,678][100917] Updated weights for policy 1, policy_version 41172 (0.0010) +[2023-10-14 06:45:58,046][100917] Updated weights for policy 1, policy_version 41182 (0.0007) +[2023-10-14 06:45:58,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 84312064. Throughput: 0: 1661.3, 1: 1644.5. Samples: 21082870. Policy #0 lag: (min: 28.0, avg: 40.9, max: 60.0) +[2023-10-14 06:45:58,512][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:45:58,518][100936] Updated weights for policy 0, policy_version 41160 (0.0007) +[2023-10-14 06:45:58,882][100936] Updated weights for policy 0, policy_version 41170 (0.0009) +[2023-10-14 06:45:59,249][100936] Updated weights for policy 0, policy_version 41180 (0.0009) +[2023-10-14 06:46:02,051][100917] Updated weights for policy 1, policy_version 41192 (0.0007) +[2023-10-14 06:46:02,418][100917] Updated weights for policy 1, policy_version 41202 (0.0008) +[2023-10-14 06:46:02,788][100917] Updated weights for policy 1, policy_version 41212 (0.0007) +[2023-10-14 06:46:03,401][100936] Updated weights for policy 0, policy_version 41190 (0.0010) +[2023-10-14 06:46:03,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 84377600. Throughput: 0: 1661.7, 1: 1645.0. Samples: 21103200. Policy #0 lag: (min: 28.0, avg: 40.9, max: 60.0) +[2023-10-14 06:46:03,512][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:46:03,771][100936] Updated weights for policy 0, policy_version 41200 (0.0011) +[2023-10-14 06:46:04,141][100936] Updated weights for policy 0, policy_version 41210 (0.0010) +[2023-10-14 06:46:07,006][100917] Updated weights for policy 1, policy_version 41222 (0.0009) +[2023-10-14 06:46:07,381][100917] Updated weights for policy 1, policy_version 41232 (0.0008) +[2023-10-14 06:46:07,760][100917] Updated weights for policy 1, policy_version 41242 (0.0009) +[2023-10-14 06:46:08,338][100936] Updated weights for policy 0, policy_version 41220 (0.0010) +[2023-10-14 06:46:08,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 84443136. Throughput: 0: 1655.2, 1: 1642.9. Samples: 21122388. Policy #0 lag: (min: 28.0, avg: 40.9, max: 60.0) +[2023-10-14 06:46:08,512][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:46:08,706][100936] Updated weights for policy 0, policy_version 41230 (0.0009) +[2023-10-14 06:46:09,070][100936] Updated weights for policy 0, policy_version 41240 (0.0010) +[2023-10-14 06:46:11,879][100917] Updated weights for policy 1, policy_version 41252 (0.0010) +[2023-10-14 06:46:12,254][100917] Updated weights for policy 1, policy_version 41262 (0.0007) +[2023-10-14 06:46:12,629][100917] Updated weights for policy 1, policy_version 41272 (0.0008) +[2023-10-14 06:46:13,145][100936] Updated weights for policy 0, policy_version 41250 (0.0009) +[2023-10-14 06:46:13,511][100936] Updated weights for policy 0, policy_version 41260 (0.0007) +[2023-10-14 06:46:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 84508672. Throughput: 0: 1660.0, 1: 1658.4. Samples: 21132928. Policy #0 lag: (min: 28.0, avg: 40.9, max: 60.0) +[2023-10-14 06:46:13,512][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:46:13,882][100936] Updated weights for policy 0, policy_version 41270 (0.0008) +[2023-10-14 06:46:14,251][100936] Updated weights for policy 0, policy_version 41280 (0.0009) +[2023-10-14 06:46:16,764][100917] Updated weights for policy 1, policy_version 41282 (0.0008) +[2023-10-14 06:46:17,129][100917] Updated weights for policy 1, policy_version 41292 (0.0011) +[2023-10-14 06:46:17,508][100917] Updated weights for policy 1, policy_version 41302 (0.0010) +[2023-10-14 06:46:17,890][100917] Updated weights for policy 1, policy_version 41312 (0.0008) +[2023-10-14 06:46:18,304][100936] Updated weights for policy 0, policy_version 41290 (0.0008) +[2023-10-14 06:46:18,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 84574208. Throughput: 0: 1660.1, 1: 1655.0. Samples: 21153084. Policy #0 lag: (min: 10.0, avg: 12.1, max: 39.0) +[2023-10-14 06:46:18,513][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:46:18,674][100936] Updated weights for policy 0, policy_version 41300 (0.0009) +[2023-10-14 06:46:19,058][100936] Updated weights for policy 0, policy_version 41310 (0.0010) +[2023-10-14 06:46:21,972][100917] Updated weights for policy 1, policy_version 41322 (0.0009) +[2023-10-14 06:46:22,349][100917] Updated weights for policy 1, policy_version 41332 (0.0009) +[2023-10-14 06:46:22,722][100917] Updated weights for policy 1, policy_version 41342 (0.0010) +[2023-10-14 06:46:23,274][100936] Updated weights for policy 0, policy_version 41320 (0.0010) +[2023-10-14 06:46:23,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 84639744. Throughput: 0: 1644.6, 1: 1654.6. Samples: 21172108. Policy #0 lag: (min: 10.0, avg: 12.1, max: 39.0) +[2023-10-14 06:46:23,513][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:46:23,655][100936] Updated weights for policy 0, policy_version 41330 (0.0009) +[2023-10-14 06:46:24,025][100936] Updated weights for policy 0, policy_version 41340 (0.0010) +[2023-10-14 06:46:26,712][100917] Updated weights for policy 1, policy_version 41352 (0.0009) +[2023-10-14 06:46:27,086][100917] Updated weights for policy 1, policy_version 41362 (0.0007) +[2023-10-14 06:46:27,459][100917] Updated weights for policy 1, policy_version 41372 (0.0009) +[2023-10-14 06:46:28,213][100936] Updated weights for policy 0, policy_version 41350 (0.0009) +[2023-10-14 06:46:28,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 84705280. Throughput: 0: 1651.7, 1: 1668.0. Samples: 21182918. Policy #0 lag: (min: 10.0, avg: 12.1, max: 39.0) +[2023-10-14 06:46:28,513][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:46:28,597][100936] Updated weights for policy 0, policy_version 41360 (0.0009) +[2023-10-14 06:46:28,963][100936] Updated weights for policy 0, policy_version 41370 (0.0008) +[2023-10-14 06:46:31,498][100917] Updated weights for policy 1, policy_version 41382 (0.0008) +[2023-10-14 06:46:31,875][100917] Updated weights for policy 1, policy_version 41392 (0.0007) +[2023-10-14 06:46:32,260][100917] Updated weights for policy 1, policy_version 41402 (0.0008) +[2023-10-14 06:46:33,150][100936] Updated weights for policy 0, policy_version 41380 (0.0007) +[2023-10-14 06:46:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 84770816. Throughput: 0: 1651.6, 1: 1657.9. Samples: 21202654. Policy #0 lag: (min: 10.0, avg: 12.1, max: 39.0) +[2023-10-14 06:46:33,513][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:46:33,519][100936] Updated weights for policy 0, policy_version 41390 (0.0008) +[2023-10-14 06:46:33,894][100936] Updated weights for policy 0, policy_version 41400 (0.0008) +[2023-10-14 06:46:36,373][100917] Updated weights for policy 1, policy_version 41412 (0.0009) +[2023-10-14 06:46:36,762][100917] Updated weights for policy 1, policy_version 41422 (0.0011) +[2023-10-14 06:46:37,138][100917] Updated weights for policy 1, policy_version 41432 (0.0009) +[2023-10-14 06:46:38,126][100936] Updated weights for policy 0, policy_version 41410 (0.0008) +[2023-10-14 06:46:38,493][100936] Updated weights for policy 0, policy_version 41420 (0.0009) +[2023-10-14 06:46:38,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 84836352. Throughput: 0: 1648.1, 1: 1666.4. Samples: 21222164. Policy #0 lag: (min: 10.0, avg: 12.1, max: 39.0) +[2023-10-14 06:46:38,513][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:46:38,869][100936] Updated weights for policy 0, policy_version 41430 (0.0008) +[2023-10-14 06:46:39,236][100936] Updated weights for policy 0, policy_version 41440 (0.0008) +[2023-10-14 06:46:41,201][100917] Updated weights for policy 1, policy_version 41442 (0.0009) +[2023-10-14 06:46:41,577][100917] Updated weights for policy 1, policy_version 41452 (0.0008) +[2023-10-14 06:46:41,943][100917] Updated weights for policy 1, policy_version 41462 (0.0007) +[2023-10-14 06:46:42,312][100917] Updated weights for policy 1, policy_version 41472 (0.0008) +[2023-10-14 06:46:43,205][100936] Updated weights for policy 0, policy_version 41450 (0.0009) +[2023-10-14 06:46:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 84901888. Throughput: 0: 1655.1, 1: 1678.1. Samples: 21232866. Policy #0 lag: (min: 10.0, avg: 12.1, max: 39.0) +[2023-10-14 06:46:43,513][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:46:43,584][100936] Updated weights for policy 0, policy_version 41460 (0.0008) +[2023-10-14 06:46:43,959][100936] Updated weights for policy 0, policy_version 41470 (0.0007) +[2023-10-14 06:46:46,377][100917] Updated weights for policy 1, policy_version 41482 (0.0009) +[2023-10-14 06:46:46,743][100917] Updated weights for policy 1, policy_version 41492 (0.0007) +[2023-10-14 06:46:47,117][100917] Updated weights for policy 1, policy_version 41502 (0.0007) +[2023-10-14 06:46:48,042][100936] Updated weights for policy 0, policy_version 41480 (0.0007) +[2023-10-14 06:46:48,400][100936] Updated weights for policy 0, policy_version 41490 (0.0009) +[2023-10-14 06:46:48,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 84967424. Throughput: 0: 1656.1, 1: 1659.8. Samples: 21252416. Policy #0 lag: (min: 31.0, avg: 33.0, max: 63.0) +[2023-10-14 06:46:48,513][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:46:48,776][100936] Updated weights for policy 0, policy_version 41500 (0.0010) +[2023-10-14 06:46:51,301][100917] Updated weights for policy 1, policy_version 41512 (0.0009) +[2023-10-14 06:46:51,665][100917] Updated weights for policy 1, policy_version 41522 (0.0009) +[2023-10-14 06:46:52,051][100917] Updated weights for policy 1, policy_version 41532 (0.0008) +[2023-10-14 06:46:52,793][100936] Updated weights for policy 0, policy_version 41510 (0.0009) +[2023-10-14 06:46:53,175][100936] Updated weights for policy 0, policy_version 41520 (0.0008) +[2023-10-14 06:46:53,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 85032960. Throughput: 0: 1644.1, 1: 1671.1. Samples: 21271572. Policy #0 lag: (min: 31.0, avg: 33.0, max: 63.0) +[2023-10-14 06:46:53,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:46:53,546][100936] Updated weights for policy 0, policy_version 41530 (0.0008) +[2023-10-14 06:46:56,131][100917] Updated weights for policy 1, policy_version 41542 (0.0008) +[2023-10-14 06:46:56,489][100917] Updated weights for policy 1, policy_version 41552 (0.0009) +[2023-10-14 06:46:56,868][100917] Updated weights for policy 1, policy_version 41562 (0.0009) +[2023-10-14 06:46:57,776][100936] Updated weights for policy 0, policy_version 41540 (0.0010) +[2023-10-14 06:46:58,146][100936] Updated weights for policy 0, policy_version 41550 (0.0010) +[2023-10-14 06:46:58,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 85098496. Throughput: 0: 1656.0, 1: 1667.8. Samples: 21282500. Policy #0 lag: (min: 31.0, avg: 33.0, max: 63.0) +[2023-10-14 06:46:58,512][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 06:46:58,523][100936] Updated weights for policy 0, policy_version 41560 (0.0008) +[2023-10-14 06:47:00,922][100917] Updated weights for policy 1, policy_version 41572 (0.0009) +[2023-10-14 06:47:01,298][100917] Updated weights for policy 1, policy_version 41582 (0.0010) +[2023-10-14 06:47:01,674][100917] Updated weights for policy 1, policy_version 41592 (0.0009) +[2023-10-14 06:47:02,604][100936] Updated weights for policy 0, policy_version 41570 (0.0009) +[2023-10-14 06:47:02,971][100936] Updated weights for policy 0, policy_version 41580 (0.0012) +[2023-10-14 06:47:03,342][100936] Updated weights for policy 0, policy_version 41590 (0.0007) +[2023-10-14 06:47:03,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 85164032. Throughput: 0: 1653.2, 1: 1652.3. Samples: 21301828. Policy #0 lag: (min: 31.0, avg: 33.0, max: 63.0) +[2023-10-14 06:47:03,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 06:47:03,712][100936] Updated weights for policy 0, policy_version 41600 (0.0007) +[2023-10-14 06:47:05,726][100917] Updated weights for policy 1, policy_version 41602 (0.0010) +[2023-10-14 06:47:06,097][100917] Updated weights for policy 1, policy_version 41612 (0.0009) +[2023-10-14 06:47:06,471][100917] Updated weights for policy 1, policy_version 41622 (0.0008) +[2023-10-14 06:47:06,844][100917] Updated weights for policy 1, policy_version 41632 (0.0009) +[2023-10-14 06:47:07,834][100936] Updated weights for policy 0, policy_version 41610 (0.0007) +[2023-10-14 06:47:08,216][100936] Updated weights for policy 0, policy_version 41620 (0.0009) +[2023-10-14 06:47:08,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 85229568. Throughput: 0: 1644.4, 1: 1670.3. Samples: 21321272. Policy #0 lag: (min: 31.0, avg: 33.0, max: 63.0) +[2023-10-14 06:47:08,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 06:47:08,584][100936] Updated weights for policy 0, policy_version 41630 (0.0010) +[2023-10-14 06:47:10,933][100917] Updated weights for policy 1, policy_version 41642 (0.0009) +[2023-10-14 06:47:11,306][100917] Updated weights for policy 1, policy_version 41652 (0.0009) +[2023-10-14 06:47:11,680][100917] Updated weights for policy 1, policy_version 41662 (0.0008) +[2023-10-14 06:47:12,711][100936] Updated weights for policy 0, policy_version 41640 (0.0009) +[2023-10-14 06:47:13,088][100936] Updated weights for policy 0, policy_version 41650 (0.0008) +[2023-10-14 06:47:13,452][100936] Updated weights for policy 0, policy_version 41660 (0.0007) +[2023-10-14 06:47:13,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 85295104. Throughput: 0: 1652.9, 1: 1659.5. Samples: 21331974. Policy #0 lag: (min: 31.0, avg: 33.0, max: 63.0) +[2023-10-14 06:47:13,512][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 06:47:15,819][100917] Updated weights for policy 1, policy_version 41672 (0.0011) +[2023-10-14 06:47:16,184][100917] Updated weights for policy 1, policy_version 41682 (0.0010) +[2023-10-14 06:47:16,553][100917] Updated weights for policy 1, policy_version 41692 (0.0010) +[2023-10-14 06:47:17,680][100936] Updated weights for policy 0, policy_version 41670 (0.0007) +[2023-10-14 06:47:18,049][100936] Updated weights for policy 0, policy_version 41680 (0.0011) +[2023-10-14 06:47:18,418][100936] Updated weights for policy 0, policy_version 41690 (0.0008) +[2023-10-14 06:47:18,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 85360640. Throughput: 0: 1654.4, 1: 1653.2. Samples: 21351494. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:47:18,512][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 06:47:20,899][100917] Updated weights for policy 1, policy_version 41702 (0.0009) +[2023-10-14 06:47:21,268][100917] Updated weights for policy 1, policy_version 41712 (0.0008) +[2023-10-14 06:47:21,642][100917] Updated weights for policy 1, policy_version 41722 (0.0010) +[2023-10-14 06:47:22,565][100936] Updated weights for policy 0, policy_version 41700 (0.0008) +[2023-10-14 06:47:22,937][100936] Updated weights for policy 0, policy_version 41710 (0.0009) +[2023-10-14 06:47:23,309][100936] Updated weights for policy 0, policy_version 41720 (0.0007) +[2023-10-14 06:47:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 85426176. Throughput: 0: 1634.9, 1: 1668.6. Samples: 21370822. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:47:23,512][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 06:47:25,718][100917] Updated weights for policy 1, policy_version 41732 (0.0009) +[2023-10-14 06:47:26,116][100917] Updated weights for policy 1, policy_version 41742 (0.0010) +[2023-10-14 06:47:26,488][100917] Updated weights for policy 1, policy_version 41752 (0.0010) +[2023-10-14 06:47:27,582][100936] Updated weights for policy 0, policy_version 41730 (0.0008) +[2023-10-14 06:47:27,959][100936] Updated weights for policy 0, policy_version 41740 (0.0009) +[2023-10-14 06:47:28,333][100936] Updated weights for policy 0, policy_version 41750 (0.0008) +[2023-10-14 06:47:28,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 85491712. Throughput: 0: 1647.4, 1: 1653.5. Samples: 21381406. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:47:28,513][99942] Avg episode reward: [(0, '0.940'), (1, '1.000')] +[2023-10-14 06:47:28,715][100936] Updated weights for policy 0, policy_version 41760 (0.0010) +[2023-10-14 06:47:30,586][100917] Updated weights for policy 1, policy_version 41762 (0.0009) +[2023-10-14 06:47:30,961][100917] Updated weights for policy 1, policy_version 41772 (0.0009) +[2023-10-14 06:47:31,322][100917] Updated weights for policy 1, policy_version 41782 (0.0008) +[2023-10-14 06:47:31,706][100917] Updated weights for policy 1, policy_version 41792 (0.0007) +[2023-10-14 06:47:32,840][100936] Updated weights for policy 0, policy_version 41770 (0.0009) +[2023-10-14 06:47:33,221][100936] Updated weights for policy 0, policy_version 41780 (0.0009) +[2023-10-14 06:47:33,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 85557248. Throughput: 0: 1649.1, 1: 1653.5. Samples: 21401030. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:47:33,513][99942] Avg episode reward: [(0, '0.940'), (1, '1.000')] +[2023-10-14 06:47:33,580][100936] Updated weights for policy 0, policy_version 41790 (0.0009) +[2023-10-14 06:47:35,655][100917] Updated weights for policy 1, policy_version 41802 (0.0009) +[2023-10-14 06:47:36,030][100917] Updated weights for policy 1, policy_version 41812 (0.0011) +[2023-10-14 06:47:36,392][100917] Updated weights for policy 1, policy_version 41822 (0.0011) +[2023-10-14 06:47:37,875][100936] Updated weights for policy 0, policy_version 41800 (0.0010) +[2023-10-14 06:47:38,238][100936] Updated weights for policy 0, policy_version 41810 (0.0010) +[2023-10-14 06:47:38,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 85622784. Throughput: 0: 1645.8, 1: 1663.7. Samples: 21420500. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:47:38,513][99942] Avg episode reward: [(0, '0.940'), (1, '1.000')] +[2023-10-14 06:47:38,525][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000041824_42827776.pth... +[2023-10-14 06:47:38,559][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000040288_41254912.pth +[2023-10-14 06:47:38,616][100936] Updated weights for policy 0, policy_version 41820 (0.0009) +[2023-10-14 06:47:38,761][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000041824_42827776.pth... +[2023-10-14 06:47:38,800][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000040256_41222144.pth +[2023-10-14 06:47:40,431][100917] Updated weights for policy 1, policy_version 41832 (0.0010) +[2023-10-14 06:47:40,809][100917] Updated weights for policy 1, policy_version 41842 (0.0009) +[2023-10-14 06:47:41,173][100917] Updated weights for policy 1, policy_version 41852 (0.0008) +[2023-10-14 06:47:42,841][100936] Updated weights for policy 0, policy_version 41830 (0.0011) +[2023-10-14 06:47:43,207][100936] Updated weights for policy 0, policy_version 41840 (0.0008) +[2023-10-14 06:47:43,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 85688320. Throughput: 0: 1648.9, 1: 1647.3. Samples: 21430832. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:47:43,512][99942] Avg episode reward: [(0, '0.940'), (1, '1.000')] +[2023-10-14 06:47:43,581][100936] Updated weights for policy 0, policy_version 41850 (0.0009) +[2023-10-14 06:47:45,559][100917] Updated weights for policy 1, policy_version 41862 (0.0008) +[2023-10-14 06:47:45,936][100917] Updated weights for policy 1, policy_version 41872 (0.0010) +[2023-10-14 06:47:46,294][100917] Updated weights for policy 1, policy_version 41882 (0.0008) +[2023-10-14 06:47:47,492][100936] Updated weights for policy 0, policy_version 41860 (0.0008) +[2023-10-14 06:47:47,857][100936] Updated weights for policy 0, policy_version 41870 (0.0010) +[2023-10-14 06:47:48,224][100936] Updated weights for policy 0, policy_version 41880 (0.0009) +[2023-10-14 06:47:48,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 85753856. Throughput: 0: 1649.7, 1: 1652.9. Samples: 21450442. Policy #0 lag: (min: 31.0, avg: 38.8, max: 63.0) +[2023-10-14 06:47:48,512][99942] Avg episode reward: [(0, '0.930'), (1, '1.000')] +[2023-10-14 06:47:50,452][100917] Updated weights for policy 1, policy_version 41892 (0.0007) +[2023-10-14 06:47:50,824][100917] Updated weights for policy 1, policy_version 41902 (0.0008) +[2023-10-14 06:47:51,196][100917] Updated weights for policy 1, policy_version 41912 (0.0008) +[2023-10-14 06:47:52,381][100936] Updated weights for policy 0, policy_version 41890 (0.0009) +[2023-10-14 06:47:52,751][100936] Updated weights for policy 0, policy_version 41900 (0.0007) +[2023-10-14 06:47:53,110][100936] Updated weights for policy 0, policy_version 41910 (0.0008) +[2023-10-14 06:47:53,485][100936] Updated weights for policy 0, policy_version 41920 (0.0007) +[2023-10-14 06:47:53,512][99942] Fps is (10 sec: 16383.9, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 85852160. Throughput: 0: 1645.9, 1: 1655.9. Samples: 21469850. Policy #0 lag: (min: 31.0, avg: 38.8, max: 63.0) +[2023-10-14 06:47:53,513][99942] Avg episode reward: [(0, '0.930'), (1, '1.000')] +[2023-10-14 06:47:55,323][100917] Updated weights for policy 1, policy_version 41922 (0.0009) +[2023-10-14 06:47:55,688][100917] Updated weights for policy 1, policy_version 41932 (0.0009) +[2023-10-14 06:47:56,067][100917] Updated weights for policy 1, policy_version 41942 (0.0011) +[2023-10-14 06:47:56,441][100917] Updated weights for policy 1, policy_version 41952 (0.0009) +[2023-10-14 06:47:57,632][100936] Updated weights for policy 0, policy_version 41930 (0.0009) +[2023-10-14 06:47:58,007][100936] Updated weights for policy 0, policy_version 41940 (0.0007) +[2023-10-14 06:47:58,379][100936] Updated weights for policy 0, policy_version 41950 (0.0008) +[2023-10-14 06:47:58,512][99942] Fps is (10 sec: 16383.7, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 85917696. Throughput: 0: 1654.7, 1: 1645.1. Samples: 21480468. Policy #0 lag: (min: 31.0, avg: 38.8, max: 63.0) +[2023-10-14 06:47:58,513][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:48:00,623][100917] Updated weights for policy 1, policy_version 41962 (0.0007) +[2023-10-14 06:48:00,992][100917] Updated weights for policy 1, policy_version 41972 (0.0008) +[2023-10-14 06:48:01,365][100917] Updated weights for policy 1, policy_version 41982 (0.0009) +[2023-10-14 06:48:02,392][100936] Updated weights for policy 0, policy_version 41960 (0.0007) +[2023-10-14 06:48:02,773][100936] Updated weights for policy 0, policy_version 41970 (0.0007) +[2023-10-14 06:48:03,145][100936] Updated weights for policy 0, policy_version 41980 (0.0007) +[2023-10-14 06:48:03,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 85983232. Throughput: 0: 1646.9, 1: 1652.8. Samples: 21499984. Policy #0 lag: (min: 31.0, avg: 38.8, max: 63.0) +[2023-10-14 06:48:03,512][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:48:05,436][100917] Updated weights for policy 1, policy_version 41992 (0.0008) +[2023-10-14 06:48:05,814][100917] Updated weights for policy 1, policy_version 42002 (0.0008) +[2023-10-14 06:48:06,193][100917] Updated weights for policy 1, policy_version 42012 (0.0009) +[2023-10-14 06:48:07,268][100936] Updated weights for policy 0, policy_version 41990 (0.0010) +[2023-10-14 06:48:07,639][100936] Updated weights for policy 0, policy_version 42000 (0.0010) +[2023-10-14 06:48:08,017][100936] Updated weights for policy 0, policy_version 42010 (0.0007) +[2023-10-14 06:48:08,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 86048768. Throughput: 0: 1652.4, 1: 1653.2. Samples: 21519578. Policy #0 lag: (min: 31.0, avg: 38.8, max: 63.0) +[2023-10-14 06:48:08,512][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:48:10,158][100917] Updated weights for policy 1, policy_version 42022 (0.0010) +[2023-10-14 06:48:10,523][100917] Updated weights for policy 1, policy_version 42032 (0.0008) +[2023-10-14 06:48:10,907][100917] Updated weights for policy 1, policy_version 42042 (0.0007) +[2023-10-14 06:48:11,980][100936] Updated weights for policy 0, policy_version 42020 (0.0009) +[2023-10-14 06:48:12,357][100936] Updated weights for policy 0, policy_version 42030 (0.0007) +[2023-10-14 06:48:12,732][100936] Updated weights for policy 0, policy_version 42040 (0.0009) +[2023-10-14 06:48:13,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 86114304. Throughput: 0: 1665.2, 1: 1642.6. Samples: 21530256. Policy #0 lag: (min: 31.0, avg: 35.7, max: 63.0) +[2023-10-14 06:48:13,513][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:48:15,248][100917] Updated weights for policy 1, policy_version 42052 (0.0008) +[2023-10-14 06:48:15,623][100917] Updated weights for policy 1, policy_version 42062 (0.0007) +[2023-10-14 06:48:16,010][100917] Updated weights for policy 1, policy_version 42072 (0.0011) +[2023-10-14 06:48:16,909][100936] Updated weights for policy 0, policy_version 42050 (0.0008) +[2023-10-14 06:48:17,292][100936] Updated weights for policy 0, policy_version 42060 (0.0010) +[2023-10-14 06:48:17,650][100936] Updated weights for policy 0, policy_version 42070 (0.0008) +[2023-10-14 06:48:18,026][100936] Updated weights for policy 0, policy_version 42080 (0.0007) +[2023-10-14 06:48:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 86179840. Throughput: 0: 1649.4, 1: 1659.7. Samples: 21549938. Policy #0 lag: (min: 31.0, avg: 35.7, max: 63.0) +[2023-10-14 06:48:18,512][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:48:20,164][100917] Updated weights for policy 1, policy_version 42082 (0.0007) +[2023-10-14 06:48:20,580][100917] Updated weights for policy 1, policy_version 42092 (0.0007) +[2023-10-14 06:48:20,945][100917] Updated weights for policy 1, policy_version 42102 (0.0007) +[2023-10-14 06:48:21,321][100917] Updated weights for policy 1, policy_version 42112 (0.0010) +[2023-10-14 06:48:22,009][100936] Updated weights for policy 0, policy_version 42090 (0.0008) +[2023-10-14 06:48:22,386][100936] Updated weights for policy 0, policy_version 42100 (0.0008) +[2023-10-14 06:48:22,758][100936] Updated weights for policy 0, policy_version 42110 (0.0008) +[2023-10-14 06:48:23,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 86245376. Throughput: 0: 1658.3, 1: 1659.7. Samples: 21569812. Policy #0 lag: (min: 31.0, avg: 35.7, max: 63.0) +[2023-10-14 06:48:23,513][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:48:25,315][100917] Updated weights for policy 1, policy_version 42122 (0.0011) +[2023-10-14 06:48:25,679][100917] Updated weights for policy 1, policy_version 42132 (0.0010) +[2023-10-14 06:48:26,050][100917] Updated weights for policy 1, policy_version 42142 (0.0010) +[2023-10-14 06:48:26,904][100936] Updated weights for policy 0, policy_version 42120 (0.0007) +[2023-10-14 06:48:27,266][100936] Updated weights for policy 0, policy_version 42130 (0.0009) +[2023-10-14 06:48:27,642][100936] Updated weights for policy 0, policy_version 42140 (0.0008) +[2023-10-14 06:48:28,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 86310912. Throughput: 0: 1666.3, 1: 1652.0. Samples: 21580156. Policy #0 lag: (min: 31.0, avg: 35.7, max: 63.0) +[2023-10-14 06:48:28,512][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:48:30,131][100917] Updated weights for policy 1, policy_version 42152 (0.0008) +[2023-10-14 06:48:30,513][100917] Updated weights for policy 1, policy_version 42162 (0.0007) +[2023-10-14 06:48:30,883][100917] Updated weights for policy 1, policy_version 42172 (0.0007) +[2023-10-14 06:48:31,799][100936] Updated weights for policy 0, policy_version 42150 (0.0008) +[2023-10-14 06:48:32,178][100936] Updated weights for policy 0, policy_version 42160 (0.0007) +[2023-10-14 06:48:32,557][100936] Updated weights for policy 0, policy_version 42170 (0.0007) +[2023-10-14 06:48:33,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 86376448. Throughput: 0: 1651.1, 1: 1661.9. Samples: 21599528. Policy #0 lag: (min: 31.0, avg: 35.7, max: 63.0) +[2023-10-14 06:48:33,513][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:48:35,040][100917] Updated weights for policy 1, policy_version 42182 (0.0009) +[2023-10-14 06:48:35,417][100917] Updated weights for policy 1, policy_version 42192 (0.0007) +[2023-10-14 06:48:35,796][100917] Updated weights for policy 1, policy_version 42202 (0.0010) +[2023-10-14 06:48:36,846][100936] Updated weights for policy 0, policy_version 42180 (0.0009) +[2023-10-14 06:48:37,219][100936] Updated weights for policy 0, policy_version 42190 (0.0008) +[2023-10-14 06:48:37,592][100936] Updated weights for policy 0, policy_version 42200 (0.0009) +[2023-10-14 06:48:38,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 86441984. Throughput: 0: 1662.8, 1: 1660.9. Samples: 21619414. Policy #0 lag: (min: 31.0, avg: 35.7, max: 63.0) +[2023-10-14 06:48:38,512][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:48:39,918][100917] Updated weights for policy 1, policy_version 42212 (0.0010) +[2023-10-14 06:48:40,297][100917] Updated weights for policy 1, policy_version 42222 (0.0007) +[2023-10-14 06:48:40,671][100917] Updated weights for policy 1, policy_version 42232 (0.0008) +[2023-10-14 06:48:41,608][100936] Updated weights for policy 0, policy_version 42210 (0.0008) +[2023-10-14 06:48:41,968][100936] Updated weights for policy 0, policy_version 42220 (0.0007) +[2023-10-14 06:48:42,339][100936] Updated weights for policy 0, policy_version 42230 (0.0007) +[2023-10-14 06:48:42,709][100936] Updated weights for policy 0, policy_version 42240 (0.0007) +[2023-10-14 06:48:43,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 86507520. Throughput: 0: 1664.6, 1: 1654.8. Samples: 21629838. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:48:43,513][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:48:44,769][100917] Updated weights for policy 1, policy_version 42242 (0.0007) +[2023-10-14 06:48:45,143][100917] Updated weights for policy 1, policy_version 42252 (0.0007) +[2023-10-14 06:48:45,510][100917] Updated weights for policy 1, policy_version 42262 (0.0007) +[2023-10-14 06:48:45,887][100917] Updated weights for policy 1, policy_version 42272 (0.0007) +[2023-10-14 06:48:46,987][100936] Updated weights for policy 0, policy_version 42250 (0.0009) +[2023-10-14 06:48:47,363][100936] Updated weights for policy 0, policy_version 42260 (0.0009) +[2023-10-14 06:48:47,735][100936] Updated weights for policy 0, policy_version 42270 (0.0009) +[2023-10-14 06:48:48,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 86573056. Throughput: 0: 1651.5, 1: 1663.8. Samples: 21649172. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:48:48,513][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:48:49,966][100917] Updated weights for policy 1, policy_version 42282 (0.0009) +[2023-10-14 06:48:50,336][100917] Updated weights for policy 1, policy_version 42292 (0.0010) +[2023-10-14 06:48:50,714][100917] Updated weights for policy 1, policy_version 42302 (0.0009) +[2023-10-14 06:48:51,757][100936] Updated weights for policy 0, policy_version 42280 (0.0010) +[2023-10-14 06:48:52,122][100936] Updated weights for policy 0, policy_version 42290 (0.0009) +[2023-10-14 06:48:52,495][100936] Updated weights for policy 0, policy_version 42300 (0.0007) +[2023-10-14 06:48:53,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 86638592. Throughput: 0: 1662.0, 1: 1659.2. Samples: 21669034. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:48:53,513][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:48:54,862][100917] Updated weights for policy 1, policy_version 42312 (0.0008) +[2023-10-14 06:48:55,228][100917] Updated weights for policy 1, policy_version 42322 (0.0011) +[2023-10-14 06:48:55,616][100917] Updated weights for policy 1, policy_version 42332 (0.0011) +[2023-10-14 06:48:56,474][100936] Updated weights for policy 0, policy_version 42310 (0.0008) +[2023-10-14 06:48:56,841][100936] Updated weights for policy 0, policy_version 42320 (0.0008) +[2023-10-14 06:48:57,217][100936] Updated weights for policy 0, policy_version 42330 (0.0009) +[2023-10-14 06:48:58,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 86704128. Throughput: 0: 1657.5, 1: 1654.0. Samples: 21679272. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:48:58,512][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:48:59,625][100917] Updated weights for policy 1, policy_version 42342 (0.0008) +[2023-10-14 06:49:00,004][100917] Updated weights for policy 1, policy_version 42352 (0.0009) +[2023-10-14 06:49:00,374][100917] Updated weights for policy 1, policy_version 42362 (0.0008) +[2023-10-14 06:49:01,454][100936] Updated weights for policy 0, policy_version 42340 (0.0010) +[2023-10-14 06:49:01,822][100936] Updated weights for policy 0, policy_version 42350 (0.0010) +[2023-10-14 06:49:02,203][100936] Updated weights for policy 0, policy_version 42360 (0.0008) +[2023-10-14 06:49:03,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.1, 300 sec: 13329.3). Total num frames: 86769664. Throughput: 0: 1654.4, 1: 1659.6. Samples: 21699070. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:49:03,513][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:49:04,377][100917] Updated weights for policy 1, policy_version 42372 (0.0010) +[2023-10-14 06:49:04,747][100917] Updated weights for policy 1, policy_version 42382 (0.0010) +[2023-10-14 06:49:05,120][100917] Updated weights for policy 1, policy_version 42392 (0.0009) +[2023-10-14 06:49:06,305][100936] Updated weights for policy 0, policy_version 42370 (0.0008) +[2023-10-14 06:49:06,678][100936] Updated weights for policy 0, policy_version 42380 (0.0008) +[2023-10-14 06:49:07,058][100936] Updated weights for policy 0, policy_version 42390 (0.0009) +[2023-10-14 06:49:07,422][100936] Updated weights for policy 0, policy_version 42400 (0.0008) +[2023-10-14 06:49:08,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 86835200. Throughput: 0: 1663.1, 1: 1657.2. Samples: 21719224. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:49:08,513][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:49:09,407][100917] Updated weights for policy 1, policy_version 42402 (0.0009) +[2023-10-14 06:49:09,814][100917] Updated weights for policy 1, policy_version 42412 (0.0009) +[2023-10-14 06:49:10,183][100917] Updated weights for policy 1, policy_version 42422 (0.0008) +[2023-10-14 06:49:10,561][100917] Updated weights for policy 1, policy_version 42432 (0.0007) +[2023-10-14 06:49:11,367][100936] Updated weights for policy 0, policy_version 42410 (0.0010) +[2023-10-14 06:49:11,745][100936] Updated weights for policy 0, policy_version 42420 (0.0009) +[2023-10-14 06:49:12,108][100936] Updated weights for policy 0, policy_version 42430 (0.0008) +[2023-10-14 06:49:13,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.1, 300 sec: 13329.3). Total num frames: 86900736. Throughput: 0: 1658.0, 1: 1652.1. Samples: 21729112. Policy #0 lag: (min: 24.0, avg: 50.0, max: 56.0) +[2023-10-14 06:49:13,514][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:49:14,643][100917] Updated weights for policy 1, policy_version 42442 (0.0010) +[2023-10-14 06:49:15,019][100917] Updated weights for policy 1, policy_version 42452 (0.0008) +[2023-10-14 06:49:15,400][100917] Updated weights for policy 1, policy_version 42462 (0.0010) +[2023-10-14 06:49:16,019][100936] Updated weights for policy 0, policy_version 42440 (0.0009) +[2023-10-14 06:49:16,391][100936] Updated weights for policy 0, policy_version 42450 (0.0010) +[2023-10-14 06:49:16,754][100936] Updated weights for policy 0, policy_version 42460 (0.0007) +[2023-10-14 06:49:18,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 86966272. Throughput: 0: 1659.9, 1: 1659.9. Samples: 21748920. Policy #0 lag: (min: 24.0, avg: 50.0, max: 56.0) +[2023-10-14 06:49:18,513][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:49:19,554][100917] Updated weights for policy 1, policy_version 42472 (0.0011) +[2023-10-14 06:49:19,927][100917] Updated weights for policy 1, policy_version 42482 (0.0011) +[2023-10-14 06:49:20,300][100917] Updated weights for policy 1, policy_version 42492 (0.0010) +[2023-10-14 06:49:20,914][100936] Updated weights for policy 0, policy_version 42470 (0.0007) +[2023-10-14 06:49:21,272][100936] Updated weights for policy 0, policy_version 42480 (0.0008) +[2023-10-14 06:49:21,641][100936] Updated weights for policy 0, policy_version 42490 (0.0007) +[2023-10-14 06:49:23,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 87031808. Throughput: 0: 1671.4, 1: 1664.3. Samples: 21769524. Policy #0 lag: (min: 24.0, avg: 50.0, max: 56.0) +[2023-10-14 06:49:23,513][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:49:24,327][100917] Updated weights for policy 1, policy_version 42502 (0.0008) +[2023-10-14 06:49:24,701][100917] Updated weights for policy 1, policy_version 42512 (0.0008) +[2023-10-14 06:49:25,064][100917] Updated weights for policy 1, policy_version 42522 (0.0008) +[2023-10-14 06:49:25,746][100936] Updated weights for policy 0, policy_version 42500 (0.0009) +[2023-10-14 06:49:26,125][100936] Updated weights for policy 0, policy_version 42510 (0.0009) +[2023-10-14 06:49:26,492][100936] Updated weights for policy 0, policy_version 42520 (0.0007) +[2023-10-14 06:49:28,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 87097344. Throughput: 0: 1659.2, 1: 1660.0. Samples: 21779204. Policy #0 lag: (min: 24.0, avg: 50.0, max: 56.0) +[2023-10-14 06:49:28,513][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:49:29,168][100917] Updated weights for policy 1, policy_version 42532 (0.0009) +[2023-10-14 06:49:29,546][100917] Updated weights for policy 1, policy_version 42542 (0.0007) +[2023-10-14 06:49:29,914][100917] Updated weights for policy 1, policy_version 42552 (0.0008) +[2023-10-14 06:49:30,666][100936] Updated weights for policy 0, policy_version 42530 (0.0008) +[2023-10-14 06:49:31,046][100936] Updated weights for policy 0, policy_version 42540 (0.0009) +[2023-10-14 06:49:31,412][100936] Updated weights for policy 0, policy_version 42550 (0.0010) +[2023-10-14 06:49:31,791][100936] Updated weights for policy 0, policy_version 42560 (0.0008) +[2023-10-14 06:49:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 87162880. Throughput: 0: 1668.3, 1: 1666.6. Samples: 21799242. Policy #0 lag: (min: 24.0, avg: 50.0, max: 56.0) +[2023-10-14 06:49:33,513][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 06:49:33,861][100917] Updated weights for policy 1, policy_version 42562 (0.0008) +[2023-10-14 06:49:34,229][100917] Updated weights for policy 1, policy_version 42572 (0.0009) +[2023-10-14 06:49:34,607][100917] Updated weights for policy 1, policy_version 42582 (0.0009) +[2023-10-14 06:49:34,975][100917] Updated weights for policy 1, policy_version 42592 (0.0007) +[2023-10-14 06:49:36,026][100936] Updated weights for policy 0, policy_version 42570 (0.0008) +[2023-10-14 06:49:36,409][100936] Updated weights for policy 0, policy_version 42580 (0.0010) +[2023-10-14 06:49:36,783][100936] Updated weights for policy 0, policy_version 42590 (0.0008) +[2023-10-14 06:49:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 87228416. Throughput: 0: 1676.0, 1: 1672.4. Samples: 21819714. Policy #0 lag: (min: 24.0, avg: 50.0, max: 56.0) +[2023-10-14 06:49:38,513][99942] Avg episode reward: [(0, '0.930'), (1, '1.000')] +[2023-10-14 06:49:38,521][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000042592_43614208.pth... +[2023-10-14 06:49:38,521][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000042592_43614208.pth... +[2023-10-14 06:49:38,553][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000041056_42041344.pth +[2023-10-14 06:49:38,561][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000041056_42041344.pth +[2023-10-14 06:49:39,062][100917] Updated weights for policy 1, policy_version 42602 (0.0007) +[2023-10-14 06:49:39,445][100917] Updated weights for policy 1, policy_version 42612 (0.0008) +[2023-10-14 06:49:39,812][100917] Updated weights for policy 1, policy_version 42622 (0.0010) +[2023-10-14 06:49:40,849][100936] Updated weights for policy 0, policy_version 42600 (0.0008) +[2023-10-14 06:49:41,223][100936] Updated weights for policy 0, policy_version 42610 (0.0009) +[2023-10-14 06:49:41,589][100936] Updated weights for policy 0, policy_version 42620 (0.0007) +[2023-10-14 06:49:43,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 87293952. Throughput: 0: 1654.4, 1: 1671.0. Samples: 21828918. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:49:43,513][99942] Avg episode reward: [(0, '0.940'), (1, '1.000')] +[2023-10-14 06:49:43,954][100917] Updated weights for policy 1, policy_version 42632 (0.0009) +[2023-10-14 06:49:44,323][100917] Updated weights for policy 1, policy_version 42642 (0.0009) +[2023-10-14 06:49:44,685][100917] Updated weights for policy 1, policy_version 42652 (0.0010) +[2023-10-14 06:49:45,770][100936] Updated weights for policy 0, policy_version 42630 (0.0009) +[2023-10-14 06:49:46,135][100936] Updated weights for policy 0, policy_version 42640 (0.0009) +[2023-10-14 06:49:46,508][100936] Updated weights for policy 0, policy_version 42650 (0.0007) +[2023-10-14 06:49:48,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 87359488. Throughput: 0: 1661.5, 1: 1670.4. Samples: 21849004. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:49:48,512][99942] Avg episode reward: [(0, '0.940'), (1, '1.000')] +[2023-10-14 06:49:48,881][100917] Updated weights for policy 1, policy_version 42662 (0.0011) +[2023-10-14 06:49:49,250][100917] Updated weights for policy 1, policy_version 42672 (0.0009) +[2023-10-14 06:49:49,629][100917] Updated weights for policy 1, policy_version 42682 (0.0008) +[2023-10-14 06:49:50,675][100936] Updated weights for policy 0, policy_version 42660 (0.0009) +[2023-10-14 06:49:51,055][100936] Updated weights for policy 0, policy_version 42670 (0.0008) +[2023-10-14 06:49:51,425][100936] Updated weights for policy 0, policy_version 42680 (0.0009) +[2023-10-14 06:49:53,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 87425024. Throughput: 0: 1659.2, 1: 1676.4. Samples: 21869326. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:49:53,512][99942] Avg episode reward: [(0, '0.940'), (1, '1.000')] +[2023-10-14 06:49:53,685][100917] Updated weights for policy 1, policy_version 42692 (0.0009) +[2023-10-14 06:49:54,079][100917] Updated weights for policy 1, policy_version 42702 (0.0008) +[2023-10-14 06:49:54,461][100917] Updated weights for policy 1, policy_version 42712 (0.0008) +[2023-10-14 06:49:55,540][100936] Updated weights for policy 0, policy_version 42690 (0.0009) +[2023-10-14 06:49:55,907][100936] Updated weights for policy 0, policy_version 42700 (0.0007) +[2023-10-14 06:49:56,274][100936] Updated weights for policy 0, policy_version 42710 (0.0010) +[2023-10-14 06:49:56,645][100936] Updated weights for policy 0, policy_version 42720 (0.0011) +[2023-10-14 06:49:58,501][100917] Updated weights for policy 1, policy_version 42722 (0.0010) +[2023-10-14 06:49:58,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 87490560. Throughput: 0: 1643.1, 1: 1674.6. Samples: 21878410. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:49:58,513][99942] Avg episode reward: [(0, '0.940'), (1, '1.000')] +[2023-10-14 06:49:58,883][100917] Updated weights for policy 1, policy_version 42732 (0.0009) +[2023-10-14 06:49:59,251][100917] Updated weights for policy 1, policy_version 42742 (0.0009) +[2023-10-14 06:49:59,630][100917] Updated weights for policy 1, policy_version 42752 (0.0009) +[2023-10-14 06:50:00,840][100936] Updated weights for policy 0, policy_version 42730 (0.0008) +[2023-10-14 06:50:01,212][100936] Updated weights for policy 0, policy_version 42740 (0.0009) +[2023-10-14 06:50:01,581][100936] Updated weights for policy 0, policy_version 42750 (0.0009) +[2023-10-14 06:50:03,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 87556096. Throughput: 0: 1653.7, 1: 1676.7. Samples: 21898788. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:50:03,513][99942] Avg episode reward: [(0, '0.940'), (1, '1.000')] +[2023-10-14 06:50:03,686][100917] Updated weights for policy 1, policy_version 42762 (0.0009) +[2023-10-14 06:50:04,070][100917] Updated weights for policy 1, policy_version 42772 (0.0007) +[2023-10-14 06:50:04,438][100917] Updated weights for policy 1, policy_version 42782 (0.0008) +[2023-10-14 06:50:05,748][100936] Updated weights for policy 0, policy_version 42760 (0.0009) +[2023-10-14 06:50:06,122][100936] Updated weights for policy 0, policy_version 42770 (0.0010) +[2023-10-14 06:50:06,499][100936] Updated weights for policy 0, policy_version 42780 (0.0010) +[2023-10-14 06:50:08,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 87621632. Throughput: 0: 1653.5, 1: 1676.3. Samples: 21919362. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:50:08,512][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 06:50:08,543][100917] Updated weights for policy 1, policy_version 42792 (0.0010) +[2023-10-14 06:50:08,906][100917] Updated weights for policy 1, policy_version 42802 (0.0008) +[2023-10-14 06:50:09,282][100917] Updated weights for policy 1, policy_version 42812 (0.0011) +[2023-10-14 06:50:10,661][100936] Updated weights for policy 0, policy_version 42790 (0.0007) +[2023-10-14 06:50:11,034][100936] Updated weights for policy 0, policy_version 42800 (0.0007) +[2023-10-14 06:50:11,397][100936] Updated weights for policy 0, policy_version 42810 (0.0010) +[2023-10-14 06:50:13,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 87687168. Throughput: 0: 1640.8, 1: 1676.0. Samples: 21928460. Policy #0 lag: (min: 31.0, avg: 31.0, max: 33.0) +[2023-10-14 06:50:13,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 06:50:13,536][100917] Updated weights for policy 1, policy_version 42822 (0.0008) +[2023-10-14 06:50:13,921][100917] Updated weights for policy 1, policy_version 42832 (0.0008) +[2023-10-14 06:50:14,291][100917] Updated weights for policy 1, policy_version 42842 (0.0010) +[2023-10-14 06:50:15,499][100936] Updated weights for policy 0, policy_version 42820 (0.0009) +[2023-10-14 06:50:15,862][100936] Updated weights for policy 0, policy_version 42830 (0.0010) +[2023-10-14 06:50:16,237][100936] Updated weights for policy 0, policy_version 42840 (0.0009) +[2023-10-14 06:50:18,485][100917] Updated weights for policy 1, policy_version 42852 (0.0008) +[2023-10-14 06:50:18,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 87752704. Throughput: 0: 1650.9, 1: 1665.5. Samples: 21948476. Policy #0 lag: (min: 31.0, avg: 31.0, max: 33.0) +[2023-10-14 06:50:18,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 06:50:18,871][100917] Updated weights for policy 1, policy_version 42862 (0.0007) +[2023-10-14 06:50:19,243][100917] Updated weights for policy 1, policy_version 42872 (0.0007) +[2023-10-14 06:50:20,484][100936] Updated weights for policy 0, policy_version 42850 (0.0007) +[2023-10-14 06:50:20,880][100936] Updated weights for policy 0, policy_version 42860 (0.0009) +[2023-10-14 06:50:21,239][100936] Updated weights for policy 0, policy_version 42870 (0.0009) +[2023-10-14 06:50:21,608][100936] Updated weights for policy 0, policy_version 42880 (0.0010) +[2023-10-14 06:50:23,425][100917] Updated weights for policy 1, policy_version 42882 (0.0008) +[2023-10-14 06:50:23,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 87818240. Throughput: 0: 1653.9, 1: 1661.6. Samples: 21968912. Policy #0 lag: (min: 31.0, avg: 31.0, max: 33.0) +[2023-10-14 06:50:23,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 06:50:23,794][100917] Updated weights for policy 1, policy_version 42892 (0.0008) +[2023-10-14 06:50:24,163][100917] Updated weights for policy 1, policy_version 42902 (0.0011) +[2023-10-14 06:50:24,545][100917] Updated weights for policy 1, policy_version 42912 (0.0010) +[2023-10-14 06:50:25,537][100936] Updated weights for policy 0, policy_version 42890 (0.0007) +[2023-10-14 06:50:25,910][100936] Updated weights for policy 0, policy_version 42900 (0.0009) +[2023-10-14 06:50:26,270][100936] Updated weights for policy 0, policy_version 42910 (0.0008) +[2023-10-14 06:50:28,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 87883776. Throughput: 0: 1649.9, 1: 1658.0. Samples: 21977772. Policy #0 lag: (min: 31.0, avg: 31.0, max: 33.0) +[2023-10-14 06:50:28,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:50:28,726][100917] Updated weights for policy 1, policy_version 42922 (0.0011) +[2023-10-14 06:50:29,092][100917] Updated weights for policy 1, policy_version 42932 (0.0010) +[2023-10-14 06:50:29,466][100917] Updated weights for policy 1, policy_version 42942 (0.0010) +[2023-10-14 06:50:30,293][100936] Updated weights for policy 0, policy_version 42920 (0.0010) +[2023-10-14 06:50:30,656][100936] Updated weights for policy 0, policy_version 42930 (0.0011) +[2023-10-14 06:50:31,032][100936] Updated weights for policy 0, policy_version 42940 (0.0009) +[2023-10-14 06:50:33,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 87949312. Throughput: 0: 1660.5, 1: 1657.2. Samples: 21998302. Policy #0 lag: (min: 31.0, avg: 31.0, max: 33.0) +[2023-10-14 06:50:33,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:50:33,569][100917] Updated weights for policy 1, policy_version 42952 (0.0008) +[2023-10-14 06:50:33,947][100917] Updated weights for policy 1, policy_version 42962 (0.0007) +[2023-10-14 06:50:34,320][100917] Updated weights for policy 1, policy_version 42972 (0.0007) +[2023-10-14 06:50:35,257][100936] Updated weights for policy 0, policy_version 42950 (0.0009) +[2023-10-14 06:50:35,623][100936] Updated weights for policy 0, policy_version 42960 (0.0010) +[2023-10-14 06:50:35,990][100936] Updated weights for policy 0, policy_version 42970 (0.0008) +[2023-10-14 06:50:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 88014848. Throughput: 0: 1660.5, 1: 1654.1. Samples: 22018484. Policy #0 lag: (min: 31.0, avg: 31.0, max: 33.0) +[2023-10-14 06:50:38,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:50:38,547][100917] Updated weights for policy 1, policy_version 42982 (0.0007) +[2023-10-14 06:50:38,919][100917] Updated weights for policy 1, policy_version 42992 (0.0007) +[2023-10-14 06:50:39,289][100917] Updated weights for policy 1, policy_version 43002 (0.0009) +[2023-10-14 06:50:40,307][100936] Updated weights for policy 0, policy_version 42980 (0.0009) +[2023-10-14 06:50:40,682][100936] Updated weights for policy 0, policy_version 42990 (0.0008) +[2023-10-14 06:50:41,058][100936] Updated weights for policy 0, policy_version 43000 (0.0007) +[2023-10-14 06:50:43,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 88080384. Throughput: 0: 1654.3, 1: 1659.6. Samples: 22027534. Policy #0 lag: (min: 31.0, avg: 31.2, max: 40.0) +[2023-10-14 06:50:43,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:50:43,519][100917] Updated weights for policy 1, policy_version 43012 (0.0008) +[2023-10-14 06:50:43,910][100917] Updated weights for policy 1, policy_version 43022 (0.0009) +[2023-10-14 06:50:44,271][100917] Updated weights for policy 1, policy_version 43032 (0.0007) +[2023-10-14 06:50:45,137][100936] Updated weights for policy 0, policy_version 43010 (0.0009) +[2023-10-14 06:50:45,511][100936] Updated weights for policy 0, policy_version 43020 (0.0007) +[2023-10-14 06:50:45,883][100936] Updated weights for policy 0, policy_version 43030 (0.0007) +[2023-10-14 06:50:46,250][100936] Updated weights for policy 0, policy_version 43040 (0.0009) +[2023-10-14 06:50:48,349][100917] Updated weights for policy 1, policy_version 43042 (0.0008) +[2023-10-14 06:50:48,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 88145920. Throughput: 0: 1659.8, 1: 1653.2. Samples: 22047872. Policy #0 lag: (min: 31.0, avg: 31.2, max: 40.0) +[2023-10-14 06:50:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:50:48,724][100917] Updated weights for policy 1, policy_version 43052 (0.0007) +[2023-10-14 06:50:49,089][100917] Updated weights for policy 1, policy_version 43062 (0.0007) +[2023-10-14 06:50:49,454][100917] Updated weights for policy 1, policy_version 43072 (0.0009) +[2023-10-14 06:50:50,221][100936] Updated weights for policy 0, policy_version 43050 (0.0007) +[2023-10-14 06:50:50,599][100936] Updated weights for policy 0, policy_version 43060 (0.0007) +[2023-10-14 06:50:50,965][100936] Updated weights for policy 0, policy_version 43070 (0.0009) +[2023-10-14 06:50:53,464][100917] Updated weights for policy 1, policy_version 43082 (0.0009) +[2023-10-14 06:50:53,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 88211456. Throughput: 0: 1658.3, 1: 1649.3. Samples: 22068206. Policy #0 lag: (min: 31.0, avg: 31.2, max: 40.0) +[2023-10-14 06:50:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:50:53,836][100917] Updated weights for policy 1, policy_version 43092 (0.0007) +[2023-10-14 06:50:54,210][100917] Updated weights for policy 1, policy_version 43102 (0.0009) +[2023-10-14 06:50:55,158][100936] Updated weights for policy 0, policy_version 43080 (0.0009) +[2023-10-14 06:50:55,525][100936] Updated weights for policy 0, policy_version 43090 (0.0009) +[2023-10-14 06:50:55,888][100936] Updated weights for policy 0, policy_version 43100 (0.0010) +[2023-10-14 06:50:58,195][100917] Updated weights for policy 1, policy_version 43112 (0.0008) +[2023-10-14 06:50:58,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 88276992. Throughput: 0: 1655.2, 1: 1652.2. Samples: 22077296. Policy #0 lag: (min: 31.0, avg: 31.2, max: 40.0) +[2023-10-14 06:50:58,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:50:58,575][100917] Updated weights for policy 1, policy_version 43122 (0.0007) +[2023-10-14 06:50:58,946][100917] Updated weights for policy 1, policy_version 43132 (0.0007) +[2023-10-14 06:51:00,276][100936] Updated weights for policy 0, policy_version 43110 (0.0009) +[2023-10-14 06:51:00,647][100936] Updated weights for policy 0, policy_version 43120 (0.0007) +[2023-10-14 06:51:01,024][100936] Updated weights for policy 0, policy_version 43130 (0.0007) +[2023-10-14 06:51:03,015][100917] Updated weights for policy 1, policy_version 43142 (0.0008) +[2023-10-14 06:51:03,398][100917] Updated weights for policy 1, policy_version 43152 (0.0008) +[2023-10-14 06:51:03,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 88342528. Throughput: 0: 1655.2, 1: 1662.4. Samples: 22097770. Policy #0 lag: (min: 31.0, avg: 31.2, max: 40.0) +[2023-10-14 06:51:03,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:51:03,767][100917] Updated weights for policy 1, policy_version 43162 (0.0007) +[2023-10-14 06:51:05,029][100936] Updated weights for policy 0, policy_version 43140 (0.0008) +[2023-10-14 06:51:05,434][100936] Updated weights for policy 0, policy_version 43150 (0.0010) +[2023-10-14 06:51:05,805][100936] Updated weights for policy 0, policy_version 43160 (0.0009) +[2023-10-14 06:51:07,907][100917] Updated weights for policy 1, policy_version 43172 (0.0009) +[2023-10-14 06:51:08,283][100917] Updated weights for policy 1, policy_version 43182 (0.0010) +[2023-10-14 06:51:08,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 88408064. Throughput: 0: 1652.1, 1: 1660.0. Samples: 22117956. Policy #0 lag: (min: 31.0, avg: 31.2, max: 40.0) +[2023-10-14 06:51:08,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:51:08,648][100917] Updated weights for policy 1, policy_version 43192 (0.0009) +[2023-10-14 06:51:09,964][100936] Updated weights for policy 0, policy_version 43170 (0.0008) +[2023-10-14 06:51:10,329][100936] Updated weights for policy 0, policy_version 43180 (0.0009) +[2023-10-14 06:51:10,696][100936] Updated weights for policy 0, policy_version 43190 (0.0009) +[2023-10-14 06:51:11,071][100936] Updated weights for policy 0, policy_version 43200 (0.0007) +[2023-10-14 06:51:12,499][100917] Updated weights for policy 1, policy_version 43202 (0.0010) +[2023-10-14 06:51:12,862][100917] Updated weights for policy 1, policy_version 43212 (0.0008) +[2023-10-14 06:51:13,246][100917] Updated weights for policy 1, policy_version 43222 (0.0007) +[2023-10-14 06:51:13,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 88473600. Throughput: 0: 1650.0, 1: 1671.1. Samples: 22127222. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:51:13,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:51:13,609][100917] Updated weights for policy 1, policy_version 43232 (0.0007) +[2023-10-14 06:51:15,108][100936] Updated weights for policy 0, policy_version 43210 (0.0010) +[2023-10-14 06:51:15,476][100936] Updated weights for policy 0, policy_version 43220 (0.0009) +[2023-10-14 06:51:15,844][100936] Updated weights for policy 0, policy_version 43230 (0.0010) +[2023-10-14 06:51:17,772][100917] Updated weights for policy 1, policy_version 43242 (0.0010) +[2023-10-14 06:51:18,136][100917] Updated weights for policy 1, policy_version 43252 (0.0008) +[2023-10-14 06:51:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 88539136. Throughput: 0: 1643.8, 1: 1671.7. Samples: 22147498. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:51:18,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:51:18,515][100917] Updated weights for policy 1, policy_version 43262 (0.0007) +[2023-10-14 06:51:20,175][100936] Updated weights for policy 0, policy_version 43240 (0.0009) +[2023-10-14 06:51:20,541][100936] Updated weights for policy 0, policy_version 43250 (0.0009) +[2023-10-14 06:51:20,920][100936] Updated weights for policy 0, policy_version 43260 (0.0010) +[2023-10-14 06:51:22,623][100917] Updated weights for policy 1, policy_version 43272 (0.0007) +[2023-10-14 06:51:22,991][100917] Updated weights for policy 1, policy_version 43282 (0.0007) +[2023-10-14 06:51:23,359][100917] Updated weights for policy 1, policy_version 43292 (0.0007) +[2023-10-14 06:51:23,512][99942] Fps is (10 sec: 16384.0, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 88637440. Throughput: 0: 1645.5, 1: 1660.0. Samples: 22167228. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:51:23,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:51:25,063][100936] Updated weights for policy 0, policy_version 43270 (0.0010) +[2023-10-14 06:51:25,439][100936] Updated weights for policy 0, policy_version 43280 (0.0009) +[2023-10-14 06:51:25,808][100936] Updated weights for policy 0, policy_version 43290 (0.0010) +[2023-10-14 06:51:27,534][100917] Updated weights for policy 1, policy_version 43302 (0.0007) +[2023-10-14 06:51:27,914][100917] Updated weights for policy 1, policy_version 43312 (0.0009) +[2023-10-14 06:51:28,277][100917] Updated weights for policy 1, policy_version 43322 (0.0010) +[2023-10-14 06:51:28,512][99942] Fps is (10 sec: 16384.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 88702976. Throughput: 0: 1642.9, 1: 1673.5. Samples: 22176770. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:51:28,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:51:29,986][100936] Updated weights for policy 0, policy_version 43300 (0.0009) +[2023-10-14 06:51:30,350][100936] Updated weights for policy 0, policy_version 43310 (0.0008) +[2023-10-14 06:51:30,723][100936] Updated weights for policy 0, policy_version 43320 (0.0009) +[2023-10-14 06:51:32,483][100917] Updated weights for policy 1, policy_version 43332 (0.0008) +[2023-10-14 06:51:32,874][100917] Updated weights for policy 1, policy_version 43342 (0.0007) +[2023-10-14 06:51:33,247][100917] Updated weights for policy 1, policy_version 43352 (0.0007) +[2023-10-14 06:51:33,512][99942] Fps is (10 sec: 9830.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 88735744. Throughput: 0: 1648.2, 1: 1676.7. Samples: 22197494. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:51:33,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:51:34,974][100936] Updated weights for policy 0, policy_version 43330 (0.0010) +[2023-10-14 06:51:35,340][100936] Updated weights for policy 0, policy_version 43340 (0.0009) +[2023-10-14 06:51:35,711][100936] Updated weights for policy 0, policy_version 43350 (0.0010) +[2023-10-14 06:51:36,081][100936] Updated weights for policy 0, policy_version 43360 (0.0010) +[2023-10-14 06:51:37,355][100917] Updated weights for policy 1, policy_version 43362 (0.0009) +[2023-10-14 06:51:37,734][100917] Updated weights for policy 1, policy_version 43372 (0.0007) +[2023-10-14 06:51:38,100][100917] Updated weights for policy 1, policy_version 43382 (0.0009) +[2023-10-14 06:51:38,471][100917] Updated weights for policy 1, policy_version 43392 (0.0011) +[2023-10-14 06:51:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 88834048. Throughput: 0: 1647.5, 1: 1658.9. Samples: 22216994. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:51:38,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:51:38,522][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000043360_44400640.pth... +[2023-10-14 06:51:38,522][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000043392_44433408.pth... +[2023-10-14 06:51:38,561][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000041824_42827776.pth +[2023-10-14 06:51:38,561][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000041824_42827776.pth +[2023-10-14 06:51:40,245][100936] Updated weights for policy 0, policy_version 43370 (0.0009) +[2023-10-14 06:51:40,625][100936] Updated weights for policy 0, policy_version 43380 (0.0010) +[2023-10-14 06:51:41,006][100936] Updated weights for policy 0, policy_version 43390 (0.0008) +[2023-10-14 06:51:42,537][100917] Updated weights for policy 1, policy_version 43402 (0.0009) +[2023-10-14 06:51:42,919][100917] Updated weights for policy 1, policy_version 43412 (0.0008) +[2023-10-14 06:51:43,287][100917] Updated weights for policy 1, policy_version 43422 (0.0010) +[2023-10-14 06:51:43,512][99942] Fps is (10 sec: 16384.2, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 88899584. Throughput: 0: 1644.3, 1: 1672.5. Samples: 22226550. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:51:43,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:51:45,198][100936] Updated weights for policy 0, policy_version 43400 (0.0008) +[2023-10-14 06:51:45,571][100936] Updated weights for policy 0, policy_version 43410 (0.0008) +[2023-10-14 06:51:45,942][100936] Updated weights for policy 0, policy_version 43420 (0.0008) +[2023-10-14 06:51:47,440][100917] Updated weights for policy 1, policy_version 43432 (0.0009) +[2023-10-14 06:51:47,806][100917] Updated weights for policy 1, policy_version 43442 (0.0009) +[2023-10-14 06:51:48,188][100917] Updated weights for policy 1, policy_version 43452 (0.0008) +[2023-10-14 06:51:48,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 88965120. Throughput: 0: 1648.6, 1: 1666.4. Samples: 22246946. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:51:48,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:51:49,927][100936] Updated weights for policy 0, policy_version 43430 (0.0009) +[2023-10-14 06:51:50,301][100936] Updated weights for policy 0, policy_version 43440 (0.0008) +[2023-10-14 06:51:50,671][100936] Updated weights for policy 0, policy_version 43450 (0.0007) +[2023-10-14 06:51:52,300][100917] Updated weights for policy 1, policy_version 43462 (0.0010) +[2023-10-14 06:51:52,654][100917] Updated weights for policy 1, policy_version 43472 (0.0011) +[2023-10-14 06:51:53,023][100917] Updated weights for policy 1, policy_version 43482 (0.0010) +[2023-10-14 06:51:53,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 89030656. Throughput: 0: 1649.0, 1: 1649.2. Samples: 22266374. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:51:53,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:51:54,991][100936] Updated weights for policy 0, policy_version 43460 (0.0007) +[2023-10-14 06:51:55,379][100936] Updated weights for policy 0, policy_version 43470 (0.0007) +[2023-10-14 06:51:55,753][100936] Updated weights for policy 0, policy_version 43480 (0.0007) +[2023-10-14 06:51:57,188][100917] Updated weights for policy 1, policy_version 43492 (0.0007) +[2023-10-14 06:51:57,563][100917] Updated weights for policy 1, policy_version 43502 (0.0008) +[2023-10-14 06:51:57,937][100917] Updated weights for policy 1, policy_version 43512 (0.0009) +[2023-10-14 06:51:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 89096192. Throughput: 0: 1643.4, 1: 1664.1. Samples: 22276060. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:51:58,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:51:59,713][100936] Updated weights for policy 0, policy_version 43490 (0.0009) +[2023-10-14 06:52:00,081][100936] Updated weights for policy 0, policy_version 43500 (0.0007) +[2023-10-14 06:52:00,447][100936] Updated weights for policy 0, policy_version 43510 (0.0007) +[2023-10-14 06:52:00,818][100936] Updated weights for policy 0, policy_version 43520 (0.0007) +[2023-10-14 06:52:01,857][100917] Updated weights for policy 1, policy_version 43522 (0.0010) +[2023-10-14 06:52:02,236][100917] Updated weights for policy 1, policy_version 43532 (0.0008) +[2023-10-14 06:52:02,606][100917] Updated weights for policy 1, policy_version 43542 (0.0007) +[2023-10-14 06:52:02,974][100917] Updated weights for policy 1, policy_version 43552 (0.0008) +[2023-10-14 06:52:03,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 89161728. Throughput: 0: 1653.7, 1: 1659.3. Samples: 22296584. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:52:03,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:52:05,073][100936] Updated weights for policy 0, policy_version 43530 (0.0010) +[2023-10-14 06:52:05,442][100936] Updated weights for policy 0, policy_version 43540 (0.0010) +[2023-10-14 06:52:05,817][100936] Updated weights for policy 0, policy_version 43550 (0.0009) +[2023-10-14 06:52:06,905][100917] Updated weights for policy 1, policy_version 43562 (0.0010) +[2023-10-14 06:52:07,278][100917] Updated weights for policy 1, policy_version 43572 (0.0008) +[2023-10-14 06:52:07,658][100917] Updated weights for policy 1, policy_version 43582 (0.0008) +[2023-10-14 06:52:08,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 89227264. Throughput: 0: 1656.7, 1: 1650.0. Samples: 22316032. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:52:08,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:52:10,042][100936] Updated weights for policy 0, policy_version 43560 (0.0009) +[2023-10-14 06:52:10,420][100936] Updated weights for policy 0, policy_version 43570 (0.0011) +[2023-10-14 06:52:10,783][100936] Updated weights for policy 0, policy_version 43580 (0.0011) +[2023-10-14 06:52:11,704][100917] Updated weights for policy 1, policy_version 43592 (0.0007) +[2023-10-14 06:52:12,074][100917] Updated weights for policy 1, policy_version 43602 (0.0007) +[2023-10-14 06:52:12,448][100917] Updated weights for policy 1, policy_version 43612 (0.0007) +[2023-10-14 06:52:13,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 89292800. Throughput: 0: 1655.6, 1: 1665.8. Samples: 22326234. Policy #0 lag: (min: 31.0, avg: 38.3, max: 63.0) +[2023-10-14 06:52:13,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:52:14,943][100936] Updated weights for policy 0, policy_version 43590 (0.0011) +[2023-10-14 06:52:15,314][100936] Updated weights for policy 0, policy_version 43600 (0.0009) +[2023-10-14 06:52:15,687][100936] Updated weights for policy 0, policy_version 43610 (0.0007) +[2023-10-14 06:52:16,712][100917] Updated weights for policy 1, policy_version 43622 (0.0008) +[2023-10-14 06:52:17,099][100917] Updated weights for policy 1, policy_version 43632 (0.0012) +[2023-10-14 06:52:17,471][100917] Updated weights for policy 1, policy_version 43642 (0.0009) +[2023-10-14 06:52:18,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 89358336. Throughput: 0: 1651.0, 1: 1654.4. Samples: 22346234. Policy #0 lag: (min: 31.0, avg: 38.3, max: 63.0) +[2023-10-14 06:52:18,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:52:19,694][100936] Updated weights for policy 0, policy_version 43620 (0.0008) +[2023-10-14 06:52:20,074][100936] Updated weights for policy 0, policy_version 43630 (0.0008) +[2023-10-14 06:52:20,429][100936] Updated weights for policy 0, policy_version 43640 (0.0008) +[2023-10-14 06:52:21,500][100917] Updated weights for policy 1, policy_version 43652 (0.0008) +[2023-10-14 06:52:21,876][100917] Updated weights for policy 1, policy_version 43662 (0.0009) +[2023-10-14 06:52:22,245][100917] Updated weights for policy 1, policy_version 43672 (0.0007) +[2023-10-14 06:52:23,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 89423872. Throughput: 0: 1654.6, 1: 1654.5. Samples: 22365902. Policy #0 lag: (min: 31.0, avg: 38.3, max: 63.0) +[2023-10-14 06:52:23,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:52:24,567][100936] Updated weights for policy 0, policy_version 43650 (0.0009) +[2023-10-14 06:52:24,926][100936] Updated weights for policy 0, policy_version 43660 (0.0007) +[2023-10-14 06:52:25,299][100936] Updated weights for policy 0, policy_version 43670 (0.0007) +[2023-10-14 06:52:25,672][100936] Updated weights for policy 0, policy_version 43680 (0.0007) +[2023-10-14 06:52:26,607][100917] Updated weights for policy 1, policy_version 43682 (0.0008) +[2023-10-14 06:52:26,985][100917] Updated weights for policy 1, policy_version 43692 (0.0008) +[2023-10-14 06:52:27,363][100917] Updated weights for policy 1, policy_version 43702 (0.0007) +[2023-10-14 06:52:27,731][100917] Updated weights for policy 1, policy_version 43712 (0.0011) +[2023-10-14 06:52:28,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.1, 300 sec: 13329.4). Total num frames: 89489408. Throughput: 0: 1656.4, 1: 1668.0. Samples: 22376152. Policy #0 lag: (min: 31.0, avg: 38.3, max: 63.0) +[2023-10-14 06:52:28,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:52:29,907][100936] Updated weights for policy 0, policy_version 43690 (0.0007) +[2023-10-14 06:52:30,283][100936] Updated weights for policy 0, policy_version 43700 (0.0007) +[2023-10-14 06:52:30,642][100936] Updated weights for policy 0, policy_version 43710 (0.0009) +[2023-10-14 06:52:31,678][100917] Updated weights for policy 1, policy_version 43722 (0.0009) +[2023-10-14 06:52:32,043][100917] Updated weights for policy 1, policy_version 43732 (0.0011) +[2023-10-14 06:52:32,432][100917] Updated weights for policy 1, policy_version 43742 (0.0011) +[2023-10-14 06:52:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 89554944. Throughput: 0: 1655.8, 1: 1657.3. Samples: 22396034. Policy #0 lag: (min: 31.0, avg: 38.3, max: 63.0) +[2023-10-14 06:52:33,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:52:34,592][100936] Updated weights for policy 0, policy_version 43720 (0.0011) +[2023-10-14 06:52:34,966][100936] Updated weights for policy 0, policy_version 43730 (0.0007) +[2023-10-14 06:52:35,326][100936] Updated weights for policy 0, policy_version 43740 (0.0008) +[2023-10-14 06:52:36,470][100917] Updated weights for policy 1, policy_version 43752 (0.0008) +[2023-10-14 06:52:36,837][100917] Updated weights for policy 1, policy_version 43762 (0.0009) +[2023-10-14 06:52:37,214][100917] Updated weights for policy 1, policy_version 43772 (0.0010) +[2023-10-14 06:52:38,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 89620480. Throughput: 0: 1657.5, 1: 1666.4. Samples: 22415952. Policy #0 lag: (min: 31.0, avg: 38.3, max: 63.0) +[2023-10-14 06:52:38,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:52:39,516][100936] Updated weights for policy 0, policy_version 43750 (0.0009) +[2023-10-14 06:52:39,895][100936] Updated weights for policy 0, policy_version 43760 (0.0009) +[2023-10-14 06:52:40,266][100936] Updated weights for policy 0, policy_version 43770 (0.0010) +[2023-10-14 06:52:41,345][100917] Updated weights for policy 1, policy_version 43782 (0.0009) +[2023-10-14 06:52:41,723][100917] Updated weights for policy 1, policy_version 43792 (0.0008) +[2023-10-14 06:52:42,098][100917] Updated weights for policy 1, policy_version 43802 (0.0007) +[2023-10-14 06:52:43,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 89686016. Throughput: 0: 1661.6, 1: 1673.8. Samples: 22426154. Policy #0 lag: (min: 11.0, avg: 11.6, max: 28.0) +[2023-10-14 06:52:43,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:52:44,327][100936] Updated weights for policy 0, policy_version 43780 (0.0008) +[2023-10-14 06:52:44,690][100936] Updated weights for policy 0, policy_version 43790 (0.0008) +[2023-10-14 06:52:45,055][100936] Updated weights for policy 0, policy_version 43800 (0.0007) +[2023-10-14 06:52:46,221][100917] Updated weights for policy 1, policy_version 43812 (0.0008) +[2023-10-14 06:52:46,598][100917] Updated weights for policy 1, policy_version 43822 (0.0009) +[2023-10-14 06:52:46,960][100917] Updated weights for policy 1, policy_version 43832 (0.0007) +[2023-10-14 06:52:48,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 89751552. Throughput: 0: 1656.7, 1: 1655.9. Samples: 22445656. Policy #0 lag: (min: 11.0, avg: 11.6, max: 28.0) +[2023-10-14 06:52:48,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:52:49,185][100936] Updated weights for policy 0, policy_version 43810 (0.0007) +[2023-10-14 06:52:49,553][100936] Updated weights for policy 0, policy_version 43820 (0.0009) +[2023-10-14 06:52:49,918][100936] Updated weights for policy 0, policy_version 43830 (0.0011) +[2023-10-14 06:52:50,290][100936] Updated weights for policy 0, policy_version 43840 (0.0007) +[2023-10-14 06:52:50,994][100917] Updated weights for policy 1, policy_version 43842 (0.0010) +[2023-10-14 06:52:51,371][100917] Updated weights for policy 1, policy_version 43852 (0.0008) +[2023-10-14 06:52:51,740][100917] Updated weights for policy 1, policy_version 43862 (0.0007) +[2023-10-14 06:52:52,102][100917] Updated weights for policy 1, policy_version 43872 (0.0009) +[2023-10-14 06:52:53,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 89817088. Throughput: 0: 1652.8, 1: 1666.6. Samples: 22465408. Policy #0 lag: (min: 11.0, avg: 11.6, max: 28.0) +[2023-10-14 06:52:53,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:52:54,546][100936] Updated weights for policy 0, policy_version 43850 (0.0007) +[2023-10-14 06:52:54,910][100936] Updated weights for policy 0, policy_version 43860 (0.0009) +[2023-10-14 06:52:55,280][100936] Updated weights for policy 0, policy_version 43870 (0.0008) +[2023-10-14 06:52:56,099][100917] Updated weights for policy 1, policy_version 43882 (0.0008) +[2023-10-14 06:52:56,478][100917] Updated weights for policy 1, policy_version 43892 (0.0009) +[2023-10-14 06:52:56,856][100917] Updated weights for policy 1, policy_version 43902 (0.0007) +[2023-10-14 06:52:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 89882624. Throughput: 0: 1654.4, 1: 1660.4. Samples: 22475404. Policy #0 lag: (min: 11.0, avg: 11.6, max: 28.0) +[2023-10-14 06:52:58,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:52:59,399][100936] Updated weights for policy 0, policy_version 43880 (0.0008) +[2023-10-14 06:52:59,760][100936] Updated weights for policy 0, policy_version 43890 (0.0008) +[2023-10-14 06:53:00,131][100936] Updated weights for policy 0, policy_version 43900 (0.0009) +[2023-10-14 06:53:01,039][100917] Updated weights for policy 1, policy_version 43912 (0.0009) +[2023-10-14 06:53:01,415][100917] Updated weights for policy 1, policy_version 43922 (0.0009) +[2023-10-14 06:53:01,797][100917] Updated weights for policy 1, policy_version 43932 (0.0009) +[2023-10-14 06:53:03,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 89948160. Throughput: 0: 1655.1, 1: 1653.8. Samples: 22495136. Policy #0 lag: (min: 11.0, avg: 11.6, max: 28.0) +[2023-10-14 06:53:03,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:53:04,322][100936] Updated weights for policy 0, policy_version 43910 (0.0008) +[2023-10-14 06:53:04,687][100936] Updated weights for policy 0, policy_version 43920 (0.0010) +[2023-10-14 06:53:05,064][100936] Updated weights for policy 0, policy_version 43930 (0.0010) +[2023-10-14 06:53:06,007][100917] Updated weights for policy 1, policy_version 43942 (0.0010) +[2023-10-14 06:53:06,389][100917] Updated weights for policy 1, policy_version 43952 (0.0009) +[2023-10-14 06:53:06,772][100917] Updated weights for policy 1, policy_version 43962 (0.0008) +[2023-10-14 06:53:08,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 90013696. Throughput: 0: 1654.6, 1: 1669.4. Samples: 22515484. Policy #0 lag: (min: 11.0, avg: 11.6, max: 28.0) +[2023-10-14 06:53:08,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:53:09,054][100936] Updated weights for policy 0, policy_version 43940 (0.0010) +[2023-10-14 06:53:09,421][100936] Updated weights for policy 0, policy_version 43950 (0.0007) +[2023-10-14 06:53:09,794][100936] Updated weights for policy 0, policy_version 43960 (0.0008) +[2023-10-14 06:53:10,891][100917] Updated weights for policy 1, policy_version 43972 (0.0010) +[2023-10-14 06:53:11,268][100917] Updated weights for policy 1, policy_version 43982 (0.0008) +[2023-10-14 06:53:11,631][100917] Updated weights for policy 1, policy_version 43992 (0.0008) +[2023-10-14 06:53:13,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 90079232. Throughput: 0: 1655.5, 1: 1662.3. Samples: 22525454. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:53:13,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:53:13,991][100936] Updated weights for policy 0, policy_version 43970 (0.0008) +[2023-10-14 06:53:14,356][100936] Updated weights for policy 0, policy_version 43980 (0.0008) +[2023-10-14 06:53:14,724][100936] Updated weights for policy 0, policy_version 43990 (0.0008) +[2023-10-14 06:53:15,090][100936] Updated weights for policy 0, policy_version 44000 (0.0009) +[2023-10-14 06:53:15,764][100917] Updated weights for policy 1, policy_version 44002 (0.0009) +[2023-10-14 06:53:16,136][100917] Updated weights for policy 1, policy_version 44012 (0.0009) +[2023-10-14 06:53:16,523][100917] Updated weights for policy 1, policy_version 44022 (0.0008) +[2023-10-14 06:53:16,886][100917] Updated weights for policy 1, policy_version 44032 (0.0009) +[2023-10-14 06:53:18,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 90144768. Throughput: 0: 1655.4, 1: 1648.7. Samples: 22544716. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:53:18,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:53:19,246][100936] Updated weights for policy 0, policy_version 44010 (0.0009) +[2023-10-14 06:53:19,632][100936] Updated weights for policy 0, policy_version 44020 (0.0008) +[2023-10-14 06:53:19,994][100936] Updated weights for policy 0, policy_version 44030 (0.0009) +[2023-10-14 06:53:20,906][100917] Updated weights for policy 1, policy_version 44042 (0.0007) +[2023-10-14 06:53:21,277][100917] Updated weights for policy 1, policy_version 44052 (0.0011) +[2023-10-14 06:53:21,648][100917] Updated weights for policy 1, policy_version 44062 (0.0008) +[2023-10-14 06:53:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 90210304. Throughput: 0: 1657.6, 1: 1661.9. Samples: 22565328. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:53:23,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:53:24,136][100936] Updated weights for policy 0, policy_version 44040 (0.0007) +[2023-10-14 06:53:24,512][100936] Updated weights for policy 0, policy_version 44050 (0.0008) +[2023-10-14 06:53:24,880][100936] Updated weights for policy 0, policy_version 44060 (0.0009) +[2023-10-14 06:53:25,803][100917] Updated weights for policy 1, policy_version 44072 (0.0008) +[2023-10-14 06:53:26,186][100917] Updated weights for policy 1, policy_version 44082 (0.0007) +[2023-10-14 06:53:26,559][100917] Updated weights for policy 1, policy_version 44092 (0.0010) +[2023-10-14 06:53:28,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 90275840. Throughput: 0: 1658.9, 1: 1653.1. Samples: 22575196. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:53:28,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:53:28,977][100936] Updated weights for policy 0, policy_version 44070 (0.0009) +[2023-10-14 06:53:29,356][100936] Updated weights for policy 0, policy_version 44080 (0.0009) +[2023-10-14 06:53:29,726][100936] Updated weights for policy 0, policy_version 44090 (0.0009) +[2023-10-14 06:53:30,671][100917] Updated weights for policy 1, policy_version 44102 (0.0008) +[2023-10-14 06:53:31,054][100917] Updated weights for policy 1, policy_version 44112 (0.0008) +[2023-10-14 06:53:31,426][100917] Updated weights for policy 1, policy_version 44122 (0.0009) +[2023-10-14 06:53:33,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 90341376. Throughput: 0: 1657.6, 1: 1657.7. Samples: 22594842. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:53:33,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:53:33,838][100936] Updated weights for policy 0, policy_version 44100 (0.0010) +[2023-10-14 06:53:34,216][100936] Updated weights for policy 0, policy_version 44110 (0.0008) +[2023-10-14 06:53:34,578][100936] Updated weights for policy 0, policy_version 44120 (0.0007) +[2023-10-14 06:53:35,349][100917] Updated weights for policy 1, policy_version 44132 (0.0007) +[2023-10-14 06:53:35,721][100917] Updated weights for policy 1, policy_version 44142 (0.0008) +[2023-10-14 06:53:36,098][100917] Updated weights for policy 1, policy_version 44152 (0.0009) +[2023-10-14 06:53:38,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 90406912. Throughput: 0: 1662.8, 1: 1672.4. Samples: 22615490. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:53:38,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:53:38,522][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000044128_45187072.pth... +[2023-10-14 06:53:38,522][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000044160_45219840.pth... +[2023-10-14 06:53:38,551][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000042592_43614208.pth +[2023-10-14 06:53:38,559][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000042592_43614208.pth +[2023-10-14 06:53:38,769][100936] Updated weights for policy 0, policy_version 44130 (0.0010) +[2023-10-14 06:53:39,139][100936] Updated weights for policy 0, policy_version 44140 (0.0008) +[2023-10-14 06:53:39,498][100936] Updated weights for policy 0, policy_version 44150 (0.0010) +[2023-10-14 06:53:39,877][100936] Updated weights for policy 0, policy_version 44160 (0.0009) +[2023-10-14 06:53:40,287][100917] Updated weights for policy 1, policy_version 44162 (0.0008) +[2023-10-14 06:53:40,660][100917] Updated weights for policy 1, policy_version 44172 (0.0008) +[2023-10-14 06:53:41,036][100917] Updated weights for policy 1, policy_version 44182 (0.0010) +[2023-10-14 06:53:41,406][100917] Updated weights for policy 1, policy_version 44192 (0.0009) +[2023-10-14 06:53:43,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 90472448. Throughput: 0: 1663.7, 1: 1659.2. Samples: 22624936. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:53:43,513][99942] Avg episode reward: [(0, '1.010'), (1, '1.000')] +[2023-10-14 06:53:43,932][100936] Updated weights for policy 0, policy_version 44170 (0.0007) +[2023-10-14 06:53:44,301][100936] Updated weights for policy 0, policy_version 44180 (0.0008) +[2023-10-14 06:53:44,679][100936] Updated weights for policy 0, policy_version 44190 (0.0009) +[2023-10-14 06:53:44,746][100560] Saving new best policy, reward=1.010! +[2023-10-14 06:53:45,445][100917] Updated weights for policy 1, policy_version 44202 (0.0007) +[2023-10-14 06:53:45,807][100917] Updated weights for policy 1, policy_version 44212 (0.0009) +[2023-10-14 06:53:46,179][100917] Updated weights for policy 1, policy_version 44222 (0.0010) +[2023-10-14 06:53:48,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 90537984. Throughput: 0: 1669.6, 1: 1663.9. Samples: 22645140. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:53:48,512][99942] Avg episode reward: [(0, '1.010'), (1, '1.000')] +[2023-10-14 06:53:48,734][100936] Updated weights for policy 0, policy_version 44200 (0.0008) +[2023-10-14 06:53:49,100][100936] Updated weights for policy 0, policy_version 44210 (0.0007) +[2023-10-14 06:53:49,474][100936] Updated weights for policy 0, policy_version 44220 (0.0008) +[2023-10-14 06:53:50,354][100917] Updated weights for policy 1, policy_version 44232 (0.0008) +[2023-10-14 06:53:50,728][100917] Updated weights for policy 1, policy_version 44242 (0.0007) +[2023-10-14 06:53:51,099][100917] Updated weights for policy 1, policy_version 44252 (0.0007) +[2023-10-14 06:53:53,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 90603520. Throughput: 0: 1662.8, 1: 1665.9. Samples: 22665276. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:53:53,513][99942] Avg episode reward: [(0, '1.010'), (1, '1.000')] +[2023-10-14 06:53:53,542][100936] Updated weights for policy 0, policy_version 44230 (0.0008) +[2023-10-14 06:53:53,920][100936] Updated weights for policy 0, policy_version 44240 (0.0009) +[2023-10-14 06:53:54,291][100936] Updated weights for policy 0, policy_version 44250 (0.0009) +[2023-10-14 06:53:55,302][100917] Updated weights for policy 1, policy_version 44262 (0.0007) +[2023-10-14 06:53:55,689][100917] Updated weights for policy 1, policy_version 44272 (0.0008) +[2023-10-14 06:53:56,066][100917] Updated weights for policy 1, policy_version 44282 (0.0007) +[2023-10-14 06:53:58,383][100936] Updated weights for policy 0, policy_version 44260 (0.0008) +[2023-10-14 06:53:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 90669056. Throughput: 0: 1665.9, 1: 1648.0. Samples: 22674578. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:53:58,512][99942] Avg episode reward: [(0, '1.010'), (1, '1.000')] +[2023-10-14 06:53:58,747][100936] Updated weights for policy 0, policy_version 44270 (0.0009) +[2023-10-14 06:53:59,109][100936] Updated weights for policy 0, policy_version 44280 (0.0008) +[2023-10-14 06:54:00,087][100917] Updated weights for policy 1, policy_version 44292 (0.0007) +[2023-10-14 06:54:00,457][100917] Updated weights for policy 1, policy_version 44302 (0.0007) +[2023-10-14 06:54:00,837][100917] Updated weights for policy 1, policy_version 44312 (0.0007) +[2023-10-14 06:54:03,316][100936] Updated weights for policy 0, policy_version 44290 (0.0009) +[2023-10-14 06:54:03,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 90734592. Throughput: 0: 1665.2, 1: 1666.7. Samples: 22694656. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:54:03,513][99942] Avg episode reward: [(0, '1.010'), (1, '1.000')] +[2023-10-14 06:54:03,685][100936] Updated weights for policy 0, policy_version 44300 (0.0007) +[2023-10-14 06:54:04,050][100936] Updated weights for policy 0, policy_version 44310 (0.0008) +[2023-10-14 06:54:04,416][100936] Updated weights for policy 0, policy_version 44320 (0.0008) +[2023-10-14 06:54:04,720][100917] Updated weights for policy 1, policy_version 44322 (0.0009) +[2023-10-14 06:54:05,092][100917] Updated weights for policy 1, policy_version 44332 (0.0009) +[2023-10-14 06:54:05,461][100917] Updated weights for policy 1, policy_version 44342 (0.0009) +[2023-10-14 06:54:05,837][100917] Updated weights for policy 1, policy_version 44352 (0.0010) +[2023-10-14 06:54:08,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 90800128. Throughput: 0: 1654.7, 1: 1671.3. Samples: 22714996. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:54:08,513][99942] Avg episode reward: [(0, '1.010'), (1, '1.000')] +[2023-10-14 06:54:08,563][100936] Updated weights for policy 0, policy_version 44330 (0.0008) +[2023-10-14 06:54:08,936][100936] Updated weights for policy 0, policy_version 44340 (0.0007) +[2023-10-14 06:54:09,305][100936] Updated weights for policy 0, policy_version 44350 (0.0008) +[2023-10-14 06:54:09,985][100917] Updated weights for policy 1, policy_version 44362 (0.0011) +[2023-10-14 06:54:10,351][100917] Updated weights for policy 1, policy_version 44372 (0.0009) +[2023-10-14 06:54:10,719][100917] Updated weights for policy 1, policy_version 44382 (0.0007) +[2023-10-14 06:54:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 90865664. Throughput: 0: 1659.7, 1: 1650.0. Samples: 22724134. Policy #0 lag: (min: 31.0, avg: 31.5, max: 47.0) +[2023-10-14 06:54:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:54:13,672][100936] Updated weights for policy 0, policy_version 44360 (0.0010) +[2023-10-14 06:54:14,038][100936] Updated weights for policy 0, policy_version 44370 (0.0008) +[2023-10-14 06:54:14,410][100936] Updated weights for policy 0, policy_version 44380 (0.0008) +[2023-10-14 06:54:14,833][100917] Updated weights for policy 1, policy_version 44392 (0.0007) +[2023-10-14 06:54:15,207][100917] Updated weights for policy 1, policy_version 44402 (0.0010) +[2023-10-14 06:54:15,582][100917] Updated weights for policy 1, policy_version 44412 (0.0009) +[2023-10-14 06:54:18,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 90931200. Throughput: 0: 1660.0, 1: 1670.3. Samples: 22744708. Policy #0 lag: (min: 31.0, avg: 31.5, max: 47.0) +[2023-10-14 06:54:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:54:18,597][100936] Updated weights for policy 0, policy_version 44390 (0.0008) +[2023-10-14 06:54:18,963][100936] Updated weights for policy 0, policy_version 44400 (0.0009) +[2023-10-14 06:54:19,326][100936] Updated weights for policy 0, policy_version 44410 (0.0009) +[2023-10-14 06:54:19,535][100917] Updated weights for policy 1, policy_version 44422 (0.0007) +[2023-10-14 06:54:19,904][100917] Updated weights for policy 1, policy_version 44432 (0.0008) +[2023-10-14 06:54:20,278][100917] Updated weights for policy 1, policy_version 44442 (0.0010) +[2023-10-14 06:54:23,434][100936] Updated weights for policy 0, policy_version 44420 (0.0008) +[2023-10-14 06:54:23,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 90996736. Throughput: 0: 1653.6, 1: 1670.5. Samples: 22765072. Policy #0 lag: (min: 31.0, avg: 31.5, max: 47.0) +[2023-10-14 06:54:23,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:54:23,808][100936] Updated weights for policy 0, policy_version 44430 (0.0009) +[2023-10-14 06:54:24,179][100936] Updated weights for policy 0, policy_version 44440 (0.0009) +[2023-10-14 06:54:24,494][100917] Updated weights for policy 1, policy_version 44452 (0.0009) +[2023-10-14 06:54:24,855][100917] Updated weights for policy 1, policy_version 44462 (0.0007) +[2023-10-14 06:54:25,228][100917] Updated weights for policy 1, policy_version 44472 (0.0009) +[2023-10-14 06:54:28,330][100936] Updated weights for policy 0, policy_version 44450 (0.0008) +[2023-10-14 06:54:28,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 91062272. Throughput: 0: 1657.8, 1: 1658.7. Samples: 22774178. Policy #0 lag: (min: 31.0, avg: 31.5, max: 47.0) +[2023-10-14 06:54:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:54:28,704][100936] Updated weights for policy 0, policy_version 44460 (0.0010) +[2023-10-14 06:54:29,082][100936] Updated weights for policy 0, policy_version 44470 (0.0009) +[2023-10-14 06:54:29,462][100936] Updated weights for policy 0, policy_version 44480 (0.0010) +[2023-10-14 06:54:29,489][100917] Updated weights for policy 1, policy_version 44482 (0.0009) +[2023-10-14 06:54:29,858][100917] Updated weights for policy 1, policy_version 44492 (0.0008) +[2023-10-14 06:54:30,236][100917] Updated weights for policy 1, policy_version 44502 (0.0007) +[2023-10-14 06:54:30,611][100917] Updated weights for policy 1, policy_version 44512 (0.0010) +[2023-10-14 06:54:33,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 91127808. Throughput: 0: 1645.8, 1: 1674.9. Samples: 22794574. Policy #0 lag: (min: 31.0, avg: 31.5, max: 47.0) +[2023-10-14 06:54:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:54:33,658][100936] Updated weights for policy 0, policy_version 44490 (0.0009) +[2023-10-14 06:54:34,026][100936] Updated weights for policy 0, policy_version 44500 (0.0008) +[2023-10-14 06:54:34,397][100936] Updated weights for policy 0, policy_version 44510 (0.0008) +[2023-10-14 06:54:34,698][100917] Updated weights for policy 1, policy_version 44522 (0.0007) +[2023-10-14 06:54:35,076][100917] Updated weights for policy 1, policy_version 44532 (0.0007) +[2023-10-14 06:54:35,441][100917] Updated weights for policy 1, policy_version 44542 (0.0007) +[2023-10-14 06:54:38,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 91193344. Throughput: 0: 1642.9, 1: 1675.7. Samples: 22814614. Policy #0 lag: (min: 31.0, avg: 31.5, max: 47.0) +[2023-10-14 06:54:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:54:38,725][100936] Updated weights for policy 0, policy_version 44520 (0.0008) +[2023-10-14 06:54:39,094][100936] Updated weights for policy 0, policy_version 44530 (0.0010) +[2023-10-14 06:54:39,457][100936] Updated weights for policy 0, policy_version 44540 (0.0009) +[2023-10-14 06:54:39,570][100917] Updated weights for policy 1, policy_version 44552 (0.0008) +[2023-10-14 06:54:39,950][100917] Updated weights for policy 1, policy_version 44562 (0.0011) +[2023-10-14 06:54:40,321][100917] Updated weights for policy 1, policy_version 44572 (0.0008) +[2023-10-14 06:54:43,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 91258880. Throughput: 0: 1638.9, 1: 1671.8. Samples: 22823558. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:54:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:54:43,650][100936] Updated weights for policy 0, policy_version 44550 (0.0008) +[2023-10-14 06:54:44,022][100936] Updated weights for policy 0, policy_version 44560 (0.0009) +[2023-10-14 06:54:44,385][100936] Updated weights for policy 0, policy_version 44570 (0.0008) +[2023-10-14 06:54:44,400][100917] Updated weights for policy 1, policy_version 44582 (0.0010) +[2023-10-14 06:54:44,777][100917] Updated weights for policy 1, policy_version 44592 (0.0009) +[2023-10-14 06:54:45,158][100917] Updated weights for policy 1, policy_version 44602 (0.0008) +[2023-10-14 06:54:48,504][100936] Updated weights for policy 0, policy_version 44580 (0.0009) +[2023-10-14 06:54:48,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 91324416. Throughput: 0: 1632.7, 1: 1679.9. Samples: 22843724. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:54:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:54:48,882][100936] Updated weights for policy 0, policy_version 44590 (0.0008) +[2023-10-14 06:54:49,247][100936] Updated weights for policy 0, policy_version 44600 (0.0009) +[2023-10-14 06:54:49,285][100917] Updated weights for policy 1, policy_version 44612 (0.0008) +[2023-10-14 06:54:49,690][100917] Updated weights for policy 1, policy_version 44622 (0.0008) +[2023-10-14 06:54:50,059][100917] Updated weights for policy 1, policy_version 44632 (0.0009) +[2023-10-14 06:54:53,436][100936] Updated weights for policy 0, policy_version 44610 (0.0007) +[2023-10-14 06:54:53,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 91389952. Throughput: 0: 1637.5, 1: 1669.5. Samples: 22863812. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:54:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:54:53,810][100936] Updated weights for policy 0, policy_version 44620 (0.0008) +[2023-10-14 06:54:54,188][100917] Updated weights for policy 1, policy_version 44642 (0.0008) +[2023-10-14 06:54:54,194][100936] Updated weights for policy 0, policy_version 44630 (0.0008) +[2023-10-14 06:54:54,552][100917] Updated weights for policy 1, policy_version 44652 (0.0007) +[2023-10-14 06:54:54,555][100936] Updated weights for policy 0, policy_version 44640 (0.0007) +[2023-10-14 06:54:54,925][100917] Updated weights for policy 1, policy_version 44662 (0.0011) +[2023-10-14 06:54:55,297][100917] Updated weights for policy 1, policy_version 44672 (0.0010) +[2023-10-14 06:54:58,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 91455488. Throughput: 0: 1632.8, 1: 1670.7. Samples: 22872794. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:54:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:54:58,814][100936] Updated weights for policy 0, policy_version 44650 (0.0009) +[2023-10-14 06:54:59,180][100936] Updated weights for policy 0, policy_version 44660 (0.0008) +[2023-10-14 06:54:59,396][100917] Updated weights for policy 1, policy_version 44682 (0.0007) +[2023-10-14 06:54:59,555][100936] Updated weights for policy 0, policy_version 44670 (0.0009) +[2023-10-14 06:54:59,771][100917] Updated weights for policy 1, policy_version 44692 (0.0009) +[2023-10-14 06:55:00,143][100917] Updated weights for policy 1, policy_version 44702 (0.0011) +[2023-10-14 06:55:03,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 91521024. Throughput: 0: 1627.6, 1: 1666.4. Samples: 22892938. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:55:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:55:03,667][100936] Updated weights for policy 0, policy_version 44680 (0.0010) +[2023-10-14 06:55:04,038][100936] Updated weights for policy 0, policy_version 44690 (0.0009) +[2023-10-14 06:55:04,215][100917] Updated weights for policy 1, policy_version 44712 (0.0007) +[2023-10-14 06:55:04,408][100936] Updated weights for policy 0, policy_version 44700 (0.0009) +[2023-10-14 06:55:04,587][100917] Updated weights for policy 1, policy_version 44722 (0.0008) +[2023-10-14 06:55:04,951][100917] Updated weights for policy 1, policy_version 44732 (0.0010) +[2023-10-14 06:55:08,503][100936] Updated weights for policy 0, policy_version 44710 (0.0010) +[2023-10-14 06:55:08,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 91586560. Throughput: 0: 1629.5, 1: 1663.2. Samples: 22913244. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:55:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 06:55:08,872][100936] Updated weights for policy 0, policy_version 44720 (0.0008) +[2023-10-14 06:55:09,079][100917] Updated weights for policy 1, policy_version 44742 (0.0008) +[2023-10-14 06:55:09,239][100936] Updated weights for policy 0, policy_version 44730 (0.0008) +[2023-10-14 06:55:09,451][100917] Updated weights for policy 1, policy_version 44752 (0.0009) +[2023-10-14 06:55:09,819][100917] Updated weights for policy 1, policy_version 44762 (0.0008) +[2023-10-14 06:55:13,483][100936] Updated weights for policy 0, policy_version 44740 (0.0008) +[2023-10-14 06:55:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 91652096. Throughput: 0: 1627.0, 1: 1662.5. Samples: 22922206. Policy #0 lag: (min: 28.0, avg: 28.4, max: 42.0) +[2023-10-14 06:55:13,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:55:13,846][100936] Updated weights for policy 0, policy_version 44750 (0.0009) +[2023-10-14 06:55:14,034][100917] Updated weights for policy 1, policy_version 44772 (0.0008) +[2023-10-14 06:55:14,212][100936] Updated weights for policy 0, policy_version 44760 (0.0008) +[2023-10-14 06:55:14,409][100917] Updated weights for policy 1, policy_version 44782 (0.0008) +[2023-10-14 06:55:14,780][100917] Updated weights for policy 1, policy_version 44792 (0.0007) +[2023-10-14 06:55:18,285][100936] Updated weights for policy 0, policy_version 44770 (0.0007) +[2023-10-14 06:55:18,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 91717632. Throughput: 0: 1631.8, 1: 1657.5. Samples: 22942594. Policy #0 lag: (min: 28.0, avg: 28.4, max: 42.0) +[2023-10-14 06:55:18,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:55:18,664][100936] Updated weights for policy 0, policy_version 44780 (0.0007) +[2023-10-14 06:55:18,845][100917] Updated weights for policy 1, policy_version 44802 (0.0008) +[2023-10-14 06:55:19,029][100936] Updated weights for policy 0, policy_version 44790 (0.0008) +[2023-10-14 06:55:19,209][100917] Updated weights for policy 1, policy_version 44812 (0.0008) +[2023-10-14 06:55:19,389][100936] Updated weights for policy 0, policy_version 44800 (0.0008) +[2023-10-14 06:55:19,585][100917] Updated weights for policy 1, policy_version 44822 (0.0009) +[2023-10-14 06:55:19,962][100917] Updated weights for policy 1, policy_version 44832 (0.0008) +[2023-10-14 06:55:23,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 91783168. Throughput: 0: 1636.1, 1: 1661.9. Samples: 22963026. Policy #0 lag: (min: 28.0, avg: 28.4, max: 42.0) +[2023-10-14 06:55:23,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:55:23,797][100936] Updated weights for policy 0, policy_version 44810 (0.0009) +[2023-10-14 06:55:24,078][100917] Updated weights for policy 1, policy_version 44842 (0.0009) +[2023-10-14 06:55:24,169][100936] Updated weights for policy 0, policy_version 44820 (0.0007) +[2023-10-14 06:55:24,440][100917] Updated weights for policy 1, policy_version 44852 (0.0009) +[2023-10-14 06:55:24,544][100936] Updated weights for policy 0, policy_version 44830 (0.0007) +[2023-10-14 06:55:24,816][100917] Updated weights for policy 1, policy_version 44862 (0.0007) +[2023-10-14 06:55:28,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 91848704. Throughput: 0: 1633.6, 1: 1661.7. Samples: 22971848. Policy #0 lag: (min: 28.0, avg: 28.4, max: 42.0) +[2023-10-14 06:55:28,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:55:28,700][100936] Updated weights for policy 0, policy_version 44840 (0.0008) +[2023-10-14 06:55:28,945][100917] Updated weights for policy 1, policy_version 44872 (0.0009) +[2023-10-14 06:55:29,078][100936] Updated weights for policy 0, policy_version 44850 (0.0008) +[2023-10-14 06:55:29,316][100917] Updated weights for policy 1, policy_version 44882 (0.0009) +[2023-10-14 06:55:29,440][100936] Updated weights for policy 0, policy_version 44860 (0.0008) +[2023-10-14 06:55:29,691][100917] Updated weights for policy 1, policy_version 44892 (0.0010) +[2023-10-14 06:55:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 91914240. Throughput: 0: 1635.7, 1: 1658.7. Samples: 22991970. Policy #0 lag: (min: 28.0, avg: 28.4, max: 42.0) +[2023-10-14 06:55:33,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:55:33,739][100917] Updated weights for policy 1, policy_version 44902 (0.0009) +[2023-10-14 06:55:33,743][100936] Updated weights for policy 0, policy_version 44870 (0.0008) +[2023-10-14 06:55:34,097][100936] Updated weights for policy 0, policy_version 44880 (0.0009) +[2023-10-14 06:55:34,106][100917] Updated weights for policy 1, policy_version 44912 (0.0008) +[2023-10-14 06:55:34,472][100936] Updated weights for policy 0, policy_version 44890 (0.0008) +[2023-10-14 06:55:34,475][100917] Updated weights for policy 1, policy_version 44922 (0.0008) +[2023-10-14 06:55:38,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 91979776. Throughput: 0: 1631.6, 1: 1659.7. Samples: 23011922. Policy #0 lag: (min: 28.0, avg: 28.4, max: 42.0) +[2023-10-14 06:55:38,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:55:38,524][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000044896_45973504.pth... +[2023-10-14 06:55:38,557][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000043360_44400640.pth +[2023-10-14 06:55:38,714][100917] Updated weights for policy 1, policy_version 44932 (0.0009) +[2023-10-14 06:55:38,920][100936] Updated weights for policy 0, policy_version 44900 (0.0007) +[2023-10-14 06:55:39,108][100917] Updated weights for policy 1, policy_version 44942 (0.0008) +[2023-10-14 06:55:39,278][100936] Updated weights for policy 0, policy_version 44910 (0.0009) +[2023-10-14 06:55:39,473][100917] Updated weights for policy 1, policy_version 44952 (0.0009) +[2023-10-14 06:55:39,647][100936] Updated weights for policy 0, policy_version 44920 (0.0007) +[2023-10-14 06:55:39,766][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000044960_46039040.pth... +[2023-10-14 06:55:39,796][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000043392_44433408.pth +[2023-10-14 06:55:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 92045312. Throughput: 0: 1629.1, 1: 1657.8. Samples: 23020702. Policy #0 lag: (min: 31.0, avg: 31.8, max: 51.0) +[2023-10-14 06:55:43,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:55:43,517][100917] Updated weights for policy 1, policy_version 44962 (0.0008) +[2023-10-14 06:55:43,900][100917] Updated weights for policy 1, policy_version 44972 (0.0007) +[2023-10-14 06:55:43,941][100936] Updated weights for policy 0, policy_version 44930 (0.0007) +[2023-10-14 06:55:44,272][100917] Updated weights for policy 1, policy_version 44982 (0.0008) +[2023-10-14 06:55:44,337][100936] Updated weights for policy 0, policy_version 44940 (0.0007) +[2023-10-14 06:55:44,639][100917] Updated weights for policy 1, policy_version 44992 (0.0008) +[2023-10-14 06:55:44,704][100936] Updated weights for policy 0, policy_version 44950 (0.0009) +[2023-10-14 06:55:45,073][100936] Updated weights for policy 0, policy_version 44960 (0.0008) +[2023-10-14 06:55:48,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 92110848. Throughput: 0: 1631.4, 1: 1653.6. Samples: 23040760. Policy #0 lag: (min: 31.0, avg: 31.8, max: 51.0) +[2023-10-14 06:55:48,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:55:48,901][100917] Updated weights for policy 1, policy_version 45002 (0.0007) +[2023-10-14 06:55:49,272][100936] Updated weights for policy 0, policy_version 44970 (0.0009) +[2023-10-14 06:55:49,283][100917] Updated weights for policy 1, policy_version 45012 (0.0010) +[2023-10-14 06:55:49,643][100936] Updated weights for policy 0, policy_version 44980 (0.0007) +[2023-10-14 06:55:49,648][100917] Updated weights for policy 1, policy_version 45022 (0.0008) +[2023-10-14 06:55:50,004][100936] Updated weights for policy 0, policy_version 44990 (0.0009) +[2023-10-14 06:55:53,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 92176384. Throughput: 0: 1636.6, 1: 1648.4. Samples: 23061072. Policy #0 lag: (min: 31.0, avg: 31.8, max: 51.0) +[2023-10-14 06:55:53,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:55:53,897][100917] Updated weights for policy 1, policy_version 45032 (0.0009) +[2023-10-14 06:55:53,922][100936] Updated weights for policy 0, policy_version 45000 (0.0009) +[2023-10-14 06:55:54,269][100917] Updated weights for policy 1, policy_version 45042 (0.0009) +[2023-10-14 06:55:54,288][100936] Updated weights for policy 0, policy_version 45010 (0.0008) +[2023-10-14 06:55:54,643][100917] Updated weights for policy 1, policy_version 45052 (0.0010) +[2023-10-14 06:55:54,655][100936] Updated weights for policy 0, policy_version 45020 (0.0007) +[2023-10-14 06:55:58,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 92241920. Throughput: 0: 1635.6, 1: 1648.3. Samples: 23069980. Policy #0 lag: (min: 31.0, avg: 31.8, max: 51.0) +[2023-10-14 06:55:58,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:55:58,803][100936] Updated weights for policy 0, policy_version 45030 (0.0008) +[2023-10-14 06:55:58,829][100917] Updated weights for policy 1, policy_version 45062 (0.0008) +[2023-10-14 06:55:59,174][100936] Updated weights for policy 0, policy_version 45040 (0.0009) +[2023-10-14 06:55:59,193][100917] Updated weights for policy 1, policy_version 45072 (0.0007) +[2023-10-14 06:55:59,538][100936] Updated weights for policy 0, policy_version 45050 (0.0009) +[2023-10-14 06:55:59,561][100917] Updated weights for policy 1, policy_version 45082 (0.0007) +[2023-10-14 06:56:03,471][100917] Updated weights for policy 1, policy_version 45092 (0.0009) +[2023-10-14 06:56:03,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 92307456. Throughput: 0: 1632.5, 1: 1649.9. Samples: 23090302. Policy #0 lag: (min: 31.0, avg: 31.8, max: 51.0) +[2023-10-14 06:56:03,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:56:03,668][100936] Updated weights for policy 0, policy_version 45060 (0.0007) +[2023-10-14 06:56:03,852][100917] Updated weights for policy 1, policy_version 45102 (0.0007) +[2023-10-14 06:56:04,030][100936] Updated weights for policy 0, policy_version 45070 (0.0007) +[2023-10-14 06:56:04,223][100917] Updated weights for policy 1, policy_version 45112 (0.0007) +[2023-10-14 06:56:04,402][100936] Updated weights for policy 0, policy_version 45080 (0.0009) +[2023-10-14 06:56:08,329][100936] Updated weights for policy 0, policy_version 45090 (0.0007) +[2023-10-14 06:56:08,385][100917] Updated weights for policy 1, policy_version 45122 (0.0009) +[2023-10-14 06:56:08,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 92372992. Throughput: 0: 1636.1, 1: 1646.1. Samples: 23110726. Policy #0 lag: (min: 31.0, avg: 31.8, max: 51.0) +[2023-10-14 06:56:08,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:56:08,690][100936] Updated weights for policy 0, policy_version 45100 (0.0010) +[2023-10-14 06:56:08,749][100917] Updated weights for policy 1, policy_version 45132 (0.0009) +[2023-10-14 06:56:09,072][100936] Updated weights for policy 0, policy_version 45110 (0.0009) +[2023-10-14 06:56:09,120][100917] Updated weights for policy 1, policy_version 45142 (0.0010) +[2023-10-14 06:56:09,431][100936] Updated weights for policy 0, policy_version 45120 (0.0008) +[2023-10-14 06:56:09,485][100917] Updated weights for policy 1, policy_version 45152 (0.0007) +[2023-10-14 06:56:13,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 92438528. Throughput: 0: 1640.0, 1: 1648.9. Samples: 23119852. Policy #0 lag: (min: 31.0, avg: 32.3, max: 56.0) +[2023-10-14 06:56:13,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:56:13,590][100917] Updated weights for policy 1, policy_version 45162 (0.0008) +[2023-10-14 06:56:13,841][100936] Updated weights for policy 0, policy_version 45130 (0.0009) +[2023-10-14 06:56:13,959][100917] Updated weights for policy 1, policy_version 45172 (0.0008) +[2023-10-14 06:56:14,210][100936] Updated weights for policy 0, policy_version 45140 (0.0009) +[2023-10-14 06:56:14,319][100917] Updated weights for policy 1, policy_version 45182 (0.0007) +[2023-10-14 06:56:14,577][100936] Updated weights for policy 0, policy_version 45150 (0.0009) +[2023-10-14 06:56:18,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 92504064. Throughput: 0: 1635.3, 1: 1653.6. Samples: 23139970. Policy #0 lag: (min: 31.0, avg: 32.3, max: 56.0) +[2023-10-14 06:56:18,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:56:18,664][100917] Updated weights for policy 1, policy_version 45192 (0.0007) +[2023-10-14 06:56:18,804][100936] Updated weights for policy 0, policy_version 45160 (0.0008) +[2023-10-14 06:56:19,022][100917] Updated weights for policy 1, policy_version 45202 (0.0007) +[2023-10-14 06:56:19,175][100936] Updated weights for policy 0, policy_version 45170 (0.0007) +[2023-10-14 06:56:19,390][100917] Updated weights for policy 1, policy_version 45212 (0.0008) +[2023-10-14 06:56:19,545][100936] Updated weights for policy 0, policy_version 45180 (0.0008) +[2023-10-14 06:56:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.1, 300 sec: 13107.2). Total num frames: 92569600. Throughput: 0: 1643.0, 1: 1652.1. Samples: 23160200. Policy #0 lag: (min: 31.0, avg: 32.3, max: 56.0) +[2023-10-14 06:56:23,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:56:23,638][100917] Updated weights for policy 1, policy_version 45222 (0.0009) +[2023-10-14 06:56:23,648][100936] Updated weights for policy 0, policy_version 45190 (0.0007) +[2023-10-14 06:56:24,009][100936] Updated weights for policy 0, policy_version 45200 (0.0007) +[2023-10-14 06:56:24,033][100917] Updated weights for policy 1, policy_version 45232 (0.0009) +[2023-10-14 06:56:24,374][100936] Updated weights for policy 0, policy_version 45210 (0.0008) +[2023-10-14 06:56:24,415][100917] Updated weights for policy 1, policy_version 45242 (0.0009) +[2023-10-14 06:56:28,479][100917] Updated weights for policy 1, policy_version 45252 (0.0009) +[2023-10-14 06:56:28,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 92635136. Throughput: 0: 1646.0, 1: 1648.2. Samples: 23168938. Policy #0 lag: (min: 31.0, avg: 32.3, max: 56.0) +[2023-10-14 06:56:28,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 06:56:28,636][100936] Updated weights for policy 0, policy_version 45220 (0.0008) +[2023-10-14 06:56:28,854][100917] Updated weights for policy 1, policy_version 45262 (0.0007) +[2023-10-14 06:56:29,032][100936] Updated weights for policy 0, policy_version 45230 (0.0008) +[2023-10-14 06:56:29,222][100917] Updated weights for policy 1, policy_version 45272 (0.0008) +[2023-10-14 06:56:29,390][100936] Updated weights for policy 0, policy_version 45240 (0.0008) +[2023-10-14 06:56:33,396][100936] Updated weights for policy 0, policy_version 45250 (0.0009) +[2023-10-14 06:56:33,453][100917] Updated weights for policy 1, policy_version 45282 (0.0007) +[2023-10-14 06:56:33,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 92700672. Throughput: 0: 1644.8, 1: 1654.0. Samples: 23189206. Policy #0 lag: (min: 31.0, avg: 32.3, max: 56.0) +[2023-10-14 06:56:33,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 06:56:33,770][100936] Updated weights for policy 0, policy_version 45260 (0.0008) +[2023-10-14 06:56:33,816][100917] Updated weights for policy 1, policy_version 45292 (0.0008) +[2023-10-14 06:56:34,124][100936] Updated weights for policy 0, policy_version 45270 (0.0009) +[2023-10-14 06:56:34,191][100917] Updated weights for policy 1, policy_version 45302 (0.0008) +[2023-10-14 06:56:34,486][100936] Updated weights for policy 0, policy_version 45280 (0.0008) +[2023-10-14 06:56:34,567][100917] Updated weights for policy 1, policy_version 45312 (0.0008) +[2023-10-14 06:56:38,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 13107.2). Total num frames: 92766208. Throughput: 0: 1643.7, 1: 1660.5. Samples: 23209762. Policy #0 lag: (min: 31.0, avg: 32.3, max: 56.0) +[2023-10-14 06:56:38,512][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 06:56:38,638][100936] Updated weights for policy 0, policy_version 45290 (0.0008) +[2023-10-14 06:56:38,644][100917] Updated weights for policy 1, policy_version 45322 (0.0010) +[2023-10-14 06:56:39,008][100936] Updated weights for policy 0, policy_version 45300 (0.0007) +[2023-10-14 06:56:39,019][100917] Updated weights for policy 1, policy_version 45332 (0.0008) +[2023-10-14 06:56:39,374][100936] Updated weights for policy 0, policy_version 45310 (0.0007) +[2023-10-14 06:56:39,381][100917] Updated weights for policy 1, policy_version 45342 (0.0009) +[2023-10-14 06:56:43,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 92831744. Throughput: 0: 1648.0, 1: 1661.4. Samples: 23218902. Policy #0 lag: (min: 26.0, avg: 29.2, max: 58.0) +[2023-10-14 06:56:43,512][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 06:56:43,556][100917] Updated weights for policy 1, policy_version 45352 (0.0008) +[2023-10-14 06:56:43,610][100936] Updated weights for policy 0, policy_version 45320 (0.0008) +[2023-10-14 06:56:43,934][100917] Updated weights for policy 1, policy_version 45362 (0.0008) +[2023-10-14 06:56:43,975][100936] Updated weights for policy 0, policy_version 45330 (0.0007) +[2023-10-14 06:56:44,302][100917] Updated weights for policy 1, policy_version 45372 (0.0008) +[2023-10-14 06:56:44,347][100936] Updated weights for policy 0, policy_version 45340 (0.0009) +[2023-10-14 06:56:48,300][100917] Updated weights for policy 1, policy_version 45382 (0.0009) +[2023-10-14 06:56:48,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 92897280. Throughput: 0: 1649.2, 1: 1659.7. Samples: 23239204. Policy #0 lag: (min: 26.0, avg: 29.2, max: 58.0) +[2023-10-14 06:56:48,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 06:56:48,555][100936] Updated weights for policy 0, policy_version 45350 (0.0007) +[2023-10-14 06:56:48,662][100917] Updated weights for policy 1, policy_version 45392 (0.0008) +[2023-10-14 06:56:48,936][100936] Updated weights for policy 0, policy_version 45360 (0.0007) +[2023-10-14 06:56:49,044][100917] Updated weights for policy 1, policy_version 45402 (0.0010) +[2023-10-14 06:56:49,316][100936] Updated weights for policy 0, policy_version 45370 (0.0007) +[2023-10-14 06:56:53,114][100917] Updated weights for policy 1, policy_version 45412 (0.0011) +[2023-10-14 06:56:53,333][100936] Updated weights for policy 0, policy_version 45380 (0.0008) +[2023-10-14 06:56:53,483][100917] Updated weights for policy 1, policy_version 45422 (0.0008) +[2023-10-14 06:56:53,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 92962816. Throughput: 0: 1646.0, 1: 1656.6. Samples: 23259346. Policy #0 lag: (min: 26.0, avg: 29.2, max: 58.0) +[2023-10-14 06:56:53,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:56:53,703][100936] Updated weights for policy 0, policy_version 45390 (0.0008) +[2023-10-14 06:56:53,861][100917] Updated weights for policy 1, policy_version 45432 (0.0010) +[2023-10-14 06:56:54,075][100936] Updated weights for policy 0, policy_version 45400 (0.0009) +[2023-10-14 06:56:57,881][100917] Updated weights for policy 1, policy_version 45442 (0.0008) +[2023-10-14 06:56:58,135][100936] Updated weights for policy 0, policy_version 45410 (0.0009) +[2023-10-14 06:56:58,258][100917] Updated weights for policy 1, policy_version 45452 (0.0010) +[2023-10-14 06:56:58,510][100936] Updated weights for policy 0, policy_version 45420 (0.0007) +[2023-10-14 06:56:58,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 93028352. Throughput: 0: 1646.3, 1: 1654.8. Samples: 23268398. Policy #0 lag: (min: 26.0, avg: 29.2, max: 58.0) +[2023-10-14 06:56:58,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:56:58,629][100917] Updated weights for policy 1, policy_version 45462 (0.0009) +[2023-10-14 06:56:58,872][100936] Updated weights for policy 0, policy_version 45430 (0.0007) +[2023-10-14 06:56:58,998][100917] Updated weights for policy 1, policy_version 45472 (0.0010) +[2023-10-14 06:56:59,241][100936] Updated weights for policy 0, policy_version 45440 (0.0007) +[2023-10-14 06:57:03,146][100917] Updated weights for policy 1, policy_version 45482 (0.0009) +[2023-10-14 06:57:03,330][100936] Updated weights for policy 0, policy_version 45450 (0.0008) +[2023-10-14 06:57:03,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 93093888. Throughput: 0: 1656.6, 1: 1654.9. Samples: 23288986. Policy #0 lag: (min: 26.0, avg: 29.2, max: 58.0) +[2023-10-14 06:57:03,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:57:03,520][100917] Updated weights for policy 1, policy_version 45492 (0.0009) +[2023-10-14 06:57:03,692][100936] Updated weights for policy 0, policy_version 45460 (0.0007) +[2023-10-14 06:57:03,890][100917] Updated weights for policy 1, policy_version 45502 (0.0008) +[2023-10-14 06:57:04,057][100936] Updated weights for policy 0, policy_version 45470 (0.0008) +[2023-10-14 06:57:08,213][100917] Updated weights for policy 1, policy_version 45512 (0.0009) +[2023-10-14 06:57:08,388][100936] Updated weights for policy 0, policy_version 45480 (0.0008) +[2023-10-14 06:57:08,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 93159424. Throughput: 0: 1649.0, 1: 1655.0. Samples: 23308880. Policy #0 lag: (min: 26.0, avg: 29.2, max: 58.0) +[2023-10-14 06:57:08,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:57:08,572][100917] Updated weights for policy 1, policy_version 45522 (0.0007) +[2023-10-14 06:57:08,749][100936] Updated weights for policy 0, policy_version 45490 (0.0009) +[2023-10-14 06:57:08,941][100917] Updated weights for policy 1, policy_version 45532 (0.0007) +[2023-10-14 06:57:09,122][100936] Updated weights for policy 0, policy_version 45500 (0.0008) +[2023-10-14 06:57:13,009][100917] Updated weights for policy 1, policy_version 45542 (0.0009) +[2023-10-14 06:57:13,348][100936] Updated weights for policy 0, policy_version 45510 (0.0007) +[2023-10-14 06:57:13,386][100917] Updated weights for policy 1, policy_version 45552 (0.0009) +[2023-10-14 06:57:13,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 93224960. Throughput: 0: 1654.0, 1: 1661.3. Samples: 23318128. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:57:13,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:57:13,726][100936] Updated weights for policy 0, policy_version 45520 (0.0007) +[2023-10-14 06:57:13,756][100917] Updated weights for policy 1, policy_version 45562 (0.0009) +[2023-10-14 06:57:14,100][100936] Updated weights for policy 0, policy_version 45530 (0.0009) +[2023-10-14 06:57:18,002][100917] Updated weights for policy 1, policy_version 45572 (0.0008) +[2023-10-14 06:57:18,267][100936] Updated weights for policy 0, policy_version 45540 (0.0008) +[2023-10-14 06:57:18,384][100917] Updated weights for policy 1, policy_version 45582 (0.0010) +[2023-10-14 06:57:18,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 93290496. Throughput: 0: 1653.8, 1: 1656.6. Samples: 23338172. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:57:18,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:57:18,632][100936] Updated weights for policy 0, policy_version 45550 (0.0010) +[2023-10-14 06:57:18,744][100917] Updated weights for policy 1, policy_version 45592 (0.0008) +[2023-10-14 06:57:19,010][100936] Updated weights for policy 0, policy_version 45560 (0.0007) +[2023-10-14 06:57:22,905][100917] Updated weights for policy 1, policy_version 45602 (0.0008) +[2023-10-14 06:57:23,211][100936] Updated weights for policy 0, policy_version 45570 (0.0008) +[2023-10-14 06:57:23,271][100917] Updated weights for policy 1, policy_version 45612 (0.0007) +[2023-10-14 06:57:23,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 93356032. Throughput: 0: 1646.0, 1: 1648.0. Samples: 23357992. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:57:23,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 06:57:23,583][100936] Updated weights for policy 0, policy_version 45580 (0.0008) +[2023-10-14 06:57:23,639][100917] Updated weights for policy 1, policy_version 45622 (0.0008) +[2023-10-14 06:57:23,958][100936] Updated weights for policy 0, policy_version 45590 (0.0007) +[2023-10-14 06:57:24,011][100917] Updated weights for policy 1, policy_version 45632 (0.0007) +[2023-10-14 06:57:24,325][100936] Updated weights for policy 0, policy_version 45600 (0.0010) +[2023-10-14 06:57:28,194][100917] Updated weights for policy 1, policy_version 45642 (0.0007) +[2023-10-14 06:57:28,391][100936] Updated weights for policy 0, policy_version 45610 (0.0008) +[2023-10-14 06:57:28,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 93421568. Throughput: 0: 1647.0, 1: 1649.8. Samples: 23367260. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:57:28,512][99942] Avg episode reward: [(0, '0.990'), (1, '0.940')] +[2023-10-14 06:57:28,567][100917] Updated weights for policy 1, policy_version 45652 (0.0007) +[2023-10-14 06:57:28,762][100936] Updated weights for policy 0, policy_version 45620 (0.0007) +[2023-10-14 06:57:28,943][100917] Updated weights for policy 1, policy_version 45662 (0.0009) +[2023-10-14 06:57:29,122][100936] Updated weights for policy 0, policy_version 45630 (0.0009) +[2023-10-14 06:57:33,055][100917] Updated weights for policy 1, policy_version 45672 (0.0009) +[2023-10-14 06:57:33,342][100936] Updated weights for policy 0, policy_version 45640 (0.0008) +[2023-10-14 06:57:33,433][100917] Updated weights for policy 1, policy_version 45682 (0.0007) +[2023-10-14 06:57:33,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 93487104. Throughput: 0: 1644.6, 1: 1654.0. Samples: 23387640. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:57:33,512][99942] Avg episode reward: [(0, '0.990'), (1, '0.940')] +[2023-10-14 06:57:33,717][100936] Updated weights for policy 0, policy_version 45650 (0.0010) +[2023-10-14 06:57:33,794][100917] Updated weights for policy 1, policy_version 45692 (0.0007) +[2023-10-14 06:57:34,093][100936] Updated weights for policy 0, policy_version 45660 (0.0007) +[2023-10-14 06:57:37,921][100917] Updated weights for policy 1, policy_version 45702 (0.0010) +[2023-10-14 06:57:38,248][100936] Updated weights for policy 0, policy_version 45670 (0.0007) +[2023-10-14 06:57:38,297][100917] Updated weights for policy 1, policy_version 45712 (0.0008) +[2023-10-14 06:57:38,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 93552640. Throughput: 0: 1642.0, 1: 1645.6. Samples: 23407288. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:57:38,512][99942] Avg episode reward: [(0, '0.990'), (1, '0.940')] +[2023-10-14 06:57:38,609][100936] Updated weights for policy 0, policy_version 45680 (0.0009) +[2023-10-14 06:57:38,662][100917] Updated weights for policy 1, policy_version 45722 (0.0007) +[2023-10-14 06:57:38,881][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000045728_46825472.pth... +[2023-10-14 06:57:38,910][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000044160_45219840.pth +[2023-10-14 06:57:38,969][100936] Updated weights for policy 0, policy_version 45690 (0.0008) +[2023-10-14 06:57:39,197][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000045696_46792704.pth... +[2023-10-14 06:57:39,231][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000044128_45187072.pth +[2023-10-14 06:57:42,547][100917] Updated weights for policy 1, policy_version 45732 (0.0010) +[2023-10-14 06:57:42,929][100917] Updated weights for policy 1, policy_version 45742 (0.0010) +[2023-10-14 06:57:43,135][100936] Updated weights for policy 0, policy_version 45700 (0.0009) +[2023-10-14 06:57:43,300][100917] Updated weights for policy 1, policy_version 45752 (0.0010) +[2023-10-14 06:57:43,503][100936] Updated weights for policy 0, policy_version 45710 (0.0007) +[2023-10-14 06:57:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 93618176. Throughput: 0: 1648.2, 1: 1652.8. Samples: 23416940. Policy #0 lag: (min: 31.0, avg: 38.1, max: 63.0) +[2023-10-14 06:57:43,512][99942] Avg episode reward: [(0, '0.990'), (1, '0.940')] +[2023-10-14 06:57:43,873][100936] Updated weights for policy 0, policy_version 45720 (0.0009) +[2023-10-14 06:57:47,591][100917] Updated weights for policy 1, policy_version 45762 (0.0008) +[2023-10-14 06:57:47,935][100936] Updated weights for policy 0, policy_version 45730 (0.0009) +[2023-10-14 06:57:47,972][100917] Updated weights for policy 1, policy_version 45772 (0.0009) +[2023-10-14 06:57:48,309][100936] Updated weights for policy 0, policy_version 45740 (0.0008) +[2023-10-14 06:57:48,345][100917] Updated weights for policy 1, policy_version 45782 (0.0008) +[2023-10-14 06:57:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 93683712. Throughput: 0: 1646.6, 1: 1650.7. Samples: 23437362. Policy #0 lag: (min: 31.0, avg: 38.1, max: 63.0) +[2023-10-14 06:57:48,513][99942] Avg episode reward: [(0, '0.990'), (1, '0.940')] +[2023-10-14 06:57:48,683][100936] Updated weights for policy 0, policy_version 45750 (0.0007) +[2023-10-14 06:57:48,712][100917] Updated weights for policy 1, policy_version 45792 (0.0009) +[2023-10-14 06:57:49,053][100936] Updated weights for policy 0, policy_version 45760 (0.0010) +[2023-10-14 06:57:52,763][100917] Updated weights for policy 1, policy_version 45802 (0.0009) +[2023-10-14 06:57:53,024][100936] Updated weights for policy 0, policy_version 45770 (0.0008) +[2023-10-14 06:57:53,138][100917] Updated weights for policy 1, policy_version 45812 (0.0007) +[2023-10-14 06:57:53,388][100936] Updated weights for policy 0, policy_version 45780 (0.0008) +[2023-10-14 06:57:53,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 93749248. Throughput: 0: 1637.9, 1: 1643.2. Samples: 23456532. Policy #0 lag: (min: 31.0, avg: 38.1, max: 63.0) +[2023-10-14 06:57:53,513][100917] Updated weights for policy 1, policy_version 45822 (0.0007) +[2023-10-14 06:57:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 06:57:53,757][100936] Updated weights for policy 0, policy_version 45790 (0.0007) +[2023-10-14 06:57:57,732][100917] Updated weights for policy 1, policy_version 45832 (0.0009) +[2023-10-14 06:57:58,108][100917] Updated weights for policy 1, policy_version 45842 (0.0009) +[2023-10-14 06:57:58,124][100936] Updated weights for policy 0, policy_version 45800 (0.0007) +[2023-10-14 06:57:58,473][100917] Updated weights for policy 1, policy_version 45852 (0.0010) +[2023-10-14 06:57:58,502][100936] Updated weights for policy 0, policy_version 45810 (0.0008) +[2023-10-14 06:57:58,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.1, 300 sec: 13107.2). Total num frames: 93814784. Throughput: 0: 1648.4, 1: 1655.0. Samples: 23466782. Policy #0 lag: (min: 31.0, avg: 38.1, max: 63.0) +[2023-10-14 06:57:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 06:57:58,877][100936] Updated weights for policy 0, policy_version 45820 (0.0010) +[2023-10-14 06:58:02,530][100917] Updated weights for policy 1, policy_version 45862 (0.0008) +[2023-10-14 06:58:02,903][100917] Updated weights for policy 1, policy_version 45872 (0.0009) +[2023-10-14 06:58:02,957][100936] Updated weights for policy 0, policy_version 45830 (0.0008) +[2023-10-14 06:58:03,284][100917] Updated weights for policy 1, policy_version 45882 (0.0007) +[2023-10-14 06:58:03,343][100936] Updated weights for policy 0, policy_version 45840 (0.0008) +[2023-10-14 06:58:03,512][99942] Fps is (10 sec: 16383.9, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 93913088. Throughput: 0: 1655.2, 1: 1659.5. Samples: 23487336. Policy #0 lag: (min: 31.0, avg: 38.1, max: 63.0) +[2023-10-14 06:58:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 06:58:03,714][100936] Updated weights for policy 0, policy_version 45850 (0.0009) +[2023-10-14 06:58:07,539][100917] Updated weights for policy 1, policy_version 45892 (0.0009) +[2023-10-14 06:58:07,913][100917] Updated weights for policy 1, policy_version 45902 (0.0009) +[2023-10-14 06:58:07,918][100936] Updated weights for policy 0, policy_version 45860 (0.0009) +[2023-10-14 06:58:08,285][100936] Updated weights for policy 0, policy_version 45870 (0.0008) +[2023-10-14 06:58:08,285][100917] Updated weights for policy 1, policy_version 45912 (0.0008) +[2023-10-14 06:58:08,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 93945856. Throughput: 0: 1643.5, 1: 1651.3. Samples: 23506258. Policy #0 lag: (min: 31.0, avg: 38.1, max: 63.0) +[2023-10-14 06:58:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 06:58:08,651][100936] Updated weights for policy 0, policy_version 45880 (0.0007) +[2023-10-14 06:58:12,456][100917] Updated weights for policy 1, policy_version 45922 (0.0009) +[2023-10-14 06:58:12,825][100917] Updated weights for policy 1, policy_version 45932 (0.0007) +[2023-10-14 06:58:12,840][100936] Updated weights for policy 0, policy_version 45890 (0.0009) +[2023-10-14 06:58:13,186][100917] Updated weights for policy 1, policy_version 45942 (0.0008) +[2023-10-14 06:58:13,203][100936] Updated weights for policy 0, policy_version 45900 (0.0007) +[2023-10-14 06:58:13,512][99942] Fps is (10 sec: 9830.6, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 94011392. Throughput: 0: 1653.9, 1: 1660.3. Samples: 23516398. Policy #0 lag: (min: 31.0, avg: 38.1, max: 63.0) +[2023-10-14 06:58:13,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 06:58:13,556][100917] Updated weights for policy 1, policy_version 45952 (0.0009) +[2023-10-14 06:58:13,575][100936] Updated weights for policy 0, policy_version 45910 (0.0007) +[2023-10-14 06:58:13,943][100936] Updated weights for policy 0, policy_version 45920 (0.0010) +[2023-10-14 06:58:17,755][100917] Updated weights for policy 1, policy_version 45962 (0.0007) +[2023-10-14 06:58:18,126][100917] Updated weights for policy 1, policy_version 45972 (0.0007) +[2023-10-14 06:58:18,211][100936] Updated weights for policy 0, policy_version 45930 (0.0008) +[2023-10-14 06:58:18,489][100917] Updated weights for policy 1, policy_version 45982 (0.0009) +[2023-10-14 06:58:18,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.3, 300 sec: 13107.2). Total num frames: 94076928. Throughput: 0: 1653.2, 1: 1653.9. Samples: 23536460. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:58:18,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 06:58:18,573][100936] Updated weights for policy 0, policy_version 45940 (0.0007) +[2023-10-14 06:58:18,948][100936] Updated weights for policy 0, policy_version 45950 (0.0007) +[2023-10-14 06:58:22,554][100917] Updated weights for policy 1, policy_version 45992 (0.0008) +[2023-10-14 06:58:22,912][100917] Updated weights for policy 1, policy_version 46002 (0.0009) +[2023-10-14 06:58:23,097][100936] Updated weights for policy 0, policy_version 45960 (0.0008) +[2023-10-14 06:58:23,281][100917] Updated weights for policy 1, policy_version 46012 (0.0007) +[2023-10-14 06:58:23,458][100936] Updated weights for policy 0, policy_version 45970 (0.0009) +[2023-10-14 06:58:23,512][99942] Fps is (10 sec: 16383.8, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 94175232. Throughput: 0: 1647.4, 1: 1648.9. Samples: 23555622. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:58:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 06:58:23,834][100936] Updated weights for policy 0, policy_version 45980 (0.0008) +[2023-10-14 06:58:27,520][100917] Updated weights for policy 1, policy_version 46022 (0.0009) +[2023-10-14 06:58:27,871][100936] Updated weights for policy 0, policy_version 45990 (0.0007) +[2023-10-14 06:58:27,879][100917] Updated weights for policy 1, policy_version 46032 (0.0008) +[2023-10-14 06:58:28,239][100936] Updated weights for policy 0, policy_version 46000 (0.0008) +[2023-10-14 06:58:28,252][100917] Updated weights for policy 1, policy_version 46042 (0.0010) +[2023-10-14 06:58:28,512][99942] Fps is (10 sec: 16383.9, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 94240768. Throughput: 0: 1654.9, 1: 1653.6. Samples: 23565820. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:58:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 06:58:28,617][100936] Updated weights for policy 0, policy_version 46010 (0.0007) +[2023-10-14 06:58:32,377][100917] Updated weights for policy 1, policy_version 46052 (0.0009) +[2023-10-14 06:58:32,747][100917] Updated weights for policy 1, policy_version 46062 (0.0009) +[2023-10-14 06:58:32,750][100936] Updated weights for policy 0, policy_version 46020 (0.0007) +[2023-10-14 06:58:33,117][100936] Updated weights for policy 0, policy_version 46030 (0.0007) +[2023-10-14 06:58:33,124][100917] Updated weights for policy 1, policy_version 46072 (0.0010) +[2023-10-14 06:58:33,486][100936] Updated weights for policy 0, policy_version 46040 (0.0007) +[2023-10-14 06:58:33,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 94306304. Throughput: 0: 1659.0, 1: 1648.9. Samples: 23586218. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:58:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 06:58:37,215][100917] Updated weights for policy 1, policy_version 46082 (0.0008) +[2023-10-14 06:58:37,359][100936] Updated weights for policy 0, policy_version 46050 (0.0008) +[2023-10-14 06:58:37,586][100917] Updated weights for policy 1, policy_version 46092 (0.0007) +[2023-10-14 06:58:37,742][100936] Updated weights for policy 0, policy_version 46060 (0.0008) +[2023-10-14 06:58:37,954][100917] Updated weights for policy 1, policy_version 46102 (0.0010) +[2023-10-14 06:58:38,111][100936] Updated weights for policy 0, policy_version 46070 (0.0009) +[2023-10-14 06:58:38,316][100917] Updated weights for policy 1, policy_version 46112 (0.0008) +[2023-10-14 06:58:38,481][100936] Updated weights for policy 0, policy_version 46080 (0.0007) +[2023-10-14 06:58:38,512][99942] Fps is (10 sec: 16383.9, 60 sec: 14199.5, 300 sec: 13329.4). Total num frames: 94404608. Throughput: 0: 1651.6, 1: 1644.1. Samples: 23604838. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:58:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 06:58:42,608][100917] Updated weights for policy 1, policy_version 46122 (0.0010) +[2023-10-14 06:58:42,818][100936] Updated weights for policy 0, policy_version 46090 (0.0007) +[2023-10-14 06:58:42,991][100917] Updated weights for policy 1, policy_version 46132 (0.0009) +[2023-10-14 06:58:43,181][100936] Updated weights for policy 0, policy_version 46100 (0.0007) +[2023-10-14 06:58:43,355][100917] Updated weights for policy 1, policy_version 46142 (0.0009) +[2023-10-14 06:58:43,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 94437376. Throughput: 0: 1657.4, 1: 1650.6. Samples: 23615640. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:58:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 06:58:43,555][100936] Updated weights for policy 0, policy_version 46110 (0.0008) +[2023-10-14 06:58:47,405][100917] Updated weights for policy 1, policy_version 46152 (0.0008) +[2023-10-14 06:58:47,776][100917] Updated weights for policy 1, policy_version 46162 (0.0010) +[2023-10-14 06:58:47,835][100936] Updated weights for policy 0, policy_version 46120 (0.0009) +[2023-10-14 06:58:48,145][100917] Updated weights for policy 1, policy_version 46172 (0.0008) +[2023-10-14 06:58:48,209][100936] Updated weights for policy 0, policy_version 46130 (0.0008) +[2023-10-14 06:58:48,512][99942] Fps is (10 sec: 9830.4, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 94502912. Throughput: 0: 1652.1, 1: 1646.5. Samples: 23635772. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:58:48,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 06:58:48,572][100936] Updated weights for policy 0, policy_version 46140 (0.0007) +[2023-10-14 06:58:52,435][100917] Updated weights for policy 1, policy_version 46182 (0.0010) +[2023-10-14 06:58:52,675][100936] Updated weights for policy 0, policy_version 46150 (0.0008) +[2023-10-14 06:58:52,798][100917] Updated weights for policy 1, policy_version 46192 (0.0009) +[2023-10-14 06:58:53,035][100936] Updated weights for policy 0, policy_version 46160 (0.0008) +[2023-10-14 06:58:53,172][100917] Updated weights for policy 1, policy_version 46202 (0.0008) +[2023-10-14 06:58:53,403][100936] Updated weights for policy 0, policy_version 46170 (0.0008) +[2023-10-14 06:58:53,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 94568448. Throughput: 0: 1646.8, 1: 1639.3. Samples: 23654134. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:58:53,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 06:58:57,170][100917] Updated weights for policy 1, policy_version 46212 (0.0009) +[2023-10-14 06:58:57,546][100917] Updated weights for policy 1, policy_version 46222 (0.0009) +[2023-10-14 06:58:57,702][100936] Updated weights for policy 0, policy_version 46180 (0.0009) +[2023-10-14 06:58:57,918][100917] Updated weights for policy 1, policy_version 46232 (0.0008) +[2023-10-14 06:58:58,074][100936] Updated weights for policy 0, policy_version 46190 (0.0010) +[2023-10-14 06:58:58,438][100936] Updated weights for policy 0, policy_version 46200 (0.0010) +[2023-10-14 06:58:58,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 94633984. Throughput: 0: 1647.4, 1: 1648.4. Samples: 23664710. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:58:58,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 06:59:02,154][100917] Updated weights for policy 1, policy_version 46242 (0.0009) +[2023-10-14 06:59:02,459][100936] Updated weights for policy 0, policy_version 46210 (0.0007) +[2023-10-14 06:59:02,523][100917] Updated weights for policy 1, policy_version 46252 (0.0008) +[2023-10-14 06:59:02,819][100936] Updated weights for policy 0, policy_version 46220 (0.0008) +[2023-10-14 06:59:02,893][100917] Updated weights for policy 1, policy_version 46262 (0.0008) +[2023-10-14 06:59:03,195][100936] Updated weights for policy 0, policy_version 46230 (0.0008) +[2023-10-14 06:59:03,262][100917] Updated weights for policy 1, policy_version 46272 (0.0009) +[2023-10-14 06:59:03,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 94699520. Throughput: 0: 1650.8, 1: 1653.6. Samples: 23685158. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:59:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 06:59:03,560][100936] Updated weights for policy 0, policy_version 46240 (0.0011) +[2023-10-14 06:59:07,176][100917] Updated weights for policy 1, policy_version 46282 (0.0008) +[2023-10-14 06:59:07,555][100917] Updated weights for policy 1, policy_version 46292 (0.0010) +[2023-10-14 06:59:07,878][100936] Updated weights for policy 0, policy_version 46250 (0.0008) +[2023-10-14 06:59:07,924][100917] Updated weights for policy 1, policy_version 46302 (0.0008) +[2023-10-14 06:59:08,248][100936] Updated weights for policy 0, policy_version 46260 (0.0009) +[2023-10-14 06:59:08,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 94765056. Throughput: 0: 1639.1, 1: 1646.6. Samples: 23703480. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:59:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 06:59:08,622][100936] Updated weights for policy 0, policy_version 46270 (0.0010) +[2023-10-14 06:59:12,114][100917] Updated weights for policy 1, policy_version 46312 (0.0009) +[2023-10-14 06:59:12,490][100917] Updated weights for policy 1, policy_version 46322 (0.0007) +[2023-10-14 06:59:12,753][100936] Updated weights for policy 0, policy_version 46280 (0.0008) +[2023-10-14 06:59:12,872][100917] Updated weights for policy 1, policy_version 46332 (0.0008) +[2023-10-14 06:59:13,122][100936] Updated weights for policy 0, policy_version 46290 (0.0007) +[2023-10-14 06:59:13,496][100936] Updated weights for policy 0, policy_version 46300 (0.0007) +[2023-10-14 06:59:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 94830592. Throughput: 0: 1643.2, 1: 1665.3. Samples: 23714704. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:59:13,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 06:59:16,856][100917] Updated weights for policy 1, policy_version 46342 (0.0008) +[2023-10-14 06:59:17,241][100917] Updated weights for policy 1, policy_version 46352 (0.0009) +[2023-10-14 06:59:17,575][100936] Updated weights for policy 0, policy_version 46310 (0.0008) +[2023-10-14 06:59:17,604][100917] Updated weights for policy 1, policy_version 46362 (0.0007) +[2023-10-14 06:59:17,934][100936] Updated weights for policy 0, policy_version 46320 (0.0009) +[2023-10-14 06:59:18,302][100936] Updated weights for policy 0, policy_version 46330 (0.0009) +[2023-10-14 06:59:18,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 94896128. Throughput: 0: 1640.2, 1: 1662.3. Samples: 23734828. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 06:59:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 06:59:21,609][100917] Updated weights for policy 1, policy_version 46372 (0.0007) +[2023-10-14 06:59:21,974][100917] Updated weights for policy 1, policy_version 46382 (0.0009) +[2023-10-14 06:59:22,360][100917] Updated weights for policy 1, policy_version 46392 (0.0010) +[2023-10-14 06:59:22,383][100936] Updated weights for policy 0, policy_version 46340 (0.0009) +[2023-10-14 06:59:22,753][100936] Updated weights for policy 0, policy_version 46350 (0.0007) +[2023-10-14 06:59:23,111][100936] Updated weights for policy 0, policy_version 46360 (0.0009) +[2023-10-14 06:59:23,512][99942] Fps is (10 sec: 16384.0, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 94994432. Throughput: 0: 1640.9, 1: 1657.6. Samples: 23753270. Policy #0 lag: (min: 31.0, avg: 31.1, max: 36.0) +[2023-10-14 06:59:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 06:59:26,500][100917] Updated weights for policy 1, policy_version 46402 (0.0008) +[2023-10-14 06:59:26,876][100917] Updated weights for policy 1, policy_version 46412 (0.0010) +[2023-10-14 06:59:27,145][100936] Updated weights for policy 0, policy_version 46370 (0.0010) +[2023-10-14 06:59:27,244][100917] Updated weights for policy 1, policy_version 46422 (0.0008) +[2023-10-14 06:59:27,511][100936] Updated weights for policy 0, policy_version 46380 (0.0008) +[2023-10-14 06:59:27,608][100917] Updated weights for policy 1, policy_version 46432 (0.0010) +[2023-10-14 06:59:27,875][100936] Updated weights for policy 0, policy_version 46390 (0.0007) +[2023-10-14 06:59:28,248][100936] Updated weights for policy 0, policy_version 46400 (0.0007) +[2023-10-14 06:59:28,512][99942] Fps is (10 sec: 16384.3, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 95059968. Throughput: 0: 1646.7, 1: 1667.3. Samples: 23764772. Policy #0 lag: (min: 31.0, avg: 31.1, max: 36.0) +[2023-10-14 06:59:28,512][99942] Avg episode reward: [(0, '0.990'), (1, '0.940')] +[2023-10-14 06:59:31,679][100917] Updated weights for policy 1, policy_version 46442 (0.0007) +[2023-10-14 06:59:32,052][100917] Updated weights for policy 1, policy_version 46452 (0.0008) +[2023-10-14 06:59:32,419][100917] Updated weights for policy 1, policy_version 46462 (0.0009) +[2023-10-14 06:59:32,671][100936] Updated weights for policy 0, policy_version 46410 (0.0008) +[2023-10-14 06:59:33,039][100936] Updated weights for policy 0, policy_version 46420 (0.0008) +[2023-10-14 06:59:33,413][100936] Updated weights for policy 0, policy_version 46430 (0.0008) +[2023-10-14 06:59:33,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 95125504. Throughput: 0: 1646.7, 1: 1653.3. Samples: 23784274. Policy #0 lag: (min: 31.0, avg: 31.1, max: 36.0) +[2023-10-14 06:59:33,513][99942] Avg episode reward: [(0, '0.990'), (1, '0.940')] +[2023-10-14 06:59:36,577][100917] Updated weights for policy 1, policy_version 46472 (0.0009) +[2023-10-14 06:59:36,942][100917] Updated weights for policy 1, policy_version 46482 (0.0009) +[2023-10-14 06:59:37,323][100917] Updated weights for policy 1, policy_version 46492 (0.0008) +[2023-10-14 06:59:37,477][100936] Updated weights for policy 0, policy_version 46440 (0.0009) +[2023-10-14 06:59:37,852][100936] Updated weights for policy 0, policy_version 46450 (0.0010) +[2023-10-14 06:59:38,216][100936] Updated weights for policy 0, policy_version 46460 (0.0010) +[2023-10-14 06:59:38,512][99942] Fps is (10 sec: 13106.7, 60 sec: 13107.1, 300 sec: 13329.3). Total num frames: 95191040. Throughput: 0: 1642.5, 1: 1660.1. Samples: 23802752. Policy #0 lag: (min: 31.0, avg: 31.1, max: 36.0) +[2023-10-14 06:59:38,513][99942] Avg episode reward: [(0, '0.990'), (1, '0.940')] +[2023-10-14 06:59:38,525][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000046464_47579136.pth... +[2023-10-14 06:59:38,526][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000046496_47611904.pth... +[2023-10-14 06:59:38,560][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000044960_46039040.pth +[2023-10-14 06:59:38,562][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000044896_45973504.pth +[2023-10-14 06:59:38,564][100681] Saving a milestone ./train_atari/atari_privateye_APPO/checkpoint_p1/milestones/checkpoint_000046496_47611904.pth +[2023-10-14 06:59:38,566][100560] Saving a milestone ./train_atari/atari_privateye_APPO/checkpoint_p0/milestones/checkpoint_000046464_47579136.pth +[2023-10-14 06:59:41,496][100917] Updated weights for policy 1, policy_version 46502 (0.0010) +[2023-10-14 06:59:41,867][100917] Updated weights for policy 1, policy_version 46512 (0.0009) +[2023-10-14 06:59:42,250][100917] Updated weights for policy 1, policy_version 46522 (0.0007) +[2023-10-14 06:59:42,338][100936] Updated weights for policy 0, policy_version 46470 (0.0007) +[2023-10-14 06:59:42,699][100936] Updated weights for policy 0, policy_version 46480 (0.0009) +[2023-10-14 06:59:43,069][100936] Updated weights for policy 0, policy_version 46490 (0.0009) +[2023-10-14 06:59:43,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 95256576. Throughput: 0: 1651.7, 1: 1667.5. Samples: 23814074. Policy #0 lag: (min: 31.0, avg: 31.1, max: 36.0) +[2023-10-14 06:59:43,512][99942] Avg episode reward: [(0, '0.990'), (1, '0.940')] +[2023-10-14 06:59:46,454][100917] Updated weights for policy 1, policy_version 46532 (0.0008) +[2023-10-14 06:59:46,836][100917] Updated weights for policy 1, policy_version 46542 (0.0010) +[2023-10-14 06:59:47,216][100917] Updated weights for policy 1, policy_version 46552 (0.0008) +[2023-10-14 06:59:47,301][100936] Updated weights for policy 0, policy_version 46500 (0.0007) +[2023-10-14 06:59:47,670][100936] Updated weights for policy 0, policy_version 46510 (0.0008) +[2023-10-14 06:59:48,037][100936] Updated weights for policy 0, policy_version 46520 (0.0008) +[2023-10-14 06:59:48,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 95322112. Throughput: 0: 1647.4, 1: 1649.8. Samples: 23833532. Policy #0 lag: (min: 31.0, avg: 31.1, max: 36.0) +[2023-10-14 06:59:48,513][99942] Avg episode reward: [(0, '0.990'), (1, '0.940')] +[2023-10-14 06:59:51,528][100917] Updated weights for policy 1, policy_version 46562 (0.0009) +[2023-10-14 06:59:51,905][100917] Updated weights for policy 1, policy_version 46572 (0.0010) +[2023-10-14 06:59:52,133][100936] Updated weights for policy 0, policy_version 46530 (0.0010) +[2023-10-14 06:59:52,286][100917] Updated weights for policy 1, policy_version 46582 (0.0009) +[2023-10-14 06:59:52,505][100936] Updated weights for policy 0, policy_version 46540 (0.0008) +[2023-10-14 06:59:52,658][100917] Updated weights for policy 1, policy_version 46592 (0.0007) +[2023-10-14 06:59:52,876][100936] Updated weights for policy 0, policy_version 46550 (0.0009) +[2023-10-14 06:59:53,259][100936] Updated weights for policy 0, policy_version 46560 (0.0009) +[2023-10-14 06:59:53,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13653.2, 300 sec: 13329.3). Total num frames: 95387648. Throughput: 0: 1650.0, 1: 1654.6. Samples: 23852190. Policy #0 lag: (min: 31.0, avg: 31.1, max: 36.0) +[2023-10-14 06:59:53,513][99942] Avg episode reward: [(0, '0.990'), (1, '0.940')] +[2023-10-14 06:59:56,782][100917] Updated weights for policy 1, policy_version 46602 (0.0008) +[2023-10-14 06:59:57,161][100917] Updated weights for policy 1, policy_version 46612 (0.0008) +[2023-10-14 06:59:57,330][100936] Updated weights for policy 0, policy_version 46570 (0.0009) +[2023-10-14 06:59:57,533][100917] Updated weights for policy 1, policy_version 46622 (0.0010) +[2023-10-14 06:59:57,712][100936] Updated weights for policy 0, policy_version 46580 (0.0009) +[2023-10-14 06:59:58,090][100936] Updated weights for policy 0, policy_version 46590 (0.0007) +[2023-10-14 06:59:58,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 95453184. Throughput: 0: 1659.6, 1: 1654.1. Samples: 23863820. Policy #0 lag: (min: 31.0, avg: 32.5, max: 57.0) +[2023-10-14 06:59:58,513][99942] Avg episode reward: [(0, '0.990'), (1, '0.940')] +[2023-10-14 07:00:01,567][100917] Updated weights for policy 1, policy_version 46632 (0.0011) +[2023-10-14 07:00:01,939][100917] Updated weights for policy 1, policy_version 46642 (0.0010) +[2023-10-14 07:00:02,250][100936] Updated weights for policy 0, policy_version 46600 (0.0007) +[2023-10-14 07:00:02,310][100917] Updated weights for policy 1, policy_version 46652 (0.0007) +[2023-10-14 07:00:02,622][100936] Updated weights for policy 0, policy_version 46610 (0.0008) +[2023-10-14 07:00:02,987][100936] Updated weights for policy 0, policy_version 46620 (0.0007) +[2023-10-14 07:00:03,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 95518720. Throughput: 0: 1645.4, 1: 1647.8. Samples: 23883020. Policy #0 lag: (min: 31.0, avg: 32.5, max: 57.0) +[2023-10-14 07:00:03,513][99942] Avg episode reward: [(0, '0.990'), (1, '0.940')] +[2023-10-14 07:00:06,500][100917] Updated weights for policy 1, policy_version 46662 (0.0008) +[2023-10-14 07:00:06,867][100917] Updated weights for policy 1, policy_version 46672 (0.0007) +[2023-10-14 07:00:07,170][100936] Updated weights for policy 0, policy_version 46630 (0.0007) +[2023-10-14 07:00:07,240][100917] Updated weights for policy 1, policy_version 46682 (0.0007) +[2023-10-14 07:00:07,538][100936] Updated weights for policy 0, policy_version 46640 (0.0009) +[2023-10-14 07:00:07,906][100936] Updated weights for policy 0, policy_version 46650 (0.0010) +[2023-10-14 07:00:08,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 95584256. Throughput: 0: 1647.2, 1: 1655.5. Samples: 23901894. Policy #0 lag: (min: 31.0, avg: 32.5, max: 57.0) +[2023-10-14 07:00:08,513][99942] Avg episode reward: [(0, '0.990'), (1, '0.940')] +[2023-10-14 07:00:11,142][100917] Updated weights for policy 1, policy_version 46692 (0.0008) +[2023-10-14 07:00:11,518][100917] Updated weights for policy 1, policy_version 46702 (0.0010) +[2023-10-14 07:00:11,882][100917] Updated weights for policy 1, policy_version 46712 (0.0009) +[2023-10-14 07:00:12,115][100936] Updated weights for policy 0, policy_version 46660 (0.0010) +[2023-10-14 07:00:12,481][100936] Updated weights for policy 0, policy_version 46670 (0.0010) +[2023-10-14 07:00:12,851][100936] Updated weights for policy 0, policy_version 46680 (0.0008) +[2023-10-14 07:00:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 95649792. Throughput: 0: 1646.8, 1: 1659.1. Samples: 23913536. Policy #0 lag: (min: 31.0, avg: 32.5, max: 57.0) +[2023-10-14 07:00:13,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:00:16,043][100917] Updated weights for policy 1, policy_version 46722 (0.0009) +[2023-10-14 07:00:16,459][100917] Updated weights for policy 1, policy_version 46732 (0.0009) +[2023-10-14 07:00:16,827][100917] Updated weights for policy 1, policy_version 46742 (0.0010) +[2023-10-14 07:00:17,184][100936] Updated weights for policy 0, policy_version 46690 (0.0009) +[2023-10-14 07:00:17,195][100917] Updated weights for policy 1, policy_version 46752 (0.0009) +[2023-10-14 07:00:17,587][100936] Updated weights for policy 0, policy_version 46700 (0.0007) +[2023-10-14 07:00:17,944][100936] Updated weights for policy 0, policy_version 46710 (0.0008) +[2023-10-14 07:00:18,310][100936] Updated weights for policy 0, policy_version 46720 (0.0007) +[2023-10-14 07:00:18,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 95715328. Throughput: 0: 1640.9, 1: 1651.7. Samples: 23932438. Policy #0 lag: (min: 31.0, avg: 32.5, max: 57.0) +[2023-10-14 07:00:18,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:00:21,245][100917] Updated weights for policy 1, policy_version 46762 (0.0009) +[2023-10-14 07:00:21,610][100917] Updated weights for policy 1, policy_version 46772 (0.0010) +[2023-10-14 07:00:21,982][100917] Updated weights for policy 1, policy_version 46782 (0.0010) +[2023-10-14 07:00:22,493][100936] Updated weights for policy 0, policy_version 46730 (0.0008) +[2023-10-14 07:00:22,869][100936] Updated weights for policy 0, policy_version 46740 (0.0009) +[2023-10-14 07:00:23,232][100936] Updated weights for policy 0, policy_version 46750 (0.0008) +[2023-10-14 07:00:23,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 95780864. Throughput: 0: 1645.0, 1: 1661.2. Samples: 23951532. Policy #0 lag: (min: 31.0, avg: 32.5, max: 57.0) +[2023-10-14 07:00:23,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:00:26,112][100917] Updated weights for policy 1, policy_version 46792 (0.0008) +[2023-10-14 07:00:26,488][100917] Updated weights for policy 1, policy_version 46802 (0.0009) +[2023-10-14 07:00:26,851][100917] Updated weights for policy 1, policy_version 46812 (0.0010) +[2023-10-14 07:00:27,246][100936] Updated weights for policy 0, policy_version 46760 (0.0008) +[2023-10-14 07:00:27,620][100936] Updated weights for policy 0, policy_version 46770 (0.0009) +[2023-10-14 07:00:27,992][100936] Updated weights for policy 0, policy_version 46780 (0.0009) +[2023-10-14 07:00:28,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 95846400. Throughput: 0: 1648.8, 1: 1658.4. Samples: 23962896. Policy #0 lag: (min: 31.0, avg: 32.5, max: 57.0) +[2023-10-14 07:00:28,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:00:30,776][100917] Updated weights for policy 1, policy_version 46822 (0.0008) +[2023-10-14 07:00:31,157][100917] Updated weights for policy 1, policy_version 46832 (0.0010) +[2023-10-14 07:00:31,534][100917] Updated weights for policy 1, policy_version 46842 (0.0011) +[2023-10-14 07:00:32,066][100936] Updated weights for policy 0, policy_version 46790 (0.0011) +[2023-10-14 07:00:32,440][100936] Updated weights for policy 0, policy_version 46800 (0.0009) +[2023-10-14 07:00:32,815][100936] Updated weights for policy 0, policy_version 46810 (0.0008) +[2023-10-14 07:00:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 13329.4). Total num frames: 95911936. Throughput: 0: 1637.7, 1: 1656.4. Samples: 23981770. Policy #0 lag: (min: 31.0, avg: 38.4, max: 63.0) +[2023-10-14 07:00:33,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:00:35,611][100917] Updated weights for policy 1, policy_version 46852 (0.0010) +[2023-10-14 07:00:35,988][100917] Updated weights for policy 1, policy_version 46862 (0.0009) +[2023-10-14 07:00:36,355][100917] Updated weights for policy 1, policy_version 46872 (0.0009) +[2023-10-14 07:00:36,995][100936] Updated weights for policy 0, policy_version 46820 (0.0008) +[2023-10-14 07:00:37,353][100936] Updated weights for policy 0, policy_version 46830 (0.0011) +[2023-10-14 07:00:37,713][100936] Updated weights for policy 0, policy_version 46840 (0.0009) +[2023-10-14 07:00:38,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 95977472. Throughput: 0: 1642.5, 1: 1676.0. Samples: 24001524. Policy #0 lag: (min: 31.0, avg: 38.4, max: 63.0) +[2023-10-14 07:00:38,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:00:40,525][100917] Updated weights for policy 1, policy_version 46882 (0.0009) +[2023-10-14 07:00:40,896][100917] Updated weights for policy 1, policy_version 46892 (0.0007) +[2023-10-14 07:00:41,263][100917] Updated weights for policy 1, policy_version 46902 (0.0011) +[2023-10-14 07:00:41,634][100917] Updated weights for policy 1, policy_version 46912 (0.0007) +[2023-10-14 07:00:41,780][100936] Updated weights for policy 0, policy_version 46850 (0.0010) +[2023-10-14 07:00:42,155][100936] Updated weights for policy 0, policy_version 46860 (0.0007) +[2023-10-14 07:00:42,529][100936] Updated weights for policy 0, policy_version 46870 (0.0008) +[2023-10-14 07:00:42,902][100936] Updated weights for policy 0, policy_version 46880 (0.0009) +[2023-10-14 07:00:43,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.1, 300 sec: 13329.3). Total num frames: 96043008. Throughput: 0: 1647.1, 1: 1658.8. Samples: 24012584. Policy #0 lag: (min: 31.0, avg: 38.4, max: 63.0) +[2023-10-14 07:00:43,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:00:45,514][100917] Updated weights for policy 1, policy_version 46922 (0.0008) +[2023-10-14 07:00:45,901][100917] Updated weights for policy 1, policy_version 46932 (0.0010) +[2023-10-14 07:00:46,265][100917] Updated weights for policy 1, policy_version 46942 (0.0009) +[2023-10-14 07:00:46,984][100936] Updated weights for policy 0, policy_version 46890 (0.0008) +[2023-10-14 07:00:47,351][100936] Updated weights for policy 0, policy_version 46900 (0.0009) +[2023-10-14 07:00:47,728][100936] Updated weights for policy 0, policy_version 46910 (0.0009) +[2023-10-14 07:00:48,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 96108544. Throughput: 0: 1644.9, 1: 1662.1. Samples: 24031836. Policy #0 lag: (min: 31.0, avg: 38.4, max: 63.0) +[2023-10-14 07:00:48,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:00:50,482][100917] Updated weights for policy 1, policy_version 46952 (0.0011) +[2023-10-14 07:00:50,842][100917] Updated weights for policy 1, policy_version 46962 (0.0010) +[2023-10-14 07:00:51,214][100917] Updated weights for policy 1, policy_version 46972 (0.0009) +[2023-10-14 07:00:52,012][100936] Updated weights for policy 0, policy_version 46920 (0.0010) +[2023-10-14 07:00:52,378][100936] Updated weights for policy 0, policy_version 46930 (0.0011) +[2023-10-14 07:00:52,749][100936] Updated weights for policy 0, policy_version 46940 (0.0008) +[2023-10-14 07:00:53,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.3, 300 sec: 13329.4). Total num frames: 96174080. Throughput: 0: 1655.7, 1: 1673.5. Samples: 24051706. Policy #0 lag: (min: 31.0, avg: 38.4, max: 63.0) +[2023-10-14 07:00:53,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:00:55,426][100917] Updated weights for policy 1, policy_version 46982 (0.0008) +[2023-10-14 07:00:55,792][100917] Updated weights for policy 1, policy_version 46992 (0.0008) +[2023-10-14 07:00:56,177][100917] Updated weights for policy 1, policy_version 47002 (0.0009) +[2023-10-14 07:00:56,877][100936] Updated weights for policy 0, policy_version 46950 (0.0009) +[2023-10-14 07:00:57,250][100936] Updated weights for policy 0, policy_version 46960 (0.0008) +[2023-10-14 07:00:57,618][100936] Updated weights for policy 0, policy_version 46970 (0.0008) +[2023-10-14 07:00:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 96239616. Throughput: 0: 1654.7, 1: 1648.9. Samples: 24062196. Policy #0 lag: (min: 31.0, avg: 38.4, max: 63.0) +[2023-10-14 07:00:58,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:01:00,307][100917] Updated weights for policy 1, policy_version 47012 (0.0009) +[2023-10-14 07:01:00,675][100917] Updated weights for policy 1, policy_version 47022 (0.0008) +[2023-10-14 07:01:01,036][100917] Updated weights for policy 1, policy_version 47032 (0.0007) +[2023-10-14 07:01:01,798][100936] Updated weights for policy 0, policy_version 46980 (0.0008) +[2023-10-14 07:01:02,169][100936] Updated weights for policy 0, policy_version 46990 (0.0008) +[2023-10-14 07:01:02,536][100936] Updated weights for policy 0, policy_version 47000 (0.0009) +[2023-10-14 07:01:03,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 96305152. Throughput: 0: 1645.1, 1: 1660.7. Samples: 24081198. Policy #0 lag: (min: 31.0, avg: 38.4, max: 63.0) +[2023-10-14 07:01:03,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:01:04,969][100917] Updated weights for policy 1, policy_version 47042 (0.0007) +[2023-10-14 07:01:05,352][100917] Updated weights for policy 1, policy_version 47052 (0.0009) +[2023-10-14 07:01:05,727][100917] Updated weights for policy 1, policy_version 47062 (0.0009) +[2023-10-14 07:01:06,096][100917] Updated weights for policy 1, policy_version 47072 (0.0009) +[2023-10-14 07:01:06,806][100936] Updated weights for policy 0, policy_version 47010 (0.0008) +[2023-10-14 07:01:07,220][100936] Updated weights for policy 0, policy_version 47020 (0.0008) +[2023-10-14 07:01:07,593][100936] Updated weights for policy 0, policy_version 47030 (0.0008) +[2023-10-14 07:01:07,959][100936] Updated weights for policy 0, policy_version 47040 (0.0008) +[2023-10-14 07:01:08,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 96370688. Throughput: 0: 1654.3, 1: 1672.1. Samples: 24101220. Policy #0 lag: (min: 31.0, avg: 38.6, max: 63.0) +[2023-10-14 07:01:08,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:01:10,324][100917] Updated weights for policy 1, policy_version 47082 (0.0010) +[2023-10-14 07:01:10,691][100917] Updated weights for policy 1, policy_version 47092 (0.0008) +[2023-10-14 07:01:11,071][100917] Updated weights for policy 1, policy_version 47102 (0.0007) +[2023-10-14 07:01:12,149][100936] Updated weights for policy 0, policy_version 47050 (0.0009) +[2023-10-14 07:01:12,521][100936] Updated weights for policy 0, policy_version 47060 (0.0008) +[2023-10-14 07:01:12,899][100936] Updated weights for policy 0, policy_version 47070 (0.0008) +[2023-10-14 07:01:13,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 96436224. Throughput: 0: 1648.6, 1: 1652.3. Samples: 24111436. Policy #0 lag: (min: 31.0, avg: 38.6, max: 63.0) +[2023-10-14 07:01:13,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:01:15,330][100917] Updated weights for policy 1, policy_version 47112 (0.0007) +[2023-10-14 07:01:15,699][100917] Updated weights for policy 1, policy_version 47122 (0.0011) +[2023-10-14 07:01:16,070][100917] Updated weights for policy 1, policy_version 47132 (0.0010) +[2023-10-14 07:01:17,117][100936] Updated weights for policy 0, policy_version 47080 (0.0009) +[2023-10-14 07:01:17,490][100936] Updated weights for policy 0, policy_version 47090 (0.0009) +[2023-10-14 07:01:17,866][100936] Updated weights for policy 0, policy_version 47100 (0.0008) +[2023-10-14 07:01:18,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 96501760. Throughput: 0: 1647.8, 1: 1661.0. Samples: 24130666. Policy #0 lag: (min: 31.0, avg: 38.6, max: 63.0) +[2023-10-14 07:01:18,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:01:20,184][100917] Updated weights for policy 1, policy_version 47142 (0.0009) +[2023-10-14 07:01:20,561][100917] Updated weights for policy 1, policy_version 47152 (0.0009) +[2023-10-14 07:01:20,940][100917] Updated weights for policy 1, policy_version 47162 (0.0009) +[2023-10-14 07:01:21,946][100936] Updated weights for policy 0, policy_version 47110 (0.0008) +[2023-10-14 07:01:22,313][100936] Updated weights for policy 0, policy_version 47120 (0.0009) +[2023-10-14 07:01:22,687][100936] Updated weights for policy 0, policy_version 47130 (0.0009) +[2023-10-14 07:01:23,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 96567296. Throughput: 0: 1645.7, 1: 1660.4. Samples: 24150300. Policy #0 lag: (min: 31.0, avg: 38.6, max: 63.0) +[2023-10-14 07:01:23,512][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:01:25,190][100917] Updated weights for policy 1, policy_version 47172 (0.0010) +[2023-10-14 07:01:25,562][100917] Updated weights for policy 1, policy_version 47182 (0.0010) +[2023-10-14 07:01:25,939][100917] Updated weights for policy 1, policy_version 47192 (0.0009) +[2023-10-14 07:01:26,836][100936] Updated weights for policy 0, policy_version 47140 (0.0008) +[2023-10-14 07:01:27,216][100936] Updated weights for policy 0, policy_version 47150 (0.0009) +[2023-10-14 07:01:27,584][100936] Updated weights for policy 0, policy_version 47160 (0.0009) +[2023-10-14 07:01:28,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 96632832. Throughput: 0: 1641.4, 1: 1647.2. Samples: 24160568. Policy #0 lag: (min: 31.0, avg: 38.6, max: 63.0) +[2023-10-14 07:01:28,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:01:29,902][100917] Updated weights for policy 1, policy_version 47202 (0.0008) +[2023-10-14 07:01:30,269][100917] Updated weights for policy 1, policy_version 47212 (0.0009) +[2023-10-14 07:01:30,640][100917] Updated weights for policy 1, policy_version 47222 (0.0009) +[2023-10-14 07:01:31,014][100917] Updated weights for policy 1, policy_version 47232 (0.0009) +[2023-10-14 07:01:31,557][100936] Updated weights for policy 0, policy_version 47170 (0.0008) +[2023-10-14 07:01:31,930][100936] Updated weights for policy 0, policy_version 47180 (0.0007) +[2023-10-14 07:01:32,297][100936] Updated weights for policy 0, policy_version 47190 (0.0007) +[2023-10-14 07:01:32,669][100936] Updated weights for policy 0, policy_version 47200 (0.0007) +[2023-10-14 07:01:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 96698368. Throughput: 0: 1639.8, 1: 1653.2. Samples: 24180022. Policy #0 lag: (min: 31.0, avg: 38.6, max: 63.0) +[2023-10-14 07:01:33,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:01:35,176][100917] Updated weights for policy 1, policy_version 47242 (0.0008) +[2023-10-14 07:01:35,549][100917] Updated weights for policy 1, policy_version 47252 (0.0007) +[2023-10-14 07:01:35,930][100917] Updated weights for policy 1, policy_version 47262 (0.0007) +[2023-10-14 07:01:36,811][100936] Updated weights for policy 0, policy_version 47210 (0.0010) +[2023-10-14 07:01:37,178][100936] Updated weights for policy 0, policy_version 47220 (0.0010) +[2023-10-14 07:01:37,551][100936] Updated weights for policy 0, policy_version 47230 (0.0010) +[2023-10-14 07:01:38,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 96763904. Throughput: 0: 1643.7, 1: 1657.2. Samples: 24200244. Policy #0 lag: (min: 31.0, avg: 38.6, max: 63.0) +[2023-10-14 07:01:38,512][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:01:38,522][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000047232_48365568.pth... +[2023-10-14 07:01:38,522][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000047264_48398336.pth... +[2023-10-14 07:01:38,559][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000045696_46792704.pth +[2023-10-14 07:01:38,565][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000045728_46825472.pth +[2023-10-14 07:01:39,901][100917] Updated weights for policy 1, policy_version 47272 (0.0007) +[2023-10-14 07:01:40,277][100917] Updated weights for policy 1, policy_version 47282 (0.0007) +[2023-10-14 07:01:40,648][100917] Updated weights for policy 1, policy_version 47292 (0.0008) +[2023-10-14 07:01:41,714][100936] Updated weights for policy 0, policy_version 47240 (0.0008) +[2023-10-14 07:01:42,073][100936] Updated weights for policy 0, policy_version 47250 (0.0007) +[2023-10-14 07:01:42,446][100936] Updated weights for policy 0, policy_version 47260 (0.0009) +[2023-10-14 07:01:43,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 96829440. Throughput: 0: 1645.6, 1: 1650.8. Samples: 24210534. Policy #0 lag: (min: 31.0, avg: 34.0, max: 63.0) +[2023-10-14 07:01:43,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:01:44,818][100917] Updated weights for policy 1, policy_version 47302 (0.0008) +[2023-10-14 07:01:45,187][100917] Updated weights for policy 1, policy_version 47312 (0.0010) +[2023-10-14 07:01:45,561][100917] Updated weights for policy 1, policy_version 47322 (0.0011) +[2023-10-14 07:01:46,453][100936] Updated weights for policy 0, policy_version 47270 (0.0008) +[2023-10-14 07:01:46,826][100936] Updated weights for policy 0, policy_version 47280 (0.0007) +[2023-10-14 07:01:47,199][100936] Updated weights for policy 0, policy_version 47290 (0.0007) +[2023-10-14 07:01:48,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 96894976. Throughput: 0: 1638.6, 1: 1666.1. Samples: 24229910. Policy #0 lag: (min: 31.0, avg: 34.0, max: 63.0) +[2023-10-14 07:01:48,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:01:49,711][100917] Updated weights for policy 1, policy_version 47332 (0.0010) +[2023-10-14 07:01:50,091][100917] Updated weights for policy 1, policy_version 47342 (0.0009) +[2023-10-14 07:01:50,456][100917] Updated weights for policy 1, policy_version 47352 (0.0010) +[2023-10-14 07:01:51,445][100936] Updated weights for policy 0, policy_version 47300 (0.0008) +[2023-10-14 07:01:51,834][100936] Updated weights for policy 0, policy_version 47310 (0.0007) +[2023-10-14 07:01:52,204][100936] Updated weights for policy 0, policy_version 47320 (0.0010) +[2023-10-14 07:01:53,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 96960512. Throughput: 0: 1641.0, 1: 1662.9. Samples: 24249892. Policy #0 lag: (min: 31.0, avg: 34.0, max: 63.0) +[2023-10-14 07:01:53,512][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:01:54,596][100917] Updated weights for policy 1, policy_version 47362 (0.0010) +[2023-10-14 07:01:55,011][100917] Updated weights for policy 1, policy_version 47372 (0.0009) +[2023-10-14 07:01:55,384][100917] Updated weights for policy 1, policy_version 47382 (0.0008) +[2023-10-14 07:01:55,757][100917] Updated weights for policy 1, policy_version 47392 (0.0008) +[2023-10-14 07:01:56,321][100936] Updated weights for policy 0, policy_version 47330 (0.0008) +[2023-10-14 07:01:56,690][100936] Updated weights for policy 0, policy_version 47340 (0.0008) +[2023-10-14 07:01:57,059][100936] Updated weights for policy 0, policy_version 47350 (0.0009) +[2023-10-14 07:01:57,425][100936] Updated weights for policy 0, policy_version 47360 (0.0008) +[2023-10-14 07:01:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.1, 300 sec: 13329.3). Total num frames: 97026048. Throughput: 0: 1641.9, 1: 1654.5. Samples: 24259774. Policy #0 lag: (min: 31.0, avg: 34.0, max: 63.0) +[2023-10-14 07:01:58,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:01:59,961][100917] Updated weights for policy 1, policy_version 47402 (0.0011) +[2023-10-14 07:02:00,336][100917] Updated weights for policy 1, policy_version 47412 (0.0009) +[2023-10-14 07:02:00,702][100917] Updated weights for policy 1, policy_version 47422 (0.0009) +[2023-10-14 07:02:01,764][100936] Updated weights for policy 0, policy_version 47370 (0.0009) +[2023-10-14 07:02:02,138][100936] Updated weights for policy 0, policy_version 47380 (0.0007) +[2023-10-14 07:02:02,506][100936] Updated weights for policy 0, policy_version 47390 (0.0008) +[2023-10-14 07:02:03,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.3, 300 sec: 13329.4). Total num frames: 97091584. Throughput: 0: 1639.2, 1: 1661.0. Samples: 24279174. Policy #0 lag: (min: 31.0, avg: 34.0, max: 63.0) +[2023-10-14 07:02:03,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:02:04,706][100917] Updated weights for policy 1, policy_version 47432 (0.0010) +[2023-10-14 07:02:05,076][100917] Updated weights for policy 1, policy_version 47442 (0.0009) +[2023-10-14 07:02:05,450][100917] Updated weights for policy 1, policy_version 47452 (0.0007) +[2023-10-14 07:02:06,638][100936] Updated weights for policy 0, policy_version 47400 (0.0010) +[2023-10-14 07:02:07,006][100936] Updated weights for policy 0, policy_version 47410 (0.0009) +[2023-10-14 07:02:07,367][100936] Updated weights for policy 0, policy_version 47420 (0.0009) +[2023-10-14 07:02:08,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 97157120. Throughput: 0: 1654.7, 1: 1665.5. Samples: 24299710. Policy #0 lag: (min: 31.0, avg: 34.0, max: 63.0) +[2023-10-14 07:02:08,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:02:09,381][100917] Updated weights for policy 1, policy_version 47462 (0.0010) +[2023-10-14 07:02:09,754][100917] Updated weights for policy 1, policy_version 47472 (0.0009) +[2023-10-14 07:02:10,120][100917] Updated weights for policy 1, policy_version 47482 (0.0009) +[2023-10-14 07:02:11,245][100936] Updated weights for policy 0, policy_version 47430 (0.0009) +[2023-10-14 07:02:11,612][100936] Updated weights for policy 0, policy_version 47440 (0.0010) +[2023-10-14 07:02:11,985][100936] Updated weights for policy 0, policy_version 47450 (0.0010) +[2023-10-14 07:02:13,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 97222656. Throughput: 0: 1649.3, 1: 1668.6. Samples: 24309872. Policy #0 lag: (min: 31.0, avg: 34.0, max: 63.0) +[2023-10-14 07:02:13,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:02:14,247][100917] Updated weights for policy 1, policy_version 47492 (0.0010) +[2023-10-14 07:02:14,623][100917] Updated weights for policy 1, policy_version 47502 (0.0009) +[2023-10-14 07:02:15,002][100917] Updated weights for policy 1, policy_version 47512 (0.0010) +[2023-10-14 07:02:16,074][100936] Updated weights for policy 0, policy_version 47460 (0.0010) +[2023-10-14 07:02:16,444][100936] Updated weights for policy 0, policy_version 47470 (0.0007) +[2023-10-14 07:02:16,817][100936] Updated weights for policy 0, policy_version 47480 (0.0009) +[2023-10-14 07:02:18,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 97288192. Throughput: 0: 1650.1, 1: 1671.3. Samples: 24329484. Policy #0 lag: (min: 27.0, avg: 27.0, max: 27.0) +[2023-10-14 07:02:18,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:02:19,116][100917] Updated weights for policy 1, policy_version 47522 (0.0009) +[2023-10-14 07:02:19,483][100917] Updated weights for policy 1, policy_version 47532 (0.0007) +[2023-10-14 07:02:19,857][100917] Updated weights for policy 1, policy_version 47542 (0.0009) +[2023-10-14 07:02:20,235][100917] Updated weights for policy 1, policy_version 47552 (0.0009) +[2023-10-14 07:02:20,863][100936] Updated weights for policy 0, policy_version 47490 (0.0009) +[2023-10-14 07:02:21,227][100936] Updated weights for policy 0, policy_version 47500 (0.0009) +[2023-10-14 07:02:21,598][100936] Updated weights for policy 0, policy_version 47510 (0.0011) +[2023-10-14 07:02:21,976][100936] Updated weights for policy 0, policy_version 47520 (0.0010) +[2023-10-14 07:02:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 97353728. Throughput: 0: 1663.2, 1: 1670.0. Samples: 24350238. Policy #0 lag: (min: 27.0, avg: 27.0, max: 27.0) +[2023-10-14 07:02:23,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:02:24,457][100917] Updated weights for policy 1, policy_version 47562 (0.0008) +[2023-10-14 07:02:24,829][100917] Updated weights for policy 1, policy_version 47572 (0.0009) +[2023-10-14 07:02:25,210][100917] Updated weights for policy 1, policy_version 47582 (0.0009) +[2023-10-14 07:02:26,117][100936] Updated weights for policy 0, policy_version 47530 (0.0009) +[2023-10-14 07:02:26,499][100936] Updated weights for policy 0, policy_version 47540 (0.0010) +[2023-10-14 07:02:26,865][100936] Updated weights for policy 0, policy_version 47550 (0.0009) +[2023-10-14 07:02:28,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 97419264. Throughput: 0: 1653.3, 1: 1665.4. Samples: 24359876. Policy #0 lag: (min: 27.0, avg: 27.0, max: 27.0) +[2023-10-14 07:02:28,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:02:29,246][100917] Updated weights for policy 1, policy_version 47592 (0.0009) +[2023-10-14 07:02:29,608][100917] Updated weights for policy 1, policy_version 47602 (0.0007) +[2023-10-14 07:02:29,983][100917] Updated weights for policy 1, policy_version 47612 (0.0009) +[2023-10-14 07:02:30,935][100936] Updated weights for policy 0, policy_version 47560 (0.0009) +[2023-10-14 07:02:31,302][100936] Updated weights for policy 0, policy_version 47570 (0.0009) +[2023-10-14 07:02:31,682][100936] Updated weights for policy 0, policy_version 47580 (0.0008) +[2023-10-14 07:02:33,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 97484800. Throughput: 0: 1664.8, 1: 1665.6. Samples: 24379778. Policy #0 lag: (min: 27.0, avg: 27.0, max: 27.0) +[2023-10-14 07:02:33,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:02:34,317][100917] Updated weights for policy 1, policy_version 47622 (0.0010) +[2023-10-14 07:02:34,691][100917] Updated weights for policy 1, policy_version 47632 (0.0009) +[2023-10-14 07:02:35,065][100917] Updated weights for policy 1, policy_version 47642 (0.0010) +[2023-10-14 07:02:35,856][100936] Updated weights for policy 0, policy_version 47590 (0.0007) +[2023-10-14 07:02:36,229][100936] Updated weights for policy 0, policy_version 47600 (0.0007) +[2023-10-14 07:02:36,598][100936] Updated weights for policy 0, policy_version 47610 (0.0008) +[2023-10-14 07:02:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.1, 300 sec: 13329.3). Total num frames: 97550336. Throughput: 0: 1675.5, 1: 1662.9. Samples: 24400124. Policy #0 lag: (min: 27.0, avg: 27.0, max: 27.0) +[2023-10-14 07:02:38,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:02:39,086][100917] Updated weights for policy 1, policy_version 47652 (0.0011) +[2023-10-14 07:02:39,487][100917] Updated weights for policy 1, policy_version 47662 (0.0009) +[2023-10-14 07:02:39,858][100917] Updated weights for policy 1, policy_version 47672 (0.0010) +[2023-10-14 07:02:40,859][100936] Updated weights for policy 0, policy_version 47620 (0.0008) +[2023-10-14 07:02:41,229][100936] Updated weights for policy 0, policy_version 47630 (0.0009) +[2023-10-14 07:02:41,603][100936] Updated weights for policy 0, policy_version 47640 (0.0010) +[2023-10-14 07:02:43,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 97615872. Throughput: 0: 1661.3, 1: 1665.7. Samples: 24409490. Policy #0 lag: (min: 27.0, avg: 27.0, max: 27.0) +[2023-10-14 07:02:43,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:02:43,804][100917] Updated weights for policy 1, policy_version 47682 (0.0007) +[2023-10-14 07:02:44,170][100917] Updated weights for policy 1, policy_version 47692 (0.0009) +[2023-10-14 07:02:44,539][100917] Updated weights for policy 1, policy_version 47702 (0.0008) +[2023-10-14 07:02:44,917][100917] Updated weights for policy 1, policy_version 47712 (0.0008) +[2023-10-14 07:02:45,659][100936] Updated weights for policy 0, policy_version 47650 (0.0009) +[2023-10-14 07:02:46,026][100936] Updated weights for policy 0, policy_version 47660 (0.0007) +[2023-10-14 07:02:46,386][100936] Updated weights for policy 0, policy_version 47670 (0.0010) +[2023-10-14 07:02:46,757][100936] Updated weights for policy 0, policy_version 47680 (0.0011) +[2023-10-14 07:02:48,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 97681408. Throughput: 0: 1670.8, 1: 1674.3. Samples: 24429702. Policy #0 lag: (min: 27.0, avg: 27.0, max: 27.0) +[2023-10-14 07:02:48,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:02:49,058][100917] Updated weights for policy 1, policy_version 47722 (0.0008) +[2023-10-14 07:02:49,436][100917] Updated weights for policy 1, policy_version 47732 (0.0010) +[2023-10-14 07:02:49,804][100917] Updated weights for policy 1, policy_version 47742 (0.0009) +[2023-10-14 07:02:50,847][100936] Updated weights for policy 0, policy_version 47690 (0.0007) +[2023-10-14 07:02:51,217][100936] Updated weights for policy 0, policy_version 47700 (0.0009) +[2023-10-14 07:02:51,582][100936] Updated weights for policy 0, policy_version 47710 (0.0010) +[2023-10-14 07:02:53,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 97746944. Throughput: 0: 1674.2, 1: 1672.8. Samples: 24450326. Policy #0 lag: (min: 9.0, avg: 18.4, max: 41.0) +[2023-10-14 07:02:53,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:02:53,660][100917] Updated weights for policy 1, policy_version 47752 (0.0009) +[2023-10-14 07:02:54,027][100917] Updated weights for policy 1, policy_version 47762 (0.0008) +[2023-10-14 07:02:54,402][100917] Updated weights for policy 1, policy_version 47772 (0.0008) +[2023-10-14 07:02:55,683][100936] Updated weights for policy 0, policy_version 47720 (0.0009) +[2023-10-14 07:02:56,049][100936] Updated weights for policy 0, policy_version 47730 (0.0008) +[2023-10-14 07:02:56,411][100936] Updated weights for policy 0, policy_version 47740 (0.0007) +[2023-10-14 07:02:58,405][100917] Updated weights for policy 1, policy_version 47782 (0.0010) +[2023-10-14 07:02:58,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 97812480. Throughput: 0: 1656.1, 1: 1673.1. Samples: 24459682. Policy #0 lag: (min: 9.0, avg: 18.4, max: 41.0) +[2023-10-14 07:02:58,512][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:02:58,779][100917] Updated weights for policy 1, policy_version 47792 (0.0010) +[2023-10-14 07:02:59,161][100917] Updated weights for policy 1, policy_version 47802 (0.0009) +[2023-10-14 07:03:00,443][100936] Updated weights for policy 0, policy_version 47750 (0.0009) +[2023-10-14 07:03:00,811][100936] Updated weights for policy 0, policy_version 47760 (0.0011) +[2023-10-14 07:03:01,179][100936] Updated weights for policy 0, policy_version 47770 (0.0011) +[2023-10-14 07:03:03,412][100917] Updated weights for policy 1, policy_version 47812 (0.0007) +[2023-10-14 07:03:03,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 97878016. Throughput: 0: 1672.1, 1: 1670.1. Samples: 24479884. Policy #0 lag: (min: 9.0, avg: 18.4, max: 41.0) +[2023-10-14 07:03:03,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:03:03,790][100917] Updated weights for policy 1, policy_version 47822 (0.0008) +[2023-10-14 07:03:04,160][100917] Updated weights for policy 1, policy_version 47832 (0.0011) +[2023-10-14 07:03:05,398][100936] Updated weights for policy 0, policy_version 47780 (0.0009) +[2023-10-14 07:03:05,764][100936] Updated weights for policy 0, policy_version 47790 (0.0010) +[2023-10-14 07:03:06,138][100936] Updated weights for policy 0, policy_version 47800 (0.0007) +[2023-10-14 07:03:08,365][100917] Updated weights for policy 1, policy_version 47842 (0.0009) +[2023-10-14 07:03:08,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 97943552. Throughput: 0: 1668.4, 1: 1670.4. Samples: 24500484. Policy #0 lag: (min: 9.0, avg: 18.4, max: 41.0) +[2023-10-14 07:03:08,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:03:08,735][100917] Updated weights for policy 1, policy_version 47852 (0.0007) +[2023-10-14 07:03:09,109][100917] Updated weights for policy 1, policy_version 47862 (0.0007) +[2023-10-14 07:03:09,486][100917] Updated weights for policy 1, policy_version 47872 (0.0010) +[2023-10-14 07:03:10,080][100936] Updated weights for policy 0, policy_version 47810 (0.0009) +[2023-10-14 07:03:10,460][100936] Updated weights for policy 0, policy_version 47820 (0.0008) +[2023-10-14 07:03:10,829][100936] Updated weights for policy 0, policy_version 47830 (0.0009) +[2023-10-14 07:03:11,201][100936] Updated weights for policy 0, policy_version 47840 (0.0008) +[2023-10-14 07:03:13,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 98009088. Throughput: 0: 1650.5, 1: 1674.9. Samples: 24509520. Policy #0 lag: (min: 9.0, avg: 18.4, max: 41.0) +[2023-10-14 07:03:13,512][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:03:13,651][100917] Updated weights for policy 1, policy_version 47882 (0.0007) +[2023-10-14 07:03:14,029][100917] Updated weights for policy 1, policy_version 47892 (0.0008) +[2023-10-14 07:03:14,404][100917] Updated weights for policy 1, policy_version 47902 (0.0007) +[2023-10-14 07:03:15,292][100936] Updated weights for policy 0, policy_version 47850 (0.0008) +[2023-10-14 07:03:15,658][100936] Updated weights for policy 0, policy_version 47860 (0.0009) +[2023-10-14 07:03:16,032][100936] Updated weights for policy 0, policy_version 47870 (0.0007) +[2023-10-14 07:03:18,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 98074624. Throughput: 0: 1665.5, 1: 1672.6. Samples: 24529992. Policy #0 lag: (min: 9.0, avg: 18.4, max: 41.0) +[2023-10-14 07:03:18,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:03:18,516][100917] Updated weights for policy 1, policy_version 47912 (0.0008) +[2023-10-14 07:03:18,896][100917] Updated weights for policy 1, policy_version 47922 (0.0009) +[2023-10-14 07:03:19,267][100917] Updated weights for policy 1, policy_version 47932 (0.0007) +[2023-10-14 07:03:20,139][100936] Updated weights for policy 0, policy_version 47880 (0.0010) +[2023-10-14 07:03:20,510][100936] Updated weights for policy 0, policy_version 47890 (0.0008) +[2023-10-14 07:03:20,890][100936] Updated weights for policy 0, policy_version 47900 (0.0009) +[2023-10-14 07:03:23,358][100917] Updated weights for policy 1, policy_version 47942 (0.0009) +[2023-10-14 07:03:23,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 98140160. Throughput: 0: 1667.5, 1: 1670.4. Samples: 24550328. Policy #0 lag: (min: 9.0, avg: 18.4, max: 41.0) +[2023-10-14 07:03:23,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:03:23,725][100917] Updated weights for policy 1, policy_version 47952 (0.0008) +[2023-10-14 07:03:24,103][100917] Updated weights for policy 1, policy_version 47962 (0.0009) +[2023-10-14 07:03:25,020][100936] Updated weights for policy 0, policy_version 47910 (0.0009) +[2023-10-14 07:03:25,389][100936] Updated weights for policy 0, policy_version 47920 (0.0008) +[2023-10-14 07:03:25,766][100936] Updated weights for policy 0, policy_version 47930 (0.0010) +[2023-10-14 07:03:28,260][100917] Updated weights for policy 1, policy_version 47972 (0.0011) +[2023-10-14 07:03:28,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 98205696. Throughput: 0: 1657.2, 1: 1671.8. Samples: 24559296. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:03:28,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:03:28,656][100917] Updated weights for policy 1, policy_version 47982 (0.0010) +[2023-10-14 07:03:29,030][100917] Updated weights for policy 1, policy_version 47992 (0.0007) +[2023-10-14 07:03:29,931][100936] Updated weights for policy 0, policy_version 47940 (0.0007) +[2023-10-14 07:03:30,320][100936] Updated weights for policy 0, policy_version 47950 (0.0010) +[2023-10-14 07:03:30,693][100936] Updated weights for policy 0, policy_version 47960 (0.0010) +[2023-10-14 07:03:33,218][100917] Updated weights for policy 1, policy_version 48002 (0.0010) +[2023-10-14 07:03:33,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 98271232. Throughput: 0: 1673.0, 1: 1663.2. Samples: 24579830. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:03:33,512][99942] Avg episode reward: [(0, '0.970'), (1, '1.000')] +[2023-10-14 07:03:33,583][100917] Updated weights for policy 1, policy_version 48012 (0.0009) +[2023-10-14 07:03:33,961][100917] Updated weights for policy 1, policy_version 48022 (0.0010) +[2023-10-14 07:03:34,329][100917] Updated weights for policy 1, policy_version 48032 (0.0009) +[2023-10-14 07:03:34,822][100936] Updated weights for policy 0, policy_version 47970 (0.0008) +[2023-10-14 07:03:35,192][100936] Updated weights for policy 0, policy_version 47980 (0.0007) +[2023-10-14 07:03:35,570][100936] Updated weights for policy 0, policy_version 47990 (0.0007) +[2023-10-14 07:03:35,931][100936] Updated weights for policy 0, policy_version 48000 (0.0009) +[2023-10-14 07:03:38,172][100917] Updated weights for policy 1, policy_version 48042 (0.0008) +[2023-10-14 07:03:38,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 98336768. Throughput: 0: 1668.5, 1: 1657.5. Samples: 24599998. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:03:38,513][99942] Avg episode reward: [(0, '0.970'), (1, '1.000')] +[2023-10-14 07:03:38,523][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000048000_49152000.pth... +[2023-10-14 07:03:38,544][100917] Updated weights for policy 1, policy_version 48052 (0.0009) +[2023-10-14 07:03:38,560][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000046464_47579136.pth +[2023-10-14 07:03:38,933][100917] Updated weights for policy 1, policy_version 48062 (0.0010) +[2023-10-14 07:03:39,000][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000048064_49217536.pth... +[2023-10-14 07:03:39,042][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000046496_47611904.pth +[2023-10-14 07:03:40,168][100936] Updated weights for policy 0, policy_version 48010 (0.0007) +[2023-10-14 07:03:40,534][100936] Updated weights for policy 0, policy_version 48020 (0.0008) +[2023-10-14 07:03:40,903][100936] Updated weights for policy 0, policy_version 48030 (0.0007) +[2023-10-14 07:03:43,220][100917] Updated weights for policy 1, policy_version 48072 (0.0007) +[2023-10-14 07:03:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 98402304. Throughput: 0: 1663.2, 1: 1656.8. Samples: 24609084. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:03:43,512][99942] Avg episode reward: [(0, '0.970'), (1, '1.000')] +[2023-10-14 07:03:43,603][100917] Updated weights for policy 1, policy_version 48082 (0.0009) +[2023-10-14 07:03:43,970][100917] Updated weights for policy 1, policy_version 48092 (0.0008) +[2023-10-14 07:03:45,003][100936] Updated weights for policy 0, policy_version 48040 (0.0011) +[2023-10-14 07:03:45,367][100936] Updated weights for policy 0, policy_version 48050 (0.0010) +[2023-10-14 07:03:45,740][100936] Updated weights for policy 0, policy_version 48060 (0.0008) +[2023-10-14 07:03:48,141][100917] Updated weights for policy 1, policy_version 48102 (0.0009) +[2023-10-14 07:03:48,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 98467840. Throughput: 0: 1663.6, 1: 1665.2. Samples: 24629684. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:03:48,513][100917] Updated weights for policy 1, policy_version 48112 (0.0007) +[2023-10-14 07:03:48,513][99942] Avg episode reward: [(0, '0.970'), (1, '1.000')] +[2023-10-14 07:03:48,888][100917] Updated weights for policy 1, policy_version 48122 (0.0009) +[2023-10-14 07:03:49,750][100936] Updated weights for policy 0, policy_version 48070 (0.0010) +[2023-10-14 07:03:50,126][100936] Updated weights for policy 0, policy_version 48080 (0.0009) +[2023-10-14 07:03:50,495][100936] Updated weights for policy 0, policy_version 48090 (0.0008) +[2023-10-14 07:03:52,993][100917] Updated weights for policy 1, policy_version 48132 (0.0009) +[2023-10-14 07:03:53,360][100917] Updated weights for policy 1, policy_version 48142 (0.0008) +[2023-10-14 07:03:53,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 98533376. Throughput: 0: 1663.6, 1: 1664.0. Samples: 24650228. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:03:53,513][99942] Avg episode reward: [(0, '0.970'), (1, '1.000')] +[2023-10-14 07:03:53,731][100917] Updated weights for policy 1, policy_version 48152 (0.0008) +[2023-10-14 07:03:54,508][100936] Updated weights for policy 0, policy_version 48100 (0.0007) +[2023-10-14 07:03:54,879][100936] Updated weights for policy 0, policy_version 48110 (0.0008) +[2023-10-14 07:03:55,261][100936] Updated weights for policy 0, policy_version 48120 (0.0010) +[2023-10-14 07:03:57,891][100917] Updated weights for policy 1, policy_version 48162 (0.0009) +[2023-10-14 07:03:58,263][100917] Updated weights for policy 1, policy_version 48172 (0.0008) +[2023-10-14 07:03:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 98598912. Throughput: 0: 1663.9, 1: 1666.6. Samples: 24659392. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:03:58,513][99942] Avg episode reward: [(0, '0.970'), (1, '1.000')] +[2023-10-14 07:03:58,636][100917] Updated weights for policy 1, policy_version 48182 (0.0008) +[2023-10-14 07:03:59,003][100917] Updated weights for policy 1, policy_version 48192 (0.0009) +[2023-10-14 07:03:59,273][100936] Updated weights for policy 0, policy_version 48130 (0.0010) +[2023-10-14 07:03:59,648][100936] Updated weights for policy 0, policy_version 48140 (0.0009) +[2023-10-14 07:04:00,021][100936] Updated weights for policy 0, policy_version 48150 (0.0009) +[2023-10-14 07:04:00,390][100936] Updated weights for policy 0, policy_version 48160 (0.0010) +[2023-10-14 07:04:03,043][100917] Updated weights for policy 1, policy_version 48202 (0.0008) +[2023-10-14 07:04:03,424][100917] Updated weights for policy 1, policy_version 48212 (0.0009) +[2023-10-14 07:04:03,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 98664448. Throughput: 0: 1661.3, 1: 1663.9. Samples: 24679626. Policy #0 lag: (min: 10.0, avg: 14.4, max: 42.0) +[2023-10-14 07:04:03,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:04:03,798][100917] Updated weights for policy 1, policy_version 48222 (0.0011) +[2023-10-14 07:04:04,697][100936] Updated weights for policy 0, policy_version 48170 (0.0007) +[2023-10-14 07:04:05,081][100936] Updated weights for policy 0, policy_version 48180 (0.0009) +[2023-10-14 07:04:05,441][100936] Updated weights for policy 0, policy_version 48190 (0.0009) +[2023-10-14 07:04:07,831][100917] Updated weights for policy 1, policy_version 48232 (0.0009) +[2023-10-14 07:04:08,207][100917] Updated weights for policy 1, policy_version 48242 (0.0009) +[2023-10-14 07:04:08,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 98729984. Throughput: 0: 1661.7, 1: 1656.1. Samples: 24699630. Policy #0 lag: (min: 10.0, avg: 14.4, max: 42.0) +[2023-10-14 07:04:08,512][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:04:08,578][100917] Updated weights for policy 1, policy_version 48252 (0.0008) +[2023-10-14 07:04:09,607][100936] Updated weights for policy 0, policy_version 48200 (0.0008) +[2023-10-14 07:04:09,975][100936] Updated weights for policy 0, policy_version 48210 (0.0010) +[2023-10-14 07:04:10,345][100936] Updated weights for policy 0, policy_version 48220 (0.0007) +[2023-10-14 07:04:12,608][100917] Updated weights for policy 1, policy_version 48262 (0.0007) +[2023-10-14 07:04:12,972][100917] Updated weights for policy 1, policy_version 48272 (0.0009) +[2023-10-14 07:04:13,345][100917] Updated weights for policy 1, policy_version 48282 (0.0009) +[2023-10-14 07:04:13,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 98795520. Throughput: 0: 1660.6, 1: 1667.2. Samples: 24709048. Policy #0 lag: (min: 10.0, avg: 14.4, max: 42.0) +[2023-10-14 07:04:13,513][99942] Avg episode reward: [(0, '0.960'), (1, '1.000')] +[2023-10-14 07:04:14,637][100936] Updated weights for policy 0, policy_version 48230 (0.0009) +[2023-10-14 07:04:14,995][100936] Updated weights for policy 0, policy_version 48240 (0.0009) +[2023-10-14 07:04:15,367][100936] Updated weights for policy 0, policy_version 48250 (0.0009) +[2023-10-14 07:04:17,574][100917] Updated weights for policy 1, policy_version 48292 (0.0009) +[2023-10-14 07:04:17,977][100917] Updated weights for policy 1, policy_version 48302 (0.0007) +[2023-10-14 07:04:18,357][100917] Updated weights for policy 1, policy_version 48312 (0.0008) +[2023-10-14 07:04:18,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 98861056. Throughput: 0: 1654.4, 1: 1669.4. Samples: 24729398. Policy #0 lag: (min: 10.0, avg: 14.4, max: 42.0) +[2023-10-14 07:04:18,513][99942] Avg episode reward: [(0, '0.960'), (1, '1.000')] +[2023-10-14 07:04:19,566][100936] Updated weights for policy 0, policy_version 48260 (0.0008) +[2023-10-14 07:04:19,958][100936] Updated weights for policy 0, policy_version 48270 (0.0009) +[2023-10-14 07:04:20,324][100936] Updated weights for policy 0, policy_version 48280 (0.0009) +[2023-10-14 07:04:22,430][100917] Updated weights for policy 1, policy_version 48322 (0.0009) +[2023-10-14 07:04:22,791][100917] Updated weights for policy 1, policy_version 48332 (0.0011) +[2023-10-14 07:04:23,164][100917] Updated weights for policy 1, policy_version 48342 (0.0009) +[2023-10-14 07:04:23,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.1, 300 sec: 13107.2). Total num frames: 98926592. Throughput: 0: 1660.2, 1: 1653.8. Samples: 24749128. Policy #0 lag: (min: 10.0, avg: 14.4, max: 42.0) +[2023-10-14 07:04:23,513][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 07:04:23,537][100917] Updated weights for policy 1, policy_version 48352 (0.0010) +[2023-10-14 07:04:24,449][100936] Updated weights for policy 0, policy_version 48290 (0.0009) +[2023-10-14 07:04:24,816][100936] Updated weights for policy 0, policy_version 48300 (0.0008) +[2023-10-14 07:04:25,191][100936] Updated weights for policy 0, policy_version 48310 (0.0008) +[2023-10-14 07:04:25,565][100936] Updated weights for policy 0, policy_version 48320 (0.0008) +[2023-10-14 07:04:27,803][100917] Updated weights for policy 1, policy_version 48362 (0.0009) +[2023-10-14 07:04:28,186][100917] Updated weights for policy 1, policy_version 48372 (0.0008) +[2023-10-14 07:04:28,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 98992128. Throughput: 0: 1660.7, 1: 1662.6. Samples: 24758632. Policy #0 lag: (min: 10.0, avg: 14.4, max: 42.0) +[2023-10-14 07:04:28,513][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 07:04:28,556][100917] Updated weights for policy 1, policy_version 48382 (0.0009) +[2023-10-14 07:04:29,699][100936] Updated weights for policy 0, policy_version 48330 (0.0008) +[2023-10-14 07:04:30,079][100936] Updated weights for policy 0, policy_version 48340 (0.0009) +[2023-10-14 07:04:30,449][100936] Updated weights for policy 0, policy_version 48350 (0.0008) +[2023-10-14 07:04:32,570][100917] Updated weights for policy 1, policy_version 48392 (0.0009) +[2023-10-14 07:04:32,949][100917] Updated weights for policy 1, policy_version 48402 (0.0010) +[2023-10-14 07:04:33,326][100917] Updated weights for policy 1, policy_version 48412 (0.0010) +[2023-10-14 07:04:33,512][99942] Fps is (10 sec: 16384.2, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 99090432. Throughput: 0: 1654.6, 1: 1659.8. Samples: 24778832. Policy #0 lag: (min: 10.0, avg: 14.4, max: 42.0) +[2023-10-14 07:04:33,513][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 07:04:34,676][100936] Updated weights for policy 0, policy_version 48360 (0.0008) +[2023-10-14 07:04:35,050][100936] Updated weights for policy 0, policy_version 48370 (0.0009) +[2023-10-14 07:04:35,425][100936] Updated weights for policy 0, policy_version 48380 (0.0007) +[2023-10-14 07:04:37,430][100917] Updated weights for policy 1, policy_version 48422 (0.0007) +[2023-10-14 07:04:37,796][100917] Updated weights for policy 1, policy_version 48432 (0.0008) +[2023-10-14 07:04:38,176][100917] Updated weights for policy 1, policy_version 48442 (0.0007) +[2023-10-14 07:04:38,512][99942] Fps is (10 sec: 16384.0, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 99155968. Throughput: 0: 1652.0, 1: 1644.2. Samples: 24798556. Policy #0 lag: (min: 31.0, avg: 38.5, max: 63.0) +[2023-10-14 07:04:38,513][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 07:04:39,514][100936] Updated weights for policy 0, policy_version 48390 (0.0008) +[2023-10-14 07:04:39,878][100936] Updated weights for policy 0, policy_version 48400 (0.0008) +[2023-10-14 07:04:40,256][100936] Updated weights for policy 0, policy_version 48410 (0.0009) +[2023-10-14 07:04:42,211][100917] Updated weights for policy 1, policy_version 48452 (0.0009) +[2023-10-14 07:04:42,593][100917] Updated weights for policy 1, policy_version 48462 (0.0009) +[2023-10-14 07:04:42,968][100917] Updated weights for policy 1, policy_version 48472 (0.0009) +[2023-10-14 07:04:43,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 99221504. Throughput: 0: 1651.2, 1: 1660.0. Samples: 24808394. Policy #0 lag: (min: 31.0, avg: 38.5, max: 63.0) +[2023-10-14 07:04:43,513][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 07:04:44,242][100936] Updated weights for policy 0, policy_version 48420 (0.0008) +[2023-10-14 07:04:44,613][100936] Updated weights for policy 0, policy_version 48430 (0.0007) +[2023-10-14 07:04:44,974][100936] Updated weights for policy 0, policy_version 48440 (0.0009) +[2023-10-14 07:04:47,230][100917] Updated weights for policy 1, policy_version 48482 (0.0008) +[2023-10-14 07:04:47,608][100917] Updated weights for policy 1, policy_version 48492 (0.0009) +[2023-10-14 07:04:47,978][100917] Updated weights for policy 1, policy_version 48502 (0.0009) +[2023-10-14 07:04:48,350][100917] Updated weights for policy 1, policy_version 48512 (0.0011) +[2023-10-14 07:04:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 99287040. Throughput: 0: 1654.9, 1: 1658.8. Samples: 24828740. Policy #0 lag: (min: 31.0, avg: 38.5, max: 63.0) +[2023-10-14 07:04:48,512][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 07:04:49,150][100936] Updated weights for policy 0, policy_version 48450 (0.0008) +[2023-10-14 07:04:49,529][100936] Updated weights for policy 0, policy_version 48460 (0.0007) +[2023-10-14 07:04:49,908][100936] Updated weights for policy 0, policy_version 48470 (0.0007) +[2023-10-14 07:04:50,269][100936] Updated weights for policy 0, policy_version 48480 (0.0009) +[2023-10-14 07:04:52,421][100917] Updated weights for policy 1, policy_version 48522 (0.0007) +[2023-10-14 07:04:52,789][100917] Updated weights for policy 1, policy_version 48532 (0.0008) +[2023-10-14 07:04:53,164][100917] Updated weights for policy 1, policy_version 48542 (0.0009) +[2023-10-14 07:04:53,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 99352576. Throughput: 0: 1657.3, 1: 1646.2. Samples: 24848288. Policy #0 lag: (min: 31.0, avg: 38.5, max: 63.0) +[2023-10-14 07:04:53,513][99942] Avg episode reward: [(0, '0.830'), (1, '1.000')] +[2023-10-14 07:04:54,333][100936] Updated weights for policy 0, policy_version 48490 (0.0009) +[2023-10-14 07:04:54,699][100936] Updated weights for policy 0, policy_version 48500 (0.0009) +[2023-10-14 07:04:55,070][100936] Updated weights for policy 0, policy_version 48510 (0.0009) +[2023-10-14 07:04:57,084][100917] Updated weights for policy 1, policy_version 48552 (0.0008) +[2023-10-14 07:04:57,463][100917] Updated weights for policy 1, policy_version 48562 (0.0007) +[2023-10-14 07:04:57,837][100917] Updated weights for policy 1, policy_version 48572 (0.0009) +[2023-10-14 07:04:58,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 99418112. Throughput: 0: 1658.2, 1: 1663.4. Samples: 24858522. Policy #0 lag: (min: 31.0, avg: 38.5, max: 63.0) +[2023-10-14 07:04:58,512][99942] Avg episode reward: [(0, '0.830'), (1, '1.000')] +[2023-10-14 07:04:59,166][100936] Updated weights for policy 0, policy_version 48520 (0.0007) +[2023-10-14 07:04:59,541][100936] Updated weights for policy 0, policy_version 48530 (0.0008) +[2023-10-14 07:04:59,915][100936] Updated weights for policy 0, policy_version 48540 (0.0009) +[2023-10-14 07:05:02,146][100917] Updated weights for policy 1, policy_version 48582 (0.0007) +[2023-10-14 07:05:02,534][100917] Updated weights for policy 1, policy_version 48592 (0.0007) +[2023-10-14 07:05:02,916][100917] Updated weights for policy 1, policy_version 48602 (0.0007) +[2023-10-14 07:05:03,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 99483648. Throughput: 0: 1656.9, 1: 1660.2. Samples: 24878670. Policy #0 lag: (min: 31.0, avg: 38.5, max: 63.0) +[2023-10-14 07:05:03,512][99942] Avg episode reward: [(0, '0.830'), (1, '1.000')] +[2023-10-14 07:05:04,039][100936] Updated weights for policy 0, policy_version 48550 (0.0009) +[2023-10-14 07:05:04,411][100936] Updated weights for policy 0, policy_version 48560 (0.0011) +[2023-10-14 07:05:04,789][100936] Updated weights for policy 0, policy_version 48570 (0.0009) +[2023-10-14 07:05:06,831][100917] Updated weights for policy 1, policy_version 48612 (0.0007) +[2023-10-14 07:05:07,207][100917] Updated weights for policy 1, policy_version 48622 (0.0009) +[2023-10-14 07:05:07,580][100917] Updated weights for policy 1, policy_version 48632 (0.0009) +[2023-10-14 07:05:08,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 99549184. Throughput: 0: 1654.2, 1: 1645.0. Samples: 24897592. Policy #0 lag: (min: 31.0, avg: 38.5, max: 63.0) +[2023-10-14 07:05:08,513][99942] Avg episode reward: [(0, '0.830'), (1, '1.000')] +[2023-10-14 07:05:09,003][100936] Updated weights for policy 0, policy_version 48580 (0.0009) +[2023-10-14 07:05:09,396][100936] Updated weights for policy 0, policy_version 48590 (0.0008) +[2023-10-14 07:05:09,756][100936] Updated weights for policy 0, policy_version 48600 (0.0010) +[2023-10-14 07:05:11,644][100917] Updated weights for policy 1, policy_version 48642 (0.0010) +[2023-10-14 07:05:12,018][100917] Updated weights for policy 1, policy_version 48652 (0.0009) +[2023-10-14 07:05:12,391][100917] Updated weights for policy 1, policy_version 48662 (0.0008) +[2023-10-14 07:05:12,758][100917] Updated weights for policy 1, policy_version 48672 (0.0009) +[2023-10-14 07:05:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 99614720. Throughput: 0: 1653.4, 1: 1663.6. Samples: 24907896. Policy #0 lag: (min: 26.0, avg: 34.5, max: 58.0) +[2023-10-14 07:05:13,512][99942] Avg episode reward: [(0, '0.830'), (1, '1.000')] +[2023-10-14 07:05:13,866][100936] Updated weights for policy 0, policy_version 48610 (0.0011) +[2023-10-14 07:05:14,241][100936] Updated weights for policy 0, policy_version 48620 (0.0007) +[2023-10-14 07:05:14,615][100936] Updated weights for policy 0, policy_version 48630 (0.0007) +[2023-10-14 07:05:14,990][100936] Updated weights for policy 0, policy_version 48640 (0.0010) +[2023-10-14 07:05:17,088][100917] Updated weights for policy 1, policy_version 48682 (0.0009) +[2023-10-14 07:05:17,459][100917] Updated weights for policy 1, policy_version 48692 (0.0010) +[2023-10-14 07:05:17,829][100917] Updated weights for policy 1, policy_version 48702 (0.0008) +[2023-10-14 07:05:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 99680256. Throughput: 0: 1657.8, 1: 1655.2. Samples: 24927914. Policy #0 lag: (min: 26.0, avg: 34.5, max: 58.0) +[2023-10-14 07:05:18,513][99942] Avg episode reward: [(0, '0.830'), (1, '1.000')] +[2023-10-14 07:05:19,275][100936] Updated weights for policy 0, policy_version 48650 (0.0009) +[2023-10-14 07:05:19,648][100936] Updated weights for policy 0, policy_version 48660 (0.0010) +[2023-10-14 07:05:20,024][100936] Updated weights for policy 0, policy_version 48670 (0.0009) +[2023-10-14 07:05:21,845][100917] Updated weights for policy 1, policy_version 48712 (0.0010) +[2023-10-14 07:05:22,230][100917] Updated weights for policy 1, policy_version 48722 (0.0010) +[2023-10-14 07:05:22,600][100917] Updated weights for policy 1, policy_version 48732 (0.0010) +[2023-10-14 07:05:23,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 99745792. Throughput: 0: 1657.0, 1: 1647.4. Samples: 24947254. Policy #0 lag: (min: 26.0, avg: 34.5, max: 58.0) +[2023-10-14 07:05:23,513][99942] Avg episode reward: [(0, '0.830'), (1, '1.000')] +[2023-10-14 07:05:24,056][100936] Updated weights for policy 0, policy_version 48680 (0.0010) +[2023-10-14 07:05:24,429][100936] Updated weights for policy 0, policy_version 48690 (0.0009) +[2023-10-14 07:05:24,809][100936] Updated weights for policy 0, policy_version 48700 (0.0009) +[2023-10-14 07:05:26,685][100917] Updated weights for policy 1, policy_version 48742 (0.0008) +[2023-10-14 07:05:27,063][100917] Updated weights for policy 1, policy_version 48752 (0.0007) +[2023-10-14 07:05:27,443][100917] Updated weights for policy 1, policy_version 48762 (0.0007) +[2023-10-14 07:05:28,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 99811328. Throughput: 0: 1656.5, 1: 1657.4. Samples: 24957520. Policy #0 lag: (min: 26.0, avg: 34.5, max: 58.0) +[2023-10-14 07:05:28,513][99942] Avg episode reward: [(0, '0.830'), (1, '1.000')] +[2023-10-14 07:05:29,021][100936] Updated weights for policy 0, policy_version 48710 (0.0008) +[2023-10-14 07:05:29,390][100936] Updated weights for policy 0, policy_version 48720 (0.0009) +[2023-10-14 07:05:29,761][100936] Updated weights for policy 0, policy_version 48730 (0.0010) +[2023-10-14 07:05:31,506][100917] Updated weights for policy 1, policy_version 48772 (0.0007) +[2023-10-14 07:05:31,875][100917] Updated weights for policy 1, policy_version 48782 (0.0007) +[2023-10-14 07:05:32,244][100917] Updated weights for policy 1, policy_version 48792 (0.0007) +[2023-10-14 07:05:33,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 99876864. Throughput: 0: 1648.8, 1: 1648.2. Samples: 24977106. Policy #0 lag: (min: 26.0, avg: 34.5, max: 58.0) +[2023-10-14 07:05:33,513][99942] Avg episode reward: [(0, '0.830'), (1, '1.000')] +[2023-10-14 07:05:34,014][100936] Updated weights for policy 0, policy_version 48740 (0.0009) +[2023-10-14 07:05:34,379][100936] Updated weights for policy 0, policy_version 48750 (0.0009) +[2023-10-14 07:05:34,749][100936] Updated weights for policy 0, policy_version 48760 (0.0009) +[2023-10-14 07:05:36,312][100917] Updated weights for policy 1, policy_version 48802 (0.0009) +[2023-10-14 07:05:36,671][100917] Updated weights for policy 1, policy_version 48812 (0.0011) +[2023-10-14 07:05:37,042][100917] Updated weights for policy 1, policy_version 48822 (0.0011) +[2023-10-14 07:05:37,418][100917] Updated weights for policy 1, policy_version 48832 (0.0008) +[2023-10-14 07:05:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 99942400. Throughput: 0: 1642.5, 1: 1655.2. Samples: 24996688. Policy #0 lag: (min: 26.0, avg: 34.5, max: 58.0) +[2023-10-14 07:05:38,513][99942] Avg episode reward: [(0, '0.830'), (1, '1.000')] +[2023-10-14 07:05:38,527][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000048768_49938432.pth... +[2023-10-14 07:05:38,527][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000048832_50003968.pth... +[2023-10-14 07:05:38,563][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000047264_48398336.pth +[2023-10-14 07:05:38,563][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000047232_48365568.pth +[2023-10-14 07:05:38,993][100936] Updated weights for policy 0, policy_version 48770 (0.0010) +[2023-10-14 07:05:39,357][100936] Updated weights for policy 0, policy_version 48780 (0.0011) +[2023-10-14 07:05:39,735][100936] Updated weights for policy 0, policy_version 48790 (0.0011) +[2023-10-14 07:05:40,097][100936] Updated weights for policy 0, policy_version 48800 (0.0011) +[2023-10-14 07:05:41,620][100917] Updated weights for policy 1, policy_version 48842 (0.0009) +[2023-10-14 07:05:41,998][100917] Updated weights for policy 1, policy_version 48852 (0.0009) +[2023-10-14 07:05:42,380][100917] Updated weights for policy 1, policy_version 48862 (0.0010) +[2023-10-14 07:05:43,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 100007936. Throughput: 0: 1637.5, 1: 1657.1. Samples: 25006778. Policy #0 lag: (min: 26.0, avg: 34.5, max: 58.0) +[2023-10-14 07:05:43,512][99942] Avg episode reward: [(0, '0.830'), (1, '1.000')] +[2023-10-14 07:05:44,393][100936] Updated weights for policy 0, policy_version 48810 (0.0007) +[2023-10-14 07:05:44,773][100936] Updated weights for policy 0, policy_version 48820 (0.0008) +[2023-10-14 07:05:45,151][100936] Updated weights for policy 0, policy_version 48830 (0.0010) +[2023-10-14 07:05:46,458][100917] Updated weights for policy 1, policy_version 48872 (0.0008) +[2023-10-14 07:05:46,843][100917] Updated weights for policy 1, policy_version 48882 (0.0010) +[2023-10-14 07:05:47,221][100917] Updated weights for policy 1, policy_version 48892 (0.0010) +[2023-10-14 07:05:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 100073472. Throughput: 0: 1636.4, 1: 1639.9. Samples: 25026104. Policy #0 lag: (min: 31.0, avg: 34.4, max: 63.0) +[2023-10-14 07:05:48,513][99942] Avg episode reward: [(0, '0.830'), (1, '1.000')] +[2023-10-14 07:05:49,308][100936] Updated weights for policy 0, policy_version 48840 (0.0009) +[2023-10-14 07:05:49,681][100936] Updated weights for policy 0, policy_version 48850 (0.0007) +[2023-10-14 07:05:50,044][100936] Updated weights for policy 0, policy_version 48860 (0.0008) +[2023-10-14 07:05:51,365][100917] Updated weights for policy 1, policy_version 48902 (0.0010) +[2023-10-14 07:05:51,735][100917] Updated weights for policy 1, policy_version 48912 (0.0008) +[2023-10-14 07:05:52,103][100917] Updated weights for policy 1, policy_version 48922 (0.0007) +[2023-10-14 07:05:53,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 100139008. Throughput: 0: 1641.5, 1: 1662.8. Samples: 25046286. Policy #0 lag: (min: 31.0, avg: 34.4, max: 63.0) +[2023-10-14 07:05:53,512][99942] Avg episode reward: [(0, '0.830'), (1, '1.000')] +[2023-10-14 07:05:54,071][100936] Updated weights for policy 0, policy_version 48870 (0.0009) +[2023-10-14 07:05:54,441][100936] Updated weights for policy 0, policy_version 48880 (0.0009) +[2023-10-14 07:05:54,811][100936] Updated weights for policy 0, policy_version 48890 (0.0010) +[2023-10-14 07:05:56,222][100917] Updated weights for policy 1, policy_version 48932 (0.0007) +[2023-10-14 07:05:56,599][100917] Updated weights for policy 1, policy_version 48942 (0.0008) +[2023-10-14 07:05:56,971][100917] Updated weights for policy 1, policy_version 48952 (0.0009) +[2023-10-14 07:05:58,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 100204544. Throughput: 0: 1638.0, 1: 1660.7. Samples: 25056338. Policy #0 lag: (min: 31.0, avg: 34.4, max: 63.0) +[2023-10-14 07:05:58,513][99942] Avg episode reward: [(0, '0.830'), (1, '1.000')] +[2023-10-14 07:05:58,978][100936] Updated weights for policy 0, policy_version 48900 (0.0008) +[2023-10-14 07:05:59,343][100936] Updated weights for policy 0, policy_version 48910 (0.0009) +[2023-10-14 07:05:59,713][100936] Updated weights for policy 0, policy_version 48920 (0.0009) +[2023-10-14 07:06:01,215][100917] Updated weights for policy 1, policy_version 48962 (0.0010) +[2023-10-14 07:06:01,595][100917] Updated weights for policy 1, policy_version 48972 (0.0010) +[2023-10-14 07:06:01,977][100917] Updated weights for policy 1, policy_version 48982 (0.0007) +[2023-10-14 07:06:02,359][100917] Updated weights for policy 1, policy_version 48992 (0.0007) +[2023-10-14 07:06:03,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 100270080. Throughput: 0: 1646.0, 1: 1645.3. Samples: 25076026. Policy #0 lag: (min: 31.0, avg: 34.4, max: 63.0) +[2023-10-14 07:06:03,513][99942] Avg episode reward: [(0, '0.830'), (1, '1.000')] +[2023-10-14 07:06:03,830][100936] Updated weights for policy 0, policy_version 48930 (0.0007) +[2023-10-14 07:06:04,199][100936] Updated weights for policy 0, policy_version 48940 (0.0007) +[2023-10-14 07:06:04,572][100936] Updated weights for policy 0, policy_version 48950 (0.0008) +[2023-10-14 07:06:04,940][100936] Updated weights for policy 0, policy_version 48960 (0.0008) +[2023-10-14 07:06:06,513][100917] Updated weights for policy 1, policy_version 49002 (0.0009) +[2023-10-14 07:06:06,876][100917] Updated weights for policy 1, policy_version 49012 (0.0007) +[2023-10-14 07:06:07,252][100917] Updated weights for policy 1, policy_version 49022 (0.0007) +[2023-10-14 07:06:08,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 100335616. Throughput: 0: 1644.5, 1: 1662.8. Samples: 25096080. Policy #0 lag: (min: 31.0, avg: 34.4, max: 63.0) +[2023-10-14 07:06:08,513][99942] Avg episode reward: [(0, '0.830'), (1, '1.000')] +[2023-10-14 07:06:09,116][100936] Updated weights for policy 0, policy_version 48970 (0.0008) +[2023-10-14 07:06:09,490][100936] Updated weights for policy 0, policy_version 48980 (0.0009) +[2023-10-14 07:06:09,857][100936] Updated weights for policy 0, policy_version 48990 (0.0010) +[2023-10-14 07:06:11,309][100917] Updated weights for policy 1, policy_version 49032 (0.0010) +[2023-10-14 07:06:11,688][100917] Updated weights for policy 1, policy_version 49042 (0.0009) +[2023-10-14 07:06:12,066][100917] Updated weights for policy 1, policy_version 49052 (0.0007) +[2023-10-14 07:06:13,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 100401152. Throughput: 0: 1646.2, 1: 1660.1. Samples: 25106302. Policy #0 lag: (min: 31.0, avg: 34.4, max: 63.0) +[2023-10-14 07:06:13,512][99942] Avg episode reward: [(0, '0.840'), (1, '1.000')] +[2023-10-14 07:06:13,876][100936] Updated weights for policy 0, policy_version 49000 (0.0010) +[2023-10-14 07:06:14,246][100936] Updated weights for policy 0, policy_version 49010 (0.0010) +[2023-10-14 07:06:14,615][100936] Updated weights for policy 0, policy_version 49020 (0.0007) +[2023-10-14 07:06:16,171][100917] Updated weights for policy 1, policy_version 49062 (0.0009) +[2023-10-14 07:06:16,551][100917] Updated weights for policy 1, policy_version 49072 (0.0008) +[2023-10-14 07:06:16,938][100917] Updated weights for policy 1, policy_version 49082 (0.0010) +[2023-10-14 07:06:18,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 100466688. Throughput: 0: 1649.9, 1: 1651.6. Samples: 25125672. Policy #0 lag: (min: 31.0, avg: 34.4, max: 63.0) +[2023-10-14 07:06:18,513][99942] Avg episode reward: [(0, '0.840'), (1, '1.000')] +[2023-10-14 07:06:18,801][100936] Updated weights for policy 0, policy_version 49030 (0.0009) +[2023-10-14 07:06:19,168][100936] Updated weights for policy 0, policy_version 49040 (0.0009) +[2023-10-14 07:06:19,536][100936] Updated weights for policy 0, policy_version 49050 (0.0007) +[2023-10-14 07:06:21,038][100917] Updated weights for policy 1, policy_version 49092 (0.0008) +[2023-10-14 07:06:21,410][100917] Updated weights for policy 1, policy_version 49102 (0.0007) +[2023-10-14 07:06:21,784][100917] Updated weights for policy 1, policy_version 49112 (0.0010) +[2023-10-14 07:06:23,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 100532224. Throughput: 0: 1650.0, 1: 1662.1. Samples: 25145734. Policy #0 lag: (min: 27.0, avg: 27.0, max: 27.0) +[2023-10-14 07:06:23,513][99942] Avg episode reward: [(0, '0.840'), (1, '1.000')] +[2023-10-14 07:06:23,589][100936] Updated weights for policy 0, policy_version 49060 (0.0007) +[2023-10-14 07:06:23,961][100936] Updated weights for policy 0, policy_version 49070 (0.0008) +[2023-10-14 07:06:24,328][100936] Updated weights for policy 0, policy_version 49080 (0.0008) +[2023-10-14 07:06:25,880][100917] Updated weights for policy 1, policy_version 49122 (0.0008) +[2023-10-14 07:06:26,238][100917] Updated weights for policy 1, policy_version 49132 (0.0009) +[2023-10-14 07:06:26,616][100917] Updated weights for policy 1, policy_version 49142 (0.0010) +[2023-10-14 07:06:26,991][100917] Updated weights for policy 1, policy_version 49152 (0.0009) +[2023-10-14 07:06:28,491][100936] Updated weights for policy 0, policy_version 49090 (0.0009) +[2023-10-14 07:06:28,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 100597760. Throughput: 0: 1656.0, 1: 1656.4. Samples: 25155838. Policy #0 lag: (min: 27.0, avg: 27.0, max: 27.0) +[2023-10-14 07:06:28,513][99942] Avg episode reward: [(0, '0.840'), (1, '1.000')] +[2023-10-14 07:06:28,872][100936] Updated weights for policy 0, policy_version 49100 (0.0009) +[2023-10-14 07:06:29,234][100936] Updated weights for policy 0, policy_version 49110 (0.0008) +[2023-10-14 07:06:29,604][100936] Updated weights for policy 0, policy_version 49120 (0.0008) +[2023-10-14 07:06:31,112][100917] Updated weights for policy 1, policy_version 49162 (0.0007) +[2023-10-14 07:06:31,473][100917] Updated weights for policy 1, policy_version 49172 (0.0008) +[2023-10-14 07:06:31,845][100917] Updated weights for policy 1, policy_version 49182 (0.0009) +[2023-10-14 07:06:33,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 100663296. Throughput: 0: 1657.1, 1: 1655.6. Samples: 25175174. Policy #0 lag: (min: 27.0, avg: 27.0, max: 27.0) +[2023-10-14 07:06:33,513][99942] Avg episode reward: [(0, '0.840'), (1, '1.000')] +[2023-10-14 07:06:33,898][100936] Updated weights for policy 0, policy_version 49130 (0.0009) +[2023-10-14 07:06:34,277][100936] Updated weights for policy 0, policy_version 49140 (0.0009) +[2023-10-14 07:06:34,644][100936] Updated weights for policy 0, policy_version 49150 (0.0008) +[2023-10-14 07:06:36,109][100917] Updated weights for policy 1, policy_version 49192 (0.0008) +[2023-10-14 07:06:36,497][100917] Updated weights for policy 1, policy_version 49202 (0.0008) +[2023-10-14 07:06:36,862][100917] Updated weights for policy 1, policy_version 49212 (0.0009) +[2023-10-14 07:06:38,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 100728832. Throughput: 0: 1651.7, 1: 1659.4. Samples: 25195288. Policy #0 lag: (min: 27.0, avg: 27.0, max: 27.0) +[2023-10-14 07:06:38,514][99942] Avg episode reward: [(0, '0.840'), (1, '1.000')] +[2023-10-14 07:06:38,870][100936] Updated weights for policy 0, policy_version 49160 (0.0007) +[2023-10-14 07:06:39,246][100936] Updated weights for policy 0, policy_version 49170 (0.0009) +[2023-10-14 07:06:39,625][100936] Updated weights for policy 0, policy_version 49180 (0.0009) +[2023-10-14 07:06:41,014][100917] Updated weights for policy 1, policy_version 49222 (0.0010) +[2023-10-14 07:06:41,395][100917] Updated weights for policy 1, policy_version 49232 (0.0009) +[2023-10-14 07:06:41,764][100917] Updated weights for policy 1, policy_version 49242 (0.0010) +[2023-10-14 07:06:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 100794368. Throughput: 0: 1655.1, 1: 1657.8. Samples: 25205418. Policy #0 lag: (min: 27.0, avg: 27.0, max: 27.0) +[2023-10-14 07:06:43,513][99942] Avg episode reward: [(0, '0.840'), (1, '1.000')] +[2023-10-14 07:06:43,846][100936] Updated weights for policy 0, policy_version 49190 (0.0010) +[2023-10-14 07:06:44,219][100936] Updated weights for policy 0, policy_version 49200 (0.0009) +[2023-10-14 07:06:44,586][100936] Updated weights for policy 0, policy_version 49210 (0.0008) +[2023-10-14 07:06:45,727][100917] Updated weights for policy 1, policy_version 49252 (0.0008) +[2023-10-14 07:06:46,106][100917] Updated weights for policy 1, policy_version 49262 (0.0009) +[2023-10-14 07:06:46,479][100917] Updated weights for policy 1, policy_version 49272 (0.0009) +[2023-10-14 07:06:48,512][99942] Fps is (10 sec: 13107.8, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 100859904. Throughput: 0: 1647.3, 1: 1659.2. Samples: 25224822. Policy #0 lag: (min: 27.0, avg: 27.0, max: 27.0) +[2023-10-14 07:06:48,512][99942] Avg episode reward: [(0, '0.840'), (1, '1.000')] +[2023-10-14 07:06:48,649][100936] Updated weights for policy 0, policy_version 49220 (0.0009) +[2023-10-14 07:06:49,019][100936] Updated weights for policy 0, policy_version 49230 (0.0007) +[2023-10-14 07:06:49,391][100936] Updated weights for policy 0, policy_version 49240 (0.0008) +[2023-10-14 07:06:50,668][100917] Updated weights for policy 1, policy_version 49282 (0.0007) +[2023-10-14 07:06:51,033][100917] Updated weights for policy 1, policy_version 49292 (0.0009) +[2023-10-14 07:06:51,409][100917] Updated weights for policy 1, policy_version 49302 (0.0009) +[2023-10-14 07:06:51,777][100917] Updated weights for policy 1, policy_version 49312 (0.0007) +[2023-10-14 07:06:53,394][100936] Updated weights for policy 0, policy_version 49250 (0.0008) +[2023-10-14 07:06:53,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 100925440. Throughput: 0: 1650.4, 1: 1663.7. Samples: 25245214. Policy #0 lag: (min: 27.0, avg: 27.0, max: 27.0) +[2023-10-14 07:06:53,512][99942] Avg episode reward: [(0, '0.840'), (1, '1.000')] +[2023-10-14 07:06:53,763][100936] Updated weights for policy 0, policy_version 49260 (0.0008) +[2023-10-14 07:06:54,147][100936] Updated weights for policy 0, policy_version 49270 (0.0010) +[2023-10-14 07:06:54,520][100936] Updated weights for policy 0, policy_version 49280 (0.0008) +[2023-10-14 07:06:55,894][100917] Updated weights for policy 1, policy_version 49322 (0.0009) +[2023-10-14 07:06:56,253][100917] Updated weights for policy 1, policy_version 49332 (0.0009) +[2023-10-14 07:06:56,620][100917] Updated weights for policy 1, policy_version 49342 (0.0009) +[2023-10-14 07:06:58,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 100990976. Throughput: 0: 1650.2, 1: 1657.5. Samples: 25255148. Policy #0 lag: (min: 21.0, avg: 22.0, max: 42.0) +[2023-10-14 07:06:58,512][99942] Avg episode reward: [(0, '0.840'), (1, '1.000')] +[2023-10-14 07:06:58,644][100936] Updated weights for policy 0, policy_version 49290 (0.0008) +[2023-10-14 07:06:59,014][100936] Updated weights for policy 0, policy_version 49300 (0.0007) +[2023-10-14 07:06:59,390][100936] Updated weights for policy 0, policy_version 49310 (0.0007) +[2023-10-14 07:07:00,748][100917] Updated weights for policy 1, policy_version 49352 (0.0009) +[2023-10-14 07:07:01,116][100917] Updated weights for policy 1, policy_version 49362 (0.0010) +[2023-10-14 07:07:01,494][100917] Updated weights for policy 1, policy_version 49372 (0.0007) +[2023-10-14 07:07:03,464][100936] Updated weights for policy 0, policy_version 49320 (0.0007) +[2023-10-14 07:07:03,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 101056512. Throughput: 0: 1656.0, 1: 1665.8. Samples: 25275154. Policy #0 lag: (min: 21.0, avg: 22.0, max: 42.0) +[2023-10-14 07:07:03,513][99942] Avg episode reward: [(0, '0.870'), (1, '1.000')] +[2023-10-14 07:07:03,842][100936] Updated weights for policy 0, policy_version 49330 (0.0007) +[2023-10-14 07:07:04,220][100936] Updated weights for policy 0, policy_version 49340 (0.0008) +[2023-10-14 07:07:05,547][100917] Updated weights for policy 1, policy_version 49382 (0.0007) +[2023-10-14 07:07:05,927][100917] Updated weights for policy 1, policy_version 49392 (0.0009) +[2023-10-14 07:07:06,301][100917] Updated weights for policy 1, policy_version 49402 (0.0007) +[2023-10-14 07:07:08,180][100936] Updated weights for policy 0, policy_version 49350 (0.0008) +[2023-10-14 07:07:08,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 101122048. Throughput: 0: 1649.8, 1: 1671.0. Samples: 25295168. Policy #0 lag: (min: 21.0, avg: 22.0, max: 42.0) +[2023-10-14 07:07:08,513][99942] Avg episode reward: [(0, '0.870'), (1, '1.000')] +[2023-10-14 07:07:08,549][100936] Updated weights for policy 0, policy_version 49360 (0.0012) +[2023-10-14 07:07:08,928][100936] Updated weights for policy 0, policy_version 49370 (0.0008) +[2023-10-14 07:07:10,238][100917] Updated weights for policy 1, policy_version 49412 (0.0008) +[2023-10-14 07:07:10,605][100917] Updated weights for policy 1, policy_version 49422 (0.0009) +[2023-10-14 07:07:10,972][100917] Updated weights for policy 1, policy_version 49432 (0.0008) +[2023-10-14 07:07:13,252][100936] Updated weights for policy 0, policy_version 49380 (0.0008) +[2023-10-14 07:07:13,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 101187584. Throughput: 0: 1658.3, 1: 1654.6. Samples: 25304916. Policy #0 lag: (min: 21.0, avg: 22.0, max: 42.0) +[2023-10-14 07:07:13,513][99942] Avg episode reward: [(0, '0.870'), (1, '1.000')] +[2023-10-14 07:07:13,614][100936] Updated weights for policy 0, policy_version 49390 (0.0009) +[2023-10-14 07:07:13,983][100936] Updated weights for policy 0, policy_version 49400 (0.0010) +[2023-10-14 07:07:15,180][100917] Updated weights for policy 1, policy_version 49442 (0.0009) +[2023-10-14 07:07:15,558][100917] Updated weights for policy 1, policy_version 49452 (0.0008) +[2023-10-14 07:07:15,931][100917] Updated weights for policy 1, policy_version 49462 (0.0008) +[2023-10-14 07:07:16,296][100917] Updated weights for policy 1, policy_version 49472 (0.0009) +[2023-10-14 07:07:18,192][100936] Updated weights for policy 0, policy_version 49410 (0.0010) +[2023-10-14 07:07:18,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 101253120. Throughput: 0: 1657.2, 1: 1670.8. Samples: 25324936. Policy #0 lag: (min: 21.0, avg: 22.0, max: 42.0) +[2023-10-14 07:07:18,513][99942] Avg episode reward: [(0, '0.870'), (1, '1.000')] +[2023-10-14 07:07:18,556][100936] Updated weights for policy 0, policy_version 49420 (0.0009) +[2023-10-14 07:07:18,925][100936] Updated weights for policy 0, policy_version 49430 (0.0007) +[2023-10-14 07:07:19,296][100936] Updated weights for policy 0, policy_version 49440 (0.0008) +[2023-10-14 07:07:20,256][100917] Updated weights for policy 1, policy_version 49482 (0.0010) +[2023-10-14 07:07:20,642][100917] Updated weights for policy 1, policy_version 49492 (0.0007) +[2023-10-14 07:07:21,015][100917] Updated weights for policy 1, policy_version 49502 (0.0007) +[2023-10-14 07:07:23,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 101318656. Throughput: 0: 1654.2, 1: 1681.5. Samples: 25345392. Policy #0 lag: (min: 21.0, avg: 22.0, max: 42.0) +[2023-10-14 07:07:23,513][99942] Avg episode reward: [(0, '0.870'), (1, '1.000')] +[2023-10-14 07:07:23,528][100936] Updated weights for policy 0, policy_version 49450 (0.0007) +[2023-10-14 07:07:23,904][100936] Updated weights for policy 0, policy_version 49460 (0.0008) +[2023-10-14 07:07:24,269][100936] Updated weights for policy 0, policy_version 49470 (0.0009) +[2023-10-14 07:07:25,048][100917] Updated weights for policy 1, policy_version 49512 (0.0008) +[2023-10-14 07:07:25,431][100917] Updated weights for policy 1, policy_version 49522 (0.0008) +[2023-10-14 07:07:25,797][100917] Updated weights for policy 1, policy_version 49532 (0.0007) +[2023-10-14 07:07:28,339][100936] Updated weights for policy 0, policy_version 49480 (0.0010) +[2023-10-14 07:07:28,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 101384192. Throughput: 0: 1663.4, 1: 1655.2. Samples: 25354756. Policy #0 lag: (min: 21.0, avg: 22.0, max: 42.0) +[2023-10-14 07:07:28,513][99942] Avg episode reward: [(0, '0.870'), (1, '1.000')] +[2023-10-14 07:07:28,709][100936] Updated weights for policy 0, policy_version 49490 (0.0010) +[2023-10-14 07:07:29,098][100936] Updated weights for policy 0, policy_version 49500 (0.0011) +[2023-10-14 07:07:29,894][100917] Updated weights for policy 1, policy_version 49542 (0.0011) +[2023-10-14 07:07:30,270][100917] Updated weights for policy 1, policy_version 49552 (0.0007) +[2023-10-14 07:07:30,650][100917] Updated weights for policy 1, policy_version 49562 (0.0008) +[2023-10-14 07:07:33,170][100936] Updated weights for policy 0, policy_version 49510 (0.0008) +[2023-10-14 07:07:33,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 101449728. Throughput: 0: 1661.8, 1: 1671.0. Samples: 25374800. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:07:33,512][99942] Avg episode reward: [(0, '0.870'), (1, '1.000')] +[2023-10-14 07:07:33,535][100936] Updated weights for policy 0, policy_version 49520 (0.0007) +[2023-10-14 07:07:33,902][100936] Updated weights for policy 0, policy_version 49530 (0.0008) +[2023-10-14 07:07:34,742][100917] Updated weights for policy 1, policy_version 49572 (0.0008) +[2023-10-14 07:07:35,116][100917] Updated weights for policy 1, policy_version 49582 (0.0008) +[2023-10-14 07:07:35,487][100917] Updated weights for policy 1, policy_version 49592 (0.0010) +[2023-10-14 07:07:38,174][100936] Updated weights for policy 0, policy_version 49540 (0.0008) +[2023-10-14 07:07:38,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 101515264. Throughput: 0: 1650.7, 1: 1674.9. Samples: 25394868. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:07:38,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:07:38,525][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000049600_50790400.pth... +[2023-10-14 07:07:38,545][100936] Updated weights for policy 0, policy_version 49550 (0.0008) +[2023-10-14 07:07:38,560][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000048064_49217536.pth +[2023-10-14 07:07:38,911][100936] Updated weights for policy 0, policy_version 49560 (0.0012) +[2023-10-14 07:07:39,202][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000049568_50757632.pth... +[2023-10-14 07:07:39,231][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000048000_49152000.pth +[2023-10-14 07:07:39,607][100917] Updated weights for policy 1, policy_version 49602 (0.0008) +[2023-10-14 07:07:39,975][100917] Updated weights for policy 1, policy_version 49612 (0.0010) +[2023-10-14 07:07:40,348][100917] Updated weights for policy 1, policy_version 49622 (0.0009) +[2023-10-14 07:07:40,717][100917] Updated weights for policy 1, policy_version 49632 (0.0009) +[2023-10-14 07:07:42,949][100936] Updated weights for policy 0, policy_version 49570 (0.0009) +[2023-10-14 07:07:43,329][100936] Updated weights for policy 0, policy_version 49580 (0.0008) +[2023-10-14 07:07:43,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 101580800. Throughput: 0: 1657.5, 1: 1650.9. Samples: 25404026. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:07:43,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:07:43,707][100936] Updated weights for policy 0, policy_version 49590 (0.0009) +[2023-10-14 07:07:44,080][100936] Updated weights for policy 0, policy_version 49600 (0.0009) +[2023-10-14 07:07:44,880][100917] Updated weights for policy 1, policy_version 49642 (0.0008) +[2023-10-14 07:07:45,250][100917] Updated weights for policy 1, policy_version 49652 (0.0010) +[2023-10-14 07:07:45,618][100917] Updated weights for policy 1, policy_version 49662 (0.0009) +[2023-10-14 07:07:48,221][100936] Updated weights for policy 0, policy_version 49610 (0.0009) +[2023-10-14 07:07:48,512][99942] Fps is (10 sec: 13107.7, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 101646336. Throughput: 0: 1655.6, 1: 1664.1. Samples: 25424538. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:07:48,512][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:07:48,586][100936] Updated weights for policy 0, policy_version 49620 (0.0009) +[2023-10-14 07:07:48,962][100936] Updated weights for policy 0, policy_version 49630 (0.0008) +[2023-10-14 07:07:49,910][100917] Updated weights for policy 1, policy_version 49672 (0.0009) +[2023-10-14 07:07:50,275][100917] Updated weights for policy 1, policy_version 49682 (0.0011) +[2023-10-14 07:07:50,656][100917] Updated weights for policy 1, policy_version 49692 (0.0009) +[2023-10-14 07:07:52,960][100936] Updated weights for policy 0, policy_version 49640 (0.0011) +[2023-10-14 07:07:53,328][100936] Updated weights for policy 0, policy_version 49650 (0.0010) +[2023-10-14 07:07:53,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 101711872. Throughput: 0: 1649.7, 1: 1659.3. Samples: 25444074. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:07:53,512][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:07:53,693][100936] Updated weights for policy 0, policy_version 49660 (0.0011) +[2023-10-14 07:07:54,638][100917] Updated weights for policy 1, policy_version 49702 (0.0009) +[2023-10-14 07:07:55,010][100917] Updated weights for policy 1, policy_version 49712 (0.0009) +[2023-10-14 07:07:55,383][100917] Updated weights for policy 1, policy_version 49722 (0.0010) +[2023-10-14 07:07:57,723][100936] Updated weights for policy 0, policy_version 49670 (0.0009) +[2023-10-14 07:07:58,092][100936] Updated weights for policy 0, policy_version 49680 (0.0007) +[2023-10-14 07:07:58,459][100936] Updated weights for policy 0, policy_version 49690 (0.0007) +[2023-10-14 07:07:58,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 101777408. Throughput: 0: 1654.1, 1: 1653.0. Samples: 25453738. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:07:58,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.990')] +[2023-10-14 07:07:59,363][100917] Updated weights for policy 1, policy_version 49732 (0.0010) +[2023-10-14 07:07:59,737][100917] Updated weights for policy 1, policy_version 49742 (0.0008) +[2023-10-14 07:08:00,097][100917] Updated weights for policy 1, policy_version 49752 (0.0008) +[2023-10-14 07:08:02,817][100936] Updated weights for policy 0, policy_version 49700 (0.0007) +[2023-10-14 07:08:03,193][100936] Updated weights for policy 0, policy_version 49710 (0.0007) +[2023-10-14 07:08:03,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 101842944. Throughput: 0: 1659.0, 1: 1663.9. Samples: 25474464. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:08:03,513][99942] Avg episode reward: [(0, '0.990'), (1, '0.990')] +[2023-10-14 07:08:03,564][100936] Updated weights for policy 0, policy_version 49720 (0.0008) +[2023-10-14 07:08:04,190][100917] Updated weights for policy 1, policy_version 49762 (0.0008) +[2023-10-14 07:08:04,560][100917] Updated weights for policy 1, policy_version 49772 (0.0007) +[2023-10-14 07:08:04,932][100917] Updated weights for policy 1, policy_version 49782 (0.0010) +[2023-10-14 07:08:05,308][100917] Updated weights for policy 1, policy_version 49792 (0.0008) +[2023-10-14 07:08:07,687][100936] Updated weights for policy 0, policy_version 49730 (0.0009) +[2023-10-14 07:08:08,052][100936] Updated weights for policy 0, policy_version 49740 (0.0007) +[2023-10-14 07:08:08,420][100936] Updated weights for policy 0, policy_version 49750 (0.0007) +[2023-10-14 07:08:08,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 101908480. Throughput: 0: 1646.3, 1: 1663.3. Samples: 25494324. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) +[2023-10-14 07:08:08,513][99942] Avg episode reward: [(0, '0.990'), (1, '0.990')] +[2023-10-14 07:08:08,790][100936] Updated weights for policy 0, policy_version 49760 (0.0010) +[2023-10-14 07:08:09,346][100917] Updated weights for policy 1, policy_version 49802 (0.0009) +[2023-10-14 07:08:09,726][100917] Updated weights for policy 1, policy_version 49812 (0.0007) +[2023-10-14 07:08:10,094][100917] Updated weights for policy 1, policy_version 49822 (0.0009) +[2023-10-14 07:08:13,080][100936] Updated weights for policy 0, policy_version 49770 (0.0007) +[2023-10-14 07:08:13,456][100936] Updated weights for policy 0, policy_version 49780 (0.0009) +[2023-10-14 07:08:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 101974016. Throughput: 0: 1655.0, 1: 1663.7. Samples: 25504100. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) +[2023-10-14 07:08:13,513][99942] Avg episode reward: [(0, '0.990'), (1, '0.990')] +[2023-10-14 07:08:13,840][100936] Updated weights for policy 0, policy_version 49790 (0.0008) +[2023-10-14 07:08:14,236][100917] Updated weights for policy 1, policy_version 49832 (0.0010) +[2023-10-14 07:08:14,614][100917] Updated weights for policy 1, policy_version 49842 (0.0008) +[2023-10-14 07:08:14,977][100917] Updated weights for policy 1, policy_version 49852 (0.0009) +[2023-10-14 07:08:17,720][100936] Updated weights for policy 0, policy_version 49800 (0.0008) +[2023-10-14 07:08:18,092][100936] Updated weights for policy 0, policy_version 49810 (0.0007) +[2023-10-14 07:08:18,464][100936] Updated weights for policy 0, policy_version 49820 (0.0009) +[2023-10-14 07:08:18,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 102039552. Throughput: 0: 1656.9, 1: 1666.4. Samples: 25524352. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) +[2023-10-14 07:08:18,512][99942] Avg episode reward: [(0, '0.980'), (1, '0.990')] +[2023-10-14 07:08:19,128][100917] Updated weights for policy 1, policy_version 49862 (0.0009) +[2023-10-14 07:08:19,498][100917] Updated weights for policy 1, policy_version 49872 (0.0008) +[2023-10-14 07:08:19,871][100917] Updated weights for policy 1, policy_version 49882 (0.0007) +[2023-10-14 07:08:22,461][100936] Updated weights for policy 0, policy_version 49830 (0.0012) +[2023-10-14 07:08:22,834][100936] Updated weights for policy 0, policy_version 49840 (0.0009) +[2023-10-14 07:08:23,205][100936] Updated weights for policy 0, policy_version 49850 (0.0008) +[2023-10-14 07:08:23,512][99942] Fps is (10 sec: 16383.9, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 102137856. Throughput: 0: 1645.8, 1: 1669.2. Samples: 25544044. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) +[2023-10-14 07:08:23,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.990')] +[2023-10-14 07:08:23,832][100917] Updated weights for policy 1, policy_version 49892 (0.0008) +[2023-10-14 07:08:24,203][100917] Updated weights for policy 1, policy_version 49902 (0.0007) +[2023-10-14 07:08:24,567][100917] Updated weights for policy 1, policy_version 49912 (0.0007) +[2023-10-14 07:08:27,287][100936] Updated weights for policy 0, policy_version 49860 (0.0007) +[2023-10-14 07:08:27,658][100936] Updated weights for policy 0, policy_version 49870 (0.0010) +[2023-10-14 07:08:28,038][100936] Updated weights for policy 0, policy_version 49880 (0.0009) +[2023-10-14 07:08:28,512][99942] Fps is (10 sec: 16384.0, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 102203392. Throughput: 0: 1663.0, 1: 1674.4. Samples: 25554208. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) +[2023-10-14 07:08:28,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.990')] +[2023-10-14 07:08:28,727][100917] Updated weights for policy 1, policy_version 49922 (0.0007) +[2023-10-14 07:08:29,098][100917] Updated weights for policy 1, policy_version 49932 (0.0008) +[2023-10-14 07:08:29,471][100917] Updated weights for policy 1, policy_version 49942 (0.0008) +[2023-10-14 07:08:29,843][100917] Updated weights for policy 1, policy_version 49952 (0.0008) +[2023-10-14 07:08:32,262][100936] Updated weights for policy 0, policy_version 49890 (0.0007) +[2023-10-14 07:08:32,629][100936] Updated weights for policy 0, policy_version 49900 (0.0008) +[2023-10-14 07:08:32,988][100936] Updated weights for policy 0, policy_version 49910 (0.0009) +[2023-10-14 07:08:33,362][100936] Updated weights for policy 0, policy_version 49920 (0.0008) +[2023-10-14 07:08:33,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 102268928. Throughput: 0: 1650.5, 1: 1675.1. Samples: 25574190. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) +[2023-10-14 07:08:33,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.990')] +[2023-10-14 07:08:33,870][100917] Updated weights for policy 1, policy_version 49962 (0.0008) +[2023-10-14 07:08:34,246][100917] Updated weights for policy 1, policy_version 49972 (0.0010) +[2023-10-14 07:08:34,622][100917] Updated weights for policy 1, policy_version 49982 (0.0009) +[2023-10-14 07:08:37,628][100936] Updated weights for policy 0, policy_version 49930 (0.0008) +[2023-10-14 07:08:37,986][100936] Updated weights for policy 0, policy_version 49940 (0.0007) +[2023-10-14 07:08:38,363][100936] Updated weights for policy 0, policy_version 49950 (0.0008) +[2023-10-14 07:08:38,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13653.4, 300 sec: 13329.3). Total num frames: 102334464. Throughput: 0: 1645.7, 1: 1678.2. Samples: 25593650. Policy #0 lag: (min: 31.0, avg: 38.2, max: 63.0) +[2023-10-14 07:08:38,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.990')] +[2023-10-14 07:08:38,795][100917] Updated weights for policy 1, policy_version 49992 (0.0011) +[2023-10-14 07:08:39,173][100917] Updated weights for policy 1, policy_version 50002 (0.0008) +[2023-10-14 07:08:39,542][100917] Updated weights for policy 1, policy_version 50012 (0.0008) +[2023-10-14 07:08:42,612][100936] Updated weights for policy 0, policy_version 49960 (0.0007) +[2023-10-14 07:08:42,980][100936] Updated weights for policy 0, policy_version 49970 (0.0008) +[2023-10-14 07:08:43,350][100936] Updated weights for policy 0, policy_version 49980 (0.0009) +[2023-10-14 07:08:43,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 102400000. Throughput: 0: 1655.1, 1: 1678.6. Samples: 25603754. Policy #0 lag: (min: 31.0, avg: 38.2, max: 63.0) +[2023-10-14 07:08:43,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.990')] +[2023-10-14 07:08:43,559][100917] Updated weights for policy 1, policy_version 50022 (0.0008) +[2023-10-14 07:08:43,925][100917] Updated weights for policy 1, policy_version 50032 (0.0009) +[2023-10-14 07:08:44,307][100917] Updated weights for policy 1, policy_version 50042 (0.0008) +[2023-10-14 07:08:47,557][100936] Updated weights for policy 0, policy_version 49990 (0.0008) +[2023-10-14 07:08:47,924][100936] Updated weights for policy 0, policy_version 50000 (0.0008) +[2023-10-14 07:08:48,291][100936] Updated weights for policy 0, policy_version 50010 (0.0009) +[2023-10-14 07:08:48,335][100917] Updated weights for policy 1, policy_version 50052 (0.0009) +[2023-10-14 07:08:48,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 102465536. Throughput: 0: 1651.4, 1: 1672.8. Samples: 25624050. Policy #0 lag: (min: 31.0, avg: 38.2, max: 63.0) +[2023-10-14 07:08:48,512][99942] Avg episode reward: [(0, '0.980'), (1, '0.990')] +[2023-10-14 07:08:48,711][100917] Updated weights for policy 1, policy_version 50062 (0.0008) +[2023-10-14 07:08:49,084][100917] Updated weights for policy 1, policy_version 50072 (0.0008) +[2023-10-14 07:08:52,300][100936] Updated weights for policy 0, policy_version 50020 (0.0007) +[2023-10-14 07:08:52,667][100936] Updated weights for policy 0, policy_version 50030 (0.0007) +[2023-10-14 07:08:53,031][100936] Updated weights for policy 0, policy_version 50040 (0.0007) +[2023-10-14 07:08:53,385][100917] Updated weights for policy 1, policy_version 50082 (0.0010) +[2023-10-14 07:08:53,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 102531072. Throughput: 0: 1640.3, 1: 1669.6. Samples: 25643270. Policy #0 lag: (min: 31.0, avg: 38.2, max: 63.0) +[2023-10-14 07:08:53,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.990')] +[2023-10-14 07:08:53,752][100917] Updated weights for policy 1, policy_version 50092 (0.0008) +[2023-10-14 07:08:54,124][100917] Updated weights for policy 1, policy_version 50102 (0.0009) +[2023-10-14 07:08:54,500][100917] Updated weights for policy 1, policy_version 50112 (0.0007) +[2023-10-14 07:08:57,230][100936] Updated weights for policy 0, policy_version 50050 (0.0010) +[2023-10-14 07:08:57,599][100936] Updated weights for policy 0, policy_version 50060 (0.0008) +[2023-10-14 07:08:57,969][100936] Updated weights for policy 0, policy_version 50070 (0.0009) +[2023-10-14 07:08:58,330][100936] Updated weights for policy 0, policy_version 50080 (0.0008) +[2023-10-14 07:08:58,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 102596608. Throughput: 0: 1650.1, 1: 1670.6. Samples: 25653532. Policy #0 lag: (min: 31.0, avg: 38.2, max: 63.0) +[2023-10-14 07:08:58,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.990')] +[2023-10-14 07:08:58,698][100917] Updated weights for policy 1, policy_version 50122 (0.0008) +[2023-10-14 07:08:59,079][100917] Updated weights for policy 1, policy_version 50132 (0.0007) +[2023-10-14 07:08:59,450][100917] Updated weights for policy 1, policy_version 50142 (0.0011) +[2023-10-14 07:09:02,614][100936] Updated weights for policy 0, policy_version 50090 (0.0008) +[2023-10-14 07:09:02,985][100936] Updated weights for policy 0, policy_version 50100 (0.0011) +[2023-10-14 07:09:03,357][100936] Updated weights for policy 0, policy_version 50110 (0.0009) +[2023-10-14 07:09:03,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 102662144. Throughput: 0: 1649.0, 1: 1668.6. Samples: 25673642. Policy #0 lag: (min: 31.0, avg: 38.2, max: 63.0) +[2023-10-14 07:09:03,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.990')] +[2023-10-14 07:09:03,603][100917] Updated weights for policy 1, policy_version 50152 (0.0008) +[2023-10-14 07:09:03,982][100917] Updated weights for policy 1, policy_version 50162 (0.0008) +[2023-10-14 07:09:04,367][100917] Updated weights for policy 1, policy_version 50172 (0.0007) +[2023-10-14 07:09:07,394][100936] Updated weights for policy 0, policy_version 50120 (0.0009) +[2023-10-14 07:09:07,763][100936] Updated weights for policy 0, policy_version 50130 (0.0009) +[2023-10-14 07:09:08,130][100936] Updated weights for policy 0, policy_version 50140 (0.0008) +[2023-10-14 07:09:08,371][100917] Updated weights for policy 1, policy_version 50182 (0.0008) +[2023-10-14 07:09:08,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 102727680. Throughput: 0: 1643.5, 1: 1665.4. Samples: 25692946. Policy #0 lag: (min: 31.0, avg: 38.2, max: 63.0) +[2023-10-14 07:09:08,512][99942] Avg episode reward: [(0, '0.980'), (1, '0.990')] +[2023-10-14 07:09:08,748][100917] Updated weights for policy 1, policy_version 50192 (0.0009) +[2023-10-14 07:09:09,129][100917] Updated weights for policy 1, policy_version 50202 (0.0009) +[2023-10-14 07:09:12,355][100936] Updated weights for policy 0, policy_version 50150 (0.0009) +[2023-10-14 07:09:12,722][100936] Updated weights for policy 0, policy_version 50160 (0.0010) +[2023-10-14 07:09:13,094][100936] Updated weights for policy 0, policy_version 50170 (0.0009) +[2023-10-14 07:09:13,199][100917] Updated weights for policy 1, policy_version 50212 (0.0009) +[2023-10-14 07:09:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 102793216. Throughput: 0: 1645.9, 1: 1664.0. Samples: 25703152. Policy #0 lag: (min: 26.0, avg: 26.0, max: 26.0) +[2023-10-14 07:09:13,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.990')] +[2023-10-14 07:09:13,583][100917] Updated weights for policy 1, policy_version 50222 (0.0009) +[2023-10-14 07:09:13,954][100917] Updated weights for policy 1, policy_version 50232 (0.0010) +[2023-10-14 07:09:17,506][100936] Updated weights for policy 0, policy_version 50180 (0.0007) +[2023-10-14 07:09:17,868][100936] Updated weights for policy 0, policy_version 50190 (0.0007) +[2023-10-14 07:09:18,038][100917] Updated weights for policy 1, policy_version 50242 (0.0009) +[2023-10-14 07:09:18,236][100936] Updated weights for policy 0, policy_version 50200 (0.0007) +[2023-10-14 07:09:18,415][100917] Updated weights for policy 1, policy_version 50252 (0.0008) +[2023-10-14 07:09:18,512][99942] Fps is (10 sec: 9830.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 102825984. Throughput: 0: 1652.8, 1: 1663.9. Samples: 25723442. Policy #0 lag: (min: 26.0, avg: 26.0, max: 26.0) +[2023-10-14 07:09:18,512][99942] Avg episode reward: [(0, '0.980'), (1, '0.990')] +[2023-10-14 07:09:18,787][100917] Updated weights for policy 1, policy_version 50262 (0.0007) +[2023-10-14 07:09:19,158][100917] Updated weights for policy 1, policy_version 50272 (0.0007) +[2023-10-14 07:09:22,341][100936] Updated weights for policy 0, policy_version 50210 (0.0008) +[2023-10-14 07:09:22,714][100936] Updated weights for policy 0, policy_version 50220 (0.0008) +[2023-10-14 07:09:23,090][100936] Updated weights for policy 0, policy_version 50230 (0.0008) +[2023-10-14 07:09:23,145][100917] Updated weights for policy 1, policy_version 50282 (0.0010) +[2023-10-14 07:09:23,452][100936] Updated weights for policy 0, policy_version 50240 (0.0009) +[2023-10-14 07:09:23,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 102924288. Throughput: 0: 1646.7, 1: 1667.8. Samples: 25742800. Policy #0 lag: (min: 26.0, avg: 26.0, max: 26.0) +[2023-10-14 07:09:23,512][99942] Avg episode reward: [(0, '0.980'), (1, '0.990')] +[2023-10-14 07:09:23,524][100917] Updated weights for policy 1, policy_version 50292 (0.0010) +[2023-10-14 07:09:23,889][100917] Updated weights for policy 1, policy_version 50302 (0.0009) +[2023-10-14 07:09:27,539][100936] Updated weights for policy 0, policy_version 50250 (0.0012) +[2023-10-14 07:09:27,915][100936] Updated weights for policy 0, policy_version 50260 (0.0009) +[2023-10-14 07:09:28,112][100917] Updated weights for policy 1, policy_version 50312 (0.0008) +[2023-10-14 07:09:28,289][100936] Updated weights for policy 0, policy_version 50270 (0.0009) +[2023-10-14 07:09:28,481][100917] Updated weights for policy 1, policy_version 50322 (0.0007) +[2023-10-14 07:09:28,512][99942] Fps is (10 sec: 16384.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 102989824. Throughput: 0: 1645.9, 1: 1668.3. Samples: 25752892. Policy #0 lag: (min: 26.0, avg: 26.0, max: 26.0) +[2023-10-14 07:09:28,512][99942] Avg episode reward: [(0, '0.980'), (1, '0.990')] +[2023-10-14 07:09:28,861][100917] Updated weights for policy 1, policy_version 50332 (0.0010) +[2023-10-14 07:09:32,309][100936] Updated weights for policy 0, policy_version 50280 (0.0010) +[2023-10-14 07:09:32,682][100936] Updated weights for policy 0, policy_version 50290 (0.0007) +[2023-10-14 07:09:32,966][100917] Updated weights for policy 1, policy_version 50342 (0.0009) +[2023-10-14 07:09:33,060][100936] Updated weights for policy 0, policy_version 50300 (0.0008) +[2023-10-14 07:09:33,328][100917] Updated weights for policy 1, policy_version 50352 (0.0008) +[2023-10-14 07:09:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 103055360. Throughput: 0: 1638.6, 1: 1665.5. Samples: 25772734. Policy #0 lag: (min: 26.0, avg: 26.0, max: 26.0) +[2023-10-14 07:09:33,513][99942] Avg episode reward: [(0, '0.950'), (1, '0.990')] +[2023-10-14 07:09:33,702][100917] Updated weights for policy 1, policy_version 50362 (0.0010) +[2023-10-14 07:09:37,151][100936] Updated weights for policy 0, policy_version 50310 (0.0009) +[2023-10-14 07:09:37,516][100936] Updated weights for policy 0, policy_version 50320 (0.0010) +[2023-10-14 07:09:37,885][100936] Updated weights for policy 0, policy_version 50330 (0.0007) +[2023-10-14 07:09:37,994][100917] Updated weights for policy 1, policy_version 50372 (0.0010) +[2023-10-14 07:09:38,359][100917] Updated weights for policy 1, policy_version 50382 (0.0009) +[2023-10-14 07:09:38,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 103120896. Throughput: 0: 1648.3, 1: 1658.7. Samples: 25792084. Policy #0 lag: (min: 26.0, avg: 26.0, max: 26.0) +[2023-10-14 07:09:38,513][99942] Avg episode reward: [(0, '0.950'), (1, '0.990')] +[2023-10-14 07:09:38,524][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000050336_51544064.pth... +[2023-10-14 07:09:38,559][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000048768_49938432.pth +[2023-10-14 07:09:38,739][100917] Updated weights for policy 1, policy_version 50392 (0.0009) +[2023-10-14 07:09:39,034][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000050400_51609600.pth... +[2023-10-14 07:09:39,063][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000048832_50003968.pth +[2023-10-14 07:09:42,046][100936] Updated weights for policy 0, policy_version 50340 (0.0008) +[2023-10-14 07:09:42,421][100936] Updated weights for policy 0, policy_version 50350 (0.0010) +[2023-10-14 07:09:42,795][100936] Updated weights for policy 0, policy_version 50360 (0.0007) +[2023-10-14 07:09:42,841][100917] Updated weights for policy 1, policy_version 50402 (0.0007) +[2023-10-14 07:09:43,210][100917] Updated weights for policy 1, policy_version 50412 (0.0007) +[2023-10-14 07:09:43,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 103186432. Throughput: 0: 1646.0, 1: 1659.0. Samples: 25802254. Policy #0 lag: (min: 26.0, avg: 26.0, max: 26.0) +[2023-10-14 07:09:43,513][99942] Avg episode reward: [(0, '0.950'), (1, '0.990')] +[2023-10-14 07:09:43,589][100917] Updated weights for policy 1, policy_version 50422 (0.0008) +[2023-10-14 07:09:43,955][100917] Updated weights for policy 1, policy_version 50432 (0.0007) +[2023-10-14 07:09:47,046][100936] Updated weights for policy 0, policy_version 50370 (0.0008) +[2023-10-14 07:09:47,402][100936] Updated weights for policy 0, policy_version 50380 (0.0011) +[2023-10-14 07:09:47,776][100936] Updated weights for policy 0, policy_version 50390 (0.0009) +[2023-10-14 07:09:48,051][100917] Updated weights for policy 1, policy_version 50442 (0.0009) +[2023-10-14 07:09:48,143][100936] Updated weights for policy 0, policy_version 50400 (0.0007) +[2023-10-14 07:09:48,425][100917] Updated weights for policy 1, policy_version 50452 (0.0010) +[2023-10-14 07:09:48,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 103251968. Throughput: 0: 1635.0, 1: 1664.6. Samples: 25822122. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:09:48,513][99942] Avg episode reward: [(0, '0.950'), (1, '0.990')] +[2023-10-14 07:09:48,794][100917] Updated weights for policy 1, policy_version 50462 (0.0007) +[2023-10-14 07:09:52,478][100936] Updated weights for policy 0, policy_version 50410 (0.0010) +[2023-10-14 07:09:52,852][100936] Updated weights for policy 0, policy_version 50420 (0.0009) +[2023-10-14 07:09:53,081][100917] Updated weights for policy 1, policy_version 50472 (0.0008) +[2023-10-14 07:09:53,222][100936] Updated weights for policy 0, policy_version 50430 (0.0008) +[2023-10-14 07:09:53,453][100917] Updated weights for policy 1, policy_version 50482 (0.0008) +[2023-10-14 07:09:53,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 103317504. Throughput: 0: 1637.2, 1: 1656.3. Samples: 25841154. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:09:53,513][99942] Avg episode reward: [(0, '0.950'), (1, '0.990')] +[2023-10-14 07:09:53,824][100917] Updated weights for policy 1, policy_version 50492 (0.0007) +[2023-10-14 07:09:57,230][100936] Updated weights for policy 0, policy_version 50440 (0.0008) +[2023-10-14 07:09:57,596][100936] Updated weights for policy 0, policy_version 50450 (0.0010) +[2023-10-14 07:09:57,787][100917] Updated weights for policy 1, policy_version 50502 (0.0009) +[2023-10-14 07:09:57,959][100936] Updated weights for policy 0, policy_version 50460 (0.0008) +[2023-10-14 07:09:58,154][100917] Updated weights for policy 1, policy_version 50512 (0.0008) +[2023-10-14 07:09:58,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 103383040. Throughput: 0: 1641.6, 1: 1659.9. Samples: 25851718. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:09:58,513][99942] Avg episode reward: [(0, '0.950'), (1, '0.990')] +[2023-10-14 07:09:58,525][100917] Updated weights for policy 1, policy_version 50522 (0.0007) +[2023-10-14 07:10:02,087][100936] Updated weights for policy 0, policy_version 50470 (0.0007) +[2023-10-14 07:10:02,447][100936] Updated weights for policy 0, policy_version 50480 (0.0007) +[2023-10-14 07:10:02,810][100936] Updated weights for policy 0, policy_version 50490 (0.0009) +[2023-10-14 07:10:02,818][100917] Updated weights for policy 1, policy_version 50532 (0.0008) +[2023-10-14 07:10:03,187][100917] Updated weights for policy 1, policy_version 50542 (0.0007) +[2023-10-14 07:10:03,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 103448576. Throughput: 0: 1634.0, 1: 1656.9. Samples: 25871534. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:10:03,512][99942] Avg episode reward: [(0, '0.950'), (1, '0.990')] +[2023-10-14 07:10:03,561][100917] Updated weights for policy 1, policy_version 50552 (0.0009) +[2023-10-14 07:10:07,030][100936] Updated weights for policy 0, policy_version 50500 (0.0008) +[2023-10-14 07:10:07,398][100936] Updated weights for policy 0, policy_version 50510 (0.0008) +[2023-10-14 07:10:07,771][100936] Updated weights for policy 0, policy_version 50520 (0.0007) +[2023-10-14 07:10:07,815][100917] Updated weights for policy 1, policy_version 50562 (0.0008) +[2023-10-14 07:10:08,184][100917] Updated weights for policy 1, policy_version 50572 (0.0009) +[2023-10-14 07:10:08,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 103514112. Throughput: 0: 1643.6, 1: 1645.6. Samples: 25890812. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:10:08,513][99942] Avg episode reward: [(0, '0.950'), (1, '0.990')] +[2023-10-14 07:10:08,555][100917] Updated weights for policy 1, policy_version 50582 (0.0007) +[2023-10-14 07:10:08,926][100917] Updated weights for policy 1, policy_version 50592 (0.0008) +[2023-10-14 07:10:11,780][100936] Updated weights for policy 0, policy_version 50530 (0.0009) +[2023-10-14 07:10:12,156][100936] Updated weights for policy 0, policy_version 50540 (0.0007) +[2023-10-14 07:10:12,525][100936] Updated weights for policy 0, policy_version 50550 (0.0009) +[2023-10-14 07:10:12,874][100917] Updated weights for policy 1, policy_version 50602 (0.0008) +[2023-10-14 07:10:12,892][100936] Updated weights for policy 0, policy_version 50560 (0.0008) +[2023-10-14 07:10:13,249][100917] Updated weights for policy 1, policy_version 50612 (0.0009) +[2023-10-14 07:10:13,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 103579648. Throughput: 0: 1651.2, 1: 1647.5. Samples: 25901334. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:10:13,513][99942] Avg episode reward: [(0, '0.960'), (1, '0.990')] +[2023-10-14 07:10:13,628][100917] Updated weights for policy 1, policy_version 50622 (0.0010) +[2023-10-14 07:10:17,128][100936] Updated weights for policy 0, policy_version 50570 (0.0009) +[2023-10-14 07:10:17,491][100936] Updated weights for policy 0, policy_version 50580 (0.0007) +[2023-10-14 07:10:17,805][100917] Updated weights for policy 1, policy_version 50632 (0.0008) +[2023-10-14 07:10:17,855][100936] Updated weights for policy 0, policy_version 50590 (0.0010) +[2023-10-14 07:10:18,176][100917] Updated weights for policy 1, policy_version 50642 (0.0007) +[2023-10-14 07:10:18,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 103645184. Throughput: 0: 1643.5, 1: 1650.7. Samples: 25920974. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:10:18,513][99942] Avg episode reward: [(0, '0.960'), (1, '0.990')] +[2023-10-14 07:10:18,549][100917] Updated weights for policy 1, policy_version 50652 (0.0007) +[2023-10-14 07:10:21,982][100936] Updated weights for policy 0, policy_version 50600 (0.0007) +[2023-10-14 07:10:22,354][100936] Updated weights for policy 0, policy_version 50610 (0.0008) +[2023-10-14 07:10:22,720][100936] Updated weights for policy 0, policy_version 50620 (0.0009) +[2023-10-14 07:10:22,767][100917] Updated weights for policy 1, policy_version 50662 (0.0009) +[2023-10-14 07:10:23,145][100917] Updated weights for policy 1, policy_version 50672 (0.0007) +[2023-10-14 07:10:23,509][100917] Updated weights for policy 1, policy_version 50682 (0.0007) +[2023-10-14 07:10:23,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 103710720. Throughput: 0: 1648.7, 1: 1643.2. Samples: 25940216. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:10:23,513][99942] Avg episode reward: [(0, '0.960'), (1, '0.990')] +[2023-10-14 07:10:26,778][100936] Updated weights for policy 0, policy_version 50630 (0.0009) +[2023-10-14 07:10:27,139][100936] Updated weights for policy 0, policy_version 50640 (0.0008) +[2023-10-14 07:10:27,487][100917] Updated weights for policy 1, policy_version 50692 (0.0008) +[2023-10-14 07:10:27,504][100936] Updated weights for policy 0, policy_version 50650 (0.0007) +[2023-10-14 07:10:27,863][100917] Updated weights for policy 1, policy_version 50702 (0.0010) +[2023-10-14 07:10:28,228][100917] Updated weights for policy 1, policy_version 50712 (0.0009) +[2023-10-14 07:10:28,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 103776256. Throughput: 0: 1654.3, 1: 1650.0. Samples: 25950950. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:10:28,512][99942] Avg episode reward: [(0, '0.960'), (1, '0.990')] +[2023-10-14 07:10:31,640][100936] Updated weights for policy 0, policy_version 50660 (0.0008) +[2023-10-14 07:10:32,004][100936] Updated weights for policy 0, policy_version 50670 (0.0008) +[2023-10-14 07:10:32,286][100917] Updated weights for policy 1, policy_version 50722 (0.0008) +[2023-10-14 07:10:32,364][100936] Updated weights for policy 0, policy_version 50680 (0.0007) +[2023-10-14 07:10:32,652][100917] Updated weights for policy 1, policy_version 50732 (0.0009) +[2023-10-14 07:10:33,028][100917] Updated weights for policy 1, policy_version 50742 (0.0009) +[2023-10-14 07:10:33,404][100917] Updated weights for policy 1, policy_version 50752 (0.0007) +[2023-10-14 07:10:33,512][99942] Fps is (10 sec: 16384.3, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 103874560. Throughput: 0: 1651.4, 1: 1649.8. Samples: 25970674. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:10:33,513][99942] Avg episode reward: [(0, '0.960'), (1, '0.990')] +[2023-10-14 07:10:36,723][100936] Updated weights for policy 0, policy_version 50690 (0.0009) +[2023-10-14 07:10:37,125][100936] Updated weights for policy 0, policy_version 50700 (0.0011) +[2023-10-14 07:10:37,488][100936] Updated weights for policy 0, policy_version 50710 (0.0010) +[2023-10-14 07:10:37,662][100917] Updated weights for policy 1, policy_version 50762 (0.0009) +[2023-10-14 07:10:37,859][100936] Updated weights for policy 0, policy_version 50720 (0.0009) +[2023-10-14 07:10:38,026][100917] Updated weights for policy 1, policy_version 50772 (0.0008) +[2023-10-14 07:10:38,395][100917] Updated weights for policy 1, policy_version 50782 (0.0007) +[2023-10-14 07:10:38,512][99942] Fps is (10 sec: 16383.7, 60 sec: 13653.4, 300 sec: 13329.3). Total num frames: 103940096. Throughput: 0: 1660.8, 1: 1634.9. Samples: 25989462. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:10:38,513][99942] Avg episode reward: [(0, '0.960'), (1, '0.990')] +[2023-10-14 07:10:42,095][100936] Updated weights for policy 0, policy_version 50730 (0.0010) +[2023-10-14 07:10:42,469][100936] Updated weights for policy 0, policy_version 50740 (0.0008) +[2023-10-14 07:10:42,597][100917] Updated weights for policy 1, policy_version 50792 (0.0008) +[2023-10-14 07:10:42,835][100936] Updated weights for policy 0, policy_version 50750 (0.0007) +[2023-10-14 07:10:42,970][100917] Updated weights for policy 1, policy_version 50802 (0.0009) +[2023-10-14 07:10:43,346][100917] Updated weights for policy 1, policy_version 50812 (0.0011) +[2023-10-14 07:10:43,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 104005632. Throughput: 0: 1653.2, 1: 1647.8. Samples: 26000262. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:10:43,513][99942] Avg episode reward: [(0, '0.960'), (1, '0.990')] +[2023-10-14 07:10:46,972][100936] Updated weights for policy 0, policy_version 50760 (0.0007) +[2023-10-14 07:10:47,343][100936] Updated weights for policy 0, policy_version 50770 (0.0007) +[2023-10-14 07:10:47,649][100917] Updated weights for policy 1, policy_version 50822 (0.0010) +[2023-10-14 07:10:47,715][100936] Updated weights for policy 0, policy_version 50780 (0.0009) +[2023-10-14 07:10:48,021][100917] Updated weights for policy 1, policy_version 50832 (0.0010) +[2023-10-14 07:10:48,391][100917] Updated weights for policy 1, policy_version 50842 (0.0008) +[2023-10-14 07:10:48,512][99942] Fps is (10 sec: 9830.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 104038400. Throughput: 0: 1646.3, 1: 1645.5. Samples: 26019670. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:10:48,513][99942] Avg episode reward: [(0, '0.960'), (1, '1.000')] +[2023-10-14 07:10:52,014][100936] Updated weights for policy 0, policy_version 50790 (0.0007) +[2023-10-14 07:10:52,367][100917] Updated weights for policy 1, policy_version 50852 (0.0010) +[2023-10-14 07:10:52,377][100936] Updated weights for policy 0, policy_version 50800 (0.0008) +[2023-10-14 07:10:52,734][100917] Updated weights for policy 1, policy_version 50862 (0.0007) +[2023-10-14 07:10:52,751][100936] Updated weights for policy 0, policy_version 50810 (0.0007) +[2023-10-14 07:10:53,110][100917] Updated weights for policy 1, policy_version 50872 (0.0009) +[2023-10-14 07:10:53,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 104136704. Throughput: 0: 1649.0, 1: 1643.5. Samples: 26038976. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:10:53,513][99942] Avg episode reward: [(0, '0.960'), (1, '1.000')] +[2023-10-14 07:10:56,811][100936] Updated weights for policy 0, policy_version 50820 (0.0007) +[2023-10-14 07:10:57,176][100936] Updated weights for policy 0, policy_version 50830 (0.0007) +[2023-10-14 07:10:57,299][100917] Updated weights for policy 1, policy_version 50882 (0.0009) +[2023-10-14 07:10:57,543][100936] Updated weights for policy 0, policy_version 50840 (0.0007) +[2023-10-14 07:10:57,669][100917] Updated weights for policy 1, policy_version 50892 (0.0007) +[2023-10-14 07:10:58,045][100917] Updated weights for policy 1, policy_version 50902 (0.0009) +[2023-10-14 07:10:58,415][100917] Updated weights for policy 1, policy_version 50912 (0.0007) +[2023-10-14 07:10:58,512][99942] Fps is (10 sec: 16384.3, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 104202240. Throughput: 0: 1651.3, 1: 1653.2. Samples: 26050036. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:10:58,513][99942] Avg episode reward: [(0, '0.960'), (1, '1.000')] +[2023-10-14 07:11:01,452][100936] Updated weights for policy 0, policy_version 50850 (0.0008) +[2023-10-14 07:11:01,823][100936] Updated weights for policy 0, policy_version 50860 (0.0008) +[2023-10-14 07:11:02,189][100936] Updated weights for policy 0, policy_version 50870 (0.0010) +[2023-10-14 07:11:02,453][100917] Updated weights for policy 1, policy_version 50922 (0.0009) +[2023-10-14 07:11:02,562][100936] Updated weights for policy 0, policy_version 50880 (0.0009) +[2023-10-14 07:11:02,833][100917] Updated weights for policy 1, policy_version 50932 (0.0008) +[2023-10-14 07:11:03,196][100917] Updated weights for policy 1, policy_version 50942 (0.0009) +[2023-10-14 07:11:03,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 104267776. Throughput: 0: 1652.1, 1: 1657.6. Samples: 26069910. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:11:03,513][99942] Avg episode reward: [(0, '0.970'), (1, '1.000')] +[2023-10-14 07:11:06,741][100936] Updated weights for policy 0, policy_version 50890 (0.0008) +[2023-10-14 07:11:07,112][100936] Updated weights for policy 0, policy_version 50900 (0.0009) +[2023-10-14 07:11:07,232][100917] Updated weights for policy 1, policy_version 50952 (0.0008) +[2023-10-14 07:11:07,478][100936] Updated weights for policy 0, policy_version 50910 (0.0009) +[2023-10-14 07:11:07,608][100917] Updated weights for policy 1, policy_version 50962 (0.0008) +[2023-10-14 07:11:07,976][100917] Updated weights for policy 1, policy_version 50972 (0.0010) +[2023-10-14 07:11:08,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 104333312. Throughput: 0: 1664.6, 1: 1648.8. Samples: 26089320. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:11:08,513][99942] Avg episode reward: [(0, '0.970'), (1, '1.000')] +[2023-10-14 07:11:11,259][100936] Updated weights for policy 0, policy_version 50920 (0.0009) +[2023-10-14 07:11:11,634][100936] Updated weights for policy 0, policy_version 50930 (0.0008) +[2023-10-14 07:11:12,004][100936] Updated weights for policy 0, policy_version 50940 (0.0009) +[2023-10-14 07:11:12,093][100917] Updated weights for policy 1, policy_version 50982 (0.0009) +[2023-10-14 07:11:12,460][100917] Updated weights for policy 1, policy_version 50992 (0.0007) +[2023-10-14 07:11:12,827][100917] Updated weights for policy 1, policy_version 51002 (0.0011) +[2023-10-14 07:11:13,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 104398848. Throughput: 0: 1656.5, 1: 1660.0. Samples: 26100196. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:11:13,513][99942] Avg episode reward: [(0, '0.970'), (1, '1.000')] +[2023-10-14 07:11:15,868][100936] Updated weights for policy 0, policy_version 50950 (0.0009) +[2023-10-14 07:11:16,242][100936] Updated weights for policy 0, policy_version 50960 (0.0008) +[2023-10-14 07:11:16,608][100936] Updated weights for policy 0, policy_version 50970 (0.0010) +[2023-10-14 07:11:17,052][100917] Updated weights for policy 1, policy_version 51012 (0.0009) +[2023-10-14 07:11:17,432][100917] Updated weights for policy 1, policy_version 51022 (0.0010) +[2023-10-14 07:11:17,792][100917] Updated weights for policy 1, policy_version 51032 (0.0009) +[2023-10-14 07:11:18,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 104464384. Throughput: 0: 1660.6, 1: 1657.4. Samples: 26119984. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:11:18,513][99942] Avg episode reward: [(0, '0.970'), (1, '1.000')] +[2023-10-14 07:11:20,685][100936] Updated weights for policy 0, policy_version 50980 (0.0008) +[2023-10-14 07:11:21,054][100936] Updated weights for policy 0, policy_version 50990 (0.0008) +[2023-10-14 07:11:21,435][100936] Updated weights for policy 0, policy_version 51000 (0.0009) +[2023-10-14 07:11:21,937][100917] Updated weights for policy 1, policy_version 51042 (0.0009) +[2023-10-14 07:11:22,353][100917] Updated weights for policy 1, policy_version 51052 (0.0011) +[2023-10-14 07:11:22,731][100917] Updated weights for policy 1, policy_version 51062 (0.0009) +[2023-10-14 07:11:23,102][100917] Updated weights for policy 1, policy_version 51072 (0.0009) +[2023-10-14 07:11:23,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 104529920. Throughput: 0: 1680.2, 1: 1653.0. Samples: 26139454. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:11:23,513][99942] Avg episode reward: [(0, '0.970'), (1, '1.000')] +[2023-10-14 07:11:25,679][100936] Updated weights for policy 0, policy_version 51010 (0.0010) +[2023-10-14 07:11:26,095][100936] Updated weights for policy 0, policy_version 51020 (0.0008) +[2023-10-14 07:11:26,463][100936] Updated weights for policy 0, policy_version 51030 (0.0010) +[2023-10-14 07:11:26,831][100936] Updated weights for policy 0, policy_version 51040 (0.0009) +[2023-10-14 07:11:27,028][100917] Updated weights for policy 1, policy_version 51082 (0.0007) +[2023-10-14 07:11:27,397][100917] Updated weights for policy 1, policy_version 51092 (0.0008) +[2023-10-14 07:11:27,769][100917] Updated weights for policy 1, policy_version 51102 (0.0009) +[2023-10-14 07:11:28,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 104595456. Throughput: 0: 1661.9, 1: 1663.6. Samples: 26149910. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:11:28,513][99942] Avg episode reward: [(0, '0.970'), (1, '1.000')] +[2023-10-14 07:11:31,026][100936] Updated weights for policy 0, policy_version 51050 (0.0010) +[2023-10-14 07:11:31,400][100936] Updated weights for policy 0, policy_version 51060 (0.0007) +[2023-10-14 07:11:31,767][100936] Updated weights for policy 0, policy_version 51070 (0.0007) +[2023-10-14 07:11:31,930][100917] Updated weights for policy 1, policy_version 51112 (0.0008) +[2023-10-14 07:11:32,293][100917] Updated weights for policy 1, policy_version 51122 (0.0007) +[2023-10-14 07:11:32,664][100917] Updated weights for policy 1, policy_version 51132 (0.0007) +[2023-10-14 07:11:33,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 104660992. Throughput: 0: 1668.7, 1: 1662.4. Samples: 26169570. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:11:33,512][99942] Avg episode reward: [(0, '0.970'), (1, '1.000')] +[2023-10-14 07:11:35,859][100936] Updated weights for policy 0, policy_version 51080 (0.0008) +[2023-10-14 07:11:36,245][100936] Updated weights for policy 0, policy_version 51090 (0.0009) +[2023-10-14 07:11:36,618][100936] Updated weights for policy 0, policy_version 51100 (0.0011) +[2023-10-14 07:11:36,905][100917] Updated weights for policy 1, policy_version 51142 (0.0009) +[2023-10-14 07:11:37,286][100917] Updated weights for policy 1, policy_version 51152 (0.0008) +[2023-10-14 07:11:37,651][100917] Updated weights for policy 1, policy_version 51162 (0.0009) +[2023-10-14 07:11:38,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.3, 300 sec: 13329.4). Total num frames: 104726528. Throughput: 0: 1682.5, 1: 1649.3. Samples: 26188910. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:11:38,512][99942] Avg episode reward: [(0, '0.970'), (1, '1.000')] +[2023-10-14 07:11:38,522][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000051104_52330496.pth... +[2023-10-14 07:11:38,522][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000051168_52396032.pth... +[2023-10-14 07:11:38,555][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000049600_50790400.pth +[2023-10-14 07:11:38,563][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000049568_50757632.pth +[2023-10-14 07:11:40,707][100936] Updated weights for policy 0, policy_version 51110 (0.0008) +[2023-10-14 07:11:41,084][100936] Updated weights for policy 0, policy_version 51120 (0.0010) +[2023-10-14 07:11:41,448][100936] Updated weights for policy 0, policy_version 51130 (0.0008) +[2023-10-14 07:11:41,834][100917] Updated weights for policy 1, policy_version 51172 (0.0011) +[2023-10-14 07:11:42,211][100917] Updated weights for policy 1, policy_version 51182 (0.0007) +[2023-10-14 07:11:42,580][100917] Updated weights for policy 1, policy_version 51192 (0.0009) +[2023-10-14 07:11:43,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 104792064. Throughput: 0: 1658.7, 1: 1658.0. Samples: 26199286. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:11:43,513][99942] Avg episode reward: [(0, '0.970'), (1, '1.000')] +[2023-10-14 07:11:45,628][100936] Updated weights for policy 0, policy_version 51140 (0.0009) +[2023-10-14 07:11:46,006][100936] Updated weights for policy 0, policy_version 51150 (0.0007) +[2023-10-14 07:11:46,371][100936] Updated weights for policy 0, policy_version 51160 (0.0008) +[2023-10-14 07:11:46,525][100917] Updated weights for policy 1, policy_version 51202 (0.0009) +[2023-10-14 07:11:46,902][100917] Updated weights for policy 1, policy_version 51212 (0.0009) +[2023-10-14 07:11:47,266][100917] Updated weights for policy 1, policy_version 51222 (0.0008) +[2023-10-14 07:11:47,648][100917] Updated weights for policy 1, policy_version 51232 (0.0009) +[2023-10-14 07:11:48,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 104857600. Throughput: 0: 1665.1, 1: 1647.2. Samples: 26218960. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:11:48,513][99942] Avg episode reward: [(0, '0.970'), (1, '1.000')] +[2023-10-14 07:11:50,523][100936] Updated weights for policy 0, policy_version 51170 (0.0007) +[2023-10-14 07:11:50,894][100936] Updated weights for policy 0, policy_version 51180 (0.0007) +[2023-10-14 07:11:51,258][100936] Updated weights for policy 0, policy_version 51190 (0.0008) +[2023-10-14 07:11:51,625][100936] Updated weights for policy 0, policy_version 51200 (0.0007) +[2023-10-14 07:11:51,759][100917] Updated weights for policy 1, policy_version 51242 (0.0011) +[2023-10-14 07:11:52,127][100917] Updated weights for policy 1, policy_version 51252 (0.0009) +[2023-10-14 07:11:52,507][100917] Updated weights for policy 1, policy_version 51262 (0.0007) +[2023-10-14 07:11:53,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 104923136. Throughput: 0: 1668.1, 1: 1653.3. Samples: 26238782. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:11:53,513][99942] Avg episode reward: [(0, '0.970'), (1, '1.000')] +[2023-10-14 07:11:55,646][100936] Updated weights for policy 0, policy_version 51210 (0.0007) +[2023-10-14 07:11:56,019][100936] Updated weights for policy 0, policy_version 51220 (0.0008) +[2023-10-14 07:11:56,384][100936] Updated weights for policy 0, policy_version 51230 (0.0008) +[2023-10-14 07:11:56,485][100917] Updated weights for policy 1, policy_version 51272 (0.0008) +[2023-10-14 07:11:56,861][100917] Updated weights for policy 1, policy_version 51282 (0.0010) +[2023-10-14 07:11:57,229][100917] Updated weights for policy 1, policy_version 51292 (0.0009) +[2023-10-14 07:11:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 104988672. Throughput: 0: 1649.6, 1: 1664.4. Samples: 26249326. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:11:58,513][99942] Avg episode reward: [(0, '0.970'), (1, '1.000')] +[2023-10-14 07:12:00,653][100936] Updated weights for policy 0, policy_version 51240 (0.0008) +[2023-10-14 07:12:01,025][100936] Updated weights for policy 0, policy_version 51250 (0.0007) +[2023-10-14 07:12:01,389][100936] Updated weights for policy 0, policy_version 51260 (0.0009) +[2023-10-14 07:12:01,492][100917] Updated weights for policy 1, policy_version 51302 (0.0009) +[2023-10-14 07:12:01,853][100917] Updated weights for policy 1, policy_version 51312 (0.0010) +[2023-10-14 07:12:02,237][100917] Updated weights for policy 1, policy_version 51322 (0.0009) +[2023-10-14 07:12:03,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 105054208. Throughput: 0: 1660.0, 1: 1649.8. Samples: 26268924. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:12:03,513][99942] Avg episode reward: [(0, '0.970'), (1, '1.000')] +[2023-10-14 07:12:05,317][100936] Updated weights for policy 0, policy_version 51270 (0.0008) +[2023-10-14 07:12:05,683][100936] Updated weights for policy 0, policy_version 51280 (0.0007) +[2023-10-14 07:12:06,047][100936] Updated weights for policy 0, policy_version 51290 (0.0008) +[2023-10-14 07:12:06,481][100917] Updated weights for policy 1, policy_version 51332 (0.0007) +[2023-10-14 07:12:06,855][100917] Updated weights for policy 1, policy_version 51342 (0.0010) +[2023-10-14 07:12:07,225][100917] Updated weights for policy 1, policy_version 51352 (0.0011) +[2023-10-14 07:12:08,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 105119744. Throughput: 0: 1659.7, 1: 1660.8. Samples: 26288878. Policy #0 lag: (min: 31.0, avg: 46.2, max: 63.0) +[2023-10-14 07:12:08,513][99942] Avg episode reward: [(0, '0.970'), (1, '1.000')] +[2023-10-14 07:12:10,111][100936] Updated weights for policy 0, policy_version 51300 (0.0010) +[2023-10-14 07:12:10,489][100936] Updated weights for policy 0, policy_version 51310 (0.0007) +[2023-10-14 07:12:10,857][100936] Updated weights for policy 0, policy_version 51320 (0.0007) +[2023-10-14 07:12:11,346][100917] Updated weights for policy 1, policy_version 51362 (0.0008) +[2023-10-14 07:12:11,763][100917] Updated weights for policy 1, policy_version 51372 (0.0008) +[2023-10-14 07:12:12,139][100917] Updated weights for policy 1, policy_version 51382 (0.0010) +[2023-10-14 07:12:12,518][100917] Updated weights for policy 1, policy_version 51392 (0.0009) +[2023-10-14 07:12:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 105185280. Throughput: 0: 1652.8, 1: 1661.2. Samples: 26299036. Policy #0 lag: (min: 31.0, avg: 46.2, max: 63.0) +[2023-10-14 07:12:13,513][99942] Avg episode reward: [(0, '0.970'), (1, '1.000')] +[2023-10-14 07:12:15,091][100936] Updated weights for policy 0, policy_version 51330 (0.0009) +[2023-10-14 07:12:15,462][100936] Updated weights for policy 0, policy_version 51340 (0.0010) +[2023-10-14 07:12:15,841][100936] Updated weights for policy 0, policy_version 51350 (0.0011) +[2023-10-14 07:12:16,206][100936] Updated weights for policy 0, policy_version 51360 (0.0011) +[2023-10-14 07:12:16,686][100917] Updated weights for policy 1, policy_version 51402 (0.0009) +[2023-10-14 07:12:17,055][100917] Updated weights for policy 1, policy_version 51412 (0.0010) +[2023-10-14 07:12:17,434][100917] Updated weights for policy 1, policy_version 51422 (0.0008) +[2023-10-14 07:12:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 105250816. Throughput: 0: 1660.6, 1: 1652.3. Samples: 26318650. Policy #0 lag: (min: 31.0, avg: 46.2, max: 63.0) +[2023-10-14 07:12:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:12:20,549][100936] Updated weights for policy 0, policy_version 51370 (0.0011) +[2023-10-14 07:12:20,923][100936] Updated weights for policy 0, policy_version 51380 (0.0008) +[2023-10-14 07:12:21,297][100936] Updated weights for policy 0, policy_version 51390 (0.0007) +[2023-10-14 07:12:21,461][100917] Updated weights for policy 1, policy_version 51432 (0.0008) +[2023-10-14 07:12:21,827][100917] Updated weights for policy 1, policy_version 51442 (0.0008) +[2023-10-14 07:12:22,197][100917] Updated weights for policy 1, policy_version 51452 (0.0008) +[2023-10-14 07:12:23,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 105316352. Throughput: 0: 1658.4, 1: 1666.2. Samples: 26338518. Policy #0 lag: (min: 31.0, avg: 46.2, max: 63.0) +[2023-10-14 07:12:23,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:12:25,512][100936] Updated weights for policy 0, policy_version 51400 (0.0007) +[2023-10-14 07:12:25,876][100936] Updated weights for policy 0, policy_version 51410 (0.0007) +[2023-10-14 07:12:26,250][100936] Updated weights for policy 0, policy_version 51420 (0.0007) +[2023-10-14 07:12:26,334][100917] Updated weights for policy 1, policy_version 51462 (0.0010) +[2023-10-14 07:12:26,698][100917] Updated weights for policy 1, policy_version 51472 (0.0010) +[2023-10-14 07:12:27,077][100917] Updated weights for policy 1, policy_version 51482 (0.0009) +[2023-10-14 07:12:28,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 105381888. Throughput: 0: 1649.2, 1: 1675.3. Samples: 26348886. Policy #0 lag: (min: 31.0, avg: 46.2, max: 63.0) +[2023-10-14 07:12:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:12:30,210][100936] Updated weights for policy 0, policy_version 51430 (0.0008) +[2023-10-14 07:12:30,585][100936] Updated weights for policy 0, policy_version 51440 (0.0008) +[2023-10-14 07:12:30,950][100936] Updated weights for policy 0, policy_version 51450 (0.0009) +[2023-10-14 07:12:31,176][100917] Updated weights for policy 1, policy_version 51492 (0.0007) +[2023-10-14 07:12:31,554][100917] Updated weights for policy 1, policy_version 51502 (0.0010) +[2023-10-14 07:12:31,921][100917] Updated weights for policy 1, policy_version 51512 (0.0007) +[2023-10-14 07:12:33,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 105447424. Throughput: 0: 1657.3, 1: 1664.0. Samples: 26368416. Policy #0 lag: (min: 31.0, avg: 46.2, max: 63.0) +[2023-10-14 07:12:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:12:35,256][100936] Updated weights for policy 0, policy_version 51460 (0.0009) +[2023-10-14 07:12:35,623][100936] Updated weights for policy 0, policy_version 51470 (0.0008) +[2023-10-14 07:12:35,789][100917] Updated weights for policy 1, policy_version 51522 (0.0007) +[2023-10-14 07:12:35,988][100936] Updated weights for policy 0, policy_version 51480 (0.0008) +[2023-10-14 07:12:36,165][100917] Updated weights for policy 1, policy_version 51532 (0.0011) +[2023-10-14 07:12:36,542][100917] Updated weights for policy 1, policy_version 51542 (0.0010) +[2023-10-14 07:12:36,902][100917] Updated weights for policy 1, policy_version 51552 (0.0009) +[2023-10-14 07:12:38,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.1, 300 sec: 13329.3). Total num frames: 105512960. Throughput: 0: 1653.6, 1: 1678.5. Samples: 26388730. Policy #0 lag: (min: 31.0, avg: 46.2, max: 63.0) +[2023-10-14 07:12:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:12:40,232][100936] Updated weights for policy 0, policy_version 51490 (0.0009) +[2023-10-14 07:12:40,598][100936] Updated weights for policy 0, policy_version 51500 (0.0009) +[2023-10-14 07:12:40,865][100917] Updated weights for policy 1, policy_version 51562 (0.0007) +[2023-10-14 07:12:40,965][100936] Updated weights for policy 0, policy_version 51510 (0.0008) +[2023-10-14 07:12:41,240][100917] Updated weights for policy 1, policy_version 51572 (0.0010) +[2023-10-14 07:12:41,339][100936] Updated weights for policy 0, policy_version 51520 (0.0008) +[2023-10-14 07:12:41,603][100917] Updated weights for policy 1, policy_version 51582 (0.0009) +[2023-10-14 07:12:43,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 105578496. Throughput: 0: 1649.9, 1: 1663.1. Samples: 26398412. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) +[2023-10-14 07:12:43,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:12:45,447][100936] Updated weights for policy 0, policy_version 51530 (0.0007) +[2023-10-14 07:12:45,818][100936] Updated weights for policy 0, policy_version 51540 (0.0007) +[2023-10-14 07:12:45,863][100917] Updated weights for policy 1, policy_version 51592 (0.0010) +[2023-10-14 07:12:46,185][100936] Updated weights for policy 0, policy_version 51550 (0.0008) +[2023-10-14 07:12:46,230][100917] Updated weights for policy 1, policy_version 51602 (0.0007) +[2023-10-14 07:12:46,595][100917] Updated weights for policy 1, policy_version 51612 (0.0009) +[2023-10-14 07:12:48,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 105644032. Throughput: 0: 1651.2, 1: 1660.9. Samples: 26417968. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) +[2023-10-14 07:12:48,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:12:50,253][100936] Updated weights for policy 0, policy_version 51560 (0.0010) +[2023-10-14 07:12:50,616][100936] Updated weights for policy 0, policy_version 51570 (0.0008) +[2023-10-14 07:12:50,729][100917] Updated weights for policy 1, policy_version 51622 (0.0008) +[2023-10-14 07:12:50,988][100936] Updated weights for policy 0, policy_version 51580 (0.0008) +[2023-10-14 07:12:51,094][100917] Updated weights for policy 1, policy_version 51632 (0.0009) +[2023-10-14 07:12:51,477][100917] Updated weights for policy 1, policy_version 51642 (0.0009) +[2023-10-14 07:12:53,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 13329.4). Total num frames: 105709568. Throughput: 0: 1643.8, 1: 1676.0. Samples: 26438268. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) +[2023-10-14 07:12:53,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:12:55,137][100936] Updated weights for policy 0, policy_version 51590 (0.0009) +[2023-10-14 07:12:55,513][100936] Updated weights for policy 0, policy_version 51600 (0.0009) +[2023-10-14 07:12:55,592][100917] Updated weights for policy 1, policy_version 51652 (0.0009) +[2023-10-14 07:12:55,874][100936] Updated weights for policy 0, policy_version 51610 (0.0008) +[2023-10-14 07:12:55,955][100917] Updated weights for policy 1, policy_version 51662 (0.0009) +[2023-10-14 07:12:56,328][100917] Updated weights for policy 1, policy_version 51672 (0.0009) +[2023-10-14 07:12:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 105775104. Throughput: 0: 1645.1, 1: 1661.1. Samples: 26447812. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) +[2023-10-14 07:12:58,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:13:00,049][100936] Updated weights for policy 0, policy_version 51620 (0.0009) +[2023-10-14 07:13:00,418][100936] Updated weights for policy 0, policy_version 51630 (0.0008) +[2023-10-14 07:13:00,593][100917] Updated weights for policy 1, policy_version 51682 (0.0009) +[2023-10-14 07:13:00,791][100936] Updated weights for policy 0, policy_version 51640 (0.0009) +[2023-10-14 07:13:01,011][100917] Updated weights for policy 1, policy_version 51692 (0.0009) +[2023-10-14 07:13:01,379][100917] Updated weights for policy 1, policy_version 51702 (0.0010) +[2023-10-14 07:13:01,745][100917] Updated weights for policy 1, policy_version 51712 (0.0008) +[2023-10-14 07:13:03,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 105840640. Throughput: 0: 1650.0, 1: 1656.8. Samples: 26467456. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) +[2023-10-14 07:13:03,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:13:04,952][100936] Updated weights for policy 0, policy_version 51650 (0.0009) +[2023-10-14 07:13:05,335][100936] Updated weights for policy 0, policy_version 51660 (0.0008) +[2023-10-14 07:13:05,701][100936] Updated weights for policy 0, policy_version 51670 (0.0008) +[2023-10-14 07:13:05,864][100917] Updated weights for policy 1, policy_version 51722 (0.0008) +[2023-10-14 07:13:06,072][100936] Updated weights for policy 0, policy_version 51680 (0.0008) +[2023-10-14 07:13:06,243][100917] Updated weights for policy 1, policy_version 51732 (0.0010) +[2023-10-14 07:13:06,609][100917] Updated weights for policy 1, policy_version 51742 (0.0009) +[2023-10-14 07:13:08,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 105906176. Throughput: 0: 1650.9, 1: 1664.2. Samples: 26487700. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) +[2023-10-14 07:13:08,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:13:10,072][100936] Updated weights for policy 0, policy_version 51690 (0.0008) +[2023-10-14 07:13:10,447][100936] Updated weights for policy 0, policy_version 51700 (0.0007) +[2023-10-14 07:13:10,531][100917] Updated weights for policy 1, policy_version 51752 (0.0007) +[2023-10-14 07:13:10,814][100936] Updated weights for policy 0, policy_version 51710 (0.0007) +[2023-10-14 07:13:10,895][100917] Updated weights for policy 1, policy_version 51762 (0.0009) +[2023-10-14 07:13:11,267][100917] Updated weights for policy 1, policy_version 51772 (0.0008) +[2023-10-14 07:13:13,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 105971712. Throughput: 0: 1652.5, 1: 1643.9. Samples: 26497226. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) +[2023-10-14 07:13:13,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:13:15,025][100936] Updated weights for policy 0, policy_version 51720 (0.0007) +[2023-10-14 07:13:15,404][100936] Updated weights for policy 0, policy_version 51730 (0.0007) +[2023-10-14 07:13:15,488][100917] Updated weights for policy 1, policy_version 51782 (0.0009) +[2023-10-14 07:13:15,774][100936] Updated weights for policy 0, policy_version 51740 (0.0007) +[2023-10-14 07:13:15,855][100917] Updated weights for policy 1, policy_version 51792 (0.0008) +[2023-10-14 07:13:16,219][100917] Updated weights for policy 1, policy_version 51802 (0.0009) +[2023-10-14 07:13:18,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 106037248. Throughput: 0: 1648.7, 1: 1647.8. Samples: 26516758. Policy #0 lag: (min: 1.0, avg: 8.2, max: 33.0) +[2023-10-14 07:13:18,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:13:19,871][100936] Updated weights for policy 0, policy_version 51750 (0.0008) +[2023-10-14 07:13:20,250][100936] Updated weights for policy 0, policy_version 51760 (0.0008) +[2023-10-14 07:13:20,389][100917] Updated weights for policy 1, policy_version 51812 (0.0010) +[2023-10-14 07:13:20,620][100936] Updated weights for policy 0, policy_version 51770 (0.0008) +[2023-10-14 07:13:20,757][100917] Updated weights for policy 1, policy_version 51822 (0.0010) +[2023-10-14 07:13:21,126][100917] Updated weights for policy 1, policy_version 51832 (0.0010) +[2023-10-14 07:13:23,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 106102784. Throughput: 0: 1651.6, 1: 1644.0. Samples: 26537036. Policy #0 lag: (min: 1.0, avg: 8.2, max: 33.0) +[2023-10-14 07:13:23,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:13:24,729][100936] Updated weights for policy 0, policy_version 51780 (0.0009) +[2023-10-14 07:13:25,096][100936] Updated weights for policy 0, policy_version 51790 (0.0009) +[2023-10-14 07:13:25,470][100936] Updated weights for policy 0, policy_version 51800 (0.0010) +[2023-10-14 07:13:25,473][100917] Updated weights for policy 1, policy_version 51842 (0.0010) +[2023-10-14 07:13:25,835][100917] Updated weights for policy 1, policy_version 51852 (0.0009) +[2023-10-14 07:13:26,208][100917] Updated weights for policy 1, policy_version 51862 (0.0009) +[2023-10-14 07:13:26,585][100917] Updated weights for policy 1, policy_version 51872 (0.0007) +[2023-10-14 07:13:28,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 106168320. Throughput: 0: 1649.1, 1: 1640.0. Samples: 26546422. Policy #0 lag: (min: 1.0, avg: 8.2, max: 33.0) +[2023-10-14 07:13:28,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:13:29,934][100936] Updated weights for policy 0, policy_version 51810 (0.0008) +[2023-10-14 07:13:30,300][100936] Updated weights for policy 0, policy_version 51820 (0.0010) +[2023-10-14 07:13:30,667][100936] Updated weights for policy 0, policy_version 51830 (0.0009) +[2023-10-14 07:13:30,766][100917] Updated weights for policy 1, policy_version 51882 (0.0007) +[2023-10-14 07:13:31,034][100936] Updated weights for policy 0, policy_version 51840 (0.0007) +[2023-10-14 07:13:31,137][100917] Updated weights for policy 1, policy_version 51892 (0.0009) +[2023-10-14 07:13:31,516][100917] Updated weights for policy 1, policy_version 51902 (0.0010) +[2023-10-14 07:13:33,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 106233856. Throughput: 0: 1643.0, 1: 1647.5. Samples: 26566038. Policy #0 lag: (min: 1.0, avg: 8.2, max: 33.0) +[2023-10-14 07:13:33,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:13:35,411][100917] Updated weights for policy 1, policy_version 51912 (0.0009) +[2023-10-14 07:13:35,413][100936] Updated weights for policy 0, policy_version 51850 (0.0008) +[2023-10-14 07:13:35,782][100936] Updated weights for policy 0, policy_version 51860 (0.0008) +[2023-10-14 07:13:35,782][100917] Updated weights for policy 1, policy_version 51922 (0.0007) +[2023-10-14 07:13:36,152][100936] Updated weights for policy 0, policy_version 51870 (0.0008) +[2023-10-14 07:13:36,162][100917] Updated weights for policy 1, policy_version 51932 (0.0008) +[2023-10-14 07:13:38,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 106299392. Throughput: 0: 1638.3, 1: 1649.5. Samples: 26586222. Policy #0 lag: (min: 1.0, avg: 8.2, max: 33.0) +[2023-10-14 07:13:38,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:13:38,521][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000051872_53116928.pth... +[2023-10-14 07:13:38,521][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000051936_53182464.pth... +[2023-10-14 07:13:38,550][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000050336_51544064.pth +[2023-10-14 07:13:38,555][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000050400_51609600.pth +[2023-10-14 07:13:40,232][100917] Updated weights for policy 1, policy_version 51942 (0.0009) +[2023-10-14 07:13:40,380][100936] Updated weights for policy 0, policy_version 51880 (0.0007) +[2023-10-14 07:13:40,605][100917] Updated weights for policy 1, policy_version 51952 (0.0008) +[2023-10-14 07:13:40,752][100936] Updated weights for policy 0, policy_version 51890 (0.0007) +[2023-10-14 07:13:40,965][100917] Updated weights for policy 1, policy_version 51962 (0.0009) +[2023-10-14 07:13:41,117][100936] Updated weights for policy 0, policy_version 51900 (0.0009) +[2023-10-14 07:13:43,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 106364928. Throughput: 0: 1638.8, 1: 1640.4. Samples: 26595376. Policy #0 lag: (min: 1.0, avg: 8.2, max: 33.0) +[2023-10-14 07:13:43,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:13:45,214][100917] Updated weights for policy 1, policy_version 51972 (0.0009) +[2023-10-14 07:13:45,229][100936] Updated weights for policy 0, policy_version 51910 (0.0009) +[2023-10-14 07:13:45,593][100936] Updated weights for policy 0, policy_version 51920 (0.0007) +[2023-10-14 07:13:45,599][100917] Updated weights for policy 1, policy_version 51982 (0.0010) +[2023-10-14 07:13:45,965][100917] Updated weights for policy 1, policy_version 51992 (0.0009) +[2023-10-14 07:13:45,968][100936] Updated weights for policy 0, policy_version 51930 (0.0007) +[2023-10-14 07:13:48,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 106430464. Throughput: 0: 1637.6, 1: 1652.8. Samples: 26615522. Policy #0 lag: (min: 1.0, avg: 8.2, max: 33.0) +[2023-10-14 07:13:48,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:13:50,085][100936] Updated weights for policy 0, policy_version 51940 (0.0009) +[2023-10-14 07:13:50,312][100917] Updated weights for policy 1, policy_version 52002 (0.0009) +[2023-10-14 07:13:50,480][100936] Updated weights for policy 0, policy_version 51950 (0.0007) +[2023-10-14 07:13:50,719][100917] Updated weights for policy 1, policy_version 52012 (0.0009) +[2023-10-14 07:13:50,845][100936] Updated weights for policy 0, policy_version 51960 (0.0007) +[2023-10-14 07:13:51,096][100917] Updated weights for policy 1, policy_version 52022 (0.0008) +[2023-10-14 07:13:51,475][100917] Updated weights for policy 1, policy_version 52032 (0.0010) +[2023-10-14 07:13:53,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 106496000. Throughput: 0: 1634.6, 1: 1654.8. Samples: 26635722. Policy #0 lag: (min: 33.0, avg: 39.8, max: 40.0) +[2023-10-14 07:13:53,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:13:54,907][100936] Updated weights for policy 0, policy_version 51970 (0.0007) +[2023-10-14 07:13:55,277][100936] Updated weights for policy 0, policy_version 51980 (0.0008) +[2023-10-14 07:13:55,360][100917] Updated weights for policy 1, policy_version 52042 (0.0008) +[2023-10-14 07:13:55,641][100936] Updated weights for policy 0, policy_version 51990 (0.0008) +[2023-10-14 07:13:55,735][100917] Updated weights for policy 1, policy_version 52052 (0.0010) +[2023-10-14 07:13:56,014][100936] Updated weights for policy 0, policy_version 52000 (0.0009) +[2023-10-14 07:13:56,109][100917] Updated weights for policy 1, policy_version 52062 (0.0008) +[2023-10-14 07:13:58,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 106561536. Throughput: 0: 1633.3, 1: 1651.6. Samples: 26645044. Policy #0 lag: (min: 33.0, avg: 39.8, max: 40.0) +[2023-10-14 07:13:58,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:14:00,262][100936] Updated weights for policy 0, policy_version 52010 (0.0007) +[2023-10-14 07:14:00,421][100917] Updated weights for policy 1, policy_version 52072 (0.0010) +[2023-10-14 07:14:00,635][100936] Updated weights for policy 0, policy_version 52020 (0.0008) +[2023-10-14 07:14:00,791][100917] Updated weights for policy 1, policy_version 52082 (0.0007) +[2023-10-14 07:14:01,013][100936] Updated weights for policy 0, policy_version 52030 (0.0007) +[2023-10-14 07:14:01,159][100917] Updated weights for policy 1, policy_version 52092 (0.0009) +[2023-10-14 07:14:03,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 106627072. Throughput: 0: 1637.3, 1: 1657.3. Samples: 26665018. Policy #0 lag: (min: 33.0, avg: 39.8, max: 40.0) +[2023-10-14 07:14:03,512][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 07:14:05,123][100936] Updated weights for policy 0, policy_version 52040 (0.0009) +[2023-10-14 07:14:05,211][100917] Updated weights for policy 1, policy_version 52102 (0.0010) +[2023-10-14 07:14:05,495][100936] Updated weights for policy 0, policy_version 52050 (0.0008) +[2023-10-14 07:14:05,575][100917] Updated weights for policy 1, policy_version 52112 (0.0008) +[2023-10-14 07:14:05,870][100936] Updated weights for policy 0, policy_version 52060 (0.0007) +[2023-10-14 07:14:05,946][100917] Updated weights for policy 1, policy_version 52122 (0.0007) +[2023-10-14 07:14:08,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 106692608. Throughput: 0: 1632.6, 1: 1664.9. Samples: 26685424. Policy #0 lag: (min: 33.0, avg: 39.8, max: 40.0) +[2023-10-14 07:14:08,513][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 07:14:10,029][100917] Updated weights for policy 1, policy_version 52132 (0.0010) +[2023-10-14 07:14:10,261][100936] Updated weights for policy 0, policy_version 52070 (0.0007) +[2023-10-14 07:14:10,394][100917] Updated weights for policy 1, policy_version 52142 (0.0008) +[2023-10-14 07:14:10,620][100936] Updated weights for policy 0, policy_version 52080 (0.0008) +[2023-10-14 07:14:10,763][100917] Updated weights for policy 1, policy_version 52152 (0.0007) +[2023-10-14 07:14:11,000][100936] Updated weights for policy 0, policy_version 52090 (0.0008) +[2023-10-14 07:14:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 106758144. Throughput: 0: 1631.8, 1: 1656.2. Samples: 26694384. Policy #0 lag: (min: 33.0, avg: 39.8, max: 40.0) +[2023-10-14 07:14:13,513][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 07:14:14,876][100917] Updated weights for policy 1, policy_version 52162 (0.0008) +[2023-10-14 07:14:15,146][100936] Updated weights for policy 0, policy_version 52100 (0.0008) +[2023-10-14 07:14:15,246][100917] Updated weights for policy 1, policy_version 52172 (0.0010) +[2023-10-14 07:14:15,514][100936] Updated weights for policy 0, policy_version 52110 (0.0007) +[2023-10-14 07:14:15,617][100917] Updated weights for policy 1, policy_version 52182 (0.0007) +[2023-10-14 07:14:15,878][100936] Updated weights for policy 0, policy_version 52120 (0.0007) +[2023-10-14 07:14:15,995][100917] Updated weights for policy 1, policy_version 52192 (0.0008) +[2023-10-14 07:14:18,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 106823680. Throughput: 0: 1634.7, 1: 1662.7. Samples: 26714420. Policy #0 lag: (min: 33.0, avg: 39.8, max: 40.0) +[2023-10-14 07:14:18,513][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 07:14:19,982][100936] Updated weights for policy 0, policy_version 52130 (0.0008) +[2023-10-14 07:14:20,071][100917] Updated weights for policy 1, policy_version 52202 (0.0007) +[2023-10-14 07:14:20,357][100936] Updated weights for policy 0, policy_version 52140 (0.0007) +[2023-10-14 07:14:20,443][100917] Updated weights for policy 1, policy_version 52212 (0.0008) +[2023-10-14 07:14:20,714][100936] Updated weights for policy 0, policy_version 52150 (0.0008) +[2023-10-14 07:14:20,811][100917] Updated weights for policy 1, policy_version 52222 (0.0009) +[2023-10-14 07:14:21,085][100936] Updated weights for policy 0, policy_version 52160 (0.0009) +[2023-10-14 07:14:23,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 106889216. Throughput: 0: 1644.4, 1: 1658.9. Samples: 26734868. Policy #0 lag: (min: 33.0, avg: 39.8, max: 40.0) +[2023-10-14 07:14:23,512][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 07:14:24,786][100917] Updated weights for policy 1, policy_version 52232 (0.0009) +[2023-10-14 07:14:25,154][100936] Updated weights for policy 0, policy_version 52170 (0.0008) +[2023-10-14 07:14:25,155][100917] Updated weights for policy 1, policy_version 52242 (0.0010) +[2023-10-14 07:14:25,523][100936] Updated weights for policy 0, policy_version 52180 (0.0007) +[2023-10-14 07:14:25,536][100917] Updated weights for policy 1, policy_version 52252 (0.0008) +[2023-10-14 07:14:25,899][100936] Updated weights for policy 0, policy_version 52190 (0.0008) +[2023-10-14 07:14:28,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 106954752. Throughput: 0: 1644.1, 1: 1657.9. Samples: 26743964. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 07:14:28,513][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 07:14:29,719][100917] Updated weights for policy 1, policy_version 52262 (0.0008) +[2023-10-14 07:14:30,085][100917] Updated weights for policy 1, policy_version 52272 (0.0009) +[2023-10-14 07:14:30,135][100936] Updated weights for policy 0, policy_version 52200 (0.0007) +[2023-10-14 07:14:30,460][100917] Updated weights for policy 1, policy_version 52282 (0.0007) +[2023-10-14 07:14:30,505][100936] Updated weights for policy 0, policy_version 52210 (0.0009) +[2023-10-14 07:14:30,879][100936] Updated weights for policy 0, policy_version 52220 (0.0007) +[2023-10-14 07:14:33,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 107020288. Throughput: 0: 1639.2, 1: 1663.5. Samples: 26764144. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 07:14:33,513][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 07:14:34,625][100917] Updated weights for policy 1, policy_version 52292 (0.0008) +[2023-10-14 07:14:34,956][100936] Updated weights for policy 0, policy_version 52230 (0.0008) +[2023-10-14 07:14:34,996][100917] Updated weights for policy 1, policy_version 52302 (0.0011) +[2023-10-14 07:14:35,327][100936] Updated weights for policy 0, policy_version 52240 (0.0008) +[2023-10-14 07:14:35,365][100917] Updated weights for policy 1, policy_version 52312 (0.0010) +[2023-10-14 07:14:35,700][100936] Updated weights for policy 0, policy_version 52250 (0.0008) +[2023-10-14 07:14:38,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 107085824. Throughput: 0: 1642.3, 1: 1664.7. Samples: 26784536. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 07:14:38,512][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 07:14:39,520][100917] Updated weights for policy 1, policy_version 52322 (0.0010) +[2023-10-14 07:14:39,888][100936] Updated weights for policy 0, policy_version 52260 (0.0008) +[2023-10-14 07:14:39,940][100917] Updated weights for policy 1, policy_version 52332 (0.0008) +[2023-10-14 07:14:40,264][100936] Updated weights for policy 0, policy_version 52270 (0.0009) +[2023-10-14 07:14:40,301][100917] Updated weights for policy 1, policy_version 52342 (0.0008) +[2023-10-14 07:14:40,624][100936] Updated weights for policy 0, policy_version 52280 (0.0007) +[2023-10-14 07:14:40,680][100917] Updated weights for policy 1, policy_version 52352 (0.0007) +[2023-10-14 07:14:43,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 107151360. Throughput: 0: 1642.3, 1: 1651.3. Samples: 26793254. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 07:14:43,512][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 07:14:44,688][100936] Updated weights for policy 0, policy_version 52290 (0.0008) +[2023-10-14 07:14:44,773][100917] Updated weights for policy 1, policy_version 52362 (0.0008) +[2023-10-14 07:14:45,055][100936] Updated weights for policy 0, policy_version 52300 (0.0008) +[2023-10-14 07:14:45,146][100917] Updated weights for policy 1, policy_version 52372 (0.0008) +[2023-10-14 07:14:45,420][100936] Updated weights for policy 0, policy_version 52310 (0.0009) +[2023-10-14 07:14:45,512][100917] Updated weights for policy 1, policy_version 52382 (0.0007) +[2023-10-14 07:14:45,790][100936] Updated weights for policy 0, policy_version 52320 (0.0009) +[2023-10-14 07:14:48,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 107216896. Throughput: 0: 1641.9, 1: 1660.0. Samples: 26813602. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 07:14:48,513][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 07:14:49,614][100917] Updated weights for policy 1, policy_version 52392 (0.0009) +[2023-10-14 07:14:49,918][100936] Updated weights for policy 0, policy_version 52330 (0.0008) +[2023-10-14 07:14:49,986][100917] Updated weights for policy 1, policy_version 52402 (0.0008) +[2023-10-14 07:14:50,279][100936] Updated weights for policy 0, policy_version 52340 (0.0008) +[2023-10-14 07:14:50,371][100917] Updated weights for policy 1, policy_version 52412 (0.0009) +[2023-10-14 07:14:50,642][100936] Updated weights for policy 0, policy_version 52350 (0.0009) +[2023-10-14 07:14:53,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 107282432. Throughput: 0: 1646.0, 1: 1657.5. Samples: 26834082. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 07:14:53,513][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 07:14:54,378][100917] Updated weights for policy 1, policy_version 52422 (0.0008) +[2023-10-14 07:14:54,740][100917] Updated weights for policy 1, policy_version 52432 (0.0009) +[2023-10-14 07:14:54,990][100936] Updated weights for policy 0, policy_version 52360 (0.0010) +[2023-10-14 07:14:55,113][100917] Updated weights for policy 1, policy_version 52442 (0.0008) +[2023-10-14 07:14:55,369][100936] Updated weights for policy 0, policy_version 52370 (0.0008) +[2023-10-14 07:14:55,745][100936] Updated weights for policy 0, policy_version 52380 (0.0010) +[2023-10-14 07:14:58,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 107347968. Throughput: 0: 1646.3, 1: 1653.2. Samples: 26842860. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 07:14:58,513][99942] Avg episode reward: [(0, '0.940'), (1, '1.000')] +[2023-10-14 07:14:59,292][100917] Updated weights for policy 1, policy_version 52452 (0.0008) +[2023-10-14 07:14:59,664][100917] Updated weights for policy 1, policy_version 52462 (0.0009) +[2023-10-14 07:14:59,894][100936] Updated weights for policy 0, policy_version 52390 (0.0009) +[2023-10-14 07:15:00,042][100917] Updated weights for policy 1, policy_version 52472 (0.0007) +[2023-10-14 07:15:00,266][100936] Updated weights for policy 0, policy_version 52400 (0.0008) +[2023-10-14 07:15:00,631][100936] Updated weights for policy 0, policy_version 52410 (0.0009) +[2023-10-14 07:15:03,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 107413504. Throughput: 0: 1648.4, 1: 1661.2. Samples: 26863352. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:15:03,513][99942] Avg episode reward: [(0, '0.940'), (1, '1.000')] +[2023-10-14 07:15:04,150][100917] Updated weights for policy 1, policy_version 52482 (0.0008) +[2023-10-14 07:15:04,522][100917] Updated weights for policy 1, policy_version 52492 (0.0009) +[2023-10-14 07:15:04,857][100936] Updated weights for policy 0, policy_version 52420 (0.0008) +[2023-10-14 07:15:04,892][100917] Updated weights for policy 1, policy_version 52502 (0.0009) +[2023-10-14 07:15:05,217][100936] Updated weights for policy 0, policy_version 52430 (0.0009) +[2023-10-14 07:15:05,259][100917] Updated weights for policy 1, policy_version 52512 (0.0008) +[2023-10-14 07:15:05,587][100936] Updated weights for policy 0, policy_version 52440 (0.0009) +[2023-10-14 07:15:08,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 107479040. Throughput: 0: 1643.8, 1: 1658.3. Samples: 26883462. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:15:08,513][99942] Avg episode reward: [(0, '0.940'), (1, '1.000')] +[2023-10-14 07:15:09,458][100917] Updated weights for policy 1, policy_version 52522 (0.0008) +[2023-10-14 07:15:09,718][100936] Updated weights for policy 0, policy_version 52450 (0.0009) +[2023-10-14 07:15:09,835][100917] Updated weights for policy 1, policy_version 52532 (0.0009) +[2023-10-14 07:15:10,085][100936] Updated weights for policy 0, policy_version 52460 (0.0008) +[2023-10-14 07:15:10,203][100917] Updated weights for policy 1, policy_version 52542 (0.0007) +[2023-10-14 07:15:10,450][100936] Updated weights for policy 0, policy_version 52470 (0.0009) +[2023-10-14 07:15:10,817][100936] Updated weights for policy 0, policy_version 52480 (0.0011) +[2023-10-14 07:15:13,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 107544576. Throughput: 0: 1644.6, 1: 1655.9. Samples: 26892488. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:15:13,513][99942] Avg episode reward: [(0, '0.940'), (1, '1.000')] +[2023-10-14 07:15:14,629][100917] Updated weights for policy 1, policy_version 52552 (0.0007) +[2023-10-14 07:15:15,004][100917] Updated weights for policy 1, policy_version 52562 (0.0009) +[2023-10-14 07:15:15,132][100936] Updated weights for policy 0, policy_version 52490 (0.0008) +[2023-10-14 07:15:15,374][100917] Updated weights for policy 1, policy_version 52572 (0.0008) +[2023-10-14 07:15:15,499][100936] Updated weights for policy 0, policy_version 52500 (0.0009) +[2023-10-14 07:15:15,868][100936] Updated weights for policy 0, policy_version 52510 (0.0007) +[2023-10-14 07:15:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 107610112. Throughput: 0: 1650.4, 1: 1648.9. Samples: 26912610. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:15:18,513][99942] Avg episode reward: [(0, '0.940'), (1, '1.000')] +[2023-10-14 07:15:19,568][100917] Updated weights for policy 1, policy_version 52582 (0.0007) +[2023-10-14 07:15:19,940][100917] Updated weights for policy 1, policy_version 52592 (0.0009) +[2023-10-14 07:15:20,060][100936] Updated weights for policy 0, policy_version 52520 (0.0008) +[2023-10-14 07:15:20,313][100917] Updated weights for policy 1, policy_version 52602 (0.0009) +[2023-10-14 07:15:20,427][100936] Updated weights for policy 0, policy_version 52530 (0.0008) +[2023-10-14 07:15:20,792][100936] Updated weights for policy 0, policy_version 52540 (0.0009) +[2023-10-14 07:15:23,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 107675648. Throughput: 0: 1644.5, 1: 1645.9. Samples: 26932604. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:15:23,513][99942] Avg episode reward: [(0, '0.940'), (1, '1.000')] +[2023-10-14 07:15:24,561][100917] Updated weights for policy 1, policy_version 52612 (0.0009) +[2023-10-14 07:15:24,954][100917] Updated weights for policy 1, policy_version 52622 (0.0007) +[2023-10-14 07:15:25,043][100936] Updated weights for policy 0, policy_version 52550 (0.0009) +[2023-10-14 07:15:25,327][100917] Updated weights for policy 1, policy_version 52632 (0.0009) +[2023-10-14 07:15:25,411][100936] Updated weights for policy 0, policy_version 52560 (0.0010) +[2023-10-14 07:15:25,783][100936] Updated weights for policy 0, policy_version 52570 (0.0009) +[2023-10-14 07:15:28,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.1, 300 sec: 13107.2). Total num frames: 107741184. Throughput: 0: 1642.4, 1: 1646.9. Samples: 26941274. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:15:28,513][99942] Avg episode reward: [(0, '0.940'), (1, '1.000')] +[2023-10-14 07:15:29,604][100917] Updated weights for policy 1, policy_version 52642 (0.0008) +[2023-10-14 07:15:29,981][100917] Updated weights for policy 1, policy_version 52652 (0.0009) +[2023-10-14 07:15:30,062][100936] Updated weights for policy 0, policy_version 52580 (0.0009) +[2023-10-14 07:15:30,349][100917] Updated weights for policy 1, policy_version 52662 (0.0009) +[2023-10-14 07:15:30,423][100936] Updated weights for policy 0, policy_version 52590 (0.0009) +[2023-10-14 07:15:30,724][100917] Updated weights for policy 1, policy_version 52672 (0.0007) +[2023-10-14 07:15:30,793][100936] Updated weights for policy 0, policy_version 52600 (0.0009) +[2023-10-14 07:15:33,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 107806720. Throughput: 0: 1640.4, 1: 1642.8. Samples: 26961350. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:15:33,513][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 07:15:34,919][100936] Updated weights for policy 0, policy_version 52610 (0.0011) +[2023-10-14 07:15:34,956][100917] Updated weights for policy 1, policy_version 52682 (0.0010) +[2023-10-14 07:15:35,289][100936] Updated weights for policy 0, policy_version 52620 (0.0009) +[2023-10-14 07:15:35,334][100917] Updated weights for policy 1, policy_version 52692 (0.0011) +[2023-10-14 07:15:35,658][100936] Updated weights for policy 0, policy_version 52630 (0.0007) +[2023-10-14 07:15:35,702][100917] Updated weights for policy 1, policy_version 52702 (0.0008) +[2023-10-14 07:15:36,036][100936] Updated weights for policy 0, policy_version 52640 (0.0009) +[2023-10-14 07:15:38,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 107872256. Throughput: 0: 1642.8, 1: 1638.2. Samples: 26981724. Policy #0 lag: (min: 31.0, avg: 34.7, max: 63.0) +[2023-10-14 07:15:38,512][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 07:15:38,522][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000052640_53903360.pth... +[2023-10-14 07:15:38,523][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000052704_53968896.pth... +[2023-10-14 07:15:38,551][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000051104_52330496.pth +[2023-10-14 07:15:38,559][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000051168_52396032.pth +[2023-10-14 07:15:39,825][100917] Updated weights for policy 1, policy_version 52712 (0.0009) +[2023-10-14 07:15:40,016][100936] Updated weights for policy 0, policy_version 52650 (0.0008) +[2023-10-14 07:15:40,204][100917] Updated weights for policy 1, policy_version 52722 (0.0007) +[2023-10-14 07:15:40,381][100936] Updated weights for policy 0, policy_version 52660 (0.0008) +[2023-10-14 07:15:40,576][100917] Updated weights for policy 1, policy_version 52732 (0.0007) +[2023-10-14 07:15:40,751][100936] Updated weights for policy 0, policy_version 52670 (0.0007) +[2023-10-14 07:15:43,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 107937792. Throughput: 0: 1646.9, 1: 1638.8. Samples: 26990718. Policy #0 lag: (min: 31.0, avg: 34.7, max: 63.0) +[2023-10-14 07:15:43,512][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 07:15:44,586][100917] Updated weights for policy 1, policy_version 52742 (0.0008) +[2023-10-14 07:15:44,639][100936] Updated weights for policy 0, policy_version 52680 (0.0007) +[2023-10-14 07:15:44,965][100917] Updated weights for policy 1, policy_version 52752 (0.0009) +[2023-10-14 07:15:45,011][100936] Updated weights for policy 0, policy_version 52690 (0.0007) +[2023-10-14 07:15:45,343][100917] Updated weights for policy 1, policy_version 52762 (0.0008) +[2023-10-14 07:15:45,390][100936] Updated weights for policy 0, policy_version 52700 (0.0007) +[2023-10-14 07:15:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 108003328. Throughput: 0: 1653.3, 1: 1635.4. Samples: 27011344. Policy #0 lag: (min: 31.0, avg: 34.7, max: 63.0) +[2023-10-14 07:15:48,512][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 07:15:49,437][100936] Updated weights for policy 0, policy_version 52710 (0.0010) +[2023-10-14 07:15:49,542][100917] Updated weights for policy 1, policy_version 52772 (0.0008) +[2023-10-14 07:15:49,815][100936] Updated weights for policy 0, policy_version 52720 (0.0010) +[2023-10-14 07:15:49,911][100917] Updated weights for policy 1, policy_version 52782 (0.0009) +[2023-10-14 07:15:50,180][100936] Updated weights for policy 0, policy_version 52730 (0.0009) +[2023-10-14 07:15:50,280][100917] Updated weights for policy 1, policy_version 52792 (0.0008) +[2023-10-14 07:15:53,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 108068864. Throughput: 0: 1654.9, 1: 1635.5. Samples: 27031530. Policy #0 lag: (min: 31.0, avg: 34.7, max: 63.0) +[2023-10-14 07:15:53,513][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 07:15:54,292][100917] Updated weights for policy 1, policy_version 52802 (0.0008) +[2023-10-14 07:15:54,454][100936] Updated weights for policy 0, policy_version 52740 (0.0008) +[2023-10-14 07:15:54,663][100917] Updated weights for policy 1, policy_version 52812 (0.0008) +[2023-10-14 07:15:54,826][100936] Updated weights for policy 0, policy_version 52750 (0.0008) +[2023-10-14 07:15:55,035][100917] Updated weights for policy 1, policy_version 52822 (0.0009) +[2023-10-14 07:15:55,185][100936] Updated weights for policy 0, policy_version 52760 (0.0008) +[2023-10-14 07:15:55,398][100917] Updated weights for policy 1, policy_version 52832 (0.0008) +[2023-10-14 07:15:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 108134400. Throughput: 0: 1652.0, 1: 1631.2. Samples: 27040228. Policy #0 lag: (min: 31.0, avg: 34.7, max: 63.0) +[2023-10-14 07:15:58,512][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 07:15:59,246][100936] Updated weights for policy 0, policy_version 52770 (0.0008) +[2023-10-14 07:15:59,603][100936] Updated weights for policy 0, policy_version 52780 (0.0009) +[2023-10-14 07:15:59,606][100917] Updated weights for policy 1, policy_version 52842 (0.0008) +[2023-10-14 07:15:59,970][100936] Updated weights for policy 0, policy_version 52790 (0.0008) +[2023-10-14 07:15:59,976][100917] Updated weights for policy 1, policy_version 52852 (0.0007) +[2023-10-14 07:16:00,339][100936] Updated weights for policy 0, policy_version 52800 (0.0008) +[2023-10-14 07:16:00,353][100917] Updated weights for policy 1, policy_version 52862 (0.0007) +[2023-10-14 07:16:03,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 108199936. Throughput: 0: 1647.6, 1: 1635.9. Samples: 27060364. Policy #0 lag: (min: 31.0, avg: 34.7, max: 63.0) +[2023-10-14 07:16:03,513][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 07:16:04,546][100917] Updated weights for policy 1, policy_version 52872 (0.0008) +[2023-10-14 07:16:04,695][100936] Updated weights for policy 0, policy_version 52810 (0.0008) +[2023-10-14 07:16:04,914][100917] Updated weights for policy 1, policy_version 52882 (0.0009) +[2023-10-14 07:16:05,054][100936] Updated weights for policy 0, policy_version 52820 (0.0009) +[2023-10-14 07:16:05,296][100917] Updated weights for policy 1, policy_version 52892 (0.0007) +[2023-10-14 07:16:05,422][100936] Updated weights for policy 0, policy_version 52830 (0.0008) +[2023-10-14 07:16:08,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 108265472. Throughput: 0: 1655.1, 1: 1634.2. Samples: 27080622. Policy #0 lag: (min: 31.0, avg: 34.7, max: 63.0) +[2023-10-14 07:16:08,513][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 07:16:09,443][100936] Updated weights for policy 0, policy_version 52840 (0.0008) +[2023-10-14 07:16:09,501][100917] Updated weights for policy 1, policy_version 52902 (0.0008) +[2023-10-14 07:16:09,811][100936] Updated weights for policy 0, policy_version 52850 (0.0008) +[2023-10-14 07:16:09,876][100917] Updated weights for policy 1, policy_version 52912 (0.0007) +[2023-10-14 07:16:10,187][100936] Updated weights for policy 0, policy_version 52860 (0.0010) +[2023-10-14 07:16:10,251][100917] Updated weights for policy 1, policy_version 52922 (0.0008) +[2023-10-14 07:16:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 108331008. Throughput: 0: 1655.8, 1: 1635.2. Samples: 27089370. Policy #0 lag: (min: 21.0, avg: 21.0, max: 21.0) +[2023-10-14 07:16:13,513][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 07:16:14,251][100917] Updated weights for policy 1, policy_version 52932 (0.0007) +[2023-10-14 07:16:14,449][100936] Updated weights for policy 0, policy_version 52870 (0.0010) +[2023-10-14 07:16:14,623][100917] Updated weights for policy 1, policy_version 52942 (0.0009) +[2023-10-14 07:16:14,815][100936] Updated weights for policy 0, policy_version 52880 (0.0008) +[2023-10-14 07:16:15,002][100917] Updated weights for policy 1, policy_version 52952 (0.0007) +[2023-10-14 07:16:15,178][100936] Updated weights for policy 0, policy_version 52890 (0.0008) +[2023-10-14 07:16:18,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 108396544. Throughput: 0: 1657.5, 1: 1637.8. Samples: 27109638. Policy #0 lag: (min: 21.0, avg: 21.0, max: 21.0) +[2023-10-14 07:16:18,513][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 07:16:19,277][100917] Updated weights for policy 1, policy_version 52962 (0.0007) +[2023-10-14 07:16:19,364][100936] Updated weights for policy 0, policy_version 52900 (0.0009) +[2023-10-14 07:16:19,647][100917] Updated weights for policy 1, policy_version 52972 (0.0009) +[2023-10-14 07:16:19,726][100936] Updated weights for policy 0, policy_version 52910 (0.0009) +[2023-10-14 07:16:20,016][100917] Updated weights for policy 1, policy_version 52982 (0.0007) +[2023-10-14 07:16:20,102][100936] Updated weights for policy 0, policy_version 52920 (0.0007) +[2023-10-14 07:16:20,391][100917] Updated weights for policy 1, policy_version 52992 (0.0007) +[2023-10-14 07:16:23,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 108462080. Throughput: 0: 1651.7, 1: 1635.8. Samples: 27129662. Policy #0 lag: (min: 21.0, avg: 21.0, max: 21.0) +[2023-10-14 07:16:23,513][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 07:16:24,239][100936] Updated weights for policy 0, policy_version 52930 (0.0008) +[2023-10-14 07:16:24,612][100936] Updated weights for policy 0, policy_version 52940 (0.0009) +[2023-10-14 07:16:24,613][100917] Updated weights for policy 1, policy_version 53002 (0.0009) +[2023-10-14 07:16:24,981][100936] Updated weights for policy 0, policy_version 52950 (0.0009) +[2023-10-14 07:16:24,985][100917] Updated weights for policy 1, policy_version 53012 (0.0008) +[2023-10-14 07:16:25,348][100936] Updated weights for policy 0, policy_version 52960 (0.0008) +[2023-10-14 07:16:25,355][100917] Updated weights for policy 1, policy_version 53022 (0.0008) +[2023-10-14 07:16:28,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.3, 300 sec: 13107.2). Total num frames: 108527616. Throughput: 0: 1650.3, 1: 1631.3. Samples: 27138390. Policy #0 lag: (min: 21.0, avg: 21.0, max: 21.0) +[2023-10-14 07:16:28,513][99942] Avg episode reward: [(0, '0.960'), (1, '1.000')] +[2023-10-14 07:16:29,532][100917] Updated weights for policy 1, policy_version 53032 (0.0011) +[2023-10-14 07:16:29,573][100936] Updated weights for policy 0, policy_version 52970 (0.0009) +[2023-10-14 07:16:29,896][100917] Updated weights for policy 1, policy_version 53042 (0.0008) +[2023-10-14 07:16:29,944][100936] Updated weights for policy 0, policy_version 52980 (0.0010) +[2023-10-14 07:16:30,265][100917] Updated weights for policy 1, policy_version 53052 (0.0010) +[2023-10-14 07:16:30,320][100936] Updated weights for policy 0, policy_version 52990 (0.0010) +[2023-10-14 07:16:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 108593152. Throughput: 0: 1643.6, 1: 1631.2. Samples: 27158712. Policy #0 lag: (min: 21.0, avg: 21.0, max: 21.0) +[2023-10-14 07:16:33,513][99942] Avg episode reward: [(0, '0.960'), (1, '1.000')] +[2023-10-14 07:16:34,342][100917] Updated weights for policy 1, policy_version 53062 (0.0007) +[2023-10-14 07:16:34,442][100936] Updated weights for policy 0, policy_version 53000 (0.0008) +[2023-10-14 07:16:34,713][100917] Updated weights for policy 1, policy_version 53072 (0.0010) +[2023-10-14 07:16:34,808][100936] Updated weights for policy 0, policy_version 53010 (0.0007) +[2023-10-14 07:16:35,093][100917] Updated weights for policy 1, policy_version 53082 (0.0009) +[2023-10-14 07:16:35,170][100936] Updated weights for policy 0, policy_version 53020 (0.0008) +[2023-10-14 07:16:38,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 108658688. Throughput: 0: 1641.1, 1: 1639.7. Samples: 27179164. Policy #0 lag: (min: 21.0, avg: 21.0, max: 21.0) +[2023-10-14 07:16:38,513][99942] Avg episode reward: [(0, '0.960'), (1, '1.000')] +[2023-10-14 07:16:39,114][100917] Updated weights for policy 1, policy_version 53092 (0.0009) +[2023-10-14 07:16:39,295][100936] Updated weights for policy 0, policy_version 53030 (0.0007) +[2023-10-14 07:16:39,493][100917] Updated weights for policy 1, policy_version 53102 (0.0008) +[2023-10-14 07:16:39,672][100936] Updated weights for policy 0, policy_version 53040 (0.0012) +[2023-10-14 07:16:39,858][100917] Updated weights for policy 1, policy_version 53112 (0.0009) +[2023-10-14 07:16:40,035][100936] Updated weights for policy 0, policy_version 53050 (0.0009) +[2023-10-14 07:16:43,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.1, 300 sec: 13107.2). Total num frames: 108724224. Throughput: 0: 1640.7, 1: 1646.2. Samples: 27188140. Policy #0 lag: (min: 21.0, avg: 21.0, max: 21.0) +[2023-10-14 07:16:43,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:16:43,946][100917] Updated weights for policy 1, policy_version 53122 (0.0009) +[2023-10-14 07:16:44,320][100936] Updated weights for policy 0, policy_version 53060 (0.0009) +[2023-10-14 07:16:44,324][100917] Updated weights for policy 1, policy_version 53132 (0.0010) +[2023-10-14 07:16:44,695][100917] Updated weights for policy 1, policy_version 53142 (0.0009) +[2023-10-14 07:16:44,696][100936] Updated weights for policy 0, policy_version 53070 (0.0007) +[2023-10-14 07:16:45,063][100917] Updated weights for policy 1, policy_version 53152 (0.0008) +[2023-10-14 07:16:45,065][100936] Updated weights for policy 0, policy_version 53080 (0.0008) +[2023-10-14 07:16:48,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 108789760. Throughput: 0: 1639.5, 1: 1650.3. Samples: 27208404. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:16:48,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:16:49,371][100936] Updated weights for policy 0, policy_version 53090 (0.0008) +[2023-10-14 07:16:49,396][100917] Updated weights for policy 1, policy_version 53162 (0.0008) +[2023-10-14 07:16:49,767][100917] Updated weights for policy 1, policy_version 53172 (0.0008) +[2023-10-14 07:16:49,773][100936] Updated weights for policy 0, policy_version 53100 (0.0007) +[2023-10-14 07:16:50,134][100917] Updated weights for policy 1, policy_version 53182 (0.0007) +[2023-10-14 07:16:50,148][100936] Updated weights for policy 0, policy_version 53110 (0.0007) +[2023-10-14 07:16:50,517][100936] Updated weights for policy 0, policy_version 53120 (0.0009) +[2023-10-14 07:16:53,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 108855296. Throughput: 0: 1636.2, 1: 1647.6. Samples: 27228396. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:16:53,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:16:54,422][100917] Updated weights for policy 1, policy_version 53192 (0.0007) +[2023-10-14 07:16:54,671][100936] Updated weights for policy 0, policy_version 53130 (0.0008) +[2023-10-14 07:16:54,792][100917] Updated weights for policy 1, policy_version 53202 (0.0009) +[2023-10-14 07:16:55,050][100936] Updated weights for policy 0, policy_version 53140 (0.0007) +[2023-10-14 07:16:55,175][100917] Updated weights for policy 1, policy_version 53212 (0.0010) +[2023-10-14 07:16:55,416][100936] Updated weights for policy 0, policy_version 53150 (0.0009) +[2023-10-14 07:16:58,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 108920832. Throughput: 0: 1638.4, 1: 1652.0. Samples: 27237440. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:16:58,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:16:59,297][100917] Updated weights for policy 1, policy_version 53222 (0.0009) +[2023-10-14 07:16:59,669][100917] Updated weights for policy 1, policy_version 53232 (0.0007) +[2023-10-14 07:16:59,722][100936] Updated weights for policy 0, policy_version 53160 (0.0007) +[2023-10-14 07:17:00,041][100917] Updated weights for policy 1, policy_version 53242 (0.0007) +[2023-10-14 07:17:00,091][100936] Updated weights for policy 0, policy_version 53170 (0.0008) +[2023-10-14 07:17:00,462][100936] Updated weights for policy 0, policy_version 53180 (0.0008) +[2023-10-14 07:17:03,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 108986368. Throughput: 0: 1639.2, 1: 1656.7. Samples: 27257950. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:17:03,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:17:03,847][100917] Updated weights for policy 1, policy_version 53252 (0.0011) +[2023-10-14 07:17:04,234][100917] Updated weights for policy 1, policy_version 53262 (0.0010) +[2023-10-14 07:17:04,529][100936] Updated weights for policy 0, policy_version 53190 (0.0008) +[2023-10-14 07:17:04,612][100917] Updated weights for policy 1, policy_version 53272 (0.0007) +[2023-10-14 07:17:04,896][100936] Updated weights for policy 0, policy_version 53200 (0.0009) +[2023-10-14 07:17:05,260][100936] Updated weights for policy 0, policy_version 53210 (0.0008) +[2023-10-14 07:17:08,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 109051904. Throughput: 0: 1637.8, 1: 1665.5. Samples: 27278310. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:17:08,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:17:08,653][100917] Updated weights for policy 1, policy_version 53282 (0.0010) +[2023-10-14 07:17:09,040][100917] Updated weights for policy 1, policy_version 53292 (0.0008) +[2023-10-14 07:17:09,412][100917] Updated weights for policy 1, policy_version 53302 (0.0007) +[2023-10-14 07:17:09,498][100936] Updated weights for policy 0, policy_version 53220 (0.0009) +[2023-10-14 07:17:09,784][100917] Updated weights for policy 1, policy_version 53312 (0.0007) +[2023-10-14 07:17:09,862][100936] Updated weights for policy 0, policy_version 53230 (0.0007) +[2023-10-14 07:17:10,231][100936] Updated weights for policy 0, policy_version 53240 (0.0009) +[2023-10-14 07:17:13,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 109117440. Throughput: 0: 1636.8, 1: 1671.3. Samples: 27287258. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:17:13,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:17:13,854][100917] Updated weights for policy 1, policy_version 53322 (0.0009) +[2023-10-14 07:17:14,236][100917] Updated weights for policy 1, policy_version 53332 (0.0011) +[2023-10-14 07:17:14,473][100936] Updated weights for policy 0, policy_version 53250 (0.0008) +[2023-10-14 07:17:14,597][100917] Updated weights for policy 1, policy_version 53342 (0.0009) +[2023-10-14 07:17:14,844][100936] Updated weights for policy 0, policy_version 53260 (0.0010) +[2023-10-14 07:17:15,217][100936] Updated weights for policy 0, policy_version 53270 (0.0009) +[2023-10-14 07:17:15,596][100936] Updated weights for policy 0, policy_version 53280 (0.0009) +[2023-10-14 07:17:18,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.3, 300 sec: 13107.2). Total num frames: 109182976. Throughput: 0: 1635.4, 1: 1670.1. Samples: 27307460. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:17:18,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:17:18,754][100917] Updated weights for policy 1, policy_version 53352 (0.0008) +[2023-10-14 07:17:19,125][100917] Updated weights for policy 1, policy_version 53362 (0.0009) +[2023-10-14 07:17:19,500][100917] Updated weights for policy 1, policy_version 53372 (0.0008) +[2023-10-14 07:17:19,750][100936] Updated weights for policy 0, policy_version 53290 (0.0010) +[2023-10-14 07:17:20,126][100936] Updated weights for policy 0, policy_version 53300 (0.0011) +[2023-10-14 07:17:20,500][100936] Updated weights for policy 0, policy_version 53310 (0.0010) +[2023-10-14 07:17:23,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.3, 300 sec: 13107.2). Total num frames: 109248512. Throughput: 0: 1635.3, 1: 1669.7. Samples: 27327890. Policy #0 lag: (min: 10.0, avg: 18.0, max: 42.0) +[2023-10-14 07:17:23,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:17:23,582][100917] Updated weights for policy 1, policy_version 53382 (0.0009) +[2023-10-14 07:17:23,956][100917] Updated weights for policy 1, policy_version 53392 (0.0009) +[2023-10-14 07:17:24,328][100917] Updated weights for policy 1, policy_version 53402 (0.0008) +[2023-10-14 07:17:24,606][100936] Updated weights for policy 0, policy_version 53320 (0.0007) +[2023-10-14 07:17:24,973][100936] Updated weights for policy 0, policy_version 53330 (0.0008) +[2023-10-14 07:17:25,349][100936] Updated weights for policy 0, policy_version 53340 (0.0007) +[2023-10-14 07:17:28,504][100917] Updated weights for policy 1, policy_version 53412 (0.0008) +[2023-10-14 07:17:28,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 109314048. Throughput: 0: 1639.8, 1: 1666.3. Samples: 27336914. Policy #0 lag: (min: 10.0, avg: 18.0, max: 42.0) +[2023-10-14 07:17:28,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:17:28,883][100917] Updated weights for policy 1, policy_version 53422 (0.0009) +[2023-10-14 07:17:29,257][100917] Updated weights for policy 1, policy_version 53432 (0.0007) +[2023-10-14 07:17:29,464][100936] Updated weights for policy 0, policy_version 53350 (0.0007) +[2023-10-14 07:17:29,834][100936] Updated weights for policy 0, policy_version 53360 (0.0008) +[2023-10-14 07:17:30,204][100936] Updated weights for policy 0, policy_version 53370 (0.0007) +[2023-10-14 07:17:33,363][100917] Updated weights for policy 1, policy_version 53442 (0.0007) +[2023-10-14 07:17:33,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 109379584. Throughput: 0: 1643.2, 1: 1668.1. Samples: 27357416. Policy #0 lag: (min: 10.0, avg: 18.0, max: 42.0) +[2023-10-14 07:17:33,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:17:33,723][100917] Updated weights for policy 1, policy_version 53452 (0.0008) +[2023-10-14 07:17:34,091][100917] Updated weights for policy 1, policy_version 53462 (0.0010) +[2023-10-14 07:17:34,259][100936] Updated weights for policy 0, policy_version 53380 (0.0008) +[2023-10-14 07:17:34,470][100917] Updated weights for policy 1, policy_version 53472 (0.0010) +[2023-10-14 07:17:34,650][100936] Updated weights for policy 0, policy_version 53390 (0.0008) +[2023-10-14 07:17:35,015][100936] Updated weights for policy 0, policy_version 53400 (0.0008) +[2023-10-14 07:17:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 109445120. Throughput: 0: 1639.7, 1: 1675.8. Samples: 27377592. Policy #0 lag: (min: 10.0, avg: 18.0, max: 42.0) +[2023-10-14 07:17:38,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:17:38,522][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000053408_54689792.pth... +[2023-10-14 07:17:38,553][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000051872_53116928.pth +[2023-10-14 07:17:38,635][100917] Updated weights for policy 1, policy_version 53482 (0.0008) +[2023-10-14 07:17:39,004][100917] Updated weights for policy 1, policy_version 53492 (0.0008) +[2023-10-14 07:17:39,220][100936] Updated weights for policy 0, policy_version 53410 (0.0010) +[2023-10-14 07:17:39,383][100917] Updated weights for policy 1, policy_version 53502 (0.0007) +[2023-10-14 07:17:39,458][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000053504_54788096.pth... +[2023-10-14 07:17:39,487][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000051936_53182464.pth +[2023-10-14 07:17:39,585][100936] Updated weights for policy 0, policy_version 53420 (0.0008) +[2023-10-14 07:17:39,950][100936] Updated weights for policy 0, policy_version 53430 (0.0009) +[2023-10-14 07:17:40,310][100936] Updated weights for policy 0, policy_version 53440 (0.0007) +[2023-10-14 07:17:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 109510656. Throughput: 0: 1637.8, 1: 1672.9. Samples: 27386424. Policy #0 lag: (min: 10.0, avg: 18.0, max: 42.0) +[2023-10-14 07:17:43,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:17:43,634][100917] Updated weights for policy 1, policy_version 53512 (0.0007) +[2023-10-14 07:17:44,013][100917] Updated weights for policy 1, policy_version 53522 (0.0009) +[2023-10-14 07:17:44,373][100917] Updated weights for policy 1, policy_version 53532 (0.0008) +[2023-10-14 07:17:44,486][100936] Updated weights for policy 0, policy_version 53450 (0.0008) +[2023-10-14 07:17:44,860][100936] Updated weights for policy 0, policy_version 53460 (0.0009) +[2023-10-14 07:17:45,222][100936] Updated weights for policy 0, policy_version 53470 (0.0008) +[2023-10-14 07:17:48,432][100917] Updated weights for policy 1, policy_version 53542 (0.0011) +[2023-10-14 07:17:48,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 109576192. Throughput: 0: 1636.1, 1: 1668.4. Samples: 27406652. Policy #0 lag: (min: 10.0, avg: 18.0, max: 42.0) +[2023-10-14 07:17:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:17:48,800][100917] Updated weights for policy 1, policy_version 53552 (0.0008) +[2023-10-14 07:17:49,178][100917] Updated weights for policy 1, policy_version 53562 (0.0007) +[2023-10-14 07:17:49,393][100936] Updated weights for policy 0, policy_version 53480 (0.0009) +[2023-10-14 07:17:49,765][100936] Updated weights for policy 0, policy_version 53490 (0.0009) +[2023-10-14 07:17:50,128][100936] Updated weights for policy 0, policy_version 53500 (0.0008) +[2023-10-14 07:17:53,244][100917] Updated weights for policy 1, policy_version 53572 (0.0009) +[2023-10-14 07:17:53,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 109641728. Throughput: 0: 1639.7, 1: 1663.7. Samples: 27426962. Policy #0 lag: (min: 10.0, avg: 18.0, max: 42.0) +[2023-10-14 07:17:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:17:53,616][100917] Updated weights for policy 1, policy_version 53582 (0.0009) +[2023-10-14 07:17:53,989][100917] Updated weights for policy 1, policy_version 53592 (0.0009) +[2023-10-14 07:17:54,248][100936] Updated weights for policy 0, policy_version 53510 (0.0008) +[2023-10-14 07:17:54,618][100936] Updated weights for policy 0, policy_version 53520 (0.0010) +[2023-10-14 07:17:54,987][100936] Updated weights for policy 0, policy_version 53530 (0.0007) +[2023-10-14 07:17:58,182][100917] Updated weights for policy 1, policy_version 53602 (0.0009) +[2023-10-14 07:17:58,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 109707264. Throughput: 0: 1641.2, 1: 1663.2. Samples: 27435952. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) +[2023-10-14 07:17:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:17:58,555][100917] Updated weights for policy 1, policy_version 53612 (0.0010) +[2023-10-14 07:17:58,922][100917] Updated weights for policy 1, policy_version 53622 (0.0009) +[2023-10-14 07:17:59,275][100936] Updated weights for policy 0, policy_version 53540 (0.0008) +[2023-10-14 07:17:59,295][100917] Updated weights for policy 1, policy_version 53632 (0.0008) +[2023-10-14 07:17:59,652][100936] Updated weights for policy 0, policy_version 53550 (0.0008) +[2023-10-14 07:18:00,020][100936] Updated weights for policy 0, policy_version 53560 (0.0007) +[2023-10-14 07:18:03,409][100917] Updated weights for policy 1, policy_version 53642 (0.0009) +[2023-10-14 07:18:03,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 109772800. Throughput: 0: 1640.8, 1: 1669.0. Samples: 27456398. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) +[2023-10-14 07:18:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:18:03,790][100917] Updated weights for policy 1, policy_version 53652 (0.0008) +[2023-10-14 07:18:04,093][100936] Updated weights for policy 0, policy_version 53570 (0.0009) +[2023-10-14 07:18:04,153][100917] Updated weights for policy 1, policy_version 53662 (0.0007) +[2023-10-14 07:18:04,466][100936] Updated weights for policy 0, policy_version 53580 (0.0009) +[2023-10-14 07:18:04,836][100936] Updated weights for policy 0, policy_version 53590 (0.0009) +[2023-10-14 07:18:05,197][100936] Updated weights for policy 0, policy_version 53600 (0.0009) +[2023-10-14 07:18:08,340][100917] Updated weights for policy 1, policy_version 53672 (0.0008) +[2023-10-14 07:18:08,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.3, 300 sec: 13107.2). Total num frames: 109838336. Throughput: 0: 1645.2, 1: 1663.0. Samples: 27476758. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) +[2023-10-14 07:18:08,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:18:08,711][100917] Updated weights for policy 1, policy_version 53682 (0.0007) +[2023-10-14 07:18:09,079][100917] Updated weights for policy 1, policy_version 53692 (0.0009) +[2023-10-14 07:18:09,434][100936] Updated weights for policy 0, policy_version 53610 (0.0008) +[2023-10-14 07:18:09,804][100936] Updated weights for policy 0, policy_version 53620 (0.0010) +[2023-10-14 07:18:10,180][100936] Updated weights for policy 0, policy_version 53630 (0.0010) +[2023-10-14 07:18:13,217][100917] Updated weights for policy 1, policy_version 53702 (0.0008) +[2023-10-14 07:18:13,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 109903872. Throughput: 0: 1642.5, 1: 1661.7. Samples: 27485606. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) +[2023-10-14 07:18:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:18:13,596][100917] Updated weights for policy 1, policy_version 53712 (0.0008) +[2023-10-14 07:18:13,969][100917] Updated weights for policy 1, policy_version 53722 (0.0008) +[2023-10-14 07:18:14,460][100936] Updated weights for policy 0, policy_version 53640 (0.0011) +[2023-10-14 07:18:14,834][100936] Updated weights for policy 0, policy_version 53650 (0.0009) +[2023-10-14 07:18:15,204][100936] Updated weights for policy 0, policy_version 53660 (0.0009) +[2023-10-14 07:18:17,944][100917] Updated weights for policy 1, policy_version 53732 (0.0008) +[2023-10-14 07:18:18,326][100917] Updated weights for policy 1, policy_version 53742 (0.0010) +[2023-10-14 07:18:18,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 109969408. Throughput: 0: 1639.1, 1: 1659.1. Samples: 27505834. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) +[2023-10-14 07:18:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:18:18,697][100917] Updated weights for policy 1, policy_version 53752 (0.0009) +[2023-10-14 07:18:19,410][100936] Updated weights for policy 0, policy_version 53670 (0.0009) +[2023-10-14 07:18:19,795][100936] Updated weights for policy 0, policy_version 53680 (0.0008) +[2023-10-14 07:18:20,160][100936] Updated weights for policy 0, policy_version 53690 (0.0007) +[2023-10-14 07:18:22,937][100917] Updated weights for policy 1, policy_version 53762 (0.0008) +[2023-10-14 07:18:23,301][100917] Updated weights for policy 1, policy_version 53772 (0.0010) +[2023-10-14 07:18:23,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 110034944. Throughput: 0: 1646.5, 1: 1653.5. Samples: 27526094. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) +[2023-10-14 07:18:23,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:18:23,683][100917] Updated weights for policy 1, policy_version 53782 (0.0010) +[2023-10-14 07:18:24,045][100917] Updated weights for policy 1, policy_version 53792 (0.0007) +[2023-10-14 07:18:24,173][100936] Updated weights for policy 0, policy_version 53700 (0.0009) +[2023-10-14 07:18:24,543][100936] Updated weights for policy 0, policy_version 53710 (0.0009) +[2023-10-14 07:18:24,917][100936] Updated weights for policy 0, policy_version 53720 (0.0009) +[2023-10-14 07:18:28,239][100917] Updated weights for policy 1, policy_version 53802 (0.0009) +[2023-10-14 07:18:28,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 110100480. Throughput: 0: 1647.2, 1: 1654.6. Samples: 27535008. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) +[2023-10-14 07:18:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:18:28,613][100917] Updated weights for policy 1, policy_version 53812 (0.0009) +[2023-10-14 07:18:28,943][100936] Updated weights for policy 0, policy_version 53730 (0.0009) +[2023-10-14 07:18:28,988][100917] Updated weights for policy 1, policy_version 53822 (0.0007) +[2023-10-14 07:18:29,316][100936] Updated weights for policy 0, policy_version 53740 (0.0007) +[2023-10-14 07:18:29,682][100936] Updated weights for policy 0, policy_version 53750 (0.0008) +[2023-10-14 07:18:30,057][100936] Updated weights for policy 0, policy_version 53760 (0.0008) +[2023-10-14 07:18:32,945][100917] Updated weights for policy 1, policy_version 53832 (0.0007) +[2023-10-14 07:18:33,316][100917] Updated weights for policy 1, policy_version 53842 (0.0008) +[2023-10-14 07:18:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 110166016. Throughput: 0: 1651.3, 1: 1655.4. Samples: 27555452. Policy #0 lag: (min: 23.0, avg: 23.0, max: 23.0) +[2023-10-14 07:18:33,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:18:33,687][100917] Updated weights for policy 1, policy_version 53852 (0.0007) +[2023-10-14 07:18:34,162][100936] Updated weights for policy 0, policy_version 53770 (0.0011) +[2023-10-14 07:18:34,535][100936] Updated weights for policy 0, policy_version 53780 (0.0010) +[2023-10-14 07:18:34,908][100936] Updated weights for policy 0, policy_version 53790 (0.0009) +[2023-10-14 07:18:37,904][100917] Updated weights for policy 1, policy_version 53862 (0.0009) +[2023-10-14 07:18:38,282][100917] Updated weights for policy 1, policy_version 53872 (0.0010) +[2023-10-14 07:18:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 110231552. Throughput: 0: 1653.7, 1: 1652.2. Samples: 27575728. Policy #0 lag: (min: 23.0, avg: 23.0, max: 23.0) +[2023-10-14 07:18:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:18:38,662][100917] Updated weights for policy 1, policy_version 53882 (0.0009) +[2023-10-14 07:18:39,033][100936] Updated weights for policy 0, policy_version 53800 (0.0009) +[2023-10-14 07:18:39,411][100936] Updated weights for policy 0, policy_version 53810 (0.0009) +[2023-10-14 07:18:39,787][100936] Updated weights for policy 0, policy_version 53820 (0.0009) +[2023-10-14 07:18:42,841][100917] Updated weights for policy 1, policy_version 53892 (0.0009) +[2023-10-14 07:18:43,220][100917] Updated weights for policy 1, policy_version 53902 (0.0007) +[2023-10-14 07:18:43,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 110297088. Throughput: 0: 1652.2, 1: 1656.2. Samples: 27584830. Policy #0 lag: (min: 23.0, avg: 23.0, max: 23.0) +[2023-10-14 07:18:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:18:43,585][100917] Updated weights for policy 1, policy_version 53912 (0.0008) +[2023-10-14 07:18:44,191][100936] Updated weights for policy 0, policy_version 53830 (0.0009) +[2023-10-14 07:18:44,565][100936] Updated weights for policy 0, policy_version 53840 (0.0009) +[2023-10-14 07:18:44,936][100936] Updated weights for policy 0, policy_version 53850 (0.0009) +[2023-10-14 07:18:47,608][100917] Updated weights for policy 1, policy_version 53922 (0.0007) +[2023-10-14 07:18:47,983][100917] Updated weights for policy 1, policy_version 53932 (0.0008) +[2023-10-14 07:18:48,354][100917] Updated weights for policy 1, policy_version 53942 (0.0009) +[2023-10-14 07:18:48,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 110362624. Throughput: 0: 1653.1, 1: 1657.4. Samples: 27605372. Policy #0 lag: (min: 23.0, avg: 23.0, max: 23.0) +[2023-10-14 07:18:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:18:48,718][100917] Updated weights for policy 1, policy_version 53952 (0.0009) +[2023-10-14 07:18:49,075][100936] Updated weights for policy 0, policy_version 53860 (0.0008) +[2023-10-14 07:18:49,449][100936] Updated weights for policy 0, policy_version 53870 (0.0007) +[2023-10-14 07:18:49,817][100936] Updated weights for policy 0, policy_version 53880 (0.0007) +[2023-10-14 07:18:52,860][100917] Updated weights for policy 1, policy_version 53962 (0.0007) +[2023-10-14 07:18:53,233][100917] Updated weights for policy 1, policy_version 53972 (0.0007) +[2023-10-14 07:18:53,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 110428160. Throughput: 0: 1656.4, 1: 1649.5. Samples: 27625520. Policy #0 lag: (min: 23.0, avg: 23.0, max: 23.0) +[2023-10-14 07:18:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:18:53,596][100917] Updated weights for policy 1, policy_version 53982 (0.0009) +[2023-10-14 07:18:53,786][100936] Updated weights for policy 0, policy_version 53890 (0.0008) +[2023-10-14 07:18:54,149][100936] Updated weights for policy 0, policy_version 53900 (0.0011) +[2023-10-14 07:18:54,529][100936] Updated weights for policy 0, policy_version 53910 (0.0010) +[2023-10-14 07:18:54,894][100936] Updated weights for policy 0, policy_version 53920 (0.0010) +[2023-10-14 07:18:57,742][100917] Updated weights for policy 1, policy_version 53992 (0.0007) +[2023-10-14 07:18:58,106][100917] Updated weights for policy 1, policy_version 54002 (0.0011) +[2023-10-14 07:18:58,485][100917] Updated weights for policy 1, policy_version 54012 (0.0009) +[2023-10-14 07:18:58,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 110493696. Throughput: 0: 1657.5, 1: 1659.9. Samples: 27634888. Policy #0 lag: (min: 23.0, avg: 23.0, max: 23.0) +[2023-10-14 07:18:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:18:59,081][100936] Updated weights for policy 0, policy_version 53930 (0.0010) +[2023-10-14 07:18:59,446][100936] Updated weights for policy 0, policy_version 53940 (0.0008) +[2023-10-14 07:18:59,826][100936] Updated weights for policy 0, policy_version 53950 (0.0010) +[2023-10-14 07:19:02,552][100917] Updated weights for policy 1, policy_version 54022 (0.0007) +[2023-10-14 07:19:02,929][100917] Updated weights for policy 1, policy_version 54032 (0.0007) +[2023-10-14 07:19:03,304][100917] Updated weights for policy 1, policy_version 54042 (0.0009) +[2023-10-14 07:19:03,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 110559232. Throughput: 0: 1659.5, 1: 1657.0. Samples: 27655074. Policy #0 lag: (min: 23.0, avg: 23.0, max: 23.0) +[2023-10-14 07:19:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:19:03,862][100936] Updated weights for policy 0, policy_version 53960 (0.0009) +[2023-10-14 07:19:04,231][100936] Updated weights for policy 0, policy_version 53970 (0.0007) +[2023-10-14 07:19:04,598][100936] Updated weights for policy 0, policy_version 53980 (0.0008) +[2023-10-14 07:19:07,453][100917] Updated weights for policy 1, policy_version 54052 (0.0010) +[2023-10-14 07:19:07,812][100917] Updated weights for policy 1, policy_version 54062 (0.0009) +[2023-10-14 07:19:08,183][100917] Updated weights for policy 1, policy_version 54072 (0.0009) +[2023-10-14 07:19:08,512][99942] Fps is (10 sec: 16384.0, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 110657536. Throughput: 0: 1660.5, 1: 1645.1. Samples: 27674846. Policy #0 lag: (min: 31.0, avg: 31.3, max: 42.0) +[2023-10-14 07:19:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:19:08,659][100936] Updated weights for policy 0, policy_version 53990 (0.0009) +[2023-10-14 07:19:09,026][100936] Updated weights for policy 0, policy_version 54000 (0.0011) +[2023-10-14 07:19:09,399][100936] Updated weights for policy 0, policy_version 54010 (0.0008) +[2023-10-14 07:19:12,220][100917] Updated weights for policy 1, policy_version 54082 (0.0008) +[2023-10-14 07:19:12,599][100917] Updated weights for policy 1, policy_version 54092 (0.0008) +[2023-10-14 07:19:12,963][100917] Updated weights for policy 1, policy_version 54102 (0.0008) +[2023-10-14 07:19:13,344][100917] Updated weights for policy 1, policy_version 54112 (0.0010) +[2023-10-14 07:19:13,512][99942] Fps is (10 sec: 16383.9, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 110723072. Throughput: 0: 1657.2, 1: 1660.9. Samples: 27684322. Policy #0 lag: (min: 31.0, avg: 31.3, max: 42.0) +[2023-10-14 07:19:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:19:13,626][100936] Updated weights for policy 0, policy_version 54020 (0.0009) +[2023-10-14 07:19:13,984][100936] Updated weights for policy 0, policy_version 54030 (0.0008) +[2023-10-14 07:19:14,357][100936] Updated weights for policy 0, policy_version 54040 (0.0007) +[2023-10-14 07:19:17,569][100917] Updated weights for policy 1, policy_version 54122 (0.0008) +[2023-10-14 07:19:17,953][100917] Updated weights for policy 1, policy_version 54132 (0.0007) +[2023-10-14 07:19:18,327][100917] Updated weights for policy 1, policy_version 54142 (0.0009) +[2023-10-14 07:19:18,423][100936] Updated weights for policy 0, policy_version 54050 (0.0008) +[2023-10-14 07:19:18,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 110788608. Throughput: 0: 1660.3, 1: 1660.0. Samples: 27704862. Policy #0 lag: (min: 31.0, avg: 31.3, max: 42.0) +[2023-10-14 07:19:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:19:18,792][100936] Updated weights for policy 0, policy_version 54060 (0.0009) +[2023-10-14 07:19:19,162][100936] Updated weights for policy 0, policy_version 54070 (0.0009) +[2023-10-14 07:19:19,534][100936] Updated weights for policy 0, policy_version 54080 (0.0008) +[2023-10-14 07:19:22,424][100917] Updated weights for policy 1, policy_version 54152 (0.0007) +[2023-10-14 07:19:22,812][100917] Updated weights for policy 1, policy_version 54162 (0.0007) +[2023-10-14 07:19:23,187][100917] Updated weights for policy 1, policy_version 54172 (0.0007) +[2023-10-14 07:19:23,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 110854144. Throughput: 0: 1658.4, 1: 1643.3. Samples: 27724302. Policy #0 lag: (min: 31.0, avg: 31.3, max: 42.0) +[2023-10-14 07:19:23,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:19:23,958][100936] Updated weights for policy 0, policy_version 54090 (0.0008) +[2023-10-14 07:19:24,330][100936] Updated weights for policy 0, policy_version 54100 (0.0008) +[2023-10-14 07:19:24,697][100936] Updated weights for policy 0, policy_version 54110 (0.0008) +[2023-10-14 07:19:27,198][100917] Updated weights for policy 1, policy_version 54182 (0.0007) +[2023-10-14 07:19:27,566][100917] Updated weights for policy 1, policy_version 54192 (0.0008) +[2023-10-14 07:19:27,940][100917] Updated weights for policy 1, policy_version 54202 (0.0010) +[2023-10-14 07:19:28,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 110919680. Throughput: 0: 1656.2, 1: 1656.4. Samples: 27733898. Policy #0 lag: (min: 31.0, avg: 31.3, max: 42.0) +[2023-10-14 07:19:28,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:19:28,755][100936] Updated weights for policy 0, policy_version 54120 (0.0008) +[2023-10-14 07:19:29,131][100936] Updated weights for policy 0, policy_version 54130 (0.0008) +[2023-10-14 07:19:29,508][100936] Updated weights for policy 0, policy_version 54140 (0.0008) +[2023-10-14 07:19:32,000][100917] Updated weights for policy 1, policy_version 54212 (0.0009) +[2023-10-14 07:19:32,372][100917] Updated weights for policy 1, policy_version 54222 (0.0008) +[2023-10-14 07:19:32,751][100917] Updated weights for policy 1, policy_version 54232 (0.0009) +[2023-10-14 07:19:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 110985216. Throughput: 0: 1661.6, 1: 1648.6. Samples: 27754334. Policy #0 lag: (min: 31.0, avg: 31.3, max: 42.0) +[2023-10-14 07:19:33,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:19:33,545][100936] Updated weights for policy 0, policy_version 54150 (0.0010) +[2023-10-14 07:19:33,903][100936] Updated weights for policy 0, policy_version 54160 (0.0007) +[2023-10-14 07:19:34,278][100936] Updated weights for policy 0, policy_version 54170 (0.0008) +[2023-10-14 07:19:36,923][100917] Updated weights for policy 1, policy_version 54242 (0.0009) +[2023-10-14 07:19:37,300][100917] Updated weights for policy 1, policy_version 54252 (0.0009) +[2023-10-14 07:19:37,668][100917] Updated weights for policy 1, policy_version 54262 (0.0008) +[2023-10-14 07:19:38,044][100917] Updated weights for policy 1, policy_version 54272 (0.0007) +[2023-10-14 07:19:38,401][100936] Updated weights for policy 0, policy_version 54180 (0.0008) +[2023-10-14 07:19:38,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 111050752. Throughput: 0: 1656.4, 1: 1634.5. Samples: 27773610. Policy #0 lag: (min: 31.0, avg: 31.3, max: 42.0) +[2023-10-14 07:19:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:19:38,523][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000054272_55574528.pth... +[2023-10-14 07:19:38,562][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000052704_53968896.pth +[2023-10-14 07:19:38,567][100681] Saving a milestone ./train_atari/atari_privateye_APPO/checkpoint_p1/milestones/checkpoint_000054272_55574528.pth +[2023-10-14 07:19:38,768][100936] Updated weights for policy 0, policy_version 54190 (0.0010) +[2023-10-14 07:19:39,143][100936] Updated weights for policy 0, policy_version 54200 (0.0010) +[2023-10-14 07:19:39,436][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000054208_55508992.pth... +[2023-10-14 07:19:39,476][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000052640_53903360.pth +[2023-10-14 07:19:39,480][100560] Saving a milestone ./train_atari/atari_privateye_APPO/checkpoint_p0/milestones/checkpoint_000054208_55508992.pth +[2023-10-14 07:19:42,236][100917] Updated weights for policy 1, policy_version 54282 (0.0010) +[2023-10-14 07:19:42,608][100917] Updated weights for policy 1, policy_version 54292 (0.0010) +[2023-10-14 07:19:42,979][100917] Updated weights for policy 1, policy_version 54302 (0.0009) +[2023-10-14 07:19:43,352][100936] Updated weights for policy 0, policy_version 54210 (0.0009) +[2023-10-14 07:19:43,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 111116288. Throughput: 0: 1657.3, 1: 1651.4. Samples: 27783780. Policy #0 lag: (min: 27.0, avg: 33.1, max: 59.0) +[2023-10-14 07:19:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:19:43,725][100936] Updated weights for policy 0, policy_version 54220 (0.0010) +[2023-10-14 07:19:44,096][100936] Updated weights for policy 0, policy_version 54230 (0.0010) +[2023-10-14 07:19:44,470][100936] Updated weights for policy 0, policy_version 54240 (0.0011) +[2023-10-14 07:19:47,223][100917] Updated weights for policy 1, policy_version 54312 (0.0008) +[2023-10-14 07:19:47,597][100917] Updated weights for policy 1, policy_version 54322 (0.0009) +[2023-10-14 07:19:47,964][100917] Updated weights for policy 1, policy_version 54332 (0.0011) +[2023-10-14 07:19:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 111181824. Throughput: 0: 1652.8, 1: 1649.3. Samples: 27803666. Policy #0 lag: (min: 27.0, avg: 33.1, max: 59.0) +[2023-10-14 07:19:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:19:48,664][100936] Updated weights for policy 0, policy_version 54250 (0.0008) +[2023-10-14 07:19:49,038][100936] Updated weights for policy 0, policy_version 54260 (0.0008) +[2023-10-14 07:19:49,412][100936] Updated weights for policy 0, policy_version 54270 (0.0007) +[2023-10-14 07:19:52,400][100917] Updated weights for policy 1, policy_version 54342 (0.0009) +[2023-10-14 07:19:52,781][100917] Updated weights for policy 1, policy_version 54352 (0.0007) +[2023-10-14 07:19:53,163][100917] Updated weights for policy 1, policy_version 54362 (0.0009) +[2023-10-14 07:19:53,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 111247360. Throughput: 0: 1646.4, 1: 1639.7. Samples: 27822718. Policy #0 lag: (min: 27.0, avg: 33.1, max: 59.0) +[2023-10-14 07:19:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:19:53,705][100936] Updated weights for policy 0, policy_version 54280 (0.0010) +[2023-10-14 07:19:54,081][100936] Updated weights for policy 0, policy_version 54290 (0.0007) +[2023-10-14 07:19:54,455][100936] Updated weights for policy 0, policy_version 54300 (0.0007) +[2023-10-14 07:19:57,483][100917] Updated weights for policy 1, policy_version 54372 (0.0008) +[2023-10-14 07:19:57,844][100917] Updated weights for policy 1, policy_version 54382 (0.0008) +[2023-10-14 07:19:58,224][100917] Updated weights for policy 1, policy_version 54392 (0.0008) +[2023-10-14 07:19:58,512][99942] Fps is (10 sec: 9830.5, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 111280128. Throughput: 0: 1649.4, 1: 1641.8. Samples: 27832428. Policy #0 lag: (min: 27.0, avg: 33.1, max: 59.0) +[2023-10-14 07:19:58,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:19:58,623][100936] Updated weights for policy 0, policy_version 54310 (0.0009) +[2023-10-14 07:19:58,999][100936] Updated weights for policy 0, policy_version 54320 (0.0009) +[2023-10-14 07:19:59,367][100936] Updated weights for policy 0, policy_version 54330 (0.0010) +[2023-10-14 07:20:02,443][100917] Updated weights for policy 1, policy_version 54402 (0.0008) +[2023-10-14 07:20:02,852][100917] Updated weights for policy 1, policy_version 54412 (0.0007) +[2023-10-14 07:20:03,230][100917] Updated weights for policy 1, policy_version 54422 (0.0007) +[2023-10-14 07:20:03,473][100936] Updated weights for policy 0, policy_version 54340 (0.0009) +[2023-10-14 07:20:03,512][99942] Fps is (10 sec: 9830.3, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 111345664. Throughput: 0: 1643.2, 1: 1643.5. Samples: 27852762. Policy #0 lag: (min: 27.0, avg: 33.1, max: 59.0) +[2023-10-14 07:20:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:20:03,598][100917] Updated weights for policy 1, policy_version 54432 (0.0009) +[2023-10-14 07:20:03,844][100936] Updated weights for policy 0, policy_version 54350 (0.0007) +[2023-10-14 07:20:04,204][100936] Updated weights for policy 0, policy_version 54360 (0.0008) +[2023-10-14 07:20:07,714][100917] Updated weights for policy 1, policy_version 54442 (0.0010) +[2023-10-14 07:20:08,082][100917] Updated weights for policy 1, policy_version 54452 (0.0008) +[2023-10-14 07:20:08,268][100936] Updated weights for policy 0, policy_version 54370 (0.0008) +[2023-10-14 07:20:08,459][100917] Updated weights for policy 1, policy_version 54462 (0.0009) +[2023-10-14 07:20:08,512][99942] Fps is (10 sec: 13107.2, 60 sec: 12561.1, 300 sec: 13107.2). Total num frames: 111411200. Throughput: 0: 1640.7, 1: 1644.5. Samples: 27872134. Policy #0 lag: (min: 27.0, avg: 33.1, max: 59.0) +[2023-10-14 07:20:08,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:20:08,631][100936] Updated weights for policy 0, policy_version 54380 (0.0008) +[2023-10-14 07:20:09,006][100936] Updated weights for policy 0, policy_version 54390 (0.0009) +[2023-10-14 07:20:09,371][100936] Updated weights for policy 0, policy_version 54400 (0.0008) +[2023-10-14 07:20:12,571][100917] Updated weights for policy 1, policy_version 54472 (0.0010) +[2023-10-14 07:20:12,957][100917] Updated weights for policy 1, policy_version 54482 (0.0009) +[2023-10-14 07:20:13,331][100917] Updated weights for policy 1, policy_version 54492 (0.0009) +[2023-10-14 07:20:13,512][99942] Fps is (10 sec: 16383.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 111509504. Throughput: 0: 1647.1, 1: 1641.3. Samples: 27881880. Policy #0 lag: (min: 27.0, avg: 33.1, max: 59.0) +[2023-10-14 07:20:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:20:13,602][100936] Updated weights for policy 0, policy_version 54410 (0.0009) +[2023-10-14 07:20:13,969][100936] Updated weights for policy 0, policy_version 54420 (0.0009) +[2023-10-14 07:20:14,337][100936] Updated weights for policy 0, policy_version 54430 (0.0009) +[2023-10-14 07:20:17,518][100917] Updated weights for policy 1, policy_version 54502 (0.0007) +[2023-10-14 07:20:17,894][100917] Updated weights for policy 1, policy_version 54512 (0.0007) +[2023-10-14 07:20:18,265][100917] Updated weights for policy 1, policy_version 54522 (0.0007) +[2023-10-14 07:20:18,323][100936] Updated weights for policy 0, policy_version 54440 (0.0009) +[2023-10-14 07:20:18,512][99942] Fps is (10 sec: 16383.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 111575040. Throughput: 0: 1645.8, 1: 1642.9. Samples: 27902326. Policy #0 lag: (min: 25.0, avg: 35.2, max: 57.0) +[2023-10-14 07:20:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:20:18,695][100936] Updated weights for policy 0, policy_version 54450 (0.0008) +[2023-10-14 07:20:19,060][100936] Updated weights for policy 0, policy_version 54460 (0.0009) +[2023-10-14 07:20:22,376][100917] Updated weights for policy 1, policy_version 54532 (0.0008) +[2023-10-14 07:20:22,752][100917] Updated weights for policy 1, policy_version 54542 (0.0007) +[2023-10-14 07:20:23,132][100917] Updated weights for policy 1, policy_version 54552 (0.0007) +[2023-10-14 07:20:23,288][100936] Updated weights for policy 0, policy_version 54470 (0.0009) +[2023-10-14 07:20:23,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 111640576. Throughput: 0: 1635.7, 1: 1646.5. Samples: 27921310. Policy #0 lag: (min: 25.0, avg: 35.2, max: 57.0) +[2023-10-14 07:20:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:20:23,664][100936] Updated weights for policy 0, policy_version 54480 (0.0008) +[2023-10-14 07:20:24,032][100936] Updated weights for policy 0, policy_version 54490 (0.0008) +[2023-10-14 07:20:27,237][100917] Updated weights for policy 1, policy_version 54562 (0.0007) +[2023-10-14 07:20:27,605][100917] Updated weights for policy 1, policy_version 54572 (0.0007) +[2023-10-14 07:20:27,988][100917] Updated weights for policy 1, policy_version 54582 (0.0009) +[2023-10-14 07:20:28,088][100936] Updated weights for policy 0, policy_version 54500 (0.0009) +[2023-10-14 07:20:28,352][100917] Updated weights for policy 1, policy_version 54592 (0.0007) +[2023-10-14 07:20:28,456][100936] Updated weights for policy 0, policy_version 54510 (0.0007) +[2023-10-14 07:20:28,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 111706112. Throughput: 0: 1640.6, 1: 1638.4. Samples: 27931338. Policy #0 lag: (min: 25.0, avg: 35.2, max: 57.0) +[2023-10-14 07:20:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:20:28,827][100936] Updated weights for policy 0, policy_version 54520 (0.0010) +[2023-10-14 07:20:32,411][100917] Updated weights for policy 1, policy_version 54602 (0.0010) +[2023-10-14 07:20:32,783][100917] Updated weights for policy 1, policy_version 54612 (0.0009) +[2023-10-14 07:20:33,143][100936] Updated weights for policy 0, policy_version 54530 (0.0009) +[2023-10-14 07:20:33,148][100917] Updated weights for policy 1, policy_version 54622 (0.0010) +[2023-10-14 07:20:33,511][100936] Updated weights for policy 0, policy_version 54540 (0.0008) +[2023-10-14 07:20:33,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 111771648. Throughput: 0: 1642.4, 1: 1644.4. Samples: 27951572. Policy #0 lag: (min: 25.0, avg: 35.2, max: 57.0) +[2023-10-14 07:20:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:20:33,875][100936] Updated weights for policy 0, policy_version 54550 (0.0009) +[2023-10-14 07:20:34,242][100936] Updated weights for policy 0, policy_version 54560 (0.0008) +[2023-10-14 07:20:37,083][100917] Updated weights for policy 1, policy_version 54632 (0.0009) +[2023-10-14 07:20:37,455][100917] Updated weights for policy 1, policy_version 54642 (0.0008) +[2023-10-14 07:20:37,827][100917] Updated weights for policy 1, policy_version 54652 (0.0010) +[2023-10-14 07:20:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 111837184. Throughput: 0: 1639.4, 1: 1644.6. Samples: 27970498. Policy #0 lag: (min: 25.0, avg: 35.2, max: 57.0) +[2023-10-14 07:20:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:20:38,598][100936] Updated weights for policy 0, policy_version 54570 (0.0009) +[2023-10-14 07:20:38,959][100936] Updated weights for policy 0, policy_version 54580 (0.0007) +[2023-10-14 07:20:39,322][100936] Updated weights for policy 0, policy_version 54590 (0.0008) +[2023-10-14 07:20:41,992][100917] Updated weights for policy 1, policy_version 54662 (0.0007) +[2023-10-14 07:20:42,371][100917] Updated weights for policy 1, policy_version 54672 (0.0007) +[2023-10-14 07:20:42,754][100917] Updated weights for policy 1, policy_version 54682 (0.0009) +[2023-10-14 07:20:43,423][100936] Updated weights for policy 0, policy_version 54600 (0.0007) +[2023-10-14 07:20:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 111902720. Throughput: 0: 1645.4, 1: 1655.6. Samples: 27980972. Policy #0 lag: (min: 25.0, avg: 35.2, max: 57.0) +[2023-10-14 07:20:43,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:20:43,798][100936] Updated weights for policy 0, policy_version 54610 (0.0010) +[2023-10-14 07:20:44,165][100936] Updated weights for policy 0, policy_version 54620 (0.0009) +[2023-10-14 07:20:46,913][100917] Updated weights for policy 1, policy_version 54692 (0.0007) +[2023-10-14 07:20:47,306][100917] Updated weights for policy 1, policy_version 54702 (0.0007) +[2023-10-14 07:20:47,678][100917] Updated weights for policy 1, policy_version 54712 (0.0008) +[2023-10-14 07:20:48,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 111968256. Throughput: 0: 1639.2, 1: 1648.1. Samples: 28000694. Policy #0 lag: (min: 25.0, avg: 35.2, max: 57.0) +[2023-10-14 07:20:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:20:48,554][100936] Updated weights for policy 0, policy_version 54630 (0.0013) +[2023-10-14 07:20:48,918][100936] Updated weights for policy 0, policy_version 54640 (0.0007) +[2023-10-14 07:20:49,289][100936] Updated weights for policy 0, policy_version 54650 (0.0008) +[2023-10-14 07:20:51,787][100917] Updated weights for policy 1, policy_version 54722 (0.0008) +[2023-10-14 07:20:52,149][100917] Updated weights for policy 1, policy_version 54732 (0.0010) +[2023-10-14 07:20:52,537][100917] Updated weights for policy 1, policy_version 54742 (0.0009) +[2023-10-14 07:20:52,905][100917] Updated weights for policy 1, policy_version 54752 (0.0009) +[2023-10-14 07:20:53,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 112033792. Throughput: 0: 1637.7, 1: 1643.8. Samples: 28019802. Policy #0 lag: (min: 9.0, avg: 27.5, max: 41.0) +[2023-10-14 07:20:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:20:53,567][100936] Updated weights for policy 0, policy_version 54660 (0.0010) +[2023-10-14 07:20:53,938][100936] Updated weights for policy 0, policy_version 54670 (0.0010) +[2023-10-14 07:20:54,305][100936] Updated weights for policy 0, policy_version 54680 (0.0008) +[2023-10-14 07:20:56,974][100917] Updated weights for policy 1, policy_version 54762 (0.0007) +[2023-10-14 07:20:57,352][100917] Updated weights for policy 1, policy_version 54772 (0.0007) +[2023-10-14 07:20:57,716][100917] Updated weights for policy 1, policy_version 54782 (0.0010) +[2023-10-14 07:20:58,353][100936] Updated weights for policy 0, policy_version 54690 (0.0008) +[2023-10-14 07:20:58,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13653.2, 300 sec: 13218.3). Total num frames: 112099328. Throughput: 0: 1632.8, 1: 1659.0. Samples: 28030010. Policy #0 lag: (min: 9.0, avg: 27.5, max: 41.0) +[2023-10-14 07:20:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:20:58,722][100936] Updated weights for policy 0, policy_version 54700 (0.0009) +[2023-10-14 07:20:59,085][100936] Updated weights for policy 0, policy_version 54710 (0.0008) +[2023-10-14 07:20:59,464][100936] Updated weights for policy 0, policy_version 54720 (0.0011) +[2023-10-14 07:21:01,926][100917] Updated weights for policy 1, policy_version 54792 (0.0008) +[2023-10-14 07:21:02,295][100917] Updated weights for policy 1, policy_version 54802 (0.0009) +[2023-10-14 07:21:02,669][100917] Updated weights for policy 1, policy_version 54812 (0.0008) +[2023-10-14 07:21:03,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 112164864. Throughput: 0: 1635.5, 1: 1653.9. Samples: 28050350. Policy #0 lag: (min: 9.0, avg: 27.5, max: 41.0) +[2023-10-14 07:21:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:21:03,529][100936] Updated weights for policy 0, policy_version 54730 (0.0007) +[2023-10-14 07:21:03,887][100936] Updated weights for policy 0, policy_version 54740 (0.0008) +[2023-10-14 07:21:04,261][100936] Updated weights for policy 0, policy_version 54750 (0.0009) +[2023-10-14 07:21:06,655][100917] Updated weights for policy 1, policy_version 54822 (0.0009) +[2023-10-14 07:21:07,021][100917] Updated weights for policy 1, policy_version 54832 (0.0009) +[2023-10-14 07:21:07,392][100917] Updated weights for policy 1, policy_version 54842 (0.0009) +[2023-10-14 07:21:08,467][100936] Updated weights for policy 0, policy_version 54760 (0.0008) +[2023-10-14 07:21:08,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 112230400. Throughput: 0: 1641.6, 1: 1655.2. Samples: 28069666. Policy #0 lag: (min: 9.0, avg: 27.5, max: 41.0) +[2023-10-14 07:21:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:21:08,836][100936] Updated weights for policy 0, policy_version 54770 (0.0009) +[2023-10-14 07:21:09,194][100936] Updated weights for policy 0, policy_version 54780 (0.0009) +[2023-10-14 07:21:11,457][100917] Updated weights for policy 1, policy_version 54852 (0.0009) +[2023-10-14 07:21:11,825][100917] Updated weights for policy 1, policy_version 54862 (0.0008) +[2023-10-14 07:21:12,200][100917] Updated weights for policy 1, policy_version 54872 (0.0009) +[2023-10-14 07:21:13,474][100936] Updated weights for policy 0, policy_version 54790 (0.0007) +[2023-10-14 07:21:13,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 112295936. Throughput: 0: 1637.8, 1: 1669.5. Samples: 28080166. Policy #0 lag: (min: 9.0, avg: 27.5, max: 41.0) +[2023-10-14 07:21:13,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:21:13,839][100936] Updated weights for policy 0, policy_version 54800 (0.0007) +[2023-10-14 07:21:14,204][100936] Updated weights for policy 0, policy_version 54810 (0.0007) +[2023-10-14 07:21:16,242][100917] Updated weights for policy 1, policy_version 54882 (0.0007) +[2023-10-14 07:21:16,612][100917] Updated weights for policy 1, policy_version 54892 (0.0009) +[2023-10-14 07:21:16,977][100917] Updated weights for policy 1, policy_version 54902 (0.0009) +[2023-10-14 07:21:17,347][100917] Updated weights for policy 1, policy_version 54912 (0.0010) +[2023-10-14 07:21:18,252][100936] Updated weights for policy 0, policy_version 54820 (0.0007) +[2023-10-14 07:21:18,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 112361472. Throughput: 0: 1640.5, 1: 1655.1. Samples: 28099874. Policy #0 lag: (min: 9.0, avg: 27.5, max: 41.0) +[2023-10-14 07:21:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:21:18,622][100936] Updated weights for policy 0, policy_version 54830 (0.0008) +[2023-10-14 07:21:18,987][100936] Updated weights for policy 0, policy_version 54840 (0.0009) +[2023-10-14 07:21:21,327][100917] Updated weights for policy 1, policy_version 54922 (0.0009) +[2023-10-14 07:21:21,700][100917] Updated weights for policy 1, policy_version 54932 (0.0007) +[2023-10-14 07:21:22,070][100917] Updated weights for policy 1, policy_version 54942 (0.0009) +[2023-10-14 07:21:23,363][100936] Updated weights for policy 0, policy_version 54850 (0.0010) +[2023-10-14 07:21:23,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 112427008. Throughput: 0: 1641.6, 1: 1672.0. Samples: 28119608. Policy #0 lag: (min: 9.0, avg: 27.5, max: 41.0) +[2023-10-14 07:21:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:21:23,733][100936] Updated weights for policy 0, policy_version 54860 (0.0009) +[2023-10-14 07:21:24,113][100936] Updated weights for policy 0, policy_version 54870 (0.0010) +[2023-10-14 07:21:24,482][100936] Updated weights for policy 0, policy_version 54880 (0.0007) +[2023-10-14 07:21:26,342][100917] Updated weights for policy 1, policy_version 54952 (0.0012) +[2023-10-14 07:21:26,712][100917] Updated weights for policy 1, policy_version 54962 (0.0010) +[2023-10-14 07:21:27,090][100917] Updated weights for policy 1, policy_version 54972 (0.0010) +[2023-10-14 07:21:28,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 112492544. Throughput: 0: 1641.1, 1: 1673.6. Samples: 28130136. Policy #0 lag: (min: 2.0, avg: 9.9, max: 34.0) +[2023-10-14 07:21:28,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:21:28,591][100936] Updated weights for policy 0, policy_version 54890 (0.0011) +[2023-10-14 07:21:28,951][100936] Updated weights for policy 0, policy_version 54900 (0.0010) +[2023-10-14 07:21:29,313][100936] Updated weights for policy 0, policy_version 54910 (0.0010) +[2023-10-14 07:21:31,094][100917] Updated weights for policy 1, policy_version 54982 (0.0008) +[2023-10-14 07:21:31,472][100917] Updated weights for policy 1, policy_version 54992 (0.0008) +[2023-10-14 07:21:31,844][100917] Updated weights for policy 1, policy_version 55002 (0.0008) +[2023-10-14 07:21:33,472][100936] Updated weights for policy 0, policy_version 54920 (0.0008) +[2023-10-14 07:21:33,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 112558080. Throughput: 0: 1649.1, 1: 1658.3. Samples: 28149528. Policy #0 lag: (min: 2.0, avg: 9.9, max: 34.0) +[2023-10-14 07:21:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:21:33,836][100936] Updated weights for policy 0, policy_version 54930 (0.0007) +[2023-10-14 07:21:34,204][100936] Updated weights for policy 0, policy_version 54940 (0.0007) +[2023-10-14 07:21:36,126][100917] Updated weights for policy 1, policy_version 55012 (0.0009) +[2023-10-14 07:21:36,522][100917] Updated weights for policy 1, policy_version 55022 (0.0011) +[2023-10-14 07:21:36,894][100917] Updated weights for policy 1, policy_version 55032 (0.0009) +[2023-10-14 07:21:38,239][100936] Updated weights for policy 0, policy_version 54950 (0.0007) +[2023-10-14 07:21:38,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 112623616. Throughput: 0: 1641.2, 1: 1673.5. Samples: 28168964. Policy #0 lag: (min: 2.0, avg: 9.9, max: 34.0) +[2023-10-14 07:21:38,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:21:38,521][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000055040_56360960.pth... +[2023-10-14 07:21:38,556][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000053504_54788096.pth +[2023-10-14 07:21:38,613][100936] Updated weights for policy 0, policy_version 54960 (0.0007) +[2023-10-14 07:21:38,978][100936] Updated weights for policy 0, policy_version 54970 (0.0010) +[2023-10-14 07:21:39,196][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000054976_56295424.pth... +[2023-10-14 07:21:39,227][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000053408_54689792.pth +[2023-10-14 07:21:41,017][100917] Updated weights for policy 1, policy_version 55042 (0.0010) +[2023-10-14 07:21:41,392][100917] Updated weights for policy 1, policy_version 55052 (0.0009) +[2023-10-14 07:21:41,770][100917] Updated weights for policy 1, policy_version 55062 (0.0008) +[2023-10-14 07:21:42,144][100917] Updated weights for policy 1, policy_version 55072 (0.0007) +[2023-10-14 07:21:42,946][100936] Updated weights for policy 0, policy_version 54980 (0.0010) +[2023-10-14 07:21:43,323][100936] Updated weights for policy 0, policy_version 54990 (0.0009) +[2023-10-14 07:21:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 112689152. Throughput: 0: 1655.4, 1: 1670.5. Samples: 28179678. Policy #0 lag: (min: 2.0, avg: 9.9, max: 34.0) +[2023-10-14 07:21:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:21:43,683][100936] Updated weights for policy 0, policy_version 55000 (0.0008) +[2023-10-14 07:21:46,309][100917] Updated weights for policy 1, policy_version 55082 (0.0010) +[2023-10-14 07:21:46,688][100917] Updated weights for policy 1, policy_version 55092 (0.0008) +[2023-10-14 07:21:47,058][100917] Updated weights for policy 1, policy_version 55102 (0.0010) +[2023-10-14 07:21:47,868][100936] Updated weights for policy 0, policy_version 55010 (0.0009) +[2023-10-14 07:21:48,242][100936] Updated weights for policy 0, policy_version 55020 (0.0009) +[2023-10-14 07:21:48,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 112754688. Throughput: 0: 1656.2, 1: 1655.7. Samples: 28199384. Policy #0 lag: (min: 2.0, avg: 9.9, max: 34.0) +[2023-10-14 07:21:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:21:48,601][100936] Updated weights for policy 0, policy_version 55030 (0.0010) +[2023-10-14 07:21:48,963][100936] Updated weights for policy 0, policy_version 55040 (0.0009) +[2023-10-14 07:21:51,150][100917] Updated weights for policy 1, policy_version 55112 (0.0009) +[2023-10-14 07:21:51,530][100917] Updated weights for policy 1, policy_version 55122 (0.0009) +[2023-10-14 07:21:51,901][100917] Updated weights for policy 1, policy_version 55132 (0.0007) +[2023-10-14 07:21:53,168][100936] Updated weights for policy 0, policy_version 55050 (0.0009) +[2023-10-14 07:21:53,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 112820224. Throughput: 0: 1646.1, 1: 1671.3. Samples: 28218948. Policy #0 lag: (min: 2.0, avg: 9.9, max: 34.0) +[2023-10-14 07:21:53,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:21:53,539][100936] Updated weights for policy 0, policy_version 55060 (0.0009) +[2023-10-14 07:21:53,917][100936] Updated weights for policy 0, policy_version 55070 (0.0008) +[2023-10-14 07:21:55,976][100917] Updated weights for policy 1, policy_version 55142 (0.0008) +[2023-10-14 07:21:56,347][100917] Updated weights for policy 1, policy_version 55152 (0.0009) +[2023-10-14 07:21:56,709][100917] Updated weights for policy 1, policy_version 55162 (0.0008) +[2023-10-14 07:21:57,967][100936] Updated weights for policy 0, policy_version 55080 (0.0009) +[2023-10-14 07:21:58,336][100936] Updated weights for policy 0, policy_version 55090 (0.0009) +[2023-10-14 07:21:58,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 112885760. Throughput: 0: 1656.2, 1: 1666.5. Samples: 28229686. Policy #0 lag: (min: 2.0, avg: 9.9, max: 34.0) +[2023-10-14 07:21:58,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:21:58,718][100936] Updated weights for policy 0, policy_version 55100 (0.0010) +[2023-10-14 07:22:00,591][100917] Updated weights for policy 1, policy_version 55172 (0.0009) +[2023-10-14 07:22:00,959][100917] Updated weights for policy 1, policy_version 55182 (0.0007) +[2023-10-14 07:22:01,332][100917] Updated weights for policy 1, policy_version 55192 (0.0007) +[2023-10-14 07:22:02,937][100936] Updated weights for policy 0, policy_version 55110 (0.0008) +[2023-10-14 07:22:03,312][100936] Updated weights for policy 0, policy_version 55120 (0.0009) +[2023-10-14 07:22:03,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 112951296. Throughput: 0: 1653.8, 1: 1667.2. Samples: 28249320. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 07:22:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:22:03,684][100936] Updated weights for policy 0, policy_version 55130 (0.0007) +[2023-10-14 07:22:05,367][100917] Updated weights for policy 1, policy_version 55202 (0.0008) +[2023-10-14 07:22:05,731][100917] Updated weights for policy 1, policy_version 55212 (0.0007) +[2023-10-14 07:22:06,112][100917] Updated weights for policy 1, policy_version 55222 (0.0009) +[2023-10-14 07:22:06,481][100917] Updated weights for policy 1, policy_version 55232 (0.0008) +[2023-10-14 07:22:07,692][100936] Updated weights for policy 0, policy_version 55140 (0.0009) +[2023-10-14 07:22:08,067][100936] Updated weights for policy 0, policy_version 55150 (0.0007) +[2023-10-14 07:22:08,437][100936] Updated weights for policy 0, policy_version 55160 (0.0009) +[2023-10-14 07:22:08,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 113016832. Throughput: 0: 1645.7, 1: 1676.2. Samples: 28269094. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 07:22:08,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:22:10,537][100917] Updated weights for policy 1, policy_version 55242 (0.0007) +[2023-10-14 07:22:10,913][100917] Updated weights for policy 1, policy_version 55252 (0.0007) +[2023-10-14 07:22:11,282][100917] Updated weights for policy 1, policy_version 55262 (0.0008) +[2023-10-14 07:22:12,650][100936] Updated weights for policy 0, policy_version 55170 (0.0008) +[2023-10-14 07:22:13,024][100936] Updated weights for policy 0, policy_version 55180 (0.0010) +[2023-10-14 07:22:13,395][100936] Updated weights for policy 0, policy_version 55190 (0.0010) +[2023-10-14 07:22:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 113082368. Throughput: 0: 1657.6, 1: 1656.4. Samples: 28279268. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 07:22:13,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:22:13,758][100936] Updated weights for policy 0, policy_version 55200 (0.0010) +[2023-10-14 07:22:15,454][100917] Updated weights for policy 1, policy_version 55272 (0.0008) +[2023-10-14 07:22:15,830][100917] Updated weights for policy 1, policy_version 55282 (0.0010) +[2023-10-14 07:22:16,205][100917] Updated weights for policy 1, policy_version 55292 (0.0008) +[2023-10-14 07:22:18,044][100936] Updated weights for policy 0, policy_version 55210 (0.0009) +[2023-10-14 07:22:18,408][100936] Updated weights for policy 0, policy_version 55220 (0.0008) +[2023-10-14 07:22:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 113147904. Throughput: 0: 1656.9, 1: 1666.3. Samples: 28299072. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 07:22:18,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:22:18,776][100936] Updated weights for policy 0, policy_version 55230 (0.0007) +[2023-10-14 07:22:20,303][100917] Updated weights for policy 1, policy_version 55302 (0.0007) +[2023-10-14 07:22:20,679][100917] Updated weights for policy 1, policy_version 55312 (0.0010) +[2023-10-14 07:22:21,044][100917] Updated weights for policy 1, policy_version 55322 (0.0011) +[2023-10-14 07:22:22,929][100936] Updated weights for policy 0, policy_version 55240 (0.0007) +[2023-10-14 07:22:23,303][100936] Updated weights for policy 0, policy_version 55250 (0.0008) +[2023-10-14 07:22:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 113213440. Throughput: 0: 1648.5, 1: 1678.1. Samples: 28318664. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 07:22:23,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:22:23,671][100936] Updated weights for policy 0, policy_version 55260 (0.0008) +[2023-10-14 07:22:25,192][100917] Updated weights for policy 1, policy_version 55332 (0.0008) +[2023-10-14 07:22:25,572][100917] Updated weights for policy 1, policy_version 55342 (0.0009) +[2023-10-14 07:22:25,946][100917] Updated weights for policy 1, policy_version 55352 (0.0010) +[2023-10-14 07:22:27,804][100936] Updated weights for policy 0, policy_version 55270 (0.0009) +[2023-10-14 07:22:28,180][100936] Updated weights for policy 0, policy_version 55280 (0.0007) +[2023-10-14 07:22:28,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 113278976. Throughput: 0: 1653.4, 1: 1655.5. Samples: 28328580. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 07:22:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:22:28,552][100936] Updated weights for policy 0, policy_version 55290 (0.0008) +[2023-10-14 07:22:30,071][100917] Updated weights for policy 1, policy_version 55362 (0.0010) +[2023-10-14 07:22:30,436][100917] Updated weights for policy 1, policy_version 55372 (0.0007) +[2023-10-14 07:22:30,811][100917] Updated weights for policy 1, policy_version 55382 (0.0010) +[2023-10-14 07:22:31,170][100917] Updated weights for policy 1, policy_version 55392 (0.0009) +[2023-10-14 07:22:32,738][100936] Updated weights for policy 0, policy_version 55300 (0.0008) +[2023-10-14 07:22:33,109][100936] Updated weights for policy 0, policy_version 55310 (0.0008) +[2023-10-14 07:22:33,481][100936] Updated weights for policy 0, policy_version 55320 (0.0008) +[2023-10-14 07:22:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 113344512. Throughput: 0: 1642.9, 1: 1665.2. Samples: 28348246. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 07:22:33,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:22:35,353][100917] Updated weights for policy 1, policy_version 55402 (0.0010) +[2023-10-14 07:22:35,735][100917] Updated weights for policy 1, policy_version 55412 (0.0008) +[2023-10-14 07:22:36,118][100917] Updated weights for policy 1, policy_version 55422 (0.0008) +[2023-10-14 07:22:37,616][100936] Updated weights for policy 0, policy_version 55330 (0.0009) +[2023-10-14 07:22:37,990][100936] Updated weights for policy 0, policy_version 55340 (0.0007) +[2023-10-14 07:22:38,356][100936] Updated weights for policy 0, policy_version 55350 (0.0008) +[2023-10-14 07:22:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 113410048. Throughput: 0: 1642.8, 1: 1671.7. Samples: 28368104. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 07:22:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:22:38,728][100936] Updated weights for policy 0, policy_version 55360 (0.0009) +[2023-10-14 07:22:40,155][100917] Updated weights for policy 1, policy_version 55432 (0.0011) +[2023-10-14 07:22:40,527][100917] Updated weights for policy 1, policy_version 55442 (0.0008) +[2023-10-14 07:22:40,910][100917] Updated weights for policy 1, policy_version 55452 (0.0009) +[2023-10-14 07:22:42,892][100936] Updated weights for policy 0, policy_version 55370 (0.0009) +[2023-10-14 07:22:43,258][100936] Updated weights for policy 0, policy_version 55380 (0.0008) +[2023-10-14 07:22:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 113475584. Throughput: 0: 1650.3, 1: 1649.3. Samples: 28378168. Policy #0 lag: (min: 31.0, avg: 37.8, max: 63.0) +[2023-10-14 07:22:43,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:22:43,638][100936] Updated weights for policy 0, policy_version 55390 (0.0008) +[2023-10-14 07:22:44,881][100917] Updated weights for policy 1, policy_version 55462 (0.0010) +[2023-10-14 07:22:45,257][100917] Updated weights for policy 1, policy_version 55472 (0.0009) +[2023-10-14 07:22:45,637][100917] Updated weights for policy 1, policy_version 55482 (0.0008) +[2023-10-14 07:22:47,748][100936] Updated weights for policy 0, policy_version 55400 (0.0008) +[2023-10-14 07:22:48,118][100936] Updated weights for policy 0, policy_version 55410 (0.0007) +[2023-10-14 07:22:48,481][100936] Updated weights for policy 0, policy_version 55420 (0.0010) +[2023-10-14 07:22:48,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 113541120. Throughput: 0: 1654.0, 1: 1660.6. Samples: 28398478. Policy #0 lag: (min: 31.0, avg: 37.8, max: 63.0) +[2023-10-14 07:22:48,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:22:49,830][100917] Updated weights for policy 1, policy_version 55492 (0.0007) +[2023-10-14 07:22:50,210][100917] Updated weights for policy 1, policy_version 55502 (0.0009) +[2023-10-14 07:22:50,570][100917] Updated weights for policy 1, policy_version 55512 (0.0009) +[2023-10-14 07:22:52,453][100936] Updated weights for policy 0, policy_version 55430 (0.0008) +[2023-10-14 07:22:52,816][100936] Updated weights for policy 0, policy_version 55440 (0.0008) +[2023-10-14 07:22:53,190][100936] Updated weights for policy 0, policy_version 55450 (0.0010) +[2023-10-14 07:22:53,512][99942] Fps is (10 sec: 16383.3, 60 sec: 13653.2, 300 sec: 13329.3). Total num frames: 113639424. Throughput: 0: 1644.9, 1: 1656.5. Samples: 28417660. Policy #0 lag: (min: 31.0, avg: 37.8, max: 63.0) +[2023-10-14 07:22:53,514][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:22:54,721][100917] Updated weights for policy 1, policy_version 55522 (0.0007) +[2023-10-14 07:22:55,078][100917] Updated weights for policy 1, policy_version 55532 (0.0011) +[2023-10-14 07:22:55,453][100917] Updated weights for policy 1, policy_version 55542 (0.0008) +[2023-10-14 07:22:55,830][100917] Updated weights for policy 1, policy_version 55552 (0.0007) +[2023-10-14 07:22:57,404][100936] Updated weights for policy 0, policy_version 55460 (0.0008) +[2023-10-14 07:22:57,776][100936] Updated weights for policy 0, policy_version 55470 (0.0008) +[2023-10-14 07:22:58,139][100936] Updated weights for policy 0, policy_version 55480 (0.0008) +[2023-10-14 07:22:58,512][99942] Fps is (10 sec: 16383.8, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 113704960. Throughput: 0: 1656.3, 1: 1647.2. Samples: 28427928. Policy #0 lag: (min: 31.0, avg: 37.8, max: 63.0) +[2023-10-14 07:22:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:22:59,863][100917] Updated weights for policy 1, policy_version 55562 (0.0007) +[2023-10-14 07:23:00,236][100917] Updated weights for policy 1, policy_version 55572 (0.0010) +[2023-10-14 07:23:00,594][100917] Updated weights for policy 1, policy_version 55582 (0.0009) +[2023-10-14 07:23:02,303][100936] Updated weights for policy 0, policy_version 55490 (0.0010) +[2023-10-14 07:23:02,705][100936] Updated weights for policy 0, policy_version 55500 (0.0009) +[2023-10-14 07:23:03,066][100936] Updated weights for policy 0, policy_version 55510 (0.0009) +[2023-10-14 07:23:03,441][100936] Updated weights for policy 0, policy_version 55520 (0.0008) +[2023-10-14 07:23:03,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 113770496. Throughput: 0: 1650.7, 1: 1661.0. Samples: 28448100. Policy #0 lag: (min: 31.0, avg: 37.8, max: 63.0) +[2023-10-14 07:23:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:23:04,803][100917] Updated weights for policy 1, policy_version 55592 (0.0009) +[2023-10-14 07:23:05,178][100917] Updated weights for policy 1, policy_version 55602 (0.0008) +[2023-10-14 07:23:05,550][100917] Updated weights for policy 1, policy_version 55612 (0.0007) +[2023-10-14 07:23:07,523][100936] Updated weights for policy 0, policy_version 55530 (0.0008) +[2023-10-14 07:23:07,897][100936] Updated weights for policy 0, policy_version 55540 (0.0008) +[2023-10-14 07:23:08,264][100936] Updated weights for policy 0, policy_version 55550 (0.0007) +[2023-10-14 07:23:08,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 113836032. Throughput: 0: 1649.4, 1: 1656.7. Samples: 28467438. Policy #0 lag: (min: 31.0, avg: 37.8, max: 63.0) +[2023-10-14 07:23:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:23:09,511][100917] Updated weights for policy 1, policy_version 55622 (0.0008) +[2023-10-14 07:23:09,891][100917] Updated weights for policy 1, policy_version 55632 (0.0007) +[2023-10-14 07:23:10,262][100917] Updated weights for policy 1, policy_version 55642 (0.0008) +[2023-10-14 07:23:12,435][100936] Updated weights for policy 0, policy_version 55560 (0.0009) +[2023-10-14 07:23:12,800][100936] Updated weights for policy 0, policy_version 55570 (0.0008) +[2023-10-14 07:23:13,171][100936] Updated weights for policy 0, policy_version 55580 (0.0008) +[2023-10-14 07:23:13,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 113901568. Throughput: 0: 1662.0, 1: 1652.8. Samples: 28477744. Policy #0 lag: (min: 31.0, avg: 37.8, max: 63.0) +[2023-10-14 07:23:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:23:14,513][100917] Updated weights for policy 1, policy_version 55652 (0.0007) +[2023-10-14 07:23:14,882][100917] Updated weights for policy 1, policy_version 55662 (0.0010) +[2023-10-14 07:23:15,259][100917] Updated weights for policy 1, policy_version 55672 (0.0010) +[2023-10-14 07:23:17,361][100936] Updated weights for policy 0, policy_version 55590 (0.0010) +[2023-10-14 07:23:17,742][100936] Updated weights for policy 0, policy_version 55600 (0.0009) +[2023-10-14 07:23:18,102][100936] Updated weights for policy 0, policy_version 55610 (0.0009) +[2023-10-14 07:23:18,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 113967104. Throughput: 0: 1660.0, 1: 1662.3. Samples: 28497752. Policy #0 lag: (min: 4.0, avg: 23.8, max: 36.0) +[2023-10-14 07:23:18,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:23:19,424][100917] Updated weights for policy 1, policy_version 55682 (0.0009) +[2023-10-14 07:23:19,840][100917] Updated weights for policy 1, policy_version 55692 (0.0009) +[2023-10-14 07:23:20,224][100917] Updated weights for policy 1, policy_version 55702 (0.0008) +[2023-10-14 07:23:20,594][100917] Updated weights for policy 1, policy_version 55712 (0.0008) +[2023-10-14 07:23:22,118][100936] Updated weights for policy 0, policy_version 55620 (0.0011) +[2023-10-14 07:23:22,483][100936] Updated weights for policy 0, policy_version 55630 (0.0009) +[2023-10-14 07:23:22,855][100936] Updated weights for policy 0, policy_version 55640 (0.0008) +[2023-10-14 07:23:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 114032640. Throughput: 0: 1653.5, 1: 1655.7. Samples: 28517020. Policy #0 lag: (min: 4.0, avg: 23.8, max: 36.0) +[2023-10-14 07:23:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:23:24,690][100917] Updated weights for policy 1, policy_version 55722 (0.0011) +[2023-10-14 07:23:25,061][100917] Updated weights for policy 1, policy_version 55732 (0.0011) +[2023-10-14 07:23:25,422][100917] Updated weights for policy 1, policy_version 55742 (0.0009) +[2023-10-14 07:23:26,761][100936] Updated weights for policy 0, policy_version 55650 (0.0008) +[2023-10-14 07:23:27,125][100936] Updated weights for policy 0, policy_version 55660 (0.0009) +[2023-10-14 07:23:27,492][100936] Updated weights for policy 0, policy_version 55670 (0.0007) +[2023-10-14 07:23:27,863][100936] Updated weights for policy 0, policy_version 55680 (0.0008) +[2023-10-14 07:23:28,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 114098176. Throughput: 0: 1665.4, 1: 1649.3. Samples: 28527332. Policy #0 lag: (min: 4.0, avg: 23.8, max: 36.0) +[2023-10-14 07:23:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:23:29,526][100917] Updated weights for policy 1, policy_version 55752 (0.0010) +[2023-10-14 07:23:29,900][100917] Updated weights for policy 1, policy_version 55762 (0.0010) +[2023-10-14 07:23:30,272][100917] Updated weights for policy 1, policy_version 55772 (0.0009) +[2023-10-14 07:23:32,255][100936] Updated weights for policy 0, policy_version 55690 (0.0009) +[2023-10-14 07:23:32,619][100936] Updated weights for policy 0, policy_version 55700 (0.0008) +[2023-10-14 07:23:32,985][100936] Updated weights for policy 0, policy_version 55710 (0.0009) +[2023-10-14 07:23:33,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 114163712. Throughput: 0: 1653.0, 1: 1652.5. Samples: 28547226. Policy #0 lag: (min: 4.0, avg: 23.8, max: 36.0) +[2023-10-14 07:23:33,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:23:34,413][100917] Updated weights for policy 1, policy_version 55782 (0.0008) +[2023-10-14 07:23:34,790][100917] Updated weights for policy 1, policy_version 55792 (0.0009) +[2023-10-14 07:23:35,167][100917] Updated weights for policy 1, policy_version 55802 (0.0009) +[2023-10-14 07:23:37,065][100936] Updated weights for policy 0, policy_version 55720 (0.0009) +[2023-10-14 07:23:37,441][100936] Updated weights for policy 0, policy_version 55730 (0.0008) +[2023-10-14 07:23:37,802][100936] Updated weights for policy 0, policy_version 55740 (0.0010) +[2023-10-14 07:23:38,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 114229248. Throughput: 0: 1658.4, 1: 1655.8. Samples: 28566796. Policy #0 lag: (min: 4.0, avg: 23.8, max: 36.0) +[2023-10-14 07:23:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:23:38,521][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000055808_57147392.pth... +[2023-10-14 07:23:38,521][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000055744_57081856.pth... +[2023-10-14 07:23:38,550][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000054272_55574528.pth +[2023-10-14 07:23:38,560][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000054208_55508992.pth +[2023-10-14 07:23:39,411][100917] Updated weights for policy 1, policy_version 55812 (0.0007) +[2023-10-14 07:23:39,775][100917] Updated weights for policy 1, policy_version 55822 (0.0010) +[2023-10-14 07:23:40,150][100917] Updated weights for policy 1, policy_version 55832 (0.0007) +[2023-10-14 07:23:42,059][100936] Updated weights for policy 0, policy_version 55750 (0.0009) +[2023-10-14 07:23:42,437][100936] Updated weights for policy 0, policy_version 55760 (0.0007) +[2023-10-14 07:23:42,811][100936] Updated weights for policy 0, policy_version 55770 (0.0008) +[2023-10-14 07:23:43,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 114294784. Throughput: 0: 1656.8, 1: 1653.6. Samples: 28576892. Policy #0 lag: (min: 4.0, avg: 23.8, max: 36.0) +[2023-10-14 07:23:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:23:44,244][100917] Updated weights for policy 1, policy_version 55842 (0.0008) +[2023-10-14 07:23:44,607][100917] Updated weights for policy 1, policy_version 55852 (0.0011) +[2023-10-14 07:23:44,978][100917] Updated weights for policy 1, policy_version 55862 (0.0010) +[2023-10-14 07:23:45,356][100917] Updated weights for policy 1, policy_version 55872 (0.0010) +[2023-10-14 07:23:47,014][100936] Updated weights for policy 0, policy_version 55780 (0.0009) +[2023-10-14 07:23:47,393][100936] Updated weights for policy 0, policy_version 55790 (0.0007) +[2023-10-14 07:23:47,760][100936] Updated weights for policy 0, policy_version 55800 (0.0008) +[2023-10-14 07:23:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 114360320. Throughput: 0: 1647.9, 1: 1656.0. Samples: 28596772. Policy #0 lag: (min: 4.0, avg: 23.8, max: 36.0) +[2023-10-14 07:23:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:23:49,495][100917] Updated weights for policy 1, policy_version 55882 (0.0009) +[2023-10-14 07:23:49,871][100917] Updated weights for policy 1, policy_version 55892 (0.0010) +[2023-10-14 07:23:50,255][100917] Updated weights for policy 1, policy_version 55902 (0.0008) +[2023-10-14 07:23:52,011][100936] Updated weights for policy 0, policy_version 55810 (0.0009) +[2023-10-14 07:23:52,423][100936] Updated weights for policy 0, policy_version 55820 (0.0008) +[2023-10-14 07:23:52,798][100936] Updated weights for policy 0, policy_version 55830 (0.0008) +[2023-10-14 07:23:53,176][100936] Updated weights for policy 0, policy_version 55840 (0.0010) +[2023-10-14 07:23:53,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.3, 300 sec: 13329.4). Total num frames: 114425856. Throughput: 0: 1647.5, 1: 1657.0. Samples: 28616138. Policy #0 lag: (min: 31.0, avg: 37.1, max: 63.0) +[2023-10-14 07:23:53,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:23:54,265][100917] Updated weights for policy 1, policy_version 55912 (0.0007) +[2023-10-14 07:23:54,641][100917] Updated weights for policy 1, policy_version 55922 (0.0008) +[2023-10-14 07:23:55,006][100917] Updated weights for policy 1, policy_version 55932 (0.0008) +[2023-10-14 07:23:57,419][100936] Updated weights for policy 0, policy_version 55850 (0.0007) +[2023-10-14 07:23:57,784][100936] Updated weights for policy 0, policy_version 55860 (0.0008) +[2023-10-14 07:23:58,151][100936] Updated weights for policy 0, policy_version 55870 (0.0008) +[2023-10-14 07:23:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 114491392. Throughput: 0: 1646.2, 1: 1655.3. Samples: 28626314. Policy #0 lag: (min: 31.0, avg: 37.1, max: 63.0) +[2023-10-14 07:23:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:23:59,223][100917] Updated weights for policy 1, policy_version 55942 (0.0008) +[2023-10-14 07:23:59,598][100917] Updated weights for policy 1, policy_version 55952 (0.0007) +[2023-10-14 07:23:59,964][100917] Updated weights for policy 1, policy_version 55962 (0.0009) +[2023-10-14 07:24:02,342][100936] Updated weights for policy 0, policy_version 55880 (0.0010) +[2023-10-14 07:24:02,707][100936] Updated weights for policy 0, policy_version 55890 (0.0011) +[2023-10-14 07:24:03,071][100936] Updated weights for policy 0, policy_version 55900 (0.0009) +[2023-10-14 07:24:03,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 114556928. Throughput: 0: 1642.9, 1: 1664.2. Samples: 28646574. Policy #0 lag: (min: 31.0, avg: 37.1, max: 63.0) +[2023-10-14 07:24:03,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:24:03,960][100917] Updated weights for policy 1, policy_version 55972 (0.0008) +[2023-10-14 07:24:04,351][100917] Updated weights for policy 1, policy_version 55982 (0.0008) +[2023-10-14 07:24:04,735][100917] Updated weights for policy 1, policy_version 55992 (0.0007) +[2023-10-14 07:24:07,312][100936] Updated weights for policy 0, policy_version 55910 (0.0009) +[2023-10-14 07:24:07,677][100936] Updated weights for policy 0, policy_version 55920 (0.0009) +[2023-10-14 07:24:08,050][100936] Updated weights for policy 0, policy_version 55930 (0.0008) +[2023-10-14 07:24:08,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 114622464. Throughput: 0: 1639.9, 1: 1673.2. Samples: 28666108. Policy #0 lag: (min: 31.0, avg: 37.1, max: 63.0) +[2023-10-14 07:24:08,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:24:08,669][100917] Updated weights for policy 1, policy_version 56002 (0.0008) +[2023-10-14 07:24:09,058][100917] Updated weights for policy 1, policy_version 56012 (0.0009) +[2023-10-14 07:24:09,426][100917] Updated weights for policy 1, policy_version 56022 (0.0010) +[2023-10-14 07:24:09,797][100917] Updated weights for policy 1, policy_version 56032 (0.0009) +[2023-10-14 07:24:12,094][100936] Updated weights for policy 0, policy_version 55940 (0.0008) +[2023-10-14 07:24:12,466][100936] Updated weights for policy 0, policy_version 55950 (0.0009) +[2023-10-14 07:24:12,842][100936] Updated weights for policy 0, policy_version 55960 (0.0010) +[2023-10-14 07:24:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 114688000. Throughput: 0: 1636.7, 1: 1673.9. Samples: 28676310. Policy #0 lag: (min: 31.0, avg: 37.1, max: 63.0) +[2023-10-14 07:24:13,513][99942] Avg episode reward: [(0, '0.820'), (1, '1.000')] +[2023-10-14 07:24:13,957][100917] Updated weights for policy 1, policy_version 56042 (0.0008) +[2023-10-14 07:24:14,328][100917] Updated weights for policy 1, policy_version 56052 (0.0007) +[2023-10-14 07:24:14,698][100917] Updated weights for policy 1, policy_version 56062 (0.0009) +[2023-10-14 07:24:17,079][100936] Updated weights for policy 0, policy_version 55970 (0.0010) +[2023-10-14 07:24:17,449][100936] Updated weights for policy 0, policy_version 55980 (0.0012) +[2023-10-14 07:24:17,828][100936] Updated weights for policy 0, policy_version 55990 (0.0010) +[2023-10-14 07:24:18,204][100936] Updated weights for policy 0, policy_version 56000 (0.0008) +[2023-10-14 07:24:18,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 114753536. Throughput: 0: 1637.4, 1: 1672.9. Samples: 28696188. Policy #0 lag: (min: 31.0, avg: 37.1, max: 63.0) +[2023-10-14 07:24:18,513][99942] Avg episode reward: [(0, '0.820'), (1, '1.000')] +[2023-10-14 07:24:18,919][100917] Updated weights for policy 1, policy_version 56072 (0.0007) +[2023-10-14 07:24:19,296][100917] Updated weights for policy 1, policy_version 56082 (0.0007) +[2023-10-14 07:24:19,666][100917] Updated weights for policy 1, policy_version 56092 (0.0008) +[2023-10-14 07:24:22,465][100936] Updated weights for policy 0, policy_version 56010 (0.0010) +[2023-10-14 07:24:22,841][100936] Updated weights for policy 0, policy_version 56020 (0.0009) +[2023-10-14 07:24:23,218][100936] Updated weights for policy 0, policy_version 56030 (0.0008) +[2023-10-14 07:24:23,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 114819072. Throughput: 0: 1633.9, 1: 1671.4. Samples: 28715532. Policy #0 lag: (min: 31.0, avg: 37.1, max: 63.0) +[2023-10-14 07:24:23,513][99942] Avg episode reward: [(0, '0.820'), (1, '1.000')] +[2023-10-14 07:24:23,910][100917] Updated weights for policy 1, policy_version 56102 (0.0009) +[2023-10-14 07:24:24,285][100917] Updated weights for policy 1, policy_version 56112 (0.0010) +[2023-10-14 07:24:24,657][100917] Updated weights for policy 1, policy_version 56122 (0.0008) +[2023-10-14 07:24:27,468][100936] Updated weights for policy 0, policy_version 56040 (0.0007) +[2023-10-14 07:24:27,842][100936] Updated weights for policy 0, policy_version 56050 (0.0009) +[2023-10-14 07:24:28,206][100936] Updated weights for policy 0, policy_version 56060 (0.0007) +[2023-10-14 07:24:28,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 114884608. Throughput: 0: 1634.4, 1: 1670.2. Samples: 28725596. Policy #0 lag: (min: 31.0, avg: 37.1, max: 63.0) +[2023-10-14 07:24:28,513][99942] Avg episode reward: [(0, '0.820'), (1, '1.000')] +[2023-10-14 07:24:28,720][100917] Updated weights for policy 1, policy_version 56132 (0.0008) +[2023-10-14 07:24:29,091][100917] Updated weights for policy 1, policy_version 56142 (0.0007) +[2023-10-14 07:24:29,462][100917] Updated weights for policy 1, policy_version 56152 (0.0007) +[2023-10-14 07:24:32,276][100936] Updated weights for policy 0, policy_version 56070 (0.0007) +[2023-10-14 07:24:32,644][100936] Updated weights for policy 0, policy_version 56080 (0.0008) +[2023-10-14 07:24:33,011][100936] Updated weights for policy 0, policy_version 56090 (0.0010) +[2023-10-14 07:24:33,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 114950144. Throughput: 0: 1642.4, 1: 1662.9. Samples: 28745510. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:24:33,513][99942] Avg episode reward: [(0, '0.670'), (1, '1.000')] +[2023-10-14 07:24:33,607][100917] Updated weights for policy 1, policy_version 56162 (0.0008) +[2023-10-14 07:24:33,977][100917] Updated weights for policy 1, policy_version 56172 (0.0007) +[2023-10-14 07:24:34,355][100917] Updated weights for policy 1, policy_version 56182 (0.0007) +[2023-10-14 07:24:34,719][100917] Updated weights for policy 1, policy_version 56192 (0.0008) +[2023-10-14 07:24:36,968][100936] Updated weights for policy 0, policy_version 56100 (0.0008) +[2023-10-14 07:24:37,355][100936] Updated weights for policy 0, policy_version 56110 (0.0007) +[2023-10-14 07:24:37,727][100936] Updated weights for policy 0, policy_version 56120 (0.0009) +[2023-10-14 07:24:38,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 115015680. Throughput: 0: 1642.8, 1: 1665.5. Samples: 28765012. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:24:38,513][99942] Avg episode reward: [(0, '0.670'), (1, '1.000')] +[2023-10-14 07:24:38,797][100917] Updated weights for policy 1, policy_version 56202 (0.0009) +[2023-10-14 07:24:39,159][100917] Updated weights for policy 1, policy_version 56212 (0.0008) +[2023-10-14 07:24:39,531][100917] Updated weights for policy 1, policy_version 56222 (0.0010) +[2023-10-14 07:24:41,770][100936] Updated weights for policy 0, policy_version 56130 (0.0008) +[2023-10-14 07:24:42,146][100936] Updated weights for policy 0, policy_version 56140 (0.0011) +[2023-10-14 07:24:42,516][100936] Updated weights for policy 0, policy_version 56150 (0.0010) +[2023-10-14 07:24:42,880][100936] Updated weights for policy 0, policy_version 56160 (0.0009) +[2023-10-14 07:24:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 115081216. Throughput: 0: 1647.6, 1: 1664.9. Samples: 28775376. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:24:43,513][99942] Avg episode reward: [(0, '0.670'), (1, '1.000')] +[2023-10-14 07:24:43,553][100917] Updated weights for policy 1, policy_version 56232 (0.0010) +[2023-10-14 07:24:43,925][100917] Updated weights for policy 1, policy_version 56242 (0.0008) +[2023-10-14 07:24:44,309][100917] Updated weights for policy 1, policy_version 56252 (0.0009) +[2023-10-14 07:24:47,099][100936] Updated weights for policy 0, policy_version 56170 (0.0009) +[2023-10-14 07:24:47,461][100936] Updated weights for policy 0, policy_version 56180 (0.0010) +[2023-10-14 07:24:47,842][100936] Updated weights for policy 0, policy_version 56190 (0.0011) +[2023-10-14 07:24:48,489][100917] Updated weights for policy 1, policy_version 56262 (0.0009) +[2023-10-14 07:24:48,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 115146752. Throughput: 0: 1641.5, 1: 1657.0. Samples: 28795004. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:24:48,513][99942] Avg episode reward: [(0, '0.670'), (1, '1.000')] +[2023-10-14 07:24:48,869][100917] Updated weights for policy 1, policy_version 56272 (0.0012) +[2023-10-14 07:24:49,241][100917] Updated weights for policy 1, policy_version 56282 (0.0008) +[2023-10-14 07:24:51,875][100936] Updated weights for policy 0, policy_version 56200 (0.0011) +[2023-10-14 07:24:52,250][100936] Updated weights for policy 0, policy_version 56210 (0.0011) +[2023-10-14 07:24:52,617][100936] Updated weights for policy 0, policy_version 56220 (0.0010) +[2023-10-14 07:24:53,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 115212288. Throughput: 0: 1654.3, 1: 1652.1. Samples: 28814894. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:24:53,513][99942] Avg episode reward: [(0, '0.620'), (1, '1.000')] +[2023-10-14 07:24:53,646][100917] Updated weights for policy 1, policy_version 56292 (0.0009) +[2023-10-14 07:24:54,045][100917] Updated weights for policy 1, policy_version 56302 (0.0009) +[2023-10-14 07:24:54,406][100917] Updated weights for policy 1, policy_version 56312 (0.0009) +[2023-10-14 07:24:56,761][100936] Updated weights for policy 0, policy_version 56230 (0.0008) +[2023-10-14 07:24:57,130][100936] Updated weights for policy 0, policy_version 56240 (0.0008) +[2023-10-14 07:24:57,502][100936] Updated weights for policy 0, policy_version 56250 (0.0008) +[2023-10-14 07:24:58,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 115277824. Throughput: 0: 1655.2, 1: 1646.1. Samples: 28824868. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:24:58,512][99942] Avg episode reward: [(0, '0.620'), (1, '1.000')] +[2023-10-14 07:24:58,634][100917] Updated weights for policy 1, policy_version 56322 (0.0007) +[2023-10-14 07:24:59,002][100917] Updated weights for policy 1, policy_version 56332 (0.0010) +[2023-10-14 07:24:59,380][100917] Updated weights for policy 1, policy_version 56342 (0.0009) +[2023-10-14 07:24:59,746][100917] Updated weights for policy 1, policy_version 56352 (0.0007) +[2023-10-14 07:25:01,546][100936] Updated weights for policy 0, policy_version 56260 (0.0008) +[2023-10-14 07:25:01,922][100936] Updated weights for policy 0, policy_version 56270 (0.0008) +[2023-10-14 07:25:02,288][100936] Updated weights for policy 0, policy_version 56280 (0.0009) +[2023-10-14 07:25:03,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 115343360. Throughput: 0: 1644.2, 1: 1646.6. Samples: 28844276. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:25:03,513][99942] Avg episode reward: [(0, '0.490'), (1, '1.000')] +[2023-10-14 07:25:03,893][100917] Updated weights for policy 1, policy_version 56362 (0.0009) +[2023-10-14 07:25:04,271][100917] Updated weights for policy 1, policy_version 56372 (0.0010) +[2023-10-14 07:25:04,647][100917] Updated weights for policy 1, policy_version 56382 (0.0009) +[2023-10-14 07:25:06,445][100936] Updated weights for policy 0, policy_version 56290 (0.0008) +[2023-10-14 07:25:06,813][100936] Updated weights for policy 0, policy_version 56300 (0.0010) +[2023-10-14 07:25:07,182][100936] Updated weights for policy 0, policy_version 56310 (0.0009) +[2023-10-14 07:25:07,546][100936] Updated weights for policy 0, policy_version 56320 (0.0008) +[2023-10-14 07:25:08,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 115408896. Throughput: 0: 1660.1, 1: 1650.9. Samples: 28864528. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:25:08,512][99942] Avg episode reward: [(0, '0.490'), (1, '1.000')] +[2023-10-14 07:25:08,617][100917] Updated weights for policy 1, policy_version 56392 (0.0009) +[2023-10-14 07:25:08,997][100917] Updated weights for policy 1, policy_version 56402 (0.0010) +[2023-10-14 07:25:09,379][100917] Updated weights for policy 1, policy_version 56412 (0.0008) +[2023-10-14 07:25:11,518][100936] Updated weights for policy 0, policy_version 56330 (0.0011) +[2023-10-14 07:25:11,892][100936] Updated weights for policy 0, policy_version 56340 (0.0008) +[2023-10-14 07:25:12,268][100936] Updated weights for policy 0, policy_version 56350 (0.0008) +[2023-10-14 07:25:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 115474432. Throughput: 0: 1660.4, 1: 1651.6. Samples: 28874634. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:25:13,513][99942] Avg episode reward: [(0, '0.490'), (1, '1.000')] +[2023-10-14 07:25:13,534][100917] Updated weights for policy 1, policy_version 56422 (0.0010) +[2023-10-14 07:25:13,907][100917] Updated weights for policy 1, policy_version 56432 (0.0010) +[2023-10-14 07:25:14,278][100917] Updated weights for policy 1, policy_version 56442 (0.0010) +[2023-10-14 07:25:16,421][100936] Updated weights for policy 0, policy_version 56360 (0.0007) +[2023-10-14 07:25:16,784][100936] Updated weights for policy 0, policy_version 56370 (0.0011) +[2023-10-14 07:25:17,151][100936] Updated weights for policy 0, policy_version 56380 (0.0008) +[2023-10-14 07:25:18,383][100917] Updated weights for policy 1, policy_version 56452 (0.0010) +[2023-10-14 07:25:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 115539968. Throughput: 0: 1645.5, 1: 1652.8. Samples: 28893932. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:25:18,512][99942] Avg episode reward: [(0, '0.490'), (1, '1.000')] +[2023-10-14 07:25:18,752][100917] Updated weights for policy 1, policy_version 56462 (0.0007) +[2023-10-14 07:25:19,123][100917] Updated weights for policy 1, policy_version 56472 (0.0010) +[2023-10-14 07:25:21,368][100936] Updated weights for policy 0, policy_version 56390 (0.0009) +[2023-10-14 07:25:21,743][100936] Updated weights for policy 0, policy_version 56400 (0.0010) +[2023-10-14 07:25:22,116][100936] Updated weights for policy 0, policy_version 56410 (0.0010) +[2023-10-14 07:25:23,282][100917] Updated weights for policy 1, policy_version 56482 (0.0011) +[2023-10-14 07:25:23,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 115605504. Throughput: 0: 1662.7, 1: 1651.3. Samples: 28914142. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:25:23,513][99942] Avg episode reward: [(0, '0.490'), (1, '1.000')] +[2023-10-14 07:25:23,651][100917] Updated weights for policy 1, policy_version 56492 (0.0010) +[2023-10-14 07:25:24,030][100917] Updated weights for policy 1, policy_version 56502 (0.0010) +[2023-10-14 07:25:24,405][100917] Updated weights for policy 1, policy_version 56512 (0.0008) +[2023-10-14 07:25:26,345][100936] Updated weights for policy 0, policy_version 56420 (0.0009) +[2023-10-14 07:25:26,743][100936] Updated weights for policy 0, policy_version 56430 (0.0008) +[2023-10-14 07:25:27,123][100936] Updated weights for policy 0, policy_version 56440 (0.0009) +[2023-10-14 07:25:28,385][100917] Updated weights for policy 1, policy_version 56522 (0.0008) +[2023-10-14 07:25:28,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 115671040. Throughput: 0: 1652.8, 1: 1655.3. Samples: 28924242. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:25:28,513][99942] Avg episode reward: [(0, '0.490'), (1, '1.000')] +[2023-10-14 07:25:28,751][100917] Updated weights for policy 1, policy_version 56532 (0.0009) +[2023-10-14 07:25:29,127][100917] Updated weights for policy 1, policy_version 56542 (0.0007) +[2023-10-14 07:25:31,316][100936] Updated weights for policy 0, policy_version 56450 (0.0010) +[2023-10-14 07:25:31,677][100936] Updated weights for policy 0, policy_version 56460 (0.0008) +[2023-10-14 07:25:32,050][100936] Updated weights for policy 0, policy_version 56470 (0.0009) +[2023-10-14 07:25:32,427][100936] Updated weights for policy 0, policy_version 56480 (0.0010) +[2023-10-14 07:25:33,156][100917] Updated weights for policy 1, policy_version 56552 (0.0010) +[2023-10-14 07:25:33,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 115736576. Throughput: 0: 1643.2, 1: 1659.4. Samples: 28943620. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:25:33,513][99942] Avg episode reward: [(0, '0.490'), (1, '1.000')] +[2023-10-14 07:25:33,522][100917] Updated weights for policy 1, policy_version 56562 (0.0010) +[2023-10-14 07:25:33,897][100917] Updated weights for policy 1, policy_version 56572 (0.0009) +[2023-10-14 07:25:36,516][100936] Updated weights for policy 0, policy_version 56490 (0.0009) +[2023-10-14 07:25:36,880][100936] Updated weights for policy 0, policy_version 56500 (0.0009) +[2023-10-14 07:25:37,248][100936] Updated weights for policy 0, policy_version 56510 (0.0007) +[2023-10-14 07:25:38,035][100917] Updated weights for policy 1, policy_version 56582 (0.0008) +[2023-10-14 07:25:38,423][100917] Updated weights for policy 1, policy_version 56592 (0.0008) +[2023-10-14 07:25:38,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 115802112. Throughput: 0: 1654.8, 1: 1657.4. Samples: 28963944. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:25:38,513][99942] Avg episode reward: [(0, '0.490'), (1, '1.000')] +[2023-10-14 07:25:38,520][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000056512_57868288.pth... +[2023-10-14 07:25:38,558][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000054976_56295424.pth +[2023-10-14 07:25:38,792][100917] Updated weights for policy 1, policy_version 56602 (0.0008) +[2023-10-14 07:25:39,014][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000056608_57966592.pth... +[2023-10-14 07:25:39,043][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000055040_56360960.pth +[2023-10-14 07:25:41,467][100936] Updated weights for policy 0, policy_version 56520 (0.0008) +[2023-10-14 07:25:41,835][100936] Updated weights for policy 0, policy_version 56530 (0.0007) +[2023-10-14 07:25:42,205][100936] Updated weights for policy 0, policy_version 56540 (0.0009) +[2023-10-14 07:25:42,837][100917] Updated weights for policy 1, policy_version 56612 (0.0009) +[2023-10-14 07:25:43,210][100917] Updated weights for policy 1, policy_version 56622 (0.0008) +[2023-10-14 07:25:43,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 115867648. Throughput: 0: 1650.8, 1: 1668.7. Samples: 28974244. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:25:43,513][99942] Avg episode reward: [(0, '0.490'), (1, '1.000')] +[2023-10-14 07:25:43,588][100917] Updated weights for policy 1, policy_version 56632 (0.0007) +[2023-10-14 07:25:46,477][100936] Updated weights for policy 0, policy_version 56550 (0.0010) +[2023-10-14 07:25:46,850][100936] Updated weights for policy 0, policy_version 56560 (0.0008) +[2023-10-14 07:25:47,207][100936] Updated weights for policy 0, policy_version 56570 (0.0009) +[2023-10-14 07:25:47,673][100917] Updated weights for policy 1, policy_version 56642 (0.0008) +[2023-10-14 07:25:48,040][100917] Updated weights for policy 1, policy_version 56652 (0.0009) +[2023-10-14 07:25:48,420][100917] Updated weights for policy 1, policy_version 56662 (0.0009) +[2023-10-14 07:25:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 115933184. Throughput: 0: 1648.1, 1: 1672.6. Samples: 28993710. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:25:48,513][99942] Avg episode reward: [(0, '0.490'), (1, '1.000')] +[2023-10-14 07:25:48,792][100917] Updated weights for policy 1, policy_version 56672 (0.0007) +[2023-10-14 07:25:51,352][100936] Updated weights for policy 0, policy_version 56580 (0.0009) +[2023-10-14 07:25:51,726][100936] Updated weights for policy 0, policy_version 56590 (0.0010) +[2023-10-14 07:25:52,091][100936] Updated weights for policy 0, policy_version 56600 (0.0008) +[2023-10-14 07:25:52,784][100917] Updated weights for policy 1, policy_version 56682 (0.0009) +[2023-10-14 07:25:53,160][100917] Updated weights for policy 1, policy_version 56692 (0.0008) +[2023-10-14 07:25:53,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 115998720. Throughput: 0: 1650.7, 1: 1662.8. Samples: 29013636. Policy #0 lag: (min: 7.0, avg: 15.0, max: 39.0) +[2023-10-14 07:25:53,513][99942] Avg episode reward: [(0, '0.490'), (1, '1.000')] +[2023-10-14 07:25:53,547][100917] Updated weights for policy 1, policy_version 56702 (0.0009) +[2023-10-14 07:25:56,243][100936] Updated weights for policy 0, policy_version 56610 (0.0009) +[2023-10-14 07:25:56,616][100936] Updated weights for policy 0, policy_version 56620 (0.0008) +[2023-10-14 07:25:56,991][100936] Updated weights for policy 0, policy_version 56630 (0.0008) +[2023-10-14 07:25:57,363][100936] Updated weights for policy 0, policy_version 56640 (0.0008) +[2023-10-14 07:25:57,649][100917] Updated weights for policy 1, policy_version 56712 (0.0008) +[2023-10-14 07:25:58,022][100917] Updated weights for policy 1, policy_version 56722 (0.0009) +[2023-10-14 07:25:58,396][100917] Updated weights for policy 1, policy_version 56732 (0.0007) +[2023-10-14 07:25:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 116064256. Throughput: 0: 1645.6, 1: 1676.1. Samples: 29024106. Policy #0 lag: (min: 7.0, avg: 15.0, max: 39.0) +[2023-10-14 07:25:58,512][99942] Avg episode reward: [(0, '0.490'), (1, '1.000')] +[2023-10-14 07:26:01,625][100936] Updated weights for policy 0, policy_version 56650 (0.0007) +[2023-10-14 07:26:01,999][100936] Updated weights for policy 0, policy_version 56660 (0.0007) +[2023-10-14 07:26:02,359][100936] Updated weights for policy 0, policy_version 56670 (0.0009) +[2023-10-14 07:26:02,492][100917] Updated weights for policy 1, policy_version 56742 (0.0008) +[2023-10-14 07:26:02,869][100917] Updated weights for policy 1, policy_version 56752 (0.0007) +[2023-10-14 07:26:03,231][100917] Updated weights for policy 1, policy_version 56762 (0.0007) +[2023-10-14 07:26:03,512][99942] Fps is (10 sec: 16384.1, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 116162560. Throughput: 0: 1644.9, 1: 1678.4. Samples: 29043482. Policy #0 lag: (min: 7.0, avg: 15.0, max: 39.0) +[2023-10-14 07:26:03,513][99942] Avg episode reward: [(0, '0.490'), (1, '1.000')] +[2023-10-14 07:26:06,424][100936] Updated weights for policy 0, policy_version 56680 (0.0009) +[2023-10-14 07:26:06,798][100936] Updated weights for policy 0, policy_version 56690 (0.0009) +[2023-10-14 07:26:07,170][100936] Updated weights for policy 0, policy_version 56700 (0.0008) +[2023-10-14 07:26:07,302][100917] Updated weights for policy 1, policy_version 56772 (0.0007) +[2023-10-14 07:26:07,671][100917] Updated weights for policy 1, policy_version 56782 (0.0009) +[2023-10-14 07:26:08,054][100917] Updated weights for policy 1, policy_version 56792 (0.0007) +[2023-10-14 07:26:08,512][99942] Fps is (10 sec: 16384.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 116228096. Throughput: 0: 1647.0, 1: 1662.4. Samples: 29063064. Policy #0 lag: (min: 7.0, avg: 15.0, max: 39.0) +[2023-10-14 07:26:08,512][99942] Avg episode reward: [(0, '0.490'), (1, '1.000')] +[2023-10-14 07:26:11,251][100936] Updated weights for policy 0, policy_version 56710 (0.0010) +[2023-10-14 07:26:11,643][100936] Updated weights for policy 0, policy_version 56720 (0.0008) +[2023-10-14 07:26:12,015][100936] Updated weights for policy 0, policy_version 56730 (0.0007) +[2023-10-14 07:26:12,196][100917] Updated weights for policy 1, policy_version 56802 (0.0008) +[2023-10-14 07:26:12,570][100917] Updated weights for policy 1, policy_version 56812 (0.0007) +[2023-10-14 07:26:12,944][100917] Updated weights for policy 1, policy_version 56822 (0.0008) +[2023-10-14 07:26:13,307][100917] Updated weights for policy 1, policy_version 56832 (0.0007) +[2023-10-14 07:26:13,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 116293632. Throughput: 0: 1643.1, 1: 1674.3. Samples: 29073524. Policy #0 lag: (min: 7.0, avg: 15.0, max: 39.0) +[2023-10-14 07:26:13,513][99942] Avg episode reward: [(0, '0.490'), (1, '1.000')] +[2023-10-14 07:26:15,935][100936] Updated weights for policy 0, policy_version 56740 (0.0009) +[2023-10-14 07:26:16,296][100936] Updated weights for policy 0, policy_version 56750 (0.0007) +[2023-10-14 07:26:16,668][100936] Updated weights for policy 0, policy_version 56760 (0.0007) +[2023-10-14 07:26:17,433][100917] Updated weights for policy 1, policy_version 56842 (0.0009) +[2023-10-14 07:26:17,808][100917] Updated weights for policy 1, policy_version 56852 (0.0009) +[2023-10-14 07:26:18,186][100917] Updated weights for policy 1, policy_version 56862 (0.0009) +[2023-10-14 07:26:18,512][99942] Fps is (10 sec: 13106.7, 60 sec: 13653.2, 300 sec: 13329.3). Total num frames: 116359168. Throughput: 0: 1654.1, 1: 1671.4. Samples: 29093268. Policy #0 lag: (min: 7.0, avg: 15.0, max: 39.0) +[2023-10-14 07:26:18,513][99942] Avg episode reward: [(0, '0.490'), (1, '1.000')] +[2023-10-14 07:26:20,774][100936] Updated weights for policy 0, policy_version 56770 (0.0007) +[2023-10-14 07:26:21,139][100936] Updated weights for policy 0, policy_version 56780 (0.0008) +[2023-10-14 07:26:21,512][100936] Updated weights for policy 0, policy_version 56790 (0.0009) +[2023-10-14 07:26:21,881][100936] Updated weights for policy 0, policy_version 56800 (0.0007) +[2023-10-14 07:26:22,324][100917] Updated weights for policy 1, policy_version 56872 (0.0009) +[2023-10-14 07:26:22,701][100917] Updated weights for policy 1, policy_version 56882 (0.0008) +[2023-10-14 07:26:23,079][100917] Updated weights for policy 1, policy_version 56892 (0.0008) +[2023-10-14 07:26:23,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 116424704. Throughput: 0: 1654.8, 1: 1649.3. Samples: 29112628. Policy #0 lag: (min: 7.0, avg: 15.0, max: 39.0) +[2023-10-14 07:26:23,514][99942] Avg episode reward: [(0, '0.490'), (1, '1.000')] +[2023-10-14 07:26:26,164][100936] Updated weights for policy 0, policy_version 56810 (0.0009) +[2023-10-14 07:26:26,531][100936] Updated weights for policy 0, policy_version 56820 (0.0008) +[2023-10-14 07:26:26,909][100936] Updated weights for policy 0, policy_version 56830 (0.0009) +[2023-10-14 07:26:27,324][100917] Updated weights for policy 1, policy_version 56902 (0.0008) +[2023-10-14 07:26:27,703][100917] Updated weights for policy 1, policy_version 56912 (0.0007) +[2023-10-14 07:26:28,081][100917] Updated weights for policy 1, policy_version 56922 (0.0007) +[2023-10-14 07:26:28,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 116490240. Throughput: 0: 1640.9, 1: 1662.5. Samples: 29122898. Policy #0 lag: (min: 7.0, avg: 15.0, max: 39.0) +[2023-10-14 07:26:28,513][99942] Avg episode reward: [(0, '0.490'), (1, '1.000')] +[2023-10-14 07:26:31,109][100936] Updated weights for policy 0, policy_version 56840 (0.0009) +[2023-10-14 07:26:31,485][100936] Updated weights for policy 0, policy_version 56850 (0.0009) +[2023-10-14 07:26:31,855][100936] Updated weights for policy 0, policy_version 56860 (0.0009) +[2023-10-14 07:26:32,311][100917] Updated weights for policy 1, policy_version 56932 (0.0009) +[2023-10-14 07:26:32,695][100917] Updated weights for policy 1, policy_version 56942 (0.0008) +[2023-10-14 07:26:33,061][100917] Updated weights for policy 1, policy_version 56952 (0.0008) +[2023-10-14 07:26:33,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 116555776. Throughput: 0: 1653.4, 1: 1653.4. Samples: 29142514. Policy #0 lag: (min: 15.0, avg: 15.2, max: 24.0) +[2023-10-14 07:26:33,513][99942] Avg episode reward: [(0, '0.490'), (1, '1.000')] +[2023-10-14 07:26:36,039][100936] Updated weights for policy 0, policy_version 56870 (0.0008) +[2023-10-14 07:26:36,404][100936] Updated weights for policy 0, policy_version 56880 (0.0008) +[2023-10-14 07:26:36,776][100936] Updated weights for policy 0, policy_version 56890 (0.0009) +[2023-10-14 07:26:37,194][100917] Updated weights for policy 1, policy_version 56962 (0.0008) +[2023-10-14 07:26:37,570][100917] Updated weights for policy 1, policy_version 56972 (0.0008) +[2023-10-14 07:26:37,939][100917] Updated weights for policy 1, policy_version 56982 (0.0009) +[2023-10-14 07:26:38,305][100917] Updated weights for policy 1, policy_version 56992 (0.0009) +[2023-10-14 07:26:38,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 116621312. Throughput: 0: 1656.8, 1: 1639.2. Samples: 29161956. Policy #0 lag: (min: 15.0, avg: 15.2, max: 24.0) +[2023-10-14 07:26:38,512][99942] Avg episode reward: [(0, '0.480'), (1, '1.000')] +[2023-10-14 07:26:40,920][100936] Updated weights for policy 0, policy_version 56900 (0.0010) +[2023-10-14 07:26:41,297][100936] Updated weights for policy 0, policy_version 56910 (0.0010) +[2023-10-14 07:26:41,654][100936] Updated weights for policy 0, policy_version 56920 (0.0007) +[2023-10-14 07:26:42,497][100917] Updated weights for policy 1, policy_version 57002 (0.0008) +[2023-10-14 07:26:42,876][100917] Updated weights for policy 1, policy_version 57012 (0.0008) +[2023-10-14 07:26:43,251][100917] Updated weights for policy 1, policy_version 57022 (0.0008) +[2023-10-14 07:26:43,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 116686848. Throughput: 0: 1647.4, 1: 1642.8. Samples: 29172164. Policy #0 lag: (min: 15.0, avg: 15.2, max: 24.0) +[2023-10-14 07:26:43,513][99942] Avg episode reward: [(0, '0.490'), (1, '1.000')] +[2023-10-14 07:26:45,645][100936] Updated weights for policy 0, policy_version 56930 (0.0010) +[2023-10-14 07:26:46,013][100936] Updated weights for policy 0, policy_version 56940 (0.0009) +[2023-10-14 07:26:46,373][100936] Updated weights for policy 0, policy_version 56950 (0.0009) +[2023-10-14 07:26:46,742][100936] Updated weights for policy 0, policy_version 56960 (0.0010) +[2023-10-14 07:26:47,315][100917] Updated weights for policy 1, policy_version 57032 (0.0008) +[2023-10-14 07:26:47,681][100917] Updated weights for policy 1, policy_version 57042 (0.0009) +[2023-10-14 07:26:48,054][100917] Updated weights for policy 1, policy_version 57052 (0.0007) +[2023-10-14 07:26:48,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 116752384. Throughput: 0: 1659.0, 1: 1648.3. Samples: 29192312. Policy #0 lag: (min: 15.0, avg: 15.2, max: 24.0) +[2023-10-14 07:26:48,513][99942] Avg episode reward: [(0, '0.490'), (1, '1.000')] +[2023-10-14 07:26:50,880][100936] Updated weights for policy 0, policy_version 56970 (0.0009) +[2023-10-14 07:26:51,242][100936] Updated weights for policy 0, policy_version 56980 (0.0009) +[2023-10-14 07:26:51,617][100936] Updated weights for policy 0, policy_version 56990 (0.0007) +[2023-10-14 07:26:52,165][100917] Updated weights for policy 1, policy_version 57062 (0.0009) +[2023-10-14 07:26:52,545][100917] Updated weights for policy 1, policy_version 57072 (0.0008) +[2023-10-14 07:26:52,917][100917] Updated weights for policy 1, policy_version 57082 (0.0007) +[2023-10-14 07:26:53,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 116817920. Throughput: 0: 1658.1, 1: 1638.5. Samples: 29211412. Policy #0 lag: (min: 15.0, avg: 15.2, max: 24.0) +[2023-10-14 07:26:53,512][99942] Avg episode reward: [(0, '0.490'), (1, '1.000')] +[2023-10-14 07:26:55,827][100936] Updated weights for policy 0, policy_version 57000 (0.0009) +[2023-10-14 07:26:56,194][100936] Updated weights for policy 0, policy_version 57010 (0.0010) +[2023-10-14 07:26:56,562][100936] Updated weights for policy 0, policy_version 57020 (0.0010) +[2023-10-14 07:26:57,088][100917] Updated weights for policy 1, policy_version 57092 (0.0009) +[2023-10-14 07:26:57,455][100917] Updated weights for policy 1, policy_version 57102 (0.0008) +[2023-10-14 07:26:57,831][100917] Updated weights for policy 1, policy_version 57112 (0.0012) +[2023-10-14 07:26:58,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 116883456. Throughput: 0: 1645.0, 1: 1647.9. Samples: 29221702. Policy #0 lag: (min: 15.0, avg: 15.2, max: 24.0) +[2023-10-14 07:26:58,513][99942] Avg episode reward: [(0, '0.490'), (1, '1.000')] +[2023-10-14 07:27:00,704][100936] Updated weights for policy 0, policy_version 57030 (0.0009) +[2023-10-14 07:27:01,076][100936] Updated weights for policy 0, policy_version 57040 (0.0007) +[2023-10-14 07:27:01,450][100936] Updated weights for policy 0, policy_version 57050 (0.0010) +[2023-10-14 07:27:02,023][100917] Updated weights for policy 1, policy_version 57122 (0.0010) +[2023-10-14 07:27:02,393][100917] Updated weights for policy 1, policy_version 57132 (0.0011) +[2023-10-14 07:27:02,771][100917] Updated weights for policy 1, policy_version 57142 (0.0010) +[2023-10-14 07:27:03,151][100917] Updated weights for policy 1, policy_version 57152 (0.0011) +[2023-10-14 07:27:03,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 116948992. Throughput: 0: 1655.4, 1: 1643.8. Samples: 29241732. Policy #0 lag: (min: 15.0, avg: 15.2, max: 24.0) +[2023-10-14 07:27:03,512][99942] Avg episode reward: [(0, '0.660'), (1, '1.000')] +[2023-10-14 07:27:05,598][100936] Updated weights for policy 0, policy_version 57060 (0.0009) +[2023-10-14 07:27:05,973][100936] Updated weights for policy 0, policy_version 57070 (0.0010) +[2023-10-14 07:27:06,352][100936] Updated weights for policy 0, policy_version 57080 (0.0008) +[2023-10-14 07:27:07,134][100917] Updated weights for policy 1, policy_version 57162 (0.0011) +[2023-10-14 07:27:07,508][100917] Updated weights for policy 1, policy_version 57172 (0.0008) +[2023-10-14 07:27:07,872][100917] Updated weights for policy 1, policy_version 57182 (0.0008) +[2023-10-14 07:27:08,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 117014528. Throughput: 0: 1656.6, 1: 1644.8. Samples: 29261188. Policy #0 lag: (min: 15.0, avg: 15.2, max: 24.0) +[2023-10-14 07:27:08,513][99942] Avg episode reward: [(0, '0.660'), (1, '1.000')] +[2023-10-14 07:27:10,535][100936] Updated weights for policy 0, policy_version 57090 (0.0009) +[2023-10-14 07:27:10,905][100936] Updated weights for policy 0, policy_version 57100 (0.0010) +[2023-10-14 07:27:11,276][100936] Updated weights for policy 0, policy_version 57110 (0.0009) +[2023-10-14 07:27:11,648][100936] Updated weights for policy 0, policy_version 57120 (0.0009) +[2023-10-14 07:27:12,077][100917] Updated weights for policy 1, policy_version 57192 (0.0009) +[2023-10-14 07:27:12,455][100917] Updated weights for policy 1, policy_version 57202 (0.0008) +[2023-10-14 07:27:12,821][100917] Updated weights for policy 1, policy_version 57212 (0.0009) +[2023-10-14 07:27:13,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 117080064. Throughput: 0: 1652.2, 1: 1654.6. Samples: 29271702. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:27:13,513][99942] Avg episode reward: [(0, '0.810'), (1, '1.000')] +[2023-10-14 07:27:15,712][100936] Updated weights for policy 0, policy_version 57130 (0.0008) +[2023-10-14 07:27:16,078][100936] Updated weights for policy 0, policy_version 57140 (0.0008) +[2023-10-14 07:27:16,450][100936] Updated weights for policy 0, policy_version 57150 (0.0011) +[2023-10-14 07:27:16,871][100917] Updated weights for policy 1, policy_version 57222 (0.0009) +[2023-10-14 07:27:17,257][100917] Updated weights for policy 1, policy_version 57232 (0.0009) +[2023-10-14 07:27:17,626][100917] Updated weights for policy 1, policy_version 57242 (0.0009) +[2023-10-14 07:27:18,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 117145600. Throughput: 0: 1658.2, 1: 1651.7. Samples: 29291458. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:27:18,513][99942] Avg episode reward: [(0, '0.810'), (1, '1.000')] +[2023-10-14 07:27:20,585][100936] Updated weights for policy 0, policy_version 57160 (0.0008) +[2023-10-14 07:27:20,957][100936] Updated weights for policy 0, policy_version 57170 (0.0007) +[2023-10-14 07:27:21,326][100936] Updated weights for policy 0, policy_version 57180 (0.0010) +[2023-10-14 07:27:21,679][100917] Updated weights for policy 1, policy_version 57252 (0.0009) +[2023-10-14 07:27:22,058][100917] Updated weights for policy 1, policy_version 57262 (0.0008) +[2023-10-14 07:27:22,423][100917] Updated weights for policy 1, policy_version 57272 (0.0009) +[2023-10-14 07:27:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 13329.4). Total num frames: 117211136. Throughput: 0: 1657.8, 1: 1653.3. Samples: 29310958. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:27:23,513][99942] Avg episode reward: [(0, '0.810'), (1, '1.000')] +[2023-10-14 07:27:25,443][100936] Updated weights for policy 0, policy_version 57190 (0.0008) +[2023-10-14 07:27:25,806][100936] Updated weights for policy 0, policy_version 57200 (0.0009) +[2023-10-14 07:27:26,180][100936] Updated weights for policy 0, policy_version 57210 (0.0007) +[2023-10-14 07:27:26,651][100917] Updated weights for policy 1, policy_version 57282 (0.0010) +[2023-10-14 07:27:27,017][100917] Updated weights for policy 1, policy_version 57292 (0.0009) +[2023-10-14 07:27:27,404][100917] Updated weights for policy 1, policy_version 57302 (0.0010) +[2023-10-14 07:27:27,769][100917] Updated weights for policy 1, policy_version 57312 (0.0008) +[2023-10-14 07:27:28,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 117276672. Throughput: 0: 1647.2, 1: 1664.9. Samples: 29321212. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:27:28,513][99942] Avg episode reward: [(0, '0.810'), (1, '1.000')] +[2023-10-14 07:27:30,309][100936] Updated weights for policy 0, policy_version 57220 (0.0010) +[2023-10-14 07:27:30,678][100936] Updated weights for policy 0, policy_version 57230 (0.0009) +[2023-10-14 07:27:31,055][100936] Updated weights for policy 0, policy_version 57240 (0.0009) +[2023-10-14 07:27:31,718][100917] Updated weights for policy 1, policy_version 57322 (0.0009) +[2023-10-14 07:27:32,084][100917] Updated weights for policy 1, policy_version 57332 (0.0011) +[2023-10-14 07:27:32,455][100917] Updated weights for policy 1, policy_version 57342 (0.0010) +[2023-10-14 07:27:33,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 117342208. Throughput: 0: 1655.9, 1: 1647.8. Samples: 29340978. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:27:33,512][99942] Avg episode reward: [(0, '0.810'), (1, '1.000')] +[2023-10-14 07:27:35,223][100936] Updated weights for policy 0, policy_version 57250 (0.0009) +[2023-10-14 07:27:35,596][100936] Updated weights for policy 0, policy_version 57260 (0.0007) +[2023-10-14 07:27:35,968][100936] Updated weights for policy 0, policy_version 57270 (0.0009) +[2023-10-14 07:27:36,336][100936] Updated weights for policy 0, policy_version 57280 (0.0008) +[2023-10-14 07:27:36,610][100917] Updated weights for policy 1, policy_version 57352 (0.0010) +[2023-10-14 07:27:36,989][100917] Updated weights for policy 1, policy_version 57362 (0.0009) +[2023-10-14 07:27:37,361][100917] Updated weights for policy 1, policy_version 57372 (0.0009) +[2023-10-14 07:27:38,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 117407744. Throughput: 0: 1656.0, 1: 1659.1. Samples: 29360590. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:27:38,512][99942] Avg episode reward: [(0, '0.860'), (1, '1.000')] +[2023-10-14 07:27:38,520][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000057280_58654720.pth... +[2023-10-14 07:27:38,520][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000057376_58753024.pth... +[2023-10-14 07:27:38,559][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000055744_57081856.pth +[2023-10-14 07:27:38,560][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000055808_57147392.pth +[2023-10-14 07:27:40,537][100936] Updated weights for policy 0, policy_version 57290 (0.0009) +[2023-10-14 07:27:40,899][100936] Updated weights for policy 0, policy_version 57300 (0.0009) +[2023-10-14 07:27:41,265][100936] Updated weights for policy 0, policy_version 57310 (0.0008) +[2023-10-14 07:27:41,415][100917] Updated weights for policy 1, policy_version 57382 (0.0008) +[2023-10-14 07:27:41,786][100917] Updated weights for policy 1, policy_version 57392 (0.0008) +[2023-10-14 07:27:42,166][100917] Updated weights for policy 1, policy_version 57402 (0.0007) +[2023-10-14 07:27:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 117473280. Throughput: 0: 1647.0, 1: 1663.6. Samples: 29370680. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:27:43,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:27:45,559][100936] Updated weights for policy 0, policy_version 57320 (0.0007) +[2023-10-14 07:27:45,922][100936] Updated weights for policy 0, policy_version 57330 (0.0007) +[2023-10-14 07:27:46,287][100917] Updated weights for policy 1, policy_version 57412 (0.0009) +[2023-10-14 07:27:46,287][100936] Updated weights for policy 0, policy_version 57340 (0.0008) +[2023-10-14 07:27:46,673][100917] Updated weights for policy 1, policy_version 57422 (0.0010) +[2023-10-14 07:27:47,050][100917] Updated weights for policy 1, policy_version 57432 (0.0009) +[2023-10-14 07:27:48,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 117538816. Throughput: 0: 1652.4, 1: 1649.8. Samples: 29390330. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:27:48,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:27:50,381][100936] Updated weights for policy 0, policy_version 57350 (0.0008) +[2023-10-14 07:27:50,759][100936] Updated weights for policy 0, policy_version 57360 (0.0009) +[2023-10-14 07:27:50,999][100917] Updated weights for policy 1, policy_version 57442 (0.0010) +[2023-10-14 07:27:51,123][100936] Updated weights for policy 0, policy_version 57370 (0.0008) +[2023-10-14 07:27:51,372][100917] Updated weights for policy 1, policy_version 57452 (0.0009) +[2023-10-14 07:27:51,751][100917] Updated weights for policy 1, policy_version 57462 (0.0009) +[2023-10-14 07:27:52,120][100917] Updated weights for policy 1, policy_version 57472 (0.0007) +[2023-10-14 07:27:53,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 117604352. Throughput: 0: 1644.8, 1: 1663.6. Samples: 29410062. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:27:53,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:27:55,278][100936] Updated weights for policy 0, policy_version 57380 (0.0009) +[2023-10-14 07:27:55,655][100936] Updated weights for policy 0, policy_version 57390 (0.0011) +[2023-10-14 07:27:56,017][100936] Updated weights for policy 0, policy_version 57400 (0.0010) +[2023-10-14 07:27:56,445][100917] Updated weights for policy 1, policy_version 57482 (0.0010) +[2023-10-14 07:27:56,818][100917] Updated weights for policy 1, policy_version 57492 (0.0008) +[2023-10-14 07:27:57,182][100917] Updated weights for policy 1, policy_version 57502 (0.0007) +[2023-10-14 07:27:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 117669888. Throughput: 0: 1636.5, 1: 1661.5. Samples: 29420112. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:27:58,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:28:00,121][100936] Updated weights for policy 0, policy_version 57410 (0.0009) +[2023-10-14 07:28:00,485][100936] Updated weights for policy 0, policy_version 57420 (0.0010) +[2023-10-14 07:28:00,861][100936] Updated weights for policy 0, policy_version 57430 (0.0010) +[2023-10-14 07:28:01,225][100936] Updated weights for policy 0, policy_version 57440 (0.0010) +[2023-10-14 07:28:01,455][100917] Updated weights for policy 1, policy_version 57512 (0.0010) +[2023-10-14 07:28:01,822][100917] Updated weights for policy 1, policy_version 57522 (0.0010) +[2023-10-14 07:28:02,200][100917] Updated weights for policy 1, policy_version 57532 (0.0008) +[2023-10-14 07:28:03,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 117735424. Throughput: 0: 1645.8, 1: 1646.4. Samples: 29439608. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:28:03,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:28:05,513][100936] Updated weights for policy 0, policy_version 57450 (0.0009) +[2023-10-14 07:28:05,886][100936] Updated weights for policy 0, policy_version 57460 (0.0009) +[2023-10-14 07:28:06,195][100917] Updated weights for policy 1, policy_version 57542 (0.0007) +[2023-10-14 07:28:06,262][100936] Updated weights for policy 0, policy_version 57470 (0.0008) +[2023-10-14 07:28:06,567][100917] Updated weights for policy 1, policy_version 57552 (0.0009) +[2023-10-14 07:28:06,946][100917] Updated weights for policy 1, policy_version 57562 (0.0010) +[2023-10-14 07:28:08,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 117800960. Throughput: 0: 1641.8, 1: 1659.8. Samples: 29459532. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:28:08,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:28:10,352][100936] Updated weights for policy 0, policy_version 57480 (0.0008) +[2023-10-14 07:28:10,712][100936] Updated weights for policy 0, policy_version 57490 (0.0011) +[2023-10-14 07:28:11,092][100936] Updated weights for policy 0, policy_version 57500 (0.0009) +[2023-10-14 07:28:11,120][100917] Updated weights for policy 1, policy_version 57572 (0.0009) +[2023-10-14 07:28:11,499][100917] Updated weights for policy 1, policy_version 57582 (0.0008) +[2023-10-14 07:28:11,870][100917] Updated weights for policy 1, policy_version 57592 (0.0007) +[2023-10-14 07:28:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 117866496. Throughput: 0: 1642.2, 1: 1661.4. Samples: 29469872. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:28:13,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:28:15,258][100936] Updated weights for policy 0, policy_version 57510 (0.0009) +[2023-10-14 07:28:15,631][100936] Updated weights for policy 0, policy_version 57520 (0.0007) +[2023-10-14 07:28:15,995][100936] Updated weights for policy 0, policy_version 57530 (0.0008) +[2023-10-14 07:28:16,126][100917] Updated weights for policy 1, policy_version 57602 (0.0010) +[2023-10-14 07:28:16,503][100917] Updated weights for policy 1, policy_version 57612 (0.0009) +[2023-10-14 07:28:16,874][100917] Updated weights for policy 1, policy_version 57622 (0.0008) +[2023-10-14 07:28:17,254][100917] Updated weights for policy 1, policy_version 57632 (0.0008) +[2023-10-14 07:28:18,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 117932032. Throughput: 0: 1643.4, 1: 1653.4. Samples: 29489334. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:28:18,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:28:20,072][100936] Updated weights for policy 0, policy_version 57540 (0.0008) +[2023-10-14 07:28:20,447][100936] Updated weights for policy 0, policy_version 57550 (0.0010) +[2023-10-14 07:28:20,805][100936] Updated weights for policy 0, policy_version 57560 (0.0010) +[2023-10-14 07:28:21,242][100917] Updated weights for policy 1, policy_version 57642 (0.0007) +[2023-10-14 07:28:21,603][100917] Updated weights for policy 1, policy_version 57652 (0.0008) +[2023-10-14 07:28:21,980][100917] Updated weights for policy 1, policy_version 57662 (0.0009) +[2023-10-14 07:28:23,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 117997568. Throughput: 0: 1644.2, 1: 1660.9. Samples: 29509320. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:28:23,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:28:24,916][100936] Updated weights for policy 0, policy_version 57570 (0.0009) +[2023-10-14 07:28:25,285][100936] Updated weights for policy 0, policy_version 57580 (0.0009) +[2023-10-14 07:28:25,661][100936] Updated weights for policy 0, policy_version 57590 (0.0010) +[2023-10-14 07:28:26,037][100917] Updated weights for policy 1, policy_version 57672 (0.0009) +[2023-10-14 07:28:26,043][100936] Updated weights for policy 0, policy_version 57600 (0.0007) +[2023-10-14 07:28:26,407][100917] Updated weights for policy 1, policy_version 57682 (0.0010) +[2023-10-14 07:28:26,778][100917] Updated weights for policy 1, policy_version 57692 (0.0009) +[2023-10-14 07:28:28,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 118063104. Throughput: 0: 1645.4, 1: 1656.2. Samples: 29519250. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:28:28,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:28:30,187][100936] Updated weights for policy 0, policy_version 57610 (0.0010) +[2023-10-14 07:28:30,557][100936] Updated weights for policy 0, policy_version 57620 (0.0010) +[2023-10-14 07:28:30,890][100917] Updated weights for policy 1, policy_version 57702 (0.0007) +[2023-10-14 07:28:30,930][100936] Updated weights for policy 0, policy_version 57630 (0.0009) +[2023-10-14 07:28:31,263][100917] Updated weights for policy 1, policy_version 57712 (0.0009) +[2023-10-14 07:28:31,641][100917] Updated weights for policy 1, policy_version 57722 (0.0009) +[2023-10-14 07:28:33,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 118128640. Throughput: 0: 1648.7, 1: 1647.2. Samples: 29538648. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) +[2023-10-14 07:28:33,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:28:35,122][100936] Updated weights for policy 0, policy_version 57640 (0.0007) +[2023-10-14 07:28:35,508][100936] Updated weights for policy 0, policy_version 57650 (0.0009) +[2023-10-14 07:28:35,764][100917] Updated weights for policy 1, policy_version 57732 (0.0008) +[2023-10-14 07:28:35,873][100936] Updated weights for policy 0, policy_version 57660 (0.0007) +[2023-10-14 07:28:36,137][100917] Updated weights for policy 1, policy_version 57742 (0.0009) +[2023-10-14 07:28:36,506][100917] Updated weights for policy 1, policy_version 57752 (0.0008) +[2023-10-14 07:28:38,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 118194176. Throughput: 0: 1653.4, 1: 1658.7. Samples: 29559104. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) +[2023-10-14 07:28:38,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:28:39,918][100936] Updated weights for policy 0, policy_version 57670 (0.0009) +[2023-10-14 07:28:40,302][100936] Updated weights for policy 0, policy_version 57680 (0.0007) +[2023-10-14 07:28:40,566][100917] Updated weights for policy 1, policy_version 57762 (0.0009) +[2023-10-14 07:28:40,682][100936] Updated weights for policy 0, policy_version 57690 (0.0010) +[2023-10-14 07:28:40,927][100917] Updated weights for policy 1, policy_version 57772 (0.0007) +[2023-10-14 07:28:41,305][100917] Updated weights for policy 1, policy_version 57782 (0.0007) +[2023-10-14 07:28:41,676][100917] Updated weights for policy 1, policy_version 57792 (0.0009) +[2023-10-14 07:28:43,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 118259712. Throughput: 0: 1655.4, 1: 1652.9. Samples: 29568984. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) +[2023-10-14 07:28:43,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:28:44,727][100936] Updated weights for policy 0, policy_version 57700 (0.0008) +[2023-10-14 07:28:45,093][100936] Updated weights for policy 0, policy_version 57710 (0.0008) +[2023-10-14 07:28:45,458][100936] Updated weights for policy 0, policy_version 57720 (0.0007) +[2023-10-14 07:28:45,808][100917] Updated weights for policy 1, policy_version 57802 (0.0008) +[2023-10-14 07:28:46,175][100917] Updated weights for policy 1, policy_version 57812 (0.0009) +[2023-10-14 07:28:46,552][100917] Updated weights for policy 1, policy_version 57822 (0.0007) +[2023-10-14 07:28:48,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 118325248. Throughput: 0: 1656.6, 1: 1659.7. Samples: 29588838. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) +[2023-10-14 07:28:48,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:28:49,523][100936] Updated weights for policy 0, policy_version 57730 (0.0007) +[2023-10-14 07:28:49,889][100936] Updated weights for policy 0, policy_version 57740 (0.0009) +[2023-10-14 07:28:50,262][100936] Updated weights for policy 0, policy_version 57750 (0.0007) +[2023-10-14 07:28:50,635][100936] Updated weights for policy 0, policy_version 57760 (0.0008) +[2023-10-14 07:28:50,883][100917] Updated weights for policy 1, policy_version 57832 (0.0009) +[2023-10-14 07:28:51,255][100917] Updated weights for policy 1, policy_version 57842 (0.0011) +[2023-10-14 07:28:51,629][100917] Updated weights for policy 1, policy_version 57852 (0.0008) +[2023-10-14 07:28:53,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 118390784. Throughput: 0: 1664.0, 1: 1663.6. Samples: 29609274. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) +[2023-10-14 07:28:53,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:28:54,688][100936] Updated weights for policy 0, policy_version 57770 (0.0007) +[2023-10-14 07:28:55,055][100936] Updated weights for policy 0, policy_version 57780 (0.0007) +[2023-10-14 07:28:55,436][100936] Updated weights for policy 0, policy_version 57790 (0.0008) +[2023-10-14 07:28:55,724][100917] Updated weights for policy 1, policy_version 57862 (0.0008) +[2023-10-14 07:28:56,098][100917] Updated weights for policy 1, policy_version 57872 (0.0008) +[2023-10-14 07:28:56,482][100917] Updated weights for policy 1, policy_version 57882 (0.0009) +[2023-10-14 07:28:58,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 118456320. Throughput: 0: 1662.2, 1: 1653.9. Samples: 29619096. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) +[2023-10-14 07:28:58,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:28:59,464][100936] Updated weights for policy 0, policy_version 57800 (0.0009) +[2023-10-14 07:28:59,827][100936] Updated weights for policy 0, policy_version 57810 (0.0008) +[2023-10-14 07:29:00,194][100936] Updated weights for policy 0, policy_version 57820 (0.0007) +[2023-10-14 07:29:00,486][100917] Updated weights for policy 1, policy_version 57892 (0.0009) +[2023-10-14 07:29:00,863][100917] Updated weights for policy 1, policy_version 57902 (0.0008) +[2023-10-14 07:29:01,223][100917] Updated weights for policy 1, policy_version 57912 (0.0009) +[2023-10-14 07:29:03,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 118521856. Throughput: 0: 1667.0, 1: 1657.2. Samples: 29638922. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) +[2023-10-14 07:29:03,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:29:04,458][100936] Updated weights for policy 0, policy_version 57830 (0.0007) +[2023-10-14 07:29:04,827][100936] Updated weights for policy 0, policy_version 57840 (0.0007) +[2023-10-14 07:29:05,197][100936] Updated weights for policy 0, policy_version 57850 (0.0007) +[2023-10-14 07:29:05,410][100917] Updated weights for policy 1, policy_version 57922 (0.0009) +[2023-10-14 07:29:05,782][100917] Updated weights for policy 1, policy_version 57932 (0.0007) +[2023-10-14 07:29:06,150][100917] Updated weights for policy 1, policy_version 57942 (0.0007) +[2023-10-14 07:29:06,531][100917] Updated weights for policy 1, policy_version 57952 (0.0010) +[2023-10-14 07:29:08,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 118587392. Throughput: 0: 1672.9, 1: 1664.4. Samples: 29659496. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) +[2023-10-14 07:29:08,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:29:09,350][100936] Updated weights for policy 0, policy_version 57860 (0.0008) +[2023-10-14 07:29:09,712][100936] Updated weights for policy 0, policy_version 57870 (0.0008) +[2023-10-14 07:29:10,084][100936] Updated weights for policy 0, policy_version 57880 (0.0008) +[2023-10-14 07:29:10,538][100917] Updated weights for policy 1, policy_version 57962 (0.0009) +[2023-10-14 07:29:10,916][100917] Updated weights for policy 1, policy_version 57972 (0.0010) +[2023-10-14 07:29:11,294][100917] Updated weights for policy 1, policy_version 57982 (0.0011) +[2023-10-14 07:29:13,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 118652928. Throughput: 0: 1673.8, 1: 1651.2. Samples: 29668874. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) +[2023-10-14 07:29:13,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:29:14,269][100936] Updated weights for policy 0, policy_version 57890 (0.0009) +[2023-10-14 07:29:14,643][100936] Updated weights for policy 0, policy_version 57900 (0.0009) +[2023-10-14 07:29:15,009][100936] Updated weights for policy 0, policy_version 57910 (0.0011) +[2023-10-14 07:29:15,258][100917] Updated weights for policy 1, policy_version 57992 (0.0007) +[2023-10-14 07:29:15,378][100936] Updated weights for policy 0, policy_version 57920 (0.0008) +[2023-10-14 07:29:15,631][100917] Updated weights for policy 1, policy_version 58002 (0.0008) +[2023-10-14 07:29:16,018][100917] Updated weights for policy 1, policy_version 58012 (0.0008) +[2023-10-14 07:29:18,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 118718464. Throughput: 0: 1668.5, 1: 1667.6. Samples: 29688772. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) +[2023-10-14 07:29:18,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:29:19,506][100936] Updated weights for policy 0, policy_version 57930 (0.0009) +[2023-10-14 07:29:19,872][100936] Updated weights for policy 0, policy_version 57940 (0.0009) +[2023-10-14 07:29:20,154][100917] Updated weights for policy 1, policy_version 58022 (0.0008) +[2023-10-14 07:29:20,249][100936] Updated weights for policy 0, policy_version 57950 (0.0007) +[2023-10-14 07:29:20,526][100917] Updated weights for policy 1, policy_version 58032 (0.0010) +[2023-10-14 07:29:20,898][100917] Updated weights for policy 1, policy_version 58042 (0.0008) +[2023-10-14 07:29:23,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 118784000. Throughput: 0: 1671.0, 1: 1670.2. Samples: 29709458. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) +[2023-10-14 07:29:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:29:24,386][100936] Updated weights for policy 0, policy_version 57960 (0.0010) +[2023-10-14 07:29:24,748][100936] Updated weights for policy 0, policy_version 57970 (0.0010) +[2023-10-14 07:29:25,006][100917] Updated weights for policy 1, policy_version 58052 (0.0009) +[2023-10-14 07:29:25,120][100936] Updated weights for policy 0, policy_version 57980 (0.0008) +[2023-10-14 07:29:25,371][100917] Updated weights for policy 1, policy_version 58062 (0.0009) +[2023-10-14 07:29:25,752][100917] Updated weights for policy 1, policy_version 58072 (0.0009) +[2023-10-14 07:29:28,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 118849536. Throughput: 0: 1669.5, 1: 1652.0. Samples: 29718452. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) +[2023-10-14 07:29:28,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:29:29,286][100936] Updated weights for policy 0, policy_version 57990 (0.0007) +[2023-10-14 07:29:29,678][100936] Updated weights for policy 0, policy_version 58000 (0.0010) +[2023-10-14 07:29:29,919][100917] Updated weights for policy 1, policy_version 58082 (0.0009) +[2023-10-14 07:29:30,049][100936] Updated weights for policy 0, policy_version 58010 (0.0007) +[2023-10-14 07:29:30,291][100917] Updated weights for policy 1, policy_version 58092 (0.0008) +[2023-10-14 07:29:30,669][100917] Updated weights for policy 1, policy_version 58102 (0.0007) +[2023-10-14 07:29:31,036][100917] Updated weights for policy 1, policy_version 58112 (0.0007) +[2023-10-14 07:29:33,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 118915072. Throughput: 0: 1660.5, 1: 1662.1. Samples: 29738356. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) +[2023-10-14 07:29:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:29:34,109][100936] Updated weights for policy 0, policy_version 58020 (0.0007) +[2023-10-14 07:29:34,479][100936] Updated weights for policy 0, policy_version 58030 (0.0010) +[2023-10-14 07:29:34,843][100936] Updated weights for policy 0, policy_version 58040 (0.0010) +[2023-10-14 07:29:35,289][100917] Updated weights for policy 1, policy_version 58122 (0.0008) +[2023-10-14 07:29:35,662][100917] Updated weights for policy 1, policy_version 58132 (0.0008) +[2023-10-14 07:29:36,034][100917] Updated weights for policy 1, policy_version 58142 (0.0009) +[2023-10-14 07:29:38,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 118980608. Throughput: 0: 1658.0, 1: 1661.7. Samples: 29758664. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) +[2023-10-14 07:29:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:29:38,522][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000058048_59441152.pth... +[2023-10-14 07:29:38,522][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000058144_59539456.pth... +[2023-10-14 07:29:38,560][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000056608_57966592.pth +[2023-10-14 07:29:38,562][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000056512_57868288.pth +[2023-10-14 07:29:38,950][100936] Updated weights for policy 0, policy_version 58050 (0.0009) +[2023-10-14 07:29:39,320][100936] Updated weights for policy 0, policy_version 58060 (0.0011) +[2023-10-14 07:29:39,694][100936] Updated weights for policy 0, policy_version 58070 (0.0009) +[2023-10-14 07:29:40,004][100917] Updated weights for policy 1, policy_version 58152 (0.0007) +[2023-10-14 07:29:40,062][100936] Updated weights for policy 0, policy_version 58080 (0.0008) +[2023-10-14 07:29:40,372][100917] Updated weights for policy 1, policy_version 58162 (0.0008) +[2023-10-14 07:29:40,743][100917] Updated weights for policy 1, policy_version 58172 (0.0009) +[2023-10-14 07:29:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 119046144. Throughput: 0: 1654.7, 1: 1642.6. Samples: 29767472. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) +[2023-10-14 07:29:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:29:44,165][100936] Updated weights for policy 0, policy_version 58090 (0.0009) +[2023-10-14 07:29:44,534][100936] Updated weights for policy 0, policy_version 58100 (0.0007) +[2023-10-14 07:29:44,895][100936] Updated weights for policy 0, policy_version 58110 (0.0009) +[2023-10-14 07:29:44,937][100917] Updated weights for policy 1, policy_version 58182 (0.0010) +[2023-10-14 07:29:45,314][100917] Updated weights for policy 1, policy_version 58192 (0.0008) +[2023-10-14 07:29:45,684][100917] Updated weights for policy 1, policy_version 58202 (0.0007) +[2023-10-14 07:29:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 119111680. Throughput: 0: 1649.6, 1: 1658.5. Samples: 29787784. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) +[2023-10-14 07:29:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:29:49,142][100936] Updated weights for policy 0, policy_version 58120 (0.0008) +[2023-10-14 07:29:49,506][100936] Updated weights for policy 0, policy_version 58130 (0.0007) +[2023-10-14 07:29:49,873][100936] Updated weights for policy 0, policy_version 58140 (0.0008) +[2023-10-14 07:29:49,982][100917] Updated weights for policy 1, policy_version 58212 (0.0009) +[2023-10-14 07:29:50,363][100917] Updated weights for policy 1, policy_version 58222 (0.0009) +[2023-10-14 07:29:50,738][100917] Updated weights for policy 1, policy_version 58232 (0.0007) +[2023-10-14 07:29:53,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 119177216. Throughput: 0: 1644.4, 1: 1653.2. Samples: 29807886. Policy #0 lag: (min: 16.0, avg: 39.7, max: 48.0) +[2023-10-14 07:29:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:29:54,078][100936] Updated weights for policy 0, policy_version 58150 (0.0008) +[2023-10-14 07:29:54,454][100936] Updated weights for policy 0, policy_version 58160 (0.0009) +[2023-10-14 07:29:54,819][100936] Updated weights for policy 0, policy_version 58170 (0.0007) +[2023-10-14 07:29:54,831][100917] Updated weights for policy 1, policy_version 58242 (0.0007) +[2023-10-14 07:29:55,206][100917] Updated weights for policy 1, policy_version 58252 (0.0007) +[2023-10-14 07:29:55,573][100917] Updated weights for policy 1, policy_version 58262 (0.0007) +[2023-10-14 07:29:55,941][100917] Updated weights for policy 1, policy_version 58272 (0.0009) +[2023-10-14 07:29:58,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 119242752. Throughput: 0: 1644.5, 1: 1646.0. Samples: 29816948. Policy #0 lag: (min: 16.0, avg: 39.7, max: 48.0) +[2023-10-14 07:29:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:29:59,033][100936] Updated weights for policy 0, policy_version 58180 (0.0009) +[2023-10-14 07:29:59,402][100936] Updated weights for policy 0, policy_version 58190 (0.0007) +[2023-10-14 07:29:59,771][100936] Updated weights for policy 0, policy_version 58200 (0.0008) +[2023-10-14 07:30:00,015][100917] Updated weights for policy 1, policy_version 58282 (0.0008) +[2023-10-14 07:30:00,391][100917] Updated weights for policy 1, policy_version 58292 (0.0010) +[2023-10-14 07:30:00,758][100917] Updated weights for policy 1, policy_version 58302 (0.0007) +[2023-10-14 07:30:03,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 119308288. Throughput: 0: 1647.8, 1: 1652.9. Samples: 29837304. Policy #0 lag: (min: 16.0, avg: 39.7, max: 48.0) +[2023-10-14 07:30:03,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:30:03,886][100936] Updated weights for policy 0, policy_version 58210 (0.0009) +[2023-10-14 07:30:04,264][100936] Updated weights for policy 0, policy_version 58220 (0.0008) +[2023-10-14 07:30:04,627][100936] Updated weights for policy 0, policy_version 58230 (0.0008) +[2023-10-14 07:30:04,854][100917] Updated weights for policy 1, policy_version 58312 (0.0008) +[2023-10-14 07:30:04,998][100936] Updated weights for policy 0, policy_version 58240 (0.0009) +[2023-10-14 07:30:05,218][100917] Updated weights for policy 1, policy_version 58322 (0.0008) +[2023-10-14 07:30:05,593][100917] Updated weights for policy 1, policy_version 58332 (0.0007) +[2023-10-14 07:30:08,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 119373824. Throughput: 0: 1649.7, 1: 1645.1. Samples: 29857724. Policy #0 lag: (min: 16.0, avg: 39.7, max: 48.0) +[2023-10-14 07:30:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:30:09,238][100936] Updated weights for policy 0, policy_version 58250 (0.0010) +[2023-10-14 07:30:09,600][100936] Updated weights for policy 0, policy_version 58260 (0.0009) +[2023-10-14 07:30:09,710][100917] Updated weights for policy 1, policy_version 58342 (0.0007) +[2023-10-14 07:30:09,975][100936] Updated weights for policy 0, policy_version 58270 (0.0009) +[2023-10-14 07:30:10,084][100917] Updated weights for policy 1, policy_version 58352 (0.0010) +[2023-10-14 07:30:10,462][100917] Updated weights for policy 1, policy_version 58362 (0.0010) +[2023-10-14 07:30:13,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 119439360. Throughput: 0: 1647.4, 1: 1645.6. Samples: 29866636. Policy #0 lag: (min: 16.0, avg: 39.7, max: 48.0) +[2023-10-14 07:30:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:30:14,343][100936] Updated weights for policy 0, policy_version 58280 (0.0007) +[2023-10-14 07:30:14,638][100917] Updated weights for policy 1, policy_version 58372 (0.0010) +[2023-10-14 07:30:14,711][100936] Updated weights for policy 0, policy_version 58290 (0.0008) +[2023-10-14 07:30:15,016][100917] Updated weights for policy 1, policy_version 58382 (0.0008) +[2023-10-14 07:30:15,086][100936] Updated weights for policy 0, policy_version 58300 (0.0008) +[2023-10-14 07:30:15,397][100917] Updated weights for policy 1, policy_version 58392 (0.0008) +[2023-10-14 07:30:18,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 119504896. Throughput: 0: 1649.6, 1: 1646.1. Samples: 29886664. Policy #0 lag: (min: 16.0, avg: 39.7, max: 48.0) +[2023-10-14 07:30:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:30:19,050][100936] Updated weights for policy 0, policy_version 58310 (0.0008) +[2023-10-14 07:30:19,411][100936] Updated weights for policy 0, policy_version 58320 (0.0008) +[2023-10-14 07:30:19,614][100917] Updated weights for policy 1, policy_version 58402 (0.0009) +[2023-10-14 07:30:19,781][100936] Updated weights for policy 0, policy_version 58330 (0.0009) +[2023-10-14 07:30:19,985][100917] Updated weights for policy 1, policy_version 58412 (0.0008) +[2023-10-14 07:30:20,355][100917] Updated weights for policy 1, policy_version 58422 (0.0008) +[2023-10-14 07:30:20,730][100917] Updated weights for policy 1, policy_version 58432 (0.0010) +[2023-10-14 07:30:23,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 119570432. Throughput: 0: 1649.2, 1: 1650.5. Samples: 29907152. Policy #0 lag: (min: 16.0, avg: 39.7, max: 48.0) +[2023-10-14 07:30:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:30:23,987][100936] Updated weights for policy 0, policy_version 58340 (0.0008) +[2023-10-14 07:30:24,357][100936] Updated weights for policy 0, policy_version 58350 (0.0008) +[2023-10-14 07:30:24,718][100936] Updated weights for policy 0, policy_version 58360 (0.0008) +[2023-10-14 07:30:24,944][100917] Updated weights for policy 1, policy_version 58442 (0.0008) +[2023-10-14 07:30:25,320][100917] Updated weights for policy 1, policy_version 58452 (0.0009) +[2023-10-14 07:30:25,688][100917] Updated weights for policy 1, policy_version 58462 (0.0011) +[2023-10-14 07:30:28,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 119635968. Throughput: 0: 1653.2, 1: 1648.4. Samples: 29916044. Policy #0 lag: (min: 16.0, avg: 39.7, max: 48.0) +[2023-10-14 07:30:28,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:30:28,897][100936] Updated weights for policy 0, policy_version 58370 (0.0009) +[2023-10-14 07:30:29,264][100936] Updated weights for policy 0, policy_version 58380 (0.0007) +[2023-10-14 07:30:29,628][100936] Updated weights for policy 0, policy_version 58390 (0.0008) +[2023-10-14 07:30:29,762][100917] Updated weights for policy 1, policy_version 58472 (0.0007) +[2023-10-14 07:30:29,995][100936] Updated weights for policy 0, policy_version 58400 (0.0007) +[2023-10-14 07:30:30,130][100917] Updated weights for policy 1, policy_version 58482 (0.0009) +[2023-10-14 07:30:30,502][100917] Updated weights for policy 1, policy_version 58492 (0.0009) +[2023-10-14 07:30:33,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 119701504. Throughput: 0: 1655.8, 1: 1649.3. Samples: 29936514. Policy #0 lag: (min: 31.0, avg: 34.2, max: 63.0) +[2023-10-14 07:30:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:30:34,118][100936] Updated weights for policy 0, policy_version 58410 (0.0007) +[2023-10-14 07:30:34,483][100936] Updated weights for policy 0, policy_version 58420 (0.0007) +[2023-10-14 07:30:34,624][100917] Updated weights for policy 1, policy_version 58502 (0.0010) +[2023-10-14 07:30:34,851][100936] Updated weights for policy 0, policy_version 58430 (0.0007) +[2023-10-14 07:30:34,992][100917] Updated weights for policy 1, policy_version 58512 (0.0009) +[2023-10-14 07:30:35,361][100917] Updated weights for policy 1, policy_version 58522 (0.0007) +[2023-10-14 07:30:38,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 119767040. Throughput: 0: 1654.9, 1: 1653.2. Samples: 29956748. Policy #0 lag: (min: 31.0, avg: 34.2, max: 63.0) +[2023-10-14 07:30:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:30:39,061][100936] Updated weights for policy 0, policy_version 58440 (0.0009) +[2023-10-14 07:30:39,421][100936] Updated weights for policy 0, policy_version 58450 (0.0009) +[2023-10-14 07:30:39,463][100917] Updated weights for policy 1, policy_version 58532 (0.0009) +[2023-10-14 07:30:39,792][100936] Updated weights for policy 0, policy_version 58460 (0.0009) +[2023-10-14 07:30:39,840][100917] Updated weights for policy 1, policy_version 58542 (0.0007) +[2023-10-14 07:30:40,217][100917] Updated weights for policy 1, policy_version 58552 (0.0008) +[2023-10-14 07:30:43,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 119832576. Throughput: 0: 1655.6, 1: 1650.5. Samples: 29965724. Policy #0 lag: (min: 31.0, avg: 34.2, max: 63.0) +[2023-10-14 07:30:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:30:43,899][100936] Updated weights for policy 0, policy_version 58470 (0.0009) +[2023-10-14 07:30:44,268][100936] Updated weights for policy 0, policy_version 58480 (0.0008) +[2023-10-14 07:30:44,351][100917] Updated weights for policy 1, policy_version 58562 (0.0007) +[2023-10-14 07:30:44,639][100936] Updated weights for policy 0, policy_version 58490 (0.0007) +[2023-10-14 07:30:44,725][100917] Updated weights for policy 1, policy_version 58572 (0.0009) +[2023-10-14 07:30:45,088][100917] Updated weights for policy 1, policy_version 58582 (0.0009) +[2023-10-14 07:30:45,464][100917] Updated weights for policy 1, policy_version 58592 (0.0010) +[2023-10-14 07:30:48,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 119898112. Throughput: 0: 1651.2, 1: 1659.4. Samples: 29986282. Policy #0 lag: (min: 31.0, avg: 34.2, max: 63.0) +[2023-10-14 07:30:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:30:48,767][100936] Updated weights for policy 0, policy_version 58500 (0.0009) +[2023-10-14 07:30:49,153][100936] Updated weights for policy 0, policy_version 58510 (0.0009) +[2023-10-14 07:30:49,511][100936] Updated weights for policy 0, policy_version 58520 (0.0008) +[2023-10-14 07:30:49,584][100917] Updated weights for policy 1, policy_version 58602 (0.0007) +[2023-10-14 07:30:49,964][100917] Updated weights for policy 1, policy_version 58612 (0.0008) +[2023-10-14 07:30:50,338][100917] Updated weights for policy 1, policy_version 58622 (0.0008) +[2023-10-14 07:30:53,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 119963648. Throughput: 0: 1649.2, 1: 1660.4. Samples: 30006654. Policy #0 lag: (min: 31.0, avg: 34.2, max: 63.0) +[2023-10-14 07:30:53,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:30:53,518][100936] Updated weights for policy 0, policy_version 58530 (0.0008) +[2023-10-14 07:30:53,887][100936] Updated weights for policy 0, policy_version 58540 (0.0008) +[2023-10-14 07:30:54,259][100936] Updated weights for policy 0, policy_version 58550 (0.0007) +[2023-10-14 07:30:54,483][100917] Updated weights for policy 1, policy_version 58632 (0.0008) +[2023-10-14 07:30:54,627][100936] Updated weights for policy 0, policy_version 58560 (0.0008) +[2023-10-14 07:30:54,860][100917] Updated weights for policy 1, policy_version 58642 (0.0009) +[2023-10-14 07:30:55,228][100917] Updated weights for policy 1, policy_version 58652 (0.0008) +[2023-10-14 07:30:58,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 120029184. Throughput: 0: 1654.0, 1: 1659.3. Samples: 30015734. Policy #0 lag: (min: 31.0, avg: 34.2, max: 63.0) +[2023-10-14 07:30:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:30:58,782][100936] Updated weights for policy 0, policy_version 58570 (0.0007) +[2023-10-14 07:30:59,145][100936] Updated weights for policy 0, policy_version 58580 (0.0008) +[2023-10-14 07:30:59,388][100917] Updated weights for policy 1, policy_version 58662 (0.0007) +[2023-10-14 07:30:59,524][100936] Updated weights for policy 0, policy_version 58590 (0.0007) +[2023-10-14 07:30:59,761][100917] Updated weights for policy 1, policy_version 58672 (0.0009) +[2023-10-14 07:31:00,140][100917] Updated weights for policy 1, policy_version 58682 (0.0009) +[2023-10-14 07:31:03,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.1, 300 sec: 13107.2). Total num frames: 120094720. Throughput: 0: 1657.6, 1: 1658.3. Samples: 30035878. Policy #0 lag: (min: 31.0, avg: 34.2, max: 63.0) +[2023-10-14 07:31:03,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:31:03,519][100936] Updated weights for policy 0, policy_version 58600 (0.0007) +[2023-10-14 07:31:03,895][100936] Updated weights for policy 0, policy_version 58610 (0.0008) +[2023-10-14 07:31:04,263][100936] Updated weights for policy 0, policy_version 58620 (0.0007) +[2023-10-14 07:31:04,333][100917] Updated weights for policy 1, policy_version 58692 (0.0008) +[2023-10-14 07:31:04,694][100917] Updated weights for policy 1, policy_version 58702 (0.0007) +[2023-10-14 07:31:05,070][100917] Updated weights for policy 1, policy_version 58712 (0.0007) +[2023-10-14 07:31:08,407][100936] Updated weights for policy 0, policy_version 58630 (0.0007) +[2023-10-14 07:31:08,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 120160256. Throughput: 0: 1652.6, 1: 1651.7. Samples: 30055844. Policy #0 lag: (min: 31.0, avg: 34.2, max: 63.0) +[2023-10-14 07:31:08,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:31:08,770][100936] Updated weights for policy 0, policy_version 58640 (0.0009) +[2023-10-14 07:31:09,148][100936] Updated weights for policy 0, policy_version 58650 (0.0010) +[2023-10-14 07:31:09,326][100917] Updated weights for policy 1, policy_version 58722 (0.0007) +[2023-10-14 07:31:09,736][100917] Updated weights for policy 1, policy_version 58732 (0.0007) +[2023-10-14 07:31:10,121][100917] Updated weights for policy 1, policy_version 58742 (0.0008) +[2023-10-14 07:31:10,493][100917] Updated weights for policy 1, policy_version 58752 (0.0009) +[2023-10-14 07:31:13,397][100936] Updated weights for policy 0, policy_version 58660 (0.0008) +[2023-10-14 07:31:13,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 120225792. Throughput: 0: 1655.3, 1: 1652.6. Samples: 30064900. Policy #0 lag: (min: 26.0, avg: 26.0, max: 26.0) +[2023-10-14 07:31:13,512][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:31:13,772][100936] Updated weights for policy 0, policy_version 58670 (0.0008) +[2023-10-14 07:31:14,149][100936] Updated weights for policy 0, policy_version 58680 (0.0008) +[2023-10-14 07:31:14,603][100917] Updated weights for policy 1, policy_version 58762 (0.0009) +[2023-10-14 07:31:14,977][100917] Updated weights for policy 1, policy_version 58772 (0.0009) +[2023-10-14 07:31:15,344][100917] Updated weights for policy 1, policy_version 58782 (0.0010) +[2023-10-14 07:31:18,302][100936] Updated weights for policy 0, policy_version 58690 (0.0007) +[2023-10-14 07:31:18,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 120291328. Throughput: 0: 1650.1, 1: 1651.9. Samples: 30085102. Policy #0 lag: (min: 26.0, avg: 26.0, max: 26.0) +[2023-10-14 07:31:18,512][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:31:18,681][100936] Updated weights for policy 0, policy_version 58700 (0.0008) +[2023-10-14 07:31:19,052][100936] Updated weights for policy 0, policy_version 58710 (0.0009) +[2023-10-14 07:31:19,420][100936] Updated weights for policy 0, policy_version 58720 (0.0007) +[2023-10-14 07:31:19,527][100917] Updated weights for policy 1, policy_version 58792 (0.0009) +[2023-10-14 07:31:19,901][100917] Updated weights for policy 1, policy_version 58802 (0.0011) +[2023-10-14 07:31:20,279][100917] Updated weights for policy 1, policy_version 58812 (0.0011) +[2023-10-14 07:31:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 13107.2). Total num frames: 120356864. Throughput: 0: 1648.5, 1: 1656.9. Samples: 30105494. Policy #0 lag: (min: 26.0, avg: 26.0, max: 26.0) +[2023-10-14 07:31:23,512][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:31:23,611][100936] Updated weights for policy 0, policy_version 58730 (0.0008) +[2023-10-14 07:31:23,986][100936] Updated weights for policy 0, policy_version 58740 (0.0009) +[2023-10-14 07:31:24,328][100917] Updated weights for policy 1, policy_version 58822 (0.0009) +[2023-10-14 07:31:24,351][100936] Updated weights for policy 0, policy_version 58750 (0.0009) +[2023-10-14 07:31:24,704][100917] Updated weights for policy 1, policy_version 58832 (0.0009) +[2023-10-14 07:31:25,070][100917] Updated weights for policy 1, policy_version 58842 (0.0009) +[2023-10-14 07:31:28,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 120422400. Throughput: 0: 1655.5, 1: 1656.3. Samples: 30114752. Policy #0 lag: (min: 26.0, avg: 26.0, max: 26.0) +[2023-10-14 07:31:28,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:31:28,656][100936] Updated weights for policy 0, policy_version 58760 (0.0009) +[2023-10-14 07:31:29,026][100936] Updated weights for policy 0, policy_version 58770 (0.0010) +[2023-10-14 07:31:29,131][100917] Updated weights for policy 1, policy_version 58852 (0.0009) +[2023-10-14 07:31:29,387][100936] Updated weights for policy 0, policy_version 58780 (0.0010) +[2023-10-14 07:31:29,506][100917] Updated weights for policy 1, policy_version 58862 (0.0009) +[2023-10-14 07:31:29,871][100917] Updated weights for policy 1, policy_version 58872 (0.0009) +[2023-10-14 07:31:33,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 120487936. Throughput: 0: 1650.7, 1: 1652.9. Samples: 30134944. Policy #0 lag: (min: 26.0, avg: 26.0, max: 26.0) +[2023-10-14 07:31:33,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:31:33,545][100936] Updated weights for policy 0, policy_version 58790 (0.0008) +[2023-10-14 07:31:33,922][100936] Updated weights for policy 0, policy_version 58800 (0.0007) +[2023-10-14 07:31:33,983][100917] Updated weights for policy 1, policy_version 58882 (0.0007) +[2023-10-14 07:31:34,288][100936] Updated weights for policy 0, policy_version 58810 (0.0007) +[2023-10-14 07:31:34,358][100917] Updated weights for policy 1, policy_version 58892 (0.0009) +[2023-10-14 07:31:34,728][100917] Updated weights for policy 1, policy_version 58902 (0.0008) +[2023-10-14 07:31:35,096][100917] Updated weights for policy 1, policy_version 58912 (0.0007) +[2023-10-14 07:31:38,431][100936] Updated weights for policy 0, policy_version 58820 (0.0009) +[2023-10-14 07:31:38,513][99942] Fps is (10 sec: 13106.6, 60 sec: 13107.1, 300 sec: 13107.2). Total num frames: 120553472. Throughput: 0: 1650.7, 1: 1651.9. Samples: 30155272. Policy #0 lag: (min: 26.0, avg: 26.0, max: 26.0) +[2023-10-14 07:31:38,514][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:31:38,528][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000058912_60325888.pth... +[2023-10-14 07:31:38,556][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000057376_58753024.pth +[2023-10-14 07:31:38,798][100936] Updated weights for policy 0, policy_version 58830 (0.0010) +[2023-10-14 07:31:39,163][100936] Updated weights for policy 0, policy_version 58840 (0.0007) +[2023-10-14 07:31:39,321][100917] Updated weights for policy 1, policy_version 58922 (0.0009) +[2023-10-14 07:31:39,463][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000058848_60260352.pth... +[2023-10-14 07:31:39,497][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000057280_58654720.pth +[2023-10-14 07:31:39,697][100917] Updated weights for policy 1, policy_version 58932 (0.0008) +[2023-10-14 07:31:40,071][100917] Updated weights for policy 1, policy_version 58942 (0.0009) +[2023-10-14 07:31:43,374][100936] Updated weights for policy 0, policy_version 58850 (0.0010) +[2023-10-14 07:31:43,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 120619008. Throughput: 0: 1651.1, 1: 1652.4. Samples: 30164388. Policy #0 lag: (min: 26.0, avg: 26.0, max: 26.0) +[2023-10-14 07:31:43,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:31:43,765][100936] Updated weights for policy 0, policy_version 58860 (0.0009) +[2023-10-14 07:31:44,136][100936] Updated weights for policy 0, policy_version 58870 (0.0010) +[2023-10-14 07:31:44,160][100917] Updated weights for policy 1, policy_version 58952 (0.0008) +[2023-10-14 07:31:44,495][100936] Updated weights for policy 0, policy_version 58880 (0.0008) +[2023-10-14 07:31:44,535][100917] Updated weights for policy 1, policy_version 58962 (0.0007) +[2023-10-14 07:31:44,900][100917] Updated weights for policy 1, policy_version 58972 (0.0007) +[2023-10-14 07:31:48,512][99942] Fps is (10 sec: 13107.8, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 120684544. Throughput: 0: 1646.8, 1: 1660.6. Samples: 30184708. Policy #0 lag: (min: 26.0, avg: 26.0, max: 26.0) +[2023-10-14 07:31:48,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:31:48,591][100936] Updated weights for policy 0, policy_version 58890 (0.0011) +[2023-10-14 07:31:48,959][100936] Updated weights for policy 0, policy_version 58900 (0.0009) +[2023-10-14 07:31:49,012][100917] Updated weights for policy 1, policy_version 58982 (0.0009) +[2023-10-14 07:31:49,337][100936] Updated weights for policy 0, policy_version 58910 (0.0007) +[2023-10-14 07:31:49,384][100917] Updated weights for policy 1, policy_version 58992 (0.0007) +[2023-10-14 07:31:49,757][100917] Updated weights for policy 1, policy_version 59002 (0.0008) +[2023-10-14 07:31:53,489][100936] Updated weights for policy 0, policy_version 58920 (0.0007) +[2023-10-14 07:31:53,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 120750080. Throughput: 0: 1648.0, 1: 1663.0. Samples: 30204838. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:31:53,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:31:53,852][100936] Updated weights for policy 0, policy_version 58930 (0.0007) +[2023-10-14 07:31:53,987][100917] Updated weights for policy 1, policy_version 59012 (0.0008) +[2023-10-14 07:31:54,223][100936] Updated weights for policy 0, policy_version 58940 (0.0009) +[2023-10-14 07:31:54,375][100917] Updated weights for policy 1, policy_version 59022 (0.0008) +[2023-10-14 07:31:54,749][100917] Updated weights for policy 1, policy_version 59032 (0.0008) +[2023-10-14 07:31:58,385][100936] Updated weights for policy 0, policy_version 58950 (0.0008) +[2023-10-14 07:31:58,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 120815616. Throughput: 0: 1648.1, 1: 1663.3. Samples: 30213912. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:31:58,512][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:31:58,749][100936] Updated weights for policy 0, policy_version 58960 (0.0007) +[2023-10-14 07:31:58,882][100917] Updated weights for policy 1, policy_version 59042 (0.0010) +[2023-10-14 07:31:59,126][100936] Updated weights for policy 0, policy_version 58970 (0.0007) +[2023-10-14 07:31:59,245][100917] Updated weights for policy 1, policy_version 59052 (0.0008) +[2023-10-14 07:31:59,609][100917] Updated weights for policy 1, policy_version 59062 (0.0007) +[2023-10-14 07:31:59,983][100917] Updated weights for policy 1, policy_version 59072 (0.0008) +[2023-10-14 07:32:03,386][100936] Updated weights for policy 0, policy_version 58980 (0.0008) +[2023-10-14 07:32:03,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 120881152. Throughput: 0: 1650.8, 1: 1664.3. Samples: 30234282. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:32:03,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:32:03,750][100936] Updated weights for policy 0, policy_version 58990 (0.0007) +[2023-10-14 07:32:04,117][100936] Updated weights for policy 0, policy_version 59000 (0.0010) +[2023-10-14 07:32:04,243][100917] Updated weights for policy 1, policy_version 59082 (0.0007) +[2023-10-14 07:32:04,614][100917] Updated weights for policy 1, policy_version 59092 (0.0009) +[2023-10-14 07:32:04,980][100917] Updated weights for policy 1, policy_version 59102 (0.0008) +[2023-10-14 07:32:08,298][100936] Updated weights for policy 0, policy_version 59010 (0.0008) +[2023-10-14 07:32:08,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 120946688. Throughput: 0: 1647.3, 1: 1660.1. Samples: 30254328. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:32:08,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:32:08,671][100936] Updated weights for policy 0, policy_version 59020 (0.0007) +[2023-10-14 07:32:09,010][100917] Updated weights for policy 1, policy_version 59112 (0.0009) +[2023-10-14 07:32:09,030][100936] Updated weights for policy 0, policy_version 59030 (0.0009) +[2023-10-14 07:32:09,385][100917] Updated weights for policy 1, policy_version 59122 (0.0009) +[2023-10-14 07:32:09,403][100936] Updated weights for policy 0, policy_version 59040 (0.0008) +[2023-10-14 07:32:09,762][100917] Updated weights for policy 1, policy_version 59132 (0.0009) +[2023-10-14 07:32:13,435][100936] Updated weights for policy 0, policy_version 59050 (0.0010) +[2023-10-14 07:32:13,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 121012224. Throughput: 0: 1646.3, 1: 1658.6. Samples: 30263472. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:32:13,512][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:32:13,801][100936] Updated weights for policy 0, policy_version 59060 (0.0007) +[2023-10-14 07:32:13,921][100917] Updated weights for policy 1, policy_version 59142 (0.0010) +[2023-10-14 07:32:14,174][100936] Updated weights for policy 0, policy_version 59070 (0.0010) +[2023-10-14 07:32:14,290][100917] Updated weights for policy 1, policy_version 59152 (0.0007) +[2023-10-14 07:32:14,670][100917] Updated weights for policy 1, policy_version 59162 (0.0009) +[2023-10-14 07:32:18,289][100936] Updated weights for policy 0, policy_version 59080 (0.0008) +[2023-10-14 07:32:18,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.1, 300 sec: 13107.2). Total num frames: 121077760. Throughput: 0: 1656.7, 1: 1656.0. Samples: 30284018. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:32:18,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:32:18,652][100936] Updated weights for policy 0, policy_version 59090 (0.0007) +[2023-10-14 07:32:18,672][100917] Updated weights for policy 1, policy_version 59172 (0.0007) +[2023-10-14 07:32:19,026][100936] Updated weights for policy 0, policy_version 59100 (0.0008) +[2023-10-14 07:32:19,052][100917] Updated weights for policy 1, policy_version 59182 (0.0009) +[2023-10-14 07:32:19,420][100917] Updated weights for policy 1, policy_version 59192 (0.0008) +[2023-10-14 07:32:23,082][100936] Updated weights for policy 0, policy_version 59110 (0.0010) +[2023-10-14 07:32:23,446][100936] Updated weights for policy 0, policy_version 59120 (0.0009) +[2023-10-14 07:32:23,485][100917] Updated weights for policy 1, policy_version 59202 (0.0009) +[2023-10-14 07:32:23,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 121143296. Throughput: 0: 1645.5, 1: 1658.0. Samples: 30303928. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:32:23,512][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:32:23,824][100936] Updated weights for policy 0, policy_version 59130 (0.0009) +[2023-10-14 07:32:23,854][100917] Updated weights for policy 1, policy_version 59212 (0.0008) +[2023-10-14 07:32:24,229][100917] Updated weights for policy 1, policy_version 59222 (0.0009) +[2023-10-14 07:32:24,596][100917] Updated weights for policy 1, policy_version 59232 (0.0008) +[2023-10-14 07:32:28,014][100936] Updated weights for policy 0, policy_version 59140 (0.0009) +[2023-10-14 07:32:28,400][100936] Updated weights for policy 0, policy_version 59150 (0.0008) +[2023-10-14 07:32:28,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 121208832. Throughput: 0: 1655.6, 1: 1655.5. Samples: 30313386. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:32:28,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:32:28,685][100917] Updated weights for policy 1, policy_version 59242 (0.0008) +[2023-10-14 07:32:28,763][100936] Updated weights for policy 0, policy_version 59160 (0.0008) +[2023-10-14 07:32:29,050][100917] Updated weights for policy 1, policy_version 59252 (0.0009) +[2023-10-14 07:32:29,425][100917] Updated weights for policy 1, policy_version 59262 (0.0011) +[2023-10-14 07:32:32,996][100936] Updated weights for policy 0, policy_version 59170 (0.0010) +[2023-10-14 07:32:33,408][100936] Updated weights for policy 0, policy_version 59180 (0.0007) +[2023-10-14 07:32:33,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 121274368. Throughput: 0: 1657.0, 1: 1654.8. Samples: 30333740. Policy #0 lag: (min: 31.0, avg: 37.9, max: 63.0) +[2023-10-14 07:32:33,512][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:32:33,679][100917] Updated weights for policy 1, policy_version 59272 (0.0007) +[2023-10-14 07:32:33,771][100936] Updated weights for policy 0, policy_version 59190 (0.0008) +[2023-10-14 07:32:34,050][100917] Updated weights for policy 1, policy_version 59282 (0.0007) +[2023-10-14 07:32:34,140][100936] Updated weights for policy 0, policy_version 59200 (0.0008) +[2023-10-14 07:32:34,421][100917] Updated weights for policy 1, policy_version 59292 (0.0009) +[2023-10-14 07:32:38,012][100936] Updated weights for policy 0, policy_version 59210 (0.0010) +[2023-10-14 07:32:38,381][100936] Updated weights for policy 0, policy_version 59220 (0.0010) +[2023-10-14 07:32:38,429][100917] Updated weights for policy 1, policy_version 59302 (0.0008) +[2023-10-14 07:32:38,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.3, 300 sec: 13107.2). Total num frames: 121339904. Throughput: 0: 1640.4, 1: 1658.0. Samples: 30353264. Policy #0 lag: (min: 31.0, avg: 37.9, max: 63.0) +[2023-10-14 07:32:38,512][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:32:38,751][100936] Updated weights for policy 0, policy_version 59230 (0.0010) +[2023-10-14 07:32:38,800][100917] Updated weights for policy 1, policy_version 59312 (0.0007) +[2023-10-14 07:32:39,184][100917] Updated weights for policy 1, policy_version 59322 (0.0008) +[2023-10-14 07:32:43,008][100936] Updated weights for policy 0, policy_version 59240 (0.0008) +[2023-10-14 07:32:43,375][100936] Updated weights for policy 0, policy_version 59250 (0.0008) +[2023-10-14 07:32:43,449][100917] Updated weights for policy 1, policy_version 59332 (0.0008) +[2023-10-14 07:32:43,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 121405440. Throughput: 0: 1656.0, 1: 1657.9. Samples: 30363036. Policy #0 lag: (min: 31.0, avg: 37.9, max: 63.0) +[2023-10-14 07:32:43,512][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:32:43,750][100936] Updated weights for policy 0, policy_version 59260 (0.0009) +[2023-10-14 07:32:43,830][100917] Updated weights for policy 1, policy_version 59342 (0.0008) +[2023-10-14 07:32:44,203][100917] Updated weights for policy 1, policy_version 59352 (0.0008) +[2023-10-14 07:32:47,818][100936] Updated weights for policy 0, policy_version 59270 (0.0008) +[2023-10-14 07:32:48,195][100936] Updated weights for policy 0, policy_version 59280 (0.0008) +[2023-10-14 07:32:48,298][100917] Updated weights for policy 1, policy_version 59362 (0.0008) +[2023-10-14 07:32:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 121470976. Throughput: 0: 1657.2, 1: 1654.1. Samples: 30383288. Policy #0 lag: (min: 31.0, avg: 37.9, max: 63.0) +[2023-10-14 07:32:48,512][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:32:48,555][100936] Updated weights for policy 0, policy_version 59290 (0.0010) +[2023-10-14 07:32:48,679][100917] Updated weights for policy 1, policy_version 59372 (0.0008) +[2023-10-14 07:32:49,043][100917] Updated weights for policy 1, policy_version 59382 (0.0007) +[2023-10-14 07:32:49,418][100917] Updated weights for policy 1, policy_version 59392 (0.0008) +[2023-10-14 07:32:52,590][100936] Updated weights for policy 0, policy_version 59300 (0.0009) +[2023-10-14 07:32:52,969][100936] Updated weights for policy 0, policy_version 59310 (0.0007) +[2023-10-14 07:32:53,333][100936] Updated weights for policy 0, policy_version 59320 (0.0008) +[2023-10-14 07:32:53,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 121536512. Throughput: 0: 1646.1, 1: 1653.6. Samples: 30402814. Policy #0 lag: (min: 31.0, avg: 37.9, max: 63.0) +[2023-10-14 07:32:53,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:32:53,583][100917] Updated weights for policy 1, policy_version 59402 (0.0009) +[2023-10-14 07:32:53,961][100917] Updated weights for policy 1, policy_version 59412 (0.0008) +[2023-10-14 07:32:54,334][100917] Updated weights for policy 1, policy_version 59422 (0.0008) +[2023-10-14 07:32:57,506][100936] Updated weights for policy 0, policy_version 59330 (0.0008) +[2023-10-14 07:32:57,858][100936] Updated weights for policy 0, policy_version 59340 (0.0010) +[2023-10-14 07:32:58,222][100936] Updated weights for policy 0, policy_version 59350 (0.0008) +[2023-10-14 07:32:58,486][100917] Updated weights for policy 1, policy_version 59432 (0.0007) +[2023-10-14 07:32:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 121602048. Throughput: 0: 1660.2, 1: 1654.4. Samples: 30412626. Policy #0 lag: (min: 31.0, avg: 37.9, max: 63.0) +[2023-10-14 07:32:58,512][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:32:58,585][100936] Updated weights for policy 0, policy_version 59360 (0.0009) +[2023-10-14 07:32:58,864][100917] Updated weights for policy 1, policy_version 59442 (0.0008) +[2023-10-14 07:32:59,235][100917] Updated weights for policy 1, policy_version 59452 (0.0007) +[2023-10-14 07:33:02,711][100936] Updated weights for policy 0, policy_version 59370 (0.0008) +[2023-10-14 07:33:03,074][100936] Updated weights for policy 0, policy_version 59380 (0.0008) +[2023-10-14 07:33:03,175][100917] Updated weights for policy 1, policy_version 59462 (0.0008) +[2023-10-14 07:33:03,447][100936] Updated weights for policy 0, policy_version 59390 (0.0009) +[2023-10-14 07:33:03,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 121667584. Throughput: 0: 1655.6, 1: 1656.1. Samples: 30433046. Policy #0 lag: (min: 31.0, avg: 37.9, max: 63.0) +[2023-10-14 07:33:03,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:33:03,549][100917] Updated weights for policy 1, policy_version 59472 (0.0009) +[2023-10-14 07:33:03,916][100917] Updated weights for policy 1, policy_version 59482 (0.0011) +[2023-10-14 07:33:07,492][100936] Updated weights for policy 0, policy_version 59400 (0.0009) +[2023-10-14 07:33:07,859][100936] Updated weights for policy 0, policy_version 59410 (0.0009) +[2023-10-14 07:33:08,093][100917] Updated weights for policy 1, policy_version 59492 (0.0009) +[2023-10-14 07:33:08,224][100936] Updated weights for policy 0, policy_version 59420 (0.0008) +[2023-10-14 07:33:08,466][100917] Updated weights for policy 1, policy_version 59502 (0.0008) +[2023-10-14 07:33:08,512][99942] Fps is (10 sec: 16383.5, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 121765888. Throughput: 0: 1640.7, 1: 1657.7. Samples: 30452358. Policy #0 lag: (min: 31.0, avg: 31.0, max: 33.0) +[2023-10-14 07:33:08,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:33:08,838][100917] Updated weights for policy 1, policy_version 59512 (0.0008) +[2023-10-14 07:33:12,497][100936] Updated weights for policy 0, policy_version 59430 (0.0010) +[2023-10-14 07:33:12,861][100936] Updated weights for policy 0, policy_version 59440 (0.0008) +[2023-10-14 07:33:13,002][100917] Updated weights for policy 1, policy_version 59522 (0.0008) +[2023-10-14 07:33:13,232][100936] Updated weights for policy 0, policy_version 59450 (0.0009) +[2023-10-14 07:33:13,381][100917] Updated weights for policy 1, policy_version 59532 (0.0008) +[2023-10-14 07:33:13,512][99942] Fps is (10 sec: 16384.2, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 121831424. Throughput: 0: 1649.7, 1: 1657.5. Samples: 30462208. Policy #0 lag: (min: 31.0, avg: 31.0, max: 33.0) +[2023-10-14 07:33:13,512][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:33:13,763][100917] Updated weights for policy 1, policy_version 59542 (0.0009) +[2023-10-14 07:33:14,140][100917] Updated weights for policy 1, policy_version 59552 (0.0007) +[2023-10-14 07:33:17,496][100936] Updated weights for policy 0, policy_version 59460 (0.0008) +[2023-10-14 07:33:17,879][100936] Updated weights for policy 0, policy_version 59470 (0.0007) +[2023-10-14 07:33:18,253][100936] Updated weights for policy 0, policy_version 59480 (0.0007) +[2023-10-14 07:33:18,339][100917] Updated weights for policy 1, policy_version 59562 (0.0008) +[2023-10-14 07:33:18,512][99942] Fps is (10 sec: 9830.6, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 121864192. Throughput: 0: 1646.2, 1: 1653.0. Samples: 30482204. Policy #0 lag: (min: 31.0, avg: 31.0, max: 33.0) +[2023-10-14 07:33:18,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:33:18,709][100917] Updated weights for policy 1, policy_version 59572 (0.0009) +[2023-10-14 07:33:19,071][100917] Updated weights for policy 1, policy_version 59582 (0.0010) +[2023-10-14 07:33:22,504][100936] Updated weights for policy 0, policy_version 59490 (0.0007) +[2023-10-14 07:33:22,872][100936] Updated weights for policy 0, policy_version 59500 (0.0009) +[2023-10-14 07:33:23,054][100917] Updated weights for policy 1, policy_version 59592 (0.0007) +[2023-10-14 07:33:23,242][100936] Updated weights for policy 0, policy_version 59510 (0.0008) +[2023-10-14 07:33:23,426][100917] Updated weights for policy 1, policy_version 59602 (0.0008) +[2023-10-14 07:33:23,512][99942] Fps is (10 sec: 9830.3, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 121929728. Throughput: 0: 1648.3, 1: 1654.0. Samples: 30501868. Policy #0 lag: (min: 31.0, avg: 31.0, max: 33.0) +[2023-10-14 07:33:23,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:33:23,608][100936] Updated weights for policy 0, policy_version 59520 (0.0008) +[2023-10-14 07:33:23,799][100917] Updated weights for policy 1, policy_version 59612 (0.0008) +[2023-10-14 07:33:27,562][100936] Updated weights for policy 0, policy_version 59530 (0.0008) +[2023-10-14 07:33:27,916][100917] Updated weights for policy 1, policy_version 59622 (0.0009) +[2023-10-14 07:33:27,923][100936] Updated weights for policy 0, policy_version 59540 (0.0007) +[2023-10-14 07:33:28,288][100917] Updated weights for policy 1, policy_version 59632 (0.0009) +[2023-10-14 07:33:28,301][100936] Updated weights for policy 0, policy_version 59550 (0.0008) +[2023-10-14 07:33:28,512][99942] Fps is (10 sec: 16384.0, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 122028032. Throughput: 0: 1650.2, 1: 1658.8. Samples: 30511938. Policy #0 lag: (min: 31.0, avg: 31.0, max: 33.0) +[2023-10-14 07:33:28,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:33:28,672][100917] Updated weights for policy 1, policy_version 59642 (0.0007) +[2023-10-14 07:33:32,553][100936] Updated weights for policy 0, policy_version 59560 (0.0008) +[2023-10-14 07:33:32,924][100917] Updated weights for policy 1, policy_version 59652 (0.0008) +[2023-10-14 07:33:32,927][100936] Updated weights for policy 0, policy_version 59570 (0.0009) +[2023-10-14 07:33:33,287][100936] Updated weights for policy 0, policy_version 59580 (0.0008) +[2023-10-14 07:33:33,309][100917] Updated weights for policy 1, policy_version 59662 (0.0008) +[2023-10-14 07:33:33,512][99942] Fps is (10 sec: 16384.1, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 122093568. Throughput: 0: 1648.8, 1: 1659.3. Samples: 30532154. Policy #0 lag: (min: 31.0, avg: 31.0, max: 33.0) +[2023-10-14 07:33:33,512][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:33:33,680][100917] Updated weights for policy 1, policy_version 59672 (0.0007) +[2023-10-14 07:33:37,454][100936] Updated weights for policy 0, policy_version 59590 (0.0008) +[2023-10-14 07:33:37,628][100917] Updated weights for policy 1, policy_version 59682 (0.0008) +[2023-10-14 07:33:37,828][100936] Updated weights for policy 0, policy_version 59600 (0.0008) +[2023-10-14 07:33:38,006][100917] Updated weights for policy 1, policy_version 59692 (0.0009) +[2023-10-14 07:33:38,203][100936] Updated weights for policy 0, policy_version 59610 (0.0008) +[2023-10-14 07:33:38,380][100917] Updated weights for policy 1, policy_version 59702 (0.0009) +[2023-10-14 07:33:38,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 122159104. Throughput: 0: 1642.0, 1: 1654.2. Samples: 30551144. Policy #0 lag: (min: 31.0, avg: 31.0, max: 33.0) +[2023-10-14 07:33:38,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:33:38,522][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000059616_61046784.pth... +[2023-10-14 07:33:38,561][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000058048_59441152.pth +[2023-10-14 07:33:38,744][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000059712_61145088.pth... +[2023-10-14 07:33:38,749][100917] Updated weights for policy 1, policy_version 59712 (0.0009) +[2023-10-14 07:33:38,784][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000058144_59539456.pth +[2023-10-14 07:33:42,363][100936] Updated weights for policy 0, policy_version 59620 (0.0009) +[2023-10-14 07:33:42,726][100936] Updated weights for policy 0, policy_version 59630 (0.0008) +[2023-10-14 07:33:42,945][100917] Updated weights for policy 1, policy_version 59722 (0.0008) +[2023-10-14 07:33:43,094][100936] Updated weights for policy 0, policy_version 59640 (0.0007) +[2023-10-14 07:33:43,315][100917] Updated weights for policy 1, policy_version 59732 (0.0009) +[2023-10-14 07:33:43,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 122224640. Throughput: 0: 1645.1, 1: 1664.4. Samples: 30561556. Policy #0 lag: (min: 31.0, avg: 31.0, max: 33.0) +[2023-10-14 07:33:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:33:43,685][100917] Updated weights for policy 1, policy_version 59742 (0.0011) +[2023-10-14 07:33:47,170][100936] Updated weights for policy 0, policy_version 59650 (0.0008) +[2023-10-14 07:33:47,543][100936] Updated weights for policy 0, policy_version 59660 (0.0009) +[2023-10-14 07:33:47,759][100917] Updated weights for policy 1, policy_version 59752 (0.0010) +[2023-10-14 07:33:47,913][100936] Updated weights for policy 0, policy_version 59670 (0.0008) +[2023-10-14 07:33:48,137][100917] Updated weights for policy 1, policy_version 59762 (0.0008) +[2023-10-14 07:33:48,289][100936] Updated weights for policy 0, policy_version 59680 (0.0009) +[2023-10-14 07:33:48,512][100917] Updated weights for policy 1, policy_version 59772 (0.0009) +[2023-10-14 07:33:48,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 122290176. Throughput: 0: 1640.7, 1: 1662.1. Samples: 30581672. Policy #0 lag: (min: 19.0, avg: 22.1, max: 51.0) +[2023-10-14 07:33:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:33:52,483][100936] Updated weights for policy 0, policy_version 59690 (0.0008) +[2023-10-14 07:33:52,707][100917] Updated weights for policy 1, policy_version 59782 (0.0008) +[2023-10-14 07:33:52,848][100936] Updated weights for policy 0, policy_version 59700 (0.0007) +[2023-10-14 07:33:53,064][100917] Updated weights for policy 1, policy_version 59792 (0.0008) +[2023-10-14 07:33:53,221][100936] Updated weights for policy 0, policy_version 59710 (0.0007) +[2023-10-14 07:33:53,434][100917] Updated weights for policy 1, policy_version 59802 (0.0009) +[2023-10-14 07:33:53,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 122355712. Throughput: 0: 1643.0, 1: 1653.4. Samples: 30600696. Policy #0 lag: (min: 19.0, avg: 22.1, max: 51.0) +[2023-10-14 07:33:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:33:57,381][100936] Updated weights for policy 0, policy_version 59720 (0.0008) +[2023-10-14 07:33:57,581][100917] Updated weights for policy 1, policy_version 59812 (0.0008) +[2023-10-14 07:33:57,756][100936] Updated weights for policy 0, policy_version 59730 (0.0007) +[2023-10-14 07:33:57,948][100917] Updated weights for policy 1, policy_version 59822 (0.0009) +[2023-10-14 07:33:58,125][100936] Updated weights for policy 0, policy_version 59740 (0.0007) +[2023-10-14 07:33:58,317][100917] Updated weights for policy 1, policy_version 59832 (0.0009) +[2023-10-14 07:33:58,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 122421248. Throughput: 0: 1649.2, 1: 1667.7. Samples: 30611470. Policy #0 lag: (min: 19.0, avg: 22.1, max: 51.0) +[2023-10-14 07:33:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:34:02,258][100936] Updated weights for policy 0, policy_version 59750 (0.0009) +[2023-10-14 07:34:02,377][100917] Updated weights for policy 1, policy_version 59842 (0.0007) +[2023-10-14 07:34:02,631][100936] Updated weights for policy 0, policy_version 59760 (0.0007) +[2023-10-14 07:34:02,742][100917] Updated weights for policy 1, policy_version 59852 (0.0007) +[2023-10-14 07:34:02,992][100936] Updated weights for policy 0, policy_version 59770 (0.0007) +[2023-10-14 07:34:03,116][100917] Updated weights for policy 1, policy_version 59862 (0.0007) +[2023-10-14 07:34:03,497][100917] Updated weights for policy 1, policy_version 59872 (0.0007) +[2023-10-14 07:34:03,512][99942] Fps is (10 sec: 16384.1, 60 sec: 14199.4, 300 sec: 13329.4). Total num frames: 122519552. Throughput: 0: 1645.6, 1: 1670.9. Samples: 30631448. Policy #0 lag: (min: 19.0, avg: 22.1, max: 51.0) +[2023-10-14 07:34:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:34:07,281][100936] Updated weights for policy 0, policy_version 59780 (0.0007) +[2023-10-14 07:34:07,567][100917] Updated weights for policy 1, policy_version 59882 (0.0008) +[2023-10-14 07:34:07,681][100936] Updated weights for policy 0, policy_version 59790 (0.0010) +[2023-10-14 07:34:07,950][100917] Updated weights for policy 1, policy_version 59892 (0.0009) +[2023-10-14 07:34:08,049][100936] Updated weights for policy 0, policy_version 59800 (0.0008) +[2023-10-14 07:34:08,313][100917] Updated weights for policy 1, policy_version 59902 (0.0007) +[2023-10-14 07:34:08,512][99942] Fps is (10 sec: 16384.0, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 122585088. Throughput: 0: 1637.1, 1: 1649.9. Samples: 30649780. Policy #0 lag: (min: 19.0, avg: 22.1, max: 51.0) +[2023-10-14 07:34:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:34:12,186][100936] Updated weights for policy 0, policy_version 59810 (0.0007) +[2023-10-14 07:34:12,432][100917] Updated weights for policy 1, policy_version 59912 (0.0009) +[2023-10-14 07:34:12,561][100936] Updated weights for policy 0, policy_version 59820 (0.0009) +[2023-10-14 07:34:12,800][100917] Updated weights for policy 1, policy_version 59922 (0.0010) +[2023-10-14 07:34:12,931][100936] Updated weights for policy 0, policy_version 59830 (0.0007) +[2023-10-14 07:34:13,174][100917] Updated weights for policy 1, policy_version 59932 (0.0010) +[2023-10-14 07:34:13,306][100936] Updated weights for policy 0, policy_version 59840 (0.0008) +[2023-10-14 07:34:13,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 122650624. Throughput: 0: 1643.5, 1: 1664.0. Samples: 30660776. Policy #0 lag: (min: 19.0, avg: 22.1, max: 51.0) +[2023-10-14 07:34:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:34:17,455][100917] Updated weights for policy 1, policy_version 59942 (0.0009) +[2023-10-14 07:34:17,511][100936] Updated weights for policy 0, policy_version 59850 (0.0008) +[2023-10-14 07:34:17,846][100917] Updated weights for policy 1, policy_version 59952 (0.0007) +[2023-10-14 07:34:17,882][100936] Updated weights for policy 0, policy_version 59860 (0.0008) +[2023-10-14 07:34:18,207][100917] Updated weights for policy 1, policy_version 59962 (0.0008) +[2023-10-14 07:34:18,239][100936] Updated weights for policy 0, policy_version 59870 (0.0009) +[2023-10-14 07:34:18,512][99942] Fps is (10 sec: 13107.3, 60 sec: 14199.4, 300 sec: 13329.4). Total num frames: 122716160. Throughput: 0: 1639.1, 1: 1664.9. Samples: 30680836. Policy #0 lag: (min: 19.0, avg: 22.1, max: 51.0) +[2023-10-14 07:34:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:34:22,340][100936] Updated weights for policy 0, policy_version 59880 (0.0007) +[2023-10-14 07:34:22,403][100917] Updated weights for policy 1, policy_version 59972 (0.0009) +[2023-10-14 07:34:22,707][100936] Updated weights for policy 0, policy_version 59890 (0.0007) +[2023-10-14 07:34:22,776][100917] Updated weights for policy 1, policy_version 59982 (0.0010) +[2023-10-14 07:34:23,072][100936] Updated weights for policy 0, policy_version 59900 (0.0008) +[2023-10-14 07:34:23,148][100917] Updated weights for policy 1, policy_version 59992 (0.0008) +[2023-10-14 07:34:23,512][99942] Fps is (10 sec: 13107.0, 60 sec: 14199.4, 300 sec: 13329.3). Total num frames: 122781696. Throughput: 0: 1643.0, 1: 1651.9. Samples: 30699414. Policy #0 lag: (min: 19.0, avg: 22.1, max: 51.0) +[2023-10-14 07:34:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:34:27,215][100917] Updated weights for policy 1, policy_version 60002 (0.0007) +[2023-10-14 07:34:27,259][100936] Updated weights for policy 0, policy_version 59910 (0.0009) +[2023-10-14 07:34:27,579][100917] Updated weights for policy 1, policy_version 60012 (0.0008) +[2023-10-14 07:34:27,629][100936] Updated weights for policy 0, policy_version 59920 (0.0008) +[2023-10-14 07:34:27,946][100917] Updated weights for policy 1, policy_version 60022 (0.0009) +[2023-10-14 07:34:27,993][100936] Updated weights for policy 0, policy_version 59930 (0.0009) +[2023-10-14 07:34:28,321][100917] Updated weights for policy 1, policy_version 60032 (0.0009) +[2023-10-14 07:34:28,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 122847232. Throughput: 0: 1650.1, 1: 1657.8. Samples: 30710412. Policy #0 lag: (min: 31.0, avg: 36.8, max: 63.0) +[2023-10-14 07:34:28,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:34:32,206][100936] Updated weights for policy 0, policy_version 59940 (0.0008) +[2023-10-14 07:34:32,508][100917] Updated weights for policy 1, policy_version 60042 (0.0009) +[2023-10-14 07:34:32,572][100936] Updated weights for policy 0, policy_version 59950 (0.0008) +[2023-10-14 07:34:32,884][100917] Updated weights for policy 1, policy_version 60052 (0.0007) +[2023-10-14 07:34:32,949][100936] Updated weights for policy 0, policy_version 59960 (0.0009) +[2023-10-14 07:34:33,250][100917] Updated weights for policy 1, policy_version 60062 (0.0007) +[2023-10-14 07:34:33,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 122912768. Throughput: 0: 1645.0, 1: 1656.7. Samples: 30730248. Policy #0 lag: (min: 31.0, avg: 36.8, max: 63.0) +[2023-10-14 07:34:33,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:34:37,109][100936] Updated weights for policy 0, policy_version 59970 (0.0009) +[2023-10-14 07:34:37,293][100917] Updated weights for policy 1, policy_version 60072 (0.0009) +[2023-10-14 07:34:37,477][100936] Updated weights for policy 0, policy_version 59980 (0.0008) +[2023-10-14 07:34:37,659][100917] Updated weights for policy 1, policy_version 60082 (0.0008) +[2023-10-14 07:34:37,845][100936] Updated weights for policy 0, policy_version 59990 (0.0007) +[2023-10-14 07:34:38,036][100917] Updated weights for policy 1, policy_version 60092 (0.0008) +[2023-10-14 07:34:38,212][100936] Updated weights for policy 0, policy_version 60000 (0.0008) +[2023-10-14 07:34:38,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 122978304. Throughput: 0: 1651.2, 1: 1643.9. Samples: 30748976. Policy #0 lag: (min: 31.0, avg: 36.8, max: 63.0) +[2023-10-14 07:34:38,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:34:42,018][100917] Updated weights for policy 1, policy_version 60102 (0.0008) +[2023-10-14 07:34:42,304][100936] Updated weights for policy 0, policy_version 60010 (0.0007) +[2023-10-14 07:34:42,391][100917] Updated weights for policy 1, policy_version 60112 (0.0007) +[2023-10-14 07:34:42,671][100936] Updated weights for policy 0, policy_version 60020 (0.0008) +[2023-10-14 07:34:42,761][100917] Updated weights for policy 1, policy_version 60122 (0.0009) +[2023-10-14 07:34:43,045][100936] Updated weights for policy 0, policy_version 60030 (0.0008) +[2023-10-14 07:34:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 123043840. Throughput: 0: 1651.5, 1: 1654.1. Samples: 30760222. Policy #0 lag: (min: 31.0, avg: 36.8, max: 63.0) +[2023-10-14 07:34:43,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:34:46,931][100917] Updated weights for policy 1, policy_version 60132 (0.0009) +[2023-10-14 07:34:47,309][100917] Updated weights for policy 1, policy_version 60142 (0.0007) +[2023-10-14 07:34:47,381][100936] Updated weights for policy 0, policy_version 60040 (0.0009) +[2023-10-14 07:34:47,680][100917] Updated weights for policy 1, policy_version 60152 (0.0008) +[2023-10-14 07:34:47,744][100936] Updated weights for policy 0, policy_version 60050 (0.0010) +[2023-10-14 07:34:48,109][100936] Updated weights for policy 0, policy_version 60060 (0.0008) +[2023-10-14 07:34:48,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 123109376. Throughput: 0: 1649.1, 1: 1649.6. Samples: 30779892. Policy #0 lag: (min: 31.0, avg: 36.8, max: 63.0) +[2023-10-14 07:34:48,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:34:51,816][100917] Updated weights for policy 1, policy_version 60162 (0.0010) +[2023-10-14 07:34:52,187][100917] Updated weights for policy 1, policy_version 60172 (0.0009) +[2023-10-14 07:34:52,256][100936] Updated weights for policy 0, policy_version 60070 (0.0008) +[2023-10-14 07:34:52,563][100917] Updated weights for policy 1, policy_version 60182 (0.0009) +[2023-10-14 07:34:52,632][100936] Updated weights for policy 0, policy_version 60080 (0.0007) +[2023-10-14 07:34:52,927][100917] Updated weights for policy 1, policy_version 60192 (0.0008) +[2023-10-14 07:34:53,014][100936] Updated weights for policy 0, policy_version 60090 (0.0008) +[2023-10-14 07:34:53,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 123174912. Throughput: 0: 1649.1, 1: 1644.0. Samples: 30797972. Policy #0 lag: (min: 31.0, avg: 36.8, max: 63.0) +[2023-10-14 07:34:53,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:34:57,082][100936] Updated weights for policy 0, policy_version 60100 (0.0008) +[2023-10-14 07:34:57,217][100917] Updated weights for policy 1, policy_version 60202 (0.0008) +[2023-10-14 07:34:57,450][100936] Updated weights for policy 0, policy_version 60110 (0.0008) +[2023-10-14 07:34:57,596][100917] Updated weights for policy 1, policy_version 60212 (0.0007) +[2023-10-14 07:34:57,815][100936] Updated weights for policy 0, policy_version 60120 (0.0008) +[2023-10-14 07:34:57,965][100917] Updated weights for policy 1, policy_version 60222 (0.0008) +[2023-10-14 07:34:58,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 123240448. Throughput: 0: 1653.0, 1: 1653.8. Samples: 30809584. Policy #0 lag: (min: 31.0, avg: 36.8, max: 63.0) +[2023-10-14 07:34:58,513][99942] Avg episode reward: [(0, '0.940'), (1, '1.000')] +[2023-10-14 07:35:02,077][100936] Updated weights for policy 0, policy_version 60130 (0.0010) +[2023-10-14 07:35:02,152][100917] Updated weights for policy 1, policy_version 60232 (0.0008) +[2023-10-14 07:35:02,442][100936] Updated weights for policy 0, policy_version 60140 (0.0008) +[2023-10-14 07:35:02,529][100917] Updated weights for policy 1, policy_version 60242 (0.0007) +[2023-10-14 07:35:02,817][100936] Updated weights for policy 0, policy_version 60150 (0.0008) +[2023-10-14 07:35:02,893][100917] Updated weights for policy 1, policy_version 60252 (0.0008) +[2023-10-14 07:35:03,187][100936] Updated weights for policy 0, policy_version 60160 (0.0008) +[2023-10-14 07:35:03,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 123305984. Throughput: 0: 1648.0, 1: 1654.5. Samples: 30829448. Policy #0 lag: (min: 31.0, avg: 36.8, max: 63.0) +[2023-10-14 07:35:03,513][99942] Avg episode reward: [(0, '0.940'), (1, '1.000')] +[2023-10-14 07:35:07,045][100917] Updated weights for policy 1, policy_version 60262 (0.0009) +[2023-10-14 07:35:07,356][100936] Updated weights for policy 0, policy_version 60170 (0.0009) +[2023-10-14 07:35:07,405][100917] Updated weights for policy 1, policy_version 60272 (0.0009) +[2023-10-14 07:35:07,720][100936] Updated weights for policy 0, policy_version 60180 (0.0008) +[2023-10-14 07:35:07,777][100917] Updated weights for policy 1, policy_version 60282 (0.0008) +[2023-10-14 07:35:08,087][100936] Updated weights for policy 0, policy_version 60190 (0.0007) +[2023-10-14 07:35:08,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 123371520. Throughput: 0: 1652.4, 1: 1646.4. Samples: 30847864. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 07:35:08,513][99942] Avg episode reward: [(0, '0.940'), (1, '1.000')] +[2023-10-14 07:35:11,923][100917] Updated weights for policy 1, policy_version 60292 (0.0008) +[2023-10-14 07:35:12,285][100917] Updated weights for policy 1, policy_version 60302 (0.0007) +[2023-10-14 07:35:12,298][100936] Updated weights for policy 0, policy_version 60200 (0.0008) +[2023-10-14 07:35:12,658][100917] Updated weights for policy 1, policy_version 60312 (0.0007) +[2023-10-14 07:35:12,661][100936] Updated weights for policy 0, policy_version 60210 (0.0007) +[2023-10-14 07:35:13,029][100936] Updated weights for policy 0, policy_version 60220 (0.0009) +[2023-10-14 07:35:13,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 123437056. Throughput: 0: 1647.1, 1: 1655.2. Samples: 30859014. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 07:35:13,513][99942] Avg episode reward: [(0, '0.940'), (1, '1.000')] +[2023-10-14 07:35:16,729][100917] Updated weights for policy 1, policy_version 60322 (0.0007) +[2023-10-14 07:35:17,104][100917] Updated weights for policy 1, policy_version 60332 (0.0010) +[2023-10-14 07:35:17,317][100936] Updated weights for policy 0, policy_version 60230 (0.0007) +[2023-10-14 07:35:17,473][100917] Updated weights for policy 1, policy_version 60342 (0.0009) +[2023-10-14 07:35:17,694][100936] Updated weights for policy 0, policy_version 60240 (0.0009) +[2023-10-14 07:35:17,835][100917] Updated weights for policy 1, policy_version 60352 (0.0009) +[2023-10-14 07:35:18,070][100936] Updated weights for policy 0, policy_version 60250 (0.0007) +[2023-10-14 07:35:18,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 123502592. Throughput: 0: 1649.9, 1: 1645.9. Samples: 30878558. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 07:35:18,513][99942] Avg episode reward: [(0, '0.940'), (1, '1.000')] +[2023-10-14 07:35:22,073][100936] Updated weights for policy 0, policy_version 60260 (0.0008) +[2023-10-14 07:35:22,133][100917] Updated weights for policy 1, policy_version 60362 (0.0007) +[2023-10-14 07:35:22,440][100936] Updated weights for policy 0, policy_version 60270 (0.0010) +[2023-10-14 07:35:22,507][100917] Updated weights for policy 1, policy_version 60372 (0.0008) +[2023-10-14 07:35:22,804][100936] Updated weights for policy 0, policy_version 60280 (0.0008) +[2023-10-14 07:35:22,877][100917] Updated weights for policy 1, policy_version 60382 (0.0007) +[2023-10-14 07:35:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 123568128. Throughput: 0: 1641.8, 1: 1643.9. Samples: 30896834. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 07:35:23,513][99942] Avg episode reward: [(0, '0.940'), (1, '1.000')] +[2023-10-14 07:35:26,996][100917] Updated weights for policy 1, policy_version 60392 (0.0009) +[2023-10-14 07:35:27,134][100936] Updated weights for policy 0, policy_version 60290 (0.0010) +[2023-10-14 07:35:27,380][100917] Updated weights for policy 1, policy_version 60402 (0.0008) +[2023-10-14 07:35:27,515][100936] Updated weights for policy 0, policy_version 60300 (0.0008) +[2023-10-14 07:35:27,752][100917] Updated weights for policy 1, policy_version 60412 (0.0007) +[2023-10-14 07:35:27,881][100936] Updated weights for policy 0, policy_version 60310 (0.0007) +[2023-10-14 07:35:28,252][100936] Updated weights for policy 0, policy_version 60320 (0.0011) +[2023-10-14 07:35:28,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 123633664. Throughput: 0: 1637.3, 1: 1645.0. Samples: 30907926. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 07:35:28,513][99942] Avg episode reward: [(0, '0.940'), (1, '1.000')] +[2023-10-14 07:35:31,926][100917] Updated weights for policy 1, policy_version 60422 (0.0007) +[2023-10-14 07:35:32,310][100917] Updated weights for policy 1, policy_version 60432 (0.0007) +[2023-10-14 07:35:32,421][100936] Updated weights for policy 0, policy_version 60330 (0.0008) +[2023-10-14 07:35:32,686][100917] Updated weights for policy 1, policy_version 60442 (0.0009) +[2023-10-14 07:35:32,797][100936] Updated weights for policy 0, policy_version 60340 (0.0007) +[2023-10-14 07:35:33,176][100936] Updated weights for policy 0, policy_version 60350 (0.0011) +[2023-10-14 07:35:33,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 123699200. Throughput: 0: 1641.6, 1: 1642.9. Samples: 30927696. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 07:35:33,513][99942] Avg episode reward: [(0, '0.940'), (1, '1.000')] +[2023-10-14 07:35:36,796][100917] Updated weights for policy 1, policy_version 60452 (0.0008) +[2023-10-14 07:35:37,166][100917] Updated weights for policy 1, policy_version 60462 (0.0009) +[2023-10-14 07:35:37,354][100936] Updated weights for policy 0, policy_version 60360 (0.0009) +[2023-10-14 07:35:37,546][100917] Updated weights for policy 1, policy_version 60472 (0.0009) +[2023-10-14 07:35:37,736][100936] Updated weights for policy 0, policy_version 60370 (0.0008) +[2023-10-14 07:35:38,104][100936] Updated weights for policy 0, policy_version 60380 (0.0008) +[2023-10-14 07:35:38,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.3, 300 sec: 13329.4). Total num frames: 123764736. Throughput: 0: 1643.4, 1: 1643.2. Samples: 30945866. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 07:35:38,512][99942] Avg episode reward: [(0, '0.930'), (1, '1.000')] +[2023-10-14 07:35:38,519][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000060480_61931520.pth... +[2023-10-14 07:35:38,519][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000060384_61833216.pth... +[2023-10-14 07:35:38,549][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000058912_60325888.pth +[2023-10-14 07:35:38,555][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000058848_60260352.pth +[2023-10-14 07:35:41,736][100917] Updated weights for policy 1, policy_version 60482 (0.0008) +[2023-10-14 07:35:42,110][100917] Updated weights for policy 1, policy_version 60492 (0.0010) +[2023-10-14 07:35:42,177][100936] Updated weights for policy 0, policy_version 60390 (0.0008) +[2023-10-14 07:35:42,490][100917] Updated weights for policy 1, policy_version 60502 (0.0009) +[2023-10-14 07:35:42,541][100936] Updated weights for policy 0, policy_version 60400 (0.0009) +[2023-10-14 07:35:42,861][100917] Updated weights for policy 1, policy_version 60512 (0.0009) +[2023-10-14 07:35:42,915][100936] Updated weights for policy 0, policy_version 60410 (0.0008) +[2023-10-14 07:35:43,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 123830272. Throughput: 0: 1640.6, 1: 1643.3. Samples: 30957362. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 07:35:43,513][99942] Avg episode reward: [(0, '0.930'), (1, '1.000')] +[2023-10-14 07:35:47,013][100917] Updated weights for policy 1, policy_version 60522 (0.0010) +[2023-10-14 07:35:47,125][100936] Updated weights for policy 0, policy_version 60420 (0.0007) +[2023-10-14 07:35:47,391][100917] Updated weights for policy 1, policy_version 60532 (0.0009) +[2023-10-14 07:35:47,485][100936] Updated weights for policy 0, policy_version 60430 (0.0010) +[2023-10-14 07:35:47,767][100917] Updated weights for policy 1, policy_version 60542 (0.0009) +[2023-10-14 07:35:47,857][100936] Updated weights for policy 0, policy_version 60440 (0.0010) +[2023-10-14 07:35:48,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 123895808. Throughput: 0: 1640.2, 1: 1636.2. Samples: 30976884. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 07:35:48,513][99942] Avg episode reward: [(0, '0.930'), (1, '1.000')] +[2023-10-14 07:35:51,979][100936] Updated weights for policy 0, policy_version 60450 (0.0008) +[2023-10-14 07:35:52,020][100917] Updated weights for policy 1, policy_version 60552 (0.0009) +[2023-10-14 07:35:52,339][100936] Updated weights for policy 0, policy_version 60460 (0.0008) +[2023-10-14 07:35:52,390][100917] Updated weights for policy 1, policy_version 60562 (0.0009) +[2023-10-14 07:35:52,717][100936] Updated weights for policy 0, policy_version 60470 (0.0008) +[2023-10-14 07:35:52,761][100917] Updated weights for policy 1, policy_version 60572 (0.0007) +[2023-10-14 07:35:53,079][100936] Updated weights for policy 0, policy_version 60480 (0.0009) +[2023-10-14 07:35:53,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 123961344. Throughput: 0: 1636.4, 1: 1641.4. Samples: 30995364. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 07:35:53,513][99942] Avg episode reward: [(0, '0.930'), (1, '1.000')] +[2023-10-14 07:35:56,919][100917] Updated weights for policy 1, policy_version 60582 (0.0009) +[2023-10-14 07:35:57,137][100936] Updated weights for policy 0, policy_version 60490 (0.0008) +[2023-10-14 07:35:57,281][100917] Updated weights for policy 1, policy_version 60592 (0.0008) +[2023-10-14 07:35:57,507][100936] Updated weights for policy 0, policy_version 60500 (0.0008) +[2023-10-14 07:35:57,648][100917] Updated weights for policy 1, policy_version 60602 (0.0009) +[2023-10-14 07:35:57,879][100936] Updated weights for policy 0, policy_version 60510 (0.0007) +[2023-10-14 07:35:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 124026880. Throughput: 0: 1640.7, 1: 1638.9. Samples: 31006594. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 07:35:58,513][99942] Avg episode reward: [(0, '0.930'), (1, '1.000')] +[2023-10-14 07:36:01,789][100917] Updated weights for policy 1, policy_version 60612 (0.0010) +[2023-10-14 07:36:02,152][100936] Updated weights for policy 0, policy_version 60520 (0.0009) +[2023-10-14 07:36:02,155][100917] Updated weights for policy 1, policy_version 60622 (0.0009) +[2023-10-14 07:36:02,513][100936] Updated weights for policy 0, policy_version 60530 (0.0009) +[2023-10-14 07:36:02,529][100917] Updated weights for policy 1, policy_version 60632 (0.0009) +[2023-10-14 07:36:02,882][100936] Updated weights for policy 0, policy_version 60540 (0.0008) +[2023-10-14 07:36:03,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 124092416. Throughput: 0: 1631.8, 1: 1644.1. Samples: 31025976. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 07:36:03,513][99942] Avg episode reward: [(0, '0.930'), (1, '1.000')] +[2023-10-14 07:36:06,596][100917] Updated weights for policy 1, policy_version 60642 (0.0007) +[2023-10-14 07:36:06,973][100917] Updated weights for policy 1, policy_version 60652 (0.0010) +[2023-10-14 07:36:07,214][100936] Updated weights for policy 0, policy_version 60550 (0.0007) +[2023-10-14 07:36:07,350][100917] Updated weights for policy 1, policy_version 60662 (0.0007) +[2023-10-14 07:36:07,586][100936] Updated weights for policy 0, policy_version 60560 (0.0009) +[2023-10-14 07:36:07,728][100917] Updated weights for policy 1, policy_version 60672 (0.0007) +[2023-10-14 07:36:07,964][100936] Updated weights for policy 0, policy_version 60570 (0.0010) +[2023-10-14 07:36:08,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.3, 300 sec: 13329.4). Total num frames: 124157952. Throughput: 0: 1635.2, 1: 1648.8. Samples: 31044610. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 07:36:08,513][99942] Avg episode reward: [(0, '0.930'), (1, '1.000')] +[2023-10-14 07:36:11,535][100917] Updated weights for policy 1, policy_version 60682 (0.0010) +[2023-10-14 07:36:11,906][100917] Updated weights for policy 1, policy_version 60692 (0.0009) +[2023-10-14 07:36:12,123][100936] Updated weights for policy 0, policy_version 60580 (0.0010) +[2023-10-14 07:36:12,284][100917] Updated weights for policy 1, policy_version 60702 (0.0009) +[2023-10-14 07:36:12,501][100936] Updated weights for policy 0, policy_version 60590 (0.0008) +[2023-10-14 07:36:12,869][100936] Updated weights for policy 0, policy_version 60600 (0.0008) +[2023-10-14 07:36:13,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 124223488. Throughput: 0: 1638.9, 1: 1656.8. Samples: 31056236. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 07:36:13,513][99942] Avg episode reward: [(0, '0.930'), (1, '1.000')] +[2023-10-14 07:36:16,509][100917] Updated weights for policy 1, policy_version 60712 (0.0010) +[2023-10-14 07:36:16,881][100917] Updated weights for policy 1, policy_version 60722 (0.0007) +[2023-10-14 07:36:16,954][100936] Updated weights for policy 0, policy_version 60610 (0.0008) +[2023-10-14 07:36:17,246][100917] Updated weights for policy 1, policy_version 60732 (0.0007) +[2023-10-14 07:36:17,323][100936] Updated weights for policy 0, policy_version 60620 (0.0008) +[2023-10-14 07:36:17,691][100936] Updated weights for policy 0, policy_version 60630 (0.0009) +[2023-10-14 07:36:18,054][100936] Updated weights for policy 0, policy_version 60640 (0.0008) +[2023-10-14 07:36:18,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 124289024. Throughput: 0: 1639.2, 1: 1641.5. Samples: 31075328. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 07:36:18,513][99942] Avg episode reward: [(0, '0.930'), (1, '1.000')] +[2023-10-14 07:36:21,433][100917] Updated weights for policy 1, policy_version 60742 (0.0009) +[2023-10-14 07:36:21,808][100917] Updated weights for policy 1, policy_version 60752 (0.0007) +[2023-10-14 07:36:22,178][100917] Updated weights for policy 1, policy_version 60762 (0.0008) +[2023-10-14 07:36:22,285][100936] Updated weights for policy 0, policy_version 60650 (0.0007) +[2023-10-14 07:36:22,652][100936] Updated weights for policy 0, policy_version 60660 (0.0008) +[2023-10-14 07:36:23,032][100936] Updated weights for policy 0, policy_version 60670 (0.0008) +[2023-10-14 07:36:23,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 124354560. Throughput: 0: 1640.2, 1: 1657.7. Samples: 31094274. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 07:36:23,512][99942] Avg episode reward: [(0, '0.930'), (1, '1.000')] +[2023-10-14 07:36:26,206][100917] Updated weights for policy 1, policy_version 60772 (0.0008) +[2023-10-14 07:36:26,572][100917] Updated weights for policy 1, policy_version 60782 (0.0007) +[2023-10-14 07:36:26,956][100917] Updated weights for policy 1, policy_version 60792 (0.0008) +[2023-10-14 07:36:27,053][100936] Updated weights for policy 0, policy_version 60680 (0.0007) +[2023-10-14 07:36:27,428][100936] Updated weights for policy 0, policy_version 60690 (0.0007) +[2023-10-14 07:36:27,795][100936] Updated weights for policy 0, policy_version 60700 (0.0009) +[2023-10-14 07:36:28,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 124420096. Throughput: 0: 1643.6, 1: 1659.4. Samples: 31105996. Policy #0 lag: (min: 3.0, avg: 3.9, max: 23.0) +[2023-10-14 07:36:28,513][99942] Avg episode reward: [(0, '0.930'), (1, '1.000')] +[2023-10-14 07:36:31,005][100917] Updated weights for policy 1, policy_version 60802 (0.0008) +[2023-10-14 07:36:31,379][100917] Updated weights for policy 1, policy_version 60812 (0.0009) +[2023-10-14 07:36:31,755][100917] Updated weights for policy 1, policy_version 60822 (0.0009) +[2023-10-14 07:36:31,948][100936] Updated weights for policy 0, policy_version 60710 (0.0008) +[2023-10-14 07:36:32,128][100917] Updated weights for policy 1, policy_version 60832 (0.0009) +[2023-10-14 07:36:32,312][100936] Updated weights for policy 0, policy_version 60720 (0.0008) +[2023-10-14 07:36:32,680][100936] Updated weights for policy 0, policy_version 60730 (0.0010) +[2023-10-14 07:36:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 13329.4). Total num frames: 124485632. Throughput: 0: 1638.0, 1: 1648.7. Samples: 31124782. Policy #0 lag: (min: 3.0, avg: 3.9, max: 23.0) +[2023-10-14 07:36:33,512][99942] Avg episode reward: [(0, '0.930'), (1, '1.000')] +[2023-10-14 07:36:36,473][100917] Updated weights for policy 1, policy_version 60842 (0.0008) +[2023-10-14 07:36:36,661][100936] Updated weights for policy 0, policy_version 60740 (0.0010) +[2023-10-14 07:36:36,848][100917] Updated weights for policy 1, policy_version 60852 (0.0008) +[2023-10-14 07:36:37,028][100936] Updated weights for policy 0, policy_version 60750 (0.0008) +[2023-10-14 07:36:37,205][100917] Updated weights for policy 1, policy_version 60862 (0.0008) +[2023-10-14 07:36:37,396][100936] Updated weights for policy 0, policy_version 60760 (0.0009) +[2023-10-14 07:36:38,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.1, 300 sec: 13329.3). Total num frames: 124551168. Throughput: 0: 1649.9, 1: 1660.0. Samples: 31144310. Policy #0 lag: (min: 3.0, avg: 3.9, max: 23.0) +[2023-10-14 07:36:38,513][99942] Avg episode reward: [(0, '0.930'), (1, '1.000')] +[2023-10-14 07:36:41,480][100917] Updated weights for policy 1, policy_version 60872 (0.0009) +[2023-10-14 07:36:41,539][100936] Updated weights for policy 0, policy_version 60770 (0.0009) +[2023-10-14 07:36:41,848][100917] Updated weights for policy 1, policy_version 60882 (0.0010) +[2023-10-14 07:36:41,911][100936] Updated weights for policy 0, policy_version 60780 (0.0009) +[2023-10-14 07:36:42,225][100917] Updated weights for policy 1, policy_version 60892 (0.0009) +[2023-10-14 07:36:42,270][100936] Updated weights for policy 0, policy_version 60790 (0.0008) +[2023-10-14 07:36:42,642][100936] Updated weights for policy 0, policy_version 60800 (0.0010) +[2023-10-14 07:36:43,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 124616704. Throughput: 0: 1645.1, 1: 1667.1. Samples: 31155640. Policy #0 lag: (min: 3.0, avg: 3.9, max: 23.0) +[2023-10-14 07:36:43,512][99942] Avg episode reward: [(0, '0.930'), (1, '1.000')] +[2023-10-14 07:36:46,159][100917] Updated weights for policy 1, policy_version 60902 (0.0008) +[2023-10-14 07:36:46,544][100917] Updated weights for policy 1, policy_version 60912 (0.0009) +[2023-10-14 07:36:46,834][100936] Updated weights for policy 0, policy_version 60810 (0.0007) +[2023-10-14 07:36:46,912][100917] Updated weights for policy 1, policy_version 60922 (0.0010) +[2023-10-14 07:36:47,207][100936] Updated weights for policy 0, policy_version 60820 (0.0007) +[2023-10-14 07:36:47,584][100936] Updated weights for policy 0, policy_version 60830 (0.0008) +[2023-10-14 07:36:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 124682240. Throughput: 0: 1638.0, 1: 1653.2. Samples: 31174080. Policy #0 lag: (min: 3.0, avg: 3.9, max: 23.0) +[2023-10-14 07:36:48,513][99942] Avg episode reward: [(0, '0.930'), (1, '1.000')] +[2023-10-14 07:36:51,072][100917] Updated weights for policy 1, policy_version 60932 (0.0008) +[2023-10-14 07:36:51,445][100917] Updated weights for policy 1, policy_version 60942 (0.0011) +[2023-10-14 07:36:51,820][100917] Updated weights for policy 1, policy_version 60952 (0.0007) +[2023-10-14 07:36:51,838][100936] Updated weights for policy 0, policy_version 60840 (0.0009) +[2023-10-14 07:36:52,213][100936] Updated weights for policy 0, policy_version 60850 (0.0009) +[2023-10-14 07:36:52,569][100936] Updated weights for policy 0, policy_version 60860 (0.0008) +[2023-10-14 07:36:53,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.3, 300 sec: 13329.4). Total num frames: 124747776. Throughput: 0: 1650.3, 1: 1666.7. Samples: 31193874. Policy #0 lag: (min: 3.0, avg: 3.9, max: 23.0) +[2023-10-14 07:36:53,512][99942] Avg episode reward: [(0, '0.930'), (1, '1.000')] +[2023-10-14 07:36:55,881][100917] Updated weights for policy 1, policy_version 60962 (0.0007) +[2023-10-14 07:36:56,262][100917] Updated weights for policy 1, policy_version 60972 (0.0008) +[2023-10-14 07:36:56,625][100917] Updated weights for policy 1, policy_version 60982 (0.0007) +[2023-10-14 07:36:56,846][100936] Updated weights for policy 0, policy_version 60870 (0.0009) +[2023-10-14 07:36:56,998][100917] Updated weights for policy 1, policy_version 60992 (0.0007) +[2023-10-14 07:36:57,215][100936] Updated weights for policy 0, policy_version 60880 (0.0008) +[2023-10-14 07:36:57,587][100936] Updated weights for policy 0, policy_version 60890 (0.0007) +[2023-10-14 07:36:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 124813312. Throughput: 0: 1649.0, 1: 1659.8. Samples: 31205132. Policy #0 lag: (min: 3.0, avg: 3.9, max: 23.0) +[2023-10-14 07:36:58,513][99942] Avg episode reward: [(0, '0.930'), (1, '1.000')] +[2023-10-14 07:37:00,962][100917] Updated weights for policy 1, policy_version 61002 (0.0009) +[2023-10-14 07:37:01,342][100917] Updated weights for policy 1, policy_version 61012 (0.0009) +[2023-10-14 07:37:01,714][100917] Updated weights for policy 1, policy_version 61022 (0.0011) +[2023-10-14 07:37:01,778][100936] Updated weights for policy 0, policy_version 60900 (0.0009) +[2023-10-14 07:37:02,147][100936] Updated weights for policy 0, policy_version 60910 (0.0011) +[2023-10-14 07:37:02,520][100936] Updated weights for policy 0, policy_version 60920 (0.0008) +[2023-10-14 07:37:03,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 124878848. Throughput: 0: 1636.9, 1: 1661.6. Samples: 31223758. Policy #0 lag: (min: 3.0, avg: 3.9, max: 23.0) +[2023-10-14 07:37:03,513][99942] Avg episode reward: [(0, '0.930'), (1, '1.000')] +[2023-10-14 07:37:05,939][100917] Updated weights for policy 1, policy_version 61032 (0.0007) +[2023-10-14 07:37:06,311][100917] Updated weights for policy 1, policy_version 61042 (0.0007) +[2023-10-14 07:37:06,523][100936] Updated weights for policy 0, policy_version 60930 (0.0008) +[2023-10-14 07:37:06,681][100917] Updated weights for policy 1, policy_version 61052 (0.0008) +[2023-10-14 07:37:06,934][100936] Updated weights for policy 0, policy_version 60940 (0.0007) +[2023-10-14 07:37:07,300][100936] Updated weights for policy 0, policy_version 60950 (0.0007) +[2023-10-14 07:37:07,670][100936] Updated weights for policy 0, policy_version 60960 (0.0010) +[2023-10-14 07:37:08,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.1, 300 sec: 13329.3). Total num frames: 124944384. Throughput: 0: 1655.1, 1: 1670.2. Samples: 31243910. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:37:08,513][99942] Avg episode reward: [(0, '0.930'), (1, '1.000')] +[2023-10-14 07:37:10,770][100917] Updated weights for policy 1, policy_version 61062 (0.0008) +[2023-10-14 07:37:11,150][100917] Updated weights for policy 1, policy_version 61072 (0.0007) +[2023-10-14 07:37:11,522][100917] Updated weights for policy 1, policy_version 61082 (0.0010) +[2023-10-14 07:37:11,880][100936] Updated weights for policy 0, policy_version 60970 (0.0007) +[2023-10-14 07:37:12,257][100936] Updated weights for policy 0, policy_version 60980 (0.0007) +[2023-10-14 07:37:12,628][100936] Updated weights for policy 0, policy_version 60990 (0.0008) +[2023-10-14 07:37:13,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.3, 300 sec: 13329.4). Total num frames: 125009920. Throughput: 0: 1650.1, 1: 1661.3. Samples: 31255008. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:37:13,512][99942] Avg episode reward: [(0, '0.940'), (1, '1.000')] +[2023-10-14 07:37:15,542][100917] Updated weights for policy 1, policy_version 61092 (0.0009) +[2023-10-14 07:37:15,918][100917] Updated weights for policy 1, policy_version 61102 (0.0009) +[2023-10-14 07:37:16,284][100917] Updated weights for policy 1, policy_version 61112 (0.0009) +[2023-10-14 07:37:16,697][100936] Updated weights for policy 0, policy_version 61000 (0.0009) +[2023-10-14 07:37:17,063][100936] Updated weights for policy 0, policy_version 61010 (0.0008) +[2023-10-14 07:37:17,436][100936] Updated weights for policy 0, policy_version 61020 (0.0008) +[2023-10-14 07:37:18,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 125075456. Throughput: 0: 1645.6, 1: 1663.5. Samples: 31273690. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:37:18,513][99942] Avg episode reward: [(0, '0.940'), (1, '1.000')] +[2023-10-14 07:37:20,356][100917] Updated weights for policy 1, policy_version 61122 (0.0009) +[2023-10-14 07:37:20,724][100917] Updated weights for policy 1, policy_version 61132 (0.0007) +[2023-10-14 07:37:21,092][100917] Updated weights for policy 1, policy_version 61142 (0.0008) +[2023-10-14 07:37:21,461][100917] Updated weights for policy 1, policy_version 61152 (0.0010) +[2023-10-14 07:37:21,473][100936] Updated weights for policy 0, policy_version 61030 (0.0008) +[2023-10-14 07:37:21,844][100936] Updated weights for policy 0, policy_version 61040 (0.0007) +[2023-10-14 07:37:22,213][100936] Updated weights for policy 0, policy_version 61050 (0.0009) +[2023-10-14 07:37:23,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.1, 300 sec: 13329.4). Total num frames: 125140992. Throughput: 0: 1649.5, 1: 1674.8. Samples: 31293904. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:37:23,513][99942] Avg episode reward: [(0, '0.940'), (1, '1.000')] +[2023-10-14 07:37:25,629][100917] Updated weights for policy 1, policy_version 61162 (0.0007) +[2023-10-14 07:37:26,000][100917] Updated weights for policy 1, policy_version 61172 (0.0008) +[2023-10-14 07:37:26,371][100917] Updated weights for policy 1, policy_version 61182 (0.0008) +[2023-10-14 07:37:26,451][100936] Updated weights for policy 0, policy_version 61060 (0.0010) +[2023-10-14 07:37:26,834][100936] Updated weights for policy 0, policy_version 61070 (0.0010) +[2023-10-14 07:37:27,195][100936] Updated weights for policy 0, policy_version 61080 (0.0010) +[2023-10-14 07:37:28,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 125206528. Throughput: 0: 1649.1, 1: 1658.7. Samples: 31304492. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:37:28,513][99942] Avg episode reward: [(0, '0.560'), (1, '1.000')] +[2023-10-14 07:37:30,458][100917] Updated weights for policy 1, policy_version 61192 (0.0008) +[2023-10-14 07:37:30,830][100917] Updated weights for policy 1, policy_version 61202 (0.0008) +[2023-10-14 07:37:31,207][100936] Updated weights for policy 0, policy_version 61090 (0.0010) +[2023-10-14 07:37:31,217][100917] Updated weights for policy 1, policy_version 61212 (0.0009) +[2023-10-14 07:37:31,564][100936] Updated weights for policy 0, policy_version 61100 (0.0007) +[2023-10-14 07:37:31,934][100936] Updated weights for policy 0, policy_version 61110 (0.0007) +[2023-10-14 07:37:32,315][100936] Updated weights for policy 0, policy_version 61120 (0.0007) +[2023-10-14 07:37:33,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 125272064. Throughput: 0: 1650.9, 1: 1665.3. Samples: 31323308. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:37:33,512][99942] Avg episode reward: [(0, '0.560'), (1, '1.000')] +[2023-10-14 07:37:35,347][100917] Updated weights for policy 1, policy_version 61222 (0.0007) +[2023-10-14 07:37:35,721][100917] Updated weights for policy 1, policy_version 61232 (0.0010) +[2023-10-14 07:37:36,085][100917] Updated weights for policy 1, policy_version 61242 (0.0009) +[2023-10-14 07:37:36,364][100936] Updated weights for policy 0, policy_version 61130 (0.0007) +[2023-10-14 07:37:36,727][100936] Updated weights for policy 0, policy_version 61140 (0.0010) +[2023-10-14 07:37:37,103][100936] Updated weights for policy 0, policy_version 61150 (0.0008) +[2023-10-14 07:37:38,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 125337600. Throughput: 0: 1660.3, 1: 1672.6. Samples: 31343856. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:37:38,513][99942] Avg episode reward: [(0, '0.570'), (1, '1.000')] +[2023-10-14 07:37:38,524][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000061152_62619648.pth... +[2023-10-14 07:37:38,524][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000061248_62717952.pth... +[2023-10-14 07:37:38,563][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000059712_61145088.pth +[2023-10-14 07:37:38,564][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000059616_61046784.pth +[2023-10-14 07:37:40,167][100917] Updated weights for policy 1, policy_version 61252 (0.0008) +[2023-10-14 07:37:40,542][100917] Updated weights for policy 1, policy_version 61262 (0.0009) +[2023-10-14 07:37:40,908][100917] Updated weights for policy 1, policy_version 61272 (0.0008) +[2023-10-14 07:37:41,122][100936] Updated weights for policy 0, policy_version 61160 (0.0008) +[2023-10-14 07:37:41,494][100936] Updated weights for policy 0, policy_version 61170 (0.0011) +[2023-10-14 07:37:41,868][100936] Updated weights for policy 0, policy_version 61180 (0.0007) +[2023-10-14 07:37:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 125403136. Throughput: 0: 1651.3, 1: 1654.3. Samples: 31353882. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:37:43,512][99942] Avg episode reward: [(0, '0.570'), (1, '1.000')] +[2023-10-14 07:37:44,982][100917] Updated weights for policy 1, policy_version 61282 (0.0007) +[2023-10-14 07:37:45,357][100917] Updated weights for policy 1, policy_version 61292 (0.0009) +[2023-10-14 07:37:45,737][100917] Updated weights for policy 1, policy_version 61302 (0.0010) +[2023-10-14 07:37:46,110][100917] Updated weights for policy 1, policy_version 61312 (0.0010) +[2023-10-14 07:37:46,187][100936] Updated weights for policy 0, policy_version 61190 (0.0009) +[2023-10-14 07:37:46,552][100936] Updated weights for policy 0, policy_version 61200 (0.0007) +[2023-10-14 07:37:46,920][100936] Updated weights for policy 0, policy_version 61210 (0.0010) +[2023-10-14 07:37:48,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 125468672. Throughput: 0: 1655.0, 1: 1668.7. Samples: 31373324. Policy #0 lag: (min: 30.0, avg: 35.5, max: 62.0) +[2023-10-14 07:37:48,513][99942] Avg episode reward: [(0, '0.610'), (1, '1.000')] +[2023-10-14 07:37:50,257][100917] Updated weights for policy 1, policy_version 61322 (0.0010) +[2023-10-14 07:37:50,633][100917] Updated weights for policy 1, policy_version 61332 (0.0008) +[2023-10-14 07:37:51,003][100917] Updated weights for policy 1, policy_version 61342 (0.0008) +[2023-10-14 07:37:51,229][100936] Updated weights for policy 0, policy_version 61220 (0.0010) +[2023-10-14 07:37:51,597][100936] Updated weights for policy 0, policy_version 61230 (0.0008) +[2023-10-14 07:37:51,967][100936] Updated weights for policy 0, policy_version 61240 (0.0009) +[2023-10-14 07:37:53,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 125534208. Throughput: 0: 1661.8, 1: 1669.7. Samples: 31393828. Policy #0 lag: (min: 30.0, avg: 35.5, max: 62.0) +[2023-10-14 07:37:53,513][99942] Avg episode reward: [(0, '0.610'), (1, '1.000')] +[2023-10-14 07:37:55,066][100917] Updated weights for policy 1, policy_version 61352 (0.0009) +[2023-10-14 07:37:55,426][100917] Updated weights for policy 1, policy_version 61362 (0.0009) +[2023-10-14 07:37:55,804][100917] Updated weights for policy 1, policy_version 61372 (0.0008) +[2023-10-14 07:37:56,130][100936] Updated weights for policy 0, policy_version 61250 (0.0009) +[2023-10-14 07:37:56,527][100936] Updated weights for policy 0, policy_version 61260 (0.0010) +[2023-10-14 07:37:56,894][100936] Updated weights for policy 0, policy_version 61270 (0.0009) +[2023-10-14 07:37:57,265][100936] Updated weights for policy 0, policy_version 61280 (0.0008) +[2023-10-14 07:37:58,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 125599744. Throughput: 0: 1652.0, 1: 1651.2. Samples: 31403650. Policy #0 lag: (min: 30.0, avg: 35.5, max: 62.0) +[2023-10-14 07:37:58,513][99942] Avg episode reward: [(0, '0.610'), (1, '1.000')] +[2023-10-14 07:37:59,793][100917] Updated weights for policy 1, policy_version 61382 (0.0009) +[2023-10-14 07:38:00,171][100917] Updated weights for policy 1, policy_version 61392 (0.0008) +[2023-10-14 07:38:00,543][100917] Updated weights for policy 1, policy_version 61402 (0.0008) +[2023-10-14 07:38:01,444][100936] Updated weights for policy 0, policy_version 61290 (0.0007) +[2023-10-14 07:38:01,811][100936] Updated weights for policy 0, policy_version 61300 (0.0007) +[2023-10-14 07:38:02,176][100936] Updated weights for policy 0, policy_version 61310 (0.0007) +[2023-10-14 07:38:03,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 125665280. Throughput: 0: 1655.5, 1: 1673.6. Samples: 31423496. Policy #0 lag: (min: 30.0, avg: 35.5, max: 62.0) +[2023-10-14 07:38:03,512][99942] Avg episode reward: [(0, '0.610'), (1, '1.000')] +[2023-10-14 07:38:04,683][100917] Updated weights for policy 1, policy_version 61412 (0.0007) +[2023-10-14 07:38:05,051][100917] Updated weights for policy 1, policy_version 61422 (0.0008) +[2023-10-14 07:38:05,426][100917] Updated weights for policy 1, policy_version 61432 (0.0009) +[2023-10-14 07:38:06,309][100936] Updated weights for policy 0, policy_version 61320 (0.0009) +[2023-10-14 07:38:06,672][100936] Updated weights for policy 0, policy_version 61330 (0.0009) +[2023-10-14 07:38:07,057][100936] Updated weights for policy 0, policy_version 61340 (0.0008) +[2023-10-14 07:38:08,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 125730816. Throughput: 0: 1655.4, 1: 1675.0. Samples: 31443770. Policy #0 lag: (min: 30.0, avg: 35.5, max: 62.0) +[2023-10-14 07:38:08,513][99942] Avg episode reward: [(0, '0.610'), (1, '1.000')] +[2023-10-14 07:38:09,431][100917] Updated weights for policy 1, policy_version 61442 (0.0009) +[2023-10-14 07:38:09,793][100917] Updated weights for policy 1, policy_version 61452 (0.0008) +[2023-10-14 07:38:10,164][100917] Updated weights for policy 1, policy_version 61462 (0.0009) +[2023-10-14 07:38:10,543][100917] Updated weights for policy 1, policy_version 61472 (0.0007) +[2023-10-14 07:38:11,253][100936] Updated weights for policy 0, policy_version 61350 (0.0009) +[2023-10-14 07:38:11,621][100936] Updated weights for policy 0, policy_version 61360 (0.0010) +[2023-10-14 07:38:11,999][100936] Updated weights for policy 0, policy_version 61370 (0.0010) +[2023-10-14 07:38:13,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 125796352. Throughput: 0: 1656.7, 1: 1665.2. Samples: 31453978. Policy #0 lag: (min: 30.0, avg: 35.5, max: 62.0) +[2023-10-14 07:38:13,513][99942] Avg episode reward: [(0, '0.610'), (1, '1.000')] +[2023-10-14 07:38:14,638][100917] Updated weights for policy 1, policy_version 61482 (0.0009) +[2023-10-14 07:38:15,019][100917] Updated weights for policy 1, policy_version 61492 (0.0012) +[2023-10-14 07:38:15,386][100917] Updated weights for policy 1, policy_version 61502 (0.0010) +[2023-10-14 07:38:15,987][100936] Updated weights for policy 0, policy_version 61380 (0.0010) +[2023-10-14 07:38:16,355][100936] Updated weights for policy 0, policy_version 61390 (0.0010) +[2023-10-14 07:38:16,732][100936] Updated weights for policy 0, policy_version 61400 (0.0010) +[2023-10-14 07:38:18,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 125861888. Throughput: 0: 1660.6, 1: 1676.1. Samples: 31473462. Policy #0 lag: (min: 30.0, avg: 35.5, max: 62.0) +[2023-10-14 07:38:18,512][99942] Avg episode reward: [(0, '0.610'), (1, '1.000')] +[2023-10-14 07:38:19,597][100917] Updated weights for policy 1, policy_version 61512 (0.0010) +[2023-10-14 07:38:19,970][100917] Updated weights for policy 1, policy_version 61522 (0.0008) +[2023-10-14 07:38:20,341][100917] Updated weights for policy 1, policy_version 61532 (0.0007) +[2023-10-14 07:38:20,639][100936] Updated weights for policy 0, policy_version 61410 (0.0010) +[2023-10-14 07:38:21,012][100936] Updated weights for policy 0, policy_version 61420 (0.0010) +[2023-10-14 07:38:21,379][100936] Updated weights for policy 0, policy_version 61430 (0.0008) +[2023-10-14 07:38:21,743][100936] Updated weights for policy 0, policy_version 61440 (0.0011) +[2023-10-14 07:38:23,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 125927424. Throughput: 0: 1663.0, 1: 1672.5. Samples: 31493954. Policy #0 lag: (min: 30.0, avg: 35.5, max: 62.0) +[2023-10-14 07:38:23,512][99942] Avg episode reward: [(0, '0.610'), (1, '1.000')] +[2023-10-14 07:38:24,360][100917] Updated weights for policy 1, policy_version 61542 (0.0009) +[2023-10-14 07:38:24,731][100917] Updated weights for policy 1, policy_version 61552 (0.0009) +[2023-10-14 07:38:25,099][100917] Updated weights for policy 1, policy_version 61562 (0.0008) +[2023-10-14 07:38:25,891][100936] Updated weights for policy 0, policy_version 61450 (0.0009) +[2023-10-14 07:38:26,269][100936] Updated weights for policy 0, policy_version 61460 (0.0008) +[2023-10-14 07:38:26,634][100936] Updated weights for policy 0, policy_version 61470 (0.0009) +[2023-10-14 07:38:28,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 125992960. Throughput: 0: 1653.9, 1: 1663.7. Samples: 31503172. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:38:28,513][99942] Avg episode reward: [(0, '0.610'), (1, '1.000')] +[2023-10-14 07:38:29,252][100917] Updated weights for policy 1, policy_version 61572 (0.0007) +[2023-10-14 07:38:29,625][100917] Updated weights for policy 1, policy_version 61582 (0.0009) +[2023-10-14 07:38:30,002][100917] Updated weights for policy 1, policy_version 61592 (0.0009) +[2023-10-14 07:38:30,660][100936] Updated weights for policy 0, policy_version 61480 (0.0008) +[2023-10-14 07:38:31,028][100936] Updated weights for policy 0, policy_version 61490 (0.0009) +[2023-10-14 07:38:31,402][100936] Updated weights for policy 0, policy_version 61500 (0.0007) +[2023-10-14 07:38:33,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 126058496. Throughput: 0: 1660.8, 1: 1671.5. Samples: 31523280. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:38:33,513][99942] Avg episode reward: [(0, '0.610'), (1, '1.000')] +[2023-10-14 07:38:34,129][100917] Updated weights for policy 1, policy_version 61602 (0.0010) +[2023-10-14 07:38:34,503][100917] Updated weights for policy 1, policy_version 61612 (0.0010) +[2023-10-14 07:38:34,879][100917] Updated weights for policy 1, policy_version 61622 (0.0008) +[2023-10-14 07:38:35,252][100917] Updated weights for policy 1, policy_version 61632 (0.0008) +[2023-10-14 07:38:35,633][100936] Updated weights for policy 0, policy_version 61510 (0.0007) +[2023-10-14 07:38:36,003][100936] Updated weights for policy 0, policy_version 61520 (0.0008) +[2023-10-14 07:38:36,370][100936] Updated weights for policy 0, policy_version 61530 (0.0007) +[2023-10-14 07:38:38,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 126124032. Throughput: 0: 1659.9, 1: 1670.4. Samples: 31543688. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:38:38,513][99942] Avg episode reward: [(0, '0.610'), (1, '0.900')] +[2023-10-14 07:38:39,367][100917] Updated weights for policy 1, policy_version 61642 (0.0011) +[2023-10-14 07:38:39,746][100917] Updated weights for policy 1, policy_version 61652 (0.0009) +[2023-10-14 07:38:40,125][100917] Updated weights for policy 1, policy_version 61662 (0.0007) +[2023-10-14 07:38:40,539][100936] Updated weights for policy 0, policy_version 61540 (0.0009) +[2023-10-14 07:38:40,940][100936] Updated weights for policy 0, policy_version 61550 (0.0010) +[2023-10-14 07:38:41,307][100936] Updated weights for policy 0, policy_version 61560 (0.0009) +[2023-10-14 07:38:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 126189568. Throughput: 0: 1644.3, 1: 1670.4. Samples: 31552812. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:38:43,513][99942] Avg episode reward: [(0, '0.610'), (1, '0.900')] +[2023-10-14 07:38:44,215][100917] Updated weights for policy 1, policy_version 61672 (0.0008) +[2023-10-14 07:38:44,599][100917] Updated weights for policy 1, policy_version 61682 (0.0008) +[2023-10-14 07:38:44,968][100917] Updated weights for policy 1, policy_version 61692 (0.0007) +[2023-10-14 07:38:45,349][100936] Updated weights for policy 0, policy_version 61570 (0.0009) +[2023-10-14 07:38:45,725][100936] Updated weights for policy 0, policy_version 61580 (0.0007) +[2023-10-14 07:38:46,090][100936] Updated weights for policy 0, policy_version 61590 (0.0007) +[2023-10-14 07:38:46,456][100936] Updated weights for policy 0, policy_version 61600 (0.0007) +[2023-10-14 07:38:48,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 126255104. Throughput: 0: 1656.0, 1: 1666.5. Samples: 31573008. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:38:48,513][99942] Avg episode reward: [(0, '0.610'), (1, '0.900')] +[2023-10-14 07:38:49,051][100917] Updated weights for policy 1, policy_version 61702 (0.0007) +[2023-10-14 07:38:49,421][100917] Updated weights for policy 1, policy_version 61712 (0.0008) +[2023-10-14 07:38:49,794][100917] Updated weights for policy 1, policy_version 61722 (0.0009) +[2023-10-14 07:38:50,697][100936] Updated weights for policy 0, policy_version 61610 (0.0009) +[2023-10-14 07:38:51,071][100936] Updated weights for policy 0, policy_version 61620 (0.0009) +[2023-10-14 07:38:51,443][100936] Updated weights for policy 0, policy_version 61630 (0.0010) +[2023-10-14 07:38:53,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 126320640. Throughput: 0: 1660.8, 1: 1666.4. Samples: 31593496. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:38:53,513][99942] Avg episode reward: [(0, '0.610'), (1, '0.900')] +[2023-10-14 07:38:53,899][100917] Updated weights for policy 1, policy_version 61732 (0.0009) +[2023-10-14 07:38:54,280][100917] Updated weights for policy 1, policy_version 61742 (0.0010) +[2023-10-14 07:38:54,660][100917] Updated weights for policy 1, policy_version 61752 (0.0010) +[2023-10-14 07:38:55,637][100936] Updated weights for policy 0, policy_version 61640 (0.0007) +[2023-10-14 07:38:56,016][100936] Updated weights for policy 0, policy_version 61650 (0.0008) +[2023-10-14 07:38:56,380][100936] Updated weights for policy 0, policy_version 61660 (0.0009) +[2023-10-14 07:38:58,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 126386176. Throughput: 0: 1640.9, 1: 1664.6. Samples: 31602722. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:38:58,512][99942] Avg episode reward: [(0, '0.610'), (1, '0.900')] +[2023-10-14 07:38:58,740][100917] Updated weights for policy 1, policy_version 61762 (0.0008) +[2023-10-14 07:38:59,119][100917] Updated weights for policy 1, policy_version 61772 (0.0008) +[2023-10-14 07:38:59,504][100917] Updated weights for policy 1, policy_version 61782 (0.0009) +[2023-10-14 07:38:59,883][100917] Updated weights for policy 1, policy_version 61792 (0.0008) +[2023-10-14 07:39:00,624][100936] Updated weights for policy 0, policy_version 61670 (0.0010) +[2023-10-14 07:39:01,001][100936] Updated weights for policy 0, policy_version 61680 (0.0009) +[2023-10-14 07:39:01,371][100936] Updated weights for policy 0, policy_version 61690 (0.0008) +[2023-10-14 07:39:03,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 126451712. Throughput: 0: 1653.4, 1: 1669.3. Samples: 31622982. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:39:03,513][99942] Avg episode reward: [(0, '0.610'), (1, '0.900')] +[2023-10-14 07:39:04,011][100917] Updated weights for policy 1, policy_version 61802 (0.0007) +[2023-10-14 07:39:04,387][100917] Updated weights for policy 1, policy_version 61812 (0.0009) +[2023-10-14 07:39:04,759][100917] Updated weights for policy 1, policy_version 61822 (0.0010) +[2023-10-14 07:39:05,544][100936] Updated weights for policy 0, policy_version 61700 (0.0010) +[2023-10-14 07:39:05,909][100936] Updated weights for policy 0, policy_version 61710 (0.0010) +[2023-10-14 07:39:06,279][100936] Updated weights for policy 0, policy_version 61720 (0.0010) +[2023-10-14 07:39:08,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 126517248. Throughput: 0: 1648.6, 1: 1672.7. Samples: 31643414. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 07:39:08,513][99942] Avg episode reward: [(0, '0.610'), (1, '0.900')] +[2023-10-14 07:39:08,878][100917] Updated weights for policy 1, policy_version 61832 (0.0008) +[2023-10-14 07:39:09,245][100917] Updated weights for policy 1, policy_version 61842 (0.0007) +[2023-10-14 07:39:09,628][100917] Updated weights for policy 1, policy_version 61852 (0.0009) +[2023-10-14 07:39:10,303][100936] Updated weights for policy 0, policy_version 61730 (0.0009) +[2023-10-14 07:39:10,675][100936] Updated weights for policy 0, policy_version 61740 (0.0007) +[2023-10-14 07:39:11,050][100936] Updated weights for policy 0, policy_version 61750 (0.0008) +[2023-10-14 07:39:11,421][100936] Updated weights for policy 0, policy_version 61760 (0.0009) +[2023-10-14 07:39:13,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 126582784. Throughput: 0: 1646.5, 1: 1675.5. Samples: 31652660. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 07:39:13,512][99942] Avg episode reward: [(0, '0.600'), (1, '0.900')] +[2023-10-14 07:39:13,525][100917] Updated weights for policy 1, policy_version 61862 (0.0009) +[2023-10-14 07:39:13,896][100917] Updated weights for policy 1, policy_version 61872 (0.0009) +[2023-10-14 07:39:14,274][100917] Updated weights for policy 1, policy_version 61882 (0.0008) +[2023-10-14 07:39:15,321][100936] Updated weights for policy 0, policy_version 61770 (0.0009) +[2023-10-14 07:39:15,701][100936] Updated weights for policy 0, policy_version 61780 (0.0009) +[2023-10-14 07:39:16,073][100936] Updated weights for policy 0, policy_version 61790 (0.0007) +[2023-10-14 07:39:18,421][100917] Updated weights for policy 1, policy_version 61892 (0.0008) +[2023-10-14 07:39:18,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 126648320. Throughput: 0: 1657.1, 1: 1675.6. Samples: 31673254. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 07:39:18,512][99942] Avg episode reward: [(0, '0.600'), (1, '0.900')] +[2023-10-14 07:39:18,793][100917] Updated weights for policy 1, policy_version 61902 (0.0008) +[2023-10-14 07:39:19,171][100917] Updated weights for policy 1, policy_version 61912 (0.0009) +[2023-10-14 07:39:20,141][100936] Updated weights for policy 0, policy_version 61800 (0.0007) +[2023-10-14 07:39:20,508][100936] Updated weights for policy 0, policy_version 61810 (0.0009) +[2023-10-14 07:39:20,884][100936] Updated weights for policy 0, policy_version 61820 (0.0011) +[2023-10-14 07:39:23,025][100917] Updated weights for policy 1, policy_version 61922 (0.0007) +[2023-10-14 07:39:23,395][100917] Updated weights for policy 1, policy_version 61932 (0.0007) +[2023-10-14 07:39:23,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 126713856. Throughput: 0: 1662.9, 1: 1677.3. Samples: 31693998. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 07:39:23,513][99942] Avg episode reward: [(0, '0.600'), (1, '0.900')] +[2023-10-14 07:39:23,776][100917] Updated weights for policy 1, policy_version 61942 (0.0008) +[2023-10-14 07:39:24,142][100917] Updated weights for policy 1, policy_version 61952 (0.0010) +[2023-10-14 07:39:25,043][100936] Updated weights for policy 0, policy_version 61830 (0.0008) +[2023-10-14 07:39:25,404][100936] Updated weights for policy 0, policy_version 61840 (0.0008) +[2023-10-14 07:39:25,770][100936] Updated weights for policy 0, policy_version 61850 (0.0007) +[2023-10-14 07:39:28,207][100917] Updated weights for policy 1, policy_version 61962 (0.0009) +[2023-10-14 07:39:28,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 126779392. Throughput: 0: 1660.8, 1: 1677.0. Samples: 31703014. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 07:39:28,513][99942] Avg episode reward: [(0, '0.600'), (1, '0.900')] +[2023-10-14 07:39:28,586][100917] Updated weights for policy 1, policy_version 61972 (0.0008) +[2023-10-14 07:39:28,953][100917] Updated weights for policy 1, policy_version 61982 (0.0007) +[2023-10-14 07:39:30,155][100936] Updated weights for policy 0, policy_version 61860 (0.0008) +[2023-10-14 07:39:30,539][100936] Updated weights for policy 0, policy_version 61870 (0.0010) +[2023-10-14 07:39:30,915][100936] Updated weights for policy 0, policy_version 61880 (0.0011) +[2023-10-14 07:39:33,219][100917] Updated weights for policy 1, policy_version 61992 (0.0008) +[2023-10-14 07:39:33,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 126844928. Throughput: 0: 1660.4, 1: 1675.2. Samples: 31723108. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 07:39:33,512][99942] Avg episode reward: [(0, '0.600'), (1, '0.900')] +[2023-10-14 07:39:33,574][100917] Updated weights for policy 1, policy_version 62002 (0.0008) +[2023-10-14 07:39:33,938][100917] Updated weights for policy 1, policy_version 62012 (0.0008) +[2023-10-14 07:39:34,978][100936] Updated weights for policy 0, policy_version 61890 (0.0010) +[2023-10-14 07:39:35,354][100936] Updated weights for policy 0, policy_version 61900 (0.0007) +[2023-10-14 07:39:35,727][100936] Updated weights for policy 0, policy_version 61910 (0.0007) +[2023-10-14 07:39:36,093][100936] Updated weights for policy 0, policy_version 61920 (0.0007) +[2023-10-14 07:39:38,027][100917] Updated weights for policy 1, policy_version 62022 (0.0008) +[2023-10-14 07:39:38,384][100917] Updated weights for policy 1, policy_version 62032 (0.0007) +[2023-10-14 07:39:38,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 126910464. Throughput: 0: 1663.9, 1: 1667.8. Samples: 31743422. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 07:39:38,513][99942] Avg episode reward: [(0, '0.600'), (1, '0.900')] +[2023-10-14 07:39:38,519][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000061920_63406080.pth... +[2023-10-14 07:39:38,554][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000060384_61833216.pth +[2023-10-14 07:39:38,559][100560] Saving a milestone ./train_atari/atari_privateye_APPO/checkpoint_p0/milestones/checkpoint_000061920_63406080.pth +[2023-10-14 07:39:38,764][100917] Updated weights for policy 1, policy_version 62042 (0.0007) +[2023-10-14 07:39:38,981][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000062048_63537152.pth... +[2023-10-14 07:39:39,010][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000060480_61931520.pth +[2023-10-14 07:39:39,014][100681] Saving a milestone ./train_atari/atari_privateye_APPO/checkpoint_p1/milestones/checkpoint_000062048_63537152.pth +[2023-10-14 07:39:40,017][100936] Updated weights for policy 0, policy_version 61930 (0.0007) +[2023-10-14 07:39:40,395][100936] Updated weights for policy 0, policy_version 61940 (0.0007) +[2023-10-14 07:39:40,751][100936] Updated weights for policy 0, policy_version 61950 (0.0007) +[2023-10-14 07:39:42,962][100917] Updated weights for policy 1, policy_version 62052 (0.0008) +[2023-10-14 07:39:43,326][100917] Updated weights for policy 1, policy_version 62062 (0.0008) +[2023-10-14 07:39:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 13107.2). Total num frames: 126976000. Throughput: 0: 1660.7, 1: 1668.9. Samples: 31752552. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 07:39:43,512][99942] Avg episode reward: [(0, '0.600'), (1, '0.900')] +[2023-10-14 07:39:43,700][100917] Updated weights for policy 1, policy_version 62072 (0.0008) +[2023-10-14 07:39:44,896][100936] Updated weights for policy 0, policy_version 61960 (0.0010) +[2023-10-14 07:39:45,271][100936] Updated weights for policy 0, policy_version 61970 (0.0008) +[2023-10-14 07:39:45,637][100936] Updated weights for policy 0, policy_version 61980 (0.0007) +[2023-10-14 07:39:47,711][100917] Updated weights for policy 1, policy_version 62082 (0.0008) +[2023-10-14 07:39:48,080][100917] Updated weights for policy 1, policy_version 62092 (0.0008) +[2023-10-14 07:39:48,452][100917] Updated weights for policy 1, policy_version 62102 (0.0009) +[2023-10-14 07:39:48,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 127041536. Throughput: 0: 1666.3, 1: 1668.0. Samples: 31773026. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-10-14 07:39:48,513][99942] Avg episode reward: [(0, '0.600'), (1, '0.900')] +[2023-10-14 07:39:48,824][100917] Updated weights for policy 1, policy_version 62112 (0.0010) +[2023-10-14 07:39:49,870][100936] Updated weights for policy 0, policy_version 61990 (0.0008) +[2023-10-14 07:39:50,229][100936] Updated weights for policy 0, policy_version 62000 (0.0009) +[2023-10-14 07:39:50,594][100936] Updated weights for policy 0, policy_version 62010 (0.0010) +[2023-10-14 07:39:52,997][100917] Updated weights for policy 1, policy_version 62122 (0.0007) +[2023-10-14 07:39:53,379][100917] Updated weights for policy 1, policy_version 62132 (0.0010) +[2023-10-14 07:39:53,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 127107072. Throughput: 0: 1666.9, 1: 1657.8. Samples: 31793026. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-10-14 07:39:53,513][99942] Avg episode reward: [(0, '0.600'), (1, '0.900')] +[2023-10-14 07:39:53,743][100917] Updated weights for policy 1, policy_version 62142 (0.0009) +[2023-10-14 07:39:54,672][100936] Updated weights for policy 0, policy_version 62020 (0.0009) +[2023-10-14 07:39:55,038][100936] Updated weights for policy 0, policy_version 62030 (0.0010) +[2023-10-14 07:39:55,418][100936] Updated weights for policy 0, policy_version 62040 (0.0008) +[2023-10-14 07:39:58,062][100917] Updated weights for policy 1, policy_version 62152 (0.0008) +[2023-10-14 07:39:58,446][100917] Updated weights for policy 1, policy_version 62162 (0.0007) +[2023-10-14 07:39:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 127172608. Throughput: 0: 1660.8, 1: 1662.1. Samples: 31802188. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-10-14 07:39:58,512][99942] Avg episode reward: [(0, '0.600'), (1, '0.900')] +[2023-10-14 07:39:58,819][100917] Updated weights for policy 1, policy_version 62172 (0.0011) +[2023-10-14 07:39:59,566][100936] Updated weights for policy 0, policy_version 62050 (0.0008) +[2023-10-14 07:39:59,946][100936] Updated weights for policy 0, policy_version 62060 (0.0010) +[2023-10-14 07:40:00,311][100936] Updated weights for policy 0, policy_version 62070 (0.0010) +[2023-10-14 07:40:00,682][100936] Updated weights for policy 0, policy_version 62080 (0.0009) +[2023-10-14 07:40:02,760][100917] Updated weights for policy 1, policy_version 62182 (0.0009) +[2023-10-14 07:40:03,134][100917] Updated weights for policy 1, policy_version 62192 (0.0009) +[2023-10-14 07:40:03,512][100917] Updated weights for policy 1, policy_version 62202 (0.0007) +[2023-10-14 07:40:03,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 127238144. Throughput: 0: 1657.5, 1: 1659.4. Samples: 31822514. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-10-14 07:40:03,513][99942] Avg episode reward: [(0, '0.600'), (1, '0.900')] +[2023-10-14 07:40:04,912][100936] Updated weights for policy 0, policy_version 62090 (0.0009) +[2023-10-14 07:40:05,283][100936] Updated weights for policy 0, policy_version 62100 (0.0007) +[2023-10-14 07:40:05,646][100936] Updated weights for policy 0, policy_version 62110 (0.0011) +[2023-10-14 07:40:07,530][100917] Updated weights for policy 1, policy_version 62212 (0.0008) +[2023-10-14 07:40:07,901][100917] Updated weights for policy 1, policy_version 62222 (0.0010) +[2023-10-14 07:40:08,271][100917] Updated weights for policy 1, policy_version 62232 (0.0008) +[2023-10-14 07:40:08,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 127303680. Throughput: 0: 1653.0, 1: 1648.7. Samples: 31842574. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-10-14 07:40:08,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.900')] +[2023-10-14 07:40:09,881][100936] Updated weights for policy 0, policy_version 62120 (0.0009) +[2023-10-14 07:40:10,255][100936] Updated weights for policy 0, policy_version 62130 (0.0008) +[2023-10-14 07:40:10,622][100936] Updated weights for policy 0, policy_version 62140 (0.0009) +[2023-10-14 07:40:12,394][100917] Updated weights for policy 1, policy_version 62242 (0.0008) +[2023-10-14 07:40:12,764][100917] Updated weights for policy 1, policy_version 62252 (0.0008) +[2023-10-14 07:40:13,143][100917] Updated weights for policy 1, policy_version 62262 (0.0011) +[2023-10-14 07:40:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 127369216. Throughput: 0: 1650.5, 1: 1660.6. Samples: 31852014. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-10-14 07:40:13,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.900')] +[2023-10-14 07:40:13,523][100917] Updated weights for policy 1, policy_version 62272 (0.0008) +[2023-10-14 07:40:14,834][100936] Updated weights for policy 0, policy_version 62150 (0.0009) +[2023-10-14 07:40:15,199][100936] Updated weights for policy 0, policy_version 62160 (0.0011) +[2023-10-14 07:40:15,571][100936] Updated weights for policy 0, policy_version 62170 (0.0008) +[2023-10-14 07:40:17,746][100917] Updated weights for policy 1, policy_version 62282 (0.0010) +[2023-10-14 07:40:18,124][100917] Updated weights for policy 1, policy_version 62292 (0.0010) +[2023-10-14 07:40:18,495][100917] Updated weights for policy 1, policy_version 62302 (0.0010) +[2023-10-14 07:40:18,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 127434752. Throughput: 0: 1656.8, 1: 1660.9. Samples: 31872404. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-10-14 07:40:18,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.900')] +[2023-10-14 07:40:19,737][100936] Updated weights for policy 0, policy_version 62180 (0.0007) +[2023-10-14 07:40:20,126][100936] Updated weights for policy 0, policy_version 62190 (0.0008) +[2023-10-14 07:40:20,501][100936] Updated weights for policy 0, policy_version 62200 (0.0010) +[2023-10-14 07:40:22,577][100917] Updated weights for policy 1, policy_version 62312 (0.0008) +[2023-10-14 07:40:22,941][100917] Updated weights for policy 1, policy_version 62322 (0.0007) +[2023-10-14 07:40:23,315][100917] Updated weights for policy 1, policy_version 62332 (0.0009) +[2023-10-14 07:40:23,512][99942] Fps is (10 sec: 16384.2, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 127533056. Throughput: 0: 1652.3, 1: 1653.0. Samples: 31892160. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-10-14 07:40:23,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.900')] +[2023-10-14 07:40:24,676][100936] Updated weights for policy 0, policy_version 62210 (0.0007) +[2023-10-14 07:40:25,045][100936] Updated weights for policy 0, policy_version 62220 (0.0007) +[2023-10-14 07:40:25,425][100936] Updated weights for policy 0, policy_version 62230 (0.0008) +[2023-10-14 07:40:25,787][100936] Updated weights for policy 0, policy_version 62240 (0.0009) +[2023-10-14 07:40:27,454][100917] Updated weights for policy 1, policy_version 62342 (0.0009) +[2023-10-14 07:40:27,823][100917] Updated weights for policy 1, policy_version 62352 (0.0010) +[2023-10-14 07:40:28,200][100917] Updated weights for policy 1, policy_version 62362 (0.0008) +[2023-10-14 07:40:28,512][99942] Fps is (10 sec: 16384.1, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 127598592. Throughput: 0: 1650.7, 1: 1666.6. Samples: 31901830. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 07:40:28,512][99942] Avg episode reward: [(0, '0.980'), (1, '0.900')] +[2023-10-14 07:40:29,822][100936] Updated weights for policy 0, policy_version 62250 (0.0009) +[2023-10-14 07:40:30,190][100936] Updated weights for policy 0, policy_version 62260 (0.0008) +[2023-10-14 07:40:30,566][100936] Updated weights for policy 0, policy_version 62270 (0.0009) +[2023-10-14 07:40:32,429][100917] Updated weights for policy 1, policy_version 62372 (0.0009) +[2023-10-14 07:40:32,805][100917] Updated weights for policy 1, policy_version 62382 (0.0011) +[2023-10-14 07:40:33,188][100917] Updated weights for policy 1, policy_version 62392 (0.0010) +[2023-10-14 07:40:33,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 127664128. Throughput: 0: 1647.8, 1: 1664.8. Samples: 31922092. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 07:40:33,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.900')] +[2023-10-14 07:40:34,891][100936] Updated weights for policy 0, policy_version 62280 (0.0009) +[2023-10-14 07:40:35,257][100936] Updated weights for policy 0, policy_version 62290 (0.0011) +[2023-10-14 07:40:35,620][100936] Updated weights for policy 0, policy_version 62300 (0.0008) +[2023-10-14 07:40:37,292][100917] Updated weights for policy 1, policy_version 62402 (0.0009) +[2023-10-14 07:40:37,667][100917] Updated weights for policy 1, policy_version 62412 (0.0008) +[2023-10-14 07:40:38,042][100917] Updated weights for policy 1, policy_version 62422 (0.0009) +[2023-10-14 07:40:38,416][100917] Updated weights for policy 1, policy_version 62432 (0.0007) +[2023-10-14 07:40:38,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 127729664. Throughput: 0: 1650.9, 1: 1651.7. Samples: 31941642. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 07:40:38,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.900')] +[2023-10-14 07:40:39,620][100936] Updated weights for policy 0, policy_version 62310 (0.0009) +[2023-10-14 07:40:39,993][100936] Updated weights for policy 0, policy_version 62320 (0.0008) +[2023-10-14 07:40:40,365][100936] Updated weights for policy 0, policy_version 62330 (0.0008) +[2023-10-14 07:40:42,621][100917] Updated weights for policy 1, policy_version 62442 (0.0009) +[2023-10-14 07:40:42,992][100917] Updated weights for policy 1, policy_version 62452 (0.0008) +[2023-10-14 07:40:43,370][100917] Updated weights for policy 1, policy_version 62462 (0.0009) +[2023-10-14 07:40:43,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 127795200. Throughput: 0: 1649.7, 1: 1665.9. Samples: 31951388. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 07:40:43,512][99942] Avg episode reward: [(0, '0.980'), (1, '0.900')] +[2023-10-14 07:40:44,614][100936] Updated weights for policy 0, policy_version 62340 (0.0009) +[2023-10-14 07:40:44,987][100936] Updated weights for policy 0, policy_version 62350 (0.0007) +[2023-10-14 07:40:45,357][100936] Updated weights for policy 0, policy_version 62360 (0.0007) +[2023-10-14 07:40:47,472][100917] Updated weights for policy 1, policy_version 62472 (0.0008) +[2023-10-14 07:40:47,852][100917] Updated weights for policy 1, policy_version 62482 (0.0010) +[2023-10-14 07:40:48,226][100917] Updated weights for policy 1, policy_version 62492 (0.0010) +[2023-10-14 07:40:48,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 127860736. Throughput: 0: 1647.4, 1: 1664.7. Samples: 31971562. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 07:40:48,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.900')] +[2023-10-14 07:40:49,374][100936] Updated weights for policy 0, policy_version 62370 (0.0010) +[2023-10-14 07:40:49,758][100936] Updated weights for policy 0, policy_version 62380 (0.0008) +[2023-10-14 07:40:50,117][100936] Updated weights for policy 0, policy_version 62390 (0.0009) +[2023-10-14 07:40:50,491][100936] Updated weights for policy 0, policy_version 62400 (0.0008) +[2023-10-14 07:40:52,291][100917] Updated weights for policy 1, policy_version 62502 (0.0010) +[2023-10-14 07:40:52,666][100917] Updated weights for policy 1, policy_version 62512 (0.0010) +[2023-10-14 07:40:53,044][100917] Updated weights for policy 1, policy_version 62522 (0.0011) +[2023-10-14 07:40:53,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 127926272. Throughput: 0: 1648.0, 1: 1653.7. Samples: 31991150. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 07:40:53,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.900')] +[2023-10-14 07:40:54,562][100936] Updated weights for policy 0, policy_version 62410 (0.0009) +[2023-10-14 07:40:54,918][100936] Updated weights for policy 0, policy_version 62420 (0.0008) +[2023-10-14 07:40:55,287][100936] Updated weights for policy 0, policy_version 62430 (0.0010) +[2023-10-14 07:40:56,956][100917] Updated weights for policy 1, policy_version 62532 (0.0008) +[2023-10-14 07:40:57,329][100917] Updated weights for policy 1, policy_version 62542 (0.0009) +[2023-10-14 07:40:57,702][100917] Updated weights for policy 1, policy_version 62552 (0.0010) +[2023-10-14 07:40:58,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 127991808. Throughput: 0: 1647.7, 1: 1661.7. Samples: 32000938. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 07:40:58,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.900')] +[2023-10-14 07:40:59,312][100936] Updated weights for policy 0, policy_version 62440 (0.0009) +[2023-10-14 07:40:59,679][100936] Updated weights for policy 0, policy_version 62450 (0.0008) +[2023-10-14 07:41:00,054][100936] Updated weights for policy 0, policy_version 62460 (0.0008) +[2023-10-14 07:41:01,695][100917] Updated weights for policy 1, policy_version 62562 (0.0011) +[2023-10-14 07:41:02,062][100917] Updated weights for policy 1, policy_version 62572 (0.0010) +[2023-10-14 07:41:02,445][100917] Updated weights for policy 1, policy_version 62582 (0.0011) +[2023-10-14 07:41:02,821][100917] Updated weights for policy 1, policy_version 62592 (0.0009) +[2023-10-14 07:41:03,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 128057344. Throughput: 0: 1648.3, 1: 1656.5. Samples: 32021120. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 07:41:03,513][99942] Avg episode reward: [(0, '0.990'), (1, '0.900')] +[2023-10-14 07:41:04,300][100936] Updated weights for policy 0, policy_version 62470 (0.0008) +[2023-10-14 07:41:04,671][100936] Updated weights for policy 0, policy_version 62480 (0.0009) +[2023-10-14 07:41:05,032][100936] Updated weights for policy 0, policy_version 62490 (0.0007) +[2023-10-14 07:41:06,840][100917] Updated weights for policy 1, policy_version 62602 (0.0011) +[2023-10-14 07:41:07,213][100917] Updated weights for policy 1, policy_version 62612 (0.0010) +[2023-10-14 07:41:07,583][100917] Updated weights for policy 1, policy_version 62622 (0.0010) +[2023-10-14 07:41:08,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 128122880. Throughput: 0: 1648.6, 1: 1653.4. Samples: 32040752. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:41:08,513][99942] Avg episode reward: [(0, '0.990'), (1, '0.900')] +[2023-10-14 07:41:09,269][100936] Updated weights for policy 0, policy_version 62500 (0.0007) +[2023-10-14 07:41:09,637][100936] Updated weights for policy 0, policy_version 62510 (0.0007) +[2023-10-14 07:41:10,010][100936] Updated weights for policy 0, policy_version 62520 (0.0009) +[2023-10-14 07:41:11,707][100917] Updated weights for policy 1, policy_version 62632 (0.0008) +[2023-10-14 07:41:12,081][100917] Updated weights for policy 1, policy_version 62642 (0.0008) +[2023-10-14 07:41:12,463][100917] Updated weights for policy 1, policy_version 62652 (0.0009) +[2023-10-14 07:41:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 128188416. Throughput: 0: 1646.5, 1: 1666.6. Samples: 32050920. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:41:13,513][99942] Avg episode reward: [(0, '0.990'), (1, '0.900')] +[2023-10-14 07:41:14,019][100936] Updated weights for policy 0, policy_version 62530 (0.0009) +[2023-10-14 07:41:14,387][100936] Updated weights for policy 0, policy_version 62540 (0.0007) +[2023-10-14 07:41:14,767][100936] Updated weights for policy 0, policy_version 62550 (0.0010) +[2023-10-14 07:41:15,132][100936] Updated weights for policy 0, policy_version 62560 (0.0009) +[2023-10-14 07:41:16,580][100917] Updated weights for policy 1, policy_version 62662 (0.0009) +[2023-10-14 07:41:16,944][100917] Updated weights for policy 1, policy_version 62672 (0.0010) +[2023-10-14 07:41:17,323][100917] Updated weights for policy 1, policy_version 62682 (0.0011) +[2023-10-14 07:41:18,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 128253952. Throughput: 0: 1653.9, 1: 1651.5. Samples: 32070838. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:41:18,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:41:19,462][100936] Updated weights for policy 0, policy_version 62570 (0.0007) +[2023-10-14 07:41:19,845][100936] Updated weights for policy 0, policy_version 62580 (0.0008) +[2023-10-14 07:41:20,220][100936] Updated weights for policy 0, policy_version 62590 (0.0009) +[2023-10-14 07:41:21,429][100917] Updated weights for policy 1, policy_version 62692 (0.0009) +[2023-10-14 07:41:21,808][100917] Updated weights for policy 1, policy_version 62702 (0.0007) +[2023-10-14 07:41:22,177][100917] Updated weights for policy 1, policy_version 62712 (0.0010) +[2023-10-14 07:41:23,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 128319488. Throughput: 0: 1653.3, 1: 1661.1. Samples: 32090790. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:41:23,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:41:24,351][100936] Updated weights for policy 0, policy_version 62600 (0.0008) +[2023-10-14 07:41:24,712][100936] Updated weights for policy 0, policy_version 62610 (0.0009) +[2023-10-14 07:41:25,084][100936] Updated weights for policy 0, policy_version 62620 (0.0008) +[2023-10-14 07:41:26,191][100917] Updated weights for policy 1, policy_version 62722 (0.0007) +[2023-10-14 07:41:26,558][100917] Updated weights for policy 1, policy_version 62732 (0.0008) +[2023-10-14 07:41:26,936][100917] Updated weights for policy 1, policy_version 62742 (0.0010) +[2023-10-14 07:41:27,298][100917] Updated weights for policy 1, policy_version 62752 (0.0009) +[2023-10-14 07:41:28,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 128385024. Throughput: 0: 1652.5, 1: 1672.0. Samples: 32100992. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:41:28,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:41:29,482][100936] Updated weights for policy 0, policy_version 62630 (0.0007) +[2023-10-14 07:41:29,850][100936] Updated weights for policy 0, policy_version 62640 (0.0010) +[2023-10-14 07:41:30,228][100936] Updated weights for policy 0, policy_version 62650 (0.0009) +[2023-10-14 07:41:31,933][100917] Updated weights for policy 1, policy_version 62762 (0.0008) +[2023-10-14 07:41:32,309][100917] Updated weights for policy 1, policy_version 62772 (0.0009) +[2023-10-14 07:41:32,685][100917] Updated weights for policy 1, policy_version 62782 (0.0008) +[2023-10-14 07:41:33,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 128450560. Throughput: 0: 1641.7, 1: 1643.9. Samples: 32119416. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:41:33,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:41:34,591][100936] Updated weights for policy 0, policy_version 62660 (0.0011) +[2023-10-14 07:41:34,940][100936] Updated weights for policy 0, policy_version 62670 (0.0008) +[2023-10-14 07:41:35,313][100936] Updated weights for policy 0, policy_version 62680 (0.0008) +[2023-10-14 07:41:37,074][100917] Updated weights for policy 1, policy_version 62792 (0.0008) +[2023-10-14 07:41:37,451][100917] Updated weights for policy 1, policy_version 62802 (0.0009) +[2023-10-14 07:41:37,820][100917] Updated weights for policy 1, policy_version 62812 (0.0009) +[2023-10-14 07:41:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 128516096. Throughput: 0: 1633.4, 1: 1635.9. Samples: 32138270. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:41:38,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:41:38,522][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000062688_64192512.pth... +[2023-10-14 07:41:38,523][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000062816_64323584.pth... +[2023-10-14 07:41:38,555][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000061248_62717952.pth +[2023-10-14 07:41:38,562][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000061152_62619648.pth +[2023-10-14 07:41:39,614][100936] Updated weights for policy 0, policy_version 62690 (0.0010) +[2023-10-14 07:41:39,982][100936] Updated weights for policy 0, policy_version 62700 (0.0009) +[2023-10-14 07:41:40,348][100936] Updated weights for policy 0, policy_version 62710 (0.0009) +[2023-10-14 07:41:40,708][100936] Updated weights for policy 0, policy_version 62720 (0.0008) +[2023-10-14 07:41:42,505][100917] Updated weights for policy 1, policy_version 62822 (0.0008) +[2023-10-14 07:41:42,871][100917] Updated weights for policy 1, policy_version 62832 (0.0007) +[2023-10-14 07:41:43,238][100917] Updated weights for policy 1, policy_version 62842 (0.0008) +[2023-10-14 07:41:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 128581632. Throughput: 0: 1629.8, 1: 1630.1. Samples: 32147634. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:41:43,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:41:45,214][100936] Updated weights for policy 0, policy_version 62730 (0.0010) +[2023-10-14 07:41:45,581][100936] Updated weights for policy 0, policy_version 62740 (0.0009) +[2023-10-14 07:41:45,960][100936] Updated weights for policy 0, policy_version 62750 (0.0011) +[2023-10-14 07:41:47,912][100917] Updated weights for policy 1, policy_version 62852 (0.0011) +[2023-10-14 07:41:48,281][100917] Updated weights for policy 1, policy_version 62862 (0.0011) +[2023-10-14 07:41:48,512][99942] Fps is (10 sec: 9830.5, 60 sec: 12561.1, 300 sec: 13107.2). Total num frames: 128614400. Throughput: 0: 1602.9, 1: 1608.0. Samples: 32165608. Policy #0 lag: (min: 21.0, avg: 26.0, max: 53.0) +[2023-10-14 07:41:48,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 07:41:48,663][100917] Updated weights for policy 1, policy_version 62872 (0.0009) +[2023-10-14 07:41:50,633][100936] Updated weights for policy 0, policy_version 62760 (0.0009) +[2023-10-14 07:41:50,995][100936] Updated weights for policy 0, policy_version 62770 (0.0010) +[2023-10-14 07:41:51,363][100936] Updated weights for policy 0, policy_version 62780 (0.0010) +[2023-10-14 07:41:52,860][100917] Updated weights for policy 1, policy_version 62882 (0.0011) +[2023-10-14 07:41:53,230][100917] Updated weights for policy 1, policy_version 62892 (0.0009) +[2023-10-14 07:41:53,512][99942] Fps is (10 sec: 9830.2, 60 sec: 12561.1, 300 sec: 13107.2). Total num frames: 128679936. Throughput: 0: 1591.9, 1: 1606.8. Samples: 32184696. Policy #0 lag: (min: 21.0, avg: 26.0, max: 53.0) +[2023-10-14 07:41:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:41:53,611][100917] Updated weights for policy 1, policy_version 62902 (0.0007) +[2023-10-14 07:41:53,980][100917] Updated weights for policy 1, policy_version 62912 (0.0008) +[2023-10-14 07:41:55,761][100936] Updated weights for policy 0, policy_version 62790 (0.0009) +[2023-10-14 07:41:56,142][100936] Updated weights for policy 0, policy_version 62800 (0.0010) +[2023-10-14 07:41:56,519][100936] Updated weights for policy 0, policy_version 62810 (0.0007) +[2023-10-14 07:41:58,259][100917] Updated weights for policy 1, policy_version 62922 (0.0009) +[2023-10-14 07:41:58,512][99942] Fps is (10 sec: 13107.3, 60 sec: 12561.1, 300 sec: 13107.2). Total num frames: 128745472. Throughput: 0: 1594.2, 1: 1580.1. Samples: 32193766. Policy #0 lag: (min: 21.0, avg: 26.0, max: 53.0) +[2023-10-14 07:41:58,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:41:58,632][100917] Updated weights for policy 1, policy_version 62932 (0.0009) +[2023-10-14 07:41:59,006][100917] Updated weights for policy 1, policy_version 62942 (0.0008) +[2023-10-14 07:42:00,891][100936] Updated weights for policy 0, policy_version 62820 (0.0009) +[2023-10-14 07:42:01,252][100936] Updated weights for policy 0, policy_version 62830 (0.0009) +[2023-10-14 07:42:01,631][100936] Updated weights for policy 0, policy_version 62840 (0.0009) +[2023-10-14 07:42:03,494][100917] Updated weights for policy 1, policy_version 62952 (0.0009) +[2023-10-14 07:42:03,512][99942] Fps is (10 sec: 13107.5, 60 sec: 12561.1, 300 sec: 13107.2). Total num frames: 128811008. Throughput: 0: 1569.3, 1: 1581.2. Samples: 32212606. Policy #0 lag: (min: 21.0, avg: 26.0, max: 53.0) +[2023-10-14 07:42:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:42:03,867][100917] Updated weights for policy 1, policy_version 62962 (0.0010) +[2023-10-14 07:42:04,241][100917] Updated weights for policy 1, policy_version 62972 (0.0009) +[2023-10-14 07:42:06,136][100936] Updated weights for policy 0, policy_version 62850 (0.0009) +[2023-10-14 07:42:06,498][100936] Updated weights for policy 0, policy_version 62860 (0.0009) +[2023-10-14 07:42:06,871][100936] Updated weights for policy 0, policy_version 62870 (0.0008) +[2023-10-14 07:42:07,236][100936] Updated weights for policy 0, policy_version 62880 (0.0008) +[2023-10-14 07:42:08,423][100917] Updated weights for policy 1, policy_version 62982 (0.0009) +[2023-10-14 07:42:08,512][99942] Fps is (10 sec: 13107.1, 60 sec: 12561.1, 300 sec: 13107.2). Total num frames: 128876544. Throughput: 0: 1554.6, 1: 1584.1. Samples: 32232032. Policy #0 lag: (min: 21.0, avg: 26.0, max: 53.0) +[2023-10-14 07:42:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:42:08,799][100917] Updated weights for policy 1, policy_version 62992 (0.0009) +[2023-10-14 07:42:09,166][100917] Updated weights for policy 1, policy_version 63002 (0.0009) +[2023-10-14 07:42:11,300][100936] Updated weights for policy 0, policy_version 62890 (0.0009) +[2023-10-14 07:42:11,669][100936] Updated weights for policy 0, policy_version 62900 (0.0009) +[2023-10-14 07:42:12,041][100936] Updated weights for policy 0, policy_version 62910 (0.0008) +[2023-10-14 07:42:13,258][100917] Updated weights for policy 1, policy_version 63012 (0.0008) +[2023-10-14 07:42:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 12561.1, 300 sec: 13107.2). Total num frames: 128942080. Throughput: 0: 1574.5, 1: 1554.9. Samples: 32241814. Policy #0 lag: (min: 21.0, avg: 26.0, max: 53.0) +[2023-10-14 07:42:13,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:42:13,623][100917] Updated weights for policy 1, policy_version 63022 (0.0008) +[2023-10-14 07:42:13,995][100917] Updated weights for policy 1, policy_version 63032 (0.0008) +[2023-10-14 07:42:16,255][100936] Updated weights for policy 0, policy_version 62920 (0.0007) +[2023-10-14 07:42:16,621][100936] Updated weights for policy 0, policy_version 62930 (0.0008) +[2023-10-14 07:42:16,995][100936] Updated weights for policy 0, policy_version 62940 (0.0010) +[2023-10-14 07:42:17,893][100917] Updated weights for policy 1, policy_version 63042 (0.0008) +[2023-10-14 07:42:18,305][100917] Updated weights for policy 1, policy_version 63052 (0.0009) +[2023-10-14 07:42:18,512][99942] Fps is (10 sec: 13107.0, 60 sec: 12561.1, 300 sec: 13107.2). Total num frames: 129007616. Throughput: 0: 1569.9, 1: 1595.1. Samples: 32261842. Policy #0 lag: (min: 21.0, avg: 26.0, max: 53.0) +[2023-10-14 07:42:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:42:18,683][100917] Updated weights for policy 1, policy_version 63062 (0.0010) +[2023-10-14 07:42:19,060][100917] Updated weights for policy 1, policy_version 63072 (0.0007) +[2023-10-14 07:42:20,967][100936] Updated weights for policy 0, policy_version 62950 (0.0009) +[2023-10-14 07:42:21,333][100936] Updated weights for policy 0, policy_version 62960 (0.0010) +[2023-10-14 07:42:21,704][100936] Updated weights for policy 0, policy_version 62970 (0.0007) +[2023-10-14 07:42:23,214][100917] Updated weights for policy 1, policy_version 63082 (0.0010) +[2023-10-14 07:42:23,512][99942] Fps is (10 sec: 13106.9, 60 sec: 12561.0, 300 sec: 13107.2). Total num frames: 129073152. Throughput: 0: 1579.1, 1: 1619.8. Samples: 32282218. Policy #0 lag: (min: 21.0, avg: 26.0, max: 53.0) +[2023-10-14 07:42:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:42:23,582][100917] Updated weights for policy 1, policy_version 63092 (0.0008) +[2023-10-14 07:42:23,966][100917] Updated weights for policy 1, policy_version 63102 (0.0009) +[2023-10-14 07:42:25,723][100936] Updated weights for policy 0, policy_version 62980 (0.0007) +[2023-10-14 07:42:26,095][100936] Updated weights for policy 0, policy_version 62990 (0.0008) +[2023-10-14 07:42:26,469][100936] Updated weights for policy 0, policy_version 63000 (0.0009) +[2023-10-14 07:42:28,086][100917] Updated weights for policy 1, policy_version 63112 (0.0008) +[2023-10-14 07:42:28,461][100917] Updated weights for policy 1, policy_version 63122 (0.0010) +[2023-10-14 07:42:28,512][99942] Fps is (10 sec: 13107.5, 60 sec: 12561.1, 300 sec: 13107.2). Total num frames: 129138688. Throughput: 0: 1596.1, 1: 1606.6. Samples: 32291754. Policy #0 lag: (min: 31.0, avg: 32.7, max: 60.0) +[2023-10-14 07:42:28,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:42:28,838][100917] Updated weights for policy 1, policy_version 63132 (0.0007) +[2023-10-14 07:42:30,655][100936] Updated weights for policy 0, policy_version 63010 (0.0008) +[2023-10-14 07:42:31,023][100936] Updated weights for policy 0, policy_version 63020 (0.0007) +[2023-10-14 07:42:31,395][100936] Updated weights for policy 0, policy_version 63030 (0.0009) +[2023-10-14 07:42:31,755][100936] Updated weights for policy 0, policy_version 63040 (0.0008) +[2023-10-14 07:42:32,992][100917] Updated weights for policy 1, policy_version 63142 (0.0007) +[2023-10-14 07:42:33,370][100917] Updated weights for policy 1, policy_version 63152 (0.0009) +[2023-10-14 07:42:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 12561.0, 300 sec: 13107.2). Total num frames: 129204224. Throughput: 0: 1613.9, 1: 1633.0. Samples: 32311716. Policy #0 lag: (min: 31.0, avg: 32.7, max: 60.0) +[2023-10-14 07:42:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:42:33,755][100917] Updated weights for policy 1, policy_version 63162 (0.0010) +[2023-10-14 07:42:35,926][100936] Updated weights for policy 0, policy_version 63050 (0.0011) +[2023-10-14 07:42:36,292][100936] Updated weights for policy 0, policy_version 63060 (0.0010) +[2023-10-14 07:42:36,658][100936] Updated weights for policy 0, policy_version 63070 (0.0009) +[2023-10-14 07:42:37,806][100917] Updated weights for policy 1, policy_version 63172 (0.0008) +[2023-10-14 07:42:38,180][100917] Updated weights for policy 1, policy_version 63182 (0.0008) +[2023-10-14 07:42:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 12561.1, 300 sec: 13107.2). Total num frames: 129269760. Throughput: 0: 1625.4, 1: 1643.7. Samples: 32331806. Policy #0 lag: (min: 31.0, avg: 32.7, max: 60.0) +[2023-10-14 07:42:38,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:42:38,553][100917] Updated weights for policy 1, policy_version 63192 (0.0010) +[2023-10-14 07:42:40,857][100936] Updated weights for policy 0, policy_version 63080 (0.0008) +[2023-10-14 07:42:41,228][100936] Updated weights for policy 0, policy_version 63090 (0.0009) +[2023-10-14 07:42:41,599][100936] Updated weights for policy 0, policy_version 63100 (0.0010) +[2023-10-14 07:42:42,701][100917] Updated weights for policy 1, policy_version 63202 (0.0011) +[2023-10-14 07:42:43,069][100917] Updated weights for policy 1, policy_version 63212 (0.0009) +[2023-10-14 07:42:43,448][100917] Updated weights for policy 1, policy_version 63222 (0.0007) +[2023-10-14 07:42:43,512][99942] Fps is (10 sec: 13107.4, 60 sec: 12561.1, 300 sec: 13107.2). Total num frames: 129335296. Throughput: 0: 1634.0, 1: 1648.4. Samples: 32341472. Policy #0 lag: (min: 31.0, avg: 32.7, max: 60.0) +[2023-10-14 07:42:43,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:42:43,829][100917] Updated weights for policy 1, policy_version 63232 (0.0008) +[2023-10-14 07:42:45,829][100936] Updated weights for policy 0, policy_version 63110 (0.0009) +[2023-10-14 07:42:46,201][100936] Updated weights for policy 0, policy_version 63120 (0.0009) +[2023-10-14 07:42:46,580][100936] Updated weights for policy 0, policy_version 63130 (0.0008) +[2023-10-14 07:42:47,950][100917] Updated weights for policy 1, policy_version 63242 (0.0008) +[2023-10-14 07:42:48,322][100917] Updated weights for policy 1, policy_version 63252 (0.0009) +[2023-10-14 07:42:48,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 129400832. Throughput: 0: 1643.9, 1: 1661.1. Samples: 32361332. Policy #0 lag: (min: 31.0, avg: 32.7, max: 60.0) +[2023-10-14 07:42:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:42:48,689][100917] Updated weights for policy 1, policy_version 63262 (0.0010) +[2023-10-14 07:42:50,774][100936] Updated weights for policy 0, policy_version 63140 (0.0008) +[2023-10-14 07:42:51,147][100936] Updated weights for policy 0, policy_version 63150 (0.0009) +[2023-10-14 07:42:51,520][100936] Updated weights for policy 0, policy_version 63160 (0.0009) +[2023-10-14 07:42:52,751][100917] Updated weights for policy 1, policy_version 63272 (0.0007) +[2023-10-14 07:42:53,127][100917] Updated weights for policy 1, policy_version 63282 (0.0009) +[2023-10-14 07:42:53,500][100917] Updated weights for policy 1, policy_version 63292 (0.0009) +[2023-10-14 07:42:53,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 129466368. Throughput: 0: 1654.7, 1: 1662.6. Samples: 32381308. Policy #0 lag: (min: 31.0, avg: 32.7, max: 60.0) +[2023-10-14 07:42:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:42:55,636][100936] Updated weights for policy 0, policy_version 63170 (0.0009) +[2023-10-14 07:42:55,998][100936] Updated weights for policy 0, policy_version 63180 (0.0010) +[2023-10-14 07:42:56,364][100936] Updated weights for policy 0, policy_version 63190 (0.0011) +[2023-10-14 07:42:56,732][100936] Updated weights for policy 0, policy_version 63200 (0.0008) +[2023-10-14 07:42:57,748][100917] Updated weights for policy 1, policy_version 63302 (0.0008) +[2023-10-14 07:42:58,119][100917] Updated weights for policy 1, policy_version 63312 (0.0007) +[2023-10-14 07:42:58,504][100917] Updated weights for policy 1, policy_version 63322 (0.0008) +[2023-10-14 07:42:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 129531904. Throughput: 0: 1643.0, 1: 1672.3. Samples: 32391000. Policy #0 lag: (min: 31.0, avg: 32.7, max: 60.0) +[2023-10-14 07:42:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:43:00,866][100936] Updated weights for policy 0, policy_version 63210 (0.0008) +[2023-10-14 07:43:01,235][100936] Updated weights for policy 0, policy_version 63220 (0.0009) +[2023-10-14 07:43:01,590][100936] Updated weights for policy 0, policy_version 63230 (0.0007) +[2023-10-14 07:43:02,577][100917] Updated weights for policy 1, policy_version 63332 (0.0007) +[2023-10-14 07:43:02,947][100917] Updated weights for policy 1, policy_version 63342 (0.0010) +[2023-10-14 07:43:03,312][100917] Updated weights for policy 1, policy_version 63352 (0.0008) +[2023-10-14 07:43:03,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 129597440. Throughput: 0: 1655.5, 1: 1657.5. Samples: 32410928. Policy #0 lag: (min: 31.0, avg: 32.7, max: 60.0) +[2023-10-14 07:43:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:43:05,671][100936] Updated weights for policy 0, policy_version 63240 (0.0011) +[2023-10-14 07:43:06,046][100936] Updated weights for policy 0, policy_version 63250 (0.0007) +[2023-10-14 07:43:06,418][100936] Updated weights for policy 0, policy_version 63260 (0.0007) +[2023-10-14 07:43:07,434][100917] Updated weights for policy 1, policy_version 63362 (0.0009) +[2023-10-14 07:43:07,843][100917] Updated weights for policy 1, policy_version 63372 (0.0007) +[2023-10-14 07:43:08,212][100917] Updated weights for policy 1, policy_version 63382 (0.0008) +[2023-10-14 07:43:08,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 129662976. Throughput: 0: 1650.3, 1: 1651.2. Samples: 32430784. Policy #0 lag: (min: 31.0, avg: 32.7, max: 60.0) +[2023-10-14 07:43:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:43:08,588][100917] Updated weights for policy 1, policy_version 63392 (0.0009) +[2023-10-14 07:43:10,637][100936] Updated weights for policy 0, policy_version 63270 (0.0009) +[2023-10-14 07:43:11,011][100936] Updated weights for policy 0, policy_version 63280 (0.0007) +[2023-10-14 07:43:11,379][100936] Updated weights for policy 0, policy_version 63290 (0.0008) +[2023-10-14 07:43:12,664][100917] Updated weights for policy 1, policy_version 63402 (0.0008) +[2023-10-14 07:43:13,031][100917] Updated weights for policy 1, policy_version 63412 (0.0007) +[2023-10-14 07:43:13,403][100917] Updated weights for policy 1, policy_version 63422 (0.0007) +[2023-10-14 07:43:13,512][99942] Fps is (10 sec: 16384.0, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 129761280. Throughput: 0: 1644.2, 1: 1662.5. Samples: 32440556. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:43:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:43:15,566][100936] Updated weights for policy 0, policy_version 63300 (0.0007) +[2023-10-14 07:43:15,939][100936] Updated weights for policy 0, policy_version 63310 (0.0007) +[2023-10-14 07:43:16,309][100936] Updated weights for policy 0, policy_version 63320 (0.0008) +[2023-10-14 07:43:17,446][100917] Updated weights for policy 1, policy_version 63432 (0.0008) +[2023-10-14 07:43:17,829][100917] Updated weights for policy 1, policy_version 63442 (0.0008) +[2023-10-14 07:43:18,205][100917] Updated weights for policy 1, policy_version 63452 (0.0009) +[2023-10-14 07:43:18,512][99942] Fps is (10 sec: 16384.0, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 129826816. Throughput: 0: 1645.7, 1: 1664.8. Samples: 32460686. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:43:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:43:20,523][100936] Updated weights for policy 0, policy_version 63330 (0.0008) +[2023-10-14 07:43:20,890][100936] Updated weights for policy 0, policy_version 63340 (0.0009) +[2023-10-14 07:43:21,265][100936] Updated weights for policy 0, policy_version 63350 (0.0009) +[2023-10-14 07:43:21,639][100936] Updated weights for policy 0, policy_version 63360 (0.0009) +[2023-10-14 07:43:22,351][100917] Updated weights for policy 1, policy_version 63462 (0.0008) +[2023-10-14 07:43:22,722][100917] Updated weights for policy 1, policy_version 63472 (0.0010) +[2023-10-14 07:43:23,085][100917] Updated weights for policy 1, policy_version 63482 (0.0010) +[2023-10-14 07:43:23,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 129892352. Throughput: 0: 1647.1, 1: 1650.0. Samples: 32480178. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:43:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:43:25,716][100936] Updated weights for policy 0, policy_version 63370 (0.0007) +[2023-10-14 07:43:26,086][100936] Updated weights for policy 0, policy_version 63380 (0.0008) +[2023-10-14 07:43:26,464][100936] Updated weights for policy 0, policy_version 63390 (0.0008) +[2023-10-14 07:43:27,244][100917] Updated weights for policy 1, policy_version 63492 (0.0010) +[2023-10-14 07:43:27,620][100917] Updated weights for policy 1, policy_version 63502 (0.0010) +[2023-10-14 07:43:27,991][100917] Updated weights for policy 1, policy_version 63512 (0.0007) +[2023-10-14 07:43:28,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 129957888. Throughput: 0: 1642.9, 1: 1663.6. Samples: 32490266. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:43:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:43:30,486][100936] Updated weights for policy 0, policy_version 63400 (0.0008) +[2023-10-14 07:43:30,848][100936] Updated weights for policy 0, policy_version 63410 (0.0007) +[2023-10-14 07:43:31,219][100936] Updated weights for policy 0, policy_version 63420 (0.0008) +[2023-10-14 07:43:31,956][100917] Updated weights for policy 1, policy_version 63522 (0.0009) +[2023-10-14 07:43:32,329][100917] Updated weights for policy 1, policy_version 63532 (0.0009) +[2023-10-14 07:43:32,706][100917] Updated weights for policy 1, policy_version 63542 (0.0009) +[2023-10-14 07:43:33,082][100917] Updated weights for policy 1, policy_version 63552 (0.0010) +[2023-10-14 07:43:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 130023424. Throughput: 0: 1651.6, 1: 1665.4. Samples: 32510598. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:43:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:43:35,503][100936] Updated weights for policy 0, policy_version 63430 (0.0009) +[2023-10-14 07:43:35,883][100936] Updated weights for policy 0, policy_version 63440 (0.0012) +[2023-10-14 07:43:36,254][100936] Updated weights for policy 0, policy_version 63450 (0.0007) +[2023-10-14 07:43:37,185][100917] Updated weights for policy 1, policy_version 63562 (0.0008) +[2023-10-14 07:43:37,550][100917] Updated weights for policy 1, policy_version 63572 (0.0007) +[2023-10-14 07:43:37,927][100917] Updated weights for policy 1, policy_version 63582 (0.0008) +[2023-10-14 07:43:38,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 130088960. Throughput: 0: 1651.7, 1: 1647.1. Samples: 32529754. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:43:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:43:38,524][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000063584_65110016.pth... +[2023-10-14 07:43:38,524][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000063456_64978944.pth... +[2023-10-14 07:43:38,556][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000061920_63406080.pth +[2023-10-14 07:43:38,561][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000062048_63537152.pth +[2023-10-14 07:43:40,307][100936] Updated weights for policy 0, policy_version 63460 (0.0009) +[2023-10-14 07:43:40,680][100936] Updated weights for policy 0, policy_version 63470 (0.0008) +[2023-10-14 07:43:41,044][100936] Updated weights for policy 0, policy_version 63480 (0.0008) +[2023-10-14 07:43:41,979][100917] Updated weights for policy 1, policy_version 63592 (0.0009) +[2023-10-14 07:43:42,355][100917] Updated weights for policy 1, policy_version 63602 (0.0007) +[2023-10-14 07:43:42,729][100917] Updated weights for policy 1, policy_version 63612 (0.0010) +[2023-10-14 07:43:43,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 130154496. Throughput: 0: 1642.0, 1: 1667.2. Samples: 32539916. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:43:43,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:43:45,251][100936] Updated weights for policy 0, policy_version 63490 (0.0008) +[2023-10-14 07:43:45,616][100936] Updated weights for policy 0, policy_version 63500 (0.0007) +[2023-10-14 07:43:45,979][100936] Updated weights for policy 0, policy_version 63510 (0.0008) +[2023-10-14 07:43:46,350][100936] Updated weights for policy 0, policy_version 63520 (0.0010) +[2023-10-14 07:43:46,868][100917] Updated weights for policy 1, policy_version 63622 (0.0008) +[2023-10-14 07:43:47,243][100917] Updated weights for policy 1, policy_version 63632 (0.0008) +[2023-10-14 07:43:47,625][100917] Updated weights for policy 1, policy_version 63642 (0.0008) +[2023-10-14 07:43:48,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 130220032. Throughput: 0: 1646.6, 1: 1668.9. Samples: 32560126. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:43:48,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:43:50,521][100936] Updated weights for policy 0, policy_version 63530 (0.0010) +[2023-10-14 07:43:50,885][100936] Updated weights for policy 0, policy_version 63540 (0.0009) +[2023-10-14 07:43:51,257][100936] Updated weights for policy 0, policy_version 63550 (0.0008) +[2023-10-14 07:43:51,577][100917] Updated weights for policy 1, policy_version 63652 (0.0008) +[2023-10-14 07:43:51,939][100917] Updated weights for policy 1, policy_version 63662 (0.0007) +[2023-10-14 07:43:52,316][100917] Updated weights for policy 1, policy_version 63672 (0.0009) +[2023-10-14 07:43:53,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 130285568. Throughput: 0: 1650.1, 1: 1657.2. Samples: 32579614. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:43:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:43:55,288][100936] Updated weights for policy 0, policy_version 63560 (0.0008) +[2023-10-14 07:43:55,670][100936] Updated weights for policy 0, policy_version 63570 (0.0007) +[2023-10-14 07:43:56,040][100936] Updated weights for policy 0, policy_version 63580 (0.0009) +[2023-10-14 07:43:56,523][100917] Updated weights for policy 1, policy_version 63682 (0.0009) +[2023-10-14 07:43:56,931][100917] Updated weights for policy 1, policy_version 63692 (0.0010) +[2023-10-14 07:43:57,306][100917] Updated weights for policy 1, policy_version 63702 (0.0008) +[2023-10-14 07:43:57,677][100917] Updated weights for policy 1, policy_version 63712 (0.0010) +[2023-10-14 07:43:58,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 130351104. Throughput: 0: 1645.9, 1: 1674.7. Samples: 32589982. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-10-14 07:43:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:44:00,190][100936] Updated weights for policy 0, policy_version 63590 (0.0009) +[2023-10-14 07:44:00,557][100936] Updated weights for policy 0, policy_version 63600 (0.0010) +[2023-10-14 07:44:00,946][100936] Updated weights for policy 0, policy_version 63610 (0.0010) +[2023-10-14 07:44:01,832][100917] Updated weights for policy 1, policy_version 63722 (0.0008) +[2023-10-14 07:44:02,196][100917] Updated weights for policy 1, policy_version 63732 (0.0007) +[2023-10-14 07:44:02,577][100917] Updated weights for policy 1, policy_version 63742 (0.0007) +[2023-10-14 07:44:03,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 130416640. Throughput: 0: 1653.7, 1: 1655.1. Samples: 32609582. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-10-14 07:44:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:44:05,064][100936] Updated weights for policy 0, policy_version 63620 (0.0010) +[2023-10-14 07:44:05,433][100936] Updated weights for policy 0, policy_version 63630 (0.0007) +[2023-10-14 07:44:05,805][100936] Updated weights for policy 0, policy_version 63640 (0.0009) +[2023-10-14 07:44:06,789][100917] Updated weights for policy 1, policy_version 63752 (0.0007) +[2023-10-14 07:44:07,159][100917] Updated weights for policy 1, policy_version 63762 (0.0007) +[2023-10-14 07:44:07,533][100917] Updated weights for policy 1, policy_version 63772 (0.0007) +[2023-10-14 07:44:08,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 130482176. Throughput: 0: 1651.5, 1: 1658.3. Samples: 32629118. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-10-14 07:44:08,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:44:10,035][100936] Updated weights for policy 0, policy_version 63650 (0.0009) +[2023-10-14 07:44:10,408][100936] Updated weights for policy 0, policy_version 63660 (0.0007) +[2023-10-14 07:44:10,782][100936] Updated weights for policy 0, policy_version 63670 (0.0007) +[2023-10-14 07:44:11,155][100936] Updated weights for policy 0, policy_version 63680 (0.0008) +[2023-10-14 07:44:11,637][100917] Updated weights for policy 1, policy_version 63782 (0.0007) +[2023-10-14 07:44:12,017][100917] Updated weights for policy 1, policy_version 63792 (0.0008) +[2023-10-14 07:44:12,377][100917] Updated weights for policy 1, policy_version 63802 (0.0010) +[2023-10-14 07:44:13,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 130547712. Throughput: 0: 1644.7, 1: 1669.9. Samples: 32639424. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-10-14 07:44:13,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:44:15,361][100936] Updated weights for policy 0, policy_version 63690 (0.0009) +[2023-10-14 07:44:15,734][100936] Updated weights for policy 0, policy_version 63700 (0.0010) +[2023-10-14 07:44:16,114][100936] Updated weights for policy 0, policy_version 63710 (0.0009) +[2023-10-14 07:44:16,616][100917] Updated weights for policy 1, policy_version 63812 (0.0010) +[2023-10-14 07:44:16,979][100917] Updated weights for policy 1, policy_version 63822 (0.0010) +[2023-10-14 07:44:17,345][100917] Updated weights for policy 1, policy_version 63832 (0.0009) +[2023-10-14 07:44:18,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 130613248. Throughput: 0: 1648.5, 1: 1653.9. Samples: 32659206. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-10-14 07:44:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:44:20,219][100936] Updated weights for policy 0, policy_version 63720 (0.0009) +[2023-10-14 07:44:20,595][100936] Updated weights for policy 0, policy_version 63730 (0.0008) +[2023-10-14 07:44:20,963][100936] Updated weights for policy 0, policy_version 63740 (0.0008) +[2023-10-14 07:44:21,567][100917] Updated weights for policy 1, policy_version 63842 (0.0010) +[2023-10-14 07:44:21,936][100917] Updated weights for policy 1, policy_version 63852 (0.0009) +[2023-10-14 07:44:22,321][100917] Updated weights for policy 1, policy_version 63862 (0.0007) +[2023-10-14 07:44:22,698][100917] Updated weights for policy 1, policy_version 63872 (0.0010) +[2023-10-14 07:44:23,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 130678784. Throughput: 0: 1652.8, 1: 1655.7. Samples: 32678636. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-10-14 07:44:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:44:24,978][100936] Updated weights for policy 0, policy_version 63750 (0.0010) +[2023-10-14 07:44:25,340][100936] Updated weights for policy 0, policy_version 63760 (0.0009) +[2023-10-14 07:44:25,718][100936] Updated weights for policy 0, policy_version 63770 (0.0008) +[2023-10-14 07:44:26,819][100917] Updated weights for policy 1, policy_version 63882 (0.0010) +[2023-10-14 07:44:27,194][100917] Updated weights for policy 1, policy_version 63892 (0.0007) +[2023-10-14 07:44:27,563][100917] Updated weights for policy 1, policy_version 63902 (0.0007) +[2023-10-14 07:44:28,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 130744320. Throughput: 0: 1656.3, 1: 1657.9. Samples: 32689054. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-10-14 07:44:28,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:44:29,904][100936] Updated weights for policy 0, policy_version 63780 (0.0009) +[2023-10-14 07:44:30,281][100936] Updated weights for policy 0, policy_version 63790 (0.0007) +[2023-10-14 07:44:30,643][100936] Updated weights for policy 0, policy_version 63800 (0.0008) +[2023-10-14 07:44:31,774][100917] Updated weights for policy 1, policy_version 63912 (0.0007) +[2023-10-14 07:44:32,136][100917] Updated weights for policy 1, policy_version 63922 (0.0008) +[2023-10-14 07:44:32,503][100917] Updated weights for policy 1, policy_version 63932 (0.0009) +[2023-10-14 07:44:33,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 130809856. Throughput: 0: 1656.8, 1: 1646.7. Samples: 32708786. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-10-14 07:44:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:44:34,703][100936] Updated weights for policy 0, policy_version 63810 (0.0008) +[2023-10-14 07:44:35,069][100936] Updated weights for policy 0, policy_version 63820 (0.0007) +[2023-10-14 07:44:35,438][100936] Updated weights for policy 0, policy_version 63830 (0.0007) +[2023-10-14 07:44:35,808][100936] Updated weights for policy 0, policy_version 63840 (0.0007) +[2023-10-14 07:44:36,532][100917] Updated weights for policy 1, policy_version 63942 (0.0010) +[2023-10-14 07:44:36,893][100917] Updated weights for policy 1, policy_version 63952 (0.0009) +[2023-10-14 07:44:37,267][100917] Updated weights for policy 1, policy_version 63962 (0.0008) +[2023-10-14 07:44:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 130875392. Throughput: 0: 1662.5, 1: 1652.5. Samples: 32728790. Policy #0 lag: (min: 17.0, avg: 30.6, max: 49.0) +[2023-10-14 07:44:38,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:44:39,927][100936] Updated weights for policy 0, policy_version 63850 (0.0010) +[2023-10-14 07:44:40,290][100936] Updated weights for policy 0, policy_version 63860 (0.0009) +[2023-10-14 07:44:40,653][100936] Updated weights for policy 0, policy_version 63870 (0.0010) +[2023-10-14 07:44:41,495][100917] Updated weights for policy 1, policy_version 63972 (0.0009) +[2023-10-14 07:44:41,891][100917] Updated weights for policy 1, policy_version 63982 (0.0010) +[2023-10-14 07:44:42,258][100917] Updated weights for policy 1, policy_version 63992 (0.0009) +[2023-10-14 07:44:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 130940928. Throughput: 0: 1660.4, 1: 1650.8. Samples: 32738986. Policy #0 lag: (min: 17.0, avg: 30.6, max: 49.0) +[2023-10-14 07:44:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:44:44,781][100936] Updated weights for policy 0, policy_version 63880 (0.0010) +[2023-10-14 07:44:45,158][100936] Updated weights for policy 0, policy_version 63890 (0.0010) +[2023-10-14 07:44:45,524][100936] Updated weights for policy 0, policy_version 63900 (0.0008) +[2023-10-14 07:44:46,462][100917] Updated weights for policy 1, policy_version 64002 (0.0010) +[2023-10-14 07:44:46,844][100917] Updated weights for policy 1, policy_version 64012 (0.0008) +[2023-10-14 07:44:47,206][100917] Updated weights for policy 1, policy_version 64022 (0.0007) +[2023-10-14 07:44:47,581][100917] Updated weights for policy 1, policy_version 64032 (0.0008) +[2023-10-14 07:44:48,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 131006464. Throughput: 0: 1660.3, 1: 1653.3. Samples: 32758694. Policy #0 lag: (min: 17.0, avg: 30.6, max: 49.0) +[2023-10-14 07:44:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:44:49,730][100936] Updated weights for policy 0, policy_version 63910 (0.0009) +[2023-10-14 07:44:50,102][100936] Updated weights for policy 0, policy_version 63920 (0.0008) +[2023-10-14 07:44:50,470][100936] Updated weights for policy 0, policy_version 63930 (0.0009) +[2023-10-14 07:44:51,615][100917] Updated weights for policy 1, policy_version 64042 (0.0009) +[2023-10-14 07:44:51,993][100917] Updated weights for policy 1, policy_version 64052 (0.0008) +[2023-10-14 07:44:52,357][100917] Updated weights for policy 1, policy_version 64062 (0.0008) +[2023-10-14 07:44:53,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 131072000. Throughput: 0: 1658.1, 1: 1659.8. Samples: 32778424. Policy #0 lag: (min: 17.0, avg: 30.6, max: 49.0) +[2023-10-14 07:44:53,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:44:54,739][100936] Updated weights for policy 0, policy_version 63940 (0.0008) +[2023-10-14 07:44:55,105][100936] Updated weights for policy 0, policy_version 63950 (0.0007) +[2023-10-14 07:44:55,478][100936] Updated weights for policy 0, policy_version 63960 (0.0009) +[2023-10-14 07:44:56,353][100917] Updated weights for policy 1, policy_version 64072 (0.0011) +[2023-10-14 07:44:56,726][100917] Updated weights for policy 1, policy_version 64082 (0.0008) +[2023-10-14 07:44:57,106][100917] Updated weights for policy 1, policy_version 64092 (0.0008) +[2023-10-14 07:44:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 131137536. Throughput: 0: 1657.9, 1: 1658.9. Samples: 32788678. Policy #0 lag: (min: 17.0, avg: 30.6, max: 49.0) +[2023-10-14 07:44:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:44:59,666][100936] Updated weights for policy 0, policy_version 63970 (0.0007) +[2023-10-14 07:45:00,036][100936] Updated weights for policy 0, policy_version 63980 (0.0008) +[2023-10-14 07:45:00,405][100936] Updated weights for policy 0, policy_version 63990 (0.0009) +[2023-10-14 07:45:00,776][100936] Updated weights for policy 0, policy_version 64000 (0.0007) +[2023-10-14 07:45:01,117][100917] Updated weights for policy 1, policy_version 64102 (0.0009) +[2023-10-14 07:45:01,489][100917] Updated weights for policy 1, policy_version 64112 (0.0009) +[2023-10-14 07:45:01,852][100917] Updated weights for policy 1, policy_version 64122 (0.0010) +[2023-10-14 07:45:03,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 131203072. Throughput: 0: 1659.3, 1: 1648.3. Samples: 32808046. Policy #0 lag: (min: 17.0, avg: 30.6, max: 49.0) +[2023-10-14 07:45:03,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:45:04,878][100936] Updated weights for policy 0, policy_version 64010 (0.0007) +[2023-10-14 07:45:05,247][100936] Updated weights for policy 0, policy_version 64020 (0.0008) +[2023-10-14 07:45:05,619][100936] Updated weights for policy 0, policy_version 64030 (0.0010) +[2023-10-14 07:45:06,056][100917] Updated weights for policy 1, policy_version 64132 (0.0009) +[2023-10-14 07:45:06,423][100917] Updated weights for policy 1, policy_version 64142 (0.0007) +[2023-10-14 07:45:06,797][100917] Updated weights for policy 1, policy_version 64152 (0.0007) +[2023-10-14 07:45:08,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 131268608. Throughput: 0: 1651.0, 1: 1666.8. Samples: 32827936. Policy #0 lag: (min: 17.0, avg: 30.6, max: 49.0) +[2023-10-14 07:45:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:45:09,834][100936] Updated weights for policy 0, policy_version 64040 (0.0007) +[2023-10-14 07:45:10,211][100936] Updated weights for policy 0, policy_version 64050 (0.0007) +[2023-10-14 07:45:10,581][100936] Updated weights for policy 0, policy_version 64060 (0.0007) +[2023-10-14 07:45:10,843][100917] Updated weights for policy 1, policy_version 64162 (0.0007) +[2023-10-14 07:45:11,213][100917] Updated weights for policy 1, policy_version 64172 (0.0008) +[2023-10-14 07:45:11,577][100917] Updated weights for policy 1, policy_version 64182 (0.0009) +[2023-10-14 07:45:11,958][100917] Updated weights for policy 1, policy_version 64192 (0.0007) +[2023-10-14 07:45:13,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 131334144. Throughput: 0: 1652.8, 1: 1659.0. Samples: 32838088. Policy #0 lag: (min: 17.0, avg: 30.6, max: 49.0) +[2023-10-14 07:45:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:45:14,599][100936] Updated weights for policy 0, policy_version 64070 (0.0007) +[2023-10-14 07:45:14,969][100936] Updated weights for policy 0, policy_version 64080 (0.0009) +[2023-10-14 07:45:15,337][100936] Updated weights for policy 0, policy_version 64090 (0.0007) +[2023-10-14 07:45:16,200][100917] Updated weights for policy 1, policy_version 64202 (0.0007) +[2023-10-14 07:45:16,574][100917] Updated weights for policy 1, policy_version 64212 (0.0008) +[2023-10-14 07:45:16,952][100917] Updated weights for policy 1, policy_version 64222 (0.0009) +[2023-10-14 07:45:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 131399680. Throughput: 0: 1656.4, 1: 1651.4. Samples: 32857636. Policy #0 lag: (min: 17.0, avg: 30.6, max: 49.0) +[2023-10-14 07:45:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:45:19,529][100936] Updated weights for policy 0, policy_version 64100 (0.0008) +[2023-10-14 07:45:19,900][100936] Updated weights for policy 0, policy_version 64110 (0.0010) +[2023-10-14 07:45:20,265][100936] Updated weights for policy 0, policy_version 64120 (0.0010) +[2023-10-14 07:45:20,874][100917] Updated weights for policy 1, policy_version 64232 (0.0008) +[2023-10-14 07:45:21,247][100917] Updated weights for policy 1, policy_version 64242 (0.0009) +[2023-10-14 07:45:21,614][100917] Updated weights for policy 1, policy_version 64252 (0.0010) +[2023-10-14 07:45:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 131465216. Throughput: 0: 1650.0, 1: 1664.6. Samples: 32877948. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:45:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:45:24,354][100936] Updated weights for policy 0, policy_version 64130 (0.0010) +[2023-10-14 07:45:24,721][100936] Updated weights for policy 0, policy_version 64140 (0.0008) +[2023-10-14 07:45:25,093][100936] Updated weights for policy 0, policy_version 64150 (0.0007) +[2023-10-14 07:45:25,462][100936] Updated weights for policy 0, policy_version 64160 (0.0008) +[2023-10-14 07:45:25,738][100917] Updated weights for policy 1, policy_version 64262 (0.0008) +[2023-10-14 07:45:26,112][100917] Updated weights for policy 1, policy_version 64272 (0.0010) +[2023-10-14 07:45:26,484][100917] Updated weights for policy 1, policy_version 64282 (0.0008) +[2023-10-14 07:45:28,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 131530752. Throughput: 0: 1653.7, 1: 1651.1. Samples: 32887700. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:45:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:45:29,524][100936] Updated weights for policy 0, policy_version 64170 (0.0009) +[2023-10-14 07:45:29,886][100936] Updated weights for policy 0, policy_version 64180 (0.0007) +[2023-10-14 07:45:30,256][100936] Updated weights for policy 0, policy_version 64190 (0.0009) +[2023-10-14 07:45:30,621][100917] Updated weights for policy 1, policy_version 64292 (0.0007) +[2023-10-14 07:45:31,037][100917] Updated weights for policy 1, policy_version 64302 (0.0008) +[2023-10-14 07:45:31,401][100917] Updated weights for policy 1, policy_version 64312 (0.0009) +[2023-10-14 07:45:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 131596288. Throughput: 0: 1658.0, 1: 1643.8. Samples: 32907272. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:45:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:45:34,339][100936] Updated weights for policy 0, policy_version 64200 (0.0009) +[2023-10-14 07:45:34,708][100936] Updated weights for policy 0, policy_version 64210 (0.0008) +[2023-10-14 07:45:35,069][100936] Updated weights for policy 0, policy_version 64220 (0.0008) +[2023-10-14 07:45:35,726][100917] Updated weights for policy 1, policy_version 64322 (0.0008) +[2023-10-14 07:45:36,094][100917] Updated weights for policy 1, policy_version 64332 (0.0008) +[2023-10-14 07:45:36,454][100917] Updated weights for policy 1, policy_version 64342 (0.0008) +[2023-10-14 07:45:36,823][100917] Updated weights for policy 1, policy_version 64352 (0.0011) +[2023-10-14 07:45:38,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 131661824. Throughput: 0: 1656.2, 1: 1653.2. Samples: 32927346. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:45:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:45:38,523][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000064352_65896448.pth... +[2023-10-14 07:45:38,523][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000064224_65765376.pth... +[2023-10-14 07:45:38,559][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000062816_64323584.pth +[2023-10-14 07:45:38,563][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000062688_64192512.pth +[2023-10-14 07:45:39,282][100936] Updated weights for policy 0, policy_version 64230 (0.0010) +[2023-10-14 07:45:39,643][100936] Updated weights for policy 0, policy_version 64240 (0.0009) +[2023-10-14 07:45:40,024][100936] Updated weights for policy 0, policy_version 64250 (0.0011) +[2023-10-14 07:45:40,936][100917] Updated weights for policy 1, policy_version 64362 (0.0008) +[2023-10-14 07:45:41,303][100917] Updated weights for policy 1, policy_version 64372 (0.0011) +[2023-10-14 07:45:41,684][100917] Updated weights for policy 1, policy_version 64382 (0.0007) +[2023-10-14 07:45:43,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 131727360. Throughput: 0: 1655.7, 1: 1644.4. Samples: 32937180. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:45:43,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:45:44,273][100936] Updated weights for policy 0, policy_version 64260 (0.0007) +[2023-10-14 07:45:44,650][100936] Updated weights for policy 0, policy_version 64270 (0.0008) +[2023-10-14 07:45:45,024][100936] Updated weights for policy 0, policy_version 64280 (0.0007) +[2023-10-14 07:45:45,903][100917] Updated weights for policy 1, policy_version 64392 (0.0011) +[2023-10-14 07:45:46,282][100917] Updated weights for policy 1, policy_version 64402 (0.0009) +[2023-10-14 07:45:46,660][100917] Updated weights for policy 1, policy_version 64412 (0.0010) +[2023-10-14 07:45:48,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.3, 300 sec: 13107.2). Total num frames: 131792896. Throughput: 0: 1655.3, 1: 1650.0. Samples: 32956782. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:45:48,512][99942] Avg episode reward: [(0, '0.820'), (1, '1.000')] +[2023-10-14 07:45:49,133][100936] Updated weights for policy 0, policy_version 64290 (0.0007) +[2023-10-14 07:45:49,504][100936] Updated weights for policy 0, policy_version 64300 (0.0009) +[2023-10-14 07:45:49,879][100936] Updated weights for policy 0, policy_version 64310 (0.0007) +[2023-10-14 07:45:50,253][100936] Updated weights for policy 0, policy_version 64320 (0.0008) +[2023-10-14 07:45:50,749][100917] Updated weights for policy 1, policy_version 64422 (0.0011) +[2023-10-14 07:45:51,120][100917] Updated weights for policy 1, policy_version 64432 (0.0011) +[2023-10-14 07:45:51,499][100917] Updated weights for policy 1, policy_version 64442 (0.0010) +[2023-10-14 07:45:53,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.1, 300 sec: 13107.2). Total num frames: 131858432. Throughput: 0: 1665.8, 1: 1656.7. Samples: 32977448. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:45:53,513][99942] Avg episode reward: [(0, '0.820'), (1, '1.000')] +[2023-10-14 07:45:54,397][100936] Updated weights for policy 0, policy_version 64330 (0.0009) +[2023-10-14 07:45:54,753][100936] Updated weights for policy 0, policy_version 64340 (0.0009) +[2023-10-14 07:45:55,130][100936] Updated weights for policy 0, policy_version 64350 (0.0009) +[2023-10-14 07:45:55,634][100917] Updated weights for policy 1, policy_version 64452 (0.0009) +[2023-10-14 07:45:56,006][100917] Updated weights for policy 1, policy_version 64462 (0.0009) +[2023-10-14 07:45:56,387][100917] Updated weights for policy 1, policy_version 64472 (0.0008) +[2023-10-14 07:45:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 131923968. Throughput: 0: 1662.3, 1: 1649.0. Samples: 32987096. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:45:58,512][99942] Avg episode reward: [(0, '0.820'), (1, '1.000')] +[2023-10-14 07:45:59,155][100936] Updated weights for policy 0, policy_version 64360 (0.0009) +[2023-10-14 07:45:59,522][100936] Updated weights for policy 0, policy_version 64370 (0.0011) +[2023-10-14 07:45:59,892][100936] Updated weights for policy 0, policy_version 64380 (0.0009) +[2023-10-14 07:46:00,472][100917] Updated weights for policy 1, policy_version 64482 (0.0011) +[2023-10-14 07:46:00,842][100917] Updated weights for policy 1, policy_version 64492 (0.0008) +[2023-10-14 07:46:01,224][100917] Updated weights for policy 1, policy_version 64502 (0.0009) +[2023-10-14 07:46:01,590][100917] Updated weights for policy 1, policy_version 64512 (0.0008) +[2023-10-14 07:46:03,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 131989504. Throughput: 0: 1660.5, 1: 1648.6. Samples: 33006548. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:46:03,513][99942] Avg episode reward: [(0, '0.820'), (1, '1.000')] +[2023-10-14 07:46:04,059][100936] Updated weights for policy 0, policy_version 64390 (0.0009) +[2023-10-14 07:46:04,427][100936] Updated weights for policy 0, policy_version 64400 (0.0008) +[2023-10-14 07:46:04,797][100936] Updated weights for policy 0, policy_version 64410 (0.0010) +[2023-10-14 07:46:05,714][100917] Updated weights for policy 1, policy_version 64522 (0.0011) +[2023-10-14 07:46:06,091][100917] Updated weights for policy 1, policy_version 64532 (0.0007) +[2023-10-14 07:46:06,459][100917] Updated weights for policy 1, policy_version 64542 (0.0007) +[2023-10-14 07:46:08,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 132055040. Throughput: 0: 1662.2, 1: 1652.5. Samples: 33027110. Policy #0 lag: (min: 12.0, avg: 23.2, max: 44.0) +[2023-10-14 07:46:08,513][99942] Avg episode reward: [(0, '0.640'), (1, '1.000')] +[2023-10-14 07:46:08,875][100936] Updated weights for policy 0, policy_version 64420 (0.0008) +[2023-10-14 07:46:09,248][100936] Updated weights for policy 0, policy_version 64430 (0.0007) +[2023-10-14 07:46:09,614][100936] Updated weights for policy 0, policy_version 64440 (0.0008) +[2023-10-14 07:46:10,516][100917] Updated weights for policy 1, policy_version 64552 (0.0008) +[2023-10-14 07:46:10,896][100917] Updated weights for policy 1, policy_version 64562 (0.0007) +[2023-10-14 07:46:11,257][100917] Updated weights for policy 1, policy_version 64572 (0.0009) +[2023-10-14 07:46:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 132120576. Throughput: 0: 1657.4, 1: 1651.2. Samples: 33036588. Policy #0 lag: (min: 12.0, avg: 23.2, max: 44.0) +[2023-10-14 07:46:13,513][99942] Avg episode reward: [(0, '0.640'), (1, '1.000')] +[2023-10-14 07:46:13,951][100936] Updated weights for policy 0, policy_version 64450 (0.0009) +[2023-10-14 07:46:14,315][100936] Updated weights for policy 0, policy_version 64460 (0.0007) +[2023-10-14 07:46:14,690][100936] Updated weights for policy 0, policy_version 64470 (0.0009) +[2023-10-14 07:46:15,070][100936] Updated weights for policy 0, policy_version 64480 (0.0009) +[2023-10-14 07:46:15,234][100917] Updated weights for policy 1, policy_version 64582 (0.0010) +[2023-10-14 07:46:15,612][100917] Updated weights for policy 1, policy_version 64592 (0.0010) +[2023-10-14 07:46:15,995][100917] Updated weights for policy 1, policy_version 64602 (0.0009) +[2023-10-14 07:46:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 132186112. Throughput: 0: 1648.2, 1: 1661.8. Samples: 33056220. Policy #0 lag: (min: 12.0, avg: 23.2, max: 44.0) +[2023-10-14 07:46:18,513][99942] Avg episode reward: [(0, '0.640'), (1, '1.000')] +[2023-10-14 07:46:19,159][100936] Updated weights for policy 0, policy_version 64490 (0.0009) +[2023-10-14 07:46:19,537][100936] Updated weights for policy 0, policy_version 64500 (0.0007) +[2023-10-14 07:46:19,905][100936] Updated weights for policy 0, policy_version 64510 (0.0011) +[2023-10-14 07:46:20,265][100917] Updated weights for policy 1, policy_version 64612 (0.0007) +[2023-10-14 07:46:20,648][100917] Updated weights for policy 1, policy_version 64622 (0.0009) +[2023-10-14 07:46:21,031][100917] Updated weights for policy 1, policy_version 64632 (0.0009) +[2023-10-14 07:46:23,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 132251648. Throughput: 0: 1649.4, 1: 1664.3. Samples: 33076460. Policy #0 lag: (min: 12.0, avg: 23.2, max: 44.0) +[2023-10-14 07:46:23,513][99942] Avg episode reward: [(0, '0.640'), (1, '1.000')] +[2023-10-14 07:46:24,328][100936] Updated weights for policy 0, policy_version 64520 (0.0008) +[2023-10-14 07:46:24,698][100936] Updated weights for policy 0, policy_version 64530 (0.0007) +[2023-10-14 07:46:24,939][100917] Updated weights for policy 1, policy_version 64642 (0.0008) +[2023-10-14 07:46:25,066][100936] Updated weights for policy 0, policy_version 64540 (0.0008) +[2023-10-14 07:46:25,312][100917] Updated weights for policy 1, policy_version 64652 (0.0009) +[2023-10-14 07:46:25,690][100917] Updated weights for policy 1, policy_version 64662 (0.0008) +[2023-10-14 07:46:26,073][100917] Updated weights for policy 1, policy_version 64672 (0.0009) +[2023-10-14 07:46:28,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 132317184. Throughput: 0: 1651.3, 1: 1644.0. Samples: 33085470. Policy #0 lag: (min: 12.0, avg: 23.2, max: 44.0) +[2023-10-14 07:46:28,512][99942] Avg episode reward: [(0, '0.640'), (1, '1.000')] +[2023-10-14 07:46:29,051][100936] Updated weights for policy 0, policy_version 64550 (0.0008) +[2023-10-14 07:46:29,411][100936] Updated weights for policy 0, policy_version 64560 (0.0009) +[2023-10-14 07:46:29,788][100936] Updated weights for policy 0, policy_version 64570 (0.0011) +[2023-10-14 07:46:30,149][100917] Updated weights for policy 1, policy_version 64682 (0.0010) +[2023-10-14 07:46:30,521][100917] Updated weights for policy 1, policy_version 64692 (0.0010) +[2023-10-14 07:46:30,907][100917] Updated weights for policy 1, policy_version 64702 (0.0010) +[2023-10-14 07:46:33,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 132382720. Throughput: 0: 1644.0, 1: 1661.6. Samples: 33105530. Policy #0 lag: (min: 12.0, avg: 23.2, max: 44.0) +[2023-10-14 07:46:33,512][99942] Avg episode reward: [(0, '0.640'), (1, '1.000')] +[2023-10-14 07:46:34,106][100936] Updated weights for policy 0, policy_version 64580 (0.0008) +[2023-10-14 07:46:34,468][100936] Updated weights for policy 0, policy_version 64590 (0.0008) +[2023-10-14 07:46:34,839][100936] Updated weights for policy 0, policy_version 64600 (0.0010) +[2023-10-14 07:46:35,229][100917] Updated weights for policy 1, policy_version 64712 (0.0008) +[2023-10-14 07:46:35,613][100917] Updated weights for policy 1, policy_version 64722 (0.0007) +[2023-10-14 07:46:35,978][100917] Updated weights for policy 1, policy_version 64732 (0.0009) +[2023-10-14 07:46:38,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 132448256. Throughput: 0: 1640.9, 1: 1658.0. Samples: 33125900. Policy #0 lag: (min: 12.0, avg: 23.2, max: 44.0) +[2023-10-14 07:46:38,513][99942] Avg episode reward: [(0, '0.530'), (1, '1.000')] +[2023-10-14 07:46:39,081][100936] Updated weights for policy 0, policy_version 64610 (0.0009) +[2023-10-14 07:46:39,496][100936] Updated weights for policy 0, policy_version 64620 (0.0009) +[2023-10-14 07:46:39,865][100936] Updated weights for policy 0, policy_version 64630 (0.0007) +[2023-10-14 07:46:40,059][100917] Updated weights for policy 1, policy_version 64742 (0.0009) +[2023-10-14 07:46:40,228][100936] Updated weights for policy 0, policy_version 64640 (0.0007) +[2023-10-14 07:46:40,437][100917] Updated weights for policy 1, policy_version 64752 (0.0009) +[2023-10-14 07:46:40,807][100917] Updated weights for policy 1, policy_version 64762 (0.0010) +[2023-10-14 07:46:43,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 132513792. Throughput: 0: 1642.2, 1: 1646.6. Samples: 33135094. Policy #0 lag: (min: 12.0, avg: 23.2, max: 44.0) +[2023-10-14 07:46:43,513][99942] Avg episode reward: [(0, '0.530'), (1, '1.000')] +[2023-10-14 07:46:44,237][100936] Updated weights for policy 0, policy_version 64650 (0.0009) +[2023-10-14 07:46:44,603][100936] Updated weights for policy 0, policy_version 64660 (0.0008) +[2023-10-14 07:46:44,959][100917] Updated weights for policy 1, policy_version 64772 (0.0008) +[2023-10-14 07:46:44,970][100936] Updated weights for policy 0, policy_version 64670 (0.0010) +[2023-10-14 07:46:45,326][100917] Updated weights for policy 1, policy_version 64782 (0.0007) +[2023-10-14 07:46:45,713][100917] Updated weights for policy 1, policy_version 64792 (0.0010) +[2023-10-14 07:46:48,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 132579328. Throughput: 0: 1642.1, 1: 1662.8. Samples: 33155268. Policy #0 lag: (min: 4.0, avg: 11.7, max: 36.0) +[2023-10-14 07:46:48,513][99942] Avg episode reward: [(0, '0.530'), (1, '1.000')] +[2023-10-14 07:46:49,099][100936] Updated weights for policy 0, policy_version 64680 (0.0010) +[2023-10-14 07:46:49,464][100936] Updated weights for policy 0, policy_version 64690 (0.0011) +[2023-10-14 07:46:49,633][100917] Updated weights for policy 1, policy_version 64802 (0.0009) +[2023-10-14 07:46:49,841][100936] Updated weights for policy 0, policy_version 64700 (0.0008) +[2023-10-14 07:46:50,010][100917] Updated weights for policy 1, policy_version 64812 (0.0007) +[2023-10-14 07:46:50,384][100917] Updated weights for policy 1, policy_version 64822 (0.0009) +[2023-10-14 07:46:50,760][100917] Updated weights for policy 1, policy_version 64832 (0.0009) +[2023-10-14 07:46:53,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 132644864. Throughput: 0: 1640.2, 1: 1663.6. Samples: 33175778. Policy #0 lag: (min: 4.0, avg: 11.7, max: 36.0) +[2023-10-14 07:46:53,513][99942] Avg episode reward: [(0, '0.530'), (1, '1.000')] +[2023-10-14 07:46:54,037][100936] Updated weights for policy 0, policy_version 64710 (0.0007) +[2023-10-14 07:46:54,413][100936] Updated weights for policy 0, policy_version 64720 (0.0007) +[2023-10-14 07:46:54,770][100936] Updated weights for policy 0, policy_version 64730 (0.0007) +[2023-10-14 07:46:54,835][100917] Updated weights for policy 1, policy_version 64842 (0.0009) +[2023-10-14 07:46:55,204][100917] Updated weights for policy 1, policy_version 64852 (0.0010) +[2023-10-14 07:46:55,577][100917] Updated weights for policy 1, policy_version 64862 (0.0008) +[2023-10-14 07:46:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 132710400. Throughput: 0: 1643.7, 1: 1648.8. Samples: 33184754. Policy #0 lag: (min: 4.0, avg: 11.7, max: 36.0) +[2023-10-14 07:46:58,513][99942] Avg episode reward: [(0, '0.530'), (1, '1.000')] +[2023-10-14 07:46:58,891][100936] Updated weights for policy 0, policy_version 64740 (0.0009) +[2023-10-14 07:46:59,263][100936] Updated weights for policy 0, policy_version 64750 (0.0007) +[2023-10-14 07:46:59,626][100936] Updated weights for policy 0, policy_version 64760 (0.0008) +[2023-10-14 07:46:59,776][100917] Updated weights for policy 1, policy_version 64872 (0.0009) +[2023-10-14 07:47:00,144][100917] Updated weights for policy 1, policy_version 64882 (0.0009) +[2023-10-14 07:47:00,515][100917] Updated weights for policy 1, policy_version 64892 (0.0008) +[2023-10-14 07:47:03,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 132775936. Throughput: 0: 1650.7, 1: 1660.8. Samples: 33205240. Policy #0 lag: (min: 4.0, avg: 11.7, max: 36.0) +[2023-10-14 07:47:03,513][99942] Avg episode reward: [(0, '0.530'), (1, '1.000')] +[2023-10-14 07:47:03,599][100936] Updated weights for policy 0, policy_version 64770 (0.0008) +[2023-10-14 07:47:03,977][100936] Updated weights for policy 0, policy_version 64780 (0.0008) +[2023-10-14 07:47:04,343][100936] Updated weights for policy 0, policy_version 64790 (0.0011) +[2023-10-14 07:47:04,696][100917] Updated weights for policy 1, policy_version 64902 (0.0009) +[2023-10-14 07:47:04,709][100936] Updated weights for policy 0, policy_version 64800 (0.0010) +[2023-10-14 07:47:05,074][100917] Updated weights for policy 1, policy_version 64912 (0.0007) +[2023-10-14 07:47:05,441][100917] Updated weights for policy 1, policy_version 64922 (0.0007) +[2023-10-14 07:47:08,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 132841472. Throughput: 0: 1657.5, 1: 1659.9. Samples: 33225742. Policy #0 lag: (min: 4.0, avg: 11.7, max: 36.0) +[2023-10-14 07:47:08,512][99942] Avg episode reward: [(0, '0.530'), (1, '1.000')] +[2023-10-14 07:47:08,834][100936] Updated weights for policy 0, policy_version 64810 (0.0009) +[2023-10-14 07:47:09,202][100936] Updated weights for policy 0, policy_version 64820 (0.0008) +[2023-10-14 07:47:09,534][100917] Updated weights for policy 1, policy_version 64932 (0.0007) +[2023-10-14 07:47:09,573][100936] Updated weights for policy 0, policy_version 64830 (0.0007) +[2023-10-14 07:47:09,931][100917] Updated weights for policy 1, policy_version 64942 (0.0008) +[2023-10-14 07:47:10,313][100917] Updated weights for policy 1, policy_version 64952 (0.0010) +[2023-10-14 07:47:13,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 132907008. Throughput: 0: 1659.1, 1: 1658.4. Samples: 33234762. Policy #0 lag: (min: 4.0, avg: 11.7, max: 36.0) +[2023-10-14 07:47:13,513][99942] Avg episode reward: [(0, '0.530'), (1, '1.000')] +[2023-10-14 07:47:13,629][100936] Updated weights for policy 0, policy_version 64840 (0.0008) +[2023-10-14 07:47:14,000][100936] Updated weights for policy 0, policy_version 64850 (0.0007) +[2023-10-14 07:47:14,374][100936] Updated weights for policy 0, policy_version 64860 (0.0008) +[2023-10-14 07:47:14,480][100917] Updated weights for policy 1, policy_version 64962 (0.0010) +[2023-10-14 07:47:14,855][100917] Updated weights for policy 1, policy_version 64972 (0.0010) +[2023-10-14 07:47:15,217][100917] Updated weights for policy 1, policy_version 64982 (0.0010) +[2023-10-14 07:47:15,600][100917] Updated weights for policy 1, policy_version 64992 (0.0007) +[2023-10-14 07:47:18,441][100936] Updated weights for policy 0, policy_version 64870 (0.0007) +[2023-10-14 07:47:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 132972544. Throughput: 0: 1668.8, 1: 1656.2. Samples: 33255156. Policy #0 lag: (min: 4.0, avg: 11.7, max: 36.0) +[2023-10-14 07:47:18,512][99942] Avg episode reward: [(0, '0.530'), (1, '1.000')] +[2023-10-14 07:47:18,814][100936] Updated weights for policy 0, policy_version 64880 (0.0007) +[2023-10-14 07:47:19,175][100936] Updated weights for policy 0, policy_version 64890 (0.0009) +[2023-10-14 07:47:19,723][100917] Updated weights for policy 1, policy_version 65002 (0.0011) +[2023-10-14 07:47:20,098][100917] Updated weights for policy 1, policy_version 65012 (0.0008) +[2023-10-14 07:47:20,479][100917] Updated weights for policy 1, policy_version 65022 (0.0008) +[2023-10-14 07:47:23,436][100936] Updated weights for policy 0, policy_version 64900 (0.0009) +[2023-10-14 07:47:23,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 133038080. Throughput: 0: 1660.0, 1: 1661.8. Samples: 33275382. Policy #0 lag: (min: 4.0, avg: 11.7, max: 36.0) +[2023-10-14 07:47:23,513][99942] Avg episode reward: [(0, '0.530'), (1, '1.000')] +[2023-10-14 07:47:23,806][100936] Updated weights for policy 0, policy_version 64910 (0.0010) +[2023-10-14 07:47:24,168][100936] Updated weights for policy 0, policy_version 64920 (0.0008) +[2023-10-14 07:47:24,608][100917] Updated weights for policy 1, policy_version 65032 (0.0008) +[2023-10-14 07:47:24,982][100917] Updated weights for policy 1, policy_version 65042 (0.0010) +[2023-10-14 07:47:25,346][100917] Updated weights for policy 1, policy_version 65052 (0.0010) +[2023-10-14 07:47:28,432][100936] Updated weights for policy 0, policy_version 64930 (0.0008) +[2023-10-14 07:47:28,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 133103616. Throughput: 0: 1666.3, 1: 1654.0. Samples: 33284508. Policy #0 lag: (min: 4.0, avg: 11.7, max: 36.0) +[2023-10-14 07:47:28,513][99942] Avg episode reward: [(0, '0.530'), (1, '1.000')] +[2023-10-14 07:47:28,842][100936] Updated weights for policy 0, policy_version 64940 (0.0010) +[2023-10-14 07:47:29,216][100936] Updated weights for policy 0, policy_version 64950 (0.0011) +[2023-10-14 07:47:29,358][100917] Updated weights for policy 1, policy_version 65062 (0.0009) +[2023-10-14 07:47:29,581][100936] Updated weights for policy 0, policy_version 64960 (0.0008) +[2023-10-14 07:47:29,728][100917] Updated weights for policy 1, policy_version 65072 (0.0009) +[2023-10-14 07:47:30,100][100917] Updated weights for policy 1, policy_version 65082 (0.0008) +[2023-10-14 07:47:33,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 133169152. Throughput: 0: 1658.6, 1: 1666.6. Samples: 33304900. Policy #0 lag: (min: 31.0, avg: 34.3, max: 63.0) +[2023-10-14 07:47:33,513][99942] Avg episode reward: [(0, '0.530'), (1, '1.000')] +[2023-10-14 07:47:33,770][100936] Updated weights for policy 0, policy_version 64970 (0.0009) +[2023-10-14 07:47:34,139][100936] Updated weights for policy 0, policy_version 64980 (0.0009) +[2023-10-14 07:47:34,205][100917] Updated weights for policy 1, policy_version 65092 (0.0009) +[2023-10-14 07:47:34,504][100936] Updated weights for policy 0, policy_version 64990 (0.0008) +[2023-10-14 07:47:34,582][100917] Updated weights for policy 1, policy_version 65102 (0.0008) +[2023-10-14 07:47:34,947][100917] Updated weights for policy 1, policy_version 65112 (0.0011) +[2023-10-14 07:47:38,444][100936] Updated weights for policy 0, policy_version 65000 (0.0008) +[2023-10-14 07:47:38,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 133234688. Throughput: 0: 1654.1, 1: 1664.0. Samples: 33325092. Policy #0 lag: (min: 31.0, avg: 34.3, max: 63.0) +[2023-10-14 07:47:38,512][99942] Avg episode reward: [(0, '0.530'), (1, '1.000')] +[2023-10-14 07:47:38,521][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000065120_66682880.pth... +[2023-10-14 07:47:38,559][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000063584_65110016.pth +[2023-10-14 07:47:38,815][100936] Updated weights for policy 0, policy_version 65010 (0.0009) +[2023-10-14 07:47:38,990][100917] Updated weights for policy 1, policy_version 65122 (0.0009) +[2023-10-14 07:47:39,178][100936] Updated weights for policy 0, policy_version 65020 (0.0007) +[2023-10-14 07:47:39,327][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000065024_66584576.pth... +[2023-10-14 07:47:39,361][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000063456_64978944.pth +[2023-10-14 07:47:39,365][100917] Updated weights for policy 1, policy_version 65132 (0.0007) +[2023-10-14 07:47:39,738][100917] Updated weights for policy 1, policy_version 65142 (0.0009) +[2023-10-14 07:47:40,112][100917] Updated weights for policy 1, policy_version 65152 (0.0007) +[2023-10-14 07:47:43,472][100936] Updated weights for policy 0, policy_version 65030 (0.0010) +[2023-10-14 07:47:43,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 133300224. Throughput: 0: 1664.2, 1: 1663.9. Samples: 33334520. Policy #0 lag: (min: 31.0, avg: 34.3, max: 63.0) +[2023-10-14 07:47:43,512][99942] Avg episode reward: [(0, '0.530'), (1, '1.000')] +[2023-10-14 07:47:43,832][100936] Updated weights for policy 0, policy_version 65040 (0.0010) +[2023-10-14 07:47:44,206][100936] Updated weights for policy 0, policy_version 65050 (0.0008) +[2023-10-14 07:47:44,236][100917] Updated weights for policy 1, policy_version 65162 (0.0009) +[2023-10-14 07:47:44,611][100917] Updated weights for policy 1, policy_version 65172 (0.0010) +[2023-10-14 07:47:44,987][100917] Updated weights for policy 1, policy_version 65182 (0.0008) +[2023-10-14 07:47:48,491][100936] Updated weights for policy 0, policy_version 65060 (0.0007) +[2023-10-14 07:47:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 133365760. Throughput: 0: 1656.5, 1: 1663.8. Samples: 33354652. Policy #0 lag: (min: 31.0, avg: 34.3, max: 63.0) +[2023-10-14 07:47:48,512][99942] Avg episode reward: [(0, '0.530'), (1, '1.000')] +[2023-10-14 07:47:48,853][100936] Updated weights for policy 0, policy_version 65070 (0.0009) +[2023-10-14 07:47:49,203][100917] Updated weights for policy 1, policy_version 65192 (0.0008) +[2023-10-14 07:47:49,220][100936] Updated weights for policy 0, policy_version 65080 (0.0009) +[2023-10-14 07:47:49,587][100917] Updated weights for policy 1, policy_version 65202 (0.0007) +[2023-10-14 07:47:49,958][100917] Updated weights for policy 1, policy_version 65212 (0.0010) +[2023-10-14 07:47:53,334][100936] Updated weights for policy 0, policy_version 65090 (0.0009) +[2023-10-14 07:47:53,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 133431296. Throughput: 0: 1650.4, 1: 1662.7. Samples: 33374834. Policy #0 lag: (min: 31.0, avg: 34.3, max: 63.0) +[2023-10-14 07:47:53,513][99942] Avg episode reward: [(0, '0.530'), (1, '1.000')] +[2023-10-14 07:47:53,706][100936] Updated weights for policy 0, policy_version 65100 (0.0008) +[2023-10-14 07:47:54,077][100936] Updated weights for policy 0, policy_version 65110 (0.0009) +[2023-10-14 07:47:54,170][100917] Updated weights for policy 1, policy_version 65222 (0.0007) +[2023-10-14 07:47:54,449][100936] Updated weights for policy 0, policy_version 65120 (0.0007) +[2023-10-14 07:47:54,557][100917] Updated weights for policy 1, policy_version 65232 (0.0008) +[2023-10-14 07:47:54,920][100917] Updated weights for policy 1, policy_version 65242 (0.0008) +[2023-10-14 07:47:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 133496832. Throughput: 0: 1652.9, 1: 1658.8. Samples: 33383790. Policy #0 lag: (min: 31.0, avg: 34.3, max: 63.0) +[2023-10-14 07:47:58,512][99942] Avg episode reward: [(0, '0.530'), (1, '1.000')] +[2023-10-14 07:47:58,550][100936] Updated weights for policy 0, policy_version 65130 (0.0011) +[2023-10-14 07:47:58,929][100936] Updated weights for policy 0, policy_version 65140 (0.0010) +[2023-10-14 07:47:59,092][100917] Updated weights for policy 1, policy_version 65252 (0.0008) +[2023-10-14 07:47:59,295][100936] Updated weights for policy 0, policy_version 65150 (0.0008) +[2023-10-14 07:47:59,476][100917] Updated weights for policy 1, policy_version 65262 (0.0008) +[2023-10-14 07:47:59,857][100917] Updated weights for policy 1, policy_version 65272 (0.0008) +[2023-10-14 07:48:03,506][100936] Updated weights for policy 0, policy_version 65160 (0.0008) +[2023-10-14 07:48:03,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 133562368. Throughput: 0: 1646.2, 1: 1664.2. Samples: 33404126. Policy #0 lag: (min: 31.0, avg: 34.3, max: 63.0) +[2023-10-14 07:48:03,512][99942] Avg episode reward: [(0, '0.530'), (1, '1.000')] +[2023-10-14 07:48:03,845][100917] Updated weights for policy 1, policy_version 65282 (0.0008) +[2023-10-14 07:48:03,878][100936] Updated weights for policy 0, policy_version 65170 (0.0008) +[2023-10-14 07:48:04,211][100917] Updated weights for policy 1, policy_version 65292 (0.0007) +[2023-10-14 07:48:04,244][100936] Updated weights for policy 0, policy_version 65180 (0.0009) +[2023-10-14 07:48:04,575][100917] Updated weights for policy 1, policy_version 65302 (0.0008) +[2023-10-14 07:48:04,946][100917] Updated weights for policy 1, policy_version 65312 (0.0008) +[2023-10-14 07:48:08,326][100936] Updated weights for policy 0, policy_version 65190 (0.0008) +[2023-10-14 07:48:08,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 133627904. Throughput: 0: 1648.5, 1: 1660.9. Samples: 33424306. Policy #0 lag: (min: 31.0, avg: 34.3, max: 63.0) +[2023-10-14 07:48:08,512][99942] Avg episode reward: [(0, '0.530'), (1, '1.000')] +[2023-10-14 07:48:08,687][100936] Updated weights for policy 0, policy_version 65200 (0.0007) +[2023-10-14 07:48:09,062][100936] Updated weights for policy 0, policy_version 65210 (0.0008) +[2023-10-14 07:48:09,084][100917] Updated weights for policy 1, policy_version 65322 (0.0009) +[2023-10-14 07:48:09,452][100917] Updated weights for policy 1, policy_version 65332 (0.0008) +[2023-10-14 07:48:09,821][100917] Updated weights for policy 1, policy_version 65342 (0.0007) +[2023-10-14 07:48:13,060][100936] Updated weights for policy 0, policy_version 65220 (0.0009) +[2023-10-14 07:48:13,428][100936] Updated weights for policy 0, policy_version 65230 (0.0007) +[2023-10-14 07:48:13,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 133693440. Throughput: 0: 1650.0, 1: 1662.0. Samples: 33433550. Policy #0 lag: (min: 31.0, avg: 34.3, max: 63.0) +[2023-10-14 07:48:13,513][99942] Avg episode reward: [(0, '0.530'), (1, '1.000')] +[2023-10-14 07:48:13,790][100936] Updated weights for policy 0, policy_version 65240 (0.0007) +[2023-10-14 07:48:13,979][100917] Updated weights for policy 1, policy_version 65352 (0.0009) +[2023-10-14 07:48:14,354][100917] Updated weights for policy 1, policy_version 65362 (0.0007) +[2023-10-14 07:48:14,728][100917] Updated weights for policy 1, policy_version 65372 (0.0007) +[2023-10-14 07:48:17,974][100936] Updated weights for policy 0, policy_version 65250 (0.0009) +[2023-10-14 07:48:18,389][100936] Updated weights for policy 0, policy_version 65260 (0.0007) +[2023-10-14 07:48:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 133758976. Throughput: 0: 1660.1, 1: 1661.6. Samples: 33454372. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:48:18,512][99942] Avg episode reward: [(0, '0.530'), (1, '1.000')] +[2023-10-14 07:48:18,681][100917] Updated weights for policy 1, policy_version 65382 (0.0007) +[2023-10-14 07:48:18,760][100936] Updated weights for policy 0, policy_version 65270 (0.0008) +[2023-10-14 07:48:19,053][100917] Updated weights for policy 1, policy_version 65392 (0.0009) +[2023-10-14 07:48:19,130][100936] Updated weights for policy 0, policy_version 65280 (0.0007) +[2023-10-14 07:48:19,429][100917] Updated weights for policy 1, policy_version 65402 (0.0009) +[2023-10-14 07:48:23,305][100936] Updated weights for policy 0, policy_version 65290 (0.0007) +[2023-10-14 07:48:23,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 133824512. Throughput: 0: 1648.8, 1: 1665.2. Samples: 33474224. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:48:23,512][99942] Avg episode reward: [(0, '0.530'), (1, '1.000')] +[2023-10-14 07:48:23,601][100917] Updated weights for policy 1, policy_version 65412 (0.0009) +[2023-10-14 07:48:23,685][100936] Updated weights for policy 0, policy_version 65300 (0.0007) +[2023-10-14 07:48:23,977][100917] Updated weights for policy 1, policy_version 65422 (0.0007) +[2023-10-14 07:48:24,046][100936] Updated weights for policy 0, policy_version 65310 (0.0008) +[2023-10-14 07:48:24,339][100917] Updated weights for policy 1, policy_version 65432 (0.0010) +[2023-10-14 07:48:28,102][100936] Updated weights for policy 0, policy_version 65320 (0.0008) +[2023-10-14 07:48:28,472][100936] Updated weights for policy 0, policy_version 65330 (0.0007) +[2023-10-14 07:48:28,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 133890048. Throughput: 0: 1647.9, 1: 1665.2. Samples: 33483612. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:48:28,512][99942] Avg episode reward: [(0, '0.530'), (1, '1.000')] +[2023-10-14 07:48:28,522][100917] Updated weights for policy 1, policy_version 65442 (0.0008) +[2023-10-14 07:48:28,845][100936] Updated weights for policy 0, policy_version 65340 (0.0008) +[2023-10-14 07:48:28,893][100917] Updated weights for policy 1, policy_version 65452 (0.0009) +[2023-10-14 07:48:29,265][100917] Updated weights for policy 1, policy_version 65462 (0.0008) +[2023-10-14 07:48:29,639][100917] Updated weights for policy 1, policy_version 65472 (0.0009) +[2023-10-14 07:48:32,850][100936] Updated weights for policy 0, policy_version 65350 (0.0008) +[2023-10-14 07:48:33,223][100936] Updated weights for policy 0, policy_version 65360 (0.0010) +[2023-10-14 07:48:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 133955584. Throughput: 0: 1658.5, 1: 1663.1. Samples: 33504124. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:48:33,512][99942] Avg episode reward: [(0, '0.530'), (1, '0.880')] +[2023-10-14 07:48:33,584][100936] Updated weights for policy 0, policy_version 65370 (0.0010) +[2023-10-14 07:48:33,644][100917] Updated weights for policy 1, policy_version 65482 (0.0009) +[2023-10-14 07:48:34,012][100917] Updated weights for policy 1, policy_version 65492 (0.0009) +[2023-10-14 07:48:34,397][100917] Updated weights for policy 1, policy_version 65502 (0.0009) +[2023-10-14 07:48:37,741][100936] Updated weights for policy 0, policy_version 65380 (0.0008) +[2023-10-14 07:48:38,122][100936] Updated weights for policy 0, policy_version 65390 (0.0010) +[2023-10-14 07:48:38,473][100917] Updated weights for policy 1, policy_version 65512 (0.0007) +[2023-10-14 07:48:38,481][100936] Updated weights for policy 0, policy_version 65400 (0.0008) +[2023-10-14 07:48:38,512][99942] Fps is (10 sec: 13106.7, 60 sec: 13107.1, 300 sec: 13107.2). Total num frames: 134021120. Throughput: 0: 1641.1, 1: 1664.7. Samples: 33523596. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:48:38,514][99942] Avg episode reward: [(0, '0.710'), (1, '0.880')] +[2023-10-14 07:48:38,847][100917] Updated weights for policy 1, policy_version 65522 (0.0007) +[2023-10-14 07:48:39,219][100917] Updated weights for policy 1, policy_version 65532 (0.0008) +[2023-10-14 07:48:42,780][100936] Updated weights for policy 0, policy_version 65410 (0.0007) +[2023-10-14 07:48:43,149][100936] Updated weights for policy 0, policy_version 65420 (0.0009) +[2023-10-14 07:48:43,383][100917] Updated weights for policy 1, policy_version 65542 (0.0009) +[2023-10-14 07:48:43,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 134086656. Throughput: 0: 1654.4, 1: 1669.5. Samples: 33533366. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:48:43,513][99942] Avg episode reward: [(0, '0.710'), (1, '0.880')] +[2023-10-14 07:48:43,524][100936] Updated weights for policy 0, policy_version 65430 (0.0009) +[2023-10-14 07:48:43,774][100917] Updated weights for policy 1, policy_version 65552 (0.0007) +[2023-10-14 07:48:43,892][100936] Updated weights for policy 0, policy_version 65440 (0.0009) +[2023-10-14 07:48:44,147][100917] Updated weights for policy 1, policy_version 65562 (0.0009) +[2023-10-14 07:48:48,134][100936] Updated weights for policy 0, policy_version 65450 (0.0007) +[2023-10-14 07:48:48,252][100917] Updated weights for policy 1, policy_version 65572 (0.0007) +[2023-10-14 07:48:48,506][100936] Updated weights for policy 0, policy_version 65460 (0.0008) +[2023-10-14 07:48:48,512][99942] Fps is (10 sec: 13107.7, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 134152192. Throughput: 0: 1652.0, 1: 1661.0. Samples: 33553208. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:48:48,512][99942] Avg episode reward: [(0, '0.890'), (1, '0.880')] +[2023-10-14 07:48:48,620][100917] Updated weights for policy 1, policy_version 65582 (0.0008) +[2023-10-14 07:48:48,876][100936] Updated weights for policy 0, policy_version 65470 (0.0008) +[2023-10-14 07:48:48,999][100917] Updated weights for policy 1, policy_version 65592 (0.0009) +[2023-10-14 07:48:52,966][100936] Updated weights for policy 0, policy_version 65480 (0.0007) +[2023-10-14 07:48:53,332][100917] Updated weights for policy 1, policy_version 65602 (0.0007) +[2023-10-14 07:48:53,339][100936] Updated weights for policy 0, policy_version 65490 (0.0007) +[2023-10-14 07:48:53,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 134217728. Throughput: 0: 1644.0, 1: 1657.9. Samples: 33572894. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:48:53,512][99942] Avg episode reward: [(0, '0.890'), (1, '0.880')] +[2023-10-14 07:48:53,705][100917] Updated weights for policy 1, policy_version 65612 (0.0007) +[2023-10-14 07:48:53,709][100936] Updated weights for policy 0, policy_version 65500 (0.0008) +[2023-10-14 07:48:54,070][100917] Updated weights for policy 1, policy_version 65622 (0.0009) +[2023-10-14 07:48:54,446][100917] Updated weights for policy 1, policy_version 65632 (0.0008) +[2023-10-14 07:48:57,755][100936] Updated weights for policy 0, policy_version 65510 (0.0009) +[2023-10-14 07:48:58,115][100936] Updated weights for policy 0, policy_version 65520 (0.0010) +[2023-10-14 07:48:58,484][100936] Updated weights for policy 0, policy_version 65530 (0.0011) +[2023-10-14 07:48:58,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 134283264. Throughput: 0: 1656.6, 1: 1655.0. Samples: 33582574. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:48:58,513][99942] Avg episode reward: [(0, '0.890'), (1, '0.880')] +[2023-10-14 07:48:58,538][100917] Updated weights for policy 1, policy_version 65642 (0.0009) +[2023-10-14 07:48:58,904][100917] Updated weights for policy 1, policy_version 65652 (0.0010) +[2023-10-14 07:48:59,274][100917] Updated weights for policy 1, policy_version 65662 (0.0008) +[2023-10-14 07:49:02,683][100936] Updated weights for policy 0, policy_version 65540 (0.0008) +[2023-10-14 07:49:03,064][100936] Updated weights for policy 0, policy_version 65550 (0.0009) +[2023-10-14 07:49:03,438][100936] Updated weights for policy 0, policy_version 65560 (0.0009) +[2023-10-14 07:49:03,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 134348800. Throughput: 0: 1653.4, 1: 1649.5. Samples: 33603000. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-10-14 07:49:03,513][99942] Avg episode reward: [(0, '0.890'), (1, '0.880')] +[2023-10-14 07:49:03,522][100917] Updated weights for policy 1, policy_version 65672 (0.0008) +[2023-10-14 07:49:03,886][100917] Updated weights for policy 1, policy_version 65682 (0.0010) +[2023-10-14 07:49:04,258][100917] Updated weights for policy 1, policy_version 65692 (0.0011) +[2023-10-14 07:49:07,549][100936] Updated weights for policy 0, policy_version 65570 (0.0009) +[2023-10-14 07:49:07,919][100936] Updated weights for policy 0, policy_version 65580 (0.0007) +[2023-10-14 07:49:08,287][100936] Updated weights for policy 0, policy_version 65590 (0.0007) +[2023-10-14 07:49:08,500][100917] Updated weights for policy 1, policy_version 65702 (0.0007) +[2023-10-14 07:49:08,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 134414336. Throughput: 0: 1643.4, 1: 1642.7. Samples: 33622100. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-10-14 07:49:08,512][99942] Avg episode reward: [(0, '0.890'), (1, '0.880')] +[2023-10-14 07:49:08,657][100936] Updated weights for policy 0, policy_version 65600 (0.0008) +[2023-10-14 07:49:08,871][100917] Updated weights for policy 1, policy_version 65712 (0.0008) +[2023-10-14 07:49:09,237][100917] Updated weights for policy 1, policy_version 65722 (0.0009) +[2023-10-14 07:49:12,732][100936] Updated weights for policy 0, policy_version 65610 (0.0010) +[2023-10-14 07:49:13,109][100936] Updated weights for policy 0, policy_version 65620 (0.0010) +[2023-10-14 07:49:13,453][100917] Updated weights for policy 1, policy_version 65732 (0.0009) +[2023-10-14 07:49:13,480][100936] Updated weights for policy 0, policy_version 65630 (0.0009) +[2023-10-14 07:49:13,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 134479872. Throughput: 0: 1659.9, 1: 1641.6. Samples: 33632180. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-10-14 07:49:13,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.880')] +[2023-10-14 07:49:13,822][100917] Updated weights for policy 1, policy_version 65742 (0.0007) +[2023-10-14 07:49:14,202][100917] Updated weights for policy 1, policy_version 65752 (0.0007) +[2023-10-14 07:49:17,635][100936] Updated weights for policy 0, policy_version 65640 (0.0009) +[2023-10-14 07:49:18,000][100936] Updated weights for policy 0, policy_version 65650 (0.0008) +[2023-10-14 07:49:18,342][100917] Updated weights for policy 1, policy_version 65762 (0.0008) +[2023-10-14 07:49:18,369][100936] Updated weights for policy 0, policy_version 65660 (0.0009) +[2023-10-14 07:49:18,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 134545408. Throughput: 0: 1653.4, 1: 1638.1. Samples: 33652244. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-10-14 07:49:18,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.880')] +[2023-10-14 07:49:18,721][100917] Updated weights for policy 1, policy_version 65772 (0.0008) +[2023-10-14 07:49:19,094][100917] Updated weights for policy 1, policy_version 65782 (0.0008) +[2023-10-14 07:49:19,472][100917] Updated weights for policy 1, policy_version 65792 (0.0009) +[2023-10-14 07:49:22,587][100936] Updated weights for policy 0, policy_version 65670 (0.0007) +[2023-10-14 07:49:22,953][100936] Updated weights for policy 0, policy_version 65680 (0.0007) +[2023-10-14 07:49:23,316][100936] Updated weights for policy 0, policy_version 65690 (0.0007) +[2023-10-14 07:49:23,503][100917] Updated weights for policy 1, policy_version 65802 (0.0009) +[2023-10-14 07:49:23,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 134610944. Throughput: 0: 1653.9, 1: 1638.9. Samples: 33671770. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-10-14 07:49:23,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.880')] +[2023-10-14 07:49:23,887][100917] Updated weights for policy 1, policy_version 65812 (0.0009) +[2023-10-14 07:49:24,255][100917] Updated weights for policy 1, policy_version 65822 (0.0011) +[2023-10-14 07:49:27,612][100936] Updated weights for policy 0, policy_version 65700 (0.0008) +[2023-10-14 07:49:27,984][100936] Updated weights for policy 0, policy_version 65710 (0.0007) +[2023-10-14 07:49:28,356][100936] Updated weights for policy 0, policy_version 65720 (0.0007) +[2023-10-14 07:49:28,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 134676480. Throughput: 0: 1654.6, 1: 1636.5. Samples: 33681466. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-10-14 07:49:28,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.880')] +[2023-10-14 07:49:28,567][100917] Updated weights for policy 1, policy_version 65832 (0.0008) +[2023-10-14 07:49:28,940][100917] Updated weights for policy 1, policy_version 65842 (0.0007) +[2023-10-14 07:49:29,318][100917] Updated weights for policy 1, policy_version 65852 (0.0007) +[2023-10-14 07:49:32,425][100936] Updated weights for policy 0, policy_version 65730 (0.0009) +[2023-10-14 07:49:32,800][100936] Updated weights for policy 0, policy_version 65740 (0.0007) +[2023-10-14 07:49:33,170][100936] Updated weights for policy 0, policy_version 65750 (0.0007) +[2023-10-14 07:49:33,473][100917] Updated weights for policy 1, policy_version 65862 (0.0008) +[2023-10-14 07:49:33,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 134742016. Throughput: 0: 1656.4, 1: 1645.0. Samples: 33701768. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-10-14 07:49:33,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.880')] +[2023-10-14 07:49:33,535][100936] Updated weights for policy 0, policy_version 65760 (0.0008) +[2023-10-14 07:49:33,852][100917] Updated weights for policy 1, policy_version 65872 (0.0010) +[2023-10-14 07:49:34,224][100917] Updated weights for policy 1, policy_version 65882 (0.0007) +[2023-10-14 07:49:37,713][100936] Updated weights for policy 0, policy_version 65770 (0.0010) +[2023-10-14 07:49:38,090][100936] Updated weights for policy 0, policy_version 65780 (0.0008) +[2023-10-14 07:49:38,269][100917] Updated weights for policy 1, policy_version 65892 (0.0008) +[2023-10-14 07:49:38,462][100936] Updated weights for policy 0, policy_version 65790 (0.0007) +[2023-10-14 07:49:38,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 134807552. Throughput: 0: 1646.7, 1: 1646.1. Samples: 33721068. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-10-14 07:49:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.880')] +[2023-10-14 07:49:38,531][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000065792_67371008.pth... +[2023-10-14 07:49:38,559][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000064224_65765376.pth +[2023-10-14 07:49:38,646][100917] Updated weights for policy 1, policy_version 65902 (0.0011) +[2023-10-14 07:49:39,028][100917] Updated weights for policy 1, policy_version 65912 (0.0009) +[2023-10-14 07:49:39,320][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000065920_67502080.pth... +[2023-10-14 07:49:39,348][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000064352_65896448.pth +[2023-10-14 07:49:42,484][100936] Updated weights for policy 0, policy_version 65800 (0.0009) +[2023-10-14 07:49:42,843][100936] Updated weights for policy 0, policy_version 65810 (0.0008) +[2023-10-14 07:49:42,993][100917] Updated weights for policy 1, policy_version 65922 (0.0009) +[2023-10-14 07:49:43,207][100936] Updated weights for policy 0, policy_version 65820 (0.0008) +[2023-10-14 07:49:43,378][100917] Updated weights for policy 1, policy_version 65932 (0.0010) +[2023-10-14 07:49:43,512][99942] Fps is (10 sec: 16384.0, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 134905856. Throughput: 0: 1650.6, 1: 1648.6. Samples: 33731036. Policy #0 lag: (min: 31.0, avg: 38.0, max: 63.0) +[2023-10-14 07:49:43,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.880')] +[2023-10-14 07:49:43,746][100917] Updated weights for policy 1, policy_version 65942 (0.0008) +[2023-10-14 07:49:44,112][100917] Updated weights for policy 1, policy_version 65952 (0.0008) +[2023-10-14 07:49:47,287][100936] Updated weights for policy 0, policy_version 65830 (0.0009) +[2023-10-14 07:49:47,645][100936] Updated weights for policy 0, policy_version 65840 (0.0007) +[2023-10-14 07:49:48,022][100936] Updated weights for policy 0, policy_version 65850 (0.0008) +[2023-10-14 07:49:48,227][100917] Updated weights for policy 1, policy_version 65962 (0.0009) +[2023-10-14 07:49:48,512][99942] Fps is (10 sec: 16384.5, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 134971392. Throughput: 0: 1644.6, 1: 1649.1. Samples: 33751218. Policy #0 lag: (min: 31.0, avg: 38.0, max: 63.0) +[2023-10-14 07:49:48,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.660')] +[2023-10-14 07:49:48,606][100917] Updated weights for policy 1, policy_version 65972 (0.0009) +[2023-10-14 07:49:48,988][100917] Updated weights for policy 1, policy_version 65982 (0.0007) +[2023-10-14 07:49:52,235][100936] Updated weights for policy 0, policy_version 65860 (0.0008) +[2023-10-14 07:49:52,623][100936] Updated weights for policy 0, policy_version 65870 (0.0007) +[2023-10-14 07:49:53,005][100936] Updated weights for policy 0, policy_version 65880 (0.0007) +[2023-10-14 07:49:53,110][100917] Updated weights for policy 1, policy_version 65992 (0.0007) +[2023-10-14 07:49:53,478][100917] Updated weights for policy 1, policy_version 66002 (0.0007) +[2023-10-14 07:49:53,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 135036928. Throughput: 0: 1647.3, 1: 1648.1. Samples: 33770396. Policy #0 lag: (min: 31.0, avg: 38.0, max: 63.0) +[2023-10-14 07:49:53,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.660')] +[2023-10-14 07:49:53,855][100917] Updated weights for policy 1, policy_version 66012 (0.0007) +[2023-10-14 07:49:57,198][100936] Updated weights for policy 0, policy_version 65890 (0.0008) +[2023-10-14 07:49:57,571][100936] Updated weights for policy 0, policy_version 65900 (0.0010) +[2023-10-14 07:49:57,940][100936] Updated weights for policy 0, policy_version 65910 (0.0007) +[2023-10-14 07:49:58,184][100917] Updated weights for policy 1, policy_version 66022 (0.0007) +[2023-10-14 07:49:58,303][100936] Updated weights for policy 0, policy_version 65920 (0.0008) +[2023-10-14 07:49:58,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 135102464. Throughput: 0: 1651.2, 1: 1650.0. Samples: 33780738. Policy #0 lag: (min: 31.0, avg: 38.0, max: 63.0) +[2023-10-14 07:49:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.660')] +[2023-10-14 07:49:58,554][100917] Updated weights for policy 1, policy_version 66032 (0.0007) +[2023-10-14 07:49:58,926][100917] Updated weights for policy 1, policy_version 66042 (0.0007) +[2023-10-14 07:50:02,412][100936] Updated weights for policy 0, policy_version 65930 (0.0011) +[2023-10-14 07:50:02,786][100936] Updated weights for policy 0, policy_version 65940 (0.0009) +[2023-10-14 07:50:03,128][100917] Updated weights for policy 1, policy_version 66052 (0.0011) +[2023-10-14 07:50:03,162][100936] Updated weights for policy 0, policy_version 65950 (0.0008) +[2023-10-14 07:50:03,502][100917] Updated weights for policy 1, policy_version 66062 (0.0009) +[2023-10-14 07:50:03,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 135168000. Throughput: 0: 1647.7, 1: 1653.9. Samples: 33800820. Policy #0 lag: (min: 31.0, avg: 38.0, max: 63.0) +[2023-10-14 07:50:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.660')] +[2023-10-14 07:50:03,877][100917] Updated weights for policy 1, policy_version 66072 (0.0009) +[2023-10-14 07:50:07,228][100936] Updated weights for policy 0, policy_version 65960 (0.0008) +[2023-10-14 07:50:07,601][100936] Updated weights for policy 0, policy_version 65970 (0.0009) +[2023-10-14 07:50:07,966][100936] Updated weights for policy 0, policy_version 65980 (0.0008) +[2023-10-14 07:50:08,135][100917] Updated weights for policy 1, policy_version 66082 (0.0009) +[2023-10-14 07:50:08,507][100917] Updated weights for policy 1, policy_version 66092 (0.0008) +[2023-10-14 07:50:08,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 135233536. Throughput: 0: 1646.2, 1: 1651.6. Samples: 33820172. Policy #0 lag: (min: 31.0, avg: 38.0, max: 63.0) +[2023-10-14 07:50:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.660')] +[2023-10-14 07:50:08,885][100917] Updated weights for policy 1, policy_version 66102 (0.0009) +[2023-10-14 07:50:09,264][100917] Updated weights for policy 1, policy_version 66112 (0.0009) +[2023-10-14 07:50:12,058][100936] Updated weights for policy 0, policy_version 65990 (0.0008) +[2023-10-14 07:50:12,429][100936] Updated weights for policy 0, policy_version 66000 (0.0008) +[2023-10-14 07:50:12,799][100936] Updated weights for policy 0, policy_version 66010 (0.0007) +[2023-10-14 07:50:13,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 135299072. Throughput: 0: 1660.4, 1: 1649.5. Samples: 33830414. Policy #0 lag: (min: 31.0, avg: 38.0, max: 63.0) +[2023-10-14 07:50:13,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.660')] +[2023-10-14 07:50:13,683][100917] Updated weights for policy 1, policy_version 66122 (0.0007) +[2023-10-14 07:50:14,057][100917] Updated weights for policy 1, policy_version 66132 (0.0009) +[2023-10-14 07:50:14,424][100917] Updated weights for policy 1, policy_version 66142 (0.0010) +[2023-10-14 07:50:16,970][100936] Updated weights for policy 0, policy_version 66020 (0.0007) +[2023-10-14 07:50:17,330][100936] Updated weights for policy 0, policy_version 66030 (0.0008) +[2023-10-14 07:50:17,714][100936] Updated weights for policy 0, policy_version 66040 (0.0009) +[2023-10-14 07:50:18,402][100917] Updated weights for policy 1, policy_version 66152 (0.0008) +[2023-10-14 07:50:18,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 135364608. Throughput: 0: 1649.2, 1: 1644.8. Samples: 33849994. Policy #0 lag: (min: 31.0, avg: 38.0, max: 63.0) +[2023-10-14 07:50:18,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.660')] +[2023-10-14 07:50:18,780][100917] Updated weights for policy 1, policy_version 66162 (0.0008) +[2023-10-14 07:50:19,146][100917] Updated weights for policy 1, policy_version 66172 (0.0009) +[2023-10-14 07:50:21,818][100936] Updated weights for policy 0, policy_version 66050 (0.0009) +[2023-10-14 07:50:22,196][100936] Updated weights for policy 0, policy_version 66060 (0.0009) +[2023-10-14 07:50:22,556][100936] Updated weights for policy 0, policy_version 66070 (0.0007) +[2023-10-14 07:50:22,930][100936] Updated weights for policy 0, policy_version 66080 (0.0007) +[2023-10-14 07:50:23,188][100917] Updated weights for policy 1, policy_version 66182 (0.0008) +[2023-10-14 07:50:23,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 135430144. Throughput: 0: 1660.9, 1: 1651.2. Samples: 33870108. Policy #0 lag: (min: 31.0, avg: 38.0, max: 63.0) +[2023-10-14 07:50:23,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.660')] +[2023-10-14 07:50:23,556][100917] Updated weights for policy 1, policy_version 66192 (0.0008) +[2023-10-14 07:50:23,936][100917] Updated weights for policy 1, policy_version 66202 (0.0010) +[2023-10-14 07:50:26,826][100936] Updated weights for policy 0, policy_version 66090 (0.0009) +[2023-10-14 07:50:27,195][100936] Updated weights for policy 0, policy_version 66100 (0.0008) +[2023-10-14 07:50:27,564][100936] Updated weights for policy 0, policy_version 66110 (0.0008) +[2023-10-14 07:50:28,091][100917] Updated weights for policy 1, policy_version 66212 (0.0009) +[2023-10-14 07:50:28,470][100917] Updated weights for policy 1, policy_version 66222 (0.0009) +[2023-10-14 07:50:28,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 135495680. Throughput: 0: 1663.8, 1: 1652.5. Samples: 33880268. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:50:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.660')] +[2023-10-14 07:50:28,849][100917] Updated weights for policy 1, policy_version 66232 (0.0007) +[2023-10-14 07:50:31,554][100936] Updated weights for policy 0, policy_version 66120 (0.0008) +[2023-10-14 07:50:31,928][100936] Updated weights for policy 0, policy_version 66130 (0.0009) +[2023-10-14 07:50:32,298][100936] Updated weights for policy 0, policy_version 66140 (0.0008) +[2023-10-14 07:50:33,022][100917] Updated weights for policy 1, policy_version 66242 (0.0008) +[2023-10-14 07:50:33,404][100917] Updated weights for policy 1, policy_version 66252 (0.0009) +[2023-10-14 07:50:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 135561216. Throughput: 0: 1647.8, 1: 1647.9. Samples: 33899526. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:50:33,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.660')] +[2023-10-14 07:50:33,765][100917] Updated weights for policy 1, policy_version 66262 (0.0007) +[2023-10-14 07:50:34,144][100917] Updated weights for policy 1, policy_version 66272 (0.0007) +[2023-10-14 07:50:36,608][100936] Updated weights for policy 0, policy_version 66150 (0.0010) +[2023-10-14 07:50:36,971][100936] Updated weights for policy 0, policy_version 66160 (0.0011) +[2023-10-14 07:50:37,345][100936] Updated weights for policy 0, policy_version 66170 (0.0010) +[2023-10-14 07:50:38,120][100917] Updated weights for policy 1, policy_version 66282 (0.0010) +[2023-10-14 07:50:38,494][100917] Updated weights for policy 1, policy_version 66292 (0.0008) +[2023-10-14 07:50:38,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 135626752. Throughput: 0: 1667.2, 1: 1648.8. Samples: 33919618. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:50:38,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.660')] +[2023-10-14 07:50:38,875][100917] Updated weights for policy 1, policy_version 66302 (0.0011) +[2023-10-14 07:50:41,647][100936] Updated weights for policy 0, policy_version 66180 (0.0007) +[2023-10-14 07:50:42,026][100936] Updated weights for policy 0, policy_version 66190 (0.0008) +[2023-10-14 07:50:42,401][100936] Updated weights for policy 0, policy_version 66200 (0.0008) +[2023-10-14 07:50:42,953][100917] Updated weights for policy 1, policy_version 66312 (0.0009) +[2023-10-14 07:50:43,335][100917] Updated weights for policy 1, policy_version 66322 (0.0008) +[2023-10-14 07:50:43,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 135692288. Throughput: 0: 1665.8, 1: 1651.6. Samples: 33930020. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:50:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.660')] +[2023-10-14 07:50:43,705][100917] Updated weights for policy 1, policy_version 66332 (0.0009) +[2023-10-14 07:50:46,473][100936] Updated weights for policy 0, policy_version 66210 (0.0009) +[2023-10-14 07:50:46,840][100936] Updated weights for policy 0, policy_version 66220 (0.0008) +[2023-10-14 07:50:47,222][100936] Updated weights for policy 0, policy_version 66230 (0.0009) +[2023-10-14 07:50:47,588][100936] Updated weights for policy 0, policy_version 66240 (0.0011) +[2023-10-14 07:50:47,809][100917] Updated weights for policy 1, policy_version 66342 (0.0010) +[2023-10-14 07:50:48,187][100917] Updated weights for policy 1, policy_version 66352 (0.0008) +[2023-10-14 07:50:48,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 135757824. Throughput: 0: 1646.9, 1: 1655.3. Samples: 33949420. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:50:48,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.660')] +[2023-10-14 07:50:48,560][100917] Updated weights for policy 1, policy_version 66362 (0.0009) +[2023-10-14 07:50:51,613][100936] Updated weights for policy 0, policy_version 66250 (0.0008) +[2023-10-14 07:50:51,978][100936] Updated weights for policy 0, policy_version 66260 (0.0008) +[2023-10-14 07:50:52,346][100936] Updated weights for policy 0, policy_version 66270 (0.0007) +[2023-10-14 07:50:52,673][100917] Updated weights for policy 1, policy_version 66372 (0.0008) +[2023-10-14 07:50:53,047][100917] Updated weights for policy 1, policy_version 66382 (0.0011) +[2023-10-14 07:50:53,413][100917] Updated weights for policy 1, policy_version 66392 (0.0010) +[2023-10-14 07:50:53,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 135823360. Throughput: 0: 1665.0, 1: 1645.1. Samples: 33969124. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:50:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.660')] +[2023-10-14 07:50:56,653][100936] Updated weights for policy 0, policy_version 66280 (0.0009) +[2023-10-14 07:50:57,018][100936] Updated weights for policy 0, policy_version 66290 (0.0007) +[2023-10-14 07:50:57,389][100936] Updated weights for policy 0, policy_version 66300 (0.0008) +[2023-10-14 07:50:57,501][100917] Updated weights for policy 1, policy_version 66402 (0.0010) +[2023-10-14 07:50:57,865][100917] Updated weights for policy 1, policy_version 66412 (0.0009) +[2023-10-14 07:50:58,236][100917] Updated weights for policy 1, policy_version 66422 (0.0008) +[2023-10-14 07:50:58,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 135888896. Throughput: 0: 1659.5, 1: 1655.7. Samples: 33979598. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:50:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.660')] +[2023-10-14 07:50:58,616][100917] Updated weights for policy 1, policy_version 66432 (0.0008) +[2023-10-14 07:51:01,641][100936] Updated weights for policy 0, policy_version 66310 (0.0008) +[2023-10-14 07:51:02,019][100936] Updated weights for policy 0, policy_version 66320 (0.0009) +[2023-10-14 07:51:02,385][100936] Updated weights for policy 0, policy_version 66330 (0.0010) +[2023-10-14 07:51:02,856][100917] Updated weights for policy 1, policy_version 66442 (0.0007) +[2023-10-14 07:51:03,227][100917] Updated weights for policy 1, policy_version 66452 (0.0007) +[2023-10-14 07:51:03,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 135954432. Throughput: 0: 1653.5, 1: 1660.9. Samples: 33999140. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:51:03,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.660')] +[2023-10-14 07:51:03,598][100917] Updated weights for policy 1, policy_version 66462 (0.0008) +[2023-10-14 07:51:06,580][100936] Updated weights for policy 0, policy_version 66340 (0.0008) +[2023-10-14 07:51:06,952][100936] Updated weights for policy 0, policy_version 66350 (0.0008) +[2023-10-14 07:51:07,311][100936] Updated weights for policy 0, policy_version 66360 (0.0007) +[2023-10-14 07:51:07,765][100917] Updated weights for policy 1, policy_version 66472 (0.0009) +[2023-10-14 07:51:08,146][100917] Updated weights for policy 1, policy_version 66482 (0.0009) +[2023-10-14 07:51:08,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 136019968. Throughput: 0: 1658.5, 1: 1642.7. Samples: 34018664. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:51:08,513][100917] Updated weights for policy 1, policy_version 66492 (0.0009) +[2023-10-14 07:51:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.660')] +[2023-10-14 07:51:11,488][100936] Updated weights for policy 0, policy_version 66370 (0.0008) +[2023-10-14 07:51:11,856][100936] Updated weights for policy 0, policy_version 66380 (0.0007) +[2023-10-14 07:51:12,234][100936] Updated weights for policy 0, policy_version 66390 (0.0008) +[2023-10-14 07:51:12,597][100936] Updated weights for policy 0, policy_version 66400 (0.0007) +[2023-10-14 07:51:12,624][100917] Updated weights for policy 1, policy_version 66502 (0.0009) +[2023-10-14 07:51:12,999][100917] Updated weights for policy 1, policy_version 66512 (0.0007) +[2023-10-14 07:51:13,372][100917] Updated weights for policy 1, policy_version 66522 (0.0008) +[2023-10-14 07:51:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 136085504. Throughput: 0: 1659.2, 1: 1654.7. Samples: 34029392. Policy #0 lag: (min: 28.0, avg: 36.0, max: 60.0) +[2023-10-14 07:51:13,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.660')] +[2023-10-14 07:51:16,548][100936] Updated weights for policy 0, policy_version 66410 (0.0008) +[2023-10-14 07:51:16,915][100936] Updated weights for policy 0, policy_version 66420 (0.0009) +[2023-10-14 07:51:17,288][100936] Updated weights for policy 0, policy_version 66430 (0.0007) +[2023-10-14 07:51:17,449][100917] Updated weights for policy 1, policy_version 66532 (0.0009) +[2023-10-14 07:51:17,826][100917] Updated weights for policy 1, policy_version 66542 (0.0010) +[2023-10-14 07:51:18,198][100917] Updated weights for policy 1, policy_version 66552 (0.0010) +[2023-10-14 07:51:18,512][99942] Fps is (10 sec: 16384.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 136183808. Throughput: 0: 1658.1, 1: 1654.3. Samples: 34048582. Policy #0 lag: (min: 28.0, avg: 36.0, max: 60.0) +[2023-10-14 07:51:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.780')] +[2023-10-14 07:51:21,436][100936] Updated weights for policy 0, policy_version 66440 (0.0008) +[2023-10-14 07:51:21,808][100936] Updated weights for policy 0, policy_version 66450 (0.0007) +[2023-10-14 07:51:22,172][100936] Updated weights for policy 0, policy_version 66460 (0.0007) +[2023-10-14 07:51:22,460][100917] Updated weights for policy 1, policy_version 66562 (0.0009) +[2023-10-14 07:51:22,837][100917] Updated weights for policy 1, policy_version 66572 (0.0007) +[2023-10-14 07:51:23,200][100917] Updated weights for policy 1, policy_version 66582 (0.0008) +[2023-10-14 07:51:23,512][99942] Fps is (10 sec: 13106.5, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 136216576. Throughput: 0: 1659.7, 1: 1644.3. Samples: 34068298. Policy #0 lag: (min: 28.0, avg: 36.0, max: 60.0) +[2023-10-14 07:51:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.780')] +[2023-10-14 07:51:23,575][100917] Updated weights for policy 1, policy_version 66592 (0.0009) +[2023-10-14 07:51:26,246][100936] Updated weights for policy 0, policy_version 66470 (0.0008) +[2023-10-14 07:51:26,618][100936] Updated weights for policy 0, policy_version 66480 (0.0012) +[2023-10-14 07:51:26,977][100936] Updated weights for policy 0, policy_version 66490 (0.0010) +[2023-10-14 07:51:27,755][100917] Updated weights for policy 1, policy_version 66602 (0.0009) +[2023-10-14 07:51:28,128][100917] Updated weights for policy 1, policy_version 66612 (0.0009) +[2023-10-14 07:51:28,506][100917] Updated weights for policy 1, policy_version 66622 (0.0009) +[2023-10-14 07:51:28,512][99942] Fps is (10 sec: 9830.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 136282112. Throughput: 0: 1649.6, 1: 1651.2. Samples: 34078552. Policy #0 lag: (min: 28.0, avg: 36.0, max: 60.0) +[2023-10-14 07:51:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.780')] +[2023-10-14 07:51:31,152][100936] Updated weights for policy 0, policy_version 66500 (0.0011) +[2023-10-14 07:51:31,540][100936] Updated weights for policy 0, policy_version 66510 (0.0009) +[2023-10-14 07:51:31,910][100936] Updated weights for policy 0, policy_version 66520 (0.0008) +[2023-10-14 07:51:32,759][100917] Updated weights for policy 1, policy_version 66632 (0.0009) +[2023-10-14 07:51:33,118][100917] Updated weights for policy 1, policy_version 66642 (0.0010) +[2023-10-14 07:51:33,488][100917] Updated weights for policy 1, policy_version 66652 (0.0010) +[2023-10-14 07:51:33,512][99942] Fps is (10 sec: 13107.8, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 136347648. Throughput: 0: 1649.7, 1: 1646.4. Samples: 34097746. Policy #0 lag: (min: 28.0, avg: 36.0, max: 60.0) +[2023-10-14 07:51:33,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.780')] +[2023-10-14 07:51:35,943][100936] Updated weights for policy 0, policy_version 66530 (0.0008) +[2023-10-14 07:51:36,313][100936] Updated weights for policy 0, policy_version 66540 (0.0010) +[2023-10-14 07:51:36,680][100936] Updated weights for policy 0, policy_version 66550 (0.0011) +[2023-10-14 07:51:37,056][100936] Updated weights for policy 0, policy_version 66560 (0.0010) +[2023-10-14 07:51:37,588][100917] Updated weights for policy 1, policy_version 66662 (0.0007) +[2023-10-14 07:51:37,960][100917] Updated weights for policy 1, policy_version 66672 (0.0009) +[2023-10-14 07:51:38,323][100917] Updated weights for policy 1, policy_version 66682 (0.0008) +[2023-10-14 07:51:38,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 136413184. Throughput: 0: 1656.9, 1: 1645.5. Samples: 34117732. Policy #0 lag: (min: 28.0, avg: 36.0, max: 60.0) +[2023-10-14 07:51:38,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.720')] +[2023-10-14 07:51:38,521][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000066560_68157440.pth... +[2023-10-14 07:51:38,546][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000066688_68288512.pth... +[2023-10-14 07:51:38,551][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000065024_66584576.pth +[2023-10-14 07:51:38,583][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000065120_66682880.pth +[2023-10-14 07:51:41,231][100936] Updated weights for policy 0, policy_version 66570 (0.0008) +[2023-10-14 07:51:41,596][100936] Updated weights for policy 0, policy_version 66580 (0.0007) +[2023-10-14 07:51:41,960][100936] Updated weights for policy 0, policy_version 66590 (0.0007) +[2023-10-14 07:51:42,462][100917] Updated weights for policy 1, policy_version 66692 (0.0007) +[2023-10-14 07:51:42,841][100917] Updated weights for policy 1, policy_version 66702 (0.0009) +[2023-10-14 07:51:43,218][100917] Updated weights for policy 1, policy_version 66712 (0.0009) +[2023-10-14 07:51:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 136478720. Throughput: 0: 1644.2, 1: 1654.6. Samples: 34128044. Policy #0 lag: (min: 28.0, avg: 36.0, max: 60.0) +[2023-10-14 07:51:43,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.720')] +[2023-10-14 07:51:46,129][100936] Updated weights for policy 0, policy_version 66600 (0.0008) +[2023-10-14 07:51:46,492][100936] Updated weights for policy 0, policy_version 66610 (0.0007) +[2023-10-14 07:51:46,870][100936] Updated weights for policy 0, policy_version 66620 (0.0007) +[2023-10-14 07:51:47,420][100917] Updated weights for policy 1, policy_version 66722 (0.0010) +[2023-10-14 07:51:47,804][100917] Updated weights for policy 1, policy_version 66732 (0.0009) +[2023-10-14 07:51:48,180][100917] Updated weights for policy 1, policy_version 66742 (0.0007) +[2023-10-14 07:51:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 136544256. Throughput: 0: 1649.3, 1: 1652.8. Samples: 34147738. Policy #0 lag: (min: 28.0, avg: 36.0, max: 60.0) +[2023-10-14 07:51:48,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.720')] +[2023-10-14 07:51:48,551][100917] Updated weights for policy 1, policy_version 66752 (0.0009) +[2023-10-14 07:51:51,036][100936] Updated weights for policy 0, policy_version 66630 (0.0008) +[2023-10-14 07:51:51,404][100936] Updated weights for policy 0, policy_version 66640 (0.0008) +[2023-10-14 07:51:51,772][100936] Updated weights for policy 0, policy_version 66650 (0.0008) +[2023-10-14 07:51:52,597][100917] Updated weights for policy 1, policy_version 66762 (0.0007) +[2023-10-14 07:51:52,964][100917] Updated weights for policy 1, policy_version 66772 (0.0009) +[2023-10-14 07:51:53,351][100917] Updated weights for policy 1, policy_version 66782 (0.0008) +[2023-10-14 07:51:53,512][99942] Fps is (10 sec: 16383.9, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 136642560. Throughput: 0: 1657.0, 1: 1648.1. Samples: 34167394. Policy #0 lag: (min: 28.0, avg: 36.0, max: 60.0) +[2023-10-14 07:51:53,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.720')] +[2023-10-14 07:51:55,791][100936] Updated weights for policy 0, policy_version 66660 (0.0009) +[2023-10-14 07:51:56,157][100936] Updated weights for policy 0, policy_version 66670 (0.0009) +[2023-10-14 07:51:56,525][100936] Updated weights for policy 0, policy_version 66680 (0.0008) +[2023-10-14 07:51:57,632][100917] Updated weights for policy 1, policy_version 66792 (0.0009) +[2023-10-14 07:51:58,005][100917] Updated weights for policy 1, policy_version 66802 (0.0007) +[2023-10-14 07:51:58,384][100917] Updated weights for policy 1, policy_version 66812 (0.0007) +[2023-10-14 07:51:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 136675328. Throughput: 0: 1642.4, 1: 1648.4. Samples: 34177478. Policy #0 lag: (min: 22.0, avg: 22.0, max: 22.0) +[2023-10-14 07:51:58,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.720')] +[2023-10-14 07:52:00,680][100936] Updated weights for policy 0, policy_version 66690 (0.0007) +[2023-10-14 07:52:01,049][100936] Updated weights for policy 0, policy_version 66700 (0.0011) +[2023-10-14 07:52:01,421][100936] Updated weights for policy 0, policy_version 66710 (0.0009) +[2023-10-14 07:52:01,789][100936] Updated weights for policy 0, policy_version 66720 (0.0007) +[2023-10-14 07:52:02,302][100917] Updated weights for policy 1, policy_version 66822 (0.0009) +[2023-10-14 07:52:02,674][100917] Updated weights for policy 1, policy_version 66832 (0.0008) +[2023-10-14 07:52:03,037][100917] Updated weights for policy 1, policy_version 66842 (0.0009) +[2023-10-14 07:52:03,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 136773632. Throughput: 0: 1654.8, 1: 1651.5. Samples: 34197368. Policy #0 lag: (min: 22.0, avg: 22.0, max: 22.0) +[2023-10-14 07:52:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.720')] +[2023-10-14 07:52:05,959][100936] Updated weights for policy 0, policy_version 66730 (0.0007) +[2023-10-14 07:52:06,325][100936] Updated weights for policy 0, policy_version 66740 (0.0007) +[2023-10-14 07:52:06,695][100936] Updated weights for policy 0, policy_version 66750 (0.0011) +[2023-10-14 07:52:07,205][100917] Updated weights for policy 1, policy_version 66852 (0.0009) +[2023-10-14 07:52:07,562][100917] Updated weights for policy 1, policy_version 66862 (0.0011) +[2023-10-14 07:52:07,945][100917] Updated weights for policy 1, policy_version 66872 (0.0010) +[2023-10-14 07:52:08,512][99942] Fps is (10 sec: 16383.8, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 136839168. Throughput: 0: 1653.5, 1: 1645.0. Samples: 34216730. Policy #0 lag: (min: 22.0, avg: 22.0, max: 22.0) +[2023-10-14 07:52:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.720')] +[2023-10-14 07:52:11,099][100936] Updated weights for policy 0, policy_version 66760 (0.0009) +[2023-10-14 07:52:11,466][100936] Updated weights for policy 0, policy_version 66770 (0.0009) +[2023-10-14 07:52:11,828][100917] Updated weights for policy 1, policy_version 66882 (0.0009) +[2023-10-14 07:52:11,832][100936] Updated weights for policy 0, policy_version 66780 (0.0009) +[2023-10-14 07:52:12,201][100917] Updated weights for policy 1, policy_version 66892 (0.0008) +[2023-10-14 07:52:12,567][100917] Updated weights for policy 1, policy_version 66902 (0.0009) +[2023-10-14 07:52:12,945][100917] Updated weights for policy 1, policy_version 66912 (0.0011) +[2023-10-14 07:52:13,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 136904704. Throughput: 0: 1644.4, 1: 1656.0. Samples: 34227070. Policy #0 lag: (min: 22.0, avg: 22.0, max: 22.0) +[2023-10-14 07:52:13,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.720')] +[2023-10-14 07:52:15,882][100936] Updated weights for policy 0, policy_version 66790 (0.0008) +[2023-10-14 07:52:16,264][100936] Updated weights for policy 0, policy_version 66800 (0.0010) +[2023-10-14 07:52:16,633][100936] Updated weights for policy 0, policy_version 66810 (0.0010) +[2023-10-14 07:52:17,006][100917] Updated weights for policy 1, policy_version 66922 (0.0009) +[2023-10-14 07:52:17,380][100917] Updated weights for policy 1, policy_version 66932 (0.0009) +[2023-10-14 07:52:17,739][100917] Updated weights for policy 1, policy_version 66942 (0.0009) +[2023-10-14 07:52:18,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 136970240. Throughput: 0: 1655.0, 1: 1653.2. Samples: 34246618. Policy #0 lag: (min: 22.0, avg: 22.0, max: 22.0) +[2023-10-14 07:52:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.720')] +[2023-10-14 07:52:20,736][100936] Updated weights for policy 0, policy_version 66820 (0.0008) +[2023-10-14 07:52:21,111][100936] Updated weights for policy 0, policy_version 66830 (0.0007) +[2023-10-14 07:52:21,481][100936] Updated weights for policy 0, policy_version 66840 (0.0009) +[2023-10-14 07:52:22,076][100917] Updated weights for policy 1, policy_version 66952 (0.0007) +[2023-10-14 07:52:22,451][100917] Updated weights for policy 1, policy_version 66962 (0.0007) +[2023-10-14 07:52:22,824][100917] Updated weights for policy 1, policy_version 66972 (0.0009) +[2023-10-14 07:52:23,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 137035776. Throughput: 0: 1656.8, 1: 1647.9. Samples: 34266444. Policy #0 lag: (min: 22.0, avg: 22.0, max: 22.0) +[2023-10-14 07:52:23,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.720')] +[2023-10-14 07:52:25,473][100936] Updated weights for policy 0, policy_version 66850 (0.0009) +[2023-10-14 07:52:25,844][100936] Updated weights for policy 0, policy_version 66860 (0.0007) +[2023-10-14 07:52:26,210][100936] Updated weights for policy 0, policy_version 66870 (0.0008) +[2023-10-14 07:52:26,564][100936] Updated weights for policy 0, policy_version 66880 (0.0007) +[2023-10-14 07:52:27,124][100917] Updated weights for policy 1, policy_version 66982 (0.0010) +[2023-10-14 07:52:27,509][100917] Updated weights for policy 1, policy_version 66992 (0.0011) +[2023-10-14 07:52:27,878][100917] Updated weights for policy 1, policy_version 67002 (0.0010) +[2023-10-14 07:52:28,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 137101312. Throughput: 0: 1649.9, 1: 1656.7. Samples: 34276842. Policy #0 lag: (min: 22.0, avg: 22.0, max: 22.0) +[2023-10-14 07:52:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 07:52:30,722][100936] Updated weights for policy 0, policy_version 66890 (0.0008) +[2023-10-14 07:52:31,099][100936] Updated weights for policy 0, policy_version 66900 (0.0007) +[2023-10-14 07:52:31,464][100936] Updated weights for policy 0, policy_version 66910 (0.0008) +[2023-10-14 07:52:31,855][100917] Updated weights for policy 1, policy_version 67012 (0.0011) +[2023-10-14 07:52:32,234][100917] Updated weights for policy 1, policy_version 67022 (0.0009) +[2023-10-14 07:52:32,599][100917] Updated weights for policy 1, policy_version 67032 (0.0011) +[2023-10-14 07:52:33,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 137166848. Throughput: 0: 1663.3, 1: 1650.7. Samples: 34296868. Policy #0 lag: (min: 22.0, avg: 22.0, max: 22.0) +[2023-10-14 07:52:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 07:52:35,478][100936] Updated weights for policy 0, policy_version 66920 (0.0008) +[2023-10-14 07:52:35,856][100936] Updated weights for policy 0, policy_version 66930 (0.0009) +[2023-10-14 07:52:36,227][100936] Updated weights for policy 0, policy_version 66940 (0.0011) +[2023-10-14 07:52:36,818][100917] Updated weights for policy 1, policy_version 67042 (0.0009) +[2023-10-14 07:52:37,235][100917] Updated weights for policy 1, policy_version 67052 (0.0009) +[2023-10-14 07:52:37,605][100917] Updated weights for policy 1, policy_version 67062 (0.0011) +[2023-10-14 07:52:37,981][100917] Updated weights for policy 1, policy_version 67072 (0.0009) +[2023-10-14 07:52:38,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 137232384. Throughput: 0: 1668.5, 1: 1642.4. Samples: 34316384. Policy #0 lag: (min: 22.0, avg: 22.0, max: 22.0) +[2023-10-14 07:52:38,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 07:52:40,317][100936] Updated weights for policy 0, policy_version 66950 (0.0010) +[2023-10-14 07:52:40,691][100936] Updated weights for policy 0, policy_version 66960 (0.0008) +[2023-10-14 07:52:41,063][100936] Updated weights for policy 0, policy_version 66970 (0.0009) +[2023-10-14 07:52:42,084][100917] Updated weights for policy 1, policy_version 67082 (0.0009) +[2023-10-14 07:52:42,458][100917] Updated weights for policy 1, policy_version 67092 (0.0010) +[2023-10-14 07:52:42,830][100917] Updated weights for policy 1, policy_version 67102 (0.0012) +[2023-10-14 07:52:43,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 137297920. Throughput: 0: 1653.7, 1: 1657.3. Samples: 34326474. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:52:43,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 07:52:45,223][100936] Updated weights for policy 0, policy_version 66980 (0.0008) +[2023-10-14 07:52:45,593][100936] Updated weights for policy 0, policy_version 66990 (0.0007) +[2023-10-14 07:52:45,962][100936] Updated weights for policy 0, policy_version 67000 (0.0007) +[2023-10-14 07:52:46,896][100917] Updated weights for policy 1, policy_version 67112 (0.0011) +[2023-10-14 07:52:47,261][100917] Updated weights for policy 1, policy_version 67122 (0.0011) +[2023-10-14 07:52:47,627][100917] Updated weights for policy 1, policy_version 67132 (0.0011) +[2023-10-14 07:52:48,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 137363456. Throughput: 0: 1666.4, 1: 1646.3. Samples: 34346440. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:52:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 07:52:50,006][100936] Updated weights for policy 0, policy_version 67010 (0.0009) +[2023-10-14 07:52:50,385][100936] Updated weights for policy 0, policy_version 67020 (0.0007) +[2023-10-14 07:52:50,744][100936] Updated weights for policy 0, policy_version 67030 (0.0007) +[2023-10-14 07:52:51,112][100936] Updated weights for policy 0, policy_version 67040 (0.0008) +[2023-10-14 07:52:51,802][100917] Updated weights for policy 1, policy_version 67142 (0.0009) +[2023-10-14 07:52:52,177][100917] Updated weights for policy 1, policy_version 67152 (0.0009) +[2023-10-14 07:52:52,542][100917] Updated weights for policy 1, policy_version 67162 (0.0009) +[2023-10-14 07:52:53,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 137428992. Throughput: 0: 1671.0, 1: 1647.2. Samples: 34366048. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:52:53,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 07:52:55,188][100936] Updated weights for policy 0, policy_version 67050 (0.0009) +[2023-10-14 07:52:55,562][100936] Updated weights for policy 0, policy_version 67060 (0.0008) +[2023-10-14 07:52:55,932][100936] Updated weights for policy 0, policy_version 67070 (0.0008) +[2023-10-14 07:52:56,675][100917] Updated weights for policy 1, policy_version 67172 (0.0009) +[2023-10-14 07:52:57,059][100917] Updated weights for policy 1, policy_version 67182 (0.0009) +[2023-10-14 07:52:57,435][100917] Updated weights for policy 1, policy_version 67192 (0.0008) +[2023-10-14 07:52:58,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 137494528. Throughput: 0: 1660.4, 1: 1655.2. Samples: 34376268. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:52:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 07:53:00,068][100936] Updated weights for policy 0, policy_version 67080 (0.0008) +[2023-10-14 07:53:00,443][100936] Updated weights for policy 0, policy_version 67090 (0.0011) +[2023-10-14 07:53:00,818][100936] Updated weights for policy 0, policy_version 67100 (0.0011) +[2023-10-14 07:53:01,556][100917] Updated weights for policy 1, policy_version 67202 (0.0009) +[2023-10-14 07:53:01,928][100917] Updated weights for policy 1, policy_version 67212 (0.0007) +[2023-10-14 07:53:02,307][100917] Updated weights for policy 1, policy_version 67222 (0.0009) +[2023-10-14 07:53:02,674][100917] Updated weights for policy 1, policy_version 67232 (0.0008) +[2023-10-14 07:53:03,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 137560064. Throughput: 0: 1672.9, 1: 1652.6. Samples: 34396268. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:53:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 07:53:05,058][100936] Updated weights for policy 0, policy_version 67110 (0.0010) +[2023-10-14 07:53:05,414][100936] Updated weights for policy 0, policy_version 67120 (0.0009) +[2023-10-14 07:53:05,790][100936] Updated weights for policy 0, policy_version 67130 (0.0007) +[2023-10-14 07:53:06,751][100917] Updated weights for policy 1, policy_version 67242 (0.0009) +[2023-10-14 07:53:07,126][100917] Updated weights for policy 1, policy_version 67252 (0.0010) +[2023-10-14 07:53:07,491][100917] Updated weights for policy 1, policy_version 67262 (0.0009) +[2023-10-14 07:53:08,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 137625600. Throughput: 0: 1667.2, 1: 1657.2. Samples: 34416044. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:53:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 07:53:09,794][100936] Updated weights for policy 0, policy_version 67140 (0.0008) +[2023-10-14 07:53:10,171][100936] Updated weights for policy 0, policy_version 67150 (0.0008) +[2023-10-14 07:53:10,540][100936] Updated weights for policy 0, policy_version 67160 (0.0008) +[2023-10-14 07:53:11,570][100917] Updated weights for policy 1, policy_version 67272 (0.0008) +[2023-10-14 07:53:11,937][100917] Updated weights for policy 1, policy_version 67282 (0.0008) +[2023-10-14 07:53:12,319][100917] Updated weights for policy 1, policy_version 67292 (0.0009) +[2023-10-14 07:53:13,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 137691136. Throughput: 0: 1659.4, 1: 1661.2. Samples: 34426268. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:53:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 07:53:14,616][100936] Updated weights for policy 0, policy_version 67170 (0.0009) +[2023-10-14 07:53:14,994][100936] Updated weights for policy 0, policy_version 67180 (0.0007) +[2023-10-14 07:53:15,374][100936] Updated weights for policy 0, policy_version 67190 (0.0008) +[2023-10-14 07:53:15,739][100936] Updated weights for policy 0, policy_version 67200 (0.0007) +[2023-10-14 07:53:16,401][100917] Updated weights for policy 1, policy_version 67302 (0.0009) +[2023-10-14 07:53:16,769][100917] Updated weights for policy 1, policy_version 67312 (0.0008) +[2023-10-14 07:53:17,136][100917] Updated weights for policy 1, policy_version 67322 (0.0008) +[2023-10-14 07:53:18,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 137756672. Throughput: 0: 1665.8, 1: 1651.6. Samples: 34446150. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:53:18,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 07:53:19,847][100936] Updated weights for policy 0, policy_version 67210 (0.0009) +[2023-10-14 07:53:20,211][100936] Updated weights for policy 0, policy_version 67220 (0.0007) +[2023-10-14 07:53:20,578][100936] Updated weights for policy 0, policy_version 67230 (0.0007) +[2023-10-14 07:53:21,265][100917] Updated weights for policy 1, policy_version 67332 (0.0008) +[2023-10-14 07:53:21,681][100917] Updated weights for policy 1, policy_version 67342 (0.0008) +[2023-10-14 07:53:22,055][100917] Updated weights for policy 1, policy_version 67352 (0.0007) +[2023-10-14 07:53:23,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.1, 300 sec: 13329.3). Total num frames: 137822208. Throughput: 0: 1653.5, 1: 1668.3. Samples: 34465864. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:53:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 07:53:24,943][100936] Updated weights for policy 0, policy_version 67240 (0.0009) +[2023-10-14 07:53:25,307][100936] Updated weights for policy 0, policy_version 67250 (0.0008) +[2023-10-14 07:53:25,675][100936] Updated weights for policy 0, policy_version 67260 (0.0008) +[2023-10-14 07:53:26,058][100917] Updated weights for policy 1, policy_version 67362 (0.0008) +[2023-10-14 07:53:26,422][100917] Updated weights for policy 1, policy_version 67372 (0.0009) +[2023-10-14 07:53:26,802][100917] Updated weights for policy 1, policy_version 67382 (0.0008) +[2023-10-14 07:53:27,164][100917] Updated weights for policy 1, policy_version 67392 (0.0009) +[2023-10-14 07:53:28,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 137887744. Throughput: 0: 1655.3, 1: 1672.2. Samples: 34476212. Policy #0 lag: (min: 2.0, avg: 2.7, max: 20.0) +[2023-10-14 07:53:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 07:53:29,719][100936] Updated weights for policy 0, policy_version 67270 (0.0010) +[2023-10-14 07:53:30,090][100936] Updated weights for policy 0, policy_version 67280 (0.0008) +[2023-10-14 07:53:30,446][100936] Updated weights for policy 0, policy_version 67290 (0.0007) +[2023-10-14 07:53:31,126][100917] Updated weights for policy 1, policy_version 67402 (0.0007) +[2023-10-14 07:53:31,492][100917] Updated weights for policy 1, policy_version 67412 (0.0009) +[2023-10-14 07:53:31,881][100917] Updated weights for policy 1, policy_version 67422 (0.0008) +[2023-10-14 07:53:33,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 137953280. Throughput: 0: 1660.1, 1: 1659.2. Samples: 34495808. Policy #0 lag: (min: 2.0, avg: 2.7, max: 20.0) +[2023-10-14 07:53:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 07:53:34,584][100936] Updated weights for policy 0, policy_version 67300 (0.0007) +[2023-10-14 07:53:34,949][100936] Updated weights for policy 0, policy_version 67310 (0.0011) +[2023-10-14 07:53:35,318][100936] Updated weights for policy 0, policy_version 67320 (0.0008) +[2023-10-14 07:53:35,920][100917] Updated weights for policy 1, policy_version 67432 (0.0009) +[2023-10-14 07:53:36,286][100917] Updated weights for policy 1, policy_version 67442 (0.0009) +[2023-10-14 07:53:36,658][100917] Updated weights for policy 1, policy_version 67452 (0.0010) +[2023-10-14 07:53:38,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.1, 300 sec: 13329.3). Total num frames: 138018816. Throughput: 0: 1657.7, 1: 1675.0. Samples: 34516018. Policy #0 lag: (min: 2.0, avg: 2.7, max: 20.0) +[2023-10-14 07:53:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 07:53:38,524][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000067328_68943872.pth... +[2023-10-14 07:53:38,525][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000067456_69074944.pth... +[2023-10-14 07:53:38,563][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000065792_67371008.pth +[2023-10-14 07:53:38,571][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000065920_67502080.pth +[2023-10-14 07:53:39,345][100936] Updated weights for policy 0, policy_version 67330 (0.0008) +[2023-10-14 07:53:39,719][100936] Updated weights for policy 0, policy_version 67340 (0.0010) +[2023-10-14 07:53:40,091][100936] Updated weights for policy 0, policy_version 67350 (0.0007) +[2023-10-14 07:53:40,457][100936] Updated weights for policy 0, policy_version 67360 (0.0008) +[2023-10-14 07:53:40,790][100917] Updated weights for policy 1, policy_version 67462 (0.0009) +[2023-10-14 07:53:41,157][100917] Updated weights for policy 1, policy_version 67472 (0.0008) +[2023-10-14 07:53:41,537][100917] Updated weights for policy 1, policy_version 67482 (0.0008) +[2023-10-14 07:53:43,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 138084352. Throughput: 0: 1655.1, 1: 1665.2. Samples: 34525678. Policy #0 lag: (min: 2.0, avg: 2.7, max: 20.0) +[2023-10-14 07:53:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 07:53:44,555][100936] Updated weights for policy 0, policy_version 67370 (0.0008) +[2023-10-14 07:53:44,930][100936] Updated weights for policy 0, policy_version 67380 (0.0008) +[2023-10-14 07:53:45,302][100936] Updated weights for policy 0, policy_version 67390 (0.0008) +[2023-10-14 07:53:45,681][100917] Updated weights for policy 1, policy_version 67492 (0.0007) +[2023-10-14 07:53:46,047][100917] Updated weights for policy 1, policy_version 67502 (0.0008) +[2023-10-14 07:53:46,429][100917] Updated weights for policy 1, policy_version 67512 (0.0008) +[2023-10-14 07:53:48,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 138149888. Throughput: 0: 1658.8, 1: 1655.7. Samples: 34545422. Policy #0 lag: (min: 2.0, avg: 2.7, max: 20.0) +[2023-10-14 07:53:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 07:53:49,375][100936] Updated weights for policy 0, policy_version 67400 (0.0008) +[2023-10-14 07:53:49,748][100936] Updated weights for policy 0, policy_version 67410 (0.0009) +[2023-10-14 07:53:50,123][100936] Updated weights for policy 0, policy_version 67420 (0.0009) +[2023-10-14 07:53:50,589][100917] Updated weights for policy 1, policy_version 67522 (0.0008) +[2023-10-14 07:53:50,959][100917] Updated weights for policy 1, policy_version 67532 (0.0009) +[2023-10-14 07:53:51,326][100917] Updated weights for policy 1, policy_version 67542 (0.0010) +[2023-10-14 07:53:51,706][100917] Updated weights for policy 1, policy_version 67552 (0.0007) +[2023-10-14 07:53:53,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 138215424. Throughput: 0: 1661.5, 1: 1676.0. Samples: 34566228. Policy #0 lag: (min: 2.0, avg: 2.7, max: 20.0) +[2023-10-14 07:53:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 07:53:54,241][100936] Updated weights for policy 0, policy_version 67430 (0.0008) +[2023-10-14 07:53:54,616][100936] Updated weights for policy 0, policy_version 67440 (0.0008) +[2023-10-14 07:53:54,980][100936] Updated weights for policy 0, policy_version 67450 (0.0009) +[2023-10-14 07:53:55,777][100917] Updated weights for policy 1, policy_version 67562 (0.0009) +[2023-10-14 07:53:56,153][100917] Updated weights for policy 1, policy_version 67572 (0.0007) +[2023-10-14 07:53:56,517][100917] Updated weights for policy 1, policy_version 67582 (0.0009) +[2023-10-14 07:53:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 138280960. Throughput: 0: 1661.0, 1: 1662.0. Samples: 34575806. Policy #0 lag: (min: 2.0, avg: 2.7, max: 20.0) +[2023-10-14 07:53:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 07:53:59,208][100936] Updated weights for policy 0, policy_version 67460 (0.0009) +[2023-10-14 07:53:59,571][100936] Updated weights for policy 0, policy_version 67470 (0.0011) +[2023-10-14 07:53:59,942][100936] Updated weights for policy 0, policy_version 67480 (0.0010) +[2023-10-14 07:54:00,552][100917] Updated weights for policy 1, policy_version 67592 (0.0008) +[2023-10-14 07:54:00,929][100917] Updated weights for policy 1, policy_version 67602 (0.0008) +[2023-10-14 07:54:01,295][100917] Updated weights for policy 1, policy_version 67612 (0.0009) +[2023-10-14 07:54:03,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 138346496. Throughput: 0: 1652.2, 1: 1667.1. Samples: 34595518. Policy #0 lag: (min: 2.0, avg: 2.7, max: 20.0) +[2023-10-14 07:54:03,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 07:54:04,126][100936] Updated weights for policy 0, policy_version 67490 (0.0008) +[2023-10-14 07:54:04,490][100936] Updated weights for policy 0, policy_version 67500 (0.0007) +[2023-10-14 07:54:04,868][100936] Updated weights for policy 0, policy_version 67510 (0.0008) +[2023-10-14 07:54:05,235][100936] Updated weights for policy 0, policy_version 67520 (0.0008) +[2023-10-14 07:54:05,301][100917] Updated weights for policy 1, policy_version 67622 (0.0008) +[2023-10-14 07:54:05,674][100917] Updated weights for policy 1, policy_version 67632 (0.0010) +[2023-10-14 07:54:06,043][100917] Updated weights for policy 1, policy_version 67642 (0.0008) +[2023-10-14 07:54:08,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 138412032. Throughput: 0: 1660.1, 1: 1677.7. Samples: 34616066. Policy #0 lag: (min: 2.0, avg: 2.7, max: 20.0) +[2023-10-14 07:54:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 07:54:09,364][100936] Updated weights for policy 0, policy_version 67530 (0.0008) +[2023-10-14 07:54:09,727][100936] Updated weights for policy 0, policy_version 67540 (0.0009) +[2023-10-14 07:54:10,104][100936] Updated weights for policy 0, policy_version 67550 (0.0009) +[2023-10-14 07:54:10,312][100917] Updated weights for policy 1, policy_version 67652 (0.0007) +[2023-10-14 07:54:10,710][100917] Updated weights for policy 1, policy_version 67662 (0.0009) +[2023-10-14 07:54:11,080][100917] Updated weights for policy 1, policy_version 67672 (0.0008) +[2023-10-14 07:54:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 138477568. Throughput: 0: 1662.2, 1: 1654.0. Samples: 34625440. Policy #0 lag: (min: 28.0, avg: 32.7, max: 60.0) +[2023-10-14 07:54:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.940')] +[2023-10-14 07:54:14,202][100936] Updated weights for policy 0, policy_version 67560 (0.0008) +[2023-10-14 07:54:14,566][100936] Updated weights for policy 0, policy_version 67570 (0.0008) +[2023-10-14 07:54:14,934][100936] Updated weights for policy 0, policy_version 67580 (0.0008) +[2023-10-14 07:54:15,273][100917] Updated weights for policy 1, policy_version 67682 (0.0010) +[2023-10-14 07:54:15,639][100917] Updated weights for policy 1, policy_version 67692 (0.0007) +[2023-10-14 07:54:16,021][100917] Updated weights for policy 1, policy_version 67702 (0.0008) +[2023-10-14 07:54:16,400][100917] Updated weights for policy 1, policy_version 67712 (0.0009) +[2023-10-14 07:54:18,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 138543104. Throughput: 0: 1654.2, 1: 1664.8. Samples: 34645166. Policy #0 lag: (min: 28.0, avg: 32.7, max: 60.0) +[2023-10-14 07:54:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:54:19,187][100936] Updated weights for policy 0, policy_version 67590 (0.0008) +[2023-10-14 07:54:19,565][100936] Updated weights for policy 0, policy_version 67600 (0.0008) +[2023-10-14 07:54:19,937][100936] Updated weights for policy 0, policy_version 67610 (0.0010) +[2023-10-14 07:54:20,474][100917] Updated weights for policy 1, policy_version 67722 (0.0010) +[2023-10-14 07:54:20,852][100917] Updated weights for policy 1, policy_version 67732 (0.0010) +[2023-10-14 07:54:21,229][100917] Updated weights for policy 1, policy_version 67742 (0.0010) +[2023-10-14 07:54:23,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.3, 300 sec: 13329.4). Total num frames: 138608640. Throughput: 0: 1657.1, 1: 1668.0. Samples: 34665644. Policy #0 lag: (min: 28.0, avg: 32.7, max: 60.0) +[2023-10-14 07:54:23,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:54:24,000][100936] Updated weights for policy 0, policy_version 67620 (0.0008) +[2023-10-14 07:54:24,363][100936] Updated weights for policy 0, policy_version 67630 (0.0007) +[2023-10-14 07:54:24,730][100936] Updated weights for policy 0, policy_version 67640 (0.0008) +[2023-10-14 07:54:25,172][100917] Updated weights for policy 1, policy_version 67752 (0.0009) +[2023-10-14 07:54:25,549][100917] Updated weights for policy 1, policy_version 67762 (0.0007) +[2023-10-14 07:54:25,922][100917] Updated weights for policy 1, policy_version 67772 (0.0008) +[2023-10-14 07:54:28,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 138674176. Throughput: 0: 1663.6, 1: 1652.8. Samples: 34674918. Policy #0 lag: (min: 28.0, avg: 32.7, max: 60.0) +[2023-10-14 07:54:28,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 07:54:28,765][100936] Updated weights for policy 0, policy_version 67650 (0.0008) +[2023-10-14 07:54:29,140][100936] Updated weights for policy 0, policy_version 67660 (0.0009) +[2023-10-14 07:54:29,511][100936] Updated weights for policy 0, policy_version 67670 (0.0007) +[2023-10-14 07:54:29,873][100936] Updated weights for policy 0, policy_version 67680 (0.0007) +[2023-10-14 07:54:30,010][100917] Updated weights for policy 1, policy_version 67782 (0.0010) +[2023-10-14 07:54:30,380][100917] Updated weights for policy 1, policy_version 67792 (0.0008) +[2023-10-14 07:54:30,748][100917] Updated weights for policy 1, policy_version 67802 (0.0009) +[2023-10-14 07:54:33,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 138739712. Throughput: 0: 1667.6, 1: 1667.1. Samples: 34695482. Policy #0 lag: (min: 28.0, avg: 32.7, max: 60.0) +[2023-10-14 07:54:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.910')] +[2023-10-14 07:54:33,773][100936] Updated weights for policy 0, policy_version 67690 (0.0009) +[2023-10-14 07:54:34,139][100936] Updated weights for policy 0, policy_version 67700 (0.0009) +[2023-10-14 07:54:34,520][100936] Updated weights for policy 0, policy_version 67710 (0.0008) +[2023-10-14 07:54:34,957][100917] Updated weights for policy 1, policy_version 67812 (0.0009) +[2023-10-14 07:54:35,312][100917] Updated weights for policy 1, policy_version 67822 (0.0008) +[2023-10-14 07:54:35,688][100917] Updated weights for policy 1, policy_version 67832 (0.0009) +[2023-10-14 07:54:38,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 138805248. Throughput: 0: 1663.2, 1: 1659.8. Samples: 34715760. Policy #0 lag: (min: 28.0, avg: 32.7, max: 60.0) +[2023-10-14 07:54:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.910')] +[2023-10-14 07:54:38,753][100936] Updated weights for policy 0, policy_version 67720 (0.0008) +[2023-10-14 07:54:39,124][100936] Updated weights for policy 0, policy_version 67730 (0.0009) +[2023-10-14 07:54:39,497][100936] Updated weights for policy 0, policy_version 67740 (0.0008) +[2023-10-14 07:54:39,868][100917] Updated weights for policy 1, policy_version 67842 (0.0011) +[2023-10-14 07:54:40,247][100917] Updated weights for policy 1, policy_version 67852 (0.0011) +[2023-10-14 07:54:40,612][100917] Updated weights for policy 1, policy_version 67862 (0.0012) +[2023-10-14 07:54:40,976][100917] Updated weights for policy 1, policy_version 67872 (0.0008) +[2023-10-14 07:54:43,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 138870784. Throughput: 0: 1666.1, 1: 1644.8. Samples: 34724800. Policy #0 lag: (min: 28.0, avg: 32.7, max: 60.0) +[2023-10-14 07:54:43,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.830')] +[2023-10-14 07:54:43,559][100936] Updated weights for policy 0, policy_version 67750 (0.0008) +[2023-10-14 07:54:43,926][100936] Updated weights for policy 0, policy_version 67760 (0.0010) +[2023-10-14 07:54:44,291][100936] Updated weights for policy 0, policy_version 67770 (0.0007) +[2023-10-14 07:54:45,167][100917] Updated weights for policy 1, policy_version 67882 (0.0007) +[2023-10-14 07:54:45,541][100917] Updated weights for policy 1, policy_version 67892 (0.0009) +[2023-10-14 07:54:45,914][100917] Updated weights for policy 1, policy_version 67902 (0.0010) +[2023-10-14 07:54:48,377][100936] Updated weights for policy 0, policy_version 67780 (0.0009) +[2023-10-14 07:54:48,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 138936320. Throughput: 0: 1674.5, 1: 1651.9. Samples: 34745206. Policy #0 lag: (min: 28.0, avg: 32.7, max: 60.0) +[2023-10-14 07:54:48,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.830')] +[2023-10-14 07:54:48,747][100936] Updated weights for policy 0, policy_version 67790 (0.0008) +[2023-10-14 07:54:49,114][100936] Updated weights for policy 0, policy_version 67800 (0.0007) +[2023-10-14 07:54:50,210][100917] Updated weights for policy 1, policy_version 67912 (0.0008) +[2023-10-14 07:54:50,586][100917] Updated weights for policy 1, policy_version 67922 (0.0007) +[2023-10-14 07:54:50,947][100917] Updated weights for policy 1, policy_version 67932 (0.0010) +[2023-10-14 07:54:53,153][100936] Updated weights for policy 0, policy_version 67810 (0.0007) +[2023-10-14 07:54:53,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 139001856. Throughput: 0: 1664.8, 1: 1645.0. Samples: 34765010. Policy #0 lag: (min: 28.0, avg: 32.7, max: 60.0) +[2023-10-14 07:54:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.830')] +[2023-10-14 07:54:53,527][100936] Updated weights for policy 0, policy_version 67820 (0.0009) +[2023-10-14 07:54:53,901][100936] Updated weights for policy 0, policy_version 67830 (0.0011) +[2023-10-14 07:54:54,271][100936] Updated weights for policy 0, policy_version 67840 (0.0008) +[2023-10-14 07:54:55,173][100917] Updated weights for policy 1, policy_version 67942 (0.0010) +[2023-10-14 07:54:55,559][100917] Updated weights for policy 1, policy_version 67952 (0.0008) +[2023-10-14 07:54:55,927][100917] Updated weights for policy 1, policy_version 67962 (0.0009) +[2023-10-14 07:54:58,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 139067392. Throughput: 0: 1672.8, 1: 1643.6. Samples: 34774680. Policy #0 lag: (min: 12.0, avg: 12.0, max: 12.0) +[2023-10-14 07:54:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.830')] +[2023-10-14 07:54:58,538][100936] Updated weights for policy 0, policy_version 67850 (0.0008) +[2023-10-14 07:54:58,908][100936] Updated weights for policy 0, policy_version 67860 (0.0009) +[2023-10-14 07:54:59,276][100936] Updated weights for policy 0, policy_version 67870 (0.0008) +[2023-10-14 07:54:59,988][100917] Updated weights for policy 1, policy_version 67972 (0.0009) +[2023-10-14 07:55:00,364][100917] Updated weights for policy 1, policy_version 67982 (0.0010) +[2023-10-14 07:55:00,741][100917] Updated weights for policy 1, policy_version 67992 (0.0011) +[2023-10-14 07:55:03,356][100936] Updated weights for policy 0, policy_version 67880 (0.0008) +[2023-10-14 07:55:03,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 139132928. Throughput: 0: 1675.6, 1: 1655.7. Samples: 34795078. Policy #0 lag: (min: 12.0, avg: 12.0, max: 12.0) +[2023-10-14 07:55:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.830')] +[2023-10-14 07:55:03,727][100936] Updated weights for policy 0, policy_version 67890 (0.0010) +[2023-10-14 07:55:04,103][100936] Updated weights for policy 0, policy_version 67900 (0.0007) +[2023-10-14 07:55:04,911][100917] Updated weights for policy 1, policy_version 68002 (0.0010) +[2023-10-14 07:55:05,286][100917] Updated weights for policy 1, policy_version 68012 (0.0008) +[2023-10-14 07:55:05,662][100917] Updated weights for policy 1, policy_version 68022 (0.0009) +[2023-10-14 07:55:06,031][100917] Updated weights for policy 1, policy_version 68032 (0.0007) +[2023-10-14 07:55:08,301][100936] Updated weights for policy 0, policy_version 67910 (0.0008) +[2023-10-14 07:55:08,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 139198464. Throughput: 0: 1665.5, 1: 1653.1. Samples: 34814982. Policy #0 lag: (min: 12.0, avg: 12.0, max: 12.0) +[2023-10-14 07:55:08,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.830')] +[2023-10-14 07:55:08,680][100936] Updated weights for policy 0, policy_version 67920 (0.0007) +[2023-10-14 07:55:09,049][100936] Updated weights for policy 0, policy_version 67930 (0.0007) +[2023-10-14 07:55:10,179][100917] Updated weights for policy 1, policy_version 68042 (0.0010) +[2023-10-14 07:55:10,552][100917] Updated weights for policy 1, policy_version 68052 (0.0009) +[2023-10-14 07:55:10,929][100917] Updated weights for policy 1, policy_version 68062 (0.0010) +[2023-10-14 07:55:13,154][100936] Updated weights for policy 0, policy_version 67940 (0.0010) +[2023-10-14 07:55:13,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 139264000. Throughput: 0: 1668.7, 1: 1650.7. Samples: 34824290. Policy #0 lag: (min: 12.0, avg: 12.0, max: 12.0) +[2023-10-14 07:55:13,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.830')] +[2023-10-14 07:55:13,521][100936] Updated weights for policy 0, policy_version 67950 (0.0009) +[2023-10-14 07:55:13,885][100936] Updated weights for policy 0, policy_version 67960 (0.0009) +[2023-10-14 07:55:15,097][100917] Updated weights for policy 1, policy_version 68072 (0.0008) +[2023-10-14 07:55:15,470][100917] Updated weights for policy 1, policy_version 68082 (0.0008) +[2023-10-14 07:55:15,840][100917] Updated weights for policy 1, policy_version 68092 (0.0009) +[2023-10-14 07:55:18,036][100936] Updated weights for policy 0, policy_version 67970 (0.0010) +[2023-10-14 07:55:18,404][100936] Updated weights for policy 0, policy_version 67980 (0.0009) +[2023-10-14 07:55:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 139329536. Throughput: 0: 1665.0, 1: 1648.5. Samples: 34844588. Policy #0 lag: (min: 12.0, avg: 12.0, max: 12.0) +[2023-10-14 07:55:18,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.830')] +[2023-10-14 07:55:18,776][100936] Updated weights for policy 0, policy_version 67990 (0.0008) +[2023-10-14 07:55:19,135][100936] Updated weights for policy 0, policy_version 68000 (0.0010) +[2023-10-14 07:55:19,867][100917] Updated weights for policy 1, policy_version 68102 (0.0010) +[2023-10-14 07:55:20,247][100917] Updated weights for policy 1, policy_version 68112 (0.0011) +[2023-10-14 07:55:20,626][100917] Updated weights for policy 1, policy_version 68122 (0.0009) +[2023-10-14 07:55:23,116][100936] Updated weights for policy 0, policy_version 68010 (0.0009) +[2023-10-14 07:55:23,481][100936] Updated weights for policy 0, policy_version 68020 (0.0009) +[2023-10-14 07:55:23,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 139395072. Throughput: 0: 1652.8, 1: 1652.8. Samples: 34864514. Policy #0 lag: (min: 12.0, avg: 12.0, max: 12.0) +[2023-10-14 07:55:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.830')] +[2023-10-14 07:55:23,858][100936] Updated weights for policy 0, policy_version 68030 (0.0008) +[2023-10-14 07:55:24,746][100917] Updated weights for policy 1, policy_version 68132 (0.0008) +[2023-10-14 07:55:25,115][100917] Updated weights for policy 1, policy_version 68142 (0.0007) +[2023-10-14 07:55:25,497][100917] Updated weights for policy 1, policy_version 68152 (0.0007) +[2023-10-14 07:55:28,003][100936] Updated weights for policy 0, policy_version 68040 (0.0008) +[2023-10-14 07:55:28,366][100936] Updated weights for policy 0, policy_version 68050 (0.0007) +[2023-10-14 07:55:28,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 139460608. Throughput: 0: 1669.2, 1: 1650.7. Samples: 34874194. Policy #0 lag: (min: 12.0, avg: 12.0, max: 12.0) +[2023-10-14 07:55:28,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.830')] +[2023-10-14 07:55:28,741][100936] Updated weights for policy 0, policy_version 68060 (0.0009) +[2023-10-14 07:55:29,541][100917] Updated weights for policy 1, policy_version 68162 (0.0010) +[2023-10-14 07:55:29,909][100917] Updated weights for policy 1, policy_version 68172 (0.0008) +[2023-10-14 07:55:30,276][100917] Updated weights for policy 1, policy_version 68182 (0.0010) +[2023-10-14 07:55:30,644][100917] Updated weights for policy 1, policy_version 68192 (0.0008) +[2023-10-14 07:55:32,909][100936] Updated weights for policy 0, policy_version 68070 (0.0010) +[2023-10-14 07:55:33,276][100936] Updated weights for policy 0, policy_version 68080 (0.0008) +[2023-10-14 07:55:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 139526144. Throughput: 0: 1661.1, 1: 1655.8. Samples: 34894466. Policy #0 lag: (min: 12.0, avg: 12.0, max: 12.0) +[2023-10-14 07:55:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.830')] +[2023-10-14 07:55:33,654][100936] Updated weights for policy 0, policy_version 68090 (0.0007) +[2023-10-14 07:55:34,635][100917] Updated weights for policy 1, policy_version 68202 (0.0009) +[2023-10-14 07:55:35,011][100917] Updated weights for policy 1, policy_version 68212 (0.0008) +[2023-10-14 07:55:35,391][100917] Updated weights for policy 1, policy_version 68222 (0.0007) +[2023-10-14 07:55:37,979][100936] Updated weights for policy 0, policy_version 68100 (0.0008) +[2023-10-14 07:55:38,350][100936] Updated weights for policy 0, policy_version 68110 (0.0010) +[2023-10-14 07:55:38,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 139591680. Throughput: 0: 1653.3, 1: 1663.0. Samples: 34914244. Policy #0 lag: (min: 12.0, avg: 12.0, max: 12.0) +[2023-10-14 07:55:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.830')] +[2023-10-14 07:55:38,522][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000068224_69861376.pth... +[2023-10-14 07:55:38,556][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000066688_68288512.pth +[2023-10-14 07:55:38,718][100936] Updated weights for policy 0, policy_version 68120 (0.0010) +[2023-10-14 07:55:39,015][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000068128_69763072.pth... +[2023-10-14 07:55:39,047][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000066560_68157440.pth +[2023-10-14 07:55:39,407][100917] Updated weights for policy 1, policy_version 68232 (0.0009) +[2023-10-14 07:55:39,782][100917] Updated weights for policy 1, policy_version 68242 (0.0010) +[2023-10-14 07:55:40,150][100917] Updated weights for policy 1, policy_version 68252 (0.0009) +[2023-10-14 07:55:42,803][100936] Updated weights for policy 0, policy_version 68130 (0.0010) +[2023-10-14 07:55:43,176][100936] Updated weights for policy 0, policy_version 68140 (0.0008) +[2023-10-14 07:55:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 139657216. Throughput: 0: 1655.4, 1: 1661.2. Samples: 34923924. Policy #0 lag: (min: 21.0, avg: 21.0, max: 21.0) +[2023-10-14 07:55:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.830')] +[2023-10-14 07:55:43,558][100936] Updated weights for policy 0, policy_version 68150 (0.0007) +[2023-10-14 07:55:43,926][100936] Updated weights for policy 0, policy_version 68160 (0.0008) +[2023-10-14 07:55:44,510][100917] Updated weights for policy 1, policy_version 68262 (0.0008) +[2023-10-14 07:55:44,885][100917] Updated weights for policy 1, policy_version 68272 (0.0007) +[2023-10-14 07:55:45,259][100917] Updated weights for policy 1, policy_version 68282 (0.0008) +[2023-10-14 07:55:48,100][100936] Updated weights for policy 0, policy_version 68170 (0.0010) +[2023-10-14 07:55:48,464][100936] Updated weights for policy 0, policy_version 68180 (0.0008) +[2023-10-14 07:55:48,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 139722752. Throughput: 0: 1654.7, 1: 1659.4. Samples: 34944214. Policy #0 lag: (min: 21.0, avg: 21.0, max: 21.0) +[2023-10-14 07:55:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.830')] +[2023-10-14 07:55:48,841][100936] Updated weights for policy 0, policy_version 68190 (0.0007) +[2023-10-14 07:55:49,382][100917] Updated weights for policy 1, policy_version 68292 (0.0009) +[2023-10-14 07:55:49,761][100917] Updated weights for policy 1, policy_version 68302 (0.0009) +[2023-10-14 07:55:50,129][100917] Updated weights for policy 1, policy_version 68312 (0.0010) +[2023-10-14 07:55:52,881][100936] Updated weights for policy 0, policy_version 68200 (0.0007) +[2023-10-14 07:55:53,265][100936] Updated weights for policy 0, policy_version 68210 (0.0008) +[2023-10-14 07:55:53,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 139788288. Throughput: 0: 1644.4, 1: 1662.4. Samples: 34963790. Policy #0 lag: (min: 21.0, avg: 21.0, max: 21.0) +[2023-10-14 07:55:53,512][99942] Avg episode reward: [(0, '0.810'), (1, '0.830')] +[2023-10-14 07:55:53,639][100936] Updated weights for policy 0, policy_version 68220 (0.0009) +[2023-10-14 07:55:54,208][100917] Updated weights for policy 1, policy_version 68322 (0.0008) +[2023-10-14 07:55:54,584][100917] Updated weights for policy 1, policy_version 68332 (0.0008) +[2023-10-14 07:55:54,962][100917] Updated weights for policy 1, policy_version 68342 (0.0010) +[2023-10-14 07:55:55,343][100917] Updated weights for policy 1, policy_version 68352 (0.0009) +[2023-10-14 07:55:57,743][100936] Updated weights for policy 0, policy_version 68230 (0.0009) +[2023-10-14 07:55:58,107][100936] Updated weights for policy 0, policy_version 68240 (0.0008) +[2023-10-14 07:55:58,479][100936] Updated weights for policy 0, policy_version 68250 (0.0008) +[2023-10-14 07:55:58,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 139853824. Throughput: 0: 1656.1, 1: 1660.7. Samples: 34973550. Policy #0 lag: (min: 21.0, avg: 21.0, max: 21.0) +[2023-10-14 07:55:58,513][99942] Avg episode reward: [(0, '0.700'), (1, '0.830')] +[2023-10-14 07:55:59,477][100917] Updated weights for policy 1, policy_version 68362 (0.0008) +[2023-10-14 07:55:59,851][100917] Updated weights for policy 1, policy_version 68372 (0.0010) +[2023-10-14 07:56:00,228][100917] Updated weights for policy 1, policy_version 68382 (0.0008) +[2023-10-14 07:56:02,659][100936] Updated weights for policy 0, policy_version 68260 (0.0008) +[2023-10-14 07:56:03,030][100936] Updated weights for policy 0, policy_version 68270 (0.0009) +[2023-10-14 07:56:03,404][100936] Updated weights for policy 0, policy_version 68280 (0.0009) +[2023-10-14 07:56:03,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 139919360. Throughput: 0: 1653.6, 1: 1668.3. Samples: 34994072. Policy #0 lag: (min: 21.0, avg: 21.0, max: 21.0) +[2023-10-14 07:56:03,513][99942] Avg episode reward: [(0, '0.700'), (1, '0.830')] +[2023-10-14 07:56:04,380][100917] Updated weights for policy 1, policy_version 68392 (0.0010) +[2023-10-14 07:56:04,753][100917] Updated weights for policy 1, policy_version 68402 (0.0008) +[2023-10-14 07:56:05,118][100917] Updated weights for policy 1, policy_version 68412 (0.0008) +[2023-10-14 07:56:07,601][100936] Updated weights for policy 0, policy_version 68290 (0.0010) +[2023-10-14 07:56:07,976][100936] Updated weights for policy 0, policy_version 68300 (0.0008) +[2023-10-14 07:56:08,352][100936] Updated weights for policy 0, policy_version 68310 (0.0007) +[2023-10-14 07:56:08,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 139984896. Throughput: 0: 1647.4, 1: 1659.4. Samples: 35013322. Policy #0 lag: (min: 21.0, avg: 21.0, max: 21.0) +[2023-10-14 07:56:08,512][99942] Avg episode reward: [(0, '0.690'), (1, '0.830')] +[2023-10-14 07:56:08,726][100936] Updated weights for policy 0, policy_version 68320 (0.0007) +[2023-10-14 07:56:09,372][100917] Updated weights for policy 1, policy_version 68422 (0.0008) +[2023-10-14 07:56:09,742][100917] Updated weights for policy 1, policy_version 68432 (0.0008) +[2023-10-14 07:56:10,108][100917] Updated weights for policy 1, policy_version 68442 (0.0009) +[2023-10-14 07:56:12,883][100936] Updated weights for policy 0, policy_version 68330 (0.0007) +[2023-10-14 07:56:13,261][100936] Updated weights for policy 0, policy_version 68340 (0.0007) +[2023-10-14 07:56:13,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.1, 300 sec: 13107.2). Total num frames: 140050432. Throughput: 0: 1648.6, 1: 1663.6. Samples: 35023242. Policy #0 lag: (min: 21.0, avg: 21.0, max: 21.0) +[2023-10-14 07:56:13,513][99942] Avg episode reward: [(0, '0.690'), (1, '0.830')] +[2023-10-14 07:56:13,633][100936] Updated weights for policy 0, policy_version 68350 (0.0007) +[2023-10-14 07:56:14,274][100917] Updated weights for policy 1, policy_version 68452 (0.0009) +[2023-10-14 07:56:14,640][100917] Updated weights for policy 1, policy_version 68462 (0.0010) +[2023-10-14 07:56:15,013][100917] Updated weights for policy 1, policy_version 68472 (0.0009) +[2023-10-14 07:56:17,872][100936] Updated weights for policy 0, policy_version 68360 (0.0009) +[2023-10-14 07:56:18,260][100936] Updated weights for policy 0, policy_version 68370 (0.0007) +[2023-10-14 07:56:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 140115968. Throughput: 0: 1650.5, 1: 1661.0. Samples: 35043484. Policy #0 lag: (min: 21.0, avg: 21.0, max: 21.0) +[2023-10-14 07:56:18,512][99942] Avg episode reward: [(0, '0.690'), (1, '0.830')] +[2023-10-14 07:56:18,632][100936] Updated weights for policy 0, policy_version 68380 (0.0008) +[2023-10-14 07:56:18,936][100917] Updated weights for policy 1, policy_version 68482 (0.0009) +[2023-10-14 07:56:19,306][100917] Updated weights for policy 1, policy_version 68492 (0.0010) +[2023-10-14 07:56:19,683][100917] Updated weights for policy 1, policy_version 68502 (0.0007) +[2023-10-14 07:56:20,069][100917] Updated weights for policy 1, policy_version 68512 (0.0007) +[2023-10-14 07:56:22,746][100936] Updated weights for policy 0, policy_version 68390 (0.0008) +[2023-10-14 07:56:23,116][100936] Updated weights for policy 0, policy_version 68400 (0.0008) +[2023-10-14 07:56:23,488][100936] Updated weights for policy 0, policy_version 68410 (0.0010) +[2023-10-14 07:56:23,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 140181504. Throughput: 0: 1643.3, 1: 1658.1. Samples: 35062804. Policy #0 lag: (min: 21.0, avg: 21.0, max: 21.0) +[2023-10-14 07:56:23,512][99942] Avg episode reward: [(0, '0.690'), (1, '0.830')] +[2023-10-14 07:56:24,261][100917] Updated weights for policy 1, policy_version 68522 (0.0008) +[2023-10-14 07:56:24,636][100917] Updated weights for policy 1, policy_version 68532 (0.0007) +[2023-10-14 07:56:24,998][100917] Updated weights for policy 1, policy_version 68542 (0.0011) +[2023-10-14 07:56:27,704][100936] Updated weights for policy 0, policy_version 68420 (0.0010) +[2023-10-14 07:56:28,072][100936] Updated weights for policy 0, policy_version 68430 (0.0008) +[2023-10-14 07:56:28,446][100936] Updated weights for policy 0, policy_version 68440 (0.0007) +[2023-10-14 07:56:28,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 140247040. Throughput: 0: 1651.0, 1: 1654.3. Samples: 35072662. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:56:28,512][99942] Avg episode reward: [(0, '0.690'), (1, '0.830')] +[2023-10-14 07:56:29,000][100917] Updated weights for policy 1, policy_version 68552 (0.0008) +[2023-10-14 07:56:29,366][100917] Updated weights for policy 1, policy_version 68562 (0.0007) +[2023-10-14 07:56:29,741][100917] Updated weights for policy 1, policy_version 68572 (0.0008) +[2023-10-14 07:56:32,640][100936] Updated weights for policy 0, policy_version 68450 (0.0009) +[2023-10-14 07:56:33,006][100936] Updated weights for policy 0, policy_version 68460 (0.0010) +[2023-10-14 07:56:33,376][100936] Updated weights for policy 0, policy_version 68470 (0.0008) +[2023-10-14 07:56:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 140312576. Throughput: 0: 1650.3, 1: 1661.1. Samples: 35093226. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:56:33,512][99942] Avg episode reward: [(0, '0.690'), (1, '0.830')] +[2023-10-14 07:56:33,730][100936] Updated weights for policy 0, policy_version 68480 (0.0007) +[2023-10-14 07:56:33,885][100917] Updated weights for policy 1, policy_version 68582 (0.0009) +[2023-10-14 07:56:34,277][100917] Updated weights for policy 1, policy_version 68592 (0.0008) +[2023-10-14 07:56:34,648][100917] Updated weights for policy 1, policy_version 68602 (0.0007) +[2023-10-14 07:56:37,839][100936] Updated weights for policy 0, policy_version 68490 (0.0008) +[2023-10-14 07:56:38,200][100936] Updated weights for policy 0, policy_version 68500 (0.0011) +[2023-10-14 07:56:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 140378112. Throughput: 0: 1646.4, 1: 1658.7. Samples: 35112522. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:56:38,513][99942] Avg episode reward: [(0, '0.690'), (1, '0.830')] +[2023-10-14 07:56:38,573][100936] Updated weights for policy 0, policy_version 68510 (0.0009) +[2023-10-14 07:56:38,751][100917] Updated weights for policy 1, policy_version 68612 (0.0009) +[2023-10-14 07:56:39,117][100917] Updated weights for policy 1, policy_version 68622 (0.0009) +[2023-10-14 07:56:39,486][100917] Updated weights for policy 1, policy_version 68632 (0.0009) +[2023-10-14 07:56:42,773][100936] Updated weights for policy 0, policy_version 68520 (0.0007) +[2023-10-14 07:56:43,142][100936] Updated weights for policy 0, policy_version 68530 (0.0010) +[2023-10-14 07:56:43,512][100936] Updated weights for policy 0, policy_version 68540 (0.0008) +[2023-10-14 07:56:43,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 140443648. Throughput: 0: 1650.6, 1: 1654.6. Samples: 35122286. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:56:43,513][99942] Avg episode reward: [(0, '0.690'), (1, '0.830')] +[2023-10-14 07:56:43,595][100917] Updated weights for policy 1, policy_version 68642 (0.0009) +[2023-10-14 07:56:43,969][100917] Updated weights for policy 1, policy_version 68652 (0.0007) +[2023-10-14 07:56:44,341][100917] Updated weights for policy 1, policy_version 68662 (0.0007) +[2023-10-14 07:56:44,712][100917] Updated weights for policy 1, policy_version 68672 (0.0008) +[2023-10-14 07:56:47,679][100936] Updated weights for policy 0, policy_version 68550 (0.0008) +[2023-10-14 07:56:48,047][100936] Updated weights for policy 0, policy_version 68560 (0.0009) +[2023-10-14 07:56:48,417][100936] Updated weights for policy 0, policy_version 68570 (0.0009) +[2023-10-14 07:56:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 140509184. Throughput: 0: 1650.9, 1: 1655.0. Samples: 35142838. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:56:48,512][99942] Avg episode reward: [(0, '0.690'), (1, '0.830')] +[2023-10-14 07:56:48,875][100917] Updated weights for policy 1, policy_version 68682 (0.0008) +[2023-10-14 07:56:49,246][100917] Updated weights for policy 1, policy_version 68692 (0.0009) +[2023-10-14 07:56:49,617][100917] Updated weights for policy 1, policy_version 68702 (0.0010) +[2023-10-14 07:56:52,535][100936] Updated weights for policy 0, policy_version 68580 (0.0007) +[2023-10-14 07:56:52,902][100936] Updated weights for policy 0, policy_version 68590 (0.0007) +[2023-10-14 07:56:53,272][100936] Updated weights for policy 0, policy_version 68600 (0.0007) +[2023-10-14 07:56:53,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 140574720. Throughput: 0: 1651.3, 1: 1660.4. Samples: 35162348. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:56:53,512][99942] Avg episode reward: [(0, '0.690'), (1, '0.830')] +[2023-10-14 07:56:53,767][100917] Updated weights for policy 1, policy_version 68712 (0.0008) +[2023-10-14 07:56:54,138][100917] Updated weights for policy 1, policy_version 68722 (0.0009) +[2023-10-14 07:56:54,497][100917] Updated weights for policy 1, policy_version 68732 (0.0009) +[2023-10-14 07:56:57,273][100936] Updated weights for policy 0, policy_version 68610 (0.0009) +[2023-10-14 07:56:57,643][100936] Updated weights for policy 0, policy_version 68620 (0.0009) +[2023-10-14 07:56:58,013][100936] Updated weights for policy 0, policy_version 68630 (0.0010) +[2023-10-14 07:56:58,384][100936] Updated weights for policy 0, policy_version 68640 (0.0008) +[2023-10-14 07:56:58,512][99942] Fps is (10 sec: 16384.0, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 140673024. Throughput: 0: 1655.6, 1: 1657.7. Samples: 35172338. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:56:58,512][99942] Avg episode reward: [(0, '0.690'), (1, '0.830')] +[2023-10-14 07:56:58,685][100917] Updated weights for policy 1, policy_version 68742 (0.0008) +[2023-10-14 07:56:59,057][100917] Updated weights for policy 1, policy_version 68752 (0.0007) +[2023-10-14 07:56:59,426][100917] Updated weights for policy 1, policy_version 68762 (0.0008) +[2023-10-14 07:57:02,617][100936] Updated weights for policy 0, policy_version 68650 (0.0011) +[2023-10-14 07:57:03,000][100936] Updated weights for policy 0, policy_version 68660 (0.0010) +[2023-10-14 07:57:03,374][100936] Updated weights for policy 0, policy_version 68670 (0.0009) +[2023-10-14 07:57:03,432][100917] Updated weights for policy 1, policy_version 68772 (0.0008) +[2023-10-14 07:57:03,512][99942] Fps is (10 sec: 16383.7, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 140738560. Throughput: 0: 1650.5, 1: 1660.2. Samples: 35192464. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:57:03,513][99942] Avg episode reward: [(0, '0.690'), (1, '0.830')] +[2023-10-14 07:57:03,799][100917] Updated weights for policy 1, policy_version 68782 (0.0008) +[2023-10-14 07:57:04,174][100917] Updated weights for policy 1, policy_version 68792 (0.0007) +[2023-10-14 07:57:07,508][100936] Updated weights for policy 0, policy_version 68680 (0.0010) +[2023-10-14 07:57:07,891][100936] Updated weights for policy 0, policy_version 68690 (0.0009) +[2023-10-14 07:57:08,257][100936] Updated weights for policy 0, policy_version 68700 (0.0007) +[2023-10-14 07:57:08,287][100917] Updated weights for policy 1, policy_version 68802 (0.0008) +[2023-10-14 07:57:08,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 140804096. Throughput: 0: 1649.0, 1: 1668.3. Samples: 35212086. Policy #0 lag: (min: 29.0, avg: 39.7, max: 40.0) +[2023-10-14 07:57:08,513][99942] Avg episode reward: [(0, '0.690'), (1, '0.830')] +[2023-10-14 07:57:08,663][100917] Updated weights for policy 1, policy_version 68812 (0.0008) +[2023-10-14 07:57:09,039][100917] Updated weights for policy 1, policy_version 68822 (0.0009) +[2023-10-14 07:57:09,417][100917] Updated weights for policy 1, policy_version 68832 (0.0010) +[2023-10-14 07:57:12,465][100936] Updated weights for policy 0, policy_version 68710 (0.0009) +[2023-10-14 07:57:12,826][100936] Updated weights for policy 0, policy_version 68720 (0.0010) +[2023-10-14 07:57:13,198][100936] Updated weights for policy 0, policy_version 68730 (0.0007) +[2023-10-14 07:57:13,422][100917] Updated weights for policy 1, policy_version 68842 (0.0008) +[2023-10-14 07:57:13,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 140869632. Throughput: 0: 1657.8, 1: 1668.7. Samples: 35222354. Policy #0 lag: (min: 29.0, avg: 39.7, max: 40.0) +[2023-10-14 07:57:13,512][99942] Avg episode reward: [(0, '0.690'), (1, '0.830')] +[2023-10-14 07:57:13,799][100917] Updated weights for policy 1, policy_version 68852 (0.0008) +[2023-10-14 07:57:14,177][100917] Updated weights for policy 1, policy_version 68862 (0.0010) +[2023-10-14 07:57:17,290][100936] Updated weights for policy 0, policy_version 68740 (0.0008) +[2023-10-14 07:57:17,664][100936] Updated weights for policy 0, policy_version 68750 (0.0009) +[2023-10-14 07:57:18,036][100936] Updated weights for policy 0, policy_version 68760 (0.0007) +[2023-10-14 07:57:18,448][100917] Updated weights for policy 1, policy_version 68872 (0.0008) +[2023-10-14 07:57:18,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 140935168. Throughput: 0: 1650.5, 1: 1661.9. Samples: 35242284. Policy #0 lag: (min: 29.0, avg: 39.7, max: 40.0) +[2023-10-14 07:57:18,513][99942] Avg episode reward: [(0, '0.690'), (1, '0.920')] +[2023-10-14 07:57:18,826][100917] Updated weights for policy 1, policy_version 68882 (0.0007) +[2023-10-14 07:57:19,203][100917] Updated weights for policy 1, policy_version 68892 (0.0009) +[2023-10-14 07:57:22,234][100936] Updated weights for policy 0, policy_version 68770 (0.0008) +[2023-10-14 07:57:22,597][100936] Updated weights for policy 0, policy_version 68780 (0.0009) +[2023-10-14 07:57:22,965][100936] Updated weights for policy 0, policy_version 68790 (0.0010) +[2023-10-14 07:57:23,299][100917] Updated weights for policy 1, policy_version 68902 (0.0008) +[2023-10-14 07:57:23,328][100936] Updated weights for policy 0, policy_version 68800 (0.0008) +[2023-10-14 07:57:23,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 141000704. Throughput: 0: 1647.7, 1: 1661.9. Samples: 35261458. Policy #0 lag: (min: 29.0, avg: 39.7, max: 40.0) +[2023-10-14 07:57:23,513][99942] Avg episode reward: [(0, '0.690'), (1, '0.920')] +[2023-10-14 07:57:23,683][100917] Updated weights for policy 1, policy_version 68912 (0.0008) +[2023-10-14 07:57:24,046][100917] Updated weights for policy 1, policy_version 68922 (0.0010) +[2023-10-14 07:57:27,480][100936] Updated weights for policy 0, policy_version 68810 (0.0008) +[2023-10-14 07:57:27,850][100936] Updated weights for policy 0, policy_version 68820 (0.0007) +[2023-10-14 07:57:28,172][100917] Updated weights for policy 1, policy_version 68932 (0.0008) +[2023-10-14 07:57:28,215][100936] Updated weights for policy 0, policy_version 68830 (0.0009) +[2023-10-14 07:57:28,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 141066240. Throughput: 0: 1655.0, 1: 1662.3. Samples: 35271564. Policy #0 lag: (min: 29.0, avg: 39.7, max: 40.0) +[2023-10-14 07:57:28,513][99942] Avg episode reward: [(0, '0.690'), (1, '1.000')] +[2023-10-14 07:57:28,544][100917] Updated weights for policy 1, policy_version 68942 (0.0009) +[2023-10-14 07:57:28,925][100917] Updated weights for policy 1, policy_version 68952 (0.0007) +[2023-10-14 07:57:32,191][100936] Updated weights for policy 0, policy_version 68840 (0.0010) +[2023-10-14 07:57:32,552][100936] Updated weights for policy 0, policy_version 68850 (0.0010) +[2023-10-14 07:57:32,914][100917] Updated weights for policy 1, policy_version 68962 (0.0007) +[2023-10-14 07:57:32,923][100936] Updated weights for policy 0, policy_version 68860 (0.0008) +[2023-10-14 07:57:33,287][100917] Updated weights for policy 1, policy_version 68972 (0.0008) +[2023-10-14 07:57:33,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 141131776. Throughput: 0: 1643.8, 1: 1661.6. Samples: 35291582. Policy #0 lag: (min: 29.0, avg: 39.7, max: 40.0) +[2023-10-14 07:57:33,512][99942] Avg episode reward: [(0, '0.690'), (1, '1.000')] +[2023-10-14 07:57:33,667][100917] Updated weights for policy 1, policy_version 68982 (0.0009) +[2023-10-14 07:57:34,027][100917] Updated weights for policy 1, policy_version 68992 (0.0010) +[2023-10-14 07:57:36,995][100936] Updated weights for policy 0, policy_version 68870 (0.0009) +[2023-10-14 07:57:37,368][100936] Updated weights for policy 0, policy_version 68880 (0.0007) +[2023-10-14 07:57:37,731][100936] Updated weights for policy 0, policy_version 68890 (0.0008) +[2023-10-14 07:57:38,216][100917] Updated weights for policy 1, policy_version 69002 (0.0008) +[2023-10-14 07:57:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 141197312. Throughput: 0: 1647.1, 1: 1657.7. Samples: 35311064. Policy #0 lag: (min: 29.0, avg: 39.7, max: 40.0) +[2023-10-14 07:57:38,513][99942] Avg episode reward: [(0, '0.690'), (1, '1.000')] +[2023-10-14 07:57:38,523][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000068896_70549504.pth... +[2023-10-14 07:57:38,556][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000067328_68943872.pth +[2023-10-14 07:57:38,592][100917] Updated weights for policy 1, policy_version 69012 (0.0011) +[2023-10-14 07:57:38,959][100917] Updated weights for policy 1, policy_version 69022 (0.0008) +[2023-10-14 07:57:39,033][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000069024_70680576.pth... +[2023-10-14 07:57:39,071][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000067456_69074944.pth +[2023-10-14 07:57:41,783][100936] Updated weights for policy 0, policy_version 68900 (0.0008) +[2023-10-14 07:57:42,158][100936] Updated weights for policy 0, policy_version 68910 (0.0010) +[2023-10-14 07:57:42,522][100936] Updated weights for policy 0, policy_version 68920 (0.0010) +[2023-10-14 07:57:43,123][100917] Updated weights for policy 1, policy_version 69032 (0.0008) +[2023-10-14 07:57:43,500][100917] Updated weights for policy 1, policy_version 69042 (0.0007) +[2023-10-14 07:57:43,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 141262848. Throughput: 0: 1653.9, 1: 1661.3. Samples: 35321522. Policy #0 lag: (min: 29.0, avg: 39.7, max: 40.0) +[2023-10-14 07:57:43,512][99942] Avg episode reward: [(0, '0.690'), (1, '1.000')] +[2023-10-14 07:57:43,874][100917] Updated weights for policy 1, policy_version 69052 (0.0010) +[2023-10-14 07:57:46,589][100936] Updated weights for policy 0, policy_version 68930 (0.0008) +[2023-10-14 07:57:46,957][100936] Updated weights for policy 0, policy_version 68940 (0.0008) +[2023-10-14 07:57:47,326][100936] Updated weights for policy 0, policy_version 68950 (0.0009) +[2023-10-14 07:57:47,705][100936] Updated weights for policy 0, policy_version 68960 (0.0010) +[2023-10-14 07:57:48,103][100917] Updated weights for policy 1, policy_version 69062 (0.0009) +[2023-10-14 07:57:48,475][100917] Updated weights for policy 1, policy_version 69072 (0.0010) +[2023-10-14 07:57:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 141328384. Throughput: 0: 1643.7, 1: 1661.4. Samples: 35341192. Policy #0 lag: (min: 29.0, avg: 39.7, max: 40.0) +[2023-10-14 07:57:48,513][99942] Avg episode reward: [(0, '0.690'), (1, '1.000')] +[2023-10-14 07:57:48,848][100917] Updated weights for policy 1, policy_version 69082 (0.0009) +[2023-10-14 07:57:51,892][100936] Updated weights for policy 0, policy_version 68970 (0.0009) +[2023-10-14 07:57:52,270][100936] Updated weights for policy 0, policy_version 68980 (0.0008) +[2023-10-14 07:57:52,638][100936] Updated weights for policy 0, policy_version 68990 (0.0008) +[2023-10-14 07:57:52,915][100917] Updated weights for policy 1, policy_version 69092 (0.0009) +[2023-10-14 07:57:53,290][100917] Updated weights for policy 1, policy_version 69102 (0.0009) +[2023-10-14 07:57:53,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 141393920. Throughput: 0: 1658.4, 1: 1650.0. Samples: 35360966. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:57:53,513][99942] Avg episode reward: [(0, '0.690'), (1, '1.000')] +[2023-10-14 07:57:53,675][100917] Updated weights for policy 1, policy_version 69112 (0.0010) +[2023-10-14 07:57:56,566][100936] Updated weights for policy 0, policy_version 69000 (0.0007) +[2023-10-14 07:57:56,932][100936] Updated weights for policy 0, policy_version 69010 (0.0009) +[2023-10-14 07:57:57,300][100936] Updated weights for policy 0, policy_version 69020 (0.0008) +[2023-10-14 07:57:57,678][100917] Updated weights for policy 1, policy_version 69122 (0.0009) +[2023-10-14 07:57:58,048][100917] Updated weights for policy 1, policy_version 69132 (0.0010) +[2023-10-14 07:57:58,418][100917] Updated weights for policy 1, policy_version 69142 (0.0011) +[2023-10-14 07:57:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 141459456. Throughput: 0: 1659.5, 1: 1653.7. Samples: 35371446. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:57:58,513][99942] Avg episode reward: [(0, '0.690'), (1, '1.000')] +[2023-10-14 07:57:58,798][100917] Updated weights for policy 1, policy_version 69152 (0.0010) +[2023-10-14 07:58:01,526][100936] Updated weights for policy 0, policy_version 69030 (0.0008) +[2023-10-14 07:58:01,896][100936] Updated weights for policy 0, policy_version 69040 (0.0010) +[2023-10-14 07:58:02,264][100936] Updated weights for policy 0, policy_version 69050 (0.0010) +[2023-10-14 07:58:02,936][100917] Updated weights for policy 1, policy_version 69162 (0.0008) +[2023-10-14 07:58:03,317][100917] Updated weights for policy 1, policy_version 69172 (0.0007) +[2023-10-14 07:58:03,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 141524992. Throughput: 0: 1645.6, 1: 1657.9. Samples: 35390942. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:58:03,512][99942] Avg episode reward: [(0, '0.690'), (1, '1.000')] +[2023-10-14 07:58:03,681][100917] Updated weights for policy 1, policy_version 69182 (0.0008) +[2023-10-14 07:58:06,542][100936] Updated weights for policy 0, policy_version 69060 (0.0010) +[2023-10-14 07:58:06,913][100936] Updated weights for policy 0, policy_version 69070 (0.0009) +[2023-10-14 07:58:07,276][100936] Updated weights for policy 0, policy_version 69080 (0.0011) +[2023-10-14 07:58:08,024][100917] Updated weights for policy 1, policy_version 69192 (0.0010) +[2023-10-14 07:58:08,404][100917] Updated weights for policy 1, policy_version 69202 (0.0009) +[2023-10-14 07:58:08,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 141590528. Throughput: 0: 1665.7, 1: 1650.0. Samples: 35410662. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:58:08,512][99942] Avg episode reward: [(0, '0.690'), (1, '1.000')] +[2023-10-14 07:58:08,780][100917] Updated weights for policy 1, policy_version 69212 (0.0010) +[2023-10-14 07:58:11,464][100936] Updated weights for policy 0, policy_version 69090 (0.0011) +[2023-10-14 07:58:11,834][100936] Updated weights for policy 0, policy_version 69100 (0.0008) +[2023-10-14 07:58:12,203][100936] Updated weights for policy 0, policy_version 69110 (0.0007) +[2023-10-14 07:58:12,573][100936] Updated weights for policy 0, policy_version 69120 (0.0008) +[2023-10-14 07:58:12,761][100917] Updated weights for policy 1, policy_version 69222 (0.0009) +[2023-10-14 07:58:13,137][100917] Updated weights for policy 1, policy_version 69232 (0.0010) +[2023-10-14 07:58:13,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 141656064. Throughput: 0: 1664.4, 1: 1662.0. Samples: 35421248. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:58:13,512][99942] Avg episode reward: [(0, '0.690'), (1, '1.000')] +[2023-10-14 07:58:13,522][100917] Updated weights for policy 1, policy_version 69242 (0.0009) +[2023-10-14 07:58:16,717][100936] Updated weights for policy 0, policy_version 69130 (0.0009) +[2023-10-14 07:58:17,081][100936] Updated weights for policy 0, policy_version 69140 (0.0008) +[2023-10-14 07:58:17,419][100917] Updated weights for policy 1, policy_version 69252 (0.0009) +[2023-10-14 07:58:17,450][100936] Updated weights for policy 0, policy_version 69150 (0.0008) +[2023-10-14 07:58:17,788][100917] Updated weights for policy 1, policy_version 69262 (0.0009) +[2023-10-14 07:58:18,159][100917] Updated weights for policy 1, policy_version 69272 (0.0009) +[2023-10-14 07:58:18,512][99942] Fps is (10 sec: 16383.9, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 141754368. Throughput: 0: 1651.2, 1: 1667.4. Samples: 35440916. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:58:18,513][99942] Avg episode reward: [(0, '0.680'), (1, '1.000')] +[2023-10-14 07:58:21,751][100936] Updated weights for policy 0, policy_version 69160 (0.0007) +[2023-10-14 07:58:22,119][100936] Updated weights for policy 0, policy_version 69170 (0.0009) +[2023-10-14 07:58:22,280][100917] Updated weights for policy 1, policy_version 69282 (0.0009) +[2023-10-14 07:58:22,493][100936] Updated weights for policy 0, policy_version 69180 (0.0008) +[2023-10-14 07:58:22,650][100917] Updated weights for policy 1, policy_version 69292 (0.0008) +[2023-10-14 07:58:23,020][100917] Updated weights for policy 1, policy_version 69302 (0.0009) +[2023-10-14 07:58:23,396][100917] Updated weights for policy 1, policy_version 69312 (0.0008) +[2023-10-14 07:58:23,512][99942] Fps is (10 sec: 16383.5, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 141819904. Throughput: 0: 1664.4, 1: 1652.0. Samples: 35460306. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:58:23,513][99942] Avg episode reward: [(0, '0.680'), (1, '1.000')] +[2023-10-14 07:58:26,624][100936] Updated weights for policy 0, policy_version 69190 (0.0008) +[2023-10-14 07:58:27,004][100936] Updated weights for policy 0, policy_version 69200 (0.0009) +[2023-10-14 07:58:27,369][100936] Updated weights for policy 0, policy_version 69210 (0.0009) +[2023-10-14 07:58:27,580][100917] Updated weights for policy 1, policy_version 69322 (0.0008) +[2023-10-14 07:58:27,955][100917] Updated weights for policy 1, policy_version 69332 (0.0010) +[2023-10-14 07:58:28,325][100917] Updated weights for policy 1, policy_version 69342 (0.0008) +[2023-10-14 07:58:28,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 141885440. Throughput: 0: 1659.3, 1: 1663.7. Samples: 35471060. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:58:28,512][99942] Avg episode reward: [(0, '0.680'), (1, '1.000')] +[2023-10-14 07:58:31,269][100936] Updated weights for policy 0, policy_version 69220 (0.0009) +[2023-10-14 07:58:31,632][100936] Updated weights for policy 0, policy_version 69230 (0.0009) +[2023-10-14 07:58:32,003][100936] Updated weights for policy 0, policy_version 69240 (0.0007) +[2023-10-14 07:58:32,512][100917] Updated weights for policy 1, policy_version 69352 (0.0008) +[2023-10-14 07:58:32,875][100917] Updated weights for policy 1, policy_version 69362 (0.0008) +[2023-10-14 07:58:33,259][100917] Updated weights for policy 1, policy_version 69372 (0.0008) +[2023-10-14 07:58:33,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 141950976. Throughput: 0: 1652.4, 1: 1663.4. Samples: 35490404. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:58:33,513][99942] Avg episode reward: [(0, '0.680'), (1, '1.000')] +[2023-10-14 07:58:35,996][100936] Updated weights for policy 0, policy_version 69250 (0.0007) +[2023-10-14 07:58:36,371][100936] Updated weights for policy 0, policy_version 69260 (0.0009) +[2023-10-14 07:58:36,737][100936] Updated weights for policy 0, policy_version 69270 (0.0009) +[2023-10-14 07:58:37,108][100936] Updated weights for policy 0, policy_version 69280 (0.0009) +[2023-10-14 07:58:37,627][100917] Updated weights for policy 1, policy_version 69382 (0.0008) +[2023-10-14 07:58:37,996][100917] Updated weights for policy 1, policy_version 69392 (0.0008) +[2023-10-14 07:58:38,352][100917] Updated weights for policy 1, policy_version 69402 (0.0009) +[2023-10-14 07:58:38,512][99942] Fps is (10 sec: 9830.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 141983744. Throughput: 0: 1664.3, 1: 1653.7. Samples: 35510274. Policy #0 lag: (min: 31.0, avg: 31.0, max: 33.0) +[2023-10-14 07:58:38,513][99942] Avg episode reward: [(0, '0.870'), (1, '1.000')] +[2023-10-14 07:58:41,341][100936] Updated weights for policy 0, policy_version 69290 (0.0007) +[2023-10-14 07:58:41,707][100936] Updated weights for policy 0, policy_version 69300 (0.0010) +[2023-10-14 07:58:42,092][100936] Updated weights for policy 0, policy_version 69310 (0.0008) +[2023-10-14 07:58:42,517][100917] Updated weights for policy 1, policy_version 69412 (0.0010) +[2023-10-14 07:58:42,894][100917] Updated weights for policy 1, policy_version 69422 (0.0009) +[2023-10-14 07:58:43,259][100917] Updated weights for policy 1, policy_version 69432 (0.0009) +[2023-10-14 07:58:43,512][99942] Fps is (10 sec: 9830.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 142049280. Throughput: 0: 1651.2, 1: 1661.6. Samples: 35520520. Policy #0 lag: (min: 31.0, avg: 31.0, max: 33.0) +[2023-10-14 07:58:43,512][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 07:58:46,150][100936] Updated weights for policy 0, policy_version 69320 (0.0010) +[2023-10-14 07:58:46,521][100936] Updated weights for policy 0, policy_version 69330 (0.0009) +[2023-10-14 07:58:46,880][100936] Updated weights for policy 0, policy_version 69340 (0.0010) +[2023-10-14 07:58:47,398][100917] Updated weights for policy 1, policy_version 69442 (0.0010) +[2023-10-14 07:58:47,768][100917] Updated weights for policy 1, policy_version 69452 (0.0009) +[2023-10-14 07:58:48,147][100917] Updated weights for policy 1, policy_version 69462 (0.0010) +[2023-10-14 07:58:48,511][100917] Updated weights for policy 1, policy_version 69472 (0.0010) +[2023-10-14 07:58:48,512][99942] Fps is (10 sec: 16384.5, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 142147584. Throughput: 0: 1659.5, 1: 1659.8. Samples: 35540310. Policy #0 lag: (min: 31.0, avg: 31.0, max: 33.0) +[2023-10-14 07:58:48,512][99942] Avg episode reward: [(0, '0.980'), (1, '0.630')] +[2023-10-14 07:58:50,984][100936] Updated weights for policy 0, policy_version 69350 (0.0009) +[2023-10-14 07:58:51,346][100936] Updated weights for policy 0, policy_version 69360 (0.0010) +[2023-10-14 07:58:51,714][100936] Updated weights for policy 0, policy_version 69370 (0.0009) +[2023-10-14 07:58:52,810][100917] Updated weights for policy 1, policy_version 69482 (0.0011) +[2023-10-14 07:58:53,180][100917] Updated weights for policy 1, policy_version 69492 (0.0011) +[2023-10-14 07:58:53,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 142180352. Throughput: 0: 1664.4, 1: 1654.5. Samples: 35560016. Policy #0 lag: (min: 31.0, avg: 31.0, max: 33.0) +[2023-10-14 07:58:53,512][99942] Avg episode reward: [(0, '0.980'), (1, '0.630')] +[2023-10-14 07:58:53,550][100917] Updated weights for policy 1, policy_version 69502 (0.0011) +[2023-10-14 07:58:55,752][100936] Updated weights for policy 0, policy_version 69380 (0.0007) +[2023-10-14 07:58:56,122][100936] Updated weights for policy 0, policy_version 69390 (0.0008) +[2023-10-14 07:58:56,490][100936] Updated weights for policy 0, policy_version 69400 (0.0010) +[2023-10-14 07:58:57,611][100917] Updated weights for policy 1, policy_version 69512 (0.0007) +[2023-10-14 07:58:57,981][100917] Updated weights for policy 1, policy_version 69522 (0.0009) +[2023-10-14 07:58:58,351][100917] Updated weights for policy 1, policy_version 69532 (0.0009) +[2023-10-14 07:58:58,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 142278656. Throughput: 0: 1647.9, 1: 1658.3. Samples: 35570024. Policy #0 lag: (min: 31.0, avg: 31.0, max: 33.0) +[2023-10-14 07:58:58,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.630')] +[2023-10-14 07:59:00,700][100936] Updated weights for policy 0, policy_version 69410 (0.0010) +[2023-10-14 07:59:01,067][100936] Updated weights for policy 0, policy_version 69420 (0.0008) +[2023-10-14 07:59:01,438][100936] Updated weights for policy 0, policy_version 69430 (0.0011) +[2023-10-14 07:59:01,812][100936] Updated weights for policy 0, policy_version 69440 (0.0011) +[2023-10-14 07:59:02,343][100917] Updated weights for policy 1, policy_version 69542 (0.0007) +[2023-10-14 07:59:02,714][100917] Updated weights for policy 1, policy_version 69552 (0.0010) +[2023-10-14 07:59:03,084][100917] Updated weights for policy 1, policy_version 69562 (0.0009) +[2023-10-14 07:59:03,512][99942] Fps is (10 sec: 16383.8, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 142344192. Throughput: 0: 1662.2, 1: 1655.8. Samples: 35590226. Policy #0 lag: (min: 31.0, avg: 31.0, max: 33.0) +[2023-10-14 07:59:03,512][99942] Avg episode reward: [(0, '0.980'), (1, '0.630')] +[2023-10-14 07:59:05,797][100936] Updated weights for policy 0, policy_version 69450 (0.0008) +[2023-10-14 07:59:06,164][100936] Updated weights for policy 0, policy_version 69460 (0.0008) +[2023-10-14 07:59:06,540][100936] Updated weights for policy 0, policy_version 69470 (0.0008) +[2023-10-14 07:59:07,275][100917] Updated weights for policy 1, policy_version 69572 (0.0010) +[2023-10-14 07:59:07,657][100917] Updated weights for policy 1, policy_version 69582 (0.0010) +[2023-10-14 07:59:08,033][100917] Updated weights for policy 1, policy_version 69592 (0.0010) +[2023-10-14 07:59:08,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 142409728. Throughput: 0: 1668.3, 1: 1651.2. Samples: 35609684. Policy #0 lag: (min: 31.0, avg: 31.0, max: 33.0) +[2023-10-14 07:59:08,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.630')] +[2023-10-14 07:59:10,880][100936] Updated weights for policy 0, policy_version 69480 (0.0008) +[2023-10-14 07:59:11,239][100936] Updated weights for policy 0, policy_version 69490 (0.0009) +[2023-10-14 07:59:11,606][100936] Updated weights for policy 0, policy_version 69500 (0.0009) +[2023-10-14 07:59:12,009][100917] Updated weights for policy 1, policy_version 69602 (0.0010) +[2023-10-14 07:59:12,385][100917] Updated weights for policy 1, policy_version 69612 (0.0010) +[2023-10-14 07:59:12,766][100917] Updated weights for policy 1, policy_version 69622 (0.0010) +[2023-10-14 07:59:13,133][100917] Updated weights for policy 1, policy_version 69632 (0.0011) +[2023-10-14 07:59:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 142475264. Throughput: 0: 1653.7, 1: 1655.0. Samples: 35619950. Policy #0 lag: (min: 31.0, avg: 31.0, max: 33.0) +[2023-10-14 07:59:13,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.630')] +[2023-10-14 07:59:15,684][100936] Updated weights for policy 0, policy_version 69510 (0.0010) +[2023-10-14 07:59:16,048][100936] Updated weights for policy 0, policy_version 69520 (0.0009) +[2023-10-14 07:59:16,423][100936] Updated weights for policy 0, policy_version 69530 (0.0009) +[2023-10-14 07:59:17,369][100917] Updated weights for policy 1, policy_version 69642 (0.0008) +[2023-10-14 07:59:17,739][100917] Updated weights for policy 1, policy_version 69652 (0.0007) +[2023-10-14 07:59:18,109][100917] Updated weights for policy 1, policy_version 69662 (0.0009) +[2023-10-14 07:59:18,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 142540800. Throughput: 0: 1667.6, 1: 1653.2. Samples: 35639838. Policy #0 lag: (min: 31.0, avg: 31.0, max: 33.0) +[2023-10-14 07:59:18,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.630')] +[2023-10-14 07:59:20,672][100936] Updated weights for policy 0, policy_version 69540 (0.0008) +[2023-10-14 07:59:21,033][100936] Updated weights for policy 0, policy_version 69550 (0.0008) +[2023-10-14 07:59:21,414][100936] Updated weights for policy 0, policy_version 69560 (0.0010) +[2023-10-14 07:59:22,235][100917] Updated weights for policy 1, policy_version 69672 (0.0010) +[2023-10-14 07:59:22,606][100917] Updated weights for policy 1, policy_version 69682 (0.0010) +[2023-10-14 07:59:22,974][100917] Updated weights for policy 1, policy_version 69692 (0.0009) +[2023-10-14 07:59:23,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 142606336. Throughput: 0: 1669.1, 1: 1642.0. Samples: 35659272. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:59:23,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.630')] +[2023-10-14 07:59:25,468][100936] Updated weights for policy 0, policy_version 69570 (0.0007) +[2023-10-14 07:59:25,887][100936] Updated weights for policy 0, policy_version 69580 (0.0008) +[2023-10-14 07:59:26,262][100936] Updated weights for policy 0, policy_version 69590 (0.0008) +[2023-10-14 07:59:26,627][100936] Updated weights for policy 0, policy_version 69600 (0.0009) +[2023-10-14 07:59:27,137][100917] Updated weights for policy 1, policy_version 69702 (0.0008) +[2023-10-14 07:59:27,527][100917] Updated weights for policy 1, policy_version 69712 (0.0008) +[2023-10-14 07:59:27,903][100917] Updated weights for policy 1, policy_version 69722 (0.0007) +[2023-10-14 07:59:28,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 142671872. Throughput: 0: 1654.8, 1: 1652.2. Samples: 35669336. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:59:28,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.630')] +[2023-10-14 07:59:30,716][100936] Updated weights for policy 0, policy_version 69610 (0.0007) +[2023-10-14 07:59:31,097][100936] Updated weights for policy 0, policy_version 69620 (0.0008) +[2023-10-14 07:59:31,455][100936] Updated weights for policy 0, policy_version 69630 (0.0009) +[2023-10-14 07:59:32,006][100917] Updated weights for policy 1, policy_version 69732 (0.0008) +[2023-10-14 07:59:32,389][100917] Updated weights for policy 1, policy_version 69742 (0.0007) +[2023-10-14 07:59:32,756][100917] Updated weights for policy 1, policy_version 69752 (0.0009) +[2023-10-14 07:59:33,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 142737408. Throughput: 0: 1662.0, 1: 1652.4. Samples: 35689460. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:59:33,512][99942] Avg episode reward: [(0, '0.980'), (1, '0.630')] +[2023-10-14 07:59:35,625][100936] Updated weights for policy 0, policy_version 69640 (0.0007) +[2023-10-14 07:59:35,994][100936] Updated weights for policy 0, policy_version 69650 (0.0008) +[2023-10-14 07:59:36,370][100936] Updated weights for policy 0, policy_version 69660 (0.0011) +[2023-10-14 07:59:36,869][100917] Updated weights for policy 1, policy_version 69762 (0.0008) +[2023-10-14 07:59:37,243][100917] Updated weights for policy 1, policy_version 69772 (0.0010) +[2023-10-14 07:59:37,620][100917] Updated weights for policy 1, policy_version 69782 (0.0011) +[2023-10-14 07:59:37,995][100917] Updated weights for policy 1, policy_version 69792 (0.0009) +[2023-10-14 07:59:38,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 142802944. Throughput: 0: 1662.8, 1: 1643.7. Samples: 35708808. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:59:38,512][99942] Avg episode reward: [(0, '0.980'), (1, '0.630')] +[2023-10-14 07:59:38,521][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000069664_71335936.pth... +[2023-10-14 07:59:38,522][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000069792_71467008.pth... +[2023-10-14 07:59:38,551][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000068128_69763072.pth +[2023-10-14 07:59:38,555][100560] Saving a milestone ./train_atari/atari_privateye_APPO/checkpoint_p0/milestones/checkpoint_000069664_71335936.pth +[2023-10-14 07:59:38,562][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000068224_69861376.pth +[2023-10-14 07:59:38,568][100681] Saving a milestone ./train_atari/atari_privateye_APPO/checkpoint_p1/milestones/checkpoint_000069792_71467008.pth +[2023-10-14 07:59:40,427][100936] Updated weights for policy 0, policy_version 69670 (0.0009) +[2023-10-14 07:59:40,801][100936] Updated weights for policy 0, policy_version 69680 (0.0011) +[2023-10-14 07:59:41,169][100936] Updated weights for policy 0, policy_version 69690 (0.0009) +[2023-10-14 07:59:42,218][100917] Updated weights for policy 1, policy_version 69802 (0.0008) +[2023-10-14 07:59:42,592][100917] Updated weights for policy 1, policy_version 69812 (0.0007) +[2023-10-14 07:59:42,957][100917] Updated weights for policy 1, policy_version 69822 (0.0007) +[2023-10-14 07:59:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 142868480. Throughput: 0: 1650.6, 1: 1656.2. Samples: 35718828. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:59:43,512][99942] Avg episode reward: [(0, '0.980'), (1, '0.630')] +[2023-10-14 07:59:45,240][100936] Updated weights for policy 0, policy_version 69700 (0.0009) +[2023-10-14 07:59:45,613][100936] Updated weights for policy 0, policy_version 69710 (0.0007) +[2023-10-14 07:59:45,979][100936] Updated weights for policy 0, policy_version 69720 (0.0008) +[2023-10-14 07:59:47,040][100917] Updated weights for policy 1, policy_version 69832 (0.0009) +[2023-10-14 07:59:47,419][100917] Updated weights for policy 1, policy_version 69842 (0.0011) +[2023-10-14 07:59:47,796][100917] Updated weights for policy 1, policy_version 69852 (0.0008) +[2023-10-14 07:59:48,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 142934016. Throughput: 0: 1660.0, 1: 1644.0. Samples: 35738906. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:59:48,512][99942] Avg episode reward: [(0, '0.980'), (1, '0.630')] +[2023-10-14 07:59:50,154][100936] Updated weights for policy 0, policy_version 69730 (0.0007) +[2023-10-14 07:59:50,513][100936] Updated weights for policy 0, policy_version 69740 (0.0009) +[2023-10-14 07:59:50,884][100936] Updated weights for policy 0, policy_version 69750 (0.0009) +[2023-10-14 07:59:51,253][100936] Updated weights for policy 0, policy_version 69760 (0.0008) +[2023-10-14 07:59:51,865][100917] Updated weights for policy 1, policy_version 69862 (0.0009) +[2023-10-14 07:59:52,242][100917] Updated weights for policy 1, policy_version 69872 (0.0011) +[2023-10-14 07:59:52,616][100917] Updated weights for policy 1, policy_version 69882 (0.0008) +[2023-10-14 07:59:53,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 142999552. Throughput: 0: 1663.9, 1: 1644.6. Samples: 35758564. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:59:53,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.630')] +[2023-10-14 07:59:55,336][100936] Updated weights for policy 0, policy_version 69770 (0.0009) +[2023-10-14 07:59:55,708][100936] Updated weights for policy 0, policy_version 69780 (0.0007) +[2023-10-14 07:59:56,072][100936] Updated weights for policy 0, policy_version 69790 (0.0009) +[2023-10-14 07:59:56,642][100917] Updated weights for policy 1, policy_version 69892 (0.0009) +[2023-10-14 07:59:57,025][100917] Updated weights for policy 1, policy_version 69902 (0.0008) +[2023-10-14 07:59:57,398][100917] Updated weights for policy 1, policy_version 69912 (0.0008) +[2023-10-14 07:59:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 143065088. Throughput: 0: 1654.0, 1: 1657.3. Samples: 35768956. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 07:59:58,512][99942] Avg episode reward: [(0, '0.980'), (1, '0.630')] +[2023-10-14 08:00:00,164][100936] Updated weights for policy 0, policy_version 69800 (0.0008) +[2023-10-14 08:00:00,534][100936] Updated weights for policy 0, policy_version 69810 (0.0008) +[2023-10-14 08:00:00,898][100936] Updated weights for policy 0, policy_version 69820 (0.0008) +[2023-10-14 08:00:01,453][100917] Updated weights for policy 1, policy_version 69922 (0.0009) +[2023-10-14 08:00:01,828][100917] Updated weights for policy 1, policy_version 69932 (0.0009) +[2023-10-14 08:00:02,207][100917] Updated weights for policy 1, policy_version 69942 (0.0010) +[2023-10-14 08:00:02,584][100917] Updated weights for policy 1, policy_version 69952 (0.0007) +[2023-10-14 08:00:03,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 143130624. Throughput: 0: 1666.3, 1: 1651.4. Samples: 35789132. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:00:03,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.630')] +[2023-10-14 08:00:04,980][100936] Updated weights for policy 0, policy_version 69830 (0.0009) +[2023-10-14 08:00:05,340][100936] Updated weights for policy 0, policy_version 69840 (0.0008) +[2023-10-14 08:00:05,716][100936] Updated weights for policy 0, policy_version 69850 (0.0007) +[2023-10-14 08:00:06,621][100917] Updated weights for policy 1, policy_version 69962 (0.0009) +[2023-10-14 08:00:06,988][100917] Updated weights for policy 1, policy_version 69972 (0.0011) +[2023-10-14 08:00:07,360][100917] Updated weights for policy 1, policy_version 69982 (0.0007) +[2023-10-14 08:00:08,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 13329.4). Total num frames: 143196160. Throughput: 0: 1662.9, 1: 1660.5. Samples: 35808824. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:00:08,512][99942] Avg episode reward: [(0, '0.980'), (1, '0.630')] +[2023-10-14 08:00:09,738][100936] Updated weights for policy 0, policy_version 69860 (0.0007) +[2023-10-14 08:00:10,110][100936] Updated weights for policy 0, policy_version 69870 (0.0009) +[2023-10-14 08:00:10,491][100936] Updated weights for policy 0, policy_version 69880 (0.0009) +[2023-10-14 08:00:11,625][100917] Updated weights for policy 1, policy_version 69992 (0.0008) +[2023-10-14 08:00:12,007][100917] Updated weights for policy 1, policy_version 70002 (0.0010) +[2023-10-14 08:00:12,392][100917] Updated weights for policy 1, policy_version 70012 (0.0010) +[2023-10-14 08:00:13,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 143261696. Throughput: 0: 1659.5, 1: 1667.7. Samples: 35819058. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:00:13,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.630')] +[2023-10-14 08:00:14,666][100936] Updated weights for policy 0, policy_version 69890 (0.0009) +[2023-10-14 08:00:15,041][100936] Updated weights for policy 0, policy_version 69900 (0.0008) +[2023-10-14 08:00:15,406][100936] Updated weights for policy 0, policy_version 69910 (0.0009) +[2023-10-14 08:00:15,776][100936] Updated weights for policy 0, policy_version 69920 (0.0008) +[2023-10-14 08:00:16,598][100917] Updated weights for policy 1, policy_version 70022 (0.0009) +[2023-10-14 08:00:16,971][100917] Updated weights for policy 1, policy_version 70032 (0.0009) +[2023-10-14 08:00:17,348][100917] Updated weights for policy 1, policy_version 70042 (0.0010) +[2023-10-14 08:00:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 143327232. Throughput: 0: 1668.7, 1: 1651.7. Samples: 35838880. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:00:18,512][99942] Avg episode reward: [(0, '0.980'), (1, '0.630')] +[2023-10-14 08:00:19,913][100936] Updated weights for policy 0, policy_version 69930 (0.0008) +[2023-10-14 08:00:20,277][100936] Updated weights for policy 0, policy_version 69940 (0.0010) +[2023-10-14 08:00:20,649][100936] Updated weights for policy 0, policy_version 69950 (0.0010) +[2023-10-14 08:00:21,344][100917] Updated weights for policy 1, policy_version 70052 (0.0010) +[2023-10-14 08:00:21,714][100917] Updated weights for policy 1, policy_version 70062 (0.0009) +[2023-10-14 08:00:22,079][100917] Updated weights for policy 1, policy_version 70072 (0.0009) +[2023-10-14 08:00:23,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 143392768. Throughput: 0: 1669.2, 1: 1662.0. Samples: 35858712. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:00:23,512][99942] Avg episode reward: [(0, '0.980'), (1, '0.630')] +[2023-10-14 08:00:24,837][100936] Updated weights for policy 0, policy_version 69960 (0.0011) +[2023-10-14 08:00:25,203][100936] Updated weights for policy 0, policy_version 69970 (0.0009) +[2023-10-14 08:00:25,571][100936] Updated weights for policy 0, policy_version 69980 (0.0007) +[2023-10-14 08:00:26,116][100917] Updated weights for policy 1, policy_version 70082 (0.0009) +[2023-10-14 08:00:26,498][100917] Updated weights for policy 1, policy_version 70092 (0.0010) +[2023-10-14 08:00:26,869][100917] Updated weights for policy 1, policy_version 70102 (0.0009) +[2023-10-14 08:00:27,241][100917] Updated weights for policy 1, policy_version 70112 (0.0007) +[2023-10-14 08:00:28,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 143458304. Throughput: 0: 1671.2, 1: 1670.4. Samples: 35869202. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:00:28,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.630')] +[2023-10-14 08:00:29,667][100936] Updated weights for policy 0, policy_version 69990 (0.0008) +[2023-10-14 08:00:30,034][100936] Updated weights for policy 0, policy_version 70000 (0.0011) +[2023-10-14 08:00:30,413][100936] Updated weights for policy 0, policy_version 70010 (0.0007) +[2023-10-14 08:00:31,343][100917] Updated weights for policy 1, policy_version 70122 (0.0011) +[2023-10-14 08:00:31,715][100917] Updated weights for policy 1, policy_version 70132 (0.0009) +[2023-10-14 08:00:32,093][100917] Updated weights for policy 1, policy_version 70142 (0.0010) +[2023-10-14 08:00:33,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 143523840. Throughput: 0: 1677.8, 1: 1655.2. Samples: 35888892. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:00:33,512][99942] Avg episode reward: [(0, '0.980'), (1, '0.630')] +[2023-10-14 08:00:34,401][100936] Updated weights for policy 0, policy_version 70020 (0.0009) +[2023-10-14 08:00:34,762][100936] Updated weights for policy 0, policy_version 70030 (0.0008) +[2023-10-14 08:00:35,134][100936] Updated weights for policy 0, policy_version 70040 (0.0010) +[2023-10-14 08:00:36,241][100917] Updated weights for policy 1, policy_version 70152 (0.0009) +[2023-10-14 08:00:36,618][100917] Updated weights for policy 1, policy_version 70162 (0.0009) +[2023-10-14 08:00:36,988][100917] Updated weights for policy 1, policy_version 70172 (0.0007) +[2023-10-14 08:00:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 143589376. Throughput: 0: 1667.4, 1: 1671.6. Samples: 35908820. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:00:38,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.630')] +[2023-10-14 08:00:39,330][100936] Updated weights for policy 0, policy_version 70050 (0.0010) +[2023-10-14 08:00:39,700][100936] Updated weights for policy 0, policy_version 70060 (0.0009) +[2023-10-14 08:00:40,076][100936] Updated weights for policy 0, policy_version 70070 (0.0009) +[2023-10-14 08:00:40,445][100936] Updated weights for policy 0, policy_version 70080 (0.0008) +[2023-10-14 08:00:41,044][100917] Updated weights for policy 1, policy_version 70182 (0.0007) +[2023-10-14 08:00:41,418][100917] Updated weights for policy 1, policy_version 70192 (0.0009) +[2023-10-14 08:00:41,794][100917] Updated weights for policy 1, policy_version 70202 (0.0011) +[2023-10-14 08:00:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 143654912. Throughput: 0: 1666.0, 1: 1664.2. Samples: 35918816. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:00:43,512][99942] Avg episode reward: [(0, '0.980'), (1, '0.630')] +[2023-10-14 08:00:44,721][100936] Updated weights for policy 0, policy_version 70090 (0.0007) +[2023-10-14 08:00:45,080][100936] Updated weights for policy 0, policy_version 70100 (0.0008) +[2023-10-14 08:00:45,461][100936] Updated weights for policy 0, policy_version 70110 (0.0008) +[2023-10-14 08:00:46,013][100917] Updated weights for policy 1, policy_version 70212 (0.0009) +[2023-10-14 08:00:46,386][100917] Updated weights for policy 1, policy_version 70222 (0.0007) +[2023-10-14 08:00:46,763][100917] Updated weights for policy 1, policy_version 70232 (0.0007) +[2023-10-14 08:00:48,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 143720448. Throughput: 0: 1664.2, 1: 1650.5. Samples: 35938296. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:00:48,512][99942] Avg episode reward: [(0, '0.980'), (1, '0.630')] +[2023-10-14 08:00:49,479][100936] Updated weights for policy 0, policy_version 70120 (0.0008) +[2023-10-14 08:00:49,850][100936] Updated weights for policy 0, policy_version 70130 (0.0008) +[2023-10-14 08:00:50,211][100936] Updated weights for policy 0, policy_version 70140 (0.0007) +[2023-10-14 08:00:50,769][100917] Updated weights for policy 1, policy_version 70242 (0.0009) +[2023-10-14 08:00:51,137][100917] Updated weights for policy 1, policy_version 70252 (0.0010) +[2023-10-14 08:00:51,515][100917] Updated weights for policy 1, policy_version 70262 (0.0010) +[2023-10-14 08:00:51,891][100917] Updated weights for policy 1, policy_version 70272 (0.0010) +[2023-10-14 08:00:53,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 143785984. Throughput: 0: 1663.7, 1: 1655.6. Samples: 35958194. Policy #0 lag: (min: 31.0, avg: 34.0, max: 63.0) +[2023-10-14 08:00:53,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.100')] +[2023-10-14 08:00:54,297][100936] Updated weights for policy 0, policy_version 70150 (0.0008) +[2023-10-14 08:00:54,678][100936] Updated weights for policy 0, policy_version 70160 (0.0008) +[2023-10-14 08:00:55,052][100936] Updated weights for policy 0, policy_version 70170 (0.0008) +[2023-10-14 08:00:56,056][100917] Updated weights for policy 1, policy_version 70282 (0.0010) +[2023-10-14 08:00:56,430][100917] Updated weights for policy 1, policy_version 70292 (0.0010) +[2023-10-14 08:00:56,798][100917] Updated weights for policy 1, policy_version 70302 (0.0009) +[2023-10-14 08:00:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 143851520. Throughput: 0: 1664.2, 1: 1650.4. Samples: 35968212. Policy #0 lag: (min: 31.0, avg: 34.0, max: 63.0) +[2023-10-14 08:00:58,512][99942] Avg episode reward: [(0, '0.970'), (1, '0.100')] +[2023-10-14 08:00:59,352][100936] Updated weights for policy 0, policy_version 70180 (0.0008) +[2023-10-14 08:00:59,724][100936] Updated weights for policy 0, policy_version 70190 (0.0008) +[2023-10-14 08:01:00,092][100936] Updated weights for policy 0, policy_version 70200 (0.0011) +[2023-10-14 08:01:00,885][100917] Updated weights for policy 1, policy_version 70312 (0.0008) +[2023-10-14 08:01:01,250][100917] Updated weights for policy 1, policy_version 70322 (0.0011) +[2023-10-14 08:01:01,628][100917] Updated weights for policy 1, policy_version 70332 (0.0011) +[2023-10-14 08:01:03,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 143917056. Throughput: 0: 1654.1, 1: 1645.9. Samples: 35987380. Policy #0 lag: (min: 31.0, avg: 34.0, max: 63.0) +[2023-10-14 08:01:03,513][99942] Avg episode reward: [(0, '0.970'), (1, '0.100')] +[2023-10-14 08:01:04,343][100936] Updated weights for policy 0, policy_version 70210 (0.0007) +[2023-10-14 08:01:04,721][100936] Updated weights for policy 0, policy_version 70220 (0.0008) +[2023-10-14 08:01:05,079][100936] Updated weights for policy 0, policy_version 70230 (0.0009) +[2023-10-14 08:01:05,454][100936] Updated weights for policy 0, policy_version 70240 (0.0008) +[2023-10-14 08:01:05,737][100917] Updated weights for policy 1, policy_version 70342 (0.0009) +[2023-10-14 08:01:06,115][100917] Updated weights for policy 1, policy_version 70352 (0.0011) +[2023-10-14 08:01:06,486][100917] Updated weights for policy 1, policy_version 70362 (0.0011) +[2023-10-14 08:01:08,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 143982592. Throughput: 0: 1656.4, 1: 1659.5. Samples: 36007928. Policy #0 lag: (min: 31.0, avg: 34.0, max: 63.0) +[2023-10-14 08:01:08,513][99942] Avg episode reward: [(0, '0.970'), (1, '0.100')] +[2023-10-14 08:01:09,516][100936] Updated weights for policy 0, policy_version 70250 (0.0011) +[2023-10-14 08:01:09,885][100936] Updated weights for policy 0, policy_version 70260 (0.0010) +[2023-10-14 08:01:10,241][100936] Updated weights for policy 0, policy_version 70270 (0.0009) +[2023-10-14 08:01:10,609][100917] Updated weights for policy 1, policy_version 70372 (0.0009) +[2023-10-14 08:01:10,986][100917] Updated weights for policy 1, policy_version 70382 (0.0010) +[2023-10-14 08:01:11,361][100917] Updated weights for policy 1, policy_version 70392 (0.0010) +[2023-10-14 08:01:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 144048128. Throughput: 0: 1653.2, 1: 1647.5. Samples: 36017730. Policy #0 lag: (min: 31.0, avg: 34.0, max: 63.0) +[2023-10-14 08:01:13,513][99942] Avg episode reward: [(0, '0.970'), (1, '0.100')] +[2023-10-14 08:01:14,328][100936] Updated weights for policy 0, policy_version 70280 (0.0007) +[2023-10-14 08:01:14,697][100936] Updated weights for policy 0, policy_version 70290 (0.0009) +[2023-10-14 08:01:15,071][100936] Updated weights for policy 0, policy_version 70300 (0.0007) +[2023-10-14 08:01:15,595][100917] Updated weights for policy 1, policy_version 70402 (0.0010) +[2023-10-14 08:01:15,970][100917] Updated weights for policy 1, policy_version 70412 (0.0009) +[2023-10-14 08:01:16,344][100917] Updated weights for policy 1, policy_version 70422 (0.0010) +[2023-10-14 08:01:16,714][100917] Updated weights for policy 1, policy_version 70432 (0.0011) +[2023-10-14 08:01:18,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 144113664. Throughput: 0: 1646.3, 1: 1649.7. Samples: 36037214. Policy #0 lag: (min: 31.0, avg: 34.0, max: 63.0) +[2023-10-14 08:01:18,513][99942] Avg episode reward: [(0, '0.970'), (1, '0.100')] +[2023-10-14 08:01:19,085][100936] Updated weights for policy 0, policy_version 70310 (0.0010) +[2023-10-14 08:01:19,449][100936] Updated weights for policy 0, policy_version 70320 (0.0007) +[2023-10-14 08:01:19,820][100936] Updated weights for policy 0, policy_version 70330 (0.0008) +[2023-10-14 08:01:20,680][100917] Updated weights for policy 1, policy_version 70442 (0.0007) +[2023-10-14 08:01:21,057][100917] Updated weights for policy 1, policy_version 70452 (0.0009) +[2023-10-14 08:01:21,414][100917] Updated weights for policy 1, policy_version 70462 (0.0009) +[2023-10-14 08:01:23,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 144179200. Throughput: 0: 1648.3, 1: 1656.4. Samples: 36057532. Policy #0 lag: (min: 31.0, avg: 34.0, max: 63.0) +[2023-10-14 08:01:23,513][99942] Avg episode reward: [(0, '0.970'), (1, '0.100')] +[2023-10-14 08:01:23,966][100936] Updated weights for policy 0, policy_version 70340 (0.0011) +[2023-10-14 08:01:24,334][100936] Updated weights for policy 0, policy_version 70350 (0.0010) +[2023-10-14 08:01:24,705][100936] Updated weights for policy 0, policy_version 70360 (0.0008) +[2023-10-14 08:01:25,596][100917] Updated weights for policy 1, policy_version 70472 (0.0007) +[2023-10-14 08:01:25,965][100917] Updated weights for policy 1, policy_version 70482 (0.0007) +[2023-10-14 08:01:26,338][100917] Updated weights for policy 1, policy_version 70492 (0.0010) +[2023-10-14 08:01:28,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 144244736. Throughput: 0: 1649.7, 1: 1648.0. Samples: 36067216. Policy #0 lag: (min: 31.0, avg: 34.0, max: 63.0) +[2023-10-14 08:01:28,513][99942] Avg episode reward: [(0, '0.970'), (1, '0.100')] +[2023-10-14 08:01:28,967][100936] Updated weights for policy 0, policy_version 70370 (0.0010) +[2023-10-14 08:01:29,335][100936] Updated weights for policy 0, policy_version 70380 (0.0009) +[2023-10-14 08:01:29,702][100936] Updated weights for policy 0, policy_version 70390 (0.0008) +[2023-10-14 08:01:30,067][100936] Updated weights for policy 0, policy_version 70400 (0.0008) +[2023-10-14 08:01:30,457][100917] Updated weights for policy 1, policy_version 70502 (0.0011) +[2023-10-14 08:01:30,835][100917] Updated weights for policy 1, policy_version 70512 (0.0010) +[2023-10-14 08:01:31,199][100917] Updated weights for policy 1, policy_version 70522 (0.0008) +[2023-10-14 08:01:33,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 144310272. Throughput: 0: 1649.2, 1: 1653.7. Samples: 36086928. Policy #0 lag: (min: 31.0, avg: 34.0, max: 63.0) +[2023-10-14 08:01:33,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.470')] +[2023-10-14 08:01:34,228][100936] Updated weights for policy 0, policy_version 70410 (0.0007) +[2023-10-14 08:01:34,601][100936] Updated weights for policy 0, policy_version 70420 (0.0008) +[2023-10-14 08:01:34,971][100936] Updated weights for policy 0, policy_version 70430 (0.0008) +[2023-10-14 08:01:35,379][100917] Updated weights for policy 1, policy_version 70532 (0.0011) +[2023-10-14 08:01:35,754][100917] Updated weights for policy 1, policy_version 70542 (0.0007) +[2023-10-14 08:01:36,120][100917] Updated weights for policy 1, policy_version 70552 (0.0008) +[2023-10-14 08:01:38,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 144375808. Throughput: 0: 1651.0, 1: 1664.0. Samples: 36107368. Policy #0 lag: (min: 31.0, avg: 33.3, max: 63.0) +[2023-10-14 08:01:38,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.470')] +[2023-10-14 08:01:38,523][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000070432_72122368.pth... +[2023-10-14 08:01:38,524][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000070560_72253440.pth... +[2023-10-14 08:01:38,552][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000068896_70549504.pth +[2023-10-14 08:01:38,555][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000069024_70680576.pth +[2023-10-14 08:01:39,054][100936] Updated weights for policy 0, policy_version 70440 (0.0009) +[2023-10-14 08:01:39,415][100936] Updated weights for policy 0, policy_version 70450 (0.0007) +[2023-10-14 08:01:39,788][100936] Updated weights for policy 0, policy_version 70460 (0.0007) +[2023-10-14 08:01:40,059][100917] Updated weights for policy 1, policy_version 70562 (0.0010) +[2023-10-14 08:01:40,432][100917] Updated weights for policy 1, policy_version 70572 (0.0011) +[2023-10-14 08:01:40,808][100917] Updated weights for policy 1, policy_version 70582 (0.0010) +[2023-10-14 08:01:41,187][100917] Updated weights for policy 1, policy_version 70592 (0.0009) +[2023-10-14 08:01:43,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 144441344. Throughput: 0: 1654.1, 1: 1646.5. Samples: 36116740. Policy #0 lag: (min: 31.0, avg: 33.3, max: 63.0) +[2023-10-14 08:01:43,512][99942] Avg episode reward: [(0, '0.980'), (1, '0.470')] +[2023-10-14 08:01:43,900][100936] Updated weights for policy 0, policy_version 70470 (0.0010) +[2023-10-14 08:01:44,281][100936] Updated weights for policy 0, policy_version 70480 (0.0008) +[2023-10-14 08:01:44,651][100936] Updated weights for policy 0, policy_version 70490 (0.0009) +[2023-10-14 08:01:45,346][100917] Updated weights for policy 1, policy_version 70602 (0.0008) +[2023-10-14 08:01:45,716][100917] Updated weights for policy 1, policy_version 70612 (0.0008) +[2023-10-14 08:01:46,093][100917] Updated weights for policy 1, policy_version 70622 (0.0009) +[2023-10-14 08:01:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 144506880. Throughput: 0: 1662.5, 1: 1659.2. Samples: 36136856. Policy #0 lag: (min: 31.0, avg: 33.3, max: 63.0) +[2023-10-14 08:01:48,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.470')] +[2023-10-14 08:01:48,799][100936] Updated weights for policy 0, policy_version 70500 (0.0007) +[2023-10-14 08:01:49,171][100936] Updated weights for policy 0, policy_version 70510 (0.0007) +[2023-10-14 08:01:49,530][100936] Updated weights for policy 0, policy_version 70520 (0.0007) +[2023-10-14 08:01:50,299][100917] Updated weights for policy 1, policy_version 70632 (0.0010) +[2023-10-14 08:01:50,678][100917] Updated weights for policy 1, policy_version 70642 (0.0009) +[2023-10-14 08:01:51,045][100917] Updated weights for policy 1, policy_version 70652 (0.0010) +[2023-10-14 08:01:53,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 144572416. Throughput: 0: 1661.5, 1: 1659.9. Samples: 36157392. Policy #0 lag: (min: 31.0, avg: 33.3, max: 63.0) +[2023-10-14 08:01:53,512][99942] Avg episode reward: [(0, '0.980'), (1, '0.470')] +[2023-10-14 08:01:53,893][100936] Updated weights for policy 0, policy_version 70530 (0.0008) +[2023-10-14 08:01:54,277][100936] Updated weights for policy 0, policy_version 70540 (0.0009) +[2023-10-14 08:01:54,644][100936] Updated weights for policy 0, policy_version 70550 (0.0010) +[2023-10-14 08:01:55,014][100936] Updated weights for policy 0, policy_version 70560 (0.0010) +[2023-10-14 08:01:55,157][100917] Updated weights for policy 1, policy_version 70662 (0.0009) +[2023-10-14 08:01:55,528][100917] Updated weights for policy 1, policy_version 70672 (0.0010) +[2023-10-14 08:01:55,904][100917] Updated weights for policy 1, policy_version 70682 (0.0007) +[2023-10-14 08:01:58,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 144637952. Throughput: 0: 1660.3, 1: 1644.9. Samples: 36166464. Policy #0 lag: (min: 31.0, avg: 33.3, max: 63.0) +[2023-10-14 08:01:58,512][99942] Avg episode reward: [(0, '0.980'), (1, '0.470')] +[2023-10-14 08:01:59,034][100936] Updated weights for policy 0, policy_version 70570 (0.0008) +[2023-10-14 08:01:59,401][100936] Updated weights for policy 0, policy_version 70580 (0.0009) +[2023-10-14 08:01:59,766][100936] Updated weights for policy 0, policy_version 70590 (0.0008) +[2023-10-14 08:01:59,956][100917] Updated weights for policy 1, policy_version 70692 (0.0007) +[2023-10-14 08:02:00,333][100917] Updated weights for policy 1, policy_version 70702 (0.0009) +[2023-10-14 08:02:00,696][100917] Updated weights for policy 1, policy_version 70712 (0.0009) +[2023-10-14 08:02:03,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 144703488. Throughput: 0: 1658.0, 1: 1661.0. Samples: 36186568. Policy #0 lag: (min: 31.0, avg: 33.3, max: 63.0) +[2023-10-14 08:02:03,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.470')] +[2023-10-14 08:02:03,949][100936] Updated weights for policy 0, policy_version 70600 (0.0009) +[2023-10-14 08:02:04,328][100936] Updated weights for policy 0, policy_version 70610 (0.0007) +[2023-10-14 08:02:04,699][100936] Updated weights for policy 0, policy_version 70620 (0.0007) +[2023-10-14 08:02:04,870][100917] Updated weights for policy 1, policy_version 70722 (0.0010) +[2023-10-14 08:02:05,243][100917] Updated weights for policy 1, policy_version 70732 (0.0009) +[2023-10-14 08:02:05,610][100917] Updated weights for policy 1, policy_version 70742 (0.0008) +[2023-10-14 08:02:05,990][100917] Updated weights for policy 1, policy_version 70752 (0.0007) +[2023-10-14 08:02:08,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 144769024. Throughput: 0: 1657.2, 1: 1662.1. Samples: 36206902. Policy #0 lag: (min: 31.0, avg: 33.3, max: 63.0) +[2023-10-14 08:02:08,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.470')] +[2023-10-14 08:02:08,780][100936] Updated weights for policy 0, policy_version 70630 (0.0008) +[2023-10-14 08:02:09,141][100936] Updated weights for policy 0, policy_version 70640 (0.0009) +[2023-10-14 08:02:09,512][100936] Updated weights for policy 0, policy_version 70650 (0.0008) +[2023-10-14 08:02:10,209][100917] Updated weights for policy 1, policy_version 70762 (0.0009) +[2023-10-14 08:02:10,579][100917] Updated weights for policy 1, policy_version 70772 (0.0009) +[2023-10-14 08:02:10,953][100917] Updated weights for policy 1, policy_version 70782 (0.0009) +[2023-10-14 08:02:13,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 144834560. Throughput: 0: 1658.4, 1: 1647.2. Samples: 36215970. Policy #0 lag: (min: 31.0, avg: 33.3, max: 63.0) +[2023-10-14 08:02:13,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.470')] +[2023-10-14 08:02:13,645][100936] Updated weights for policy 0, policy_version 70660 (0.0008) +[2023-10-14 08:02:14,013][100936] Updated weights for policy 0, policy_version 70670 (0.0008) +[2023-10-14 08:02:14,387][100936] Updated weights for policy 0, policy_version 70680 (0.0008) +[2023-10-14 08:02:15,021][100917] Updated weights for policy 1, policy_version 70792 (0.0009) +[2023-10-14 08:02:15,393][100917] Updated weights for policy 1, policy_version 70802 (0.0009) +[2023-10-14 08:02:15,759][100917] Updated weights for policy 1, policy_version 70812 (0.0007) +[2023-10-14 08:02:18,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 144900096. Throughput: 0: 1656.0, 1: 1660.5. Samples: 36236172. Policy #0 lag: (min: 31.0, avg: 33.3, max: 63.0) +[2023-10-14 08:02:18,512][99942] Avg episode reward: [(0, '0.980'), (1, '0.470')] +[2023-10-14 08:02:18,534][100936] Updated weights for policy 0, policy_version 70690 (0.0009) +[2023-10-14 08:02:18,914][100936] Updated weights for policy 0, policy_version 70700 (0.0007) +[2023-10-14 08:02:19,274][100936] Updated weights for policy 0, policy_version 70710 (0.0009) +[2023-10-14 08:02:19,643][100936] Updated weights for policy 0, policy_version 70720 (0.0009) +[2023-10-14 08:02:19,939][100917] Updated weights for policy 1, policy_version 70822 (0.0008) +[2023-10-14 08:02:20,318][100917] Updated weights for policy 1, policy_version 70832 (0.0007) +[2023-10-14 08:02:20,701][100917] Updated weights for policy 1, policy_version 70842 (0.0007) +[2023-10-14 08:02:23,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 144965632. Throughput: 0: 1654.8, 1: 1663.7. Samples: 36256698. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:02:23,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.470')] +[2023-10-14 08:02:23,766][100936] Updated weights for policy 0, policy_version 70730 (0.0007) +[2023-10-14 08:02:24,137][100936] Updated weights for policy 0, policy_version 70740 (0.0008) +[2023-10-14 08:02:24,505][100936] Updated weights for policy 0, policy_version 70750 (0.0010) +[2023-10-14 08:02:24,889][100917] Updated weights for policy 1, policy_version 70852 (0.0008) +[2023-10-14 08:02:25,253][100917] Updated weights for policy 1, policy_version 70862 (0.0007) +[2023-10-14 08:02:25,632][100917] Updated weights for policy 1, policy_version 70872 (0.0007) +[2023-10-14 08:02:28,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 145031168. Throughput: 0: 1654.6, 1: 1655.8. Samples: 36265710. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:02:28,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.470')] +[2023-10-14 08:02:28,525][100936] Updated weights for policy 0, policy_version 70760 (0.0010) +[2023-10-14 08:02:28,888][100936] Updated weights for policy 0, policy_version 70770 (0.0007) +[2023-10-14 08:02:29,257][100936] Updated weights for policy 0, policy_version 70780 (0.0008) +[2023-10-14 08:02:29,757][100917] Updated weights for policy 1, policy_version 70882 (0.0009) +[2023-10-14 08:02:30,125][100917] Updated weights for policy 1, policy_version 70892 (0.0008) +[2023-10-14 08:02:30,496][100917] Updated weights for policy 1, policy_version 70902 (0.0008) +[2023-10-14 08:02:30,868][100917] Updated weights for policy 1, policy_version 70912 (0.0009) +[2023-10-14 08:02:33,400][100936] Updated weights for policy 0, policy_version 70790 (0.0008) +[2023-10-14 08:02:33,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 145096704. Throughput: 0: 1655.5, 1: 1665.6. Samples: 36286304. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:02:33,512][99942] Avg episode reward: [(0, '0.980'), (1, '0.470')] +[2023-10-14 08:02:33,765][100936] Updated weights for policy 0, policy_version 70800 (0.0009) +[2023-10-14 08:02:34,132][100936] Updated weights for policy 0, policy_version 70810 (0.0007) +[2023-10-14 08:02:34,860][100917] Updated weights for policy 1, policy_version 70922 (0.0007) +[2023-10-14 08:02:35,221][100917] Updated weights for policy 1, policy_version 70932 (0.0009) +[2023-10-14 08:02:35,598][100917] Updated weights for policy 1, policy_version 70942 (0.0007) +[2023-10-14 08:02:38,256][100936] Updated weights for policy 0, policy_version 70820 (0.0007) +[2023-10-14 08:02:38,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 145162240. Throughput: 0: 1646.8, 1: 1665.1. Samples: 36306426. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:02:38,512][99942] Avg episode reward: [(0, '0.980'), (1, '0.470')] +[2023-10-14 08:02:38,630][100936] Updated weights for policy 0, policy_version 70830 (0.0011) +[2023-10-14 08:02:38,985][100936] Updated weights for policy 0, policy_version 70840 (0.0011) +[2023-10-14 08:02:39,740][100917] Updated weights for policy 1, policy_version 70952 (0.0008) +[2023-10-14 08:02:40,108][100917] Updated weights for policy 1, policy_version 70962 (0.0008) +[2023-10-14 08:02:40,475][100917] Updated weights for policy 1, policy_version 70972 (0.0008) +[2023-10-14 08:02:43,139][100936] Updated weights for policy 0, policy_version 70850 (0.0007) +[2023-10-14 08:02:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 145227776. Throughput: 0: 1659.1, 1: 1657.2. Samples: 36315700. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:02:43,512][99942] Avg episode reward: [(0, '0.980'), (1, '0.470')] +[2023-10-14 08:02:43,529][100936] Updated weights for policy 0, policy_version 70860 (0.0008) +[2023-10-14 08:02:43,895][100936] Updated weights for policy 0, policy_version 70870 (0.0010) +[2023-10-14 08:02:44,262][100936] Updated weights for policy 0, policy_version 70880 (0.0010) +[2023-10-14 08:02:44,470][100917] Updated weights for policy 1, policy_version 70982 (0.0007) +[2023-10-14 08:02:44,845][100917] Updated weights for policy 1, policy_version 70992 (0.0008) +[2023-10-14 08:02:45,221][100917] Updated weights for policy 1, policy_version 71002 (0.0010) +[2023-10-14 08:02:48,460][100936] Updated weights for policy 0, policy_version 70890 (0.0009) +[2023-10-14 08:02:48,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 145293312. Throughput: 0: 1662.6, 1: 1660.4. Samples: 36336106. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:02:48,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.470')] +[2023-10-14 08:02:48,831][100936] Updated weights for policy 0, policy_version 70900 (0.0007) +[2023-10-14 08:02:49,194][100936] Updated weights for policy 0, policy_version 70910 (0.0009) +[2023-10-14 08:02:49,459][100917] Updated weights for policy 1, policy_version 71012 (0.0009) +[2023-10-14 08:02:49,829][100917] Updated weights for policy 1, policy_version 71022 (0.0007) +[2023-10-14 08:02:50,195][100917] Updated weights for policy 1, policy_version 71032 (0.0007) +[2023-10-14 08:02:53,171][100936] Updated weights for policy 0, policy_version 70920 (0.0008) +[2023-10-14 08:02:53,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 145358848. Throughput: 0: 1657.5, 1: 1664.9. Samples: 36356408. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:02:53,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.470')] +[2023-10-14 08:02:53,539][100936] Updated weights for policy 0, policy_version 70930 (0.0011) +[2023-10-14 08:02:53,903][100936] Updated weights for policy 0, policy_version 70940 (0.0010) +[2023-10-14 08:02:54,242][100917] Updated weights for policy 1, policy_version 71042 (0.0009) +[2023-10-14 08:02:54,654][100917] Updated weights for policy 1, policy_version 71052 (0.0009) +[2023-10-14 08:02:55,026][100917] Updated weights for policy 1, policy_version 71062 (0.0008) +[2023-10-14 08:02:55,396][100917] Updated weights for policy 1, policy_version 71072 (0.0008) +[2023-10-14 08:02:58,160][100936] Updated weights for policy 0, policy_version 70950 (0.0010) +[2023-10-14 08:02:58,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 145424384. Throughput: 0: 1667.3, 1: 1661.3. Samples: 36365754. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:02:58,512][99942] Avg episode reward: [(0, '0.980'), (1, '0.470')] +[2023-10-14 08:02:58,530][100936] Updated weights for policy 0, policy_version 70960 (0.0010) +[2023-10-14 08:02:58,908][100936] Updated weights for policy 0, policy_version 70970 (0.0009) +[2023-10-14 08:02:59,523][100917] Updated weights for policy 1, policy_version 71082 (0.0008) +[2023-10-14 08:02:59,902][100917] Updated weights for policy 1, policy_version 71092 (0.0007) +[2023-10-14 08:03:00,264][100917] Updated weights for policy 1, policy_version 71102 (0.0007) +[2023-10-14 08:03:03,107][100936] Updated weights for policy 0, policy_version 70980 (0.0009) +[2023-10-14 08:03:03,470][100936] Updated weights for policy 0, policy_version 70990 (0.0010) +[2023-10-14 08:03:03,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 145489920. Throughput: 0: 1665.2, 1: 1665.9. Samples: 36386072. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:03:03,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.470')] +[2023-10-14 08:03:03,837][100936] Updated weights for policy 0, policy_version 71000 (0.0010) +[2023-10-14 08:03:04,380][100917] Updated weights for policy 1, policy_version 71112 (0.0009) +[2023-10-14 08:03:04,749][100917] Updated weights for policy 1, policy_version 71122 (0.0011) +[2023-10-14 08:03:05,117][100917] Updated weights for policy 1, policy_version 71132 (0.0009) +[2023-10-14 08:03:07,839][100936] Updated weights for policy 0, policy_version 71010 (0.0008) +[2023-10-14 08:03:08,212][100936] Updated weights for policy 0, policy_version 71020 (0.0007) +[2023-10-14 08:03:08,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 145555456. Throughput: 0: 1653.9, 1: 1660.6. Samples: 36405848. Policy #0 lag: (min: 30.0, avg: 31.3, max: 56.0) +[2023-10-14 08:03:08,512][99942] Avg episode reward: [(0, '0.980'), (1, '0.470')] +[2023-10-14 08:03:08,576][100936] Updated weights for policy 0, policy_version 71030 (0.0007) +[2023-10-14 08:03:08,944][100936] Updated weights for policy 0, policy_version 71040 (0.0007) +[2023-10-14 08:03:09,215][100917] Updated weights for policy 1, policy_version 71142 (0.0008) +[2023-10-14 08:03:09,584][100917] Updated weights for policy 1, policy_version 71152 (0.0008) +[2023-10-14 08:03:09,953][100917] Updated weights for policy 1, policy_version 71162 (0.0009) +[2023-10-14 08:03:12,904][100936] Updated weights for policy 0, policy_version 71050 (0.0007) +[2023-10-14 08:03:13,264][100936] Updated weights for policy 0, policy_version 71060 (0.0008) +[2023-10-14 08:03:13,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 145620992. Throughput: 0: 1670.2, 1: 1664.1. Samples: 36415754. Policy #0 lag: (min: 30.0, avg: 31.3, max: 56.0) +[2023-10-14 08:03:13,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.470')] +[2023-10-14 08:03:13,636][100936] Updated weights for policy 0, policy_version 71070 (0.0009) +[2023-10-14 08:03:14,173][100917] Updated weights for policy 1, policy_version 71172 (0.0009) +[2023-10-14 08:03:14,549][100917] Updated weights for policy 1, policy_version 71182 (0.0008) +[2023-10-14 08:03:14,912][100917] Updated weights for policy 1, policy_version 71192 (0.0010) +[2023-10-14 08:03:17,513][100936] Updated weights for policy 0, policy_version 71080 (0.0009) +[2023-10-14 08:03:17,874][100936] Updated weights for policy 0, policy_version 71090 (0.0008) +[2023-10-14 08:03:18,245][100936] Updated weights for policy 0, policy_version 71100 (0.0007) +[2023-10-14 08:03:18,512][99942] Fps is (10 sec: 16383.8, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 145719296. Throughput: 0: 1668.2, 1: 1660.5. Samples: 36436096. Policy #0 lag: (min: 30.0, avg: 31.3, max: 56.0) +[2023-10-14 08:03:18,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.470')] +[2023-10-14 08:03:19,070][100917] Updated weights for policy 1, policy_version 71202 (0.0009) +[2023-10-14 08:03:19,439][100917] Updated weights for policy 1, policy_version 71212 (0.0011) +[2023-10-14 08:03:19,811][100917] Updated weights for policy 1, policy_version 71222 (0.0011) +[2023-10-14 08:03:20,191][100917] Updated weights for policy 1, policy_version 71232 (0.0008) +[2023-10-14 08:03:22,340][100936] Updated weights for policy 0, policy_version 71110 (0.0009) +[2023-10-14 08:03:22,706][100936] Updated weights for policy 0, policy_version 71120 (0.0009) +[2023-10-14 08:03:23,077][100936] Updated weights for policy 0, policy_version 71130 (0.0007) +[2023-10-14 08:03:23,512][99942] Fps is (10 sec: 16384.1, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 145784832. Throughput: 0: 1650.7, 1: 1664.6. Samples: 36455614. Policy #0 lag: (min: 30.0, avg: 31.3, max: 56.0) +[2023-10-14 08:03:23,512][99942] Avg episode reward: [(0, '0.980'), (1, '0.470')] +[2023-10-14 08:03:24,268][100917] Updated weights for policy 1, policy_version 71242 (0.0007) +[2023-10-14 08:03:24,645][100917] Updated weights for policy 1, policy_version 71252 (0.0009) +[2023-10-14 08:03:25,010][100917] Updated weights for policy 1, policy_version 71262 (0.0009) +[2023-10-14 08:03:27,156][100936] Updated weights for policy 0, policy_version 71140 (0.0008) +[2023-10-14 08:03:27,527][100936] Updated weights for policy 0, policy_version 71150 (0.0009) +[2023-10-14 08:03:27,891][100936] Updated weights for policy 0, policy_version 71160 (0.0009) +[2023-10-14 08:03:28,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 145850368. Throughput: 0: 1676.3, 1: 1665.3. Samples: 36466070. Policy #0 lag: (min: 30.0, avg: 31.3, max: 56.0) +[2023-10-14 08:03:28,513][99942] Avg episode reward: [(0, '0.980'), (1, '0.470')] +[2023-10-14 08:03:29,151][100917] Updated weights for policy 1, policy_version 71272 (0.0008) +[2023-10-14 08:03:29,541][100917] Updated weights for policy 1, policy_version 71282 (0.0007) +[2023-10-14 08:03:29,903][100917] Updated weights for policy 1, policy_version 71292 (0.0010) +[2023-10-14 08:03:32,183][100936] Updated weights for policy 0, policy_version 71170 (0.0009) +[2023-10-14 08:03:32,582][100936] Updated weights for policy 0, policy_version 71180 (0.0009) +[2023-10-14 08:03:32,936][100936] Updated weights for policy 0, policy_version 71190 (0.0008) +[2023-10-14 08:03:33,301][100936] Updated weights for policy 0, policy_version 71200 (0.0007) +[2023-10-14 08:03:33,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 145915904. Throughput: 0: 1663.3, 1: 1663.6. Samples: 36485818. Policy #0 lag: (min: 30.0, avg: 31.3, max: 56.0) +[2023-10-14 08:03:33,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 08:03:33,852][100917] Updated weights for policy 1, policy_version 71302 (0.0007) +[2023-10-14 08:03:34,232][100917] Updated weights for policy 1, policy_version 71312 (0.0008) +[2023-10-14 08:03:34,610][100917] Updated weights for policy 1, policy_version 71322 (0.0009) +[2023-10-14 08:03:37,416][100936] Updated weights for policy 0, policy_version 71210 (0.0008) +[2023-10-14 08:03:37,786][100936] Updated weights for policy 0, policy_version 71220 (0.0009) +[2023-10-14 08:03:38,163][100936] Updated weights for policy 0, policy_version 71230 (0.0008) +[2023-10-14 08:03:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 145981440. Throughput: 0: 1655.1, 1: 1662.2. Samples: 36505686. Policy #0 lag: (min: 30.0, avg: 31.3, max: 56.0) +[2023-10-14 08:03:38,513][99942] Avg episode reward: [(0, '0.980'), (1, '1.000')] +[2023-10-14 08:03:38,524][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000071232_72941568.pth... +[2023-10-14 08:03:38,560][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000069664_71335936.pth +[2023-10-14 08:03:38,610][100917] Updated weights for policy 1, policy_version 71332 (0.0009) +[2023-10-14 08:03:38,993][100917] Updated weights for policy 1, policy_version 71342 (0.0009) +[2023-10-14 08:03:39,372][100917] Updated weights for policy 1, policy_version 71352 (0.0011) +[2023-10-14 08:03:39,666][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000071360_73072640.pth... +[2023-10-14 08:03:39,705][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000069792_71467008.pth +[2023-10-14 08:03:42,308][100936] Updated weights for policy 0, policy_version 71240 (0.0009) +[2023-10-14 08:03:42,681][100936] Updated weights for policy 0, policy_version 71250 (0.0011) +[2023-10-14 08:03:43,052][100936] Updated weights for policy 0, policy_version 71260 (0.0009) +[2023-10-14 08:03:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 146046976. Throughput: 0: 1673.3, 1: 1662.7. Samples: 36515876. Policy #0 lag: (min: 30.0, avg: 31.3, max: 56.0) +[2023-10-14 08:03:43,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 08:03:43,532][100917] Updated weights for policy 1, policy_version 71362 (0.0010) +[2023-10-14 08:03:43,894][100917] Updated weights for policy 1, policy_version 71372 (0.0010) +[2023-10-14 08:03:44,274][100917] Updated weights for policy 1, policy_version 71382 (0.0007) +[2023-10-14 08:03:44,646][100917] Updated weights for policy 1, policy_version 71392 (0.0007) +[2023-10-14 08:03:47,223][100936] Updated weights for policy 0, policy_version 71270 (0.0008) +[2023-10-14 08:03:47,602][100936] Updated weights for policy 0, policy_version 71280 (0.0010) +[2023-10-14 08:03:47,965][100936] Updated weights for policy 0, policy_version 71290 (0.0010) +[2023-10-14 08:03:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 146112512. Throughput: 0: 1669.1, 1: 1658.9. Samples: 36535832. Policy #0 lag: (min: 30.0, avg: 31.3, max: 56.0) +[2023-10-14 08:03:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:03:48,798][100917] Updated weights for policy 1, policy_version 71402 (0.0008) +[2023-10-14 08:03:49,177][100917] Updated weights for policy 1, policy_version 71412 (0.0007) +[2023-10-14 08:03:49,538][100917] Updated weights for policy 1, policy_version 71422 (0.0009) +[2023-10-14 08:03:51,997][100936] Updated weights for policy 0, policy_version 71300 (0.0010) +[2023-10-14 08:03:52,359][100936] Updated weights for policy 0, policy_version 71310 (0.0010) +[2023-10-14 08:03:52,733][100936] Updated weights for policy 0, policy_version 71320 (0.0007) +[2023-10-14 08:03:53,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 146178048. Throughput: 0: 1661.3, 1: 1661.4. Samples: 36555368. Policy #0 lag: (min: 31.0, avg: 32.7, max: 59.0) +[2023-10-14 08:03:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:03:53,637][100917] Updated weights for policy 1, policy_version 71432 (0.0009) +[2023-10-14 08:03:54,008][100917] Updated weights for policy 1, policy_version 71442 (0.0008) +[2023-10-14 08:03:54,372][100917] Updated weights for policy 1, policy_version 71452 (0.0007) +[2023-10-14 08:03:56,864][100936] Updated weights for policy 0, policy_version 71330 (0.0009) +[2023-10-14 08:03:57,228][100936] Updated weights for policy 0, policy_version 71340 (0.0010) +[2023-10-14 08:03:57,591][100936] Updated weights for policy 0, policy_version 71350 (0.0011) +[2023-10-14 08:03:57,962][100936] Updated weights for policy 0, policy_version 71360 (0.0010) +[2023-10-14 08:03:58,477][100917] Updated weights for policy 1, policy_version 71462 (0.0007) +[2023-10-14 08:03:58,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 146243584. Throughput: 0: 1670.8, 1: 1658.0. Samples: 36565550. Policy #0 lag: (min: 31.0, avg: 32.7, max: 59.0) +[2023-10-14 08:03:58,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:03:58,856][100917] Updated weights for policy 1, policy_version 71472 (0.0007) +[2023-10-14 08:03:59,232][100917] Updated weights for policy 1, policy_version 71482 (0.0010) +[2023-10-14 08:04:02,141][100936] Updated weights for policy 0, policy_version 71370 (0.0009) +[2023-10-14 08:04:02,501][100936] Updated weights for policy 0, policy_version 71380 (0.0011) +[2023-10-14 08:04:02,868][100936] Updated weights for policy 0, policy_version 71390 (0.0010) +[2023-10-14 08:04:03,365][100917] Updated weights for policy 1, policy_version 71492 (0.0010) +[2023-10-14 08:04:03,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 146309120. Throughput: 0: 1655.8, 1: 1663.3. Samples: 36585454. Policy #0 lag: (min: 31.0, avg: 32.7, max: 59.0) +[2023-10-14 08:04:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:04:03,730][100917] Updated weights for policy 1, policy_version 71502 (0.0011) +[2023-10-14 08:04:04,099][100917] Updated weights for policy 1, policy_version 71512 (0.0010) +[2023-10-14 08:04:06,989][100936] Updated weights for policy 0, policy_version 71400 (0.0007) +[2023-10-14 08:04:07,362][100936] Updated weights for policy 0, policy_version 71410 (0.0007) +[2023-10-14 08:04:07,737][100936] Updated weights for policy 0, policy_version 71420 (0.0008) +[2023-10-14 08:04:08,154][100917] Updated weights for policy 1, policy_version 71522 (0.0009) +[2023-10-14 08:04:08,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 146374656. Throughput: 0: 1666.2, 1: 1664.9. Samples: 36605516. Policy #0 lag: (min: 31.0, avg: 32.7, max: 59.0) +[2023-10-14 08:04:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:04:08,516][100917] Updated weights for policy 1, policy_version 71532 (0.0007) +[2023-10-14 08:04:08,902][100917] Updated weights for policy 1, policy_version 71542 (0.0010) +[2023-10-14 08:04:09,269][100917] Updated weights for policy 1, policy_version 71552 (0.0008) +[2023-10-14 08:04:11,872][100936] Updated weights for policy 0, policy_version 71430 (0.0009) +[2023-10-14 08:04:12,244][100936] Updated weights for policy 0, policy_version 71440 (0.0007) +[2023-10-14 08:04:12,622][100936] Updated weights for policy 0, policy_version 71450 (0.0008) +[2023-10-14 08:04:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 146440192. Throughput: 0: 1661.2, 1: 1664.7. Samples: 36615734. Policy #0 lag: (min: 31.0, avg: 32.7, max: 59.0) +[2023-10-14 08:04:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:04:13,620][100917] Updated weights for policy 1, policy_version 71562 (0.0007) +[2023-10-14 08:04:13,990][100917] Updated weights for policy 1, policy_version 71572 (0.0010) +[2023-10-14 08:04:14,365][100917] Updated weights for policy 1, policy_version 71582 (0.0007) +[2023-10-14 08:04:16,710][100936] Updated weights for policy 0, policy_version 71460 (0.0008) +[2023-10-14 08:04:17,104][100936] Updated weights for policy 0, policy_version 71470 (0.0010) +[2023-10-14 08:04:17,478][100936] Updated weights for policy 0, policy_version 71480 (0.0009) +[2023-10-14 08:04:18,490][100917] Updated weights for policy 1, policy_version 71592 (0.0009) +[2023-10-14 08:04:18,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 146505728. Throughput: 0: 1651.2, 1: 1663.8. Samples: 36634996. Policy #0 lag: (min: 31.0, avg: 32.7, max: 59.0) +[2023-10-14 08:04:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:04:18,867][100917] Updated weights for policy 1, policy_version 71602 (0.0010) +[2023-10-14 08:04:19,246][100917] Updated weights for policy 1, policy_version 71612 (0.0011) +[2023-10-14 08:04:21,439][100936] Updated weights for policy 0, policy_version 71490 (0.0008) +[2023-10-14 08:04:21,807][100936] Updated weights for policy 0, policy_version 71500 (0.0007) +[2023-10-14 08:04:22,181][100936] Updated weights for policy 0, policy_version 71510 (0.0007) +[2023-10-14 08:04:22,545][100936] Updated weights for policy 0, policy_version 71520 (0.0008) +[2023-10-14 08:04:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 146571264. Throughput: 0: 1667.5, 1: 1660.9. Samples: 36655464. Policy #0 lag: (min: 31.0, avg: 32.7, max: 59.0) +[2023-10-14 08:04:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:04:23,518][100917] Updated weights for policy 1, policy_version 71622 (0.0010) +[2023-10-14 08:04:23,888][100917] Updated weights for policy 1, policy_version 71632 (0.0008) +[2023-10-14 08:04:24,265][100917] Updated weights for policy 1, policy_version 71642 (0.0007) +[2023-10-14 08:04:26,296][100936] Updated weights for policy 0, policy_version 71530 (0.0010) +[2023-10-14 08:04:26,670][100936] Updated weights for policy 0, policy_version 71540 (0.0009) +[2023-10-14 08:04:27,030][100936] Updated weights for policy 0, policy_version 71550 (0.0009) +[2023-10-14 08:04:28,296][100917] Updated weights for policy 1, policy_version 71652 (0.0007) +[2023-10-14 08:04:28,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 146636800. Throughput: 0: 1661.3, 1: 1664.0. Samples: 36665514. Policy #0 lag: (min: 31.0, avg: 32.7, max: 59.0) +[2023-10-14 08:04:28,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:04:28,680][100917] Updated weights for policy 1, policy_version 71662 (0.0008) +[2023-10-14 08:04:29,057][100917] Updated weights for policy 1, policy_version 71672 (0.0008) +[2023-10-14 08:04:31,258][100936] Updated weights for policy 0, policy_version 71560 (0.0010) +[2023-10-14 08:04:31,621][100936] Updated weights for policy 0, policy_version 71570 (0.0010) +[2023-10-14 08:04:31,997][100936] Updated weights for policy 0, policy_version 71580 (0.0011) +[2023-10-14 08:04:32,868][100917] Updated weights for policy 1, policy_version 71682 (0.0008) +[2023-10-14 08:04:33,248][100917] Updated weights for policy 1, policy_version 71692 (0.0011) +[2023-10-14 08:04:33,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 146702336. Throughput: 0: 1652.3, 1: 1671.4. Samples: 36685398. Policy #0 lag: (min: 31.0, avg: 32.7, max: 59.0) +[2023-10-14 08:04:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:04:33,618][100917] Updated weights for policy 1, policy_version 71702 (0.0009) +[2023-10-14 08:04:33,988][100917] Updated weights for policy 1, policy_version 71712 (0.0009) +[2023-10-14 08:04:36,018][100936] Updated weights for policy 0, policy_version 71590 (0.0009) +[2023-10-14 08:04:36,392][100936] Updated weights for policy 0, policy_version 71600 (0.0011) +[2023-10-14 08:04:36,754][100936] Updated weights for policy 0, policy_version 71610 (0.0009) +[2023-10-14 08:04:38,148][100917] Updated weights for policy 1, policy_version 71722 (0.0009) +[2023-10-14 08:04:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 146767872. Throughput: 0: 1675.5, 1: 1667.9. Samples: 36705820. Policy #0 lag: (min: 31.0, avg: 32.7, max: 59.0) +[2023-10-14 08:04:38,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:04:38,518][100917] Updated weights for policy 1, policy_version 71732 (0.0008) +[2023-10-14 08:04:38,894][100917] Updated weights for policy 1, policy_version 71742 (0.0007) +[2023-10-14 08:04:40,918][100936] Updated weights for policy 0, policy_version 71620 (0.0008) +[2023-10-14 08:04:41,286][100936] Updated weights for policy 0, policy_version 71630 (0.0008) +[2023-10-14 08:04:41,639][100936] Updated weights for policy 0, policy_version 71640 (0.0007) +[2023-10-14 08:04:43,087][100917] Updated weights for policy 1, policy_version 71752 (0.0008) +[2023-10-14 08:04:43,468][100917] Updated weights for policy 1, policy_version 71762 (0.0007) +[2023-10-14 08:04:43,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 146833408. Throughput: 0: 1660.4, 1: 1671.1. Samples: 36715468. Policy #0 lag: (min: 31.0, avg: 37.5, max: 63.0) +[2023-10-14 08:04:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:04:43,834][100917] Updated weights for policy 1, policy_version 71772 (0.0007) +[2023-10-14 08:04:45,630][100936] Updated weights for policy 0, policy_version 71650 (0.0009) +[2023-10-14 08:04:45,998][100936] Updated weights for policy 0, policy_version 71660 (0.0008) +[2023-10-14 08:04:46,364][100936] Updated weights for policy 0, policy_version 71670 (0.0008) +[2023-10-14 08:04:46,736][100936] Updated weights for policy 0, policy_version 71680 (0.0008) +[2023-10-14 08:04:47,722][100917] Updated weights for policy 1, policy_version 71782 (0.0007) +[2023-10-14 08:04:48,096][100917] Updated weights for policy 1, policy_version 71792 (0.0010) +[2023-10-14 08:04:48,468][100917] Updated weights for policy 1, policy_version 71802 (0.0009) +[2023-10-14 08:04:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 146898944. Throughput: 0: 1660.3, 1: 1673.2. Samples: 36735460. Policy #0 lag: (min: 31.0, avg: 37.5, max: 63.0) +[2023-10-14 08:04:48,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:04:50,874][100936] Updated weights for policy 0, policy_version 71690 (0.0008) +[2023-10-14 08:04:51,240][100936] Updated weights for policy 0, policy_version 71700 (0.0008) +[2023-10-14 08:04:51,604][100936] Updated weights for policy 0, policy_version 71710 (0.0008) +[2023-10-14 08:04:52,727][100917] Updated weights for policy 1, policy_version 71812 (0.0011) +[2023-10-14 08:04:53,102][100917] Updated weights for policy 1, policy_version 71822 (0.0008) +[2023-10-14 08:04:53,465][100917] Updated weights for policy 1, policy_version 71832 (0.0007) +[2023-10-14 08:04:53,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 146964480. Throughput: 0: 1683.6, 1: 1655.1. Samples: 36755760. Policy #0 lag: (min: 31.0, avg: 37.5, max: 63.0) +[2023-10-14 08:04:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:04:55,586][100936] Updated weights for policy 0, policy_version 71720 (0.0007) +[2023-10-14 08:04:55,965][100936] Updated weights for policy 0, policy_version 71730 (0.0009) +[2023-10-14 08:04:56,340][100936] Updated weights for policy 0, policy_version 71740 (0.0009) +[2023-10-14 08:04:57,540][100917] Updated weights for policy 1, policy_version 71842 (0.0010) +[2023-10-14 08:04:57,913][100917] Updated weights for policy 1, policy_version 71852 (0.0010) +[2023-10-14 08:04:58,286][100917] Updated weights for policy 1, policy_version 71862 (0.0008) +[2023-10-14 08:04:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 147030016. Throughput: 0: 1659.0, 1: 1666.3. Samples: 36765374. Policy #0 lag: (min: 31.0, avg: 37.5, max: 63.0) +[2023-10-14 08:04:58,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:04:58,660][100917] Updated weights for policy 1, policy_version 71872 (0.0010) +[2023-10-14 08:05:00,356][100936] Updated weights for policy 0, policy_version 71750 (0.0008) +[2023-10-14 08:05:00,716][100936] Updated weights for policy 0, policy_version 71760 (0.0008) +[2023-10-14 08:05:01,095][100936] Updated weights for policy 0, policy_version 71770 (0.0008) +[2023-10-14 08:05:02,704][100917] Updated weights for policy 1, policy_version 71882 (0.0008) +[2023-10-14 08:05:03,076][100917] Updated weights for policy 1, policy_version 71892 (0.0008) +[2023-10-14 08:05:03,448][100917] Updated weights for policy 1, policy_version 71902 (0.0009) +[2023-10-14 08:05:03,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 147095552. Throughput: 0: 1677.5, 1: 1670.1. Samples: 36785638. Policy #0 lag: (min: 31.0, avg: 37.5, max: 63.0) +[2023-10-14 08:05:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:05:05,408][100936] Updated weights for policy 0, policy_version 71780 (0.0008) +[2023-10-14 08:05:05,804][100936] Updated weights for policy 0, policy_version 71790 (0.0009) +[2023-10-14 08:05:06,167][100936] Updated weights for policy 0, policy_version 71800 (0.0011) +[2023-10-14 08:05:07,632][100917] Updated weights for policy 1, policy_version 71912 (0.0009) +[2023-10-14 08:05:08,002][100917] Updated weights for policy 1, policy_version 71922 (0.0010) +[2023-10-14 08:05:08,373][100917] Updated weights for policy 1, policy_version 71932 (0.0009) +[2023-10-14 08:05:08,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 147161088. Throughput: 0: 1674.9, 1: 1655.8. Samples: 36805344. Policy #0 lag: (min: 31.0, avg: 37.5, max: 63.0) +[2023-10-14 08:05:08,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:05:10,276][100936] Updated weights for policy 0, policy_version 71810 (0.0011) +[2023-10-14 08:05:10,656][100936] Updated weights for policy 0, policy_version 71820 (0.0008) +[2023-10-14 08:05:11,019][100936] Updated weights for policy 0, policy_version 71830 (0.0010) +[2023-10-14 08:05:11,391][100936] Updated weights for policy 0, policy_version 71840 (0.0009) +[2023-10-14 08:05:12,633][100917] Updated weights for policy 1, policy_version 71942 (0.0010) +[2023-10-14 08:05:13,002][100917] Updated weights for policy 1, policy_version 71952 (0.0010) +[2023-10-14 08:05:13,387][100917] Updated weights for policy 1, policy_version 71962 (0.0008) +[2023-10-14 08:05:13,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 147226624. Throughput: 0: 1652.7, 1: 1668.5. Samples: 36814972. Policy #0 lag: (min: 31.0, avg: 37.5, max: 63.0) +[2023-10-14 08:05:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:05:15,514][100936] Updated weights for policy 0, policy_version 71850 (0.0010) +[2023-10-14 08:05:15,885][100936] Updated weights for policy 0, policy_version 71860 (0.0011) +[2023-10-14 08:05:16,247][100936] Updated weights for policy 0, policy_version 71870 (0.0011) +[2023-10-14 08:05:17,640][100917] Updated weights for policy 1, policy_version 71972 (0.0010) +[2023-10-14 08:05:18,023][100917] Updated weights for policy 1, policy_version 71982 (0.0010) +[2023-10-14 08:05:18,396][100917] Updated weights for policy 1, policy_version 71992 (0.0008) +[2023-10-14 08:05:18,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 147292160. Throughput: 0: 1664.4, 1: 1666.1. Samples: 36835272. Policy #0 lag: (min: 31.0, avg: 37.5, max: 63.0) +[2023-10-14 08:05:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:05:20,674][100936] Updated weights for policy 0, policy_version 71880 (0.0010) +[2023-10-14 08:05:21,046][100936] Updated weights for policy 0, policy_version 71890 (0.0009) +[2023-10-14 08:05:21,421][100936] Updated weights for policy 0, policy_version 71900 (0.0007) +[2023-10-14 08:05:22,413][100917] Updated weights for policy 1, policy_version 72002 (0.0007) +[2023-10-14 08:05:22,791][100917] Updated weights for policy 1, policy_version 72012 (0.0007) +[2023-10-14 08:05:23,171][100917] Updated weights for policy 1, policy_version 72022 (0.0008) +[2023-10-14 08:05:23,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 147357696. Throughput: 0: 1658.5, 1: 1659.8. Samples: 36855146. Policy #0 lag: (min: 31.0, avg: 37.5, max: 63.0) +[2023-10-14 08:05:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:05:23,541][100917] Updated weights for policy 1, policy_version 72032 (0.0007) +[2023-10-14 08:05:25,636][100936] Updated weights for policy 0, policy_version 71910 (0.0009) +[2023-10-14 08:05:26,003][100936] Updated weights for policy 0, policy_version 71920 (0.0009) +[2023-10-14 08:05:26,375][100936] Updated weights for policy 0, policy_version 71930 (0.0011) +[2023-10-14 08:05:27,597][100917] Updated weights for policy 1, policy_version 72042 (0.0008) +[2023-10-14 08:05:27,973][100917] Updated weights for policy 1, policy_version 72052 (0.0007) +[2023-10-14 08:05:28,345][100917] Updated weights for policy 1, policy_version 72062 (0.0007) +[2023-10-14 08:05:28,512][99942] Fps is (10 sec: 16384.2, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 147456000. Throughput: 0: 1650.9, 1: 1670.1. Samples: 36864912. Policy #0 lag: (min: 31.0, avg: 37.5, max: 63.0) +[2023-10-14 08:05:28,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:05:30,492][100936] Updated weights for policy 0, policy_version 71940 (0.0009) +[2023-10-14 08:05:30,860][100936] Updated weights for policy 0, policy_version 71950 (0.0009) +[2023-10-14 08:05:31,229][100936] Updated weights for policy 0, policy_version 71960 (0.0008) +[2023-10-14 08:05:32,394][100917] Updated weights for policy 1, policy_version 72072 (0.0007) +[2023-10-14 08:05:32,764][100917] Updated weights for policy 1, policy_version 72082 (0.0008) +[2023-10-14 08:05:33,135][100917] Updated weights for policy 1, policy_version 72092 (0.0008) +[2023-10-14 08:05:33,512][99942] Fps is (10 sec: 16384.1, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 147521536. Throughput: 0: 1659.3, 1: 1665.2. Samples: 36885064. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) +[2023-10-14 08:05:33,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:05:35,425][100936] Updated weights for policy 0, policy_version 71970 (0.0009) +[2023-10-14 08:05:35,803][100936] Updated weights for policy 0, policy_version 71980 (0.0009) +[2023-10-14 08:05:36,173][100936] Updated weights for policy 0, policy_version 71990 (0.0008) +[2023-10-14 08:05:36,551][100936] Updated weights for policy 0, policy_version 72000 (0.0009) +[2023-10-14 08:05:37,224][100917] Updated weights for policy 1, policy_version 72102 (0.0008) +[2023-10-14 08:05:37,602][100917] Updated weights for policy 1, policy_version 72112 (0.0009) +[2023-10-14 08:05:37,963][100917] Updated weights for policy 1, policy_version 72122 (0.0007) +[2023-10-14 08:05:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 147587072. Throughput: 0: 1649.8, 1: 1658.6. Samples: 36904638. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) +[2023-10-14 08:05:38,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:05:38,524][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000072000_73728000.pth... +[2023-10-14 08:05:38,525][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000072128_73859072.pth... +[2023-10-14 08:05:38,554][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000070432_72122368.pth +[2023-10-14 08:05:38,559][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000070560_72253440.pth +[2023-10-14 08:05:40,611][100936] Updated weights for policy 0, policy_version 72010 (0.0011) +[2023-10-14 08:05:40,984][100936] Updated weights for policy 0, policy_version 72020 (0.0009) +[2023-10-14 08:05:41,356][100936] Updated weights for policy 0, policy_version 72030 (0.0011) +[2023-10-14 08:05:42,078][100917] Updated weights for policy 1, policy_version 72132 (0.0010) +[2023-10-14 08:05:42,448][100917] Updated weights for policy 1, policy_version 72142 (0.0009) +[2023-10-14 08:05:42,823][100917] Updated weights for policy 1, policy_version 72152 (0.0009) +[2023-10-14 08:05:43,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 147652608. Throughput: 0: 1646.0, 1: 1671.8. Samples: 36914676. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) +[2023-10-14 08:05:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:05:45,859][100936] Updated weights for policy 0, policy_version 72040 (0.0008) +[2023-10-14 08:05:46,235][100936] Updated weights for policy 0, policy_version 72050 (0.0007) +[2023-10-14 08:05:46,606][100936] Updated weights for policy 0, policy_version 72060 (0.0009) +[2023-10-14 08:05:46,908][100917] Updated weights for policy 1, policy_version 72162 (0.0008) +[2023-10-14 08:05:47,285][100917] Updated weights for policy 1, policy_version 72172 (0.0010) +[2023-10-14 08:05:47,662][100917] Updated weights for policy 1, policy_version 72182 (0.0010) +[2023-10-14 08:05:48,037][100917] Updated weights for policy 1, policy_version 72192 (0.0007) +[2023-10-14 08:05:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 147718144. Throughput: 0: 1644.5, 1: 1666.3. Samples: 36934626. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) +[2023-10-14 08:05:48,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:05:50,609][100936] Updated weights for policy 0, policy_version 72070 (0.0010) +[2023-10-14 08:05:50,976][100936] Updated weights for policy 0, policy_version 72080 (0.0008) +[2023-10-14 08:05:51,339][100936] Updated weights for policy 0, policy_version 72090 (0.0009) +[2023-10-14 08:05:52,254][100917] Updated weights for policy 1, policy_version 72202 (0.0008) +[2023-10-14 08:05:52,623][100917] Updated weights for policy 1, policy_version 72212 (0.0010) +[2023-10-14 08:05:52,994][100917] Updated weights for policy 1, policy_version 72222 (0.0010) +[2023-10-14 08:05:53,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 147783680. Throughput: 0: 1645.5, 1: 1655.4. Samples: 36953882. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) +[2023-10-14 08:05:53,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:05:55,421][100936] Updated weights for policy 0, policy_version 72100 (0.0008) +[2023-10-14 08:05:55,792][100936] Updated weights for policy 0, policy_version 72110 (0.0010) +[2023-10-14 08:05:56,170][100936] Updated weights for policy 0, policy_version 72120 (0.0010) +[2023-10-14 08:05:57,072][100917] Updated weights for policy 1, policy_version 72232 (0.0011) +[2023-10-14 08:05:57,440][100917] Updated weights for policy 1, policy_version 72242 (0.0010) +[2023-10-14 08:05:57,816][100917] Updated weights for policy 1, policy_version 72252 (0.0008) +[2023-10-14 08:05:58,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 147849216. Throughput: 0: 1647.0, 1: 1662.7. Samples: 36963906. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) +[2023-10-14 08:05:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:06:00,416][100936] Updated weights for policy 0, policy_version 72130 (0.0008) +[2023-10-14 08:06:00,783][100936] Updated weights for policy 0, policy_version 72140 (0.0009) +[2023-10-14 08:06:01,153][100936] Updated weights for policy 0, policy_version 72150 (0.0008) +[2023-10-14 08:06:01,521][100936] Updated weights for policy 0, policy_version 72160 (0.0007) +[2023-10-14 08:06:01,868][100917] Updated weights for policy 1, policy_version 72262 (0.0007) +[2023-10-14 08:06:02,238][100917] Updated weights for policy 1, policy_version 72272 (0.0007) +[2023-10-14 08:06:02,618][100917] Updated weights for policy 1, policy_version 72282 (0.0009) +[2023-10-14 08:06:03,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 147914752. Throughput: 0: 1649.4, 1: 1653.6. Samples: 36983908. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) +[2023-10-14 08:06:03,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:06:05,605][100936] Updated weights for policy 0, policy_version 72170 (0.0008) +[2023-10-14 08:06:05,977][100936] Updated weights for policy 0, policy_version 72180 (0.0008) +[2023-10-14 08:06:06,342][100936] Updated weights for policy 0, policy_version 72190 (0.0009) +[2023-10-14 08:06:06,817][100917] Updated weights for policy 1, policy_version 72292 (0.0008) +[2023-10-14 08:06:07,213][100917] Updated weights for policy 1, policy_version 72302 (0.0008) +[2023-10-14 08:06:07,593][100917] Updated weights for policy 1, policy_version 72312 (0.0010) +[2023-10-14 08:06:08,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 147980288. Throughput: 0: 1656.1, 1: 1644.3. Samples: 37003666. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) +[2023-10-14 08:06:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:06:10,315][100936] Updated weights for policy 0, policy_version 72200 (0.0009) +[2023-10-14 08:06:10,682][100936] Updated weights for policy 0, policy_version 72210 (0.0008) +[2023-10-14 08:06:11,048][100936] Updated weights for policy 0, policy_version 72220 (0.0008) +[2023-10-14 08:06:11,559][100917] Updated weights for policy 1, policy_version 72322 (0.0008) +[2023-10-14 08:06:11,922][100917] Updated weights for policy 1, policy_version 72332 (0.0007) +[2023-10-14 08:06:12,300][100917] Updated weights for policy 1, policy_version 72342 (0.0007) +[2023-10-14 08:06:12,664][100917] Updated weights for policy 1, policy_version 72352 (0.0008) +[2023-10-14 08:06:13,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 148045824. Throughput: 0: 1653.6, 1: 1663.6. Samples: 37014190. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) +[2023-10-14 08:06:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:06:15,239][100936] Updated weights for policy 0, policy_version 72230 (0.0009) +[2023-10-14 08:06:15,608][100936] Updated weights for policy 0, policy_version 72240 (0.0007) +[2023-10-14 08:06:15,979][100936] Updated weights for policy 0, policy_version 72250 (0.0009) +[2023-10-14 08:06:16,730][100917] Updated weights for policy 1, policy_version 72362 (0.0009) +[2023-10-14 08:06:17,095][100917] Updated weights for policy 1, policy_version 72372 (0.0010) +[2023-10-14 08:06:17,465][100917] Updated weights for policy 1, policy_version 72382 (0.0008) +[2023-10-14 08:06:18,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 148111360. Throughput: 0: 1661.0, 1: 1650.2. Samples: 37034068. Policy #0 lag: (min: 6.0, avg: 10.7, max: 38.0) +[2023-10-14 08:06:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:06:19,843][100936] Updated weights for policy 0, policy_version 72260 (0.0008) +[2023-10-14 08:06:20,205][100936] Updated weights for policy 0, policy_version 72270 (0.0008) +[2023-10-14 08:06:20,574][100936] Updated weights for policy 0, policy_version 72280 (0.0009) +[2023-10-14 08:06:21,586][100917] Updated weights for policy 1, policy_version 72392 (0.0010) +[2023-10-14 08:06:21,962][100917] Updated weights for policy 1, policy_version 72402 (0.0007) +[2023-10-14 08:06:22,339][100917] Updated weights for policy 1, policy_version 72412 (0.0009) +[2023-10-14 08:06:23,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 148176896. Throughput: 0: 1665.5, 1: 1652.0. Samples: 37053924. Policy #0 lag: (min: 6.0, avg: 10.7, max: 38.0) +[2023-10-14 08:06:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:06:24,808][100936] Updated weights for policy 0, policy_version 72290 (0.0008) +[2023-10-14 08:06:25,180][100936] Updated weights for policy 0, policy_version 72300 (0.0009) +[2023-10-14 08:06:25,551][100936] Updated weights for policy 0, policy_version 72310 (0.0009) +[2023-10-14 08:06:25,919][100936] Updated weights for policy 0, policy_version 72320 (0.0010) +[2023-10-14 08:06:26,621][100917] Updated weights for policy 1, policy_version 72422 (0.0009) +[2023-10-14 08:06:26,983][100917] Updated weights for policy 1, policy_version 72432 (0.0009) +[2023-10-14 08:06:27,361][100917] Updated weights for policy 1, policy_version 72442 (0.0009) +[2023-10-14 08:06:28,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 148242432. Throughput: 0: 1664.0, 1: 1657.8. Samples: 37064156. Policy #0 lag: (min: 6.0, avg: 10.7, max: 38.0) +[2023-10-14 08:06:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:06:30,146][100936] Updated weights for policy 0, policy_version 72330 (0.0011) +[2023-10-14 08:06:30,512][100936] Updated weights for policy 0, policy_version 72340 (0.0009) +[2023-10-14 08:06:30,876][100936] Updated weights for policy 0, policy_version 72350 (0.0007) +[2023-10-14 08:06:31,473][100917] Updated weights for policy 1, policy_version 72452 (0.0010) +[2023-10-14 08:06:31,843][100917] Updated weights for policy 1, policy_version 72462 (0.0008) +[2023-10-14 08:06:32,225][100917] Updated weights for policy 1, policy_version 72472 (0.0009) +[2023-10-14 08:06:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 148307968. Throughput: 0: 1669.1, 1: 1645.4. Samples: 37083782. Policy #0 lag: (min: 6.0, avg: 10.7, max: 38.0) +[2023-10-14 08:06:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:06:35,088][100936] Updated weights for policy 0, policy_version 72360 (0.0010) +[2023-10-14 08:06:35,457][100936] Updated weights for policy 0, policy_version 72370 (0.0010) +[2023-10-14 08:06:35,832][100936] Updated weights for policy 0, policy_version 72380 (0.0007) +[2023-10-14 08:06:36,165][100917] Updated weights for policy 1, policy_version 72482 (0.0009) +[2023-10-14 08:06:36,538][100917] Updated weights for policy 1, policy_version 72492 (0.0011) +[2023-10-14 08:06:36,925][100917] Updated weights for policy 1, policy_version 72502 (0.0007) +[2023-10-14 08:06:37,296][100917] Updated weights for policy 1, policy_version 72512 (0.0009) +[2023-10-14 08:06:38,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.1, 300 sec: 13329.3). Total num frames: 148373504. Throughput: 0: 1666.7, 1: 1660.6. Samples: 37103610. Policy #0 lag: (min: 6.0, avg: 10.7, max: 38.0) +[2023-10-14 08:06:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:06:39,983][100936] Updated weights for policy 0, policy_version 72390 (0.0007) +[2023-10-14 08:06:40,370][100936] Updated weights for policy 0, policy_version 72400 (0.0008) +[2023-10-14 08:06:40,745][100936] Updated weights for policy 0, policy_version 72410 (0.0010) +[2023-10-14 08:06:41,469][100917] Updated weights for policy 1, policy_version 72522 (0.0008) +[2023-10-14 08:06:41,847][100917] Updated weights for policy 1, policy_version 72532 (0.0008) +[2023-10-14 08:06:42,228][100917] Updated weights for policy 1, policy_version 72542 (0.0008) +[2023-10-14 08:06:43,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 148439040. Throughput: 0: 1660.4, 1: 1668.5. Samples: 37113706. Policy #0 lag: (min: 6.0, avg: 10.7, max: 38.0) +[2023-10-14 08:06:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:06:44,943][100936] Updated weights for policy 0, policy_version 72420 (0.0008) +[2023-10-14 08:06:45,314][100936] Updated weights for policy 0, policy_version 72430 (0.0010) +[2023-10-14 08:06:45,678][100936] Updated weights for policy 0, policy_version 72440 (0.0008) +[2023-10-14 08:06:46,409][100917] Updated weights for policy 1, policy_version 72552 (0.0009) +[2023-10-14 08:06:46,778][100917] Updated weights for policy 1, policy_version 72562 (0.0010) +[2023-10-14 08:06:47,153][100917] Updated weights for policy 1, policy_version 72572 (0.0010) +[2023-10-14 08:06:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.1, 300 sec: 13329.3). Total num frames: 148504576. Throughput: 0: 1664.2, 1: 1652.5. Samples: 37133158. Policy #0 lag: (min: 6.0, avg: 10.7, max: 38.0) +[2023-10-14 08:06:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:06:49,675][100936] Updated weights for policy 0, policy_version 72450 (0.0009) +[2023-10-14 08:06:50,046][100936] Updated weights for policy 0, policy_version 72460 (0.0010) +[2023-10-14 08:06:50,420][100936] Updated weights for policy 0, policy_version 72470 (0.0009) +[2023-10-14 08:06:50,787][100936] Updated weights for policy 0, policy_version 72480 (0.0010) +[2023-10-14 08:06:51,503][100917] Updated weights for policy 1, policy_version 72582 (0.0009) +[2023-10-14 08:06:51,884][100917] Updated weights for policy 1, policy_version 72592 (0.0010) +[2023-10-14 08:06:52,258][100917] Updated weights for policy 1, policy_version 72602 (0.0007) +[2023-10-14 08:06:53,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 148570112. Throughput: 0: 1663.3, 1: 1658.3. Samples: 37153140. Policy #0 lag: (min: 6.0, avg: 10.7, max: 38.0) +[2023-10-14 08:06:53,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:06:54,901][100936] Updated weights for policy 0, policy_version 72490 (0.0009) +[2023-10-14 08:06:55,268][100936] Updated weights for policy 0, policy_version 72500 (0.0007) +[2023-10-14 08:06:55,639][100936] Updated weights for policy 0, policy_version 72510 (0.0007) +[2023-10-14 08:06:56,399][100917] Updated weights for policy 1, policy_version 72612 (0.0009) +[2023-10-14 08:06:56,774][100917] Updated weights for policy 1, policy_version 72622 (0.0008) +[2023-10-14 08:06:57,149][100917] Updated weights for policy 1, policy_version 72632 (0.0007) +[2023-10-14 08:06:58,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 148635648. Throughput: 0: 1662.5, 1: 1655.8. Samples: 37163510. Policy #0 lag: (min: 6.0, avg: 10.7, max: 38.0) +[2023-10-14 08:06:58,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:06:59,683][100936] Updated weights for policy 0, policy_version 72520 (0.0009) +[2023-10-14 08:07:00,049][100936] Updated weights for policy 0, policy_version 72530 (0.0008) +[2023-10-14 08:07:00,416][100936] Updated weights for policy 0, policy_version 72540 (0.0008) +[2023-10-14 08:07:01,203][100917] Updated weights for policy 1, policy_version 72642 (0.0007) +[2023-10-14 08:07:01,571][100917] Updated weights for policy 1, policy_version 72652 (0.0007) +[2023-10-14 08:07:01,947][100917] Updated weights for policy 1, policy_version 72662 (0.0007) +[2023-10-14 08:07:02,315][100917] Updated weights for policy 1, policy_version 72672 (0.0007) +[2023-10-14 08:07:03,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 148701184. Throughput: 0: 1660.8, 1: 1653.8. Samples: 37183226. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:07:03,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:07:04,550][100936] Updated weights for policy 0, policy_version 72550 (0.0007) +[2023-10-14 08:07:04,928][100936] Updated weights for policy 0, policy_version 72560 (0.0007) +[2023-10-14 08:07:05,306][100936] Updated weights for policy 0, policy_version 72570 (0.0008) +[2023-10-14 08:07:06,146][100917] Updated weights for policy 1, policy_version 72682 (0.0009) +[2023-10-14 08:07:06,512][100917] Updated weights for policy 1, policy_version 72692 (0.0010) +[2023-10-14 08:07:06,903][100917] Updated weights for policy 1, policy_version 72702 (0.0010) +[2023-10-14 08:07:08,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 148766720. Throughput: 0: 1654.6, 1: 1667.8. Samples: 37203434. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:07:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:07:09,473][100936] Updated weights for policy 0, policy_version 72580 (0.0007) +[2023-10-14 08:07:09,839][100936] Updated weights for policy 0, policy_version 72590 (0.0010) +[2023-10-14 08:07:10,216][100936] Updated weights for policy 0, policy_version 72600 (0.0008) +[2023-10-14 08:07:10,868][100917] Updated weights for policy 1, policy_version 72712 (0.0008) +[2023-10-14 08:07:11,230][100917] Updated weights for policy 1, policy_version 72722 (0.0008) +[2023-10-14 08:07:11,600][100917] Updated weights for policy 1, policy_version 72732 (0.0007) +[2023-10-14 08:07:13,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 148832256. Throughput: 0: 1656.0, 1: 1656.2. Samples: 37213206. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:07:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:07:14,229][100936] Updated weights for policy 0, policy_version 72610 (0.0009) +[2023-10-14 08:07:14,604][100936] Updated weights for policy 0, policy_version 72620 (0.0010) +[2023-10-14 08:07:14,976][100936] Updated weights for policy 0, policy_version 72630 (0.0007) +[2023-10-14 08:07:15,344][100936] Updated weights for policy 0, policy_version 72640 (0.0010) +[2023-10-14 08:07:15,830][100917] Updated weights for policy 1, policy_version 72742 (0.0008) +[2023-10-14 08:07:16,203][100917] Updated weights for policy 1, policy_version 72752 (0.0008) +[2023-10-14 08:07:16,581][100917] Updated weights for policy 1, policy_version 72762 (0.0008) +[2023-10-14 08:07:18,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 148897792. Throughput: 0: 1659.2, 1: 1651.2. Samples: 37232754. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:07:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:07:19,444][100936] Updated weights for policy 0, policy_version 72650 (0.0008) +[2023-10-14 08:07:19,815][100936] Updated weights for policy 0, policy_version 72660 (0.0007) +[2023-10-14 08:07:20,183][100936] Updated weights for policy 0, policy_version 72670 (0.0007) +[2023-10-14 08:07:20,493][100917] Updated weights for policy 1, policy_version 72772 (0.0010) +[2023-10-14 08:07:20,862][100917] Updated weights for policy 1, policy_version 72782 (0.0008) +[2023-10-14 08:07:21,232][100917] Updated weights for policy 1, policy_version 72792 (0.0010) +[2023-10-14 08:07:23,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 148963328. Throughput: 0: 1665.1, 1: 1661.2. Samples: 37253296. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:07:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:07:24,310][100936] Updated weights for policy 0, policy_version 72680 (0.0011) +[2023-10-14 08:07:24,686][100936] Updated weights for policy 0, policy_version 72690 (0.0007) +[2023-10-14 08:07:25,047][100936] Updated weights for policy 0, policy_version 72700 (0.0011) +[2023-10-14 08:07:25,413][100917] Updated weights for policy 1, policy_version 72802 (0.0010) +[2023-10-14 08:07:25,784][100917] Updated weights for policy 1, policy_version 72812 (0.0009) +[2023-10-14 08:07:26,157][100917] Updated weights for policy 1, policy_version 72822 (0.0010) +[2023-10-14 08:07:26,521][100917] Updated weights for policy 1, policy_version 72832 (0.0009) +[2023-10-14 08:07:28,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 149028864. Throughput: 0: 1668.4, 1: 1645.4. Samples: 37262828. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:07:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:07:29,037][100936] Updated weights for policy 0, policy_version 72710 (0.0009) +[2023-10-14 08:07:29,414][100936] Updated weights for policy 0, policy_version 72720 (0.0008) +[2023-10-14 08:07:29,785][100936] Updated weights for policy 0, policy_version 72730 (0.0009) +[2023-10-14 08:07:30,879][100917] Updated weights for policy 1, policy_version 72842 (0.0010) +[2023-10-14 08:07:31,256][100917] Updated weights for policy 1, policy_version 72852 (0.0010) +[2023-10-14 08:07:31,626][100917] Updated weights for policy 1, policy_version 72862 (0.0008) +[2023-10-14 08:07:33,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 149094400. Throughput: 0: 1666.4, 1: 1650.3. Samples: 37282410. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:07:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:07:33,923][100936] Updated weights for policy 0, policy_version 72740 (0.0009) +[2023-10-14 08:07:34,290][100936] Updated weights for policy 0, policy_version 72750 (0.0008) +[2023-10-14 08:07:34,652][100936] Updated weights for policy 0, policy_version 72760 (0.0010) +[2023-10-14 08:07:35,709][100917] Updated weights for policy 1, policy_version 72872 (0.0007) +[2023-10-14 08:07:36,092][100917] Updated weights for policy 1, policy_version 72882 (0.0007) +[2023-10-14 08:07:36,463][100917] Updated weights for policy 1, policy_version 72892 (0.0009) +[2023-10-14 08:07:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 149159936. Throughput: 0: 1661.4, 1: 1664.7. Samples: 37302816. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:07:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:07:38,523][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000072896_74645504.pth... +[2023-10-14 08:07:38,523][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000072768_74514432.pth... +[2023-10-14 08:07:38,558][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000071232_72941568.pth +[2023-10-14 08:07:38,562][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000071360_73072640.pth +[2023-10-14 08:07:38,896][100936] Updated weights for policy 0, policy_version 72770 (0.0008) +[2023-10-14 08:07:39,259][100936] Updated weights for policy 0, policy_version 72780 (0.0009) +[2023-10-14 08:07:39,635][100936] Updated weights for policy 0, policy_version 72790 (0.0008) +[2023-10-14 08:07:40,002][100936] Updated weights for policy 0, policy_version 72800 (0.0007) +[2023-10-14 08:07:40,630][100917] Updated weights for policy 1, policy_version 72902 (0.0011) +[2023-10-14 08:07:41,005][100917] Updated weights for policy 1, policy_version 72912 (0.0009) +[2023-10-14 08:07:41,381][100917] Updated weights for policy 1, policy_version 72922 (0.0010) +[2023-10-14 08:07:43,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 149225472. Throughput: 0: 1662.3, 1: 1651.4. Samples: 37312626. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:07:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:07:44,119][100936] Updated weights for policy 0, policy_version 72810 (0.0010) +[2023-10-14 08:07:44,487][100936] Updated weights for policy 0, policy_version 72820 (0.0009) +[2023-10-14 08:07:44,853][100936] Updated weights for policy 0, policy_version 72830 (0.0009) +[2023-10-14 08:07:45,588][100917] Updated weights for policy 1, policy_version 72932 (0.0009) +[2023-10-14 08:07:45,970][100917] Updated weights for policy 1, policy_version 72942 (0.0009) +[2023-10-14 08:07:46,343][100917] Updated weights for policy 1, policy_version 72952 (0.0007) +[2023-10-14 08:07:48,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 149291008. Throughput: 0: 1662.9, 1: 1647.5. Samples: 37332192. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:07:48,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:07:48,923][100936] Updated weights for policy 0, policy_version 72840 (0.0008) +[2023-10-14 08:07:49,292][100936] Updated weights for policy 0, policy_version 72850 (0.0007) +[2023-10-14 08:07:49,666][100936] Updated weights for policy 0, policy_version 72860 (0.0007) +[2023-10-14 08:07:50,098][100917] Updated weights for policy 1, policy_version 72962 (0.0008) +[2023-10-14 08:07:50,474][100917] Updated weights for policy 1, policy_version 72972 (0.0010) +[2023-10-14 08:07:50,838][100917] Updated weights for policy 1, policy_version 72982 (0.0008) +[2023-10-14 08:07:51,214][100917] Updated weights for policy 1, policy_version 72992 (0.0008) +[2023-10-14 08:07:53,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 149356544. Throughput: 0: 1665.5, 1: 1654.0. Samples: 37352808. Policy #0 lag: (min: 9.0, avg: 26.6, max: 41.0) +[2023-10-14 08:07:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:07:53,832][100936] Updated weights for policy 0, policy_version 72870 (0.0008) +[2023-10-14 08:07:54,192][100936] Updated weights for policy 0, policy_version 72880 (0.0010) +[2023-10-14 08:07:54,566][100936] Updated weights for policy 0, policy_version 72890 (0.0009) +[2023-10-14 08:07:55,664][100917] Updated weights for policy 1, policy_version 73002 (0.0009) +[2023-10-14 08:07:56,040][100917] Updated weights for policy 1, policy_version 73012 (0.0011) +[2023-10-14 08:07:56,413][100917] Updated weights for policy 1, policy_version 73022 (0.0011) +[2023-10-14 08:07:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 149422080. Throughput: 0: 1666.9, 1: 1650.8. Samples: 37362502. Policy #0 lag: (min: 9.0, avg: 26.6, max: 41.0) +[2023-10-14 08:07:58,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:07:58,800][100936] Updated weights for policy 0, policy_version 72900 (0.0008) +[2023-10-14 08:07:59,181][100936] Updated weights for policy 0, policy_version 72910 (0.0009) +[2023-10-14 08:07:59,547][100936] Updated weights for policy 0, policy_version 72920 (0.0008) +[2023-10-14 08:08:00,567][100917] Updated weights for policy 1, policy_version 73032 (0.0008) +[2023-10-14 08:08:00,930][100917] Updated weights for policy 1, policy_version 73042 (0.0008) +[2023-10-14 08:08:01,297][100917] Updated weights for policy 1, policy_version 73052 (0.0011) +[2023-10-14 08:08:03,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.1, 300 sec: 13329.3). Total num frames: 149487616. Throughput: 0: 1658.0, 1: 1659.5. Samples: 37382042. Policy #0 lag: (min: 9.0, avg: 26.6, max: 41.0) +[2023-10-14 08:08:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:08:03,799][100936] Updated weights for policy 0, policy_version 72930 (0.0009) +[2023-10-14 08:08:04,171][100936] Updated weights for policy 0, policy_version 72940 (0.0009) +[2023-10-14 08:08:04,537][100936] Updated weights for policy 0, policy_version 72950 (0.0011) +[2023-10-14 08:08:04,899][100936] Updated weights for policy 0, policy_version 72960 (0.0008) +[2023-10-14 08:08:05,395][100917] Updated weights for policy 1, policy_version 73062 (0.0010) +[2023-10-14 08:08:05,768][100917] Updated weights for policy 1, policy_version 73072 (0.0009) +[2023-10-14 08:08:06,142][100917] Updated weights for policy 1, policy_version 73082 (0.0010) +[2023-10-14 08:08:08,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 149553152. Throughput: 0: 1652.8, 1: 1654.3. Samples: 37402118. Policy #0 lag: (min: 9.0, avg: 26.6, max: 41.0) +[2023-10-14 08:08:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:08:08,999][100936] Updated weights for policy 0, policy_version 72970 (0.0008) +[2023-10-14 08:08:09,369][100936] Updated weights for policy 0, policy_version 72980 (0.0009) +[2023-10-14 08:08:09,748][100936] Updated weights for policy 0, policy_version 72990 (0.0008) +[2023-10-14 08:08:10,289][100917] Updated weights for policy 1, policy_version 73092 (0.0010) +[2023-10-14 08:08:10,649][100917] Updated weights for policy 1, policy_version 73102 (0.0008) +[2023-10-14 08:08:11,021][100917] Updated weights for policy 1, policy_version 73112 (0.0008) +[2023-10-14 08:08:13,512][99942] Fps is (10 sec: 13107.7, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 149618688. Throughput: 0: 1657.0, 1: 1651.1. Samples: 37411694. Policy #0 lag: (min: 9.0, avg: 26.6, max: 41.0) +[2023-10-14 08:08:13,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:08:13,813][100936] Updated weights for policy 0, policy_version 73000 (0.0007) +[2023-10-14 08:08:14,191][100936] Updated weights for policy 0, policy_version 73010 (0.0009) +[2023-10-14 08:08:14,568][100936] Updated weights for policy 0, policy_version 73020 (0.0011) +[2023-10-14 08:08:15,200][100917] Updated weights for policy 1, policy_version 73122 (0.0010) +[2023-10-14 08:08:15,578][100917] Updated weights for policy 1, policy_version 73132 (0.0011) +[2023-10-14 08:08:15,965][100917] Updated weights for policy 1, policy_version 73142 (0.0007) +[2023-10-14 08:08:16,331][100917] Updated weights for policy 1, policy_version 73152 (0.0008) +[2023-10-14 08:08:18,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 149684224. Throughput: 0: 1658.1, 1: 1656.9. Samples: 37431588. Policy #0 lag: (min: 9.0, avg: 26.6, max: 41.0) +[2023-10-14 08:08:18,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:08:18,696][100936] Updated weights for policy 0, policy_version 73030 (0.0010) +[2023-10-14 08:08:19,063][100936] Updated weights for policy 0, policy_version 73040 (0.0008) +[2023-10-14 08:08:19,438][100936] Updated weights for policy 0, policy_version 73050 (0.0008) +[2023-10-14 08:08:20,377][100917] Updated weights for policy 1, policy_version 73162 (0.0007) +[2023-10-14 08:08:20,749][100917] Updated weights for policy 1, policy_version 73172 (0.0008) +[2023-10-14 08:08:21,131][100917] Updated weights for policy 1, policy_version 73182 (0.0008) +[2023-10-14 08:08:23,446][100936] Updated weights for policy 0, policy_version 73060 (0.0007) +[2023-10-14 08:08:23,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 149749760. Throughput: 0: 1664.1, 1: 1655.9. Samples: 37452218. Policy #0 lag: (min: 9.0, avg: 26.6, max: 41.0) +[2023-10-14 08:08:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:08:23,811][100936] Updated weights for policy 0, policy_version 73070 (0.0008) +[2023-10-14 08:08:24,176][100936] Updated weights for policy 0, policy_version 73080 (0.0010) +[2023-10-14 08:08:25,146][100917] Updated weights for policy 1, policy_version 73192 (0.0009) +[2023-10-14 08:08:25,526][100917] Updated weights for policy 1, policy_version 73202 (0.0008) +[2023-10-14 08:08:25,907][100917] Updated weights for policy 1, policy_version 73212 (0.0008) +[2023-10-14 08:08:28,312][100936] Updated weights for policy 0, policy_version 73090 (0.0009) +[2023-10-14 08:08:28,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 149815296. Throughput: 0: 1663.2, 1: 1644.3. Samples: 37461464. Policy #0 lag: (min: 9.0, avg: 26.6, max: 41.0) +[2023-10-14 08:08:28,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:08:28,687][100936] Updated weights for policy 0, policy_version 73100 (0.0009) +[2023-10-14 08:08:29,054][100936] Updated weights for policy 0, policy_version 73110 (0.0007) +[2023-10-14 08:08:29,419][100936] Updated weights for policy 0, policy_version 73120 (0.0007) +[2023-10-14 08:08:29,990][100917] Updated weights for policy 1, policy_version 73222 (0.0008) +[2023-10-14 08:08:30,371][100917] Updated weights for policy 1, policy_version 73232 (0.0008) +[2023-10-14 08:08:30,746][100917] Updated weights for policy 1, policy_version 73242 (0.0009) +[2023-10-14 08:08:33,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 149880832. Throughput: 0: 1666.5, 1: 1655.2. Samples: 37481668. Policy #0 lag: (min: 9.0, avg: 26.6, max: 41.0) +[2023-10-14 08:08:33,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:08:33,548][100936] Updated weights for policy 0, policy_version 73130 (0.0009) +[2023-10-14 08:08:33,906][100936] Updated weights for policy 0, policy_version 73140 (0.0008) +[2023-10-14 08:08:34,287][100936] Updated weights for policy 0, policy_version 73150 (0.0011) +[2023-10-14 08:08:34,939][100917] Updated weights for policy 1, policy_version 73252 (0.0010) +[2023-10-14 08:08:35,317][100917] Updated weights for policy 1, policy_version 73262 (0.0008) +[2023-10-14 08:08:35,688][100917] Updated weights for policy 1, policy_version 73272 (0.0011) +[2023-10-14 08:08:38,300][100936] Updated weights for policy 0, policy_version 73160 (0.0009) +[2023-10-14 08:08:38,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 149946368. Throughput: 0: 1657.0, 1: 1653.1. Samples: 37501762. Policy #0 lag: (min: 9.0, avg: 26.6, max: 41.0) +[2023-10-14 08:08:38,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:08:38,672][100936] Updated weights for policy 0, policy_version 73170 (0.0008) +[2023-10-14 08:08:39,037][100936] Updated weights for policy 0, policy_version 73180 (0.0009) +[2023-10-14 08:08:39,744][100917] Updated weights for policy 1, policy_version 73282 (0.0007) +[2023-10-14 08:08:40,114][100917] Updated weights for policy 1, policy_version 73292 (0.0010) +[2023-10-14 08:08:40,483][100917] Updated weights for policy 1, policy_version 73302 (0.0011) +[2023-10-14 08:08:40,854][100917] Updated weights for policy 1, policy_version 73312 (0.0008) +[2023-10-14 08:08:43,123][100936] Updated weights for policy 0, policy_version 73190 (0.0008) +[2023-10-14 08:08:43,490][100936] Updated weights for policy 0, policy_version 73200 (0.0009) +[2023-10-14 08:08:43,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 150011904. Throughput: 0: 1666.8, 1: 1639.4. Samples: 37511282. Policy #0 lag: (min: 19.0, avg: 20.3, max: 43.0) +[2023-10-14 08:08:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:08:43,862][100936] Updated weights for policy 0, policy_version 73210 (0.0008) +[2023-10-14 08:08:45,026][100917] Updated weights for policy 1, policy_version 73322 (0.0008) +[2023-10-14 08:08:45,384][100917] Updated weights for policy 1, policy_version 73332 (0.0008) +[2023-10-14 08:08:45,753][100917] Updated weights for policy 1, policy_version 73342 (0.0010) +[2023-10-14 08:08:48,090][100936] Updated weights for policy 0, policy_version 73220 (0.0010) +[2023-10-14 08:08:48,460][100936] Updated weights for policy 0, policy_version 73230 (0.0009) +[2023-10-14 08:08:48,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 150077440. Throughput: 0: 1669.0, 1: 1653.3. Samples: 37531548. Policy #0 lag: (min: 19.0, avg: 20.3, max: 43.0) +[2023-10-14 08:08:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:08:48,825][100936] Updated weights for policy 0, policy_version 73240 (0.0007) +[2023-10-14 08:08:49,976][100917] Updated weights for policy 1, policy_version 73352 (0.0009) +[2023-10-14 08:08:50,344][100917] Updated weights for policy 1, policy_version 73362 (0.0007) +[2023-10-14 08:08:50,732][100917] Updated weights for policy 1, policy_version 73372 (0.0007) +[2023-10-14 08:08:52,876][100936] Updated weights for policy 0, policy_version 73250 (0.0007) +[2023-10-14 08:08:53,249][100936] Updated weights for policy 0, policy_version 73260 (0.0008) +[2023-10-14 08:08:53,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 150142976. Throughput: 0: 1660.8, 1: 1657.6. Samples: 37551446. Policy #0 lag: (min: 19.0, avg: 20.3, max: 43.0) +[2023-10-14 08:08:53,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:08:53,616][100936] Updated weights for policy 0, policy_version 73270 (0.0007) +[2023-10-14 08:08:53,986][100936] Updated weights for policy 0, policy_version 73280 (0.0009) +[2023-10-14 08:08:54,827][100917] Updated weights for policy 1, policy_version 73382 (0.0009) +[2023-10-14 08:08:55,206][100917] Updated weights for policy 1, policy_version 73392 (0.0009) +[2023-10-14 08:08:55,572][100917] Updated weights for policy 1, policy_version 73402 (0.0009) +[2023-10-14 08:08:58,117][100936] Updated weights for policy 0, policy_version 73290 (0.0008) +[2023-10-14 08:08:58,493][100936] Updated weights for policy 0, policy_version 73300 (0.0008) +[2023-10-14 08:08:58,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 150208512. Throughput: 0: 1671.0, 1: 1648.0. Samples: 37561052. Policy #0 lag: (min: 19.0, avg: 20.3, max: 43.0) +[2023-10-14 08:08:58,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:08:58,856][100936] Updated weights for policy 0, policy_version 73310 (0.0008) +[2023-10-14 08:08:59,676][100917] Updated weights for policy 1, policy_version 73412 (0.0008) +[2023-10-14 08:09:00,052][100917] Updated weights for policy 1, policy_version 73422 (0.0007) +[2023-10-14 08:09:00,427][100917] Updated weights for policy 1, policy_version 73432 (0.0008) +[2023-10-14 08:09:03,042][100936] Updated weights for policy 0, policy_version 73320 (0.0009) +[2023-10-14 08:09:03,414][100936] Updated weights for policy 0, policy_version 73330 (0.0009) +[2023-10-14 08:09:03,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 150274048. Throughput: 0: 1671.2, 1: 1660.3. Samples: 37581504. Policy #0 lag: (min: 19.0, avg: 20.3, max: 43.0) +[2023-10-14 08:09:03,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:09:03,785][100936] Updated weights for policy 0, policy_version 73340 (0.0009) +[2023-10-14 08:09:04,557][100917] Updated weights for policy 1, policy_version 73442 (0.0010) +[2023-10-14 08:09:04,921][100917] Updated weights for policy 1, policy_version 73452 (0.0009) +[2023-10-14 08:09:05,292][100917] Updated weights for policy 1, policy_version 73462 (0.0010) +[2023-10-14 08:09:05,663][100917] Updated weights for policy 1, policy_version 73472 (0.0011) +[2023-10-14 08:09:07,824][100936] Updated weights for policy 0, policy_version 73350 (0.0008) +[2023-10-14 08:09:08,196][100936] Updated weights for policy 0, policy_version 73360 (0.0009) +[2023-10-14 08:09:08,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 150339584. Throughput: 0: 1650.2, 1: 1661.7. Samples: 37601254. Policy #0 lag: (min: 19.0, avg: 20.3, max: 43.0) +[2023-10-14 08:09:08,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:09:08,557][100936] Updated weights for policy 0, policy_version 73370 (0.0009) +[2023-10-14 08:09:09,629][100917] Updated weights for policy 1, policy_version 73482 (0.0009) +[2023-10-14 08:09:10,016][100917] Updated weights for policy 1, policy_version 73492 (0.0009) +[2023-10-14 08:09:10,391][100917] Updated weights for policy 1, policy_version 73502 (0.0009) +[2023-10-14 08:09:12,672][100936] Updated weights for policy 0, policy_version 73380 (0.0008) +[2023-10-14 08:09:13,048][100936] Updated weights for policy 0, policy_version 73390 (0.0008) +[2023-10-14 08:09:13,421][100936] Updated weights for policy 0, policy_version 73400 (0.0009) +[2023-10-14 08:09:13,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 150405120. Throughput: 0: 1667.9, 1: 1656.7. Samples: 37611070. Policy #0 lag: (min: 19.0, avg: 20.3, max: 43.0) +[2023-10-14 08:09:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:09:14,631][100917] Updated weights for policy 1, policy_version 73512 (0.0009) +[2023-10-14 08:09:15,005][100917] Updated weights for policy 1, policy_version 73522 (0.0008) +[2023-10-14 08:09:15,389][100917] Updated weights for policy 1, policy_version 73532 (0.0009) +[2023-10-14 08:09:17,535][100936] Updated weights for policy 0, policy_version 73410 (0.0008) +[2023-10-14 08:09:17,906][100936] Updated weights for policy 0, policy_version 73420 (0.0009) +[2023-10-14 08:09:18,267][100936] Updated weights for policy 0, policy_version 73430 (0.0008) +[2023-10-14 08:09:18,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 150470656. Throughput: 0: 1665.5, 1: 1661.8. Samples: 37631394. Policy #0 lag: (min: 19.0, avg: 20.3, max: 43.0) +[2023-10-14 08:09:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:09:18,635][100936] Updated weights for policy 0, policy_version 73440 (0.0008) +[2023-10-14 08:09:19,522][100917] Updated weights for policy 1, policy_version 73542 (0.0009) +[2023-10-14 08:09:19,909][100917] Updated weights for policy 1, policy_version 73552 (0.0010) +[2023-10-14 08:09:20,291][100917] Updated weights for policy 1, policy_version 73562 (0.0010) +[2023-10-14 08:09:22,846][100936] Updated weights for policy 0, policy_version 73450 (0.0007) +[2023-10-14 08:09:23,210][100936] Updated weights for policy 0, policy_version 73460 (0.0007) +[2023-10-14 08:09:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 150536192. Throughput: 0: 1653.0, 1: 1658.4. Samples: 37650778. Policy #0 lag: (min: 19.0, avg: 20.3, max: 43.0) +[2023-10-14 08:09:23,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:09:23,583][100936] Updated weights for policy 0, policy_version 73470 (0.0009) +[2023-10-14 08:09:24,470][100917] Updated weights for policy 1, policy_version 73572 (0.0007) +[2023-10-14 08:09:24,851][100917] Updated weights for policy 1, policy_version 73582 (0.0009) +[2023-10-14 08:09:25,221][100917] Updated weights for policy 1, policy_version 73592 (0.0009) +[2023-10-14 08:09:27,668][100936] Updated weights for policy 0, policy_version 73480 (0.0007) +[2023-10-14 08:09:28,039][100936] Updated weights for policy 0, policy_version 73490 (0.0010) +[2023-10-14 08:09:28,420][100936] Updated weights for policy 0, policy_version 73500 (0.0010) +[2023-10-14 08:09:28,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 150601728. Throughput: 0: 1660.8, 1: 1659.4. Samples: 37660690. Policy #0 lag: (min: 19.0, avg: 20.3, max: 43.0) +[2023-10-14 08:09:28,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:09:29,378][100917] Updated weights for policy 1, policy_version 73602 (0.0008) +[2023-10-14 08:09:29,753][100917] Updated weights for policy 1, policy_version 73612 (0.0009) +[2023-10-14 08:09:30,111][100917] Updated weights for policy 1, policy_version 73622 (0.0011) +[2023-10-14 08:09:30,487][100917] Updated weights for policy 1, policy_version 73632 (0.0010) +[2023-10-14 08:09:32,409][100936] Updated weights for policy 0, policy_version 73510 (0.0008) +[2023-10-14 08:09:32,783][100936] Updated weights for policy 0, policy_version 73520 (0.0007) +[2023-10-14 08:09:33,148][100936] Updated weights for policy 0, policy_version 73530 (0.0008) +[2023-10-14 08:09:33,512][99942] Fps is (10 sec: 16383.6, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 150700032. Throughput: 0: 1660.4, 1: 1662.8. Samples: 37681090. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:09:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:09:34,561][100917] Updated weights for policy 1, policy_version 73642 (0.0011) +[2023-10-14 08:09:34,947][100917] Updated weights for policy 1, policy_version 73652 (0.0010) +[2023-10-14 08:09:35,326][100917] Updated weights for policy 1, policy_version 73662 (0.0007) +[2023-10-14 08:09:37,162][100936] Updated weights for policy 0, policy_version 73540 (0.0008) +[2023-10-14 08:09:37,527][100936] Updated weights for policy 0, policy_version 73550 (0.0008) +[2023-10-14 08:09:37,898][100936] Updated weights for policy 0, policy_version 73560 (0.0010) +[2023-10-14 08:09:38,512][99942] Fps is (10 sec: 16383.3, 60 sec: 13653.2, 300 sec: 13329.3). Total num frames: 150765568. Throughput: 0: 1651.4, 1: 1660.0. Samples: 37700458. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:09:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:09:38,527][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000073568_75333632.pth... +[2023-10-14 08:09:38,528][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000073664_75431936.pth... +[2023-10-14 08:09:38,563][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000072128_73859072.pth +[2023-10-14 08:09:38,565][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000072000_73728000.pth +[2023-10-14 08:09:39,553][100917] Updated weights for policy 1, policy_version 73672 (0.0007) +[2023-10-14 08:09:39,923][100917] Updated weights for policy 1, policy_version 73682 (0.0009) +[2023-10-14 08:09:40,311][100917] Updated weights for policy 1, policy_version 73692 (0.0010) +[2023-10-14 08:09:42,165][100936] Updated weights for policy 0, policy_version 73570 (0.0008) +[2023-10-14 08:09:42,537][100936] Updated weights for policy 0, policy_version 73580 (0.0009) +[2023-10-14 08:09:42,900][100936] Updated weights for policy 0, policy_version 73590 (0.0007) +[2023-10-14 08:09:43,270][100936] Updated weights for policy 0, policy_version 73600 (0.0007) +[2023-10-14 08:09:43,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 150831104. Throughput: 0: 1666.9, 1: 1659.1. Samples: 37710720. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:09:43,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:09:44,347][100917] Updated weights for policy 1, policy_version 73702 (0.0008) +[2023-10-14 08:09:44,724][100917] Updated weights for policy 1, policy_version 73712 (0.0008) +[2023-10-14 08:09:45,102][100917] Updated weights for policy 1, policy_version 73722 (0.0010) +[2023-10-14 08:09:47,397][100936] Updated weights for policy 0, policy_version 73610 (0.0008) +[2023-10-14 08:09:47,752][100936] Updated weights for policy 0, policy_version 73620 (0.0009) +[2023-10-14 08:09:48,124][100936] Updated weights for policy 0, policy_version 73630 (0.0009) +[2023-10-14 08:09:48,512][99942] Fps is (10 sec: 13107.9, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 150896640. Throughput: 0: 1660.0, 1: 1657.1. Samples: 37730776. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:09:48,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:09:49,287][100917] Updated weights for policy 1, policy_version 73732 (0.0007) +[2023-10-14 08:09:49,649][100917] Updated weights for policy 1, policy_version 73742 (0.0009) +[2023-10-14 08:09:50,024][100917] Updated weights for policy 1, policy_version 73752 (0.0008) +[2023-10-14 08:09:52,065][100936] Updated weights for policy 0, policy_version 73640 (0.0008) +[2023-10-14 08:09:52,443][100936] Updated weights for policy 0, policy_version 73650 (0.0008) +[2023-10-14 08:09:52,815][100936] Updated weights for policy 0, policy_version 73660 (0.0007) +[2023-10-14 08:09:53,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 150962176. Throughput: 0: 1657.2, 1: 1657.5. Samples: 37750418. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:09:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:09:54,157][100917] Updated weights for policy 1, policy_version 73762 (0.0009) +[2023-10-14 08:09:54,530][100917] Updated weights for policy 1, policy_version 73772 (0.0009) +[2023-10-14 08:09:54,899][100917] Updated weights for policy 1, policy_version 73782 (0.0007) +[2023-10-14 08:09:55,283][100917] Updated weights for policy 1, policy_version 73792 (0.0008) +[2023-10-14 08:09:57,046][100936] Updated weights for policy 0, policy_version 73670 (0.0008) +[2023-10-14 08:09:57,418][100936] Updated weights for policy 0, policy_version 73680 (0.0008) +[2023-10-14 08:09:57,790][100936] Updated weights for policy 0, policy_version 73690 (0.0008) +[2023-10-14 08:09:58,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 151027712. Throughput: 0: 1664.4, 1: 1655.6. Samples: 37760472. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:09:58,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:09:59,412][100917] Updated weights for policy 1, policy_version 73802 (0.0008) +[2023-10-14 08:09:59,786][100917] Updated weights for policy 1, policy_version 73812 (0.0010) +[2023-10-14 08:10:00,161][100917] Updated weights for policy 1, policy_version 73822 (0.0008) +[2023-10-14 08:10:01,900][100936] Updated weights for policy 0, policy_version 73700 (0.0009) +[2023-10-14 08:10:02,269][100936] Updated weights for policy 0, policy_version 73710 (0.0009) +[2023-10-14 08:10:02,634][100936] Updated weights for policy 0, policy_version 73720 (0.0008) +[2023-10-14 08:10:03,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 151093248. Throughput: 0: 1650.0, 1: 1656.8. Samples: 37780200. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:10:03,513][99942] Avg episode reward: [(0, '0.860'), (1, '1.000')] +[2023-10-14 08:10:04,305][100917] Updated weights for policy 1, policy_version 73832 (0.0007) +[2023-10-14 08:10:04,679][100917] Updated weights for policy 1, policy_version 73842 (0.0009) +[2023-10-14 08:10:05,050][100917] Updated weights for policy 1, policy_version 73852 (0.0011) +[2023-10-14 08:10:06,697][100936] Updated weights for policy 0, policy_version 73730 (0.0011) +[2023-10-14 08:10:07,068][100936] Updated weights for policy 0, policy_version 73740 (0.0009) +[2023-10-14 08:10:07,443][100936] Updated weights for policy 0, policy_version 73750 (0.0008) +[2023-10-14 08:10:07,813][100936] Updated weights for policy 0, policy_version 73760 (0.0008) +[2023-10-14 08:10:08,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 151158784. Throughput: 0: 1658.4, 1: 1654.9. Samples: 37799874. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:10:08,512][99942] Avg episode reward: [(0, '0.860'), (1, '1.000')] +[2023-10-14 08:10:09,303][100917] Updated weights for policy 1, policy_version 73862 (0.0008) +[2023-10-14 08:10:09,668][100917] Updated weights for policy 1, policy_version 73872 (0.0007) +[2023-10-14 08:10:10,048][100917] Updated weights for policy 1, policy_version 73882 (0.0008) +[2023-10-14 08:10:11,894][100936] Updated weights for policy 0, policy_version 73770 (0.0007) +[2023-10-14 08:10:12,254][100936] Updated weights for policy 0, policy_version 73780 (0.0007) +[2023-10-14 08:10:12,620][100936] Updated weights for policy 0, policy_version 73790 (0.0007) +[2023-10-14 08:10:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 151224320. Throughput: 0: 1667.2, 1: 1650.7. Samples: 37809998. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:10:13,513][99942] Avg episode reward: [(0, '0.860'), (1, '1.000')] +[2023-10-14 08:10:14,121][100917] Updated weights for policy 1, policy_version 73892 (0.0010) +[2023-10-14 08:10:14,491][100917] Updated weights for policy 1, policy_version 73902 (0.0008) +[2023-10-14 08:10:14,866][100917] Updated weights for policy 1, policy_version 73912 (0.0008) +[2023-10-14 08:10:16,791][100936] Updated weights for policy 0, policy_version 73800 (0.0008) +[2023-10-14 08:10:17,167][100936] Updated weights for policy 0, policy_version 73810 (0.0009) +[2023-10-14 08:10:17,532][100936] Updated weights for policy 0, policy_version 73820 (0.0007) +[2023-10-14 08:10:18,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 151289856. Throughput: 0: 1649.2, 1: 1647.2. Samples: 37829424. Policy #0 lag: (min: 30.0, avg: 38.0, max: 62.0) +[2023-10-14 08:10:18,513][99942] Avg episode reward: [(0, '0.690'), (1, '1.000')] +[2023-10-14 08:10:19,084][100917] Updated weights for policy 1, policy_version 73922 (0.0010) +[2023-10-14 08:10:19,467][100917] Updated weights for policy 1, policy_version 73932 (0.0008) +[2023-10-14 08:10:19,828][100917] Updated weights for policy 1, policy_version 73942 (0.0009) +[2023-10-14 08:10:20,209][100917] Updated weights for policy 1, policy_version 73952 (0.0011) +[2023-10-14 08:10:21,554][100936] Updated weights for policy 0, policy_version 73830 (0.0008) +[2023-10-14 08:10:21,930][100936] Updated weights for policy 0, policy_version 73840 (0.0009) +[2023-10-14 08:10:22,296][100936] Updated weights for policy 0, policy_version 73850 (0.0008) +[2023-10-14 08:10:23,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 151355392. Throughput: 0: 1665.8, 1: 1650.9. Samples: 37849704. Policy #0 lag: (min: 30.0, avg: 38.0, max: 62.0) +[2023-10-14 08:10:23,512][99942] Avg episode reward: [(0, '0.690'), (1, '1.000')] +[2023-10-14 08:10:24,191][100917] Updated weights for policy 1, policy_version 73962 (0.0008) +[2023-10-14 08:10:24,570][100917] Updated weights for policy 1, policy_version 73972 (0.0011) +[2023-10-14 08:10:24,949][100917] Updated weights for policy 1, policy_version 73982 (0.0010) +[2023-10-14 08:10:26,432][100936] Updated weights for policy 0, policy_version 73860 (0.0008) +[2023-10-14 08:10:26,792][100936] Updated weights for policy 0, policy_version 73870 (0.0010) +[2023-10-14 08:10:27,165][100936] Updated weights for policy 0, policy_version 73880 (0.0010) +[2023-10-14 08:10:28,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 151420928. Throughput: 0: 1663.3, 1: 1651.9. Samples: 37859904. Policy #0 lag: (min: 30.0, avg: 38.0, max: 62.0) +[2023-10-14 08:10:28,513][99942] Avg episode reward: [(0, '0.690'), (1, '1.000')] +[2023-10-14 08:10:29,033][100917] Updated weights for policy 1, policy_version 73992 (0.0008) +[2023-10-14 08:10:29,410][100917] Updated weights for policy 1, policy_version 74002 (0.0007) +[2023-10-14 08:10:29,794][100917] Updated weights for policy 1, policy_version 74012 (0.0007) +[2023-10-14 08:10:31,533][100936] Updated weights for policy 0, policy_version 73890 (0.0010) +[2023-10-14 08:10:31,904][100936] Updated weights for policy 0, policy_version 73900 (0.0009) +[2023-10-14 08:10:32,270][100936] Updated weights for policy 0, policy_version 73910 (0.0011) +[2023-10-14 08:10:32,642][100936] Updated weights for policy 0, policy_version 73920 (0.0009) +[2023-10-14 08:10:33,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 151486464. Throughput: 0: 1642.2, 1: 1658.1. Samples: 37879288. Policy #0 lag: (min: 30.0, avg: 38.0, max: 62.0) +[2023-10-14 08:10:33,513][99942] Avg episode reward: [(0, '0.690'), (1, '1.000')] +[2023-10-14 08:10:33,989][100917] Updated weights for policy 1, policy_version 74022 (0.0010) +[2023-10-14 08:10:34,358][100917] Updated weights for policy 1, policy_version 74032 (0.0010) +[2023-10-14 08:10:34,727][100917] Updated weights for policy 1, policy_version 74042 (0.0008) +[2023-10-14 08:10:36,837][100936] Updated weights for policy 0, policy_version 73930 (0.0007) +[2023-10-14 08:10:37,206][100936] Updated weights for policy 0, policy_version 73940 (0.0007) +[2023-10-14 08:10:37,577][100936] Updated weights for policy 0, policy_version 73950 (0.0009) +[2023-10-14 08:10:38,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 151552000. Throughput: 0: 1652.0, 1: 1651.0. Samples: 37899054. Policy #0 lag: (min: 30.0, avg: 38.0, max: 62.0) +[2023-10-14 08:10:38,513][99942] Avg episode reward: [(0, '0.690'), (1, '1.000')] +[2023-10-14 08:10:38,996][100917] Updated weights for policy 1, policy_version 74052 (0.0008) +[2023-10-14 08:10:39,369][100917] Updated weights for policy 1, policy_version 74062 (0.0010) +[2023-10-14 08:10:39,745][100917] Updated weights for policy 1, policy_version 74072 (0.0009) +[2023-10-14 08:10:41,563][100936] Updated weights for policy 0, policy_version 73960 (0.0008) +[2023-10-14 08:10:41,934][100936] Updated weights for policy 0, policy_version 73970 (0.0008) +[2023-10-14 08:10:42,305][100936] Updated weights for policy 0, policy_version 73980 (0.0008) +[2023-10-14 08:10:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 151617536. Throughput: 0: 1654.0, 1: 1653.5. Samples: 37909308. Policy #0 lag: (min: 30.0, avg: 38.0, max: 62.0) +[2023-10-14 08:10:43,513][99942] Avg episode reward: [(0, '0.690'), (1, '1.000')] +[2023-10-14 08:10:43,827][100917] Updated weights for policy 1, policy_version 74082 (0.0007) +[2023-10-14 08:10:44,197][100917] Updated weights for policy 1, policy_version 74092 (0.0009) +[2023-10-14 08:10:44,570][100917] Updated weights for policy 1, policy_version 74102 (0.0009) +[2023-10-14 08:10:44,930][100917] Updated weights for policy 1, policy_version 74112 (0.0009) +[2023-10-14 08:10:46,624][100936] Updated weights for policy 0, policy_version 73990 (0.0008) +[2023-10-14 08:10:46,996][100936] Updated weights for policy 0, policy_version 74000 (0.0008) +[2023-10-14 08:10:47,365][100936] Updated weights for policy 0, policy_version 74010 (0.0009) +[2023-10-14 08:10:48,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 151683072. Throughput: 0: 1645.8, 1: 1657.4. Samples: 37928846. Policy #0 lag: (min: 30.0, avg: 38.0, max: 62.0) +[2023-10-14 08:10:48,512][99942] Avg episode reward: [(0, '0.690'), (1, '1.000')] +[2023-10-14 08:10:48,975][100917] Updated weights for policy 1, policy_version 74122 (0.0009) +[2023-10-14 08:10:49,335][100917] Updated weights for policy 1, policy_version 74132 (0.0007) +[2023-10-14 08:10:49,712][100917] Updated weights for policy 1, policy_version 74142 (0.0007) +[2023-10-14 08:10:51,629][100936] Updated weights for policy 0, policy_version 74020 (0.0010) +[2023-10-14 08:10:52,003][100936] Updated weights for policy 0, policy_version 74030 (0.0008) +[2023-10-14 08:10:52,372][100936] Updated weights for policy 0, policy_version 74040 (0.0007) +[2023-10-14 08:10:53,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 151748608. Throughput: 0: 1649.8, 1: 1660.4. Samples: 37948836. Policy #0 lag: (min: 30.0, avg: 38.0, max: 62.0) +[2023-10-14 08:10:53,513][99942] Avg episode reward: [(0, '0.690'), (1, '1.000')] +[2023-10-14 08:10:53,757][100917] Updated weights for policy 1, policy_version 74152 (0.0009) +[2023-10-14 08:10:54,138][100917] Updated weights for policy 1, policy_version 74162 (0.0009) +[2023-10-14 08:10:54,499][100917] Updated weights for policy 1, policy_version 74172 (0.0009) +[2023-10-14 08:10:56,577][100936] Updated weights for policy 0, policy_version 74050 (0.0010) +[2023-10-14 08:10:56,949][100936] Updated weights for policy 0, policy_version 74060 (0.0009) +[2023-10-14 08:10:57,322][100936] Updated weights for policy 0, policy_version 74070 (0.0007) +[2023-10-14 08:10:57,686][100936] Updated weights for policy 0, policy_version 74080 (0.0011) +[2023-10-14 08:10:58,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 151814144. Throughput: 0: 1652.2, 1: 1662.6. Samples: 37959166. Policy #0 lag: (min: 30.0, avg: 38.0, max: 62.0) +[2023-10-14 08:10:58,513][99942] Avg episode reward: [(0, '0.690'), (1, '1.000')] +[2023-10-14 08:10:58,575][100917] Updated weights for policy 1, policy_version 74182 (0.0008) +[2023-10-14 08:10:58,951][100917] Updated weights for policy 1, policy_version 74192 (0.0010) +[2023-10-14 08:10:59,319][100917] Updated weights for policy 1, policy_version 74202 (0.0008) +[2023-10-14 08:11:01,691][100936] Updated weights for policy 0, policy_version 74090 (0.0010) +[2023-10-14 08:11:02,056][100936] Updated weights for policy 0, policy_version 74100 (0.0007) +[2023-10-14 08:11:02,422][100936] Updated weights for policy 0, policy_version 74110 (0.0009) +[2023-10-14 08:11:03,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 151879680. Throughput: 0: 1649.1, 1: 1659.2. Samples: 37978298. Policy #0 lag: (min: 30.0, avg: 38.0, max: 62.0) +[2023-10-14 08:11:03,512][99942] Avg episode reward: [(0, '0.690'), (1, '1.000')] +[2023-10-14 08:11:03,549][100917] Updated weights for policy 1, policy_version 74212 (0.0008) +[2023-10-14 08:11:03,921][100917] Updated weights for policy 1, policy_version 74222 (0.0009) +[2023-10-14 08:11:04,303][100917] Updated weights for policy 1, policy_version 74232 (0.0008) +[2023-10-14 08:11:06,473][100936] Updated weights for policy 0, policy_version 74120 (0.0009) +[2023-10-14 08:11:06,852][100936] Updated weights for policy 0, policy_version 74130 (0.0008) +[2023-10-14 08:11:07,218][100936] Updated weights for policy 0, policy_version 74140 (0.0007) +[2023-10-14 08:11:08,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 151945216. Throughput: 0: 1648.9, 1: 1656.3. Samples: 37998440. Policy #0 lag: (min: 8.0, avg: 34.8, max: 40.0) +[2023-10-14 08:11:08,513][99942] Avg episode reward: [(0, '0.680'), (1, '1.000')] +[2023-10-14 08:11:08,570][100917] Updated weights for policy 1, policy_version 74242 (0.0008) +[2023-10-14 08:11:08,945][100917] Updated weights for policy 1, policy_version 74252 (0.0009) +[2023-10-14 08:11:09,316][100917] Updated weights for policy 1, policy_version 74262 (0.0008) +[2023-10-14 08:11:09,684][100917] Updated weights for policy 1, policy_version 74272 (0.0009) +[2023-10-14 08:11:11,427][100936] Updated weights for policy 0, policy_version 74150 (0.0010) +[2023-10-14 08:11:11,795][100936] Updated weights for policy 0, policy_version 74160 (0.0009) +[2023-10-14 08:11:12,160][100936] Updated weights for policy 0, policy_version 74170 (0.0008) +[2023-10-14 08:11:13,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 152010752. Throughput: 0: 1646.1, 1: 1656.6. Samples: 38008526. Policy #0 lag: (min: 8.0, avg: 34.8, max: 40.0) +[2023-10-14 08:11:13,513][99942] Avg episode reward: [(0, '0.680'), (1, '1.000')] +[2023-10-14 08:11:13,849][100917] Updated weights for policy 1, policy_version 74282 (0.0007) +[2023-10-14 08:11:14,209][100917] Updated weights for policy 1, policy_version 74292 (0.0008) +[2023-10-14 08:11:14,585][100917] Updated weights for policy 1, policy_version 74302 (0.0008) +[2023-10-14 08:11:16,396][100936] Updated weights for policy 0, policy_version 74180 (0.0009) +[2023-10-14 08:11:16,767][100936] Updated weights for policy 0, policy_version 74190 (0.0008) +[2023-10-14 08:11:17,132][100936] Updated weights for policy 0, policy_version 74200 (0.0007) +[2023-10-14 08:11:18,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 152076288. Throughput: 0: 1651.1, 1: 1652.9. Samples: 38027968. Policy #0 lag: (min: 8.0, avg: 34.8, max: 40.0) +[2023-10-14 08:11:18,512][99942] Avg episode reward: [(0, '0.680'), (1, '1.000')] +[2023-10-14 08:11:18,555][100917] Updated weights for policy 1, policy_version 74312 (0.0010) +[2023-10-14 08:11:18,918][100917] Updated weights for policy 1, policy_version 74322 (0.0008) +[2023-10-14 08:11:19,294][100917] Updated weights for policy 1, policy_version 74332 (0.0010) +[2023-10-14 08:11:21,174][100936] Updated weights for policy 0, policy_version 74210 (0.0008) +[2023-10-14 08:11:21,535][100936] Updated weights for policy 0, policy_version 74220 (0.0007) +[2023-10-14 08:11:21,906][100936] Updated weights for policy 0, policy_version 74230 (0.0007) +[2023-10-14 08:11:22,269][100936] Updated weights for policy 0, policy_version 74240 (0.0008) +[2023-10-14 08:11:23,249][100917] Updated weights for policy 1, policy_version 74342 (0.0010) +[2023-10-14 08:11:23,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 152141824. Throughput: 0: 1662.8, 1: 1663.4. Samples: 38048732. Policy #0 lag: (min: 8.0, avg: 34.8, max: 40.0) +[2023-10-14 08:11:23,513][99942] Avg episode reward: [(0, '0.680'), (1, '1.000')] +[2023-10-14 08:11:23,629][100917] Updated weights for policy 1, policy_version 74352 (0.0009) +[2023-10-14 08:11:23,996][100917] Updated weights for policy 1, policy_version 74362 (0.0010) +[2023-10-14 08:11:26,547][100936] Updated weights for policy 0, policy_version 74250 (0.0009) +[2023-10-14 08:11:26,923][100936] Updated weights for policy 0, policy_version 74260 (0.0010) +[2023-10-14 08:11:27,286][100936] Updated weights for policy 0, policy_version 74270 (0.0007) +[2023-10-14 08:11:28,323][100917] Updated weights for policy 1, policy_version 74372 (0.0008) +[2023-10-14 08:11:28,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 152207360. Throughput: 0: 1655.0, 1: 1661.4. Samples: 38058546. Policy #0 lag: (min: 8.0, avg: 34.8, max: 40.0) +[2023-10-14 08:11:28,513][99942] Avg episode reward: [(0, '0.680'), (1, '1.000')] +[2023-10-14 08:11:28,707][100917] Updated weights for policy 1, policy_version 74382 (0.0010) +[2023-10-14 08:11:29,073][100917] Updated weights for policy 1, policy_version 74392 (0.0011) +[2023-10-14 08:11:31,300][100936] Updated weights for policy 0, policy_version 74280 (0.0008) +[2023-10-14 08:11:31,668][100936] Updated weights for policy 0, policy_version 74290 (0.0009) +[2023-10-14 08:11:32,038][100936] Updated weights for policy 0, policy_version 74300 (0.0009) +[2023-10-14 08:11:33,279][100917] Updated weights for policy 1, policy_version 74402 (0.0008) +[2023-10-14 08:11:33,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 152272896. Throughput: 0: 1658.6, 1: 1658.7. Samples: 38078122. Policy #0 lag: (min: 8.0, avg: 34.8, max: 40.0) +[2023-10-14 08:11:33,512][99942] Avg episode reward: [(0, '0.680'), (1, '1.000')] +[2023-10-14 08:11:33,654][100917] Updated weights for policy 1, policy_version 74412 (0.0009) +[2023-10-14 08:11:34,032][100917] Updated weights for policy 1, policy_version 74422 (0.0008) +[2023-10-14 08:11:34,404][100917] Updated weights for policy 1, policy_version 74432 (0.0008) +[2023-10-14 08:11:36,013][100936] Updated weights for policy 0, policy_version 74310 (0.0008) +[2023-10-14 08:11:36,389][100936] Updated weights for policy 0, policy_version 74320 (0.0010) +[2023-10-14 08:11:36,758][100936] Updated weights for policy 0, policy_version 74330 (0.0011) +[2023-10-14 08:11:38,505][100917] Updated weights for policy 1, policy_version 74442 (0.0009) +[2023-10-14 08:11:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 152338432. Throughput: 0: 1671.8, 1: 1658.4. Samples: 38098696. Policy #0 lag: (min: 8.0, avg: 34.8, max: 40.0) +[2023-10-14 08:11:38,513][99942] Avg episode reward: [(0, '0.680'), (1, '1.000')] +[2023-10-14 08:11:38,521][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000074336_76120064.pth... +[2023-10-14 08:11:38,552][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000072768_74514432.pth +[2023-10-14 08:11:38,882][100917] Updated weights for policy 1, policy_version 74452 (0.0010) +[2023-10-14 08:11:39,257][100917] Updated weights for policy 1, policy_version 74462 (0.0011) +[2023-10-14 08:11:39,324][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000074464_76251136.pth... +[2023-10-14 08:11:39,354][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000072896_74645504.pth +[2023-10-14 08:11:40,935][100936] Updated weights for policy 0, policy_version 74340 (0.0011) +[2023-10-14 08:11:41,299][100936] Updated weights for policy 0, policy_version 74350 (0.0008) +[2023-10-14 08:11:41,669][100936] Updated weights for policy 0, policy_version 74360 (0.0007) +[2023-10-14 08:11:43,310][100917] Updated weights for policy 1, policy_version 74472 (0.0008) +[2023-10-14 08:11:43,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 152403968. Throughput: 0: 1654.1, 1: 1655.7. Samples: 38108102. Policy #0 lag: (min: 8.0, avg: 34.8, max: 40.0) +[2023-10-14 08:11:43,512][99942] Avg episode reward: [(0, '0.680'), (1, '1.000')] +[2023-10-14 08:11:43,685][100917] Updated weights for policy 1, policy_version 74482 (0.0008) +[2023-10-14 08:11:44,055][100917] Updated weights for policy 1, policy_version 74492 (0.0008) +[2023-10-14 08:11:45,782][100936] Updated weights for policy 0, policy_version 74370 (0.0007) +[2023-10-14 08:11:46,149][100936] Updated weights for policy 0, policy_version 74380 (0.0007) +[2023-10-14 08:11:46,514][100936] Updated weights for policy 0, policy_version 74390 (0.0007) +[2023-10-14 08:11:46,887][100936] Updated weights for policy 0, policy_version 74400 (0.0009) +[2023-10-14 08:11:48,254][100917] Updated weights for policy 1, policy_version 74502 (0.0009) +[2023-10-14 08:11:48,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 152469504. Throughput: 0: 1666.9, 1: 1659.9. Samples: 38128004. Policy #0 lag: (min: 8.0, avg: 34.8, max: 40.0) +[2023-10-14 08:11:48,513][99942] Avg episode reward: [(0, '0.680'), (1, '1.000')] +[2023-10-14 08:11:48,630][100917] Updated weights for policy 1, policy_version 74512 (0.0009) +[2023-10-14 08:11:49,001][100917] Updated weights for policy 1, policy_version 74522 (0.0007) +[2023-10-14 08:11:50,940][100936] Updated weights for policy 0, policy_version 74410 (0.0007) +[2023-10-14 08:11:51,305][100936] Updated weights for policy 0, policy_version 74420 (0.0009) +[2023-10-14 08:11:51,678][100936] Updated weights for policy 0, policy_version 74430 (0.0011) +[2023-10-14 08:11:53,111][100917] Updated weights for policy 1, policy_version 74532 (0.0007) +[2023-10-14 08:11:53,484][100917] Updated weights for policy 1, policy_version 74542 (0.0007) +[2023-10-14 08:11:53,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 152535040. Throughput: 0: 1669.9, 1: 1664.0. Samples: 38148468. Policy #0 lag: (min: 8.0, avg: 34.8, max: 40.0) +[2023-10-14 08:11:53,512][99942] Avg episode reward: [(0, '0.680'), (1, '1.000')] +[2023-10-14 08:11:53,866][100917] Updated weights for policy 1, policy_version 74552 (0.0010) +[2023-10-14 08:11:55,666][100936] Updated weights for policy 0, policy_version 74440 (0.0008) +[2023-10-14 08:11:56,038][100936] Updated weights for policy 0, policy_version 74450 (0.0007) +[2023-10-14 08:11:56,416][100936] Updated weights for policy 0, policy_version 74460 (0.0008) +[2023-10-14 08:11:58,033][100917] Updated weights for policy 1, policy_version 74562 (0.0009) +[2023-10-14 08:11:58,402][100917] Updated weights for policy 1, policy_version 74572 (0.0008) +[2023-10-14 08:11:58,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 152600576. Throughput: 0: 1651.6, 1: 1661.9. Samples: 38157636. Policy #0 lag: (min: 20.0, avg: 20.0, max: 20.0) +[2023-10-14 08:11:58,512][99942] Avg episode reward: [(0, '0.680'), (1, '1.000')] +[2023-10-14 08:11:58,772][100917] Updated weights for policy 1, policy_version 74582 (0.0008) +[2023-10-14 08:11:59,142][100917] Updated weights for policy 1, policy_version 74592 (0.0008) +[2023-10-14 08:12:00,500][100936] Updated weights for policy 0, policy_version 74470 (0.0007) +[2023-10-14 08:12:00,879][100936] Updated weights for policy 0, policy_version 74480 (0.0007) +[2023-10-14 08:12:01,242][100936] Updated weights for policy 0, policy_version 74490 (0.0010) +[2023-10-14 08:12:03,310][100917] Updated weights for policy 1, policy_version 74602 (0.0008) +[2023-10-14 08:12:03,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 152666112. Throughput: 0: 1675.1, 1: 1658.1. Samples: 38177962. Policy #0 lag: (min: 20.0, avg: 20.0, max: 20.0) +[2023-10-14 08:12:03,512][99942] Avg episode reward: [(0, '0.680'), (1, '1.000')] +[2023-10-14 08:12:03,680][100917] Updated weights for policy 1, policy_version 74612 (0.0010) +[2023-10-14 08:12:04,062][100917] Updated weights for policy 1, policy_version 74622 (0.0008) +[2023-10-14 08:12:05,392][100936] Updated weights for policy 0, policy_version 74500 (0.0007) +[2023-10-14 08:12:05,762][100936] Updated weights for policy 0, policy_version 74510 (0.0008) +[2023-10-14 08:12:06,125][100936] Updated weights for policy 0, policy_version 74520 (0.0008) +[2023-10-14 08:12:08,219][100917] Updated weights for policy 1, policy_version 74632 (0.0007) +[2023-10-14 08:12:08,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 152731648. Throughput: 0: 1674.8, 1: 1654.7. Samples: 38198560. Policy #0 lag: (min: 20.0, avg: 20.0, max: 20.0) +[2023-10-14 08:12:08,513][99942] Avg episode reward: [(0, '0.680'), (1, '1.000')] +[2023-10-14 08:12:08,593][100917] Updated weights for policy 1, policy_version 74642 (0.0009) +[2023-10-14 08:12:08,976][100917] Updated weights for policy 1, policy_version 74652 (0.0010) +[2023-10-14 08:12:10,116][100936] Updated weights for policy 0, policy_version 74530 (0.0008) +[2023-10-14 08:12:10,487][100936] Updated weights for policy 0, policy_version 74540 (0.0009) +[2023-10-14 08:12:10,858][100936] Updated weights for policy 0, policy_version 74550 (0.0009) +[2023-10-14 08:12:11,225][100936] Updated weights for policy 0, policy_version 74560 (0.0008) +[2023-10-14 08:12:13,178][100917] Updated weights for policy 1, policy_version 74662 (0.0010) +[2023-10-14 08:12:13,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 152797184. Throughput: 0: 1652.9, 1: 1657.8. Samples: 38207528. Policy #0 lag: (min: 20.0, avg: 20.0, max: 20.0) +[2023-10-14 08:12:13,512][99942] Avg episode reward: [(0, '0.680'), (1, '1.000')] +[2023-10-14 08:12:13,544][100917] Updated weights for policy 1, policy_version 74672 (0.0007) +[2023-10-14 08:12:13,919][100917] Updated weights for policy 1, policy_version 74682 (0.0009) +[2023-10-14 08:12:15,414][100936] Updated weights for policy 0, policy_version 74570 (0.0008) +[2023-10-14 08:12:15,776][100936] Updated weights for policy 0, policy_version 74580 (0.0010) +[2023-10-14 08:12:16,147][100936] Updated weights for policy 0, policy_version 74590 (0.0008) +[2023-10-14 08:12:17,920][100917] Updated weights for policy 1, policy_version 74692 (0.0008) +[2023-10-14 08:12:18,292][100917] Updated weights for policy 1, policy_version 74702 (0.0009) +[2023-10-14 08:12:18,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 152862720. Throughput: 0: 1669.1, 1: 1660.1. Samples: 38227936. Policy #0 lag: (min: 20.0, avg: 20.0, max: 20.0) +[2023-10-14 08:12:18,513][99942] Avg episode reward: [(0, '0.680'), (1, '1.000')] +[2023-10-14 08:12:18,670][100917] Updated weights for policy 1, policy_version 74712 (0.0008) +[2023-10-14 08:12:20,419][100936] Updated weights for policy 0, policy_version 74600 (0.0007) +[2023-10-14 08:12:20,784][100936] Updated weights for policy 0, policy_version 74610 (0.0009) +[2023-10-14 08:12:21,151][100936] Updated weights for policy 0, policy_version 74620 (0.0008) +[2023-10-14 08:12:22,692][100917] Updated weights for policy 1, policy_version 74722 (0.0007) +[2023-10-14 08:12:23,057][100917] Updated weights for policy 1, policy_version 74732 (0.0007) +[2023-10-14 08:12:23,430][100917] Updated weights for policy 1, policy_version 74742 (0.0008) +[2023-10-14 08:12:23,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 152928256. Throughput: 0: 1661.1, 1: 1656.2. Samples: 38247972. Policy #0 lag: (min: 20.0, avg: 20.0, max: 20.0) +[2023-10-14 08:12:23,513][99942] Avg episode reward: [(0, '0.680'), (1, '1.000')] +[2023-10-14 08:12:23,813][100917] Updated weights for policy 1, policy_version 74752 (0.0010) +[2023-10-14 08:12:25,067][100936] Updated weights for policy 0, policy_version 74630 (0.0011) +[2023-10-14 08:12:25,439][100936] Updated weights for policy 0, policy_version 74640 (0.0007) +[2023-10-14 08:12:25,813][100936] Updated weights for policy 0, policy_version 74650 (0.0007) +[2023-10-14 08:12:27,984][100917] Updated weights for policy 1, policy_version 74762 (0.0007) +[2023-10-14 08:12:28,360][100917] Updated weights for policy 1, policy_version 74772 (0.0010) +[2023-10-14 08:12:28,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 152993792. Throughput: 0: 1648.5, 1: 1668.8. Samples: 38257382. Policy #0 lag: (min: 20.0, avg: 20.0, max: 20.0) +[2023-10-14 08:12:28,512][99942] Avg episode reward: [(0, '0.680'), (1, '1.000')] +[2023-10-14 08:12:28,732][100917] Updated weights for policy 1, policy_version 74782 (0.0009) +[2023-10-14 08:12:29,976][100936] Updated weights for policy 0, policy_version 74660 (0.0007) +[2023-10-14 08:12:30,344][100936] Updated weights for policy 0, policy_version 74670 (0.0007) +[2023-10-14 08:12:30,715][100936] Updated weights for policy 0, policy_version 74680 (0.0009) +[2023-10-14 08:12:32,901][100917] Updated weights for policy 1, policy_version 74792 (0.0008) +[2023-10-14 08:12:33,272][100917] Updated weights for policy 1, policy_version 74802 (0.0008) +[2023-10-14 08:12:33,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 153059328. Throughput: 0: 1660.3, 1: 1663.3. Samples: 38277566. Policy #0 lag: (min: 20.0, avg: 20.0, max: 20.0) +[2023-10-14 08:12:33,512][99942] Avg episode reward: [(0, '0.680'), (1, '1.000')] +[2023-10-14 08:12:33,635][100917] Updated weights for policy 1, policy_version 74812 (0.0008) +[2023-10-14 08:12:34,836][100936] Updated weights for policy 0, policy_version 74690 (0.0009) +[2023-10-14 08:12:35,208][100936] Updated weights for policy 0, policy_version 74700 (0.0009) +[2023-10-14 08:12:35,584][100936] Updated weights for policy 0, policy_version 74710 (0.0010) +[2023-10-14 08:12:35,956][100936] Updated weights for policy 0, policy_version 74720 (0.0008) +[2023-10-14 08:12:37,698][100917] Updated weights for policy 1, policy_version 74822 (0.0008) +[2023-10-14 08:12:38,064][100917] Updated weights for policy 1, policy_version 74832 (0.0010) +[2023-10-14 08:12:38,438][100917] Updated weights for policy 1, policy_version 74842 (0.0009) +[2023-10-14 08:12:38,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 153124864. Throughput: 0: 1662.7, 1: 1649.9. Samples: 38297538. Policy #0 lag: (min: 20.0, avg: 20.0, max: 20.0) +[2023-10-14 08:12:38,513][99942] Avg episode reward: [(0, '0.680'), (1, '1.000')] +[2023-10-14 08:12:40,053][100936] Updated weights for policy 0, policy_version 74730 (0.0009) +[2023-10-14 08:12:40,414][100936] Updated weights for policy 0, policy_version 74740 (0.0008) +[2023-10-14 08:12:40,785][100936] Updated weights for policy 0, policy_version 74750 (0.0007) +[2023-10-14 08:12:42,650][100917] Updated weights for policy 1, policy_version 74852 (0.0009) +[2023-10-14 08:12:43,030][100917] Updated weights for policy 1, policy_version 74862 (0.0007) +[2023-10-14 08:12:43,401][100917] Updated weights for policy 1, policy_version 74872 (0.0007) +[2023-10-14 08:12:43,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 153190400. Throughput: 0: 1657.0, 1: 1661.0. Samples: 38306948. Policy #0 lag: (min: 20.0, avg: 20.0, max: 20.0) +[2023-10-14 08:12:43,513][99942] Avg episode reward: [(0, '0.680'), (1, '1.000')] +[2023-10-14 08:12:44,963][100936] Updated weights for policy 0, policy_version 74760 (0.0007) +[2023-10-14 08:12:45,334][100936] Updated weights for policy 0, policy_version 74770 (0.0007) +[2023-10-14 08:12:45,711][100936] Updated weights for policy 0, policy_version 74780 (0.0010) +[2023-10-14 08:12:47,608][100917] Updated weights for policy 1, policy_version 74882 (0.0010) +[2023-10-14 08:12:47,980][100917] Updated weights for policy 1, policy_version 74892 (0.0010) +[2023-10-14 08:12:48,357][100917] Updated weights for policy 1, policy_version 74902 (0.0011) +[2023-10-14 08:12:48,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 153255936. Throughput: 0: 1655.6, 1: 1656.8. Samples: 38327024. Policy #0 lag: (min: 21.0, avg: 23.9, max: 53.0) +[2023-10-14 08:12:48,513][99942] Avg episode reward: [(0, '0.680'), (1, '1.000')] +[2023-10-14 08:12:48,728][100917] Updated weights for policy 1, policy_version 74912 (0.0007) +[2023-10-14 08:12:49,966][100936] Updated weights for policy 0, policy_version 74790 (0.0007) +[2023-10-14 08:12:50,333][100936] Updated weights for policy 0, policy_version 74800 (0.0008) +[2023-10-14 08:12:50,704][100936] Updated weights for policy 0, policy_version 74810 (0.0009) +[2023-10-14 08:12:52,680][100917] Updated weights for policy 1, policy_version 74922 (0.0007) +[2023-10-14 08:12:53,052][100917] Updated weights for policy 1, policy_version 74932 (0.0007) +[2023-10-14 08:12:53,420][100917] Updated weights for policy 1, policy_version 74942 (0.0009) +[2023-10-14 08:12:53,512][99942] Fps is (10 sec: 16384.2, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 153354240. Throughput: 0: 1649.1, 1: 1647.0. Samples: 38346886. Policy #0 lag: (min: 21.0, avg: 23.9, max: 53.0) +[2023-10-14 08:12:53,512][99942] Avg episode reward: [(0, '0.820'), (1, '1.000')] +[2023-10-14 08:12:54,828][100936] Updated weights for policy 0, policy_version 74820 (0.0009) +[2023-10-14 08:12:55,206][100936] Updated weights for policy 0, policy_version 74830 (0.0007) +[2023-10-14 08:12:55,580][100936] Updated weights for policy 0, policy_version 74840 (0.0008) +[2023-10-14 08:12:57,572][100917] Updated weights for policy 1, policy_version 74952 (0.0009) +[2023-10-14 08:12:57,942][100917] Updated weights for policy 1, policy_version 74962 (0.0009) +[2023-10-14 08:12:58,317][100917] Updated weights for policy 1, policy_version 74972 (0.0008) +[2023-10-14 08:12:58,512][99942] Fps is (10 sec: 16383.9, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 153419776. Throughput: 0: 1650.2, 1: 1660.8. Samples: 38356524. Policy #0 lag: (min: 21.0, avg: 23.9, max: 53.0) +[2023-10-14 08:12:58,513][99942] Avg episode reward: [(0, '0.970'), (1, '1.000')] +[2023-10-14 08:12:59,875][100936] Updated weights for policy 0, policy_version 74850 (0.0009) +[2023-10-14 08:13:00,239][100936] Updated weights for policy 0, policy_version 74860 (0.0008) +[2023-10-14 08:13:00,620][100936] Updated weights for policy 0, policy_version 74870 (0.0010) +[2023-10-14 08:13:00,999][100936] Updated weights for policy 0, policy_version 74880 (0.0009) +[2023-10-14 08:13:02,434][100917] Updated weights for policy 1, policy_version 74982 (0.0009) +[2023-10-14 08:13:02,807][100917] Updated weights for policy 1, policy_version 74992 (0.0009) +[2023-10-14 08:13:03,187][100917] Updated weights for policy 1, policy_version 75002 (0.0008) +[2023-10-14 08:13:03,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 153485312. Throughput: 0: 1651.3, 1: 1653.8. Samples: 38376664. Policy #0 lag: (min: 21.0, avg: 23.9, max: 53.0) +[2023-10-14 08:13:03,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 08:13:05,208][100936] Updated weights for policy 0, policy_version 74890 (0.0007) +[2023-10-14 08:13:05,581][100936] Updated weights for policy 0, policy_version 74900 (0.0007) +[2023-10-14 08:13:05,951][100936] Updated weights for policy 0, policy_version 74910 (0.0008) +[2023-10-14 08:13:07,117][100917] Updated weights for policy 1, policy_version 75012 (0.0007) +[2023-10-14 08:13:07,485][100917] Updated weights for policy 1, policy_version 75022 (0.0008) +[2023-10-14 08:13:07,857][100917] Updated weights for policy 1, policy_version 75032 (0.0008) +[2023-10-14 08:13:08,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 153550848. Throughput: 0: 1657.9, 1: 1641.1. Samples: 38396424. Policy #0 lag: (min: 21.0, avg: 23.9, max: 53.0) +[2023-10-14 08:13:08,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 08:13:10,051][100936] Updated weights for policy 0, policy_version 74920 (0.0007) +[2023-10-14 08:13:10,423][100936] Updated weights for policy 0, policy_version 74930 (0.0007) +[2023-10-14 08:13:10,797][100936] Updated weights for policy 0, policy_version 74940 (0.0009) +[2023-10-14 08:13:11,992][100917] Updated weights for policy 1, policy_version 75042 (0.0008) +[2023-10-14 08:13:12,358][100917] Updated weights for policy 1, policy_version 75052 (0.0008) +[2023-10-14 08:13:12,734][100917] Updated weights for policy 1, policy_version 75062 (0.0007) +[2023-10-14 08:13:13,104][100917] Updated weights for policy 1, policy_version 75072 (0.0007) +[2023-10-14 08:13:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 153616384. Throughput: 0: 1655.0, 1: 1653.1. Samples: 38406244. Policy #0 lag: (min: 21.0, avg: 23.9, max: 53.0) +[2023-10-14 08:13:13,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 08:13:14,733][100936] Updated weights for policy 0, policy_version 74950 (0.0009) +[2023-10-14 08:13:15,109][100936] Updated weights for policy 0, policy_version 74960 (0.0007) +[2023-10-14 08:13:15,477][100936] Updated weights for policy 0, policy_version 74970 (0.0009) +[2023-10-14 08:13:17,226][100917] Updated weights for policy 1, policy_version 75082 (0.0008) +[2023-10-14 08:13:17,602][100917] Updated weights for policy 1, policy_version 75092 (0.0009) +[2023-10-14 08:13:17,984][100917] Updated weights for policy 1, policy_version 75102 (0.0010) +[2023-10-14 08:13:18,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 153681920. Throughput: 0: 1665.8, 1: 1656.7. Samples: 38427078. Policy #0 lag: (min: 21.0, avg: 23.9, max: 53.0) +[2023-10-14 08:13:18,513][99942] Avg episode reward: [(0, '0.990'), (1, '0.990')] +[2023-10-14 08:13:19,553][100936] Updated weights for policy 0, policy_version 74980 (0.0008) +[2023-10-14 08:13:19,914][100936] Updated weights for policy 0, policy_version 74990 (0.0009) +[2023-10-14 08:13:20,288][100936] Updated weights for policy 0, policy_version 75000 (0.0010) +[2023-10-14 08:13:22,086][100917] Updated weights for policy 1, policy_version 75112 (0.0009) +[2023-10-14 08:13:22,463][100917] Updated weights for policy 1, policy_version 75122 (0.0009) +[2023-10-14 08:13:22,846][100917] Updated weights for policy 1, policy_version 75132 (0.0009) +[2023-10-14 08:13:23,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 153747456. Throughput: 0: 1667.4, 1: 1640.9. Samples: 38446414. Policy #0 lag: (min: 21.0, avg: 23.9, max: 53.0) +[2023-10-14 08:13:23,513][99942] Avg episode reward: [(0, '0.990'), (1, '0.990')] +[2023-10-14 08:13:24,224][100936] Updated weights for policy 0, policy_version 75010 (0.0009) +[2023-10-14 08:13:24,597][100936] Updated weights for policy 0, policy_version 75020 (0.0010) +[2023-10-14 08:13:24,980][100936] Updated weights for policy 0, policy_version 75030 (0.0009) +[2023-10-14 08:13:25,348][100936] Updated weights for policy 0, policy_version 75040 (0.0009) +[2023-10-14 08:13:26,979][100917] Updated weights for policy 1, policy_version 75142 (0.0008) +[2023-10-14 08:13:27,357][100917] Updated weights for policy 1, policy_version 75152 (0.0009) +[2023-10-14 08:13:27,737][100917] Updated weights for policy 1, policy_version 75162 (0.0010) +[2023-10-14 08:13:28,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 153812992. Throughput: 0: 1665.5, 1: 1656.8. Samples: 38456450. Policy #0 lag: (min: 21.0, avg: 23.9, max: 53.0) +[2023-10-14 08:13:28,512][99942] Avg episode reward: [(0, '0.990'), (1, '0.990')] +[2023-10-14 08:13:29,487][100936] Updated weights for policy 0, policy_version 75050 (0.0008) +[2023-10-14 08:13:29,859][100936] Updated weights for policy 0, policy_version 75060 (0.0008) +[2023-10-14 08:13:30,238][100936] Updated weights for policy 0, policy_version 75070 (0.0010) +[2023-10-14 08:13:31,873][100917] Updated weights for policy 1, policy_version 75172 (0.0009) +[2023-10-14 08:13:32,257][100917] Updated weights for policy 1, policy_version 75182 (0.0010) +[2023-10-14 08:13:32,631][100917] Updated weights for policy 1, policy_version 75192 (0.0008) +[2023-10-14 08:13:33,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 153878528. Throughput: 0: 1667.6, 1: 1656.1. Samples: 38476590. Policy #0 lag: (min: 21.0, avg: 23.9, max: 53.0) +[2023-10-14 08:13:33,513][99942] Avg episode reward: [(0, '0.990'), (1, '0.990')] +[2023-10-14 08:13:34,494][100936] Updated weights for policy 0, policy_version 75080 (0.0009) +[2023-10-14 08:13:34,864][100936] Updated weights for policy 0, policy_version 75090 (0.0011) +[2023-10-14 08:13:35,236][100936] Updated weights for policy 0, policy_version 75100 (0.0008) +[2023-10-14 08:13:36,722][100917] Updated weights for policy 1, policy_version 75202 (0.0009) +[2023-10-14 08:13:37,095][100917] Updated weights for policy 1, policy_version 75212 (0.0008) +[2023-10-14 08:13:37,471][100917] Updated weights for policy 1, policy_version 75222 (0.0010) +[2023-10-14 08:13:37,842][100917] Updated weights for policy 1, policy_version 75232 (0.0008) +[2023-10-14 08:13:38,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 153944064. Throughput: 0: 1671.0, 1: 1641.2. Samples: 38495938. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:13:38,513][99942] Avg episode reward: [(0, '0.990'), (1, '0.930')] +[2023-10-14 08:13:38,526][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000075232_77037568.pth... +[2023-10-14 08:13:38,527][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000075104_76906496.pth... +[2023-10-14 08:13:38,563][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000073664_75431936.pth +[2023-10-14 08:13:38,571][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000073568_75333632.pth +[2023-10-14 08:13:39,240][100936] Updated weights for policy 0, policy_version 75110 (0.0008) +[2023-10-14 08:13:39,603][100936] Updated weights for policy 0, policy_version 75120 (0.0010) +[2023-10-14 08:13:39,975][100936] Updated weights for policy 0, policy_version 75130 (0.0010) +[2023-10-14 08:13:41,948][100917] Updated weights for policy 1, policy_version 75242 (0.0011) +[2023-10-14 08:13:42,316][100917] Updated weights for policy 1, policy_version 75252 (0.0010) +[2023-10-14 08:13:42,689][100917] Updated weights for policy 1, policy_version 75262 (0.0009) +[2023-10-14 08:13:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 154009600. Throughput: 0: 1667.1, 1: 1657.4. Samples: 38506126. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:13:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.930')] +[2023-10-14 08:13:44,096][100936] Updated weights for policy 0, policy_version 75140 (0.0010) +[2023-10-14 08:13:44,460][100936] Updated weights for policy 0, policy_version 75150 (0.0011) +[2023-10-14 08:13:44,831][100936] Updated weights for policy 0, policy_version 75160 (0.0011) +[2023-10-14 08:13:46,851][100917] Updated weights for policy 1, policy_version 75272 (0.0007) +[2023-10-14 08:13:47,223][100917] Updated weights for policy 1, policy_version 75282 (0.0007) +[2023-10-14 08:13:47,600][100917] Updated weights for policy 1, policy_version 75292 (0.0008) +[2023-10-14 08:13:48,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 154075136. Throughput: 0: 1668.7, 1: 1649.2. Samples: 38525966. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:13:48,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.930')] +[2023-10-14 08:13:49,090][100936] Updated weights for policy 0, policy_version 75170 (0.0009) +[2023-10-14 08:13:49,458][100936] Updated weights for policy 0, policy_version 75180 (0.0008) +[2023-10-14 08:13:49,834][100936] Updated weights for policy 0, policy_version 75190 (0.0008) +[2023-10-14 08:13:50,192][100936] Updated weights for policy 0, policy_version 75200 (0.0009) +[2023-10-14 08:13:51,777][100917] Updated weights for policy 1, policy_version 75302 (0.0008) +[2023-10-14 08:13:52,139][100917] Updated weights for policy 1, policy_version 75312 (0.0009) +[2023-10-14 08:13:52,519][100917] Updated weights for policy 1, policy_version 75322 (0.0007) +[2023-10-14 08:13:53,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 154140672. Throughput: 0: 1667.1, 1: 1650.7. Samples: 38545730. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:13:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.930')] +[2023-10-14 08:13:54,362][100936] Updated weights for policy 0, policy_version 75210 (0.0011) +[2023-10-14 08:13:54,726][100936] Updated weights for policy 0, policy_version 75220 (0.0008) +[2023-10-14 08:13:55,098][100936] Updated weights for policy 0, policy_version 75230 (0.0007) +[2023-10-14 08:13:56,726][100917] Updated weights for policy 1, policy_version 75332 (0.0008) +[2023-10-14 08:13:57,095][100917] Updated weights for policy 1, policy_version 75342 (0.0010) +[2023-10-14 08:13:57,457][100917] Updated weights for policy 1, policy_version 75352 (0.0009) +[2023-10-14 08:13:58,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 154206208. Throughput: 0: 1668.6, 1: 1656.7. Samples: 38555882. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:13:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.930')] +[2023-10-14 08:13:59,116][100936] Updated weights for policy 0, policy_version 75240 (0.0007) +[2023-10-14 08:13:59,480][100936] Updated weights for policy 0, policy_version 75250 (0.0007) +[2023-10-14 08:13:59,858][100936] Updated weights for policy 0, policy_version 75260 (0.0008) +[2023-10-14 08:14:01,568][100917] Updated weights for policy 1, policy_version 75362 (0.0009) +[2023-10-14 08:14:01,990][100917] Updated weights for policy 1, policy_version 75372 (0.0009) +[2023-10-14 08:14:02,375][100917] Updated weights for policy 1, policy_version 75382 (0.0008) +[2023-10-14 08:14:02,745][100917] Updated weights for policy 1, policy_version 75392 (0.0008) +[2023-10-14 08:14:03,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 154271744. Throughput: 0: 1659.2, 1: 1647.5. Samples: 38575882. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:14:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.930')] +[2023-10-14 08:14:03,983][100936] Updated weights for policy 0, policy_version 75270 (0.0010) +[2023-10-14 08:14:04,355][100936] Updated weights for policy 0, policy_version 75280 (0.0009) +[2023-10-14 08:14:04,722][100936] Updated weights for policy 0, policy_version 75290 (0.0009) +[2023-10-14 08:14:06,817][100917] Updated weights for policy 1, policy_version 75402 (0.0010) +[2023-10-14 08:14:07,192][100917] Updated weights for policy 1, policy_version 75412 (0.0009) +[2023-10-14 08:14:07,567][100917] Updated weights for policy 1, policy_version 75422 (0.0008) +[2023-10-14 08:14:08,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 154337280. Throughput: 0: 1657.0, 1: 1653.3. Samples: 38595376. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:14:08,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.930')] +[2023-10-14 08:14:08,779][100936] Updated weights for policy 0, policy_version 75300 (0.0008) +[2023-10-14 08:14:09,163][100936] Updated weights for policy 0, policy_version 75310 (0.0009) +[2023-10-14 08:14:09,520][100936] Updated weights for policy 0, policy_version 75320 (0.0009) +[2023-10-14 08:14:11,861][100917] Updated weights for policy 1, policy_version 75432 (0.0011) +[2023-10-14 08:14:12,239][100917] Updated weights for policy 1, policy_version 75442 (0.0009) +[2023-10-14 08:14:12,603][100917] Updated weights for policy 1, policy_version 75452 (0.0010) +[2023-10-14 08:14:13,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 154402816. Throughput: 0: 1658.4, 1: 1655.2. Samples: 38605558. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:14:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.720')] +[2023-10-14 08:14:13,761][100936] Updated weights for policy 0, policy_version 75330 (0.0008) +[2023-10-14 08:14:14,129][100936] Updated weights for policy 0, policy_version 75340 (0.0008) +[2023-10-14 08:14:14,494][100936] Updated weights for policy 0, policy_version 75350 (0.0009) +[2023-10-14 08:14:14,863][100936] Updated weights for policy 0, policy_version 75360 (0.0008) +[2023-10-14 08:14:16,814][100917] Updated weights for policy 1, policy_version 75462 (0.0008) +[2023-10-14 08:14:17,197][100917] Updated weights for policy 1, policy_version 75472 (0.0008) +[2023-10-14 08:14:17,567][100917] Updated weights for policy 1, policy_version 75482 (0.0009) +[2023-10-14 08:14:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 154468352. Throughput: 0: 1656.9, 1: 1648.9. Samples: 38625352. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:14:18,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.720')] +[2023-10-14 08:14:18,976][100936] Updated weights for policy 0, policy_version 75370 (0.0009) +[2023-10-14 08:14:19,334][100936] Updated weights for policy 0, policy_version 75380 (0.0009) +[2023-10-14 08:14:19,713][100936] Updated weights for policy 0, policy_version 75390 (0.0010) +[2023-10-14 08:14:21,615][100917] Updated weights for policy 1, policy_version 75492 (0.0008) +[2023-10-14 08:14:21,976][100917] Updated weights for policy 1, policy_version 75502 (0.0008) +[2023-10-14 08:14:22,353][100917] Updated weights for policy 1, policy_version 75512 (0.0010) +[2023-10-14 08:14:23,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.3, 300 sec: 13329.4). Total num frames: 154533888. Throughput: 0: 1661.6, 1: 1653.5. Samples: 38645114. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:14:23,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.720')] +[2023-10-14 08:14:23,880][100936] Updated weights for policy 0, policy_version 75400 (0.0008) +[2023-10-14 08:14:24,256][100936] Updated weights for policy 0, policy_version 75410 (0.0010) +[2023-10-14 08:14:24,621][100936] Updated weights for policy 0, policy_version 75420 (0.0008) +[2023-10-14 08:14:26,531][100917] Updated weights for policy 1, policy_version 75522 (0.0010) +[2023-10-14 08:14:26,900][100917] Updated weights for policy 1, policy_version 75532 (0.0008) +[2023-10-14 08:14:27,263][100917] Updated weights for policy 1, policy_version 75542 (0.0009) +[2023-10-14 08:14:27,632][100917] Updated weights for policy 1, policy_version 75552 (0.0008) +[2023-10-14 08:14:28,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 154599424. Throughput: 0: 1665.3, 1: 1650.3. Samples: 38655328. Policy #0 lag: (min: 31.0, avg: 31.0, max: 33.0) +[2023-10-14 08:14:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.720')] +[2023-10-14 08:14:28,661][100936] Updated weights for policy 0, policy_version 75430 (0.0010) +[2023-10-14 08:14:29,017][100936] Updated weights for policy 0, policy_version 75440 (0.0011) +[2023-10-14 08:14:29,387][100936] Updated weights for policy 0, policy_version 75450 (0.0010) +[2023-10-14 08:14:31,609][100917] Updated weights for policy 1, policy_version 75562 (0.0011) +[2023-10-14 08:14:31,985][100917] Updated weights for policy 1, policy_version 75572 (0.0010) +[2023-10-14 08:14:32,348][100917] Updated weights for policy 1, policy_version 75582 (0.0009) +[2023-10-14 08:14:33,402][100936] Updated weights for policy 0, policy_version 75460 (0.0008) +[2023-10-14 08:14:33,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 154664960. Throughput: 0: 1668.2, 1: 1648.2. Samples: 38675204. Policy #0 lag: (min: 31.0, avg: 31.0, max: 33.0) +[2023-10-14 08:14:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.720')] +[2023-10-14 08:14:33,761][100936] Updated weights for policy 0, policy_version 75470 (0.0007) +[2023-10-14 08:14:34,138][100936] Updated weights for policy 0, policy_version 75480 (0.0007) +[2023-10-14 08:14:36,467][100917] Updated weights for policy 1, policy_version 75592 (0.0009) +[2023-10-14 08:14:36,833][100917] Updated weights for policy 1, policy_version 75602 (0.0007) +[2023-10-14 08:14:37,201][100917] Updated weights for policy 1, policy_version 75612 (0.0010) +[2023-10-14 08:14:38,326][100936] Updated weights for policy 0, policy_version 75490 (0.0007) +[2023-10-14 08:14:38,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 154730496. Throughput: 0: 1662.8, 1: 1652.4. Samples: 38694910. Policy #0 lag: (min: 31.0, avg: 31.0, max: 33.0) +[2023-10-14 08:14:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.720')] +[2023-10-14 08:14:38,692][100936] Updated weights for policy 0, policy_version 75500 (0.0007) +[2023-10-14 08:14:39,071][100936] Updated weights for policy 0, policy_version 75510 (0.0008) +[2023-10-14 08:14:39,438][100936] Updated weights for policy 0, policy_version 75520 (0.0009) +[2023-10-14 08:14:41,419][100917] Updated weights for policy 1, policy_version 75622 (0.0010) +[2023-10-14 08:14:41,795][100917] Updated weights for policy 1, policy_version 75632 (0.0008) +[2023-10-14 08:14:42,173][100917] Updated weights for policy 1, policy_version 75642 (0.0009) +[2023-10-14 08:14:43,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 154796032. Throughput: 0: 1665.5, 1: 1653.5. Samples: 38705238. Policy #0 lag: (min: 31.0, avg: 31.0, max: 33.0) +[2023-10-14 08:14:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.720')] +[2023-10-14 08:14:43,712][100936] Updated weights for policy 0, policy_version 75530 (0.0009) +[2023-10-14 08:14:44,089][100936] Updated weights for policy 0, policy_version 75540 (0.0008) +[2023-10-14 08:14:44,461][100936] Updated weights for policy 0, policy_version 75550 (0.0009) +[2023-10-14 08:14:46,488][100917] Updated weights for policy 1, policy_version 75652 (0.0008) +[2023-10-14 08:14:46,878][100917] Updated weights for policy 1, policy_version 75662 (0.0007) +[2023-10-14 08:14:47,247][100917] Updated weights for policy 1, policy_version 75672 (0.0009) +[2023-10-14 08:14:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 154861568. Throughput: 0: 1662.3, 1: 1648.4. Samples: 38724864. Policy #0 lag: (min: 31.0, avg: 31.0, max: 33.0) +[2023-10-14 08:14:48,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.720')] +[2023-10-14 08:14:48,520][100936] Updated weights for policy 0, policy_version 75560 (0.0009) +[2023-10-14 08:14:48,890][100936] Updated weights for policy 0, policy_version 75570 (0.0007) +[2023-10-14 08:14:49,254][100936] Updated weights for policy 0, policy_version 75580 (0.0010) +[2023-10-14 08:14:51,215][100917] Updated weights for policy 1, policy_version 75682 (0.0008) +[2023-10-14 08:14:51,588][100917] Updated weights for policy 1, policy_version 75692 (0.0009) +[2023-10-14 08:14:51,954][100917] Updated weights for policy 1, policy_version 75702 (0.0010) +[2023-10-14 08:14:52,327][100917] Updated weights for policy 1, policy_version 75712 (0.0008) +[2023-10-14 08:14:53,400][100936] Updated weights for policy 0, policy_version 75590 (0.0008) +[2023-10-14 08:14:53,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 154927104. Throughput: 0: 1657.9, 1: 1659.1. Samples: 38744644. Policy #0 lag: (min: 31.0, avg: 31.0, max: 33.0) +[2023-10-14 08:14:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.720')] +[2023-10-14 08:14:53,770][100936] Updated weights for policy 0, policy_version 75600 (0.0007) +[2023-10-14 08:14:54,142][100936] Updated weights for policy 0, policy_version 75610 (0.0007) +[2023-10-14 08:14:56,500][100917] Updated weights for policy 1, policy_version 75722 (0.0008) +[2023-10-14 08:14:56,868][100917] Updated weights for policy 1, policy_version 75732 (0.0011) +[2023-10-14 08:14:57,243][100917] Updated weights for policy 1, policy_version 75742 (0.0011) +[2023-10-14 08:14:58,173][100936] Updated weights for policy 0, policy_version 75620 (0.0007) +[2023-10-14 08:14:58,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 154992640. Throughput: 0: 1660.5, 1: 1661.3. Samples: 38755042. Policy #0 lag: (min: 31.0, avg: 31.0, max: 33.0) +[2023-10-14 08:14:58,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.720')] +[2023-10-14 08:14:58,543][100936] Updated weights for policy 0, policy_version 75630 (0.0007) +[2023-10-14 08:14:58,920][100936] Updated weights for policy 0, policy_version 75640 (0.0009) +[2023-10-14 08:15:01,191][100917] Updated weights for policy 1, policy_version 75752 (0.0010) +[2023-10-14 08:15:01,569][100917] Updated weights for policy 1, policy_version 75762 (0.0009) +[2023-10-14 08:15:01,933][100917] Updated weights for policy 1, policy_version 75772 (0.0008) +[2023-10-14 08:15:02,975][100936] Updated weights for policy 0, policy_version 75650 (0.0007) +[2023-10-14 08:15:03,354][100936] Updated weights for policy 0, policy_version 75660 (0.0008) +[2023-10-14 08:15:03,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 155058176. Throughput: 0: 1665.8, 1: 1654.9. Samples: 38774784. Policy #0 lag: (min: 31.0, avg: 31.0, max: 33.0) +[2023-10-14 08:15:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.720')] +[2023-10-14 08:15:03,734][100936] Updated weights for policy 0, policy_version 75670 (0.0009) +[2023-10-14 08:15:04,098][100936] Updated weights for policy 0, policy_version 75680 (0.0009) +[2023-10-14 08:15:05,906][100917] Updated weights for policy 1, policy_version 75782 (0.0007) +[2023-10-14 08:15:06,283][100917] Updated weights for policy 1, policy_version 75792 (0.0009) +[2023-10-14 08:15:06,651][100917] Updated weights for policy 1, policy_version 75802 (0.0010) +[2023-10-14 08:15:08,285][100936] Updated weights for policy 0, policy_version 75690 (0.0007) +[2023-10-14 08:15:08,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 155123712. Throughput: 0: 1651.7, 1: 1671.9. Samples: 38794676. Policy #0 lag: (min: 31.0, avg: 31.0, max: 33.0) +[2023-10-14 08:15:08,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.720')] +[2023-10-14 08:15:08,657][100936] Updated weights for policy 0, policy_version 75700 (0.0009) +[2023-10-14 08:15:09,031][100936] Updated weights for policy 0, policy_version 75710 (0.0007) +[2023-10-14 08:15:10,715][100917] Updated weights for policy 1, policy_version 75812 (0.0008) +[2023-10-14 08:15:11,091][100917] Updated weights for policy 1, policy_version 75822 (0.0009) +[2023-10-14 08:15:11,459][100917] Updated weights for policy 1, policy_version 75832 (0.0007) +[2023-10-14 08:15:13,048][100936] Updated weights for policy 0, policy_version 75720 (0.0008) +[2023-10-14 08:15:13,421][100936] Updated weights for policy 0, policy_version 75730 (0.0009) +[2023-10-14 08:15:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 155189248. Throughput: 0: 1664.3, 1: 1664.6. Samples: 38805126. Policy #0 lag: (min: 31.0, avg: 31.0, max: 33.0) +[2023-10-14 08:15:13,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.720')] +[2023-10-14 08:15:13,787][100936] Updated weights for policy 0, policy_version 75740 (0.0009) +[2023-10-14 08:15:15,548][100917] Updated weights for policy 1, policy_version 75842 (0.0010) +[2023-10-14 08:15:15,928][100917] Updated weights for policy 1, policy_version 75852 (0.0008) +[2023-10-14 08:15:16,291][100917] Updated weights for policy 1, policy_version 75862 (0.0008) +[2023-10-14 08:15:16,669][100917] Updated weights for policy 1, policy_version 75872 (0.0010) +[2023-10-14 08:15:18,101][100936] Updated weights for policy 0, policy_version 75750 (0.0009) +[2023-10-14 08:15:18,474][100936] Updated weights for policy 0, policy_version 75760 (0.0009) +[2023-10-14 08:15:18,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 155254784. Throughput: 0: 1660.6, 1: 1663.2. Samples: 38824774. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) +[2023-10-14 08:15:18,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.720')] +[2023-10-14 08:15:18,856][100936] Updated weights for policy 0, policy_version 75770 (0.0009) +[2023-10-14 08:15:20,754][100917] Updated weights for policy 1, policy_version 75882 (0.0008) +[2023-10-14 08:15:21,123][100917] Updated weights for policy 1, policy_version 75892 (0.0008) +[2023-10-14 08:15:21,489][100917] Updated weights for policy 1, policy_version 75902 (0.0010) +[2023-10-14 08:15:22,927][100936] Updated weights for policy 0, policy_version 75780 (0.0008) +[2023-10-14 08:15:23,305][100936] Updated weights for policy 0, policy_version 75790 (0.0007) +[2023-10-14 08:15:23,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 155320320. Throughput: 0: 1648.8, 1: 1672.8. Samples: 38844382. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) +[2023-10-14 08:15:23,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.720')] +[2023-10-14 08:15:23,677][100936] Updated weights for policy 0, policy_version 75800 (0.0008) +[2023-10-14 08:15:25,794][100917] Updated weights for policy 1, policy_version 75912 (0.0009) +[2023-10-14 08:15:26,168][100917] Updated weights for policy 1, policy_version 75922 (0.0010) +[2023-10-14 08:15:26,547][100917] Updated weights for policy 1, policy_version 75932 (0.0008) +[2023-10-14 08:15:28,049][100936] Updated weights for policy 0, policy_version 75810 (0.0009) +[2023-10-14 08:15:28,434][100936] Updated weights for policy 0, policy_version 75820 (0.0008) +[2023-10-14 08:15:28,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 155385856. Throughput: 0: 1658.8, 1: 1660.2. Samples: 38854594. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) +[2023-10-14 08:15:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.720')] +[2023-10-14 08:15:28,814][100936] Updated weights for policy 0, policy_version 75830 (0.0008) +[2023-10-14 08:15:29,182][100936] Updated weights for policy 0, policy_version 75840 (0.0009) +[2023-10-14 08:15:30,767][100917] Updated weights for policy 1, policy_version 75942 (0.0008) +[2023-10-14 08:15:31,136][100917] Updated weights for policy 1, policy_version 75952 (0.0010) +[2023-10-14 08:15:31,516][100917] Updated weights for policy 1, policy_version 75962 (0.0011) +[2023-10-14 08:15:33,232][100936] Updated weights for policy 0, policy_version 75850 (0.0007) +[2023-10-14 08:15:33,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 155451392. Throughput: 0: 1657.3, 1: 1655.0. Samples: 38873918. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) +[2023-10-14 08:15:33,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.720')] +[2023-10-14 08:15:33,596][100936] Updated weights for policy 0, policy_version 75860 (0.0009) +[2023-10-14 08:15:33,964][100936] Updated weights for policy 0, policy_version 75870 (0.0011) +[2023-10-14 08:15:35,808][100917] Updated weights for policy 1, policy_version 75972 (0.0009) +[2023-10-14 08:15:36,200][100917] Updated weights for policy 1, policy_version 75982 (0.0008) +[2023-10-14 08:15:36,578][100917] Updated weights for policy 1, policy_version 75992 (0.0009) +[2023-10-14 08:15:38,125][100936] Updated weights for policy 0, policy_version 75880 (0.0010) +[2023-10-14 08:15:38,486][100936] Updated weights for policy 0, policy_version 75890 (0.0010) +[2023-10-14 08:15:38,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 155516928. Throughput: 0: 1643.6, 1: 1657.9. Samples: 38893214. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) +[2023-10-14 08:15:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.720')] +[2023-10-14 08:15:38,523][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000076000_77824000.pth... +[2023-10-14 08:15:38,562][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000074464_76251136.pth +[2023-10-14 08:15:38,857][100936] Updated weights for policy 0, policy_version 75900 (0.0010) +[2023-10-14 08:15:39,004][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000075904_77725696.pth... +[2023-10-14 08:15:39,033][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000074336_76120064.pth +[2023-10-14 08:15:40,714][100917] Updated weights for policy 1, policy_version 76002 (0.0007) +[2023-10-14 08:15:41,081][100917] Updated weights for policy 1, policy_version 76012 (0.0007) +[2023-10-14 08:15:41,451][100917] Updated weights for policy 1, policy_version 76022 (0.0010) +[2023-10-14 08:15:41,831][100917] Updated weights for policy 1, policy_version 76032 (0.0011) +[2023-10-14 08:15:43,048][100936] Updated weights for policy 0, policy_version 75910 (0.0008) +[2023-10-14 08:15:43,416][100936] Updated weights for policy 0, policy_version 75920 (0.0007) +[2023-10-14 08:15:43,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 155582464. Throughput: 0: 1652.8, 1: 1648.3. Samples: 38903590. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) +[2023-10-14 08:15:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.720')] +[2023-10-14 08:15:43,782][100936] Updated weights for policy 0, policy_version 75930 (0.0008) +[2023-10-14 08:15:45,906][100917] Updated weights for policy 1, policy_version 76042 (0.0007) +[2023-10-14 08:15:46,275][100917] Updated weights for policy 1, policy_version 76052 (0.0010) +[2023-10-14 08:15:46,641][100917] Updated weights for policy 1, policy_version 76062 (0.0008) +[2023-10-14 08:15:47,818][100936] Updated weights for policy 0, policy_version 75940 (0.0010) +[2023-10-14 08:15:48,180][100936] Updated weights for policy 0, policy_version 75950 (0.0010) +[2023-10-14 08:15:48,512][99942] Fps is (10 sec: 13107.7, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 155648000. Throughput: 0: 1652.9, 1: 1648.5. Samples: 38923348. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) +[2023-10-14 08:15:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.720')] +[2023-10-14 08:15:48,554][100936] Updated weights for policy 0, policy_version 75960 (0.0008) +[2023-10-14 08:15:50,744][100917] Updated weights for policy 1, policy_version 76072 (0.0010) +[2023-10-14 08:15:51,117][100917] Updated weights for policy 1, policy_version 76082 (0.0010) +[2023-10-14 08:15:51,483][100917] Updated weights for policy 1, policy_version 76092 (0.0007) +[2023-10-14 08:15:52,600][100936] Updated weights for policy 0, policy_version 75970 (0.0008) +[2023-10-14 08:15:52,972][100936] Updated weights for policy 0, policy_version 75980 (0.0008) +[2023-10-14 08:15:53,351][100936] Updated weights for policy 0, policy_version 75990 (0.0007) +[2023-10-14 08:15:53,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 155713536. Throughput: 0: 1645.2, 1: 1651.3. Samples: 38943020. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) +[2023-10-14 08:15:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.720')] +[2023-10-14 08:15:53,724][100936] Updated weights for policy 0, policy_version 76000 (0.0009) +[2023-10-14 08:15:55,669][100917] Updated weights for policy 1, policy_version 76102 (0.0009) +[2023-10-14 08:15:56,039][100917] Updated weights for policy 1, policy_version 76112 (0.0008) +[2023-10-14 08:15:56,415][100917] Updated weights for policy 1, policy_version 76122 (0.0009) +[2023-10-14 08:15:57,908][100936] Updated weights for policy 0, policy_version 76010 (0.0008) +[2023-10-14 08:15:58,273][100936] Updated weights for policy 0, policy_version 76020 (0.0009) +[2023-10-14 08:15:58,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 155779072. Throughput: 0: 1650.6, 1: 1647.2. Samples: 38953528. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) +[2023-10-14 08:15:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.730')] +[2023-10-14 08:15:58,646][100936] Updated weights for policy 0, policy_version 76030 (0.0008) +[2023-10-14 08:16:00,356][100917] Updated weights for policy 1, policy_version 76132 (0.0010) +[2023-10-14 08:16:00,727][100917] Updated weights for policy 1, policy_version 76142 (0.0009) +[2023-10-14 08:16:01,100][100917] Updated weights for policy 1, policy_version 76152 (0.0007) +[2023-10-14 08:16:02,714][100936] Updated weights for policy 0, policy_version 76040 (0.0007) +[2023-10-14 08:16:03,078][100936] Updated weights for policy 0, policy_version 76050 (0.0007) +[2023-10-14 08:16:03,441][100936] Updated weights for policy 0, policy_version 76060 (0.0008) +[2023-10-14 08:16:03,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 155844608. Throughput: 0: 1653.5, 1: 1649.0. Samples: 38973386. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) +[2023-10-14 08:16:03,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.730')] +[2023-10-14 08:16:05,259][100917] Updated weights for policy 1, policy_version 76162 (0.0008) +[2023-10-14 08:16:05,630][100917] Updated weights for policy 1, policy_version 76172 (0.0010) +[2023-10-14 08:16:06,009][100917] Updated weights for policy 1, policy_version 76182 (0.0008) +[2023-10-14 08:16:06,377][100917] Updated weights for policy 1, policy_version 76192 (0.0007) +[2023-10-14 08:16:07,630][100936] Updated weights for policy 0, policy_version 76070 (0.0010) +[2023-10-14 08:16:07,989][100936] Updated weights for policy 0, policy_version 76080 (0.0008) +[2023-10-14 08:16:08,364][100936] Updated weights for policy 0, policy_version 76090 (0.0011) +[2023-10-14 08:16:08,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 155910144. Throughput: 0: 1646.0, 1: 1649.5. Samples: 38992678. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:16:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.730')] +[2023-10-14 08:16:10,443][100917] Updated weights for policy 1, policy_version 76202 (0.0011) +[2023-10-14 08:16:10,811][100917] Updated weights for policy 1, policy_version 76212 (0.0009) +[2023-10-14 08:16:11,184][100917] Updated weights for policy 1, policy_version 76222 (0.0011) +[2023-10-14 08:16:12,419][100936] Updated weights for policy 0, policy_version 76100 (0.0008) +[2023-10-14 08:16:12,786][100936] Updated weights for policy 0, policy_version 76110 (0.0008) +[2023-10-14 08:16:13,155][100936] Updated weights for policy 0, policy_version 76120 (0.0007) +[2023-10-14 08:16:13,512][99942] Fps is (10 sec: 16383.7, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 156008448. Throughput: 0: 1656.8, 1: 1641.5. Samples: 39003018. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:16:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.730')] +[2023-10-14 08:16:15,342][100917] Updated weights for policy 1, policy_version 76232 (0.0010) +[2023-10-14 08:16:15,716][100917] Updated weights for policy 1, policy_version 76242 (0.0010) +[2023-10-14 08:16:16,099][100917] Updated weights for policy 1, policy_version 76252 (0.0009) +[2023-10-14 08:16:17,441][100936] Updated weights for policy 0, policy_version 76130 (0.0008) +[2023-10-14 08:16:17,826][100936] Updated weights for policy 0, policy_version 76140 (0.0009) +[2023-10-14 08:16:18,196][100936] Updated weights for policy 0, policy_version 76150 (0.0009) +[2023-10-14 08:16:18,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 156041216. Throughput: 0: 1656.8, 1: 1653.1. Samples: 39022864. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:16:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.790')] +[2023-10-14 08:16:18,565][100936] Updated weights for policy 0, policy_version 76160 (0.0008) +[2023-10-14 08:16:20,355][100917] Updated weights for policy 1, policy_version 76262 (0.0009) +[2023-10-14 08:16:20,739][100917] Updated weights for policy 1, policy_version 76272 (0.0009) +[2023-10-14 08:16:21,113][100917] Updated weights for policy 1, policy_version 76282 (0.0010) +[2023-10-14 08:16:22,588][100936] Updated weights for policy 0, policy_version 76170 (0.0008) +[2023-10-14 08:16:22,952][100936] Updated weights for policy 0, policy_version 76180 (0.0008) +[2023-10-14 08:16:23,317][100936] Updated weights for policy 0, policy_version 76190 (0.0009) +[2023-10-14 08:16:23,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 156139520. Throughput: 0: 1650.1, 1: 1657.9. Samples: 39042072. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:16:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.790')] +[2023-10-14 08:16:25,103][100917] Updated weights for policy 1, policy_version 76292 (0.0008) +[2023-10-14 08:16:25,477][100917] Updated weights for policy 1, policy_version 76302 (0.0008) +[2023-10-14 08:16:25,855][100917] Updated weights for policy 1, policy_version 76312 (0.0009) +[2023-10-14 08:16:27,468][100936] Updated weights for policy 0, policy_version 76200 (0.0007) +[2023-10-14 08:16:27,844][100936] Updated weights for policy 0, policy_version 76210 (0.0007) +[2023-10-14 08:16:28,222][100936] Updated weights for policy 0, policy_version 76220 (0.0007) +[2023-10-14 08:16:28,512][99942] Fps is (10 sec: 16384.1, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 156205056. Throughput: 0: 1667.5, 1: 1645.6. Samples: 39052678. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:16:28,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.790')] +[2023-10-14 08:16:29,926][100917] Updated weights for policy 1, policy_version 76322 (0.0011) +[2023-10-14 08:16:30,299][100917] Updated weights for policy 1, policy_version 76332 (0.0010) +[2023-10-14 08:16:30,674][100917] Updated weights for policy 1, policy_version 76342 (0.0008) +[2023-10-14 08:16:31,038][100917] Updated weights for policy 1, policy_version 76352 (0.0008) +[2023-10-14 08:16:32,287][100936] Updated weights for policy 0, policy_version 76230 (0.0007) +[2023-10-14 08:16:32,658][100936] Updated weights for policy 0, policy_version 76240 (0.0007) +[2023-10-14 08:16:33,030][100936] Updated weights for policy 0, policy_version 76250 (0.0008) +[2023-10-14 08:16:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 156270592. Throughput: 0: 1660.4, 1: 1660.8. Samples: 39072804. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:16:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.790')] +[2023-10-14 08:16:35,146][100917] Updated weights for policy 1, policy_version 76362 (0.0008) +[2023-10-14 08:16:35,512][100917] Updated weights for policy 1, policy_version 76372 (0.0009) +[2023-10-14 08:16:35,879][100917] Updated weights for policy 1, policy_version 76382 (0.0007) +[2023-10-14 08:16:37,167][100936] Updated weights for policy 0, policy_version 76260 (0.0009) +[2023-10-14 08:16:37,541][100936] Updated weights for policy 0, policy_version 76270 (0.0010) +[2023-10-14 08:16:37,908][100936] Updated weights for policy 0, policy_version 76280 (0.0010) +[2023-10-14 08:16:38,512][99942] Fps is (10 sec: 13106.5, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 156336128. Throughput: 0: 1654.2, 1: 1662.6. Samples: 39092278. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:16:38,514][99942] Avg episode reward: [(0, '1.000'), (1, '0.790')] +[2023-10-14 08:16:39,877][100917] Updated weights for policy 1, policy_version 76392 (0.0009) +[2023-10-14 08:16:40,257][100917] Updated weights for policy 1, policy_version 76402 (0.0007) +[2023-10-14 08:16:40,630][100917] Updated weights for policy 1, policy_version 76412 (0.0010) +[2023-10-14 08:16:42,038][100936] Updated weights for policy 0, policy_version 76290 (0.0008) +[2023-10-14 08:16:42,409][100936] Updated weights for policy 0, policy_version 76300 (0.0007) +[2023-10-14 08:16:42,774][100936] Updated weights for policy 0, policy_version 76310 (0.0008) +[2023-10-14 08:16:43,139][100936] Updated weights for policy 0, policy_version 76320 (0.0008) +[2023-10-14 08:16:43,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 156401664. Throughput: 0: 1664.6, 1: 1649.3. Samples: 39102656. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:16:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.790')] +[2023-10-14 08:16:44,715][100917] Updated weights for policy 1, policy_version 76422 (0.0009) +[2023-10-14 08:16:45,085][100917] Updated weights for policy 1, policy_version 76432 (0.0010) +[2023-10-14 08:16:45,464][100917] Updated weights for policy 1, policy_version 76442 (0.0008) +[2023-10-14 08:16:47,208][100936] Updated weights for policy 0, policy_version 76330 (0.0009) +[2023-10-14 08:16:47,571][100936] Updated weights for policy 0, policy_version 76340 (0.0010) +[2023-10-14 08:16:47,946][100936] Updated weights for policy 0, policy_version 76350 (0.0010) +[2023-10-14 08:16:48,512][99942] Fps is (10 sec: 13107.8, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 156467200. Throughput: 0: 1649.6, 1: 1661.6. Samples: 39122392. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:16:48,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:16:49,784][100917] Updated weights for policy 1, policy_version 76452 (0.0009) +[2023-10-14 08:16:50,157][100917] Updated weights for policy 1, policy_version 76462 (0.0011) +[2023-10-14 08:16:50,535][100917] Updated weights for policy 1, policy_version 76472 (0.0009) +[2023-10-14 08:16:52,062][100936] Updated weights for policy 0, policy_version 76360 (0.0010) +[2023-10-14 08:16:52,436][100936] Updated weights for policy 0, policy_version 76370 (0.0011) +[2023-10-14 08:16:52,798][100936] Updated weights for policy 0, policy_version 76380 (0.0010) +[2023-10-14 08:16:53,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 156532736. Throughput: 0: 1657.1, 1: 1661.5. Samples: 39142014. Policy #0 lag: (min: 31.0, avg: 34.5, max: 63.0) +[2023-10-14 08:16:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 08:16:54,750][100917] Updated weights for policy 1, policy_version 76482 (0.0010) +[2023-10-14 08:16:55,121][100917] Updated weights for policy 1, policy_version 76492 (0.0009) +[2023-10-14 08:16:55,490][100917] Updated weights for policy 1, policy_version 76502 (0.0007) +[2023-10-14 08:16:55,854][100917] Updated weights for policy 1, policy_version 76512 (0.0007) +[2023-10-14 08:16:57,073][100936] Updated weights for policy 0, policy_version 76390 (0.0008) +[2023-10-14 08:16:57,438][100936] Updated weights for policy 0, policy_version 76400 (0.0008) +[2023-10-14 08:16:57,807][100936] Updated weights for policy 0, policy_version 76410 (0.0007) +[2023-10-14 08:16:58,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 156598272. Throughput: 0: 1663.6, 1: 1652.8. Samples: 39152254. Policy #0 lag: (min: 31.0, avg: 34.5, max: 63.0) +[2023-10-14 08:16:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 08:17:00,097][100917] Updated weights for policy 1, policy_version 76522 (0.0010) +[2023-10-14 08:17:00,472][100917] Updated weights for policy 1, policy_version 76532 (0.0011) +[2023-10-14 08:17:00,842][100917] Updated weights for policy 1, policy_version 76542 (0.0009) +[2023-10-14 08:17:01,984][100936] Updated weights for policy 0, policy_version 76420 (0.0008) +[2023-10-14 08:17:02,381][100936] Updated weights for policy 0, policy_version 76430 (0.0009) +[2023-10-14 08:17:02,742][100936] Updated weights for policy 0, policy_version 76440 (0.0009) +[2023-10-14 08:17:03,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 156663808. Throughput: 0: 1652.3, 1: 1659.3. Samples: 39171884. Policy #0 lag: (min: 31.0, avg: 34.5, max: 63.0) +[2023-10-14 08:17:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 08:17:05,037][100917] Updated weights for policy 1, policy_version 76552 (0.0009) +[2023-10-14 08:17:05,408][100917] Updated weights for policy 1, policy_version 76562 (0.0007) +[2023-10-14 08:17:05,780][100917] Updated weights for policy 1, policy_version 76572 (0.0008) +[2023-10-14 08:17:06,878][100936] Updated weights for policy 0, policy_version 76450 (0.0009) +[2023-10-14 08:17:07,234][100936] Updated weights for policy 0, policy_version 76460 (0.0009) +[2023-10-14 08:17:07,603][100936] Updated weights for policy 0, policy_version 76470 (0.0008) +[2023-10-14 08:17:07,978][100936] Updated weights for policy 0, policy_version 76480 (0.0007) +[2023-10-14 08:17:08,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 156729344. Throughput: 0: 1661.6, 1: 1662.5. Samples: 39191654. Policy #0 lag: (min: 31.0, avg: 34.5, max: 63.0) +[2023-10-14 08:17:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 08:17:09,941][100917] Updated weights for policy 1, policy_version 76582 (0.0009) +[2023-10-14 08:17:10,319][100917] Updated weights for policy 1, policy_version 76592 (0.0009) +[2023-10-14 08:17:10,709][100917] Updated weights for policy 1, policy_version 76602 (0.0009) +[2023-10-14 08:17:12,042][100936] Updated weights for policy 0, policy_version 76490 (0.0008) +[2023-10-14 08:17:12,411][100936] Updated weights for policy 0, policy_version 76500 (0.0009) +[2023-10-14 08:17:12,775][100936] Updated weights for policy 0, policy_version 76510 (0.0010) +[2023-10-14 08:17:13,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 156794880. Throughput: 0: 1662.4, 1: 1649.7. Samples: 39201722. Policy #0 lag: (min: 31.0, avg: 34.5, max: 63.0) +[2023-10-14 08:17:13,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 08:17:14,797][100917] Updated weights for policy 1, policy_version 76612 (0.0010) +[2023-10-14 08:17:15,166][100917] Updated weights for policy 1, policy_version 76622 (0.0008) +[2023-10-14 08:17:15,538][100917] Updated weights for policy 1, policy_version 76632 (0.0008) +[2023-10-14 08:17:17,022][100936] Updated weights for policy 0, policy_version 76520 (0.0008) +[2023-10-14 08:17:17,398][100936] Updated weights for policy 0, policy_version 76530 (0.0010) +[2023-10-14 08:17:17,763][100936] Updated weights for policy 0, policy_version 76540 (0.0012) +[2023-10-14 08:17:18,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 156860416. Throughput: 0: 1647.4, 1: 1650.8. Samples: 39221224. Policy #0 lag: (min: 31.0, avg: 34.5, max: 63.0) +[2023-10-14 08:17:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 08:17:19,579][100917] Updated weights for policy 1, policy_version 76642 (0.0011) +[2023-10-14 08:17:19,952][100917] Updated weights for policy 1, policy_version 76652 (0.0009) +[2023-10-14 08:17:20,319][100917] Updated weights for policy 1, policy_version 76662 (0.0008) +[2023-10-14 08:17:20,699][100917] Updated weights for policy 1, policy_version 76672 (0.0007) +[2023-10-14 08:17:21,771][100936] Updated weights for policy 0, policy_version 76550 (0.0008) +[2023-10-14 08:17:22,144][100936] Updated weights for policy 0, policy_version 76560 (0.0007) +[2023-10-14 08:17:22,515][100936] Updated weights for policy 0, policy_version 76570 (0.0007) +[2023-10-14 08:17:23,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 156925952. Throughput: 0: 1662.0, 1: 1652.5. Samples: 39241430. Policy #0 lag: (min: 31.0, avg: 34.5, max: 63.0) +[2023-10-14 08:17:23,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 08:17:24,764][100917] Updated weights for policy 1, policy_version 76682 (0.0008) +[2023-10-14 08:17:25,133][100917] Updated weights for policy 1, policy_version 76692 (0.0008) +[2023-10-14 08:17:25,512][100917] Updated weights for policy 1, policy_version 76702 (0.0009) +[2023-10-14 08:17:26,631][100936] Updated weights for policy 0, policy_version 76580 (0.0007) +[2023-10-14 08:17:26,991][100936] Updated weights for policy 0, policy_version 76590 (0.0008) +[2023-10-14 08:17:27,357][100936] Updated weights for policy 0, policy_version 76600 (0.0012) +[2023-10-14 08:17:28,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 156991488. Throughput: 0: 1664.0, 1: 1647.7. Samples: 39251684. Policy #0 lag: (min: 31.0, avg: 34.5, max: 63.0) +[2023-10-14 08:17:28,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 08:17:29,542][100917] Updated weights for policy 1, policy_version 76712 (0.0009) +[2023-10-14 08:17:29,920][100917] Updated weights for policy 1, policy_version 76722 (0.0008) +[2023-10-14 08:17:30,291][100917] Updated weights for policy 1, policy_version 76732 (0.0010) +[2023-10-14 08:17:31,437][100936] Updated weights for policy 0, policy_version 76610 (0.0008) +[2023-10-14 08:17:31,804][100936] Updated weights for policy 0, policy_version 76620 (0.0009) +[2023-10-14 08:17:32,173][100936] Updated weights for policy 0, policy_version 76630 (0.0008) +[2023-10-14 08:17:32,550][100936] Updated weights for policy 0, policy_version 76640 (0.0007) +[2023-10-14 08:17:33,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 157057024. Throughput: 0: 1654.8, 1: 1655.3. Samples: 39271344. Policy #0 lag: (min: 31.0, avg: 34.5, max: 63.0) +[2023-10-14 08:17:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 08:17:34,411][100917] Updated weights for policy 1, policy_version 76742 (0.0009) +[2023-10-14 08:17:34,787][100917] Updated weights for policy 1, policy_version 76752 (0.0009) +[2023-10-14 08:17:35,150][100917] Updated weights for policy 1, policy_version 76762 (0.0008) +[2023-10-14 08:17:36,664][100936] Updated weights for policy 0, policy_version 76650 (0.0008) +[2023-10-14 08:17:37,034][100936] Updated weights for policy 0, policy_version 76660 (0.0009) +[2023-10-14 08:17:37,411][100936] Updated weights for policy 0, policy_version 76670 (0.0008) +[2023-10-14 08:17:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.3, 300 sec: 13329.4). Total num frames: 157122560. Throughput: 0: 1666.5, 1: 1658.4. Samples: 39291634. Policy #0 lag: (min: 31.0, avg: 34.5, max: 63.0) +[2023-10-14 08:17:38,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 08:17:38,520][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000076672_78512128.pth... +[2023-10-14 08:17:38,521][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000076768_78610432.pth... +[2023-10-14 08:17:38,561][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000075104_76906496.pth +[2023-10-14 08:17:38,563][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000075232_77037568.pth +[2023-10-14 08:17:39,214][100917] Updated weights for policy 1, policy_version 76772 (0.0009) +[2023-10-14 08:17:39,593][100917] Updated weights for policy 1, policy_version 76782 (0.0007) +[2023-10-14 08:17:39,964][100917] Updated weights for policy 1, policy_version 76792 (0.0007) +[2023-10-14 08:17:41,439][100936] Updated weights for policy 0, policy_version 76680 (0.0009) +[2023-10-14 08:17:41,809][100936] Updated weights for policy 0, policy_version 76690 (0.0007) +[2023-10-14 08:17:42,175][100936] Updated weights for policy 0, policy_version 76700 (0.0007) +[2023-10-14 08:17:43,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 157188096. Throughput: 0: 1659.9, 1: 1660.5. Samples: 39301670. Policy #0 lag: (min: 31.0, avg: 38.0, max: 63.0) +[2023-10-14 08:17:43,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 08:17:44,004][100917] Updated weights for policy 1, policy_version 76802 (0.0010) +[2023-10-14 08:17:44,377][100917] Updated weights for policy 1, policy_version 76812 (0.0009) +[2023-10-14 08:17:44,755][100917] Updated weights for policy 1, policy_version 76822 (0.0009) +[2023-10-14 08:17:45,117][100917] Updated weights for policy 1, policy_version 76832 (0.0010) +[2023-10-14 08:17:46,412][100936] Updated weights for policy 0, policy_version 76710 (0.0008) +[2023-10-14 08:17:46,798][100936] Updated weights for policy 0, policy_version 76720 (0.0008) +[2023-10-14 08:17:47,159][100936] Updated weights for policy 0, policy_version 76730 (0.0011) +[2023-10-14 08:17:48,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 157253632. Throughput: 0: 1652.6, 1: 1664.9. Samples: 39321172. Policy #0 lag: (min: 31.0, avg: 38.0, max: 63.0) +[2023-10-14 08:17:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 08:17:49,195][100917] Updated weights for policy 1, policy_version 76842 (0.0008) +[2023-10-14 08:17:49,580][100917] Updated weights for policy 1, policy_version 76852 (0.0007) +[2023-10-14 08:17:49,953][100917] Updated weights for policy 1, policy_version 76862 (0.0008) +[2023-10-14 08:17:51,356][100936] Updated weights for policy 0, policy_version 76740 (0.0010) +[2023-10-14 08:17:51,727][100936] Updated weights for policy 0, policy_version 76750 (0.0009) +[2023-10-14 08:17:52,099][100936] Updated weights for policy 0, policy_version 76760 (0.0007) +[2023-10-14 08:17:53,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 157319168. Throughput: 0: 1660.4, 1: 1668.8. Samples: 39341468. Policy #0 lag: (min: 31.0, avg: 38.0, max: 63.0) +[2023-10-14 08:17:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 08:17:53,966][100917] Updated weights for policy 1, policy_version 76872 (0.0010) +[2023-10-14 08:17:54,338][100917] Updated weights for policy 1, policy_version 76882 (0.0007) +[2023-10-14 08:17:54,703][100917] Updated weights for policy 1, policy_version 76892 (0.0010) +[2023-10-14 08:17:56,163][100936] Updated weights for policy 0, policy_version 76770 (0.0010) +[2023-10-14 08:17:56,531][100936] Updated weights for policy 0, policy_version 76780 (0.0008) +[2023-10-14 08:17:56,909][100936] Updated weights for policy 0, policy_version 76790 (0.0009) +[2023-10-14 08:17:57,283][100936] Updated weights for policy 0, policy_version 76800 (0.0011) +[2023-10-14 08:17:58,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 157384704. Throughput: 0: 1652.6, 1: 1673.0. Samples: 39351376. Policy #0 lag: (min: 31.0, avg: 38.0, max: 63.0) +[2023-10-14 08:17:58,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 08:17:58,863][100917] Updated weights for policy 1, policy_version 76902 (0.0008) +[2023-10-14 08:17:59,243][100917] Updated weights for policy 1, policy_version 76912 (0.0009) +[2023-10-14 08:17:59,613][100917] Updated weights for policy 1, policy_version 76922 (0.0009) +[2023-10-14 08:18:01,323][100936] Updated weights for policy 0, policy_version 76810 (0.0008) +[2023-10-14 08:18:01,697][100936] Updated weights for policy 0, policy_version 76820 (0.0008) +[2023-10-14 08:18:02,060][100936] Updated weights for policy 0, policy_version 76830 (0.0007) +[2023-10-14 08:18:03,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 157450240. Throughput: 0: 1653.0, 1: 1675.8. Samples: 39371022. Policy #0 lag: (min: 31.0, avg: 38.0, max: 63.0) +[2023-10-14 08:18:03,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 08:18:03,715][100917] Updated weights for policy 1, policy_version 76932 (0.0010) +[2023-10-14 08:18:04,090][100917] Updated weights for policy 1, policy_version 76942 (0.0010) +[2023-10-14 08:18:04,457][100917] Updated weights for policy 1, policy_version 76952 (0.0009) +[2023-10-14 08:18:06,336][100936] Updated weights for policy 0, policy_version 76840 (0.0008) +[2023-10-14 08:18:06,698][100936] Updated weights for policy 0, policy_version 76850 (0.0010) +[2023-10-14 08:18:07,067][100936] Updated weights for policy 0, policy_version 76860 (0.0008) +[2023-10-14 08:18:08,483][100917] Updated weights for policy 1, policy_version 76962 (0.0011) +[2023-10-14 08:18:08,512][99942] Fps is (10 sec: 13106.7, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 157515776. Throughput: 0: 1659.8, 1: 1673.5. Samples: 39391430. Policy #0 lag: (min: 31.0, avg: 38.0, max: 63.0) +[2023-10-14 08:18:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 08:18:08,851][100917] Updated weights for policy 1, policy_version 76972 (0.0012) +[2023-10-14 08:18:09,232][100917] Updated weights for policy 1, policy_version 76982 (0.0010) +[2023-10-14 08:18:09,598][100917] Updated weights for policy 1, policy_version 76992 (0.0009) +[2023-10-14 08:18:11,175][100936] Updated weights for policy 0, policy_version 76870 (0.0008) +[2023-10-14 08:18:11,540][100936] Updated weights for policy 0, policy_version 76880 (0.0009) +[2023-10-14 08:18:11,911][100936] Updated weights for policy 0, policy_version 76890 (0.0007) +[2023-10-14 08:18:13,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 157581312. Throughput: 0: 1647.6, 1: 1677.6. Samples: 39401318. Policy #0 lag: (min: 31.0, avg: 38.0, max: 63.0) +[2023-10-14 08:18:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 08:18:13,719][100917] Updated weights for policy 1, policy_version 77002 (0.0008) +[2023-10-14 08:18:14,079][100917] Updated weights for policy 1, policy_version 77012 (0.0011) +[2023-10-14 08:18:14,446][100917] Updated weights for policy 1, policy_version 77022 (0.0010) +[2023-10-14 08:18:15,960][100936] Updated weights for policy 0, policy_version 76900 (0.0010) +[2023-10-14 08:18:16,337][100936] Updated weights for policy 0, policy_version 76910 (0.0007) +[2023-10-14 08:18:16,700][100936] Updated weights for policy 0, policy_version 76920 (0.0009) +[2023-10-14 08:18:18,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 157646848. Throughput: 0: 1647.7, 1: 1677.2. Samples: 39420966. Policy #0 lag: (min: 31.0, avg: 38.0, max: 63.0) +[2023-10-14 08:18:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 08:18:18,669][100917] Updated weights for policy 1, policy_version 77032 (0.0009) +[2023-10-14 08:18:19,039][100917] Updated weights for policy 1, policy_version 77042 (0.0011) +[2023-10-14 08:18:19,419][100917] Updated weights for policy 1, policy_version 77052 (0.0010) +[2023-10-14 08:18:20,921][100936] Updated weights for policy 0, policy_version 76930 (0.0009) +[2023-10-14 08:18:21,298][100936] Updated weights for policy 0, policy_version 76940 (0.0008) +[2023-10-14 08:18:21,660][100936] Updated weights for policy 0, policy_version 76950 (0.0009) +[2023-10-14 08:18:22,033][100936] Updated weights for policy 0, policy_version 76960 (0.0008) +[2023-10-14 08:18:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 157712384. Throughput: 0: 1649.2, 1: 1675.9. Samples: 39441264. Policy #0 lag: (min: 31.0, avg: 38.0, max: 63.0) +[2023-10-14 08:18:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 08:18:23,531][100917] Updated weights for policy 1, policy_version 77062 (0.0008) +[2023-10-14 08:18:23,904][100917] Updated weights for policy 1, policy_version 77072 (0.0009) +[2023-10-14 08:18:24,280][100917] Updated weights for policy 1, policy_version 77082 (0.0011) +[2023-10-14 08:18:26,251][100936] Updated weights for policy 0, policy_version 76970 (0.0008) +[2023-10-14 08:18:26,614][100936] Updated weights for policy 0, policy_version 76980 (0.0008) +[2023-10-14 08:18:26,987][100936] Updated weights for policy 0, policy_version 76990 (0.0008) +[2023-10-14 08:18:28,428][100917] Updated weights for policy 1, policy_version 77092 (0.0007) +[2023-10-14 08:18:28,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 157777920. Throughput: 0: 1641.6, 1: 1673.4. Samples: 39450842. Policy #0 lag: (min: 31.0, avg: 38.0, max: 63.0) +[2023-10-14 08:18:28,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 08:18:28,806][100917] Updated weights for policy 1, policy_version 77102 (0.0011) +[2023-10-14 08:18:29,178][100917] Updated weights for policy 1, policy_version 77112 (0.0010) +[2023-10-14 08:18:31,117][100936] Updated weights for policy 0, policy_version 77000 (0.0009) +[2023-10-14 08:18:31,480][100936] Updated weights for policy 0, policy_version 77010 (0.0008) +[2023-10-14 08:18:31,861][100936] Updated weights for policy 0, policy_version 77020 (0.0007) +[2023-10-14 08:18:33,310][100917] Updated weights for policy 1, policy_version 77122 (0.0008) +[2023-10-14 08:18:33,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 157843456. Throughput: 0: 1653.1, 1: 1671.1. Samples: 39470760. Policy #0 lag: (min: 31.0, avg: 38.8, max: 63.0) +[2023-10-14 08:18:33,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 08:18:33,684][100917] Updated weights for policy 1, policy_version 77132 (0.0010) +[2023-10-14 08:18:34,050][100917] Updated weights for policy 1, policy_version 77142 (0.0008) +[2023-10-14 08:18:34,429][100917] Updated weights for policy 1, policy_version 77152 (0.0007) +[2023-10-14 08:18:36,090][100936] Updated weights for policy 0, policy_version 77030 (0.0007) +[2023-10-14 08:18:36,478][100936] Updated weights for policy 0, policy_version 77040 (0.0008) +[2023-10-14 08:18:36,856][100936] Updated weights for policy 0, policy_version 77050 (0.0007) +[2023-10-14 08:18:38,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 157908992. Throughput: 0: 1657.3, 1: 1666.0. Samples: 39491018. Policy #0 lag: (min: 31.0, avg: 38.8, max: 63.0) +[2023-10-14 08:18:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 08:18:38,577][100917] Updated weights for policy 1, policy_version 77162 (0.0009) +[2023-10-14 08:18:38,947][100917] Updated weights for policy 1, policy_version 77172 (0.0010) +[2023-10-14 08:18:39,316][100917] Updated weights for policy 1, policy_version 77182 (0.0010) +[2023-10-14 08:18:40,854][100936] Updated weights for policy 0, policy_version 77060 (0.0007) +[2023-10-14 08:18:41,233][100936] Updated weights for policy 0, policy_version 77070 (0.0007) +[2023-10-14 08:18:41,596][100936] Updated weights for policy 0, policy_version 77080 (0.0011) +[2023-10-14 08:18:43,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 157974528. Throughput: 0: 1651.3, 1: 1662.4. Samples: 39500492. Policy #0 lag: (min: 31.0, avg: 38.8, max: 63.0) +[2023-10-14 08:18:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 08:18:43,586][100917] Updated weights for policy 1, policy_version 77192 (0.0009) +[2023-10-14 08:18:43,948][100917] Updated weights for policy 1, policy_version 77202 (0.0011) +[2023-10-14 08:18:44,317][100917] Updated weights for policy 1, policy_version 77212 (0.0009) +[2023-10-14 08:18:45,650][100936] Updated weights for policy 0, policy_version 77090 (0.0007) +[2023-10-14 08:18:46,021][100936] Updated weights for policy 0, policy_version 77100 (0.0008) +[2023-10-14 08:18:46,393][100936] Updated weights for policy 0, policy_version 77110 (0.0007) +[2023-10-14 08:18:46,765][100936] Updated weights for policy 0, policy_version 77120 (0.0009) +[2023-10-14 08:18:48,364][100917] Updated weights for policy 1, policy_version 77222 (0.0009) +[2023-10-14 08:18:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 158040064. Throughput: 0: 1657.7, 1: 1667.4. Samples: 39520652. Policy #0 lag: (min: 31.0, avg: 38.8, max: 63.0) +[2023-10-14 08:18:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 08:18:48,736][100917] Updated weights for policy 1, policy_version 77232 (0.0009) +[2023-10-14 08:18:49,107][100917] Updated weights for policy 1, policy_version 77242 (0.0010) +[2023-10-14 08:18:50,792][100936] Updated weights for policy 0, policy_version 77130 (0.0007) +[2023-10-14 08:18:51,153][100936] Updated weights for policy 0, policy_version 77140 (0.0009) +[2023-10-14 08:18:51,529][100936] Updated weights for policy 0, policy_version 77150 (0.0010) +[2023-10-14 08:18:53,265][100917] Updated weights for policy 1, policy_version 77252 (0.0008) +[2023-10-14 08:18:53,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 158105600. Throughput: 0: 1659.2, 1: 1661.8. Samples: 39540874. Policy #0 lag: (min: 31.0, avg: 38.8, max: 63.0) +[2023-10-14 08:18:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 08:18:53,635][100917] Updated weights for policy 1, policy_version 77262 (0.0008) +[2023-10-14 08:18:54,016][100917] Updated weights for policy 1, policy_version 77272 (0.0007) +[2023-10-14 08:18:55,709][100936] Updated weights for policy 0, policy_version 77160 (0.0010) +[2023-10-14 08:18:56,078][100936] Updated weights for policy 0, policy_version 77170 (0.0007) +[2023-10-14 08:18:56,449][100936] Updated weights for policy 0, policy_version 77180 (0.0007) +[2023-10-14 08:18:58,109][100917] Updated weights for policy 1, policy_version 77282 (0.0011) +[2023-10-14 08:18:58,488][100917] Updated weights for policy 1, policy_version 77292 (0.0009) +[2023-10-14 08:18:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 158171136. Throughput: 0: 1647.3, 1: 1661.0. Samples: 39550192. Policy #0 lag: (min: 31.0, avg: 38.8, max: 63.0) +[2023-10-14 08:18:58,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 08:18:58,863][100917] Updated weights for policy 1, policy_version 77302 (0.0007) +[2023-10-14 08:18:59,230][100917] Updated weights for policy 1, policy_version 77312 (0.0007) +[2023-10-14 08:19:00,474][100936] Updated weights for policy 0, policy_version 77190 (0.0008) +[2023-10-14 08:19:00,845][100936] Updated weights for policy 0, policy_version 77200 (0.0011) +[2023-10-14 08:19:01,215][100936] Updated weights for policy 0, policy_version 77210 (0.0009) +[2023-10-14 08:19:03,377][100917] Updated weights for policy 1, policy_version 77322 (0.0008) +[2023-10-14 08:19:03,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 158236672. Throughput: 0: 1664.6, 1: 1654.0. Samples: 39570306. Policy #0 lag: (min: 31.0, avg: 38.8, max: 63.0) +[2023-10-14 08:19:03,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 08:19:03,745][100917] Updated weights for policy 1, policy_version 77332 (0.0010) +[2023-10-14 08:19:04,123][100917] Updated weights for policy 1, policy_version 77342 (0.0009) +[2023-10-14 08:19:05,435][100936] Updated weights for policy 0, policy_version 77220 (0.0008) +[2023-10-14 08:19:05,810][100936] Updated weights for policy 0, policy_version 77230 (0.0007) +[2023-10-14 08:19:06,176][100936] Updated weights for policy 0, policy_version 77240 (0.0008) +[2023-10-14 08:19:08,181][100917] Updated weights for policy 1, policy_version 77352 (0.0007) +[2023-10-14 08:19:08,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 158302208. Throughput: 0: 1666.4, 1: 1654.0. Samples: 39590678. Policy #0 lag: (min: 31.0, avg: 38.8, max: 63.0) +[2023-10-14 08:19:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 08:19:08,547][100917] Updated weights for policy 1, policy_version 77362 (0.0007) +[2023-10-14 08:19:08,923][100917] Updated weights for policy 1, policy_version 77372 (0.0008) +[2023-10-14 08:19:10,339][100936] Updated weights for policy 0, policy_version 77250 (0.0010) +[2023-10-14 08:19:10,710][100936] Updated weights for policy 0, policy_version 77260 (0.0007) +[2023-10-14 08:19:11,077][100936] Updated weights for policy 0, policy_version 77270 (0.0010) +[2023-10-14 08:19:11,449][100936] Updated weights for policy 0, policy_version 77280 (0.0008) +[2023-10-14 08:19:12,937][100917] Updated weights for policy 1, policy_version 77382 (0.0008) +[2023-10-14 08:19:13,317][100917] Updated weights for policy 1, policy_version 77392 (0.0010) +[2023-10-14 08:19:13,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 158367744. Throughput: 0: 1655.5, 1: 1660.5. Samples: 39600066. Policy #0 lag: (min: 31.0, avg: 38.8, max: 63.0) +[2023-10-14 08:19:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 08:19:13,685][100917] Updated weights for policy 1, policy_version 77402 (0.0008) +[2023-10-14 08:19:15,379][100936] Updated weights for policy 0, policy_version 77290 (0.0009) +[2023-10-14 08:19:15,747][100936] Updated weights for policy 0, policy_version 77300 (0.0009) +[2023-10-14 08:19:16,112][100936] Updated weights for policy 0, policy_version 77310 (0.0009) +[2023-10-14 08:19:17,804][100917] Updated weights for policy 1, policy_version 77412 (0.0008) +[2023-10-14 08:19:18,175][100917] Updated weights for policy 1, policy_version 77422 (0.0008) +[2023-10-14 08:19:18,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 158433280. Throughput: 0: 1663.3, 1: 1661.0. Samples: 39620352. Policy #0 lag: (min: 31.0, avg: 38.8, max: 63.0) +[2023-10-14 08:19:18,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 08:19:18,551][100917] Updated weights for policy 1, policy_version 77432 (0.0007) +[2023-10-14 08:19:20,099][100936] Updated weights for policy 0, policy_version 77320 (0.0008) +[2023-10-14 08:19:20,470][100936] Updated weights for policy 0, policy_version 77330 (0.0009) +[2023-10-14 08:19:20,840][100936] Updated weights for policy 0, policy_version 77340 (0.0008) +[2023-10-14 08:19:22,607][100917] Updated weights for policy 1, policy_version 77442 (0.0007) +[2023-10-14 08:19:22,981][100917] Updated weights for policy 1, policy_version 77452 (0.0009) +[2023-10-14 08:19:23,346][100917] Updated weights for policy 1, policy_version 77462 (0.0009) +[2023-10-14 08:19:23,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 158498816. Throughput: 0: 1664.3, 1: 1653.1. Samples: 39640300. Policy #0 lag: (min: 8.0, avg: 30.0, max: 40.0) +[2023-10-14 08:19:23,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 08:19:23,715][100917] Updated weights for policy 1, policy_version 77472 (0.0009) +[2023-10-14 08:19:25,218][100936] Updated weights for policy 0, policy_version 77350 (0.0008) +[2023-10-14 08:19:25,606][100936] Updated weights for policy 0, policy_version 77360 (0.0010) +[2023-10-14 08:19:25,983][100936] Updated weights for policy 0, policy_version 77370 (0.0008) +[2023-10-14 08:19:27,899][100917] Updated weights for policy 1, policy_version 77482 (0.0007) +[2023-10-14 08:19:28,273][100917] Updated weights for policy 1, policy_version 77492 (0.0007) +[2023-10-14 08:19:28,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 158564352. Throughput: 0: 1644.6, 1: 1668.2. Samples: 39649568. Policy #0 lag: (min: 8.0, avg: 30.0, max: 40.0) +[2023-10-14 08:19:28,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 08:19:28,639][100917] Updated weights for policy 1, policy_version 77502 (0.0008) +[2023-10-14 08:19:30,048][100936] Updated weights for policy 0, policy_version 77380 (0.0009) +[2023-10-14 08:19:30,417][100936] Updated weights for policy 0, policy_version 77390 (0.0007) +[2023-10-14 08:19:30,781][100936] Updated weights for policy 0, policy_version 77400 (0.0008) +[2023-10-14 08:19:32,895][100917] Updated weights for policy 1, policy_version 77512 (0.0008) +[2023-10-14 08:19:33,280][100917] Updated weights for policy 1, policy_version 77522 (0.0009) +[2023-10-14 08:19:33,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 158629888. Throughput: 0: 1657.8, 1: 1664.9. Samples: 39670174. Policy #0 lag: (min: 8.0, avg: 30.0, max: 40.0) +[2023-10-14 08:19:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 08:19:33,652][100917] Updated weights for policy 1, policy_version 77532 (0.0008) +[2023-10-14 08:19:35,008][100936] Updated weights for policy 0, policy_version 77410 (0.0009) +[2023-10-14 08:19:35,365][100936] Updated weights for policy 0, policy_version 77420 (0.0008) +[2023-10-14 08:19:35,738][100936] Updated weights for policy 0, policy_version 77430 (0.0007) +[2023-10-14 08:19:36,109][100936] Updated weights for policy 0, policy_version 77440 (0.0009) +[2023-10-14 08:19:37,607][100917] Updated weights for policy 1, policy_version 77542 (0.0011) +[2023-10-14 08:19:37,984][100917] Updated weights for policy 1, policy_version 77552 (0.0009) +[2023-10-14 08:19:38,343][100917] Updated weights for policy 1, policy_version 77562 (0.0009) +[2023-10-14 08:19:38,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 158695424. Throughput: 0: 1655.9, 1: 1655.2. Samples: 39689874. Policy #0 lag: (min: 8.0, avg: 30.0, max: 40.0) +[2023-10-14 08:19:38,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.920')] +[2023-10-14 08:19:38,521][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000077440_79298560.pth... +[2023-10-14 08:19:38,552][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000075904_77725696.pth +[2023-10-14 08:19:38,556][100560] Saving a milestone ./train_atari/atari_privateye_APPO/checkpoint_p0/milestones/checkpoint_000077440_79298560.pth +[2023-10-14 08:19:38,566][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000077568_79429632.pth... +[2023-10-14 08:19:38,595][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000076000_77824000.pth +[2023-10-14 08:19:38,599][100681] Saving a milestone ./train_atari/atari_privateye_APPO/checkpoint_p1/milestones/checkpoint_000077568_79429632.pth +[2023-10-14 08:19:40,363][100936] Updated weights for policy 0, policy_version 77450 (0.0009) +[2023-10-14 08:19:40,726][100936] Updated weights for policy 0, policy_version 77460 (0.0007) +[2023-10-14 08:19:41,096][100936] Updated weights for policy 0, policy_version 77470 (0.0009) +[2023-10-14 08:19:42,407][100917] Updated weights for policy 1, policy_version 77572 (0.0008) +[2023-10-14 08:19:42,792][100917] Updated weights for policy 1, policy_version 77582 (0.0009) +[2023-10-14 08:19:43,165][100917] Updated weights for policy 1, policy_version 77592 (0.0009) +[2023-10-14 08:19:43,512][99942] Fps is (10 sec: 16384.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 158793728. Throughput: 0: 1648.1, 1: 1665.8. Samples: 39699316. Policy #0 lag: (min: 8.0, avg: 30.0, max: 40.0) +[2023-10-14 08:19:43,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:19:45,354][100936] Updated weights for policy 0, policy_version 77480 (0.0012) +[2023-10-14 08:19:45,724][100936] Updated weights for policy 0, policy_version 77490 (0.0010) +[2023-10-14 08:19:46,085][100936] Updated weights for policy 0, policy_version 77500 (0.0008) +[2023-10-14 08:19:47,339][100917] Updated weights for policy 1, policy_version 77602 (0.0008) +[2023-10-14 08:19:47,705][100917] Updated weights for policy 1, policy_version 77612 (0.0008) +[2023-10-14 08:19:48,067][100917] Updated weights for policy 1, policy_version 77622 (0.0008) +[2023-10-14 08:19:48,442][100917] Updated weights for policy 1, policy_version 77632 (0.0009) +[2023-10-14 08:19:48,512][99942] Fps is (10 sec: 16383.9, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 158859264. Throughput: 0: 1649.2, 1: 1666.8. Samples: 39719522. Policy #0 lag: (min: 8.0, avg: 30.0, max: 40.0) +[2023-10-14 08:19:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:19:50,276][100936] Updated weights for policy 0, policy_version 77510 (0.0008) +[2023-10-14 08:19:50,652][100936] Updated weights for policy 0, policy_version 77520 (0.0008) +[2023-10-14 08:19:51,023][100936] Updated weights for policy 0, policy_version 77530 (0.0010) +[2023-10-14 08:19:52,663][100917] Updated weights for policy 1, policy_version 77642 (0.0007) +[2023-10-14 08:19:53,028][100917] Updated weights for policy 1, policy_version 77652 (0.0007) +[2023-10-14 08:19:53,406][100917] Updated weights for policy 1, policy_version 77662 (0.0009) +[2023-10-14 08:19:53,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13329.3). Total num frames: 158924800. Throughput: 0: 1649.5, 1: 1651.4. Samples: 39739220. Policy #0 lag: (min: 8.0, avg: 30.0, max: 40.0) +[2023-10-14 08:19:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:19:55,110][100936] Updated weights for policy 0, policy_version 77540 (0.0010) +[2023-10-14 08:19:55,483][100936] Updated weights for policy 0, policy_version 77550 (0.0009) +[2023-10-14 08:19:55,852][100936] Updated weights for policy 0, policy_version 77560 (0.0008) +[2023-10-14 08:19:57,398][100917] Updated weights for policy 1, policy_version 77672 (0.0010) +[2023-10-14 08:19:57,774][100917] Updated weights for policy 1, policy_version 77682 (0.0010) +[2023-10-14 08:19:58,137][100917] Updated weights for policy 1, policy_version 77692 (0.0009) +[2023-10-14 08:19:58,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 158990336. Throughput: 0: 1646.9, 1: 1664.1. Samples: 39749058. Policy #0 lag: (min: 8.0, avg: 30.0, max: 40.0) +[2023-10-14 08:19:58,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:20:00,130][100936] Updated weights for policy 0, policy_version 77570 (0.0007) +[2023-10-14 08:20:00,498][100936] Updated weights for policy 0, policy_version 77580 (0.0008) +[2023-10-14 08:20:00,861][100936] Updated weights for policy 0, policy_version 77590 (0.0011) +[2023-10-14 08:20:01,230][100936] Updated weights for policy 0, policy_version 77600 (0.0010) +[2023-10-14 08:20:02,340][100917] Updated weights for policy 1, policy_version 77702 (0.0009) +[2023-10-14 08:20:02,716][100917] Updated weights for policy 1, policy_version 77712 (0.0010) +[2023-10-14 08:20:03,087][100917] Updated weights for policy 1, policy_version 77722 (0.0009) +[2023-10-14 08:20:03,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 159055872. Throughput: 0: 1649.4, 1: 1664.3. Samples: 39769472. Policy #0 lag: (min: 8.0, avg: 30.0, max: 40.0) +[2023-10-14 08:20:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:20:05,439][100936] Updated weights for policy 0, policy_version 77610 (0.0007) +[2023-10-14 08:20:05,804][100936] Updated weights for policy 0, policy_version 77620 (0.0007) +[2023-10-14 08:20:06,174][100936] Updated weights for policy 0, policy_version 77630 (0.0010) +[2023-10-14 08:20:07,272][100917] Updated weights for policy 1, policy_version 77732 (0.0008) +[2023-10-14 08:20:07,650][100917] Updated weights for policy 1, policy_version 77742 (0.0010) +[2023-10-14 08:20:08,021][100917] Updated weights for policy 1, policy_version 77752 (0.0007) +[2023-10-14 08:20:08,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 159121408. Throughput: 0: 1649.7, 1: 1651.4. Samples: 39788852. Policy #0 lag: (min: 8.0, avg: 30.0, max: 40.0) +[2023-10-14 08:20:08,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:20:10,423][100936] Updated weights for policy 0, policy_version 77640 (0.0010) +[2023-10-14 08:20:10,808][100936] Updated weights for policy 0, policy_version 77650 (0.0011) +[2023-10-14 08:20:11,173][100936] Updated weights for policy 0, policy_version 77660 (0.0008) +[2023-10-14 08:20:12,149][100917] Updated weights for policy 1, policy_version 77762 (0.0008) +[2023-10-14 08:20:12,527][100917] Updated weights for policy 1, policy_version 77772 (0.0009) +[2023-10-14 08:20:12,905][100917] Updated weights for policy 1, policy_version 77782 (0.0008) +[2023-10-14 08:20:13,276][100917] Updated weights for policy 1, policy_version 77792 (0.0009) +[2023-10-14 08:20:13,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 159186944. Throughput: 0: 1647.4, 1: 1656.7. Samples: 39798254. Policy #0 lag: (min: 31.0, avg: 53.2, max: 63.0) +[2023-10-14 08:20:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:20:15,329][100936] Updated weights for policy 0, policy_version 77670 (0.0011) +[2023-10-14 08:20:15,701][100936] Updated weights for policy 0, policy_version 77680 (0.0011) +[2023-10-14 08:20:16,069][100936] Updated weights for policy 0, policy_version 77690 (0.0011) +[2023-10-14 08:20:17,534][100917] Updated weights for policy 1, policy_version 77802 (0.0009) +[2023-10-14 08:20:17,915][100917] Updated weights for policy 1, policy_version 77812 (0.0008) +[2023-10-14 08:20:18,287][100917] Updated weights for policy 1, policy_version 77822 (0.0009) +[2023-10-14 08:20:18,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 159252480. Throughput: 0: 1642.8, 1: 1653.9. Samples: 39818522. Policy #0 lag: (min: 31.0, avg: 53.2, max: 63.0) +[2023-10-14 08:20:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:20:20,118][100936] Updated weights for policy 0, policy_version 77700 (0.0008) +[2023-10-14 08:20:20,488][100936] Updated weights for policy 0, policy_version 77710 (0.0008) +[2023-10-14 08:20:20,861][100936] Updated weights for policy 0, policy_version 77720 (0.0009) +[2023-10-14 08:20:22,252][100917] Updated weights for policy 1, policy_version 77832 (0.0008) +[2023-10-14 08:20:22,622][100917] Updated weights for policy 1, policy_version 77842 (0.0011) +[2023-10-14 08:20:22,993][100917] Updated weights for policy 1, policy_version 77852 (0.0011) +[2023-10-14 08:20:23,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 159318016. Throughput: 0: 1644.6, 1: 1647.2. Samples: 39838006. Policy #0 lag: (min: 31.0, avg: 53.2, max: 63.0) +[2023-10-14 08:20:23,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:20:24,871][100936] Updated weights for policy 0, policy_version 77730 (0.0009) +[2023-10-14 08:20:25,241][100936] Updated weights for policy 0, policy_version 77740 (0.0007) +[2023-10-14 08:20:25,613][100936] Updated weights for policy 0, policy_version 77750 (0.0011) +[2023-10-14 08:20:25,968][100936] Updated weights for policy 0, policy_version 77760 (0.0010) +[2023-10-14 08:20:27,027][100917] Updated weights for policy 1, policy_version 77862 (0.0008) +[2023-10-14 08:20:27,390][100917] Updated weights for policy 1, policy_version 77872 (0.0009) +[2023-10-14 08:20:27,771][100917] Updated weights for policy 1, policy_version 77882 (0.0010) +[2023-10-14 08:20:28,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 159383552. Throughput: 0: 1645.9, 1: 1659.9. Samples: 39848078. Policy #0 lag: (min: 31.0, avg: 53.2, max: 63.0) +[2023-10-14 08:20:28,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:20:29,936][100936] Updated weights for policy 0, policy_version 77770 (0.0008) +[2023-10-14 08:20:30,303][100936] Updated weights for policy 0, policy_version 77780 (0.0008) +[2023-10-14 08:20:30,666][100936] Updated weights for policy 0, policy_version 77790 (0.0007) +[2023-10-14 08:20:31,849][100917] Updated weights for policy 1, policy_version 77892 (0.0009) +[2023-10-14 08:20:32,224][100917] Updated weights for policy 1, policy_version 77902 (0.0008) +[2023-10-14 08:20:32,601][100917] Updated weights for policy 1, policy_version 77912 (0.0007) +[2023-10-14 08:20:33,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 159449088. Throughput: 0: 1653.2, 1: 1658.4. Samples: 39868546. Policy #0 lag: (min: 31.0, avg: 53.2, max: 63.0) +[2023-10-14 08:20:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:20:34,661][100936] Updated weights for policy 0, policy_version 77800 (0.0008) +[2023-10-14 08:20:35,020][100936] Updated weights for policy 0, policy_version 77810 (0.0008) +[2023-10-14 08:20:35,393][100936] Updated weights for policy 0, policy_version 77820 (0.0007) +[2023-10-14 08:20:36,831][100917] Updated weights for policy 1, policy_version 77922 (0.0011) +[2023-10-14 08:20:37,213][100917] Updated weights for policy 1, policy_version 77932 (0.0010) +[2023-10-14 08:20:37,585][100917] Updated weights for policy 1, policy_version 77942 (0.0010) +[2023-10-14 08:20:37,955][100917] Updated weights for policy 1, policy_version 77952 (0.0011) +[2023-10-14 08:20:38,512][99942] Fps is (10 sec: 13106.6, 60 sec: 13653.2, 300 sec: 13329.3). Total num frames: 159514624. Throughput: 0: 1659.9, 1: 1647.2. Samples: 39888040. Policy #0 lag: (min: 31.0, avg: 53.2, max: 63.0) +[2023-10-14 08:20:38,514][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 08:20:39,638][100936] Updated weights for policy 0, policy_version 77830 (0.0008) +[2023-10-14 08:20:40,015][100936] Updated weights for policy 0, policy_version 77840 (0.0008) +[2023-10-14 08:20:40,392][100936] Updated weights for policy 0, policy_version 77850 (0.0010) +[2023-10-14 08:20:42,221][100917] Updated weights for policy 1, policy_version 77962 (0.0009) +[2023-10-14 08:20:42,594][100917] Updated weights for policy 1, policy_version 77972 (0.0009) +[2023-10-14 08:20:42,969][100917] Updated weights for policy 1, policy_version 77982 (0.0009) +[2023-10-14 08:20:43,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 159580160. Throughput: 0: 1658.0, 1: 1653.2. Samples: 39898064. Policy #0 lag: (min: 31.0, avg: 53.2, max: 63.0) +[2023-10-14 08:20:43,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 08:20:44,788][100936] Updated weights for policy 0, policy_version 77860 (0.0009) +[2023-10-14 08:20:45,163][100936] Updated weights for policy 0, policy_version 77870 (0.0007) +[2023-10-14 08:20:45,540][100936] Updated weights for policy 0, policy_version 77880 (0.0007) +[2023-10-14 08:20:47,061][100917] Updated weights for policy 1, policy_version 77992 (0.0010) +[2023-10-14 08:20:47,424][100917] Updated weights for policy 1, policy_version 78002 (0.0009) +[2023-10-14 08:20:47,794][100917] Updated weights for policy 1, policy_version 78012 (0.0010) +[2023-10-14 08:20:48,512][99942] Fps is (10 sec: 13107.8, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 159645696. Throughput: 0: 1659.8, 1: 1646.7. Samples: 39918262. Policy #0 lag: (min: 31.0, avg: 53.2, max: 63.0) +[2023-10-14 08:20:48,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 08:20:49,582][100936] Updated weights for policy 0, policy_version 77890 (0.0009) +[2023-10-14 08:20:49,954][100936] Updated weights for policy 0, policy_version 77900 (0.0008) +[2023-10-14 08:20:50,330][100936] Updated weights for policy 0, policy_version 77910 (0.0011) +[2023-10-14 08:20:50,709][100936] Updated weights for policy 0, policy_version 77920 (0.0007) +[2023-10-14 08:20:52,047][100917] Updated weights for policy 1, policy_version 78022 (0.0009) +[2023-10-14 08:20:52,417][100917] Updated weights for policy 1, policy_version 78032 (0.0010) +[2023-10-14 08:20:52,795][100917] Updated weights for policy 1, policy_version 78042 (0.0009) +[2023-10-14 08:20:53,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 159711232. Throughput: 0: 1662.2, 1: 1644.0. Samples: 39937634. Policy #0 lag: (min: 31.0, avg: 53.2, max: 63.0) +[2023-10-14 08:20:53,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 08:20:54,923][100936] Updated weights for policy 0, policy_version 77930 (0.0007) +[2023-10-14 08:20:55,296][100936] Updated weights for policy 0, policy_version 77940 (0.0008) +[2023-10-14 08:20:55,670][100936] Updated weights for policy 0, policy_version 77950 (0.0010) +[2023-10-14 08:20:56,705][100917] Updated weights for policy 1, policy_version 78052 (0.0009) +[2023-10-14 08:20:57,075][100917] Updated weights for policy 1, policy_version 78062 (0.0008) +[2023-10-14 08:20:57,449][100917] Updated weights for policy 1, policy_version 78072 (0.0008) +[2023-10-14 08:20:58,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 159776768. Throughput: 0: 1662.9, 1: 1658.6. Samples: 39947718. Policy #0 lag: (min: 31.0, avg: 53.2, max: 63.0) +[2023-10-14 08:20:58,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 08:20:59,641][100936] Updated weights for policy 0, policy_version 77960 (0.0008) +[2023-10-14 08:21:00,014][100936] Updated weights for policy 0, policy_version 77970 (0.0008) +[2023-10-14 08:21:00,389][100936] Updated weights for policy 0, policy_version 77980 (0.0007) +[2023-10-14 08:21:01,498][100917] Updated weights for policy 1, policy_version 78082 (0.0007) +[2023-10-14 08:21:01,873][100917] Updated weights for policy 1, policy_version 78092 (0.0008) +[2023-10-14 08:21:02,245][100917] Updated weights for policy 1, policy_version 78102 (0.0007) +[2023-10-14 08:21:02,636][100917] Updated weights for policy 1, policy_version 78112 (0.0009) +[2023-10-14 08:21:03,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 159842304. Throughput: 0: 1669.3, 1: 1651.2. Samples: 39967946. Policy #0 lag: (min: 30.0, avg: 30.0, max: 30.0) +[2023-10-14 08:21:03,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 08:21:04,544][100936] Updated weights for policy 0, policy_version 77990 (0.0010) +[2023-10-14 08:21:04,912][100936] Updated weights for policy 0, policy_version 78000 (0.0008) +[2023-10-14 08:21:05,287][100936] Updated weights for policy 0, policy_version 78010 (0.0008) +[2023-10-14 08:21:06,732][100917] Updated weights for policy 1, policy_version 78122 (0.0009) +[2023-10-14 08:21:07,108][100917] Updated weights for policy 1, policy_version 78132 (0.0007) +[2023-10-14 08:21:07,473][100917] Updated weights for policy 1, policy_version 78142 (0.0008) +[2023-10-14 08:21:08,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 159907840. Throughput: 0: 1671.2, 1: 1652.3. Samples: 39987566. Policy #0 lag: (min: 30.0, avg: 30.0, max: 30.0) +[2023-10-14 08:21:08,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 08:21:09,401][100936] Updated weights for policy 0, policy_version 78020 (0.0007) +[2023-10-14 08:21:09,765][100936] Updated weights for policy 0, policy_version 78030 (0.0008) +[2023-10-14 08:21:10,139][100936] Updated weights for policy 0, policy_version 78040 (0.0009) +[2023-10-14 08:21:11,570][100917] Updated weights for policy 1, policy_version 78152 (0.0010) +[2023-10-14 08:21:11,952][100917] Updated weights for policy 1, policy_version 78162 (0.0010) +[2023-10-14 08:21:12,332][100917] Updated weights for policy 1, policy_version 78172 (0.0008) +[2023-10-14 08:21:13,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 159973376. Throughput: 0: 1672.5, 1: 1657.2. Samples: 39997912. Policy #0 lag: (min: 30.0, avg: 30.0, max: 30.0) +[2023-10-14 08:21:13,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 08:21:14,108][100936] Updated weights for policy 0, policy_version 78050 (0.0009) +[2023-10-14 08:21:14,473][100936] Updated weights for policy 0, policy_version 78060 (0.0009) +[2023-10-14 08:21:14,845][100936] Updated weights for policy 0, policy_version 78070 (0.0008) +[2023-10-14 08:21:15,213][100936] Updated weights for policy 0, policy_version 78080 (0.0008) +[2023-10-14 08:21:16,586][100917] Updated weights for policy 1, policy_version 78182 (0.0010) +[2023-10-14 08:21:16,946][100917] Updated weights for policy 1, policy_version 78192 (0.0007) +[2023-10-14 08:21:17,319][100917] Updated weights for policy 1, policy_version 78202 (0.0010) +[2023-10-14 08:21:18,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 160038912. Throughput: 0: 1675.6, 1: 1643.8. Samples: 40017922. Policy #0 lag: (min: 30.0, avg: 30.0, max: 30.0) +[2023-10-14 08:21:18,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 08:21:19,169][100936] Updated weights for policy 0, policy_version 78090 (0.0007) +[2023-10-14 08:21:19,529][100936] Updated weights for policy 0, policy_version 78100 (0.0007) +[2023-10-14 08:21:19,905][100936] Updated weights for policy 0, policy_version 78110 (0.0009) +[2023-10-14 08:21:21,458][100917] Updated weights for policy 1, policy_version 78212 (0.0009) +[2023-10-14 08:21:21,839][100917] Updated weights for policy 1, policy_version 78222 (0.0009) +[2023-10-14 08:21:22,207][100917] Updated weights for policy 1, policy_version 78232 (0.0009) +[2023-10-14 08:21:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 160104448. Throughput: 0: 1669.8, 1: 1655.3. Samples: 40037668. Policy #0 lag: (min: 30.0, avg: 30.0, max: 30.0) +[2023-10-14 08:21:23,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 08:21:23,945][100936] Updated weights for policy 0, policy_version 78120 (0.0008) +[2023-10-14 08:21:24,319][100936] Updated weights for policy 0, policy_version 78130 (0.0009) +[2023-10-14 08:21:24,681][100936] Updated weights for policy 0, policy_version 78140 (0.0009) +[2023-10-14 08:21:26,300][100917] Updated weights for policy 1, policy_version 78242 (0.0009) +[2023-10-14 08:21:26,664][100917] Updated weights for policy 1, policy_version 78252 (0.0008) +[2023-10-14 08:21:27,040][100917] Updated weights for policy 1, policy_version 78262 (0.0008) +[2023-10-14 08:21:27,402][100917] Updated weights for policy 1, policy_version 78272 (0.0007) +[2023-10-14 08:21:28,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 160169984. Throughput: 0: 1675.7, 1: 1661.9. Samples: 40048256. Policy #0 lag: (min: 30.0, avg: 30.0, max: 30.0) +[2023-10-14 08:21:28,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 08:21:28,612][100936] Updated weights for policy 0, policy_version 78150 (0.0011) +[2023-10-14 08:21:28,987][100936] Updated weights for policy 0, policy_version 78160 (0.0007) +[2023-10-14 08:21:29,359][100936] Updated weights for policy 0, policy_version 78170 (0.0007) +[2023-10-14 08:21:31,357][100917] Updated weights for policy 1, policy_version 78282 (0.0010) +[2023-10-14 08:21:31,731][100917] Updated weights for policy 1, policy_version 78292 (0.0008) +[2023-10-14 08:21:32,108][100917] Updated weights for policy 1, policy_version 78302 (0.0009) +[2023-10-14 08:21:33,433][100936] Updated weights for policy 0, policy_version 78180 (0.0009) +[2023-10-14 08:21:33,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 160235520. Throughput: 0: 1676.6, 1: 1650.2. Samples: 40067966. Policy #0 lag: (min: 30.0, avg: 30.0, max: 30.0) +[2023-10-14 08:21:33,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 08:21:33,794][100936] Updated weights for policy 0, policy_version 78190 (0.0009) +[2023-10-14 08:21:34,165][100936] Updated weights for policy 0, policy_version 78200 (0.0010) +[2023-10-14 08:21:36,174][100917] Updated weights for policy 1, policy_version 78312 (0.0009) +[2023-10-14 08:21:36,543][100917] Updated weights for policy 1, policy_version 78322 (0.0011) +[2023-10-14 08:21:36,912][100917] Updated weights for policy 1, policy_version 78332 (0.0007) +[2023-10-14 08:21:38,237][100936] Updated weights for policy 0, policy_version 78210 (0.0009) +[2023-10-14 08:21:38,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 160301056. Throughput: 0: 1678.1, 1: 1673.0. Samples: 40088436. Policy #0 lag: (min: 30.0, avg: 30.0, max: 30.0) +[2023-10-14 08:21:38,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 08:21:38,526][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000078336_80216064.pth... +[2023-10-14 08:21:38,559][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000076768_78610432.pth +[2023-10-14 08:21:38,605][100936] Updated weights for policy 0, policy_version 78220 (0.0010) +[2023-10-14 08:21:38,962][100936] Updated weights for policy 0, policy_version 78230 (0.0009) +[2023-10-14 08:21:39,332][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000078240_80117760.pth... +[2023-10-14 08:21:39,332][100936] Updated weights for policy 0, policy_version 78240 (0.0009) +[2023-10-14 08:21:39,370][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000076672_78512128.pth +[2023-10-14 08:21:40,989][100917] Updated weights for policy 1, policy_version 78342 (0.0009) +[2023-10-14 08:21:41,362][100917] Updated weights for policy 1, policy_version 78352 (0.0009) +[2023-10-14 08:21:41,743][100917] Updated weights for policy 1, policy_version 78362 (0.0011) +[2023-10-14 08:21:43,440][100936] Updated weights for policy 0, policy_version 78250 (0.0007) +[2023-10-14 08:21:43,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 160366592. Throughput: 0: 1687.6, 1: 1663.7. Samples: 40098528. Policy #0 lag: (min: 30.0, avg: 30.0, max: 30.0) +[2023-10-14 08:21:43,512][99942] Avg episode reward: [(0, '0.960'), (1, '1.000')] +[2023-10-14 08:21:43,807][100936] Updated weights for policy 0, policy_version 78260 (0.0009) +[2023-10-14 08:21:44,175][100936] Updated weights for policy 0, policy_version 78270 (0.0010) +[2023-10-14 08:21:45,741][100917] Updated weights for policy 1, policy_version 78372 (0.0008) +[2023-10-14 08:21:46,111][100917] Updated weights for policy 1, policy_version 78382 (0.0008) +[2023-10-14 08:21:46,480][100917] Updated weights for policy 1, policy_version 78392 (0.0007) +[2023-10-14 08:21:48,328][100936] Updated weights for policy 0, policy_version 78280 (0.0008) +[2023-10-14 08:21:48,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 160432128. Throughput: 0: 1684.4, 1: 1651.7. Samples: 40118070. Policy #0 lag: (min: 30.0, avg: 30.0, max: 30.0) +[2023-10-14 08:21:48,513][99942] Avg episode reward: [(0, '0.960'), (1, '1.000')] +[2023-10-14 08:21:48,692][100936] Updated weights for policy 0, policy_version 78290 (0.0008) +[2023-10-14 08:21:49,072][100936] Updated weights for policy 0, policy_version 78300 (0.0008) +[2023-10-14 08:21:50,671][100917] Updated weights for policy 1, policy_version 78402 (0.0007) +[2023-10-14 08:21:51,043][100917] Updated weights for policy 1, policy_version 78412 (0.0009) +[2023-10-14 08:21:51,417][100917] Updated weights for policy 1, policy_version 78422 (0.0008) +[2023-10-14 08:21:51,787][100917] Updated weights for policy 1, policy_version 78432 (0.0010) +[2023-10-14 08:21:53,058][100936] Updated weights for policy 0, policy_version 78310 (0.0009) +[2023-10-14 08:21:53,423][100936] Updated weights for policy 0, policy_version 78320 (0.0007) +[2023-10-14 08:21:53,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 160497664. Throughput: 0: 1670.2, 1: 1670.6. Samples: 40137902. Policy #0 lag: (min: 5.0, avg: 8.1, max: 37.0) +[2023-10-14 08:21:53,512][99942] Avg episode reward: [(0, '0.960'), (1, '1.000')] +[2023-10-14 08:21:53,808][100936] Updated weights for policy 0, policy_version 78330 (0.0008) +[2023-10-14 08:21:56,055][100917] Updated weights for policy 1, policy_version 78442 (0.0009) +[2023-10-14 08:21:56,434][100917] Updated weights for policy 1, policy_version 78452 (0.0009) +[2023-10-14 08:21:56,811][100917] Updated weights for policy 1, policy_version 78462 (0.0008) +[2023-10-14 08:21:58,044][100936] Updated weights for policy 0, policy_version 78340 (0.0009) +[2023-10-14 08:21:58,421][100936] Updated weights for policy 0, policy_version 78350 (0.0007) +[2023-10-14 08:21:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 160563200. Throughput: 0: 1680.7, 1: 1661.3. Samples: 40148300. Policy #0 lag: (min: 5.0, avg: 8.1, max: 37.0) +[2023-10-14 08:21:58,512][99942] Avg episode reward: [(0, '0.960'), (1, '1.000')] +[2023-10-14 08:21:58,793][100936] Updated weights for policy 0, policy_version 78360 (0.0009) +[2023-10-14 08:22:00,963][100917] Updated weights for policy 1, policy_version 78472 (0.0009) +[2023-10-14 08:22:01,342][100917] Updated weights for policy 1, policy_version 78482 (0.0009) +[2023-10-14 08:22:01,717][100917] Updated weights for policy 1, policy_version 78492 (0.0011) +[2023-10-14 08:22:03,131][100936] Updated weights for policy 0, policy_version 78370 (0.0008) +[2023-10-14 08:22:03,494][100936] Updated weights for policy 0, policy_version 78380 (0.0008) +[2023-10-14 08:22:03,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 160628736. Throughput: 0: 1670.2, 1: 1651.4. Samples: 40167394. Policy #0 lag: (min: 5.0, avg: 8.1, max: 37.0) +[2023-10-14 08:22:03,512][99942] Avg episode reward: [(0, '0.960'), (1, '1.000')] +[2023-10-14 08:22:03,869][100936] Updated weights for policy 0, policy_version 78390 (0.0009) +[2023-10-14 08:22:04,234][100936] Updated weights for policy 0, policy_version 78400 (0.0011) +[2023-10-14 08:22:05,850][100917] Updated weights for policy 1, policy_version 78502 (0.0009) +[2023-10-14 08:22:06,216][100917] Updated weights for policy 1, policy_version 78512 (0.0010) +[2023-10-14 08:22:06,587][100917] Updated weights for policy 1, policy_version 78522 (0.0008) +[2023-10-14 08:22:08,312][100936] Updated weights for policy 0, policy_version 78410 (0.0009) +[2023-10-14 08:22:08,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 160694272. Throughput: 0: 1660.9, 1: 1670.2. Samples: 40187568. Policy #0 lag: (min: 5.0, avg: 8.1, max: 37.0) +[2023-10-14 08:22:08,513][99942] Avg episode reward: [(0, '0.960'), (1, '1.000')] +[2023-10-14 08:22:08,677][100936] Updated weights for policy 0, policy_version 78420 (0.0008) +[2023-10-14 08:22:09,054][100936] Updated weights for policy 0, policy_version 78430 (0.0008) +[2023-10-14 08:22:10,747][100917] Updated weights for policy 1, policy_version 78532 (0.0009) +[2023-10-14 08:22:11,129][100917] Updated weights for policy 1, policy_version 78542 (0.0008) +[2023-10-14 08:22:11,502][100917] Updated weights for policy 1, policy_version 78552 (0.0009) +[2023-10-14 08:22:13,249][100936] Updated weights for policy 0, policy_version 78440 (0.0009) +[2023-10-14 08:22:13,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 160759808. Throughput: 0: 1662.7, 1: 1658.3. Samples: 40197700. Policy #0 lag: (min: 5.0, avg: 8.1, max: 37.0) +[2023-10-14 08:22:13,512][99942] Avg episode reward: [(0, '0.960'), (1, '1.000')] +[2023-10-14 08:22:13,628][100936] Updated weights for policy 0, policy_version 78450 (0.0009) +[2023-10-14 08:22:13,995][100936] Updated weights for policy 0, policy_version 78460 (0.0008) +[2023-10-14 08:22:15,468][100917] Updated weights for policy 1, policy_version 78562 (0.0008) +[2023-10-14 08:22:15,843][100917] Updated weights for policy 1, policy_version 78572 (0.0007) +[2023-10-14 08:22:16,214][100917] Updated weights for policy 1, policy_version 78582 (0.0008) +[2023-10-14 08:22:16,585][100917] Updated weights for policy 1, policy_version 78592 (0.0007) +[2023-10-14 08:22:18,125][100936] Updated weights for policy 0, policy_version 78470 (0.0009) +[2023-10-14 08:22:18,492][100936] Updated weights for policy 0, policy_version 78480 (0.0007) +[2023-10-14 08:22:18,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 160825344. Throughput: 0: 1661.4, 1: 1660.6. Samples: 40217454. Policy #0 lag: (min: 5.0, avg: 8.1, max: 37.0) +[2023-10-14 08:22:18,512][99942] Avg episode reward: [(0, '0.960'), (1, '1.000')] +[2023-10-14 08:22:18,863][100936] Updated weights for policy 0, policy_version 78490 (0.0007) +[2023-10-14 08:22:20,608][100917] Updated weights for policy 1, policy_version 78602 (0.0007) +[2023-10-14 08:22:20,983][100917] Updated weights for policy 1, policy_version 78612 (0.0007) +[2023-10-14 08:22:21,347][100917] Updated weights for policy 1, policy_version 78622 (0.0008) +[2023-10-14 08:22:22,990][100936] Updated weights for policy 0, policy_version 78500 (0.0008) +[2023-10-14 08:22:23,367][100936] Updated weights for policy 0, policy_version 78510 (0.0009) +[2023-10-14 08:22:23,512][99942] Fps is (10 sec: 13106.7, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 160890880. Throughput: 0: 1645.1, 1: 1669.4. Samples: 40237590. Policy #0 lag: (min: 5.0, avg: 8.1, max: 37.0) +[2023-10-14 08:22:23,514][99942] Avg episode reward: [(0, '0.960'), (1, '1.000')] +[2023-10-14 08:22:23,730][100936] Updated weights for policy 0, policy_version 78520 (0.0009) +[2023-10-14 08:22:25,501][100917] Updated weights for policy 1, policy_version 78632 (0.0011) +[2023-10-14 08:22:25,887][100917] Updated weights for policy 1, policy_version 78642 (0.0009) +[2023-10-14 08:22:26,255][100917] Updated weights for policy 1, policy_version 78652 (0.0007) +[2023-10-14 08:22:27,992][100936] Updated weights for policy 0, policy_version 78530 (0.0007) +[2023-10-14 08:22:28,408][100936] Updated weights for policy 0, policy_version 78540 (0.0009) +[2023-10-14 08:22:28,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 160956416. Throughput: 0: 1650.9, 1: 1657.6. Samples: 40247412. Policy #0 lag: (min: 5.0, avg: 8.1, max: 37.0) +[2023-10-14 08:22:28,513][99942] Avg episode reward: [(0, '0.960'), (1, '1.000')] +[2023-10-14 08:22:28,775][100936] Updated weights for policy 0, policy_version 78550 (0.0007) +[2023-10-14 08:22:29,138][100936] Updated weights for policy 0, policy_version 78560 (0.0007) +[2023-10-14 08:22:30,261][100917] Updated weights for policy 1, policy_version 78662 (0.0009) +[2023-10-14 08:22:30,645][100917] Updated weights for policy 1, policy_version 78672 (0.0011) +[2023-10-14 08:22:31,020][100917] Updated weights for policy 1, policy_version 78682 (0.0011) +[2023-10-14 08:22:33,296][100936] Updated weights for policy 0, policy_version 78570 (0.0007) +[2023-10-14 08:22:33,512][99942] Fps is (10 sec: 13107.8, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 161021952. Throughput: 0: 1647.6, 1: 1669.7. Samples: 40267346. Policy #0 lag: (min: 5.0, avg: 8.1, max: 37.0) +[2023-10-14 08:22:33,512][99942] Avg episode reward: [(0, '0.960'), (1, '1.000')] +[2023-10-14 08:22:33,659][100936] Updated weights for policy 0, policy_version 78580 (0.0009) +[2023-10-14 08:22:34,025][100936] Updated weights for policy 0, policy_version 78590 (0.0011) +[2023-10-14 08:22:35,040][100917] Updated weights for policy 1, policy_version 78692 (0.0010) +[2023-10-14 08:22:35,410][100917] Updated weights for policy 1, policy_version 78702 (0.0008) +[2023-10-14 08:22:35,770][100917] Updated weights for policy 1, policy_version 78712 (0.0008) +[2023-10-14 08:22:38,106][100936] Updated weights for policy 0, policy_version 78600 (0.0009) +[2023-10-14 08:22:38,468][100936] Updated weights for policy 0, policy_version 78610 (0.0008) +[2023-10-14 08:22:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 161087488. Throughput: 0: 1651.5, 1: 1667.8. Samples: 40287272. Policy #0 lag: (min: 5.0, avg: 8.1, max: 37.0) +[2023-10-14 08:22:38,513][99942] Avg episode reward: [(0, '0.960'), (1, '1.000')] +[2023-10-14 08:22:38,844][100936] Updated weights for policy 0, policy_version 78620 (0.0008) +[2023-10-14 08:22:39,911][100917] Updated weights for policy 1, policy_version 78722 (0.0008) +[2023-10-14 08:22:40,281][100917] Updated weights for policy 1, policy_version 78732 (0.0010) +[2023-10-14 08:22:40,658][100917] Updated weights for policy 1, policy_version 78742 (0.0009) +[2023-10-14 08:22:41,019][100917] Updated weights for policy 1, policy_version 78752 (0.0009) +[2023-10-14 08:22:42,900][100936] Updated weights for policy 0, policy_version 78630 (0.0011) +[2023-10-14 08:22:43,273][100936] Updated weights for policy 0, policy_version 78640 (0.0008) +[2023-10-14 08:22:43,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 161153024. Throughput: 0: 1652.8, 1: 1650.0. Samples: 40296924. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:22:43,513][99942] Avg episode reward: [(0, '0.960'), (1, '1.000')] +[2023-10-14 08:22:43,645][100936] Updated weights for policy 0, policy_version 78650 (0.0009) +[2023-10-14 08:22:45,156][100917] Updated weights for policy 1, policy_version 78762 (0.0007) +[2023-10-14 08:22:45,531][100917] Updated weights for policy 1, policy_version 78772 (0.0007) +[2023-10-14 08:22:45,913][100917] Updated weights for policy 1, policy_version 78782 (0.0009) +[2023-10-14 08:22:47,840][100936] Updated weights for policy 0, policy_version 78660 (0.0008) +[2023-10-14 08:22:48,216][100936] Updated weights for policy 0, policy_version 78670 (0.0008) +[2023-10-14 08:22:48,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 161218560. Throughput: 0: 1655.5, 1: 1678.2. Samples: 40317414. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:22:48,513][99942] Avg episode reward: [(0, '0.960'), (1, '1.000')] +[2023-10-14 08:22:48,583][100936] Updated weights for policy 0, policy_version 78680 (0.0010) +[2023-10-14 08:22:50,177][100917] Updated weights for policy 1, policy_version 78792 (0.0008) +[2023-10-14 08:22:50,551][100917] Updated weights for policy 1, policy_version 78802 (0.0007) +[2023-10-14 08:22:50,931][100917] Updated weights for policy 1, policy_version 78812 (0.0007) +[2023-10-14 08:22:52,543][100936] Updated weights for policy 0, policy_version 78690 (0.0009) +[2023-10-14 08:22:52,915][100936] Updated weights for policy 0, policy_version 78700 (0.0007) +[2023-10-14 08:22:53,291][100936] Updated weights for policy 0, policy_version 78710 (0.0008) +[2023-10-14 08:22:53,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 161284096. Throughput: 0: 1647.1, 1: 1666.5. Samples: 40336680. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:22:53,513][99942] Avg episode reward: [(0, '0.960'), (1, '1.000')] +[2023-10-14 08:22:53,658][100936] Updated weights for policy 0, policy_version 78720 (0.0009) +[2023-10-14 08:22:55,079][100917] Updated weights for policy 1, policy_version 78822 (0.0008) +[2023-10-14 08:22:55,451][100917] Updated weights for policy 1, policy_version 78832 (0.0009) +[2023-10-14 08:22:55,817][100917] Updated weights for policy 1, policy_version 78842 (0.0008) +[2023-10-14 08:22:57,798][100936] Updated weights for policy 0, policy_version 78730 (0.0008) +[2023-10-14 08:22:58,168][100936] Updated weights for policy 0, policy_version 78740 (0.0009) +[2023-10-14 08:22:58,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 161349632. Throughput: 0: 1662.2, 1: 1649.3. Samples: 40346718. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:22:58,513][99942] Avg episode reward: [(0, '0.670'), (1, '1.000')] +[2023-10-14 08:22:58,527][100936] Updated weights for policy 0, policy_version 78750 (0.0009) +[2023-10-14 08:22:59,824][100917] Updated weights for policy 1, policy_version 78852 (0.0009) +[2023-10-14 08:23:00,200][100917] Updated weights for policy 1, policy_version 78862 (0.0008) +[2023-10-14 08:23:00,590][100917] Updated weights for policy 1, policy_version 78872 (0.0009) +[2023-10-14 08:23:02,756][100936] Updated weights for policy 0, policy_version 78760 (0.0008) +[2023-10-14 08:23:03,124][100936] Updated weights for policy 0, policy_version 78770 (0.0009) +[2023-10-14 08:23:03,505][100936] Updated weights for policy 0, policy_version 78780 (0.0008) +[2023-10-14 08:23:03,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 161415168. Throughput: 0: 1656.7, 1: 1667.6. Samples: 40367048. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:23:03,513][99942] Avg episode reward: [(0, '0.620'), (1, '1.000')] +[2023-10-14 08:23:04,637][100917] Updated weights for policy 1, policy_version 78882 (0.0007) +[2023-10-14 08:23:05,012][100917] Updated weights for policy 1, policy_version 78892 (0.0010) +[2023-10-14 08:23:05,391][100917] Updated weights for policy 1, policy_version 78902 (0.0008) +[2023-10-14 08:23:05,771][100917] Updated weights for policy 1, policy_version 78912 (0.0009) +[2023-10-14 08:23:07,609][100936] Updated weights for policy 0, policy_version 78790 (0.0010) +[2023-10-14 08:23:07,978][100936] Updated weights for policy 0, policy_version 78800 (0.0010) +[2023-10-14 08:23:08,362][100936] Updated weights for policy 0, policy_version 78810 (0.0010) +[2023-10-14 08:23:08,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 161480704. Throughput: 0: 1648.3, 1: 1660.8. Samples: 40386500. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:23:08,513][99942] Avg episode reward: [(0, '0.620'), (1, '1.000')] +[2023-10-14 08:23:09,906][100917] Updated weights for policy 1, policy_version 78922 (0.0008) +[2023-10-14 08:23:10,278][100917] Updated weights for policy 1, policy_version 78932 (0.0007) +[2023-10-14 08:23:10,652][100917] Updated weights for policy 1, policy_version 78942 (0.0011) +[2023-10-14 08:23:12,519][100936] Updated weights for policy 0, policy_version 78820 (0.0007) +[2023-10-14 08:23:12,885][100936] Updated weights for policy 0, policy_version 78830 (0.0007) +[2023-10-14 08:23:13,248][100936] Updated weights for policy 0, policy_version 78840 (0.0008) +[2023-10-14 08:23:13,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 161546240. Throughput: 0: 1659.2, 1: 1650.0. Samples: 40396322. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:23:13,512][99942] Avg episode reward: [(0, '0.460'), (1, '1.000')] +[2023-10-14 08:23:14,628][100917] Updated weights for policy 1, policy_version 78952 (0.0008) +[2023-10-14 08:23:14,998][100917] Updated weights for policy 1, policy_version 78962 (0.0009) +[2023-10-14 08:23:15,368][100917] Updated weights for policy 1, policy_version 78972 (0.0008) +[2023-10-14 08:23:17,210][100936] Updated weights for policy 0, policy_version 78850 (0.0009) +[2023-10-14 08:23:17,622][100936] Updated weights for policy 0, policy_version 78860 (0.0009) +[2023-10-14 08:23:17,983][100936] Updated weights for policy 0, policy_version 78870 (0.0008) +[2023-10-14 08:23:18,350][100936] Updated weights for policy 0, policy_version 78880 (0.0007) +[2023-10-14 08:23:18,512][99942] Fps is (10 sec: 16384.2, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 161644544. Throughput: 0: 1660.0, 1: 1659.5. Samples: 40416726. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:23:18,512][99942] Avg episode reward: [(0, '0.470'), (1, '1.000')] +[2023-10-14 08:23:19,528][100917] Updated weights for policy 1, policy_version 78982 (0.0009) +[2023-10-14 08:23:19,891][100917] Updated weights for policy 1, policy_version 78992 (0.0008) +[2023-10-14 08:23:20,266][100917] Updated weights for policy 1, policy_version 79002 (0.0008) +[2023-10-14 08:23:22,507][100936] Updated weights for policy 0, policy_version 78890 (0.0007) +[2023-10-14 08:23:22,874][100936] Updated weights for policy 0, policy_version 78900 (0.0009) +[2023-10-14 08:23:23,240][100936] Updated weights for policy 0, policy_version 78910 (0.0008) +[2023-10-14 08:23:23,512][99942] Fps is (10 sec: 16383.8, 60 sec: 13653.4, 300 sec: 13329.3). Total num frames: 161710080. Throughput: 0: 1644.0, 1: 1666.3. Samples: 40436234. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:23:23,513][99942] Avg episode reward: [(0, '0.470'), (1, '1.000')] +[2023-10-14 08:23:24,311][100917] Updated weights for policy 1, policy_version 79012 (0.0007) +[2023-10-14 08:23:24,684][100917] Updated weights for policy 1, policy_version 79022 (0.0008) +[2023-10-14 08:23:25,059][100917] Updated weights for policy 1, policy_version 79032 (0.0009) +[2023-10-14 08:23:27,631][100936] Updated weights for policy 0, policy_version 78920 (0.0009) +[2023-10-14 08:23:28,000][100936] Updated weights for policy 0, policy_version 78930 (0.0007) +[2023-10-14 08:23:28,380][100936] Updated weights for policy 0, policy_version 78940 (0.0008) +[2023-10-14 08:23:28,512][99942] Fps is (10 sec: 9830.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 161742848. Throughput: 0: 1657.0, 1: 1661.6. Samples: 40446264. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:23:28,512][99942] Avg episode reward: [(0, '0.470'), (1, '1.000')] +[2023-10-14 08:23:29,222][100917] Updated weights for policy 1, policy_version 79042 (0.0008) +[2023-10-14 08:23:29,583][100917] Updated weights for policy 1, policy_version 79052 (0.0008) +[2023-10-14 08:23:29,965][100917] Updated weights for policy 1, policy_version 79062 (0.0011) +[2023-10-14 08:23:30,325][100917] Updated weights for policy 1, policy_version 79072 (0.0011) +[2023-10-14 08:23:32,658][100936] Updated weights for policy 0, policy_version 78950 (0.0009) +[2023-10-14 08:23:33,031][100936] Updated weights for policy 0, policy_version 78960 (0.0009) +[2023-10-14 08:23:33,395][100936] Updated weights for policy 0, policy_version 78970 (0.0008) +[2023-10-14 08:23:33,512][99942] Fps is (10 sec: 9830.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 161808384. Throughput: 0: 1656.6, 1: 1658.9. Samples: 40466614. Policy #0 lag: (min: 1.0, avg: 7.0, max: 33.0) +[2023-10-14 08:23:33,512][99942] Avg episode reward: [(0, '0.470'), (1, '1.000')] +[2023-10-14 08:23:34,548][100917] Updated weights for policy 1, policy_version 79082 (0.0007) +[2023-10-14 08:23:34,926][100917] Updated weights for policy 1, policy_version 79092 (0.0007) +[2023-10-14 08:23:35,295][100917] Updated weights for policy 1, policy_version 79102 (0.0008) +[2023-10-14 08:23:37,304][100936] Updated weights for policy 0, policy_version 78980 (0.0008) +[2023-10-14 08:23:37,670][100936] Updated weights for policy 0, policy_version 78990 (0.0007) +[2023-10-14 08:23:38,051][100936] Updated weights for policy 0, policy_version 79000 (0.0007) +[2023-10-14 08:23:38,512][99942] Fps is (10 sec: 16383.5, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 161906688. Throughput: 0: 1650.4, 1: 1664.1. Samples: 40485836. Policy #0 lag: (min: 1.0, avg: 7.0, max: 33.0) +[2023-10-14 08:23:38,513][99942] Avg episode reward: [(0, '0.470'), (1, '1.000')] +[2023-10-14 08:23:38,522][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000079008_80904192.pth... +[2023-10-14 08:23:38,522][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000079104_81002496.pth... +[2023-10-14 08:23:38,557][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000077440_79298560.pth +[2023-10-14 08:23:38,558][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000077568_79429632.pth +[2023-10-14 08:23:39,550][100917] Updated weights for policy 1, policy_version 79112 (0.0009) +[2023-10-14 08:23:39,918][100917] Updated weights for policy 1, policy_version 79122 (0.0009) +[2023-10-14 08:23:40,299][100917] Updated weights for policy 1, policy_version 79132 (0.0010) +[2023-10-14 08:23:42,014][100936] Updated weights for policy 0, policy_version 79010 (0.0008) +[2023-10-14 08:23:42,390][100936] Updated weights for policy 0, policy_version 79020 (0.0010) +[2023-10-14 08:23:42,765][100936] Updated weights for policy 0, policy_version 79030 (0.0011) +[2023-10-14 08:23:43,132][100936] Updated weights for policy 0, policy_version 79040 (0.0007) +[2023-10-14 08:23:43,512][99942] Fps is (10 sec: 16383.9, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 161972224. Throughput: 0: 1658.3, 1: 1662.4. Samples: 40496146. Policy #0 lag: (min: 1.0, avg: 7.0, max: 33.0) +[2023-10-14 08:23:43,513][99942] Avg episode reward: [(0, '0.470'), (1, '1.000')] +[2023-10-14 08:23:44,511][100917] Updated weights for policy 1, policy_version 79142 (0.0009) +[2023-10-14 08:23:44,888][100917] Updated weights for policy 1, policy_version 79152 (0.0007) +[2023-10-14 08:23:45,256][100917] Updated weights for policy 1, policy_version 79162 (0.0009) +[2023-10-14 08:23:47,302][100936] Updated weights for policy 0, policy_version 79050 (0.0007) +[2023-10-14 08:23:47,670][100936] Updated weights for policy 0, policy_version 79060 (0.0008) +[2023-10-14 08:23:48,050][100936] Updated weights for policy 0, policy_version 79070 (0.0008) +[2023-10-14 08:23:48,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 162037760. Throughput: 0: 1648.2, 1: 1664.0. Samples: 40516096. Policy #0 lag: (min: 1.0, avg: 7.0, max: 33.0) +[2023-10-14 08:23:48,512][99942] Avg episode reward: [(0, '0.470'), (1, '1.000')] +[2023-10-14 08:23:49,317][100917] Updated weights for policy 1, policy_version 79172 (0.0010) +[2023-10-14 08:23:49,688][100917] Updated weights for policy 1, policy_version 79182 (0.0008) +[2023-10-14 08:23:50,064][100917] Updated weights for policy 1, policy_version 79192 (0.0009) +[2023-10-14 08:23:52,238][100936] Updated weights for policy 0, policy_version 79080 (0.0009) +[2023-10-14 08:23:52,608][100936] Updated weights for policy 0, policy_version 79090 (0.0010) +[2023-10-14 08:23:52,981][100936] Updated weights for policy 0, policy_version 79100 (0.0010) +[2023-10-14 08:23:53,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 162103296. Throughput: 0: 1654.1, 1: 1666.9. Samples: 40535946. Policy #0 lag: (min: 1.0, avg: 7.0, max: 33.0) +[2023-10-14 08:23:53,513][99942] Avg episode reward: [(0, '0.470'), (1, '1.000')] +[2023-10-14 08:23:53,800][100917] Updated weights for policy 1, policy_version 79202 (0.0007) +[2023-10-14 08:23:54,178][100917] Updated weights for policy 1, policy_version 79212 (0.0007) +[2023-10-14 08:23:54,545][100917] Updated weights for policy 1, policy_version 79222 (0.0008) +[2023-10-14 08:23:54,906][100917] Updated weights for policy 1, policy_version 79232 (0.0007) +[2023-10-14 08:23:57,115][100936] Updated weights for policy 0, policy_version 79110 (0.0010) +[2023-10-14 08:23:57,482][100936] Updated weights for policy 0, policy_version 79120 (0.0010) +[2023-10-14 08:23:57,858][100936] Updated weights for policy 0, policy_version 79130 (0.0008) +[2023-10-14 08:23:58,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 162168832. Throughput: 0: 1662.7, 1: 1673.0. Samples: 40546428. Policy #0 lag: (min: 1.0, avg: 7.0, max: 33.0) +[2023-10-14 08:23:58,513][99942] Avg episode reward: [(0, '0.470'), (1, '1.000')] +[2023-10-14 08:23:59,027][100917] Updated weights for policy 1, policy_version 79242 (0.0008) +[2023-10-14 08:23:59,400][100917] Updated weights for policy 1, policy_version 79252 (0.0010) +[2023-10-14 08:23:59,777][100917] Updated weights for policy 1, policy_version 79262 (0.0010) +[2023-10-14 08:24:02,174][100936] Updated weights for policy 0, policy_version 79140 (0.0007) +[2023-10-14 08:24:02,570][100936] Updated weights for policy 0, policy_version 79150 (0.0009) +[2023-10-14 08:24:02,944][100936] Updated weights for policy 0, policy_version 79160 (0.0009) +[2023-10-14 08:24:03,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 162234368. Throughput: 0: 1651.3, 1: 1672.9. Samples: 40566316. Policy #0 lag: (min: 1.0, avg: 7.0, max: 33.0) +[2023-10-14 08:24:03,513][99942] Avg episode reward: [(0, '0.470'), (1, '1.000')] +[2023-10-14 08:24:03,911][100917] Updated weights for policy 1, policy_version 79272 (0.0010) +[2023-10-14 08:24:04,290][100917] Updated weights for policy 1, policy_version 79282 (0.0009) +[2023-10-14 08:24:04,658][100917] Updated weights for policy 1, policy_version 79292 (0.0007) +[2023-10-14 08:24:07,030][100936] Updated weights for policy 0, policy_version 79170 (0.0008) +[2023-10-14 08:24:07,403][100936] Updated weights for policy 0, policy_version 79180 (0.0008) +[2023-10-14 08:24:07,769][100936] Updated weights for policy 0, policy_version 79190 (0.0007) +[2023-10-14 08:24:08,139][100936] Updated weights for policy 0, policy_version 79200 (0.0008) +[2023-10-14 08:24:08,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 162299904. Throughput: 0: 1653.4, 1: 1671.4. Samples: 40585852. Policy #0 lag: (min: 1.0, avg: 7.0, max: 33.0) +[2023-10-14 08:24:08,513][99942] Avg episode reward: [(0, '0.470'), (1, '1.000')] +[2023-10-14 08:24:08,594][100917] Updated weights for policy 1, policy_version 79302 (0.0007) +[2023-10-14 08:24:08,978][100917] Updated weights for policy 1, policy_version 79312 (0.0007) +[2023-10-14 08:24:09,360][100917] Updated weights for policy 1, policy_version 79322 (0.0008) +[2023-10-14 08:24:12,366][100936] Updated weights for policy 0, policy_version 79210 (0.0009) +[2023-10-14 08:24:12,732][100936] Updated weights for policy 0, policy_version 79220 (0.0009) +[2023-10-14 08:24:13,097][100936] Updated weights for policy 0, policy_version 79230 (0.0009) +[2023-10-14 08:24:13,502][100917] Updated weights for policy 1, policy_version 79332 (0.0009) +[2023-10-14 08:24:13,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 162365440. Throughput: 0: 1655.4, 1: 1669.9. Samples: 40595906. Policy #0 lag: (min: 1.0, avg: 7.0, max: 33.0) +[2023-10-14 08:24:13,513][99942] Avg episode reward: [(0, '0.470'), (1, '1.000')] +[2023-10-14 08:24:13,879][100917] Updated weights for policy 1, policy_version 79342 (0.0008) +[2023-10-14 08:24:14,250][100917] Updated weights for policy 1, policy_version 79352 (0.0009) +[2023-10-14 08:24:17,200][100936] Updated weights for policy 0, policy_version 79240 (0.0007) +[2023-10-14 08:24:17,569][100936] Updated weights for policy 0, policy_version 79250 (0.0011) +[2023-10-14 08:24:17,947][100936] Updated weights for policy 0, policy_version 79260 (0.0008) +[2023-10-14 08:24:18,364][100917] Updated weights for policy 1, policy_version 79362 (0.0010) +[2023-10-14 08:24:18,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 162430976. Throughput: 0: 1646.2, 1: 1673.4. Samples: 40615996. Policy #0 lag: (min: 15.0, avg: 15.4, max: 28.0) +[2023-10-14 08:24:18,513][99942] Avg episode reward: [(0, '0.470'), (1, '1.000')] +[2023-10-14 08:24:18,728][100917] Updated weights for policy 1, policy_version 79372 (0.0007) +[2023-10-14 08:24:19,107][100917] Updated weights for policy 1, policy_version 79382 (0.0008) +[2023-10-14 08:24:19,483][100917] Updated weights for policy 1, policy_version 79392 (0.0007) +[2023-10-14 08:24:22,017][100936] Updated weights for policy 0, policy_version 79270 (0.0007) +[2023-10-14 08:24:22,383][100936] Updated weights for policy 0, policy_version 79280 (0.0008) +[2023-10-14 08:24:22,754][100936] Updated weights for policy 0, policy_version 79290 (0.0010) +[2023-10-14 08:24:23,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 162496512. Throughput: 0: 1652.6, 1: 1678.6. Samples: 40635740. Policy #0 lag: (min: 15.0, avg: 15.4, max: 28.0) +[2023-10-14 08:24:23,512][99942] Avg episode reward: [(0, '0.470'), (1, '1.000')] +[2023-10-14 08:24:23,734][100917] Updated weights for policy 1, policy_version 79402 (0.0009) +[2023-10-14 08:24:24,103][100917] Updated weights for policy 1, policy_version 79412 (0.0009) +[2023-10-14 08:24:24,474][100917] Updated weights for policy 1, policy_version 79422 (0.0009) +[2023-10-14 08:24:26,970][100936] Updated weights for policy 0, policy_version 79300 (0.0009) +[2023-10-14 08:24:27,346][100936] Updated weights for policy 0, policy_version 79310 (0.0010) +[2023-10-14 08:24:27,710][100936] Updated weights for policy 0, policy_version 79320 (0.0009) +[2023-10-14 08:24:28,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 162562048. Throughput: 0: 1651.3, 1: 1675.4. Samples: 40645850. Policy #0 lag: (min: 15.0, avg: 15.4, max: 28.0) +[2023-10-14 08:24:28,512][99942] Avg episode reward: [(0, '0.500'), (1, '1.000')] +[2023-10-14 08:24:28,555][100917] Updated weights for policy 1, policy_version 79432 (0.0008) +[2023-10-14 08:24:28,931][100917] Updated weights for policy 1, policy_version 79442 (0.0007) +[2023-10-14 08:24:29,300][100917] Updated weights for policy 1, policy_version 79452 (0.0007) +[2023-10-14 08:24:31,828][100936] Updated weights for policy 0, policy_version 79330 (0.0008) +[2023-10-14 08:24:32,206][100936] Updated weights for policy 0, policy_version 79340 (0.0009) +[2023-10-14 08:24:32,573][100936] Updated weights for policy 0, policy_version 79350 (0.0008) +[2023-10-14 08:24:32,943][100936] Updated weights for policy 0, policy_version 79360 (0.0007) +[2023-10-14 08:24:33,405][100917] Updated weights for policy 1, policy_version 79462 (0.0009) +[2023-10-14 08:24:33,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 162627584. Throughput: 0: 1648.4, 1: 1671.8. Samples: 40665508. Policy #0 lag: (min: 15.0, avg: 15.4, max: 28.0) +[2023-10-14 08:24:33,513][99942] Avg episode reward: [(0, '0.500'), (1, '1.000')] +[2023-10-14 08:24:33,769][100917] Updated weights for policy 1, policy_version 79472 (0.0008) +[2023-10-14 08:24:34,141][100917] Updated weights for policy 1, policy_version 79482 (0.0007) +[2023-10-14 08:24:37,011][100936] Updated weights for policy 0, policy_version 79370 (0.0009) +[2023-10-14 08:24:37,372][100936] Updated weights for policy 0, policy_version 79380 (0.0009) +[2023-10-14 08:24:37,748][100936] Updated weights for policy 0, policy_version 79390 (0.0010) +[2023-10-14 08:24:38,443][100917] Updated weights for policy 1, policy_version 79492 (0.0007) +[2023-10-14 08:24:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 162693120. Throughput: 0: 1650.9, 1: 1666.2. Samples: 40685216. Policy #0 lag: (min: 15.0, avg: 15.4, max: 28.0) +[2023-10-14 08:24:38,513][99942] Avg episode reward: [(0, '0.500'), (1, '1.000')] +[2023-10-14 08:24:38,821][100917] Updated weights for policy 1, policy_version 79502 (0.0007) +[2023-10-14 08:24:39,189][100917] Updated weights for policy 1, policy_version 79512 (0.0009) +[2023-10-14 08:24:41,706][100936] Updated weights for policy 0, policy_version 79400 (0.0008) +[2023-10-14 08:24:42,073][100936] Updated weights for policy 0, policy_version 79410 (0.0007) +[2023-10-14 08:24:42,447][100936] Updated weights for policy 0, policy_version 79420 (0.0007) +[2023-10-14 08:24:43,294][100917] Updated weights for policy 1, policy_version 79522 (0.0008) +[2023-10-14 08:24:43,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 162758656. Throughput: 0: 1654.2, 1: 1661.6. Samples: 40695640. Policy #0 lag: (min: 15.0, avg: 15.4, max: 28.0) +[2023-10-14 08:24:43,512][99942] Avg episode reward: [(0, '0.500'), (1, '1.000')] +[2023-10-14 08:24:43,667][100917] Updated weights for policy 1, policy_version 79532 (0.0009) +[2023-10-14 08:24:44,039][100917] Updated weights for policy 1, policy_version 79542 (0.0008) +[2023-10-14 08:24:44,406][100917] Updated weights for policy 1, policy_version 79552 (0.0010) +[2023-10-14 08:24:46,641][100936] Updated weights for policy 0, policy_version 79430 (0.0010) +[2023-10-14 08:24:47,012][100936] Updated weights for policy 0, policy_version 79440 (0.0007) +[2023-10-14 08:24:47,389][100936] Updated weights for policy 0, policy_version 79450 (0.0008) +[2023-10-14 08:24:48,508][100917] Updated weights for policy 1, policy_version 79562 (0.0009) +[2023-10-14 08:24:48,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 162824192. Throughput: 0: 1642.8, 1: 1660.7. Samples: 40714972. Policy #0 lag: (min: 15.0, avg: 15.4, max: 28.0) +[2023-10-14 08:24:48,512][99942] Avg episode reward: [(0, '0.500'), (1, '1.000')] +[2023-10-14 08:24:48,891][100917] Updated weights for policy 1, policy_version 79572 (0.0008) +[2023-10-14 08:24:49,257][100917] Updated weights for policy 1, policy_version 79582 (0.0008) +[2023-10-14 08:24:51,488][100936] Updated weights for policy 0, policy_version 79460 (0.0008) +[2023-10-14 08:24:51,857][100936] Updated weights for policy 0, policy_version 79470 (0.0010) +[2023-10-14 08:24:52,229][100936] Updated weights for policy 0, policy_version 79480 (0.0011) +[2023-10-14 08:24:53,318][100917] Updated weights for policy 1, policy_version 79592 (0.0008) +[2023-10-14 08:24:53,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 162889728. Throughput: 0: 1656.8, 1: 1657.0. Samples: 40734970. Policy #0 lag: (min: 15.0, avg: 15.4, max: 28.0) +[2023-10-14 08:24:53,513][99942] Avg episode reward: [(0, '0.500'), (1, '1.000')] +[2023-10-14 08:24:53,694][100917] Updated weights for policy 1, policy_version 79602 (0.0011) +[2023-10-14 08:24:54,060][100917] Updated weights for policy 1, policy_version 79612 (0.0009) +[2023-10-14 08:24:56,391][100936] Updated weights for policy 0, policy_version 79490 (0.0011) +[2023-10-14 08:24:56,759][100936] Updated weights for policy 0, policy_version 79500 (0.0009) +[2023-10-14 08:24:57,126][100936] Updated weights for policy 0, policy_version 79510 (0.0010) +[2023-10-14 08:24:57,501][100936] Updated weights for policy 0, policy_version 79520 (0.0010) +[2023-10-14 08:24:58,126][100917] Updated weights for policy 1, policy_version 79622 (0.0008) +[2023-10-14 08:24:58,505][100917] Updated weights for policy 1, policy_version 79632 (0.0008) +[2023-10-14 08:24:58,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 162955264. Throughput: 0: 1654.1, 1: 1658.3. Samples: 40744964. Policy #0 lag: (min: 15.0, avg: 15.4, max: 28.0) +[2023-10-14 08:24:58,513][99942] Avg episode reward: [(0, '0.500'), (1, '1.000')] +[2023-10-14 08:24:58,875][100917] Updated weights for policy 1, policy_version 79642 (0.0009) +[2023-10-14 08:25:01,708][100936] Updated weights for policy 0, policy_version 79530 (0.0009) +[2023-10-14 08:25:02,087][100936] Updated weights for policy 0, policy_version 79540 (0.0008) +[2023-10-14 08:25:02,460][100936] Updated weights for policy 0, policy_version 79550 (0.0009) +[2023-10-14 08:25:03,025][100917] Updated weights for policy 1, policy_version 79652 (0.0008) +[2023-10-14 08:25:03,406][100917] Updated weights for policy 1, policy_version 79662 (0.0008) +[2023-10-14 08:25:03,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 163020800. Throughput: 0: 1640.4, 1: 1657.9. Samples: 40764420. Policy #0 lag: (min: 15.0, avg: 15.4, max: 28.0) +[2023-10-14 08:25:03,512][99942] Avg episode reward: [(0, '0.500'), (1, '1.000')] +[2023-10-14 08:25:03,778][100917] Updated weights for policy 1, policy_version 79672 (0.0008) +[2023-10-14 08:25:06,754][100936] Updated weights for policy 0, policy_version 79560 (0.0009) +[2023-10-14 08:25:07,119][100936] Updated weights for policy 0, policy_version 79570 (0.0007) +[2023-10-14 08:25:07,492][100936] Updated weights for policy 0, policy_version 79580 (0.0008) +[2023-10-14 08:25:07,940][100917] Updated weights for policy 1, policy_version 79682 (0.0008) +[2023-10-14 08:25:08,309][100917] Updated weights for policy 1, policy_version 79692 (0.0009) +[2023-10-14 08:25:08,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 163086336. Throughput: 0: 1650.7, 1: 1653.3. Samples: 40784420. Policy #0 lag: (min: 31.0, avg: 38.0, max: 63.0) +[2023-10-14 08:25:08,512][99942] Avg episode reward: [(0, '0.500'), (1, '1.000')] +[2023-10-14 08:25:08,688][100917] Updated weights for policy 1, policy_version 79702 (0.0010) +[2023-10-14 08:25:09,061][100917] Updated weights for policy 1, policy_version 79712 (0.0011) +[2023-10-14 08:25:11,624][100936] Updated weights for policy 0, policy_version 79590 (0.0008) +[2023-10-14 08:25:11,995][100936] Updated weights for policy 0, policy_version 79600 (0.0007) +[2023-10-14 08:25:12,362][100936] Updated weights for policy 0, policy_version 79610 (0.0008) +[2023-10-14 08:25:13,263][100917] Updated weights for policy 1, policy_version 79722 (0.0007) +[2023-10-14 08:25:13,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 163151872. Throughput: 0: 1648.0, 1: 1656.5. Samples: 40794556. Policy #0 lag: (min: 31.0, avg: 38.0, max: 63.0) +[2023-10-14 08:25:13,515][99942] Avg episode reward: [(0, '0.500'), (1, '1.000')] +[2023-10-14 08:25:13,634][100917] Updated weights for policy 1, policy_version 79732 (0.0009) +[2023-10-14 08:25:14,013][100917] Updated weights for policy 1, policy_version 79742 (0.0010) +[2023-10-14 08:25:16,462][100936] Updated weights for policy 0, policy_version 79620 (0.0009) +[2023-10-14 08:25:16,833][100936] Updated weights for policy 0, policy_version 79630 (0.0010) +[2023-10-14 08:25:17,202][100936] Updated weights for policy 0, policy_version 79640 (0.0008) +[2023-10-14 08:25:18,291][100917] Updated weights for policy 1, policy_version 79752 (0.0009) +[2023-10-14 08:25:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 163217408. Throughput: 0: 1642.6, 1: 1653.0. Samples: 40813808. Policy #0 lag: (min: 31.0, avg: 38.0, max: 63.0) +[2023-10-14 08:25:18,512][99942] Avg episode reward: [(0, '0.500'), (1, '1.000')] +[2023-10-14 08:25:18,659][100917] Updated weights for policy 1, policy_version 79762 (0.0011) +[2023-10-14 08:25:19,036][100917] Updated weights for policy 1, policy_version 79772 (0.0011) +[2023-10-14 08:25:21,386][100936] Updated weights for policy 0, policy_version 79650 (0.0007) +[2023-10-14 08:25:21,751][100936] Updated weights for policy 0, policy_version 79660 (0.0008) +[2023-10-14 08:25:22,117][100936] Updated weights for policy 0, policy_version 79670 (0.0007) +[2023-10-14 08:25:22,485][100936] Updated weights for policy 0, policy_version 79680 (0.0008) +[2023-10-14 08:25:23,179][100917] Updated weights for policy 1, policy_version 79782 (0.0010) +[2023-10-14 08:25:23,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 163282944. Throughput: 0: 1650.3, 1: 1655.1. Samples: 40833960. Policy #0 lag: (min: 31.0, avg: 38.0, max: 63.0) +[2023-10-14 08:25:23,514][99942] Avg episode reward: [(0, '0.500'), (1, '1.000')] +[2023-10-14 08:25:23,558][100917] Updated weights for policy 1, policy_version 79792 (0.0009) +[2023-10-14 08:25:23,934][100917] Updated weights for policy 1, policy_version 79802 (0.0009) +[2023-10-14 08:25:26,501][100936] Updated weights for policy 0, policy_version 79690 (0.0009) +[2023-10-14 08:25:26,871][100936] Updated weights for policy 0, policy_version 79700 (0.0007) +[2023-10-14 08:25:27,246][100936] Updated weights for policy 0, policy_version 79710 (0.0007) +[2023-10-14 08:25:28,076][100917] Updated weights for policy 1, policy_version 79812 (0.0007) +[2023-10-14 08:25:28,450][100917] Updated weights for policy 1, policy_version 79822 (0.0007) +[2023-10-14 08:25:28,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 163348480. Throughput: 0: 1643.8, 1: 1650.9. Samples: 40843902. Policy #0 lag: (min: 31.0, avg: 38.0, max: 63.0) +[2023-10-14 08:25:28,513][99942] Avg episode reward: [(0, '0.500'), (1, '1.000')] +[2023-10-14 08:25:28,817][100917] Updated weights for policy 1, policy_version 79832 (0.0009) +[2023-10-14 08:25:31,273][100936] Updated weights for policy 0, policy_version 79720 (0.0010) +[2023-10-14 08:25:31,628][100936] Updated weights for policy 0, policy_version 79730 (0.0007) +[2023-10-14 08:25:32,003][100936] Updated weights for policy 0, policy_version 79740 (0.0008) +[2023-10-14 08:25:33,053][100917] Updated weights for policy 1, policy_version 79842 (0.0007) +[2023-10-14 08:25:33,428][100917] Updated weights for policy 1, policy_version 79852 (0.0008) +[2023-10-14 08:25:33,512][99942] Fps is (10 sec: 13107.8, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 163414016. Throughput: 0: 1646.9, 1: 1646.3. Samples: 40863166. Policy #0 lag: (min: 31.0, avg: 38.0, max: 63.0) +[2023-10-14 08:25:33,512][99942] Avg episode reward: [(0, '0.500'), (1, '1.000')] +[2023-10-14 08:25:33,796][100917] Updated weights for policy 1, policy_version 79862 (0.0008) +[2023-10-14 08:25:34,166][100917] Updated weights for policy 1, policy_version 79872 (0.0008) +[2023-10-14 08:25:36,191][100936] Updated weights for policy 0, policy_version 79750 (0.0008) +[2023-10-14 08:25:36,571][100936] Updated weights for policy 0, policy_version 79760 (0.0008) +[2023-10-14 08:25:36,938][100936] Updated weights for policy 0, policy_version 79770 (0.0009) +[2023-10-14 08:25:38,402][100917] Updated weights for policy 1, policy_version 79882 (0.0010) +[2023-10-14 08:25:38,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 163479552. Throughput: 0: 1657.6, 1: 1642.6. Samples: 40883478. Policy #0 lag: (min: 31.0, avg: 38.0, max: 63.0) +[2023-10-14 08:25:38,512][99942] Avg episode reward: [(0, '0.500'), (1, '1.000')] +[2023-10-14 08:25:38,520][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000079776_81690624.pth... +[2023-10-14 08:25:38,557][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000078240_80117760.pth +[2023-10-14 08:25:38,778][100917] Updated weights for policy 1, policy_version 79892 (0.0008) +[2023-10-14 08:25:39,154][100917] Updated weights for policy 1, policy_version 79902 (0.0010) +[2023-10-14 08:25:39,220][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000079904_81821696.pth... +[2023-10-14 08:25:39,259][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000078336_80216064.pth +[2023-10-14 08:25:41,035][100936] Updated weights for policy 0, policy_version 79780 (0.0008) +[2023-10-14 08:25:41,396][100936] Updated weights for policy 0, policy_version 79790 (0.0007) +[2023-10-14 08:25:41,766][100936] Updated weights for policy 0, policy_version 79800 (0.0008) +[2023-10-14 08:25:42,998][100917] Updated weights for policy 1, policy_version 79912 (0.0008) +[2023-10-14 08:25:43,379][100917] Updated weights for policy 1, policy_version 79922 (0.0007) +[2023-10-14 08:25:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 163545088. Throughput: 0: 1647.2, 1: 1645.4. Samples: 40893130. Policy #0 lag: (min: 31.0, avg: 38.0, max: 63.0) +[2023-10-14 08:25:43,512][99942] Avg episode reward: [(0, '0.840'), (1, '1.000')] +[2023-10-14 08:25:43,746][100917] Updated weights for policy 1, policy_version 79932 (0.0007) +[2023-10-14 08:25:45,893][100936] Updated weights for policy 0, policy_version 79810 (0.0009) +[2023-10-14 08:25:46,270][100936] Updated weights for policy 0, policy_version 79820 (0.0010) +[2023-10-14 08:25:46,635][100936] Updated weights for policy 0, policy_version 79830 (0.0008) +[2023-10-14 08:25:47,003][100936] Updated weights for policy 0, policy_version 79840 (0.0007) +[2023-10-14 08:25:47,980][100917] Updated weights for policy 1, policy_version 79942 (0.0010) +[2023-10-14 08:25:48,353][100917] Updated weights for policy 1, policy_version 79952 (0.0011) +[2023-10-14 08:25:48,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 163610624. Throughput: 0: 1656.2, 1: 1643.1. Samples: 40912890. Policy #0 lag: (min: 31.0, avg: 38.0, max: 63.0) +[2023-10-14 08:25:48,513][99942] Avg episode reward: [(0, '0.840'), (1, '1.000')] +[2023-10-14 08:25:48,717][100917] Updated weights for policy 1, policy_version 79962 (0.0008) +[2023-10-14 08:25:51,023][100936] Updated weights for policy 0, policy_version 79850 (0.0009) +[2023-10-14 08:25:51,398][100936] Updated weights for policy 0, policy_version 79860 (0.0010) +[2023-10-14 08:25:51,761][100936] Updated weights for policy 0, policy_version 79870 (0.0007) +[2023-10-14 08:25:52,860][100917] Updated weights for policy 1, policy_version 79972 (0.0011) +[2023-10-14 08:25:53,242][100917] Updated weights for policy 1, policy_version 79982 (0.0010) +[2023-10-14 08:25:53,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 163676160. Throughput: 0: 1666.6, 1: 1642.3. Samples: 40933320. Policy #0 lag: (min: 31.0, avg: 38.0, max: 63.0) +[2023-10-14 08:25:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:25:53,609][100917] Updated weights for policy 1, policy_version 79992 (0.0009) +[2023-10-14 08:25:55,987][100936] Updated weights for policy 0, policy_version 79880 (0.0009) +[2023-10-14 08:25:56,357][100936] Updated weights for policy 0, policy_version 79890 (0.0008) +[2023-10-14 08:25:56,730][100936] Updated weights for policy 0, policy_version 79900 (0.0008) +[2023-10-14 08:25:57,752][100917] Updated weights for policy 1, policy_version 80002 (0.0008) +[2023-10-14 08:25:58,145][100917] Updated weights for policy 1, policy_version 80012 (0.0009) +[2023-10-14 08:25:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 163741696. Throughput: 0: 1651.8, 1: 1647.8. Samples: 40943038. Policy #0 lag: (min: 31.0, avg: 38.0, max: 63.0) +[2023-10-14 08:25:58,512][100917] Updated weights for policy 1, policy_version 80022 (0.0010) +[2023-10-14 08:25:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:25:58,890][100917] Updated weights for policy 1, policy_version 80032 (0.0008) +[2023-10-14 08:26:00,949][100936] Updated weights for policy 0, policy_version 79910 (0.0008) +[2023-10-14 08:26:01,320][100936] Updated weights for policy 0, policy_version 79920 (0.0009) +[2023-10-14 08:26:01,693][100936] Updated weights for policy 0, policy_version 79930 (0.0009) +[2023-10-14 08:26:02,945][100917] Updated weights for policy 1, policy_version 80042 (0.0007) +[2023-10-14 08:26:03,312][100917] Updated weights for policy 1, policy_version 80052 (0.0007) +[2023-10-14 08:26:03,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 163807232. Throughput: 0: 1657.2, 1: 1654.6. Samples: 40962842. Policy #0 lag: (min: 27.0, avg: 27.4, max: 41.0) +[2023-10-14 08:26:03,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:26:03,686][100917] Updated weights for policy 1, policy_version 80062 (0.0007) +[2023-10-14 08:26:05,833][100936] Updated weights for policy 0, policy_version 79940 (0.0009) +[2023-10-14 08:26:06,202][100936] Updated weights for policy 0, policy_version 79950 (0.0009) +[2023-10-14 08:26:06,579][100936] Updated weights for policy 0, policy_version 79960 (0.0010) +[2023-10-14 08:26:07,613][100917] Updated weights for policy 1, policy_version 80072 (0.0007) +[2023-10-14 08:26:07,984][100917] Updated weights for policy 1, policy_version 80082 (0.0011) +[2023-10-14 08:26:08,361][100917] Updated weights for policy 1, policy_version 80092 (0.0008) +[2023-10-14 08:26:08,512][99942] Fps is (10 sec: 16383.7, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 163905536. Throughput: 0: 1664.0, 1: 1642.9. Samples: 40982772. Policy #0 lag: (min: 27.0, avg: 27.4, max: 41.0) +[2023-10-14 08:26:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:26:10,706][100936] Updated weights for policy 0, policy_version 79970 (0.0008) +[2023-10-14 08:26:11,076][100936] Updated weights for policy 0, policy_version 79980 (0.0007) +[2023-10-14 08:26:11,433][100936] Updated weights for policy 0, policy_version 79990 (0.0008) +[2023-10-14 08:26:11,805][100936] Updated weights for policy 0, policy_version 80000 (0.0008) +[2023-10-14 08:26:12,680][100917] Updated weights for policy 1, policy_version 80102 (0.0008) +[2023-10-14 08:26:13,057][100917] Updated weights for policy 1, policy_version 80112 (0.0008) +[2023-10-14 08:26:13,429][100917] Updated weights for policy 1, policy_version 80122 (0.0009) +[2023-10-14 08:26:13,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 163938304. Throughput: 0: 1652.8, 1: 1658.2. Samples: 40992898. Policy #0 lag: (min: 27.0, avg: 27.4, max: 41.0) +[2023-10-14 08:26:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:26:15,978][100936] Updated weights for policy 0, policy_version 80010 (0.0009) +[2023-10-14 08:26:16,346][100936] Updated weights for policy 0, policy_version 80020 (0.0007) +[2023-10-14 08:26:16,729][100936] Updated weights for policy 0, policy_version 80030 (0.0009) +[2023-10-14 08:26:17,759][100917] Updated weights for policy 1, policy_version 80132 (0.0008) +[2023-10-14 08:26:18,138][100917] Updated weights for policy 1, policy_version 80142 (0.0008) +[2023-10-14 08:26:18,512][100917] Updated weights for policy 1, policy_version 80152 (0.0008) +[2023-10-14 08:26:18,512][99942] Fps is (10 sec: 9830.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 164003840. Throughput: 0: 1662.3, 1: 1658.4. Samples: 41012596. Policy #0 lag: (min: 27.0, avg: 27.4, max: 41.0) +[2023-10-14 08:26:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:26:20,923][100936] Updated weights for policy 0, policy_version 80040 (0.0011) +[2023-10-14 08:26:21,296][100936] Updated weights for policy 0, policy_version 80050 (0.0010) +[2023-10-14 08:26:21,670][100936] Updated weights for policy 0, policy_version 80060 (0.0010) +[2023-10-14 08:26:22,623][100917] Updated weights for policy 1, policy_version 80162 (0.0007) +[2023-10-14 08:26:22,996][100917] Updated weights for policy 1, policy_version 80172 (0.0010) +[2023-10-14 08:26:23,364][100917] Updated weights for policy 1, policy_version 80182 (0.0009) +[2023-10-14 08:26:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 164069376. Throughput: 0: 1655.0, 1: 1654.4. Samples: 41032398. Policy #0 lag: (min: 27.0, avg: 27.4, max: 41.0) +[2023-10-14 08:26:23,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:26:23,734][100917] Updated weights for policy 1, policy_version 80192 (0.0007) +[2023-10-14 08:26:25,756][100936] Updated weights for policy 0, policy_version 80070 (0.0010) +[2023-10-14 08:26:26,133][100936] Updated weights for policy 0, policy_version 80080 (0.0009) +[2023-10-14 08:26:26,494][100936] Updated weights for policy 0, policy_version 80090 (0.0008) +[2023-10-14 08:26:27,897][100917] Updated weights for policy 1, policy_version 80202 (0.0008) +[2023-10-14 08:26:28,273][100917] Updated weights for policy 1, policy_version 80212 (0.0009) +[2023-10-14 08:26:28,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 164134912. Throughput: 0: 1648.8, 1: 1662.4. Samples: 41042132. Policy #0 lag: (min: 27.0, avg: 27.4, max: 41.0) +[2023-10-14 08:26:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:26:28,644][100917] Updated weights for policy 1, policy_version 80222 (0.0008) +[2023-10-14 08:26:30,354][100936] Updated weights for policy 0, policy_version 80100 (0.0009) +[2023-10-14 08:26:30,711][100936] Updated weights for policy 0, policy_version 80110 (0.0010) +[2023-10-14 08:26:31,079][100936] Updated weights for policy 0, policy_version 80120 (0.0009) +[2023-10-14 08:26:32,693][100917] Updated weights for policy 1, policy_version 80232 (0.0007) +[2023-10-14 08:26:33,062][100917] Updated weights for policy 1, policy_version 80242 (0.0008) +[2023-10-14 08:26:33,434][100917] Updated weights for policy 1, policy_version 80252 (0.0007) +[2023-10-14 08:26:33,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 164200448. Throughput: 0: 1658.8, 1: 1662.5. Samples: 41062350. Policy #0 lag: (min: 27.0, avg: 27.4, max: 41.0) +[2023-10-14 08:26:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:26:35,218][100936] Updated weights for policy 0, policy_version 80130 (0.0010) +[2023-10-14 08:26:35,581][100936] Updated weights for policy 0, policy_version 80140 (0.0008) +[2023-10-14 08:26:35,942][100936] Updated weights for policy 0, policy_version 80150 (0.0009) +[2023-10-14 08:26:36,307][100936] Updated weights for policy 0, policy_version 80160 (0.0010) +[2023-10-14 08:26:37,544][100917] Updated weights for policy 1, policy_version 80262 (0.0010) +[2023-10-14 08:26:37,908][100917] Updated weights for policy 1, policy_version 80272 (0.0009) +[2023-10-14 08:26:38,275][100917] Updated weights for policy 1, policy_version 80282 (0.0008) +[2023-10-14 08:26:38,512][99942] Fps is (10 sec: 16384.0, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 164298752. Throughput: 0: 1658.8, 1: 1654.7. Samples: 41082430. Policy #0 lag: (min: 27.0, avg: 27.4, max: 41.0) +[2023-10-14 08:26:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:26:40,403][100936] Updated weights for policy 0, policy_version 80170 (0.0009) +[2023-10-14 08:26:40,776][100936] Updated weights for policy 0, policy_version 80180 (0.0007) +[2023-10-14 08:26:41,144][100936] Updated weights for policy 0, policy_version 80190 (0.0007) +[2023-10-14 08:26:42,446][100917] Updated weights for policy 1, policy_version 80292 (0.0011) +[2023-10-14 08:26:42,838][100917] Updated weights for policy 1, policy_version 80302 (0.0010) +[2023-10-14 08:26:43,211][100917] Updated weights for policy 1, policy_version 80312 (0.0008) +[2023-10-14 08:26:43,512][99942] Fps is (10 sec: 16384.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 164364288. Throughput: 0: 1647.0, 1: 1662.7. Samples: 41091974. Policy #0 lag: (min: 27.0, avg: 27.4, max: 41.0) +[2023-10-14 08:26:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:26:45,331][100936] Updated weights for policy 0, policy_version 80200 (0.0008) +[2023-10-14 08:26:45,704][100936] Updated weights for policy 0, policy_version 80210 (0.0009) +[2023-10-14 08:26:46,069][100936] Updated weights for policy 0, policy_version 80220 (0.0007) +[2023-10-14 08:26:47,135][100917] Updated weights for policy 1, policy_version 80322 (0.0010) +[2023-10-14 08:26:47,502][100917] Updated weights for policy 1, policy_version 80332 (0.0011) +[2023-10-14 08:26:47,874][100917] Updated weights for policy 1, policy_version 80342 (0.0011) +[2023-10-14 08:26:48,253][100917] Updated weights for policy 1, policy_version 80352 (0.0008) +[2023-10-14 08:26:48,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 164429824. Throughput: 0: 1661.6, 1: 1656.6. Samples: 41112162. Policy #0 lag: (min: 27.0, avg: 27.4, max: 41.0) +[2023-10-14 08:26:48,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:26:50,157][100936] Updated weights for policy 0, policy_version 80230 (0.0008) +[2023-10-14 08:26:50,532][100936] Updated weights for policy 0, policy_version 80240 (0.0009) +[2023-10-14 08:26:50,907][100936] Updated weights for policy 0, policy_version 80250 (0.0010) +[2023-10-14 08:26:52,448][100917] Updated weights for policy 1, policy_version 80362 (0.0012) +[2023-10-14 08:26:52,834][100917] Updated weights for policy 1, policy_version 80372 (0.0011) +[2023-10-14 08:26:53,203][100917] Updated weights for policy 1, policy_version 80382 (0.0011) +[2023-10-14 08:26:53,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 164495360. Throughput: 0: 1659.6, 1: 1651.3. Samples: 41131760. Policy #0 lag: (min: 27.0, avg: 27.4, max: 41.0) +[2023-10-14 08:26:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:26:55,144][100936] Updated weights for policy 0, policy_version 80260 (0.0008) +[2023-10-14 08:26:55,511][100936] Updated weights for policy 0, policy_version 80270 (0.0007) +[2023-10-14 08:26:55,877][100936] Updated weights for policy 0, policy_version 80280 (0.0007) +[2023-10-14 08:26:57,220][100917] Updated weights for policy 1, policy_version 80392 (0.0010) +[2023-10-14 08:26:57,590][100917] Updated weights for policy 1, policy_version 80402 (0.0010) +[2023-10-14 08:26:57,967][100917] Updated weights for policy 1, policy_version 80412 (0.0008) +[2023-10-14 08:26:58,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 164560896. Throughput: 0: 1643.6, 1: 1661.4. Samples: 41141622. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:26:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:27:00,022][100936] Updated weights for policy 0, policy_version 80290 (0.0007) +[2023-10-14 08:27:00,389][100936] Updated weights for policy 0, policy_version 80300 (0.0007) +[2023-10-14 08:27:00,753][100936] Updated weights for policy 0, policy_version 80310 (0.0007) +[2023-10-14 08:27:01,123][100936] Updated weights for policy 0, policy_version 80320 (0.0009) +[2023-10-14 08:27:02,127][100917] Updated weights for policy 1, policy_version 80422 (0.0008) +[2023-10-14 08:27:02,494][100917] Updated weights for policy 1, policy_version 80432 (0.0010) +[2023-10-14 08:27:02,872][100917] Updated weights for policy 1, policy_version 80442 (0.0009) +[2023-10-14 08:27:03,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 164626432. Throughput: 0: 1654.3, 1: 1665.9. Samples: 41162002. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:27:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:27:05,259][100936] Updated weights for policy 0, policy_version 80330 (0.0007) +[2023-10-14 08:27:05,620][100936] Updated weights for policy 0, policy_version 80340 (0.0009) +[2023-10-14 08:27:05,988][100936] Updated weights for policy 0, policy_version 80350 (0.0010) +[2023-10-14 08:27:07,027][100917] Updated weights for policy 1, policy_version 80452 (0.0008) +[2023-10-14 08:27:07,412][100917] Updated weights for policy 1, policy_version 80462 (0.0009) +[2023-10-14 08:27:07,783][100917] Updated weights for policy 1, policy_version 80472 (0.0009) +[2023-10-14 08:27:08,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 164691968. Throughput: 0: 1665.3, 1: 1649.3. Samples: 41181556. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:27:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:27:10,359][100936] Updated weights for policy 0, policy_version 80360 (0.0010) +[2023-10-14 08:27:10,733][100936] Updated weights for policy 0, policy_version 80370 (0.0009) +[2023-10-14 08:27:11,109][100936] Updated weights for policy 0, policy_version 80380 (0.0009) +[2023-10-14 08:27:11,796][100917] Updated weights for policy 1, policy_version 80482 (0.0008) +[2023-10-14 08:27:12,169][100917] Updated weights for policy 1, policy_version 80492 (0.0008) +[2023-10-14 08:27:12,550][100917] Updated weights for policy 1, policy_version 80502 (0.0007) +[2023-10-14 08:27:12,927][100917] Updated weights for policy 1, policy_version 80512 (0.0007) +[2023-10-14 08:27:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 164757504. Throughput: 0: 1653.0, 1: 1669.6. Samples: 41191650. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:27:13,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:27:15,276][100936] Updated weights for policy 0, policy_version 80390 (0.0008) +[2023-10-14 08:27:15,642][100936] Updated weights for policy 0, policy_version 80400 (0.0008) +[2023-10-14 08:27:16,018][100936] Updated weights for policy 0, policy_version 80410 (0.0008) +[2023-10-14 08:27:16,965][100917] Updated weights for policy 1, policy_version 80522 (0.0011) +[2023-10-14 08:27:17,349][100917] Updated weights for policy 1, policy_version 80532 (0.0010) +[2023-10-14 08:27:17,727][100917] Updated weights for policy 1, policy_version 80542 (0.0012) +[2023-10-14 08:27:18,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 164823040. Throughput: 0: 1655.1, 1: 1663.8. Samples: 41211702. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:27:18,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:27:19,871][100936] Updated weights for policy 0, policy_version 80420 (0.0008) +[2023-10-14 08:27:20,240][100936] Updated weights for policy 0, policy_version 80430 (0.0007) +[2023-10-14 08:27:20,604][100936] Updated weights for policy 0, policy_version 80440 (0.0007) +[2023-10-14 08:27:21,939][100917] Updated weights for policy 1, policy_version 80552 (0.0009) +[2023-10-14 08:27:22,305][100917] Updated weights for policy 1, policy_version 80562 (0.0007) +[2023-10-14 08:27:22,669][100917] Updated weights for policy 1, policy_version 80572 (0.0007) +[2023-10-14 08:27:23,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 164888576. Throughput: 0: 1657.8, 1: 1653.5. Samples: 41231436. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:27:23,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:27:24,783][100936] Updated weights for policy 0, policy_version 80450 (0.0008) +[2023-10-14 08:27:25,154][100936] Updated weights for policy 0, policy_version 80460 (0.0008) +[2023-10-14 08:27:25,518][100936] Updated weights for policy 0, policy_version 80470 (0.0007) +[2023-10-14 08:27:25,885][100936] Updated weights for policy 0, policy_version 80480 (0.0008) +[2023-10-14 08:27:26,817][100917] Updated weights for policy 1, policy_version 80582 (0.0007) +[2023-10-14 08:27:27,186][100917] Updated weights for policy 1, policy_version 80592 (0.0007) +[2023-10-14 08:27:27,567][100917] Updated weights for policy 1, policy_version 80602 (0.0009) +[2023-10-14 08:27:28,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 164954112. Throughput: 0: 1655.9, 1: 1668.2. Samples: 41241558. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:27:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:27:30,165][100936] Updated weights for policy 0, policy_version 80490 (0.0008) +[2023-10-14 08:27:30,540][100936] Updated weights for policy 0, policy_version 80500 (0.0009) +[2023-10-14 08:27:30,907][100936] Updated weights for policy 0, policy_version 80510 (0.0010) +[2023-10-14 08:27:31,689][100917] Updated weights for policy 1, policy_version 80612 (0.0009) +[2023-10-14 08:27:32,083][100917] Updated weights for policy 1, policy_version 80622 (0.0008) +[2023-10-14 08:27:32,456][100917] Updated weights for policy 1, policy_version 80632 (0.0008) +[2023-10-14 08:27:33,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 165019648. Throughput: 0: 1660.8, 1: 1659.6. Samples: 41261580. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:27:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:27:35,131][100936] Updated weights for policy 0, policy_version 80520 (0.0008) +[2023-10-14 08:27:35,498][100936] Updated weights for policy 0, policy_version 80530 (0.0007) +[2023-10-14 08:27:35,868][100936] Updated weights for policy 0, policy_version 80540 (0.0007) +[2023-10-14 08:27:36,535][100917] Updated weights for policy 1, policy_version 80642 (0.0009) +[2023-10-14 08:27:36,913][100917] Updated weights for policy 1, policy_version 80652 (0.0008) +[2023-10-14 08:27:37,287][100917] Updated weights for policy 1, policy_version 80662 (0.0010) +[2023-10-14 08:27:37,654][100917] Updated weights for policy 1, policy_version 80672 (0.0008) +[2023-10-14 08:27:38,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 165085184. Throughput: 0: 1660.6, 1: 1654.3. Samples: 41280930. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:27:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:27:38,524][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000080544_82477056.pth... +[2023-10-14 08:27:38,524][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000080672_82608128.pth... +[2023-10-14 08:27:38,556][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000079008_80904192.pth +[2023-10-14 08:27:38,564][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000079104_81002496.pth +[2023-10-14 08:27:39,945][100936] Updated weights for policy 0, policy_version 80550 (0.0009) +[2023-10-14 08:27:40,316][100936] Updated weights for policy 0, policy_version 80560 (0.0007) +[2023-10-14 08:27:40,682][100936] Updated weights for policy 0, policy_version 80570 (0.0007) +[2023-10-14 08:27:41,837][100917] Updated weights for policy 1, policy_version 80682 (0.0010) +[2023-10-14 08:27:42,205][100917] Updated weights for policy 1, policy_version 80692 (0.0009) +[2023-10-14 08:27:42,581][100917] Updated weights for policy 1, policy_version 80702 (0.0007) +[2023-10-14 08:27:43,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 165150720. Throughput: 0: 1660.4, 1: 1659.8. Samples: 41291032. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:27:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:27:44,685][100936] Updated weights for policy 0, policy_version 80580 (0.0009) +[2023-10-14 08:27:45,058][100936] Updated weights for policy 0, policy_version 80590 (0.0009) +[2023-10-14 08:27:45,426][100936] Updated weights for policy 0, policy_version 80600 (0.0007) +[2023-10-14 08:27:46,482][100917] Updated weights for policy 1, policy_version 80712 (0.0010) +[2023-10-14 08:27:46,848][100917] Updated weights for policy 1, policy_version 80722 (0.0007) +[2023-10-14 08:27:47,225][100917] Updated weights for policy 1, policy_version 80732 (0.0010) +[2023-10-14 08:27:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 165216256. Throughput: 0: 1664.1, 1: 1645.4. Samples: 41310928. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:27:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:27:49,676][100936] Updated weights for policy 0, policy_version 80610 (0.0007) +[2023-10-14 08:27:50,054][100936] Updated weights for policy 0, policy_version 80620 (0.0008) +[2023-10-14 08:27:50,415][100936] Updated weights for policy 0, policy_version 80630 (0.0009) +[2023-10-14 08:27:50,794][100936] Updated weights for policy 0, policy_version 80640 (0.0010) +[2023-10-14 08:27:51,382][100917] Updated weights for policy 1, policy_version 80742 (0.0009) +[2023-10-14 08:27:51,759][100917] Updated weights for policy 1, policy_version 80752 (0.0009) +[2023-10-14 08:27:52,128][100917] Updated weights for policy 1, policy_version 80762 (0.0009) +[2023-10-14 08:27:53,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 165281792. Throughput: 0: 1661.3, 1: 1659.0. Samples: 41330972. Policy #0 lag: (min: 14.0, avg: 14.3, max: 26.0) +[2023-10-14 08:27:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:27:54,794][100936] Updated weights for policy 0, policy_version 80650 (0.0009) +[2023-10-14 08:27:55,161][100936] Updated weights for policy 0, policy_version 80660 (0.0010) +[2023-10-14 08:27:55,530][100936] Updated weights for policy 0, policy_version 80670 (0.0008) +[2023-10-14 08:27:56,243][100917] Updated weights for policy 1, policy_version 80772 (0.0010) +[2023-10-14 08:27:56,624][100917] Updated weights for policy 1, policy_version 80782 (0.0008) +[2023-10-14 08:27:56,990][100917] Updated weights for policy 1, policy_version 80792 (0.0007) +[2023-10-14 08:27:58,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.3, 300 sec: 13329.4). Total num frames: 165347328. Throughput: 0: 1663.8, 1: 1657.6. Samples: 41341112. Policy #0 lag: (min: 14.0, avg: 14.3, max: 26.0) +[2023-10-14 08:27:58,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:27:59,490][100936] Updated weights for policy 0, policy_version 80680 (0.0008) +[2023-10-14 08:27:59,869][100936] Updated weights for policy 0, policy_version 80690 (0.0010) +[2023-10-14 08:28:00,236][100936] Updated weights for policy 0, policy_version 80700 (0.0010) +[2023-10-14 08:28:01,067][100917] Updated weights for policy 1, policy_version 80802 (0.0008) +[2023-10-14 08:28:01,442][100917] Updated weights for policy 1, policy_version 80812 (0.0007) +[2023-10-14 08:28:01,817][100917] Updated weights for policy 1, policy_version 80822 (0.0010) +[2023-10-14 08:28:02,184][100917] Updated weights for policy 1, policy_version 80832 (0.0011) +[2023-10-14 08:28:03,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 165412864. Throughput: 0: 1663.5, 1: 1643.7. Samples: 41360524. Policy #0 lag: (min: 14.0, avg: 14.3, max: 26.0) +[2023-10-14 08:28:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:28:04,598][100936] Updated weights for policy 0, policy_version 80710 (0.0008) +[2023-10-14 08:28:04,959][100936] Updated weights for policy 0, policy_version 80720 (0.0008) +[2023-10-14 08:28:05,323][100936] Updated weights for policy 0, policy_version 80730 (0.0008) +[2023-10-14 08:28:06,490][100917] Updated weights for policy 1, policy_version 80842 (0.0011) +[2023-10-14 08:28:06,860][100917] Updated weights for policy 1, policy_version 80852 (0.0010) +[2023-10-14 08:28:07,231][100917] Updated weights for policy 1, policy_version 80862 (0.0009) +[2023-10-14 08:28:08,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 165478400. Throughput: 0: 1655.6, 1: 1656.4. Samples: 41380478. Policy #0 lag: (min: 14.0, avg: 14.3, max: 26.0) +[2023-10-14 08:28:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:28:09,492][100936] Updated weights for policy 0, policy_version 80740 (0.0009) +[2023-10-14 08:28:09,860][100936] Updated weights for policy 0, policy_version 80750 (0.0010) +[2023-10-14 08:28:10,223][100936] Updated weights for policy 0, policy_version 80760 (0.0011) +[2023-10-14 08:28:11,137][100917] Updated weights for policy 1, policy_version 80872 (0.0010) +[2023-10-14 08:28:11,509][100917] Updated weights for policy 1, policy_version 80882 (0.0008) +[2023-10-14 08:28:11,880][100917] Updated weights for policy 1, policy_version 80892 (0.0010) +[2023-10-14 08:28:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 165543936. Throughput: 0: 1654.7, 1: 1660.5. Samples: 41390740. Policy #0 lag: (min: 14.0, avg: 14.3, max: 26.0) +[2023-10-14 08:28:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:28:14,536][100936] Updated weights for policy 0, policy_version 80770 (0.0010) +[2023-10-14 08:28:14,907][100936] Updated weights for policy 0, policy_version 80780 (0.0008) +[2023-10-14 08:28:15,268][100936] Updated weights for policy 0, policy_version 80790 (0.0009) +[2023-10-14 08:28:15,642][100936] Updated weights for policy 0, policy_version 80800 (0.0008) +[2023-10-14 08:28:15,964][100917] Updated weights for policy 1, policy_version 80902 (0.0007) +[2023-10-14 08:28:16,336][100917] Updated weights for policy 1, policy_version 80912 (0.0009) +[2023-10-14 08:28:16,709][100917] Updated weights for policy 1, policy_version 80922 (0.0009) +[2023-10-14 08:28:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 165609472. Throughput: 0: 1650.6, 1: 1649.0. Samples: 41410064. Policy #0 lag: (min: 14.0, avg: 14.3, max: 26.0) +[2023-10-14 08:28:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:28:19,740][100936] Updated weights for policy 0, policy_version 80810 (0.0008) +[2023-10-14 08:28:20,112][100936] Updated weights for policy 0, policy_version 80820 (0.0008) +[2023-10-14 08:28:20,488][100936] Updated weights for policy 0, policy_version 80830 (0.0008) +[2023-10-14 08:28:20,947][100917] Updated weights for policy 1, policy_version 80932 (0.0010) +[2023-10-14 08:28:21,333][100917] Updated weights for policy 1, policy_version 80942 (0.0010) +[2023-10-14 08:28:21,709][100917] Updated weights for policy 1, policy_version 80952 (0.0008) +[2023-10-14 08:28:23,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.1, 300 sec: 13329.3). Total num frames: 165675008. Throughput: 0: 1650.8, 1: 1668.8. Samples: 41430312. Policy #0 lag: (min: 14.0, avg: 14.3, max: 26.0) +[2023-10-14 08:28:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:28:24,589][100936] Updated weights for policy 0, policy_version 80840 (0.0009) +[2023-10-14 08:28:24,957][100936] Updated weights for policy 0, policy_version 80850 (0.0010) +[2023-10-14 08:28:25,341][100936] Updated weights for policy 0, policy_version 80860 (0.0010) +[2023-10-14 08:28:25,829][100917] Updated weights for policy 1, policy_version 80962 (0.0009) +[2023-10-14 08:28:26,199][100917] Updated weights for policy 1, policy_version 80972 (0.0009) +[2023-10-14 08:28:26,564][100917] Updated weights for policy 1, policy_version 80982 (0.0008) +[2023-10-14 08:28:26,933][100917] Updated weights for policy 1, policy_version 80992 (0.0007) +[2023-10-14 08:28:28,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 165740544. Throughput: 0: 1652.4, 1: 1665.9. Samples: 41440352. Policy #0 lag: (min: 14.0, avg: 14.3, max: 26.0) +[2023-10-14 08:28:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:28:29,612][100936] Updated weights for policy 0, policy_version 80870 (0.0010) +[2023-10-14 08:28:29,986][100936] Updated weights for policy 0, policy_version 80880 (0.0009) +[2023-10-14 08:28:30,356][100936] Updated weights for policy 0, policy_version 80890 (0.0008) +[2023-10-14 08:28:31,014][100917] Updated weights for policy 1, policy_version 81002 (0.0009) +[2023-10-14 08:28:31,382][100917] Updated weights for policy 1, policy_version 81012 (0.0009) +[2023-10-14 08:28:31,751][100917] Updated weights for policy 1, policy_version 81022 (0.0008) +[2023-10-14 08:28:33,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 165806080. Throughput: 0: 1649.6, 1: 1657.5. Samples: 41459746. Policy #0 lag: (min: 14.0, avg: 14.3, max: 26.0) +[2023-10-14 08:28:33,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:28:34,366][100936] Updated weights for policy 0, policy_version 80900 (0.0009) +[2023-10-14 08:28:34,729][100936] Updated weights for policy 0, policy_version 80910 (0.0011) +[2023-10-14 08:28:35,109][100936] Updated weights for policy 0, policy_version 80920 (0.0008) +[2023-10-14 08:28:35,982][100917] Updated weights for policy 1, policy_version 81032 (0.0010) +[2023-10-14 08:28:36,351][100917] Updated weights for policy 1, policy_version 81042 (0.0012) +[2023-10-14 08:28:36,728][100917] Updated weights for policy 1, policy_version 81052 (0.0008) +[2023-10-14 08:28:38,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 165871616. Throughput: 0: 1645.6, 1: 1669.8. Samples: 41480164. Policy #0 lag: (min: 14.0, avg: 14.3, max: 26.0) +[2023-10-14 08:28:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:28:39,407][100936] Updated weights for policy 0, policy_version 80930 (0.0008) +[2023-10-14 08:28:39,775][100936] Updated weights for policy 0, policy_version 80940 (0.0007) +[2023-10-14 08:28:40,147][100936] Updated weights for policy 0, policy_version 80950 (0.0007) +[2023-10-14 08:28:40,521][100936] Updated weights for policy 0, policy_version 80960 (0.0008) +[2023-10-14 08:28:40,704][100917] Updated weights for policy 1, policy_version 81062 (0.0008) +[2023-10-14 08:28:41,079][100917] Updated weights for policy 1, policy_version 81072 (0.0009) +[2023-10-14 08:28:41,437][100917] Updated weights for policy 1, policy_version 81082 (0.0009) +[2023-10-14 08:28:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 165937152. Throughput: 0: 1645.6, 1: 1662.4. Samples: 41489970. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:28:43,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.880')] +[2023-10-14 08:28:44,717][100936] Updated weights for policy 0, policy_version 80970 (0.0010) +[2023-10-14 08:28:45,096][100936] Updated weights for policy 0, policy_version 80980 (0.0010) +[2023-10-14 08:28:45,466][100936] Updated weights for policy 0, policy_version 80990 (0.0008) +[2023-10-14 08:28:45,481][100917] Updated weights for policy 1, policy_version 81092 (0.0007) +[2023-10-14 08:28:45,855][100917] Updated weights for policy 1, policy_version 81102 (0.0008) +[2023-10-14 08:28:46,230][100917] Updated weights for policy 1, policy_version 81112 (0.0008) +[2023-10-14 08:28:48,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 166002688. Throughput: 0: 1646.3, 1: 1663.2. Samples: 41509452. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:28:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.880')] +[2023-10-14 08:28:49,773][100936] Updated weights for policy 0, policy_version 81000 (0.0009) +[2023-10-14 08:28:50,154][100936] Updated weights for policy 0, policy_version 81010 (0.0009) +[2023-10-14 08:28:50,354][100917] Updated weights for policy 1, policy_version 81122 (0.0008) +[2023-10-14 08:28:50,532][100936] Updated weights for policy 0, policy_version 81020 (0.0008) +[2023-10-14 08:28:50,742][100917] Updated weights for policy 1, policy_version 81132 (0.0011) +[2023-10-14 08:28:51,103][100917] Updated weights for policy 1, policy_version 81142 (0.0010) +[2023-10-14 08:28:51,484][100917] Updated weights for policy 1, policy_version 81152 (0.0008) +[2023-10-14 08:28:53,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 166068224. Throughput: 0: 1647.1, 1: 1671.8. Samples: 41529828. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:28:53,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.880')] +[2023-10-14 08:28:54,570][100936] Updated weights for policy 0, policy_version 81030 (0.0007) +[2023-10-14 08:28:54,945][100936] Updated weights for policy 0, policy_version 81040 (0.0009) +[2023-10-14 08:28:55,317][100936] Updated weights for policy 0, policy_version 81050 (0.0009) +[2023-10-14 08:28:55,527][100917] Updated weights for policy 1, policy_version 81162 (0.0008) +[2023-10-14 08:28:55,908][100917] Updated weights for policy 1, policy_version 81172 (0.0008) +[2023-10-14 08:28:56,282][100917] Updated weights for policy 1, policy_version 81182 (0.0008) +[2023-10-14 08:28:58,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 166133760. Throughput: 0: 1649.1, 1: 1648.3. Samples: 41539120. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:28:58,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.880')] +[2023-10-14 08:28:59,402][100936] Updated weights for policy 0, policy_version 81060 (0.0010) +[2023-10-14 08:28:59,778][100936] Updated weights for policy 0, policy_version 81070 (0.0008) +[2023-10-14 08:29:00,141][100936] Updated weights for policy 0, policy_version 81080 (0.0008) +[2023-10-14 08:29:00,405][100917] Updated weights for policy 1, policy_version 81192 (0.0008) +[2023-10-14 08:29:00,774][100917] Updated weights for policy 1, policy_version 81202 (0.0011) +[2023-10-14 08:29:01,155][100917] Updated weights for policy 1, policy_version 81212 (0.0009) +[2023-10-14 08:29:03,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 166199296. Throughput: 0: 1650.7, 1: 1658.9. Samples: 41558998. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:29:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.880')] +[2023-10-14 08:29:04,168][100936] Updated weights for policy 0, policy_version 81090 (0.0007) +[2023-10-14 08:29:04,549][100936] Updated weights for policy 0, policy_version 81100 (0.0008) +[2023-10-14 08:29:04,917][100936] Updated weights for policy 0, policy_version 81110 (0.0009) +[2023-10-14 08:29:05,296][100936] Updated weights for policy 0, policy_version 81120 (0.0008) +[2023-10-14 08:29:05,339][100917] Updated weights for policy 1, policy_version 81222 (0.0009) +[2023-10-14 08:29:05,716][100917] Updated weights for policy 1, policy_version 81232 (0.0007) +[2023-10-14 08:29:06,095][100917] Updated weights for policy 1, policy_version 81242 (0.0008) +[2023-10-14 08:29:08,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 166264832. Throughput: 0: 1656.2, 1: 1658.0. Samples: 41579448. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:29:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.880')] +[2023-10-14 08:29:09,197][100936] Updated weights for policy 0, policy_version 81130 (0.0008) +[2023-10-14 08:29:09,569][100936] Updated weights for policy 0, policy_version 81140 (0.0009) +[2023-10-14 08:29:09,932][100936] Updated weights for policy 0, policy_version 81150 (0.0008) +[2023-10-14 08:29:10,238][100917] Updated weights for policy 1, policy_version 81252 (0.0009) +[2023-10-14 08:29:10,608][100917] Updated weights for policy 1, policy_version 81262 (0.0009) +[2023-10-14 08:29:10,984][100917] Updated weights for policy 1, policy_version 81272 (0.0008) +[2023-10-14 08:29:13,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 166330368. Throughput: 0: 1657.6, 1: 1643.5. Samples: 41588902. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:29:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.880')] +[2023-10-14 08:29:14,161][100936] Updated weights for policy 0, policy_version 81160 (0.0008) +[2023-10-14 08:29:14,536][100936] Updated weights for policy 0, policy_version 81170 (0.0008) +[2023-10-14 08:29:14,910][100936] Updated weights for policy 0, policy_version 81180 (0.0008) +[2023-10-14 08:29:15,167][100917] Updated weights for policy 1, policy_version 81282 (0.0008) +[2023-10-14 08:29:15,537][100917] Updated weights for policy 1, policy_version 81292 (0.0008) +[2023-10-14 08:29:15,899][100917] Updated weights for policy 1, policy_version 81302 (0.0008) +[2023-10-14 08:29:16,271][100917] Updated weights for policy 1, policy_version 81312 (0.0009) +[2023-10-14 08:29:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 166395904. Throughput: 0: 1654.8, 1: 1658.1. Samples: 41608826. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:29:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.880')] +[2023-10-14 08:29:18,908][100936] Updated weights for policy 0, policy_version 81190 (0.0008) +[2023-10-14 08:29:19,278][100936] Updated weights for policy 0, policy_version 81200 (0.0008) +[2023-10-14 08:29:19,645][100936] Updated weights for policy 0, policy_version 81210 (0.0008) +[2023-10-14 08:29:20,294][100917] Updated weights for policy 1, policy_version 81322 (0.0010) +[2023-10-14 08:29:20,672][100917] Updated weights for policy 1, policy_version 81332 (0.0009) +[2023-10-14 08:29:21,042][100917] Updated weights for policy 1, policy_version 81342 (0.0007) +[2023-10-14 08:29:23,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 166461440. Throughput: 0: 1659.6, 1: 1662.0. Samples: 41629632. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:29:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.880')] +[2023-10-14 08:29:23,886][100936] Updated weights for policy 0, policy_version 81220 (0.0008) +[2023-10-14 08:29:24,253][100936] Updated weights for policy 0, policy_version 81230 (0.0010) +[2023-10-14 08:29:24,622][100936] Updated weights for policy 0, policy_version 81240 (0.0007) +[2023-10-14 08:29:25,227][100917] Updated weights for policy 1, policy_version 81352 (0.0010) +[2023-10-14 08:29:25,596][100917] Updated weights for policy 1, policy_version 81362 (0.0008) +[2023-10-14 08:29:25,969][100917] Updated weights for policy 1, policy_version 81372 (0.0007) +[2023-10-14 08:29:28,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 166526976. Throughput: 0: 1662.8, 1: 1644.0. Samples: 41638776. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:29:28,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.880')] +[2023-10-14 08:29:28,654][100936] Updated weights for policy 0, policy_version 81250 (0.0008) +[2023-10-14 08:29:29,024][100936] Updated weights for policy 0, policy_version 81260 (0.0009) +[2023-10-14 08:29:29,385][100936] Updated weights for policy 0, policy_version 81270 (0.0009) +[2023-10-14 08:29:29,755][100936] Updated weights for policy 0, policy_version 81280 (0.0010) +[2023-10-14 08:29:30,056][100917] Updated weights for policy 1, policy_version 81382 (0.0008) +[2023-10-14 08:29:30,433][100917] Updated weights for policy 1, policy_version 81392 (0.0007) +[2023-10-14 08:29:30,802][100917] Updated weights for policy 1, policy_version 81402 (0.0008) +[2023-10-14 08:29:33,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 166592512. Throughput: 0: 1664.7, 1: 1661.2. Samples: 41659116. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:29:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.880')] +[2023-10-14 08:29:34,012][100936] Updated weights for policy 0, policy_version 81290 (0.0010) +[2023-10-14 08:29:34,389][100936] Updated weights for policy 0, policy_version 81300 (0.0008) +[2023-10-14 08:29:34,748][100936] Updated weights for policy 0, policy_version 81310 (0.0009) +[2023-10-14 08:29:34,825][100917] Updated weights for policy 1, policy_version 81412 (0.0009) +[2023-10-14 08:29:35,194][100917] Updated weights for policy 1, policy_version 81422 (0.0009) +[2023-10-14 08:29:35,570][100917] Updated weights for policy 1, policy_version 81432 (0.0009) +[2023-10-14 08:29:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 166658048. Throughput: 0: 1664.2, 1: 1663.5. Samples: 41679576. Policy #0 lag: (min: 25.0, avg: 36.5, max: 57.0) +[2023-10-14 08:29:38,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.880')] +[2023-10-14 08:29:38,521][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000081440_83394560.pth... +[2023-10-14 08:29:38,554][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000079904_81821696.pth +[2023-10-14 08:29:38,768][100936] Updated weights for policy 0, policy_version 81320 (0.0008) +[2023-10-14 08:29:39,140][100936] Updated weights for policy 0, policy_version 81330 (0.0008) +[2023-10-14 08:29:39,486][100917] Updated weights for policy 1, policy_version 81442 (0.0009) +[2023-10-14 08:29:39,505][100936] Updated weights for policy 0, policy_version 81340 (0.0010) +[2023-10-14 08:29:39,643][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000081344_83296256.pth... +[2023-10-14 08:29:39,673][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000079776_81690624.pth +[2023-10-14 08:29:39,852][100917] Updated weights for policy 1, policy_version 81452 (0.0011) +[2023-10-14 08:29:40,231][100917] Updated weights for policy 1, policy_version 81462 (0.0010) +[2023-10-14 08:29:40,613][100917] Updated weights for policy 1, policy_version 81472 (0.0009) +[2023-10-14 08:29:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 166723584. Throughput: 0: 1665.1, 1: 1655.2. Samples: 41688536. Policy #0 lag: (min: 25.0, avg: 36.5, max: 57.0) +[2023-10-14 08:29:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.880')] +[2023-10-14 08:29:43,701][100936] Updated weights for policy 0, policy_version 81350 (0.0009) +[2023-10-14 08:29:44,069][100936] Updated weights for policy 0, policy_version 81360 (0.0007) +[2023-10-14 08:29:44,436][100936] Updated weights for policy 0, policy_version 81370 (0.0009) +[2023-10-14 08:29:44,935][100917] Updated weights for policy 1, policy_version 81482 (0.0010) +[2023-10-14 08:29:45,306][100917] Updated weights for policy 1, policy_version 81492 (0.0007) +[2023-10-14 08:29:45,675][100917] Updated weights for policy 1, policy_version 81502 (0.0009) +[2023-10-14 08:29:48,472][100936] Updated weights for policy 0, policy_version 81380 (0.0009) +[2023-10-14 08:29:48,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 166789120. Throughput: 0: 1668.3, 1: 1660.5. Samples: 41708794. Policy #0 lag: (min: 25.0, avg: 36.5, max: 57.0) +[2023-10-14 08:29:48,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.880')] +[2023-10-14 08:29:48,840][100936] Updated weights for policy 0, policy_version 81390 (0.0008) +[2023-10-14 08:29:49,203][100936] Updated weights for policy 0, policy_version 81400 (0.0009) +[2023-10-14 08:29:49,875][100917] Updated weights for policy 1, policy_version 81512 (0.0009) +[2023-10-14 08:29:50,257][100917] Updated weights for policy 1, policy_version 81522 (0.0007) +[2023-10-14 08:29:50,637][100917] Updated weights for policy 1, policy_version 81532 (0.0007) +[2023-10-14 08:29:53,412][100936] Updated weights for policy 0, policy_version 81410 (0.0009) +[2023-10-14 08:29:53,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 166854656. Throughput: 0: 1659.5, 1: 1664.2. Samples: 41729016. Policy #0 lag: (min: 25.0, avg: 36.5, max: 57.0) +[2023-10-14 08:29:53,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.880')] +[2023-10-14 08:29:53,784][100936] Updated weights for policy 0, policy_version 81420 (0.0008) +[2023-10-14 08:29:54,149][100936] Updated weights for policy 0, policy_version 81430 (0.0009) +[2023-10-14 08:29:54,524][100936] Updated weights for policy 0, policy_version 81440 (0.0007) +[2023-10-14 08:29:54,807][100917] Updated weights for policy 1, policy_version 81542 (0.0008) +[2023-10-14 08:29:55,196][100917] Updated weights for policy 1, policy_version 81552 (0.0007) +[2023-10-14 08:29:55,572][100917] Updated weights for policy 1, policy_version 81562 (0.0010) +[2023-10-14 08:29:58,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 166920192. Throughput: 0: 1659.5, 1: 1655.2. Samples: 41738064. Policy #0 lag: (min: 25.0, avg: 36.5, max: 57.0) +[2023-10-14 08:29:58,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.880')] +[2023-10-14 08:29:58,710][100936] Updated weights for policy 0, policy_version 81450 (0.0008) +[2023-10-14 08:29:59,080][100936] Updated weights for policy 0, policy_version 81460 (0.0007) +[2023-10-14 08:29:59,446][100936] Updated weights for policy 0, policy_version 81470 (0.0007) +[2023-10-14 08:29:59,624][100917] Updated weights for policy 1, policy_version 81572 (0.0008) +[2023-10-14 08:30:00,000][100917] Updated weights for policy 1, policy_version 81582 (0.0007) +[2023-10-14 08:30:00,368][100917] Updated weights for policy 1, policy_version 81592 (0.0007) +[2023-10-14 08:30:03,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 166985728. Throughput: 0: 1662.2, 1: 1665.1. Samples: 41758554. Policy #0 lag: (min: 25.0, avg: 36.5, max: 57.0) +[2023-10-14 08:30:03,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.880')] +[2023-10-14 08:30:03,581][100936] Updated weights for policy 0, policy_version 81480 (0.0007) +[2023-10-14 08:30:03,943][100936] Updated weights for policy 0, policy_version 81490 (0.0008) +[2023-10-14 08:30:04,320][100936] Updated weights for policy 0, policy_version 81500 (0.0009) +[2023-10-14 08:30:04,536][100917] Updated weights for policy 1, policy_version 81602 (0.0007) +[2023-10-14 08:30:04,913][100917] Updated weights for policy 1, policy_version 81612 (0.0008) +[2023-10-14 08:30:05,285][100917] Updated weights for policy 1, policy_version 81622 (0.0007) +[2023-10-14 08:30:05,666][100917] Updated weights for policy 1, policy_version 81632 (0.0010) +[2023-10-14 08:30:08,292][100936] Updated weights for policy 0, policy_version 81510 (0.0008) +[2023-10-14 08:30:08,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 167051264. Throughput: 0: 1655.7, 1: 1661.1. Samples: 41778890. Policy #0 lag: (min: 25.0, avg: 36.5, max: 57.0) +[2023-10-14 08:30:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.880')] +[2023-10-14 08:30:08,668][100936] Updated weights for policy 0, policy_version 81520 (0.0009) +[2023-10-14 08:30:09,025][100936] Updated weights for policy 0, policy_version 81530 (0.0008) +[2023-10-14 08:30:09,615][100917] Updated weights for policy 1, policy_version 81642 (0.0008) +[2023-10-14 08:30:09,986][100917] Updated weights for policy 1, policy_version 81652 (0.0011) +[2023-10-14 08:30:10,362][100917] Updated weights for policy 1, policy_version 81662 (0.0010) +[2023-10-14 08:30:13,035][100936] Updated weights for policy 0, policy_version 81540 (0.0007) +[2023-10-14 08:30:13,398][100936] Updated weights for policy 0, policy_version 81550 (0.0007) +[2023-10-14 08:30:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 167116800. Throughput: 0: 1662.8, 1: 1658.6. Samples: 41788240. Policy #0 lag: (min: 25.0, avg: 36.5, max: 57.0) +[2023-10-14 08:30:13,512][99942] Avg episode reward: [(0, '0.990'), (1, '0.880')] +[2023-10-14 08:30:13,764][100936] Updated weights for policy 0, policy_version 81560 (0.0008) +[2023-10-14 08:30:14,513][100917] Updated weights for policy 1, policy_version 81672 (0.0008) +[2023-10-14 08:30:14,894][100917] Updated weights for policy 1, policy_version 81682 (0.0007) +[2023-10-14 08:30:15,266][100917] Updated weights for policy 1, policy_version 81692 (0.0007) +[2023-10-14 08:30:17,903][100936] Updated weights for policy 0, policy_version 81570 (0.0007) +[2023-10-14 08:30:18,275][100936] Updated weights for policy 0, policy_version 81580 (0.0010) +[2023-10-14 08:30:18,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 167182336. Throughput: 0: 1667.2, 1: 1665.3. Samples: 41809078. Policy #0 lag: (min: 25.0, avg: 36.5, max: 57.0) +[2023-10-14 08:30:18,512][99942] Avg episode reward: [(0, '0.990'), (1, '0.880')] +[2023-10-14 08:30:18,638][100936] Updated weights for policy 0, policy_version 81590 (0.0008) +[2023-10-14 08:30:19,008][100936] Updated weights for policy 0, policy_version 81600 (0.0009) +[2023-10-14 08:30:19,348][100917] Updated weights for policy 1, policy_version 81702 (0.0010) +[2023-10-14 08:30:19,712][100917] Updated weights for policy 1, policy_version 81712 (0.0009) +[2023-10-14 08:30:20,089][100917] Updated weights for policy 1, policy_version 81722 (0.0009) +[2023-10-14 08:30:23,063][100936] Updated weights for policy 0, policy_version 81610 (0.0009) +[2023-10-14 08:30:23,429][100936] Updated weights for policy 0, policy_version 81620 (0.0009) +[2023-10-14 08:30:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 167247872. Throughput: 0: 1651.7, 1: 1665.5. Samples: 41828848. Policy #0 lag: (min: 25.0, avg: 36.5, max: 57.0) +[2023-10-14 08:30:23,512][99942] Avg episode reward: [(0, '0.990'), (1, '0.880')] +[2023-10-14 08:30:23,810][100936] Updated weights for policy 0, policy_version 81630 (0.0010) +[2023-10-14 08:30:24,140][100917] Updated weights for policy 1, policy_version 81732 (0.0008) +[2023-10-14 08:30:24,510][100917] Updated weights for policy 1, policy_version 81742 (0.0010) +[2023-10-14 08:30:24,889][100917] Updated weights for policy 1, policy_version 81752 (0.0009) +[2023-10-14 08:30:27,928][100936] Updated weights for policy 0, policy_version 81640 (0.0009) +[2023-10-14 08:30:28,300][100936] Updated weights for policy 0, policy_version 81650 (0.0007) +[2023-10-14 08:30:28,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 167313408. Throughput: 0: 1667.2, 1: 1664.4. Samples: 41838456. Policy #0 lag: (min: 25.0, avg: 36.5, max: 57.0) +[2023-10-14 08:30:28,512][99942] Avg episode reward: [(0, '0.990'), (1, '0.880')] +[2023-10-14 08:30:28,669][100936] Updated weights for policy 0, policy_version 81660 (0.0008) +[2023-10-14 08:30:29,069][100917] Updated weights for policy 1, policy_version 81762 (0.0011) +[2023-10-14 08:30:29,431][100917] Updated weights for policy 1, policy_version 81772 (0.0009) +[2023-10-14 08:30:29,808][100917] Updated weights for policy 1, policy_version 81782 (0.0009) +[2023-10-14 08:30:30,183][100917] Updated weights for policy 1, policy_version 81792 (0.0009) +[2023-10-14 08:30:32,896][100936] Updated weights for policy 0, policy_version 81670 (0.0007) +[2023-10-14 08:30:33,263][100936] Updated weights for policy 0, policy_version 81680 (0.0008) +[2023-10-14 08:30:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 167378944. Throughput: 0: 1662.7, 1: 1672.9. Samples: 41858894. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:30:33,512][99942] Avg episode reward: [(0, '0.990'), (1, '0.880')] +[2023-10-14 08:30:33,636][100936] Updated weights for policy 0, policy_version 81690 (0.0008) +[2023-10-14 08:30:34,234][100917] Updated weights for policy 1, policy_version 81802 (0.0008) +[2023-10-14 08:30:34,612][100917] Updated weights for policy 1, policy_version 81812 (0.0008) +[2023-10-14 08:30:34,981][100917] Updated weights for policy 1, policy_version 81822 (0.0007) +[2023-10-14 08:30:37,952][100936] Updated weights for policy 0, policy_version 81700 (0.0009) +[2023-10-14 08:30:38,321][100936] Updated weights for policy 0, policy_version 81710 (0.0009) +[2023-10-14 08:30:38,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 167444480. Throughput: 0: 1648.3, 1: 1674.4. Samples: 41878536. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:30:38,512][99942] Avg episode reward: [(0, '0.990'), (1, '0.880')] +[2023-10-14 08:30:38,695][100936] Updated weights for policy 0, policy_version 81720 (0.0010) +[2023-10-14 08:30:38,947][100917] Updated weights for policy 1, policy_version 81832 (0.0009) +[2023-10-14 08:30:39,306][100917] Updated weights for policy 1, policy_version 81842 (0.0010) +[2023-10-14 08:30:39,678][100917] Updated weights for policy 1, policy_version 81852 (0.0009) +[2023-10-14 08:30:42,787][100936] Updated weights for policy 0, policy_version 81730 (0.0008) +[2023-10-14 08:30:43,158][100936] Updated weights for policy 0, policy_version 81740 (0.0007) +[2023-10-14 08:30:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 167510016. Throughput: 0: 1662.8, 1: 1672.2. Samples: 41888140. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:30:43,512][99942] Avg episode reward: [(0, '0.990'), (1, '0.880')] +[2023-10-14 08:30:43,530][100936] Updated weights for policy 0, policy_version 81750 (0.0007) +[2023-10-14 08:30:43,897][100936] Updated weights for policy 0, policy_version 81760 (0.0007) +[2023-10-14 08:30:43,956][100917] Updated weights for policy 1, policy_version 81862 (0.0008) +[2023-10-14 08:30:44,332][100917] Updated weights for policy 1, policy_version 81872 (0.0009) +[2023-10-14 08:30:44,702][100917] Updated weights for policy 1, policy_version 81882 (0.0009) +[2023-10-14 08:30:48,134][100936] Updated weights for policy 0, policy_version 81770 (0.0007) +[2023-10-14 08:30:48,510][100936] Updated weights for policy 0, policy_version 81780 (0.0008) +[2023-10-14 08:30:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 167575552. Throughput: 0: 1661.6, 1: 1663.9. Samples: 41908200. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:30:48,512][99942] Avg episode reward: [(0, '0.800'), (1, '0.880')] +[2023-10-14 08:30:48,877][100936] Updated weights for policy 0, policy_version 81790 (0.0007) +[2023-10-14 08:30:48,879][100917] Updated weights for policy 1, policy_version 81892 (0.0009) +[2023-10-14 08:30:49,241][100917] Updated weights for policy 1, policy_version 81902 (0.0008) +[2023-10-14 08:30:49,613][100917] Updated weights for policy 1, policy_version 81912 (0.0009) +[2023-10-14 08:30:52,887][100936] Updated weights for policy 0, policy_version 81800 (0.0008) +[2023-10-14 08:30:53,265][100936] Updated weights for policy 0, policy_version 81810 (0.0007) +[2023-10-14 08:30:53,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 167641088. Throughput: 0: 1647.2, 1: 1661.2. Samples: 41927770. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:30:53,512][99942] Avg episode reward: [(0, '0.800'), (1, '0.880')] +[2023-10-14 08:30:53,637][100936] Updated weights for policy 0, policy_version 81820 (0.0008) +[2023-10-14 08:30:53,714][100917] Updated weights for policy 1, policy_version 81922 (0.0009) +[2023-10-14 08:30:54,089][100917] Updated weights for policy 1, policy_version 81932 (0.0009) +[2023-10-14 08:30:54,463][100917] Updated weights for policy 1, policy_version 81942 (0.0007) +[2023-10-14 08:30:54,845][100917] Updated weights for policy 1, policy_version 81952 (0.0009) +[2023-10-14 08:30:58,016][100936] Updated weights for policy 0, policy_version 81830 (0.0008) +[2023-10-14 08:30:58,382][100936] Updated weights for policy 0, policy_version 81840 (0.0009) +[2023-10-14 08:30:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 167706624. Throughput: 0: 1654.2, 1: 1658.5. Samples: 41937310. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:30:58,512][99942] Avg episode reward: [(0, '0.800'), (1, '0.880')] +[2023-10-14 08:30:58,751][100936] Updated weights for policy 0, policy_version 81850 (0.0009) +[2023-10-14 08:30:59,111][100917] Updated weights for policy 1, policy_version 81962 (0.0008) +[2023-10-14 08:30:59,486][100917] Updated weights for policy 1, policy_version 81972 (0.0008) +[2023-10-14 08:30:59,857][100917] Updated weights for policy 1, policy_version 81982 (0.0009) +[2023-10-14 08:31:03,021][100936] Updated weights for policy 0, policy_version 81860 (0.0008) +[2023-10-14 08:31:03,393][100936] Updated weights for policy 0, policy_version 81870 (0.0007) +[2023-10-14 08:31:03,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.1, 300 sec: 13107.2). Total num frames: 167772160. Throughput: 0: 1647.1, 1: 1649.2. Samples: 41957410. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:31:03,513][99942] Avg episode reward: [(0, '0.800'), (1, '0.880')] +[2023-10-14 08:31:03,763][100936] Updated weights for policy 0, policy_version 81880 (0.0008) +[2023-10-14 08:31:04,047][100917] Updated weights for policy 1, policy_version 81992 (0.0008) +[2023-10-14 08:31:04,421][100917] Updated weights for policy 1, policy_version 82002 (0.0007) +[2023-10-14 08:31:04,808][100917] Updated weights for policy 1, policy_version 82012 (0.0007) +[2023-10-14 08:31:07,739][100936] Updated weights for policy 0, policy_version 81890 (0.0009) +[2023-10-14 08:31:08,153][100936] Updated weights for policy 0, policy_version 81900 (0.0011) +[2023-10-14 08:31:08,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 167837696. Throughput: 0: 1647.8, 1: 1645.8. Samples: 41977058. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:31:08,513][99942] Avg episode reward: [(0, '0.800'), (1, '0.880')] +[2023-10-14 08:31:08,529][100936] Updated weights for policy 0, policy_version 81910 (0.0010) +[2023-10-14 08:31:08,838][100917] Updated weights for policy 1, policy_version 82022 (0.0007) +[2023-10-14 08:31:08,894][100936] Updated weights for policy 0, policy_version 81920 (0.0007) +[2023-10-14 08:31:09,207][100917] Updated weights for policy 1, policy_version 82032 (0.0008) +[2023-10-14 08:31:09,585][100917] Updated weights for policy 1, policy_version 82042 (0.0008) +[2023-10-14 08:31:13,103][100936] Updated weights for policy 0, policy_version 81930 (0.0008) +[2023-10-14 08:31:13,474][100936] Updated weights for policy 0, policy_version 81940 (0.0011) +[2023-10-14 08:31:13,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 167903232. Throughput: 0: 1647.7, 1: 1647.2. Samples: 41986728. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:31:13,512][99942] Avg episode reward: [(0, '0.800'), (1, '0.880')] +[2023-10-14 08:31:13,763][100917] Updated weights for policy 1, policy_version 82052 (0.0010) +[2023-10-14 08:31:13,839][100936] Updated weights for policy 0, policy_version 81950 (0.0009) +[2023-10-14 08:31:14,136][100917] Updated weights for policy 1, policy_version 82062 (0.0008) +[2023-10-14 08:31:14,509][100917] Updated weights for policy 1, policy_version 82072 (0.0009) +[2023-10-14 08:31:17,778][100936] Updated weights for policy 0, policy_version 81960 (0.0008) +[2023-10-14 08:31:18,137][100936] Updated weights for policy 0, policy_version 81970 (0.0008) +[2023-10-14 08:31:18,500][100936] Updated weights for policy 0, policy_version 81980 (0.0009) +[2023-10-14 08:31:18,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 167968768. Throughput: 0: 1647.2, 1: 1642.4. Samples: 42006928. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:31:18,513][99942] Avg episode reward: [(0, '0.800'), (1, '0.880')] +[2023-10-14 08:31:18,649][100917] Updated weights for policy 1, policy_version 82082 (0.0009) +[2023-10-14 08:31:19,017][100917] Updated weights for policy 1, policy_version 82092 (0.0009) +[2023-10-14 08:31:19,384][100917] Updated weights for policy 1, policy_version 82102 (0.0007) +[2023-10-14 08:31:19,754][100917] Updated weights for policy 1, policy_version 82112 (0.0008) +[2023-10-14 08:31:22,613][100936] Updated weights for policy 0, policy_version 81990 (0.0009) +[2023-10-14 08:31:22,988][100936] Updated weights for policy 0, policy_version 82000 (0.0007) +[2023-10-14 08:31:23,356][100936] Updated weights for policy 0, policy_version 82010 (0.0007) +[2023-10-14 08:31:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 168034304. Throughput: 0: 1645.8, 1: 1644.4. Samples: 42026598. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:31:23,512][99942] Avg episode reward: [(0, '0.800'), (1, '0.880')] +[2023-10-14 08:31:23,797][100917] Updated weights for policy 1, policy_version 82122 (0.0009) +[2023-10-14 08:31:24,161][100917] Updated weights for policy 1, policy_version 82132 (0.0010) +[2023-10-14 08:31:24,537][100917] Updated weights for policy 1, policy_version 82142 (0.0011) +[2023-10-14 08:31:27,490][100936] Updated weights for policy 0, policy_version 82020 (0.0008) +[2023-10-14 08:31:27,862][100936] Updated weights for policy 0, policy_version 82030 (0.0009) +[2023-10-14 08:31:28,216][100936] Updated weights for policy 0, policy_version 82040 (0.0008) +[2023-10-14 08:31:28,512][99942] Fps is (10 sec: 16384.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 168132608. Throughput: 0: 1654.9, 1: 1646.8. Samples: 42036720. Policy #0 lag: (min: 31.0, avg: 37.5, max: 63.0) +[2023-10-14 08:31:28,512][99942] Avg episode reward: [(0, '0.800'), (1, '1.000')] +[2023-10-14 08:31:28,872][100917] Updated weights for policy 1, policy_version 82152 (0.0008) +[2023-10-14 08:31:29,235][100917] Updated weights for policy 1, policy_version 82162 (0.0009) +[2023-10-14 08:31:29,615][100917] Updated weights for policy 1, policy_version 82172 (0.0010) +[2023-10-14 08:31:32,495][100936] Updated weights for policy 0, policy_version 82050 (0.0008) +[2023-10-14 08:31:32,861][100936] Updated weights for policy 0, policy_version 82060 (0.0008) +[2023-10-14 08:31:33,223][100936] Updated weights for policy 0, policy_version 82070 (0.0008) +[2023-10-14 08:31:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 168165376. Throughput: 0: 1655.1, 1: 1655.4. Samples: 42057174. Policy #0 lag: (min: 31.0, avg: 37.5, max: 63.0) +[2023-10-14 08:31:33,512][99942] Avg episode reward: [(0, '0.800'), (1, '1.000')] +[2023-10-14 08:31:33,538][100917] Updated weights for policy 1, policy_version 82182 (0.0010) +[2023-10-14 08:31:33,596][100936] Updated weights for policy 0, policy_version 82080 (0.0007) +[2023-10-14 08:31:33,910][100917] Updated weights for policy 1, policy_version 82192 (0.0010) +[2023-10-14 08:31:34,268][100917] Updated weights for policy 1, policy_version 82202 (0.0007) +[2023-10-14 08:31:37,757][100936] Updated weights for policy 0, policy_version 82090 (0.0010) +[2023-10-14 08:31:38,123][100936] Updated weights for policy 0, policy_version 82100 (0.0007) +[2023-10-14 08:31:38,489][100936] Updated weights for policy 0, policy_version 82110 (0.0008) +[2023-10-14 08:31:38,512][99942] Fps is (10 sec: 9830.1, 60 sec: 13107.1, 300 sec: 13107.2). Total num frames: 168230912. Throughput: 0: 1650.5, 1: 1656.5. Samples: 42076588. Policy #0 lag: (min: 31.0, avg: 37.5, max: 63.0) +[2023-10-14 08:31:38,513][100917] Updated weights for policy 1, policy_version 82212 (0.0008) +[2023-10-14 08:31:38,513][99942] Avg episode reward: [(0, '0.800'), (1, '1.000')] +[2023-10-14 08:31:38,559][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000082112_84082688.pth... +[2023-10-14 08:31:38,593][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000080544_82477056.pth +[2023-10-14 08:31:38,894][100917] Updated weights for policy 1, policy_version 82222 (0.0007) +[2023-10-14 08:31:39,266][100917] Updated weights for policy 1, policy_version 82232 (0.0007) +[2023-10-14 08:31:39,562][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000082240_84213760.pth... +[2023-10-14 08:31:39,592][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000080672_82608128.pth +[2023-10-14 08:31:42,628][100936] Updated weights for policy 0, policy_version 82120 (0.0009) +[2023-10-14 08:31:42,994][100936] Updated weights for policy 0, policy_version 82130 (0.0009) +[2023-10-14 08:31:43,359][100936] Updated weights for policy 0, policy_version 82140 (0.0009) +[2023-10-14 08:31:43,512][99942] Fps is (10 sec: 16383.9, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 168329216. Throughput: 0: 1654.0, 1: 1659.7. Samples: 42086428. Policy #0 lag: (min: 31.0, avg: 37.5, max: 63.0) +[2023-10-14 08:31:43,512][99942] Avg episode reward: [(0, '0.800'), (1, '1.000')] +[2023-10-14 08:31:43,529][100917] Updated weights for policy 1, policy_version 82242 (0.0008) +[2023-10-14 08:31:43,890][100917] Updated weights for policy 1, policy_version 82252 (0.0010) +[2023-10-14 08:31:44,261][100917] Updated weights for policy 1, policy_version 82262 (0.0010) +[2023-10-14 08:31:44,630][100917] Updated weights for policy 1, policy_version 82272 (0.0010) +[2023-10-14 08:31:47,401][100936] Updated weights for policy 0, policy_version 82150 (0.0009) +[2023-10-14 08:31:47,770][100936] Updated weights for policy 0, policy_version 82160 (0.0010) +[2023-10-14 08:31:48,145][100936] Updated weights for policy 0, policy_version 82170 (0.0010) +[2023-10-14 08:31:48,512][99942] Fps is (10 sec: 16384.4, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 168394752. Throughput: 0: 1654.8, 1: 1663.5. Samples: 42106734. Policy #0 lag: (min: 31.0, avg: 37.5, max: 63.0) +[2023-10-14 08:31:48,513][99942] Avg episode reward: [(0, '0.710'), (1, '1.000')] +[2023-10-14 08:31:48,740][100917] Updated weights for policy 1, policy_version 82282 (0.0008) +[2023-10-14 08:31:49,109][100917] Updated weights for policy 1, policy_version 82292 (0.0009) +[2023-10-14 08:31:49,490][100917] Updated weights for policy 1, policy_version 82302 (0.0010) +[2023-10-14 08:31:52,191][100936] Updated weights for policy 0, policy_version 82180 (0.0009) +[2023-10-14 08:31:52,565][100936] Updated weights for policy 0, policy_version 82190 (0.0011) +[2023-10-14 08:31:52,929][100936] Updated weights for policy 0, policy_version 82200 (0.0009) +[2023-10-14 08:31:53,484][100917] Updated weights for policy 1, policy_version 82312 (0.0008) +[2023-10-14 08:31:53,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 168460288. Throughput: 0: 1648.4, 1: 1665.2. Samples: 42126168. Policy #0 lag: (min: 31.0, avg: 37.5, max: 63.0) +[2023-10-14 08:31:53,513][99942] Avg episode reward: [(0, '0.710'), (1, '1.000')] +[2023-10-14 08:31:53,854][100917] Updated weights for policy 1, policy_version 82322 (0.0009) +[2023-10-14 08:31:54,225][100917] Updated weights for policy 1, policy_version 82332 (0.0010) +[2023-10-14 08:31:57,213][100936] Updated weights for policy 0, policy_version 82210 (0.0007) +[2023-10-14 08:31:57,615][100936] Updated weights for policy 0, policy_version 82220 (0.0010) +[2023-10-14 08:31:57,979][100936] Updated weights for policy 0, policy_version 82230 (0.0011) +[2023-10-14 08:31:58,345][100936] Updated weights for policy 0, policy_version 82240 (0.0010) +[2023-10-14 08:31:58,507][100917] Updated weights for policy 1, policy_version 82342 (0.0009) +[2023-10-14 08:31:58,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 168525824. Throughput: 0: 1658.9, 1: 1663.2. Samples: 42136224. Policy #0 lag: (min: 31.0, avg: 37.5, max: 63.0) +[2023-10-14 08:31:58,512][99942] Avg episode reward: [(0, '0.710'), (1, '1.000')] +[2023-10-14 08:31:58,881][100917] Updated weights for policy 1, policy_version 82352 (0.0010) +[2023-10-14 08:31:59,255][100917] Updated weights for policy 1, policy_version 82362 (0.0007) +[2023-10-14 08:32:02,544][100936] Updated weights for policy 0, policy_version 82250 (0.0008) +[2023-10-14 08:32:02,927][100936] Updated weights for policy 0, policy_version 82260 (0.0008) +[2023-10-14 08:32:03,296][100936] Updated weights for policy 0, policy_version 82270 (0.0007) +[2023-10-14 08:32:03,347][100917] Updated weights for policy 1, policy_version 82372 (0.0007) +[2023-10-14 08:32:03,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 168591360. Throughput: 0: 1649.2, 1: 1663.8. Samples: 42156012. Policy #0 lag: (min: 31.0, avg: 37.5, max: 63.0) +[2023-10-14 08:32:03,512][99942] Avg episode reward: [(0, '0.710'), (1, '1.000')] +[2023-10-14 08:32:03,719][100917] Updated weights for policy 1, policy_version 82382 (0.0008) +[2023-10-14 08:32:04,092][100917] Updated weights for policy 1, policy_version 82392 (0.0009) +[2023-10-14 08:32:07,586][100936] Updated weights for policy 0, policy_version 82280 (0.0009) +[2023-10-14 08:32:07,962][100936] Updated weights for policy 0, policy_version 82290 (0.0008) +[2023-10-14 08:32:08,054][100917] Updated weights for policy 1, policy_version 82402 (0.0008) +[2023-10-14 08:32:08,332][100936] Updated weights for policy 0, policy_version 82300 (0.0009) +[2023-10-14 08:32:08,416][100917] Updated weights for policy 1, policy_version 82412 (0.0009) +[2023-10-14 08:32:08,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 168656896. Throughput: 0: 1645.1, 1: 1665.8. Samples: 42175590. Policy #0 lag: (min: 31.0, avg: 37.5, max: 63.0) +[2023-10-14 08:32:08,513][99942] Avg episode reward: [(0, '0.710'), (1, '1.000')] +[2023-10-14 08:32:08,786][100917] Updated weights for policy 1, policy_version 82422 (0.0008) +[2023-10-14 08:32:09,166][100917] Updated weights for policy 1, policy_version 82432 (0.0008) +[2023-10-14 08:32:12,433][100936] Updated weights for policy 0, policy_version 82310 (0.0010) +[2023-10-14 08:32:12,800][100936] Updated weights for policy 0, policy_version 82320 (0.0008) +[2023-10-14 08:32:13,179][100936] Updated weights for policy 0, policy_version 82330 (0.0007) +[2023-10-14 08:32:13,326][100917] Updated weights for policy 1, policy_version 82442 (0.0007) +[2023-10-14 08:32:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 168722432. Throughput: 0: 1644.7, 1: 1663.9. Samples: 42185606. Policy #0 lag: (min: 31.0, avg: 37.5, max: 63.0) +[2023-10-14 08:32:13,512][99942] Avg episode reward: [(0, '0.710'), (1, '1.000')] +[2023-10-14 08:32:13,692][100917] Updated weights for policy 1, policy_version 82452 (0.0009) +[2023-10-14 08:32:14,062][100917] Updated weights for policy 1, policy_version 82462 (0.0009) +[2023-10-14 08:32:17,206][100936] Updated weights for policy 0, policy_version 82340 (0.0009) +[2023-10-14 08:32:17,582][100936] Updated weights for policy 0, policy_version 82350 (0.0010) +[2023-10-14 08:32:17,947][100936] Updated weights for policy 0, policy_version 82360 (0.0009) +[2023-10-14 08:32:18,254][100917] Updated weights for policy 1, policy_version 82472 (0.0007) +[2023-10-14 08:32:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 168787968. Throughput: 0: 1635.8, 1: 1660.6. Samples: 42205510. Policy #0 lag: (min: 26.0, avg: 29.2, max: 58.0) +[2023-10-14 08:32:18,513][99942] Avg episode reward: [(0, '0.710'), (1, '1.000')] +[2023-10-14 08:32:18,619][100917] Updated weights for policy 1, policy_version 82482 (0.0007) +[2023-10-14 08:32:19,000][100917] Updated weights for policy 1, policy_version 82492 (0.0008) +[2023-10-14 08:32:22,041][100936] Updated weights for policy 0, policy_version 82370 (0.0007) +[2023-10-14 08:32:22,411][100936] Updated weights for policy 0, policy_version 82380 (0.0007) +[2023-10-14 08:32:22,783][100936] Updated weights for policy 0, policy_version 82390 (0.0007) +[2023-10-14 08:32:23,155][100936] Updated weights for policy 0, policy_version 82400 (0.0007) +[2023-10-14 08:32:23,157][100917] Updated weights for policy 1, policy_version 82502 (0.0008) +[2023-10-14 08:32:23,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 168853504. Throughput: 0: 1639.8, 1: 1658.3. Samples: 42225000. Policy #0 lag: (min: 26.0, avg: 29.2, max: 58.0) +[2023-10-14 08:32:23,512][99942] Avg episode reward: [(0, '0.710'), (1, '0.990')] +[2023-10-14 08:32:23,521][100917] Updated weights for policy 1, policy_version 82512 (0.0010) +[2023-10-14 08:32:23,892][100917] Updated weights for policy 1, policy_version 82522 (0.0008) +[2023-10-14 08:32:27,424][100936] Updated weights for policy 0, policy_version 82410 (0.0009) +[2023-10-14 08:32:27,791][100936] Updated weights for policy 0, policy_version 82420 (0.0009) +[2023-10-14 08:32:27,967][100917] Updated weights for policy 1, policy_version 82532 (0.0007) +[2023-10-14 08:32:28,164][100936] Updated weights for policy 0, policy_version 82430 (0.0008) +[2023-10-14 08:32:28,337][100917] Updated weights for policy 1, policy_version 82542 (0.0008) +[2023-10-14 08:32:28,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 168919040. Throughput: 0: 1647.0, 1: 1655.5. Samples: 42235040. Policy #0 lag: (min: 26.0, avg: 29.2, max: 58.0) +[2023-10-14 08:32:28,512][99942] Avg episode reward: [(0, '0.710'), (1, '0.990')] +[2023-10-14 08:32:28,717][100917] Updated weights for policy 1, policy_version 82552 (0.0008) +[2023-10-14 08:32:32,421][100936] Updated weights for policy 0, policy_version 82440 (0.0009) +[2023-10-14 08:32:32,763][100917] Updated weights for policy 1, policy_version 82562 (0.0009) +[2023-10-14 08:32:32,786][100936] Updated weights for policy 0, policy_version 82450 (0.0010) +[2023-10-14 08:32:33,138][100917] Updated weights for policy 1, policy_version 82572 (0.0008) +[2023-10-14 08:32:33,159][100936] Updated weights for policy 0, policy_version 82460 (0.0009) +[2023-10-14 08:32:33,504][100917] Updated weights for policy 1, policy_version 82582 (0.0008) +[2023-10-14 08:32:33,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 168984576. Throughput: 0: 1640.0, 1: 1658.0. Samples: 42255146. Policy #0 lag: (min: 26.0, avg: 29.2, max: 58.0) +[2023-10-14 08:32:33,515][99942] Avg episode reward: [(0, '0.710'), (1, '0.990')] +[2023-10-14 08:32:33,882][100917] Updated weights for policy 1, policy_version 82592 (0.0007) +[2023-10-14 08:32:37,250][100936] Updated weights for policy 0, policy_version 82470 (0.0009) +[2023-10-14 08:32:37,617][100936] Updated weights for policy 0, policy_version 82480 (0.0008) +[2023-10-14 08:32:37,982][100936] Updated weights for policy 0, policy_version 82490 (0.0008) +[2023-10-14 08:32:37,985][100917] Updated weights for policy 1, policy_version 82602 (0.0009) +[2023-10-14 08:32:38,349][100917] Updated weights for policy 1, policy_version 82612 (0.0011) +[2023-10-14 08:32:38,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 169050112. Throughput: 0: 1638.5, 1: 1654.0. Samples: 42274330. Policy #0 lag: (min: 26.0, avg: 29.2, max: 58.0) +[2023-10-14 08:32:38,513][99942] Avg episode reward: [(0, '0.710'), (1, '0.990')] +[2023-10-14 08:32:38,722][100917] Updated weights for policy 1, policy_version 82622 (0.0010) +[2023-10-14 08:32:42,240][100936] Updated weights for policy 0, policy_version 82500 (0.0010) +[2023-10-14 08:32:42,634][100936] Updated weights for policy 0, policy_version 82510 (0.0009) +[2023-10-14 08:32:42,813][100917] Updated weights for policy 1, policy_version 82632 (0.0008) +[2023-10-14 08:32:42,991][100936] Updated weights for policy 0, policy_version 82520 (0.0007) +[2023-10-14 08:32:43,190][100917] Updated weights for policy 1, policy_version 82642 (0.0009) +[2023-10-14 08:32:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 169115648. Throughput: 0: 1642.0, 1: 1665.0. Samples: 42285038. Policy #0 lag: (min: 26.0, avg: 29.2, max: 58.0) +[2023-10-14 08:32:43,513][99942] Avg episode reward: [(0, '0.710'), (1, '0.990')] +[2023-10-14 08:32:43,555][100917] Updated weights for policy 1, policy_version 82652 (0.0009) +[2023-10-14 08:32:47,112][100936] Updated weights for policy 0, policy_version 82530 (0.0008) +[2023-10-14 08:32:47,474][100936] Updated weights for policy 0, policy_version 82540 (0.0008) +[2023-10-14 08:32:47,640][100917] Updated weights for policy 1, policy_version 82662 (0.0010) +[2023-10-14 08:32:47,844][100936] Updated weights for policy 0, policy_version 82550 (0.0009) +[2023-10-14 08:32:48,003][100917] Updated weights for policy 1, policy_version 82672 (0.0008) +[2023-10-14 08:32:48,205][100936] Updated weights for policy 0, policy_version 82560 (0.0009) +[2023-10-14 08:32:48,383][100917] Updated weights for policy 1, policy_version 82682 (0.0008) +[2023-10-14 08:32:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 169181184. Throughput: 0: 1639.6, 1: 1666.8. Samples: 42304802. Policy #0 lag: (min: 26.0, avg: 29.2, max: 58.0) +[2023-10-14 08:32:48,513][99942] Avg episode reward: [(0, '0.710'), (1, '0.990')] +[2023-10-14 08:32:52,270][100936] Updated weights for policy 0, policy_version 82570 (0.0009) +[2023-10-14 08:32:52,508][100917] Updated weights for policy 1, policy_version 82692 (0.0009) +[2023-10-14 08:32:52,627][100936] Updated weights for policy 0, policy_version 82580 (0.0010) +[2023-10-14 08:32:52,886][100917] Updated weights for policy 1, policy_version 82702 (0.0009) +[2023-10-14 08:32:52,992][100936] Updated weights for policy 0, policy_version 82590 (0.0008) +[2023-10-14 08:32:53,264][100917] Updated weights for policy 1, policy_version 82712 (0.0010) +[2023-10-14 08:32:53,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 169246720. Throughput: 0: 1641.6, 1: 1649.1. Samples: 42323668. Policy #0 lag: (min: 26.0, avg: 29.2, max: 58.0) +[2023-10-14 08:32:53,512][99942] Avg episode reward: [(0, '0.710'), (1, '0.990')] +[2023-10-14 08:32:57,233][100936] Updated weights for policy 0, policy_version 82600 (0.0008) +[2023-10-14 08:32:57,377][100917] Updated weights for policy 1, policy_version 82722 (0.0010) +[2023-10-14 08:32:57,597][100936] Updated weights for policy 0, policy_version 82610 (0.0008) +[2023-10-14 08:32:57,746][100917] Updated weights for policy 1, policy_version 82732 (0.0009) +[2023-10-14 08:32:57,971][100936] Updated weights for policy 0, policy_version 82620 (0.0007) +[2023-10-14 08:32:58,120][100917] Updated weights for policy 1, policy_version 82742 (0.0010) +[2023-10-14 08:32:58,497][100917] Updated weights for policy 1, policy_version 82752 (0.0008) +[2023-10-14 08:32:58,512][99942] Fps is (10 sec: 16384.3, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 169345024. Throughput: 0: 1643.2, 1: 1665.1. Samples: 42334476. Policy #0 lag: (min: 26.0, avg: 29.2, max: 58.0) +[2023-10-14 08:32:58,512][99942] Avg episode reward: [(0, '0.720'), (1, '0.990')] +[2023-10-14 08:33:02,111][100936] Updated weights for policy 0, policy_version 82630 (0.0009) +[2023-10-14 08:33:02,475][100936] Updated weights for policy 0, policy_version 82640 (0.0007) +[2023-10-14 08:33:02,505][100917] Updated weights for policy 1, policy_version 82762 (0.0009) +[2023-10-14 08:33:02,849][100936] Updated weights for policy 0, policy_version 82650 (0.0007) +[2023-10-14 08:33:02,890][100917] Updated weights for policy 1, policy_version 82772 (0.0010) +[2023-10-14 08:33:03,258][100917] Updated weights for policy 1, policy_version 82782 (0.0008) +[2023-10-14 08:33:03,512][99942] Fps is (10 sec: 16384.2, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 169410560. Throughput: 0: 1643.0, 1: 1668.0. Samples: 42354504. Policy #0 lag: (min: 26.0, avg: 29.2, max: 58.0) +[2023-10-14 08:33:03,512][99942] Avg episode reward: [(0, '0.720'), (1, '0.990')] +[2023-10-14 08:33:07,019][100936] Updated weights for policy 0, policy_version 82660 (0.0008) +[2023-10-14 08:33:07,399][100936] Updated weights for policy 0, policy_version 82670 (0.0009) +[2023-10-14 08:33:07,435][100917] Updated weights for policy 1, policy_version 82792 (0.0007) +[2023-10-14 08:33:07,761][100936] Updated weights for policy 0, policy_version 82680 (0.0008) +[2023-10-14 08:33:07,804][100917] Updated weights for policy 1, policy_version 82802 (0.0008) +[2023-10-14 08:33:08,180][100917] Updated weights for policy 1, policy_version 82812 (0.0009) +[2023-10-14 08:33:08,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 169476096. Throughput: 0: 1644.9, 1: 1651.0. Samples: 42373314. Policy #0 lag: (min: 26.0, avg: 29.2, max: 58.0) +[2023-10-14 08:33:08,513][99942] Avg episode reward: [(0, '0.720'), (1, '0.990')] +[2023-10-14 08:33:11,887][100936] Updated weights for policy 0, policy_version 82690 (0.0009) +[2023-10-14 08:33:12,245][100936] Updated weights for policy 0, policy_version 82700 (0.0008) +[2023-10-14 08:33:12,393][100917] Updated weights for policy 1, policy_version 82822 (0.0010) +[2023-10-14 08:33:12,619][100936] Updated weights for policy 0, policy_version 82710 (0.0007) +[2023-10-14 08:33:12,767][100917] Updated weights for policy 1, policy_version 82832 (0.0009) +[2023-10-14 08:33:12,981][100936] Updated weights for policy 0, policy_version 82720 (0.0007) +[2023-10-14 08:33:13,133][100917] Updated weights for policy 1, policy_version 82842 (0.0008) +[2023-10-14 08:33:13,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 169541632. Throughput: 0: 1647.2, 1: 1670.5. Samples: 42384338. Policy #0 lag: (min: 0.0, avg: 25.0, max: 32.0) +[2023-10-14 08:33:13,513][99942] Avg episode reward: [(0, '0.720'), (1, '0.990')] +[2023-10-14 08:33:17,108][100936] Updated weights for policy 0, policy_version 82730 (0.0010) +[2023-10-14 08:33:17,271][100917] Updated weights for policy 1, policy_version 82852 (0.0009) +[2023-10-14 08:33:17,471][100936] Updated weights for policy 0, policy_version 82740 (0.0007) +[2023-10-14 08:33:17,638][100917] Updated weights for policy 1, policy_version 82862 (0.0009) +[2023-10-14 08:33:17,836][100936] Updated weights for policy 0, policy_version 82750 (0.0007) +[2023-10-14 08:33:18,009][100917] Updated weights for policy 1, policy_version 82872 (0.0009) +[2023-10-14 08:33:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 169607168. Throughput: 0: 1642.6, 1: 1664.1. Samples: 42403948. Policy #0 lag: (min: 0.0, avg: 25.0, max: 32.0) +[2023-10-14 08:33:18,513][99942] Avg episode reward: [(0, '0.720'), (1, '0.990')] +[2023-10-14 08:33:21,959][100917] Updated weights for policy 1, policy_version 82882 (0.0007) +[2023-10-14 08:33:22,066][100936] Updated weights for policy 0, policy_version 82760 (0.0008) +[2023-10-14 08:33:22,335][100917] Updated weights for policy 1, policy_version 82892 (0.0008) +[2023-10-14 08:33:22,439][100936] Updated weights for policy 0, policy_version 82770 (0.0010) +[2023-10-14 08:33:22,702][100917] Updated weights for policy 1, policy_version 82902 (0.0007) +[2023-10-14 08:33:22,805][100936] Updated weights for policy 0, policy_version 82780 (0.0008) +[2023-10-14 08:33:23,076][100917] Updated weights for policy 1, policy_version 82912 (0.0007) +[2023-10-14 08:33:23,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 169672704. Throughput: 0: 1654.7, 1: 1648.8. Samples: 42422988. Policy #0 lag: (min: 0.0, avg: 25.0, max: 32.0) +[2023-10-14 08:33:23,513][99942] Avg episode reward: [(0, '0.720'), (1, '0.990')] +[2023-10-14 08:33:26,901][100936] Updated weights for policy 0, policy_version 82790 (0.0009) +[2023-10-14 08:33:27,268][100917] Updated weights for policy 1, policy_version 82922 (0.0008) +[2023-10-14 08:33:27,277][100936] Updated weights for policy 0, policy_version 82800 (0.0008) +[2023-10-14 08:33:27,644][100917] Updated weights for policy 1, policy_version 82932 (0.0008) +[2023-10-14 08:33:27,654][100936] Updated weights for policy 0, policy_version 82810 (0.0009) +[2023-10-14 08:33:28,008][100917] Updated weights for policy 1, policy_version 82942 (0.0008) +[2023-10-14 08:33:28,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 169738240. Throughput: 0: 1653.1, 1: 1662.4. Samples: 42434232. Policy #0 lag: (min: 0.0, avg: 25.0, max: 32.0) +[2023-10-14 08:33:28,512][99942] Avg episode reward: [(0, '0.910'), (1, '0.680')] +[2023-10-14 08:33:31,598][100936] Updated weights for policy 0, policy_version 82820 (0.0010) +[2023-10-14 08:33:31,971][100936] Updated weights for policy 0, policy_version 82830 (0.0008) +[2023-10-14 08:33:32,200][100917] Updated weights for policy 1, policy_version 82952 (0.0009) +[2023-10-14 08:33:32,336][100936] Updated weights for policy 0, policy_version 82840 (0.0008) +[2023-10-14 08:33:32,580][100917] Updated weights for policy 1, policy_version 82962 (0.0009) +[2023-10-14 08:33:32,959][100917] Updated weights for policy 1, policy_version 82972 (0.0007) +[2023-10-14 08:33:33,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 169803776. Throughput: 0: 1648.5, 1: 1658.5. Samples: 42453618. Policy #0 lag: (min: 0.0, avg: 25.0, max: 32.0) +[2023-10-14 08:33:33,512][99942] Avg episode reward: [(0, '0.910'), (1, '0.680')] +[2023-10-14 08:33:36,423][100936] Updated weights for policy 0, policy_version 82850 (0.0008) +[2023-10-14 08:33:36,798][100936] Updated weights for policy 0, policy_version 82860 (0.0009) +[2023-10-14 08:33:37,100][100917] Updated weights for policy 1, policy_version 82982 (0.0008) +[2023-10-14 08:33:37,164][100936] Updated weights for policy 0, policy_version 82870 (0.0008) +[2023-10-14 08:33:37,463][100917] Updated weights for policy 1, policy_version 82992 (0.0008) +[2023-10-14 08:33:37,537][100936] Updated weights for policy 0, policy_version 82880 (0.0007) +[2023-10-14 08:33:37,835][100917] Updated weights for policy 1, policy_version 83002 (0.0009) +[2023-10-14 08:33:38,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 169869312. Throughput: 0: 1666.4, 1: 1647.9. Samples: 42472816. Policy #0 lag: (min: 0.0, avg: 25.0, max: 32.0) +[2023-10-14 08:33:38,513][99942] Avg episode reward: [(0, '0.850'), (1, '0.680')] +[2023-10-14 08:33:38,524][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000082880_84869120.pth... +[2023-10-14 08:33:38,524][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000083008_85000192.pth... +[2023-10-14 08:33:38,557][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000081440_83394560.pth +[2023-10-14 08:33:38,561][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000081344_83296256.pth +[2023-10-14 08:33:41,573][100936] Updated weights for policy 0, policy_version 82890 (0.0008) +[2023-10-14 08:33:41,940][100936] Updated weights for policy 0, policy_version 82900 (0.0008) +[2023-10-14 08:33:42,226][100917] Updated weights for policy 1, policy_version 83012 (0.0008) +[2023-10-14 08:33:42,318][100936] Updated weights for policy 0, policy_version 82910 (0.0008) +[2023-10-14 08:33:42,598][100917] Updated weights for policy 1, policy_version 83022 (0.0008) +[2023-10-14 08:33:42,970][100917] Updated weights for policy 1, policy_version 83032 (0.0010) +[2023-10-14 08:33:43,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 169934848. Throughput: 0: 1661.7, 1: 1656.6. Samples: 42483800. Policy #0 lag: (min: 0.0, avg: 25.0, max: 32.0) +[2023-10-14 08:33:43,512][99942] Avg episode reward: [(0, '0.850'), (1, '0.680')] +[2023-10-14 08:33:46,390][100936] Updated weights for policy 0, policy_version 82920 (0.0008) +[2023-10-14 08:33:46,752][100936] Updated weights for policy 0, policy_version 82930 (0.0011) +[2023-10-14 08:33:47,110][100936] Updated weights for policy 0, policy_version 82940 (0.0010) +[2023-10-14 08:33:47,318][100917] Updated weights for policy 1, policy_version 83042 (0.0009) +[2023-10-14 08:33:47,734][100917] Updated weights for policy 1, policy_version 83052 (0.0009) +[2023-10-14 08:33:48,103][100917] Updated weights for policy 1, policy_version 83062 (0.0009) +[2023-10-14 08:33:48,477][100917] Updated weights for policy 1, policy_version 83072 (0.0010) +[2023-10-14 08:33:48,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 170000384. Throughput: 0: 1649.2, 1: 1653.2. Samples: 42503110. Policy #0 lag: (min: 0.0, avg: 25.0, max: 32.0) +[2023-10-14 08:33:48,512][99942] Avg episode reward: [(0, '0.850'), (1, '0.680')] +[2023-10-14 08:33:51,359][100936] Updated weights for policy 0, policy_version 82950 (0.0008) +[2023-10-14 08:33:51,734][100936] Updated weights for policy 0, policy_version 82960 (0.0007) +[2023-10-14 08:33:52,095][100936] Updated weights for policy 0, policy_version 82970 (0.0008) +[2023-10-14 08:33:52,352][100917] Updated weights for policy 1, policy_version 83082 (0.0008) +[2023-10-14 08:33:52,731][100917] Updated weights for policy 1, policy_version 83092 (0.0008) +[2023-10-14 08:33:53,094][100917] Updated weights for policy 1, policy_version 83102 (0.0009) +[2023-10-14 08:33:53,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 170065920. Throughput: 0: 1666.0, 1: 1651.1. Samples: 42522580. Policy #0 lag: (min: 0.0, avg: 25.0, max: 32.0) +[2023-10-14 08:33:53,513][99942] Avg episode reward: [(0, '0.850'), (1, '0.680')] +[2023-10-14 08:33:56,167][100936] Updated weights for policy 0, policy_version 82980 (0.0008) +[2023-10-14 08:33:56,525][100936] Updated weights for policy 0, policy_version 82990 (0.0008) +[2023-10-14 08:33:56,900][100936] Updated weights for policy 0, policy_version 83000 (0.0009) +[2023-10-14 08:33:56,976][100917] Updated weights for policy 1, policy_version 83112 (0.0009) +[2023-10-14 08:33:57,342][100917] Updated weights for policy 1, policy_version 83122 (0.0007) +[2023-10-14 08:33:57,713][100917] Updated weights for policy 1, policy_version 83132 (0.0010) +[2023-10-14 08:33:58,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 170131456. Throughput: 0: 1662.4, 1: 1658.8. Samples: 42533794. Policy #0 lag: (min: 0.0, avg: 25.0, max: 32.0) +[2023-10-14 08:33:58,513][99942] Avg episode reward: [(0, '0.850'), (1, '0.680')] +[2023-10-14 08:34:00,999][100936] Updated weights for policy 0, policy_version 83010 (0.0008) +[2023-10-14 08:34:01,359][100936] Updated weights for policy 0, policy_version 83020 (0.0009) +[2023-10-14 08:34:01,726][100936] Updated weights for policy 0, policy_version 83030 (0.0009) +[2023-10-14 08:34:01,913][100917] Updated weights for policy 1, policy_version 83142 (0.0009) +[2023-10-14 08:34:02,098][100936] Updated weights for policy 0, policy_version 83040 (0.0007) +[2023-10-14 08:34:02,289][100917] Updated weights for policy 1, policy_version 83152 (0.0010) +[2023-10-14 08:34:02,661][100917] Updated weights for policy 1, policy_version 83162 (0.0008) +[2023-10-14 08:34:03,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 170196992. Throughput: 0: 1655.6, 1: 1655.8. Samples: 42552964. Policy #0 lag: (min: 0.0, avg: 25.0, max: 32.0) +[2023-10-14 08:34:03,512][99942] Avg episode reward: [(0, '0.850'), (1, '0.680')] +[2023-10-14 08:34:06,233][100936] Updated weights for policy 0, policy_version 83050 (0.0009) +[2023-10-14 08:34:06,600][100936] Updated weights for policy 0, policy_version 83060 (0.0009) +[2023-10-14 08:34:06,795][100917] Updated weights for policy 1, policy_version 83172 (0.0007) +[2023-10-14 08:34:06,968][100936] Updated weights for policy 0, policy_version 83070 (0.0009) +[2023-10-14 08:34:07,163][100917] Updated weights for policy 1, policy_version 83182 (0.0009) +[2023-10-14 08:34:07,538][100917] Updated weights for policy 1, policy_version 83192 (0.0010) +[2023-10-14 08:34:08,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 170262528. Throughput: 0: 1665.2, 1: 1654.9. Samples: 42572394. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:34:08,513][99942] Avg episode reward: [(0, '0.850'), (1, '0.680')] +[2023-10-14 08:34:11,200][100936] Updated weights for policy 0, policy_version 83080 (0.0009) +[2023-10-14 08:34:11,564][100936] Updated weights for policy 0, policy_version 83090 (0.0009) +[2023-10-14 08:34:11,605][100917] Updated weights for policy 1, policy_version 83202 (0.0009) +[2023-10-14 08:34:11,929][100936] Updated weights for policy 0, policy_version 83100 (0.0009) +[2023-10-14 08:34:11,976][100917] Updated weights for policy 1, policy_version 83212 (0.0008) +[2023-10-14 08:34:12,344][100917] Updated weights for policy 1, policy_version 83222 (0.0007) +[2023-10-14 08:34:12,708][100917] Updated weights for policy 1, policy_version 83232 (0.0008) +[2023-10-14 08:34:13,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 170328064. Throughput: 0: 1654.4, 1: 1658.4. Samples: 42583308. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:34:13,513][99942] Avg episode reward: [(0, '0.850'), (1, '0.680')] +[2023-10-14 08:34:16,166][100936] Updated weights for policy 0, policy_version 83110 (0.0008) +[2023-10-14 08:34:16,533][100936] Updated weights for policy 0, policy_version 83120 (0.0008) +[2023-10-14 08:34:16,875][100917] Updated weights for policy 1, policy_version 83242 (0.0009) +[2023-10-14 08:34:16,904][100936] Updated weights for policy 0, policy_version 83130 (0.0008) +[2023-10-14 08:34:17,255][100917] Updated weights for policy 1, policy_version 83252 (0.0008) +[2023-10-14 08:34:17,626][100917] Updated weights for policy 1, policy_version 83262 (0.0008) +[2023-10-14 08:34:18,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 170393600. Throughput: 0: 1656.7, 1: 1652.7. Samples: 42602540. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:34:18,512][99942] Avg episode reward: [(0, '0.850'), (1, '0.680')] +[2023-10-14 08:34:21,062][100936] Updated weights for policy 0, policy_version 83140 (0.0010) +[2023-10-14 08:34:21,459][100936] Updated weights for policy 0, policy_version 83150 (0.0009) +[2023-10-14 08:34:21,634][100917] Updated weights for policy 1, policy_version 83272 (0.0008) +[2023-10-14 08:34:21,823][100936] Updated weights for policy 0, policy_version 83160 (0.0008) +[2023-10-14 08:34:21,994][100917] Updated weights for policy 1, policy_version 83282 (0.0007) +[2023-10-14 08:34:22,364][100917] Updated weights for policy 1, policy_version 83292 (0.0009) +[2023-10-14 08:34:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 170459136. Throughput: 0: 1657.6, 1: 1657.8. Samples: 42622010. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:34:23,513][99942] Avg episode reward: [(0, '0.850'), (1, '0.680')] +[2023-10-14 08:34:26,026][100936] Updated weights for policy 0, policy_version 83170 (0.0009) +[2023-10-14 08:34:26,396][100936] Updated weights for policy 0, policy_version 83180 (0.0009) +[2023-10-14 08:34:26,442][100917] Updated weights for policy 1, policy_version 83302 (0.0009) +[2023-10-14 08:34:26,768][100936] Updated weights for policy 0, policy_version 83190 (0.0010) +[2023-10-14 08:34:26,812][100917] Updated weights for policy 1, policy_version 83312 (0.0008) +[2023-10-14 08:34:27,136][100936] Updated weights for policy 0, policy_version 83200 (0.0009) +[2023-10-14 08:34:27,186][100917] Updated weights for policy 1, policy_version 83322 (0.0008) +[2023-10-14 08:34:28,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 170524672. Throughput: 0: 1652.4, 1: 1662.0. Samples: 42632950. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:34:28,512][99942] Avg episode reward: [(0, '0.940'), (1, '0.680')] +[2023-10-14 08:34:31,183][100917] Updated weights for policy 1, policy_version 83332 (0.0009) +[2023-10-14 08:34:31,235][100936] Updated weights for policy 0, policy_version 83210 (0.0007) +[2023-10-14 08:34:31,551][100917] Updated weights for policy 1, policy_version 83342 (0.0009) +[2023-10-14 08:34:31,603][100936] Updated weights for policy 0, policy_version 83220 (0.0007) +[2023-10-14 08:34:31,913][100917] Updated weights for policy 1, policy_version 83352 (0.0007) +[2023-10-14 08:34:31,977][100936] Updated weights for policy 0, policy_version 83230 (0.0008) +[2023-10-14 08:34:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 170590208. Throughput: 0: 1655.4, 1: 1647.6. Samples: 42651748. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:34:33,513][99942] Avg episode reward: [(0, '0.940'), (1, '0.680')] +[2023-10-14 08:34:36,185][100917] Updated weights for policy 1, policy_version 83362 (0.0010) +[2023-10-14 08:34:36,220][100936] Updated weights for policy 0, policy_version 83240 (0.0009) +[2023-10-14 08:34:36,591][100936] Updated weights for policy 0, policy_version 83250 (0.0008) +[2023-10-14 08:34:36,605][100917] Updated weights for policy 1, policy_version 83372 (0.0010) +[2023-10-14 08:34:36,958][100936] Updated weights for policy 0, policy_version 83260 (0.0007) +[2023-10-14 08:34:36,980][100917] Updated weights for policy 1, policy_version 83382 (0.0007) +[2023-10-14 08:34:37,355][100917] Updated weights for policy 1, policy_version 83392 (0.0007) +[2023-10-14 08:34:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.3, 300 sec: 13329.4). Total num frames: 170655744. Throughput: 0: 1654.3, 1: 1656.7. Samples: 42671574. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:34:38,512][99942] Avg episode reward: [(0, '0.940'), (1, '0.680')] +[2023-10-14 08:34:41,113][100936] Updated weights for policy 0, policy_version 83270 (0.0007) +[2023-10-14 08:34:41,399][100917] Updated weights for policy 1, policy_version 83402 (0.0010) +[2023-10-14 08:34:41,481][100936] Updated weights for policy 0, policy_version 83280 (0.0007) +[2023-10-14 08:34:41,773][100917] Updated weights for policy 1, policy_version 83412 (0.0009) +[2023-10-14 08:34:41,847][100936] Updated weights for policy 0, policy_version 83290 (0.0008) +[2023-10-14 08:34:42,146][100917] Updated weights for policy 1, policy_version 83422 (0.0007) +[2023-10-14 08:34:43,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 170721280. Throughput: 0: 1646.0, 1: 1658.8. Samples: 42682510. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:34:43,513][99942] Avg episode reward: [(0, '0.940'), (1, '0.680')] +[2023-10-14 08:34:46,093][100936] Updated weights for policy 0, policy_version 83300 (0.0009) +[2023-10-14 08:34:46,313][100917] Updated weights for policy 1, policy_version 83432 (0.0007) +[2023-10-14 08:34:46,453][100936] Updated weights for policy 0, policy_version 83310 (0.0007) +[2023-10-14 08:34:46,690][100917] Updated weights for policy 1, policy_version 83442 (0.0009) +[2023-10-14 08:34:46,828][100936] Updated weights for policy 0, policy_version 83320 (0.0009) +[2023-10-14 08:34:47,055][100917] Updated weights for policy 1, policy_version 83452 (0.0009) +[2023-10-14 08:34:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 170786816. Throughput: 0: 1645.8, 1: 1645.3. Samples: 42701064. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:34:48,512][99942] Avg episode reward: [(0, '0.940'), (1, '0.680')] +[2023-10-14 08:34:50,859][100936] Updated weights for policy 0, policy_version 83330 (0.0008) +[2023-10-14 08:34:51,195][100917] Updated weights for policy 1, policy_version 83462 (0.0010) +[2023-10-14 08:34:51,230][100936] Updated weights for policy 0, policy_version 83340 (0.0009) +[2023-10-14 08:34:51,569][100917] Updated weights for policy 1, policy_version 83472 (0.0008) +[2023-10-14 08:34:51,592][100936] Updated weights for policy 0, policy_version 83350 (0.0008) +[2023-10-14 08:34:51,938][100917] Updated weights for policy 1, policy_version 83482 (0.0008) +[2023-10-14 08:34:51,962][100936] Updated weights for policy 0, policy_version 83360 (0.0008) +[2023-10-14 08:34:53,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 170852352. Throughput: 0: 1648.2, 1: 1659.1. Samples: 42721222. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:34:53,513][99942] Avg episode reward: [(0, '0.940'), (1, '0.680')] +[2023-10-14 08:34:56,141][100936] Updated weights for policy 0, policy_version 83370 (0.0008) +[2023-10-14 08:34:56,231][100917] Updated weights for policy 1, policy_version 83492 (0.0010) +[2023-10-14 08:34:56,502][100936] Updated weights for policy 0, policy_version 83380 (0.0009) +[2023-10-14 08:34:56,593][100917] Updated weights for policy 1, policy_version 83502 (0.0008) +[2023-10-14 08:34:56,882][100936] Updated weights for policy 0, policy_version 83390 (0.0008) +[2023-10-14 08:34:56,970][100917] Updated weights for policy 1, policy_version 83512 (0.0010) +[2023-10-14 08:34:58,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 170917888. Throughput: 0: 1643.9, 1: 1657.1. Samples: 42731856. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:34:58,513][99942] Avg episode reward: [(0, '0.940'), (1, '0.680')] +[2023-10-14 08:35:00,990][100936] Updated weights for policy 0, policy_version 83400 (0.0008) +[2023-10-14 08:35:01,150][100917] Updated weights for policy 1, policy_version 83522 (0.0009) +[2023-10-14 08:35:01,363][100936] Updated weights for policy 0, policy_version 83410 (0.0009) +[2023-10-14 08:35:01,517][100917] Updated weights for policy 1, policy_version 83532 (0.0007) +[2023-10-14 08:35:01,723][100936] Updated weights for policy 0, policy_version 83420 (0.0009) +[2023-10-14 08:35:01,877][100917] Updated weights for policy 1, policy_version 83542 (0.0008) +[2023-10-14 08:35:02,251][100917] Updated weights for policy 1, policy_version 83552 (0.0008) +[2023-10-14 08:35:03,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 170983424. Throughput: 0: 1647.7, 1: 1647.8. Samples: 42750838. Policy #0 lag: (min: 22.0, avg: 22.0, max: 22.0) +[2023-10-14 08:35:03,512][99942] Avg episode reward: [(0, '0.940'), (1, '0.690')] +[2023-10-14 08:35:05,883][100936] Updated weights for policy 0, policy_version 83430 (0.0008) +[2023-10-14 08:35:06,273][100936] Updated weights for policy 0, policy_version 83440 (0.0008) +[2023-10-14 08:35:06,387][100917] Updated weights for policy 1, policy_version 83562 (0.0009) +[2023-10-14 08:35:06,639][100936] Updated weights for policy 0, policy_version 83450 (0.0008) +[2023-10-14 08:35:06,754][100917] Updated weights for policy 1, policy_version 83572 (0.0009) +[2023-10-14 08:35:07,133][100917] Updated weights for policy 1, policy_version 83582 (0.0009) +[2023-10-14 08:35:08,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 171048960. Throughput: 0: 1642.2, 1: 1655.7. Samples: 42770416. Policy #0 lag: (min: 22.0, avg: 22.0, max: 22.0) +[2023-10-14 08:35:08,512][99942] Avg episode reward: [(0, '0.940'), (1, '0.690')] +[2023-10-14 08:35:11,008][100936] Updated weights for policy 0, policy_version 83460 (0.0008) +[2023-10-14 08:35:11,195][100917] Updated weights for policy 1, policy_version 83592 (0.0009) +[2023-10-14 08:35:11,376][100936] Updated weights for policy 0, policy_version 83470 (0.0007) +[2023-10-14 08:35:11,561][100917] Updated weights for policy 1, policy_version 83602 (0.0009) +[2023-10-14 08:35:11,746][100936] Updated weights for policy 0, policy_version 83480 (0.0011) +[2023-10-14 08:35:11,939][100917] Updated weights for policy 1, policy_version 83612 (0.0010) +[2023-10-14 08:35:13,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.1, 300 sec: 13329.3). Total num frames: 171114496. Throughput: 0: 1639.7, 1: 1657.6. Samples: 42781332. Policy #0 lag: (min: 22.0, avg: 22.0, max: 22.0) +[2023-10-14 08:35:13,514][99942] Avg episode reward: [(0, '0.940'), (1, '0.690')] +[2023-10-14 08:35:16,043][100936] Updated weights for policy 0, policy_version 83490 (0.0008) +[2023-10-14 08:35:16,355][100917] Updated weights for policy 1, policy_version 83622 (0.0010) +[2023-10-14 08:35:16,408][100936] Updated weights for policy 0, policy_version 83500 (0.0009) +[2023-10-14 08:35:16,722][100917] Updated weights for policy 1, policy_version 83632 (0.0007) +[2023-10-14 08:35:16,778][100936] Updated weights for policy 0, policy_version 83510 (0.0010) +[2023-10-14 08:35:17,095][100917] Updated weights for policy 1, policy_version 83642 (0.0007) +[2023-10-14 08:35:17,145][100936] Updated weights for policy 0, policy_version 83520 (0.0010) +[2023-10-14 08:35:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 171180032. Throughput: 0: 1641.5, 1: 1646.4. Samples: 42799702. Policy #0 lag: (min: 22.0, avg: 22.0, max: 22.0) +[2023-10-14 08:35:18,513][99942] Avg episode reward: [(0, '0.940'), (1, '0.690')] +[2023-10-14 08:35:21,226][100917] Updated weights for policy 1, policy_version 83652 (0.0009) +[2023-10-14 08:35:21,356][100936] Updated weights for policy 0, policy_version 83530 (0.0010) +[2023-10-14 08:35:21,619][100917] Updated weights for policy 1, policy_version 83662 (0.0008) +[2023-10-14 08:35:21,712][100936] Updated weights for policy 0, policy_version 83540 (0.0009) +[2023-10-14 08:35:21,991][100917] Updated weights for policy 1, policy_version 83672 (0.0007) +[2023-10-14 08:35:22,083][100936] Updated weights for policy 0, policy_version 83550 (0.0008) +[2023-10-14 08:35:23,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 171245568. Throughput: 0: 1639.9, 1: 1645.2. Samples: 42819402. Policy #0 lag: (min: 22.0, avg: 22.0, max: 22.0) +[2023-10-14 08:35:23,513][99942] Avg episode reward: [(0, '0.940'), (1, '0.690')] +[2023-10-14 08:35:26,215][100936] Updated weights for policy 0, policy_version 83560 (0.0008) +[2023-10-14 08:35:26,216][100917] Updated weights for policy 1, policy_version 83682 (0.0008) +[2023-10-14 08:35:26,575][100936] Updated weights for policy 0, policy_version 83570 (0.0008) +[2023-10-14 08:35:26,593][100917] Updated weights for policy 1, policy_version 83692 (0.0008) +[2023-10-14 08:35:26,952][100936] Updated weights for policy 0, policy_version 83580 (0.0008) +[2023-10-14 08:35:26,962][100917] Updated weights for policy 1, policy_version 83702 (0.0009) +[2023-10-14 08:35:27,334][100917] Updated weights for policy 1, policy_version 83712 (0.0010) +[2023-10-14 08:35:28,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 171311104. Throughput: 0: 1638.5, 1: 1641.3. Samples: 42830102. Policy #0 lag: (min: 22.0, avg: 22.0, max: 22.0) +[2023-10-14 08:35:28,513][99942] Avg episode reward: [(0, '0.940'), (1, '0.690')] +[2023-10-14 08:35:31,037][100936] Updated weights for policy 0, policy_version 83590 (0.0008) +[2023-10-14 08:35:31,393][100936] Updated weights for policy 0, policy_version 83600 (0.0010) +[2023-10-14 08:35:31,430][100917] Updated weights for policy 1, policy_version 83722 (0.0007) +[2023-10-14 08:35:31,765][100936] Updated weights for policy 0, policy_version 83610 (0.0008) +[2023-10-14 08:35:31,798][100917] Updated weights for policy 1, policy_version 83732 (0.0010) +[2023-10-14 08:35:32,166][100917] Updated weights for policy 1, policy_version 83742 (0.0009) +[2023-10-14 08:35:33,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 171376640. Throughput: 0: 1643.4, 1: 1640.2. Samples: 42848826. Policy #0 lag: (min: 22.0, avg: 22.0, max: 22.0) +[2023-10-14 08:35:33,513][99942] Avg episode reward: [(0, '0.940'), (1, '0.690')] +[2023-10-14 08:35:35,936][100936] Updated weights for policy 0, policy_version 83620 (0.0008) +[2023-10-14 08:35:36,224][100917] Updated weights for policy 1, policy_version 83752 (0.0009) +[2023-10-14 08:35:36,307][100936] Updated weights for policy 0, policy_version 83630 (0.0009) +[2023-10-14 08:35:36,600][100917] Updated weights for policy 1, policy_version 83762 (0.0008) +[2023-10-14 08:35:36,670][100936] Updated weights for policy 0, policy_version 83640 (0.0008) +[2023-10-14 08:35:36,966][100917] Updated weights for policy 1, policy_version 83772 (0.0008) +[2023-10-14 08:35:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 171442176. Throughput: 0: 1639.8, 1: 1638.0. Samples: 42868724. Policy #0 lag: (min: 22.0, avg: 22.0, max: 22.0) +[2023-10-14 08:35:38,512][99942] Avg episode reward: [(0, '0.940'), (1, '0.690')] +[2023-10-14 08:35:38,520][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000083648_85655552.pth... +[2023-10-14 08:35:38,521][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000083776_85786624.pth... +[2023-10-14 08:35:38,551][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000082112_84082688.pth +[2023-10-14 08:35:38,557][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000082240_84213760.pth +[2023-10-14 08:35:40,740][100936] Updated weights for policy 0, policy_version 83650 (0.0008) +[2023-10-14 08:35:41,028][100917] Updated weights for policy 1, policy_version 83782 (0.0009) +[2023-10-14 08:35:41,113][100936] Updated weights for policy 0, policy_version 83660 (0.0009) +[2023-10-14 08:35:41,399][100917] Updated weights for policy 1, policy_version 83792 (0.0008) +[2023-10-14 08:35:41,476][100936] Updated weights for policy 0, policy_version 83670 (0.0009) +[2023-10-14 08:35:41,762][100917] Updated weights for policy 1, policy_version 83802 (0.0008) +[2023-10-14 08:35:41,845][100936] Updated weights for policy 0, policy_version 83680 (0.0009) +[2023-10-14 08:35:43,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 171507712. Throughput: 0: 1639.8, 1: 1642.9. Samples: 42879578. Policy #0 lag: (min: 22.0, avg: 22.0, max: 22.0) +[2023-10-14 08:35:43,513][99942] Avg episode reward: [(0, '0.940'), (1, '0.690')] +[2023-10-14 08:35:46,030][100936] Updated weights for policy 0, policy_version 83690 (0.0009) +[2023-10-14 08:35:46,052][100917] Updated weights for policy 1, policy_version 83812 (0.0009) +[2023-10-14 08:35:46,393][100936] Updated weights for policy 0, policy_version 83700 (0.0008) +[2023-10-14 08:35:46,419][100917] Updated weights for policy 1, policy_version 83822 (0.0009) +[2023-10-14 08:35:46,759][100936] Updated weights for policy 0, policy_version 83710 (0.0007) +[2023-10-14 08:35:46,794][100917] Updated weights for policy 1, policy_version 83832 (0.0008) +[2023-10-14 08:35:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 171573248. Throughput: 0: 1639.8, 1: 1637.0. Samples: 42898292. Policy #0 lag: (min: 22.0, avg: 22.0, max: 22.0) +[2023-10-14 08:35:48,512][99942] Avg episode reward: [(0, '0.940'), (1, '0.690')] +[2023-10-14 08:35:50,959][100917] Updated weights for policy 1, policy_version 83842 (0.0010) +[2023-10-14 08:35:50,986][100936] Updated weights for policy 0, policy_version 83720 (0.0008) +[2023-10-14 08:35:51,330][100917] Updated weights for policy 1, policy_version 83852 (0.0008) +[2023-10-14 08:35:51,361][100936] Updated weights for policy 0, policy_version 83730 (0.0008) +[2023-10-14 08:35:51,705][100917] Updated weights for policy 1, policy_version 83862 (0.0009) +[2023-10-14 08:35:51,731][100936] Updated weights for policy 0, policy_version 83740 (0.0008) +[2023-10-14 08:35:52,069][100917] Updated weights for policy 1, policy_version 83872 (0.0008) +[2023-10-14 08:35:53,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 171638784. Throughput: 0: 1647.1, 1: 1642.5. Samples: 42918446. Policy #0 lag: (min: 22.0, avg: 22.0, max: 22.0) +[2023-10-14 08:35:53,513][99942] Avg episode reward: [(0, '0.940'), (1, '0.690')] +[2023-10-14 08:35:55,897][100936] Updated weights for policy 0, policy_version 83750 (0.0008) +[2023-10-14 08:35:56,115][100917] Updated weights for policy 1, policy_version 83882 (0.0007) +[2023-10-14 08:35:56,264][100936] Updated weights for policy 0, policy_version 83760 (0.0007) +[2023-10-14 08:35:56,478][100917] Updated weights for policy 1, policy_version 83892 (0.0008) +[2023-10-14 08:35:56,632][100936] Updated weights for policy 0, policy_version 83770 (0.0007) +[2023-10-14 08:35:56,849][100917] Updated weights for policy 1, policy_version 83902 (0.0009) +[2023-10-14 08:35:58,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.3, 300 sec: 13329.4). Total num frames: 171704320. Throughput: 0: 1645.0, 1: 1633.6. Samples: 42928866. Policy #0 lag: (min: 3.0, avg: 10.9, max: 35.0) +[2023-10-14 08:35:58,512][99942] Avg episode reward: [(0, '0.940'), (1, '0.690')] +[2023-10-14 08:36:00,923][100936] Updated weights for policy 0, policy_version 83780 (0.0008) +[2023-10-14 08:36:00,970][100917] Updated weights for policy 1, policy_version 83912 (0.0008) +[2023-10-14 08:36:01,283][100936] Updated weights for policy 0, policy_version 83790 (0.0008) +[2023-10-14 08:36:01,340][100917] Updated weights for policy 1, policy_version 83922 (0.0008) +[2023-10-14 08:36:01,647][100936] Updated weights for policy 0, policy_version 83800 (0.0007) +[2023-10-14 08:36:01,702][100917] Updated weights for policy 1, policy_version 83932 (0.0009) +[2023-10-14 08:36:03,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 171769856. Throughput: 0: 1649.3, 1: 1637.0. Samples: 42947586. Policy #0 lag: (min: 3.0, avg: 10.9, max: 35.0) +[2023-10-14 08:36:03,513][99942] Avg episode reward: [(0, '0.940'), (1, '0.690')] +[2023-10-14 08:36:05,810][100936] Updated weights for policy 0, policy_version 83810 (0.0008) +[2023-10-14 08:36:05,878][100917] Updated weights for policy 1, policy_version 83942 (0.0008) +[2023-10-14 08:36:06,182][100936] Updated weights for policy 0, policy_version 83820 (0.0007) +[2023-10-14 08:36:06,251][100917] Updated weights for policy 1, policy_version 83952 (0.0010) +[2023-10-14 08:36:06,543][100936] Updated weights for policy 0, policy_version 83830 (0.0007) +[2023-10-14 08:36:06,613][100917] Updated weights for policy 1, policy_version 83962 (0.0010) +[2023-10-14 08:36:06,909][100936] Updated weights for policy 0, policy_version 83840 (0.0008) +[2023-10-14 08:36:08,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 171835392. Throughput: 0: 1652.3, 1: 1655.8. Samples: 42968268. Policy #0 lag: (min: 3.0, avg: 10.9, max: 35.0) +[2023-10-14 08:36:08,512][99942] Avg episode reward: [(0, '0.940'), (1, '0.690')] +[2023-10-14 08:36:10,739][100917] Updated weights for policy 1, policy_version 83972 (0.0009) +[2023-10-14 08:36:11,134][100936] Updated weights for policy 0, policy_version 83850 (0.0008) +[2023-10-14 08:36:11,142][100917] Updated weights for policy 1, policy_version 83982 (0.0009) +[2023-10-14 08:36:11,506][100936] Updated weights for policy 0, policy_version 83860 (0.0008) +[2023-10-14 08:36:11,513][100917] Updated weights for policy 1, policy_version 83992 (0.0007) +[2023-10-14 08:36:11,884][100936] Updated weights for policy 0, policy_version 83870 (0.0007) +[2023-10-14 08:36:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 171900928. Throughput: 0: 1646.8, 1: 1652.9. Samples: 42978590. Policy #0 lag: (min: 3.0, avg: 10.9, max: 35.0) +[2023-10-14 08:36:13,513][99942] Avg episode reward: [(0, '0.940'), (1, '1.000')] +[2023-10-14 08:36:15,761][100917] Updated weights for policy 1, policy_version 84002 (0.0009) +[2023-10-14 08:36:15,966][100936] Updated weights for policy 0, policy_version 83880 (0.0007) +[2023-10-14 08:36:16,127][100917] Updated weights for policy 1, policy_version 84012 (0.0008) +[2023-10-14 08:36:16,325][100936] Updated weights for policy 0, policy_version 83890 (0.0009) +[2023-10-14 08:36:16,503][100917] Updated weights for policy 1, policy_version 84022 (0.0009) +[2023-10-14 08:36:16,700][100936] Updated weights for policy 0, policy_version 83900 (0.0008) +[2023-10-14 08:36:16,876][100917] Updated weights for policy 1, policy_version 84032 (0.0009) +[2023-10-14 08:36:18,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 171966464. Throughput: 0: 1645.0, 1: 1650.7. Samples: 42997134. Policy #0 lag: (min: 3.0, avg: 10.9, max: 35.0) +[2023-10-14 08:36:18,513][99942] Avg episode reward: [(0, '0.940'), (1, '1.000')] +[2023-10-14 08:36:20,895][100936] Updated weights for policy 0, policy_version 83910 (0.0008) +[2023-10-14 08:36:21,060][100917] Updated weights for policy 1, policy_version 84042 (0.0007) +[2023-10-14 08:36:21,273][100936] Updated weights for policy 0, policy_version 83920 (0.0008) +[2023-10-14 08:36:21,434][100917] Updated weights for policy 1, policy_version 84052 (0.0007) +[2023-10-14 08:36:21,640][100936] Updated weights for policy 0, policy_version 83930 (0.0008) +[2023-10-14 08:36:21,803][100917] Updated weights for policy 1, policy_version 84062 (0.0008) +[2023-10-14 08:36:23,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 172032000. Throughput: 0: 1646.3, 1: 1658.7. Samples: 43017446. Policy #0 lag: (min: 3.0, avg: 10.9, max: 35.0) +[2023-10-14 08:36:23,513][99942] Avg episode reward: [(0, '0.940'), (1, '1.000')] +[2023-10-14 08:36:25,714][100936] Updated weights for policy 0, policy_version 83940 (0.0010) +[2023-10-14 08:36:25,991][100917] Updated weights for policy 1, policy_version 84072 (0.0008) +[2023-10-14 08:36:26,083][100936] Updated weights for policy 0, policy_version 83950 (0.0007) +[2023-10-14 08:36:26,364][100917] Updated weights for policy 1, policy_version 84082 (0.0007) +[2023-10-14 08:36:26,461][100936] Updated weights for policy 0, policy_version 83960 (0.0009) +[2023-10-14 08:36:26,732][100917] Updated weights for policy 1, policy_version 84092 (0.0008) +[2023-10-14 08:36:28,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 172097536. Throughput: 0: 1642.5, 1: 1648.6. Samples: 43027678. Policy #0 lag: (min: 3.0, avg: 10.9, max: 35.0) +[2023-10-14 08:36:28,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:36:30,447][100936] Updated weights for policy 0, policy_version 83970 (0.0009) +[2023-10-14 08:36:30,706][100917] Updated weights for policy 1, policy_version 84102 (0.0008) +[2023-10-14 08:36:30,827][100936] Updated weights for policy 0, policy_version 83980 (0.0007) +[2023-10-14 08:36:31,071][100917] Updated weights for policy 1, policy_version 84112 (0.0008) +[2023-10-14 08:36:31,184][100936] Updated weights for policy 0, policy_version 83990 (0.0007) +[2023-10-14 08:36:31,441][100917] Updated weights for policy 1, policy_version 84122 (0.0009) +[2023-10-14 08:36:31,558][100936] Updated weights for policy 0, policy_version 84000 (0.0007) +[2023-10-14 08:36:33,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 172163072. Throughput: 0: 1649.4, 1: 1657.3. Samples: 43047096. Policy #0 lag: (min: 3.0, avg: 10.9, max: 35.0) +[2023-10-14 08:36:33,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:36:35,452][100917] Updated weights for policy 1, policy_version 84132 (0.0009) +[2023-10-14 08:36:35,617][100936] Updated weights for policy 0, policy_version 84010 (0.0008) +[2023-10-14 08:36:35,820][100917] Updated weights for policy 1, policy_version 84142 (0.0008) +[2023-10-14 08:36:35,989][100936] Updated weights for policy 0, policy_version 84020 (0.0008) +[2023-10-14 08:36:36,185][100917] Updated weights for policy 1, policy_version 84152 (0.0008) +[2023-10-14 08:36:36,358][100936] Updated weights for policy 0, policy_version 84030 (0.0008) +[2023-10-14 08:36:38,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 172228608. Throughput: 0: 1654.4, 1: 1663.2. Samples: 43067738. Policy #0 lag: (min: 3.0, avg: 10.9, max: 35.0) +[2023-10-14 08:36:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:36:40,195][100917] Updated weights for policy 1, policy_version 84162 (0.0008) +[2023-10-14 08:36:40,560][100917] Updated weights for policy 1, policy_version 84172 (0.0008) +[2023-10-14 08:36:40,718][100936] Updated weights for policy 0, policy_version 84040 (0.0009) +[2023-10-14 08:36:40,923][100917] Updated weights for policy 1, policy_version 84182 (0.0009) +[2023-10-14 08:36:41,098][100936] Updated weights for policy 0, policy_version 84050 (0.0008) +[2023-10-14 08:36:41,297][100917] Updated weights for policy 1, policy_version 84192 (0.0009) +[2023-10-14 08:36:41,471][100936] Updated weights for policy 0, policy_version 84060 (0.0008) +[2023-10-14 08:36:43,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 172294144. Throughput: 0: 1646.7, 1: 1654.0. Samples: 43077398. Policy #0 lag: (min: 3.0, avg: 10.9, max: 35.0) +[2023-10-14 08:36:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:36:45,417][100917] Updated weights for policy 1, policy_version 84202 (0.0009) +[2023-10-14 08:36:45,676][100936] Updated weights for policy 0, policy_version 84070 (0.0009) +[2023-10-14 08:36:45,793][100917] Updated weights for policy 1, policy_version 84212 (0.0009) +[2023-10-14 08:36:46,039][100936] Updated weights for policy 0, policy_version 84080 (0.0008) +[2023-10-14 08:36:46,161][100917] Updated weights for policy 1, policy_version 84222 (0.0009) +[2023-10-14 08:36:46,401][100936] Updated weights for policy 0, policy_version 84090 (0.0008) +[2023-10-14 08:36:48,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 172359680. Throughput: 0: 1653.6, 1: 1669.8. Samples: 43097138. Policy #0 lag: (min: 3.0, avg: 10.9, max: 35.0) +[2023-10-14 08:36:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:36:50,235][100917] Updated weights for policy 1, policy_version 84232 (0.0007) +[2023-10-14 08:36:50,573][100936] Updated weights for policy 0, policy_version 84100 (0.0008) +[2023-10-14 08:36:50,609][100917] Updated weights for policy 1, policy_version 84242 (0.0007) +[2023-10-14 08:36:50,945][100936] Updated weights for policy 0, policy_version 84110 (0.0009) +[2023-10-14 08:36:50,977][100917] Updated weights for policy 1, policy_version 84252 (0.0007) +[2023-10-14 08:36:51,309][100936] Updated weights for policy 0, policy_version 84120 (0.0010) +[2023-10-14 08:36:53,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 172425216. Throughput: 0: 1651.0, 1: 1664.5. Samples: 43117464. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:36:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:36:55,217][100917] Updated weights for policy 1, policy_version 84262 (0.0008) +[2023-10-14 08:36:55,343][100936] Updated weights for policy 0, policy_version 84130 (0.0009) +[2023-10-14 08:36:55,592][100917] Updated weights for policy 1, policy_version 84272 (0.0007) +[2023-10-14 08:36:55,714][100936] Updated weights for policy 0, policy_version 84140 (0.0007) +[2023-10-14 08:36:55,954][100917] Updated weights for policy 1, policy_version 84282 (0.0008) +[2023-10-14 08:36:56,087][100936] Updated weights for policy 0, policy_version 84150 (0.0008) +[2023-10-14 08:36:56,447][100936] Updated weights for policy 0, policy_version 84160 (0.0008) +[2023-10-14 08:36:58,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 172490752. Throughput: 0: 1644.9, 1: 1649.1. Samples: 43126818. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:36:58,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:37:00,170][100917] Updated weights for policy 1, policy_version 84292 (0.0009) +[2023-10-14 08:37:00,513][100936] Updated weights for policy 0, policy_version 84170 (0.0008) +[2023-10-14 08:37:00,569][100917] Updated weights for policy 1, policy_version 84302 (0.0008) +[2023-10-14 08:37:00,880][100936] Updated weights for policy 0, policy_version 84180 (0.0008) +[2023-10-14 08:37:00,943][100917] Updated weights for policy 1, policy_version 84312 (0.0008) +[2023-10-14 08:37:01,245][100936] Updated weights for policy 0, policy_version 84190 (0.0008) +[2023-10-14 08:37:03,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 172556288. Throughput: 0: 1665.9, 1: 1663.7. Samples: 43146966. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:37:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:37:04,989][100917] Updated weights for policy 1, policy_version 84322 (0.0009) +[2023-10-14 08:37:05,190][100936] Updated weights for policy 0, policy_version 84200 (0.0008) +[2023-10-14 08:37:05,354][100917] Updated weights for policy 1, policy_version 84332 (0.0009) +[2023-10-14 08:37:05,560][100936] Updated weights for policy 0, policy_version 84210 (0.0008) +[2023-10-14 08:37:05,723][100917] Updated weights for policy 1, policy_version 84342 (0.0008) +[2023-10-14 08:37:05,923][100936] Updated weights for policy 0, policy_version 84220 (0.0008) +[2023-10-14 08:37:06,095][100917] Updated weights for policy 1, policy_version 84352 (0.0009) +[2023-10-14 08:37:08,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 172621824. Throughput: 0: 1670.3, 1: 1665.2. Samples: 43167546. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:37:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:37:10,081][100917] Updated weights for policy 1, policy_version 84362 (0.0009) +[2023-10-14 08:37:10,093][100936] Updated weights for policy 0, policy_version 84230 (0.0009) +[2023-10-14 08:37:10,459][100917] Updated weights for policy 1, policy_version 84372 (0.0009) +[2023-10-14 08:37:10,468][100936] Updated weights for policy 0, policy_version 84240 (0.0007) +[2023-10-14 08:37:10,840][100936] Updated weights for policy 0, policy_version 84250 (0.0007) +[2023-10-14 08:37:10,841][100917] Updated weights for policy 1, policy_version 84382 (0.0007) +[2023-10-14 08:37:13,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 172687360. Throughput: 0: 1660.6, 1: 1647.2. Samples: 43176530. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:37:13,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:37:14,923][100936] Updated weights for policy 0, policy_version 84260 (0.0010) +[2023-10-14 08:37:14,985][100917] Updated weights for policy 1, policy_version 84392 (0.0007) +[2023-10-14 08:37:15,297][100936] Updated weights for policy 0, policy_version 84270 (0.0008) +[2023-10-14 08:37:15,366][100917] Updated weights for policy 1, policy_version 84402 (0.0008) +[2023-10-14 08:37:15,671][100936] Updated weights for policy 0, policy_version 84280 (0.0007) +[2023-10-14 08:37:15,736][100917] Updated weights for policy 1, policy_version 84412 (0.0010) +[2023-10-14 08:37:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 172752896. Throughput: 0: 1663.0, 1: 1660.8. Samples: 43196670. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:37:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:37:19,959][100936] Updated weights for policy 0, policy_version 84290 (0.0007) +[2023-10-14 08:37:20,007][100917] Updated weights for policy 1, policy_version 84422 (0.0009) +[2023-10-14 08:37:20,324][100936] Updated weights for policy 0, policy_version 84300 (0.0008) +[2023-10-14 08:37:20,379][100917] Updated weights for policy 1, policy_version 84432 (0.0008) +[2023-10-14 08:37:20,694][100936] Updated weights for policy 0, policy_version 84310 (0.0008) +[2023-10-14 08:37:20,758][100917] Updated weights for policy 1, policy_version 84442 (0.0009) +[2023-10-14 08:37:21,051][100936] Updated weights for policy 0, policy_version 84320 (0.0009) +[2023-10-14 08:37:23,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 172818432. Throughput: 0: 1660.0, 1: 1657.9. Samples: 43217044. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:37:23,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:37:24,970][100917] Updated weights for policy 1, policy_version 84452 (0.0007) +[2023-10-14 08:37:25,317][100936] Updated weights for policy 0, policy_version 84330 (0.0009) +[2023-10-14 08:37:25,337][100917] Updated weights for policy 1, policy_version 84462 (0.0008) +[2023-10-14 08:37:25,685][100936] Updated weights for policy 0, policy_version 84340 (0.0008) +[2023-10-14 08:37:25,702][100917] Updated weights for policy 1, policy_version 84472 (0.0009) +[2023-10-14 08:37:26,056][100936] Updated weights for policy 0, policy_version 84350 (0.0009) +[2023-10-14 08:37:28,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 172883968. Throughput: 0: 1652.5, 1: 1647.6. Samples: 43225900. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:37:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:37:29,701][100917] Updated weights for policy 1, policy_version 84482 (0.0007) +[2023-10-14 08:37:29,957][100936] Updated weights for policy 0, policy_version 84360 (0.0008) +[2023-10-14 08:37:30,086][100917] Updated weights for policy 1, policy_version 84492 (0.0007) +[2023-10-14 08:37:30,320][100936] Updated weights for policy 0, policy_version 84370 (0.0009) +[2023-10-14 08:37:30,449][100917] Updated weights for policy 1, policy_version 84502 (0.0009) +[2023-10-14 08:37:30,690][100936] Updated weights for policy 0, policy_version 84380 (0.0008) +[2023-10-14 08:37:30,818][100917] Updated weights for policy 1, policy_version 84512 (0.0007) +[2023-10-14 08:37:33,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 172949504. Throughput: 0: 1659.0, 1: 1659.0. Samples: 43246446. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:37:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:37:34,805][100917] Updated weights for policy 1, policy_version 84522 (0.0008) +[2023-10-14 08:37:34,905][100936] Updated weights for policy 0, policy_version 84390 (0.0007) +[2023-10-14 08:37:35,175][100917] Updated weights for policy 1, policy_version 84532 (0.0008) +[2023-10-14 08:37:35,282][100936] Updated weights for policy 0, policy_version 84400 (0.0007) +[2023-10-14 08:37:35,543][100917] Updated weights for policy 1, policy_version 84542 (0.0008) +[2023-10-14 08:37:35,647][100936] Updated weights for policy 0, policy_version 84410 (0.0007) +[2023-10-14 08:37:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 173015040. Throughput: 0: 1665.7, 1: 1658.8. Samples: 43267064. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:37:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:37:38,521][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000084416_86441984.pth... +[2023-10-14 08:37:38,521][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000084544_86573056.pth... +[2023-10-14 08:37:38,560][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000082880_84869120.pth +[2023-10-14 08:37:38,563][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000083008_85000192.pth +[2023-10-14 08:37:39,601][100917] Updated weights for policy 1, policy_version 84552 (0.0009) +[2023-10-14 08:37:39,691][100936] Updated weights for policy 0, policy_version 84420 (0.0007) +[2023-10-14 08:37:39,979][100917] Updated weights for policy 1, policy_version 84562 (0.0009) +[2023-10-14 08:37:40,058][100936] Updated weights for policy 0, policy_version 84430 (0.0007) +[2023-10-14 08:37:40,348][100917] Updated weights for policy 1, policy_version 84572 (0.0007) +[2023-10-14 08:37:40,437][100936] Updated weights for policy 0, policy_version 84440 (0.0007) +[2023-10-14 08:37:43,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 173080576. Throughput: 0: 1659.5, 1: 1655.7. Samples: 43276004. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:37:43,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:37:44,400][100917] Updated weights for policy 1, policy_version 84582 (0.0010) +[2023-10-14 08:37:44,642][100936] Updated weights for policy 0, policy_version 84450 (0.0009) +[2023-10-14 08:37:44,771][100917] Updated weights for policy 1, policy_version 84592 (0.0010) +[2023-10-14 08:37:45,006][100936] Updated weights for policy 0, policy_version 84460 (0.0008) +[2023-10-14 08:37:45,138][100917] Updated weights for policy 1, policy_version 84602 (0.0009) +[2023-10-14 08:37:45,377][100936] Updated weights for policy 0, policy_version 84470 (0.0009) +[2023-10-14 08:37:45,741][100936] Updated weights for policy 0, policy_version 84480 (0.0008) +[2023-10-14 08:37:48,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 173146112. Throughput: 0: 1657.5, 1: 1663.3. Samples: 43296404. Policy #0 lag: (min: 31.0, avg: 38.4, max: 63.0) +[2023-10-14 08:37:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:37:49,310][100917] Updated weights for policy 1, policy_version 84612 (0.0008) +[2023-10-14 08:37:49,679][100917] Updated weights for policy 1, policy_version 84622 (0.0010) +[2023-10-14 08:37:49,963][100936] Updated weights for policy 0, policy_version 84490 (0.0010) +[2023-10-14 08:37:50,054][100917] Updated weights for policy 1, policy_version 84632 (0.0008) +[2023-10-14 08:37:50,329][100936] Updated weights for policy 0, policy_version 84500 (0.0007) +[2023-10-14 08:37:50,709][100936] Updated weights for policy 0, policy_version 84510 (0.0008) +[2023-10-14 08:37:53,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 173211648. Throughput: 0: 1654.4, 1: 1660.7. Samples: 43316726. Policy #0 lag: (min: 31.0, avg: 38.4, max: 63.0) +[2023-10-14 08:37:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:37:54,296][100917] Updated weights for policy 1, policy_version 84642 (0.0009) +[2023-10-14 08:37:54,672][100917] Updated weights for policy 1, policy_version 84652 (0.0009) +[2023-10-14 08:37:54,934][100936] Updated weights for policy 0, policy_version 84520 (0.0009) +[2023-10-14 08:37:55,043][100917] Updated weights for policy 1, policy_version 84662 (0.0008) +[2023-10-14 08:37:55,302][100936] Updated weights for policy 0, policy_version 84530 (0.0008) +[2023-10-14 08:37:55,413][100917] Updated weights for policy 1, policy_version 84672 (0.0010) +[2023-10-14 08:37:55,664][100936] Updated weights for policy 0, policy_version 84540 (0.0008) +[2023-10-14 08:37:58,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 173277184. Throughput: 0: 1655.9, 1: 1660.6. Samples: 43325774. Policy #0 lag: (min: 31.0, avg: 38.4, max: 63.0) +[2023-10-14 08:37:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:37:59,513][100917] Updated weights for policy 1, policy_version 84682 (0.0008) +[2023-10-14 08:37:59,737][100936] Updated weights for policy 0, policy_version 84550 (0.0009) +[2023-10-14 08:37:59,880][100917] Updated weights for policy 1, policy_version 84692 (0.0008) +[2023-10-14 08:38:00,105][100936] Updated weights for policy 0, policy_version 84560 (0.0009) +[2023-10-14 08:38:00,255][100917] Updated weights for policy 1, policy_version 84702 (0.0009) +[2023-10-14 08:38:00,475][100936] Updated weights for policy 0, policy_version 84570 (0.0008) +[2023-10-14 08:38:03,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 173342720. Throughput: 0: 1660.7, 1: 1663.5. Samples: 43346256. Policy #0 lag: (min: 31.0, avg: 38.4, max: 63.0) +[2023-10-14 08:38:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:38:04,404][100917] Updated weights for policy 1, policy_version 84712 (0.0010) +[2023-10-14 08:38:04,539][100936] Updated weights for policy 0, policy_version 84580 (0.0007) +[2023-10-14 08:38:04,775][100917] Updated weights for policy 1, policy_version 84722 (0.0009) +[2023-10-14 08:38:04,912][100936] Updated weights for policy 0, policy_version 84590 (0.0007) +[2023-10-14 08:38:05,145][100917] Updated weights for policy 1, policy_version 84732 (0.0009) +[2023-10-14 08:38:05,281][100936] Updated weights for policy 0, policy_version 84600 (0.0007) +[2023-10-14 08:38:08,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 173408256. Throughput: 0: 1667.6, 1: 1664.7. Samples: 43366996. Policy #0 lag: (min: 31.0, avg: 38.4, max: 63.0) +[2023-10-14 08:38:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:38:09,182][100917] Updated weights for policy 1, policy_version 84742 (0.0007) +[2023-10-14 08:38:09,296][100936] Updated weights for policy 0, policy_version 84610 (0.0007) +[2023-10-14 08:38:09,557][100917] Updated weights for policy 1, policy_version 84752 (0.0007) +[2023-10-14 08:38:09,667][100936] Updated weights for policy 0, policy_version 84620 (0.0009) +[2023-10-14 08:38:09,926][100917] Updated weights for policy 1, policy_version 84762 (0.0007) +[2023-10-14 08:38:10,026][100936] Updated weights for policy 0, policy_version 84630 (0.0010) +[2023-10-14 08:38:10,397][100936] Updated weights for policy 0, policy_version 84640 (0.0009) +[2023-10-14 08:38:13,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 173473792. Throughput: 0: 1667.4, 1: 1665.6. Samples: 43375882. Policy #0 lag: (min: 31.0, avg: 38.4, max: 63.0) +[2023-10-14 08:38:13,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:38:14,037][100917] Updated weights for policy 1, policy_version 84772 (0.0011) +[2023-10-14 08:38:14,409][100917] Updated weights for policy 1, policy_version 84782 (0.0009) +[2023-10-14 08:38:14,659][100936] Updated weights for policy 0, policy_version 84650 (0.0008) +[2023-10-14 08:38:14,786][100917] Updated weights for policy 1, policy_version 84792 (0.0008) +[2023-10-14 08:38:15,033][100936] Updated weights for policy 0, policy_version 84660 (0.0008) +[2023-10-14 08:38:15,404][100936] Updated weights for policy 0, policy_version 84670 (0.0007) +[2023-10-14 08:38:18,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 173539328. Throughput: 0: 1666.7, 1: 1655.2. Samples: 43395934. Policy #0 lag: (min: 31.0, avg: 38.4, max: 63.0) +[2023-10-14 08:38:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:38:19,085][100917] Updated weights for policy 1, policy_version 84802 (0.0009) +[2023-10-14 08:38:19,326][100936] Updated weights for policy 0, policy_version 84680 (0.0009) +[2023-10-14 08:38:19,454][100917] Updated weights for policy 1, policy_version 84812 (0.0007) +[2023-10-14 08:38:19,692][100936] Updated weights for policy 0, policy_version 84690 (0.0008) +[2023-10-14 08:38:19,813][100917] Updated weights for policy 1, policy_version 84822 (0.0008) +[2023-10-14 08:38:20,049][100936] Updated weights for policy 0, policy_version 84700 (0.0008) +[2023-10-14 08:38:20,184][100917] Updated weights for policy 1, policy_version 84832 (0.0008) +[2023-10-14 08:38:23,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 173604864. Throughput: 0: 1662.2, 1: 1659.0. Samples: 43416518. Policy #0 lag: (min: 31.0, avg: 38.4, max: 63.0) +[2023-10-14 08:38:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:38:24,163][100917] Updated weights for policy 1, policy_version 84842 (0.0008) +[2023-10-14 08:38:24,204][100936] Updated weights for policy 0, policy_version 84710 (0.0008) +[2023-10-14 08:38:24,534][100917] Updated weights for policy 1, policy_version 84852 (0.0008) +[2023-10-14 08:38:24,581][100936] Updated weights for policy 0, policy_version 84720 (0.0009) +[2023-10-14 08:38:24,905][100917] Updated weights for policy 1, policy_version 84862 (0.0008) +[2023-10-14 08:38:24,951][100936] Updated weights for policy 0, policy_version 84730 (0.0008) +[2023-10-14 08:38:28,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 173670400. Throughput: 0: 1664.5, 1: 1658.6. Samples: 43425544. Policy #0 lag: (min: 31.0, avg: 38.4, max: 63.0) +[2023-10-14 08:38:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:38:28,882][100936] Updated weights for policy 0, policy_version 84740 (0.0009) +[2023-10-14 08:38:29,004][100917] Updated weights for policy 1, policy_version 84872 (0.0007) +[2023-10-14 08:38:29,255][100936] Updated weights for policy 0, policy_version 84750 (0.0008) +[2023-10-14 08:38:29,379][100917] Updated weights for policy 1, policy_version 84882 (0.0007) +[2023-10-14 08:38:29,617][100936] Updated weights for policy 0, policy_version 84760 (0.0009) +[2023-10-14 08:38:29,743][100917] Updated weights for policy 1, policy_version 84892 (0.0008) +[2023-10-14 08:38:33,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 173735936. Throughput: 0: 1659.2, 1: 1661.1. Samples: 43445818. Policy #0 lag: (min: 31.0, avg: 38.4, max: 63.0) +[2023-10-14 08:38:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:38:33,812][100936] Updated weights for policy 0, policy_version 84770 (0.0009) +[2023-10-14 08:38:33,924][100917] Updated weights for policy 1, policy_version 84902 (0.0009) +[2023-10-14 08:38:34,185][100936] Updated weights for policy 0, policy_version 84780 (0.0010) +[2023-10-14 08:38:34,305][100917] Updated weights for policy 1, policy_version 84912 (0.0010) +[2023-10-14 08:38:34,548][100936] Updated weights for policy 0, policy_version 84790 (0.0007) +[2023-10-14 08:38:34,665][100917] Updated weights for policy 1, policy_version 84922 (0.0010) +[2023-10-14 08:38:34,912][100936] Updated weights for policy 0, policy_version 84800 (0.0007) +[2023-10-14 08:38:38,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 173801472. Throughput: 0: 1662.4, 1: 1659.3. Samples: 43466204. Policy #0 lag: (min: 31.0, avg: 38.4, max: 63.0) +[2023-10-14 08:38:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:38:38,848][100917] Updated weights for policy 1, policy_version 84932 (0.0010) +[2023-10-14 08:38:39,124][100936] Updated weights for policy 0, policy_version 84810 (0.0007) +[2023-10-14 08:38:39,225][100917] Updated weights for policy 1, policy_version 84942 (0.0010) +[2023-10-14 08:38:39,501][100936] Updated weights for policy 0, policy_version 84820 (0.0008) +[2023-10-14 08:38:39,601][100917] Updated weights for policy 1, policy_version 84952 (0.0010) +[2023-10-14 08:38:39,862][100936] Updated weights for policy 0, policy_version 84830 (0.0009) +[2023-10-14 08:38:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 173867008. Throughput: 0: 1662.3, 1: 1657.6. Samples: 43475170. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:38:43,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:38:43,640][100917] Updated weights for policy 1, policy_version 84962 (0.0008) +[2023-10-14 08:38:44,013][100917] Updated weights for policy 1, policy_version 84972 (0.0008) +[2023-10-14 08:38:44,182][100936] Updated weights for policy 0, policy_version 84840 (0.0008) +[2023-10-14 08:38:44,381][100917] Updated weights for policy 1, policy_version 84982 (0.0008) +[2023-10-14 08:38:44,559][100936] Updated weights for policy 0, policy_version 84850 (0.0007) +[2023-10-14 08:38:44,745][100917] Updated weights for policy 1, policy_version 84992 (0.0007) +[2023-10-14 08:38:44,926][100936] Updated weights for policy 0, policy_version 84860 (0.0007) +[2023-10-14 08:38:48,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 173932544. Throughput: 0: 1657.0, 1: 1659.4. Samples: 43495496. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:38:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:38:48,989][100917] Updated weights for policy 1, policy_version 85002 (0.0007) +[2023-10-14 08:38:49,111][100936] Updated weights for policy 0, policy_version 84870 (0.0009) +[2023-10-14 08:38:49,360][100917] Updated weights for policy 1, policy_version 85012 (0.0009) +[2023-10-14 08:38:49,476][100936] Updated weights for policy 0, policy_version 84880 (0.0007) +[2023-10-14 08:38:49,731][100917] Updated weights for policy 1, policy_version 85022 (0.0009) +[2023-10-14 08:38:49,843][100936] Updated weights for policy 0, policy_version 84890 (0.0009) +[2023-10-14 08:38:53,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 173998080. Throughput: 0: 1649.7, 1: 1662.9. Samples: 43516058. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:38:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:38:53,792][100917] Updated weights for policy 1, policy_version 85032 (0.0010) +[2023-10-14 08:38:53,975][100936] Updated weights for policy 0, policy_version 84900 (0.0009) +[2023-10-14 08:38:54,168][100917] Updated weights for policy 1, policy_version 85042 (0.0009) +[2023-10-14 08:38:54,347][100936] Updated weights for policy 0, policy_version 84910 (0.0008) +[2023-10-14 08:38:54,538][100917] Updated weights for policy 1, policy_version 85052 (0.0009) +[2023-10-14 08:38:54,715][100936] Updated weights for policy 0, policy_version 84920 (0.0010) +[2023-10-14 08:38:58,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 174063616. Throughput: 0: 1654.1, 1: 1662.3. Samples: 43525120. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:38:58,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:38:58,638][100917] Updated weights for policy 1, policy_version 85062 (0.0010) +[2023-10-14 08:38:58,805][100936] Updated weights for policy 0, policy_version 84930 (0.0009) +[2023-10-14 08:38:59,011][100917] Updated weights for policy 1, policy_version 85072 (0.0009) +[2023-10-14 08:38:59,198][100936] Updated weights for policy 0, policy_version 84940 (0.0009) +[2023-10-14 08:38:59,384][100917] Updated weights for policy 1, policy_version 85082 (0.0008) +[2023-10-14 08:38:59,559][100936] Updated weights for policy 0, policy_version 84950 (0.0009) +[2023-10-14 08:38:59,925][100936] Updated weights for policy 0, policy_version 84960 (0.0009) +[2023-10-14 08:39:03,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 174129152. Throughput: 0: 1650.7, 1: 1667.5. Samples: 43545254. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:39:03,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:39:03,595][100917] Updated weights for policy 1, policy_version 85092 (0.0008) +[2023-10-14 08:39:03,974][100917] Updated weights for policy 1, policy_version 85102 (0.0009) +[2023-10-14 08:39:04,277][100936] Updated weights for policy 0, policy_version 84970 (0.0011) +[2023-10-14 08:39:04,335][100917] Updated weights for policy 1, policy_version 85112 (0.0007) +[2023-10-14 08:39:04,645][100936] Updated weights for policy 0, policy_version 84980 (0.0009) +[2023-10-14 08:39:05,015][100936] Updated weights for policy 0, policy_version 84990 (0.0007) +[2023-10-14 08:39:08,357][100917] Updated weights for policy 1, policy_version 85122 (0.0008) +[2023-10-14 08:39:08,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 174194688. Throughput: 0: 1649.5, 1: 1670.7. Samples: 43565924. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:39:08,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:39:08,729][100917] Updated weights for policy 1, policy_version 85132 (0.0009) +[2023-10-14 08:39:09,103][100917] Updated weights for policy 1, policy_version 85142 (0.0008) +[2023-10-14 08:39:09,190][100936] Updated weights for policy 0, policy_version 85000 (0.0008) +[2023-10-14 08:39:09,465][100917] Updated weights for policy 1, policy_version 85152 (0.0009) +[2023-10-14 08:39:09,554][100936] Updated weights for policy 0, policy_version 85010 (0.0008) +[2023-10-14 08:39:09,920][100936] Updated weights for policy 0, policy_version 85020 (0.0007) +[2023-10-14 08:39:13,426][100917] Updated weights for policy 1, policy_version 85162 (0.0008) +[2023-10-14 08:39:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 174260224. Throughput: 0: 1651.1, 1: 1668.4. Samples: 43574920. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:39:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:39:13,798][100917] Updated weights for policy 1, policy_version 85172 (0.0008) +[2023-10-14 08:39:13,909][100936] Updated weights for policy 0, policy_version 85030 (0.0008) +[2023-10-14 08:39:14,163][100917] Updated weights for policy 1, policy_version 85182 (0.0007) +[2023-10-14 08:39:14,277][100936] Updated weights for policy 0, policy_version 85040 (0.0008) +[2023-10-14 08:39:14,650][100936] Updated weights for policy 0, policy_version 85050 (0.0008) +[2023-10-14 08:39:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 174325760. Throughput: 0: 1655.8, 1: 1666.9. Samples: 43595338. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:39:18,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:39:18,541][100917] Updated weights for policy 1, policy_version 85192 (0.0008) +[2023-10-14 08:39:18,731][100936] Updated weights for policy 0, policy_version 85060 (0.0008) +[2023-10-14 08:39:18,934][100917] Updated weights for policy 1, policy_version 85202 (0.0007) +[2023-10-14 08:39:19,089][100936] Updated weights for policy 0, policy_version 85070 (0.0009) +[2023-10-14 08:39:19,293][100917] Updated weights for policy 1, policy_version 85212 (0.0008) +[2023-10-14 08:39:19,454][100936] Updated weights for policy 0, policy_version 85080 (0.0008) +[2023-10-14 08:39:23,325][100917] Updated weights for policy 1, policy_version 85222 (0.0010) +[2023-10-14 08:39:23,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 174391296. Throughput: 0: 1650.1, 1: 1667.0. Samples: 43615474. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:39:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:39:23,612][100936] Updated weights for policy 0, policy_version 85090 (0.0009) +[2023-10-14 08:39:23,700][100917] Updated weights for policy 1, policy_version 85232 (0.0009) +[2023-10-14 08:39:23,974][100936] Updated weights for policy 0, policy_version 85100 (0.0009) +[2023-10-14 08:39:24,072][100917] Updated weights for policy 1, policy_version 85242 (0.0009) +[2023-10-14 08:39:24,338][100936] Updated weights for policy 0, policy_version 85110 (0.0007) +[2023-10-14 08:39:24,700][100936] Updated weights for policy 0, policy_version 85120 (0.0007) +[2023-10-14 08:39:28,215][100917] Updated weights for policy 1, policy_version 85252 (0.0009) +[2023-10-14 08:39:28,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 174456832. Throughput: 0: 1653.0, 1: 1663.5. Samples: 43624414. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:39:28,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:39:28,588][100917] Updated weights for policy 1, policy_version 85262 (0.0008) +[2023-10-14 08:39:28,808][100936] Updated weights for policy 0, policy_version 85130 (0.0008) +[2023-10-14 08:39:28,957][100917] Updated weights for policy 1, policy_version 85272 (0.0007) +[2023-10-14 08:39:29,164][100936] Updated weights for policy 0, policy_version 85140 (0.0007) +[2023-10-14 08:39:29,540][100936] Updated weights for policy 0, policy_version 85150 (0.0009) +[2023-10-14 08:39:33,058][100917] Updated weights for policy 1, policy_version 85282 (0.0009) +[2023-10-14 08:39:33,420][100917] Updated weights for policy 1, policy_version 85292 (0.0010) +[2023-10-14 08:39:33,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 174522368. Throughput: 0: 1664.0, 1: 1663.3. Samples: 43645226. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:39:33,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:39:33,604][100936] Updated weights for policy 0, policy_version 85160 (0.0007) +[2023-10-14 08:39:33,794][100917] Updated weights for policy 1, policy_version 85302 (0.0009) +[2023-10-14 08:39:33,969][100936] Updated weights for policy 0, policy_version 85170 (0.0007) +[2023-10-14 08:39:34,157][100917] Updated weights for policy 1, policy_version 85312 (0.0007) +[2023-10-14 08:39:34,334][100936] Updated weights for policy 0, policy_version 85180 (0.0009) +[2023-10-14 08:39:38,372][100917] Updated weights for policy 1, policy_version 85322 (0.0007) +[2023-10-14 08:39:38,475][100936] Updated weights for policy 0, policy_version 85190 (0.0008) +[2023-10-14 08:39:38,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 174587904. Throughput: 0: 1658.7, 1: 1661.3. Samples: 43665456. Policy #0 lag: (min: 31.0, avg: 32.6, max: 59.0) +[2023-10-14 08:39:38,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:39:38,733][100917] Updated weights for policy 1, policy_version 85332 (0.0008) +[2023-10-14 08:39:38,837][100936] Updated weights for policy 0, policy_version 85200 (0.0007) +[2023-10-14 08:39:39,104][100917] Updated weights for policy 1, policy_version 85342 (0.0010) +[2023-10-14 08:39:39,172][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000085344_87392256.pth... +[2023-10-14 08:39:39,201][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000083776_85786624.pth +[2023-10-14 08:39:39,204][100681] Saving a milestone ./train_atari/atari_privateye_APPO/checkpoint_p1/milestones/checkpoint_000085344_87392256.pth +[2023-10-14 08:39:39,211][100936] Updated weights for policy 0, policy_version 85210 (0.0007) +[2023-10-14 08:39:39,428][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000085216_87261184.pth... +[2023-10-14 08:39:39,456][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000083648_85655552.pth +[2023-10-14 08:39:39,460][100560] Saving a milestone ./train_atari/atari_privateye_APPO/checkpoint_p0/milestones/checkpoint_000085216_87261184.pth +[2023-10-14 08:39:43,369][100917] Updated weights for policy 1, policy_version 85352 (0.0007) +[2023-10-14 08:39:43,452][100936] Updated weights for policy 0, policy_version 85220 (0.0007) +[2023-10-14 08:39:43,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 174653440. Throughput: 0: 1660.7, 1: 1655.9. Samples: 43674368. Policy #0 lag: (min: 31.0, avg: 32.6, max: 59.0) +[2023-10-14 08:39:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:39:43,739][100917] Updated weights for policy 1, policy_version 85362 (0.0007) +[2023-10-14 08:39:43,817][100936] Updated weights for policy 0, policy_version 85230 (0.0009) +[2023-10-14 08:39:44,110][100917] Updated weights for policy 1, policy_version 85372 (0.0010) +[2023-10-14 08:39:44,191][100936] Updated weights for policy 0, policy_version 85240 (0.0010) +[2023-10-14 08:39:48,285][100917] Updated weights for policy 1, policy_version 85382 (0.0008) +[2023-10-14 08:39:48,366][100936] Updated weights for policy 0, policy_version 85250 (0.0010) +[2023-10-14 08:39:48,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 174718976. Throughput: 0: 1661.9, 1: 1656.2. Samples: 43694570. Policy #0 lag: (min: 31.0, avg: 32.6, max: 59.0) +[2023-10-14 08:39:48,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:39:48,642][100917] Updated weights for policy 1, policy_version 85392 (0.0008) +[2023-10-14 08:39:48,754][100936] Updated weights for policy 0, policy_version 85260 (0.0008) +[2023-10-14 08:39:49,013][100917] Updated weights for policy 1, policy_version 85402 (0.0009) +[2023-10-14 08:39:49,125][100936] Updated weights for policy 0, policy_version 85270 (0.0009) +[2023-10-14 08:39:49,489][100936] Updated weights for policy 0, policy_version 85280 (0.0009) +[2023-10-14 08:39:53,050][100917] Updated weights for policy 1, policy_version 85412 (0.0008) +[2023-10-14 08:39:53,424][100917] Updated weights for policy 1, policy_version 85422 (0.0008) +[2023-10-14 08:39:53,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 174784512. Throughput: 0: 1656.4, 1: 1650.9. Samples: 43714754. Policy #0 lag: (min: 31.0, avg: 32.6, max: 59.0) +[2023-10-14 08:39:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:39:53,599][100936] Updated weights for policy 0, policy_version 85290 (0.0008) +[2023-10-14 08:39:53,802][100917] Updated weights for policy 1, policy_version 85432 (0.0007) +[2023-10-14 08:39:53,976][100936] Updated weights for policy 0, policy_version 85300 (0.0008) +[2023-10-14 08:39:54,348][100936] Updated weights for policy 0, policy_version 85310 (0.0008) +[2023-10-14 08:39:57,807][100917] Updated weights for policy 1, policy_version 85442 (0.0007) +[2023-10-14 08:39:58,169][100917] Updated weights for policy 1, policy_version 85452 (0.0008) +[2023-10-14 08:39:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 174850048. Throughput: 0: 1653.8, 1: 1656.0. Samples: 43723864. Policy #0 lag: (min: 31.0, avg: 32.6, max: 59.0) +[2023-10-14 08:39:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:39:58,541][100917] Updated weights for policy 1, policy_version 85462 (0.0008) +[2023-10-14 08:39:58,604][100936] Updated weights for policy 0, policy_version 85320 (0.0008) +[2023-10-14 08:39:58,919][100917] Updated weights for policy 1, policy_version 85472 (0.0009) +[2023-10-14 08:39:58,976][100936] Updated weights for policy 0, policy_version 85330 (0.0008) +[2023-10-14 08:39:59,344][100936] Updated weights for policy 0, policy_version 85340 (0.0009) +[2023-10-14 08:40:02,950][100917] Updated weights for policy 1, policy_version 85482 (0.0007) +[2023-10-14 08:40:03,320][100917] Updated weights for policy 1, policy_version 85492 (0.0008) +[2023-10-14 08:40:03,375][100936] Updated weights for policy 0, policy_version 85350 (0.0009) +[2023-10-14 08:40:03,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 174915584. Throughput: 0: 1655.0, 1: 1659.1. Samples: 43744470. Policy #0 lag: (min: 31.0, avg: 32.6, max: 59.0) +[2023-10-14 08:40:03,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:40:03,704][100917] Updated weights for policy 1, policy_version 85502 (0.0007) +[2023-10-14 08:40:03,742][100936] Updated weights for policy 0, policy_version 85360 (0.0007) +[2023-10-14 08:40:04,112][100936] Updated weights for policy 0, policy_version 85370 (0.0009) +[2023-10-14 08:40:08,038][100917] Updated weights for policy 1, policy_version 85512 (0.0007) +[2023-10-14 08:40:08,228][100936] Updated weights for policy 0, policy_version 85380 (0.0008) +[2023-10-14 08:40:08,413][100917] Updated weights for policy 1, policy_version 85522 (0.0007) +[2023-10-14 08:40:08,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 174981120. Throughput: 0: 1652.5, 1: 1650.8. Samples: 43764124. Policy #0 lag: (min: 31.0, avg: 32.6, max: 59.0) +[2023-10-14 08:40:08,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:40:08,605][100936] Updated weights for policy 0, policy_version 85390 (0.0007) +[2023-10-14 08:40:08,777][100917] Updated weights for policy 1, policy_version 85532 (0.0009) +[2023-10-14 08:40:08,972][100936] Updated weights for policy 0, policy_version 85400 (0.0009) +[2023-10-14 08:40:12,819][100917] Updated weights for policy 1, policy_version 85542 (0.0009) +[2023-10-14 08:40:13,199][100917] Updated weights for policy 1, policy_version 85552 (0.0009) +[2023-10-14 08:40:13,350][100936] Updated weights for policy 0, policy_version 85410 (0.0008) +[2023-10-14 08:40:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 175046656. Throughput: 0: 1650.4, 1: 1662.4. Samples: 43773492. Policy #0 lag: (min: 31.0, avg: 32.6, max: 59.0) +[2023-10-14 08:40:13,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:40:13,578][100917] Updated weights for policy 1, policy_version 85562 (0.0010) +[2023-10-14 08:40:13,727][100936] Updated weights for policy 0, policy_version 85420 (0.0009) +[2023-10-14 08:40:14,098][100936] Updated weights for policy 0, policy_version 85430 (0.0009) +[2023-10-14 08:40:14,476][100936] Updated weights for policy 0, policy_version 85440 (0.0007) +[2023-10-14 08:40:17,790][100917] Updated weights for policy 1, policy_version 85572 (0.0009) +[2023-10-14 08:40:18,165][100917] Updated weights for policy 1, policy_version 85582 (0.0008) +[2023-10-14 08:40:18,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 175112192. Throughput: 0: 1638.4, 1: 1660.9. Samples: 43793698. Policy #0 lag: (min: 31.0, avg: 32.6, max: 59.0) +[2023-10-14 08:40:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:40:18,537][100917] Updated weights for policy 1, policy_version 85592 (0.0008) +[2023-10-14 08:40:18,757][100936] Updated weights for policy 0, policy_version 85450 (0.0009) +[2023-10-14 08:40:19,117][100936] Updated weights for policy 0, policy_version 85460 (0.0007) +[2023-10-14 08:40:19,489][100936] Updated weights for policy 0, policy_version 85470 (0.0010) +[2023-10-14 08:40:22,375][100917] Updated weights for policy 1, policy_version 85602 (0.0009) +[2023-10-14 08:40:22,738][100917] Updated weights for policy 1, policy_version 85612 (0.0011) +[2023-10-14 08:40:23,112][100917] Updated weights for policy 1, policy_version 85622 (0.0010) +[2023-10-14 08:40:23,483][100917] Updated weights for policy 1, policy_version 85632 (0.0008) +[2023-10-14 08:40:23,512][99942] Fps is (10 sec: 16383.5, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 175210496. Throughput: 0: 1638.6, 1: 1648.2. Samples: 43813362. Policy #0 lag: (min: 31.0, avg: 32.6, max: 59.0) +[2023-10-14 08:40:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:40:23,529][100936] Updated weights for policy 0, policy_version 85480 (0.0007) +[2023-10-14 08:40:23,911][100936] Updated weights for policy 0, policy_version 85490 (0.0008) +[2023-10-14 08:40:24,290][100936] Updated weights for policy 0, policy_version 85500 (0.0010) +[2023-10-14 08:40:27,660][100917] Updated weights for policy 1, policy_version 85642 (0.0007) +[2023-10-14 08:40:28,033][100917] Updated weights for policy 1, policy_version 85652 (0.0007) +[2023-10-14 08:40:28,325][100936] Updated weights for policy 0, policy_version 85510 (0.0008) +[2023-10-14 08:40:28,402][100917] Updated weights for policy 1, policy_version 85662 (0.0008) +[2023-10-14 08:40:28,512][99942] Fps is (10 sec: 16384.0, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 175276032. Throughput: 0: 1639.9, 1: 1669.2. Samples: 43823278. Policy #0 lag: (min: 31.0, avg: 32.6, max: 59.0) +[2023-10-14 08:40:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:40:28,686][100936] Updated weights for policy 0, policy_version 85520 (0.0007) +[2023-10-14 08:40:29,049][100936] Updated weights for policy 0, policy_version 85530 (0.0007) +[2023-10-14 08:40:32,651][100917] Updated weights for policy 1, policy_version 85672 (0.0008) +[2023-10-14 08:40:33,019][100917] Updated weights for policy 1, policy_version 85682 (0.0009) +[2023-10-14 08:40:33,357][100936] Updated weights for policy 0, policy_version 85540 (0.0008) +[2023-10-14 08:40:33,404][100917] Updated weights for policy 1, policy_version 85692 (0.0009) +[2023-10-14 08:40:33,512][99942] Fps is (10 sec: 9830.7, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 175308800. Throughput: 0: 1647.0, 1: 1670.6. Samples: 43843862. Policy #0 lag: (min: 22.0, avg: 26.4, max: 54.0) +[2023-10-14 08:40:33,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:40:33,763][100936] Updated weights for policy 0, policy_version 85550 (0.0009) +[2023-10-14 08:40:34,132][100936] Updated weights for policy 0, policy_version 85560 (0.0007) +[2023-10-14 08:40:37,490][100917] Updated weights for policy 1, policy_version 85702 (0.0009) +[2023-10-14 08:40:37,866][100917] Updated weights for policy 1, policy_version 85712 (0.0009) +[2023-10-14 08:40:38,127][100936] Updated weights for policy 0, policy_version 85570 (0.0008) +[2023-10-14 08:40:38,242][100917] Updated weights for policy 1, policy_version 85722 (0.0009) +[2023-10-14 08:40:38,493][100936] Updated weights for policy 0, policy_version 85580 (0.0009) +[2023-10-14 08:40:38,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 175407104. Throughput: 0: 1642.3, 1: 1651.0. Samples: 43862954. Policy #0 lag: (min: 22.0, avg: 26.4, max: 54.0) +[2023-10-14 08:40:38,513][99942] Avg episode reward: [(0, '0.790'), (1, '1.000')] +[2023-10-14 08:40:38,863][100936] Updated weights for policy 0, policy_version 85590 (0.0009) +[2023-10-14 08:40:39,234][100936] Updated weights for policy 0, policy_version 85600 (0.0008) +[2023-10-14 08:40:42,416][100917] Updated weights for policy 1, policy_version 85732 (0.0009) +[2023-10-14 08:40:42,795][100917] Updated weights for policy 1, policy_version 85742 (0.0007) +[2023-10-14 08:40:43,158][100917] Updated weights for policy 1, policy_version 85752 (0.0007) +[2023-10-14 08:40:43,326][100936] Updated weights for policy 0, policy_version 85610 (0.0008) +[2023-10-14 08:40:43,512][99942] Fps is (10 sec: 16383.7, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 175472640. Throughput: 0: 1649.2, 1: 1660.0. Samples: 43872774. Policy #0 lag: (min: 22.0, avg: 26.4, max: 54.0) +[2023-10-14 08:40:43,513][99942] Avg episode reward: [(0, '0.790'), (1, '1.000')] +[2023-10-14 08:40:43,697][100936] Updated weights for policy 0, policy_version 85620 (0.0010) +[2023-10-14 08:40:44,066][100936] Updated weights for policy 0, policy_version 85630 (0.0008) +[2023-10-14 08:40:47,334][100917] Updated weights for policy 1, policy_version 85762 (0.0008) +[2023-10-14 08:40:47,700][100917] Updated weights for policy 1, policy_version 85772 (0.0008) +[2023-10-14 08:40:48,068][100917] Updated weights for policy 1, policy_version 85782 (0.0009) +[2023-10-14 08:40:48,308][100936] Updated weights for policy 0, policy_version 85640 (0.0008) +[2023-10-14 08:40:48,443][100917] Updated weights for policy 1, policy_version 85792 (0.0009) +[2023-10-14 08:40:48,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 175538176. Throughput: 0: 1644.4, 1: 1657.3. Samples: 43893046. Policy #0 lag: (min: 22.0, avg: 26.4, max: 54.0) +[2023-10-14 08:40:48,513][99942] Avg episode reward: [(0, '0.790'), (1, '1.000')] +[2023-10-14 08:40:48,673][100936] Updated weights for policy 0, policy_version 85650 (0.0007) +[2023-10-14 08:40:49,048][100936] Updated weights for policy 0, policy_version 85660 (0.0007) +[2023-10-14 08:40:52,634][100917] Updated weights for policy 1, policy_version 85802 (0.0009) +[2023-10-14 08:40:53,017][100917] Updated weights for policy 1, policy_version 85812 (0.0008) +[2023-10-14 08:40:53,121][100936] Updated weights for policy 0, policy_version 85670 (0.0008) +[2023-10-14 08:40:53,384][100917] Updated weights for policy 1, policy_version 85822 (0.0009) +[2023-10-14 08:40:53,482][100936] Updated weights for policy 0, policy_version 85680 (0.0008) +[2023-10-14 08:40:53,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 175603712. Throughput: 0: 1638.6, 1: 1652.7. Samples: 43912232. Policy #0 lag: (min: 22.0, avg: 26.4, max: 54.0) +[2023-10-14 08:40:53,513][99942] Avg episode reward: [(0, '0.790'), (1, '1.000')] +[2023-10-14 08:40:53,845][100936] Updated weights for policy 0, policy_version 85690 (0.0007) +[2023-10-14 08:40:57,618][100917] Updated weights for policy 1, policy_version 85832 (0.0011) +[2023-10-14 08:40:57,991][100917] Updated weights for policy 1, policy_version 85842 (0.0008) +[2023-10-14 08:40:58,117][100936] Updated weights for policy 0, policy_version 85700 (0.0008) +[2023-10-14 08:40:58,363][100917] Updated weights for policy 1, policy_version 85852 (0.0008) +[2023-10-14 08:40:58,486][100936] Updated weights for policy 0, policy_version 85710 (0.0009) +[2023-10-14 08:40:58,512][99942] Fps is (10 sec: 9830.6, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 175636480. Throughput: 0: 1644.0, 1: 1657.2. Samples: 43922046. Policy #0 lag: (min: 22.0, avg: 26.4, max: 54.0) +[2023-10-14 08:40:58,512][99942] Avg episode reward: [(0, '0.790'), (1, '1.000')] +[2023-10-14 08:40:58,860][100936] Updated weights for policy 0, policy_version 85720 (0.0009) +[2023-10-14 08:41:02,432][100917] Updated weights for policy 1, policy_version 85862 (0.0009) +[2023-10-14 08:41:02,799][100917] Updated weights for policy 1, policy_version 85872 (0.0009) +[2023-10-14 08:41:02,945][100936] Updated weights for policy 0, policy_version 85730 (0.0007) +[2023-10-14 08:41:03,177][100917] Updated weights for policy 1, policy_version 85882 (0.0010) +[2023-10-14 08:41:03,322][100936] Updated weights for policy 0, policy_version 85740 (0.0009) +[2023-10-14 08:41:03,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 175734784. Throughput: 0: 1647.6, 1: 1659.3. Samples: 43942508. Policy #0 lag: (min: 22.0, avg: 26.4, max: 54.0) +[2023-10-14 08:41:03,513][99942] Avg episode reward: [(0, '0.790'), (1, '1.000')] +[2023-10-14 08:41:03,686][100936] Updated weights for policy 0, policy_version 85750 (0.0008) +[2023-10-14 08:41:04,049][100936] Updated weights for policy 0, policy_version 85760 (0.0009) +[2023-10-14 08:41:07,331][100917] Updated weights for policy 1, policy_version 85892 (0.0010) +[2023-10-14 08:41:07,706][100917] Updated weights for policy 1, policy_version 85902 (0.0011) +[2023-10-14 08:41:08,088][100917] Updated weights for policy 1, policy_version 85912 (0.0010) +[2023-10-14 08:41:08,260][100936] Updated weights for policy 0, policy_version 85770 (0.0007) +[2023-10-14 08:41:08,512][99942] Fps is (10 sec: 16383.9, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 175800320. Throughput: 0: 1644.5, 1: 1655.2. Samples: 43961850. Policy #0 lag: (min: 22.0, avg: 26.4, max: 54.0) +[2023-10-14 08:41:08,513][99942] Avg episode reward: [(0, '0.790'), (1, '1.000')] +[2023-10-14 08:41:08,628][100936] Updated weights for policy 0, policy_version 85780 (0.0009) +[2023-10-14 08:41:09,005][100936] Updated weights for policy 0, policy_version 85790 (0.0008) +[2023-10-14 08:41:12,361][100917] Updated weights for policy 1, policy_version 85922 (0.0011) +[2023-10-14 08:41:12,736][100917] Updated weights for policy 1, policy_version 85932 (0.0009) +[2023-10-14 08:41:13,091][100936] Updated weights for policy 0, policy_version 85800 (0.0010) +[2023-10-14 08:41:13,107][100917] Updated weights for policy 1, policy_version 85942 (0.0008) +[2023-10-14 08:41:13,467][100936] Updated weights for policy 0, policy_version 85810 (0.0009) +[2023-10-14 08:41:13,480][100917] Updated weights for policy 1, policy_version 85952 (0.0008) +[2023-10-14 08:41:13,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 175865856. Throughput: 0: 1651.9, 1: 1652.0. Samples: 43971952. Policy #0 lag: (min: 22.0, avg: 26.4, max: 54.0) +[2023-10-14 08:41:13,512][99942] Avg episode reward: [(0, '0.790'), (1, '1.000')] +[2023-10-14 08:41:13,838][100936] Updated weights for policy 0, policy_version 85820 (0.0008) +[2023-10-14 08:41:17,376][100917] Updated weights for policy 1, policy_version 85962 (0.0008) +[2023-10-14 08:41:17,748][100917] Updated weights for policy 1, policy_version 85972 (0.0008) +[2023-10-14 08:41:18,060][100936] Updated weights for policy 0, policy_version 85830 (0.0007) +[2023-10-14 08:41:18,110][100917] Updated weights for policy 1, policy_version 85982 (0.0009) +[2023-10-14 08:41:18,437][100936] Updated weights for policy 0, policy_version 85840 (0.0009) +[2023-10-14 08:41:18,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 175931392. Throughput: 0: 1646.4, 1: 1651.5. Samples: 43992268. Policy #0 lag: (min: 22.0, avg: 26.4, max: 54.0) +[2023-10-14 08:41:18,512][99942] Avg episode reward: [(0, '0.790'), (1, '1.000')] +[2023-10-14 08:41:18,813][100936] Updated weights for policy 0, policy_version 85850 (0.0008) +[2023-10-14 08:41:22,274][100917] Updated weights for policy 1, policy_version 85992 (0.0007) +[2023-10-14 08:41:22,646][100917] Updated weights for policy 1, policy_version 86002 (0.0007) +[2023-10-14 08:41:22,936][100936] Updated weights for policy 0, policy_version 85860 (0.0008) +[2023-10-14 08:41:23,022][100917] Updated weights for policy 1, policy_version 86012 (0.0007) +[2023-10-14 08:41:23,298][100936] Updated weights for policy 0, policy_version 85870 (0.0011) +[2023-10-14 08:41:23,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 175996928. Throughput: 0: 1643.8, 1: 1647.3. Samples: 44011054. Policy #0 lag: (min: 22.0, avg: 26.4, max: 54.0) +[2023-10-14 08:41:23,513][99942] Avg episode reward: [(0, '0.790'), (1, '1.000')] +[2023-10-14 08:41:23,662][100936] Updated weights for policy 0, policy_version 85880 (0.0008) +[2023-10-14 08:41:27,152][100917] Updated weights for policy 1, policy_version 86022 (0.0007) +[2023-10-14 08:41:27,531][100917] Updated weights for policy 1, policy_version 86032 (0.0009) +[2023-10-14 08:41:27,666][100936] Updated weights for policy 0, policy_version 85890 (0.0007) +[2023-10-14 08:41:27,905][100917] Updated weights for policy 1, policy_version 86042 (0.0008) +[2023-10-14 08:41:28,031][100936] Updated weights for policy 0, policy_version 85900 (0.0007) +[2023-10-14 08:41:28,402][100936] Updated weights for policy 0, policy_version 85910 (0.0007) +[2023-10-14 08:41:28,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 176062464. Throughput: 0: 1653.2, 1: 1655.4. Samples: 44021660. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:41:28,513][99942] Avg episode reward: [(0, '0.790'), (1, '1.000')] +[2023-10-14 08:41:28,772][100936] Updated weights for policy 0, policy_version 85920 (0.0007) +[2023-10-14 08:41:31,920][100917] Updated weights for policy 1, policy_version 86052 (0.0009) +[2023-10-14 08:41:32,301][100917] Updated weights for policy 1, policy_version 86062 (0.0011) +[2023-10-14 08:41:32,665][100917] Updated weights for policy 1, policy_version 86072 (0.0009) +[2023-10-14 08:41:32,940][100936] Updated weights for policy 0, policy_version 85930 (0.0007) +[2023-10-14 08:41:33,311][100936] Updated weights for policy 0, policy_version 85940 (0.0007) +[2023-10-14 08:41:33,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 176128000. Throughput: 0: 1657.6, 1: 1650.9. Samples: 44041926. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:41:33,512][99942] Avg episode reward: [(0, '0.790'), (1, '1.000')] +[2023-10-14 08:41:33,676][100936] Updated weights for policy 0, policy_version 85950 (0.0007) +[2023-10-14 08:41:36,756][100917] Updated weights for policy 1, policy_version 86082 (0.0009) +[2023-10-14 08:41:37,126][100917] Updated weights for policy 1, policy_version 86092 (0.0010) +[2023-10-14 08:41:37,497][100917] Updated weights for policy 1, policy_version 86102 (0.0009) +[2023-10-14 08:41:37,868][100917] Updated weights for policy 1, policy_version 86112 (0.0009) +[2023-10-14 08:41:37,896][100936] Updated weights for policy 0, policy_version 85960 (0.0008) +[2023-10-14 08:41:38,262][100936] Updated weights for policy 0, policy_version 85970 (0.0008) +[2023-10-14 08:41:38,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 176193536. Throughput: 0: 1648.1, 1: 1645.4. Samples: 44060442. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:41:38,512][99942] Avg episode reward: [(0, '0.790'), (1, '1.000')] +[2023-10-14 08:41:38,520][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000086112_88178688.pth... +[2023-10-14 08:41:38,555][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000084544_86573056.pth +[2023-10-14 08:41:38,633][100936] Updated weights for policy 0, policy_version 85980 (0.0008) +[2023-10-14 08:41:38,778][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000085984_88047616.pth... +[2023-10-14 08:41:38,806][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000084416_86441984.pth +[2023-10-14 08:41:41,981][100917] Updated weights for policy 1, policy_version 86122 (0.0008) +[2023-10-14 08:41:42,343][100917] Updated weights for policy 1, policy_version 86132 (0.0008) +[2023-10-14 08:41:42,709][100936] Updated weights for policy 0, policy_version 85990 (0.0009) +[2023-10-14 08:41:42,726][100917] Updated weights for policy 1, policy_version 86142 (0.0008) +[2023-10-14 08:41:43,085][100936] Updated weights for policy 0, policy_version 86000 (0.0007) +[2023-10-14 08:41:43,452][100936] Updated weights for policy 0, policy_version 86010 (0.0008) +[2023-10-14 08:41:43,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 176259072. Throughput: 0: 1660.2, 1: 1664.4. Samples: 44071656. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:41:43,512][99942] Avg episode reward: [(0, '0.790'), (1, '1.000')] +[2023-10-14 08:41:46,853][100917] Updated weights for policy 1, policy_version 86152 (0.0010) +[2023-10-14 08:41:47,230][100917] Updated weights for policy 1, policy_version 86162 (0.0009) +[2023-10-14 08:41:47,605][100917] Updated weights for policy 1, policy_version 86172 (0.0009) +[2023-10-14 08:41:47,646][100936] Updated weights for policy 0, policy_version 86020 (0.0010) +[2023-10-14 08:41:48,015][100936] Updated weights for policy 0, policy_version 86030 (0.0009) +[2023-10-14 08:41:48,381][100936] Updated weights for policy 0, policy_version 86040 (0.0009) +[2023-10-14 08:41:48,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 176324608. Throughput: 0: 1658.6, 1: 1649.4. Samples: 44091366. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:41:48,512][99942] Avg episode reward: [(0, '0.790'), (1, '1.000')] +[2023-10-14 08:41:51,809][100917] Updated weights for policy 1, policy_version 86182 (0.0008) +[2023-10-14 08:41:52,187][100917] Updated weights for policy 1, policy_version 86192 (0.0007) +[2023-10-14 08:41:52,416][100936] Updated weights for policy 0, policy_version 86050 (0.0009) +[2023-10-14 08:41:52,558][100917] Updated weights for policy 1, policy_version 86202 (0.0007) +[2023-10-14 08:41:52,790][100936] Updated weights for policy 0, policy_version 86060 (0.0008) +[2023-10-14 08:41:53,162][100936] Updated weights for policy 0, policy_version 86070 (0.0007) +[2023-10-14 08:41:53,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 176390144. Throughput: 0: 1642.4, 1: 1646.1. Samples: 44109834. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:41:53,513][99942] Avg episode reward: [(0, '0.790'), (1, '1.000')] +[2023-10-14 08:41:53,524][100936] Updated weights for policy 0, policy_version 86080 (0.0007) +[2023-10-14 08:41:56,660][100917] Updated weights for policy 1, policy_version 86212 (0.0009) +[2023-10-14 08:41:57,029][100917] Updated weights for policy 1, policy_version 86222 (0.0010) +[2023-10-14 08:41:57,406][100917] Updated weights for policy 1, policy_version 86232 (0.0009) +[2023-10-14 08:41:57,716][100936] Updated weights for policy 0, policy_version 86090 (0.0009) +[2023-10-14 08:41:58,079][100936] Updated weights for policy 0, policy_version 86100 (0.0010) +[2023-10-14 08:41:58,452][100936] Updated weights for policy 0, policy_version 86110 (0.0008) +[2023-10-14 08:41:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 176455680. Throughput: 0: 1654.0, 1: 1663.5. Samples: 44121238. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:41:58,512][99942] Avg episode reward: [(0, '0.750'), (1, '1.000')] +[2023-10-14 08:42:01,332][100917] Updated weights for policy 1, policy_version 86242 (0.0009) +[2023-10-14 08:42:01,708][100917] Updated weights for policy 1, policy_version 86252 (0.0007) +[2023-10-14 08:42:02,069][100917] Updated weights for policy 1, policy_version 86262 (0.0008) +[2023-10-14 08:42:02,439][100917] Updated weights for policy 1, policy_version 86272 (0.0008) +[2023-10-14 08:42:02,708][100936] Updated weights for policy 0, policy_version 86120 (0.0009) +[2023-10-14 08:42:03,067][100936] Updated weights for policy 0, policy_version 86130 (0.0011) +[2023-10-14 08:42:03,431][100936] Updated weights for policy 0, policy_version 86140 (0.0011) +[2023-10-14 08:42:03,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 176521216. Throughput: 0: 1655.0, 1: 1652.8. Samples: 44141120. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:42:03,512][99942] Avg episode reward: [(0, '0.750'), (1, '1.000')] +[2023-10-14 08:42:06,553][100917] Updated weights for policy 1, policy_version 86282 (0.0008) +[2023-10-14 08:42:06,914][100917] Updated weights for policy 1, policy_version 86292 (0.0009) +[2023-10-14 08:42:07,295][100917] Updated weights for policy 1, policy_version 86302 (0.0008) +[2023-10-14 08:42:07,643][100936] Updated weights for policy 0, policy_version 86150 (0.0010) +[2023-10-14 08:42:08,023][100936] Updated weights for policy 0, policy_version 86160 (0.0008) +[2023-10-14 08:42:08,390][100936] Updated weights for policy 0, policy_version 86170 (0.0009) +[2023-10-14 08:42:08,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 176586752. Throughput: 0: 1647.4, 1: 1662.9. Samples: 44160016. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:42:08,513][99942] Avg episode reward: [(0, '0.750'), (1, '1.000')] +[2023-10-14 08:42:11,357][100917] Updated weights for policy 1, policy_version 86312 (0.0007) +[2023-10-14 08:42:11,730][100917] Updated weights for policy 1, policy_version 86322 (0.0007) +[2023-10-14 08:42:12,106][100917] Updated weights for policy 1, policy_version 86332 (0.0007) +[2023-10-14 08:42:12,590][100936] Updated weights for policy 0, policy_version 86180 (0.0009) +[2023-10-14 08:42:12,950][100936] Updated weights for policy 0, policy_version 86190 (0.0007) +[2023-10-14 08:42:13,316][100936] Updated weights for policy 0, policy_version 86200 (0.0007) +[2023-10-14 08:42:13,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 176652288. Throughput: 0: 1650.5, 1: 1669.7. Samples: 44171068. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:42:13,513][99942] Avg episode reward: [(0, '0.750'), (1, '1.000')] +[2023-10-14 08:42:16,226][100917] Updated weights for policy 1, policy_version 86342 (0.0009) +[2023-10-14 08:42:16,606][100917] Updated weights for policy 1, policy_version 86352 (0.0009) +[2023-10-14 08:42:16,987][100917] Updated weights for policy 1, policy_version 86362 (0.0011) +[2023-10-14 08:42:17,367][100936] Updated weights for policy 0, policy_version 86210 (0.0009) +[2023-10-14 08:42:17,740][100936] Updated weights for policy 0, policy_version 86220 (0.0007) +[2023-10-14 08:42:18,113][100936] Updated weights for policy 0, policy_version 86230 (0.0007) +[2023-10-14 08:42:18,488][100936] Updated weights for policy 0, policy_version 86240 (0.0007) +[2023-10-14 08:42:18,512][99942] Fps is (10 sec: 16384.1, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 176750592. Throughput: 0: 1648.2, 1: 1654.4. Samples: 44190544. Policy #0 lag: (min: 31.0, avg: 33.7, max: 63.0) +[2023-10-14 08:42:18,513][99942] Avg episode reward: [(0, '0.750'), (1, '1.000')] +[2023-10-14 08:42:21,065][100917] Updated weights for policy 1, policy_version 86372 (0.0010) +[2023-10-14 08:42:21,438][100917] Updated weights for policy 1, policy_version 86382 (0.0009) +[2023-10-14 08:42:21,807][100917] Updated weights for policy 1, policy_version 86392 (0.0008) +[2023-10-14 08:42:22,621][100936] Updated weights for policy 0, policy_version 86250 (0.0007) +[2023-10-14 08:42:23,007][100936] Updated weights for policy 0, policy_version 86260 (0.0008) +[2023-10-14 08:42:23,385][100936] Updated weights for policy 0, policy_version 86270 (0.0009) +[2023-10-14 08:42:23,512][99942] Fps is (10 sec: 16383.8, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 176816128. Throughput: 0: 1647.0, 1: 1676.0. Samples: 44209980. Policy #0 lag: (min: 31.0, avg: 33.7, max: 63.0) +[2023-10-14 08:42:23,513][99942] Avg episode reward: [(0, '0.750'), (1, '1.000')] +[2023-10-14 08:42:25,898][100917] Updated weights for policy 1, policy_version 86402 (0.0009) +[2023-10-14 08:42:26,271][100917] Updated weights for policy 1, policy_version 86412 (0.0009) +[2023-10-14 08:42:26,656][100917] Updated weights for policy 1, policy_version 86422 (0.0011) +[2023-10-14 08:42:27,032][100917] Updated weights for policy 1, policy_version 86432 (0.0011) +[2023-10-14 08:42:27,463][100936] Updated weights for policy 0, policy_version 86280 (0.0009) +[2023-10-14 08:42:27,829][100936] Updated weights for policy 0, policy_version 86290 (0.0008) +[2023-10-14 08:42:28,202][100936] Updated weights for policy 0, policy_version 86300 (0.0008) +[2023-10-14 08:42:28,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 176881664. Throughput: 0: 1649.9, 1: 1668.3. Samples: 44220978. Policy #0 lag: (min: 31.0, avg: 33.7, max: 63.0) +[2023-10-14 08:42:28,513][99942] Avg episode reward: [(0, '0.750'), (1, '1.000')] +[2023-10-14 08:42:31,039][100917] Updated weights for policy 1, policy_version 86442 (0.0007) +[2023-10-14 08:42:31,404][100917] Updated weights for policy 1, policy_version 86452 (0.0009) +[2023-10-14 08:42:31,783][100917] Updated weights for policy 1, policy_version 86462 (0.0011) +[2023-10-14 08:42:32,504][100936] Updated weights for policy 0, policy_version 86310 (0.0008) +[2023-10-14 08:42:32,873][100936] Updated weights for policy 0, policy_version 86320 (0.0008) +[2023-10-14 08:42:33,243][100936] Updated weights for policy 0, policy_version 86330 (0.0007) +[2023-10-14 08:42:33,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 176947200. Throughput: 0: 1649.7, 1: 1658.2. Samples: 44240220. Policy #0 lag: (min: 31.0, avg: 33.7, max: 63.0) +[2023-10-14 08:42:33,513][99942] Avg episode reward: [(0, '0.750'), (1, '1.000')] +[2023-10-14 08:42:35,972][100917] Updated weights for policy 1, policy_version 86472 (0.0009) +[2023-10-14 08:42:36,349][100917] Updated weights for policy 1, policy_version 86482 (0.0007) +[2023-10-14 08:42:36,716][100917] Updated weights for policy 1, policy_version 86492 (0.0010) +[2023-10-14 08:42:37,199][100936] Updated weights for policy 0, policy_version 86340 (0.0010) +[2023-10-14 08:42:37,576][100936] Updated weights for policy 0, policy_version 86350 (0.0009) +[2023-10-14 08:42:37,947][100936] Updated weights for policy 0, policy_version 86360 (0.0009) +[2023-10-14 08:42:38,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 177012736. Throughput: 0: 1651.8, 1: 1675.4. Samples: 44259558. Policy #0 lag: (min: 31.0, avg: 33.7, max: 63.0) +[2023-10-14 08:42:38,512][99942] Avg episode reward: [(0, '0.750'), (1, '1.000')] +[2023-10-14 08:42:40,891][100917] Updated weights for policy 1, policy_version 86502 (0.0011) +[2023-10-14 08:42:41,269][100917] Updated weights for policy 1, policy_version 86512 (0.0010) +[2023-10-14 08:42:41,645][100917] Updated weights for policy 1, policy_version 86522 (0.0008) +[2023-10-14 08:42:42,033][100936] Updated weights for policy 0, policy_version 86370 (0.0008) +[2023-10-14 08:42:42,402][100936] Updated weights for policy 0, policy_version 86380 (0.0007) +[2023-10-14 08:42:42,768][100936] Updated weights for policy 0, policy_version 86390 (0.0008) +[2023-10-14 08:42:43,136][100936] Updated weights for policy 0, policy_version 86400 (0.0010) +[2023-10-14 08:42:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 177078272. Throughput: 0: 1656.8, 1: 1663.6. Samples: 44270656. Policy #0 lag: (min: 31.0, avg: 33.7, max: 63.0) +[2023-10-14 08:42:43,512][99942] Avg episode reward: [(0, '0.750'), (1, '1.000')] +[2023-10-14 08:42:45,671][100917] Updated weights for policy 1, policy_version 86532 (0.0007) +[2023-10-14 08:42:46,055][100917] Updated weights for policy 1, policy_version 86542 (0.0008) +[2023-10-14 08:42:46,429][100917] Updated weights for policy 1, policy_version 86552 (0.0007) +[2023-10-14 08:42:47,267][100936] Updated weights for policy 0, policy_version 86410 (0.0007) +[2023-10-14 08:42:47,631][100936] Updated weights for policy 0, policy_version 86420 (0.0010) +[2023-10-14 08:42:48,004][100936] Updated weights for policy 0, policy_version 86430 (0.0010) +[2023-10-14 08:42:48,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 177143808. Throughput: 0: 1642.8, 1: 1653.5. Samples: 44289454. Policy #0 lag: (min: 31.0, avg: 33.7, max: 63.0) +[2023-10-14 08:42:48,513][99942] Avg episode reward: [(0, '0.750'), (1, '1.000')] +[2023-10-14 08:42:50,549][100917] Updated weights for policy 1, policy_version 86562 (0.0008) +[2023-10-14 08:42:50,927][100917] Updated weights for policy 1, policy_version 86572 (0.0010) +[2023-10-14 08:42:51,304][100917] Updated weights for policy 1, policy_version 86582 (0.0011) +[2023-10-14 08:42:51,672][100917] Updated weights for policy 1, policy_version 86592 (0.0011) +[2023-10-14 08:42:52,392][100936] Updated weights for policy 0, policy_version 86440 (0.0008) +[2023-10-14 08:42:52,758][100936] Updated weights for policy 0, policy_version 86450 (0.0007) +[2023-10-14 08:42:53,122][100936] Updated weights for policy 0, policy_version 86460 (0.0007) +[2023-10-14 08:42:53,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 177209344. Throughput: 0: 1645.8, 1: 1665.2. Samples: 44309014. Policy #0 lag: (min: 31.0, avg: 33.7, max: 63.0) +[2023-10-14 08:42:53,513][99942] Avg episode reward: [(0, '0.750'), (1, '1.000')] +[2023-10-14 08:42:55,753][100917] Updated weights for policy 1, policy_version 86602 (0.0008) +[2023-10-14 08:42:56,125][100917] Updated weights for policy 1, policy_version 86612 (0.0009) +[2023-10-14 08:42:56,493][100917] Updated weights for policy 1, policy_version 86622 (0.0010) +[2023-10-14 08:42:57,210][100936] Updated weights for policy 0, policy_version 86470 (0.0009) +[2023-10-14 08:42:57,573][100936] Updated weights for policy 0, policy_version 86480 (0.0011) +[2023-10-14 08:42:57,945][100936] Updated weights for policy 0, policy_version 86490 (0.0008) +[2023-10-14 08:42:58,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 177274880. Throughput: 0: 1659.1, 1: 1654.1. Samples: 44320162. Policy #0 lag: (min: 31.0, avg: 33.7, max: 63.0) +[2023-10-14 08:42:58,512][99942] Avg episode reward: [(0, '0.750'), (1, '1.000')] +[2023-10-14 08:43:00,485][100917] Updated weights for policy 1, policy_version 86632 (0.0008) +[2023-10-14 08:43:00,857][100917] Updated weights for policy 1, policy_version 86642 (0.0008) +[2023-10-14 08:43:01,226][100917] Updated weights for policy 1, policy_version 86652 (0.0008) +[2023-10-14 08:43:01,929][100936] Updated weights for policy 0, policy_version 86500 (0.0009) +[2023-10-14 08:43:02,296][100936] Updated weights for policy 0, policy_version 86510 (0.0008) +[2023-10-14 08:43:02,663][100936] Updated weights for policy 0, policy_version 86520 (0.0010) +[2023-10-14 08:43:03,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 177340416. Throughput: 0: 1650.1, 1: 1665.8. Samples: 44339758. Policy #0 lag: (min: 31.0, avg: 33.7, max: 63.0) +[2023-10-14 08:43:03,512][99942] Avg episode reward: [(0, '0.750'), (1, '1.000')] +[2023-10-14 08:43:05,351][100917] Updated weights for policy 1, policy_version 86662 (0.0009) +[2023-10-14 08:43:05,728][100917] Updated weights for policy 1, policy_version 86672 (0.0010) +[2023-10-14 08:43:06,112][100917] Updated weights for policy 1, policy_version 86682 (0.0010) +[2023-10-14 08:43:06,611][100936] Updated weights for policy 0, policy_version 86530 (0.0009) +[2023-10-14 08:43:06,981][100936] Updated weights for policy 0, policy_version 86540 (0.0010) +[2023-10-14 08:43:07,340][100936] Updated weights for policy 0, policy_version 86550 (0.0011) +[2023-10-14 08:43:07,710][100936] Updated weights for policy 0, policy_version 86560 (0.0009) +[2023-10-14 08:43:08,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 177405952. Throughput: 0: 1660.6, 1: 1669.4. Samples: 44359828. Policy #0 lag: (min: 31.0, avg: 33.7, max: 63.0) +[2023-10-14 08:43:08,513][99942] Avg episode reward: [(0, '0.750'), (1, '1.000')] +[2023-10-14 08:43:10,243][100917] Updated weights for policy 1, policy_version 86692 (0.0010) +[2023-10-14 08:43:10,622][100917] Updated weights for policy 1, policy_version 86702 (0.0009) +[2023-10-14 08:43:11,007][100917] Updated weights for policy 1, policy_version 86712 (0.0009) +[2023-10-14 08:43:11,904][100936] Updated weights for policy 0, policy_version 86570 (0.0007) +[2023-10-14 08:43:12,277][100936] Updated weights for policy 0, policy_version 86580 (0.0007) +[2023-10-14 08:43:12,644][100936] Updated weights for policy 0, policy_version 86590 (0.0008) +[2023-10-14 08:43:13,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 177471488. Throughput: 0: 1669.6, 1: 1656.8. Samples: 44370664. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:43:13,513][99942] Avg episode reward: [(0, '0.750'), (1, '1.000')] +[2023-10-14 08:43:15,103][100917] Updated weights for policy 1, policy_version 86722 (0.0009) +[2023-10-14 08:43:15,476][100917] Updated weights for policy 1, policy_version 86732 (0.0009) +[2023-10-14 08:43:15,840][100917] Updated weights for policy 1, policy_version 86742 (0.0009) +[2023-10-14 08:43:16,221][100917] Updated weights for policy 1, policy_version 86752 (0.0010) +[2023-10-14 08:43:16,936][100936] Updated weights for policy 0, policy_version 86600 (0.0009) +[2023-10-14 08:43:17,300][100936] Updated weights for policy 0, policy_version 86610 (0.0008) +[2023-10-14 08:43:17,664][100936] Updated weights for policy 0, policy_version 86620 (0.0010) +[2023-10-14 08:43:18,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.3, 300 sec: 13329.4). Total num frames: 177537024. Throughput: 0: 1653.0, 1: 1669.1. Samples: 44389714. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:43:18,512][99942] Avg episode reward: [(0, '0.820'), (1, '1.000')] +[2023-10-14 08:43:20,543][100917] Updated weights for policy 1, policy_version 86762 (0.0009) +[2023-10-14 08:43:20,913][100917] Updated weights for policy 1, policy_version 86772 (0.0008) +[2023-10-14 08:43:21,285][100917] Updated weights for policy 1, policy_version 86782 (0.0010) +[2023-10-14 08:43:21,793][100936] Updated weights for policy 0, policy_version 86630 (0.0008) +[2023-10-14 08:43:22,168][100936] Updated weights for policy 0, policy_version 86640 (0.0007) +[2023-10-14 08:43:22,527][100936] Updated weights for policy 0, policy_version 86650 (0.0007) +[2023-10-14 08:43:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 177602560. Throughput: 0: 1662.9, 1: 1669.8. Samples: 44409530. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:43:23,512][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 08:43:25,374][100917] Updated weights for policy 1, policy_version 86792 (0.0009) +[2023-10-14 08:43:25,749][100917] Updated weights for policy 1, policy_version 86802 (0.0009) +[2023-10-14 08:43:26,132][100917] Updated weights for policy 1, policy_version 86812 (0.0009) +[2023-10-14 08:43:26,742][100936] Updated weights for policy 0, policy_version 86660 (0.0009) +[2023-10-14 08:43:27,108][100936] Updated weights for policy 0, policy_version 86670 (0.0010) +[2023-10-14 08:43:27,475][100936] Updated weights for policy 0, policy_version 86680 (0.0008) +[2023-10-14 08:43:28,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 177668096. Throughput: 0: 1664.0, 1: 1657.0. Samples: 44420100. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:43:28,513][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 08:43:30,262][100917] Updated weights for policy 1, policy_version 86822 (0.0009) +[2023-10-14 08:43:30,629][100917] Updated weights for policy 1, policy_version 86832 (0.0007) +[2023-10-14 08:43:30,999][100917] Updated weights for policy 1, policy_version 86842 (0.0008) +[2023-10-14 08:43:31,441][100936] Updated weights for policy 0, policy_version 86690 (0.0010) +[2023-10-14 08:43:31,812][100936] Updated weights for policy 0, policy_version 86700 (0.0008) +[2023-10-14 08:43:32,176][100936] Updated weights for policy 0, policy_version 86710 (0.0010) +[2023-10-14 08:43:32,547][100936] Updated weights for policy 0, policy_version 86720 (0.0008) +[2023-10-14 08:43:33,512][99942] Fps is (10 sec: 13106.7, 60 sec: 13107.1, 300 sec: 13329.3). Total num frames: 177733632. Throughput: 0: 1660.0, 1: 1671.4. Samples: 44439368. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:43:33,514][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 08:43:34,998][100917] Updated weights for policy 1, policy_version 86852 (0.0008) +[2023-10-14 08:43:35,376][100917] Updated weights for policy 1, policy_version 86862 (0.0008) +[2023-10-14 08:43:35,755][100917] Updated weights for policy 1, policy_version 86872 (0.0009) +[2023-10-14 08:43:36,536][100936] Updated weights for policy 0, policy_version 86730 (0.0009) +[2023-10-14 08:43:36,915][100936] Updated weights for policy 0, policy_version 86740 (0.0010) +[2023-10-14 08:43:37,287][100936] Updated weights for policy 0, policy_version 86750 (0.0010) +[2023-10-14 08:43:38,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 177799168. Throughput: 0: 1673.0, 1: 1673.8. Samples: 44459620. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:43:38,513][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 08:43:38,522][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000086752_88834048.pth... +[2023-10-14 08:43:38,522][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000086880_88965120.pth... +[2023-10-14 08:43:38,561][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000085216_87261184.pth +[2023-10-14 08:43:38,563][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000085344_87392256.pth +[2023-10-14 08:43:39,783][100917] Updated weights for policy 1, policy_version 86882 (0.0009) +[2023-10-14 08:43:40,158][100917] Updated weights for policy 1, policy_version 86892 (0.0007) +[2023-10-14 08:43:40,528][100917] Updated weights for policy 1, policy_version 86902 (0.0007) +[2023-10-14 08:43:40,907][100917] Updated weights for policy 1, policy_version 86912 (0.0007) +[2023-10-14 08:43:41,395][100936] Updated weights for policy 0, policy_version 86760 (0.0010) +[2023-10-14 08:43:41,768][100936] Updated weights for policy 0, policy_version 86770 (0.0009) +[2023-10-14 08:43:42,146][100936] Updated weights for policy 0, policy_version 86780 (0.0009) +[2023-10-14 08:43:43,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 177864704. Throughput: 0: 1662.9, 1: 1657.4. Samples: 44469576. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:43:43,513][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 08:43:44,993][100917] Updated weights for policy 1, policy_version 86922 (0.0008) +[2023-10-14 08:43:45,364][100917] Updated weights for policy 1, policy_version 86932 (0.0007) +[2023-10-14 08:43:45,740][100917] Updated weights for policy 1, policy_version 86942 (0.0007) +[2023-10-14 08:43:46,158][100936] Updated weights for policy 0, policy_version 86790 (0.0009) +[2023-10-14 08:43:46,523][100936] Updated weights for policy 0, policy_version 86800 (0.0011) +[2023-10-14 08:43:46,897][100936] Updated weights for policy 0, policy_version 86810 (0.0008) +[2023-10-14 08:43:48,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 177930240. Throughput: 0: 1654.7, 1: 1665.1. Samples: 44489150. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:43:48,513][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 08:43:49,872][100917] Updated weights for policy 1, policy_version 86952 (0.0009) +[2023-10-14 08:43:50,247][100917] Updated weights for policy 1, policy_version 86962 (0.0010) +[2023-10-14 08:43:50,623][100917] Updated weights for policy 1, policy_version 86972 (0.0009) +[2023-10-14 08:43:51,153][100936] Updated weights for policy 0, policy_version 86820 (0.0008) +[2023-10-14 08:43:51,523][100936] Updated weights for policy 0, policy_version 86830 (0.0011) +[2023-10-14 08:43:51,891][100936] Updated weights for policy 0, policy_version 86840 (0.0008) +[2023-10-14 08:43:53,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 177995776. Throughput: 0: 1666.8, 1: 1666.7. Samples: 44509832. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:43:53,513][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 08:43:54,646][100917] Updated weights for policy 1, policy_version 86982 (0.0009) +[2023-10-14 08:43:55,018][100917] Updated weights for policy 1, policy_version 86992 (0.0011) +[2023-10-14 08:43:55,395][100917] Updated weights for policy 1, policy_version 87002 (0.0007) +[2023-10-14 08:43:56,087][100936] Updated weights for policy 0, policy_version 86850 (0.0008) +[2023-10-14 08:43:56,458][100936] Updated weights for policy 0, policy_version 86860 (0.0007) +[2023-10-14 08:43:56,825][100936] Updated weights for policy 0, policy_version 86870 (0.0010) +[2023-10-14 08:43:57,201][100936] Updated weights for policy 0, policy_version 86880 (0.0007) +[2023-10-14 08:43:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 178061312. Throughput: 0: 1652.0, 1: 1654.2. Samples: 44519444. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:43:58,513][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 08:43:59,560][100917] Updated weights for policy 1, policy_version 87012 (0.0011) +[2023-10-14 08:43:59,937][100917] Updated weights for policy 1, policy_version 87022 (0.0007) +[2023-10-14 08:44:00,315][100917] Updated weights for policy 1, policy_version 87032 (0.0008) +[2023-10-14 08:44:01,454][100936] Updated weights for policy 0, policy_version 86890 (0.0008) +[2023-10-14 08:44:01,817][100936] Updated weights for policy 0, policy_version 86900 (0.0009) +[2023-10-14 08:44:02,189][100936] Updated weights for policy 0, policy_version 86910 (0.0008) +[2023-10-14 08:44:03,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 178126848. Throughput: 0: 1655.5, 1: 1661.9. Samples: 44539000. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:44:03,513][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 08:44:04,604][100917] Updated weights for policy 1, policy_version 87042 (0.0008) +[2023-10-14 08:44:05,017][100917] Updated weights for policy 1, policy_version 87052 (0.0008) +[2023-10-14 08:44:05,390][100917] Updated weights for policy 1, policy_version 87062 (0.0007) +[2023-10-14 08:44:05,748][100917] Updated weights for policy 1, policy_version 87072 (0.0008) +[2023-10-14 08:44:06,170][100936] Updated weights for policy 0, policy_version 86920 (0.0010) +[2023-10-14 08:44:06,531][100936] Updated weights for policy 0, policy_version 86930 (0.0008) +[2023-10-14 08:44:06,906][100936] Updated weights for policy 0, policy_version 86940 (0.0010) +[2023-10-14 08:44:08,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 178192384. Throughput: 0: 1674.7, 1: 1655.5. Samples: 44559394. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-10-14 08:44:08,513][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 08:44:09,857][100917] Updated weights for policy 1, policy_version 87082 (0.0011) +[2023-10-14 08:44:10,239][100917] Updated weights for policy 1, policy_version 87092 (0.0009) +[2023-10-14 08:44:10,620][100917] Updated weights for policy 1, policy_version 87102 (0.0008) +[2023-10-14 08:44:10,900][100936] Updated weights for policy 0, policy_version 86950 (0.0008) +[2023-10-14 08:44:11,276][100936] Updated weights for policy 0, policy_version 86960 (0.0007) +[2023-10-14 08:44:11,645][100936] Updated weights for policy 0, policy_version 86970 (0.0008) +[2023-10-14 08:44:13,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 178257920. Throughput: 0: 1657.2, 1: 1650.0. Samples: 44568922. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-10-14 08:44:13,512][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 08:44:14,756][100917] Updated weights for policy 1, policy_version 87112 (0.0010) +[2023-10-14 08:44:15,135][100917] Updated weights for policy 1, policy_version 87122 (0.0008) +[2023-10-14 08:44:15,497][100917] Updated weights for policy 1, policy_version 87132 (0.0009) +[2023-10-14 08:44:15,515][100936] Updated weights for policy 0, policy_version 86980 (0.0009) +[2023-10-14 08:44:15,883][100936] Updated weights for policy 0, policy_version 86990 (0.0007) +[2023-10-14 08:44:16,262][100936] Updated weights for policy 0, policy_version 87000 (0.0007) +[2023-10-14 08:44:18,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 178323456. Throughput: 0: 1665.6, 1: 1658.0. Samples: 44588930. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-10-14 08:44:18,513][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 08:44:19,561][100917] Updated weights for policy 1, policy_version 87142 (0.0007) +[2023-10-14 08:44:19,927][100917] Updated weights for policy 1, policy_version 87152 (0.0007) +[2023-10-14 08:44:20,308][100917] Updated weights for policy 1, policy_version 87162 (0.0007) +[2023-10-14 08:44:20,490][100936] Updated weights for policy 0, policy_version 87010 (0.0009) +[2023-10-14 08:44:20,859][100936] Updated weights for policy 0, policy_version 87020 (0.0007) +[2023-10-14 08:44:21,220][100936] Updated weights for policy 0, policy_version 87030 (0.0009) +[2023-10-14 08:44:21,596][100936] Updated weights for policy 0, policy_version 87040 (0.0008) +[2023-10-14 08:44:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 178388992. Throughput: 0: 1670.4, 1: 1661.6. Samples: 44609560. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-10-14 08:44:23,513][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 08:44:24,402][100917] Updated weights for policy 1, policy_version 87172 (0.0008) +[2023-10-14 08:44:24,773][100917] Updated weights for policy 1, policy_version 87182 (0.0009) +[2023-10-14 08:44:25,137][100917] Updated weights for policy 1, policy_version 87192 (0.0010) +[2023-10-14 08:44:25,758][100936] Updated weights for policy 0, policy_version 87050 (0.0010) +[2023-10-14 08:44:26,124][100936] Updated weights for policy 0, policy_version 87060 (0.0009) +[2023-10-14 08:44:26,486][100936] Updated weights for policy 0, policy_version 87070 (0.0011) +[2023-10-14 08:44:28,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 178454528. Throughput: 0: 1652.3, 1: 1662.3. Samples: 44618734. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-10-14 08:44:28,513][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 08:44:29,209][100917] Updated weights for policy 1, policy_version 87202 (0.0010) +[2023-10-14 08:44:29,575][100917] Updated weights for policy 1, policy_version 87212 (0.0009) +[2023-10-14 08:44:29,940][100917] Updated weights for policy 1, policy_version 87222 (0.0010) +[2023-10-14 08:44:30,319][100917] Updated weights for policy 1, policy_version 87232 (0.0008) +[2023-10-14 08:44:30,784][100936] Updated weights for policy 0, policy_version 87080 (0.0010) +[2023-10-14 08:44:31,161][100936] Updated weights for policy 0, policy_version 87090 (0.0009) +[2023-10-14 08:44:31,535][100936] Updated weights for policy 0, policy_version 87100 (0.0009) +[2023-10-14 08:44:33,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 178520064. Throughput: 0: 1657.6, 1: 1664.2. Samples: 44638630. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-10-14 08:44:33,513][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 08:44:34,365][100917] Updated weights for policy 1, policy_version 87242 (0.0008) +[2023-10-14 08:44:34,745][100917] Updated weights for policy 1, policy_version 87252 (0.0008) +[2023-10-14 08:44:35,113][100917] Updated weights for policy 1, policy_version 87262 (0.0008) +[2023-10-14 08:44:35,645][100936] Updated weights for policy 0, policy_version 87110 (0.0010) +[2023-10-14 08:44:36,021][100936] Updated weights for policy 0, policy_version 87120 (0.0009) +[2023-10-14 08:44:36,399][100936] Updated weights for policy 0, policy_version 87130 (0.0009) +[2023-10-14 08:44:38,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 178585600. Throughput: 0: 1654.3, 1: 1662.6. Samples: 44659092. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-10-14 08:44:38,513][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 08:44:39,242][100917] Updated weights for policy 1, policy_version 87272 (0.0009) +[2023-10-14 08:44:39,618][100917] Updated weights for policy 1, policy_version 87282 (0.0009) +[2023-10-14 08:44:39,987][100917] Updated weights for policy 1, policy_version 87292 (0.0007) +[2023-10-14 08:44:40,590][100936] Updated weights for policy 0, policy_version 87140 (0.0009) +[2023-10-14 08:44:40,957][100936] Updated weights for policy 0, policy_version 87150 (0.0012) +[2023-10-14 08:44:41,329][100936] Updated weights for policy 0, policy_version 87160 (0.0010) +[2023-10-14 08:44:43,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.1, 300 sec: 13329.3). Total num frames: 178651136. Throughput: 0: 1647.6, 1: 1662.3. Samples: 44668394. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-10-14 08:44:43,514][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 08:44:43,998][100917] Updated weights for policy 1, policy_version 87302 (0.0009) +[2023-10-14 08:44:44,360][100917] Updated weights for policy 1, policy_version 87312 (0.0007) +[2023-10-14 08:44:44,739][100917] Updated weights for policy 1, policy_version 87322 (0.0009) +[2023-10-14 08:44:45,569][100936] Updated weights for policy 0, policy_version 87170 (0.0009) +[2023-10-14 08:44:45,927][100936] Updated weights for policy 0, policy_version 87180 (0.0009) +[2023-10-14 08:44:46,298][100936] Updated weights for policy 0, policy_version 87190 (0.0010) +[2023-10-14 08:44:46,670][100936] Updated weights for policy 0, policy_version 87200 (0.0008) +[2023-10-14 08:44:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 178716672. Throughput: 0: 1653.8, 1: 1667.3. Samples: 44688450. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-10-14 08:44:48,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 08:44:48,837][100917] Updated weights for policy 1, policy_version 87332 (0.0008) +[2023-10-14 08:44:49,202][100917] Updated weights for policy 1, policy_version 87342 (0.0007) +[2023-10-14 08:44:49,563][100917] Updated weights for policy 1, policy_version 87352 (0.0007) +[2023-10-14 08:44:50,786][100936] Updated weights for policy 0, policy_version 87210 (0.0007) +[2023-10-14 08:44:51,148][100936] Updated weights for policy 0, policy_version 87220 (0.0007) +[2023-10-14 08:44:51,522][100936] Updated weights for policy 0, policy_version 87230 (0.0010) +[2023-10-14 08:44:53,512][99942] Fps is (10 sec: 13107.7, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 178782208. Throughput: 0: 1645.5, 1: 1673.3. Samples: 44708740. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-10-14 08:44:53,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 08:44:53,700][100917] Updated weights for policy 1, policy_version 87362 (0.0009) +[2023-10-14 08:44:54,099][100917] Updated weights for policy 1, policy_version 87372 (0.0007) +[2023-10-14 08:44:54,470][100917] Updated weights for policy 1, policy_version 87382 (0.0007) +[2023-10-14 08:44:54,835][100917] Updated weights for policy 1, policy_version 87392 (0.0007) +[2023-10-14 08:44:55,719][100936] Updated weights for policy 0, policy_version 87240 (0.0008) +[2023-10-14 08:44:56,087][100936] Updated weights for policy 0, policy_version 87250 (0.0009) +[2023-10-14 08:44:56,454][100936] Updated weights for policy 0, policy_version 87260 (0.0008) +[2023-10-14 08:44:58,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 178847744. Throughput: 0: 1639.4, 1: 1671.6. Samples: 44717918. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-10-14 08:44:58,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 08:44:58,999][100917] Updated weights for policy 1, policy_version 87402 (0.0008) +[2023-10-14 08:44:59,376][100917] Updated weights for policy 1, policy_version 87412 (0.0009) +[2023-10-14 08:44:59,747][100917] Updated weights for policy 1, policy_version 87422 (0.0010) +[2023-10-14 08:45:00,772][100936] Updated weights for policy 0, policy_version 87270 (0.0007) +[2023-10-14 08:45:01,133][100936] Updated weights for policy 0, policy_version 87280 (0.0010) +[2023-10-14 08:45:01,500][100936] Updated weights for policy 0, policy_version 87290 (0.0010) +[2023-10-14 08:45:03,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 178913280. Throughput: 0: 1640.7, 1: 1675.1. Samples: 44738138. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:45:03,513][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 08:45:03,961][100917] Updated weights for policy 1, policy_version 87432 (0.0010) +[2023-10-14 08:45:04,321][100917] Updated weights for policy 1, policy_version 87442 (0.0010) +[2023-10-14 08:45:04,694][100917] Updated weights for policy 1, policy_version 87452 (0.0009) +[2023-10-14 08:45:05,614][100936] Updated weights for policy 0, policy_version 87300 (0.0010) +[2023-10-14 08:45:05,980][100936] Updated weights for policy 0, policy_version 87310 (0.0009) +[2023-10-14 08:45:06,348][100936] Updated weights for policy 0, policy_version 87320 (0.0008) +[2023-10-14 08:45:08,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 178978816. Throughput: 0: 1648.1, 1: 1672.8. Samples: 44759002. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:45:08,512][99942] Avg episode reward: [(0, '0.990'), (1, '1.000')] +[2023-10-14 08:45:08,582][100917] Updated weights for policy 1, policy_version 87462 (0.0008) +[2023-10-14 08:45:08,943][100917] Updated weights for policy 1, policy_version 87472 (0.0007) +[2023-10-14 08:45:09,323][100917] Updated weights for policy 1, policy_version 87482 (0.0007) +[2023-10-14 08:45:10,299][100936] Updated weights for policy 0, policy_version 87330 (0.0008) +[2023-10-14 08:45:10,666][100936] Updated weights for policy 0, policy_version 87340 (0.0009) +[2023-10-14 08:45:11,034][100936] Updated weights for policy 0, policy_version 87350 (0.0009) +[2023-10-14 08:45:11,400][100936] Updated weights for policy 0, policy_version 87360 (0.0009) +[2023-10-14 08:45:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 179044352. Throughput: 0: 1647.6, 1: 1672.6. Samples: 44768140. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:45:13,512][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 08:45:13,517][100917] Updated weights for policy 1, policy_version 87492 (0.0009) +[2023-10-14 08:45:13,884][100917] Updated weights for policy 1, policy_version 87502 (0.0010) +[2023-10-14 08:45:14,258][100917] Updated weights for policy 1, policy_version 87512 (0.0009) +[2023-10-14 08:45:15,700][100936] Updated weights for policy 0, policy_version 87370 (0.0009) +[2023-10-14 08:45:16,074][100936] Updated weights for policy 0, policy_version 87380 (0.0008) +[2023-10-14 08:45:16,449][100936] Updated weights for policy 0, policy_version 87390 (0.0008) +[2023-10-14 08:45:18,335][100917] Updated weights for policy 1, policy_version 87522 (0.0009) +[2023-10-14 08:45:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 179109888. Throughput: 0: 1657.2, 1: 1671.4. Samples: 44788416. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:45:18,513][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 08:45:18,710][100917] Updated weights for policy 1, policy_version 87532 (0.0011) +[2023-10-14 08:45:19,094][100917] Updated weights for policy 1, policy_version 87542 (0.0008) +[2023-10-14 08:45:19,452][100917] Updated weights for policy 1, policy_version 87552 (0.0007) +[2023-10-14 08:45:20,406][100936] Updated weights for policy 0, policy_version 87400 (0.0008) +[2023-10-14 08:45:20,768][100936] Updated weights for policy 0, policy_version 87410 (0.0009) +[2023-10-14 08:45:21,141][100936] Updated weights for policy 0, policy_version 87420 (0.0009) +[2023-10-14 08:45:23,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 179175424. Throughput: 0: 1662.0, 1: 1667.0. Samples: 44808894. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:45:23,513][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 08:45:23,551][100917] Updated weights for policy 1, policy_version 87562 (0.0010) +[2023-10-14 08:45:23,911][100917] Updated weights for policy 1, policy_version 87572 (0.0010) +[2023-10-14 08:45:24,277][100917] Updated weights for policy 1, policy_version 87582 (0.0007) +[2023-10-14 08:45:25,348][100936] Updated weights for policy 0, policy_version 87430 (0.0007) +[2023-10-14 08:45:25,721][100936] Updated weights for policy 0, policy_version 87440 (0.0008) +[2023-10-14 08:45:26,099][100936] Updated weights for policy 0, policy_version 87450 (0.0009) +[2023-10-14 08:45:28,452][100917] Updated weights for policy 1, policy_version 87592 (0.0010) +[2023-10-14 08:45:28,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 179240960. Throughput: 0: 1653.7, 1: 1668.5. Samples: 44817890. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:45:28,513][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 08:45:28,828][100917] Updated weights for policy 1, policy_version 87602 (0.0010) +[2023-10-14 08:45:29,200][100917] Updated weights for policy 1, policy_version 87612 (0.0010) +[2023-10-14 08:45:30,271][100936] Updated weights for policy 0, policy_version 87460 (0.0009) +[2023-10-14 08:45:30,641][100936] Updated weights for policy 0, policy_version 87470 (0.0008) +[2023-10-14 08:45:31,011][100936] Updated weights for policy 0, policy_version 87480 (0.0009) +[2023-10-14 08:45:33,465][100917] Updated weights for policy 1, policy_version 87622 (0.0009) +[2023-10-14 08:45:33,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 179306496. Throughput: 0: 1664.1, 1: 1663.0. Samples: 44838168. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:45:33,512][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 08:45:33,845][100917] Updated weights for policy 1, policy_version 87632 (0.0008) +[2023-10-14 08:45:34,210][100917] Updated weights for policy 1, policy_version 87642 (0.0009) +[2023-10-14 08:45:35,114][100936] Updated weights for policy 0, policy_version 87490 (0.0009) +[2023-10-14 08:45:35,478][100936] Updated weights for policy 0, policy_version 87500 (0.0010) +[2023-10-14 08:45:35,845][100936] Updated weights for policy 0, policy_version 87510 (0.0009) +[2023-10-14 08:45:36,216][100936] Updated weights for policy 0, policy_version 87520 (0.0007) +[2023-10-14 08:45:38,166][100917] Updated weights for policy 1, policy_version 87652 (0.0009) +[2023-10-14 08:45:38,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 179372032. Throughput: 0: 1666.0, 1: 1664.7. Samples: 44858622. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:45:38,513][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 08:45:38,523][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000087520_89620480.pth... +[2023-10-14 08:45:38,554][100917] Updated weights for policy 1, policy_version 87662 (0.0007) +[2023-10-14 08:45:38,559][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000085984_88047616.pth +[2023-10-14 08:45:38,920][100917] Updated weights for policy 1, policy_version 87672 (0.0010) +[2023-10-14 08:45:39,217][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000087680_89784320.pth... +[2023-10-14 08:45:39,246][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000086112_88178688.pth +[2023-10-14 08:45:40,221][100936] Updated weights for policy 0, policy_version 87530 (0.0007) +[2023-10-14 08:45:40,589][100936] Updated weights for policy 0, policy_version 87540 (0.0011) +[2023-10-14 08:45:40,966][100936] Updated weights for policy 0, policy_version 87550 (0.0009) +[2023-10-14 08:45:43,076][100917] Updated weights for policy 1, policy_version 87682 (0.0009) +[2023-10-14 08:45:43,449][100917] Updated weights for policy 1, policy_version 87692 (0.0008) +[2023-10-14 08:45:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 179437568. Throughput: 0: 1662.3, 1: 1664.5. Samples: 44867626. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:45:43,512][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 08:45:43,824][100917] Updated weights for policy 1, policy_version 87702 (0.0008) +[2023-10-14 08:45:44,190][100917] Updated weights for policy 1, policy_version 87712 (0.0010) +[2023-10-14 08:45:45,045][100936] Updated weights for policy 0, policy_version 87560 (0.0008) +[2023-10-14 08:45:45,423][100936] Updated weights for policy 0, policy_version 87570 (0.0009) +[2023-10-14 08:45:45,788][100936] Updated weights for policy 0, policy_version 87580 (0.0009) +[2023-10-14 08:45:48,380][100917] Updated weights for policy 1, policy_version 87722 (0.0010) +[2023-10-14 08:45:48,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 179503104. Throughput: 0: 1668.5, 1: 1658.5. Samples: 44887856. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:45:48,513][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 08:45:48,750][100917] Updated weights for policy 1, policy_version 87732 (0.0007) +[2023-10-14 08:45:49,129][100917] Updated weights for policy 1, policy_version 87742 (0.0009) +[2023-10-14 08:45:49,769][100936] Updated weights for policy 0, policy_version 87590 (0.0009) +[2023-10-14 08:45:50,149][100936] Updated weights for policy 0, policy_version 87600 (0.0008) +[2023-10-14 08:45:50,514][100936] Updated weights for policy 0, policy_version 87610 (0.0009) +[2023-10-14 08:45:53,184][100917] Updated weights for policy 1, policy_version 87752 (0.0008) +[2023-10-14 08:45:53,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 179568640. Throughput: 0: 1666.7, 1: 1654.6. Samples: 44908462. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:45:53,513][99942] Avg episode reward: [(0, '0.950'), (1, '1.000')] +[2023-10-14 08:45:53,549][100917] Updated weights for policy 1, policy_version 87762 (0.0007) +[2023-10-14 08:45:53,912][100917] Updated weights for policy 1, policy_version 87772 (0.0009) +[2023-10-14 08:45:54,581][100936] Updated weights for policy 0, policy_version 87620 (0.0008) +[2023-10-14 08:45:54,958][100936] Updated weights for policy 0, policy_version 87630 (0.0010) +[2023-10-14 08:45:55,322][100936] Updated weights for policy 0, policy_version 87640 (0.0010) +[2023-10-14 08:45:58,091][100917] Updated weights for policy 1, policy_version 87782 (0.0009) +[2023-10-14 08:45:58,466][100917] Updated weights for policy 1, policy_version 87792 (0.0007) +[2023-10-14 08:45:58,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 179634176. Throughput: 0: 1665.4, 1: 1655.4. Samples: 44917576. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:45:58,512][99942] Avg episode reward: [(0, '0.960'), (1, '1.000')] +[2023-10-14 08:45:58,838][100917] Updated weights for policy 1, policy_version 87802 (0.0007) +[2023-10-14 08:45:59,355][100936] Updated weights for policy 0, policy_version 87650 (0.0009) +[2023-10-14 08:45:59,732][100936] Updated weights for policy 0, policy_version 87660 (0.0010) +[2023-10-14 08:46:00,095][100936] Updated weights for policy 0, policy_version 87670 (0.0007) +[2023-10-14 08:46:00,458][100936] Updated weights for policy 0, policy_version 87680 (0.0009) +[2023-10-14 08:46:02,957][100917] Updated weights for policy 1, policy_version 87812 (0.0008) +[2023-10-14 08:46:03,327][100917] Updated weights for policy 1, policy_version 87822 (0.0010) +[2023-10-14 08:46:03,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 179699712. Throughput: 0: 1667.5, 1: 1656.9. Samples: 44938012. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) +[2023-10-14 08:46:03,513][99942] Avg episode reward: [(0, '0.960'), (1, '1.000')] +[2023-10-14 08:46:03,694][100917] Updated weights for policy 1, policy_version 87832 (0.0010) +[2023-10-14 08:46:04,655][100936] Updated weights for policy 0, policy_version 87690 (0.0008) +[2023-10-14 08:46:05,028][100936] Updated weights for policy 0, policy_version 87700 (0.0008) +[2023-10-14 08:46:05,404][100936] Updated weights for policy 0, policy_version 87710 (0.0010) +[2023-10-14 08:46:07,943][100917] Updated weights for policy 1, policy_version 87842 (0.0009) +[2023-10-14 08:46:08,309][100917] Updated weights for policy 1, policy_version 87852 (0.0009) +[2023-10-14 08:46:08,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 179765248. Throughput: 0: 1666.9, 1: 1653.9. Samples: 44958328. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) +[2023-10-14 08:46:08,513][99942] Avg episode reward: [(0, '0.960'), (1, '1.000')] +[2023-10-14 08:46:08,684][100917] Updated weights for policy 1, policy_version 87862 (0.0007) +[2023-10-14 08:46:09,052][100917] Updated weights for policy 1, policy_version 87872 (0.0008) +[2023-10-14 08:46:09,633][100936] Updated weights for policy 0, policy_version 87720 (0.0009) +[2023-10-14 08:46:10,012][100936] Updated weights for policy 0, policy_version 87730 (0.0009) +[2023-10-14 08:46:10,373][100936] Updated weights for policy 0, policy_version 87740 (0.0009) +[2023-10-14 08:46:13,118][100917] Updated weights for policy 1, policy_version 87882 (0.0010) +[2023-10-14 08:46:13,501][100917] Updated weights for policy 1, policy_version 87892 (0.0012) +[2023-10-14 08:46:13,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 179830784. Throughput: 0: 1664.0, 1: 1659.0. Samples: 44967426. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) +[2023-10-14 08:46:13,513][99942] Avg episode reward: [(0, '0.960'), (1, '1.000')] +[2023-10-14 08:46:13,860][100917] Updated weights for policy 1, policy_version 87902 (0.0010) +[2023-10-14 08:46:14,401][100936] Updated weights for policy 0, policy_version 87750 (0.0010) +[2023-10-14 08:46:14,768][100936] Updated weights for policy 0, policy_version 87760 (0.0007) +[2023-10-14 08:46:15,147][100936] Updated weights for policy 0, policy_version 87770 (0.0009) +[2023-10-14 08:46:17,938][100917] Updated weights for policy 1, policy_version 87912 (0.0009) +[2023-10-14 08:46:18,299][100917] Updated weights for policy 1, policy_version 87922 (0.0008) +[2023-10-14 08:46:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 179896320. Throughput: 0: 1670.4, 1: 1662.1. Samples: 44988130. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) +[2023-10-14 08:46:18,513][99942] Avg episode reward: [(0, '0.960'), (1, '1.000')] +[2023-10-14 08:46:18,675][100917] Updated weights for policy 1, policy_version 87932 (0.0009) +[2023-10-14 08:46:19,092][100936] Updated weights for policy 0, policy_version 87780 (0.0008) +[2023-10-14 08:46:19,459][100936] Updated weights for policy 0, policy_version 87790 (0.0007) +[2023-10-14 08:46:19,833][100936] Updated weights for policy 0, policy_version 87800 (0.0008) +[2023-10-14 08:46:22,754][100917] Updated weights for policy 1, policy_version 87942 (0.0010) +[2023-10-14 08:46:23,119][100917] Updated weights for policy 1, policy_version 87952 (0.0010) +[2023-10-14 08:46:23,494][100917] Updated weights for policy 1, policy_version 87962 (0.0009) +[2023-10-14 08:46:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 179961856. Throughput: 0: 1671.4, 1: 1656.9. Samples: 45008396. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) +[2023-10-14 08:46:23,513][99942] Avg episode reward: [(0, '0.960'), (1, '1.000')] +[2023-10-14 08:46:24,051][100936] Updated weights for policy 0, policy_version 87810 (0.0008) +[2023-10-14 08:46:24,421][100936] Updated weights for policy 0, policy_version 87820 (0.0007) +[2023-10-14 08:46:24,782][100936] Updated weights for policy 0, policy_version 87830 (0.0010) +[2023-10-14 08:46:25,153][100936] Updated weights for policy 0, policy_version 87840 (0.0010) +[2023-10-14 08:46:27,577][100917] Updated weights for policy 1, policy_version 87972 (0.0008) +[2023-10-14 08:46:27,973][100917] Updated weights for policy 1, policy_version 87982 (0.0009) +[2023-10-14 08:46:28,336][100917] Updated weights for policy 1, policy_version 87992 (0.0009) +[2023-10-14 08:46:28,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 180027392. Throughput: 0: 1669.0, 1: 1671.8. Samples: 45017964. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) +[2023-10-14 08:46:28,513][99942] Avg episode reward: [(0, '0.960'), (1, '1.000')] +[2023-10-14 08:46:29,120][100936] Updated weights for policy 0, policy_version 87850 (0.0008) +[2023-10-14 08:46:29,492][100936] Updated weights for policy 0, policy_version 87860 (0.0010) +[2023-10-14 08:46:29,859][100936] Updated weights for policy 0, policy_version 87870 (0.0008) +[2023-10-14 08:46:32,478][100917] Updated weights for policy 1, policy_version 88002 (0.0011) +[2023-10-14 08:46:32,849][100917] Updated weights for policy 1, policy_version 88012 (0.0007) +[2023-10-14 08:46:33,220][100917] Updated weights for policy 1, policy_version 88022 (0.0007) +[2023-10-14 08:46:33,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 180092928. Throughput: 0: 1676.0, 1: 1667.1. Samples: 45038296. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) +[2023-10-14 08:46:33,513][99942] Avg episode reward: [(0, '0.960'), (1, '1.000')] +[2023-10-14 08:46:33,590][100917] Updated weights for policy 1, policy_version 88032 (0.0007) +[2023-10-14 08:46:33,872][100936] Updated weights for policy 0, policy_version 87880 (0.0011) +[2023-10-14 08:46:34,251][100936] Updated weights for policy 0, policy_version 87890 (0.0008) +[2023-10-14 08:46:34,622][100936] Updated weights for policy 0, policy_version 87900 (0.0009) +[2023-10-14 08:46:37,631][100917] Updated weights for policy 1, policy_version 88042 (0.0011) +[2023-10-14 08:46:38,009][100917] Updated weights for policy 1, policy_version 88052 (0.0010) +[2023-10-14 08:46:38,374][100917] Updated weights for policy 1, policy_version 88062 (0.0010) +[2023-10-14 08:46:38,512][99942] Fps is (10 sec: 16384.3, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 180191232. Throughput: 0: 1676.8, 1: 1655.2. Samples: 45058404. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) +[2023-10-14 08:46:38,513][99942] Avg episode reward: [(0, '0.960'), (1, '1.000')] +[2023-10-14 08:46:38,813][100936] Updated weights for policy 0, policy_version 87910 (0.0010) +[2023-10-14 08:46:39,182][100936] Updated weights for policy 0, policy_version 87920 (0.0008) +[2023-10-14 08:46:39,552][100936] Updated weights for policy 0, policy_version 87930 (0.0009) +[2023-10-14 08:46:42,549][100917] Updated weights for policy 1, policy_version 88072 (0.0010) +[2023-10-14 08:46:42,934][100917] Updated weights for policy 1, policy_version 88082 (0.0009) +[2023-10-14 08:46:43,307][100917] Updated weights for policy 1, policy_version 88092 (0.0009) +[2023-10-14 08:46:43,512][99942] Fps is (10 sec: 16384.4, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 180256768. Throughput: 0: 1677.2, 1: 1669.6. Samples: 45068182. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) +[2023-10-14 08:46:43,512][99942] Avg episode reward: [(0, '0.960'), (1, '1.000')] +[2023-10-14 08:46:43,626][100936] Updated weights for policy 0, policy_version 87940 (0.0010) +[2023-10-14 08:46:44,001][100936] Updated weights for policy 0, policy_version 87950 (0.0010) +[2023-10-14 08:46:44,360][100936] Updated weights for policy 0, policy_version 87960 (0.0009) +[2023-10-14 08:46:47,462][100917] Updated weights for policy 1, policy_version 88102 (0.0009) +[2023-10-14 08:46:47,837][100917] Updated weights for policy 1, policy_version 88112 (0.0008) +[2023-10-14 08:46:48,210][100917] Updated weights for policy 1, policy_version 88122 (0.0007) +[2023-10-14 08:46:48,479][100936] Updated weights for policy 0, policy_version 87970 (0.0011) +[2023-10-14 08:46:48,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 180322304. Throughput: 0: 1681.7, 1: 1664.5. Samples: 45088594. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) +[2023-10-14 08:46:48,512][99942] Avg episode reward: [(0, '0.960'), (1, '1.000')] +[2023-10-14 08:46:48,855][100936] Updated weights for policy 0, policy_version 87980 (0.0008) +[2023-10-14 08:46:49,226][100936] Updated weights for policy 0, policy_version 87990 (0.0007) +[2023-10-14 08:46:49,583][100936] Updated weights for policy 0, policy_version 88000 (0.0008) +[2023-10-14 08:46:52,448][100917] Updated weights for policy 1, policy_version 88132 (0.0009) +[2023-10-14 08:46:52,818][100917] Updated weights for policy 1, policy_version 88142 (0.0008) +[2023-10-14 08:46:53,196][100917] Updated weights for policy 1, policy_version 88152 (0.0007) +[2023-10-14 08:46:53,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 180387840. Throughput: 0: 1679.7, 1: 1650.4. Samples: 45108184. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) +[2023-10-14 08:46:53,513][99942] Avg episode reward: [(0, '0.960'), (1, '1.000')] +[2023-10-14 08:46:53,723][100936] Updated weights for policy 0, policy_version 88010 (0.0007) +[2023-10-14 08:46:54,091][100936] Updated weights for policy 0, policy_version 88020 (0.0008) +[2023-10-14 08:46:54,469][100936] Updated weights for policy 0, policy_version 88030 (0.0008) +[2023-10-14 08:46:57,167][100917] Updated weights for policy 1, policy_version 88162 (0.0008) +[2023-10-14 08:46:57,536][100917] Updated weights for policy 1, policy_version 88172 (0.0009) +[2023-10-14 08:46:57,912][100917] Updated weights for policy 1, policy_version 88182 (0.0007) +[2023-10-14 08:46:58,275][100917] Updated weights for policy 1, policy_version 88192 (0.0007) +[2023-10-14 08:46:58,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 180453376. Throughput: 0: 1682.3, 1: 1663.2. Samples: 45117974. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) +[2023-10-14 08:46:58,513][99942] Avg episode reward: [(0, '0.960'), (1, '1.000')] +[2023-10-14 08:46:58,680][100936] Updated weights for policy 0, policy_version 88040 (0.0008) +[2023-10-14 08:46:59,042][100936] Updated weights for policy 0, policy_version 88050 (0.0008) +[2023-10-14 08:46:59,412][100936] Updated weights for policy 0, policy_version 88060 (0.0010) +[2023-10-14 08:47:02,454][100917] Updated weights for policy 1, policy_version 88202 (0.0008) +[2023-10-14 08:47:02,825][100917] Updated weights for policy 1, policy_version 88212 (0.0007) +[2023-10-14 08:47:03,205][100917] Updated weights for policy 1, policy_version 88222 (0.0007) +[2023-10-14 08:47:03,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 180518912. Throughput: 0: 1674.8, 1: 1661.8. Samples: 45138278. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) +[2023-10-14 08:47:03,512][99942] Avg episode reward: [(0, '0.960'), (1, '1.000')] +[2023-10-14 08:47:03,543][100936] Updated weights for policy 0, policy_version 88070 (0.0009) +[2023-10-14 08:47:03,906][100936] Updated weights for policy 0, policy_version 88080 (0.0009) +[2023-10-14 08:47:04,270][100936] Updated weights for policy 0, policy_version 88090 (0.0009) +[2023-10-14 08:47:07,364][100917] Updated weights for policy 1, policy_version 88232 (0.0008) +[2023-10-14 08:47:07,729][100917] Updated weights for policy 1, policy_version 88242 (0.0007) +[2023-10-14 08:47:08,104][100917] Updated weights for policy 1, policy_version 88252 (0.0007) +[2023-10-14 08:47:08,278][100936] Updated weights for policy 0, policy_version 88100 (0.0008) +[2023-10-14 08:47:08,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 180584448. Throughput: 0: 1671.4, 1: 1644.0. Samples: 45157588. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) +[2023-10-14 08:47:08,513][99942] Avg episode reward: [(0, '0.960'), (1, '1.000')] +[2023-10-14 08:47:08,663][100936] Updated weights for policy 0, policy_version 88110 (0.0007) +[2023-10-14 08:47:09,017][100936] Updated weights for policy 0, policy_version 88120 (0.0008) +[2023-10-14 08:47:12,256][100917] Updated weights for policy 1, policy_version 88262 (0.0009) +[2023-10-14 08:47:12,630][100917] Updated weights for policy 1, policy_version 88272 (0.0007) +[2023-10-14 08:47:12,999][100917] Updated weights for policy 1, policy_version 88282 (0.0007) +[2023-10-14 08:47:13,290][100936] Updated weights for policy 0, policy_version 88130 (0.0007) +[2023-10-14 08:47:13,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 180649984. Throughput: 0: 1675.5, 1: 1649.6. Samples: 45167594. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) +[2023-10-14 08:47:13,512][99942] Avg episode reward: [(0, '0.960'), (1, '1.000')] +[2023-10-14 08:47:13,665][100936] Updated weights for policy 0, policy_version 88140 (0.0007) +[2023-10-14 08:47:14,029][100936] Updated weights for policy 0, policy_version 88150 (0.0009) +[2023-10-14 08:47:14,404][100936] Updated weights for policy 0, policy_version 88160 (0.0008) +[2023-10-14 08:47:17,251][100917] Updated weights for policy 1, policy_version 88292 (0.0008) +[2023-10-14 08:47:17,657][100917] Updated weights for policy 1, policy_version 88302 (0.0010) +[2023-10-14 08:47:18,027][100917] Updated weights for policy 1, policy_version 88312 (0.0009) +[2023-10-14 08:47:18,418][100936] Updated weights for policy 0, policy_version 88170 (0.0008) +[2023-10-14 08:47:18,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 180715520. Throughput: 0: 1669.2, 1: 1654.0. Samples: 45187838. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) +[2023-10-14 08:47:18,513][99942] Avg episode reward: [(0, '0.860'), (1, '1.000')] +[2023-10-14 08:47:18,791][100936] Updated weights for policy 0, policy_version 88180 (0.0007) +[2023-10-14 08:47:19,154][100936] Updated weights for policy 0, policy_version 88190 (0.0010) +[2023-10-14 08:47:22,135][100917] Updated weights for policy 1, policy_version 88322 (0.0008) +[2023-10-14 08:47:22,503][100917] Updated weights for policy 1, policy_version 88332 (0.0011) +[2023-10-14 08:47:22,884][100917] Updated weights for policy 1, policy_version 88342 (0.0009) +[2023-10-14 08:47:23,240][100936] Updated weights for policy 0, policy_version 88200 (0.0008) +[2023-10-14 08:47:23,249][100917] Updated weights for policy 1, policy_version 88352 (0.0009) +[2023-10-14 08:47:23,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 180781056. Throughput: 0: 1655.9, 1: 1645.0. Samples: 45206942. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) +[2023-10-14 08:47:23,512][99942] Avg episode reward: [(0, '0.860'), (1, '1.000')] +[2023-10-14 08:47:23,612][100936] Updated weights for policy 0, policy_version 88210 (0.0008) +[2023-10-14 08:47:23,971][100936] Updated weights for policy 0, policy_version 88220 (0.0008) +[2023-10-14 08:47:27,366][100917] Updated weights for policy 1, policy_version 88362 (0.0010) +[2023-10-14 08:47:27,746][100917] Updated weights for policy 1, policy_version 88372 (0.0007) +[2023-10-14 08:47:28,107][100936] Updated weights for policy 0, policy_version 88230 (0.0009) +[2023-10-14 08:47:28,118][100917] Updated weights for policy 1, policy_version 88382 (0.0007) +[2023-10-14 08:47:28,476][100936] Updated weights for policy 0, policy_version 88240 (0.0008) +[2023-10-14 08:47:28,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 180846592. Throughput: 0: 1664.7, 1: 1650.9. Samples: 45217386. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) +[2023-10-14 08:47:28,513][99942] Avg episode reward: [(0, '0.860'), (1, '1.000')] +[2023-10-14 08:47:28,848][100936] Updated weights for policy 0, policy_version 88250 (0.0010) +[2023-10-14 08:47:32,091][100917] Updated weights for policy 1, policy_version 88392 (0.0007) +[2023-10-14 08:47:32,465][100917] Updated weights for policy 1, policy_version 88402 (0.0008) +[2023-10-14 08:47:32,833][100917] Updated weights for policy 1, policy_version 88412 (0.0008) +[2023-10-14 08:47:32,879][100936] Updated weights for policy 0, policy_version 88260 (0.0009) +[2023-10-14 08:47:33,248][100936] Updated weights for policy 0, policy_version 88270 (0.0009) +[2023-10-14 08:47:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 180912128. Throughput: 0: 1665.2, 1: 1658.8. Samples: 45238172. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) +[2023-10-14 08:47:33,512][99942] Avg episode reward: [(0, '0.860'), (1, '1.000')] +[2023-10-14 08:47:33,615][100936] Updated weights for policy 0, policy_version 88280 (0.0008) +[2023-10-14 08:47:36,965][100917] Updated weights for policy 1, policy_version 88422 (0.0010) +[2023-10-14 08:47:37,341][100917] Updated weights for policy 1, policy_version 88432 (0.0009) +[2023-10-14 08:47:37,670][100936] Updated weights for policy 0, policy_version 88290 (0.0008) +[2023-10-14 08:47:37,710][100917] Updated weights for policy 1, policy_version 88442 (0.0007) +[2023-10-14 08:47:38,038][100936] Updated weights for policy 0, policy_version 88300 (0.0009) +[2023-10-14 08:47:38,404][100936] Updated weights for policy 0, policy_version 88310 (0.0007) +[2023-10-14 08:47:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 180977664. Throughput: 0: 1651.7, 1: 1650.5. Samples: 45256786. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) +[2023-10-14 08:47:38,513][99942] Avg episode reward: [(0, '0.860'), (1, '1.000')] +[2023-10-14 08:47:38,522][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000088448_90570752.pth... +[2023-10-14 08:47:38,561][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000086880_88965120.pth +[2023-10-14 08:47:38,774][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000088320_90439680.pth... +[2023-10-14 08:47:38,779][100936] Updated weights for policy 0, policy_version 88320 (0.0008) +[2023-10-14 08:47:38,803][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000086752_88834048.pth +[2023-10-14 08:47:41,765][100917] Updated weights for policy 1, policy_version 88452 (0.0009) +[2023-10-14 08:47:42,146][100917] Updated weights for policy 1, policy_version 88462 (0.0009) +[2023-10-14 08:47:42,508][100917] Updated weights for policy 1, policy_version 88472 (0.0008) +[2023-10-14 08:47:42,932][100936] Updated weights for policy 0, policy_version 88330 (0.0007) +[2023-10-14 08:47:43,308][100936] Updated weights for policy 0, policy_version 88340 (0.0009) +[2023-10-14 08:47:43,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 181043200. Throughput: 0: 1669.0, 1: 1658.4. Samples: 45267706. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) +[2023-10-14 08:47:43,513][99942] Avg episode reward: [(0, '0.860'), (1, '1.000')] +[2023-10-14 08:47:43,673][100936] Updated weights for policy 0, policy_version 88350 (0.0008) +[2023-10-14 08:47:46,776][100917] Updated weights for policy 1, policy_version 88482 (0.0008) +[2023-10-14 08:47:47,151][100917] Updated weights for policy 1, policy_version 88492 (0.0008) +[2023-10-14 08:47:47,532][100917] Updated weights for policy 1, policy_version 88502 (0.0008) +[2023-10-14 08:47:47,895][100917] Updated weights for policy 1, policy_version 88512 (0.0010) +[2023-10-14 08:47:48,008][100936] Updated weights for policy 0, policy_version 88360 (0.0010) +[2023-10-14 08:47:48,384][100936] Updated weights for policy 0, policy_version 88370 (0.0009) +[2023-10-14 08:47:48,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 181108736. Throughput: 0: 1667.8, 1: 1649.2. Samples: 45287546. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) +[2023-10-14 08:47:48,513][99942] Avg episode reward: [(0, '0.860'), (1, '1.000')] +[2023-10-14 08:47:48,757][100936] Updated weights for policy 0, policy_version 88380 (0.0009) +[2023-10-14 08:47:52,109][100917] Updated weights for policy 1, policy_version 88522 (0.0007) +[2023-10-14 08:47:52,488][100917] Updated weights for policy 1, policy_version 88532 (0.0009) +[2023-10-14 08:47:52,697][100936] Updated weights for policy 0, policy_version 88390 (0.0008) +[2023-10-14 08:47:52,856][100917] Updated weights for policy 1, policy_version 88542 (0.0009) +[2023-10-14 08:47:53,059][100936] Updated weights for policy 0, policy_version 88400 (0.0008) +[2023-10-14 08:47:53,435][100936] Updated weights for policy 0, policy_version 88410 (0.0008) +[2023-10-14 08:47:53,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 181174272. Throughput: 0: 1650.9, 1: 1647.5. Samples: 45306018. Policy #0 lag: (min: 31.0, avg: 35.4, max: 63.0) +[2023-10-14 08:47:53,513][99942] Avg episode reward: [(0, '0.860'), (1, '1.000')] +[2023-10-14 08:47:56,949][100917] Updated weights for policy 1, policy_version 88552 (0.0009) +[2023-10-14 08:47:57,328][100917] Updated weights for policy 1, policy_version 88562 (0.0010) +[2023-10-14 08:47:57,624][100936] Updated weights for policy 0, policy_version 88420 (0.0009) +[2023-10-14 08:47:57,700][100917] Updated weights for policy 1, policy_version 88572 (0.0010) +[2023-10-14 08:47:57,985][100936] Updated weights for policy 0, policy_version 88430 (0.0009) +[2023-10-14 08:47:58,369][100936] Updated weights for policy 0, policy_version 88440 (0.0009) +[2023-10-14 08:47:58,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 181239808. Throughput: 0: 1668.8, 1: 1655.7. Samples: 45317198. Policy #0 lag: (min: 31.0, avg: 35.4, max: 63.0) +[2023-10-14 08:47:58,513][99942] Avg episode reward: [(0, '0.830'), (1, '1.000')] +[2023-10-14 08:48:01,716][100917] Updated weights for policy 1, policy_version 88582 (0.0008) +[2023-10-14 08:48:02,088][100917] Updated weights for policy 1, policy_version 88592 (0.0007) +[2023-10-14 08:48:02,456][100917] Updated weights for policy 1, policy_version 88602 (0.0007) +[2023-10-14 08:48:02,481][100936] Updated weights for policy 0, policy_version 88450 (0.0008) +[2023-10-14 08:48:02,838][100936] Updated weights for policy 0, policy_version 88460 (0.0007) +[2023-10-14 08:48:03,203][100936] Updated weights for policy 0, policy_version 88470 (0.0009) +[2023-10-14 08:48:03,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 181305344. Throughput: 0: 1676.2, 1: 1645.8. Samples: 45337326. Policy #0 lag: (min: 31.0, avg: 35.4, max: 63.0) +[2023-10-14 08:48:03,513][99942] Avg episode reward: [(0, '0.830'), (1, '1.000')] +[2023-10-14 08:48:03,576][100936] Updated weights for policy 0, policy_version 88480 (0.0008) +[2023-10-14 08:48:06,651][100917] Updated weights for policy 1, policy_version 88612 (0.0009) +[2023-10-14 08:48:07,016][100917] Updated weights for policy 1, policy_version 88622 (0.0010) +[2023-10-14 08:48:07,391][100917] Updated weights for policy 1, policy_version 88632 (0.0008) +[2023-10-14 08:48:07,676][100936] Updated weights for policy 0, policy_version 88490 (0.0008) +[2023-10-14 08:48:08,043][100936] Updated weights for policy 0, policy_version 88500 (0.0007) +[2023-10-14 08:48:08,407][100936] Updated weights for policy 0, policy_version 88510 (0.0007) +[2023-10-14 08:48:08,512][99942] Fps is (10 sec: 16384.0, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 181403648. Throughput: 0: 1659.6, 1: 1650.8. Samples: 45355912. Policy #0 lag: (min: 31.0, avg: 35.4, max: 63.0) +[2023-10-14 08:48:08,513][99942] Avg episode reward: [(0, '0.830'), (1, '1.000')] +[2023-10-14 08:48:11,516][100917] Updated weights for policy 1, policy_version 88642 (0.0008) +[2023-10-14 08:48:11,884][100917] Updated weights for policy 1, policy_version 88652 (0.0007) +[2023-10-14 08:48:12,249][100917] Updated weights for policy 1, policy_version 88662 (0.0008) +[2023-10-14 08:48:12,565][100936] Updated weights for policy 0, policy_version 88520 (0.0008) +[2023-10-14 08:48:12,625][100917] Updated weights for policy 1, policy_version 88672 (0.0009) +[2023-10-14 08:48:12,925][100936] Updated weights for policy 0, policy_version 88530 (0.0010) +[2023-10-14 08:48:13,297][100936] Updated weights for policy 0, policy_version 88540 (0.0010) +[2023-10-14 08:48:13,512][99942] Fps is (10 sec: 16383.7, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 181469184. Throughput: 0: 1671.6, 1: 1659.2. Samples: 45367272. Policy #0 lag: (min: 31.0, avg: 35.4, max: 63.0) +[2023-10-14 08:48:13,513][99942] Avg episode reward: [(0, '0.830'), (1, '1.000')] +[2023-10-14 08:48:16,736][100917] Updated weights for policy 1, policy_version 88682 (0.0009) +[2023-10-14 08:48:17,099][100917] Updated weights for policy 1, policy_version 88692 (0.0008) +[2023-10-14 08:48:17,345][100936] Updated weights for policy 0, policy_version 88550 (0.0007) +[2023-10-14 08:48:17,471][100917] Updated weights for policy 1, policy_version 88702 (0.0008) +[2023-10-14 08:48:17,708][100936] Updated weights for policy 0, policy_version 88560 (0.0007) +[2023-10-14 08:48:18,075][100936] Updated weights for policy 0, policy_version 88570 (0.0010) +[2023-10-14 08:48:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 181534720. Throughput: 0: 1664.3, 1: 1640.0. Samples: 45386866. Policy #0 lag: (min: 31.0, avg: 35.4, max: 63.0) +[2023-10-14 08:48:18,513][99942] Avg episode reward: [(0, '0.830'), (1, '1.000')] +[2023-10-14 08:48:21,630][100917] Updated weights for policy 1, policy_version 88712 (0.0011) +[2023-10-14 08:48:22,014][100917] Updated weights for policy 1, policy_version 88722 (0.0009) +[2023-10-14 08:48:22,194][100936] Updated weights for policy 0, policy_version 88580 (0.0009) +[2023-10-14 08:48:22,381][100917] Updated weights for policy 1, policy_version 88732 (0.0009) +[2023-10-14 08:48:22,561][100936] Updated weights for policy 0, policy_version 88590 (0.0009) +[2023-10-14 08:48:22,926][100936] Updated weights for policy 0, policy_version 88600 (0.0011) +[2023-10-14 08:48:23,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 181600256. Throughput: 0: 1655.1, 1: 1652.7. Samples: 45405638. Policy #0 lag: (min: 31.0, avg: 35.4, max: 63.0) +[2023-10-14 08:48:23,512][99942] Avg episode reward: [(0, '0.830'), (1, '1.000')] +[2023-10-14 08:48:26,330][100917] Updated weights for policy 1, policy_version 88742 (0.0008) +[2023-10-14 08:48:26,696][100917] Updated weights for policy 1, policy_version 88752 (0.0010) +[2023-10-14 08:48:26,866][100936] Updated weights for policy 0, policy_version 88610 (0.0010) +[2023-10-14 08:48:27,068][100917] Updated weights for policy 1, policy_version 88762 (0.0008) +[2023-10-14 08:48:27,227][100936] Updated weights for policy 0, policy_version 88620 (0.0009) +[2023-10-14 08:48:27,599][100936] Updated weights for policy 0, policy_version 88630 (0.0008) +[2023-10-14 08:48:27,969][100936] Updated weights for policy 0, policy_version 88640 (0.0009) +[2023-10-14 08:48:28,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 181665792. Throughput: 0: 1662.2, 1: 1656.4. Samples: 45417042. Policy #0 lag: (min: 31.0, avg: 35.4, max: 63.0) +[2023-10-14 08:48:28,512][99942] Avg episode reward: [(0, '0.830'), (1, '1.000')] +[2023-10-14 08:48:31,078][100917] Updated weights for policy 1, policy_version 88772 (0.0009) +[2023-10-14 08:48:31,449][100917] Updated weights for policy 1, policy_version 88782 (0.0011) +[2023-10-14 08:48:31,835][100917] Updated weights for policy 1, policy_version 88792 (0.0009) +[2023-10-14 08:48:32,262][100936] Updated weights for policy 0, policy_version 88650 (0.0007) +[2023-10-14 08:48:32,637][100936] Updated weights for policy 0, policy_version 88660 (0.0007) +[2023-10-14 08:48:33,015][100936] Updated weights for policy 0, policy_version 88670 (0.0008) +[2023-10-14 08:48:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 181731328. Throughput: 0: 1653.1, 1: 1649.8. Samples: 45436176. Policy #0 lag: (min: 31.0, avg: 35.4, max: 63.0) +[2023-10-14 08:48:33,513][99942] Avg episode reward: [(0, '0.830'), (1, '1.000')] +[2023-10-14 08:48:35,935][100917] Updated weights for policy 1, policy_version 88802 (0.0007) +[2023-10-14 08:48:36,304][100917] Updated weights for policy 1, policy_version 88812 (0.0008) +[2023-10-14 08:48:36,676][100917] Updated weights for policy 1, policy_version 88822 (0.0007) +[2023-10-14 08:48:36,908][100936] Updated weights for policy 0, policy_version 88680 (0.0009) +[2023-10-14 08:48:37,042][100917] Updated weights for policy 1, policy_version 88832 (0.0008) +[2023-10-14 08:48:37,274][100936] Updated weights for policy 0, policy_version 88690 (0.0009) +[2023-10-14 08:48:37,646][100936] Updated weights for policy 0, policy_version 88700 (0.0008) +[2023-10-14 08:48:38,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 181796864. Throughput: 0: 1657.1, 1: 1671.5. Samples: 45455802. Policy #0 lag: (min: 31.0, avg: 35.4, max: 63.0) +[2023-10-14 08:48:38,513][99942] Avg episode reward: [(0, '0.740'), (1, '1.000')] +[2023-10-14 08:48:41,226][100917] Updated weights for policy 1, policy_version 88842 (0.0010) +[2023-10-14 08:48:41,609][100917] Updated weights for policy 1, policy_version 88852 (0.0007) +[2023-10-14 08:48:41,923][100936] Updated weights for policy 0, policy_version 88710 (0.0008) +[2023-10-14 08:48:41,977][100917] Updated weights for policy 1, policy_version 88862 (0.0007) +[2023-10-14 08:48:42,295][100936] Updated weights for policy 0, policy_version 88720 (0.0008) +[2023-10-14 08:48:42,656][100936] Updated weights for policy 0, policy_version 88730 (0.0010) +[2023-10-14 08:48:43,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 181862400. Throughput: 0: 1662.5, 1: 1668.5. Samples: 45467096. Policy #0 lag: (min: 31.0, avg: 35.4, max: 63.0) +[2023-10-14 08:48:43,512][99942] Avg episode reward: [(0, '0.740'), (1, '1.000')] +[2023-10-14 08:48:46,156][100917] Updated weights for policy 1, policy_version 88872 (0.0009) +[2023-10-14 08:48:46,543][100917] Updated weights for policy 1, policy_version 88882 (0.0010) +[2023-10-14 08:48:46,904][100917] Updated weights for policy 1, policy_version 88892 (0.0008) +[2023-10-14 08:48:46,975][100936] Updated weights for policy 0, policy_version 88740 (0.0010) +[2023-10-14 08:48:47,340][100936] Updated weights for policy 0, policy_version 88750 (0.0008) +[2023-10-14 08:48:47,719][100936] Updated weights for policy 0, policy_version 88760 (0.0009) +[2023-10-14 08:48:48,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 181927936. Throughput: 0: 1643.9, 1: 1655.4. Samples: 45485792. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:48:48,513][99942] Avg episode reward: [(0, '0.740'), (1, '1.000')] +[2023-10-14 08:48:51,206][100917] Updated weights for policy 1, policy_version 88902 (0.0007) +[2023-10-14 08:48:51,597][100917] Updated weights for policy 1, policy_version 88912 (0.0007) +[2023-10-14 08:48:51,810][100936] Updated weights for policy 0, policy_version 88770 (0.0010) +[2023-10-14 08:48:51,975][100917] Updated weights for policy 1, policy_version 88922 (0.0007) +[2023-10-14 08:48:52,180][100936] Updated weights for policy 0, policy_version 88780 (0.0007) +[2023-10-14 08:48:52,549][100936] Updated weights for policy 0, policy_version 88790 (0.0011) +[2023-10-14 08:48:52,916][100936] Updated weights for policy 0, policy_version 88800 (0.0008) +[2023-10-14 08:48:53,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 181993472. Throughput: 0: 1658.0, 1: 1662.9. Samples: 45505352. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:48:53,513][99942] Avg episode reward: [(0, '0.740'), (1, '1.000')] +[2023-10-14 08:48:56,119][100917] Updated weights for policy 1, policy_version 88932 (0.0008) +[2023-10-14 08:48:56,486][100917] Updated weights for policy 1, policy_version 88942 (0.0010) +[2023-10-14 08:48:56,853][100917] Updated weights for policy 1, policy_version 88952 (0.0010) +[2023-10-14 08:48:57,013][100936] Updated weights for policy 0, policy_version 88810 (0.0007) +[2023-10-14 08:48:57,374][100936] Updated weights for policy 0, policy_version 88820 (0.0007) +[2023-10-14 08:48:57,748][100936] Updated weights for policy 0, policy_version 88830 (0.0008) +[2023-10-14 08:48:58,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 182059008. Throughput: 0: 1667.6, 1: 1660.0. Samples: 45517012. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:48:58,513][99942] Avg episode reward: [(0, '0.630'), (1, '1.000')] +[2023-10-14 08:49:00,929][100917] Updated weights for policy 1, policy_version 88962 (0.0008) +[2023-10-14 08:49:01,292][100917] Updated weights for policy 1, policy_version 88972 (0.0009) +[2023-10-14 08:49:01,670][100917] Updated weights for policy 1, policy_version 88982 (0.0008) +[2023-10-14 08:49:01,843][100936] Updated weights for policy 0, policy_version 88840 (0.0007) +[2023-10-14 08:49:02,031][100917] Updated weights for policy 1, policy_version 88992 (0.0009) +[2023-10-14 08:49:02,208][100936] Updated weights for policy 0, policy_version 88850 (0.0008) +[2023-10-14 08:49:02,583][100936] Updated weights for policy 0, policy_version 88860 (0.0007) +[2023-10-14 08:49:03,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 182124544. Throughput: 0: 1652.1, 1: 1651.2. Samples: 45535516. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:49:03,512][99942] Avg episode reward: [(0, '0.630'), (1, '1.000')] +[2023-10-14 08:49:06,008][100917] Updated weights for policy 1, policy_version 89002 (0.0007) +[2023-10-14 08:49:06,378][100917] Updated weights for policy 1, policy_version 89012 (0.0008) +[2023-10-14 08:49:06,738][100917] Updated weights for policy 1, policy_version 89022 (0.0008) +[2023-10-14 08:49:06,750][100936] Updated weights for policy 0, policy_version 88870 (0.0007) +[2023-10-14 08:49:07,117][100936] Updated weights for policy 0, policy_version 88880 (0.0008) +[2023-10-14 08:49:07,487][100936] Updated weights for policy 0, policy_version 88890 (0.0010) +[2023-10-14 08:49:08,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 182190080. Throughput: 0: 1660.9, 1: 1667.3. Samples: 45555408. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:49:08,512][99942] Avg episode reward: [(0, '0.630'), (1, '1.000')] +[2023-10-14 08:49:10,852][100917] Updated weights for policy 1, policy_version 89032 (0.0007) +[2023-10-14 08:49:11,235][100917] Updated weights for policy 1, policy_version 89042 (0.0010) +[2023-10-14 08:49:11,600][100917] Updated weights for policy 1, policy_version 89052 (0.0010) +[2023-10-14 08:49:11,640][100936] Updated weights for policy 0, policy_version 88900 (0.0010) +[2023-10-14 08:49:12,013][100936] Updated weights for policy 0, policy_version 88910 (0.0007) +[2023-10-14 08:49:12,382][100936] Updated weights for policy 0, policy_version 88920 (0.0009) +[2023-10-14 08:49:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 182255616. Throughput: 0: 1661.2, 1: 1660.5. Samples: 45566518. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:49:13,513][99942] Avg episode reward: [(0, '0.630'), (1, '1.000')] +[2023-10-14 08:49:15,751][100917] Updated weights for policy 1, policy_version 89062 (0.0009) +[2023-10-14 08:49:16,130][100917] Updated weights for policy 1, policy_version 89072 (0.0011) +[2023-10-14 08:49:16,485][100936] Updated weights for policy 0, policy_version 88930 (0.0008) +[2023-10-14 08:49:16,497][100917] Updated weights for policy 1, policy_version 89082 (0.0007) +[2023-10-14 08:49:16,893][100936] Updated weights for policy 0, policy_version 88940 (0.0007) +[2023-10-14 08:49:17,265][100936] Updated weights for policy 0, policy_version 88950 (0.0007) +[2023-10-14 08:49:17,636][100936] Updated weights for policy 0, policy_version 88960 (0.0010) +[2023-10-14 08:49:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 182321152. Throughput: 0: 1651.3, 1: 1655.4. Samples: 45584978. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:49:18,512][99942] Avg episode reward: [(0, '0.520'), (1, '1.000')] +[2023-10-14 08:49:20,595][100917] Updated weights for policy 1, policy_version 89092 (0.0007) +[2023-10-14 08:49:20,958][100917] Updated weights for policy 1, policy_version 89102 (0.0010) +[2023-10-14 08:49:21,333][100917] Updated weights for policy 1, policy_version 89112 (0.0010) +[2023-10-14 08:49:21,673][100936] Updated weights for policy 0, policy_version 88970 (0.0008) +[2023-10-14 08:49:22,042][100936] Updated weights for policy 0, policy_version 88980 (0.0008) +[2023-10-14 08:49:22,413][100936] Updated weights for policy 0, policy_version 88990 (0.0007) +[2023-10-14 08:49:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 182386688. Throughput: 0: 1659.6, 1: 1658.2. Samples: 45605102. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:49:23,512][99942] Avg episode reward: [(0, '0.520'), (1, '1.000')] +[2023-10-14 08:49:25,387][100917] Updated weights for policy 1, policy_version 89122 (0.0009) +[2023-10-14 08:49:25,744][100917] Updated weights for policy 1, policy_version 89132 (0.0007) +[2023-10-14 08:49:26,117][100917] Updated weights for policy 1, policy_version 89142 (0.0007) +[2023-10-14 08:49:26,386][100936] Updated weights for policy 0, policy_version 89000 (0.0008) +[2023-10-14 08:49:26,479][100917] Updated weights for policy 1, policy_version 89152 (0.0009) +[2023-10-14 08:49:26,759][100936] Updated weights for policy 0, policy_version 89010 (0.0009) +[2023-10-14 08:49:27,120][100936] Updated weights for policy 0, policy_version 89020 (0.0009) +[2023-10-14 08:49:28,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.1, 300 sec: 13329.3). Total num frames: 182452224. Throughput: 0: 1656.6, 1: 1643.6. Samples: 45615606. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:49:28,513][99942] Avg episode reward: [(0, '0.520'), (1, '1.000')] +[2023-10-14 08:49:30,605][100917] Updated weights for policy 1, policy_version 89162 (0.0008) +[2023-10-14 08:49:30,985][100917] Updated weights for policy 1, policy_version 89172 (0.0009) +[2023-10-14 08:49:31,263][100936] Updated weights for policy 0, policy_version 89030 (0.0008) +[2023-10-14 08:49:31,359][100917] Updated weights for policy 1, policy_version 89182 (0.0008) +[2023-10-14 08:49:31,629][100936] Updated weights for policy 0, policy_version 89040 (0.0008) +[2023-10-14 08:49:31,988][100936] Updated weights for policy 0, policy_version 89050 (0.0009) +[2023-10-14 08:49:33,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 182517760. Throughput: 0: 1649.6, 1: 1656.7. Samples: 45634576. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:49:33,513][99942] Avg episode reward: [(0, '0.520'), (1, '1.000')] +[2023-10-14 08:49:35,623][100917] Updated weights for policy 1, policy_version 89192 (0.0009) +[2023-10-14 08:49:36,003][100917] Updated weights for policy 1, policy_version 89202 (0.0010) +[2023-10-14 08:49:36,224][100936] Updated weights for policy 0, policy_version 89060 (0.0008) +[2023-10-14 08:49:36,378][100917] Updated weights for policy 1, policy_version 89212 (0.0007) +[2023-10-14 08:49:36,585][100936] Updated weights for policy 0, policy_version 89070 (0.0009) +[2023-10-14 08:49:36,958][100936] Updated weights for policy 0, policy_version 89080 (0.0010) +[2023-10-14 08:49:38,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 182583296. Throughput: 0: 1663.2, 1: 1658.8. Samples: 45654844. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:49:38,513][99942] Avg episode reward: [(0, '0.520'), (1, '1.000')] +[2023-10-14 08:49:38,520][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000089088_91226112.pth... +[2023-10-14 08:49:38,520][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000089216_91357184.pth... +[2023-10-14 08:49:38,560][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000087680_89784320.pth +[2023-10-14 08:49:38,561][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000087520_89620480.pth +[2023-10-14 08:49:40,624][100917] Updated weights for policy 1, policy_version 89222 (0.0008) +[2023-10-14 08:49:41,008][100917] Updated weights for policy 1, policy_version 89232 (0.0008) +[2023-10-14 08:49:41,133][100936] Updated weights for policy 0, policy_version 89090 (0.0010) +[2023-10-14 08:49:41,371][100917] Updated weights for policy 1, policy_version 89242 (0.0009) +[2023-10-14 08:49:41,490][100936] Updated weights for policy 0, policy_version 89100 (0.0008) +[2023-10-14 08:49:41,867][100936] Updated weights for policy 0, policy_version 89110 (0.0007) +[2023-10-14 08:49:42,228][100936] Updated weights for policy 0, policy_version 89120 (0.0007) +[2023-10-14 08:49:43,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 182648832. Throughput: 0: 1650.5, 1: 1643.4. Samples: 45665236. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:49:43,513][99942] Avg episode reward: [(0, '0.520'), (1, '1.000')] +[2023-10-14 08:49:45,555][100917] Updated weights for policy 1, policy_version 89252 (0.0008) +[2023-10-14 08:49:45,926][100917] Updated weights for policy 1, policy_version 89262 (0.0009) +[2023-10-14 08:49:46,302][100917] Updated weights for policy 1, policy_version 89272 (0.0007) +[2023-10-14 08:49:46,423][100936] Updated weights for policy 0, policy_version 89130 (0.0008) +[2023-10-14 08:49:46,800][100936] Updated weights for policy 0, policy_version 89140 (0.0009) +[2023-10-14 08:49:47,167][100936] Updated weights for policy 0, policy_version 89150 (0.0010) +[2023-10-14 08:49:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 182714368. Throughput: 0: 1650.1, 1: 1649.4. Samples: 45683994. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:49:48,513][99942] Avg episode reward: [(0, '0.520'), (1, '1.000')] +[2023-10-14 08:49:50,289][100917] Updated weights for policy 1, policy_version 89282 (0.0009) +[2023-10-14 08:49:50,663][100917] Updated weights for policy 1, policy_version 89292 (0.0009) +[2023-10-14 08:49:51,033][100917] Updated weights for policy 1, policy_version 89302 (0.0009) +[2023-10-14 08:49:51,208][100936] Updated weights for policy 0, policy_version 89160 (0.0007) +[2023-10-14 08:49:51,407][100917] Updated weights for policy 1, policy_version 89312 (0.0009) +[2023-10-14 08:49:51,584][100936] Updated weights for policy 0, policy_version 89170 (0.0009) +[2023-10-14 08:49:51,960][100936] Updated weights for policy 0, policy_version 89180 (0.0011) +[2023-10-14 08:49:53,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 182779904. Throughput: 0: 1664.8, 1: 1656.0. Samples: 45704842. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:49:53,512][99942] Avg episode reward: [(0, '0.520'), (1, '1.000')] +[2023-10-14 08:49:55,480][100917] Updated weights for policy 1, policy_version 89322 (0.0009) +[2023-10-14 08:49:55,844][100917] Updated weights for policy 1, policy_version 89332 (0.0007) +[2023-10-14 08:49:56,215][100917] Updated weights for policy 1, policy_version 89342 (0.0010) +[2023-10-14 08:49:56,219][100936] Updated weights for policy 0, policy_version 89190 (0.0009) +[2023-10-14 08:49:56,585][100936] Updated weights for policy 0, policy_version 89200 (0.0008) +[2023-10-14 08:49:56,955][100936] Updated weights for policy 0, policy_version 89210 (0.0010) +[2023-10-14 08:49:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 182845440. Throughput: 0: 1657.7, 1: 1642.3. Samples: 45715016. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:49:58,513][99942] Avg episode reward: [(0, '0.620'), (1, '1.000')] +[2023-10-14 08:50:00,202][100917] Updated weights for policy 1, policy_version 89352 (0.0010) +[2023-10-14 08:50:00,588][100917] Updated weights for policy 1, policy_version 89362 (0.0008) +[2023-10-14 08:50:00,940][100936] Updated weights for policy 0, policy_version 89220 (0.0008) +[2023-10-14 08:50:00,946][100917] Updated weights for policy 1, policy_version 89372 (0.0007) +[2023-10-14 08:50:01,309][100936] Updated weights for policy 0, policy_version 89230 (0.0010) +[2023-10-14 08:50:01,673][100936] Updated weights for policy 0, policy_version 89240 (0.0011) +[2023-10-14 08:50:03,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.1, 300 sec: 13329.3). Total num frames: 182910976. Throughput: 0: 1656.2, 1: 1659.3. Samples: 45734176. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:50:03,513][99942] Avg episode reward: [(0, '0.620'), (1, '1.000')] +[2023-10-14 08:50:05,136][100917] Updated weights for policy 1, policy_version 89382 (0.0007) +[2023-10-14 08:50:05,512][100917] Updated weights for policy 1, policy_version 89392 (0.0007) +[2023-10-14 08:50:05,802][100936] Updated weights for policy 0, policy_version 89250 (0.0010) +[2023-10-14 08:50:05,876][100917] Updated weights for policy 1, policy_version 89402 (0.0007) +[2023-10-14 08:50:06,230][100936] Updated weights for policy 0, policy_version 89260 (0.0010) +[2023-10-14 08:50:06,593][100936] Updated weights for policy 0, policy_version 89270 (0.0009) +[2023-10-14 08:50:06,957][100936] Updated weights for policy 0, policy_version 89280 (0.0008) +[2023-10-14 08:50:08,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 182976512. Throughput: 0: 1660.2, 1: 1661.4. Samples: 45754572. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:50:08,512][99942] Avg episode reward: [(0, '0.620'), (1, '1.000')] +[2023-10-14 08:50:10,002][100917] Updated weights for policy 1, policy_version 89412 (0.0009) +[2023-10-14 08:50:10,369][100917] Updated weights for policy 1, policy_version 89422 (0.0007) +[2023-10-14 08:50:10,744][100917] Updated weights for policy 1, policy_version 89432 (0.0009) +[2023-10-14 08:50:11,078][100936] Updated weights for policy 0, policy_version 89290 (0.0009) +[2023-10-14 08:50:11,448][100936] Updated weights for policy 0, policy_version 89300 (0.0009) +[2023-10-14 08:50:11,822][100936] Updated weights for policy 0, policy_version 89310 (0.0010) +[2023-10-14 08:50:13,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 183042048. Throughput: 0: 1650.8, 1: 1653.3. Samples: 45764290. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:50:13,513][99942] Avg episode reward: [(0, '0.620'), (1, '1.000')] +[2023-10-14 08:50:14,741][100917] Updated weights for policy 1, policy_version 89442 (0.0008) +[2023-10-14 08:50:15,116][100917] Updated weights for policy 1, policy_version 89452 (0.0009) +[2023-10-14 08:50:15,488][100917] Updated weights for policy 1, policy_version 89462 (0.0010) +[2023-10-14 08:50:15,858][100917] Updated weights for policy 1, policy_version 89472 (0.0009) +[2023-10-14 08:50:15,881][100936] Updated weights for policy 0, policy_version 89320 (0.0010) +[2023-10-14 08:50:16,259][100936] Updated weights for policy 0, policy_version 89330 (0.0011) +[2023-10-14 08:50:16,617][100936] Updated weights for policy 0, policy_version 89340 (0.0010) +[2023-10-14 08:50:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 183107584. Throughput: 0: 1657.9, 1: 1664.5. Samples: 45784084. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:50:18,512][99942] Avg episode reward: [(0, '0.620'), (1, '1.000')] +[2023-10-14 08:50:19,948][100917] Updated weights for policy 1, policy_version 89482 (0.0007) +[2023-10-14 08:50:20,320][100917] Updated weights for policy 1, policy_version 89492 (0.0008) +[2023-10-14 08:50:20,688][100917] Updated weights for policy 1, policy_version 89502 (0.0009) +[2023-10-14 08:50:20,814][100936] Updated weights for policy 0, policy_version 89350 (0.0009) +[2023-10-14 08:50:21,184][100936] Updated weights for policy 0, policy_version 89360 (0.0009) +[2023-10-14 08:50:21,546][100936] Updated weights for policy 0, policy_version 89370 (0.0008) +[2023-10-14 08:50:23,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.1, 300 sec: 13329.3). Total num frames: 183173120. Throughput: 0: 1658.8, 1: 1673.5. Samples: 45804800. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:50:23,513][99942] Avg episode reward: [(0, '0.620'), (1, '1.000')] +[2023-10-14 08:50:24,921][100917] Updated weights for policy 1, policy_version 89512 (0.0008) +[2023-10-14 08:50:25,294][100917] Updated weights for policy 1, policy_version 89522 (0.0010) +[2023-10-14 08:50:25,579][100936] Updated weights for policy 0, policy_version 89380 (0.0008) +[2023-10-14 08:50:25,673][100917] Updated weights for policy 1, policy_version 89532 (0.0009) +[2023-10-14 08:50:25,942][100936] Updated weights for policy 0, policy_version 89390 (0.0008) +[2023-10-14 08:50:26,297][100936] Updated weights for policy 0, policy_version 89400 (0.0009) +[2023-10-14 08:50:28,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.3, 300 sec: 13329.4). Total num frames: 183238656. Throughput: 0: 1646.7, 1: 1656.4. Samples: 45813876. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:50:28,513][99942] Avg episode reward: [(0, '0.620'), (1, '1.000')] +[2023-10-14 08:50:29,873][100917] Updated weights for policy 1, policy_version 89542 (0.0009) +[2023-10-14 08:50:30,246][100917] Updated weights for policy 1, policy_version 89552 (0.0009) +[2023-10-14 08:50:30,467][100936] Updated weights for policy 0, policy_version 89410 (0.0010) +[2023-10-14 08:50:30,618][100917] Updated weights for policy 1, policy_version 89562 (0.0008) +[2023-10-14 08:50:30,831][100936] Updated weights for policy 0, policy_version 89420 (0.0009) +[2023-10-14 08:50:31,207][100936] Updated weights for policy 0, policy_version 89430 (0.0007) +[2023-10-14 08:50:31,567][100936] Updated weights for policy 0, policy_version 89440 (0.0008) +[2023-10-14 08:50:33,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 183304192. Throughput: 0: 1662.0, 1: 1669.3. Samples: 45833904. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:50:33,513][99942] Avg episode reward: [(0, '0.620'), (1, '1.000')] +[2023-10-14 08:50:34,870][100917] Updated weights for policy 1, policy_version 89572 (0.0009) +[2023-10-14 08:50:35,235][100917] Updated weights for policy 1, policy_version 89582 (0.0009) +[2023-10-14 08:50:35,604][100917] Updated weights for policy 1, policy_version 89592 (0.0007) +[2023-10-14 08:50:35,638][100936] Updated weights for policy 0, policy_version 89450 (0.0008) +[2023-10-14 08:50:36,005][100936] Updated weights for policy 0, policy_version 89460 (0.0010) +[2023-10-14 08:50:36,368][100936] Updated weights for policy 0, policy_version 89470 (0.0009) +[2023-10-14 08:50:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 183369728. Throughput: 0: 1661.9, 1: 1652.6. Samples: 45853994. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:50:38,513][99942] Avg episode reward: [(0, '0.690'), (1, '1.000')] +[2023-10-14 08:50:39,802][100917] Updated weights for policy 1, policy_version 89602 (0.0008) +[2023-10-14 08:50:40,174][100917] Updated weights for policy 1, policy_version 89612 (0.0008) +[2023-10-14 08:50:40,553][100917] Updated weights for policy 1, policy_version 89622 (0.0007) +[2023-10-14 08:50:40,624][100936] Updated weights for policy 0, policy_version 89480 (0.0008) +[2023-10-14 08:50:40,919][100917] Updated weights for policy 1, policy_version 89632 (0.0009) +[2023-10-14 08:50:40,988][100936] Updated weights for policy 0, policy_version 89490 (0.0009) +[2023-10-14 08:50:41,354][100936] Updated weights for policy 0, policy_version 89500 (0.0009) +[2023-10-14 08:50:43,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 183435264. Throughput: 0: 1647.6, 1: 1643.7. Samples: 45863122. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:50:43,513][99942] Avg episode reward: [(0, '0.690'), (1, '1.000')] +[2023-10-14 08:50:45,150][100917] Updated weights for policy 1, policy_version 89642 (0.0007) +[2023-10-14 08:50:45,470][100936] Updated weights for policy 0, policy_version 89510 (0.0009) +[2023-10-14 08:50:45,528][100917] Updated weights for policy 1, policy_version 89652 (0.0008) +[2023-10-14 08:50:45,837][100936] Updated weights for policy 0, policy_version 89520 (0.0007) +[2023-10-14 08:50:45,896][100917] Updated weights for policy 1, policy_version 89662 (0.0008) +[2023-10-14 08:50:46,207][100936] Updated weights for policy 0, policy_version 89530 (0.0007) +[2023-10-14 08:50:48,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 183500800. Throughput: 0: 1666.7, 1: 1643.2. Samples: 45883120. Policy #0 lag: (min: 30.0, avg: 30.0, max: 30.0) +[2023-10-14 08:50:48,513][99942] Avg episode reward: [(0, '0.690'), (1, '1.000')] +[2023-10-14 08:50:50,005][100917] Updated weights for policy 1, policy_version 89672 (0.0008) +[2023-10-14 08:50:50,236][100936] Updated weights for policy 0, policy_version 89540 (0.0007) +[2023-10-14 08:50:50,362][100917] Updated weights for policy 1, policy_version 89682 (0.0007) +[2023-10-14 08:50:50,595][100936] Updated weights for policy 0, policy_version 89550 (0.0008) +[2023-10-14 08:50:50,736][100917] Updated weights for policy 1, policy_version 89692 (0.0009) +[2023-10-14 08:50:50,961][100936] Updated weights for policy 0, policy_version 89560 (0.0008) +[2023-10-14 08:50:53,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 183566336. Throughput: 0: 1667.6, 1: 1639.2. Samples: 45903376. Policy #0 lag: (min: 30.0, avg: 30.0, max: 30.0) +[2023-10-14 08:50:53,513][99942] Avg episode reward: [(0, '0.690'), (1, '1.000')] +[2023-10-14 08:50:54,924][100917] Updated weights for policy 1, policy_version 89702 (0.0009) +[2023-10-14 08:50:55,151][100936] Updated weights for policy 0, policy_version 89570 (0.0008) +[2023-10-14 08:50:55,291][100917] Updated weights for policy 1, policy_version 89712 (0.0008) +[2023-10-14 08:50:55,546][100936] Updated weights for policy 0, policy_version 89580 (0.0008) +[2023-10-14 08:50:55,670][100917] Updated weights for policy 1, policy_version 89722 (0.0008) +[2023-10-14 08:50:55,914][100936] Updated weights for policy 0, policy_version 89590 (0.0007) +[2023-10-14 08:50:56,272][100936] Updated weights for policy 0, policy_version 89600 (0.0009) +[2023-10-14 08:50:58,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 183631872. Throughput: 0: 1649.3, 1: 1635.6. Samples: 45912112. Policy #0 lag: (min: 30.0, avg: 30.0, max: 30.0) +[2023-10-14 08:50:58,512][99942] Avg episode reward: [(0, '0.690'), (1, '1.000')] +[2023-10-14 08:50:59,823][100917] Updated weights for policy 1, policy_version 89732 (0.0009) +[2023-10-14 08:51:00,192][100917] Updated weights for policy 1, policy_version 89742 (0.0007) +[2023-10-14 08:51:00,406][100936] Updated weights for policy 0, policy_version 89610 (0.0007) +[2023-10-14 08:51:00,560][100917] Updated weights for policy 1, policy_version 89752 (0.0008) +[2023-10-14 08:51:00,775][100936] Updated weights for policy 0, policy_version 89620 (0.0008) +[2023-10-14 08:51:01,151][100936] Updated weights for policy 0, policy_version 89630 (0.0007) +[2023-10-14 08:51:03,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 183697408. Throughput: 0: 1666.9, 1: 1636.9. Samples: 45932754. Policy #0 lag: (min: 30.0, avg: 30.0, max: 30.0) +[2023-10-14 08:51:03,513][99942] Avg episode reward: [(0, '0.690'), (1, '1.000')] +[2023-10-14 08:51:04,674][100917] Updated weights for policy 1, policy_version 89762 (0.0010) +[2023-10-14 08:51:05,037][100917] Updated weights for policy 1, policy_version 89772 (0.0010) +[2023-10-14 08:51:05,200][100936] Updated weights for policy 0, policy_version 89640 (0.0009) +[2023-10-14 08:51:05,398][100917] Updated weights for policy 1, policy_version 89782 (0.0010) +[2023-10-14 08:51:05,563][100936] Updated weights for policy 0, policy_version 89650 (0.0009) +[2023-10-14 08:51:05,768][100917] Updated weights for policy 1, policy_version 89792 (0.0008) +[2023-10-14 08:51:05,933][100936] Updated weights for policy 0, policy_version 89660 (0.0011) +[2023-10-14 08:51:08,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 183762944. Throughput: 0: 1663.8, 1: 1636.7. Samples: 45953324. Policy #0 lag: (min: 30.0, avg: 30.0, max: 30.0) +[2023-10-14 08:51:08,513][99942] Avg episode reward: [(0, '0.690'), (1, '1.000')] +[2023-10-14 08:51:10,002][100917] Updated weights for policy 1, policy_version 89802 (0.0010) +[2023-10-14 08:51:10,133][100936] Updated weights for policy 0, policy_version 89670 (0.0008) +[2023-10-14 08:51:10,381][100917] Updated weights for policy 1, policy_version 89812 (0.0008) +[2023-10-14 08:51:10,502][100936] Updated weights for policy 0, policy_version 89680 (0.0008) +[2023-10-14 08:51:10,767][100917] Updated weights for policy 1, policy_version 89822 (0.0010) +[2023-10-14 08:51:10,884][100936] Updated weights for policy 0, policy_version 89690 (0.0009) +[2023-10-14 08:51:13,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 183828480. Throughput: 0: 1654.1, 1: 1639.3. Samples: 45962082. Policy #0 lag: (min: 30.0, avg: 30.0, max: 30.0) +[2023-10-14 08:51:13,513][99942] Avg episode reward: [(0, '0.690'), (1, '1.000')] +[2023-10-14 08:51:14,764][100917] Updated weights for policy 1, policy_version 89832 (0.0007) +[2023-10-14 08:51:15,139][100917] Updated weights for policy 1, policy_version 89842 (0.0007) +[2023-10-14 08:51:15,241][100936] Updated weights for policy 0, policy_version 89700 (0.0008) +[2023-10-14 08:51:15,500][100917] Updated weights for policy 1, policy_version 89852 (0.0008) +[2023-10-14 08:51:15,623][100936] Updated weights for policy 0, policy_version 89710 (0.0007) +[2023-10-14 08:51:15,984][100936] Updated weights for policy 0, policy_version 89720 (0.0007) +[2023-10-14 08:51:18,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 183894016. Throughput: 0: 1655.6, 1: 1646.9. Samples: 45982518. Policy #0 lag: (min: 30.0, avg: 30.0, max: 30.0) +[2023-10-14 08:51:18,512][99942] Avg episode reward: [(0, '0.690'), (1, '1.000')] +[2023-10-14 08:51:19,606][100917] Updated weights for policy 1, policy_version 89862 (0.0010) +[2023-10-14 08:51:19,971][100917] Updated weights for policy 1, policy_version 89872 (0.0007) +[2023-10-14 08:51:20,072][100936] Updated weights for policy 0, policy_version 89730 (0.0010) +[2023-10-14 08:51:20,344][100917] Updated weights for policy 1, policy_version 89882 (0.0009) +[2023-10-14 08:51:20,442][100936] Updated weights for policy 0, policy_version 89740 (0.0007) +[2023-10-14 08:51:20,810][100936] Updated weights for policy 0, policy_version 89750 (0.0007) +[2023-10-14 08:51:21,178][100936] Updated weights for policy 0, policy_version 89760 (0.0007) +[2023-10-14 08:51:23,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.3, 300 sec: 13329.4). Total num frames: 183959552. Throughput: 0: 1657.6, 1: 1655.9. Samples: 46003098. Policy #0 lag: (min: 30.0, avg: 30.0, max: 30.0) +[2023-10-14 08:51:23,512][99942] Avg episode reward: [(0, '0.780'), (1, '1.000')] +[2023-10-14 08:51:24,419][100917] Updated weights for policy 1, policy_version 89892 (0.0009) +[2023-10-14 08:51:24,793][100917] Updated weights for policy 1, policy_version 89902 (0.0007) +[2023-10-14 08:51:25,172][100917] Updated weights for policy 1, policy_version 89912 (0.0009) +[2023-10-14 08:51:25,415][100936] Updated weights for policy 0, policy_version 89770 (0.0008) +[2023-10-14 08:51:25,789][100936] Updated weights for policy 0, policy_version 89780 (0.0008) +[2023-10-14 08:51:26,152][100936] Updated weights for policy 0, policy_version 89790 (0.0008) +[2023-10-14 08:51:28,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 184025088. Throughput: 0: 1650.9, 1: 1656.7. Samples: 46011966. Policy #0 lag: (min: 30.0, avg: 30.0, max: 30.0) +[2023-10-14 08:51:28,512][99942] Avg episode reward: [(0, '0.780'), (1, '1.000')] +[2023-10-14 08:51:29,184][100917] Updated weights for policy 1, policy_version 89922 (0.0009) +[2023-10-14 08:51:29,561][100917] Updated weights for policy 1, policy_version 89932 (0.0010) +[2023-10-14 08:51:29,938][100917] Updated weights for policy 1, policy_version 89942 (0.0009) +[2023-10-14 08:51:30,256][100936] Updated weights for policy 0, policy_version 89800 (0.0008) +[2023-10-14 08:51:30,311][100917] Updated weights for policy 1, policy_version 89952 (0.0008) +[2023-10-14 08:51:30,624][100936] Updated weights for policy 0, policy_version 89810 (0.0007) +[2023-10-14 08:51:30,996][100936] Updated weights for policy 0, policy_version 89820 (0.0007) +[2023-10-14 08:51:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 184090624. Throughput: 0: 1655.6, 1: 1660.1. Samples: 46032324. Policy #0 lag: (min: 30.0, avg: 30.0, max: 30.0) +[2023-10-14 08:51:33,513][99942] Avg episode reward: [(0, '0.780'), (1, '1.000')] +[2023-10-14 08:51:34,435][100917] Updated weights for policy 1, policy_version 89962 (0.0011) +[2023-10-14 08:51:34,809][100917] Updated weights for policy 1, policy_version 89972 (0.0009) +[2023-10-14 08:51:34,926][100936] Updated weights for policy 0, policy_version 89830 (0.0009) +[2023-10-14 08:51:35,174][100917] Updated weights for policy 1, policy_version 89982 (0.0008) +[2023-10-14 08:51:35,299][100936] Updated weights for policy 0, policy_version 89840 (0.0007) +[2023-10-14 08:51:35,672][100936] Updated weights for policy 0, policy_version 89850 (0.0009) +[2023-10-14 08:51:38,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 184156160. Throughput: 0: 1655.9, 1: 1661.8. Samples: 46052674. Policy #0 lag: (min: 30.0, avg: 30.0, max: 30.0) +[2023-10-14 08:51:38,513][99942] Avg episode reward: [(0, '0.890'), (1, '1.000')] +[2023-10-14 08:51:38,523][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000089856_92012544.pth... +[2023-10-14 08:51:38,523][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000089984_92143616.pth... +[2023-10-14 08:51:38,570][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000088320_90439680.pth +[2023-10-14 08:51:38,570][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000088448_90570752.pth +[2023-10-14 08:51:39,376][100917] Updated weights for policy 1, policy_version 89992 (0.0011) +[2023-10-14 08:51:39,742][100917] Updated weights for policy 1, policy_version 90002 (0.0009) +[2023-10-14 08:51:40,060][100936] Updated weights for policy 0, policy_version 89860 (0.0007) +[2023-10-14 08:51:40,113][100917] Updated weights for policy 1, policy_version 90012 (0.0008) +[2023-10-14 08:51:40,441][100936] Updated weights for policy 0, policy_version 89870 (0.0009) +[2023-10-14 08:51:40,819][100936] Updated weights for policy 0, policy_version 89880 (0.0012) +[2023-10-14 08:51:43,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 184221696. Throughput: 0: 1657.1, 1: 1664.7. Samples: 46061590. Policy #0 lag: (min: 30.0, avg: 30.0, max: 30.0) +[2023-10-14 08:51:43,513][99942] Avg episode reward: [(0, '0.890'), (1, '1.000')] +[2023-10-14 08:51:44,360][100917] Updated weights for policy 1, policy_version 90022 (0.0007) +[2023-10-14 08:51:44,731][100917] Updated weights for policy 1, policy_version 90032 (0.0007) +[2023-10-14 08:51:44,897][100936] Updated weights for policy 0, policy_version 89890 (0.0009) +[2023-10-14 08:51:45,100][100917] Updated weights for policy 1, policy_version 90042 (0.0009) +[2023-10-14 08:51:45,265][100936] Updated weights for policy 0, policy_version 89900 (0.0009) +[2023-10-14 08:51:45,631][100936] Updated weights for policy 0, policy_version 89910 (0.0010) +[2023-10-14 08:51:46,008][100936] Updated weights for policy 0, policy_version 89920 (0.0008) +[2023-10-14 08:51:48,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 184287232. Throughput: 0: 1652.0, 1: 1664.1. Samples: 46081980. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-10-14 08:51:48,513][99942] Avg episode reward: [(0, '0.890'), (1, '1.000')] +[2023-10-14 08:51:49,233][100917] Updated weights for policy 1, policy_version 90052 (0.0009) +[2023-10-14 08:51:49,607][100917] Updated weights for policy 1, policy_version 90062 (0.0010) +[2023-10-14 08:51:49,972][100917] Updated weights for policy 1, policy_version 90072 (0.0009) +[2023-10-14 08:51:50,219][100936] Updated weights for policy 0, policy_version 89930 (0.0008) +[2023-10-14 08:51:50,582][100936] Updated weights for policy 0, policy_version 89940 (0.0010) +[2023-10-14 08:51:50,947][100936] Updated weights for policy 0, policy_version 89950 (0.0010) +[2023-10-14 08:51:53,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 184352768. Throughput: 0: 1651.8, 1: 1663.9. Samples: 46102530. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-10-14 08:51:53,513][99942] Avg episode reward: [(0, '0.890'), (1, '1.000')] +[2023-10-14 08:51:54,036][100917] Updated weights for policy 1, policy_version 90082 (0.0008) +[2023-10-14 08:51:54,403][100917] Updated weights for policy 1, policy_version 90092 (0.0009) +[2023-10-14 08:51:54,774][100917] Updated weights for policy 1, policy_version 90102 (0.0008) +[2023-10-14 08:51:55,040][100936] Updated weights for policy 0, policy_version 89960 (0.0007) +[2023-10-14 08:51:55,145][100917] Updated weights for policy 1, policy_version 90112 (0.0008) +[2023-10-14 08:51:55,409][100936] Updated weights for policy 0, policy_version 89970 (0.0007) +[2023-10-14 08:51:55,776][100936] Updated weights for policy 0, policy_version 89980 (0.0008) +[2023-10-14 08:51:58,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 184418304. Throughput: 0: 1654.6, 1: 1666.7. Samples: 46111540. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-10-14 08:51:58,513][99942] Avg episode reward: [(0, '0.890'), (1, '1.000')] +[2023-10-14 08:51:59,225][100917] Updated weights for policy 1, policy_version 90122 (0.0007) +[2023-10-14 08:51:59,600][100917] Updated weights for policy 1, policy_version 90132 (0.0009) +[2023-10-14 08:51:59,960][100936] Updated weights for policy 0, policy_version 89990 (0.0009) +[2023-10-14 08:51:59,967][100917] Updated weights for policy 1, policy_version 90142 (0.0008) +[2023-10-14 08:52:00,329][100936] Updated weights for policy 0, policy_version 90000 (0.0007) +[2023-10-14 08:52:00,692][100936] Updated weights for policy 0, policy_version 90010 (0.0007) +[2023-10-14 08:52:03,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 184483840. Throughput: 0: 1657.2, 1: 1662.8. Samples: 46131914. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-10-14 08:52:03,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:52:04,158][100917] Updated weights for policy 1, policy_version 90152 (0.0011) +[2023-10-14 08:52:04,531][100917] Updated weights for policy 1, policy_version 90162 (0.0009) +[2023-10-14 08:52:04,616][100936] Updated weights for policy 0, policy_version 90020 (0.0007) +[2023-10-14 08:52:04,907][100917] Updated weights for policy 1, policy_version 90172 (0.0008) +[2023-10-14 08:52:04,989][100936] Updated weights for policy 0, policy_version 90030 (0.0008) +[2023-10-14 08:52:05,357][100936] Updated weights for policy 0, policy_version 90040 (0.0008) +[2023-10-14 08:52:08,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 184549376. Throughput: 0: 1660.4, 1: 1661.5. Samples: 46152586. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-10-14 08:52:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:52:09,015][100917] Updated weights for policy 1, policy_version 90182 (0.0010) +[2023-10-14 08:52:09,254][100936] Updated weights for policy 0, policy_version 90050 (0.0008) +[2023-10-14 08:52:09,381][100917] Updated weights for policy 1, policy_version 90192 (0.0010) +[2023-10-14 08:52:09,637][100936] Updated weights for policy 0, policy_version 90060 (0.0008) +[2023-10-14 08:52:09,756][100917] Updated weights for policy 1, policy_version 90202 (0.0010) +[2023-10-14 08:52:10,007][100936] Updated weights for policy 0, policy_version 90070 (0.0009) +[2023-10-14 08:52:10,369][100936] Updated weights for policy 0, policy_version 90080 (0.0008) +[2023-10-14 08:52:13,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 184614912. Throughput: 0: 1663.8, 1: 1659.4. Samples: 46161508. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-10-14 08:52:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:52:13,773][100917] Updated weights for policy 1, policy_version 90212 (0.0008) +[2023-10-14 08:52:14,152][100917] Updated weights for policy 1, policy_version 90222 (0.0007) +[2023-10-14 08:52:14,467][100936] Updated weights for policy 0, policy_version 90090 (0.0007) +[2023-10-14 08:52:14,518][100917] Updated weights for policy 1, policy_version 90232 (0.0009) +[2023-10-14 08:52:14,841][100936] Updated weights for policy 0, policy_version 90100 (0.0008) +[2023-10-14 08:52:15,205][100936] Updated weights for policy 0, policy_version 90110 (0.0010) +[2023-10-14 08:52:18,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 184680448. Throughput: 0: 1662.6, 1: 1666.8. Samples: 46182150. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-10-14 08:52:18,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:52:18,661][100917] Updated weights for policy 1, policy_version 90242 (0.0009) +[2023-10-14 08:52:19,026][100917] Updated weights for policy 1, policy_version 90252 (0.0007) +[2023-10-14 08:52:19,247][100936] Updated weights for policy 0, policy_version 90120 (0.0007) +[2023-10-14 08:52:19,397][100917] Updated weights for policy 1, policy_version 90262 (0.0007) +[2023-10-14 08:52:19,625][100936] Updated weights for policy 0, policy_version 90130 (0.0007) +[2023-10-14 08:52:19,757][100917] Updated weights for policy 1, policy_version 90272 (0.0009) +[2023-10-14 08:52:19,995][100936] Updated weights for policy 0, policy_version 90140 (0.0009) +[2023-10-14 08:52:23,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 184745984. Throughput: 0: 1669.1, 1: 1664.4. Samples: 46202684. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-10-14 08:52:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:52:23,976][100917] Updated weights for policy 1, policy_version 90282 (0.0010) +[2023-10-14 08:52:24,182][100936] Updated weights for policy 0, policy_version 90150 (0.0010) +[2023-10-14 08:52:24,347][100917] Updated weights for policy 1, policy_version 90292 (0.0009) +[2023-10-14 08:52:24,544][100936] Updated weights for policy 0, policy_version 90160 (0.0007) +[2023-10-14 08:52:24,715][100917] Updated weights for policy 1, policy_version 90302 (0.0010) +[2023-10-14 08:52:24,905][100936] Updated weights for policy 0, policy_version 90170 (0.0008) +[2023-10-14 08:52:28,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 184811520. Throughput: 0: 1672.6, 1: 1664.1. Samples: 46211738. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-10-14 08:52:28,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:52:28,795][100917] Updated weights for policy 1, policy_version 90312 (0.0010) +[2023-10-14 08:52:29,176][100917] Updated weights for policy 1, policy_version 90322 (0.0009) +[2023-10-14 08:52:29,235][100936] Updated weights for policy 0, policy_version 90180 (0.0008) +[2023-10-14 08:52:29,553][100917] Updated weights for policy 1, policy_version 90332 (0.0010) +[2023-10-14 08:52:29,624][100936] Updated weights for policy 0, policy_version 90190 (0.0007) +[2023-10-14 08:52:29,991][100936] Updated weights for policy 0, policy_version 90200 (0.0010) +[2023-10-14 08:52:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 184877056. Throughput: 0: 1671.1, 1: 1666.4. Samples: 46232166. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-10-14 08:52:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:52:33,593][100917] Updated weights for policy 1, policy_version 90342 (0.0008) +[2023-10-14 08:52:33,953][100917] Updated weights for policy 1, policy_version 90352 (0.0009) +[2023-10-14 08:52:34,139][100936] Updated weights for policy 0, policy_version 90210 (0.0009) +[2023-10-14 08:52:34,329][100917] Updated weights for policy 1, policy_version 90362 (0.0007) +[2023-10-14 08:52:34,498][100936] Updated weights for policy 0, policy_version 90220 (0.0008) +[2023-10-14 08:52:34,867][100936] Updated weights for policy 0, policy_version 90230 (0.0009) +[2023-10-14 08:52:35,234][100936] Updated weights for policy 0, policy_version 90240 (0.0008) +[2023-10-14 08:52:38,459][100917] Updated weights for policy 1, policy_version 90372 (0.0008) +[2023-10-14 08:52:38,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 184942592. Throughput: 0: 1668.0, 1: 1666.0. Samples: 46252558. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-10-14 08:52:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:52:38,841][100917] Updated weights for policy 1, policy_version 90382 (0.0010) +[2023-10-14 08:52:39,218][100917] Updated weights for policy 1, policy_version 90392 (0.0009) +[2023-10-14 08:52:39,370][100936] Updated weights for policy 0, policy_version 90250 (0.0010) +[2023-10-14 08:52:39,747][100936] Updated weights for policy 0, policy_version 90260 (0.0009) +[2023-10-14 08:52:40,102][100936] Updated weights for policy 0, policy_version 90270 (0.0008) +[2023-10-14 08:52:43,411][100917] Updated weights for policy 1, policy_version 90402 (0.0007) +[2023-10-14 08:52:43,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 185008128. Throughput: 0: 1667.5, 1: 1666.2. Samples: 46261556. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-10-14 08:52:43,513][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 08:52:43,799][100917] Updated weights for policy 1, policy_version 90412 (0.0007) +[2023-10-14 08:52:44,173][100917] Updated weights for policy 1, policy_version 90422 (0.0009) +[2023-10-14 08:52:44,216][100936] Updated weights for policy 0, policy_version 90280 (0.0008) +[2023-10-14 08:52:44,546][100917] Updated weights for policy 1, policy_version 90432 (0.0010) +[2023-10-14 08:52:44,592][100936] Updated weights for policy 0, policy_version 90290 (0.0009) +[2023-10-14 08:52:44,961][100936] Updated weights for policy 0, policy_version 90300 (0.0010) +[2023-10-14 08:52:48,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 185073664. Throughput: 0: 1672.2, 1: 1663.8. Samples: 46282034. Policy #0 lag: (min: 3.0, avg: 11.8, max: 35.0) +[2023-10-14 08:52:48,512][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 08:52:48,747][100917] Updated weights for policy 1, policy_version 90442 (0.0009) +[2023-10-14 08:52:49,076][100936] Updated weights for policy 0, policy_version 90310 (0.0009) +[2023-10-14 08:52:49,125][100917] Updated weights for policy 1, policy_version 90452 (0.0008) +[2023-10-14 08:52:49,440][100936] Updated weights for policy 0, policy_version 90320 (0.0007) +[2023-10-14 08:52:49,491][100917] Updated weights for policy 1, policy_version 90462 (0.0007) +[2023-10-14 08:52:49,814][100936] Updated weights for policy 0, policy_version 90330 (0.0007) +[2023-10-14 08:52:53,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 185139200. Throughput: 0: 1670.7, 1: 1661.8. Samples: 46302548. Policy #0 lag: (min: 3.0, avg: 11.8, max: 35.0) +[2023-10-14 08:52:53,513][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 08:52:53,531][100917] Updated weights for policy 1, policy_version 90472 (0.0009) +[2023-10-14 08:52:53,867][100936] Updated weights for policy 0, policy_version 90340 (0.0008) +[2023-10-14 08:52:53,905][100917] Updated weights for policy 1, policy_version 90482 (0.0010) +[2023-10-14 08:52:54,240][100936] Updated weights for policy 0, policy_version 90350 (0.0008) +[2023-10-14 08:52:54,264][100917] Updated weights for policy 1, policy_version 90492 (0.0010) +[2023-10-14 08:52:54,606][100936] Updated weights for policy 0, policy_version 90360 (0.0008) +[2023-10-14 08:52:58,358][100917] Updated weights for policy 1, policy_version 90502 (0.0009) +[2023-10-14 08:52:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 185204736. Throughput: 0: 1665.7, 1: 1665.6. Samples: 46311416. Policy #0 lag: (min: 3.0, avg: 11.8, max: 35.0) +[2023-10-14 08:52:58,512][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 08:52:58,729][100917] Updated weights for policy 1, policy_version 90512 (0.0010) +[2023-10-14 08:52:58,784][100936] Updated weights for policy 0, policy_version 90370 (0.0007) +[2023-10-14 08:52:59,102][100917] Updated weights for policy 1, policy_version 90522 (0.0008) +[2023-10-14 08:52:59,148][100936] Updated weights for policy 0, policy_version 90380 (0.0008) +[2023-10-14 08:52:59,511][100936] Updated weights for policy 0, policy_version 90390 (0.0008) +[2023-10-14 08:52:59,878][100936] Updated weights for policy 0, policy_version 90400 (0.0009) +[2023-10-14 08:53:03,136][100917] Updated weights for policy 1, policy_version 90532 (0.0009) +[2023-10-14 08:53:03,504][100917] Updated weights for policy 1, policy_version 90542 (0.0009) +[2023-10-14 08:53:03,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 185270272. Throughput: 0: 1662.4, 1: 1659.2. Samples: 46331624. Policy #0 lag: (min: 3.0, avg: 11.8, max: 35.0) +[2023-10-14 08:53:03,513][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 08:53:03,814][100936] Updated weights for policy 0, policy_version 90410 (0.0008) +[2023-10-14 08:53:03,880][100917] Updated weights for policy 1, policy_version 90552 (0.0009) +[2023-10-14 08:53:04,178][100936] Updated weights for policy 0, policy_version 90420 (0.0010) +[2023-10-14 08:53:04,546][100936] Updated weights for policy 0, policy_version 90430 (0.0010) +[2023-10-14 08:53:07,864][100917] Updated weights for policy 1, policy_version 90562 (0.0008) +[2023-10-14 08:53:08,232][100917] Updated weights for policy 1, policy_version 90572 (0.0011) +[2023-10-14 08:53:08,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 185335808. Throughput: 0: 1662.7, 1: 1663.0. Samples: 46352340. Policy #0 lag: (min: 3.0, avg: 11.8, max: 35.0) +[2023-10-14 08:53:08,513][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 08:53:08,604][100917] Updated weights for policy 1, policy_version 90582 (0.0009) +[2023-10-14 08:53:08,867][100936] Updated weights for policy 0, policy_version 90440 (0.0008) +[2023-10-14 08:53:08,979][100917] Updated weights for policy 1, policy_version 90592 (0.0007) +[2023-10-14 08:53:09,239][100936] Updated weights for policy 0, policy_version 90450 (0.0010) +[2023-10-14 08:53:09,603][100936] Updated weights for policy 0, policy_version 90460 (0.0010) +[2023-10-14 08:53:13,166][100917] Updated weights for policy 1, policy_version 90602 (0.0007) +[2023-10-14 08:53:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 185401344. Throughput: 0: 1661.5, 1: 1667.8. Samples: 46361556. Policy #0 lag: (min: 3.0, avg: 11.8, max: 35.0) +[2023-10-14 08:53:13,513][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 08:53:13,539][100917] Updated weights for policy 1, policy_version 90612 (0.0008) +[2023-10-14 08:53:13,772][100936] Updated weights for policy 0, policy_version 90470 (0.0010) +[2023-10-14 08:53:13,921][100917] Updated weights for policy 1, policy_version 90622 (0.0007) +[2023-10-14 08:53:14,136][100936] Updated weights for policy 0, policy_version 90480 (0.0009) +[2023-10-14 08:53:14,503][100936] Updated weights for policy 0, policy_version 90490 (0.0010) +[2023-10-14 08:53:18,044][100917] Updated weights for policy 1, policy_version 90632 (0.0009) +[2023-10-14 08:53:18,406][100917] Updated weights for policy 1, policy_version 90642 (0.0008) +[2023-10-14 08:53:18,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 185466880. Throughput: 0: 1665.6, 1: 1664.9. Samples: 46382038. Policy #0 lag: (min: 3.0, avg: 11.8, max: 35.0) +[2023-10-14 08:53:18,512][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 08:53:18,517][100936] Updated weights for policy 0, policy_version 90500 (0.0010) +[2023-10-14 08:53:18,777][100917] Updated weights for policy 1, policy_version 90652 (0.0008) +[2023-10-14 08:53:18,903][100936] Updated weights for policy 0, policy_version 90510 (0.0008) +[2023-10-14 08:53:19,264][100936] Updated weights for policy 0, policy_version 90520 (0.0007) +[2023-10-14 08:53:22,887][100917] Updated weights for policy 1, policy_version 90662 (0.0009) +[2023-10-14 08:53:23,272][100917] Updated weights for policy 1, policy_version 90672 (0.0008) +[2023-10-14 08:53:23,450][100936] Updated weights for policy 0, policy_version 90530 (0.0008) +[2023-10-14 08:53:23,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.3, 300 sec: 13107.2). Total num frames: 185532416. Throughput: 0: 1662.3, 1: 1661.6. Samples: 46402132. Policy #0 lag: (min: 3.0, avg: 11.8, max: 35.0) +[2023-10-14 08:53:23,512][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 08:53:23,647][100917] Updated weights for policy 1, policy_version 90682 (0.0007) +[2023-10-14 08:53:23,814][100936] Updated weights for policy 0, policy_version 90540 (0.0008) +[2023-10-14 08:53:24,191][100936] Updated weights for policy 0, policy_version 90550 (0.0008) +[2023-10-14 08:53:24,547][100936] Updated weights for policy 0, policy_version 90560 (0.0009) +[2023-10-14 08:53:27,656][100917] Updated weights for policy 1, policy_version 90692 (0.0009) +[2023-10-14 08:53:28,032][100917] Updated weights for policy 1, policy_version 90702 (0.0009) +[2023-10-14 08:53:28,418][100917] Updated weights for policy 1, policy_version 90712 (0.0010) +[2023-10-14 08:53:28,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 185597952. Throughput: 0: 1662.5, 1: 1666.9. Samples: 46411378. Policy #0 lag: (min: 3.0, avg: 11.8, max: 35.0) +[2023-10-14 08:53:28,513][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 08:53:28,746][100936] Updated weights for policy 0, policy_version 90570 (0.0009) +[2023-10-14 08:53:29,126][100936] Updated weights for policy 0, policy_version 90580 (0.0009) +[2023-10-14 08:53:29,498][100936] Updated weights for policy 0, policy_version 90590 (0.0009) +[2023-10-14 08:53:32,724][100917] Updated weights for policy 1, policy_version 90722 (0.0007) +[2023-10-14 08:53:33,153][100917] Updated weights for policy 1, policy_version 90732 (0.0007) +[2023-10-14 08:53:33,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 185663488. Throughput: 0: 1659.6, 1: 1670.2. Samples: 46431878. Policy #0 lag: (min: 3.0, avg: 11.8, max: 35.0) +[2023-10-14 08:53:33,513][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 08:53:33,518][100936] Updated weights for policy 0, policy_version 90600 (0.0008) +[2023-10-14 08:53:33,531][100917] Updated weights for policy 1, policy_version 90742 (0.0007) +[2023-10-14 08:53:33,877][100936] Updated weights for policy 0, policy_version 90610 (0.0008) +[2023-10-14 08:53:33,896][100917] Updated weights for policy 1, policy_version 90752 (0.0010) +[2023-10-14 08:53:34,256][100936] Updated weights for policy 0, policy_version 90620 (0.0009) +[2023-10-14 08:53:37,934][100917] Updated weights for policy 1, policy_version 90762 (0.0010) +[2023-10-14 08:53:38,311][100917] Updated weights for policy 1, policy_version 90772 (0.0009) +[2023-10-14 08:53:38,334][100936] Updated weights for policy 0, policy_version 90630 (0.0007) +[2023-10-14 08:53:38,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 185729024. Throughput: 0: 1654.8, 1: 1658.8. Samples: 46451656. Policy #0 lag: (min: 3.0, avg: 11.8, max: 35.0) +[2023-10-14 08:53:38,513][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 08:53:38,686][100917] Updated weights for policy 1, policy_version 90782 (0.0010) +[2023-10-14 08:53:38,709][100936] Updated weights for policy 0, policy_version 90640 (0.0007) +[2023-10-14 08:53:38,757][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000090784_92962816.pth... +[2023-10-14 08:53:38,794][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000089216_91357184.pth +[2023-10-14 08:53:39,082][100936] Updated weights for policy 0, policy_version 90650 (0.0008) +[2023-10-14 08:53:39,299][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000090656_92831744.pth... +[2023-10-14 08:53:39,328][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000089088_91226112.pth +[2023-10-14 08:53:42,665][100917] Updated weights for policy 1, policy_version 90792 (0.0009) +[2023-10-14 08:53:43,033][100917] Updated weights for policy 1, policy_version 90802 (0.0007) +[2023-10-14 08:53:43,264][100936] Updated weights for policy 0, policy_version 90660 (0.0008) +[2023-10-14 08:53:43,406][100917] Updated weights for policy 1, policy_version 90812 (0.0007) +[2023-10-14 08:53:43,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.3, 300 sec: 13107.2). Total num frames: 185794560. Throughput: 0: 1660.9, 1: 1668.9. Samples: 46461256. Policy #0 lag: (min: 3.0, avg: 11.8, max: 35.0) +[2023-10-14 08:53:43,512][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 08:53:43,630][100936] Updated weights for policy 0, policy_version 90670 (0.0008) +[2023-10-14 08:53:43,991][100936] Updated weights for policy 0, policy_version 90680 (0.0008) +[2023-10-14 08:53:47,652][100917] Updated weights for policy 1, policy_version 90822 (0.0008) +[2023-10-14 08:53:48,020][100917] Updated weights for policy 1, policy_version 90832 (0.0009) +[2023-10-14 08:53:48,088][100936] Updated weights for policy 0, policy_version 90690 (0.0008) +[2023-10-14 08:53:48,394][100917] Updated weights for policy 1, policy_version 90842 (0.0009) +[2023-10-14 08:53:48,453][100936] Updated weights for policy 0, policy_version 90700 (0.0010) +[2023-10-14 08:53:48,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 185860096. Throughput: 0: 1664.4, 1: 1670.4. Samples: 46481688. Policy #0 lag: (min: 31.0, avg: 38.5, max: 63.0) +[2023-10-14 08:53:48,513][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 08:53:48,821][100936] Updated weights for policy 0, policy_version 90710 (0.0007) +[2023-10-14 08:53:49,190][100936] Updated weights for policy 0, policy_version 90720 (0.0011) +[2023-10-14 08:53:52,492][100917] Updated weights for policy 1, policy_version 90852 (0.0007) +[2023-10-14 08:53:52,868][100917] Updated weights for policy 1, policy_version 90862 (0.0007) +[2023-10-14 08:53:53,141][100936] Updated weights for policy 0, policy_version 90730 (0.0008) +[2023-10-14 08:53:53,242][100917] Updated weights for policy 1, policy_version 90872 (0.0007) +[2023-10-14 08:53:53,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 185925632. Throughput: 0: 1650.3, 1: 1656.0. Samples: 46501124. Policy #0 lag: (min: 31.0, avg: 38.5, max: 63.0) +[2023-10-14 08:53:53,512][100936] Updated weights for policy 0, policy_version 90740 (0.0008) +[2023-10-14 08:53:53,513][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 08:53:53,880][100936] Updated weights for policy 0, policy_version 90750 (0.0009) +[2023-10-14 08:53:57,214][100917] Updated weights for policy 1, policy_version 90882 (0.0009) +[2023-10-14 08:53:57,580][100917] Updated weights for policy 1, policy_version 90892 (0.0008) +[2023-10-14 08:53:57,961][100917] Updated weights for policy 1, policy_version 90902 (0.0008) +[2023-10-14 08:53:58,158][100936] Updated weights for policy 0, policy_version 90760 (0.0009) +[2023-10-14 08:53:58,342][100917] Updated weights for policy 1, policy_version 90912 (0.0009) +[2023-10-14 08:53:58,512][99942] Fps is (10 sec: 16384.2, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 186023936. Throughput: 0: 1661.3, 1: 1663.8. Samples: 46511186. Policy #0 lag: (min: 31.0, avg: 38.5, max: 63.0) +[2023-10-14 08:53:58,512][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 08:53:58,523][100936] Updated weights for policy 0, policy_version 90770 (0.0007) +[2023-10-14 08:53:58,902][100936] Updated weights for policy 0, policy_version 90780 (0.0008) +[2023-10-14 08:54:02,483][100917] Updated weights for policy 1, policy_version 90922 (0.0010) +[2023-10-14 08:54:02,855][100917] Updated weights for policy 1, policy_version 90932 (0.0007) +[2023-10-14 08:54:03,058][100936] Updated weights for policy 0, policy_version 90790 (0.0008) +[2023-10-14 08:54:03,219][100917] Updated weights for policy 1, policy_version 90942 (0.0009) +[2023-10-14 08:54:03,443][100936] Updated weights for policy 0, policy_version 90800 (0.0010) +[2023-10-14 08:54:03,512][99942] Fps is (10 sec: 16384.0, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 186089472. Throughput: 0: 1661.2, 1: 1664.4. Samples: 46531690. Policy #0 lag: (min: 31.0, avg: 38.5, max: 63.0) +[2023-10-14 08:54:03,513][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 08:54:03,809][100936] Updated weights for policy 0, policy_version 90810 (0.0008) +[2023-10-14 08:54:07,324][100917] Updated weights for policy 1, policy_version 90952 (0.0008) +[2023-10-14 08:54:07,683][100936] Updated weights for policy 0, policy_version 90820 (0.0007) +[2023-10-14 08:54:07,686][100917] Updated weights for policy 1, policy_version 90962 (0.0007) +[2023-10-14 08:54:08,058][100917] Updated weights for policy 1, policy_version 90972 (0.0008) +[2023-10-14 08:54:08,064][100936] Updated weights for policy 0, policy_version 90830 (0.0008) +[2023-10-14 08:54:08,426][100936] Updated weights for policy 0, policy_version 90840 (0.0008) +[2023-10-14 08:54:08,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 186155008. Throughput: 0: 1644.9, 1: 1644.8. Samples: 46550170. Policy #0 lag: (min: 31.0, avg: 38.5, max: 63.0) +[2023-10-14 08:54:08,512][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 08:54:12,245][100917] Updated weights for policy 1, policy_version 90982 (0.0008) +[2023-10-14 08:54:12,611][100917] Updated weights for policy 1, policy_version 90992 (0.0007) +[2023-10-14 08:54:12,742][100936] Updated weights for policy 0, policy_version 90850 (0.0008) +[2023-10-14 08:54:12,991][100917] Updated weights for policy 1, policy_version 91002 (0.0007) +[2023-10-14 08:54:13,109][100936] Updated weights for policy 0, policy_version 90860 (0.0008) +[2023-10-14 08:54:13,471][100936] Updated weights for policy 0, policy_version 90870 (0.0009) +[2023-10-14 08:54:13,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 186220544. Throughput: 0: 1662.1, 1: 1664.2. Samples: 46561060. Policy #0 lag: (min: 31.0, avg: 38.5, max: 63.0) +[2023-10-14 08:54:13,512][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 08:54:13,851][100936] Updated weights for policy 0, policy_version 90880 (0.0010) +[2023-10-14 08:54:17,173][100917] Updated weights for policy 1, policy_version 91012 (0.0007) +[2023-10-14 08:54:17,544][100917] Updated weights for policy 1, policy_version 91022 (0.0008) +[2023-10-14 08:54:17,886][100936] Updated weights for policy 0, policy_version 90890 (0.0009) +[2023-10-14 08:54:17,914][100917] Updated weights for policy 1, policy_version 91032 (0.0009) +[2023-10-14 08:54:18,260][100936] Updated weights for policy 0, policy_version 90900 (0.0009) +[2023-10-14 08:54:18,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 186286080. Throughput: 0: 1659.1, 1: 1662.5. Samples: 46581348. Policy #0 lag: (min: 31.0, avg: 38.5, max: 63.0) +[2023-10-14 08:54:18,513][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 08:54:18,621][100936] Updated weights for policy 0, policy_version 90910 (0.0008) +[2023-10-14 08:54:21,825][100917] Updated weights for policy 1, policy_version 91042 (0.0008) +[2023-10-14 08:54:22,239][100917] Updated weights for policy 1, policy_version 91052 (0.0008) +[2023-10-14 08:54:22,602][100917] Updated weights for policy 1, policy_version 91062 (0.0008) +[2023-10-14 08:54:22,759][100936] Updated weights for policy 0, policy_version 90920 (0.0007) +[2023-10-14 08:54:22,965][100917] Updated weights for policy 1, policy_version 91072 (0.0007) +[2023-10-14 08:54:23,118][100936] Updated weights for policy 0, policy_version 90930 (0.0007) +[2023-10-14 08:54:23,482][100936] Updated weights for policy 0, policy_version 90940 (0.0009) +[2023-10-14 08:54:23,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 186351616. Throughput: 0: 1640.3, 1: 1648.2. Samples: 46599638. Policy #0 lag: (min: 31.0, avg: 38.5, max: 63.0) +[2023-10-14 08:54:23,513][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 08:54:27,188][100917] Updated weights for policy 1, policy_version 91082 (0.0008) +[2023-10-14 08:54:27,557][100917] Updated weights for policy 1, policy_version 91092 (0.0008) +[2023-10-14 08:54:27,820][100936] Updated weights for policy 0, policy_version 90950 (0.0008) +[2023-10-14 08:54:27,933][100917] Updated weights for policy 1, policy_version 91102 (0.0008) +[2023-10-14 08:54:28,195][100936] Updated weights for policy 0, policy_version 90960 (0.0008) +[2023-10-14 08:54:28,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 186417152. Throughput: 0: 1653.4, 1: 1660.4. Samples: 46610380. Policy #0 lag: (min: 31.0, avg: 38.5, max: 63.0) +[2023-10-14 08:54:28,513][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 08:54:28,572][100936] Updated weights for policy 0, policy_version 90970 (0.0008) +[2023-10-14 08:54:32,058][100917] Updated weights for policy 1, policy_version 91112 (0.0007) +[2023-10-14 08:54:32,441][100917] Updated weights for policy 1, policy_version 91122 (0.0007) +[2023-10-14 08:54:32,605][100936] Updated weights for policy 0, policy_version 90980 (0.0007) +[2023-10-14 08:54:32,809][100917] Updated weights for policy 1, policy_version 91132 (0.0007) +[2023-10-14 08:54:32,978][100936] Updated weights for policy 0, policy_version 90990 (0.0008) +[2023-10-14 08:54:33,335][100936] Updated weights for policy 0, policy_version 91000 (0.0007) +[2023-10-14 08:54:33,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 186482688. Throughput: 0: 1656.3, 1: 1656.5. Samples: 46630762. Policy #0 lag: (min: 31.0, avg: 38.5, max: 63.0) +[2023-10-14 08:54:33,512][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 08:54:36,749][100917] Updated weights for policy 1, policy_version 91142 (0.0007) +[2023-10-14 08:54:37,128][100917] Updated weights for policy 1, policy_version 91152 (0.0008) +[2023-10-14 08:54:37,458][100936] Updated weights for policy 0, policy_version 91010 (0.0007) +[2023-10-14 08:54:37,493][100917] Updated weights for policy 1, policy_version 91162 (0.0010) +[2023-10-14 08:54:37,825][100936] Updated weights for policy 0, policy_version 91020 (0.0008) +[2023-10-14 08:54:38,196][100936] Updated weights for policy 0, policy_version 91030 (0.0008) +[2023-10-14 08:54:38,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 186548224. Throughput: 0: 1639.6, 1: 1651.4. Samples: 46649216. Policy #0 lag: (min: 31.0, avg: 38.5, max: 63.0) +[2023-10-14 08:54:38,513][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 08:54:38,561][100936] Updated weights for policy 0, policy_version 91040 (0.0008) +[2023-10-14 08:54:41,690][100917] Updated weights for policy 1, policy_version 91172 (0.0009) +[2023-10-14 08:54:42,070][100917] Updated weights for policy 1, policy_version 91182 (0.0011) +[2023-10-14 08:54:42,429][100917] Updated weights for policy 1, policy_version 91192 (0.0007) +[2023-10-14 08:54:42,763][100936] Updated weights for policy 0, policy_version 91050 (0.0009) +[2023-10-14 08:54:43,139][100936] Updated weights for policy 0, policy_version 91060 (0.0007) +[2023-10-14 08:54:43,507][100936] Updated weights for policy 0, policy_version 91070 (0.0008) +[2023-10-14 08:54:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 186613760. Throughput: 0: 1653.3, 1: 1666.3. Samples: 46660570. Policy #0 lag: (min: 31.0, avg: 38.5, max: 63.0) +[2023-10-14 08:54:43,513][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 08:54:46,722][100917] Updated weights for policy 1, policy_version 91202 (0.0009) +[2023-10-14 08:54:47,101][100917] Updated weights for policy 1, policy_version 91212 (0.0007) +[2023-10-14 08:54:47,473][100917] Updated weights for policy 1, policy_version 91222 (0.0008) +[2023-10-14 08:54:47,770][100936] Updated weights for policy 0, policy_version 91080 (0.0009) +[2023-10-14 08:54:47,839][100917] Updated weights for policy 1, policy_version 91232 (0.0009) +[2023-10-14 08:54:48,136][100936] Updated weights for policy 0, policy_version 91090 (0.0010) +[2023-10-14 08:54:48,510][100936] Updated weights for policy 0, policy_version 91100 (0.0009) +[2023-10-14 08:54:48,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 186679296. Throughput: 0: 1650.3, 1: 1656.9. Samples: 46680512. Policy #0 lag: (min: 25.0, avg: 36.7, max: 57.0) +[2023-10-14 08:54:48,513][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 08:54:51,893][100917] Updated weights for policy 1, policy_version 91242 (0.0008) +[2023-10-14 08:54:52,253][100917] Updated weights for policy 1, policy_version 91252 (0.0009) +[2023-10-14 08:54:52,631][100917] Updated weights for policy 1, policy_version 91262 (0.0008) +[2023-10-14 08:54:52,792][100936] Updated weights for policy 0, policy_version 91110 (0.0008) +[2023-10-14 08:54:53,170][100936] Updated weights for policy 0, policy_version 91120 (0.0010) +[2023-10-14 08:54:53,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 186744832. Throughput: 0: 1648.1, 1: 1658.0. Samples: 46698944. Policy #0 lag: (min: 25.0, avg: 36.7, max: 57.0) +[2023-10-14 08:54:53,512][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 08:54:53,540][100936] Updated weights for policy 0, policy_version 91130 (0.0008) +[2023-10-14 08:54:56,721][100917] Updated weights for policy 1, policy_version 91272 (0.0009) +[2023-10-14 08:54:57,091][100917] Updated weights for policy 1, policy_version 91282 (0.0009) +[2023-10-14 08:54:57,471][100917] Updated weights for policy 1, policy_version 91292 (0.0007) +[2023-10-14 08:54:57,709][100936] Updated weights for policy 0, policy_version 91140 (0.0007) +[2023-10-14 08:54:58,083][100936] Updated weights for policy 0, policy_version 91150 (0.0008) +[2023-10-14 08:54:58,442][100936] Updated weights for policy 0, policy_version 91160 (0.0007) +[2023-10-14 08:54:58,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 186810368. Throughput: 0: 1646.0, 1: 1662.1. Samples: 46709926. Policy #0 lag: (min: 25.0, avg: 36.7, max: 57.0) +[2023-10-14 08:54:58,513][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 08:55:01,675][100917] Updated weights for policy 1, policy_version 91302 (0.0008) +[2023-10-14 08:55:02,065][100917] Updated weights for policy 1, policy_version 91312 (0.0008) +[2023-10-14 08:55:02,436][100917] Updated weights for policy 1, policy_version 91322 (0.0010) +[2023-10-14 08:55:02,503][100936] Updated weights for policy 0, policy_version 91170 (0.0008) +[2023-10-14 08:55:02,867][100936] Updated weights for policy 0, policy_version 91180 (0.0007) +[2023-10-14 08:55:03,246][100936] Updated weights for policy 0, policy_version 91190 (0.0007) +[2023-10-14 08:55:03,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 186875904. Throughput: 0: 1649.2, 1: 1649.5. Samples: 46729788. Policy #0 lag: (min: 25.0, avg: 36.7, max: 57.0) +[2023-10-14 08:55:03,513][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 08:55:03,605][100936] Updated weights for policy 0, policy_version 91200 (0.0008) +[2023-10-14 08:55:06,644][100917] Updated weights for policy 1, policy_version 91332 (0.0008) +[2023-10-14 08:55:07,029][100917] Updated weights for policy 1, policy_version 91342 (0.0008) +[2023-10-14 08:55:07,410][100917] Updated weights for policy 1, policy_version 91352 (0.0008) +[2023-10-14 08:55:07,829][100936] Updated weights for policy 0, policy_version 91210 (0.0008) +[2023-10-14 08:55:08,187][100936] Updated weights for policy 0, policy_version 91220 (0.0010) +[2023-10-14 08:55:08,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 186941440. Throughput: 0: 1644.9, 1: 1655.3. Samples: 46748144. Policy #0 lag: (min: 25.0, avg: 36.7, max: 57.0) +[2023-10-14 08:55:08,512][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 08:55:08,563][100936] Updated weights for policy 0, policy_version 91230 (0.0011) +[2023-10-14 08:55:11,505][100917] Updated weights for policy 1, policy_version 91362 (0.0007) +[2023-10-14 08:55:11,884][100917] Updated weights for policy 1, policy_version 91372 (0.0009) +[2023-10-14 08:55:12,251][100917] Updated weights for policy 1, policy_version 91382 (0.0010) +[2023-10-14 08:55:12,576][100936] Updated weights for policy 0, policy_version 91240 (0.0008) +[2023-10-14 08:55:12,624][100917] Updated weights for policy 1, policy_version 91392 (0.0009) +[2023-10-14 08:55:12,953][100936] Updated weights for policy 0, policy_version 91250 (0.0007) +[2023-10-14 08:55:13,320][100936] Updated weights for policy 0, policy_version 91260 (0.0009) +[2023-10-14 08:55:13,512][99942] Fps is (10 sec: 16384.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 187039744. Throughput: 0: 1652.6, 1: 1660.4. Samples: 46759466. Policy #0 lag: (min: 25.0, avg: 36.7, max: 57.0) +[2023-10-14 08:55:13,512][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 08:55:16,744][100917] Updated weights for policy 1, policy_version 91402 (0.0009) +[2023-10-14 08:55:17,113][100917] Updated weights for policy 1, policy_version 91412 (0.0010) +[2023-10-14 08:55:17,413][100936] Updated weights for policy 0, policy_version 91270 (0.0008) +[2023-10-14 08:55:17,489][100917] Updated weights for policy 1, policy_version 91422 (0.0007) +[2023-10-14 08:55:17,789][100936] Updated weights for policy 0, policy_version 91280 (0.0011) +[2023-10-14 08:55:18,150][100936] Updated weights for policy 0, policy_version 91290 (0.0009) +[2023-10-14 08:55:18,512][99942] Fps is (10 sec: 16383.9, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 187105280. Throughput: 0: 1647.6, 1: 1647.7. Samples: 46779050. Policy #0 lag: (min: 25.0, avg: 36.7, max: 57.0) +[2023-10-14 08:55:18,512][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 08:55:21,479][100917] Updated weights for policy 1, policy_version 91432 (0.0009) +[2023-10-14 08:55:21,841][100917] Updated weights for policy 1, policy_version 91442 (0.0007) +[2023-10-14 08:55:22,215][100917] Updated weights for policy 1, policy_version 91452 (0.0007) +[2023-10-14 08:55:22,288][100936] Updated weights for policy 0, policy_version 91300 (0.0007) +[2023-10-14 08:55:22,654][100936] Updated weights for policy 0, policy_version 91310 (0.0009) +[2023-10-14 08:55:23,027][100936] Updated weights for policy 0, policy_version 91320 (0.0009) +[2023-10-14 08:55:23,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 187170816. Throughput: 0: 1647.6, 1: 1655.9. Samples: 46797870. Policy #0 lag: (min: 25.0, avg: 36.7, max: 57.0) +[2023-10-14 08:55:23,512][99942] Avg episode reward: [(0, '0.920'), (1, '1.000')] +[2023-10-14 08:55:26,230][100917] Updated weights for policy 1, policy_version 91462 (0.0010) +[2023-10-14 08:55:26,600][100917] Updated weights for policy 1, policy_version 91472 (0.0009) +[2023-10-14 08:55:26,978][100917] Updated weights for policy 1, policy_version 91482 (0.0009) +[2023-10-14 08:55:27,119][100936] Updated weights for policy 0, policy_version 91330 (0.0009) +[2023-10-14 08:55:27,477][100936] Updated weights for policy 0, policy_version 91340 (0.0008) +[2023-10-14 08:55:27,845][100936] Updated weights for policy 0, policy_version 91350 (0.0009) +[2023-10-14 08:55:28,211][100936] Updated weights for policy 0, policy_version 91360 (0.0009) +[2023-10-14 08:55:28,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 187236352. Throughput: 0: 1652.0, 1: 1660.3. Samples: 46809626. Policy #0 lag: (min: 25.0, avg: 36.7, max: 57.0) +[2023-10-14 08:55:28,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:55:31,118][100917] Updated weights for policy 1, policy_version 91492 (0.0010) +[2023-10-14 08:55:31,489][100917] Updated weights for policy 1, policy_version 91502 (0.0010) +[2023-10-14 08:55:31,865][100917] Updated weights for policy 1, policy_version 91512 (0.0009) +[2023-10-14 08:55:32,369][100936] Updated weights for policy 0, policy_version 91370 (0.0007) +[2023-10-14 08:55:32,731][100936] Updated weights for policy 0, policy_version 91380 (0.0008) +[2023-10-14 08:55:33,105][100936] Updated weights for policy 0, policy_version 91390 (0.0007) +[2023-10-14 08:55:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 187301888. Throughput: 0: 1647.3, 1: 1646.8. Samples: 46828746. Policy #0 lag: (min: 25.0, avg: 36.7, max: 57.0) +[2023-10-14 08:55:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:55:36,027][100917] Updated weights for policy 1, policy_version 91522 (0.0008) +[2023-10-14 08:55:36,398][100917] Updated weights for policy 1, policy_version 91532 (0.0008) +[2023-10-14 08:55:36,756][100917] Updated weights for policy 1, policy_version 91542 (0.0008) +[2023-10-14 08:55:37,124][100917] Updated weights for policy 1, policy_version 91552 (0.0009) +[2023-10-14 08:55:37,267][100936] Updated weights for policy 0, policy_version 91400 (0.0007) +[2023-10-14 08:55:37,636][100936] Updated weights for policy 0, policy_version 91410 (0.0009) +[2023-10-14 08:55:38,006][100936] Updated weights for policy 0, policy_version 91420 (0.0008) +[2023-10-14 08:55:38,512][99942] Fps is (10 sec: 13106.6, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 187367424. Throughput: 0: 1653.9, 1: 1660.5. Samples: 46848094. Policy #0 lag: (min: 25.0, avg: 36.7, max: 57.0) +[2023-10-14 08:55:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:55:38,523][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000091424_93618176.pth... +[2023-10-14 08:55:38,523][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000091552_93749248.pth... +[2023-10-14 08:55:38,553][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000089856_92012544.pth +[2023-10-14 08:55:38,564][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000089984_92143616.pth +[2023-10-14 08:55:41,161][100917] Updated weights for policy 1, policy_version 91562 (0.0008) +[2023-10-14 08:55:41,533][100917] Updated weights for policy 1, policy_version 91572 (0.0009) +[2023-10-14 08:55:41,902][100917] Updated weights for policy 1, policy_version 91582 (0.0009) +[2023-10-14 08:55:41,975][100936] Updated weights for policy 0, policy_version 91430 (0.0009) +[2023-10-14 08:55:42,348][100936] Updated weights for policy 0, policy_version 91440 (0.0008) +[2023-10-14 08:55:42,723][100936] Updated weights for policy 0, policy_version 91450 (0.0009) +[2023-10-14 08:55:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 187432960. Throughput: 0: 1664.8, 1: 1658.1. Samples: 46859460. Policy #0 lag: (min: 10.0, avg: 15.2, max: 42.0) +[2023-10-14 08:55:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:55:45,803][100917] Updated weights for policy 1, policy_version 91592 (0.0007) +[2023-10-14 08:55:46,178][100917] Updated weights for policy 1, policy_version 91602 (0.0007) +[2023-10-14 08:55:46,546][100917] Updated weights for policy 1, policy_version 91612 (0.0008) +[2023-10-14 08:55:46,851][100936] Updated weights for policy 0, policy_version 91460 (0.0009) +[2023-10-14 08:55:47,222][100936] Updated weights for policy 0, policy_version 91470 (0.0007) +[2023-10-14 08:55:47,586][100936] Updated weights for policy 0, policy_version 91480 (0.0008) +[2023-10-14 08:55:48,512][99942] Fps is (10 sec: 13107.7, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 187498496. Throughput: 0: 1644.8, 1: 1651.8. Samples: 46878134. Policy #0 lag: (min: 10.0, avg: 15.2, max: 42.0) +[2023-10-14 08:55:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:55:50,745][100917] Updated weights for policy 1, policy_version 91622 (0.0009) +[2023-10-14 08:55:51,121][100917] Updated weights for policy 1, policy_version 91632 (0.0008) +[2023-10-14 08:55:51,494][100917] Updated weights for policy 1, policy_version 91642 (0.0009) +[2023-10-14 08:55:51,847][100936] Updated weights for policy 0, policy_version 91490 (0.0008) +[2023-10-14 08:55:52,226][100936] Updated weights for policy 0, policy_version 91500 (0.0009) +[2023-10-14 08:55:52,611][100936] Updated weights for policy 0, policy_version 91510 (0.0008) +[2023-10-14 08:55:52,987][100936] Updated weights for policy 0, policy_version 91520 (0.0010) +[2023-10-14 08:55:53,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 187564032. Throughput: 0: 1654.1, 1: 1673.7. Samples: 46897894. Policy #0 lag: (min: 10.0, avg: 15.2, max: 42.0) +[2023-10-14 08:55:53,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:55:55,671][100917] Updated weights for policy 1, policy_version 91652 (0.0010) +[2023-10-14 08:55:56,067][100917] Updated weights for policy 1, policy_version 91662 (0.0009) +[2023-10-14 08:55:56,429][100917] Updated weights for policy 1, policy_version 91672 (0.0009) +[2023-10-14 08:55:56,978][100936] Updated weights for policy 0, policy_version 91530 (0.0009) +[2023-10-14 08:55:57,357][100936] Updated weights for policy 0, policy_version 91540 (0.0008) +[2023-10-14 08:55:57,730][100936] Updated weights for policy 0, policy_version 91550 (0.0009) +[2023-10-14 08:55:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 187629568. Throughput: 0: 1662.4, 1: 1660.3. Samples: 46908988. Policy #0 lag: (min: 10.0, avg: 15.2, max: 42.0) +[2023-10-14 08:55:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:56:00,582][100917] Updated weights for policy 1, policy_version 91682 (0.0010) +[2023-10-14 08:56:00,954][100917] Updated weights for policy 1, policy_version 91692 (0.0009) +[2023-10-14 08:56:01,317][100917] Updated weights for policy 1, policy_version 91702 (0.0009) +[2023-10-14 08:56:01,690][100917] Updated weights for policy 1, policy_version 91712 (0.0009) +[2023-10-14 08:56:01,745][100936] Updated weights for policy 0, policy_version 91560 (0.0008) +[2023-10-14 08:56:02,117][100936] Updated weights for policy 0, policy_version 91570 (0.0009) +[2023-10-14 08:56:02,484][100936] Updated weights for policy 0, policy_version 91580 (0.0008) +[2023-10-14 08:56:03,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 187695104. Throughput: 0: 1648.4, 1: 1655.4. Samples: 46927720. Policy #0 lag: (min: 10.0, avg: 15.2, max: 42.0) +[2023-10-14 08:56:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:56:05,733][100917] Updated weights for policy 1, policy_version 91722 (0.0009) +[2023-10-14 08:56:06,103][100917] Updated weights for policy 1, policy_version 91732 (0.0010) +[2023-10-14 08:56:06,490][100917] Updated weights for policy 1, policy_version 91742 (0.0009) +[2023-10-14 08:56:06,527][100936] Updated weights for policy 0, policy_version 91590 (0.0008) +[2023-10-14 08:56:06,897][100936] Updated weights for policy 0, policy_version 91600 (0.0007) +[2023-10-14 08:56:07,267][100936] Updated weights for policy 0, policy_version 91610 (0.0007) +[2023-10-14 08:56:08,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 187760640. Throughput: 0: 1673.9, 1: 1670.2. Samples: 46948352. Policy #0 lag: (min: 10.0, avg: 15.2, max: 42.0) +[2023-10-14 08:56:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:56:10,463][100917] Updated weights for policy 1, policy_version 91752 (0.0009) +[2023-10-14 08:56:10,827][100917] Updated weights for policy 1, policy_version 91762 (0.0007) +[2023-10-14 08:56:11,129][100936] Updated weights for policy 0, policy_version 91620 (0.0008) +[2023-10-14 08:56:11,197][100917] Updated weights for policy 1, policy_version 91772 (0.0008) +[2023-10-14 08:56:11,489][100936] Updated weights for policy 0, policy_version 91630 (0.0011) +[2023-10-14 08:56:11,863][100936] Updated weights for policy 0, policy_version 91640 (0.0008) +[2023-10-14 08:56:13,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 187826176. Throughput: 0: 1667.0, 1: 1649.0. Samples: 46958846. Policy #0 lag: (min: 10.0, avg: 15.2, max: 42.0) +[2023-10-14 08:56:13,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:56:15,408][100917] Updated weights for policy 1, policy_version 91782 (0.0009) +[2023-10-14 08:56:15,787][100917] Updated weights for policy 1, policy_version 91792 (0.0007) +[2023-10-14 08:56:16,161][100917] Updated weights for policy 1, policy_version 91802 (0.0007) +[2023-10-14 08:56:16,240][100936] Updated weights for policy 0, policy_version 91650 (0.0007) +[2023-10-14 08:56:16,605][100936] Updated weights for policy 0, policy_version 91660 (0.0008) +[2023-10-14 08:56:16,970][100936] Updated weights for policy 0, policy_version 91670 (0.0010) +[2023-10-14 08:56:17,347][100936] Updated weights for policy 0, policy_version 91680 (0.0008) +[2023-10-14 08:56:18,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 187891712. Throughput: 0: 1652.2, 1: 1662.7. Samples: 46977918. Policy #0 lag: (min: 10.0, avg: 15.2, max: 42.0) +[2023-10-14 08:56:18,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:56:20,364][100917] Updated weights for policy 1, policy_version 91812 (0.0009) +[2023-10-14 08:56:20,743][100917] Updated weights for policy 1, policy_version 91822 (0.0009) +[2023-10-14 08:56:21,119][100917] Updated weights for policy 1, policy_version 91832 (0.0007) +[2023-10-14 08:56:21,505][100936] Updated weights for policy 0, policy_version 91690 (0.0010) +[2023-10-14 08:56:21,876][100936] Updated weights for policy 0, policy_version 91700 (0.0007) +[2023-10-14 08:56:22,243][100936] Updated weights for policy 0, policy_version 91710 (0.0010) +[2023-10-14 08:56:23,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 187957248. Throughput: 0: 1667.3, 1: 1672.1. Samples: 46998366. Policy #0 lag: (min: 10.0, avg: 15.2, max: 42.0) +[2023-10-14 08:56:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:56:25,202][100917] Updated weights for policy 1, policy_version 91842 (0.0008) +[2023-10-14 08:56:25,581][100917] Updated weights for policy 1, policy_version 91852 (0.0010) +[2023-10-14 08:56:25,957][100917] Updated weights for policy 1, policy_version 91862 (0.0009) +[2023-10-14 08:56:26,326][100917] Updated weights for policy 1, policy_version 91872 (0.0007) +[2023-10-14 08:56:26,481][100936] Updated weights for policy 0, policy_version 91720 (0.0010) +[2023-10-14 08:56:26,849][100936] Updated weights for policy 0, policy_version 91730 (0.0010) +[2023-10-14 08:56:27,222][100936] Updated weights for policy 0, policy_version 91740 (0.0008) +[2023-10-14 08:56:28,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 188022784. Throughput: 0: 1662.9, 1: 1656.4. Samples: 47008828. Policy #0 lag: (min: 10.0, avg: 15.2, max: 42.0) +[2023-10-14 08:56:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:56:30,613][100917] Updated weights for policy 1, policy_version 91882 (0.0008) +[2023-10-14 08:56:30,992][100917] Updated weights for policy 1, policy_version 91892 (0.0010) +[2023-10-14 08:56:31,267][100936] Updated weights for policy 0, policy_version 91750 (0.0009) +[2023-10-14 08:56:31,365][100917] Updated weights for policy 1, policy_version 91902 (0.0010) +[2023-10-14 08:56:31,638][100936] Updated weights for policy 0, policy_version 91760 (0.0007) +[2023-10-14 08:56:32,006][100936] Updated weights for policy 0, policy_version 91770 (0.0008) +[2023-10-14 08:56:33,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 188088320. Throughput: 0: 1659.4, 1: 1665.3. Samples: 47027746. Policy #0 lag: (min: 10.0, avg: 15.2, max: 42.0) +[2023-10-14 08:56:33,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:56:35,448][100917] Updated weights for policy 1, policy_version 91912 (0.0010) +[2023-10-14 08:56:35,813][100917] Updated weights for policy 1, policy_version 91922 (0.0009) +[2023-10-14 08:56:36,187][100917] Updated weights for policy 1, policy_version 91932 (0.0010) +[2023-10-14 08:56:36,291][100936] Updated weights for policy 0, policy_version 91780 (0.0009) +[2023-10-14 08:56:36,650][100936] Updated weights for policy 0, policy_version 91790 (0.0008) +[2023-10-14 08:56:37,028][100936] Updated weights for policy 0, policy_version 91800 (0.0008) +[2023-10-14 08:56:38,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 13329.4). Total num frames: 188153856. Throughput: 0: 1673.6, 1: 1663.1. Samples: 47048044. Policy #0 lag: (min: 10.0, avg: 15.2, max: 42.0) +[2023-10-14 08:56:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:56:40,403][100917] Updated weights for policy 1, policy_version 91942 (0.0009) +[2023-10-14 08:56:40,776][100917] Updated weights for policy 1, policy_version 91952 (0.0008) +[2023-10-14 08:56:41,086][100936] Updated weights for policy 0, policy_version 91810 (0.0009) +[2023-10-14 08:56:41,145][100917] Updated weights for policy 1, policy_version 91962 (0.0009) +[2023-10-14 08:56:41,463][100936] Updated weights for policy 0, policy_version 91820 (0.0008) +[2023-10-14 08:56:41,827][100936] Updated weights for policy 0, policy_version 91830 (0.0009) +[2023-10-14 08:56:42,192][100936] Updated weights for policy 0, policy_version 91840 (0.0008) +[2023-10-14 08:56:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 188219392. Throughput: 0: 1663.2, 1: 1658.0. Samples: 47058446. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:56:43,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:56:45,293][100917] Updated weights for policy 1, policy_version 91972 (0.0008) +[2023-10-14 08:56:45,657][100917] Updated weights for policy 1, policy_version 91982 (0.0007) +[2023-10-14 08:56:46,035][100917] Updated weights for policy 1, policy_version 91992 (0.0008) +[2023-10-14 08:56:46,292][100936] Updated weights for policy 0, policy_version 91850 (0.0009) +[2023-10-14 08:56:46,660][100936] Updated weights for policy 0, policy_version 91860 (0.0010) +[2023-10-14 08:56:47,041][100936] Updated weights for policy 0, policy_version 91870 (0.0010) +[2023-10-14 08:56:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 188284928. Throughput: 0: 1661.8, 1: 1668.0. Samples: 47077562. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:56:48,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:56:50,307][100917] Updated weights for policy 1, policy_version 92002 (0.0009) +[2023-10-14 08:56:50,730][100917] Updated weights for policy 1, policy_version 92012 (0.0008) +[2023-10-14 08:56:51,104][100917] Updated weights for policy 1, policy_version 92022 (0.0008) +[2023-10-14 08:56:51,224][100936] Updated weights for policy 0, policy_version 91880 (0.0008) +[2023-10-14 08:56:51,467][100917] Updated weights for policy 1, policy_version 92032 (0.0009) +[2023-10-14 08:56:51,594][100936] Updated weights for policy 0, policy_version 91890 (0.0008) +[2023-10-14 08:56:51,969][100936] Updated weights for policy 0, policy_version 91900 (0.0008) +[2023-10-14 08:56:53,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 188350464. Throughput: 0: 1658.6, 1: 1661.2. Samples: 47097742. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:56:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:56:55,426][100917] Updated weights for policy 1, policy_version 92042 (0.0009) +[2023-10-14 08:56:55,793][100917] Updated weights for policy 1, policy_version 92052 (0.0010) +[2023-10-14 08:56:56,146][100936] Updated weights for policy 0, policy_version 91910 (0.0009) +[2023-10-14 08:56:56,168][100917] Updated weights for policy 1, policy_version 92062 (0.0007) +[2023-10-14 08:56:56,514][100936] Updated weights for policy 0, policy_version 91920 (0.0009) +[2023-10-14 08:56:56,887][100936] Updated weights for policy 0, policy_version 91930 (0.0009) +[2023-10-14 08:56:58,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 188416000. Throughput: 0: 1652.2, 1: 1656.2. Samples: 47107726. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:56:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:57:00,218][100917] Updated weights for policy 1, policy_version 92072 (0.0008) +[2023-10-14 08:57:00,588][100917] Updated weights for policy 1, policy_version 92082 (0.0008) +[2023-10-14 08:57:00,905][100936] Updated weights for policy 0, policy_version 91940 (0.0008) +[2023-10-14 08:57:00,967][100917] Updated weights for policy 1, policy_version 92092 (0.0007) +[2023-10-14 08:57:01,267][100936] Updated weights for policy 0, policy_version 91950 (0.0008) +[2023-10-14 08:57:01,645][100936] Updated weights for policy 0, policy_version 91960 (0.0011) +[2023-10-14 08:57:03,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 188481536. Throughput: 0: 1656.6, 1: 1657.7. Samples: 47127062. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:57:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:57:05,108][100917] Updated weights for policy 1, policy_version 92102 (0.0010) +[2023-10-14 08:57:05,484][100917] Updated weights for policy 1, policy_version 92112 (0.0007) +[2023-10-14 08:57:05,773][100936] Updated weights for policy 0, policy_version 91970 (0.0008) +[2023-10-14 08:57:05,846][100917] Updated weights for policy 1, policy_version 92122 (0.0008) +[2023-10-14 08:57:06,129][100936] Updated weights for policy 0, policy_version 91980 (0.0008) +[2023-10-14 08:57:06,501][100936] Updated weights for policy 0, policy_version 91990 (0.0009) +[2023-10-14 08:57:06,875][100936] Updated weights for policy 0, policy_version 92000 (0.0007) +[2023-10-14 08:57:08,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 188547072. Throughput: 0: 1658.8, 1: 1653.4. Samples: 47147414. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:57:08,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:57:09,798][100917] Updated weights for policy 1, policy_version 92132 (0.0009) +[2023-10-14 08:57:10,179][100917] Updated weights for policy 1, policy_version 92142 (0.0010) +[2023-10-14 08:57:10,555][100917] Updated weights for policy 1, policy_version 92152 (0.0007) +[2023-10-14 08:57:11,091][100936] Updated weights for policy 0, policy_version 92010 (0.0007) +[2023-10-14 08:57:11,450][100936] Updated weights for policy 0, policy_version 92020 (0.0009) +[2023-10-14 08:57:11,824][100936] Updated weights for policy 0, policy_version 92030 (0.0008) +[2023-10-14 08:57:13,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 188612608. Throughput: 0: 1647.0, 1: 1644.9. Samples: 47156962. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:57:13,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:57:14,715][100917] Updated weights for policy 1, policy_version 92162 (0.0008) +[2023-10-14 08:57:15,093][100917] Updated weights for policy 1, policy_version 92172 (0.0012) +[2023-10-14 08:57:15,476][100917] Updated weights for policy 1, policy_version 92182 (0.0007) +[2023-10-14 08:57:15,838][100917] Updated weights for policy 1, policy_version 92192 (0.0008) +[2023-10-14 08:57:15,862][100936] Updated weights for policy 0, policy_version 92040 (0.0009) +[2023-10-14 08:57:16,228][100936] Updated weights for policy 0, policy_version 92050 (0.0008) +[2023-10-14 08:57:16,587][100936] Updated weights for policy 0, policy_version 92060 (0.0009) +[2023-10-14 08:57:18,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 188678144. Throughput: 0: 1657.4, 1: 1657.6. Samples: 47176922. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:57:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:57:19,943][100917] Updated weights for policy 1, policy_version 92202 (0.0007) +[2023-10-14 08:57:20,324][100917] Updated weights for policy 1, policy_version 92212 (0.0008) +[2023-10-14 08:57:20,690][100917] Updated weights for policy 1, policy_version 92222 (0.0009) +[2023-10-14 08:57:20,812][100936] Updated weights for policy 0, policy_version 92070 (0.0008) +[2023-10-14 08:57:21,173][100936] Updated weights for policy 0, policy_version 92080 (0.0008) +[2023-10-14 08:57:21,538][100936] Updated weights for policy 0, policy_version 92090 (0.0011) +[2023-10-14 08:57:23,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 188743680. Throughput: 0: 1658.5, 1: 1662.9. Samples: 47197508. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:57:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:57:24,785][100917] Updated weights for policy 1, policy_version 92232 (0.0007) +[2023-10-14 08:57:25,150][100917] Updated weights for policy 1, policy_version 92242 (0.0007) +[2023-10-14 08:57:25,526][100917] Updated weights for policy 1, policy_version 92252 (0.0007) +[2023-10-14 08:57:25,653][100936] Updated weights for policy 0, policy_version 92100 (0.0009) +[2023-10-14 08:57:26,032][100936] Updated weights for policy 0, policy_version 92110 (0.0007) +[2023-10-14 08:57:26,395][100936] Updated weights for policy 0, policy_version 92120 (0.0009) +[2023-10-14 08:57:28,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 188809216. Throughput: 0: 1647.0, 1: 1652.0. Samples: 47206902. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:57:28,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:57:29,520][100917] Updated weights for policy 1, policy_version 92262 (0.0007) +[2023-10-14 08:57:29,892][100917] Updated weights for policy 1, policy_version 92272 (0.0007) +[2023-10-14 08:57:30,269][100917] Updated weights for policy 1, policy_version 92282 (0.0007) +[2023-10-14 08:57:30,572][100936] Updated weights for policy 0, policy_version 92130 (0.0008) +[2023-10-14 08:57:30,931][100936] Updated weights for policy 0, policy_version 92140 (0.0007) +[2023-10-14 08:57:31,300][100936] Updated weights for policy 0, policy_version 92150 (0.0009) +[2023-10-14 08:57:31,666][100936] Updated weights for policy 0, policy_version 92160 (0.0010) +[2023-10-14 08:57:33,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 188874752. Throughput: 0: 1657.0, 1: 1668.5. Samples: 47227208. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:57:33,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 08:57:34,355][100917] Updated weights for policy 1, policy_version 92292 (0.0009) +[2023-10-14 08:57:34,761][100917] Updated weights for policy 1, policy_version 92302 (0.0010) +[2023-10-14 08:57:35,130][100917] Updated weights for policy 1, policy_version 92312 (0.0007) +[2023-10-14 08:57:35,677][100936] Updated weights for policy 0, policy_version 92170 (0.0009) +[2023-10-14 08:57:36,048][100936] Updated weights for policy 0, policy_version 92180 (0.0011) +[2023-10-14 08:57:36,413][100936] Updated weights for policy 0, policy_version 92190 (0.0010) +[2023-10-14 08:57:38,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 188940288. Throughput: 0: 1661.3, 1: 1668.0. Samples: 47247560. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:57:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 08:57:38,525][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000092320_94535680.pth... +[2023-10-14 08:57:38,525][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000092192_94404608.pth... +[2023-10-14 08:57:38,556][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000090784_92962816.pth +[2023-10-14 08:57:38,560][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000090656_92831744.pth +[2023-10-14 08:57:39,309][100917] Updated weights for policy 1, policy_version 92322 (0.0008) +[2023-10-14 08:57:39,673][100917] Updated weights for policy 1, policy_version 92332 (0.0008) +[2023-10-14 08:57:40,033][100917] Updated weights for policy 1, policy_version 92342 (0.0011) +[2023-10-14 08:57:40,407][100917] Updated weights for policy 1, policy_version 92352 (0.0008) +[2023-10-14 08:57:40,573][100936] Updated weights for policy 0, policy_version 92200 (0.0009) +[2023-10-14 08:57:40,951][100936] Updated weights for policy 0, policy_version 92210 (0.0009) +[2023-10-14 08:57:41,318][100936] Updated weights for policy 0, policy_version 92220 (0.0011) +[2023-10-14 08:57:43,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 189005824. Throughput: 0: 1651.4, 1: 1656.9. Samples: 47256600. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:57:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 08:57:44,541][100917] Updated weights for policy 1, policy_version 92362 (0.0009) +[2023-10-14 08:57:44,920][100917] Updated weights for policy 1, policy_version 92372 (0.0008) +[2023-10-14 08:57:45,283][100917] Updated weights for policy 1, policy_version 92382 (0.0008) +[2023-10-14 08:57:45,518][100936] Updated weights for policy 0, policy_version 92230 (0.0008) +[2023-10-14 08:57:45,893][100936] Updated weights for policy 0, policy_version 92240 (0.0007) +[2023-10-14 08:57:46,260][100936] Updated weights for policy 0, policy_version 92250 (0.0008) +[2023-10-14 08:57:48,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 189071360. Throughput: 0: 1661.4, 1: 1666.9. Samples: 47276836. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:57:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.990')] +[2023-10-14 08:57:49,488][100917] Updated weights for policy 1, policy_version 92392 (0.0007) +[2023-10-14 08:57:49,855][100917] Updated weights for policy 1, policy_version 92402 (0.0008) +[2023-10-14 08:57:50,232][100917] Updated weights for policy 1, policy_version 92412 (0.0008) +[2023-10-14 08:57:50,349][100936] Updated weights for policy 0, policy_version 92260 (0.0008) +[2023-10-14 08:57:50,715][100936] Updated weights for policy 0, policy_version 92270 (0.0008) +[2023-10-14 08:57:51,079][100936] Updated weights for policy 0, policy_version 92280 (0.0009) +[2023-10-14 08:57:53,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 189136896. Throughput: 0: 1656.2, 1: 1663.3. Samples: 47296792. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:57:53,512][99942] Avg episode reward: [(0, '0.980'), (1, '0.990')] +[2023-10-14 08:57:54,323][100917] Updated weights for policy 1, policy_version 92422 (0.0009) +[2023-10-14 08:57:54,710][100917] Updated weights for policy 1, policy_version 92432 (0.0010) +[2023-10-14 08:57:55,083][100917] Updated weights for policy 1, policy_version 92442 (0.0009) +[2023-10-14 08:57:55,232][100936] Updated weights for policy 0, policy_version 92290 (0.0007) +[2023-10-14 08:57:55,591][100936] Updated weights for policy 0, policy_version 92300 (0.0007) +[2023-10-14 08:57:55,957][100936] Updated weights for policy 0, policy_version 92310 (0.0009) +[2023-10-14 08:57:56,321][100936] Updated weights for policy 0, policy_version 92320 (0.0007) +[2023-10-14 08:57:58,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 189202432. Throughput: 0: 1648.0, 1: 1659.4. Samples: 47305798. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:57:58,512][99942] Avg episode reward: [(0, '0.980'), (1, '0.990')] +[2023-10-14 08:57:59,127][100917] Updated weights for policy 1, policy_version 92452 (0.0011) +[2023-10-14 08:57:59,498][100917] Updated weights for policy 1, policy_version 92462 (0.0009) +[2023-10-14 08:57:59,873][100917] Updated weights for policy 1, policy_version 92472 (0.0007) +[2023-10-14 08:58:00,590][100936] Updated weights for policy 0, policy_version 92330 (0.0009) +[2023-10-14 08:58:00,962][100936] Updated weights for policy 0, policy_version 92340 (0.0010) +[2023-10-14 08:58:01,344][100936] Updated weights for policy 0, policy_version 92350 (0.0009) +[2023-10-14 08:58:03,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 189267968. Throughput: 0: 1658.4, 1: 1658.0. Samples: 47326160. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:58:03,513][99942] Avg episode reward: [(0, '0.970'), (1, '0.990')] +[2023-10-14 08:58:03,975][100917] Updated weights for policy 1, policy_version 92482 (0.0007) +[2023-10-14 08:58:04,346][100917] Updated weights for policy 1, policy_version 92492 (0.0010) +[2023-10-14 08:58:04,722][100917] Updated weights for policy 1, policy_version 92502 (0.0009) +[2023-10-14 08:58:05,083][100917] Updated weights for policy 1, policy_version 92512 (0.0009) +[2023-10-14 08:58:05,377][100936] Updated weights for policy 0, policy_version 92360 (0.0009) +[2023-10-14 08:58:05,752][100936] Updated weights for policy 0, policy_version 92370 (0.0010) +[2023-10-14 08:58:06,123][100936] Updated weights for policy 0, policy_version 92380 (0.0011) +[2023-10-14 08:58:08,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 189333504. Throughput: 0: 1657.2, 1: 1656.9. Samples: 47346642. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:58:08,512][99942] Avg episode reward: [(0, '0.970'), (1, '0.990')] +[2023-10-14 08:58:09,295][100917] Updated weights for policy 1, policy_version 92522 (0.0010) +[2023-10-14 08:58:09,662][100917] Updated weights for policy 1, policy_version 92532 (0.0011) +[2023-10-14 08:58:10,031][100917] Updated weights for policy 1, policy_version 92542 (0.0007) +[2023-10-14 08:58:10,452][100936] Updated weights for policy 0, policy_version 92390 (0.0010) +[2023-10-14 08:58:10,827][100936] Updated weights for policy 0, policy_version 92400 (0.0009) +[2023-10-14 08:58:11,196][100936] Updated weights for policy 0, policy_version 92410 (0.0008) +[2023-10-14 08:58:13,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 189399040. Throughput: 0: 1652.4, 1: 1657.8. Samples: 47355862. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:58:13,512][99942] Avg episode reward: [(0, '0.970'), (1, '0.990')] +[2023-10-14 08:58:14,262][100917] Updated weights for policy 1, policy_version 92552 (0.0010) +[2023-10-14 08:58:14,632][100917] Updated weights for policy 1, policy_version 92562 (0.0008) +[2023-10-14 08:58:15,009][100917] Updated weights for policy 1, policy_version 92572 (0.0010) +[2023-10-14 08:58:15,344][100936] Updated weights for policy 0, policy_version 92420 (0.0010) +[2023-10-14 08:58:15,710][100936] Updated weights for policy 0, policy_version 92430 (0.0007) +[2023-10-14 08:58:16,084][100936] Updated weights for policy 0, policy_version 92440 (0.0008) +[2023-10-14 08:58:18,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 189464576. Throughput: 0: 1656.3, 1: 1650.7. Samples: 47376022. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:58:18,513][99942] Avg episode reward: [(0, '0.970'), (1, '0.990')] +[2023-10-14 08:58:19,207][100917] Updated weights for policy 1, policy_version 92582 (0.0009) +[2023-10-14 08:58:19,578][100917] Updated weights for policy 1, policy_version 92592 (0.0009) +[2023-10-14 08:58:19,946][100917] Updated weights for policy 1, policy_version 92602 (0.0007) +[2023-10-14 08:58:20,187][100936] Updated weights for policy 0, policy_version 92450 (0.0009) +[2023-10-14 08:58:20,561][100936] Updated weights for policy 0, policy_version 92460 (0.0008) +[2023-10-14 08:58:20,920][100936] Updated weights for policy 0, policy_version 92470 (0.0007) +[2023-10-14 08:58:21,288][100936] Updated weights for policy 0, policy_version 92480 (0.0010) +[2023-10-14 08:58:23,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 189530112. Throughput: 0: 1656.8, 1: 1655.2. Samples: 47396596. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:58:23,513][99942] Avg episode reward: [(0, '0.970'), (1, '0.990')] +[2023-10-14 08:58:23,968][100917] Updated weights for policy 1, policy_version 92612 (0.0007) +[2023-10-14 08:58:24,367][100917] Updated weights for policy 1, policy_version 92622 (0.0010) +[2023-10-14 08:58:24,744][100917] Updated weights for policy 1, policy_version 92632 (0.0009) +[2023-10-14 08:58:25,274][100936] Updated weights for policy 0, policy_version 92490 (0.0007) +[2023-10-14 08:58:25,651][100936] Updated weights for policy 0, policy_version 92500 (0.0007) +[2023-10-14 08:58:26,018][100936] Updated weights for policy 0, policy_version 92510 (0.0010) +[2023-10-14 08:58:28,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 189595648. Throughput: 0: 1649.4, 1: 1660.6. Samples: 47405550. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:58:28,513][99942] Avg episode reward: [(0, '0.970'), (1, '0.990')] +[2023-10-14 08:58:28,703][100917] Updated weights for policy 1, policy_version 92642 (0.0009) +[2023-10-14 08:58:29,077][100917] Updated weights for policy 1, policy_version 92652 (0.0009) +[2023-10-14 08:58:29,455][100917] Updated weights for policy 1, policy_version 92662 (0.0010) +[2023-10-14 08:58:29,820][100917] Updated weights for policy 1, policy_version 92672 (0.0008) +[2023-10-14 08:58:30,075][100936] Updated weights for policy 0, policy_version 92520 (0.0010) +[2023-10-14 08:58:30,433][100936] Updated weights for policy 0, policy_version 92530 (0.0008) +[2023-10-14 08:58:30,807][100936] Updated weights for policy 0, policy_version 92540 (0.0007) +[2023-10-14 08:58:33,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 189661184. Throughput: 0: 1666.3, 1: 1655.5. Samples: 47426318. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:58:33,512][99942] Avg episode reward: [(0, '0.970'), (1, '0.990')] +[2023-10-14 08:58:33,956][100917] Updated weights for policy 1, policy_version 92682 (0.0010) +[2023-10-14 08:58:34,337][100917] Updated weights for policy 1, policy_version 92692 (0.0008) +[2023-10-14 08:58:34,714][100917] Updated weights for policy 1, policy_version 92702 (0.0010) +[2023-10-14 08:58:34,837][100936] Updated weights for policy 0, policy_version 92550 (0.0008) +[2023-10-14 08:58:35,205][100936] Updated weights for policy 0, policy_version 92560 (0.0009) +[2023-10-14 08:58:35,575][100936] Updated weights for policy 0, policy_version 92570 (0.0010) +[2023-10-14 08:58:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 189726720. Throughput: 0: 1667.3, 1: 1657.6. Samples: 47446410. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 08:58:38,513][99942] Avg episode reward: [(0, '0.970'), (1, '0.990')] +[2023-10-14 08:58:38,923][100917] Updated weights for policy 1, policy_version 92712 (0.0009) +[2023-10-14 08:58:39,298][100917] Updated weights for policy 1, policy_version 92722 (0.0009) +[2023-10-14 08:58:39,669][100917] Updated weights for policy 1, policy_version 92732 (0.0008) +[2023-10-14 08:58:39,689][100936] Updated weights for policy 0, policy_version 92580 (0.0008) +[2023-10-14 08:58:40,045][100936] Updated weights for policy 0, policy_version 92590 (0.0010) +[2023-10-14 08:58:40,426][100936] Updated weights for policy 0, policy_version 92600 (0.0009) +[2023-10-14 08:58:43,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 189792256. Throughput: 0: 1663.8, 1: 1659.2. Samples: 47455332. Policy #0 lag: (min: 31.0, avg: 37.3, max: 63.0) +[2023-10-14 08:58:43,513][99942] Avg episode reward: [(0, '0.970'), (1, '0.990')] +[2023-10-14 08:58:43,748][100917] Updated weights for policy 1, policy_version 92742 (0.0009) +[2023-10-14 08:58:44,110][100917] Updated weights for policy 1, policy_version 92752 (0.0009) +[2023-10-14 08:58:44,485][100936] Updated weights for policy 0, policy_version 92610 (0.0007) +[2023-10-14 08:58:44,485][100917] Updated weights for policy 1, policy_version 92762 (0.0010) +[2023-10-14 08:58:44,848][100936] Updated weights for policy 0, policy_version 92620 (0.0009) +[2023-10-14 08:58:45,219][100936] Updated weights for policy 0, policy_version 92630 (0.0009) +[2023-10-14 08:58:45,579][100936] Updated weights for policy 0, policy_version 92640 (0.0009) +[2023-10-14 08:58:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 189857792. Throughput: 0: 1667.0, 1: 1659.4. Samples: 47475848. Policy #0 lag: (min: 31.0, avg: 37.3, max: 63.0) +[2023-10-14 08:58:48,512][99942] Avg episode reward: [(0, '0.970'), (1, '0.990')] +[2023-10-14 08:58:48,519][100917] Updated weights for policy 1, policy_version 92772 (0.0009) +[2023-10-14 08:58:48,898][100917] Updated weights for policy 1, policy_version 92782 (0.0010) +[2023-10-14 08:58:49,267][100917] Updated weights for policy 1, policy_version 92792 (0.0010) +[2023-10-14 08:58:49,900][100936] Updated weights for policy 0, policy_version 92650 (0.0007) +[2023-10-14 08:58:50,268][100936] Updated weights for policy 0, policy_version 92660 (0.0009) +[2023-10-14 08:58:50,628][100936] Updated weights for policy 0, policy_version 92670 (0.0009) +[2023-10-14 08:58:53,385][100917] Updated weights for policy 1, policy_version 92802 (0.0009) +[2023-10-14 08:58:53,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 189923328. Throughput: 0: 1669.7, 1: 1659.9. Samples: 47496474. Policy #0 lag: (min: 31.0, avg: 37.3, max: 63.0) +[2023-10-14 08:58:53,512][99942] Avg episode reward: [(0, '0.970'), (1, '0.990')] +[2023-10-14 08:58:53,752][100917] Updated weights for policy 1, policy_version 92812 (0.0008) +[2023-10-14 08:58:54,129][100917] Updated weights for policy 1, policy_version 92822 (0.0011) +[2023-10-14 08:58:54,494][100917] Updated weights for policy 1, policy_version 92832 (0.0009) +[2023-10-14 08:58:54,637][100936] Updated weights for policy 0, policy_version 92680 (0.0009) +[2023-10-14 08:58:55,001][100936] Updated weights for policy 0, policy_version 92690 (0.0008) +[2023-10-14 08:58:55,374][100936] Updated weights for policy 0, policy_version 92700 (0.0009) +[2023-10-14 08:58:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 189988864. Throughput: 0: 1666.6, 1: 1655.3. Samples: 47505346. Policy #0 lag: (min: 31.0, avg: 37.3, max: 63.0) +[2023-10-14 08:58:58,513][99942] Avg episode reward: [(0, '0.970'), (1, '0.990')] +[2023-10-14 08:58:58,714][100917] Updated weights for policy 1, policy_version 92842 (0.0008) +[2023-10-14 08:58:59,076][100917] Updated weights for policy 1, policy_version 92852 (0.0007) +[2023-10-14 08:58:59,359][100936] Updated weights for policy 0, policy_version 92710 (0.0008) +[2023-10-14 08:58:59,451][100917] Updated weights for policy 1, policy_version 92862 (0.0007) +[2023-10-14 08:58:59,724][100936] Updated weights for policy 0, policy_version 92720 (0.0007) +[2023-10-14 08:59:00,090][100936] Updated weights for policy 0, policy_version 92730 (0.0008) +[2023-10-14 08:59:03,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 190054400. Throughput: 0: 1669.2, 1: 1658.3. Samples: 47525760. Policy #0 lag: (min: 31.0, avg: 37.3, max: 63.0) +[2023-10-14 08:59:03,513][99942] Avg episode reward: [(0, '0.970'), (1, '0.990')] +[2023-10-14 08:59:03,635][100917] Updated weights for policy 1, policy_version 92872 (0.0009) +[2023-10-14 08:59:04,006][100917] Updated weights for policy 1, policy_version 92882 (0.0009) +[2023-10-14 08:59:04,226][100936] Updated weights for policy 0, policy_version 92740 (0.0010) +[2023-10-14 08:59:04,376][100917] Updated weights for policy 1, policy_version 92892 (0.0007) +[2023-10-14 08:59:04,596][100936] Updated weights for policy 0, policy_version 92750 (0.0008) +[2023-10-14 08:59:04,966][100936] Updated weights for policy 0, policy_version 92760 (0.0008) +[2023-10-14 08:59:08,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 190119936. Throughput: 0: 1664.7, 1: 1656.9. Samples: 47546064. Policy #0 lag: (min: 31.0, avg: 37.3, max: 63.0) +[2023-10-14 08:59:08,512][99942] Avg episode reward: [(0, '0.970'), (1, '0.990')] +[2023-10-14 08:59:08,579][100917] Updated weights for policy 1, policy_version 92902 (0.0009) +[2023-10-14 08:59:08,979][100917] Updated weights for policy 1, policy_version 92912 (0.0009) +[2023-10-14 08:59:09,266][100936] Updated weights for policy 0, policy_version 92770 (0.0007) +[2023-10-14 08:59:09,357][100917] Updated weights for policy 1, policy_version 92922 (0.0010) +[2023-10-14 08:59:09,637][100936] Updated weights for policy 0, policy_version 92780 (0.0009) +[2023-10-14 08:59:10,005][100936] Updated weights for policy 0, policy_version 92790 (0.0010) +[2023-10-14 08:59:10,380][100936] Updated weights for policy 0, policy_version 92800 (0.0008) +[2023-10-14 08:59:13,463][100917] Updated weights for policy 1, policy_version 92932 (0.0007) +[2023-10-14 08:59:13,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 190185472. Throughput: 0: 1666.8, 1: 1654.4. Samples: 47555008. Policy #0 lag: (min: 31.0, avg: 37.3, max: 63.0) +[2023-10-14 08:59:13,513][99942] Avg episode reward: [(0, '0.970'), (1, '0.990')] +[2023-10-14 08:59:13,841][100917] Updated weights for policy 1, policy_version 92942 (0.0007) +[2023-10-14 08:59:14,221][100917] Updated weights for policy 1, policy_version 92952 (0.0007) +[2023-10-14 08:59:14,707][100936] Updated weights for policy 0, policy_version 92810 (0.0008) +[2023-10-14 08:59:15,076][100936] Updated weights for policy 0, policy_version 92820 (0.0008) +[2023-10-14 08:59:15,448][100936] Updated weights for policy 0, policy_version 92830 (0.0007) +[2023-10-14 08:59:18,284][100917] Updated weights for policy 1, policy_version 92962 (0.0008) +[2023-10-14 08:59:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 190251008. Throughput: 0: 1656.6, 1: 1661.0. Samples: 47575612. Policy #0 lag: (min: 31.0, avg: 37.3, max: 63.0) +[2023-10-14 08:59:18,512][99942] Avg episode reward: [(0, '0.970'), (1, '0.990')] +[2023-10-14 08:59:18,658][100917] Updated weights for policy 1, policy_version 92972 (0.0011) +[2023-10-14 08:59:19,034][100917] Updated weights for policy 1, policy_version 92982 (0.0008) +[2023-10-14 08:59:19,400][100917] Updated weights for policy 1, policy_version 92992 (0.0009) +[2023-10-14 08:59:19,556][100936] Updated weights for policy 0, policy_version 92840 (0.0008) +[2023-10-14 08:59:19,924][100936] Updated weights for policy 0, policy_version 92850 (0.0010) +[2023-10-14 08:59:20,292][100936] Updated weights for policy 0, policy_version 92860 (0.0011) +[2023-10-14 08:59:23,434][100917] Updated weights for policy 1, policy_version 93002 (0.0007) +[2023-10-14 08:59:23,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 190316544. Throughput: 0: 1655.3, 1: 1666.5. Samples: 47595888. Policy #0 lag: (min: 31.0, avg: 37.3, max: 63.0) +[2023-10-14 08:59:23,512][99942] Avg episode reward: [(0, '0.970'), (1, '0.990')] +[2023-10-14 08:59:23,801][100917] Updated weights for policy 1, policy_version 93012 (0.0008) +[2023-10-14 08:59:24,175][100917] Updated weights for policy 1, policy_version 93022 (0.0009) +[2023-10-14 08:59:24,577][100936] Updated weights for policy 0, policy_version 92870 (0.0011) +[2023-10-14 08:59:24,946][100936] Updated weights for policy 0, policy_version 92880 (0.0010) +[2023-10-14 08:59:25,307][100936] Updated weights for policy 0, policy_version 92890 (0.0009) +[2023-10-14 08:59:28,217][100917] Updated weights for policy 1, policy_version 93032 (0.0009) +[2023-10-14 08:59:28,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 190382080. Throughput: 0: 1656.6, 1: 1669.0. Samples: 47604984. Policy #0 lag: (min: 31.0, avg: 37.3, max: 63.0) +[2023-10-14 08:59:28,512][99942] Avg episode reward: [(0, '0.970'), (1, '0.990')] +[2023-10-14 08:59:28,589][100917] Updated weights for policy 1, policy_version 93042 (0.0007) +[2023-10-14 08:59:28,955][100917] Updated weights for policy 1, policy_version 93052 (0.0008) +[2023-10-14 08:59:29,207][100936] Updated weights for policy 0, policy_version 92900 (0.0008) +[2023-10-14 08:59:29,579][100936] Updated weights for policy 0, policy_version 92910 (0.0007) +[2023-10-14 08:59:29,951][100936] Updated weights for policy 0, policy_version 92920 (0.0009) +[2023-10-14 08:59:32,960][100917] Updated weights for policy 1, policy_version 93062 (0.0007) +[2023-10-14 08:59:33,327][100917] Updated weights for policy 1, policy_version 93072 (0.0008) +[2023-10-14 08:59:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 190447616. Throughput: 0: 1656.1, 1: 1668.2. Samples: 47625444. Policy #0 lag: (min: 31.0, avg: 37.3, max: 63.0) +[2023-10-14 08:59:33,512][99942] Avg episode reward: [(0, '0.970'), (1, '0.990')] +[2023-10-14 08:59:33,715][100917] Updated weights for policy 1, policy_version 93082 (0.0008) +[2023-10-14 08:59:34,092][100936] Updated weights for policy 0, policy_version 92930 (0.0008) +[2023-10-14 08:59:34,484][100936] Updated weights for policy 0, policy_version 92940 (0.0007) +[2023-10-14 08:59:34,847][100936] Updated weights for policy 0, policy_version 92950 (0.0008) +[2023-10-14 08:59:35,215][100936] Updated weights for policy 0, policy_version 92960 (0.0007) +[2023-10-14 08:59:37,816][100917] Updated weights for policy 1, policy_version 93092 (0.0008) +[2023-10-14 08:59:38,194][100917] Updated weights for policy 1, policy_version 93102 (0.0009) +[2023-10-14 08:59:38,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 190513152. Throughput: 0: 1657.5, 1: 1659.8. Samples: 47645752. Policy #0 lag: (min: 31.0, avg: 37.3, max: 63.0) +[2023-10-14 08:59:38,513][99942] Avg episode reward: [(0, '0.970'), (1, '0.990')] +[2023-10-14 08:59:38,521][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000092960_95191040.pth... +[2023-10-14 08:59:38,555][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000091424_93618176.pth +[2023-10-14 08:59:38,559][100560] Saving a milestone ./train_atari/atari_privateye_APPO/checkpoint_p0/milestones/checkpoint_000092960_95191040.pth +[2023-10-14 08:59:38,560][100917] Updated weights for policy 1, policy_version 93112 (0.0009) +[2023-10-14 08:59:38,860][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000093120_95354880.pth... +[2023-10-14 08:59:38,893][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000091552_93749248.pth +[2023-10-14 08:59:38,897][100681] Saving a milestone ./train_atari/atari_privateye_APPO/checkpoint_p1/milestones/checkpoint_000093120_95354880.pth +[2023-10-14 08:59:39,098][100936] Updated weights for policy 0, policy_version 92970 (0.0008) +[2023-10-14 08:59:39,461][100936] Updated weights for policy 0, policy_version 92980 (0.0008) +[2023-10-14 08:59:39,834][100936] Updated weights for policy 0, policy_version 92990 (0.0009) +[2023-10-14 08:59:42,756][100917] Updated weights for policy 1, policy_version 93122 (0.0011) +[2023-10-14 08:59:43,121][100917] Updated weights for policy 1, policy_version 93132 (0.0011) +[2023-10-14 08:59:43,488][100917] Updated weights for policy 1, policy_version 93142 (0.0009) +[2023-10-14 08:59:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 190578688. Throughput: 0: 1655.7, 1: 1669.5. Samples: 47654976. Policy #0 lag: (min: 31.0, avg: 36.6, max: 63.0) +[2023-10-14 08:59:43,512][99942] Avg episode reward: [(0, '0.970'), (1, '0.990')] +[2023-10-14 08:59:43,859][100917] Updated weights for policy 1, policy_version 93152 (0.0009) +[2023-10-14 08:59:44,013][100936] Updated weights for policy 0, policy_version 93000 (0.0007) +[2023-10-14 08:59:44,378][100936] Updated weights for policy 0, policy_version 93010 (0.0007) +[2023-10-14 08:59:44,751][100936] Updated weights for policy 0, policy_version 93020 (0.0009) +[2023-10-14 08:59:47,902][100917] Updated weights for policy 1, policy_version 93162 (0.0007) +[2023-10-14 08:59:48,266][100917] Updated weights for policy 1, policy_version 93172 (0.0007) +[2023-10-14 08:59:48,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 190644224. Throughput: 0: 1657.6, 1: 1669.6. Samples: 47675484. Policy #0 lag: (min: 31.0, avg: 36.6, max: 63.0) +[2023-10-14 08:59:48,512][99942] Avg episode reward: [(0, '0.880'), (1, '0.990')] +[2023-10-14 08:59:48,636][100917] Updated weights for policy 1, policy_version 93182 (0.0008) +[2023-10-14 08:59:48,893][100936] Updated weights for policy 0, policy_version 93030 (0.0010) +[2023-10-14 08:59:49,268][100936] Updated weights for policy 0, policy_version 93040 (0.0009) +[2023-10-14 08:59:49,628][100936] Updated weights for policy 0, policy_version 93050 (0.0010) +[2023-10-14 08:59:52,947][100917] Updated weights for policy 1, policy_version 93192 (0.0009) +[2023-10-14 08:59:53,326][100917] Updated weights for policy 1, policy_version 93202 (0.0007) +[2023-10-14 08:59:53,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 190709760. Throughput: 0: 1662.2, 1: 1661.2. Samples: 47695616. Policy #0 lag: (min: 31.0, avg: 36.6, max: 63.0) +[2023-10-14 08:59:53,512][99942] Avg episode reward: [(0, '0.880'), (1, '0.990')] +[2023-10-14 08:59:53,686][100917] Updated weights for policy 1, policy_version 93212 (0.0007) +[2023-10-14 08:59:53,834][100936] Updated weights for policy 0, policy_version 93060 (0.0008) +[2023-10-14 08:59:54,204][100936] Updated weights for policy 0, policy_version 93070 (0.0008) +[2023-10-14 08:59:54,572][100936] Updated weights for policy 0, policy_version 93080 (0.0007) +[2023-10-14 08:59:58,016][100917] Updated weights for policy 1, policy_version 93222 (0.0010) +[2023-10-14 08:59:58,399][100917] Updated weights for policy 1, policy_version 93232 (0.0009) +[2023-10-14 08:59:58,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 190775296. Throughput: 0: 1662.8, 1: 1667.9. Samples: 47704890. Policy #0 lag: (min: 31.0, avg: 36.6, max: 63.0) +[2023-10-14 08:59:58,512][99942] Avg episode reward: [(0, '0.880'), (1, '0.990')] +[2023-10-14 08:59:58,605][100936] Updated weights for policy 0, policy_version 93090 (0.0008) +[2023-10-14 08:59:58,766][100917] Updated weights for policy 1, policy_version 93242 (0.0008) +[2023-10-14 08:59:58,973][100936] Updated weights for policy 0, policy_version 93100 (0.0008) +[2023-10-14 08:59:59,348][100936] Updated weights for policy 0, policy_version 93110 (0.0007) +[2023-10-14 08:59:59,716][100936] Updated weights for policy 0, policy_version 93120 (0.0008) +[2023-10-14 09:00:02,783][100917] Updated weights for policy 1, policy_version 93252 (0.0008) +[2023-10-14 09:00:03,162][100917] Updated weights for policy 1, policy_version 93262 (0.0007) +[2023-10-14 09:00:03,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 190840832. Throughput: 0: 1663.6, 1: 1659.5. Samples: 47725154. Policy #0 lag: (min: 31.0, avg: 36.6, max: 63.0) +[2023-10-14 09:00:03,513][99942] Avg episode reward: [(0, '0.880'), (1, '0.990')] +[2023-10-14 09:00:03,538][100917] Updated weights for policy 1, policy_version 93272 (0.0007) +[2023-10-14 09:00:03,774][100936] Updated weights for policy 0, policy_version 93130 (0.0008) +[2023-10-14 09:00:04,149][100936] Updated weights for policy 0, policy_version 93140 (0.0008) +[2023-10-14 09:00:04,522][100936] Updated weights for policy 0, policy_version 93150 (0.0007) +[2023-10-14 09:00:07,682][100917] Updated weights for policy 1, policy_version 93282 (0.0010) +[2023-10-14 09:00:08,065][100917] Updated weights for policy 1, policy_version 93292 (0.0008) +[2023-10-14 09:00:08,428][100917] Updated weights for policy 1, policy_version 93302 (0.0010) +[2023-10-14 09:00:08,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 190906368. Throughput: 0: 1669.9, 1: 1653.6. Samples: 47745442. Policy #0 lag: (min: 31.0, avg: 36.6, max: 63.0) +[2023-10-14 09:00:08,512][99942] Avg episode reward: [(0, '0.880'), (1, '0.990')] +[2023-10-14 09:00:08,684][100936] Updated weights for policy 0, policy_version 93160 (0.0007) +[2023-10-14 09:00:08,807][100917] Updated weights for policy 1, policy_version 93312 (0.0008) +[2023-10-14 09:00:09,062][100936] Updated weights for policy 0, policy_version 93170 (0.0007) +[2023-10-14 09:00:09,428][100936] Updated weights for policy 0, policy_version 93180 (0.0007) +[2023-10-14 09:00:13,120][100917] Updated weights for policy 1, policy_version 93322 (0.0010) +[2023-10-14 09:00:13,485][100917] Updated weights for policy 1, policy_version 93332 (0.0009) +[2023-10-14 09:00:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 190971904. Throughput: 0: 1668.0, 1: 1653.4. Samples: 47754448. Policy #0 lag: (min: 31.0, avg: 36.6, max: 63.0) +[2023-10-14 09:00:13,513][99942] Avg episode reward: [(0, '0.880'), (1, '0.990')] +[2023-10-14 09:00:13,615][100936] Updated weights for policy 0, policy_version 93190 (0.0010) +[2023-10-14 09:00:13,855][100917] Updated weights for policy 1, policy_version 93342 (0.0008) +[2023-10-14 09:00:13,979][100936] Updated weights for policy 0, policy_version 93200 (0.0008) +[2023-10-14 09:00:14,353][100936] Updated weights for policy 0, policy_version 93210 (0.0008) +[2023-10-14 09:00:17,977][100917] Updated weights for policy 1, policy_version 93352 (0.0009) +[2023-10-14 09:00:18,353][100917] Updated weights for policy 1, policy_version 93362 (0.0011) +[2023-10-14 09:00:18,453][100936] Updated weights for policy 0, policy_version 93220 (0.0008) +[2023-10-14 09:00:18,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 191037440. Throughput: 0: 1664.3, 1: 1650.3. Samples: 47774606. Policy #0 lag: (min: 31.0, avg: 36.6, max: 63.0) +[2023-10-14 09:00:18,513][99942] Avg episode reward: [(0, '0.880'), (1, '0.990')] +[2023-10-14 09:00:18,721][100917] Updated weights for policy 1, policy_version 93372 (0.0008) +[2023-10-14 09:00:18,821][100936] Updated weights for policy 0, policy_version 93230 (0.0007) +[2023-10-14 09:00:19,192][100936] Updated weights for policy 0, policy_version 93240 (0.0008) +[2023-10-14 09:00:22,756][100917] Updated weights for policy 1, policy_version 93382 (0.0009) +[2023-10-14 09:00:23,139][100917] Updated weights for policy 1, policy_version 93392 (0.0009) +[2023-10-14 09:00:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 191102976. Throughput: 0: 1660.5, 1: 1645.0. Samples: 47794500. Policy #0 lag: (min: 31.0, avg: 36.6, max: 63.0) +[2023-10-14 09:00:23,513][99942] Avg episode reward: [(0, '0.880'), (1, '1.000')] +[2023-10-14 09:00:23,516][100917] Updated weights for policy 1, policy_version 93402 (0.0008) +[2023-10-14 09:00:23,554][100936] Updated weights for policy 0, policy_version 93250 (0.0008) +[2023-10-14 09:00:23,956][100936] Updated weights for policy 0, policy_version 93260 (0.0010) +[2023-10-14 09:00:24,318][100936] Updated weights for policy 0, policy_version 93270 (0.0011) +[2023-10-14 09:00:24,683][100936] Updated weights for policy 0, policy_version 93280 (0.0009) +[2023-10-14 09:00:27,408][100917] Updated weights for policy 1, policy_version 93412 (0.0008) +[2023-10-14 09:00:27,783][100917] Updated weights for policy 1, policy_version 93422 (0.0008) +[2023-10-14 09:00:28,160][100917] Updated weights for policy 1, policy_version 93432 (0.0007) +[2023-10-14 09:00:28,512][99942] Fps is (10 sec: 16384.1, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 191201280. Throughput: 0: 1657.5, 1: 1654.8. Samples: 47804028. Policy #0 lag: (min: 31.0, avg: 36.6, max: 63.0) +[2023-10-14 09:00:28,513][99942] Avg episode reward: [(0, '0.880'), (1, '1.000')] +[2023-10-14 09:00:28,794][100936] Updated weights for policy 0, policy_version 93290 (0.0009) +[2023-10-14 09:00:29,162][100936] Updated weights for policy 0, policy_version 93300 (0.0010) +[2023-10-14 09:00:29,530][100936] Updated weights for policy 0, policy_version 93310 (0.0009) +[2023-10-14 09:00:32,161][100917] Updated weights for policy 1, policy_version 93442 (0.0009) +[2023-10-14 09:00:32,543][100917] Updated weights for policy 1, policy_version 93452 (0.0010) +[2023-10-14 09:00:32,908][100917] Updated weights for policy 1, policy_version 93462 (0.0010) +[2023-10-14 09:00:33,280][100917] Updated weights for policy 1, policy_version 93472 (0.0008) +[2023-10-14 09:00:33,507][100936] Updated weights for policy 0, policy_version 93320 (0.0008) +[2023-10-14 09:00:33,512][99942] Fps is (10 sec: 16384.0, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 191266816. Throughput: 0: 1659.9, 1: 1656.5. Samples: 47824724. Policy #0 lag: (min: 31.0, avg: 36.6, max: 63.0) +[2023-10-14 09:00:33,513][99942] Avg episode reward: [(0, '0.900'), (1, '1.000')] +[2023-10-14 09:00:33,887][100936] Updated weights for policy 0, policy_version 93330 (0.0007) +[2023-10-14 09:00:34,253][100936] Updated weights for policy 0, policy_version 93340 (0.0008) +[2023-10-14 09:00:37,525][100917] Updated weights for policy 1, policy_version 93482 (0.0008) +[2023-10-14 09:00:37,898][100917] Updated weights for policy 1, policy_version 93492 (0.0008) +[2023-10-14 09:00:38,273][100917] Updated weights for policy 1, policy_version 93502 (0.0007) +[2023-10-14 09:00:38,406][100936] Updated weights for policy 0, policy_version 93350 (0.0008) +[2023-10-14 09:00:38,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 191332352. Throughput: 0: 1655.5, 1: 1642.1. Samples: 47844008. Policy #0 lag: (min: 31.0, avg: 36.6, max: 63.0) +[2023-10-14 09:00:38,513][99942] Avg episode reward: [(0, '0.900'), (1, '1.000')] +[2023-10-14 09:00:38,770][100936] Updated weights for policy 0, policy_version 93360 (0.0008) +[2023-10-14 09:00:39,143][100936] Updated weights for policy 0, policy_version 93370 (0.0010) +[2023-10-14 09:00:42,422][100917] Updated weights for policy 1, policy_version 93512 (0.0008) +[2023-10-14 09:00:42,794][100917] Updated weights for policy 1, policy_version 93522 (0.0011) +[2023-10-14 09:00:43,164][100917] Updated weights for policy 1, policy_version 93532 (0.0011) +[2023-10-14 09:00:43,389][100936] Updated weights for policy 0, policy_version 93380 (0.0011) +[2023-10-14 09:00:43,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 191397888. Throughput: 0: 1657.5, 1: 1659.5. Samples: 47854156. Policy #0 lag: (min: 3.0, avg: 9.9, max: 35.0) +[2023-10-14 09:00:43,512][99942] Avg episode reward: [(0, '0.900'), (1, '1.000')] +[2023-10-14 09:00:43,753][100936] Updated weights for policy 0, policy_version 93390 (0.0011) +[2023-10-14 09:00:44,122][100936] Updated weights for policy 0, policy_version 93400 (0.0008) +[2023-10-14 09:00:47,520][100917] Updated weights for policy 1, policy_version 93542 (0.0009) +[2023-10-14 09:00:47,904][100917] Updated weights for policy 1, policy_version 93552 (0.0010) +[2023-10-14 09:00:48,169][100936] Updated weights for policy 0, policy_version 93410 (0.0007) +[2023-10-14 09:00:48,278][100917] Updated weights for policy 1, policy_version 93562 (0.0008) +[2023-10-14 09:00:48,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 191463424. Throughput: 0: 1655.0, 1: 1664.6. Samples: 47874536. Policy #0 lag: (min: 3.0, avg: 9.9, max: 35.0) +[2023-10-14 09:00:48,513][99942] Avg episode reward: [(0, '0.910'), (1, '1.000')] +[2023-10-14 09:00:48,537][100936] Updated weights for policy 0, policy_version 93420 (0.0009) +[2023-10-14 09:00:48,903][100936] Updated weights for policy 0, policy_version 93430 (0.0010) +[2023-10-14 09:00:49,277][100936] Updated weights for policy 0, policy_version 93440 (0.0008) +[2023-10-14 09:00:52,204][100917] Updated weights for policy 1, policy_version 93572 (0.0008) +[2023-10-14 09:00:52,581][100917] Updated weights for policy 1, policy_version 93582 (0.0007) +[2023-10-14 09:00:52,944][100917] Updated weights for policy 1, policy_version 93592 (0.0009) +[2023-10-14 09:00:53,367][100936] Updated weights for policy 0, policy_version 93450 (0.0009) +[2023-10-14 09:00:53,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 191528960. Throughput: 0: 1646.7, 1: 1646.7. Samples: 47893646. Policy #0 lag: (min: 3.0, avg: 9.9, max: 35.0) +[2023-10-14 09:00:53,513][99942] Avg episode reward: [(0, '0.910'), (1, '1.000')] +[2023-10-14 09:00:53,732][100936] Updated weights for policy 0, policy_version 93460 (0.0008) +[2023-10-14 09:00:54,100][100936] Updated weights for policy 0, policy_version 93470 (0.0008) +[2023-10-14 09:00:57,123][100917] Updated weights for policy 1, policy_version 93602 (0.0011) +[2023-10-14 09:00:57,498][100917] Updated weights for policy 1, policy_version 93612 (0.0011) +[2023-10-14 09:00:57,875][100917] Updated weights for policy 1, policy_version 93622 (0.0009) +[2023-10-14 09:00:58,124][100936] Updated weights for policy 0, policy_version 93480 (0.0007) +[2023-10-14 09:00:58,241][100917] Updated weights for policy 1, policy_version 93632 (0.0009) +[2023-10-14 09:00:58,490][100936] Updated weights for policy 0, policy_version 93490 (0.0009) +[2023-10-14 09:00:58,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 191594496. Throughput: 0: 1657.1, 1: 1660.2. Samples: 47903726. Policy #0 lag: (min: 3.0, avg: 9.9, max: 35.0) +[2023-10-14 09:00:58,512][99942] Avg episode reward: [(0, '0.910'), (1, '1.000')] +[2023-10-14 09:00:58,865][100936] Updated weights for policy 0, policy_version 93500 (0.0010) +[2023-10-14 09:01:02,279][100917] Updated weights for policy 1, policy_version 93642 (0.0009) +[2023-10-14 09:01:02,660][100917] Updated weights for policy 1, policy_version 93652 (0.0008) +[2023-10-14 09:01:03,022][100917] Updated weights for policy 1, policy_version 93662 (0.0007) +[2023-10-14 09:01:03,039][100936] Updated weights for policy 0, policy_version 93510 (0.0008) +[2023-10-14 09:01:03,422][100936] Updated weights for policy 0, policy_version 93520 (0.0010) +[2023-10-14 09:01:03,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 191660032. Throughput: 0: 1656.2, 1: 1662.3. Samples: 47923940. Policy #0 lag: (min: 3.0, avg: 9.9, max: 35.0) +[2023-10-14 09:01:03,512][99942] Avg episode reward: [(0, '0.910'), (1, '1.000')] +[2023-10-14 09:01:03,793][100936] Updated weights for policy 0, policy_version 93530 (0.0008) +[2023-10-14 09:01:07,206][100917] Updated weights for policy 1, policy_version 93672 (0.0008) +[2023-10-14 09:01:07,570][100917] Updated weights for policy 1, policy_version 93682 (0.0008) +[2023-10-14 09:01:07,933][100917] Updated weights for policy 1, policy_version 93692 (0.0008) +[2023-10-14 09:01:07,980][100936] Updated weights for policy 0, policy_version 93540 (0.0008) +[2023-10-14 09:01:08,371][100936] Updated weights for policy 0, policy_version 93550 (0.0007) +[2023-10-14 09:01:08,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 191725568. Throughput: 0: 1644.7, 1: 1652.8. Samples: 47942886. Policy #0 lag: (min: 3.0, avg: 9.9, max: 35.0) +[2023-10-14 09:01:08,513][99942] Avg episode reward: [(0, '0.910'), (1, '1.000')] +[2023-10-14 09:01:08,743][100936] Updated weights for policy 0, policy_version 93560 (0.0007) +[2023-10-14 09:01:12,124][100917] Updated weights for policy 1, policy_version 93702 (0.0008) +[2023-10-14 09:01:12,503][100917] Updated weights for policy 1, policy_version 93712 (0.0008) +[2023-10-14 09:01:12,876][100917] Updated weights for policy 1, policy_version 93722 (0.0007) +[2023-10-14 09:01:12,879][100936] Updated weights for policy 0, policy_version 93570 (0.0007) +[2023-10-14 09:01:13,244][100936] Updated weights for policy 0, policy_version 93580 (0.0007) +[2023-10-14 09:01:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13218.3). Total num frames: 191791104. Throughput: 0: 1663.5, 1: 1658.3. Samples: 47953508. Policy #0 lag: (min: 3.0, avg: 9.9, max: 35.0) +[2023-10-14 09:01:13,513][99942] Avg episode reward: [(0, '0.910'), (1, '1.000')] +[2023-10-14 09:01:13,610][100936] Updated weights for policy 0, policy_version 93590 (0.0008) +[2023-10-14 09:01:13,970][100936] Updated weights for policy 0, policy_version 93600 (0.0010) +[2023-10-14 09:01:16,929][100917] Updated weights for policy 1, policy_version 93732 (0.0009) +[2023-10-14 09:01:17,300][100917] Updated weights for policy 1, policy_version 93742 (0.0009) +[2023-10-14 09:01:17,676][100917] Updated weights for policy 1, policy_version 93752 (0.0010) +[2023-10-14 09:01:18,238][100936] Updated weights for policy 0, policy_version 93610 (0.0009) +[2023-10-14 09:01:18,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 191856640. Throughput: 0: 1658.4, 1: 1653.5. Samples: 47973760. Policy #0 lag: (min: 3.0, avg: 9.9, max: 35.0) +[2023-10-14 09:01:18,513][99942] Avg episode reward: [(0, '0.910'), (1, '1.000')] +[2023-10-14 09:01:18,604][100936] Updated weights for policy 0, policy_version 93620 (0.0010) +[2023-10-14 09:01:18,980][100936] Updated weights for policy 0, policy_version 93630 (0.0008) +[2023-10-14 09:01:21,770][100917] Updated weights for policy 1, policy_version 93762 (0.0010) +[2023-10-14 09:01:22,148][100917] Updated weights for policy 1, policy_version 93772 (0.0008) +[2023-10-14 09:01:22,509][100917] Updated weights for policy 1, policy_version 93782 (0.0011) +[2023-10-14 09:01:22,885][100917] Updated weights for policy 1, policy_version 93792 (0.0007) +[2023-10-14 09:01:23,263][100936] Updated weights for policy 0, policy_version 93640 (0.0008) +[2023-10-14 09:01:23,512][99942] Fps is (10 sec: 13106.7, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 191922176. Throughput: 0: 1651.5, 1: 1655.7. Samples: 47992830. Policy #0 lag: (min: 3.0, avg: 9.9, max: 35.0) +[2023-10-14 09:01:23,514][99942] Avg episode reward: [(0, '0.910'), (1, '1.000')] +[2023-10-14 09:01:23,629][100936] Updated weights for policy 0, policy_version 93650 (0.0009) +[2023-10-14 09:01:24,003][100936] Updated weights for policy 0, policy_version 93660 (0.0008) +[2023-10-14 09:01:26,942][100917] Updated weights for policy 1, policy_version 93802 (0.0009) +[2023-10-14 09:01:27,315][100917] Updated weights for policy 1, policy_version 93812 (0.0008) +[2023-10-14 09:01:27,688][100917] Updated weights for policy 1, policy_version 93822 (0.0008) +[2023-10-14 09:01:28,198][100936] Updated weights for policy 0, policy_version 93670 (0.0008) +[2023-10-14 09:01:28,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 191987712. Throughput: 0: 1656.7, 1: 1661.9. Samples: 48003492. Policy #0 lag: (min: 3.0, avg: 9.9, max: 35.0) +[2023-10-14 09:01:28,512][99942] Avg episode reward: [(0, '0.910'), (1, '1.000')] +[2023-10-14 09:01:28,575][100936] Updated weights for policy 0, policy_version 93680 (0.0010) +[2023-10-14 09:01:28,941][100936] Updated weights for policy 0, policy_version 93690 (0.0009) +[2023-10-14 09:01:31,820][100917] Updated weights for policy 1, policy_version 93832 (0.0009) +[2023-10-14 09:01:32,193][100917] Updated weights for policy 1, policy_version 93842 (0.0007) +[2023-10-14 09:01:32,570][100917] Updated weights for policy 1, policy_version 93852 (0.0008) +[2023-10-14 09:01:33,008][100936] Updated weights for policy 0, policy_version 93700 (0.0009) +[2023-10-14 09:01:33,384][100936] Updated weights for policy 0, policy_version 93710 (0.0009) +[2023-10-14 09:01:33,512][99942] Fps is (10 sec: 13107.8, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 192053248. Throughput: 0: 1658.4, 1: 1648.6. Samples: 48023350. Policy #0 lag: (min: 3.0, avg: 9.9, max: 35.0) +[2023-10-14 09:01:33,512][99942] Avg episode reward: [(0, '0.910'), (1, '1.000')] +[2023-10-14 09:01:33,763][100936] Updated weights for policy 0, policy_version 93720 (0.0008) +[2023-10-14 09:01:36,627][100917] Updated weights for policy 1, policy_version 93862 (0.0009) +[2023-10-14 09:01:36,998][100917] Updated weights for policy 1, policy_version 93872 (0.0007) +[2023-10-14 09:01:37,370][100917] Updated weights for policy 1, policy_version 93882 (0.0008) +[2023-10-14 09:01:37,898][100936] Updated weights for policy 0, policy_version 93730 (0.0009) +[2023-10-14 09:01:38,260][100936] Updated weights for policy 0, policy_version 93740 (0.0007) +[2023-10-14 09:01:38,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 192118784. Throughput: 0: 1650.5, 1: 1653.7. Samples: 48042338. Policy #0 lag: (min: 3.0, avg: 9.9, max: 35.0) +[2023-10-14 09:01:38,512][99942] Avg episode reward: [(0, '0.910'), (1, '1.000')] +[2023-10-14 09:01:38,520][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000093888_96141312.pth... +[2023-10-14 09:01:38,560][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000092320_94535680.pth +[2023-10-14 09:01:38,632][100936] Updated weights for policy 0, policy_version 93750 (0.0009) +[2023-10-14 09:01:39,004][100936] Updated weights for policy 0, policy_version 93760 (0.0008) +[2023-10-14 09:01:39,004][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000093760_96010240.pth... +[2023-10-14 09:01:39,033][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000092192_94404608.pth +[2023-10-14 09:01:41,625][100917] Updated weights for policy 1, policy_version 93892 (0.0009) +[2023-10-14 09:01:41,991][100917] Updated weights for policy 1, policy_version 93902 (0.0008) +[2023-10-14 09:01:42,364][100917] Updated weights for policy 1, policy_version 93912 (0.0011) +[2023-10-14 09:01:43,116][100936] Updated weights for policy 0, policy_version 93770 (0.0009) +[2023-10-14 09:01:43,485][100936] Updated weights for policy 0, policy_version 93780 (0.0008) +[2023-10-14 09:01:43,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 192184320. Throughput: 0: 1654.6, 1: 1666.2. Samples: 48053162. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 09:01:43,513][99942] Avg episode reward: [(0, '0.910'), (1, '1.000')] +[2023-10-14 09:01:43,858][100936] Updated weights for policy 0, policy_version 93790 (0.0009) +[2023-10-14 09:01:46,504][100917] Updated weights for policy 1, policy_version 93922 (0.0011) +[2023-10-14 09:01:46,876][100917] Updated weights for policy 1, policy_version 93932 (0.0009) +[2023-10-14 09:01:47,253][100917] Updated weights for policy 1, policy_version 93942 (0.0007) +[2023-10-14 09:01:47,620][100917] Updated weights for policy 1, policy_version 93952 (0.0007) +[2023-10-14 09:01:47,945][100936] Updated weights for policy 0, policy_version 93800 (0.0007) +[2023-10-14 09:01:48,327][100936] Updated weights for policy 0, policy_version 93810 (0.0010) +[2023-10-14 09:01:48,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 192249856. Throughput: 0: 1655.6, 1: 1659.0. Samples: 48073096. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 09:01:48,513][99942] Avg episode reward: [(0, '0.910'), (1, '1.000')] +[2023-10-14 09:01:48,691][100936] Updated weights for policy 0, policy_version 93820 (0.0008) +[2023-10-14 09:01:51,771][100917] Updated weights for policy 1, policy_version 93962 (0.0008) +[2023-10-14 09:01:52,148][100917] Updated weights for policy 1, policy_version 93972 (0.0009) +[2023-10-14 09:01:52,517][100917] Updated weights for policy 1, policy_version 93982 (0.0008) +[2023-10-14 09:01:52,734][100936] Updated weights for policy 0, policy_version 93830 (0.0007) +[2023-10-14 09:01:53,096][100936] Updated weights for policy 0, policy_version 93840 (0.0008) +[2023-10-14 09:01:53,472][100936] Updated weights for policy 0, policy_version 93850 (0.0009) +[2023-10-14 09:01:53,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 192315392. Throughput: 0: 1647.8, 1: 1662.1. Samples: 48091830. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 09:01:53,513][99942] Avg episode reward: [(0, '0.840'), (1, '1.000')] +[2023-10-14 09:01:56,563][100917] Updated weights for policy 1, policy_version 93992 (0.0010) +[2023-10-14 09:01:56,926][100917] Updated weights for policy 1, policy_version 94002 (0.0010) +[2023-10-14 09:01:57,307][100917] Updated weights for policy 1, policy_version 94012 (0.0009) +[2023-10-14 09:01:57,767][100936] Updated weights for policy 0, policy_version 93860 (0.0007) +[2023-10-14 09:01:58,152][100936] Updated weights for policy 0, policy_version 93870 (0.0007) +[2023-10-14 09:01:58,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 192380928. Throughput: 0: 1648.5, 1: 1672.6. Samples: 48102958. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 09:01:58,512][99942] Avg episode reward: [(0, '0.840'), (1, '1.000')] +[2023-10-14 09:01:58,524][100936] Updated weights for policy 0, policy_version 93880 (0.0009) +[2023-10-14 09:02:01,273][100917] Updated weights for policy 1, policy_version 94022 (0.0010) +[2023-10-14 09:02:01,647][100917] Updated weights for policy 1, policy_version 94032 (0.0008) +[2023-10-14 09:02:02,019][100917] Updated weights for policy 1, policy_version 94042 (0.0008) +[2023-10-14 09:02:02,755][100936] Updated weights for policy 0, policy_version 93890 (0.0010) +[2023-10-14 09:02:03,131][100936] Updated weights for policy 0, policy_version 93900 (0.0009) +[2023-10-14 09:02:03,498][100936] Updated weights for policy 0, policy_version 93910 (0.0010) +[2023-10-14 09:02:03,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 192446464. Throughput: 0: 1647.6, 1: 1655.3. Samples: 48122390. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 09:02:03,513][99942] Avg episode reward: [(0, '0.840'), (1, '1.000')] +[2023-10-14 09:02:03,867][100936] Updated weights for policy 0, policy_version 93920 (0.0008) +[2023-10-14 09:02:06,008][100917] Updated weights for policy 1, policy_version 94052 (0.0008) +[2023-10-14 09:02:06,375][100917] Updated weights for policy 1, policy_version 94062 (0.0007) +[2023-10-14 09:02:06,742][100917] Updated weights for policy 1, policy_version 94072 (0.0008) +[2023-10-14 09:02:07,847][100936] Updated weights for policy 0, policy_version 93930 (0.0007) +[2023-10-14 09:02:08,215][100936] Updated weights for policy 0, policy_version 93940 (0.0009) +[2023-10-14 09:02:08,512][99942] Fps is (10 sec: 13106.7, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 192512000. Throughput: 0: 1635.5, 1: 1671.0. Samples: 48141620. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 09:02:08,513][99942] Avg episode reward: [(0, '0.800'), (1, '1.000')] +[2023-10-14 09:02:08,585][100936] Updated weights for policy 0, policy_version 93950 (0.0009) +[2023-10-14 09:02:10,702][100917] Updated weights for policy 1, policy_version 94082 (0.0009) +[2023-10-14 09:02:11,079][100917] Updated weights for policy 1, policy_version 94092 (0.0007) +[2023-10-14 09:02:11,453][100917] Updated weights for policy 1, policy_version 94102 (0.0008) +[2023-10-14 09:02:11,832][100917] Updated weights for policy 1, policy_version 94112 (0.0007) +[2023-10-14 09:02:12,716][100936] Updated weights for policy 0, policy_version 93960 (0.0009) +[2023-10-14 09:02:13,100][100936] Updated weights for policy 0, policy_version 93970 (0.0008) +[2023-10-14 09:02:13,470][100936] Updated weights for policy 0, policy_version 93980 (0.0009) +[2023-10-14 09:02:13,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 192577536. Throughput: 0: 1646.4, 1: 1661.8. Samples: 48152360. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 09:02:13,512][99942] Avg episode reward: [(0, '0.800'), (1, '1.000')] +[2023-10-14 09:02:15,982][100917] Updated weights for policy 1, policy_version 94122 (0.0007) +[2023-10-14 09:02:16,359][100917] Updated weights for policy 1, policy_version 94132 (0.0007) +[2023-10-14 09:02:16,730][100917] Updated weights for policy 1, policy_version 94142 (0.0009) +[2023-10-14 09:02:17,541][100936] Updated weights for policy 0, policy_version 93990 (0.0008) +[2023-10-14 09:02:17,915][100936] Updated weights for policy 0, policy_version 94000 (0.0008) +[2023-10-14 09:02:18,277][100936] Updated weights for policy 0, policy_version 94010 (0.0007) +[2023-10-14 09:02:18,512][99942] Fps is (10 sec: 16384.4, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 192675840. Throughput: 0: 1646.0, 1: 1655.5. Samples: 48171916. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 09:02:18,513][99942] Avg episode reward: [(0, '0.680'), (1, '1.000')] +[2023-10-14 09:02:21,017][100917] Updated weights for policy 1, policy_version 94152 (0.0007) +[2023-10-14 09:02:21,395][100917] Updated weights for policy 1, policy_version 94162 (0.0008) +[2023-10-14 09:02:21,763][100917] Updated weights for policy 1, policy_version 94172 (0.0008) +[2023-10-14 09:02:22,282][100936] Updated weights for policy 0, policy_version 94020 (0.0008) +[2023-10-14 09:02:22,648][100936] Updated weights for policy 0, policy_version 94030 (0.0009) +[2023-10-14 09:02:23,013][100936] Updated weights for policy 0, policy_version 94040 (0.0009) +[2023-10-14 09:02:23,512][99942] Fps is (10 sec: 16384.0, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 192741376. Throughput: 0: 1641.5, 1: 1673.1. Samples: 48191494. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 09:02:23,512][99942] Avg episode reward: [(0, '0.680'), (1, '1.000')] +[2023-10-14 09:02:25,738][100917] Updated weights for policy 1, policy_version 94182 (0.0008) +[2023-10-14 09:02:26,108][100917] Updated weights for policy 1, policy_version 94192 (0.0009) +[2023-10-14 09:02:26,482][100917] Updated weights for policy 1, policy_version 94202 (0.0011) +[2023-10-14 09:02:27,314][100936] Updated weights for policy 0, policy_version 94050 (0.0010) +[2023-10-14 09:02:27,687][100936] Updated weights for policy 0, policy_version 94060 (0.0009) +[2023-10-14 09:02:28,059][100936] Updated weights for policy 0, policy_version 94070 (0.0009) +[2023-10-14 09:02:28,424][100936] Updated weights for policy 0, policy_version 94080 (0.0009) +[2023-10-14 09:02:28,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 192806912. Throughput: 0: 1654.9, 1: 1665.2. Samples: 48202566. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 09:02:28,513][99942] Avg episode reward: [(0, '0.680'), (1, '1.000')] +[2023-10-14 09:02:30,745][100917] Updated weights for policy 1, policy_version 94212 (0.0011) +[2023-10-14 09:02:31,117][100917] Updated weights for policy 1, policy_version 94222 (0.0008) +[2023-10-14 09:02:31,488][100917] Updated weights for policy 1, policy_version 94232 (0.0007) +[2023-10-14 09:02:32,574][100936] Updated weights for policy 0, policy_version 94090 (0.0008) +[2023-10-14 09:02:32,951][100936] Updated weights for policy 0, policy_version 94100 (0.0009) +[2023-10-14 09:02:33,326][100936] Updated weights for policy 0, policy_version 94110 (0.0009) +[2023-10-14 09:02:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 192872448. Throughput: 0: 1648.5, 1: 1650.4. Samples: 48221546. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 09:02:33,512][99942] Avg episode reward: [(0, '0.770'), (1, '1.000')] +[2023-10-14 09:02:35,792][100917] Updated weights for policy 1, policy_version 94242 (0.0007) +[2023-10-14 09:02:36,165][100917] Updated weights for policy 1, policy_version 94252 (0.0008) +[2023-10-14 09:02:36,529][100917] Updated weights for policy 1, policy_version 94262 (0.0009) +[2023-10-14 09:02:36,905][100917] Updated weights for policy 1, policy_version 94272 (0.0010) +[2023-10-14 09:02:37,414][100936] Updated weights for policy 0, policy_version 94120 (0.0009) +[2023-10-14 09:02:37,779][100936] Updated weights for policy 0, policy_version 94130 (0.0007) +[2023-10-14 09:02:38,156][100936] Updated weights for policy 0, policy_version 94140 (0.0007) +[2023-10-14 09:02:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.3). Total num frames: 192937984. Throughput: 0: 1643.9, 1: 1667.8. Samples: 48240856. Policy #0 lag: (min: 26.0, avg: 26.1, max: 34.0) +[2023-10-14 09:02:38,513][99942] Avg episode reward: [(0, '0.770'), (1, '1.000')] +[2023-10-14 09:02:41,010][100917] Updated weights for policy 1, policy_version 94282 (0.0010) +[2023-10-14 09:02:41,380][100917] Updated weights for policy 1, policy_version 94292 (0.0009) +[2023-10-14 09:02:41,751][100917] Updated weights for policy 1, policy_version 94302 (0.0009) +[2023-10-14 09:02:42,225][100936] Updated weights for policy 0, policy_version 94150 (0.0007) +[2023-10-14 09:02:42,607][100936] Updated weights for policy 0, policy_version 94160 (0.0008) +[2023-10-14 09:02:42,975][100936] Updated weights for policy 0, policy_version 94170 (0.0007) +[2023-10-14 09:02:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 193003520. Throughput: 0: 1660.2, 1: 1654.1. Samples: 48252102. Policy #0 lag: (min: 26.0, avg: 26.1, max: 34.0) +[2023-10-14 09:02:43,512][99942] Avg episode reward: [(0, '0.770'), (1, '1.000')] +[2023-10-14 09:02:46,013][100917] Updated weights for policy 1, policy_version 94312 (0.0008) +[2023-10-14 09:02:46,382][100917] Updated weights for policy 1, policy_version 94322 (0.0011) +[2023-10-14 09:02:46,751][100917] Updated weights for policy 1, policy_version 94332 (0.0009) +[2023-10-14 09:02:47,221][100936] Updated weights for policy 0, policy_version 94180 (0.0009) +[2023-10-14 09:02:47,586][100936] Updated weights for policy 0, policy_version 94190 (0.0009) +[2023-10-14 09:02:47,956][100936] Updated weights for policy 0, policy_version 94200 (0.0009) +[2023-10-14 09:02:48,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 193069056. Throughput: 0: 1651.8, 1: 1651.6. Samples: 48271046. Policy #0 lag: (min: 26.0, avg: 26.1, max: 34.0) +[2023-10-14 09:02:48,512][99942] Avg episode reward: [(0, '0.770'), (1, '1.000')] +[2023-10-14 09:02:50,895][100917] Updated weights for policy 1, policy_version 94342 (0.0007) +[2023-10-14 09:02:51,264][100917] Updated weights for policy 1, policy_version 94352 (0.0009) +[2023-10-14 09:02:51,628][100917] Updated weights for policy 1, policy_version 94362 (0.0008) +[2023-10-14 09:02:52,047][100936] Updated weights for policy 0, policy_version 94210 (0.0008) +[2023-10-14 09:02:52,406][100936] Updated weights for policy 0, policy_version 94220 (0.0008) +[2023-10-14 09:02:52,786][100936] Updated weights for policy 0, policy_version 94230 (0.0010) +[2023-10-14 09:02:53,158][100936] Updated weights for policy 0, policy_version 94240 (0.0010) +[2023-10-14 09:02:53,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 193134592. Throughput: 0: 1656.5, 1: 1655.9. Samples: 48290680. Policy #0 lag: (min: 26.0, avg: 26.1, max: 34.0) +[2023-10-14 09:02:53,513][99942] Avg episode reward: [(0, '0.770'), (1, '1.000')] +[2023-10-14 09:02:55,784][100917] Updated weights for policy 1, policy_version 94372 (0.0008) +[2023-10-14 09:02:56,166][100917] Updated weights for policy 1, policy_version 94382 (0.0008) +[2023-10-14 09:02:56,536][100917] Updated weights for policy 1, policy_version 94392 (0.0008) +[2023-10-14 09:02:57,375][100936] Updated weights for policy 0, policy_version 94250 (0.0009) +[2023-10-14 09:02:57,743][100936] Updated weights for policy 0, policy_version 94260 (0.0008) +[2023-10-14 09:02:58,126][100936] Updated weights for policy 0, policy_version 94270 (0.0008) +[2023-10-14 09:02:58,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 193200128. Throughput: 0: 1663.4, 1: 1657.3. Samples: 48301792. Policy #0 lag: (min: 26.0, avg: 26.1, max: 34.0) +[2023-10-14 09:02:58,513][99942] Avg episode reward: [(0, '0.770'), (1, '1.000')] +[2023-10-14 09:03:00,671][100917] Updated weights for policy 1, policy_version 94402 (0.0009) +[2023-10-14 09:03:01,043][100917] Updated weights for policy 1, policy_version 94412 (0.0009) +[2023-10-14 09:03:01,412][100917] Updated weights for policy 1, policy_version 94422 (0.0008) +[2023-10-14 09:03:01,785][100917] Updated weights for policy 1, policy_version 94432 (0.0008) +[2023-10-14 09:03:02,287][100936] Updated weights for policy 0, policy_version 94280 (0.0009) +[2023-10-14 09:03:02,654][100936] Updated weights for policy 0, policy_version 94290 (0.0007) +[2023-10-14 09:03:03,022][100936] Updated weights for policy 0, policy_version 94300 (0.0008) +[2023-10-14 09:03:03,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 193265664. Throughput: 0: 1653.2, 1: 1658.3. Samples: 48320934. Policy #0 lag: (min: 26.0, avg: 26.1, max: 34.0) +[2023-10-14 09:03:03,513][99942] Avg episode reward: [(0, '0.770'), (1, '1.000')] +[2023-10-14 09:03:05,936][100917] Updated weights for policy 1, policy_version 94442 (0.0007) +[2023-10-14 09:03:06,314][100917] Updated weights for policy 1, policy_version 94452 (0.0010) +[2023-10-14 09:03:06,686][100917] Updated weights for policy 1, policy_version 94462 (0.0007) +[2023-10-14 09:03:07,199][100936] Updated weights for policy 0, policy_version 94310 (0.0009) +[2023-10-14 09:03:07,563][100936] Updated weights for policy 0, policy_version 94320 (0.0008) +[2023-10-14 09:03:07,940][100936] Updated weights for policy 0, policy_version 94330 (0.0007) +[2023-10-14 09:03:08,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.4, 300 sec: 13329.3). Total num frames: 193331200. Throughput: 0: 1652.2, 1: 1661.4. Samples: 48340606. Policy #0 lag: (min: 26.0, avg: 26.1, max: 34.0) +[2023-10-14 09:03:08,513][99942] Avg episode reward: [(0, '0.770'), (1, '1.000')] +[2023-10-14 09:03:10,703][100917] Updated weights for policy 1, policy_version 94472 (0.0009) +[2023-10-14 09:03:11,080][100917] Updated weights for policy 1, policy_version 94482 (0.0007) +[2023-10-14 09:03:11,441][100917] Updated weights for policy 1, policy_version 94492 (0.0009) +[2023-10-14 09:03:12,111][100936] Updated weights for policy 0, policy_version 94340 (0.0008) +[2023-10-14 09:03:12,475][100936] Updated weights for policy 0, policy_version 94350 (0.0007) +[2023-10-14 09:03:12,837][100936] Updated weights for policy 0, policy_version 94360 (0.0008) +[2023-10-14 09:03:13,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 193396736. Throughput: 0: 1653.8, 1: 1655.2. Samples: 48351470. Policy #0 lag: (min: 26.0, avg: 26.1, max: 34.0) +[2023-10-14 09:03:13,513][99942] Avg episode reward: [(0, '0.770'), (1, '1.000')] +[2023-10-14 09:03:15,369][100917] Updated weights for policy 1, policy_version 94502 (0.0008) +[2023-10-14 09:03:15,752][100917] Updated weights for policy 1, policy_version 94512 (0.0007) +[2023-10-14 09:03:16,119][100917] Updated weights for policy 1, policy_version 94522 (0.0008) +[2023-10-14 09:03:16,976][100936] Updated weights for policy 0, policy_version 94370 (0.0009) +[2023-10-14 09:03:17,341][100936] Updated weights for policy 0, policy_version 94380 (0.0008) +[2023-10-14 09:03:17,723][100936] Updated weights for policy 0, policy_version 94390 (0.0008) +[2023-10-14 09:03:18,084][100936] Updated weights for policy 0, policy_version 94400 (0.0007) +[2023-10-14 09:03:18,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 193462272. Throughput: 0: 1649.6, 1: 1665.5. Samples: 48370724. Policy #0 lag: (min: 26.0, avg: 26.1, max: 34.0) +[2023-10-14 09:03:18,512][99942] Avg episode reward: [(0, '0.770'), (1, '1.000')] +[2023-10-14 09:03:20,017][100917] Updated weights for policy 1, policy_version 94532 (0.0009) +[2023-10-14 09:03:20,398][100917] Updated weights for policy 1, policy_version 94542 (0.0007) +[2023-10-14 09:03:20,761][100917] Updated weights for policy 1, policy_version 94552 (0.0007) +[2023-10-14 09:03:22,053][100936] Updated weights for policy 0, policy_version 94410 (0.0007) +[2023-10-14 09:03:22,423][100936] Updated weights for policy 0, policy_version 94420 (0.0010) +[2023-10-14 09:03:22,798][100936] Updated weights for policy 0, policy_version 94430 (0.0010) +[2023-10-14 09:03:23,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 193527808. Throughput: 0: 1660.2, 1: 1665.8. Samples: 48390526. Policy #0 lag: (min: 26.0, avg: 26.1, max: 34.0) +[2023-10-14 09:03:23,513][99942] Avg episode reward: [(0, '0.770'), (1, '1.000')] +[2023-10-14 09:03:24,799][100917] Updated weights for policy 1, policy_version 94562 (0.0009) +[2023-10-14 09:03:25,168][100917] Updated weights for policy 1, policy_version 94572 (0.0009) +[2023-10-14 09:03:25,536][100917] Updated weights for policy 1, policy_version 94582 (0.0009) +[2023-10-14 09:03:25,912][100917] Updated weights for policy 1, policy_version 94592 (0.0010) +[2023-10-14 09:03:27,168][100936] Updated weights for policy 0, policy_version 94440 (0.0010) +[2023-10-14 09:03:27,528][100936] Updated weights for policy 0, policy_version 94450 (0.0009) +[2023-10-14 09:03:27,890][100936] Updated weights for policy 0, policy_version 94460 (0.0010) +[2023-10-14 09:03:28,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 193593344. Throughput: 0: 1658.4, 1: 1645.7. Samples: 48400788. Policy #0 lag: (min: 26.0, avg: 26.1, max: 34.0) +[2023-10-14 09:03:28,513][99942] Avg episode reward: [(0, '0.770'), (1, '1.000')] +[2023-10-14 09:03:30,127][100917] Updated weights for policy 1, policy_version 94602 (0.0010) +[2023-10-14 09:03:30,497][100917] Updated weights for policy 1, policy_version 94612 (0.0009) +[2023-10-14 09:03:30,883][100917] Updated weights for policy 1, policy_version 94622 (0.0008) +[2023-10-14 09:03:32,044][100936] Updated weights for policy 0, policy_version 94470 (0.0010) +[2023-10-14 09:03:32,406][100936] Updated weights for policy 0, policy_version 94480 (0.0007) +[2023-10-14 09:03:32,782][100936] Updated weights for policy 0, policy_version 94490 (0.0007) +[2023-10-14 09:03:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 193658880. Throughput: 0: 1652.9, 1: 1666.2. Samples: 48420406. Policy #0 lag: (min: 26.0, avg: 26.1, max: 34.0) +[2023-10-14 09:03:33,513][99942] Avg episode reward: [(0, '0.770'), (1, '1.000')] +[2023-10-14 09:03:34,999][100917] Updated weights for policy 1, policy_version 94632 (0.0010) +[2023-10-14 09:03:35,376][100917] Updated weights for policy 1, policy_version 94642 (0.0010) +[2023-10-14 09:03:35,755][100917] Updated weights for policy 1, policy_version 94652 (0.0009) +[2023-10-14 09:03:36,834][100936] Updated weights for policy 0, policy_version 94500 (0.0008) +[2023-10-14 09:03:37,200][100936] Updated weights for policy 0, policy_version 94510 (0.0009) +[2023-10-14 09:03:37,567][100936] Updated weights for policy 0, policy_version 94520 (0.0009) +[2023-10-14 09:03:38,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 193724416. Throughput: 0: 1651.7, 1: 1670.4. Samples: 48440176. Policy #0 lag: (min: 29.0, avg: 29.0, max: 29.0) +[2023-10-14 09:03:38,513][99942] Avg episode reward: [(0, '0.770'), (1, '1.000')] +[2023-10-14 09:03:38,525][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000094656_96927744.pth... +[2023-10-14 09:03:38,525][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000094528_96796672.pth... +[2023-10-14 09:03:38,561][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000092960_95191040.pth +[2023-10-14 09:03:38,565][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000093120_95354880.pth +[2023-10-14 09:03:39,985][100917] Updated weights for policy 1, policy_version 94662 (0.0009) +[2023-10-14 09:03:40,362][100917] Updated weights for policy 1, policy_version 94672 (0.0009) +[2023-10-14 09:03:40,727][100917] Updated weights for policy 1, policy_version 94682 (0.0009) +[2023-10-14 09:03:41,583][100936] Updated weights for policy 0, policy_version 94530 (0.0009) +[2023-10-14 09:03:41,954][100936] Updated weights for policy 0, policy_version 94540 (0.0007) +[2023-10-14 09:03:42,330][100936] Updated weights for policy 0, policy_version 94550 (0.0008) +[2023-10-14 09:03:42,707][100936] Updated weights for policy 0, policy_version 94560 (0.0009) +[2023-10-14 09:03:43,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 193789952. Throughput: 0: 1657.4, 1: 1649.1. Samples: 48450584. Policy #0 lag: (min: 29.0, avg: 29.0, max: 29.0) +[2023-10-14 09:03:43,512][99942] Avg episode reward: [(0, '0.770'), (1, '1.000')] +[2023-10-14 09:03:44,726][100917] Updated weights for policy 1, policy_version 94692 (0.0009) +[2023-10-14 09:03:45,089][100917] Updated weights for policy 1, policy_version 94702 (0.0009) +[2023-10-14 09:03:45,463][100917] Updated weights for policy 1, policy_version 94712 (0.0007) +[2023-10-14 09:03:46,987][100936] Updated weights for policy 0, policy_version 94570 (0.0009) +[2023-10-14 09:03:47,355][100936] Updated weights for policy 0, policy_version 94580 (0.0007) +[2023-10-14 09:03:47,731][100936] Updated weights for policy 0, policy_version 94590 (0.0009) +[2023-10-14 09:03:48,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 193855488. Throughput: 0: 1648.9, 1: 1667.8. Samples: 48470182. Policy #0 lag: (min: 29.0, avg: 29.0, max: 29.0) +[2023-10-14 09:03:48,512][99942] Avg episode reward: [(0, '0.770'), (1, '1.000')] +[2023-10-14 09:03:49,507][100917] Updated weights for policy 1, policy_version 94722 (0.0007) +[2023-10-14 09:03:49,872][100917] Updated weights for policy 1, policy_version 94732 (0.0009) +[2023-10-14 09:03:50,244][100917] Updated weights for policy 1, policy_version 94742 (0.0011) +[2023-10-14 09:03:50,618][100917] Updated weights for policy 1, policy_version 94752 (0.0007) +[2023-10-14 09:03:51,766][100936] Updated weights for policy 0, policy_version 94600 (0.0008) +[2023-10-14 09:03:52,132][100936] Updated weights for policy 0, policy_version 94610 (0.0007) +[2023-10-14 09:03:52,494][100936] Updated weights for policy 0, policy_version 94620 (0.0008) +[2023-10-14 09:03:53,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 193921024. Throughput: 0: 1659.1, 1: 1667.3. Samples: 48490294. Policy #0 lag: (min: 29.0, avg: 29.0, max: 29.0) +[2023-10-14 09:03:53,512][99942] Avg episode reward: [(0, '0.770'), (1, '1.000')] +[2023-10-14 09:03:54,730][100917] Updated weights for policy 1, policy_version 94762 (0.0010) +[2023-10-14 09:03:55,104][100917] Updated weights for policy 1, policy_version 94772 (0.0010) +[2023-10-14 09:03:55,470][100917] Updated weights for policy 1, policy_version 94782 (0.0011) +[2023-10-14 09:03:56,510][100936] Updated weights for policy 0, policy_version 94630 (0.0009) +[2023-10-14 09:03:56,878][100936] Updated weights for policy 0, policy_version 94640 (0.0007) +[2023-10-14 09:03:57,250][100936] Updated weights for policy 0, policy_version 94650 (0.0007) +[2023-10-14 09:03:58,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 193986560. Throughput: 0: 1660.1, 1: 1648.3. Samples: 48500346. Policy #0 lag: (min: 29.0, avg: 29.0, max: 29.0) +[2023-10-14 09:03:58,513][99942] Avg episode reward: [(0, '0.770'), (1, '1.000')] +[2023-10-14 09:03:59,657][100917] Updated weights for policy 1, policy_version 94792 (0.0011) +[2023-10-14 09:04:00,029][100917] Updated weights for policy 1, policy_version 94802 (0.0008) +[2023-10-14 09:04:00,399][100917] Updated weights for policy 1, policy_version 94812 (0.0010) +[2023-10-14 09:04:01,496][100936] Updated weights for policy 0, policy_version 94660 (0.0010) +[2023-10-14 09:04:01,853][100936] Updated weights for policy 0, policy_version 94670 (0.0009) +[2023-10-14 09:04:02,219][100936] Updated weights for policy 0, policy_version 94680 (0.0009) +[2023-10-14 09:04:03,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 194052096. Throughput: 0: 1649.7, 1: 1659.7. Samples: 48519648. Policy #0 lag: (min: 29.0, avg: 29.0, max: 29.0) +[2023-10-14 09:04:03,513][99942] Avg episode reward: [(0, '0.770'), (1, '1.000')] +[2023-10-14 09:04:04,499][100917] Updated weights for policy 1, policy_version 94822 (0.0009) +[2023-10-14 09:04:04,867][100917] Updated weights for policy 1, policy_version 94832 (0.0009) +[2023-10-14 09:04:05,235][100917] Updated weights for policy 1, policy_version 94842 (0.0010) +[2023-10-14 09:04:06,209][100936] Updated weights for policy 0, policy_version 94690 (0.0008) +[2023-10-14 09:04:06,577][100936] Updated weights for policy 0, policy_version 94700 (0.0009) +[2023-10-14 09:04:06,943][100936] Updated weights for policy 0, policy_version 94710 (0.0007) +[2023-10-14 09:04:07,311][100936] Updated weights for policy 0, policy_version 94720 (0.0007) +[2023-10-14 09:04:08,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 194117632. Throughput: 0: 1665.8, 1: 1668.1. Samples: 48540550. Policy #0 lag: (min: 29.0, avg: 29.0, max: 29.0) +[2023-10-14 09:04:08,513][99942] Avg episode reward: [(0, '0.770'), (1, '1.000')] +[2023-10-14 09:04:09,192][100917] Updated weights for policy 1, policy_version 94852 (0.0010) +[2023-10-14 09:04:09,568][100917] Updated weights for policy 1, policy_version 94862 (0.0010) +[2023-10-14 09:04:09,938][100917] Updated weights for policy 1, policy_version 94872 (0.0010) +[2023-10-14 09:04:11,369][100936] Updated weights for policy 0, policy_version 94730 (0.0008) +[2023-10-14 09:04:11,742][100936] Updated weights for policy 0, policy_version 94740 (0.0007) +[2023-10-14 09:04:12,117][100936] Updated weights for policy 0, policy_version 94750 (0.0008) +[2023-10-14 09:04:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 194183168. Throughput: 0: 1655.7, 1: 1669.4. Samples: 48550418. Policy #0 lag: (min: 29.0, avg: 29.0, max: 29.0) +[2023-10-14 09:04:13,513][99942] Avg episode reward: [(0, '0.770'), (1, '1.000')] +[2023-10-14 09:04:13,991][100917] Updated weights for policy 1, policy_version 94882 (0.0008) +[2023-10-14 09:04:14,360][100917] Updated weights for policy 1, policy_version 94892 (0.0008) +[2023-10-14 09:04:14,738][100917] Updated weights for policy 1, policy_version 94902 (0.0010) +[2023-10-14 09:04:15,106][100917] Updated weights for policy 1, policy_version 94912 (0.0009) +[2023-10-14 09:04:16,283][100936] Updated weights for policy 0, policy_version 94760 (0.0009) +[2023-10-14 09:04:16,662][100936] Updated weights for policy 0, policy_version 94770 (0.0010) +[2023-10-14 09:04:17,025][100936] Updated weights for policy 0, policy_version 94780 (0.0009) +[2023-10-14 09:04:18,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 194248704. Throughput: 0: 1647.0, 1: 1673.6. Samples: 48569834. Policy #0 lag: (min: 29.0, avg: 29.0, max: 29.0) +[2023-10-14 09:04:18,512][99942] Avg episode reward: [(0, '0.770'), (1, '1.000')] +[2023-10-14 09:04:19,364][100917] Updated weights for policy 1, policy_version 94922 (0.0010) +[2023-10-14 09:04:19,727][100917] Updated weights for policy 1, policy_version 94932 (0.0009) +[2023-10-14 09:04:20,101][100917] Updated weights for policy 1, policy_version 94942 (0.0009) +[2023-10-14 09:04:21,061][100936] Updated weights for policy 0, policy_version 94790 (0.0008) +[2023-10-14 09:04:21,433][100936] Updated weights for policy 0, policy_version 94800 (0.0008) +[2023-10-14 09:04:21,800][100936] Updated weights for policy 0, policy_version 94810 (0.0007) +[2023-10-14 09:04:23,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 194314240. Throughput: 0: 1669.8, 1: 1667.7. Samples: 48590364. Policy #0 lag: (min: 29.0, avg: 29.0, max: 29.0) +[2023-10-14 09:04:23,512][99942] Avg episode reward: [(0, '0.770'), (1, '1.000')] +[2023-10-14 09:04:24,398][100917] Updated weights for policy 1, policy_version 94952 (0.0008) +[2023-10-14 09:04:24,765][100917] Updated weights for policy 1, policy_version 94962 (0.0010) +[2023-10-14 09:04:25,139][100917] Updated weights for policy 1, policy_version 94972 (0.0009) +[2023-10-14 09:04:25,921][100936] Updated weights for policy 0, policy_version 94820 (0.0009) +[2023-10-14 09:04:26,286][100936] Updated weights for policy 0, policy_version 94830 (0.0009) +[2023-10-14 09:04:26,648][100936] Updated weights for policy 0, policy_version 94840 (0.0008) +[2023-10-14 09:04:28,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 194379776. Throughput: 0: 1651.1, 1: 1663.5. Samples: 48599742. Policy #0 lag: (min: 29.0, avg: 29.0, max: 29.0) +[2023-10-14 09:04:28,512][99942] Avg episode reward: [(0, '0.770'), (1, '1.000')] +[2023-10-14 09:04:29,340][100917] Updated weights for policy 1, policy_version 94982 (0.0008) +[2023-10-14 09:04:29,716][100917] Updated weights for policy 1, policy_version 94992 (0.0009) +[2023-10-14 09:04:30,097][100917] Updated weights for policy 1, policy_version 95002 (0.0012) +[2023-10-14 09:04:30,776][100936] Updated weights for policy 0, policy_version 94850 (0.0007) +[2023-10-14 09:04:31,137][100936] Updated weights for policy 0, policy_version 94860 (0.0009) +[2023-10-14 09:04:31,499][100936] Updated weights for policy 0, policy_version 94870 (0.0009) +[2023-10-14 09:04:31,859][100936] Updated weights for policy 0, policy_version 94880 (0.0007) +[2023-10-14 09:04:33,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 194445312. Throughput: 0: 1653.6, 1: 1665.8. Samples: 48619554. Policy #0 lag: (min: 29.0, avg: 29.0, max: 29.0) +[2023-10-14 09:04:33,513][99942] Avg episode reward: [(0, '0.770'), (1, '1.000')] +[2023-10-14 09:04:34,319][100917] Updated weights for policy 1, policy_version 95012 (0.0010) +[2023-10-14 09:04:34,681][100917] Updated weights for policy 1, policy_version 95022 (0.0009) +[2023-10-14 09:04:35,051][100917] Updated weights for policy 1, policy_version 95032 (0.0010) +[2023-10-14 09:04:36,020][100936] Updated weights for policy 0, policy_version 94890 (0.0007) +[2023-10-14 09:04:36,390][100936] Updated weights for policy 0, policy_version 94900 (0.0007) +[2023-10-14 09:04:36,758][100936] Updated weights for policy 0, policy_version 94910 (0.0010) +[2023-10-14 09:04:38,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 194510848. Throughput: 0: 1664.7, 1: 1660.5. Samples: 48639926. Policy #0 lag: (min: 13.0, avg: 21.0, max: 45.0) +[2023-10-14 09:04:38,513][99942] Avg episode reward: [(0, '0.770'), (1, '1.000')] +[2023-10-14 09:04:39,018][100917] Updated weights for policy 1, policy_version 95042 (0.0011) +[2023-10-14 09:04:39,390][100917] Updated weights for policy 1, policy_version 95052 (0.0009) +[2023-10-14 09:04:39,766][100917] Updated weights for policy 1, policy_version 95062 (0.0007) +[2023-10-14 09:04:40,138][100917] Updated weights for policy 1, policy_version 95072 (0.0007) +[2023-10-14 09:04:40,926][100936] Updated weights for policy 0, policy_version 94920 (0.0007) +[2023-10-14 09:04:41,291][100936] Updated weights for policy 0, policy_version 94930 (0.0008) +[2023-10-14 09:04:41,664][100936] Updated weights for policy 0, policy_version 94940 (0.0010) +[2023-10-14 09:04:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.1, 300 sec: 13329.3). Total num frames: 194576384. Throughput: 0: 1648.7, 1: 1663.2. Samples: 48649382. Policy #0 lag: (min: 13.0, avg: 21.0, max: 45.0) +[2023-10-14 09:04:43,513][99942] Avg episode reward: [(0, '0.880'), (1, '1.000')] +[2023-10-14 09:04:44,248][100917] Updated weights for policy 1, policy_version 95082 (0.0010) +[2023-10-14 09:04:44,630][100917] Updated weights for policy 1, policy_version 95092 (0.0010) +[2023-10-14 09:04:45,006][100917] Updated weights for policy 1, policy_version 95102 (0.0010) +[2023-10-14 09:04:45,875][100936] Updated weights for policy 0, policy_version 94950 (0.0008) +[2023-10-14 09:04:46,237][100936] Updated weights for policy 0, policy_version 94960 (0.0008) +[2023-10-14 09:04:46,603][100936] Updated weights for policy 0, policy_version 94970 (0.0007) +[2023-10-14 09:04:48,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13329.3). Total num frames: 194641920. Throughput: 0: 1660.6, 1: 1663.8. Samples: 48669246. Policy #0 lag: (min: 13.0, avg: 21.0, max: 45.0) +[2023-10-14 09:04:48,513][99942] Avg episode reward: [(0, '0.880'), (1, '0.980')] +[2023-10-14 09:04:49,251][100917] Updated weights for policy 1, policy_version 95112 (0.0008) +[2023-10-14 09:04:49,619][100917] Updated weights for policy 1, policy_version 95122 (0.0007) +[2023-10-14 09:04:49,996][100917] Updated weights for policy 1, policy_version 95132 (0.0007) +[2023-10-14 09:04:50,774][100936] Updated weights for policy 0, policy_version 94980 (0.0008) +[2023-10-14 09:04:51,141][100936] Updated weights for policy 0, policy_version 94990 (0.0007) +[2023-10-14 09:04:51,514][100936] Updated weights for policy 0, policy_version 95000 (0.0009) +[2023-10-14 09:04:53,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 194707456. Throughput: 0: 1659.0, 1: 1650.8. Samples: 48689492. Policy #0 lag: (min: 13.0, avg: 21.0, max: 45.0) +[2023-10-14 09:04:53,513][99942] Avg episode reward: [(0, '0.880'), (1, '0.980')] +[2023-10-14 09:04:54,073][100917] Updated weights for policy 1, policy_version 95142 (0.0009) +[2023-10-14 09:04:54,453][100917] Updated weights for policy 1, policy_version 95152 (0.0008) +[2023-10-14 09:04:54,828][100917] Updated weights for policy 1, policy_version 95162 (0.0009) +[2023-10-14 09:04:55,672][100936] Updated weights for policy 0, policy_version 95010 (0.0008) +[2023-10-14 09:04:56,048][100936] Updated weights for policy 0, policy_version 95020 (0.0008) +[2023-10-14 09:04:56,423][100936] Updated weights for policy 0, policy_version 95030 (0.0007) +[2023-10-14 09:04:56,785][100936] Updated weights for policy 0, policy_version 95040 (0.0009) +[2023-10-14 09:04:58,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 194772992. Throughput: 0: 1647.7, 1: 1652.1. Samples: 48698910. Policy #0 lag: (min: 13.0, avg: 21.0, max: 45.0) +[2023-10-14 09:04:58,513][99942] Avg episode reward: [(0, '0.880'), (1, '0.980')] +[2023-10-14 09:04:58,840][100917] Updated weights for policy 1, policy_version 95172 (0.0009) +[2023-10-14 09:04:59,219][100917] Updated weights for policy 1, policy_version 95182 (0.0010) +[2023-10-14 09:04:59,592][100917] Updated weights for policy 1, policy_version 95192 (0.0008) +[2023-10-14 09:05:00,996][100936] Updated weights for policy 0, policy_version 95050 (0.0009) +[2023-10-14 09:05:01,371][100936] Updated weights for policy 0, policy_version 95060 (0.0010) +[2023-10-14 09:05:01,730][100936] Updated weights for policy 0, policy_version 95070 (0.0011) +[2023-10-14 09:05:03,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 194838528. Throughput: 0: 1658.5, 1: 1653.6. Samples: 48718882. Policy #0 lag: (min: 13.0, avg: 21.0, max: 45.0) +[2023-10-14 09:05:03,512][99942] Avg episode reward: [(0, '0.880'), (1, '0.980')] +[2023-10-14 09:05:03,678][100917] Updated weights for policy 1, policy_version 95202 (0.0009) +[2023-10-14 09:05:04,050][100917] Updated weights for policy 1, policy_version 95212 (0.0011) +[2023-10-14 09:05:04,428][100917] Updated weights for policy 1, policy_version 95222 (0.0010) +[2023-10-14 09:05:04,800][100917] Updated weights for policy 1, policy_version 95232 (0.0009) +[2023-10-14 09:05:06,011][100936] Updated weights for policy 0, policy_version 95080 (0.0009) +[2023-10-14 09:05:06,389][100936] Updated weights for policy 0, policy_version 95090 (0.0009) +[2023-10-14 09:05:06,761][100936] Updated weights for policy 0, policy_version 95100 (0.0011) +[2023-10-14 09:05:08,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 194904064. Throughput: 0: 1646.8, 1: 1655.2. Samples: 48738954. Policy #0 lag: (min: 13.0, avg: 21.0, max: 45.0) +[2023-10-14 09:05:08,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.980')] +[2023-10-14 09:05:08,896][100917] Updated weights for policy 1, policy_version 95242 (0.0009) +[2023-10-14 09:05:09,275][100917] Updated weights for policy 1, policy_version 95252 (0.0008) +[2023-10-14 09:05:09,654][100917] Updated weights for policy 1, policy_version 95262 (0.0010) +[2023-10-14 09:05:10,953][100936] Updated weights for policy 0, policy_version 95110 (0.0009) +[2023-10-14 09:05:11,325][100936] Updated weights for policy 0, policy_version 95120 (0.0009) +[2023-10-14 09:05:11,688][100936] Updated weights for policy 0, policy_version 95130 (0.0010) +[2023-10-14 09:05:13,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 194969600. Throughput: 0: 1645.9, 1: 1660.7. Samples: 48748540. Policy #0 lag: (min: 13.0, avg: 21.0, max: 45.0) +[2023-10-14 09:05:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.980')] +[2023-10-14 09:05:13,664][100917] Updated weights for policy 1, policy_version 95272 (0.0007) +[2023-10-14 09:05:14,041][100917] Updated weights for policy 1, policy_version 95282 (0.0009) +[2023-10-14 09:05:14,407][100917] Updated weights for policy 1, policy_version 95292 (0.0009) +[2023-10-14 09:05:15,834][100936] Updated weights for policy 0, policy_version 95140 (0.0012) +[2023-10-14 09:05:16,209][100936] Updated weights for policy 0, policy_version 95150 (0.0008) +[2023-10-14 09:05:16,569][100936] Updated weights for policy 0, policy_version 95160 (0.0008) +[2023-10-14 09:05:18,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13329.4). Total num frames: 195035136. Throughput: 0: 1649.1, 1: 1661.8. Samples: 48768544. Policy #0 lag: (min: 13.0, avg: 21.0, max: 45.0) +[2023-10-14 09:05:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.980')] +[2023-10-14 09:05:18,644][100917] Updated weights for policy 1, policy_version 95302 (0.0010) +[2023-10-14 09:05:19,021][100917] Updated weights for policy 1, policy_version 95312 (0.0009) +[2023-10-14 09:05:19,398][100917] Updated weights for policy 1, policy_version 95322 (0.0009) +[2023-10-14 09:05:20,511][100936] Updated weights for policy 0, policy_version 95170 (0.0009) +[2023-10-14 09:05:20,884][100936] Updated weights for policy 0, policy_version 95180 (0.0011) +[2023-10-14 09:05:21,246][100936] Updated weights for policy 0, policy_version 95190 (0.0010) +[2023-10-14 09:05:21,616][100936] Updated weights for policy 0, policy_version 95200 (0.0010) +[2023-10-14 09:05:23,441][100917] Updated weights for policy 1, policy_version 95332 (0.0008) +[2023-10-14 09:05:23,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 195100672. Throughput: 0: 1650.0, 1: 1665.4. Samples: 48789116. Policy #0 lag: (min: 13.0, avg: 21.0, max: 45.0) +[2023-10-14 09:05:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.980')] +[2023-10-14 09:05:23,819][100917] Updated weights for policy 1, policy_version 95342 (0.0009) +[2023-10-14 09:05:24,185][100917] Updated weights for policy 1, policy_version 95352 (0.0011) +[2023-10-14 09:05:25,983][100936] Updated weights for policy 0, policy_version 95210 (0.0007) +[2023-10-14 09:05:26,346][100936] Updated weights for policy 0, policy_version 95220 (0.0007) +[2023-10-14 09:05:26,714][100936] Updated weights for policy 0, policy_version 95230 (0.0008) +[2023-10-14 09:05:28,413][100917] Updated weights for policy 1, policy_version 95362 (0.0009) +[2023-10-14 09:05:28,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 195166208. Throughput: 0: 1649.8, 1: 1668.0. Samples: 48798680. Policy #0 lag: (min: 13.0, avg: 21.0, max: 45.0) +[2023-10-14 09:05:28,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.980')] +[2023-10-14 09:05:28,824][100917] Updated weights for policy 1, policy_version 95372 (0.0010) +[2023-10-14 09:05:29,209][100917] Updated weights for policy 1, policy_version 95382 (0.0007) +[2023-10-14 09:05:29,575][100917] Updated weights for policy 1, policy_version 95392 (0.0011) +[2023-10-14 09:05:30,813][100936] Updated weights for policy 0, policy_version 95240 (0.0007) +[2023-10-14 09:05:31,176][100936] Updated weights for policy 0, policy_version 95250 (0.0009) +[2023-10-14 09:05:31,547][100936] Updated weights for policy 0, policy_version 95260 (0.0010) +[2023-10-14 09:05:33,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 195231744. Throughput: 0: 1648.3, 1: 1660.7. Samples: 48818152. Policy #0 lag: (min: 13.0, avg: 21.0, max: 45.0) +[2023-10-14 09:05:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.980')] +[2023-10-14 09:05:33,831][100917] Updated weights for policy 1, policy_version 95402 (0.0010) +[2023-10-14 09:05:34,203][100917] Updated weights for policy 1, policy_version 95412 (0.0009) +[2023-10-14 09:05:34,579][100917] Updated weights for policy 1, policy_version 95422 (0.0007) +[2023-10-14 09:05:35,653][100936] Updated weights for policy 0, policy_version 95270 (0.0008) +[2023-10-14 09:05:36,024][100936] Updated weights for policy 0, policy_version 95280 (0.0008) +[2023-10-14 09:05:36,391][100936] Updated weights for policy 0, policy_version 95290 (0.0008) +[2023-10-14 09:05:38,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 195297280. Throughput: 0: 1648.6, 1: 1667.1. Samples: 48838698. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 09:05:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.980')] +[2023-10-14 09:05:38,519][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000095296_97583104.pth... +[2023-10-14 09:05:38,554][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000093760_96010240.pth +[2023-10-14 09:05:38,694][100917] Updated weights for policy 1, policy_version 95432 (0.0010) +[2023-10-14 09:05:39,066][100917] Updated weights for policy 1, policy_version 95442 (0.0010) +[2023-10-14 09:05:39,431][100917] Updated weights for policy 1, policy_version 95452 (0.0010) +[2023-10-14 09:05:39,579][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000095456_97746944.pth... +[2023-10-14 09:05:39,614][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000093888_96141312.pth +[2023-10-14 09:05:40,544][100936] Updated weights for policy 0, policy_version 95300 (0.0008) +[2023-10-14 09:05:40,912][100936] Updated weights for policy 0, policy_version 95310 (0.0009) +[2023-10-14 09:05:41,283][100936] Updated weights for policy 0, policy_version 95320 (0.0008) +[2023-10-14 09:05:43,482][100917] Updated weights for policy 1, policy_version 95462 (0.0010) +[2023-10-14 09:05:43,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 195362816. Throughput: 0: 1647.2, 1: 1665.5. Samples: 48847982. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 09:05:43,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.980')] +[2023-10-14 09:05:43,852][100917] Updated weights for policy 1, policy_version 95472 (0.0008) +[2023-10-14 09:05:44,235][100917] Updated weights for policy 1, policy_version 95482 (0.0010) +[2023-10-14 09:05:45,245][100936] Updated weights for policy 0, policy_version 95330 (0.0008) +[2023-10-14 09:05:45,615][100936] Updated weights for policy 0, policy_version 95340 (0.0010) +[2023-10-14 09:05:45,983][100936] Updated weights for policy 0, policy_version 95350 (0.0010) +[2023-10-14 09:05:46,363][100936] Updated weights for policy 0, policy_version 95360 (0.0008) +[2023-10-14 09:05:48,297][100917] Updated weights for policy 1, policy_version 95492 (0.0010) +[2023-10-14 09:05:48,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 195428352. Throughput: 0: 1659.5, 1: 1664.2. Samples: 48868448. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 09:05:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.980')] +[2023-10-14 09:05:48,673][100917] Updated weights for policy 1, policy_version 95502 (0.0007) +[2023-10-14 09:05:49,034][100917] Updated weights for policy 1, policy_version 95512 (0.0007) +[2023-10-14 09:05:50,455][100936] Updated weights for policy 0, policy_version 95370 (0.0011) +[2023-10-14 09:05:50,825][100936] Updated weights for policy 0, policy_version 95380 (0.0008) +[2023-10-14 09:05:51,200][100936] Updated weights for policy 0, policy_version 95390 (0.0011) +[2023-10-14 09:05:52,994][100917] Updated weights for policy 1, policy_version 95522 (0.0007) +[2023-10-14 09:05:53,364][100917] Updated weights for policy 1, policy_version 95532 (0.0010) +[2023-10-14 09:05:53,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 195493888. Throughput: 0: 1665.0, 1: 1669.6. Samples: 48889010. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 09:05:53,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.980')] +[2023-10-14 09:05:53,740][100917] Updated weights for policy 1, policy_version 95542 (0.0007) +[2023-10-14 09:05:54,122][100917] Updated weights for policy 1, policy_version 95552 (0.0007) +[2023-10-14 09:05:55,328][100936] Updated weights for policy 0, policy_version 95400 (0.0008) +[2023-10-14 09:05:55,696][100936] Updated weights for policy 0, policy_version 95410 (0.0007) +[2023-10-14 09:05:56,064][100936] Updated weights for policy 0, policy_version 95420 (0.0008) +[2023-10-14 09:05:58,268][100917] Updated weights for policy 1, policy_version 95562 (0.0008) +[2023-10-14 09:05:58,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 195559424. Throughput: 0: 1651.9, 1: 1668.6. Samples: 48897960. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 09:05:58,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.980')] +[2023-10-14 09:05:58,638][100917] Updated weights for policy 1, policy_version 95572 (0.0009) +[2023-10-14 09:05:59,008][100917] Updated weights for policy 1, policy_version 95582 (0.0009) +[2023-10-14 09:06:00,204][100936] Updated weights for policy 0, policy_version 95430 (0.0010) +[2023-10-14 09:06:00,575][100936] Updated weights for policy 0, policy_version 95440 (0.0009) +[2023-10-14 09:06:00,947][100936] Updated weights for policy 0, policy_version 95450 (0.0010) +[2023-10-14 09:06:03,067][100917] Updated weights for policy 1, policy_version 95592 (0.0010) +[2023-10-14 09:06:03,443][100917] Updated weights for policy 1, policy_version 95602 (0.0011) +[2023-10-14 09:06:03,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 195624960. Throughput: 0: 1661.8, 1: 1663.0. Samples: 48918158. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 09:06:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.980')] +[2023-10-14 09:06:03,821][100917] Updated weights for policy 1, policy_version 95612 (0.0009) +[2023-10-14 09:06:05,048][100936] Updated weights for policy 0, policy_version 95460 (0.0009) +[2023-10-14 09:06:05,421][100936] Updated weights for policy 0, policy_version 95470 (0.0007) +[2023-10-14 09:06:05,801][100936] Updated weights for policy 0, policy_version 95480 (0.0009) +[2023-10-14 09:06:08,060][100917] Updated weights for policy 1, policy_version 95622 (0.0007) +[2023-10-14 09:06:08,429][100917] Updated weights for policy 1, policy_version 95632 (0.0007) +[2023-10-14 09:06:08,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 195690496. Throughput: 0: 1662.9, 1: 1655.4. Samples: 48938442. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 09:06:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.980')] +[2023-10-14 09:06:08,809][100917] Updated weights for policy 1, policy_version 95642 (0.0010) +[2023-10-14 09:06:10,004][100936] Updated weights for policy 0, policy_version 95490 (0.0008) +[2023-10-14 09:06:10,371][100936] Updated weights for policy 0, policy_version 95500 (0.0007) +[2023-10-14 09:06:10,745][100936] Updated weights for policy 0, policy_version 95510 (0.0009) +[2023-10-14 09:06:11,113][100936] Updated weights for policy 0, policy_version 95520 (0.0008) +[2023-10-14 09:06:12,844][100917] Updated weights for policy 1, policy_version 95652 (0.0009) +[2023-10-14 09:06:13,214][100917] Updated weights for policy 1, policy_version 95662 (0.0007) +[2023-10-14 09:06:13,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 195756032. Throughput: 0: 1648.1, 1: 1659.6. Samples: 48947528. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 09:06:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.980')] +[2023-10-14 09:06:13,581][100917] Updated weights for policy 1, policy_version 95672 (0.0011) +[2023-10-14 09:06:15,339][100936] Updated weights for policy 0, policy_version 95530 (0.0010) +[2023-10-14 09:06:15,712][100936] Updated weights for policy 0, policy_version 95540 (0.0009) +[2023-10-14 09:06:16,079][100936] Updated weights for policy 0, policy_version 95550 (0.0009) +[2023-10-14 09:06:17,819][100917] Updated weights for policy 1, policy_version 95682 (0.0011) +[2023-10-14 09:06:18,244][100917] Updated weights for policy 1, policy_version 95692 (0.0009) +[2023-10-14 09:06:18,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 195821568. Throughput: 0: 1661.6, 1: 1670.4. Samples: 48968096. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 09:06:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.980')] +[2023-10-14 09:06:18,611][100917] Updated weights for policy 1, policy_version 95702 (0.0007) +[2023-10-14 09:06:18,993][100917] Updated weights for policy 1, policy_version 95712 (0.0007) +[2023-10-14 09:06:20,237][100936] Updated weights for policy 0, policy_version 95560 (0.0008) +[2023-10-14 09:06:20,609][100936] Updated weights for policy 0, policy_version 95570 (0.0009) +[2023-10-14 09:06:20,980][100936] Updated weights for policy 0, policy_version 95580 (0.0008) +[2023-10-14 09:06:22,987][100917] Updated weights for policy 1, policy_version 95722 (0.0010) +[2023-10-14 09:06:23,361][100917] Updated weights for policy 1, policy_version 95732 (0.0010) +[2023-10-14 09:06:23,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 195887104. Throughput: 0: 1658.7, 1: 1658.6. Samples: 48987976. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 09:06:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.980')] +[2023-10-14 09:06:23,738][100917] Updated weights for policy 1, policy_version 95742 (0.0010) +[2023-10-14 09:06:25,214][100936] Updated weights for policy 0, policy_version 95590 (0.0008) +[2023-10-14 09:06:25,578][100936] Updated weights for policy 0, policy_version 95600 (0.0007) +[2023-10-14 09:06:25,947][100936] Updated weights for policy 0, policy_version 95610 (0.0007) +[2023-10-14 09:06:27,795][100917] Updated weights for policy 1, policy_version 95752 (0.0010) +[2023-10-14 09:06:28,163][100917] Updated weights for policy 1, policy_version 95762 (0.0009) +[2023-10-14 09:06:28,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 195952640. Throughput: 0: 1650.5, 1: 1667.1. Samples: 48997276. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 09:06:28,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.980')] +[2023-10-14 09:06:28,535][100917] Updated weights for policy 1, policy_version 95772 (0.0009) +[2023-10-14 09:06:29,970][100936] Updated weights for policy 0, policy_version 95620 (0.0007) +[2023-10-14 09:06:30,333][100936] Updated weights for policy 0, policy_version 95630 (0.0008) +[2023-10-14 09:06:30,715][100936] Updated weights for policy 0, policy_version 95640 (0.0008) +[2023-10-14 09:06:32,563][100917] Updated weights for policy 1, policy_version 95782 (0.0010) +[2023-10-14 09:06:32,932][100917] Updated weights for policy 1, policy_version 95792 (0.0009) +[2023-10-14 09:06:33,307][100917] Updated weights for policy 1, policy_version 95802 (0.0009) +[2023-10-14 09:06:33,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 196018176. Throughput: 0: 1656.2, 1: 1664.6. Samples: 49017884. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 09:06:33,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.980')] +[2023-10-14 09:06:34,874][100936] Updated weights for policy 0, policy_version 95650 (0.0007) +[2023-10-14 09:06:35,280][100936] Updated weights for policy 0, policy_version 95660 (0.0007) +[2023-10-14 09:06:35,662][100936] Updated weights for policy 0, policy_version 95670 (0.0007) +[2023-10-14 09:06:36,027][100936] Updated weights for policy 0, policy_version 95680 (0.0007) +[2023-10-14 09:06:37,267][100917] Updated weights for policy 1, policy_version 95812 (0.0009) +[2023-10-14 09:06:37,632][100917] Updated weights for policy 1, policy_version 95822 (0.0008) +[2023-10-14 09:06:38,008][100917] Updated weights for policy 1, policy_version 95832 (0.0008) +[2023-10-14 09:06:38,512][99942] Fps is (10 sec: 16383.9, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 196116480. Throughput: 0: 1655.4, 1: 1643.6. Samples: 49037466. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-10-14 09:06:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.980')] +[2023-10-14 09:06:40,083][100936] Updated weights for policy 0, policy_version 95690 (0.0008) +[2023-10-14 09:06:40,450][100936] Updated weights for policy 0, policy_version 95700 (0.0008) +[2023-10-14 09:06:40,817][100936] Updated weights for policy 0, policy_version 95710 (0.0008) +[2023-10-14 09:06:42,152][100917] Updated weights for policy 1, policy_version 95842 (0.0009) +[2023-10-14 09:06:42,532][100917] Updated weights for policy 1, policy_version 95852 (0.0008) +[2023-10-14 09:06:42,911][100917] Updated weights for policy 1, policy_version 95862 (0.0009) +[2023-10-14 09:06:43,289][100917] Updated weights for policy 1, policy_version 95872 (0.0008) +[2023-10-14 09:06:43,512][99942] Fps is (10 sec: 16383.9, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 196182016. Throughput: 0: 1655.5, 1: 1661.6. Samples: 49047230. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-10-14 09:06:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.980')] +[2023-10-14 09:06:44,765][100936] Updated weights for policy 0, policy_version 95720 (0.0009) +[2023-10-14 09:06:45,137][100936] Updated weights for policy 0, policy_version 95730 (0.0010) +[2023-10-14 09:06:45,502][100936] Updated weights for policy 0, policy_version 95740 (0.0008) +[2023-10-14 09:06:47,366][100917] Updated weights for policy 1, policy_version 95882 (0.0008) +[2023-10-14 09:06:47,742][100917] Updated weights for policy 1, policy_version 95892 (0.0007) +[2023-10-14 09:06:48,114][100917] Updated weights for policy 1, policy_version 95902 (0.0007) +[2023-10-14 09:06:48,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 196247552. Throughput: 0: 1655.5, 1: 1660.9. Samples: 49067398. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-10-14 09:06:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.980')] +[2023-10-14 09:06:49,825][100936] Updated weights for policy 0, policy_version 95750 (0.0007) +[2023-10-14 09:06:50,194][100936] Updated weights for policy 0, policy_version 95760 (0.0008) +[2023-10-14 09:06:50,567][100936] Updated weights for policy 0, policy_version 95770 (0.0007) +[2023-10-14 09:06:52,336][100917] Updated weights for policy 1, policy_version 95912 (0.0009) +[2023-10-14 09:06:52,704][100917] Updated weights for policy 1, policy_version 95922 (0.0007) +[2023-10-14 09:06:53,076][100917] Updated weights for policy 1, policy_version 95932 (0.0007) +[2023-10-14 09:06:53,512][99942] Fps is (10 sec: 13106.7, 60 sec: 13653.2, 300 sec: 13329.3). Total num frames: 196313088. Throughput: 0: 1652.7, 1: 1644.7. Samples: 49086826. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-10-14 09:06:53,514][99942] Avg episode reward: [(0, '1.000'), (1, '0.980')] +[2023-10-14 09:06:54,760][100936] Updated weights for policy 0, policy_version 95780 (0.0008) +[2023-10-14 09:06:55,125][100936] Updated weights for policy 0, policy_version 95790 (0.0011) +[2023-10-14 09:06:55,508][100936] Updated weights for policy 0, policy_version 95800 (0.0010) +[2023-10-14 09:06:57,159][100917] Updated weights for policy 1, policy_version 95942 (0.0009) +[2023-10-14 09:06:57,530][100917] Updated weights for policy 1, policy_version 95952 (0.0011) +[2023-10-14 09:06:57,905][100917] Updated weights for policy 1, policy_version 95962 (0.0007) +[2023-10-14 09:06:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13329.4). Total num frames: 196378624. Throughput: 0: 1651.0, 1: 1664.7. Samples: 49096732. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-10-14 09:06:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.980')] +[2023-10-14 09:06:59,655][100936] Updated weights for policy 0, policy_version 95810 (0.0008) +[2023-10-14 09:07:00,038][100936] Updated weights for policy 0, policy_version 95820 (0.0007) +[2023-10-14 09:07:00,400][100936] Updated weights for policy 0, policy_version 95830 (0.0010) +[2023-10-14 09:07:00,767][100936] Updated weights for policy 0, policy_version 95840 (0.0007) +[2023-10-14 09:07:01,960][100917] Updated weights for policy 1, policy_version 95972 (0.0008) +[2023-10-14 09:07:02,327][100917] Updated weights for policy 1, policy_version 95982 (0.0010) +[2023-10-14 09:07:02,697][100917] Updated weights for policy 1, policy_version 95992 (0.0009) +[2023-10-14 09:07:03,512][99942] Fps is (10 sec: 13107.8, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 196444160. Throughput: 0: 1649.2, 1: 1661.4. Samples: 49117070. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-10-14 09:07:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.980')] +[2023-10-14 09:07:04,992][100936] Updated weights for policy 0, policy_version 95850 (0.0007) +[2023-10-14 09:07:05,357][100936] Updated weights for policy 0, policy_version 95860 (0.0007) +[2023-10-14 09:07:05,742][100936] Updated weights for policy 0, policy_version 95870 (0.0009) +[2023-10-14 09:07:06,885][100917] Updated weights for policy 1, policy_version 96002 (0.0011) +[2023-10-14 09:07:07,297][100917] Updated weights for policy 1, policy_version 96012 (0.0010) +[2023-10-14 09:07:07,674][100917] Updated weights for policy 1, policy_version 96022 (0.0009) +[2023-10-14 09:07:08,050][100917] Updated weights for policy 1, policy_version 96032 (0.0008) +[2023-10-14 09:07:08,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13653.4, 300 sec: 13329.4). Total num frames: 196509696. Throughput: 0: 1648.7, 1: 1649.5. Samples: 49136392. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-10-14 09:07:08,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.980')] +[2023-10-14 09:07:09,940][100936] Updated weights for policy 0, policy_version 95880 (0.0009) +[2023-10-14 09:07:10,308][100936] Updated weights for policy 0, policy_version 95890 (0.0008) +[2023-10-14 09:07:10,681][100936] Updated weights for policy 0, policy_version 95900 (0.0008) +[2023-10-14 09:07:12,183][100917] Updated weights for policy 1, policy_version 96042 (0.0008) +[2023-10-14 09:07:12,553][100917] Updated weights for policy 1, policy_version 96052 (0.0007) +[2023-10-14 09:07:12,932][100917] Updated weights for policy 1, policy_version 96062 (0.0007) +[2023-10-14 09:07:13,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 196575232. Throughput: 0: 1644.9, 1: 1666.4. Samples: 49146286. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-10-14 09:07:13,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.980')] +[2023-10-14 09:07:14,857][100936] Updated weights for policy 0, policy_version 95910 (0.0011) +[2023-10-14 09:07:15,220][100936] Updated weights for policy 0, policy_version 95920 (0.0011) +[2023-10-14 09:07:15,601][100936] Updated weights for policy 0, policy_version 95930 (0.0008) +[2023-10-14 09:07:17,131][100917] Updated weights for policy 1, policy_version 96072 (0.0009) +[2023-10-14 09:07:17,511][100917] Updated weights for policy 1, policy_version 96082 (0.0009) +[2023-10-14 09:07:17,880][100917] Updated weights for policy 1, policy_version 96092 (0.0010) +[2023-10-14 09:07:18,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 196640768. Throughput: 0: 1642.3, 1: 1662.5. Samples: 49166600. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-10-14 09:07:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.980')] +[2023-10-14 09:07:20,018][100936] Updated weights for policy 0, policy_version 95940 (0.0010) +[2023-10-14 09:07:20,405][100936] Updated weights for policy 0, policy_version 95950 (0.0009) +[2023-10-14 09:07:20,789][100936] Updated weights for policy 0, policy_version 95960 (0.0009) +[2023-10-14 09:07:22,077][100917] Updated weights for policy 1, policy_version 96102 (0.0007) +[2023-10-14 09:07:22,458][100917] Updated weights for policy 1, policy_version 96112 (0.0010) +[2023-10-14 09:07:22,823][100917] Updated weights for policy 1, policy_version 96122 (0.0010) +[2023-10-14 09:07:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 196706304. Throughput: 0: 1637.2, 1: 1656.6. Samples: 49185690. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-10-14 09:07:23,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.980')] +[2023-10-14 09:07:24,989][100936] Updated weights for policy 0, policy_version 95970 (0.0010) +[2023-10-14 09:07:25,363][100936] Updated weights for policy 0, policy_version 95980 (0.0008) +[2023-10-14 09:07:25,731][100936] Updated weights for policy 0, policy_version 95990 (0.0008) +[2023-10-14 09:07:26,093][100936] Updated weights for policy 0, policy_version 96000 (0.0008) +[2023-10-14 09:07:26,862][100917] Updated weights for policy 1, policy_version 96132 (0.0009) +[2023-10-14 09:07:27,239][100917] Updated weights for policy 1, policy_version 96142 (0.0010) +[2023-10-14 09:07:27,613][100917] Updated weights for policy 1, policy_version 96152 (0.0010) +[2023-10-14 09:07:28,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 196771840. Throughput: 0: 1634.8, 1: 1666.9. Samples: 49195808. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-10-14 09:07:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '0.980')] +[2023-10-14 09:07:29,996][100936] Updated weights for policy 0, policy_version 96010 (0.0007) +[2023-10-14 09:07:30,365][100936] Updated weights for policy 0, policy_version 96020 (0.0008) +[2023-10-14 09:07:30,733][100936] Updated weights for policy 0, policy_version 96030 (0.0009) +[2023-10-14 09:07:31,506][100917] Updated weights for policy 1, policy_version 96162 (0.0011) +[2023-10-14 09:07:31,882][100917] Updated weights for policy 1, policy_version 96172 (0.0011) +[2023-10-14 09:07:32,265][100917] Updated weights for policy 1, policy_version 96182 (0.0008) +[2023-10-14 09:07:32,627][100917] Updated weights for policy 1, policy_version 96192 (0.0009) +[2023-10-14 09:07:33,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13218.3). Total num frames: 196837376. Throughput: 0: 1642.1, 1: 1659.3. Samples: 49215958. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-10-14 09:07:33,512][99942] Avg episode reward: [(0, '1.000'), (1, '0.980')] +[2023-10-14 09:07:34,829][100936] Updated weights for policy 0, policy_version 96040 (0.0009) +[2023-10-14 09:07:35,192][100936] Updated weights for policy 0, policy_version 96050 (0.0008) +[2023-10-14 09:07:35,558][100936] Updated weights for policy 0, policy_version 96060 (0.0010) +[2023-10-14 09:07:36,760][100917] Updated weights for policy 1, policy_version 96202 (0.0010) +[2023-10-14 09:07:37,140][100917] Updated weights for policy 1, policy_version 96212 (0.0008) +[2023-10-14 09:07:37,512][100917] Updated weights for policy 1, policy_version 96222 (0.0009) +[2023-10-14 09:07:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 196902912. Throughput: 0: 1641.5, 1: 1665.3. Samples: 49235632. Policy #0 lag: (min: 22.0, avg: 22.3, max: 34.0) +[2023-10-14 09:07:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 09:07:38,523][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000096064_98369536.pth... +[2023-10-14 09:07:38,523][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000096224_98533376.pth... +[2023-10-14 09:07:38,552][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000094656_96927744.pth +[2023-10-14 09:07:38,563][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000094528_96796672.pth +[2023-10-14 09:07:39,766][100936] Updated weights for policy 0, policy_version 96070 (0.0008) +[2023-10-14 09:07:40,134][100936] Updated weights for policy 0, policy_version 96080 (0.0007) +[2023-10-14 09:07:40,513][100936] Updated weights for policy 0, policy_version 96090 (0.0009) +[2023-10-14 09:07:41,624][100917] Updated weights for policy 1, policy_version 96232 (0.0007) +[2023-10-14 09:07:41,993][100917] Updated weights for policy 1, policy_version 96242 (0.0007) +[2023-10-14 09:07:42,369][100917] Updated weights for policy 1, policy_version 96252 (0.0007) +[2023-10-14 09:07:43,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 196968448. Throughput: 0: 1645.3, 1: 1668.3. Samples: 49245846. Policy #0 lag: (min: 22.0, avg: 22.3, max: 34.0) +[2023-10-14 09:07:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 09:07:44,557][100936] Updated weights for policy 0, policy_version 96100 (0.0008) +[2023-10-14 09:07:44,923][100936] Updated weights for policy 0, policy_version 96110 (0.0009) +[2023-10-14 09:07:45,290][100936] Updated weights for policy 0, policy_version 96120 (0.0010) +[2023-10-14 09:07:46,403][100917] Updated weights for policy 1, policy_version 96262 (0.0010) +[2023-10-14 09:07:46,771][100917] Updated weights for policy 1, policy_version 96272 (0.0011) +[2023-10-14 09:07:47,133][100917] Updated weights for policy 1, policy_version 96282 (0.0009) +[2023-10-14 09:07:48,512][99942] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 197033984. Throughput: 0: 1643.4, 1: 1654.0. Samples: 49265454. Policy #0 lag: (min: 22.0, avg: 22.3, max: 34.0) +[2023-10-14 09:07:48,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 09:07:49,499][100936] Updated weights for policy 0, policy_version 96130 (0.0010) +[2023-10-14 09:07:49,875][100936] Updated weights for policy 0, policy_version 96140 (0.0010) +[2023-10-14 09:07:50,241][100936] Updated weights for policy 0, policy_version 96150 (0.0010) +[2023-10-14 09:07:50,611][100936] Updated weights for policy 0, policy_version 96160 (0.0007) +[2023-10-14 09:07:51,285][100917] Updated weights for policy 1, policy_version 96292 (0.0009) +[2023-10-14 09:07:51,655][100917] Updated weights for policy 1, policy_version 96302 (0.0008) +[2023-10-14 09:07:52,026][100917] Updated weights for policy 1, policy_version 96312 (0.0007) +[2023-10-14 09:07:53,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 197099520. Throughput: 0: 1637.8, 1: 1666.4. Samples: 49285080. Policy #0 lag: (min: 22.0, avg: 22.3, max: 34.0) +[2023-10-14 09:07:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 09:07:54,792][100936] Updated weights for policy 0, policy_version 96170 (0.0009) +[2023-10-14 09:07:55,156][100936] Updated weights for policy 0, policy_version 96180 (0.0007) +[2023-10-14 09:07:55,529][100936] Updated weights for policy 0, policy_version 96190 (0.0007) +[2023-10-14 09:07:56,198][100917] Updated weights for policy 1, policy_version 96322 (0.0007) +[2023-10-14 09:07:56,626][100917] Updated weights for policy 1, policy_version 96332 (0.0008) +[2023-10-14 09:07:57,006][100917] Updated weights for policy 1, policy_version 96342 (0.0008) +[2023-10-14 09:07:57,382][100917] Updated weights for policy 1, policy_version 96352 (0.0012) +[2023-10-14 09:07:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 197165056. Throughput: 0: 1641.1, 1: 1672.4. Samples: 49295396. Policy #0 lag: (min: 22.0, avg: 22.3, max: 34.0) +[2023-10-14 09:07:58,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 09:07:59,652][100936] Updated weights for policy 0, policy_version 96200 (0.0010) +[2023-10-14 09:08:00,025][100936] Updated weights for policy 0, policy_version 96210 (0.0007) +[2023-10-14 09:08:00,382][100936] Updated weights for policy 0, policy_version 96220 (0.0010) +[2023-10-14 09:08:01,407][100917] Updated weights for policy 1, policy_version 96362 (0.0009) +[2023-10-14 09:08:01,770][100917] Updated weights for policy 1, policy_version 96372 (0.0009) +[2023-10-14 09:08:02,136][100917] Updated weights for policy 1, policy_version 96382 (0.0010) +[2023-10-14 09:08:03,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 197230592. Throughput: 0: 1637.7, 1: 1652.5. Samples: 49314658. Policy #0 lag: (min: 22.0, avg: 22.3, max: 34.0) +[2023-10-14 09:08:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 09:08:04,603][100936] Updated weights for policy 0, policy_version 96230 (0.0009) +[2023-10-14 09:08:04,979][100936] Updated weights for policy 0, policy_version 96240 (0.0007) +[2023-10-14 09:08:05,343][100936] Updated weights for policy 0, policy_version 96250 (0.0008) +[2023-10-14 09:08:06,270][100917] Updated weights for policy 1, policy_version 96392 (0.0007) +[2023-10-14 09:08:06,644][100917] Updated weights for policy 1, policy_version 96402 (0.0008) +[2023-10-14 09:08:07,006][100917] Updated weights for policy 1, policy_version 96412 (0.0007) +[2023-10-14 09:08:08,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 197296128. Throughput: 0: 1644.4, 1: 1667.0. Samples: 49334706. Policy #0 lag: (min: 22.0, avg: 22.3, max: 34.0) +[2023-10-14 09:08:08,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 09:08:09,693][100936] Updated weights for policy 0, policy_version 96260 (0.0009) +[2023-10-14 09:08:10,081][100936] Updated weights for policy 0, policy_version 96270 (0.0011) +[2023-10-14 09:08:10,454][100936] Updated weights for policy 0, policy_version 96280 (0.0009) +[2023-10-14 09:08:11,122][100917] Updated weights for policy 1, policy_version 96422 (0.0008) +[2023-10-14 09:08:11,490][100917] Updated weights for policy 1, policy_version 96432 (0.0007) +[2023-10-14 09:08:11,860][100917] Updated weights for policy 1, policy_version 96442 (0.0007) +[2023-10-14 09:08:13,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 197361664. Throughput: 0: 1645.0, 1: 1668.0. Samples: 49344894. Policy #0 lag: (min: 22.0, avg: 22.3, max: 34.0) +[2023-10-14 09:08:13,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 09:08:14,784][100936] Updated weights for policy 0, policy_version 96290 (0.0008) +[2023-10-14 09:08:15,151][100936] Updated weights for policy 0, policy_version 96300 (0.0008) +[2023-10-14 09:08:15,513][100936] Updated weights for policy 0, policy_version 96310 (0.0007) +[2023-10-14 09:08:15,794][100917] Updated weights for policy 1, policy_version 96452 (0.0007) +[2023-10-14 09:08:15,877][100936] Updated weights for policy 0, policy_version 96320 (0.0007) +[2023-10-14 09:08:16,156][100917] Updated weights for policy 1, policy_version 96462 (0.0009) +[2023-10-14 09:08:16,530][100917] Updated weights for policy 1, policy_version 96472 (0.0007) +[2023-10-14 09:08:18,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 197427200. Throughput: 0: 1638.0, 1: 1654.0. Samples: 49364100. Policy #0 lag: (min: 22.0, avg: 22.3, max: 34.0) +[2023-10-14 09:08:18,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 09:08:20,062][100936] Updated weights for policy 0, policy_version 96330 (0.0007) +[2023-10-14 09:08:20,436][100936] Updated weights for policy 0, policy_version 96340 (0.0009) +[2023-10-14 09:08:20,621][100917] Updated weights for policy 1, policy_version 96482 (0.0009) +[2023-10-14 09:08:20,803][100936] Updated weights for policy 0, policy_version 96350 (0.0008) +[2023-10-14 09:08:20,979][100917] Updated weights for policy 1, policy_version 96492 (0.0010) +[2023-10-14 09:08:21,349][100917] Updated weights for policy 1, policy_version 96502 (0.0011) +[2023-10-14 09:08:21,716][100917] Updated weights for policy 1, policy_version 96512 (0.0009) +[2023-10-14 09:08:23,512][99942] Fps is (10 sec: 13106.8, 60 sec: 13107.1, 300 sec: 13218.3). Total num frames: 197492736. Throughput: 0: 1637.1, 1: 1670.8. Samples: 49384484. Policy #0 lag: (min: 22.0, avg: 22.3, max: 34.0) +[2023-10-14 09:08:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 09:08:24,997][100936] Updated weights for policy 0, policy_version 96360 (0.0008) +[2023-10-14 09:08:25,379][100936] Updated weights for policy 0, policy_version 96370 (0.0007) +[2023-10-14 09:08:25,743][100936] Updated weights for policy 0, policy_version 96380 (0.0009) +[2023-10-14 09:08:25,864][100917] Updated weights for policy 1, policy_version 96522 (0.0007) +[2023-10-14 09:08:26,228][100917] Updated weights for policy 1, policy_version 96532 (0.0010) +[2023-10-14 09:08:26,607][100917] Updated weights for policy 1, policy_version 96542 (0.0011) +[2023-10-14 09:08:28,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 197558272. Throughput: 0: 1636.7, 1: 1657.9. Samples: 49394102. Policy #0 lag: (min: 22.0, avg: 22.3, max: 34.0) +[2023-10-14 09:08:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 09:08:29,723][100936] Updated weights for policy 0, policy_version 96390 (0.0008) +[2023-10-14 09:08:30,098][100936] Updated weights for policy 0, policy_version 96400 (0.0009) +[2023-10-14 09:08:30,466][100936] Updated weights for policy 0, policy_version 96410 (0.0009) +[2023-10-14 09:08:30,935][100917] Updated weights for policy 1, policy_version 96552 (0.0008) +[2023-10-14 09:08:31,295][100917] Updated weights for policy 1, policy_version 96562 (0.0009) +[2023-10-14 09:08:31,665][100917] Updated weights for policy 1, policy_version 96572 (0.0009) +[2023-10-14 09:08:33,512][99942] Fps is (10 sec: 13107.7, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 197623808. Throughput: 0: 1649.1, 1: 1650.7. Samples: 49413944. Policy #0 lag: (min: 22.0, avg: 22.3, max: 34.0) +[2023-10-14 09:08:33,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 09:08:34,335][100936] Updated weights for policy 0, policy_version 96420 (0.0009) +[2023-10-14 09:08:34,697][100936] Updated weights for policy 0, policy_version 96430 (0.0007) +[2023-10-14 09:08:35,064][100936] Updated weights for policy 0, policy_version 96440 (0.0008) +[2023-10-14 09:08:35,928][100917] Updated weights for policy 1, policy_version 96582 (0.0010) +[2023-10-14 09:08:36,294][100917] Updated weights for policy 1, policy_version 96592 (0.0010) +[2023-10-14 09:08:36,666][100917] Updated weights for policy 1, policy_version 96602 (0.0010) +[2023-10-14 09:08:38,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 197689344. Throughput: 0: 1659.2, 1: 1663.2. Samples: 49434588. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 09:08:38,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 09:08:39,206][100936] Updated weights for policy 0, policy_version 96450 (0.0010) +[2023-10-14 09:08:39,587][100936] Updated weights for policy 0, policy_version 96460 (0.0010) +[2023-10-14 09:08:39,943][100936] Updated weights for policy 0, policy_version 96470 (0.0009) +[2023-10-14 09:08:40,306][100936] Updated weights for policy 0, policy_version 96480 (0.0009) +[2023-10-14 09:08:40,740][100917] Updated weights for policy 1, policy_version 96612 (0.0009) +[2023-10-14 09:08:41,138][100917] Updated weights for policy 1, policy_version 96622 (0.0007) +[2023-10-14 09:08:41,499][100917] Updated weights for policy 1, policy_version 96632 (0.0007) +[2023-10-14 09:08:43,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 197754880. Throughput: 0: 1657.6, 1: 1651.8. Samples: 49444318. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 09:08:43,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 09:08:44,565][100936] Updated weights for policy 0, policy_version 96490 (0.0008) +[2023-10-14 09:08:44,928][100936] Updated weights for policy 0, policy_version 96500 (0.0008) +[2023-10-14 09:08:45,313][100936] Updated weights for policy 0, policy_version 96510 (0.0009) +[2023-10-14 09:08:45,728][100917] Updated weights for policy 1, policy_version 96642 (0.0007) +[2023-10-14 09:08:46,109][100917] Updated weights for policy 1, policy_version 96652 (0.0011) +[2023-10-14 09:08:46,479][100917] Updated weights for policy 1, policy_version 96662 (0.0011) +[2023-10-14 09:08:46,851][100917] Updated weights for policy 1, policy_version 96672 (0.0011) +[2023-10-14 09:08:48,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 197820416. Throughput: 0: 1659.3, 1: 1653.6. Samples: 49463738. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 09:08:48,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 09:08:49,546][100936] Updated weights for policy 0, policy_version 96520 (0.0008) +[2023-10-14 09:08:49,917][100936] Updated weights for policy 0, policy_version 96530 (0.0007) +[2023-10-14 09:08:50,281][100936] Updated weights for policy 0, policy_version 96540 (0.0007) +[2023-10-14 09:08:50,848][100917] Updated weights for policy 1, policy_version 96682 (0.0007) +[2023-10-14 09:08:51,224][100917] Updated weights for policy 1, policy_version 96692 (0.0007) +[2023-10-14 09:08:51,587][100917] Updated weights for policy 1, policy_version 96702 (0.0009) +[2023-10-14 09:08:53,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 197885952. Throughput: 0: 1660.1, 1: 1663.0. Samples: 49484246. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 09:08:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 09:08:54,286][100936] Updated weights for policy 0, policy_version 96550 (0.0008) +[2023-10-14 09:08:54,678][100936] Updated weights for policy 0, policy_version 96560 (0.0008) +[2023-10-14 09:08:55,051][100936] Updated weights for policy 0, policy_version 96570 (0.0010) +[2023-10-14 09:08:55,654][100917] Updated weights for policy 1, policy_version 96712 (0.0008) +[2023-10-14 09:08:56,025][100917] Updated weights for policy 1, policy_version 96722 (0.0007) +[2023-10-14 09:08:56,396][100917] Updated weights for policy 1, policy_version 96732 (0.0010) +[2023-10-14 09:08:58,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 197951488. Throughput: 0: 1657.7, 1: 1649.3. Samples: 49493710. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 09:08:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 09:08:59,202][100936] Updated weights for policy 0, policy_version 96580 (0.0009) +[2023-10-14 09:08:59,569][100936] Updated weights for policy 0, policy_version 96590 (0.0008) +[2023-10-14 09:08:59,947][100936] Updated weights for policy 0, policy_version 96600 (0.0008) +[2023-10-14 09:09:00,499][100917] Updated weights for policy 1, policy_version 96742 (0.0009) +[2023-10-14 09:09:00,876][100917] Updated weights for policy 1, policy_version 96752 (0.0009) +[2023-10-14 09:09:01,247][100917] Updated weights for policy 1, policy_version 96762 (0.0009) +[2023-10-14 09:09:03,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 198017024. Throughput: 0: 1662.8, 1: 1659.1. Samples: 49513584. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 09:09:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 09:09:03,919][100936] Updated weights for policy 0, policy_version 96610 (0.0008) +[2023-10-14 09:09:04,285][100936] Updated weights for policy 0, policy_version 96620 (0.0012) +[2023-10-14 09:09:04,662][100936] Updated weights for policy 0, policy_version 96630 (0.0008) +[2023-10-14 09:09:05,023][100936] Updated weights for policy 0, policy_version 96640 (0.0007) +[2023-10-14 09:09:05,482][100917] Updated weights for policy 1, policy_version 96772 (0.0010) +[2023-10-14 09:09:05,850][100917] Updated weights for policy 1, policy_version 96782 (0.0010) +[2023-10-14 09:09:06,226][100917] Updated weights for policy 1, policy_version 96792 (0.0010) +[2023-10-14 09:09:08,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 198082560. Throughput: 0: 1661.7, 1: 1654.1. Samples: 49533698. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 09:09:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 09:09:09,315][100936] Updated weights for policy 0, policy_version 96650 (0.0009) +[2023-10-14 09:09:09,690][100936] Updated weights for policy 0, policy_version 96660 (0.0007) +[2023-10-14 09:09:10,049][100936] Updated weights for policy 0, policy_version 96670 (0.0009) +[2023-10-14 09:09:10,350][100917] Updated weights for policy 1, policy_version 96802 (0.0009) +[2023-10-14 09:09:10,722][100917] Updated weights for policy 1, policy_version 96812 (0.0007) +[2023-10-14 09:09:11,098][100917] Updated weights for policy 1, policy_version 96822 (0.0008) +[2023-10-14 09:09:11,465][100917] Updated weights for policy 1, policy_version 96832 (0.0008) +[2023-10-14 09:09:13,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 198148096. Throughput: 0: 1664.2, 1: 1652.5. Samples: 49543354. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 09:09:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 09:09:14,299][100936] Updated weights for policy 0, policy_version 96680 (0.0008) +[2023-10-14 09:09:14,660][100936] Updated weights for policy 0, policy_version 96690 (0.0007) +[2023-10-14 09:09:15,040][100936] Updated weights for policy 0, policy_version 96700 (0.0007) +[2023-10-14 09:09:15,524][100917] Updated weights for policy 1, policy_version 96842 (0.0009) +[2023-10-14 09:09:15,901][100917] Updated weights for policy 1, policy_version 96852 (0.0009) +[2023-10-14 09:09:16,271][100917] Updated weights for policy 1, policy_version 96862 (0.0009) +[2023-10-14 09:09:18,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 198213632. Throughput: 0: 1651.6, 1: 1662.4. Samples: 49563072. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 09:09:18,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 09:09:19,215][100936] Updated weights for policy 0, policy_version 96710 (0.0007) +[2023-10-14 09:09:19,594][100936] Updated weights for policy 0, policy_version 96720 (0.0009) +[2023-10-14 09:09:19,967][100936] Updated weights for policy 0, policy_version 96730 (0.0008) +[2023-10-14 09:09:20,396][100917] Updated weights for policy 1, policy_version 96872 (0.0009) +[2023-10-14 09:09:20,769][100917] Updated weights for policy 1, policy_version 96882 (0.0010) +[2023-10-14 09:09:21,148][100917] Updated weights for policy 1, policy_version 96892 (0.0009) +[2023-10-14 09:09:23,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 198279168. Throughput: 0: 1644.8, 1: 1660.7. Samples: 49583336. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 09:09:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 09:09:24,158][100936] Updated weights for policy 0, policy_version 96740 (0.0010) +[2023-10-14 09:09:24,528][100936] Updated weights for policy 0, policy_version 96750 (0.0010) +[2023-10-14 09:09:24,895][100936] Updated weights for policy 0, policy_version 96760 (0.0010) +[2023-10-14 09:09:25,168][100917] Updated weights for policy 1, policy_version 96902 (0.0008) +[2023-10-14 09:09:25,547][100917] Updated weights for policy 1, policy_version 96912 (0.0008) +[2023-10-14 09:09:25,914][100917] Updated weights for policy 1, policy_version 96922 (0.0010) +[2023-10-14 09:09:28,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 198344704. Throughput: 0: 1645.1, 1: 1649.1. Samples: 49592558. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 09:09:28,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 09:09:29,071][100936] Updated weights for policy 0, policy_version 96770 (0.0009) +[2023-10-14 09:09:29,443][100936] Updated weights for policy 0, policy_version 96780 (0.0010) +[2023-10-14 09:09:29,814][100936] Updated weights for policy 0, policy_version 96790 (0.0010) +[2023-10-14 09:09:29,932][100917] Updated weights for policy 1, policy_version 96932 (0.0008) +[2023-10-14 09:09:30,179][100936] Updated weights for policy 0, policy_version 96800 (0.0008) +[2023-10-14 09:09:30,332][100917] Updated weights for policy 1, policy_version 96942 (0.0009) +[2023-10-14 09:09:30,694][100917] Updated weights for policy 1, policy_version 96952 (0.0008) +[2023-10-14 09:09:33,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 198410240. Throughput: 0: 1645.0, 1: 1665.8. Samples: 49612726. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) +[2023-10-14 09:09:33,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 09:09:34,378][100936] Updated weights for policy 0, policy_version 96810 (0.0008) +[2023-10-14 09:09:34,737][100917] Updated weights for policy 1, policy_version 96962 (0.0007) +[2023-10-14 09:09:34,748][100936] Updated weights for policy 0, policy_version 96820 (0.0008) +[2023-10-14 09:09:35,103][100917] Updated weights for policy 1, policy_version 96972 (0.0009) +[2023-10-14 09:09:35,113][100936] Updated weights for policy 0, policy_version 96830 (0.0008) +[2023-10-14 09:09:35,476][100917] Updated weights for policy 1, policy_version 96982 (0.0008) +[2023-10-14 09:09:35,838][100917] Updated weights for policy 1, policy_version 96992 (0.0008) +[2023-10-14 09:09:38,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 198475776. Throughput: 0: 1645.0, 1: 1662.8. Samples: 49633094. Policy #0 lag: (min: 22.0, avg: 24.9, max: 54.0) +[2023-10-14 09:09:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 09:09:38,520][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000096832_99155968.pth... +[2023-10-14 09:09:38,520][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000096992_99319808.pth... +[2023-10-14 09:09:38,551][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000095456_97746944.pth +[2023-10-14 09:09:38,558][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000095296_97583104.pth +[2023-10-14 09:09:39,380][100936] Updated weights for policy 0, policy_version 96840 (0.0008) +[2023-10-14 09:09:39,757][100936] Updated weights for policy 0, policy_version 96850 (0.0007) +[2023-10-14 09:09:40,041][100917] Updated weights for policy 1, policy_version 97002 (0.0008) +[2023-10-14 09:09:40,129][100936] Updated weights for policy 0, policy_version 96860 (0.0009) +[2023-10-14 09:09:40,410][100917] Updated weights for policy 1, policy_version 97012 (0.0008) +[2023-10-14 09:09:40,781][100917] Updated weights for policy 1, policy_version 97022 (0.0008) +[2023-10-14 09:09:43,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 198541312. Throughput: 0: 1646.2, 1: 1645.5. Samples: 49641838. Policy #0 lag: (min: 22.0, avg: 24.9, max: 54.0) +[2023-10-14 09:09:43,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 09:09:44,361][100936] Updated weights for policy 0, policy_version 96870 (0.0007) +[2023-10-14 09:09:44,726][100936] Updated weights for policy 0, policy_version 96880 (0.0009) +[2023-10-14 09:09:45,027][100917] Updated weights for policy 1, policy_version 97032 (0.0007) +[2023-10-14 09:09:45,094][100936] Updated weights for policy 0, policy_version 96890 (0.0008) +[2023-10-14 09:09:45,395][100917] Updated weights for policy 1, policy_version 97042 (0.0009) +[2023-10-14 09:09:45,766][100917] Updated weights for policy 1, policy_version 97052 (0.0007) +[2023-10-14 09:09:48,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 198606848. Throughput: 0: 1641.2, 1: 1659.0. Samples: 49662092. Policy #0 lag: (min: 22.0, avg: 24.9, max: 54.0) +[2023-10-14 09:09:48,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 09:09:49,247][100936] Updated weights for policy 0, policy_version 96900 (0.0009) +[2023-10-14 09:09:49,619][100936] Updated weights for policy 0, policy_version 96910 (0.0010) +[2023-10-14 09:09:49,952][100917] Updated weights for policy 1, policy_version 97062 (0.0008) +[2023-10-14 09:09:49,975][100936] Updated weights for policy 0, policy_version 96920 (0.0008) +[2023-10-14 09:09:50,319][100917] Updated weights for policy 1, policy_version 97072 (0.0007) +[2023-10-14 09:09:50,696][100917] Updated weights for policy 1, policy_version 97082 (0.0008) +[2023-10-14 09:09:53,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 198672384. Throughput: 0: 1643.3, 1: 1663.1. Samples: 49682486. Policy #0 lag: (min: 22.0, avg: 24.9, max: 54.0) +[2023-10-14 09:09:53,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 09:09:54,094][100936] Updated weights for policy 0, policy_version 96930 (0.0007) +[2023-10-14 09:09:54,460][100936] Updated weights for policy 0, policy_version 96940 (0.0008) +[2023-10-14 09:09:54,765][100917] Updated weights for policy 1, policy_version 97092 (0.0009) +[2023-10-14 09:09:54,827][100936] Updated weights for policy 0, policy_version 96950 (0.0008) +[2023-10-14 09:09:55,140][100917] Updated weights for policy 1, policy_version 97102 (0.0010) +[2023-10-14 09:09:55,196][100936] Updated weights for policy 0, policy_version 96960 (0.0008) +[2023-10-14 09:09:55,519][100917] Updated weights for policy 1, policy_version 97112 (0.0009) +[2023-10-14 09:09:58,512][99942] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 198737920. Throughput: 0: 1641.2, 1: 1648.4. Samples: 49691384. Policy #0 lag: (min: 22.0, avg: 24.9, max: 54.0) +[2023-10-14 09:09:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 09:09:59,224][100936] Updated weights for policy 0, policy_version 96970 (0.0007) +[2023-10-14 09:09:59,509][100917] Updated weights for policy 1, policy_version 97122 (0.0008) +[2023-10-14 09:09:59,595][100936] Updated weights for policy 0, policy_version 96980 (0.0007) +[2023-10-14 09:09:59,879][100917] Updated weights for policy 1, policy_version 97132 (0.0010) +[2023-10-14 09:09:59,961][100936] Updated weights for policy 0, policy_version 96990 (0.0007) +[2023-10-14 09:10:00,250][100917] Updated weights for policy 1, policy_version 97142 (0.0007) +[2023-10-14 09:10:00,620][100917] Updated weights for policy 1, policy_version 97152 (0.0008) +[2023-10-14 09:10:03,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 198803456. Throughput: 0: 1644.7, 1: 1668.5. Samples: 49712168. Policy #0 lag: (min: 22.0, avg: 24.9, max: 54.0) +[2023-10-14 09:10:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 09:10:04,077][100936] Updated weights for policy 0, policy_version 97000 (0.0009) +[2023-10-14 09:10:04,445][100936] Updated weights for policy 0, policy_version 97010 (0.0009) +[2023-10-14 09:10:04,729][100917] Updated weights for policy 1, policy_version 97162 (0.0008) +[2023-10-14 09:10:04,809][100936] Updated weights for policy 0, policy_version 97020 (0.0008) +[2023-10-14 09:10:05,111][100917] Updated weights for policy 1, policy_version 97172 (0.0009) +[2023-10-14 09:10:05,478][100917] Updated weights for policy 1, policy_version 97182 (0.0007) +[2023-10-14 09:10:08,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 198868992. Throughput: 0: 1650.5, 1: 1660.7. Samples: 49732340. Policy #0 lag: (min: 22.0, avg: 24.9, max: 54.0) +[2023-10-14 09:10:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 09:10:08,876][100936] Updated weights for policy 0, policy_version 97030 (0.0008) +[2023-10-14 09:10:09,249][100936] Updated weights for policy 0, policy_version 97040 (0.0008) +[2023-10-14 09:10:09,622][100936] Updated weights for policy 0, policy_version 97050 (0.0010) +[2023-10-14 09:10:09,873][100917] Updated weights for policy 1, policy_version 97192 (0.0009) +[2023-10-14 09:10:10,236][100917] Updated weights for policy 1, policy_version 97202 (0.0010) +[2023-10-14 09:10:10,617][100917] Updated weights for policy 1, policy_version 97212 (0.0011) +[2023-10-14 09:10:13,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 198934528. Throughput: 0: 1653.3, 1: 1650.7. Samples: 49741238. Policy #0 lag: (min: 22.0, avg: 24.9, max: 54.0) +[2023-10-14 09:10:13,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 09:10:13,680][100936] Updated weights for policy 0, policy_version 97060 (0.0007) +[2023-10-14 09:10:14,053][100936] Updated weights for policy 0, policy_version 97070 (0.0007) +[2023-10-14 09:10:14,415][100936] Updated weights for policy 0, policy_version 97080 (0.0009) +[2023-10-14 09:10:14,852][100917] Updated weights for policy 1, policy_version 97222 (0.0010) +[2023-10-14 09:10:15,219][100917] Updated weights for policy 1, policy_version 97232 (0.0008) +[2023-10-14 09:10:15,594][100917] Updated weights for policy 1, policy_version 97242 (0.0010) +[2023-10-14 09:10:18,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 199000064. Throughput: 0: 1654.1, 1: 1655.7. Samples: 49761668. Policy #0 lag: (min: 22.0, avg: 24.9, max: 54.0) +[2023-10-14 09:10:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 09:10:18,632][100936] Updated weights for policy 0, policy_version 97090 (0.0008) +[2023-10-14 09:10:18,997][100936] Updated weights for policy 0, policy_version 97100 (0.0009) +[2023-10-14 09:10:19,368][100936] Updated weights for policy 0, policy_version 97110 (0.0009) +[2023-10-14 09:10:19,594][100917] Updated weights for policy 1, policy_version 97252 (0.0010) +[2023-10-14 09:10:19,729][100936] Updated weights for policy 0, policy_version 97120 (0.0007) +[2023-10-14 09:10:19,968][100917] Updated weights for policy 1, policy_version 97262 (0.0009) +[2023-10-14 09:10:20,339][100917] Updated weights for policy 1, policy_version 97272 (0.0010) +[2023-10-14 09:10:23,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 199065600. Throughput: 0: 1650.2, 1: 1658.6. Samples: 49781990. Policy #0 lag: (min: 22.0, avg: 24.9, max: 54.0) +[2023-10-14 09:10:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 09:10:23,996][100936] Updated weights for policy 0, policy_version 97130 (0.0009) +[2023-10-14 09:10:24,364][100917] Updated weights for policy 1, policy_version 97282 (0.0010) +[2023-10-14 09:10:24,368][100936] Updated weights for policy 0, policy_version 97140 (0.0009) +[2023-10-14 09:10:24,742][100936] Updated weights for policy 0, policy_version 97150 (0.0007) +[2023-10-14 09:10:24,742][100917] Updated weights for policy 1, policy_version 97292 (0.0009) +[2023-10-14 09:10:25,116][100917] Updated weights for policy 1, policy_version 97302 (0.0008) +[2023-10-14 09:10:25,483][100917] Updated weights for policy 1, policy_version 97312 (0.0007) +[2023-10-14 09:10:28,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 199131136. Throughput: 0: 1651.6, 1: 1661.8. Samples: 49790942. Policy #0 lag: (min: 22.0, avg: 24.9, max: 54.0) +[2023-10-14 09:10:28,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 09:10:28,841][100936] Updated weights for policy 0, policy_version 97160 (0.0009) +[2023-10-14 09:10:29,216][100936] Updated weights for policy 0, policy_version 97170 (0.0008) +[2023-10-14 09:10:29,578][100936] Updated weights for policy 0, policy_version 97180 (0.0008) +[2023-10-14 09:10:29,626][100917] Updated weights for policy 1, policy_version 97322 (0.0009) +[2023-10-14 09:10:29,997][100917] Updated weights for policy 1, policy_version 97332 (0.0009) +[2023-10-14 09:10:30,370][100917] Updated weights for policy 1, policy_version 97342 (0.0010) +[2023-10-14 09:10:33,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 199196672. Throughput: 0: 1658.0, 1: 1662.3. Samples: 49811502. Policy #0 lag: (min: 22.0, avg: 24.9, max: 54.0) +[2023-10-14 09:10:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 09:10:33,552][100936] Updated weights for policy 0, policy_version 97190 (0.0008) +[2023-10-14 09:10:33,918][100936] Updated weights for policy 0, policy_version 97200 (0.0009) +[2023-10-14 09:10:34,288][100936] Updated weights for policy 0, policy_version 97210 (0.0009) +[2023-10-14 09:10:34,423][100917] Updated weights for policy 1, policy_version 97352 (0.0008) +[2023-10-14 09:10:34,795][100917] Updated weights for policy 1, policy_version 97362 (0.0010) +[2023-10-14 09:10:35,171][100917] Updated weights for policy 1, policy_version 97372 (0.0008) +[2023-10-14 09:10:38,424][100936] Updated weights for policy 0, policy_version 97220 (0.0007) +[2023-10-14 09:10:38,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 199262208. Throughput: 0: 1655.9, 1: 1662.1. Samples: 49831796. Policy #0 lag: (min: 28.0, avg: 35.0, max: 60.0) +[2023-10-14 09:10:38,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 09:10:38,783][100936] Updated weights for policy 0, policy_version 97230 (0.0008) +[2023-10-14 09:10:39,159][100936] Updated weights for policy 0, policy_version 97240 (0.0009) +[2023-10-14 09:10:39,295][100917] Updated weights for policy 1, policy_version 97382 (0.0008) +[2023-10-14 09:10:39,667][100917] Updated weights for policy 1, policy_version 97392 (0.0009) +[2023-10-14 09:10:40,042][100917] Updated weights for policy 1, policy_version 97402 (0.0008) +[2023-10-14 09:10:43,414][100936] Updated weights for policy 0, policy_version 97250 (0.0010) +[2023-10-14 09:10:43,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 199327744. Throughput: 0: 1657.5, 1: 1664.9. Samples: 49840892. Policy #0 lag: (min: 28.0, avg: 35.0, max: 60.0) +[2023-10-14 09:10:43,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 09:10:43,783][100936] Updated weights for policy 0, policy_version 97260 (0.0011) +[2023-10-14 09:10:44,026][100917] Updated weights for policy 1, policy_version 97412 (0.0008) +[2023-10-14 09:10:44,165][100936] Updated weights for policy 0, policy_version 97270 (0.0009) +[2023-10-14 09:10:44,401][100917] Updated weights for policy 1, policy_version 97422 (0.0009) +[2023-10-14 09:10:44,531][100936] Updated weights for policy 0, policy_version 97280 (0.0008) +[2023-10-14 09:10:44,774][100917] Updated weights for policy 1, policy_version 97432 (0.0008) +[2023-10-14 09:10:48,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 199393280. Throughput: 0: 1654.0, 1: 1659.9. Samples: 49861292. Policy #0 lag: (min: 28.0, avg: 35.0, max: 60.0) +[2023-10-14 09:10:48,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 09:10:48,785][100936] Updated weights for policy 0, policy_version 97290 (0.0011) +[2023-10-14 09:10:48,880][100917] Updated weights for policy 1, policy_version 97442 (0.0009) +[2023-10-14 09:10:49,159][100936] Updated weights for policy 0, policy_version 97300 (0.0008) +[2023-10-14 09:10:49,245][100917] Updated weights for policy 1, policy_version 97452 (0.0007) +[2023-10-14 09:10:49,521][100936] Updated weights for policy 0, policy_version 97310 (0.0010) +[2023-10-14 09:10:49,614][100917] Updated weights for policy 1, policy_version 97462 (0.0007) +[2023-10-14 09:10:49,984][100917] Updated weights for policy 1, policy_version 97472 (0.0009) +[2023-10-14 09:10:53,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.3, 300 sec: 13218.3). Total num frames: 199458816. Throughput: 0: 1650.3, 1: 1665.1. Samples: 49881534. Policy #0 lag: (min: 28.0, avg: 35.0, max: 60.0) +[2023-10-14 09:10:53,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 09:10:53,748][100936] Updated weights for policy 0, policy_version 97320 (0.0008) +[2023-10-14 09:10:54,118][100936] Updated weights for policy 0, policy_version 97330 (0.0008) +[2023-10-14 09:10:54,135][100917] Updated weights for policy 1, policy_version 97482 (0.0009) +[2023-10-14 09:10:54,490][100936] Updated weights for policy 0, policy_version 97340 (0.0007) +[2023-10-14 09:10:54,503][100917] Updated weights for policy 1, policy_version 97492 (0.0009) +[2023-10-14 09:10:54,868][100917] Updated weights for policy 1, policy_version 97502 (0.0009) +[2023-10-14 09:10:58,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 199524352. Throughput: 0: 1652.4, 1: 1667.0. Samples: 49890608. Policy #0 lag: (min: 28.0, avg: 35.0, max: 60.0) +[2023-10-14 09:10:58,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 09:10:58,519][100936] Updated weights for policy 0, policy_version 97350 (0.0009) +[2023-10-14 09:10:58,891][100936] Updated weights for policy 0, policy_version 97360 (0.0008) +[2023-10-14 09:10:59,091][100917] Updated weights for policy 1, policy_version 97512 (0.0009) +[2023-10-14 09:10:59,261][100936] Updated weights for policy 0, policy_version 97370 (0.0009) +[2023-10-14 09:10:59,457][100917] Updated weights for policy 1, policy_version 97522 (0.0007) +[2023-10-14 09:10:59,828][100917] Updated weights for policy 1, policy_version 97532 (0.0007) +[2023-10-14 09:11:03,467][100936] Updated weights for policy 0, policy_version 97380 (0.0008) +[2023-10-14 09:11:03,512][99942] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 199589888. Throughput: 0: 1651.7, 1: 1666.1. Samples: 49910968. Policy #0 lag: (min: 28.0, avg: 35.0, max: 60.0) +[2023-10-14 09:11:03,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 09:11:03,839][100936] Updated weights for policy 0, policy_version 97390 (0.0009) +[2023-10-14 09:11:03,885][100917] Updated weights for policy 1, policy_version 97542 (0.0009) +[2023-10-14 09:11:04,205][100936] Updated weights for policy 0, policy_version 97400 (0.0007) +[2023-10-14 09:11:04,272][100917] Updated weights for policy 1, policy_version 97552 (0.0008) +[2023-10-14 09:11:04,649][100917] Updated weights for policy 1, policy_version 97562 (0.0007) +[2023-10-14 09:11:08,254][100936] Updated weights for policy 0, policy_version 97410 (0.0008) +[2023-10-14 09:11:08,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 199655424. Throughput: 0: 1650.1, 1: 1665.9. Samples: 49931212. Policy #0 lag: (min: 28.0, avg: 35.0, max: 60.0) +[2023-10-14 09:11:08,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 09:11:08,618][100936] Updated weights for policy 0, policy_version 97420 (0.0010) +[2023-10-14 09:11:08,633][100917] Updated weights for policy 1, policy_version 97572 (0.0009) +[2023-10-14 09:11:08,983][100936] Updated weights for policy 0, policy_version 97430 (0.0008) +[2023-10-14 09:11:08,996][100917] Updated weights for policy 1, policy_version 97582 (0.0009) +[2023-10-14 09:11:09,353][100936] Updated weights for policy 0, policy_version 97440 (0.0008) +[2023-10-14 09:11:09,367][100917] Updated weights for policy 1, policy_version 97592 (0.0009) +[2023-10-14 09:11:13,507][100917] Updated weights for policy 1, policy_version 97602 (0.0008) +[2023-10-14 09:11:13,509][100936] Updated weights for policy 0, policy_version 97450 (0.0007) +[2023-10-14 09:11:13,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 199720960. Throughput: 0: 1654.7, 1: 1661.3. Samples: 49940160. Policy #0 lag: (min: 28.0, avg: 35.0, max: 60.0) +[2023-10-14 09:11:13,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 09:11:13,874][100917] Updated weights for policy 1, policy_version 97612 (0.0009) +[2023-10-14 09:11:13,877][100936] Updated weights for policy 0, policy_version 97460 (0.0008) +[2023-10-14 09:11:14,243][100936] Updated weights for policy 0, policy_version 97470 (0.0009) +[2023-10-14 09:11:14,245][100917] Updated weights for policy 1, policy_version 97622 (0.0008) +[2023-10-14 09:11:14,621][100917] Updated weights for policy 1, policy_version 97632 (0.0008) +[2023-10-14 09:11:18,308][100936] Updated weights for policy 0, policy_version 97480 (0.0008) +[2023-10-14 09:11:18,512][99942] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 199786496. Throughput: 0: 1653.3, 1: 1658.8. Samples: 49960548. Policy #0 lag: (min: 28.0, avg: 35.0, max: 60.0) +[2023-10-14 09:11:18,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 09:11:18,683][100936] Updated weights for policy 0, policy_version 97490 (0.0008) +[2023-10-14 09:11:18,835][100917] Updated weights for policy 1, policy_version 97642 (0.0008) +[2023-10-14 09:11:19,048][100936] Updated weights for policy 0, policy_version 97500 (0.0007) +[2023-10-14 09:11:19,211][100917] Updated weights for policy 1, policy_version 97652 (0.0008) +[2023-10-14 09:11:19,574][100917] Updated weights for policy 1, policy_version 97662 (0.0008) +[2023-10-14 09:11:23,300][100936] Updated weights for policy 0, policy_version 97510 (0.0007) +[2023-10-14 09:11:23,512][99942] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 199852032. Throughput: 0: 1646.8, 1: 1655.4. Samples: 49980396. Policy #0 lag: (min: 28.0, avg: 35.0, max: 60.0) +[2023-10-14 09:11:23,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 09:11:23,664][100936] Updated weights for policy 0, policy_version 97520 (0.0009) +[2023-10-14 09:11:23,949][100917] Updated weights for policy 1, policy_version 97672 (0.0011) +[2023-10-14 09:11:24,031][100936] Updated weights for policy 0, policy_version 97530 (0.0010) +[2023-10-14 09:11:24,325][100917] Updated weights for policy 1, policy_version 97682 (0.0010) +[2023-10-14 09:11:24,695][100917] Updated weights for policy 1, policy_version 97692 (0.0009) +[2023-10-14 09:11:28,247][100936] Updated weights for policy 0, policy_version 97540 (0.0008) +[2023-10-14 09:11:28,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13218.3). Total num frames: 199917568. Throughput: 0: 1650.6, 1: 1653.0. Samples: 49989552. Policy #0 lag: (min: 28.0, avg: 35.0, max: 60.0) +[2023-10-14 09:11:28,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 09:11:28,619][100936] Updated weights for policy 0, policy_version 97550 (0.0007) +[2023-10-14 09:11:28,912][100917] Updated weights for policy 1, policy_version 97702 (0.0008) +[2023-10-14 09:11:28,979][100936] Updated weights for policy 0, policy_version 97560 (0.0007) +[2023-10-14 09:11:29,288][100917] Updated weights for policy 1, policy_version 97712 (0.0009) +[2023-10-14 09:11:29,655][100917] Updated weights for policy 1, policy_version 97722 (0.0010) +[2023-10-14 09:11:33,162][100936] Updated weights for policy 0, policy_version 97570 (0.0009) +[2023-10-14 09:11:33,512][99942] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 199983104. Throughput: 0: 1652.9, 1: 1648.0. Samples: 50009832. Policy #0 lag: (min: 28.0, avg: 35.0, max: 60.0) +[2023-10-14 09:11:33,513][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 09:11:33,518][100936] Updated weights for policy 0, policy_version 97580 (0.0008) +[2023-10-14 09:11:33,809][100917] Updated weights for policy 1, policy_version 97732 (0.0007) +[2023-10-14 09:11:33,890][100936] Updated weights for policy 0, policy_version 97590 (0.0008) +[2023-10-14 09:11:34,166][100917] Updated weights for policy 1, policy_version 97742 (0.0008) +[2023-10-14 09:11:34,253][100936] Updated weights for policy 0, policy_version 97600 (0.0009) +[2023-10-14 09:11:34,542][100917] Updated weights for policy 1, policy_version 97752 (0.0008) +[2023-10-14 09:11:38,453][100936] Updated weights for policy 0, policy_version 97610 (0.0009) +[2023-10-14 09:11:38,512][99942] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 200048640. Throughput: 0: 1648.1, 1: 1650.1. Samples: 50029952. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 09:11:38,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 09:11:38,569][100917] Updated weights for policy 1, policy_version 97762 (0.0009) +[2023-10-14 09:11:38,828][100936] Updated weights for policy 0, policy_version 97620 (0.0008) +[2023-10-14 09:11:38,938][100917] Updated weights for policy 1, policy_version 97772 (0.0010) +[2023-10-14 09:11:39,189][100936] Updated weights for policy 0, policy_version 97630 (0.0008) +[2023-10-14 09:11:39,262][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000097632_99975168.pth... +[2023-10-14 09:11:39,292][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000096064_98369536.pth +[2023-10-14 09:11:39,300][100917] Updated weights for policy 1, policy_version 97782 (0.0009) +[2023-10-14 09:11:39,671][100917] Updated weights for policy 1, policy_version 97792 (0.0007) +[2023-10-14 09:11:39,671][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000097792_100139008.pth... +[2023-10-14 09:11:39,711][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000096224_98533376.pth +[2023-10-14 09:11:43,321][100936] Updated weights for policy 0, policy_version 97640 (0.0007) +[2023-10-14 09:11:43,512][99942] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 200114176. Throughput: 0: 1651.2, 1: 1649.8. Samples: 50039154. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) +[2023-10-14 09:11:43,512][99942] Avg episode reward: [(0, '1.000'), (1, '1.000')] +[2023-10-14 09:11:43,699][100936] Updated weights for policy 0, policy_version 97650 (0.0007) +[2023-10-14 09:11:43,958][100917] Updated weights for policy 1, policy_version 97802 (0.0009) +[2023-10-14 09:11:44,073][100936] Updated weights for policy 0, policy_version 97660 (0.0009) +[2023-10-14 09:11:44,320][100917] Updated weights for policy 1, policy_version 97812 (0.0008) +[2023-10-14 09:11:44,694][100917] Updated weights for policy 1, policy_version 97822 (0.0009) +[2023-10-14 09:11:44,756][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000097824_100171776.pth... +[2023-10-14 09:11:44,756][100961] Stopping RolloutWorker_w10... +[2023-10-14 09:11:44,756][100959] Stopping RolloutWorker_w8... +[2023-10-14 09:11:44,756][100957] Stopping RolloutWorker_w3... +[2023-10-14 09:11:44,757][100963] Stopping RolloutWorker_w11... +[2023-10-14 09:11:44,757][100560] Stopping Batcher_0... +[2023-10-14 09:11:44,757][100956] Stopping RolloutWorker_w6... +[2023-10-14 09:11:44,757][100961] Loop rollout_proc10_evt_loop terminating... +[2023-10-14 09:11:44,757][100959] Loop rollout_proc8_evt_loop terminating... +[2023-10-14 09:11:44,757][99942] Component RolloutWorker_w8 stopped! +[2023-10-14 09:11:44,757][100957] Loop rollout_proc3_evt_loop terminating... +[2023-10-14 09:11:44,757][100963] Loop rollout_proc11_evt_loop terminating... +[2023-10-14 09:11:44,757][100956] Loop rollout_proc6_evt_loop terminating... +[2023-10-14 09:11:44,757][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000097664_100007936.pth... +[2023-10-14 09:11:44,757][99942] Component RolloutWorker_w10 stopped! +[2023-10-14 09:11:44,758][100962] Stopping RolloutWorker_w12... +[2023-10-14 09:11:44,758][99942] Component RolloutWorker_w3 stopped! +[2023-10-14 09:11:44,758][101548] Stopping RolloutWorker_w14... +[2023-10-14 09:11:44,758][100962] Loop rollout_proc12_evt_loop terminating... +[2023-10-14 09:11:44,758][101548] Loop rollout_proc14_evt_loop terminating... +[2023-10-14 09:11:44,758][99942] Component RolloutWorker_w11 stopped! +[2023-10-14 09:11:44,759][99942] Component Batcher_0 stopped! +[2023-10-14 09:11:44,759][99942] Component RolloutWorker_w6 stopped! +[2023-10-14 09:11:44,760][99942] Component RolloutWorker_w12 stopped! +[2023-10-14 09:11:44,760][99942] Component RolloutWorker_w14 stopped! +[2023-10-14 09:11:44,761][100954] Stopping RolloutWorker_w4... +[2023-10-14 09:11:44,761][100960] Stopping RolloutWorker_w9... +[2023-10-14 09:11:44,761][100954] Loop rollout_proc4_evt_loop terminating... +[2023-10-14 09:11:44,761][99942] Component RolloutWorker_w4 stopped! +[2023-10-14 09:11:44,761][100955] Stopping RolloutWorker_w5... +[2023-10-14 09:11:44,762][100960] Loop rollout_proc9_evt_loop terminating... +[2023-10-14 09:11:44,762][100955] Loop rollout_proc5_evt_loop terminating... +[2023-10-14 09:11:44,762][99942] Component RolloutWorker_w9 stopped! +[2023-10-14 09:11:44,757][100560] Loop batcher_evt_loop terminating... +[2023-10-14 09:11:44,762][99942] Component RolloutWorker_w5 stopped! +[2023-10-14 09:11:44,763][100964] Stopping RolloutWorker_w13... +[2023-10-14 09:11:44,763][100964] Loop rollout_proc13_evt_loop terminating... +[2023-10-14 09:11:44,763][99942] Component RolloutWorker_w13 stopped! +[2023-10-14 09:11:44,764][101580] Stopping RolloutWorker_w15... +[2023-10-14 09:11:44,764][100958] Stopping RolloutWorker_w7... +[2023-10-14 09:11:44,764][101580] Loop rollout_proc15_evt_loop terminating... +[2023-10-14 09:11:44,764][100953] Stopping RolloutWorker_w2... +[2023-10-14 09:11:44,764][99942] Component RolloutWorker_w15 stopped! +[2023-10-14 09:11:44,764][100958] Loop rollout_proc7_evt_loop terminating... +[2023-10-14 09:11:44,764][100953] Loop rollout_proc2_evt_loop terminating... +[2023-10-14 09:11:44,764][100950] Stopping RolloutWorker_w0... +[2023-10-14 09:11:44,765][100950] Loop rollout_proc0_evt_loop terminating... +[2023-10-14 09:11:44,765][99942] Component RolloutWorker_w7 stopped! +[2023-10-14 09:11:44,765][99942] Component RolloutWorker_w2 stopped! +[2023-10-14 09:11:44,766][99942] Component RolloutWorker_w0 stopped! +[2023-10-14 09:11:44,766][100951] Stopping RolloutWorker_w1... +[2023-10-14 09:11:44,767][100951] Loop rollout_proc1_evt_loop terminating... +[2023-10-14 09:11:44,767][99942] Component RolloutWorker_w1 stopped! +[2023-10-14 09:11:44,769][99942] Component Batcher_1 stopped! +[2023-10-14 09:11:44,784][100936] Weights refcount: 2 0 +[2023-10-14 09:11:44,785][100936] Stopping InferenceWorker_p0-w0... +[2023-10-14 09:11:44,786][100936] Loop inference_proc0-0_evt_loop terminating... +[2023-10-14 09:11:44,786][99942] Component InferenceWorker_p0-w0 stopped! +[2023-10-14 09:11:44,788][100917] Weights refcount: 2 0 +[2023-10-14 09:11:44,779][100681] Stopping Batcher_1... +[2023-10-14 09:11:44,789][100917] Stopping InferenceWorker_p1-w0... +[2023-10-14 09:11:44,790][100917] Loop inference_proc1-0_evt_loop terminating... +[2023-10-14 09:11:44,790][99942] Component InferenceWorker_p1-w0 stopped! +[2023-10-14 09:11:44,790][100681] Loop batcher_evt_loop terminating... +[2023-10-14 09:11:44,791][100681] Removing ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000096992_99319808.pth +[2023-10-14 09:11:44,795][100681] Saving ./train_atari/atari_privateye_APPO/checkpoint_p1/checkpoint_000097824_100171776.pth... +[2023-10-14 09:11:44,805][100560] Removing ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000096832_99155968.pth +[2023-10-14 09:11:44,811][100560] Saving ./train_atari/atari_privateye_APPO/checkpoint_p0/checkpoint_000097664_100007936.pth... +[2023-10-14 09:11:44,839][100681] Stopping LearnerWorker_p1... +[2023-10-14 09:11:44,840][100681] Loop learner_proc1_evt_loop terminating... +[2023-10-14 09:11:44,840][99942] Component LearnerWorker_p1 stopped! +[2023-10-14 09:11:44,868][100560] Stopping LearnerWorker_p0... +[2023-10-14 09:11:44,869][100560] Loop learner_proc0_evt_loop terminating... +[2023-10-14 09:11:44,869][99942] Component LearnerWorker_p0 stopped! +[2023-10-14 09:11:44,870][99942] Waiting for process learner_proc0 to stop... +[2023-10-14 09:11:45,764][99942] Waiting for process learner_proc1 to stop... +[2023-10-14 09:11:45,765][99942] Waiting for process inference_proc0-0 to join... +[2023-10-14 09:11:45,766][99942] Waiting for process inference_proc1-0 to join... +[2023-10-14 09:11:45,766][99942] Waiting for process rollout_proc0 to join... +[2023-10-14 09:11:45,767][99942] Waiting for process rollout_proc1 to join... +[2023-10-14 09:11:45,768][99942] Waiting for process rollout_proc2 to join... +[2023-10-14 09:11:45,768][99942] Waiting for process rollout_proc3 to join... +[2023-10-14 09:11:45,769][99942] Waiting for process rollout_proc4 to join... +[2023-10-14 09:11:45,770][99942] Waiting for process rollout_proc5 to join... +[2023-10-14 09:11:45,770][99942] Waiting for process rollout_proc6 to join... +[2023-10-14 09:11:45,771][99942] Waiting for process rollout_proc7 to join... +[2023-10-14 09:11:45,771][99942] Waiting for process rollout_proc8 to join... +[2023-10-14 09:11:45,772][99942] Waiting for process rollout_proc9 to join... +[2023-10-14 09:11:45,772][99942] Waiting for process rollout_proc10 to join... +[2023-10-14 09:11:45,773][99942] Waiting for process rollout_proc11 to join... +[2023-10-14 09:11:45,773][99942] Waiting for process rollout_proc12 to join... +[2023-10-14 09:11:45,774][99942] Waiting for process rollout_proc13 to join... +[2023-10-14 09:11:45,774][99942] Waiting for process rollout_proc14 to join... +[2023-10-14 09:11:45,775][99942] Waiting for process rollout_proc15 to join... +[2023-10-14 09:11:45,775][99942] Batcher 0 profile tree view: +batching: 170.1284, releasing_batches: 0.0898 +[2023-10-14 09:11:45,776][99942] Batcher 1 profile tree view: +batching: 170.4715, releasing_batches: 0.0892 +[2023-10-14 09:11:45,776][99942] InferenceWorker_p0-w0 profile tree view: +wait_policy: 0.0001 + wait_policy_total: 2770.8998 +update_model: 207.8615 + weight_update: 0.0010 +one_step: 0.0024 + handle_policy_step: 11418.8776 + deserialize: 65.4233, stack: 194.5848, obs_to_device_normalize: 2552.3583, forward: 5165.8236, prepare_outputs: 2472.9716, send_messages: 466.7990 +[2023-10-14 09:11:45,776][99942] InferenceWorker_p1-w0 profile tree view: +wait_policy: 0.0003 + wait_policy_total: 2734.0024 +update_model: 207.7954 + weight_update: 0.0009 +one_step: 0.0027 + handle_policy_step: 11480.8959 + deserialize: 65.3978, stack: 195.2313, obs_to_device_normalize: 2582.4540, forward: 5212.0432, prepare_outputs: 2450.1799, send_messages: 475.5991 +[2023-10-14 09:11:45,777][99942] Learner 0 profile tree view: +misc: 0.0193, prepare_batch: 269.7847 +train: 3631.8324 + epoch_init: 0.1881, minibatch_init: 13.1783, losses_postprocess: 893.5110, kl_divergence: 32.5652, update: 385.9921, after_optimizer: 2117.3604 + calculate_losses: 171.8701 + losses_init: 0.3853, forward_head: 59.9134, bptt_initial: 1.4496, bptt: 2.0403, tail: 38.2428, advantages_returns: 11.3897, losses: 44.6116 +[2023-10-14 09:11:45,777][99942] Learner 1 profile tree view: +misc: 0.0182, prepare_batch: 271.3927 +train: 3663.5990 + epoch_init: 0.1885, minibatch_init: 13.0784, losses_postprocess: 899.9505, kl_divergence: 30.9372, update: 406.2971, after_optimizer: 2129.4018 + calculate_losses: 167.1410 + losses_init: 0.3853, forward_head: 56.2346, bptt_initial: 1.4429, bptt: 1.8545, tail: 38.1691, advantages_returns: 11.2823, losses: 44.2832 +[2023-10-14 09:11:45,778][99942] RolloutWorker_w0 profile tree view: +wait_for_trajectories: 1.2398, enqueue_policy_requests: 409.2845, process_policy_outputs: 190.8238, env_step: 7998.3568, finalize_trajectories: 3.5046, complete_rollouts: 2.9482 +post_env_step: 375.9650 + process_env_step: 83.8153 +[2023-10-14 09:11:45,778][99942] RolloutWorker_w15 profile tree view: +wait_for_trajectories: 1.2470, enqueue_policy_requests: 412.9833, process_policy_outputs: 190.5598, env_step: 7845.0010, finalize_trajectories: 3.5419, complete_rollouts: 2.9657 +post_env_step: 377.5984 + process_env_step: 83.6013 +[2023-10-14 09:11:45,779][99942] Loop Runner_EvtLoop terminating... +[2023-10-14 09:11:45,780][99942] Runner profile tree view: +main_loop: 15122.6587 +[2023-10-14 09:11:45,780][99942] Collected {0: 100007936, 1: 100171776}, FPS: 13237.1