diff --git "a/experiment/pearl_trainer_3/debug.log" "b/experiment/pearl_trainer_3/debug.log" new file mode 100644--- /dev/null +++ "b/experiment/pearl_trainer_3/debug.log" @@ -0,0 +1,13502 @@ +2025-04-02 13:44:41 | [pearl_trainer] Logging to /home/h2khalil/MetaRL-Assistive-Robotics/data/local/experiment/pearl_trainer_3 +2025-04-02 13:44:47 | [pearl_trainer] Obtaining samples... +2025-04-02 13:49:47 | [pearl_trainer] epoch #0 | Training... +2025-04-02 13:51:01 | [pearl_trainer] epoch #0 | Evaluating... +2025-04-02 13:51:01 | [pearl_trainer] epoch #0 | Sampling for adapation and meta-testing... +2025-04-02 13:52:42 | [pearl_trainer] epoch #0 | Finished meta-testing... +2025-04-02 13:52:42 | [pearl_trainer] epoch #0 | Saving snapshot... +2025-04-02 13:52:43 | [pearl_trainer] epoch #0 | Saved +2025-04-02 13:52:43 | [pearl_trainer] epoch #0 | Time 475.93 s +2025-04-02 13:52:43 | [pearl_trainer] epoch #0 | EpochTime 475.93 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -42.6774 +MetaTest/Average/AverageReturn -42.6774 +MetaTest/Average/Iteration 0 +MetaTest/Average/MaxReturn -36.6019 +MetaTest/Average/MinReturn -50.7819 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 5.05293 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -42.6774 +MetaTest/__unnamed_task__/AverageReturn -42.6774 +MetaTest/__unnamed_task__/Iteration 0 +MetaTest/__unnamed_task__/MaxReturn -36.6019 +MetaTest/__unnamed_task__/MinReturn -50.7819 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 5.05293 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 16600 +------------------------------------------------- ----------- +2025-04-02 13:53:10 | [pearl_trainer] epoch #1 | Training... +2025-04-02 13:54:44 | [pearl_trainer] epoch #1 | Evaluating... +2025-04-02 13:54:44 | [pearl_trainer] epoch #1 | Sampling for adapation and meta-testing... +2025-04-02 13:56:22 | [pearl_trainer] epoch #1 | Finished meta-testing... +2025-04-02 13:56:22 | [pearl_trainer] epoch #1 | Saving snapshot... +2025-04-02 13:56:23 | [pearl_trainer] epoch #1 | Saved +2025-04-02 13:56:23 | [pearl_trainer] epoch #1 | Time 696.17 s +2025-04-02 13:56:23 | [pearl_trainer] epoch #1 | EpochTime 220.24 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -21.4485 +MetaTest/Average/AverageReturn -21.4485 +MetaTest/Average/Iteration 1 +MetaTest/Average/MaxReturn -17.9062 +MetaTest/Average/MinReturn -27.9034 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 3.61317 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -21.4485 +MetaTest/__unnamed_task__/AverageReturn -21.4485 +MetaTest/__unnamed_task__/Iteration 1 +MetaTest/__unnamed_task__/MaxReturn -17.9062 +MetaTest/__unnamed_task__/MinReturn -27.9034 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 3.61317 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 18200 +------------------------------------------------- ----------- +2025-04-02 13:56:53 | [pearl_trainer] epoch #2 | Training... +2025-04-02 13:58:24 | [pearl_trainer] epoch #2 | Evaluating... +2025-04-02 13:58:24 | [pearl_trainer] epoch #2 | Sampling for adapation and meta-testing... +2025-04-02 14:00:11 | [pearl_trainer] epoch #2 | Finished meta-testing... +2025-04-02 14:00:11 | [pearl_trainer] epoch #2 | Saving snapshot... +2025-04-02 14:00:12 | [pearl_trainer] epoch #2 | Saved +2025-04-02 14:00:12 | [pearl_trainer] epoch #2 | Time 924.58 s +2025-04-02 14:00:12 | [pearl_trainer] epoch #2 | EpochTime 228.41 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -19.9799 +MetaTest/Average/AverageReturn -19.9799 +MetaTest/Average/Iteration 2 +MetaTest/Average/MaxReturn -7.15418 +MetaTest/Average/MinReturn -27.0813 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 7.28971 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -19.9799 +MetaTest/__unnamed_task__/AverageReturn -19.9799 +MetaTest/__unnamed_task__/Iteration 2 +MetaTest/__unnamed_task__/MaxReturn -7.15418 +MetaTest/__unnamed_task__/MinReturn -27.0813 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 7.28971 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 19800 +------------------------------------------------- ----------- +2025-04-02 14:00:39 | [pearl_trainer] epoch #3 | Training... +2025-04-02 14:02:06 | [pearl_trainer] epoch #3 | Evaluating... +2025-04-02 14:02:06 | [pearl_trainer] epoch #3 | Sampling for adapation and meta-testing... +2025-04-02 14:03:53 | [pearl_trainer] epoch #3 | Finished meta-testing... +2025-04-02 14:03:53 | [pearl_trainer] epoch #3 | Saving snapshot... +2025-04-02 14:03:54 | [pearl_trainer] epoch #3 | Saved +2025-04-02 14:03:54 | [pearl_trainer] epoch #3 | Time 1146.80 s +2025-04-02 14:03:54 | [pearl_trainer] epoch #3 | EpochTime 222.22 s +------------------------------------------------- ---------- +MetaTest/Average/AverageDiscountedReturn -35.2473 +MetaTest/Average/AverageReturn -35.2473 +MetaTest/Average/Iteration 3 +MetaTest/Average/MaxReturn -16.36 +MetaTest/Average/MinReturn -51.7746 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 13.2321 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -35.2473 +MetaTest/__unnamed_task__/AverageReturn -35.2473 +MetaTest/__unnamed_task__/Iteration 3 +MetaTest/__unnamed_task__/MaxReturn -16.36 +MetaTest/__unnamed_task__/MinReturn -51.7746 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 13.2321 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 21400 +------------------------------------------------- ---------- +2025-04-02 14:04:24 | [pearl_trainer] epoch #4 | Training... +2025-04-02 14:05:44 | [pearl_trainer] epoch #4 | Evaluating... +2025-04-02 14:05:44 | [pearl_trainer] epoch #4 | Sampling for adapation and meta-testing... +2025-04-02 14:07:32 | [pearl_trainer] epoch #4 | Finished meta-testing... +2025-04-02 14:07:32 | [pearl_trainer] epoch #4 | Saving snapshot... +2025-04-02 14:07:33 | [pearl_trainer] epoch #4 | Saved +2025-04-02 14:07:33 | [pearl_trainer] epoch #4 | Time 1366.06 s +2025-04-02 14:07:33 | [pearl_trainer] epoch #4 | EpochTime 219.25 s +------------------------------------------------- ---------- +MetaTest/Average/AverageDiscountedReturn -25.047 +MetaTest/Average/AverageReturn -25.047 +MetaTest/Average/Iteration 4 +MetaTest/Average/MaxReturn -19.7625 +MetaTest/Average/MinReturn -28.6404 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 3.7322 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -25.047 +MetaTest/__unnamed_task__/AverageReturn -25.047 +MetaTest/__unnamed_task__/Iteration 4 +MetaTest/__unnamed_task__/MaxReturn -19.7625 +MetaTest/__unnamed_task__/MinReturn -28.6404 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 3.7322 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 23000 +------------------------------------------------- ---------- +2025-04-02 14:08:02 | [pearl_trainer] epoch #5 | Training... +2025-04-02 14:09:40 | [pearl_trainer] epoch #5 | Evaluating... +2025-04-02 14:09:40 | [pearl_trainer] epoch #5 | Sampling for adapation and meta-testing... +2025-04-02 14:11:23 | [pearl_trainer] epoch #5 | Finished meta-testing... +2025-04-02 14:11:23 | [pearl_trainer] epoch #5 | Saving snapshot... +2025-04-02 14:11:24 | [pearl_trainer] epoch #5 | Saved +2025-04-02 14:11:24 | [pearl_trainer] epoch #5 | Time 1597.02 s +2025-04-02 14:11:24 | [pearl_trainer] epoch #5 | EpochTime 230.96 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -23.9448 +MetaTest/Average/AverageReturn -23.9448 +MetaTest/Average/Iteration 5 +MetaTest/Average/MaxReturn -20.7284 +MetaTest/Average/MinReturn -30.4779 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 3.53155 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -23.9448 +MetaTest/__unnamed_task__/AverageReturn -23.9448 +MetaTest/__unnamed_task__/Iteration 5 +MetaTest/__unnamed_task__/MaxReturn -20.7284 +MetaTest/__unnamed_task__/MinReturn -30.4779 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 3.53155 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 24600 +------------------------------------------------- ----------- +2025-04-02 14:11:56 | [pearl_trainer] epoch #6 | Training... +2025-04-02 14:13:23 | [pearl_trainer] epoch #6 | Evaluating... +2025-04-02 14:13:23 | [pearl_trainer] epoch #6 | Sampling for adapation and meta-testing... +2025-04-02 14:15:09 | [pearl_trainer] epoch #6 | Finished meta-testing... +2025-04-02 14:15:09 | [pearl_trainer] epoch #6 | Saving snapshot... +2025-04-02 14:15:10 | [pearl_trainer] epoch #6 | Saved +2025-04-02 14:15:10 | [pearl_trainer] epoch #6 | Time 1822.79 s +2025-04-02 14:15:10 | [pearl_trainer] epoch #6 | EpochTime 225.77 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -22.6776 +MetaTest/Average/AverageReturn -22.6776 +MetaTest/Average/Iteration 6 +MetaTest/Average/MaxReturn -17.1635 +MetaTest/Average/MinReturn -32.5358 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 5.29532 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -22.6776 +MetaTest/__unnamed_task__/AverageReturn -22.6776 +MetaTest/__unnamed_task__/Iteration 6 +MetaTest/__unnamed_task__/MaxReturn -17.1635 +MetaTest/__unnamed_task__/MinReturn -32.5358 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 5.29532 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 26200 +------------------------------------------------- ----------- +2025-04-02 14:15:40 | [pearl_trainer] epoch #7 | Training... +2025-04-02 14:17:05 | [pearl_trainer] epoch #7 | Evaluating... +2025-04-02 14:17:05 | [pearl_trainer] epoch #7 | Sampling for adapation and meta-testing... +2025-04-02 14:18:48 | [pearl_trainer] epoch #7 | Finished meta-testing... +2025-04-02 14:18:48 | [pearl_trainer] epoch #7 | Saving snapshot... +2025-04-02 14:18:49 | [pearl_trainer] epoch #7 | Saved +2025-04-02 14:18:49 | [pearl_trainer] epoch #7 | Time 2042.01 s +2025-04-02 14:18:49 | [pearl_trainer] epoch #7 | EpochTime 219.21 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -21.6374 +MetaTest/Average/AverageReturn -21.6374 +MetaTest/Average/Iteration 7 +MetaTest/Average/MaxReturn -9.26076 +MetaTest/Average/MinReturn -28.0957 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 6.79293 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -21.6374 +MetaTest/__unnamed_task__/AverageReturn -21.6374 +MetaTest/__unnamed_task__/Iteration 7 +MetaTest/__unnamed_task__/MaxReturn -9.26076 +MetaTest/__unnamed_task__/MinReturn -28.0957 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 6.79293 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 27800 +------------------------------------------------- ----------- +2025-04-02 14:19:17 | [pearl_trainer] epoch #8 | Training... +2025-04-02 14:20:50 | [pearl_trainer] epoch #8 | Evaluating... +2025-04-02 14:20:50 | [pearl_trainer] epoch #8 | Sampling for adapation and meta-testing... +2025-04-02 14:22:31 | [pearl_trainer] epoch #8 | Finished meta-testing... +2025-04-02 14:22:31 | [pearl_trainer] epoch #8 | Saving snapshot... +2025-04-02 14:22:32 | [pearl_trainer] epoch #8 | Saved +2025-04-02 14:22:32 | [pearl_trainer] epoch #8 | Time 2265.36 s +2025-04-02 14:22:32 | [pearl_trainer] epoch #8 | EpochTime 223.35 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -22.6704 +MetaTest/Average/AverageReturn -22.6704 +MetaTest/Average/Iteration 8 +MetaTest/Average/MaxReturn -18.7679 +MetaTest/Average/MinReturn -28.8123 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 3.72081 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -22.6704 +MetaTest/__unnamed_task__/AverageReturn -22.6704 +MetaTest/__unnamed_task__/Iteration 8 +MetaTest/__unnamed_task__/MaxReturn -18.7679 +MetaTest/__unnamed_task__/MinReturn -28.8123 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 3.72081 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 29400 +------------------------------------------------- ----------- +2025-04-02 14:23:02 | [pearl_trainer] epoch #9 | Training... +2025-04-02 14:24:33 | [pearl_trainer] epoch #9 | Evaluating... +2025-04-02 14:24:33 | [pearl_trainer] epoch #9 | Sampling for adapation and meta-testing... +2025-04-02 14:26:16 | [pearl_trainer] epoch #9 | Finished meta-testing... +2025-04-02 14:26:16 | [pearl_trainer] epoch #9 | Saving snapshot... +2025-04-02 14:26:17 | [pearl_trainer] epoch #9 | Saved +2025-04-02 14:26:17 | [pearl_trainer] epoch #9 | Time 2490.41 s +2025-04-02 14:26:17 | [pearl_trainer] epoch #9 | EpochTime 225.05 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -19.4773 +MetaTest/Average/AverageReturn -19.4773 +MetaTest/Average/Iteration 9 +MetaTest/Average/MaxReturn -17.8967 +MetaTest/Average/MinReturn -21.179 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 1.05188 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -19.4773 +MetaTest/__unnamed_task__/AverageReturn -19.4773 +MetaTest/__unnamed_task__/Iteration 9 +MetaTest/__unnamed_task__/MaxReturn -17.8967 +MetaTest/__unnamed_task__/MinReturn -21.179 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 1.05188 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 31000 +------------------------------------------------- ----------- +2025-04-02 14:26:46 | [pearl_trainer] epoch #10 | Training... +2025-04-02 14:28:11 | [pearl_trainer] epoch #10 | Evaluating... +2025-04-02 14:28:11 | [pearl_trainer] epoch #10 | Sampling for adapation and meta-testing... +2025-04-02 14:29:55 | [pearl_trainer] epoch #10 | Finished meta-testing... +2025-04-02 14:29:55 | [pearl_trainer] epoch #10 | Saving snapshot... +2025-04-02 14:29:56 | [pearl_trainer] epoch #10 | Saved +2025-04-02 14:29:56 | [pearl_trainer] epoch #10 | Time 2709.32 s +2025-04-02 14:29:56 | [pearl_trainer] epoch #10 | EpochTime 218.91 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -17.5995 +MetaTest/Average/AverageReturn -17.5995 +MetaTest/Average/Iteration 10 +MetaTest/Average/MaxReturn -10.7049 +MetaTest/Average/MinReturn -22.512 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 4.03239 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -17.5995 +MetaTest/__unnamed_task__/AverageReturn -17.5995 +MetaTest/__unnamed_task__/Iteration 10 +MetaTest/__unnamed_task__/MaxReturn -10.7049 +MetaTest/__unnamed_task__/MinReturn -22.512 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 4.03239 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 32600 +------------------------------------------------- ----------- +2025-04-02 14:30:26 | [pearl_trainer] epoch #11 | Training... +2025-04-02 14:31:55 | [pearl_trainer] epoch #11 | Evaluating... +2025-04-02 14:31:55 | [pearl_trainer] epoch #11 | Sampling for adapation and meta-testing... +2025-04-02 14:33:42 | [pearl_trainer] epoch #11 | Finished meta-testing... +2025-04-02 14:33:42 | [pearl_trainer] epoch #11 | Saving snapshot... +2025-04-02 14:33:43 | [pearl_trainer] epoch #11 | Saved +2025-04-02 14:33:43 | [pearl_trainer] epoch #11 | Time 2935.69 s +2025-04-02 14:33:43 | [pearl_trainer] epoch #11 | EpochTime 226.36 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -21.6134 +MetaTest/Average/AverageReturn -21.6134 +MetaTest/Average/Iteration 11 +MetaTest/Average/MaxReturn -17.2524 +MetaTest/Average/MinReturn -27.458 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 3.40059 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -21.6134 +MetaTest/__unnamed_task__/AverageReturn -21.6134 +MetaTest/__unnamed_task__/Iteration 11 +MetaTest/__unnamed_task__/MaxReturn -17.2524 +MetaTest/__unnamed_task__/MinReturn -27.458 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 3.40059 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 34200 +------------------------------------------------- ----------- +2025-04-02 14:34:12 | [pearl_trainer] epoch #12 | Training... +2025-04-02 14:35:34 | [pearl_trainer] epoch #12 | Evaluating... +2025-04-02 14:35:34 | [pearl_trainer] epoch #12 | Sampling for adapation and meta-testing... +2025-04-02 14:37:14 | [pearl_trainer] epoch #12 | Finished meta-testing... +2025-04-02 14:37:14 | [pearl_trainer] epoch #12 | Saving snapshot... +2025-04-02 14:37:15 | [pearl_trainer] epoch #12 | Saved +2025-04-02 14:37:15 | [pearl_trainer] epoch #12 | Time 3147.84 s +2025-04-02 14:37:15 | [pearl_trainer] epoch #12 | EpochTime 212.15 s +------------------------------------------------- ---------- +MetaTest/Average/AverageDiscountedReturn -23.7029 +MetaTest/Average/AverageReturn -23.7029 +MetaTest/Average/Iteration 12 +MetaTest/Average/MaxReturn -18.907 +MetaTest/Average/MinReturn -28.6758 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 4.0841 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -23.7029 +MetaTest/__unnamed_task__/AverageReturn -23.7029 +MetaTest/__unnamed_task__/Iteration 12 +MetaTest/__unnamed_task__/MaxReturn -18.907 +MetaTest/__unnamed_task__/MinReturn -28.6758 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 4.0841 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 35800 +------------------------------------------------- ---------- +2025-04-02 14:37:47 | [pearl_trainer] epoch #13 | Training... +2025-04-02 14:39:23 | [pearl_trainer] epoch #13 | Evaluating... +2025-04-02 14:39:23 | [pearl_trainer] epoch #13 | Sampling for adapation and meta-testing... +2025-04-02 14:41:16 | [pearl_trainer] epoch #13 | Finished meta-testing... +2025-04-02 14:41:16 | [pearl_trainer] epoch #13 | Saving snapshot... +2025-04-02 14:41:17 | [pearl_trainer] epoch #13 | Saved +2025-04-02 14:41:17 | [pearl_trainer] epoch #13 | Time 3390.10 s +2025-04-02 14:41:17 | [pearl_trainer] epoch #13 | EpochTime 242.25 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -19.2789 +MetaTest/Average/AverageReturn -19.2789 +MetaTest/Average/Iteration 13 +MetaTest/Average/MaxReturn -11.4718 +MetaTest/Average/MinReturn -32.7042 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 7.13667 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -19.2789 +MetaTest/__unnamed_task__/AverageReturn -19.2789 +MetaTest/__unnamed_task__/Iteration 13 +MetaTest/__unnamed_task__/MaxReturn -11.4718 +MetaTest/__unnamed_task__/MinReturn -32.7042 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 7.13667 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 37400 +------------------------------------------------- ----------- +2025-04-02 14:41:46 | [pearl_trainer] epoch #14 | Training... +2025-04-02 14:43:13 | [pearl_trainer] epoch #14 | Evaluating... +2025-04-02 14:43:13 | [pearl_trainer] epoch #14 | Sampling for adapation and meta-testing... +2025-04-02 14:44:54 | [pearl_trainer] epoch #14 | Finished meta-testing... +2025-04-02 14:44:54 | [pearl_trainer] epoch #14 | Saving snapshot... +2025-04-02 14:44:55 | [pearl_trainer] epoch #14 | Saved +2025-04-02 14:44:55 | [pearl_trainer] epoch #14 | Time 3608.13 s +2025-04-02 14:44:55 | [pearl_trainer] epoch #14 | EpochTime 218.03 s +------------------------------------------------- ---------- +MetaTest/Average/AverageDiscountedReturn -23.8298 +MetaTest/Average/AverageReturn -23.8298 +MetaTest/Average/Iteration 14 +MetaTest/Average/MaxReturn -10.2619 +MetaTest/Average/MinReturn -44.1763 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 11.8903 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -23.8298 +MetaTest/__unnamed_task__/AverageReturn -23.8298 +MetaTest/__unnamed_task__/Iteration 14 +MetaTest/__unnamed_task__/MaxReturn -10.2619 +MetaTest/__unnamed_task__/MinReturn -44.1763 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 11.8903 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 39000 +------------------------------------------------- ---------- +2025-04-02 14:45:27 | [pearl_trainer] epoch #15 | Training... +2025-04-02 14:47:01 | [pearl_trainer] epoch #15 | Evaluating... +2025-04-02 14:47:01 | [pearl_trainer] epoch #15 | Sampling for adapation and meta-testing... +2025-04-02 14:48:46 | [pearl_trainer] epoch #15 | Finished meta-testing... +2025-04-02 14:48:46 | [pearl_trainer] epoch #15 | Saving snapshot... +2025-04-02 14:48:47 | [pearl_trainer] epoch #15 | Saved +2025-04-02 14:48:47 | [pearl_trainer] epoch #15 | Time 3840.06 s +2025-04-02 14:48:47 | [pearl_trainer] epoch #15 | EpochTime 231.93 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -20.3511 +MetaTest/Average/AverageReturn -20.3511 +MetaTest/Average/Iteration 15 +MetaTest/Average/MaxReturn -16.531 +MetaTest/Average/MinReturn -25.8955 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 3.11799 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -20.3511 +MetaTest/__unnamed_task__/AverageReturn -20.3511 +MetaTest/__unnamed_task__/Iteration 15 +MetaTest/__unnamed_task__/MaxReturn -16.531 +MetaTest/__unnamed_task__/MinReturn -25.8955 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 3.11799 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 40600 +------------------------------------------------- ----------- +2025-04-02 14:49:16 | [pearl_trainer] epoch #16 | Training... +2025-04-02 14:50:37 | [pearl_trainer] epoch #16 | Evaluating... +2025-04-02 14:50:37 | [pearl_trainer] epoch #16 | Sampling for adapation and meta-testing... +2025-04-02 14:52:26 | [pearl_trainer] epoch #16 | Finished meta-testing... +2025-04-02 14:52:26 | [pearl_trainer] epoch #16 | Saving snapshot... +2025-04-02 14:52:28 | [pearl_trainer] epoch #16 | Saved +2025-04-02 14:52:28 | [pearl_trainer] epoch #16 | Time 4060.62 s +2025-04-02 14:52:28 | [pearl_trainer] epoch #16 | EpochTime 220.56 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -27.5376 +MetaTest/Average/AverageReturn -27.5376 +MetaTest/Average/Iteration 16 +MetaTest/Average/MaxReturn -15.1613 +MetaTest/Average/MinReturn -44.3685 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 9.99509 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -27.5376 +MetaTest/__unnamed_task__/AverageReturn -27.5376 +MetaTest/__unnamed_task__/Iteration 16 +MetaTest/__unnamed_task__/MaxReturn -15.1613 +MetaTest/__unnamed_task__/MinReturn -44.3685 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 9.99509 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 42200 +------------------------------------------------- ----------- +2025-04-02 14:52:56 | [pearl_trainer] epoch #17 | Training... +2025-04-02 14:54:23 | [pearl_trainer] epoch #17 | Evaluating... +2025-04-02 14:54:23 | [pearl_trainer] epoch #17 | Sampling for adapation and meta-testing... +2025-04-02 14:56:09 | [pearl_trainer] epoch #17 | Finished meta-testing... +2025-04-02 14:56:09 | [pearl_trainer] epoch #17 | Saving snapshot... +2025-04-02 14:56:10 | [pearl_trainer] epoch #17 | Saved +2025-04-02 14:56:10 | [pearl_trainer] epoch #17 | Time 4283.30 s +2025-04-02 14:56:10 | [pearl_trainer] epoch #17 | EpochTime 222.68 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -20.8042 +MetaTest/Average/AverageReturn -20.8042 +MetaTest/Average/Iteration 17 +MetaTest/Average/MaxReturn -5.95525 +MetaTest/Average/MinReturn -32.6257 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 8.94228 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -20.8042 +MetaTest/__unnamed_task__/AverageReturn -20.8042 +MetaTest/__unnamed_task__/Iteration 17 +MetaTest/__unnamed_task__/MaxReturn -5.95525 +MetaTest/__unnamed_task__/MinReturn -32.6257 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 8.94228 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 43800 +------------------------------------------------- ----------- +2025-04-02 14:56:41 | [pearl_trainer] epoch #18 | Training... +2025-04-02 14:58:00 | [pearl_trainer] epoch #18 | Evaluating... +2025-04-02 14:58:00 | [pearl_trainer] epoch #18 | Sampling for adapation and meta-testing... +2025-04-02 14:59:50 | [pearl_trainer] epoch #18 | Finished meta-testing... +2025-04-02 14:59:50 | [pearl_trainer] epoch #18 | Saving snapshot... +2025-04-02 14:59:51 | [pearl_trainer] epoch #18 | Saved +2025-04-02 14:59:51 | [pearl_trainer] epoch #18 | Time 4503.56 s +2025-04-02 14:59:51 | [pearl_trainer] epoch #18 | EpochTime 220.26 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -23.1971 +MetaTest/Average/AverageReturn -23.1971 +MetaTest/Average/Iteration 18 +MetaTest/Average/MaxReturn -18.3618 +MetaTest/Average/MinReturn -30.0122 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 3.99624 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -23.1971 +MetaTest/__unnamed_task__/AverageReturn -23.1971 +MetaTest/__unnamed_task__/Iteration 18 +MetaTest/__unnamed_task__/MaxReturn -18.3618 +MetaTest/__unnamed_task__/MinReturn -30.0122 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 3.99624 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 45400 +------------------------------------------------- ----------- +2025-04-02 15:00:21 | [pearl_trainer] epoch #19 | Training... +2025-04-02 15:01:51 | [pearl_trainer] epoch #19 | Evaluating... +2025-04-02 15:01:51 | [pearl_trainer] epoch #19 | Sampling for adapation and meta-testing... +2025-04-02 15:03:34 | [pearl_trainer] epoch #19 | Finished meta-testing... +2025-04-02 15:03:34 | [pearl_trainer] epoch #19 | Saving snapshot... +2025-04-02 15:03:35 | [pearl_trainer] epoch #19 | Saved +2025-04-02 15:03:35 | [pearl_trainer] epoch #19 | Time 4727.88 s +2025-04-02 15:03:35 | [pearl_trainer] epoch #19 | EpochTime 224.31 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -19.6469 +MetaTest/Average/AverageReturn -19.6469 +MetaTest/Average/Iteration 19 +MetaTest/Average/MaxReturn -9.3338 +MetaTest/Average/MinReturn -27.8311 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 5.95094 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -19.6469 +MetaTest/__unnamed_task__/AverageReturn -19.6469 +MetaTest/__unnamed_task__/Iteration 19 +MetaTest/__unnamed_task__/MaxReturn -9.3338 +MetaTest/__unnamed_task__/MinReturn -27.8311 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 5.95094 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 47000 +------------------------------------------------- ----------- +2025-04-02 15:04:09 | [pearl_trainer] epoch #20 | Training... +2025-04-02 15:05:36 | [pearl_trainer] epoch #20 | Evaluating... +2025-04-02 15:05:36 | [pearl_trainer] epoch #20 | Sampling for adapation and meta-testing... +2025-04-02 15:07:31 | [pearl_trainer] epoch #20 | Finished meta-testing... +2025-04-02 15:07:31 | [pearl_trainer] epoch #20 | Saving snapshot... +2025-04-02 15:07:32 | [pearl_trainer] epoch #20 | Saved +2025-04-02 15:07:32 | [pearl_trainer] epoch #20 | Time 4965.24 s +2025-04-02 15:07:32 | [pearl_trainer] epoch #20 | EpochTime 237.36 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -20.905 +MetaTest/Average/AverageReturn -20.905 +MetaTest/Average/Iteration 20 +MetaTest/Average/MaxReturn -19.0249 +MetaTest/Average/MinReturn -25.6457 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 2.41798 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -20.905 +MetaTest/__unnamed_task__/AverageReturn -20.905 +MetaTest/__unnamed_task__/Iteration 20 +MetaTest/__unnamed_task__/MaxReturn -19.0249 +MetaTest/__unnamed_task__/MinReturn -25.6457 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 2.41798 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 48600 +------------------------------------------------- ----------- +2025-04-02 15:08:02 | [pearl_trainer] epoch #21 | Training... +2025-04-02 15:09:39 | [pearl_trainer] epoch #21 | Evaluating... +2025-04-02 15:09:39 | [pearl_trainer] epoch #21 | Sampling for adapation and meta-testing... +2025-04-02 15:11:33 | [pearl_trainer] epoch #21 | Finished meta-testing... +2025-04-02 15:11:33 | [pearl_trainer] epoch #21 | Saving snapshot... +2025-04-02 15:11:34 | [pearl_trainer] epoch #21 | Saved +2025-04-02 15:11:34 | [pearl_trainer] epoch #21 | Time 5206.94 s +2025-04-02 15:11:34 | [pearl_trainer] epoch #21 | EpochTime 241.70 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -20.7244 +MetaTest/Average/AverageReturn -20.7244 +MetaTest/Average/Iteration 21 +MetaTest/Average/MaxReturn -4.85922 +MetaTest/Average/MinReturn -36.938 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 10.1895 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -20.7244 +MetaTest/__unnamed_task__/AverageReturn -20.7244 +MetaTest/__unnamed_task__/Iteration 21 +MetaTest/__unnamed_task__/MaxReturn -4.85922 +MetaTest/__unnamed_task__/MinReturn -36.938 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 10.1895 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 50200 +------------------------------------------------- ----------- +2025-04-02 15:12:08 | [pearl_trainer] epoch #22 | Training... +2025-04-02 15:13:35 | [pearl_trainer] epoch #22 | Evaluating... +2025-04-02 15:13:35 | [pearl_trainer] epoch #22 | Sampling for adapation and meta-testing... +2025-04-02 15:15:30 | [pearl_trainer] epoch #22 | Finished meta-testing... +2025-04-02 15:15:30 | [pearl_trainer] epoch #22 | Saving snapshot... +2025-04-02 15:15:31 | [pearl_trainer] epoch #22 | Saved +2025-04-02 15:15:31 | [pearl_trainer] epoch #22 | Time 5443.94 s +2025-04-02 15:15:31 | [pearl_trainer] epoch #22 | EpochTime 236.99 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -21.9557 +MetaTest/Average/AverageReturn -21.9557 +MetaTest/Average/Iteration 22 +MetaTest/Average/MaxReturn -16.3887 +MetaTest/Average/MinReturn -24.7628 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 2.95493 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -21.9557 +MetaTest/__unnamed_task__/AverageReturn -21.9557 +MetaTest/__unnamed_task__/Iteration 22 +MetaTest/__unnamed_task__/MaxReturn -16.3887 +MetaTest/__unnamed_task__/MinReturn -24.7628 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 2.95493 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 51800 +------------------------------------------------- ----------- +2025-04-02 15:16:00 | [pearl_trainer] epoch #23 | Training... +2025-04-02 15:17:24 | [pearl_trainer] epoch #23 | Evaluating... +2025-04-02 15:17:24 | [pearl_trainer] epoch #23 | Sampling for adapation and meta-testing... +2025-04-02 15:19:15 | [pearl_trainer] epoch #23 | Finished meta-testing... +2025-04-02 15:19:15 | [pearl_trainer] epoch #23 | Saving snapshot... +2025-04-02 15:19:16 | [pearl_trainer] epoch #23 | Saved +2025-04-02 15:19:16 | [pearl_trainer] epoch #23 | Time 5669.34 s +2025-04-02 15:19:16 | [pearl_trainer] epoch #23 | EpochTime 225.40 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -22.2629 +MetaTest/Average/AverageReturn -22.2629 +MetaTest/Average/Iteration 23 +MetaTest/Average/MaxReturn -16.5059 +MetaTest/Average/MinReturn -28.0977 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 3.94191 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -22.2629 +MetaTest/__unnamed_task__/AverageReturn -22.2629 +MetaTest/__unnamed_task__/Iteration 23 +MetaTest/__unnamed_task__/MaxReturn -16.5059 +MetaTest/__unnamed_task__/MinReturn -28.0977 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 3.94191 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 53400 +------------------------------------------------- ----------- +2025-04-02 15:19:48 | [pearl_trainer] epoch #24 | Training... +2025-04-02 15:21:15 | [pearl_trainer] epoch #24 | Evaluating... +2025-04-02 15:21:15 | [pearl_trainer] epoch #24 | Sampling for adapation and meta-testing... +2025-04-02 15:23:06 | [pearl_trainer] epoch #24 | Finished meta-testing... +2025-04-02 15:23:06 | [pearl_trainer] epoch #24 | Saving snapshot... +2025-04-02 15:23:07 | [pearl_trainer] epoch #24 | Saved +2025-04-02 15:23:07 | [pearl_trainer] epoch #24 | Time 5899.83 s +2025-04-02 15:23:07 | [pearl_trainer] epoch #24 | EpochTime 230.49 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -16.9849 +MetaTest/Average/AverageReturn -16.9849 +MetaTest/Average/Iteration 24 +MetaTest/Average/MaxReturn -0.821416 +MetaTest/Average/MinReturn -24.4387 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 8.52942 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -16.9849 +MetaTest/__unnamed_task__/AverageReturn -16.9849 +MetaTest/__unnamed_task__/Iteration 24 +MetaTest/__unnamed_task__/MaxReturn -0.821416 +MetaTest/__unnamed_task__/MinReturn -24.4387 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 8.52942 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 55000 +------------------------------------------------- ------------ +2025-04-02 15:23:37 | [pearl_trainer] epoch #25 | Training... +2025-04-02 15:25:07 | [pearl_trainer] epoch #25 | Evaluating... +2025-04-02 15:25:07 | [pearl_trainer] epoch #25 | Sampling for adapation and meta-testing... +2025-04-02 15:26:51 | [pearl_trainer] epoch #25 | Finished meta-testing... +2025-04-02 15:26:51 | [pearl_trainer] epoch #25 | Saving snapshot... +2025-04-02 15:26:51 | [pearl_trainer] epoch #25 | Saved +2025-04-02 15:26:51 | [pearl_trainer] epoch #25 | Time 6124.47 s +2025-04-02 15:26:51 | [pearl_trainer] epoch #25 | EpochTime 224.64 s +------------------------------------------------- ---------- +MetaTest/Average/AverageDiscountedReturn -17.2318 +MetaTest/Average/AverageReturn -17.2318 +MetaTest/Average/Iteration 25 +MetaTest/Average/MaxReturn -12.8127 +MetaTest/Average/MinReturn -20.6082 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 2.943 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -17.2318 +MetaTest/__unnamed_task__/AverageReturn -17.2318 +MetaTest/__unnamed_task__/Iteration 25 +MetaTest/__unnamed_task__/MaxReturn -12.8127 +MetaTest/__unnamed_task__/MinReturn -20.6082 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 2.943 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 56600 +------------------------------------------------- ---------- +2025-04-02 15:27:24 | [pearl_trainer] epoch #26 | Training... +2025-04-02 15:28:49 | [pearl_trainer] epoch #26 | Evaluating... +2025-04-02 15:28:49 | [pearl_trainer] epoch #26 | Sampling for adapation and meta-testing... +2025-04-02 15:30:37 | [pearl_trainer] epoch #26 | Finished meta-testing... +2025-04-02 15:30:37 | [pearl_trainer] epoch #26 | Saving snapshot... +2025-04-02 15:30:38 | [pearl_trainer] epoch #26 | Saved +2025-04-02 15:30:38 | [pearl_trainer] epoch #26 | Time 6351.12 s +2025-04-02 15:30:38 | [pearl_trainer] epoch #26 | EpochTime 226.64 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -23.2634 +MetaTest/Average/AverageReturn -23.2634 +MetaTest/Average/Iteration 26 +MetaTest/Average/MaxReturn -19.9607 +MetaTest/Average/MinReturn -25.5376 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 2.21214 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -23.2634 +MetaTest/__unnamed_task__/AverageReturn -23.2634 +MetaTest/__unnamed_task__/Iteration 26 +MetaTest/__unnamed_task__/MaxReturn -19.9607 +MetaTest/__unnamed_task__/MinReturn -25.5376 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 2.21214 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 58200 +------------------------------------------------- ----------- +2025-04-02 15:31:09 | [pearl_trainer] epoch #27 | Training... +2025-04-02 15:32:34 | [pearl_trainer] epoch #27 | Evaluating... +2025-04-02 15:32:34 | [pearl_trainer] epoch #27 | Sampling for adapation and meta-testing... +2025-04-02 15:34:22 | [pearl_trainer] epoch #27 | Finished meta-testing... +2025-04-02 15:34:22 | [pearl_trainer] epoch #27 | Saving snapshot... +2025-04-02 15:34:23 | [pearl_trainer] epoch #27 | Saved +2025-04-02 15:34:23 | [pearl_trainer] epoch #27 | Time 6576.26 s +2025-04-02 15:34:23 | [pearl_trainer] epoch #27 | EpochTime 225.14 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -25.7283 +MetaTest/Average/AverageReturn -25.7283 +MetaTest/Average/Iteration 27 +MetaTest/Average/MaxReturn -21.7489 +MetaTest/Average/MinReturn -31.8223 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 3.46328 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -25.7283 +MetaTest/__unnamed_task__/AverageReturn -25.7283 +MetaTest/__unnamed_task__/Iteration 27 +MetaTest/__unnamed_task__/MaxReturn -21.7489 +MetaTest/__unnamed_task__/MinReturn -31.8223 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 3.46328 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 59800 +------------------------------------------------- ----------- +2025-04-02 15:34:54 | [pearl_trainer] epoch #28 | Training... +2025-04-02 15:36:31 | [pearl_trainer] epoch #28 | Evaluating... +2025-04-02 15:36:31 | [pearl_trainer] epoch #28 | Sampling for adapation and meta-testing... +2025-04-02 15:38:18 | [pearl_trainer] epoch #28 | Finished meta-testing... +2025-04-02 15:38:18 | [pearl_trainer] epoch #28 | Saving snapshot... +2025-04-02 15:38:19 | [pearl_trainer] epoch #28 | Saved +2025-04-02 15:38:19 | [pearl_trainer] epoch #28 | Time 6812.26 s +2025-04-02 15:38:19 | [pearl_trainer] epoch #28 | EpochTime 235.99 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -24.4924 +MetaTest/Average/AverageReturn -24.4924 +MetaTest/Average/Iteration 28 +MetaTest/Average/MaxReturn -16.6407 +MetaTest/Average/MinReturn -34.4437 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 5.91712 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -24.4924 +MetaTest/__unnamed_task__/AverageReturn -24.4924 +MetaTest/__unnamed_task__/Iteration 28 +MetaTest/__unnamed_task__/MaxReturn -16.6407 +MetaTest/__unnamed_task__/MinReturn -34.4437 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 5.91712 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 61400 +------------------------------------------------- ----------- +2025-04-02 15:38:54 | [pearl_trainer] epoch #29 | Training... +2025-04-02 15:40:18 | [pearl_trainer] epoch #29 | Evaluating... +2025-04-02 15:40:18 | [pearl_trainer] epoch #29 | Sampling for adapation and meta-testing... +2025-04-02 15:42:18 | [pearl_trainer] epoch #29 | Finished meta-testing... +2025-04-02 15:42:18 | [pearl_trainer] epoch #29 | Saving snapshot... +2025-04-02 15:42:19 | [pearl_trainer] epoch #29 | Saved +2025-04-02 15:42:19 | [pearl_trainer] epoch #29 | Time 7051.64 s +2025-04-02 15:42:19 | [pearl_trainer] epoch #29 | EpochTime 239.39 s +------------------------------------------------- ---------- +MetaTest/Average/AverageDiscountedReturn -26.9995 +MetaTest/Average/AverageReturn -26.9995 +MetaTest/Average/Iteration 29 +MetaTest/Average/MaxReturn -19.7894 +MetaTest/Average/MinReturn -47.2528 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 10.2769 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -26.9995 +MetaTest/__unnamed_task__/AverageReturn -26.9995 +MetaTest/__unnamed_task__/Iteration 29 +MetaTest/__unnamed_task__/MaxReturn -19.7894 +MetaTest/__unnamed_task__/MinReturn -47.2528 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 10.2769 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 63000 +------------------------------------------------- ---------- +2025-04-02 15:42:48 | [pearl_trainer] epoch #30 | Training... +2025-04-02 15:44:15 | [pearl_trainer] epoch #30 | Evaluating... +2025-04-02 15:44:15 | [pearl_trainer] epoch #30 | Sampling for adapation and meta-testing... +2025-04-02 15:46:03 | [pearl_trainer] epoch #30 | Finished meta-testing... +2025-04-02 15:46:03 | [pearl_trainer] epoch #30 | Saving snapshot... +2025-04-02 15:46:04 | [pearl_trainer] epoch #30 | Saved +2025-04-02 15:46:04 | [pearl_trainer] epoch #30 | Time 7277.20 s +2025-04-02 15:46:04 | [pearl_trainer] epoch #30 | EpochTime 225.56 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -26.9238 +MetaTest/Average/AverageReturn -26.9238 +MetaTest/Average/Iteration 30 +MetaTest/Average/MaxReturn -18.126 +MetaTest/Average/MinReturn -40.7802 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 8.51223 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -26.9238 +MetaTest/__unnamed_task__/AverageReturn -26.9238 +MetaTest/__unnamed_task__/Iteration 30 +MetaTest/__unnamed_task__/MaxReturn -18.126 +MetaTest/__unnamed_task__/MinReturn -40.7802 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 8.51223 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 64600 +------------------------------------------------- ----------- +2025-04-02 15:46:35 | [pearl_trainer] epoch #31 | Training... +2025-04-02 15:47:48 | [pearl_trainer] epoch #31 | Evaluating... +2025-04-02 15:47:48 | [pearl_trainer] epoch #31 | Sampling for adapation and meta-testing... +2025-04-02 15:49:43 | [pearl_trainer] epoch #31 | Finished meta-testing... +2025-04-02 15:49:43 | [pearl_trainer] epoch #31 | Saving snapshot... +2025-04-02 15:49:44 | [pearl_trainer] epoch #31 | Saved +2025-04-02 15:49:44 | [pearl_trainer] epoch #31 | Time 7496.93 s +2025-04-02 15:49:44 | [pearl_trainer] epoch #31 | EpochTime 219.73 s +------------------------------------------------- ---------- +MetaTest/Average/AverageDiscountedReturn -27.8311 +MetaTest/Average/AverageReturn -27.8311 +MetaTest/Average/Iteration 31 +MetaTest/Average/MaxReturn -20.168 +MetaTest/Average/MinReturn -50.4576 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 11.4738 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -27.8311 +MetaTest/__unnamed_task__/AverageReturn -27.8311 +MetaTest/__unnamed_task__/Iteration 31 +MetaTest/__unnamed_task__/MaxReturn -20.168 +MetaTest/__unnamed_task__/MinReturn -50.4576 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 11.4738 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 66200 +------------------------------------------------- ---------- +2025-04-02 15:50:14 | [pearl_trainer] epoch #32 | Training... +2025-04-02 15:51:50 | [pearl_trainer] epoch #32 | Evaluating... +2025-04-02 15:51:50 | [pearl_trainer] epoch #32 | Sampling for adapation and meta-testing... +2025-04-02 15:53:38 | [pearl_trainer] epoch #32 | Finished meta-testing... +2025-04-02 15:53:38 | [pearl_trainer] epoch #32 | Saving snapshot... +2025-04-02 15:53:39 | [pearl_trainer] epoch #32 | Saved +2025-04-02 15:53:39 | [pearl_trainer] epoch #32 | Time 7732.09 s +2025-04-02 15:53:39 | [pearl_trainer] epoch #32 | EpochTime 235.16 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -22.0477 +MetaTest/Average/AverageReturn -22.0477 +MetaTest/Average/Iteration 32 +MetaTest/Average/MaxReturn -17.2026 +MetaTest/Average/MinReturn -27.5708 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 3.50762 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -22.0477 +MetaTest/__unnamed_task__/AverageReturn -22.0477 +MetaTest/__unnamed_task__/Iteration 32 +MetaTest/__unnamed_task__/MaxReturn -17.2026 +MetaTest/__unnamed_task__/MinReturn -27.5708 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 3.50762 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 67800 +------------------------------------------------- ----------- +2025-04-02 15:54:12 | [pearl_trainer] epoch #33 | Training... +2025-04-02 15:55:42 | [pearl_trainer] epoch #33 | Evaluating... +2025-04-02 15:55:42 | [pearl_trainer] epoch #33 | Sampling for adapation and meta-testing... +2025-04-02 15:57:33 | [pearl_trainer] epoch #33 | Finished meta-testing... +2025-04-02 15:57:33 | [pearl_trainer] epoch #33 | Saving snapshot... +2025-04-02 15:57:34 | [pearl_trainer] epoch #33 | Saved +2025-04-02 15:57:34 | [pearl_trainer] epoch #33 | Time 7967.14 s +2025-04-02 15:57:34 | [pearl_trainer] epoch #33 | EpochTime 235.05 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -11.4242 +MetaTest/Average/AverageReturn -11.4242 +MetaTest/Average/Iteration 33 +MetaTest/Average/MaxReturn -4.0269 +MetaTest/Average/MinReturn -18.6713 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 5.53629 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -11.4242 +MetaTest/__unnamed_task__/AverageReturn -11.4242 +MetaTest/__unnamed_task__/Iteration 33 +MetaTest/__unnamed_task__/MaxReturn -4.0269 +MetaTest/__unnamed_task__/MinReturn -18.6713 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 5.53629 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 69400 +------------------------------------------------- ----------- +2025-04-02 15:58:06 | [pearl_trainer] epoch #34 | Training... +2025-04-02 15:59:32 | [pearl_trainer] epoch #34 | Evaluating... +2025-04-02 15:59:32 | [pearl_trainer] epoch #34 | Sampling for adapation and meta-testing... +2025-04-02 16:01:21 | [pearl_trainer] epoch #34 | Finished meta-testing... +2025-04-02 16:01:21 | [pearl_trainer] epoch #34 | Saving snapshot... +2025-04-02 16:01:22 | [pearl_trainer] epoch #34 | Saved +2025-04-02 16:01:22 | [pearl_trainer] epoch #34 | Time 8194.55 s +2025-04-02 16:01:22 | [pearl_trainer] epoch #34 | EpochTime 227.40 s +------------------------------------------------- ---------- +MetaTest/Average/AverageDiscountedReturn -28.7716 +MetaTest/Average/AverageReturn -28.7716 +MetaTest/Average/Iteration 34 +MetaTest/Average/MaxReturn -16.8497 +MetaTest/Average/MinReturn -60.3166 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 15.9946 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -28.7716 +MetaTest/__unnamed_task__/AverageReturn -28.7716 +MetaTest/__unnamed_task__/Iteration 34 +MetaTest/__unnamed_task__/MaxReturn -16.8497 +MetaTest/__unnamed_task__/MinReturn -60.3166 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 15.9946 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 71000 +------------------------------------------------- ---------- +2025-04-02 16:01:52 | [pearl_trainer] epoch #35 | Training... +2025-04-02 16:03:07 | [pearl_trainer] epoch #35 | Evaluating... +2025-04-02 16:03:07 | [pearl_trainer] epoch #35 | Sampling for adapation and meta-testing... +2025-04-02 16:04:58 | [pearl_trainer] epoch #35 | Finished meta-testing... +2025-04-02 16:04:58 | [pearl_trainer] epoch #35 | Saving snapshot... +2025-04-02 16:04:59 | [pearl_trainer] epoch #35 | Saved +2025-04-02 16:04:59 | [pearl_trainer] epoch #35 | Time 8412.21 s +2025-04-02 16:04:59 | [pearl_trainer] epoch #35 | EpochTime 217.66 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -22.4103 +MetaTest/Average/AverageReturn -22.4103 +MetaTest/Average/Iteration 35 +MetaTest/Average/MaxReturn -11.3184 +MetaTest/Average/MinReturn -36.3447 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 8.53645 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -22.4103 +MetaTest/__unnamed_task__/AverageReturn -22.4103 +MetaTest/__unnamed_task__/Iteration 35 +MetaTest/__unnamed_task__/MaxReturn -11.3184 +MetaTest/__unnamed_task__/MinReturn -36.3447 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 8.53645 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 72600 +------------------------------------------------- ----------- +2025-04-02 16:05:28 | [pearl_trainer] epoch #36 | Training... +2025-04-02 16:06:54 | [pearl_trainer] epoch #36 | Evaluating... +2025-04-02 16:06:54 | [pearl_trainer] epoch #36 | Sampling for adapation and meta-testing... +2025-04-02 16:08:37 | [pearl_trainer] epoch #36 | Finished meta-testing... +2025-04-02 16:08:37 | [pearl_trainer] epoch #36 | Saving snapshot... +2025-04-02 16:08:38 | [pearl_trainer] epoch #36 | Saved +2025-04-02 16:08:38 | [pearl_trainer] epoch #36 | Time 8630.73 s +2025-04-02 16:08:38 | [pearl_trainer] epoch #36 | EpochTime 218.52 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -17.3184 +MetaTest/Average/AverageReturn -17.3184 +MetaTest/Average/Iteration 36 +MetaTest/Average/MaxReturn 0.370391 +MetaTest/Average/MinReturn -25.4845 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 9.21436 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -17.3184 +MetaTest/__unnamed_task__/AverageReturn -17.3184 +MetaTest/__unnamed_task__/Iteration 36 +MetaTest/__unnamed_task__/MaxReturn 0.370391 +MetaTest/__unnamed_task__/MinReturn -25.4845 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 9.21436 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 74200 +------------------------------------------------- ------------ +2025-04-02 16:09:07 | [pearl_trainer] epoch #37 | Training... +2025-04-02 16:10:42 | [pearl_trainer] epoch #37 | Evaluating... +2025-04-02 16:10:42 | [pearl_trainer] epoch #37 | Sampling for adapation and meta-testing... +2025-04-02 16:12:33 | [pearl_trainer] epoch #37 | Finished meta-testing... +2025-04-02 16:12:33 | [pearl_trainer] epoch #37 | Saving snapshot... +2025-04-02 16:12:34 | [pearl_trainer] epoch #37 | Saved +2025-04-02 16:12:34 | [pearl_trainer] epoch #37 | Time 8867.34 s +2025-04-02 16:12:34 | [pearl_trainer] epoch #37 | EpochTime 236.60 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -11.7964 +MetaTest/Average/AverageReturn -11.7964 +MetaTest/Average/Iteration 37 +MetaTest/Average/MaxReturn -3.00448 +MetaTest/Average/MinReturn -16.7134 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 4.85814 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -11.7964 +MetaTest/__unnamed_task__/AverageReturn -11.7964 +MetaTest/__unnamed_task__/Iteration 37 +MetaTest/__unnamed_task__/MaxReturn -3.00448 +MetaTest/__unnamed_task__/MinReturn -16.7134 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 4.85814 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 75800 +------------------------------------------------- ----------- +2025-04-02 16:13:08 | [pearl_trainer] epoch #38 | Training... +2025-04-02 16:14:36 | [pearl_trainer] epoch #38 | Evaluating... +2025-04-02 16:14:36 | [pearl_trainer] epoch #38 | Sampling for adapation and meta-testing... +2025-04-02 16:16:24 | [pearl_trainer] epoch #38 | Finished meta-testing... +2025-04-02 16:16:24 | [pearl_trainer] epoch #38 | Saving snapshot... +2025-04-02 16:16:25 | [pearl_trainer] epoch #38 | Saved +2025-04-02 16:16:25 | [pearl_trainer] epoch #38 | Time 9097.65 s +2025-04-02 16:16:25 | [pearl_trainer] epoch #38 | EpochTime 230.32 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -7.72792 +MetaTest/Average/AverageReturn -7.72792 +MetaTest/Average/Iteration 38 +MetaTest/Average/MaxReturn 16.8852 +MetaTest/Average/MinReturn -23.9715 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 15.9574 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -7.72792 +MetaTest/__unnamed_task__/AverageReturn -7.72792 +MetaTest/__unnamed_task__/Iteration 38 +MetaTest/__unnamed_task__/MaxReturn 16.8852 +MetaTest/__unnamed_task__/MinReturn -23.9715 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 15.9574 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 77400 +------------------------------------------------- ----------- +2025-04-02 16:16:54 | [pearl_trainer] epoch #39 | Training... +2025-04-02 16:18:14 | [pearl_trainer] epoch #39 | Evaluating... +2025-04-02 16:18:14 | [pearl_trainer] epoch #39 | Sampling for adapation and meta-testing... +2025-04-02 16:19:59 | [pearl_trainer] epoch #39 | Finished meta-testing... +2025-04-02 16:19:59 | [pearl_trainer] epoch #39 | Saving snapshot... +2025-04-02 16:20:00 | [pearl_trainer] epoch #39 | Saved +2025-04-02 16:20:00 | [pearl_trainer] epoch #39 | Time 9312.89 s +2025-04-02 16:20:00 | [pearl_trainer] epoch #39 | EpochTime 215.23 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -16.3912 +MetaTest/Average/AverageReturn -16.3912 +MetaTest/Average/Iteration 39 +MetaTest/Average/MaxReturn -15.0812 +MetaTest/Average/MinReturn -18.364 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 1.19982 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -16.3912 +MetaTest/__unnamed_task__/AverageReturn -16.3912 +MetaTest/__unnamed_task__/Iteration 39 +MetaTest/__unnamed_task__/MaxReturn -15.0812 +MetaTest/__unnamed_task__/MinReturn -18.364 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 1.19982 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 79000 +------------------------------------------------- ----------- +2025-04-02 16:20:30 | [pearl_trainer] epoch #40 | Training... +2025-04-02 16:21:51 | [pearl_trainer] epoch #40 | Evaluating... +2025-04-02 16:21:51 | [pearl_trainer] epoch #40 | Sampling for adapation and meta-testing... +2025-04-02 16:23:41 | [pearl_trainer] epoch #40 | Finished meta-testing... +2025-04-02 16:23:41 | [pearl_trainer] epoch #40 | Saving snapshot... +2025-04-02 16:23:42 | [pearl_trainer] epoch #40 | Saved +2025-04-02 16:23:42 | [pearl_trainer] epoch #40 | Time 9535.05 s +2025-04-02 16:23:42 | [pearl_trainer] epoch #40 | EpochTime 222.16 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -14.3286 +MetaTest/Average/AverageReturn -14.3286 +MetaTest/Average/Iteration 40 +MetaTest/Average/MaxReturn 6.16217 +MetaTest/Average/MinReturn -29.5854 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 11.7505 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -14.3286 +MetaTest/__unnamed_task__/AverageReturn -14.3286 +MetaTest/__unnamed_task__/Iteration 40 +MetaTest/__unnamed_task__/MaxReturn 6.16217 +MetaTest/__unnamed_task__/MinReturn -29.5854 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 11.7505 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 80600 +------------------------------------------------- ----------- +2025-04-02 16:24:11 | [pearl_trainer] epoch #41 | Training... +2025-04-02 16:25:44 | [pearl_trainer] epoch #41 | Evaluating... +2025-04-02 16:25:44 | [pearl_trainer] epoch #41 | Sampling for adapation and meta-testing... +2025-04-02 16:27:46 | [pearl_trainer] epoch #41 | Finished meta-testing... +2025-04-02 16:27:46 | [pearl_trainer] epoch #41 | Saving snapshot... +2025-04-02 16:27:47 | [pearl_trainer] epoch #41 | Saved +2025-04-02 16:27:47 | [pearl_trainer] epoch #41 | Time 9780.16 s +2025-04-02 16:27:47 | [pearl_trainer] epoch #41 | EpochTime 245.11 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -17.2561 +MetaTest/Average/AverageReturn -17.2561 +MetaTest/Average/Iteration 41 +MetaTest/Average/MaxReturn -11.6425 +MetaTest/Average/MinReturn -22.3301 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 3.79196 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -17.2561 +MetaTest/__unnamed_task__/AverageReturn -17.2561 +MetaTest/__unnamed_task__/Iteration 41 +MetaTest/__unnamed_task__/MaxReturn -11.6425 +MetaTest/__unnamed_task__/MinReturn -22.3301 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 3.79196 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 82200 +------------------------------------------------- ----------- +2025-04-02 16:28:29 | [pearl_trainer] epoch #42 | Training... +2025-04-02 16:30:37 | [pearl_trainer] epoch #42 | Evaluating... +2025-04-02 16:30:37 | [pearl_trainer] epoch #42 | Sampling for adapation and meta-testing... +2025-04-02 16:33:04 | [pearl_trainer] epoch #42 | Finished meta-testing... +2025-04-02 16:33:04 | [pearl_trainer] epoch #42 | Saving snapshot... +2025-04-02 16:33:05 | [pearl_trainer] epoch #42 | Saved +2025-04-02 16:33:05 | [pearl_trainer] epoch #42 | Time 10097.87 s +2025-04-02 16:33:05 | [pearl_trainer] epoch #42 | EpochTime 317.71 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -17.6708 +MetaTest/Average/AverageReturn -17.6708 +MetaTest/Average/Iteration 42 +MetaTest/Average/MaxReturn -9.06397 +MetaTest/Average/MinReturn -24.8941 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 5.22008 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -17.6708 +MetaTest/__unnamed_task__/AverageReturn -17.6708 +MetaTest/__unnamed_task__/Iteration 42 +MetaTest/__unnamed_task__/MaxReturn -9.06397 +MetaTest/__unnamed_task__/MinReturn -24.8941 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 5.22008 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 83800 +------------------------------------------------- ----------- +2025-04-02 16:33:35 | [pearl_trainer] epoch #43 | Training... +2025-04-02 16:35:03 | [pearl_trainer] epoch #43 | Evaluating... +2025-04-02 16:35:03 | [pearl_trainer] epoch #43 | Sampling for adapation and meta-testing... +2025-04-02 16:36:54 | [pearl_trainer] epoch #43 | Finished meta-testing... +2025-04-02 16:36:54 | [pearl_trainer] epoch #43 | Saving snapshot... +2025-04-02 16:36:55 | [pearl_trainer] epoch #43 | Saved +2025-04-02 16:36:55 | [pearl_trainer] epoch #43 | Time 10328.43 s +2025-04-02 16:36:55 | [pearl_trainer] epoch #43 | EpochTime 230.55 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -12.0834 +MetaTest/Average/AverageReturn -12.0834 +MetaTest/Average/Iteration 43 +MetaTest/Average/MaxReturn -2.58727 +MetaTest/Average/MinReturn -25.9441 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 9.70779 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -12.0834 +MetaTest/__unnamed_task__/AverageReturn -12.0834 +MetaTest/__unnamed_task__/Iteration 43 +MetaTest/__unnamed_task__/MaxReturn -2.58727 +MetaTest/__unnamed_task__/MinReturn -25.9441 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 9.70779 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 85400 +------------------------------------------------- ----------- +2025-04-02 16:37:27 | [pearl_trainer] epoch #44 | Training... +2025-04-02 16:38:53 | [pearl_trainer] epoch #44 | Evaluating... +2025-04-02 16:38:53 | [pearl_trainer] epoch #44 | Sampling for adapation and meta-testing... +2025-04-02 16:40:44 | [pearl_trainer] epoch #44 | Finished meta-testing... +2025-04-02 16:40:44 | [pearl_trainer] epoch #44 | Saving snapshot... +2025-04-02 16:40:45 | [pearl_trainer] epoch #44 | Saved +2025-04-02 16:40:45 | [pearl_trainer] epoch #44 | Time 10557.86 s +2025-04-02 16:40:45 | [pearl_trainer] epoch #44 | EpochTime 229.43 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -19.1956 +MetaTest/Average/AverageReturn -19.1956 +MetaTest/Average/Iteration 44 +MetaTest/Average/MaxReturn -13.6122 +MetaTest/Average/MinReturn -23.2155 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 3.83889 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -19.1956 +MetaTest/__unnamed_task__/AverageReturn -19.1956 +MetaTest/__unnamed_task__/Iteration 44 +MetaTest/__unnamed_task__/MaxReturn -13.6122 +MetaTest/__unnamed_task__/MinReturn -23.2155 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 3.83889 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 87000 +------------------------------------------------- ----------- +2025-04-02 16:41:15 | [pearl_trainer] epoch #45 | Training... +2025-04-02 16:42:48 | [pearl_trainer] epoch #45 | Evaluating... +2025-04-02 16:42:48 | [pearl_trainer] epoch #45 | Sampling for adapation and meta-testing... +2025-04-02 16:44:35 | [pearl_trainer] epoch #45 | Finished meta-testing... +2025-04-02 16:44:35 | [pearl_trainer] epoch #45 | Saving snapshot... +2025-04-02 16:44:36 | [pearl_trainer] epoch #45 | Saved +2025-04-02 16:44:36 | [pearl_trainer] epoch #45 | Time 10788.85 s +2025-04-02 16:44:36 | [pearl_trainer] epoch #45 | EpochTime 230.99 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -15.5721 +MetaTest/Average/AverageReturn -15.5721 +MetaTest/Average/Iteration 45 +MetaTest/Average/MaxReturn 0.198331 +MetaTest/Average/MinReturn -21.7072 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 8.06152 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -15.5721 +MetaTest/__unnamed_task__/AverageReturn -15.5721 +MetaTest/__unnamed_task__/Iteration 45 +MetaTest/__unnamed_task__/MaxReturn 0.198331 +MetaTest/__unnamed_task__/MinReturn -21.7072 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 8.06152 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 88600 +------------------------------------------------- ------------ +2025-04-02 16:45:12 | [pearl_trainer] epoch #46 | Training... +2025-04-02 16:46:55 | [pearl_trainer] epoch #46 | Evaluating... +2025-04-02 16:46:55 | [pearl_trainer] epoch #46 | Sampling for adapation and meta-testing... +2025-04-02 16:48:51 | [pearl_trainer] epoch #46 | Finished meta-testing... +2025-04-02 16:48:51 | [pearl_trainer] epoch #46 | Saving snapshot... +2025-04-02 16:48:51 | [pearl_trainer] epoch #46 | Saved +2025-04-02 16:48:51 | [pearl_trainer] epoch #46 | Time 11044.49 s +2025-04-02 16:48:51 | [pearl_trainer] epoch #46 | EpochTime 255.64 s +------------------------------------------------- ---------- +MetaTest/Average/AverageDiscountedReturn -18.1298 +MetaTest/Average/AverageReturn -18.1298 +MetaTest/Average/Iteration 46 +MetaTest/Average/MaxReturn 11.4964 +MetaTest/Average/MinReturn -31.4637 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 15.7373 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -18.1298 +MetaTest/__unnamed_task__/AverageReturn -18.1298 +MetaTest/__unnamed_task__/Iteration 46 +MetaTest/__unnamed_task__/MaxReturn 11.4964 +MetaTest/__unnamed_task__/MinReturn -31.4637 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 15.7373 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 90200 +------------------------------------------------- ---------- +2025-04-02 16:49:22 | [pearl_trainer] epoch #47 | Training... +2025-04-02 16:50:44 | [pearl_trainer] epoch #47 | Evaluating... +2025-04-02 16:50:44 | [pearl_trainer] epoch #47 | Sampling for adapation and meta-testing... +2025-04-02 16:52:38 | [pearl_trainer] epoch #47 | Finished meta-testing... +2025-04-02 16:52:38 | [pearl_trainer] epoch #47 | Saving snapshot... +2025-04-02 16:52:40 | [pearl_trainer] epoch #47 | Saved +2025-04-02 16:52:40 | [pearl_trainer] epoch #47 | Time 11272.66 s +2025-04-02 16:52:40 | [pearl_trainer] epoch #47 | EpochTime 228.17 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -7.88335 +MetaTest/Average/AverageReturn -7.88335 +MetaTest/Average/Iteration 47 +MetaTest/Average/MaxReturn 18.0602 +MetaTest/Average/MinReturn -22.4929 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 15.4556 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -7.88335 +MetaTest/__unnamed_task__/AverageReturn -7.88335 +MetaTest/__unnamed_task__/Iteration 47 +MetaTest/__unnamed_task__/MaxReturn 18.0602 +MetaTest/__unnamed_task__/MinReturn -22.4929 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 15.4556 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 91800 +------------------------------------------------- ----------- +2025-04-02 16:53:11 | [pearl_trainer] epoch #48 | Training... +2025-04-02 16:54:46 | [pearl_trainer] epoch #48 | Evaluating... +2025-04-02 16:54:46 | [pearl_trainer] epoch #48 | Sampling for adapation and meta-testing... +2025-04-02 16:56:38 | [pearl_trainer] epoch #48 | Finished meta-testing... +2025-04-02 16:56:38 | [pearl_trainer] epoch #48 | Saving snapshot... +2025-04-02 16:56:39 | [pearl_trainer] epoch #48 | Saved +2025-04-02 16:56:39 | [pearl_trainer] epoch #48 | Time 11512.20 s +2025-04-02 16:56:39 | [pearl_trainer] epoch #48 | EpochTime 239.54 s +------------------------------------------------- ---------- +MetaTest/Average/AverageDiscountedReturn -23.8101 +MetaTest/Average/AverageReturn -23.8101 +MetaTest/Average/Iteration 48 +MetaTest/Average/MaxReturn -16.8845 +MetaTest/Average/MinReturn -31.3138 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 5.04 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -23.8101 +MetaTest/__unnamed_task__/AverageReturn -23.8101 +MetaTest/__unnamed_task__/Iteration 48 +MetaTest/__unnamed_task__/MaxReturn -16.8845 +MetaTest/__unnamed_task__/MinReturn -31.3138 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 5.04 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 93400 +------------------------------------------------- ---------- +2025-04-02 16:57:12 | [pearl_trainer] epoch #49 | Training... +2025-04-02 16:58:33 | [pearl_trainer] epoch #49 | Evaluating... +2025-04-02 16:58:33 | [pearl_trainer] epoch #49 | Sampling for adapation and meta-testing... +2025-04-02 17:00:28 | [pearl_trainer] epoch #49 | Finished meta-testing... +2025-04-02 17:00:28 | [pearl_trainer] epoch #49 | Saving snapshot... +2025-04-02 17:00:29 | [pearl_trainer] epoch #49 | Saved +2025-04-02 17:00:29 | [pearl_trainer] epoch #49 | Time 11741.99 s +2025-04-02 17:00:29 | [pearl_trainer] epoch #49 | EpochTime 229.78 s +------------------------------------------------- ---------- +MetaTest/Average/AverageDiscountedReturn -21.7259 +MetaTest/Average/AverageReturn -21.7259 +MetaTest/Average/Iteration 49 +MetaTest/Average/MaxReturn -17.3001 +MetaTest/Average/MinReturn -27.3247 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 3.3154 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -21.7259 +MetaTest/__unnamed_task__/AverageReturn -21.7259 +MetaTest/__unnamed_task__/Iteration 49 +MetaTest/__unnamed_task__/MaxReturn -17.3001 +MetaTest/__unnamed_task__/MinReturn -27.3247 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 3.3154 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 95000 +------------------------------------------------- ---------- +2025-04-02 17:01:00 | [pearl_trainer] epoch #50 | Training... +2025-04-02 17:02:39 | [pearl_trainer] epoch #50 | Evaluating... +2025-04-02 17:02:39 | [pearl_trainer] epoch #50 | Sampling for adapation and meta-testing... +2025-04-02 17:04:29 | [pearl_trainer] epoch #50 | Finished meta-testing... +2025-04-02 17:04:29 | [pearl_trainer] epoch #50 | Saving snapshot... +2025-04-02 17:04:31 | [pearl_trainer] epoch #50 | Saved +2025-04-02 17:04:31 | [pearl_trainer] epoch #50 | Time 11983.64 s +2025-04-02 17:04:31 | [pearl_trainer] epoch #50 | EpochTime 241.65 s +------------------------------------------------- ---------- +MetaTest/Average/AverageDiscountedReturn -14.6048 +MetaTest/Average/AverageReturn -14.6048 +MetaTest/Average/Iteration 50 +MetaTest/Average/MaxReturn 30.7293 +MetaTest/Average/MinReturn -35.7581 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 23.2851 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -14.6048 +MetaTest/__unnamed_task__/AverageReturn -14.6048 +MetaTest/__unnamed_task__/Iteration 50 +MetaTest/__unnamed_task__/MaxReturn 30.7293 +MetaTest/__unnamed_task__/MinReturn -35.7581 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 23.2851 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 96600 +------------------------------------------------- ---------- +2025-04-02 17:05:05 | [pearl_trainer] epoch #51 | Training... +2025-04-02 17:06:30 | [pearl_trainer] epoch #51 | Evaluating... +2025-04-02 17:06:30 | [pearl_trainer] epoch #51 | Sampling for adapation and meta-testing... +2025-04-02 17:08:22 | [pearl_trainer] epoch #51 | Finished meta-testing... +2025-04-02 17:08:22 | [pearl_trainer] epoch #51 | Saving snapshot... +2025-04-02 17:08:23 | [pearl_trainer] epoch #51 | Saved +2025-04-02 17:08:23 | [pearl_trainer] epoch #51 | Time 12216.25 s +2025-04-02 17:08:23 | [pearl_trainer] epoch #51 | EpochTime 232.61 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -18.6899 +MetaTest/Average/AverageReturn -18.6899 +MetaTest/Average/Iteration 51 +MetaTest/Average/MaxReturn -10.483 +MetaTest/Average/MinReturn -27.658 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 5.82561 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -18.6899 +MetaTest/__unnamed_task__/AverageReturn -18.6899 +MetaTest/__unnamed_task__/Iteration 51 +MetaTest/__unnamed_task__/MaxReturn -10.483 +MetaTest/__unnamed_task__/MinReturn -27.658 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 5.82561 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 98200 +------------------------------------------------- ----------- +2025-04-02 17:08:52 | [pearl_trainer] epoch #52 | Training... +2025-04-02 17:10:24 | [pearl_trainer] epoch #52 | Evaluating... +2025-04-02 17:10:24 | [pearl_trainer] epoch #52 | Sampling for adapation and meta-testing... +2025-04-02 17:12:10 | [pearl_trainer] epoch #52 | Finished meta-testing... +2025-04-02 17:12:10 | [pearl_trainer] epoch #52 | Saving snapshot... +2025-04-02 17:12:11 | [pearl_trainer] epoch #52 | Saved +2025-04-02 17:12:11 | [pearl_trainer] epoch #52 | Time 12444.05 s +2025-04-02 17:12:11 | [pearl_trainer] epoch #52 | EpochTime 227.80 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -13.0592 +MetaTest/Average/AverageReturn -13.0592 +MetaTest/Average/Iteration 52 +MetaTest/Average/MaxReturn 3.1844 +MetaTest/Average/MinReturn -21.4686 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 8.64636 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -13.0592 +MetaTest/__unnamed_task__/AverageReturn -13.0592 +MetaTest/__unnamed_task__/Iteration 52 +MetaTest/__unnamed_task__/MaxReturn 3.1844 +MetaTest/__unnamed_task__/MinReturn -21.4686 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 8.64636 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 99800 +------------------------------------------------- ----------- +2025-04-02 17:12:46 | [pearl_trainer] epoch #53 | Training... +2025-04-02 17:14:12 | [pearl_trainer] epoch #53 | Evaluating... +2025-04-02 17:14:12 | [pearl_trainer] epoch #53 | Sampling for adapation and meta-testing... +2025-04-02 17:16:03 | [pearl_trainer] epoch #53 | Finished meta-testing... +2025-04-02 17:16:03 | [pearl_trainer] epoch #53 | Saving snapshot... +2025-04-02 17:16:04 | [pearl_trainer] epoch #53 | Saved +2025-04-02 17:16:04 | [pearl_trainer] epoch #53 | Time 12676.65 s +2025-04-02 17:16:04 | [pearl_trainer] epoch #53 | EpochTime 232.60 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -19.7669 +MetaTest/Average/AverageReturn -19.7669 +MetaTest/Average/Iteration 53 +MetaTest/Average/MaxReturn -13.5748 +MetaTest/Average/MinReturn -33.1753 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 6.98113 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -19.7669 +MetaTest/__unnamed_task__/AverageReturn -19.7669 +MetaTest/__unnamed_task__/Iteration 53 +MetaTest/__unnamed_task__/MaxReturn -13.5748 +MetaTest/__unnamed_task__/MinReturn -33.1753 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 6.98113 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 101400 +------------------------------------------------- ------------ +2025-04-02 17:16:34 | [pearl_trainer] epoch #54 | Training... +2025-04-02 17:18:15 | [pearl_trainer] epoch #54 | Evaluating... +2025-04-02 17:18:15 | [pearl_trainer] epoch #54 | Sampling for adapation and meta-testing... +2025-04-02 17:20:20 | [pearl_trainer] epoch #54 | Finished meta-testing... +2025-04-02 17:20:20 | [pearl_trainer] epoch #54 | Saving snapshot... +2025-04-02 17:20:21 | [pearl_trainer] epoch #54 | Saved +2025-04-02 17:20:21 | [pearl_trainer] epoch #54 | Time 12934.45 s +2025-04-02 17:20:21 | [pearl_trainer] epoch #54 | EpochTime 257.79 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -7.4869 +MetaTest/Average/AverageReturn -7.4869 +MetaTest/Average/Iteration 54 +MetaTest/Average/MaxReturn 2.28001 +MetaTest/Average/MinReturn -21.0622 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 10.0403 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -7.4869 +MetaTest/__unnamed_task__/AverageReturn -7.4869 +MetaTest/__unnamed_task__/Iteration 54 +MetaTest/__unnamed_task__/MaxReturn 2.28001 +MetaTest/__unnamed_task__/MinReturn -21.0622 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 10.0403 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 103000 +------------------------------------------------- ------------ +2025-04-02 17:20:56 | [pearl_trainer] epoch #55 | Training... +2025-04-02 17:22:21 | [pearl_trainer] epoch #55 | Evaluating... +2025-04-02 17:22:21 | [pearl_trainer] epoch #55 | Sampling for adapation and meta-testing... +2025-04-02 17:24:17 | [pearl_trainer] epoch #55 | Finished meta-testing... +2025-04-02 17:24:17 | [pearl_trainer] epoch #55 | Saving snapshot... +2025-04-02 17:24:18 | [pearl_trainer] epoch #55 | Saved +2025-04-02 17:24:18 | [pearl_trainer] epoch #55 | Time 13170.93 s +2025-04-02 17:24:18 | [pearl_trainer] epoch #55 | EpochTime 236.48 s +------------------------------------------------- ------------- +MetaTest/Average/AverageDiscountedReturn -16.0532 +MetaTest/Average/AverageReturn -16.0532 +MetaTest/Average/Iteration 55 +MetaTest/Average/MaxReturn -0.601947 +MetaTest/Average/MinReturn -26.6475 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 8.51417 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -16.0532 +MetaTest/__unnamed_task__/AverageReturn -16.0532 +MetaTest/__unnamed_task__/Iteration 55 +MetaTest/__unnamed_task__/MaxReturn -0.601947 +MetaTest/__unnamed_task__/MinReturn -26.6475 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 8.51417 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 104600 +------------------------------------------------- ------------- +2025-04-02 17:24:49 | [pearl_trainer] epoch #56 | Training... +2025-04-02 17:26:22 | [pearl_trainer] epoch #56 | Evaluating... +2025-04-02 17:26:22 | [pearl_trainer] epoch #56 | Sampling for adapation and meta-testing... +2025-04-02 17:28:10 | [pearl_trainer] epoch #56 | Finished meta-testing... +2025-04-02 17:28:10 | [pearl_trainer] epoch #56 | Saving snapshot... +2025-04-02 17:28:11 | [pearl_trainer] epoch #56 | Saved +2025-04-02 17:28:11 | [pearl_trainer] epoch #56 | Time 13404.11 s +2025-04-02 17:28:11 | [pearl_trainer] epoch #56 | EpochTime 233.19 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -19.0072 +MetaTest/Average/AverageReturn -19.0072 +MetaTest/Average/Iteration 56 +MetaTest/Average/MaxReturn -10.2165 +MetaTest/Average/MinReturn -24.3514 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 5.11215 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -19.0072 +MetaTest/__unnamed_task__/AverageReturn -19.0072 +MetaTest/__unnamed_task__/Iteration 56 +MetaTest/__unnamed_task__/MaxReturn -10.2165 +MetaTest/__unnamed_task__/MinReturn -24.3514 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 5.11215 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 106200 +------------------------------------------------- ------------ +2025-04-02 17:28:43 | [pearl_trainer] epoch #57 | Training... +2025-04-02 17:30:15 | [pearl_trainer] epoch #57 | Evaluating... +2025-04-02 17:30:15 | [pearl_trainer] epoch #57 | Sampling for adapation and meta-testing... +2025-04-02 17:32:12 | [pearl_trainer] epoch #57 | Finished meta-testing... +2025-04-02 17:32:12 | [pearl_trainer] epoch #57 | Saving snapshot... +2025-04-02 17:32:13 | [pearl_trainer] epoch #57 | Saved +2025-04-02 17:32:13 | [pearl_trainer] epoch #57 | Time 13646.38 s +2025-04-02 17:32:13 | [pearl_trainer] epoch #57 | EpochTime 242.26 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -16.7068 +MetaTest/Average/AverageReturn -16.7068 +MetaTest/Average/Iteration 57 +MetaTest/Average/MaxReturn -12.8559 +MetaTest/Average/MinReturn -20.262 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 2.57581 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -16.7068 +MetaTest/__unnamed_task__/AverageReturn -16.7068 +MetaTest/__unnamed_task__/Iteration 57 +MetaTest/__unnamed_task__/MaxReturn -12.8559 +MetaTest/__unnamed_task__/MinReturn -20.262 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 2.57581 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 107800 +------------------------------------------------- ------------ +2025-04-02 17:32:44 | [pearl_trainer] epoch #58 | Training... +2025-04-02 17:34:16 | [pearl_trainer] epoch #58 | Evaluating... +2025-04-02 17:34:16 | [pearl_trainer] epoch #58 | Sampling for adapation and meta-testing... +2025-04-02 17:36:03 | [pearl_trainer] epoch #58 | Finished meta-testing... +2025-04-02 17:36:03 | [pearl_trainer] epoch #58 | Saving snapshot... +2025-04-02 17:36:04 | [pearl_trainer] epoch #58 | Saved +2025-04-02 17:36:04 | [pearl_trainer] epoch #58 | Time 13876.97 s +2025-04-02 17:36:04 | [pearl_trainer] epoch #58 | EpochTime 230.58 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -16.4972 +MetaTest/Average/AverageReturn -16.4972 +MetaTest/Average/Iteration 58 +MetaTest/Average/MaxReturn 6.42197 +MetaTest/Average/MinReturn -44.0633 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 16.7877 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -16.4972 +MetaTest/__unnamed_task__/AverageReturn -16.4972 +MetaTest/__unnamed_task__/Iteration 58 +MetaTest/__unnamed_task__/MaxReturn 6.42197 +MetaTest/__unnamed_task__/MinReturn -44.0633 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 16.7877 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 109400 +------------------------------------------------- ------------ +2025-04-02 17:36:34 | [pearl_trainer] epoch #59 | Training... +2025-04-02 17:38:04 | [pearl_trainer] epoch #59 | Evaluating... +2025-04-02 17:38:04 | [pearl_trainer] epoch #59 | Sampling for adapation and meta-testing... +2025-04-02 17:40:00 | [pearl_trainer] epoch #59 | Finished meta-testing... +2025-04-02 17:40:00 | [pearl_trainer] epoch #59 | Saving snapshot... +2025-04-02 17:40:01 | [pearl_trainer] epoch #59 | Saved +2025-04-02 17:40:01 | [pearl_trainer] epoch #59 | Time 14113.90 s +2025-04-02 17:40:01 | [pearl_trainer] epoch #59 | EpochTime 236.92 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -20.6392 +MetaTest/Average/AverageReturn -20.6392 +MetaTest/Average/Iteration 59 +MetaTest/Average/MaxReturn -15.9933 +MetaTest/Average/MinReturn -26.7605 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 4.23911 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -20.6392 +MetaTest/__unnamed_task__/AverageReturn -20.6392 +MetaTest/__unnamed_task__/Iteration 59 +MetaTest/__unnamed_task__/MaxReturn -15.9933 +MetaTest/__unnamed_task__/MinReturn -26.7605 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 4.23911 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 111000 +------------------------------------------------- ------------ +2025-04-02 17:40:32 | [pearl_trainer] epoch #60 | Training... +2025-04-02 17:41:58 | [pearl_trainer] epoch #60 | Evaluating... +2025-04-02 17:41:58 | [pearl_trainer] epoch #60 | Sampling for adapation and meta-testing... +2025-04-02 17:43:51 | [pearl_trainer] epoch #60 | Finished meta-testing... +2025-04-02 17:43:51 | [pearl_trainer] epoch #60 | Saving snapshot... +2025-04-02 17:43:52 | [pearl_trainer] epoch #60 | Saved +2025-04-02 17:43:52 | [pearl_trainer] epoch #60 | Time 14345.30 s +2025-04-02 17:43:52 | [pearl_trainer] epoch #60 | EpochTime 231.40 s +------------------------------------------------- ------------- +MetaTest/Average/AverageDiscountedReturn -12.3364 +MetaTest/Average/AverageReturn -12.3364 +MetaTest/Average/Iteration 60 +MetaTest/Average/MaxReturn 0.762693 +MetaTest/Average/MinReturn -30.1815 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 10.3763 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -12.3364 +MetaTest/__unnamed_task__/AverageReturn -12.3364 +MetaTest/__unnamed_task__/Iteration 60 +MetaTest/__unnamed_task__/MaxReturn 0.762693 +MetaTest/__unnamed_task__/MinReturn -30.1815 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 10.3763 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 112600 +------------------------------------------------- ------------- +2025-04-02 17:44:24 | [pearl_trainer] epoch #61 | Training... +2025-04-02 17:45:51 | [pearl_trainer] epoch #61 | Evaluating... +2025-04-02 17:45:51 | [pearl_trainer] epoch #61 | Sampling for adapation and meta-testing... +2025-04-02 17:47:45 | [pearl_trainer] epoch #61 | Finished meta-testing... +2025-04-02 17:47:45 | [pearl_trainer] epoch #61 | Saving snapshot... +2025-04-02 17:47:46 | [pearl_trainer] epoch #61 | Saved +2025-04-02 17:47:46 | [pearl_trainer] epoch #61 | Time 14578.89 s +2025-04-02 17:47:46 | [pearl_trainer] epoch #61 | EpochTime 233.59 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -9.44381 +MetaTest/Average/AverageReturn -9.44381 +MetaTest/Average/Iteration 61 +MetaTest/Average/MaxReturn 26.2316 +MetaTest/Average/MinReturn -20.511 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 18.0568 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -9.44381 +MetaTest/__unnamed_task__/AverageReturn -9.44381 +MetaTest/__unnamed_task__/Iteration 61 +MetaTest/__unnamed_task__/MaxReturn 26.2316 +MetaTest/__unnamed_task__/MinReturn -20.511 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 18.0568 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 114200 +------------------------------------------------- ------------ +2025-04-02 17:48:17 | [pearl_trainer] epoch #62 | Training... +2025-04-02 17:49:49 | [pearl_trainer] epoch #62 | Evaluating... +2025-04-02 17:49:49 | [pearl_trainer] epoch #62 | Sampling for adapation and meta-testing... +2025-04-02 17:51:46 | [pearl_trainer] epoch #62 | Finished meta-testing... +2025-04-02 17:51:46 | [pearl_trainer] epoch #62 | Saving snapshot... +2025-04-02 17:51:47 | [pearl_trainer] epoch #62 | Saved +2025-04-02 17:51:47 | [pearl_trainer] epoch #62 | Time 14819.99 s +2025-04-02 17:51:47 | [pearl_trainer] epoch #62 | EpochTime 241.10 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -14.4411 +MetaTest/Average/AverageReturn -14.4411 +MetaTest/Average/Iteration 62 +MetaTest/Average/MaxReturn -4.64078 +MetaTest/Average/MinReturn -25.5 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 8.26857 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -14.4411 +MetaTest/__unnamed_task__/AverageReturn -14.4411 +MetaTest/__unnamed_task__/Iteration 62 +MetaTest/__unnamed_task__/MaxReturn -4.64078 +MetaTest/__unnamed_task__/MinReturn -25.5 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 8.26857 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 115800 +------------------------------------------------- ------------ +2025-04-02 17:52:19 | [pearl_trainer] epoch #63 | Training... +2025-04-02 17:53:53 | [pearl_trainer] epoch #63 | Evaluating... +2025-04-02 17:53:53 | [pearl_trainer] epoch #63 | Sampling for adapation and meta-testing... +2025-04-02 17:55:43 | [pearl_trainer] epoch #63 | Finished meta-testing... +2025-04-02 17:55:43 | [pearl_trainer] epoch #63 | Saving snapshot... +2025-04-02 17:55:44 | [pearl_trainer] epoch #63 | Saved +2025-04-02 17:55:44 | [pearl_trainer] epoch #63 | Time 15057.08 s +2025-04-02 17:55:44 | [pearl_trainer] epoch #63 | EpochTime 237.09 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -12.1931 +MetaTest/Average/AverageReturn -12.1931 +MetaTest/Average/Iteration 63 +MetaTest/Average/MaxReturn 12.4146 +MetaTest/Average/MinReturn -20.4136 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 12.5806 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -12.1931 +MetaTest/__unnamed_task__/AverageReturn -12.1931 +MetaTest/__unnamed_task__/Iteration 63 +MetaTest/__unnamed_task__/MaxReturn 12.4146 +MetaTest/__unnamed_task__/MinReturn -20.4136 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 12.5806 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 117400 +------------------------------------------------- ----------- +2025-04-02 17:56:16 | [pearl_trainer] epoch #64 | Training... +2025-04-02 17:57:46 | [pearl_trainer] epoch #64 | Evaluating... +2025-04-02 17:57:46 | [pearl_trainer] epoch #64 | Sampling for adapation and meta-testing... +2025-04-02 17:59:37 | [pearl_trainer] epoch #64 | Finished meta-testing... +2025-04-02 17:59:37 | [pearl_trainer] epoch #64 | Saving snapshot... +2025-04-02 17:59:38 | [pearl_trainer] epoch #64 | Saved +2025-04-02 17:59:38 | [pearl_trainer] epoch #64 | Time 15291.30 s +2025-04-02 17:59:38 | [pearl_trainer] epoch #64 | EpochTime 234.21 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -22.4497 +MetaTest/Average/AverageReturn -22.4497 +MetaTest/Average/Iteration 64 +MetaTest/Average/MaxReturn -16.6925 +MetaTest/Average/MinReturn -27.0379 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 3.59367 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -22.4497 +MetaTest/__unnamed_task__/AverageReturn -22.4497 +MetaTest/__unnamed_task__/Iteration 64 +MetaTest/__unnamed_task__/MaxReturn -16.6925 +MetaTest/__unnamed_task__/MinReturn -27.0379 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 3.59367 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 119000 +------------------------------------------------- ------------ +2025-04-02 18:00:11 | [pearl_trainer] epoch #65 | Training... +2025-04-02 18:01:38 | [pearl_trainer] epoch #65 | Evaluating... +2025-04-02 18:01:38 | [pearl_trainer] epoch #65 | Sampling for adapation and meta-testing... +2025-04-02 18:03:31 | [pearl_trainer] epoch #65 | Finished meta-testing... +2025-04-02 18:03:31 | [pearl_trainer] epoch #65 | Saving snapshot... +2025-04-02 18:03:33 | [pearl_trainer] epoch #65 | Saved +2025-04-02 18:03:33 | [pearl_trainer] epoch #65 | Time 15525.54 s +2025-04-02 18:03:33 | [pearl_trainer] epoch #65 | EpochTime 234.24 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -15.3694 +MetaTest/Average/AverageReturn -15.3694 +MetaTest/Average/Iteration 65 +MetaTest/Average/MaxReturn -13.4706 +MetaTest/Average/MinReturn -18.0059 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 1.7272 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -15.3694 +MetaTest/__unnamed_task__/AverageReturn -15.3694 +MetaTest/__unnamed_task__/Iteration 65 +MetaTest/__unnamed_task__/MaxReturn -13.4706 +MetaTest/__unnamed_task__/MinReturn -18.0059 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 1.7272 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 120600 +------------------------------------------------- ----------- +2025-04-02 18:04:05 | [pearl_trainer] epoch #66 | Training... +2025-04-02 18:05:32 | [pearl_trainer] epoch #66 | Evaluating... +2025-04-02 18:05:32 | [pearl_trainer] epoch #66 | Sampling for adapation and meta-testing... +2025-04-02 18:07:28 | [pearl_trainer] epoch #66 | Finished meta-testing... +2025-04-02 18:07:28 | [pearl_trainer] epoch #66 | Saving snapshot... +2025-04-02 18:07:29 | [pearl_trainer] epoch #66 | Saved +2025-04-02 18:07:29 | [pearl_trainer] epoch #66 | Time 15762.08 s +2025-04-02 18:07:29 | [pearl_trainer] epoch #66 | EpochTime 236.53 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -13.691 +MetaTest/Average/AverageReturn -13.691 +MetaTest/Average/Iteration 66 +MetaTest/Average/MaxReturn 11.8004 +MetaTest/Average/MinReturn -33.1414 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 17.2703 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -13.691 +MetaTest/__unnamed_task__/AverageReturn -13.691 +MetaTest/__unnamed_task__/Iteration 66 +MetaTest/__unnamed_task__/MaxReturn 11.8004 +MetaTest/__unnamed_task__/MinReturn -33.1414 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 17.2703 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 122200 +------------------------------------------------- ----------- +2025-04-02 18:08:00 | [pearl_trainer] epoch #67 | Training... +2025-04-02 18:09:24 | [pearl_trainer] epoch #67 | Evaluating... +2025-04-02 18:09:24 | [pearl_trainer] epoch #67 | Sampling for adapation and meta-testing... +2025-04-02 18:11:13 | [pearl_trainer] epoch #67 | Finished meta-testing... +2025-04-02 18:11:13 | [pearl_trainer] epoch #67 | Saving snapshot... +2025-04-02 18:11:14 | [pearl_trainer] epoch #67 | Saved +2025-04-02 18:11:14 | [pearl_trainer] epoch #67 | Time 15986.94 s +2025-04-02 18:11:14 | [pearl_trainer] epoch #67 | EpochTime 224.86 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -17.4585 +MetaTest/Average/AverageReturn -17.4585 +MetaTest/Average/Iteration 67 +MetaTest/Average/MaxReturn 5.65334 +MetaTest/Average/MinReturn -35.3884 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 14.271 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -17.4585 +MetaTest/__unnamed_task__/AverageReturn -17.4585 +MetaTest/__unnamed_task__/Iteration 67 +MetaTest/__unnamed_task__/MaxReturn 5.65334 +MetaTest/__unnamed_task__/MinReturn -35.3884 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 14.271 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 123800 +------------------------------------------------- ------------ +2025-04-02 18:11:49 | [pearl_trainer] epoch #68 | Training... +2025-04-02 18:13:20 | [pearl_trainer] epoch #68 | Evaluating... +2025-04-02 18:13:20 | [pearl_trainer] epoch #68 | Sampling for adapation and meta-testing... +2025-04-02 18:15:17 | [pearl_trainer] epoch #68 | Finished meta-testing... +2025-04-02 18:15:17 | [pearl_trainer] epoch #68 | Saving snapshot... +2025-04-02 18:15:18 | [pearl_trainer] epoch #68 | Saved +2025-04-02 18:15:18 | [pearl_trainer] epoch #68 | Time 16231.11 s +2025-04-02 18:15:18 | [pearl_trainer] epoch #68 | EpochTime 244.17 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -15.6053 +MetaTest/Average/AverageReturn -15.6053 +MetaTest/Average/Iteration 68 +MetaTest/Average/MaxReturn 2.25108 +MetaTest/Average/MinReturn -25.4292 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 9.45388 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -15.6053 +MetaTest/__unnamed_task__/AverageReturn -15.6053 +MetaTest/__unnamed_task__/Iteration 68 +MetaTest/__unnamed_task__/MaxReturn 2.25108 +MetaTest/__unnamed_task__/MinReturn -25.4292 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 9.45388 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 125400 +------------------------------------------------- ------------ +2025-04-02 18:15:49 | [pearl_trainer] epoch #69 | Training... +2025-04-02 18:17:24 | [pearl_trainer] epoch #69 | Evaluating... +2025-04-02 18:17:24 | [pearl_trainer] epoch #69 | Sampling for adapation and meta-testing... +2025-04-02 18:19:13 | [pearl_trainer] epoch #69 | Finished meta-testing... +2025-04-02 18:19:13 | [pearl_trainer] epoch #69 | Saving snapshot... +2025-04-02 18:19:14 | [pearl_trainer] epoch #69 | Saved +2025-04-02 18:19:14 | [pearl_trainer] epoch #69 | Time 16466.89 s +2025-04-02 18:19:14 | [pearl_trainer] epoch #69 | EpochTime 235.77 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -27.4205 +MetaTest/Average/AverageReturn -27.4205 +MetaTest/Average/Iteration 69 +MetaTest/Average/MaxReturn -14.4395 +MetaTest/Average/MinReturn -61.1165 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 17.0167 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -27.4205 +MetaTest/__unnamed_task__/AverageReturn -27.4205 +MetaTest/__unnamed_task__/Iteration 69 +MetaTest/__unnamed_task__/MaxReturn -14.4395 +MetaTest/__unnamed_task__/MinReturn -61.1165 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 17.0167 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 127000 +------------------------------------------------- ----------- +2025-04-02 18:19:46 | [pearl_trainer] epoch #70 | Training... +2025-04-02 18:21:14 | [pearl_trainer] epoch #70 | Evaluating... +2025-04-02 18:21:14 | [pearl_trainer] epoch #70 | Sampling for adapation and meta-testing... +2025-04-02 18:23:10 | [pearl_trainer] epoch #70 | Finished meta-testing... +2025-04-02 18:23:10 | [pearl_trainer] epoch #70 | Saving snapshot... +2025-04-02 18:23:11 | [pearl_trainer] epoch #70 | Saved +2025-04-02 18:23:11 | [pearl_trainer] epoch #70 | Time 16704.09 s +2025-04-02 18:23:11 | [pearl_trainer] epoch #70 | EpochTime 237.20 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -21.1282 +MetaTest/Average/AverageReturn -21.1282 +MetaTest/Average/Iteration 70 +MetaTest/Average/MaxReturn -14.0452 +MetaTest/Average/MinReturn -26.9983 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 4.15635 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -21.1282 +MetaTest/__unnamed_task__/AverageReturn -21.1282 +MetaTest/__unnamed_task__/Iteration 70 +MetaTest/__unnamed_task__/MaxReturn -14.0452 +MetaTest/__unnamed_task__/MinReturn -26.9983 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 4.15635 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 128600 +------------------------------------------------- ------------ +2025-04-02 18:23:43 | [pearl_trainer] epoch #71 | Training... +2025-04-02 18:25:23 | [pearl_trainer] epoch #71 | Evaluating... +2025-04-02 18:25:23 | [pearl_trainer] epoch #71 | Sampling for adapation and meta-testing... +2025-04-02 18:27:08 | [pearl_trainer] epoch #71 | Finished meta-testing... +2025-04-02 18:27:08 | [pearl_trainer] epoch #71 | Saving snapshot... +2025-04-02 18:27:09 | [pearl_trainer] epoch #71 | Saved +2025-04-02 18:27:09 | [pearl_trainer] epoch #71 | Time 16942.17 s +2025-04-02 18:27:09 | [pearl_trainer] epoch #71 | EpochTime 238.08 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -16.4811 +MetaTest/Average/AverageReturn -16.4811 +MetaTest/Average/Iteration 71 +MetaTest/Average/MaxReturn -4.47035 +MetaTest/Average/MinReturn -22.2282 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 6.2117 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -16.4811 +MetaTest/__unnamed_task__/AverageReturn -16.4811 +MetaTest/__unnamed_task__/Iteration 71 +MetaTest/__unnamed_task__/MaxReturn -4.47035 +MetaTest/__unnamed_task__/MinReturn -22.2282 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 6.2117 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 130200 +------------------------------------------------- ------------ +2025-04-02 18:27:44 | [pearl_trainer] epoch #72 | Training... +2025-04-02 18:29:13 | [pearl_trainer] epoch #72 | Evaluating... +2025-04-02 18:29:13 | [pearl_trainer] epoch #72 | Sampling for adapation and meta-testing... +2025-04-02 18:31:11 | [pearl_trainer] epoch #72 | Finished meta-testing... +2025-04-02 18:31:11 | [pearl_trainer] epoch #72 | Saving snapshot... +2025-04-02 18:31:12 | [pearl_trainer] epoch #72 | Saved +2025-04-02 18:31:12 | [pearl_trainer] epoch #72 | Time 17184.82 s +2025-04-02 18:31:12 | [pearl_trainer] epoch #72 | EpochTime 242.66 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -15.3214 +MetaTest/Average/AverageReturn -15.3214 +MetaTest/Average/Iteration 72 +MetaTest/Average/MaxReturn -4.36592 +MetaTest/Average/MinReturn -24.3084 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 6.78318 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -15.3214 +MetaTest/__unnamed_task__/AverageReturn -15.3214 +MetaTest/__unnamed_task__/Iteration 72 +MetaTest/__unnamed_task__/MaxReturn -4.36592 +MetaTest/__unnamed_task__/MinReturn -24.3084 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 6.78318 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 131800 +------------------------------------------------- ------------ +2025-04-02 18:31:43 | [pearl_trainer] epoch #73 | Training... +2025-04-02 18:33:16 | [pearl_trainer] epoch #73 | Evaluating... +2025-04-02 18:33:16 | [pearl_trainer] epoch #73 | Sampling for adapation and meta-testing... +2025-04-02 18:35:06 | [pearl_trainer] epoch #73 | Finished meta-testing... +2025-04-02 18:35:06 | [pearl_trainer] epoch #73 | Saving snapshot... +2025-04-02 18:35:07 | [pearl_trainer] epoch #73 | Saved +2025-04-02 18:35:07 | [pearl_trainer] epoch #73 | Time 17420.51 s +2025-04-02 18:35:07 | [pearl_trainer] epoch #73 | EpochTime 235.68 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -8.29037 +MetaTest/Average/AverageReturn -8.29037 +MetaTest/Average/Iteration 73 +MetaTest/Average/MaxReturn 5.10001 +MetaTest/Average/MinReturn -21.2868 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 11.0865 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -8.29037 +MetaTest/__unnamed_task__/AverageReturn -8.29037 +MetaTest/__unnamed_task__/Iteration 73 +MetaTest/__unnamed_task__/MaxReturn 5.10001 +MetaTest/__unnamed_task__/MinReturn -21.2868 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 11.0865 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 133400 +------------------------------------------------- ------------ +2025-04-02 18:35:42 | [pearl_trainer] epoch #74 | Training... +2025-04-02 18:37:08 | [pearl_trainer] epoch #74 | Evaluating... +2025-04-02 18:37:08 | [pearl_trainer] epoch #74 | Sampling for adapation and meta-testing... +2025-04-02 18:38:59 | [pearl_trainer] epoch #74 | Finished meta-testing... +2025-04-02 18:38:59 | [pearl_trainer] epoch #74 | Saving snapshot... +2025-04-02 18:39:00 | [pearl_trainer] epoch #74 | Saved +2025-04-02 18:39:00 | [pearl_trainer] epoch #74 | Time 17652.57 s +2025-04-02 18:39:00 | [pearl_trainer] epoch #74 | EpochTime 232.06 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -23.2948 +MetaTest/Average/AverageReturn -23.2948 +MetaTest/Average/Iteration 74 +MetaTest/Average/MaxReturn -11.4082 +MetaTest/Average/MinReturn -52.5046 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 14.9541 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -23.2948 +MetaTest/__unnamed_task__/AverageReturn -23.2948 +MetaTest/__unnamed_task__/Iteration 74 +MetaTest/__unnamed_task__/MaxReturn -11.4082 +MetaTest/__unnamed_task__/MinReturn -52.5046 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 14.9541 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 135000 +------------------------------------------------- ----------- +2025-04-02 18:39:31 | [pearl_trainer] epoch #75 | Training... +2025-04-02 18:40:54 | [pearl_trainer] epoch #75 | Evaluating... +2025-04-02 18:40:54 | [pearl_trainer] epoch #75 | Sampling for adapation and meta-testing... +2025-04-02 18:42:47 | [pearl_trainer] epoch #75 | Finished meta-testing... +2025-04-02 18:42:47 | [pearl_trainer] epoch #75 | Saving snapshot... +2025-04-02 18:42:48 | [pearl_trainer] epoch #75 | Saved +2025-04-02 18:42:48 | [pearl_trainer] epoch #75 | Time 17881.02 s +2025-04-02 18:42:48 | [pearl_trainer] epoch #75 | EpochTime 228.45 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -10.9425 +MetaTest/Average/AverageReturn -10.9425 +MetaTest/Average/Iteration 75 +MetaTest/Average/MaxReturn 41.8836 +MetaTest/Average/MinReturn -57.3222 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 38.434 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -10.9425 +MetaTest/__unnamed_task__/AverageReturn -10.9425 +MetaTest/__unnamed_task__/Iteration 75 +MetaTest/__unnamed_task__/MaxReturn 41.8836 +MetaTest/__unnamed_task__/MinReturn -57.3222 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 38.434 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 136600 +------------------------------------------------- ----------- +2025-04-02 18:43:20 | [pearl_trainer] epoch #76 | Training... +2025-04-02 18:44:50 | [pearl_trainer] epoch #76 | Evaluating... +2025-04-02 18:44:50 | [pearl_trainer] epoch #76 | Sampling for adapation and meta-testing... +2025-04-02 18:46:41 | [pearl_trainer] epoch #76 | Finished meta-testing... +2025-04-02 18:46:41 | [pearl_trainer] epoch #76 | Saving snapshot... +2025-04-02 18:46:42 | [pearl_trainer] epoch #76 | Saved +2025-04-02 18:46:42 | [pearl_trainer] epoch #76 | Time 18114.75 s +2025-04-02 18:46:42 | [pearl_trainer] epoch #76 | EpochTime 233.73 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -30.7361 +MetaTest/Average/AverageReturn -30.7361 +MetaTest/Average/Iteration 76 +MetaTest/Average/MaxReturn -16.1194 +MetaTest/Average/MinReturn -54.8263 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 13.251 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -30.7361 +MetaTest/__unnamed_task__/AverageReturn -30.7361 +MetaTest/__unnamed_task__/Iteration 76 +MetaTest/__unnamed_task__/MaxReturn -16.1194 +MetaTest/__unnamed_task__/MinReturn -54.8263 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 13.251 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 138200 +------------------------------------------------- ----------- +2025-04-02 18:47:15 | [pearl_trainer] epoch #77 | Training... +2025-04-02 18:48:41 | [pearl_trainer] epoch #77 | Evaluating... +2025-04-02 18:48:41 | [pearl_trainer] epoch #77 | Sampling for adapation and meta-testing... +2025-04-02 18:50:35 | [pearl_trainer] epoch #77 | Finished meta-testing... +2025-04-02 18:50:35 | [pearl_trainer] epoch #77 | Saving snapshot... +2025-04-02 18:50:36 | [pearl_trainer] epoch #77 | Saved +2025-04-02 18:50:36 | [pearl_trainer] epoch #77 | Time 18349.46 s +2025-04-02 18:50:36 | [pearl_trainer] epoch #77 | EpochTime 234.70 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -29.5628 +MetaTest/Average/AverageReturn -29.5628 +MetaTest/Average/Iteration 77 +MetaTest/Average/MaxReturn -10.3618 +MetaTest/Average/MinReturn -72.3682 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 22.1515 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -29.5628 +MetaTest/__unnamed_task__/AverageReturn -29.5628 +MetaTest/__unnamed_task__/Iteration 77 +MetaTest/__unnamed_task__/MaxReturn -10.3618 +MetaTest/__unnamed_task__/MinReturn -72.3682 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 22.1515 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 139800 +------------------------------------------------- ----------- +2025-04-02 18:51:08 | [pearl_trainer] epoch #78 | Training... +2025-04-02 18:52:41 | [pearl_trainer] epoch #78 | Evaluating... +2025-04-02 18:52:41 | [pearl_trainer] epoch #78 | Sampling for adapation and meta-testing... +2025-04-02 18:54:32 | [pearl_trainer] epoch #78 | Finished meta-testing... +2025-04-02 18:54:32 | [pearl_trainer] epoch #78 | Saving snapshot... +2025-04-02 18:54:33 | [pearl_trainer] epoch #78 | Saved +2025-04-02 18:54:33 | [pearl_trainer] epoch #78 | Time 18586.29 s +2025-04-02 18:54:33 | [pearl_trainer] epoch #78 | EpochTime 236.83 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -13.0422 +MetaTest/Average/AverageReturn -13.0422 +MetaTest/Average/Iteration 78 +MetaTest/Average/MaxReturn 15.2914 +MetaTest/Average/MinReturn -24.5531 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 15.1265 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -13.0422 +MetaTest/__unnamed_task__/AverageReturn -13.0422 +MetaTest/__unnamed_task__/Iteration 78 +MetaTest/__unnamed_task__/MaxReturn 15.2914 +MetaTest/__unnamed_task__/MinReturn -24.5531 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 15.1265 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 141400 +------------------------------------------------- ----------- +2025-04-02 18:55:05 | [pearl_trainer] epoch #79 | Training... +2025-04-02 18:56:33 | [pearl_trainer] epoch #79 | Evaluating... +2025-04-02 18:56:33 | [pearl_trainer] epoch #79 | Sampling for adapation and meta-testing... +2025-04-02 18:58:37 | [pearl_trainer] epoch #79 | Finished meta-testing... +2025-04-02 18:58:37 | [pearl_trainer] epoch #79 | Saving snapshot... +2025-04-02 18:58:38 | [pearl_trainer] epoch #79 | Saved +2025-04-02 18:58:38 | [pearl_trainer] epoch #79 | Time 18830.75 s +2025-04-02 18:58:38 | [pearl_trainer] epoch #79 | EpochTime 244.45 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -30.2633 +MetaTest/Average/AverageReturn -30.2633 +MetaTest/Average/Iteration 79 +MetaTest/Average/MaxReturn -15.9081 +MetaTest/Average/MinReturn -65.7903 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 18.3192 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -30.2633 +MetaTest/__unnamed_task__/AverageReturn -30.2633 +MetaTest/__unnamed_task__/Iteration 79 +MetaTest/__unnamed_task__/MaxReturn -15.9081 +MetaTest/__unnamed_task__/MinReturn -65.7903 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 18.3192 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 143000 +------------------------------------------------- ----------- +2025-04-02 18:59:09 | [pearl_trainer] epoch #80 | Training... +2025-04-02 19:00:46 | [pearl_trainer] epoch #80 | Evaluating... +2025-04-02 19:00:46 | [pearl_trainer] epoch #80 | Sampling for adapation and meta-testing... +2025-04-02 19:02:35 | [pearl_trainer] epoch #80 | Finished meta-testing... +2025-04-02 19:02:35 | [pearl_trainer] epoch #80 | Saving snapshot... +2025-04-02 19:02:36 | [pearl_trainer] epoch #80 | Saved +2025-04-02 19:02:36 | [pearl_trainer] epoch #80 | Time 19069.46 s +2025-04-02 19:02:36 | [pearl_trainer] epoch #80 | EpochTime 238.71 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -22.8721 +MetaTest/Average/AverageReturn -22.8721 +MetaTest/Average/Iteration 80 +MetaTest/Average/MaxReturn -7.95111 +MetaTest/Average/MinReturn -39.8049 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 10.2708 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -22.8721 +MetaTest/__unnamed_task__/AverageReturn -22.8721 +MetaTest/__unnamed_task__/Iteration 80 +MetaTest/__unnamed_task__/MaxReturn -7.95111 +MetaTest/__unnamed_task__/MinReturn -39.8049 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 10.2708 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 144600 +------------------------------------------------- ------------ +2025-04-02 19:03:09 | [pearl_trainer] epoch #81 | Training... +2025-04-02 19:04:36 | [pearl_trainer] epoch #81 | Evaluating... +2025-04-02 19:04:36 | [pearl_trainer] epoch #81 | Sampling for adapation and meta-testing... +2025-04-02 19:06:29 | [pearl_trainer] epoch #81 | Finished meta-testing... +2025-04-02 19:06:29 | [pearl_trainer] epoch #81 | Saving snapshot... +2025-04-02 19:06:30 | [pearl_trainer] epoch #81 | Saved +2025-04-02 19:06:30 | [pearl_trainer] epoch #81 | Time 19302.89 s +2025-04-02 19:06:30 | [pearl_trainer] epoch #81 | EpochTime 233.43 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -18.9416 +MetaTest/Average/AverageReturn -18.9416 +MetaTest/Average/Iteration 81 +MetaTest/Average/MaxReturn -2.15714 +MetaTest/Average/MinReturn -30.9748 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 9.35309 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -18.9416 +MetaTest/__unnamed_task__/AverageReturn -18.9416 +MetaTest/__unnamed_task__/Iteration 81 +MetaTest/__unnamed_task__/MaxReturn -2.15714 +MetaTest/__unnamed_task__/MinReturn -30.9748 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 9.35309 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 146200 +------------------------------------------------- ------------ +2025-04-02 19:07:02 | [pearl_trainer] epoch #82 | Training... +2025-04-02 19:08:34 | [pearl_trainer] epoch #82 | Evaluating... +2025-04-02 19:08:34 | [pearl_trainer] epoch #82 | Sampling for adapation and meta-testing... +2025-04-02 19:10:22 | [pearl_trainer] epoch #82 | Finished meta-testing... +2025-04-02 19:10:22 | [pearl_trainer] epoch #82 | Saving snapshot... +2025-04-02 19:10:23 | [pearl_trainer] epoch #82 | Saved +2025-04-02 19:10:23 | [pearl_trainer] epoch #82 | Time 19536.12 s +2025-04-02 19:10:23 | [pearl_trainer] epoch #82 | EpochTime 233.23 s +------------------------------------------------- ------------- +MetaTest/Average/AverageDiscountedReturn -19.3101 +MetaTest/Average/AverageReturn -19.3101 +MetaTest/Average/Iteration 82 +MetaTest/Average/MaxReturn -0.185232 +MetaTest/Average/MinReturn -35.0578 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 11.6972 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -19.3101 +MetaTest/__unnamed_task__/AverageReturn -19.3101 +MetaTest/__unnamed_task__/Iteration 82 +MetaTest/__unnamed_task__/MaxReturn -0.185232 +MetaTest/__unnamed_task__/MinReturn -35.0578 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 11.6972 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 147800 +------------------------------------------------- ------------- +2025-04-02 19:10:54 | [pearl_trainer] epoch #83 | Training... +2025-04-02 19:12:13 | [pearl_trainer] epoch #83 | Evaluating... +2025-04-02 19:12:13 | [pearl_trainer] epoch #83 | Sampling for adapation and meta-testing... +2025-04-02 19:14:07 | [pearl_trainer] epoch #83 | Finished meta-testing... +2025-04-02 19:14:07 | [pearl_trainer] epoch #83 | Saving snapshot... +2025-04-02 19:14:08 | [pearl_trainer] epoch #83 | Saved +2025-04-02 19:14:08 | [pearl_trainer] epoch #83 | Time 19761.22 s +2025-04-02 19:14:08 | [pearl_trainer] epoch #83 | EpochTime 225.09 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -24.835 +MetaTest/Average/AverageReturn -24.835 +MetaTest/Average/Iteration 83 +MetaTest/Average/MaxReturn -19.5619 +MetaTest/Average/MinReturn -37.8258 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 6.61806 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -24.835 +MetaTest/__unnamed_task__/AverageReturn -24.835 +MetaTest/__unnamed_task__/Iteration 83 +MetaTest/__unnamed_task__/MaxReturn -19.5619 +MetaTest/__unnamed_task__/MinReturn -37.8258 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 6.61806 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 149400 +------------------------------------------------- ------------ +2025-04-02 19:14:37 | [pearl_trainer] epoch #84 | Training... +2025-04-02 19:16:06 | [pearl_trainer] epoch #84 | Evaluating... +2025-04-02 19:16:06 | [pearl_trainer] epoch #84 | Sampling for adapation and meta-testing... +2025-04-02 19:17:53 | [pearl_trainer] epoch #84 | Finished meta-testing... +2025-04-02 19:17:53 | [pearl_trainer] epoch #84 | Saving snapshot... +2025-04-02 19:17:54 | [pearl_trainer] epoch #84 | Saved +2025-04-02 19:17:54 | [pearl_trainer] epoch #84 | Time 19986.78 s +2025-04-02 19:17:54 | [pearl_trainer] epoch #84 | EpochTime 225.56 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -18.4091 +MetaTest/Average/AverageReturn -18.4091 +MetaTest/Average/Iteration 84 +MetaTest/Average/MaxReturn -8.73567 +MetaTest/Average/MinReturn -24.4907 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 5.19381 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -18.4091 +MetaTest/__unnamed_task__/AverageReturn -18.4091 +MetaTest/__unnamed_task__/Iteration 84 +MetaTest/__unnamed_task__/MaxReturn -8.73567 +MetaTest/__unnamed_task__/MinReturn -24.4907 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 5.19381 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 151000 +------------------------------------------------- ------------ +2025-04-02 19:18:26 | [pearl_trainer] epoch #85 | Training... +2025-04-02 19:19:54 | [pearl_trainer] epoch #85 | Evaluating... +2025-04-02 19:19:54 | [pearl_trainer] epoch #85 | Sampling for adapation and meta-testing... +2025-04-02 19:21:46 | [pearl_trainer] epoch #85 | Finished meta-testing... +2025-04-02 19:21:46 | [pearl_trainer] epoch #85 | Saving snapshot... +2025-04-02 19:21:47 | [pearl_trainer] epoch #85 | Saved +2025-04-02 19:21:47 | [pearl_trainer] epoch #85 | Time 20220.45 s +2025-04-02 19:21:47 | [pearl_trainer] epoch #85 | EpochTime 233.67 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -2.91662 +MetaTest/Average/AverageReturn -2.91662 +MetaTest/Average/Iteration 85 +MetaTest/Average/MaxReturn 26.6855 +MetaTest/Average/MinReturn -17.7474 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 16.1951 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -2.91662 +MetaTest/__unnamed_task__/AverageReturn -2.91662 +MetaTest/__unnamed_task__/Iteration 85 +MetaTest/__unnamed_task__/MaxReturn 26.6855 +MetaTest/__unnamed_task__/MinReturn -17.7474 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 16.1951 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 152600 +------------------------------------------------- ------------ +2025-04-02 19:22:19 | [pearl_trainer] epoch #86 | Training... +2025-04-02 19:23:50 | [pearl_trainer] epoch #86 | Evaluating... +2025-04-02 19:23:50 | [pearl_trainer] epoch #86 | Sampling for adapation and meta-testing... +2025-04-02 19:25:45 | [pearl_trainer] epoch #86 | Finished meta-testing... +2025-04-02 19:25:45 | [pearl_trainer] epoch #86 | Saving snapshot... +2025-04-02 19:25:46 | [pearl_trainer] epoch #86 | Saved +2025-04-02 19:25:46 | [pearl_trainer] epoch #86 | Time 20459.02 s +2025-04-02 19:25:46 | [pearl_trainer] epoch #86 | EpochTime 238.56 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -18.5537 +MetaTest/Average/AverageReturn -18.5537 +MetaTest/Average/Iteration 86 +MetaTest/Average/MaxReturn 1.03536 +MetaTest/Average/MinReturn -33.4535 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 11.2177 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -18.5537 +MetaTest/__unnamed_task__/AverageReturn -18.5537 +MetaTest/__unnamed_task__/Iteration 86 +MetaTest/__unnamed_task__/MaxReturn 1.03536 +MetaTest/__unnamed_task__/MinReturn -33.4535 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 11.2177 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 154200 +------------------------------------------------- ------------ +2025-04-02 19:26:18 | [pearl_trainer] epoch #87 | Training... +2025-04-02 19:28:09 | [pearl_trainer] epoch #87 | Evaluating... +2025-04-02 19:28:09 | [pearl_trainer] epoch #87 | Sampling for adapation and meta-testing... +2025-04-02 19:30:00 | [pearl_trainer] epoch #87 | Finished meta-testing... +2025-04-02 19:30:00 | [pearl_trainer] epoch #87 | Saving snapshot... +2025-04-02 19:30:01 | [pearl_trainer] epoch #87 | Saved +2025-04-02 19:30:01 | [pearl_trainer] epoch #87 | Time 20713.56 s +2025-04-02 19:30:01 | [pearl_trainer] epoch #87 | EpochTime 254.54 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -9.40308 +MetaTest/Average/AverageReturn -9.40308 +MetaTest/Average/Iteration 87 +MetaTest/Average/MaxReturn 32.6162 +MetaTest/Average/MinReturn -24.76 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 21.3287 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -9.40308 +MetaTest/__unnamed_task__/AverageReturn -9.40308 +MetaTest/__unnamed_task__/Iteration 87 +MetaTest/__unnamed_task__/MaxReturn 32.6162 +MetaTest/__unnamed_task__/MinReturn -24.76 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 21.3287 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 155800 +------------------------------------------------- ------------ +2025-04-02 19:30:34 | [pearl_trainer] epoch #88 | Training... +2025-04-02 19:31:59 | [pearl_trainer] epoch #88 | Evaluating... +2025-04-02 19:31:59 | [pearl_trainer] epoch #88 | Sampling for adapation and meta-testing... +2025-04-02 19:33:51 | [pearl_trainer] epoch #88 | Finished meta-testing... +2025-04-02 19:33:51 | [pearl_trainer] epoch #88 | Saving snapshot... +2025-04-02 19:33:53 | [pearl_trainer] epoch #88 | Saved +2025-04-02 19:33:53 | [pearl_trainer] epoch #88 | Time 20945.68 s +2025-04-02 19:33:53 | [pearl_trainer] epoch #88 | EpochTime 232.12 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -7.887 +MetaTest/Average/AverageReturn -7.887 +MetaTest/Average/Iteration 88 +MetaTest/Average/MaxReturn 36.6997 +MetaTest/Average/MinReturn -47.4895 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 27.9362 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -7.887 +MetaTest/__unnamed_task__/AverageReturn -7.887 +MetaTest/__unnamed_task__/Iteration 88 +MetaTest/__unnamed_task__/MaxReturn 36.6997 +MetaTest/__unnamed_task__/MinReturn -47.4895 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 27.9362 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 157400 +------------------------------------------------- ----------- +2025-04-02 19:34:24 | [pearl_trainer] epoch #89 | Training... +2025-04-02 19:36:02 | [pearl_trainer] epoch #89 | Evaluating... +2025-04-02 19:36:02 | [pearl_trainer] epoch #89 | Sampling for adapation and meta-testing... +2025-04-02 19:37:54 | [pearl_trainer] epoch #89 | Finished meta-testing... +2025-04-02 19:37:54 | [pearl_trainer] epoch #89 | Saving snapshot... +2025-04-02 19:37:55 | [pearl_trainer] epoch #89 | Saved +2025-04-02 19:37:55 | [pearl_trainer] epoch #89 | Time 21188.01 s +2025-04-02 19:37:55 | [pearl_trainer] epoch #89 | EpochTime 242.32 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -23.2098 +MetaTest/Average/AverageReturn -23.2098 +MetaTest/Average/Iteration 89 +MetaTest/Average/MaxReturn -9.27863 +MetaTest/Average/MinReturn -44.2853 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 12.632 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -23.2098 +MetaTest/__unnamed_task__/AverageReturn -23.2098 +MetaTest/__unnamed_task__/Iteration 89 +MetaTest/__unnamed_task__/MaxReturn -9.27863 +MetaTest/__unnamed_task__/MinReturn -44.2853 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 12.632 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 159000 +------------------------------------------------- ------------ +2025-04-02 19:38:30 | [pearl_trainer] epoch #90 | Training... +2025-04-02 19:39:54 | [pearl_trainer] epoch #90 | Evaluating... +2025-04-02 19:39:54 | [pearl_trainer] epoch #90 | Sampling for adapation and meta-testing... +2025-04-02 19:41:51 | [pearl_trainer] epoch #90 | Finished meta-testing... +2025-04-02 19:41:51 | [pearl_trainer] epoch #90 | Saving snapshot... +2025-04-02 19:41:53 | [pearl_trainer] epoch #90 | Saved +2025-04-02 19:41:53 | [pearl_trainer] epoch #90 | Time 21425.64 s +2025-04-02 19:41:53 | [pearl_trainer] epoch #90 | EpochTime 237.63 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -15.6959 +MetaTest/Average/AverageReturn -15.6959 +MetaTest/Average/Iteration 90 +MetaTest/Average/MaxReturn 24.5249 +MetaTest/Average/MinReturn -36.456 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 21.3972 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -15.6959 +MetaTest/__unnamed_task__/AverageReturn -15.6959 +MetaTest/__unnamed_task__/Iteration 90 +MetaTest/__unnamed_task__/MaxReturn 24.5249 +MetaTest/__unnamed_task__/MinReturn -36.456 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 21.3972 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 160600 +------------------------------------------------- ----------- +2025-04-02 19:42:22 | [pearl_trainer] epoch #91 | Training... +2025-04-02 19:43:58 | [pearl_trainer] epoch #91 | Evaluating... +2025-04-02 19:43:58 | [pearl_trainer] epoch #91 | Sampling for adapation and meta-testing... +2025-04-02 19:45:45 | [pearl_trainer] epoch #91 | Finished meta-testing... +2025-04-02 19:45:45 | [pearl_trainer] epoch #91 | Saving snapshot... +2025-04-02 19:45:46 | [pearl_trainer] epoch #91 | Saved +2025-04-02 19:45:46 | [pearl_trainer] epoch #91 | Time 21658.75 s +2025-04-02 19:45:46 | [pearl_trainer] epoch #91 | EpochTime 233.10 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -17.4907 +MetaTest/Average/AverageReturn -17.4907 +MetaTest/Average/Iteration 91 +MetaTest/Average/MaxReturn 1.25289 +MetaTest/Average/MinReturn -26.5282 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 9.78697 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -17.4907 +MetaTest/__unnamed_task__/AverageReturn -17.4907 +MetaTest/__unnamed_task__/Iteration 91 +MetaTest/__unnamed_task__/MaxReturn 1.25289 +MetaTest/__unnamed_task__/MinReturn -26.5282 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 9.78697 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 162200 +------------------------------------------------- ------------ +2025-04-02 19:46:18 | [pearl_trainer] epoch #92 | Training... +2025-04-02 19:47:47 | [pearl_trainer] epoch #92 | Evaluating... +2025-04-02 19:47:47 | [pearl_trainer] epoch #92 | Sampling for adapation and meta-testing... +2025-04-02 19:49:39 | [pearl_trainer] epoch #92 | Finished meta-testing... +2025-04-02 19:49:39 | [pearl_trainer] epoch #92 | Saving snapshot... +2025-04-02 19:49:40 | [pearl_trainer] epoch #92 | Saved +2025-04-02 19:49:40 | [pearl_trainer] epoch #92 | Time 21893.11 s +2025-04-02 19:49:40 | [pearl_trainer] epoch #92 | EpochTime 234.36 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -19.5549 +MetaTest/Average/AverageReturn -19.5549 +MetaTest/Average/Iteration 92 +MetaTest/Average/MaxReturn -10.5259 +MetaTest/Average/MinReturn -30.1052 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 6.32027 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -19.5549 +MetaTest/__unnamed_task__/AverageReturn -19.5549 +MetaTest/__unnamed_task__/Iteration 92 +MetaTest/__unnamed_task__/MaxReturn -10.5259 +MetaTest/__unnamed_task__/MinReturn -30.1052 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 6.32027 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 163800 +------------------------------------------------- ------------ +2025-04-02 19:50:11 | [pearl_trainer] epoch #93 | Training... +2025-04-02 19:51:41 | [pearl_trainer] epoch #93 | Evaluating... +2025-04-02 19:51:41 | [pearl_trainer] epoch #93 | Sampling for adapation and meta-testing... +2025-04-02 19:53:31 | [pearl_trainer] epoch #93 | Finished meta-testing... +2025-04-02 19:53:31 | [pearl_trainer] epoch #93 | Saving snapshot... +2025-04-02 19:53:32 | [pearl_trainer] epoch #93 | Saved +2025-04-02 19:53:32 | [pearl_trainer] epoch #93 | Time 22125.19 s +2025-04-02 19:53:32 | [pearl_trainer] epoch #93 | EpochTime 232.08 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -23.098 +MetaTest/Average/AverageReturn -23.098 +MetaTest/Average/Iteration 93 +MetaTest/Average/MaxReturn -13.3837 +MetaTest/Average/MinReturn -47.9151 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 12.9025 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -23.098 +MetaTest/__unnamed_task__/AverageReturn -23.098 +MetaTest/__unnamed_task__/Iteration 93 +MetaTest/__unnamed_task__/MaxReturn -13.3837 +MetaTest/__unnamed_task__/MinReturn -47.9151 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 12.9025 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 165400 +------------------------------------------------- ----------- +2025-04-02 19:54:07 | [pearl_trainer] epoch #94 | Training... +2025-04-02 19:55:38 | [pearl_trainer] epoch #94 | Evaluating... +2025-04-02 19:55:38 | [pearl_trainer] epoch #94 | Sampling for adapation and meta-testing... +2025-04-02 19:57:38 | [pearl_trainer] epoch #94 | Finished meta-testing... +2025-04-02 19:57:38 | [pearl_trainer] epoch #94 | Saving snapshot... +2025-04-02 19:57:39 | [pearl_trainer] epoch #94 | Saved +2025-04-02 19:57:39 | [pearl_trainer] epoch #94 | Time 22372.33 s +2025-04-02 19:57:39 | [pearl_trainer] epoch #94 | EpochTime 247.14 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -26.4676 +MetaTest/Average/AverageReturn -26.4676 +MetaTest/Average/Iteration 94 +MetaTest/Average/MaxReturn -10.1151 +MetaTest/Average/MinReturn -47.2181 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 12.949 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -26.4676 +MetaTest/__unnamed_task__/AverageReturn -26.4676 +MetaTest/__unnamed_task__/Iteration 94 +MetaTest/__unnamed_task__/MaxReturn -10.1151 +MetaTest/__unnamed_task__/MinReturn -47.2181 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 12.949 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 167000 +------------------------------------------------- ----------- +2025-04-02 19:58:10 | [pearl_trainer] epoch #95 | Training... +2025-04-02 19:59:40 | [pearl_trainer] epoch #95 | Evaluating... +2025-04-02 19:59:40 | [pearl_trainer] epoch #95 | Sampling for adapation and meta-testing... +2025-04-02 20:01:41 | [pearl_trainer] epoch #95 | Finished meta-testing... +2025-04-02 20:01:41 | [pearl_trainer] epoch #95 | Saving snapshot... +2025-04-02 20:01:43 | [pearl_trainer] epoch #95 | Saved +2025-04-02 20:01:43 | [pearl_trainer] epoch #95 | Time 22615.69 s +2025-04-02 20:01:43 | [pearl_trainer] epoch #95 | EpochTime 243.36 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -13.3349 +MetaTest/Average/AverageReturn -13.3349 +MetaTest/Average/Iteration 95 +MetaTest/Average/MaxReturn 27.2101 +MetaTest/Average/MinReturn -26.5986 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 20.4581 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -13.3349 +MetaTest/__unnamed_task__/AverageReturn -13.3349 +MetaTest/__unnamed_task__/Iteration 95 +MetaTest/__unnamed_task__/MaxReturn 27.2101 +MetaTest/__unnamed_task__/MinReturn -26.5986 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 20.4581 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 168600 +------------------------------------------------- ----------- +2025-04-02 20:02:15 | [pearl_trainer] epoch #96 | Training... +2025-04-02 20:03:45 | [pearl_trainer] epoch #96 | Evaluating... +2025-04-02 20:03:45 | [pearl_trainer] epoch #96 | Sampling for adapation and meta-testing... +2025-04-02 20:05:42 | [pearl_trainer] epoch #96 | Finished meta-testing... +2025-04-02 20:05:42 | [pearl_trainer] epoch #96 | Saving snapshot... +2025-04-02 20:05:43 | [pearl_trainer] epoch #96 | Saved +2025-04-02 20:05:43 | [pearl_trainer] epoch #96 | Time 22856.00 s +2025-04-02 20:05:43 | [pearl_trainer] epoch #96 | EpochTime 240.32 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -23.8541 +MetaTest/Average/AverageReturn -23.8541 +MetaTest/Average/Iteration 96 +MetaTest/Average/MaxReturn -16.0655 +MetaTest/Average/MinReturn -32.2803 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 7.154 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -23.8541 +MetaTest/__unnamed_task__/AverageReturn -23.8541 +MetaTest/__unnamed_task__/Iteration 96 +MetaTest/__unnamed_task__/MaxReturn -16.0655 +MetaTest/__unnamed_task__/MinReturn -32.2803 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 7.154 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 170200 +------------------------------------------------- ----------- +2025-04-02 20:06:15 | [pearl_trainer] epoch #97 | Training... +2025-04-02 20:07:38 | [pearl_trainer] epoch #97 | Evaluating... +2025-04-02 20:07:38 | [pearl_trainer] epoch #97 | Sampling for adapation and meta-testing... +2025-04-02 20:09:34 | [pearl_trainer] epoch #97 | Finished meta-testing... +2025-04-02 20:09:34 | [pearl_trainer] epoch #97 | Saving snapshot... +2025-04-02 20:09:35 | [pearl_trainer] epoch #97 | Saved +2025-04-02 20:09:35 | [pearl_trainer] epoch #97 | Time 23087.61 s +2025-04-02 20:09:35 | [pearl_trainer] epoch #97 | EpochTime 231.61 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -32.0342 +MetaTest/Average/AverageReturn -32.0342 +MetaTest/Average/Iteration 97 +MetaTest/Average/MaxReturn -23.6213 +MetaTest/Average/MinReturn -44.4651 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 8.25557 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -32.0342 +MetaTest/__unnamed_task__/AverageReturn -32.0342 +MetaTest/__unnamed_task__/Iteration 97 +MetaTest/__unnamed_task__/MaxReturn -23.6213 +MetaTest/__unnamed_task__/MinReturn -44.4651 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 8.25557 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 171800 +------------------------------------------------- ------------ +2025-04-02 20:10:06 | [pearl_trainer] epoch #98 | Training... +2025-04-02 20:11:36 | [pearl_trainer] epoch #98 | Evaluating... +2025-04-02 20:11:36 | [pearl_trainer] epoch #98 | Sampling for adapation and meta-testing... +2025-04-02 20:13:25 | [pearl_trainer] epoch #98 | Finished meta-testing... +2025-04-02 20:13:25 | [pearl_trainer] epoch #98 | Saving snapshot... +2025-04-02 20:13:26 | [pearl_trainer] epoch #98 | Saved +2025-04-02 20:13:26 | [pearl_trainer] epoch #98 | Time 23319.49 s +2025-04-02 20:13:26 | [pearl_trainer] epoch #98 | EpochTime 231.87 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -9.41489 +MetaTest/Average/AverageReturn -9.41489 +MetaTest/Average/Iteration 98 +MetaTest/Average/MaxReturn 8.83238 +MetaTest/Average/MinReturn -27.8818 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 12.8063 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -9.41489 +MetaTest/__unnamed_task__/AverageReturn -9.41489 +MetaTest/__unnamed_task__/Iteration 98 +MetaTest/__unnamed_task__/MaxReturn 8.83238 +MetaTest/__unnamed_task__/MinReturn -27.8818 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 12.8063 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 173400 +------------------------------------------------- ------------ +2025-04-02 20:13:57 | [pearl_trainer] epoch #99 | Training... +2025-04-02 20:15:29 | [pearl_trainer] epoch #99 | Evaluating... +2025-04-02 20:15:29 | [pearl_trainer] epoch #99 | Sampling for adapation and meta-testing... +2025-04-02 20:17:18 | [pearl_trainer] epoch #99 | Finished meta-testing... +2025-04-02 20:17:18 | [pearl_trainer] epoch #99 | Saving snapshot... +2025-04-02 20:17:19 | [pearl_trainer] epoch #99 | Saved +2025-04-02 20:17:19 | [pearl_trainer] epoch #99 | Time 23552.44 s +2025-04-02 20:17:19 | [pearl_trainer] epoch #99 | EpochTime 232.95 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -16.3435 +MetaTest/Average/AverageReturn -16.3435 +MetaTest/Average/Iteration 99 +MetaTest/Average/MaxReturn 6.3603 +MetaTest/Average/MinReturn -28.7035 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 12.2262 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -16.3435 +MetaTest/__unnamed_task__/AverageReturn -16.3435 +MetaTest/__unnamed_task__/Iteration 99 +MetaTest/__unnamed_task__/MaxReturn 6.3603 +MetaTest/__unnamed_task__/MinReturn -28.7035 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 12.2262 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 175000 +------------------------------------------------- ----------- +2025-04-02 20:17:50 | [pearl_trainer] epoch #100 | Training... +2025-04-02 20:19:26 | [pearl_trainer] epoch #100 | Evaluating... +2025-04-02 20:19:26 | [pearl_trainer] epoch #100 | Sampling for adapation and meta-testing... +2025-04-02 20:21:22 | [pearl_trainer] epoch #100 | Finished meta-testing... +2025-04-02 20:21:22 | [pearl_trainer] epoch #100 | Saving snapshot... +2025-04-02 20:21:23 | [pearl_trainer] epoch #100 | Saved +2025-04-02 20:21:23 | [pearl_trainer] epoch #100 | Time 23796.19 s +2025-04-02 20:21:23 | [pearl_trainer] epoch #100 | EpochTime 243.75 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -15.83 +MetaTest/Average/AverageReturn -15.83 +MetaTest/Average/Iteration 100 +MetaTest/Average/MaxReturn -7.85981 +MetaTest/Average/MinReturn -19.5279 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 4.52202 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -15.83 +MetaTest/__unnamed_task__/AverageReturn -15.83 +MetaTest/__unnamed_task__/Iteration 100 +MetaTest/__unnamed_task__/MaxReturn -7.85981 +MetaTest/__unnamed_task__/MinReturn -19.5279 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 4.52202 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 176600 +------------------------------------------------- ------------ +2025-04-02 20:21:56 | [pearl_trainer] epoch #101 | Training... +2025-04-02 20:23:26 | [pearl_trainer] epoch #101 | Evaluating... +2025-04-02 20:23:26 | [pearl_trainer] epoch #101 | Sampling for adapation and meta-testing... +2025-04-02 20:25:13 | [pearl_trainer] epoch #101 | Finished meta-testing... +2025-04-02 20:25:13 | [pearl_trainer] epoch #101 | Saving snapshot... +2025-04-02 20:25:13 | [pearl_trainer] epoch #101 | Saved +2025-04-02 20:25:13 | [pearl_trainer] epoch #101 | Time 24026.51 s +2025-04-02 20:25:13 | [pearl_trainer] epoch #101 | EpochTime 230.31 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -8.66101 +MetaTest/Average/AverageReturn -8.66101 +MetaTest/Average/Iteration 101 +MetaTest/Average/MaxReturn 18.8328 +MetaTest/Average/MinReturn -22.3034 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 14.5635 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -8.66101 +MetaTest/__unnamed_task__/AverageReturn -8.66101 +MetaTest/__unnamed_task__/Iteration 101 +MetaTest/__unnamed_task__/MaxReturn 18.8328 +MetaTest/__unnamed_task__/MinReturn -22.3034 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 14.5635 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 178200 +------------------------------------------------- ------------ +2025-04-02 20:25:42 | [pearl_trainer] epoch #102 | Training... +2025-04-02 20:27:18 | [pearl_trainer] epoch #102 | Evaluating... +2025-04-02 20:27:18 | [pearl_trainer] epoch #102 | Sampling for adapation and meta-testing... +2025-04-02 20:29:05 | [pearl_trainer] epoch #102 | Finished meta-testing... +2025-04-02 20:29:05 | [pearl_trainer] epoch #102 | Saving snapshot... +2025-04-02 20:29:06 | [pearl_trainer] epoch #102 | Saved +2025-04-02 20:29:06 | [pearl_trainer] epoch #102 | Time 24259.01 s +2025-04-02 20:29:06 | [pearl_trainer] epoch #102 | EpochTime 232.50 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -17.7384 +MetaTest/Average/AverageReturn -17.7384 +MetaTest/Average/Iteration 102 +MetaTest/Average/MaxReturn -11.9379 +MetaTest/Average/MinReturn -22.5971 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 3.57811 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -17.7384 +MetaTest/__unnamed_task__/AverageReturn -17.7384 +MetaTest/__unnamed_task__/Iteration 102 +MetaTest/__unnamed_task__/MaxReturn -11.9379 +MetaTest/__unnamed_task__/MinReturn -22.5971 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 3.57811 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 179800 +------------------------------------------------- ------------ +2025-04-02 20:29:40 | [pearl_trainer] epoch #103 | Training... +2025-04-02 20:31:14 | [pearl_trainer] epoch #103 | Evaluating... +2025-04-02 20:31:14 | [pearl_trainer] epoch #103 | Sampling for adapation and meta-testing... +2025-04-02 20:33:12 | [pearl_trainer] epoch #103 | Finished meta-testing... +2025-04-02 20:33:12 | [pearl_trainer] epoch #103 | Saving snapshot... +2025-04-02 20:33:14 | [pearl_trainer] epoch #103 | Saved +2025-04-02 20:33:14 | [pearl_trainer] epoch #103 | Time 24506.64 s +2025-04-02 20:33:14 | [pearl_trainer] epoch #103 | EpochTime 247.62 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 9.32442 +MetaTest/Average/AverageReturn 9.32442 +MetaTest/Average/Iteration 103 +MetaTest/Average/MaxReturn 64.3587 +MetaTest/Average/MinReturn -28.1203 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 36.0725 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 9.32442 +MetaTest/__unnamed_task__/AverageReturn 9.32442 +MetaTest/__unnamed_task__/Iteration 103 +MetaTest/__unnamed_task__/MaxReturn 64.3587 +MetaTest/__unnamed_task__/MinReturn -28.1203 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 36.0725 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 181400 +------------------------------------------------- ------------ +2025-04-02 20:33:44 | [pearl_trainer] epoch #104 | Training... +2025-04-02 20:35:13 | [pearl_trainer] epoch #104 | Evaluating... +2025-04-02 20:35:13 | [pearl_trainer] epoch #104 | Sampling for adapation and meta-testing... +2025-04-02 20:37:06 | [pearl_trainer] epoch #104 | Finished meta-testing... +2025-04-02 20:37:06 | [pearl_trainer] epoch #104 | Saving snapshot... +2025-04-02 20:37:07 | [pearl_trainer] epoch #104 | Saved +2025-04-02 20:37:07 | [pearl_trainer] epoch #104 | Time 24739.86 s +2025-04-02 20:37:07 | [pearl_trainer] epoch #104 | EpochTime 233.22 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -10.0447 +MetaTest/Average/AverageReturn -10.0447 +MetaTest/Average/Iteration 104 +MetaTest/Average/MaxReturn 22.1713 +MetaTest/Average/MinReturn -25.8116 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 16.6415 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -10.0447 +MetaTest/__unnamed_task__/AverageReturn -10.0447 +MetaTest/__unnamed_task__/Iteration 104 +MetaTest/__unnamed_task__/MaxReturn 22.1713 +MetaTest/__unnamed_task__/MinReturn -25.8116 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 16.6415 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 183000 +------------------------------------------------- ----------- +2025-04-02 20:37:39 | [pearl_trainer] epoch #105 | Training... +2025-04-02 20:39:09 | [pearl_trainer] epoch #105 | Evaluating... +2025-04-02 20:39:09 | [pearl_trainer] epoch #105 | Sampling for adapation and meta-testing... +2025-04-02 20:40:58 | [pearl_trainer] epoch #105 | Finished meta-testing... +2025-04-02 20:40:58 | [pearl_trainer] epoch #105 | Saving snapshot... +2025-04-02 20:41:00 | [pearl_trainer] epoch #105 | Saved +2025-04-02 20:41:00 | [pearl_trainer] epoch #105 | Time 24972.65 s +2025-04-02 20:41:00 | [pearl_trainer] epoch #105 | EpochTime 232.79 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -3.42879 +MetaTest/Average/AverageReturn -3.42879 +MetaTest/Average/Iteration 105 +MetaTest/Average/MaxReturn 42.1675 +MetaTest/Average/MinReturn -23.241 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 23.446 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -3.42879 +MetaTest/__unnamed_task__/AverageReturn -3.42879 +MetaTest/__unnamed_task__/Iteration 105 +MetaTest/__unnamed_task__/MaxReturn 42.1675 +MetaTest/__unnamed_task__/MinReturn -23.241 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 23.446 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 184600 +------------------------------------------------- ------------ +2025-04-02 20:41:33 | [pearl_trainer] epoch #106 | Training... +2025-04-02 20:43:07 | [pearl_trainer] epoch #106 | Evaluating... +2025-04-02 20:43:07 | [pearl_trainer] epoch #106 | Sampling for adapation and meta-testing... +2025-04-02 20:44:55 | [pearl_trainer] epoch #106 | Finished meta-testing... +2025-04-02 20:44:55 | [pearl_trainer] epoch #106 | Saving snapshot... +2025-04-02 20:44:56 | [pearl_trainer] epoch #106 | Saved +2025-04-02 20:44:56 | [pearl_trainer] epoch #106 | Time 25209.24 s +2025-04-02 20:44:56 | [pearl_trainer] epoch #106 | EpochTime 236.58 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 5.56745 +MetaTest/Average/AverageReturn 5.56745 +MetaTest/Average/Iteration 106 +MetaTest/Average/MaxReturn 54.5133 +MetaTest/Average/MinReturn -10.3552 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 24.8516 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 5.56745 +MetaTest/__unnamed_task__/AverageReturn 5.56745 +MetaTest/__unnamed_task__/Iteration 106 +MetaTest/__unnamed_task__/MaxReturn 54.5133 +MetaTest/__unnamed_task__/MinReturn -10.3552 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 24.8516 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 186200 +------------------------------------------------- ------------ +2025-04-02 20:45:32 | [pearl_trainer] epoch #107 | Training... +2025-04-02 20:47:03 | [pearl_trainer] epoch #107 | Evaluating... +2025-04-02 20:47:03 | [pearl_trainer] epoch #107 | Sampling for adapation and meta-testing... +2025-04-02 20:49:03 | [pearl_trainer] epoch #107 | Finished meta-testing... +2025-04-02 20:49:03 | [pearl_trainer] epoch #107 | Saving snapshot... +2025-04-02 20:49:04 | [pearl_trainer] epoch #107 | Saved +2025-04-02 20:49:04 | [pearl_trainer] epoch #107 | Time 25456.58 s +2025-04-02 20:49:04 | [pearl_trainer] epoch #107 | EpochTime 247.34 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -16.3246 +MetaTest/Average/AverageReturn -16.3246 +MetaTest/Average/Iteration 107 +MetaTest/Average/MaxReturn -6.7254 +MetaTest/Average/MinReturn -21.8056 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 5.30427 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -16.3246 +MetaTest/__unnamed_task__/AverageReturn -16.3246 +MetaTest/__unnamed_task__/Iteration 107 +MetaTest/__unnamed_task__/MaxReturn -6.7254 +MetaTest/__unnamed_task__/MinReturn -21.8056 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 5.30427 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 187800 +------------------------------------------------- ------------ +2025-04-02 20:49:35 | [pearl_trainer] epoch #108 | Training... +2025-04-02 20:51:04 | [pearl_trainer] epoch #108 | Evaluating... +2025-04-02 20:51:04 | [pearl_trainer] epoch #108 | Sampling for adapation and meta-testing... +2025-04-02 20:52:58 | [pearl_trainer] epoch #108 | Finished meta-testing... +2025-04-02 20:52:58 | [pearl_trainer] epoch #108 | Saving snapshot... +2025-04-02 20:53:00 | [pearl_trainer] epoch #108 | Saved +2025-04-02 20:53:00 | [pearl_trainer] epoch #108 | Time 25692.57 s +2025-04-02 20:53:00 | [pearl_trainer] epoch #108 | EpochTime 235.98 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -9.31804 +MetaTest/Average/AverageReturn -9.31804 +MetaTest/Average/Iteration 108 +MetaTest/Average/MaxReturn 11.4036 +MetaTest/Average/MinReturn -18.6535 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 10.6417 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -9.31804 +MetaTest/__unnamed_task__/AverageReturn -9.31804 +MetaTest/__unnamed_task__/Iteration 108 +MetaTest/__unnamed_task__/MaxReturn 11.4036 +MetaTest/__unnamed_task__/MinReturn -18.6535 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 10.6417 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 189400 +------------------------------------------------- ------------ +2025-04-02 20:53:32 | [pearl_trainer] epoch #109 | Training... +2025-04-02 20:55:09 | [pearl_trainer] epoch #109 | Evaluating... +2025-04-02 20:55:09 | [pearl_trainer] epoch #109 | Sampling for adapation and meta-testing... +2025-04-02 20:57:04 | [pearl_trainer] epoch #109 | Finished meta-testing... +2025-04-02 20:57:04 | [pearl_trainer] epoch #109 | Saving snapshot... +2025-04-02 20:57:05 | [pearl_trainer] epoch #109 | Saved +2025-04-02 20:57:05 | [pearl_trainer] epoch #109 | Time 25938.07 s +2025-04-02 20:57:05 | [pearl_trainer] epoch #109 | EpochTime 245.50 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -3.18374 +MetaTest/Average/AverageReturn -3.18374 +MetaTest/Average/Iteration 109 +MetaTest/Average/MaxReturn 23.5864 +MetaTest/Average/MinReturn -17.4792 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 14.081 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -3.18374 +MetaTest/__unnamed_task__/AverageReturn -3.18374 +MetaTest/__unnamed_task__/Iteration 109 +MetaTest/__unnamed_task__/MaxReturn 23.5864 +MetaTest/__unnamed_task__/MinReturn -17.4792 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 14.081 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 191000 +------------------------------------------------- ------------ +2025-04-02 20:57:37 | [pearl_trainer] epoch #110 | Training... +2025-04-02 20:59:00 | [pearl_trainer] epoch #110 | Evaluating... +2025-04-02 20:59:00 | [pearl_trainer] epoch #110 | Sampling for adapation and meta-testing... +2025-04-02 21:00:54 | [pearl_trainer] epoch #110 | Finished meta-testing... +2025-04-02 21:00:54 | [pearl_trainer] epoch #110 | Saving snapshot... +2025-04-02 21:00:55 | [pearl_trainer] epoch #110 | Saved +2025-04-02 21:00:55 | [pearl_trainer] epoch #110 | Time 26168.30 s +2025-04-02 21:00:55 | [pearl_trainer] epoch #110 | EpochTime 230.23 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 2.1067 +MetaTest/Average/AverageReturn 2.1067 +MetaTest/Average/Iteration 110 +MetaTest/Average/MaxReturn 32.1246 +MetaTest/Average/MinReturn -10.8194 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 16.0706 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 2.1067 +MetaTest/__unnamed_task__/AverageReturn 2.1067 +MetaTest/__unnamed_task__/Iteration 110 +MetaTest/__unnamed_task__/MaxReturn 32.1246 +MetaTest/__unnamed_task__/MinReturn -10.8194 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 16.0706 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 192600 +------------------------------------------------- ----------- +2025-04-02 21:01:28 | [pearl_trainer] epoch #111 | Training... +2025-04-02 21:02:59 | [pearl_trainer] epoch #111 | Evaluating... +2025-04-02 21:02:59 | [pearl_trainer] epoch #111 | Sampling for adapation and meta-testing... +2025-04-02 21:05:03 | [pearl_trainer] epoch #111 | Finished meta-testing... +2025-04-02 21:05:03 | [pearl_trainer] epoch #111 | Saving snapshot... +2025-04-02 21:05:04 | [pearl_trainer] epoch #111 | Saved +2025-04-02 21:05:04 | [pearl_trainer] epoch #111 | Time 26417.40 s +2025-04-02 21:05:04 | [pearl_trainer] epoch #111 | EpochTime 249.09 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -12.6534 +MetaTest/Average/AverageReturn -12.6534 +MetaTest/Average/Iteration 111 +MetaTest/Average/MaxReturn 6.92589 +MetaTest/Average/MinReturn -23.5231 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 11.2714 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -12.6534 +MetaTest/__unnamed_task__/AverageReturn -12.6534 +MetaTest/__unnamed_task__/Iteration 111 +MetaTest/__unnamed_task__/MaxReturn 6.92589 +MetaTest/__unnamed_task__/MinReturn -23.5231 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 11.2714 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 194200 +------------------------------------------------- ------------ +2025-04-02 21:05:36 | [pearl_trainer] epoch #112 | Training... +2025-04-02 21:07:06 | [pearl_trainer] epoch #112 | Evaluating... +2025-04-02 21:07:06 | [pearl_trainer] epoch #112 | Sampling for adapation and meta-testing... +2025-04-02 21:09:01 | [pearl_trainer] epoch #112 | Finished meta-testing... +2025-04-02 21:09:01 | [pearl_trainer] epoch #112 | Saving snapshot... +2025-04-02 21:09:02 | [pearl_trainer] epoch #112 | Saved +2025-04-02 21:09:02 | [pearl_trainer] epoch #112 | Time 26654.75 s +2025-04-02 21:09:02 | [pearl_trainer] epoch #112 | EpochTime 237.35 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 19.5436 +MetaTest/Average/AverageReturn 19.5436 +MetaTest/Average/Iteration 112 +MetaTest/Average/MaxReturn 64.1754 +MetaTest/Average/MinReturn -17.5167 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 34.4276 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 19.5436 +MetaTest/__unnamed_task__/AverageReturn 19.5436 +MetaTest/__unnamed_task__/Iteration 112 +MetaTest/__unnamed_task__/MaxReturn 64.1754 +MetaTest/__unnamed_task__/MinReturn -17.5167 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 34.4276 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 195800 +------------------------------------------------- ----------- +2025-04-02 21:09:36 | [pearl_trainer] epoch #113 | Training... +2025-04-02 21:11:03 | [pearl_trainer] epoch #113 | Evaluating... +2025-04-02 21:11:03 | [pearl_trainer] epoch #113 | Sampling for adapation and meta-testing... +2025-04-02 21:12:59 | [pearl_trainer] epoch #113 | Finished meta-testing... +2025-04-02 21:12:59 | [pearl_trainer] epoch #113 | Saving snapshot... +2025-04-02 21:13:00 | [pearl_trainer] epoch #113 | Saved +2025-04-02 21:13:00 | [pearl_trainer] epoch #113 | Time 26892.93 s +2025-04-02 21:13:00 | [pearl_trainer] epoch #113 | EpochTime 238.18 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 10.1187 +MetaTest/Average/AverageReturn 10.1187 +MetaTest/Average/Iteration 113 +MetaTest/Average/MaxReturn 39.7954 +MetaTest/Average/MinReturn -21.1606 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 22.7657 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 10.1187 +MetaTest/__unnamed_task__/AverageReturn 10.1187 +MetaTest/__unnamed_task__/Iteration 113 +MetaTest/__unnamed_task__/MaxReturn 39.7954 +MetaTest/__unnamed_task__/MinReturn -21.1606 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 22.7657 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 197400 +------------------------------------------------- ----------- +2025-04-02 21:13:32 | [pearl_trainer] epoch #114 | Training... +2025-04-02 21:15:06 | [pearl_trainer] epoch #114 | Evaluating... +2025-04-02 21:15:06 | [pearl_trainer] epoch #114 | Sampling for adapation and meta-testing... +2025-04-02 21:16:57 | [pearl_trainer] epoch #114 | Finished meta-testing... +2025-04-02 21:16:57 | [pearl_trainer] epoch #114 | Saving snapshot... +2025-04-02 21:16:58 | [pearl_trainer] epoch #114 | Saved +2025-04-02 21:16:58 | [pearl_trainer] epoch #114 | Time 27131.33 s +2025-04-02 21:16:58 | [pearl_trainer] epoch #114 | EpochTime 238.40 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -4.27096 +MetaTest/Average/AverageReturn -4.27096 +MetaTest/Average/Iteration 114 +MetaTest/Average/MaxReturn 62.1317 +MetaTest/Average/MinReturn -28.5683 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 34.1165 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -4.27096 +MetaTest/__unnamed_task__/AverageReturn -4.27096 +MetaTest/__unnamed_task__/Iteration 114 +MetaTest/__unnamed_task__/MaxReturn 62.1317 +MetaTest/__unnamed_task__/MinReturn -28.5683 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 34.1165 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 199000 +------------------------------------------------- ------------ +2025-04-02 21:17:29 | [pearl_trainer] epoch #115 | Training... +2025-04-02 21:19:06 | [pearl_trainer] epoch #115 | Evaluating... +2025-04-02 21:19:06 | [pearl_trainer] epoch #115 | Sampling for adapation and meta-testing... +2025-04-02 21:20:57 | [pearl_trainer] epoch #115 | Finished meta-testing... +2025-04-02 21:20:57 | [pearl_trainer] epoch #115 | Saving snapshot... +2025-04-02 21:20:58 | [pearl_trainer] epoch #115 | Saved +2025-04-02 21:20:58 | [pearl_trainer] epoch #115 | Time 27371.12 s +2025-04-02 21:20:58 | [pearl_trainer] epoch #115 | EpochTime 239.79 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 4.41408 +MetaTest/Average/AverageReturn 4.41408 +MetaTest/Average/Iteration 115 +MetaTest/Average/MaxReturn 38.2437 +MetaTest/Average/MinReturn -27.6079 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 29.2481 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 4.41408 +MetaTest/__unnamed_task__/AverageReturn 4.41408 +MetaTest/__unnamed_task__/Iteration 115 +MetaTest/__unnamed_task__/MaxReturn 38.2437 +MetaTest/__unnamed_task__/MinReturn -27.6079 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 29.2481 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 200600 +------------------------------------------------- ------------ +2025-04-02 21:21:31 | [pearl_trainer] epoch #116 | Training... +2025-04-02 21:23:04 | [pearl_trainer] epoch #116 | Evaluating... +2025-04-02 21:23:04 | [pearl_trainer] epoch #116 | Sampling for adapation and meta-testing... +2025-04-02 21:24:56 | [pearl_trainer] epoch #116 | Finished meta-testing... +2025-04-02 21:24:56 | [pearl_trainer] epoch #116 | Saving snapshot... +2025-04-02 21:24:57 | [pearl_trainer] epoch #116 | Saved +2025-04-02 21:24:57 | [pearl_trainer] epoch #116 | Time 27610.43 s +2025-04-02 21:24:57 | [pearl_trainer] epoch #116 | EpochTime 239.31 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 21.5467 +MetaTest/Average/AverageReturn 21.5467 +MetaTest/Average/Iteration 116 +MetaTest/Average/MaxReturn 48.529 +MetaTest/Average/MinReturn -18.7814 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 22.62 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 21.5467 +MetaTest/__unnamed_task__/AverageReturn 21.5467 +MetaTest/__unnamed_task__/Iteration 116 +MetaTest/__unnamed_task__/MaxReturn 48.529 +MetaTest/__unnamed_task__/MinReturn -18.7814 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 22.62 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 202200 +------------------------------------------------- ----------- +2025-04-02 21:25:30 | [pearl_trainer] epoch #117 | Training... +2025-04-02 21:27:07 | [pearl_trainer] epoch #117 | Evaluating... +2025-04-02 21:27:07 | [pearl_trainer] epoch #117 | Sampling for adapation and meta-testing... +2025-04-02 21:29:01 | [pearl_trainer] epoch #117 | Finished meta-testing... +2025-04-02 21:29:01 | [pearl_trainer] epoch #117 | Saving snapshot... +2025-04-02 21:29:02 | [pearl_trainer] epoch #117 | Saved +2025-04-02 21:29:02 | [pearl_trainer] epoch #117 | Time 27855.39 s +2025-04-02 21:29:02 | [pearl_trainer] epoch #117 | EpochTime 244.96 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -5.11794 +MetaTest/Average/AverageReturn -5.11794 +MetaTest/Average/Iteration 117 +MetaTest/Average/MaxReturn 46.4696 +MetaTest/Average/MinReturn -25.5441 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 26.7486 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -5.11794 +MetaTest/__unnamed_task__/AverageReturn -5.11794 +MetaTest/__unnamed_task__/Iteration 117 +MetaTest/__unnamed_task__/MaxReturn 46.4696 +MetaTest/__unnamed_task__/MinReturn -25.5441 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 26.7486 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 203800 +------------------------------------------------- ------------ +2025-04-02 21:29:35 | [pearl_trainer] epoch #118 | Training... +2025-04-02 21:30:55 | [pearl_trainer] epoch #118 | Evaluating... +2025-04-02 21:30:55 | [pearl_trainer] epoch #118 | Sampling for adapation and meta-testing... +2025-04-02 21:32:50 | [pearl_trainer] epoch #118 | Finished meta-testing... +2025-04-02 21:32:50 | [pearl_trainer] epoch #118 | Saving snapshot... +2025-04-02 21:32:51 | [pearl_trainer] epoch #118 | Saved +2025-04-02 21:32:51 | [pearl_trainer] epoch #118 | Time 28083.86 s +2025-04-02 21:32:51 | [pearl_trainer] epoch #118 | EpochTime 228.47 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -3.81942 +MetaTest/Average/AverageReturn -3.81942 +MetaTest/Average/Iteration 118 +MetaTest/Average/MaxReturn 32.1807 +MetaTest/Average/MinReturn -41.0723 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 26.4754 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -3.81942 +MetaTest/__unnamed_task__/AverageReturn -3.81942 +MetaTest/__unnamed_task__/Iteration 118 +MetaTest/__unnamed_task__/MaxReturn 32.1807 +MetaTest/__unnamed_task__/MinReturn -41.0723 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 26.4754 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 205400 +------------------------------------------------- ------------ +2025-04-02 21:33:20 | [pearl_trainer] epoch #119 | Training... +2025-04-02 21:34:57 | [pearl_trainer] epoch #119 | Evaluating... +2025-04-02 21:34:57 | [pearl_trainer] epoch #119 | Sampling for adapation and meta-testing... +2025-04-02 21:36:53 | [pearl_trainer] epoch #119 | Finished meta-testing... +2025-04-02 21:36:53 | [pearl_trainer] epoch #119 | Saving snapshot... +2025-04-02 21:36:54 | [pearl_trainer] epoch #119 | Saved +2025-04-02 21:36:54 | [pearl_trainer] epoch #119 | Time 28327.00 s +2025-04-02 21:36:54 | [pearl_trainer] epoch #119 | EpochTime 243.13 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -15.8784 +MetaTest/Average/AverageReturn -15.8784 +MetaTest/Average/Iteration 119 +MetaTest/Average/MaxReturn 27.9107 +MetaTest/Average/MinReturn -46.3395 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 24.3543 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -15.8784 +MetaTest/__unnamed_task__/AverageReturn -15.8784 +MetaTest/__unnamed_task__/Iteration 119 +MetaTest/__unnamed_task__/MaxReturn 27.9107 +MetaTest/__unnamed_task__/MinReturn -46.3395 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 24.3543 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 207000 +------------------------------------------------- ----------- +2025-04-02 21:37:24 | [pearl_trainer] epoch #120 | Training... +2025-04-02 21:38:54 | [pearl_trainer] epoch #120 | Evaluating... +2025-04-02 21:38:54 | [pearl_trainer] epoch #120 | Sampling for adapation and meta-testing... +2025-04-02 21:40:42 | [pearl_trainer] epoch #120 | Finished meta-testing... +2025-04-02 21:40:42 | [pearl_trainer] epoch #120 | Saving snapshot... +2025-04-02 21:40:43 | [pearl_trainer] epoch #120 | Saved +2025-04-02 21:40:43 | [pearl_trainer] epoch #120 | Time 28555.98 s +2025-04-02 21:40:43 | [pearl_trainer] epoch #120 | EpochTime 228.98 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -11.9456 +MetaTest/Average/AverageReturn -11.9456 +MetaTest/Average/Iteration 120 +MetaTest/Average/MaxReturn 21.7592 +MetaTest/Average/MinReturn -55.0653 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 26.1085 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -11.9456 +MetaTest/__unnamed_task__/AverageReturn -11.9456 +MetaTest/__unnamed_task__/Iteration 120 +MetaTest/__unnamed_task__/MaxReturn 21.7592 +MetaTest/__unnamed_task__/MinReturn -55.0653 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 26.1085 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 208600 +------------------------------------------------- ----------- +2025-04-02 21:41:14 | [pearl_trainer] epoch #121 | Training... +2025-04-02 21:42:47 | [pearl_trainer] epoch #121 | Evaluating... +2025-04-02 21:42:47 | [pearl_trainer] epoch #121 | Sampling for adapation and meta-testing... +2025-04-02 21:44:37 | [pearl_trainer] epoch #121 | Finished meta-testing... +2025-04-02 21:44:37 | [pearl_trainer] epoch #121 | Saving snapshot... +2025-04-02 21:44:38 | [pearl_trainer] epoch #121 | Saved +2025-04-02 21:44:38 | [pearl_trainer] epoch #121 | Time 28791.11 s +2025-04-02 21:44:38 | [pearl_trainer] epoch #121 | EpochTime 235.13 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -19.4416 +MetaTest/Average/AverageReturn -19.4416 +MetaTest/Average/Iteration 121 +MetaTest/Average/MaxReturn 1.80037 +MetaTest/Average/MinReturn -30.8725 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 11.7358 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -19.4416 +MetaTest/__unnamed_task__/AverageReturn -19.4416 +MetaTest/__unnamed_task__/Iteration 121 +MetaTest/__unnamed_task__/MaxReturn 1.80037 +MetaTest/__unnamed_task__/MinReturn -30.8725 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 11.7358 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 210200 +------------------------------------------------- ------------ +2025-04-02 21:45:09 | [pearl_trainer] epoch #122 | Training... +2025-04-02 21:46:35 | [pearl_trainer] epoch #122 | Evaluating... +2025-04-02 21:46:35 | [pearl_trainer] epoch #122 | Sampling for adapation and meta-testing... +2025-04-02 21:48:25 | [pearl_trainer] epoch #122 | Finished meta-testing... +2025-04-02 21:48:25 | [pearl_trainer] epoch #122 | Saving snapshot... +2025-04-02 21:48:26 | [pearl_trainer] epoch #122 | Saved +2025-04-02 21:48:26 | [pearl_trainer] epoch #122 | Time 29019.26 s +2025-04-02 21:48:26 | [pearl_trainer] epoch #122 | EpochTime 228.15 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -5.78029 +MetaTest/Average/AverageReturn -5.78029 +MetaTest/Average/Iteration 122 +MetaTest/Average/MaxReturn 28.3591 +MetaTest/Average/MinReturn -29.3915 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 22.0196 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -5.78029 +MetaTest/__unnamed_task__/AverageReturn -5.78029 +MetaTest/__unnamed_task__/Iteration 122 +MetaTest/__unnamed_task__/MaxReturn 28.3591 +MetaTest/__unnamed_task__/MinReturn -29.3915 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 22.0196 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 211800 +------------------------------------------------- ------------ +2025-04-02 21:48:57 | [pearl_trainer] epoch #123 | Training... +2025-04-02 21:50:23 | [pearl_trainer] epoch #123 | Evaluating... +2025-04-02 21:50:23 | [pearl_trainer] epoch #123 | Sampling for adapation and meta-testing... +2025-04-02 21:52:13 | [pearl_trainer] epoch #123 | Finished meta-testing... +2025-04-02 21:52:13 | [pearl_trainer] epoch #123 | Saving snapshot... +2025-04-02 21:52:15 | [pearl_trainer] epoch #123 | Saved +2025-04-02 21:52:15 | [pearl_trainer] epoch #123 | Time 29247.58 s +2025-04-02 21:52:15 | [pearl_trainer] epoch #123 | EpochTime 228.32 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -12.4546 +MetaTest/Average/AverageReturn -12.4546 +MetaTest/Average/Iteration 123 +MetaTest/Average/MaxReturn 21.6568 +MetaTest/Average/MinReturn -43.7272 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 22.2838 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -12.4546 +MetaTest/__unnamed_task__/AverageReturn -12.4546 +MetaTest/__unnamed_task__/Iteration 123 +MetaTest/__unnamed_task__/MaxReturn 21.6568 +MetaTest/__unnamed_task__/MinReturn -43.7272 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 22.2838 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 213400 +------------------------------------------------- ----------- +2025-04-02 21:52:45 | [pearl_trainer] epoch #124 | Training... +2025-04-02 21:54:17 | [pearl_trainer] epoch #124 | Evaluating... +2025-04-02 21:54:17 | [pearl_trainer] epoch #124 | Sampling for adapation and meta-testing... +2025-04-02 21:56:03 | [pearl_trainer] epoch #124 | Finished meta-testing... +2025-04-02 21:56:03 | [pearl_trainer] epoch #124 | Saving snapshot... +2025-04-02 21:56:04 | [pearl_trainer] epoch #124 | Saved +2025-04-02 21:56:04 | [pearl_trainer] epoch #124 | Time 29477.44 s +2025-04-02 21:56:04 | [pearl_trainer] epoch #124 | EpochTime 229.86 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 11.4428 +MetaTest/Average/AverageReturn 11.4428 +MetaTest/Average/Iteration 124 +MetaTest/Average/MaxReturn 58.6054 +MetaTest/Average/MinReturn -17.3657 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 25.5261 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 11.4428 +MetaTest/__unnamed_task__/AverageReturn 11.4428 +MetaTest/__unnamed_task__/Iteration 124 +MetaTest/__unnamed_task__/MaxReturn 58.6054 +MetaTest/__unnamed_task__/MinReturn -17.3657 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 25.5261 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 215000 +------------------------------------------------- ----------- +2025-04-02 21:56:38 | [pearl_trainer] epoch #125 | Training... +2025-04-02 21:57:52 | [pearl_trainer] epoch #125 | Evaluating... +2025-04-02 21:57:52 | [pearl_trainer] epoch #125 | Sampling for adapation and meta-testing... +2025-04-02 21:59:43 | [pearl_trainer] epoch #125 | Finished meta-testing... +2025-04-02 21:59:43 | [pearl_trainer] epoch #125 | Saving snapshot... +2025-04-02 21:59:44 | [pearl_trainer] epoch #125 | Saved +2025-04-02 21:59:44 | [pearl_trainer] epoch #125 | Time 29696.95 s +2025-04-02 21:59:44 | [pearl_trainer] epoch #125 | EpochTime 219.50 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -34.4818 +MetaTest/Average/AverageReturn -34.4818 +MetaTest/Average/Iteration 125 +MetaTest/Average/MaxReturn 11.7961 +MetaTest/Average/MinReturn -112.32 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 45.8144 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -34.4818 +MetaTest/__unnamed_task__/AverageReturn -34.4818 +MetaTest/__unnamed_task__/Iteration 125 +MetaTest/__unnamed_task__/MaxReturn 11.7961 +MetaTest/__unnamed_task__/MinReturn -112.32 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 45.8144 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 216600 +------------------------------------------------- ----------- +2025-04-02 22:00:13 | [pearl_trainer] epoch #126 | Training... +2025-04-02 22:01:47 | [pearl_trainer] epoch #126 | Evaluating... +2025-04-02 22:01:47 | [pearl_trainer] epoch #126 | Sampling for adapation and meta-testing... +2025-04-02 22:03:35 | [pearl_trainer] epoch #126 | Finished meta-testing... +2025-04-02 22:03:35 | [pearl_trainer] epoch #126 | Saving snapshot... +2025-04-02 22:03:36 | [pearl_trainer] epoch #126 | Saved +2025-04-02 22:03:36 | [pearl_trainer] epoch #126 | Time 29929.03 s +2025-04-02 22:03:36 | [pearl_trainer] epoch #126 | EpochTime 232.08 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 1.3839 +MetaTest/Average/AverageReturn 1.3839 +MetaTest/Average/Iteration 126 +MetaTest/Average/MaxReturn 80.9004 +MetaTest/Average/MinReturn -51.575 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 45.0795 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 1.3839 +MetaTest/__unnamed_task__/AverageReturn 1.3839 +MetaTest/__unnamed_task__/Iteration 126 +MetaTest/__unnamed_task__/MaxReturn 80.9004 +MetaTest/__unnamed_task__/MinReturn -51.575 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 45.0795 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 218200 +------------------------------------------------- ----------- +2025-04-02 22:04:08 | [pearl_trainer] epoch #127 | Training... +2025-04-02 22:05:58 | [pearl_trainer] epoch #127 | Evaluating... +2025-04-02 22:05:58 | [pearl_trainer] epoch #127 | Sampling for adapation and meta-testing... +2025-04-02 22:08:11 | [pearl_trainer] epoch #127 | Finished meta-testing... +2025-04-02 22:08:11 | [pearl_trainer] epoch #127 | Saving snapshot... +2025-04-02 22:08:12 | [pearl_trainer] epoch #127 | Saved +2025-04-02 22:08:12 | [pearl_trainer] epoch #127 | Time 30204.59 s +2025-04-02 22:08:12 | [pearl_trainer] epoch #127 | EpochTime 275.55 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -36.2381 +MetaTest/Average/AverageReturn -36.2381 +MetaTest/Average/Iteration 127 +MetaTest/Average/MaxReturn -7.19009 +MetaTest/Average/MinReturn -56.064 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 21.132 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -36.2381 +MetaTest/__unnamed_task__/AverageReturn -36.2381 +MetaTest/__unnamed_task__/Iteration 127 +MetaTest/__unnamed_task__/MaxReturn -7.19009 +MetaTest/__unnamed_task__/MinReturn -56.064 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 21.132 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 219800 +------------------------------------------------- ------------ +2025-04-02 22:08:41 | [pearl_trainer] epoch #128 | Training... +2025-04-02 22:10:14 | [pearl_trainer] epoch #128 | Evaluating... +2025-04-02 22:10:14 | [pearl_trainer] epoch #128 | Sampling for adapation and meta-testing... +2025-04-02 22:12:01 | [pearl_trainer] epoch #128 | Finished meta-testing... +2025-04-02 22:12:01 | [pearl_trainer] epoch #128 | Saving snapshot... +2025-04-02 22:12:02 | [pearl_trainer] epoch #128 | Saved +2025-04-02 22:12:02 | [pearl_trainer] epoch #128 | Time 30435.06 s +2025-04-02 22:12:02 | [pearl_trainer] epoch #128 | EpochTime 230.47 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -40.8515 +MetaTest/Average/AverageReturn -40.8515 +MetaTest/Average/Iteration 128 +MetaTest/Average/MaxReturn -22.1598 +MetaTest/Average/MinReturn -64.0138 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 14.4641 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -40.8515 +MetaTest/__unnamed_task__/AverageReturn -40.8515 +MetaTest/__unnamed_task__/Iteration 128 +MetaTest/__unnamed_task__/MaxReturn -22.1598 +MetaTest/__unnamed_task__/MinReturn -64.0138 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 14.4641 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 221400 +------------------------------------------------- ----------- +2025-04-02 22:12:36 | [pearl_trainer] epoch #129 | Training... +2025-04-02 22:14:05 | [pearl_trainer] epoch #129 | Evaluating... +2025-04-02 22:14:05 | [pearl_trainer] epoch #129 | Sampling for adapation and meta-testing... +2025-04-02 22:16:01 | [pearl_trainer] epoch #129 | Finished meta-testing... +2025-04-02 22:16:01 | [pearl_trainer] epoch #129 | Saving snapshot... +2025-04-02 22:16:02 | [pearl_trainer] epoch #129 | Saved +2025-04-02 22:16:02 | [pearl_trainer] epoch #129 | Time 30674.78 s +2025-04-02 22:16:02 | [pearl_trainer] epoch #129 | EpochTime 239.71 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -9.79748 +MetaTest/Average/AverageReturn -9.79748 +MetaTest/Average/Iteration 129 +MetaTest/Average/MaxReturn 69.1984 +MetaTest/Average/MinReturn -76.3032 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 51.8059 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -9.79748 +MetaTest/__unnamed_task__/AverageReturn -9.79748 +MetaTest/__unnamed_task__/Iteration 129 +MetaTest/__unnamed_task__/MaxReturn 69.1984 +MetaTest/__unnamed_task__/MinReturn -76.3032 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 51.8059 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 223000 +------------------------------------------------- ------------ +2025-04-02 22:16:31 | [pearl_trainer] epoch #130 | Training... +2025-04-02 22:17:53 | [pearl_trainer] epoch #130 | Evaluating... +2025-04-02 22:17:53 | [pearl_trainer] epoch #130 | Sampling for adapation and meta-testing... +2025-04-02 22:19:41 | [pearl_trainer] epoch #130 | Finished meta-testing... +2025-04-02 22:19:41 | [pearl_trainer] epoch #130 | Saving snapshot... +2025-04-02 22:19:42 | [pearl_trainer] epoch #130 | Saved +2025-04-02 22:19:42 | [pearl_trainer] epoch #130 | Time 30895.32 s +2025-04-02 22:19:42 | [pearl_trainer] epoch #130 | EpochTime 220.54 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -7.94964 +MetaTest/Average/AverageReturn -7.94964 +MetaTest/Average/Iteration 130 +MetaTest/Average/MaxReturn 44.2662 +MetaTest/Average/MinReturn -22.655 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 26.1368 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -7.94964 +MetaTest/__unnamed_task__/AverageReturn -7.94964 +MetaTest/__unnamed_task__/Iteration 130 +MetaTest/__unnamed_task__/MaxReturn 44.2662 +MetaTest/__unnamed_task__/MinReturn -22.655 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 26.1368 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 224600 +------------------------------------------------- ------------ +2025-04-02 22:20:13 | [pearl_trainer] epoch #131 | Training... +2025-04-02 22:21:45 | [pearl_trainer] epoch #131 | Evaluating... +2025-04-02 22:21:45 | [pearl_trainer] epoch #131 | Sampling for adapation and meta-testing... +2025-04-02 22:23:40 | [pearl_trainer] epoch #131 | Finished meta-testing... +2025-04-02 22:23:40 | [pearl_trainer] epoch #131 | Saving snapshot... +2025-04-02 22:23:41 | [pearl_trainer] epoch #131 | Saved +2025-04-02 22:23:41 | [pearl_trainer] epoch #131 | Time 31134.13 s +2025-04-02 22:23:41 | [pearl_trainer] epoch #131 | EpochTime 238.80 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -27.8955 +MetaTest/Average/AverageReturn -27.8955 +MetaTest/Average/Iteration 131 +MetaTest/Average/MaxReturn -20.0863 +MetaTest/Average/MinReturn -36.5997 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 6.15202 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -27.8955 +MetaTest/__unnamed_task__/AverageReturn -27.8955 +MetaTest/__unnamed_task__/Iteration 131 +MetaTest/__unnamed_task__/MaxReturn -20.0863 +MetaTest/__unnamed_task__/MinReturn -36.5997 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 6.15202 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 226200 +------------------------------------------------- ------------ +2025-04-02 22:24:10 | [pearl_trainer] epoch #132 | Training... +2025-04-02 22:25:47 | [pearl_trainer] epoch #132 | Evaluating... +2025-04-02 22:25:47 | [pearl_trainer] epoch #132 | Sampling for adapation and meta-testing... +2025-04-02 22:27:37 | [pearl_trainer] epoch #132 | Finished meta-testing... +2025-04-02 22:27:37 | [pearl_trainer] epoch #132 | Saving snapshot... +2025-04-02 22:27:38 | [pearl_trainer] epoch #132 | Saved +2025-04-02 22:27:38 | [pearl_trainer] epoch #132 | Time 31370.52 s +2025-04-02 22:27:38 | [pearl_trainer] epoch #132 | EpochTime 236.39 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 8.91054 +MetaTest/Average/AverageReturn 8.91054 +MetaTest/Average/Iteration 132 +MetaTest/Average/MaxReturn 31.653 +MetaTest/Average/MinReturn -21.2931 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 20.7776 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 8.91054 +MetaTest/__unnamed_task__/AverageReturn 8.91054 +MetaTest/__unnamed_task__/Iteration 132 +MetaTest/__unnamed_task__/MaxReturn 31.653 +MetaTest/__unnamed_task__/MinReturn -21.2931 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 20.7776 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 227800 +------------------------------------------------- ------------ +2025-04-02 22:28:10 | [pearl_trainer] epoch #133 | Training... +2025-04-02 22:29:35 | [pearl_trainer] epoch #133 | Evaluating... +2025-04-02 22:29:35 | [pearl_trainer] epoch #133 | Sampling for adapation and meta-testing... +2025-04-02 22:31:31 | [pearl_trainer] epoch #133 | Finished meta-testing... +2025-04-02 22:31:31 | [pearl_trainer] epoch #133 | Saving snapshot... +2025-04-02 22:31:33 | [pearl_trainer] epoch #133 | Saved +2025-04-02 22:31:33 | [pearl_trainer] epoch #133 | Time 31605.53 s +2025-04-02 22:31:33 | [pearl_trainer] epoch #133 | EpochTime 235.01 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -9.32277 +MetaTest/Average/AverageReturn -9.32277 +MetaTest/Average/Iteration 133 +MetaTest/Average/MaxReturn 7.15168 +MetaTest/Average/MinReturn -25.308 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 13.9138 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -9.32277 +MetaTest/__unnamed_task__/AverageReturn -9.32277 +MetaTest/__unnamed_task__/Iteration 133 +MetaTest/__unnamed_task__/MaxReturn 7.15168 +MetaTest/__unnamed_task__/MinReturn -25.308 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 13.9138 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 229400 +------------------------------------------------- ------------ +2025-04-02 22:32:03 | [pearl_trainer] epoch #134 | Training... +2025-04-02 22:33:29 | [pearl_trainer] epoch #134 | Evaluating... +2025-04-02 22:33:29 | [pearl_trainer] epoch #134 | Sampling for adapation and meta-testing... +2025-04-02 22:35:22 | [pearl_trainer] epoch #134 | Finished meta-testing... +2025-04-02 22:35:22 | [pearl_trainer] epoch #134 | Saving snapshot... +2025-04-02 22:35:23 | [pearl_trainer] epoch #134 | Saved +2025-04-02 22:35:23 | [pearl_trainer] epoch #134 | Time 31836.34 s +2025-04-02 22:35:23 | [pearl_trainer] epoch #134 | EpochTime 230.81 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -16.6005 +MetaTest/Average/AverageReturn -16.6005 +MetaTest/Average/Iteration 134 +MetaTest/Average/MaxReturn 12.9052 +MetaTest/Average/MinReturn -33.6154 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 15.7655 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -16.6005 +MetaTest/__unnamed_task__/AverageReturn -16.6005 +MetaTest/__unnamed_task__/Iteration 134 +MetaTest/__unnamed_task__/MaxReturn 12.9052 +MetaTest/__unnamed_task__/MinReturn -33.6154 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 15.7655 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 231000 +------------------------------------------------- ----------- +2025-04-02 22:35:53 | [pearl_trainer] epoch #135 | Training... +2025-04-02 22:37:26 | [pearl_trainer] epoch #135 | Evaluating... +2025-04-02 22:37:26 | [pearl_trainer] epoch #135 | Sampling for adapation and meta-testing... +2025-04-02 22:39:26 | [pearl_trainer] epoch #135 | Finished meta-testing... +2025-04-02 22:39:26 | [pearl_trainer] epoch #135 | Saving snapshot... +2025-04-02 22:39:27 | [pearl_trainer] epoch #135 | Saved +2025-04-02 22:39:27 | [pearl_trainer] epoch #135 | Time 32080.24 s +2025-04-02 22:39:27 | [pearl_trainer] epoch #135 | EpochTime 243.89 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -20.9035 +MetaTest/Average/AverageReturn -20.9035 +MetaTest/Average/Iteration 135 +MetaTest/Average/MaxReturn 2.24816 +MetaTest/Average/MinReturn -39.5836 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 16.5682 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -20.9035 +MetaTest/__unnamed_task__/AverageReturn -20.9035 +MetaTest/__unnamed_task__/Iteration 135 +MetaTest/__unnamed_task__/MaxReturn 2.24816 +MetaTest/__unnamed_task__/MinReturn -39.5836 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 16.5682 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 232600 +------------------------------------------------- ------------ +2025-04-02 22:39:56 | [pearl_trainer] epoch #136 | Training... +2025-04-02 22:41:26 | [pearl_trainer] epoch #136 | Evaluating... +2025-04-02 22:41:26 | [pearl_trainer] epoch #136 | Sampling for adapation and meta-testing... +2025-04-02 22:43:15 | [pearl_trainer] epoch #136 | Finished meta-testing... +2025-04-02 22:43:15 | [pearl_trainer] epoch #136 | Saving snapshot... +2025-04-02 22:43:16 | [pearl_trainer] epoch #136 | Saved +2025-04-02 22:43:16 | [pearl_trainer] epoch #136 | Time 32308.55 s +2025-04-02 22:43:16 | [pearl_trainer] epoch #136 | EpochTime 228.31 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -4.94723 +MetaTest/Average/AverageReturn -4.94723 +MetaTest/Average/Iteration 136 +MetaTest/Average/MaxReturn 49.313 +MetaTest/Average/MinReturn -27.7401 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 27.684 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -4.94723 +MetaTest/__unnamed_task__/AverageReturn -4.94723 +MetaTest/__unnamed_task__/Iteration 136 +MetaTest/__unnamed_task__/MaxReturn 49.313 +MetaTest/__unnamed_task__/MinReturn -27.7401 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 27.684 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 234200 +------------------------------------------------- ------------ +2025-04-02 22:43:47 | [pearl_trainer] epoch #137 | Training... +2025-04-02 22:45:16 | [pearl_trainer] epoch #137 | Evaluating... +2025-04-02 22:45:16 | [pearl_trainer] epoch #137 | Sampling for adapation and meta-testing... +2025-04-02 22:47:06 | [pearl_trainer] epoch #137 | Finished meta-testing... +2025-04-02 22:47:06 | [pearl_trainer] epoch #137 | Saving snapshot... +2025-04-02 22:47:07 | [pearl_trainer] epoch #137 | Saved +2025-04-02 22:47:07 | [pearl_trainer] epoch #137 | Time 32540.07 s +2025-04-02 22:47:07 | [pearl_trainer] epoch #137 | EpochTime 231.52 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -14.3569 +MetaTest/Average/AverageReturn -14.3569 +MetaTest/Average/Iteration 137 +MetaTest/Average/MaxReturn 13.369 +MetaTest/Average/MinReturn -24.3985 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 14.0971 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -14.3569 +MetaTest/__unnamed_task__/AverageReturn -14.3569 +MetaTest/__unnamed_task__/Iteration 137 +MetaTest/__unnamed_task__/MaxReturn 13.369 +MetaTest/__unnamed_task__/MinReturn -24.3985 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 14.0971 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 235800 +------------------------------------------------- ----------- +2025-04-02 22:47:39 | [pearl_trainer] epoch #138 | Training... +2025-04-02 22:49:12 | [pearl_trainer] epoch #138 | Evaluating... +2025-04-02 22:49:12 | [pearl_trainer] epoch #138 | Sampling for adapation and meta-testing... +2025-04-02 22:51:00 | [pearl_trainer] epoch #138 | Finished meta-testing... +2025-04-02 22:51:00 | [pearl_trainer] epoch #138 | Saving snapshot... +2025-04-02 22:51:01 | [pearl_trainer] epoch #138 | Saved +2025-04-02 22:51:01 | [pearl_trainer] epoch #138 | Time 32774.43 s +2025-04-02 22:51:01 | [pearl_trainer] epoch #138 | EpochTime 234.36 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 8.82534 +MetaTest/Average/AverageReturn 8.82534 +MetaTest/Average/Iteration 138 +MetaTest/Average/MaxReturn 62.7983 +MetaTest/Average/MinReturn -16.9136 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 29.6932 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 8.82534 +MetaTest/__unnamed_task__/AverageReturn 8.82534 +MetaTest/__unnamed_task__/Iteration 138 +MetaTest/__unnamed_task__/MaxReturn 62.7983 +MetaTest/__unnamed_task__/MinReturn -16.9136 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 29.6932 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 237400 +------------------------------------------------- ------------ +2025-04-02 22:51:36 | [pearl_trainer] epoch #139 | Training... +2025-04-02 22:53:03 | [pearl_trainer] epoch #139 | Evaluating... +2025-04-02 22:53:03 | [pearl_trainer] epoch #139 | Sampling for adapation and meta-testing... +2025-04-02 22:54:52 | [pearl_trainer] epoch #139 | Finished meta-testing... +2025-04-02 22:54:52 | [pearl_trainer] epoch #139 | Saving snapshot... +2025-04-02 22:54:53 | [pearl_trainer] epoch #139 | Saved +2025-04-02 22:54:53 | [pearl_trainer] epoch #139 | Time 33006.05 s +2025-04-02 22:54:53 | [pearl_trainer] epoch #139 | EpochTime 231.62 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 12.7795 +MetaTest/Average/AverageReturn 12.7795 +MetaTest/Average/Iteration 139 +MetaTest/Average/MaxReturn 35.3863 +MetaTest/Average/MinReturn -17.0715 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 20.3023 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 12.7795 +MetaTest/__unnamed_task__/AverageReturn 12.7795 +MetaTest/__unnamed_task__/Iteration 139 +MetaTest/__unnamed_task__/MaxReturn 35.3863 +MetaTest/__unnamed_task__/MinReturn -17.0715 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 20.3023 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 239000 +------------------------------------------------- ----------- +2025-04-02 22:55:26 | [pearl_trainer] epoch #140 | Training... +2025-04-02 22:56:43 | [pearl_trainer] epoch #140 | Evaluating... +2025-04-02 22:56:43 | [pearl_trainer] epoch #140 | Sampling for adapation and meta-testing... +2025-04-02 22:58:35 | [pearl_trainer] epoch #140 | Finished meta-testing... +2025-04-02 22:58:35 | [pearl_trainer] epoch #140 | Saving snapshot... +2025-04-02 22:58:37 | [pearl_trainer] epoch #140 | Saved +2025-04-02 22:58:37 | [pearl_trainer] epoch #140 | Time 33229.77 s +2025-04-02 22:58:37 | [pearl_trainer] epoch #140 | EpochTime 223.72 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 5.57404 +MetaTest/Average/AverageReturn 5.57404 +MetaTest/Average/Iteration 140 +MetaTest/Average/MaxReturn 47.4507 +MetaTest/Average/MinReturn -29.4681 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 30.9312 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 5.57404 +MetaTest/__unnamed_task__/AverageReturn 5.57404 +MetaTest/__unnamed_task__/Iteration 140 +MetaTest/__unnamed_task__/MaxReturn 47.4507 +MetaTest/__unnamed_task__/MinReturn -29.4681 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 30.9312 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 240600 +------------------------------------------------- ------------ +2025-04-02 22:59:07 | [pearl_trainer] epoch #141 | Training... +2025-04-02 23:00:37 | [pearl_trainer] epoch #141 | Evaluating... +2025-04-02 23:00:37 | [pearl_trainer] epoch #141 | Sampling for adapation and meta-testing... +2025-04-02 23:02:28 | [pearl_trainer] epoch #141 | Finished meta-testing... +2025-04-02 23:02:28 | [pearl_trainer] epoch #141 | Saving snapshot... +2025-04-02 23:02:29 | [pearl_trainer] epoch #141 | Saved +2025-04-02 23:02:29 | [pearl_trainer] epoch #141 | Time 33461.81 s +2025-04-02 23:02:29 | [pearl_trainer] epoch #141 | EpochTime 232.04 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 12.0966 +MetaTest/Average/AverageReturn 12.0966 +MetaTest/Average/Iteration 141 +MetaTest/Average/MaxReturn 39.1085 +MetaTest/Average/MinReturn -19.6073 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 24.8887 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 12.0966 +MetaTest/__unnamed_task__/AverageReturn 12.0966 +MetaTest/__unnamed_task__/Iteration 141 +MetaTest/__unnamed_task__/MaxReturn 39.1085 +MetaTest/__unnamed_task__/MinReturn -19.6073 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 24.8887 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 242200 +------------------------------------------------- ----------- +2025-04-02 23:02:59 | [pearl_trainer] epoch #142 | Training... +2025-04-02 23:04:32 | [pearl_trainer] epoch #142 | Evaluating... +2025-04-02 23:04:32 | [pearl_trainer] epoch #142 | Sampling for adapation and meta-testing... +2025-04-02 23:06:24 | [pearl_trainer] epoch #142 | Finished meta-testing... +2025-04-02 23:06:24 | [pearl_trainer] epoch #142 | Saving snapshot... +2025-04-02 23:06:25 | [pearl_trainer] epoch #142 | Saved +2025-04-02 23:06:25 | [pearl_trainer] epoch #142 | Time 33697.89 s +2025-04-02 23:06:25 | [pearl_trainer] epoch #142 | EpochTime 236.08 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -5.62588 +MetaTest/Average/AverageReturn -5.62588 +MetaTest/Average/Iteration 142 +MetaTest/Average/MaxReturn 36.9462 +MetaTest/Average/MinReturn -28.9259 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 22.9646 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -5.62588 +MetaTest/__unnamed_task__/AverageReturn -5.62588 +MetaTest/__unnamed_task__/Iteration 142 +MetaTest/__unnamed_task__/MaxReturn 36.9462 +MetaTest/__unnamed_task__/MinReturn -28.9259 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 22.9646 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 243800 +------------------------------------------------- ------------ +2025-04-02 23:06:54 | [pearl_trainer] epoch #143 | Training... +2025-04-02 23:08:22 | [pearl_trainer] epoch #143 | Evaluating... +2025-04-02 23:08:22 | [pearl_trainer] epoch #143 | Sampling for adapation and meta-testing... +2025-04-02 23:10:20 | [pearl_trainer] epoch #143 | Finished meta-testing... +2025-04-02 23:10:20 | [pearl_trainer] epoch #143 | Saving snapshot... +2025-04-02 23:10:22 | [pearl_trainer] epoch #143 | Saved +2025-04-02 23:10:22 | [pearl_trainer] epoch #143 | Time 33934.86 s +2025-04-02 23:10:22 | [pearl_trainer] epoch #143 | EpochTime 236.97 s +------------------------------------------------- ------------- +MetaTest/Average/AverageDiscountedReturn 0.324816 +MetaTest/Average/AverageReturn 0.324816 +MetaTest/Average/Iteration 143 +MetaTest/Average/MaxReturn 40.555 +MetaTest/Average/MinReturn -20.2773 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 23.7701 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 0.324816 +MetaTest/__unnamed_task__/AverageReturn 0.324816 +MetaTest/__unnamed_task__/Iteration 143 +MetaTest/__unnamed_task__/MaxReturn 40.555 +MetaTest/__unnamed_task__/MinReturn -20.2773 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 23.7701 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 245400 +------------------------------------------------- ------------- +2025-04-02 23:10:56 | [pearl_trainer] epoch #144 | Training... +2025-04-02 23:12:31 | [pearl_trainer] epoch #144 | Evaluating... +2025-04-02 23:12:31 | [pearl_trainer] epoch #144 | Sampling for adapation and meta-testing... +2025-04-02 23:14:23 | [pearl_trainer] epoch #144 | Finished meta-testing... +2025-04-02 23:14:23 | [pearl_trainer] epoch #144 | Saving snapshot... +2025-04-02 23:14:24 | [pearl_trainer] epoch #144 | Saved +2025-04-02 23:14:24 | [pearl_trainer] epoch #144 | Time 34177.28 s +2025-04-02 23:14:24 | [pearl_trainer] epoch #144 | EpochTime 242.42 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -13.4847 +MetaTest/Average/AverageReturn -13.4847 +MetaTest/Average/Iteration 144 +MetaTest/Average/MaxReturn -8.60691 +MetaTest/Average/MinReturn -17.4356 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 3.32847 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -13.4847 +MetaTest/__unnamed_task__/AverageReturn -13.4847 +MetaTest/__unnamed_task__/Iteration 144 +MetaTest/__unnamed_task__/MaxReturn -8.60691 +MetaTest/__unnamed_task__/MinReturn -17.4356 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 3.32847 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 247000 +------------------------------------------------- ------------ +2025-04-02 23:14:56 | [pearl_trainer] epoch #145 | Training... +2025-04-02 23:16:22 | [pearl_trainer] epoch #145 | Evaluating... +2025-04-02 23:16:22 | [pearl_trainer] epoch #145 | Sampling for adapation and meta-testing... +2025-04-02 23:18:10 | [pearl_trainer] epoch #145 | Finished meta-testing... +2025-04-02 23:18:10 | [pearl_trainer] epoch #145 | Saving snapshot... +2025-04-02 23:18:12 | [pearl_trainer] epoch #145 | Saved +2025-04-02 23:18:12 | [pearl_trainer] epoch #145 | Time 34404.79 s +2025-04-02 23:18:12 | [pearl_trainer] epoch #145 | EpochTime 227.51 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -14.465 +MetaTest/Average/AverageReturn -14.465 +MetaTest/Average/Iteration 145 +MetaTest/Average/MaxReturn -9.18624 +MetaTest/Average/MinReturn -19.0349 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 4.33878 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -14.465 +MetaTest/__unnamed_task__/AverageReturn -14.465 +MetaTest/__unnamed_task__/Iteration 145 +MetaTest/__unnamed_task__/MaxReturn -9.18624 +MetaTest/__unnamed_task__/MinReturn -19.0349 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 4.33878 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 248600 +------------------------------------------------- ------------ +2025-04-02 23:18:43 | [pearl_trainer] epoch #146 | Training... +2025-04-02 23:20:14 | [pearl_trainer] epoch #146 | Evaluating... +2025-04-02 23:20:14 | [pearl_trainer] epoch #146 | Sampling for adapation and meta-testing... +2025-04-02 23:22:06 | [pearl_trainer] epoch #146 | Finished meta-testing... +2025-04-02 23:22:06 | [pearl_trainer] epoch #146 | Saving snapshot... +2025-04-02 23:22:07 | [pearl_trainer] epoch #146 | Saved +2025-04-02 23:22:07 | [pearl_trainer] epoch #146 | Time 34640.27 s +2025-04-02 23:22:07 | [pearl_trainer] epoch #146 | EpochTime 235.47 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -14.1212 +MetaTest/Average/AverageReturn -14.1212 +MetaTest/Average/Iteration 146 +MetaTest/Average/MaxReturn 12.2521 +MetaTest/Average/MinReturn -26.3372 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 13.7705 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -14.1212 +MetaTest/__unnamed_task__/AverageReturn -14.1212 +MetaTest/__unnamed_task__/Iteration 146 +MetaTest/__unnamed_task__/MaxReturn 12.2521 +MetaTest/__unnamed_task__/MinReturn -26.3372 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 13.7705 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 250200 +------------------------------------------------- ----------- +2025-04-02 23:22:38 | [pearl_trainer] epoch #147 | Training... +2025-04-02 23:24:10 | [pearl_trainer] epoch #147 | Evaluating... +2025-04-02 23:24:10 | [pearl_trainer] epoch #147 | Sampling for adapation and meta-testing... +2025-04-02 23:26:02 | [pearl_trainer] epoch #147 | Finished meta-testing... +2025-04-02 23:26:02 | [pearl_trainer] epoch #147 | Saving snapshot... +2025-04-02 23:26:03 | [pearl_trainer] epoch #147 | Saved +2025-04-02 23:26:03 | [pearl_trainer] epoch #147 | Time 34875.58 s +2025-04-02 23:26:03 | [pearl_trainer] epoch #147 | EpochTime 235.31 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 12.7752 +MetaTest/Average/AverageReturn 12.7752 +MetaTest/Average/Iteration 147 +MetaTest/Average/MaxReturn 41.1599 +MetaTest/Average/MinReturn -19.3987 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 26.1596 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 12.7752 +MetaTest/__unnamed_task__/AverageReturn 12.7752 +MetaTest/__unnamed_task__/Iteration 147 +MetaTest/__unnamed_task__/MaxReturn 41.1599 +MetaTest/__unnamed_task__/MinReturn -19.3987 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 26.1596 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 251800 +------------------------------------------------- ----------- +2025-04-02 23:26:37 | [pearl_trainer] epoch #148 | Training... +2025-04-02 23:28:08 | [pearl_trainer] epoch #148 | Evaluating... +2025-04-02 23:28:08 | [pearl_trainer] epoch #148 | Sampling for adapation and meta-testing... +2025-04-02 23:30:05 | [pearl_trainer] epoch #148 | Finished meta-testing... +2025-04-02 23:30:05 | [pearl_trainer] epoch #148 | Saving snapshot... +2025-04-02 23:30:06 | [pearl_trainer] epoch #148 | Saved +2025-04-02 23:30:06 | [pearl_trainer] epoch #148 | Time 35118.94 s +2025-04-02 23:30:06 | [pearl_trainer] epoch #148 | EpochTime 243.36 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -33.7736 +MetaTest/Average/AverageReturn -33.7736 +MetaTest/Average/Iteration 148 +MetaTest/Average/MaxReturn -9.22271 +MetaTest/Average/MinReturn -61.7553 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 20.4205 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -33.7736 +MetaTest/__unnamed_task__/AverageReturn -33.7736 +MetaTest/__unnamed_task__/Iteration 148 +MetaTest/__unnamed_task__/MaxReturn -9.22271 +MetaTest/__unnamed_task__/MinReturn -61.7553 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 20.4205 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 253400 +------------------------------------------------- ------------ +2025-04-02 23:30:37 | [pearl_trainer] epoch #149 | Training... +2025-04-02 23:32:11 | [pearl_trainer] epoch #149 | Evaluating... +2025-04-02 23:32:11 | [pearl_trainer] epoch #149 | Sampling for adapation and meta-testing... +2025-04-02 23:34:03 | [pearl_trainer] epoch #149 | Finished meta-testing... +2025-04-02 23:34:03 | [pearl_trainer] epoch #149 | Saving snapshot... +2025-04-02 23:34:04 | [pearl_trainer] epoch #149 | Saved +2025-04-02 23:34:04 | [pearl_trainer] epoch #149 | Time 35357.11 s +2025-04-02 23:34:04 | [pearl_trainer] epoch #149 | EpochTime 238.17 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -57.9675 +MetaTest/Average/AverageReturn -57.9675 +MetaTest/Average/Iteration 149 +MetaTest/Average/MaxReturn -10.1669 +MetaTest/Average/MinReturn -81.506 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 25.5431 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -57.9675 +MetaTest/__unnamed_task__/AverageReturn -57.9675 +MetaTest/__unnamed_task__/Iteration 149 +MetaTest/__unnamed_task__/MaxReturn -10.1669 +MetaTest/__unnamed_task__/MinReturn -81.506 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 25.5431 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 255000 +------------------------------------------------- ----------- +2025-04-02 23:34:37 | [pearl_trainer] epoch #150 | Training... +2025-04-02 23:36:04 | [pearl_trainer] epoch #150 | Evaluating... +2025-04-02 23:36:04 | [pearl_trainer] epoch #150 | Sampling for adapation and meta-testing... +2025-04-02 23:37:53 | [pearl_trainer] epoch #150 | Finished meta-testing... +2025-04-02 23:37:53 | [pearl_trainer] epoch #150 | Saving snapshot... +2025-04-02 23:37:54 | [pearl_trainer] epoch #150 | Saved +2025-04-02 23:37:54 | [pearl_trainer] epoch #150 | Time 35586.91 s +2025-04-02 23:37:54 | [pearl_trainer] epoch #150 | EpochTime 229.80 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -62.8799 +MetaTest/Average/AverageReturn -62.8799 +MetaTest/Average/Iteration 150 +MetaTest/Average/MaxReturn -21.05 +MetaTest/Average/MinReturn -82.469 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 21.4933 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -62.8799 +MetaTest/__unnamed_task__/AverageReturn -62.8799 +MetaTest/__unnamed_task__/Iteration 150 +MetaTest/__unnamed_task__/MaxReturn -21.05 +MetaTest/__unnamed_task__/MinReturn -82.469 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 21.4933 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 256600 +------------------------------------------------- ----------- +2025-04-02 23:38:27 | [pearl_trainer] epoch #151 | Training... +2025-04-02 23:39:57 | [pearl_trainer] epoch #151 | Evaluating... +2025-04-02 23:39:57 | [pearl_trainer] epoch #151 | Sampling for adapation and meta-testing... +2025-04-02 23:41:51 | [pearl_trainer] epoch #151 | Finished meta-testing... +2025-04-02 23:41:51 | [pearl_trainer] epoch #151 | Saving snapshot... +2025-04-02 23:41:52 | [pearl_trainer] epoch #151 | Saved +2025-04-02 23:41:52 | [pearl_trainer] epoch #151 | Time 35824.65 s +2025-04-02 23:41:52 | [pearl_trainer] epoch #151 | EpochTime 237.74 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -48.5059 +MetaTest/Average/AverageReturn -48.5059 +MetaTest/Average/Iteration 151 +MetaTest/Average/MaxReturn -33.35 +MetaTest/Average/MinReturn -72.761 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 13.9708 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -48.5059 +MetaTest/__unnamed_task__/AverageReturn -48.5059 +MetaTest/__unnamed_task__/Iteration 151 +MetaTest/__unnamed_task__/MaxReturn -33.35 +MetaTest/__unnamed_task__/MinReturn -72.761 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 13.9708 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 258200 +------------------------------------------------- ----------- +2025-04-02 23:42:25 | [pearl_trainer] epoch #152 | Training... +2025-04-02 23:43:59 | [pearl_trainer] epoch #152 | Evaluating... +2025-04-02 23:43:59 | [pearl_trainer] epoch #152 | Sampling for adapation and meta-testing... +2025-04-02 23:45:50 | [pearl_trainer] epoch #152 | Finished meta-testing... +2025-04-02 23:45:50 | [pearl_trainer] epoch #152 | Saving snapshot... +2025-04-02 23:45:51 | [pearl_trainer] epoch #152 | Saved +2025-04-02 23:45:51 | [pearl_trainer] epoch #152 | Time 36064.32 s +2025-04-02 23:45:51 | [pearl_trainer] epoch #152 | EpochTime 239.67 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -43.1652 +MetaTest/Average/AverageReturn -43.1652 +MetaTest/Average/Iteration 152 +MetaTest/Average/MaxReturn -30.0183 +MetaTest/Average/MinReturn -75.4496 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 16.6771 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -43.1652 +MetaTest/__unnamed_task__/AverageReturn -43.1652 +MetaTest/__unnamed_task__/Iteration 152 +MetaTest/__unnamed_task__/MaxReturn -30.0183 +MetaTest/__unnamed_task__/MinReturn -75.4496 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 16.6771 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 259800 +------------------------------------------------- ----------- +2025-04-02 23:46:21 | [pearl_trainer] epoch #153 | Training... +2025-04-02 23:47:58 | [pearl_trainer] epoch #153 | Evaluating... +2025-04-02 23:47:58 | [pearl_trainer] epoch #153 | Sampling for adapation and meta-testing... +2025-04-02 23:49:48 | [pearl_trainer] epoch #153 | Finished meta-testing... +2025-04-02 23:49:48 | [pearl_trainer] epoch #153 | Saving snapshot... +2025-04-02 23:49:49 | [pearl_trainer] epoch #153 | Saved +2025-04-02 23:49:49 | [pearl_trainer] epoch #153 | Time 36302.50 s +2025-04-02 23:49:49 | [pearl_trainer] epoch #153 | EpochTime 238.18 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -19.7166 +MetaTest/Average/AverageReturn -19.7166 +MetaTest/Average/Iteration 153 +MetaTest/Average/MaxReturn 40.3249 +MetaTest/Average/MinReturn -55.1938 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 32.2603 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -19.7166 +MetaTest/__unnamed_task__/AverageReturn -19.7166 +MetaTest/__unnamed_task__/Iteration 153 +MetaTest/__unnamed_task__/MaxReturn 40.3249 +MetaTest/__unnamed_task__/MinReturn -55.1938 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 32.2603 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 261400 +------------------------------------------------- ----------- +2025-04-02 23:50:23 | [pearl_trainer] epoch #154 | Training... +2025-04-02 23:52:19 | [pearl_trainer] epoch #154 | Evaluating... +2025-04-02 23:52:19 | [pearl_trainer] epoch #154 | Sampling for adapation and meta-testing... +2025-04-02 23:54:15 | [pearl_trainer] epoch #154 | Finished meta-testing... +2025-04-02 23:54:15 | [pearl_trainer] epoch #154 | Saving snapshot... +2025-04-02 23:54:16 | [pearl_trainer] epoch #154 | Saved +2025-04-02 23:54:16 | [pearl_trainer] epoch #154 | Time 36569.27 s +2025-04-02 23:54:16 | [pearl_trainer] epoch #154 | EpochTime 266.77 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -23.617 +MetaTest/Average/AverageReturn -23.617 +MetaTest/Average/Iteration 154 +MetaTest/Average/MaxReturn -1.32739 +MetaTest/Average/MinReturn -66.4016 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 23.5889 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -23.617 +MetaTest/__unnamed_task__/AverageReturn -23.617 +MetaTest/__unnamed_task__/Iteration 154 +MetaTest/__unnamed_task__/MaxReturn -1.32739 +MetaTest/__unnamed_task__/MinReturn -66.4016 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 23.5889 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 263000 +------------------------------------------------- ------------ +2025-04-02 23:54:46 | [pearl_trainer] epoch #155 | Training... +2025-04-02 23:56:14 | [pearl_trainer] epoch #155 | Evaluating... +2025-04-02 23:56:14 | [pearl_trainer] epoch #155 | Sampling for adapation and meta-testing... +2025-04-02 23:58:10 | [pearl_trainer] epoch #155 | Finished meta-testing... +2025-04-02 23:58:10 | [pearl_trainer] epoch #155 | Saving snapshot... +2025-04-02 23:58:11 | [pearl_trainer] epoch #155 | Saved +2025-04-02 23:58:11 | [pearl_trainer] epoch #155 | Time 36804.16 s +2025-04-02 23:58:11 | [pearl_trainer] epoch #155 | EpochTime 234.88 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -9.34724 +MetaTest/Average/AverageReturn -9.34724 +MetaTest/Average/Iteration 155 +MetaTest/Average/MaxReturn 23.8305 +MetaTest/Average/MinReturn -32.7781 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 19.1728 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -9.34724 +MetaTest/__unnamed_task__/AverageReturn -9.34724 +MetaTest/__unnamed_task__/Iteration 155 +MetaTest/__unnamed_task__/MaxReturn 23.8305 +MetaTest/__unnamed_task__/MinReturn -32.7781 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 19.1728 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 264600 +------------------------------------------------- ------------ +2025-04-02 23:58:43 | [pearl_trainer] epoch #156 | Training... +2025-04-03 00:00:16 | [pearl_trainer] epoch #156 | Evaluating... +2025-04-03 00:00:16 | [pearl_trainer] epoch #156 | Sampling for adapation and meta-testing... +2025-04-03 00:02:12 | [pearl_trainer] epoch #156 | Finished meta-testing... +2025-04-03 00:02:12 | [pearl_trainer] epoch #156 | Saving snapshot... +2025-04-03 00:02:13 | [pearl_trainer] epoch #156 | Saved +2025-04-03 00:02:13 | [pearl_trainer] epoch #156 | Time 37046.01 s +2025-04-03 00:02:13 | [pearl_trainer] epoch #156 | EpochTime 241.85 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 12.7779 +MetaTest/Average/AverageReturn 12.7779 +MetaTest/Average/Iteration 156 +MetaTest/Average/MaxReturn 136.818 +MetaTest/Average/MinReturn -25.1613 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 62.3941 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 12.7779 +MetaTest/__unnamed_task__/AverageReturn 12.7779 +MetaTest/__unnamed_task__/Iteration 156 +MetaTest/__unnamed_task__/MaxReturn 136.818 +MetaTest/__unnamed_task__/MinReturn -25.1613 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 62.3941 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 266200 +------------------------------------------------- ----------- +2025-04-03 00:02:45 | [pearl_trainer] epoch #157 | Training... +2025-04-03 00:04:11 | [pearl_trainer] epoch #157 | Evaluating... +2025-04-03 00:04:11 | [pearl_trainer] epoch #157 | Sampling for adapation and meta-testing... +2025-04-03 00:06:09 | [pearl_trainer] epoch #157 | Finished meta-testing... +2025-04-03 00:06:09 | [pearl_trainer] epoch #157 | Saving snapshot... +2025-04-03 00:06:10 | [pearl_trainer] epoch #157 | Saved +2025-04-03 00:06:10 | [pearl_trainer] epoch #157 | Time 37283.36 s +2025-04-03 00:06:10 | [pearl_trainer] epoch #157 | EpochTime 237.35 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 8.18116 +MetaTest/Average/AverageReturn 8.18116 +MetaTest/Average/Iteration 157 +MetaTest/Average/MaxReturn 65.2576 +MetaTest/Average/MinReturn -21.9856 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 36.7893 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 8.18116 +MetaTest/__unnamed_task__/AverageReturn 8.18116 +MetaTest/__unnamed_task__/Iteration 157 +MetaTest/__unnamed_task__/MaxReturn 65.2576 +MetaTest/__unnamed_task__/MinReturn -21.9856 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 36.7893 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 267800 +------------------------------------------------- ------------ +2025-04-03 00:06:42 | [pearl_trainer] epoch #158 | Training... +2025-04-03 00:08:20 | [pearl_trainer] epoch #158 | Evaluating... +2025-04-03 00:08:20 | [pearl_trainer] epoch #158 | Sampling for adapation and meta-testing... +2025-04-03 00:10:06 | [pearl_trainer] epoch #158 | Finished meta-testing... +2025-04-03 00:10:06 | [pearl_trainer] epoch #158 | Saving snapshot... +2025-04-03 00:10:07 | [pearl_trainer] epoch #158 | Saved +2025-04-03 00:10:07 | [pearl_trainer] epoch #158 | Time 37520.00 s +2025-04-03 00:10:07 | [pearl_trainer] epoch #158 | EpochTime 236.64 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 10.6627 +MetaTest/Average/AverageReturn 10.6627 +MetaTest/Average/Iteration 158 +MetaTest/Average/MaxReturn 34.0188 +MetaTest/Average/MinReturn -13.6854 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 20.6491 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 10.6627 +MetaTest/__unnamed_task__/AverageReturn 10.6627 +MetaTest/__unnamed_task__/Iteration 158 +MetaTest/__unnamed_task__/MaxReturn 34.0188 +MetaTest/__unnamed_task__/MinReturn -13.6854 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 20.6491 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 269400 +------------------------------------------------- ----------- +2025-04-03 00:10:38 | [pearl_trainer] epoch #159 | Training... +2025-04-03 00:12:09 | [pearl_trainer] epoch #159 | Evaluating... +2025-04-03 00:12:09 | [pearl_trainer] epoch #159 | Sampling for adapation and meta-testing... +2025-04-03 00:14:02 | [pearl_trainer] epoch #159 | Finished meta-testing... +2025-04-03 00:14:02 | [pearl_trainer] epoch #159 | Saving snapshot... +2025-04-03 00:14:03 | [pearl_trainer] epoch #159 | Saved +2025-04-03 00:14:03 | [pearl_trainer] epoch #159 | Time 37755.78 s +2025-04-03 00:14:03 | [pearl_trainer] epoch #159 | EpochTime 235.77 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 19.8009 +MetaTest/Average/AverageReturn 19.8009 +MetaTest/Average/Iteration 159 +MetaTest/Average/MaxReturn 99.7595 +MetaTest/Average/MinReturn -60.2126 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 56.9896 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 19.8009 +MetaTest/__unnamed_task__/AverageReturn 19.8009 +MetaTest/__unnamed_task__/Iteration 159 +MetaTest/__unnamed_task__/MaxReturn 99.7595 +MetaTest/__unnamed_task__/MinReturn -60.2126 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 56.9896 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 271000 +------------------------------------------------- ----------- +2025-04-03 00:14:36 | [pearl_trainer] epoch #160 | Training... +2025-04-03 00:16:22 | [pearl_trainer] epoch #160 | Evaluating... +2025-04-03 00:16:22 | [pearl_trainer] epoch #160 | Sampling for adapation and meta-testing... +2025-04-03 00:18:12 | [pearl_trainer] epoch #160 | Finished meta-testing... +2025-04-03 00:18:12 | [pearl_trainer] epoch #160 | Saving snapshot... +2025-04-03 00:18:14 | [pearl_trainer] epoch #160 | Saved +2025-04-03 00:18:14 | [pearl_trainer] epoch #160 | Time 38006.54 s +2025-04-03 00:18:14 | [pearl_trainer] epoch #160 | EpochTime 250.76 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -12.8587 +MetaTest/Average/AverageReturn -12.8587 +MetaTest/Average/Iteration 160 +MetaTest/Average/MaxReturn 60.53 +MetaTest/Average/MinReturn -73.8275 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 50.1181 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -12.8587 +MetaTest/__unnamed_task__/AverageReturn -12.8587 +MetaTest/__unnamed_task__/Iteration 160 +MetaTest/__unnamed_task__/MaxReturn 60.53 +MetaTest/__unnamed_task__/MinReturn -73.8275 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 50.1181 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 272600 +------------------------------------------------- ----------- +2025-04-03 00:18:47 | [pearl_trainer] epoch #161 | Training... +2025-04-03 00:20:18 | [pearl_trainer] epoch #161 | Evaluating... +2025-04-03 00:20:18 | [pearl_trainer] epoch #161 | Sampling for adapation and meta-testing... +2025-04-03 00:22:10 | [pearl_trainer] epoch #161 | Finished meta-testing... +2025-04-03 00:22:10 | [pearl_trainer] epoch #161 | Saving snapshot... +2025-04-03 00:22:11 | [pearl_trainer] epoch #161 | Saved +2025-04-03 00:22:11 | [pearl_trainer] epoch #161 | Time 38244.25 s +2025-04-03 00:22:11 | [pearl_trainer] epoch #161 | EpochTime 237.71 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -18.266 +MetaTest/Average/AverageReturn -18.266 +MetaTest/Average/Iteration 161 +MetaTest/Average/MaxReturn 49.5975 +MetaTest/Average/MinReturn -37.3336 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 34.0028 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -18.266 +MetaTest/__unnamed_task__/AverageReturn -18.266 +MetaTest/__unnamed_task__/Iteration 161 +MetaTest/__unnamed_task__/MaxReturn 49.5975 +MetaTest/__unnamed_task__/MinReturn -37.3336 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 34.0028 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 274200 +------------------------------------------------- ----------- +2025-04-03 00:22:43 | [pearl_trainer] epoch #162 | Training... +2025-04-03 00:24:17 | [pearl_trainer] epoch #162 | Evaluating... +2025-04-03 00:24:17 | [pearl_trainer] epoch #162 | Sampling for adapation and meta-testing... +2025-04-03 00:26:01 | [pearl_trainer] epoch #162 | Finished meta-testing... +2025-04-03 00:26:01 | [pearl_trainer] epoch #162 | Saving snapshot... +2025-04-03 00:26:02 | [pearl_trainer] epoch #162 | Saved +2025-04-03 00:26:02 | [pearl_trainer] epoch #162 | Time 38474.57 s +2025-04-03 00:26:02 | [pearl_trainer] epoch #162 | EpochTime 230.32 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -38.1152 +MetaTest/Average/AverageReturn -38.1152 +MetaTest/Average/Iteration 162 +MetaTest/Average/MaxReturn -25.4208 +MetaTest/Average/MinReturn -53.0699 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 10.8695 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -38.1152 +MetaTest/__unnamed_task__/AverageReturn -38.1152 +MetaTest/__unnamed_task__/Iteration 162 +MetaTest/__unnamed_task__/MaxReturn -25.4208 +MetaTest/__unnamed_task__/MinReturn -53.0699 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 10.8695 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 275800 +------------------------------------------------- ----------- +2025-04-03 00:26:37 | [pearl_trainer] epoch #163 | Training... +2025-04-03 00:27:59 | [pearl_trainer] epoch #163 | Evaluating... +2025-04-03 00:27:59 | [pearl_trainer] epoch #163 | Sampling for adapation and meta-testing... +2025-04-03 00:29:55 | [pearl_trainer] epoch #163 | Finished meta-testing... +2025-04-03 00:29:55 | [pearl_trainer] epoch #163 | Saving snapshot... +2025-04-03 00:29:57 | [pearl_trainer] epoch #163 | Saved +2025-04-03 00:29:57 | [pearl_trainer] epoch #163 | Time 38709.68 s +2025-04-03 00:29:57 | [pearl_trainer] epoch #163 | EpochTime 235.11 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -29.3293 +MetaTest/Average/AverageReturn -29.3293 +MetaTest/Average/Iteration 163 +MetaTest/Average/MaxReturn -23.8084 +MetaTest/Average/MinReturn -35.0479 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 3.58936 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -29.3293 +MetaTest/__unnamed_task__/AverageReturn -29.3293 +MetaTest/__unnamed_task__/Iteration 163 +MetaTest/__unnamed_task__/MaxReturn -23.8084 +MetaTest/__unnamed_task__/MinReturn -35.0479 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 3.58936 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 277400 +------------------------------------------------- ------------ +2025-04-03 00:30:27 | [pearl_trainer] epoch #164 | Training... +2025-04-03 00:31:58 | [pearl_trainer] epoch #164 | Evaluating... +2025-04-03 00:31:58 | [pearl_trainer] epoch #164 | Sampling for adapation and meta-testing... +2025-04-03 00:33:50 | [pearl_trainer] epoch #164 | Finished meta-testing... +2025-04-03 00:33:50 | [pearl_trainer] epoch #164 | Saving snapshot... +2025-04-03 00:33:51 | [pearl_trainer] epoch #164 | Saved +2025-04-03 00:33:51 | [pearl_trainer] epoch #164 | Time 38943.84 s +2025-04-03 00:33:51 | [pearl_trainer] epoch #164 | EpochTime 234.16 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -25.3159 +MetaTest/Average/AverageReturn -25.3159 +MetaTest/Average/Iteration 164 +MetaTest/Average/MaxReturn -12.55 +MetaTest/Average/MinReturn -41.0951 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 9.71044 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -25.3159 +MetaTest/__unnamed_task__/AverageReturn -25.3159 +MetaTest/__unnamed_task__/Iteration 164 +MetaTest/__unnamed_task__/MaxReturn -12.55 +MetaTest/__unnamed_task__/MinReturn -41.0951 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 9.71044 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 279000 +------------------------------------------------- ------------ +2025-04-03 00:34:22 | [pearl_trainer] epoch #165 | Training... +2025-04-03 00:35:44 | [pearl_trainer] epoch #165 | Evaluating... +2025-04-03 00:35:44 | [pearl_trainer] epoch #165 | Sampling for adapation and meta-testing... +2025-04-03 00:37:38 | [pearl_trainer] epoch #165 | Finished meta-testing... +2025-04-03 00:37:38 | [pearl_trainer] epoch #165 | Saving snapshot... +2025-04-03 00:37:39 | [pearl_trainer] epoch #165 | Saved +2025-04-03 00:37:39 | [pearl_trainer] epoch #165 | Time 39172.23 s +2025-04-03 00:37:39 | [pearl_trainer] epoch #165 | EpochTime 228.38 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -16.259 +MetaTest/Average/AverageReturn -16.259 +MetaTest/Average/Iteration 165 +MetaTest/Average/MaxReturn 25.6497 +MetaTest/Average/MinReturn -44.6808 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 23.8933 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -16.259 +MetaTest/__unnamed_task__/AverageReturn -16.259 +MetaTest/__unnamed_task__/Iteration 165 +MetaTest/__unnamed_task__/MaxReturn 25.6497 +MetaTest/__unnamed_task__/MinReturn -44.6808 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 23.8933 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 280600 +------------------------------------------------- ----------- +2025-04-03 00:38:10 | [pearl_trainer] epoch #166 | Training... +2025-04-03 00:39:44 | [pearl_trainer] epoch #166 | Evaluating... +2025-04-03 00:39:44 | [pearl_trainer] epoch #166 | Sampling for adapation and meta-testing... +2025-04-03 00:41:31 | [pearl_trainer] epoch #166 | Finished meta-testing... +2025-04-03 00:41:31 | [pearl_trainer] epoch #166 | Saving snapshot... +2025-04-03 00:41:32 | [pearl_trainer] epoch #166 | Saved +2025-04-03 00:41:32 | [pearl_trainer] epoch #166 | Time 39404.95 s +2025-04-03 00:41:32 | [pearl_trainer] epoch #166 | EpochTime 232.72 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 25.4488 +MetaTest/Average/AverageReturn 25.4488 +MetaTest/Average/Iteration 166 +MetaTest/Average/MaxReturn 80.388 +MetaTest/Average/MinReturn 11.225 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 27.4721 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 25.4488 +MetaTest/__unnamed_task__/AverageReturn 25.4488 +MetaTest/__unnamed_task__/Iteration 166 +MetaTest/__unnamed_task__/MaxReturn 80.388 +MetaTest/__unnamed_task__/MinReturn 11.225 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 27.4721 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 282200 +------------------------------------------------- ----------- +2025-04-03 00:42:03 | [pearl_trainer] epoch #167 | Training... +2025-04-03 00:43:44 | [pearl_trainer] epoch #167 | Evaluating... +2025-04-03 00:43:44 | [pearl_trainer] epoch #167 | Sampling for adapation and meta-testing... +2025-04-03 00:45:33 | [pearl_trainer] epoch #167 | Finished meta-testing... +2025-04-03 00:45:33 | [pearl_trainer] epoch #167 | Saving snapshot... +2025-04-03 00:45:34 | [pearl_trainer] epoch #167 | Saved +2025-04-03 00:45:34 | [pearl_trainer] epoch #167 | Time 39646.85 s +2025-04-03 00:45:34 | [pearl_trainer] epoch #167 | EpochTime 241.90 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -13.8195 +MetaTest/Average/AverageReturn -13.8195 +MetaTest/Average/Iteration 167 +MetaTest/Average/MaxReturn 22.3599 +MetaTest/Average/MinReturn -36.6224 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 19.4791 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -13.8195 +MetaTest/__unnamed_task__/AverageReturn -13.8195 +MetaTest/__unnamed_task__/Iteration 167 +MetaTest/__unnamed_task__/MaxReturn 22.3599 +MetaTest/__unnamed_task__/MinReturn -36.6224 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 19.4791 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 283800 +------------------------------------------------- ----------- +2025-04-03 00:46:07 | [pearl_trainer] epoch #168 | Training... +2025-04-03 00:47:44 | [pearl_trainer] epoch #168 | Evaluating... +2025-04-03 00:47:44 | [pearl_trainer] epoch #168 | Sampling for adapation and meta-testing... +2025-04-03 00:49:38 | [pearl_trainer] epoch #168 | Finished meta-testing... +2025-04-03 00:49:38 | [pearl_trainer] epoch #168 | Saving snapshot... +2025-04-03 00:49:39 | [pearl_trainer] epoch #168 | Saved +2025-04-03 00:49:39 | [pearl_trainer] epoch #168 | Time 39891.98 s +2025-04-03 00:49:39 | [pearl_trainer] epoch #168 | EpochTime 245.13 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -14.626 +MetaTest/Average/AverageReturn -14.626 +MetaTest/Average/Iteration 168 +MetaTest/Average/MaxReturn 2.66234 +MetaTest/Average/MinReturn -35.7008 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 15.3194 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -14.626 +MetaTest/__unnamed_task__/AverageReturn -14.626 +MetaTest/__unnamed_task__/Iteration 168 +MetaTest/__unnamed_task__/MaxReturn 2.66234 +MetaTest/__unnamed_task__/MinReturn -35.7008 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 15.3194 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 285400 +------------------------------------------------- ------------ +2025-04-03 00:50:12 | [pearl_trainer] epoch #169 | Training... +2025-04-03 00:51:45 | [pearl_trainer] epoch #169 | Evaluating... +2025-04-03 00:51:45 | [pearl_trainer] epoch #169 | Sampling for adapation and meta-testing... +2025-04-03 00:53:37 | [pearl_trainer] epoch #169 | Finished meta-testing... +2025-04-03 00:53:37 | [pearl_trainer] epoch #169 | Saving snapshot... +2025-04-03 00:53:38 | [pearl_trainer] epoch #169 | Saved +2025-04-03 00:53:38 | [pearl_trainer] epoch #169 | Time 40130.63 s +2025-04-03 00:53:38 | [pearl_trainer] epoch #169 | EpochTime 238.64 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 22.3992 +MetaTest/Average/AverageReturn 22.3992 +MetaTest/Average/Iteration 169 +MetaTest/Average/MaxReturn 129.285 +MetaTest/Average/MinReturn -51.9298 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 63.0939 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 22.3992 +MetaTest/__unnamed_task__/AverageReturn 22.3992 +MetaTest/__unnamed_task__/Iteration 169 +MetaTest/__unnamed_task__/MaxReturn 129.285 +MetaTest/__unnamed_task__/MinReturn -51.9298 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 63.0939 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 287000 +------------------------------------------------- ----------- +2025-04-03 00:54:09 | [pearl_trainer] epoch #170 | Training... +2025-04-03 00:55:34 | [pearl_trainer] epoch #170 | Evaluating... +2025-04-03 00:55:34 | [pearl_trainer] epoch #170 | Sampling for adapation and meta-testing... +2025-04-03 00:57:26 | [pearl_trainer] epoch #170 | Finished meta-testing... +2025-04-03 00:57:26 | [pearl_trainer] epoch #170 | Saving snapshot... +2025-04-03 00:57:27 | [pearl_trainer] epoch #170 | Saved +2025-04-03 00:57:27 | [pearl_trainer] epoch #170 | Time 40360.01 s +2025-04-03 00:57:27 | [pearl_trainer] epoch #170 | EpochTime 229.38 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -3.97166 +MetaTest/Average/AverageReturn -3.97166 +MetaTest/Average/Iteration 170 +MetaTest/Average/MaxReturn 40.052 +MetaTest/Average/MinReturn -23.1545 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 23.7471 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -3.97166 +MetaTest/__unnamed_task__/AverageReturn -3.97166 +MetaTest/__unnamed_task__/Iteration 170 +MetaTest/__unnamed_task__/MaxReturn 40.052 +MetaTest/__unnamed_task__/MinReturn -23.1545 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 23.7471 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 288600 +------------------------------------------------- ------------ +2025-04-03 00:57:59 | [pearl_trainer] epoch #171 | Training... +2025-04-03 00:59:35 | [pearl_trainer] epoch #171 | Evaluating... +2025-04-03 00:59:35 | [pearl_trainer] epoch #171 | Sampling for adapation and meta-testing... +2025-04-03 01:01:31 | [pearl_trainer] epoch #171 | Finished meta-testing... +2025-04-03 01:01:31 | [pearl_trainer] epoch #171 | Saving snapshot... +2025-04-03 01:01:32 | [pearl_trainer] epoch #171 | Saved +2025-04-03 01:01:32 | [pearl_trainer] epoch #171 | Time 40604.99 s +2025-04-03 01:01:32 | [pearl_trainer] epoch #171 | EpochTime 244.98 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -6.53189 +MetaTest/Average/AverageReturn -6.53189 +MetaTest/Average/Iteration 171 +MetaTest/Average/MaxReturn 22.6726 +MetaTest/Average/MinReturn -17.2991 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 15.2813 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -6.53189 +MetaTest/__unnamed_task__/AverageReturn -6.53189 +MetaTest/__unnamed_task__/Iteration 171 +MetaTest/__unnamed_task__/MaxReturn 22.6726 +MetaTest/__unnamed_task__/MinReturn -17.2991 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 15.2813 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 290200 +------------------------------------------------- ------------ +2025-04-03 01:02:04 | [pearl_trainer] epoch #172 | Training... +2025-04-03 01:03:23 | [pearl_trainer] epoch #172 | Evaluating... +2025-04-03 01:03:23 | [pearl_trainer] epoch #172 | Sampling for adapation and meta-testing... +2025-04-03 01:05:16 | [pearl_trainer] epoch #172 | Finished meta-testing... +2025-04-03 01:05:16 | [pearl_trainer] epoch #172 | Saving snapshot... +2025-04-03 01:05:17 | [pearl_trainer] epoch #172 | Saved +2025-04-03 01:05:17 | [pearl_trainer] epoch #172 | Time 40830.19 s +2025-04-03 01:05:17 | [pearl_trainer] epoch #172 | EpochTime 225.20 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 8.29681 +MetaTest/Average/AverageReturn 8.29681 +MetaTest/Average/Iteration 172 +MetaTest/Average/MaxReturn 51.4122 +MetaTest/Average/MinReturn -19.3252 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 27.0974 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 8.29681 +MetaTest/__unnamed_task__/AverageReturn 8.29681 +MetaTest/__unnamed_task__/Iteration 172 +MetaTest/__unnamed_task__/MaxReturn 51.4122 +MetaTest/__unnamed_task__/MinReturn -19.3252 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 27.0974 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 291800 +------------------------------------------------- ------------ +2025-04-03 01:05:50 | [pearl_trainer] epoch #173 | Training... +2025-04-03 01:07:16 | [pearl_trainer] epoch #173 | Evaluating... +2025-04-03 01:07:16 | [pearl_trainer] epoch #173 | Sampling for adapation and meta-testing... +2025-04-03 01:09:09 | [pearl_trainer] epoch #173 | Finished meta-testing... +2025-04-03 01:09:09 | [pearl_trainer] epoch #173 | Saving snapshot... +2025-04-03 01:09:10 | [pearl_trainer] epoch #173 | Saved +2025-04-03 01:09:10 | [pearl_trainer] epoch #173 | Time 41063.03 s +2025-04-03 01:09:10 | [pearl_trainer] epoch #173 | EpochTime 232.84 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 15.523 +MetaTest/Average/AverageReturn 15.523 +MetaTest/Average/Iteration 173 +MetaTest/Average/MaxReturn 51.6908 +MetaTest/Average/MinReturn -29.3797 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 32.58 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 15.523 +MetaTest/__unnamed_task__/AverageReturn 15.523 +MetaTest/__unnamed_task__/Iteration 173 +MetaTest/__unnamed_task__/MaxReturn 51.6908 +MetaTest/__unnamed_task__/MinReturn -29.3797 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 32.58 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 293400 +------------------------------------------------- ----------- +2025-04-03 01:09:44 | [pearl_trainer] epoch #174 | Training... +2025-04-03 01:11:09 | [pearl_trainer] epoch #174 | Evaluating... +2025-04-03 01:11:09 | [pearl_trainer] epoch #174 | Sampling for adapation and meta-testing... +2025-04-03 01:13:07 | [pearl_trainer] epoch #174 | Finished meta-testing... +2025-04-03 01:13:07 | [pearl_trainer] epoch #174 | Saving snapshot... +2025-04-03 01:13:08 | [pearl_trainer] epoch #174 | Saved +2025-04-03 01:13:08 | [pearl_trainer] epoch #174 | Time 41300.81 s +2025-04-03 01:13:08 | [pearl_trainer] epoch #174 | EpochTime 237.77 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 1.45816 +MetaTest/Average/AverageReturn 1.45816 +MetaTest/Average/Iteration 174 +MetaTest/Average/MaxReturn 80.8437 +MetaTest/Average/MinReturn -41.5244 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 43.0398 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 1.45816 +MetaTest/__unnamed_task__/AverageReturn 1.45816 +MetaTest/__unnamed_task__/Iteration 174 +MetaTest/__unnamed_task__/MaxReturn 80.8437 +MetaTest/__unnamed_task__/MinReturn -41.5244 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 43.0398 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 295000 +------------------------------------------------- ------------ +2025-04-03 01:13:39 | [pearl_trainer] epoch #175 | Training... +2025-04-03 01:15:08 | [pearl_trainer] epoch #175 | Evaluating... +2025-04-03 01:15:08 | [pearl_trainer] epoch #175 | Sampling for adapation and meta-testing... +2025-04-03 01:17:00 | [pearl_trainer] epoch #175 | Finished meta-testing... +2025-04-03 01:17:00 | [pearl_trainer] epoch #175 | Saving snapshot... +2025-04-03 01:17:02 | [pearl_trainer] epoch #175 | Saved +2025-04-03 01:17:02 | [pearl_trainer] epoch #175 | Time 41534.85 s +2025-04-03 01:17:02 | [pearl_trainer] epoch #175 | EpochTime 234.04 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -6.84148 +MetaTest/Average/AverageReturn -6.84148 +MetaTest/Average/Iteration 175 +MetaTest/Average/MaxReturn 90.4176 +MetaTest/Average/MinReturn -55.3603 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 50.3704 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -6.84148 +MetaTest/__unnamed_task__/AverageReturn -6.84148 +MetaTest/__unnamed_task__/Iteration 175 +MetaTest/__unnamed_task__/MaxReturn 90.4176 +MetaTest/__unnamed_task__/MinReturn -55.3603 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 50.3704 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 296600 +------------------------------------------------- ------------ +2025-04-03 01:17:35 | [pearl_trainer] epoch #176 | Training... +2025-04-03 01:19:13 | [pearl_trainer] epoch #176 | Evaluating... +2025-04-03 01:19:13 | [pearl_trainer] epoch #176 | Sampling for adapation and meta-testing... +2025-04-03 01:21:11 | [pearl_trainer] epoch #176 | Finished meta-testing... +2025-04-03 01:21:11 | [pearl_trainer] epoch #176 | Saving snapshot... +2025-04-03 01:21:12 | [pearl_trainer] epoch #176 | Saved +2025-04-03 01:21:12 | [pearl_trainer] epoch #176 | Time 41785.26 s +2025-04-03 01:21:12 | [pearl_trainer] epoch #176 | EpochTime 250.41 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 2.00627 +MetaTest/Average/AverageReturn 2.00627 +MetaTest/Average/Iteration 176 +MetaTest/Average/MaxReturn 95.8295 +MetaTest/Average/MinReturn -44.6086 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 48.5203 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 2.00627 +MetaTest/__unnamed_task__/AverageReturn 2.00627 +MetaTest/__unnamed_task__/Iteration 176 +MetaTest/__unnamed_task__/MaxReturn 95.8295 +MetaTest/__unnamed_task__/MinReturn -44.6086 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 48.5203 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 298200 +------------------------------------------------- ------------ +2025-04-03 01:21:43 | [pearl_trainer] epoch #177 | Training... +2025-04-03 01:23:24 | [pearl_trainer] epoch #177 | Evaluating... +2025-04-03 01:23:24 | [pearl_trainer] epoch #177 | Sampling for adapation and meta-testing... +2025-04-03 01:25:15 | [pearl_trainer] epoch #177 | Finished meta-testing... +2025-04-03 01:25:15 | [pearl_trainer] epoch #177 | Saving snapshot... +2025-04-03 01:25:16 | [pearl_trainer] epoch #177 | Saved +2025-04-03 01:25:16 | [pearl_trainer] epoch #177 | Time 42028.74 s +2025-04-03 01:25:16 | [pearl_trainer] epoch #177 | EpochTime 243.48 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 10.4005 +MetaTest/Average/AverageReturn 10.4005 +MetaTest/Average/Iteration 177 +MetaTest/Average/MaxReturn 88.6095 +MetaTest/Average/MinReturn -40.0178 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 43.0194 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 10.4005 +MetaTest/__unnamed_task__/AverageReturn 10.4005 +MetaTest/__unnamed_task__/Iteration 177 +MetaTest/__unnamed_task__/MaxReturn 88.6095 +MetaTest/__unnamed_task__/MinReturn -40.0178 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 43.0194 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 299800 +------------------------------------------------- ----------- +2025-04-03 01:25:51 | [pearl_trainer] epoch #178 | Training... +2025-04-03 01:27:14 | [pearl_trainer] epoch #178 | Evaluating... +2025-04-03 01:27:14 | [pearl_trainer] epoch #178 | Sampling for adapation and meta-testing... +2025-04-03 01:29:12 | [pearl_trainer] epoch #178 | Finished meta-testing... +2025-04-03 01:29:13 | [pearl_trainer] epoch #178 | Saving snapshot... +2025-04-03 01:29:14 | [pearl_trainer] epoch #178 | Saved +2025-04-03 01:29:14 | [pearl_trainer] epoch #178 | Time 42266.92 s +2025-04-03 01:29:14 | [pearl_trainer] epoch #178 | EpochTime 238.18 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -1.5921 +MetaTest/Average/AverageReturn -1.5921 +MetaTest/Average/Iteration 178 +MetaTest/Average/MaxReturn 20.3719 +MetaTest/Average/MinReturn -15.7818 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 12.2061 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -1.5921 +MetaTest/__unnamed_task__/AverageReturn -1.5921 +MetaTest/__unnamed_task__/Iteration 178 +MetaTest/__unnamed_task__/MaxReturn 20.3719 +MetaTest/__unnamed_task__/MinReturn -15.7818 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 12.2061 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 301400 +------------------------------------------------- ----------- +2025-04-03 01:29:45 | [pearl_trainer] epoch #179 | Training... +2025-04-03 01:31:14 | [pearl_trainer] epoch #179 | Evaluating... +2025-04-03 01:31:14 | [pearl_trainer] epoch #179 | Sampling for adapation and meta-testing... +2025-04-03 01:33:09 | [pearl_trainer] epoch #179 | Finished meta-testing... +2025-04-03 01:33:09 | [pearl_trainer] epoch #179 | Saving snapshot... +2025-04-03 01:33:10 | [pearl_trainer] epoch #179 | Saved +2025-04-03 01:33:10 | [pearl_trainer] epoch #179 | Time 42502.93 s +2025-04-03 01:33:10 | [pearl_trainer] epoch #179 | EpochTime 236.00 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 20.5707 +MetaTest/Average/AverageReturn 20.5707 +MetaTest/Average/Iteration 179 +MetaTest/Average/MaxReturn 77.0532 +MetaTest/Average/MinReturn -23.3531 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 45.2339 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 20.5707 +MetaTest/__unnamed_task__/AverageReturn 20.5707 +MetaTest/__unnamed_task__/Iteration 179 +MetaTest/__unnamed_task__/MaxReturn 77.0532 +MetaTest/__unnamed_task__/MinReturn -23.3531 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 45.2339 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 303000 +------------------------------------------------- ----------- +2025-04-03 01:33:43 | [pearl_trainer] epoch #180 | Training... +2025-04-03 01:35:05 | [pearl_trainer] epoch #180 | Evaluating... +2025-04-03 01:35:05 | [pearl_trainer] epoch #180 | Sampling for adapation and meta-testing... +2025-04-03 01:37:03 | [pearl_trainer] epoch #180 | Finished meta-testing... +2025-04-03 01:37:03 | [pearl_trainer] epoch #180 | Saving snapshot... +2025-04-03 01:37:05 | [pearl_trainer] epoch #180 | Saved +2025-04-03 01:37:05 | [pearl_trainer] epoch #180 | Time 42737.64 s +2025-04-03 01:37:05 | [pearl_trainer] epoch #180 | EpochTime 234.71 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 48.9722 +MetaTest/Average/AverageReturn 48.9722 +MetaTest/Average/Iteration 180 +MetaTest/Average/MaxReturn 152.108 +MetaTest/Average/MinReturn -19.7267 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 64.1636 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 48.9722 +MetaTest/__unnamed_task__/AverageReturn 48.9722 +MetaTest/__unnamed_task__/Iteration 180 +MetaTest/__unnamed_task__/MaxReturn 152.108 +MetaTest/__unnamed_task__/MinReturn -19.7267 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 64.1636 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 304600 +------------------------------------------------- ----------- +2025-04-03 01:37:36 | [pearl_trainer] epoch #181 | Training... +2025-04-03 01:39:12 | [pearl_trainer] epoch #181 | Evaluating... +2025-04-03 01:39:12 | [pearl_trainer] epoch #181 | Sampling for adapation and meta-testing... +2025-04-03 01:41:00 | [pearl_trainer] epoch #181 | Finished meta-testing... +2025-04-03 01:41:00 | [pearl_trainer] epoch #181 | Saving snapshot... +2025-04-03 01:41:01 | [pearl_trainer] epoch #181 | Saved +2025-04-03 01:41:01 | [pearl_trainer] epoch #181 | Time 42974.49 s +2025-04-03 01:41:01 | [pearl_trainer] epoch #181 | EpochTime 236.85 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 23.7362 +MetaTest/Average/AverageReturn 23.7362 +MetaTest/Average/Iteration 181 +MetaTest/Average/MaxReturn 61.7699 +MetaTest/Average/MinReturn -14.1107 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 30.2574 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 23.7362 +MetaTest/__unnamed_task__/AverageReturn 23.7362 +MetaTest/__unnamed_task__/Iteration 181 +MetaTest/__unnamed_task__/MaxReturn 61.7699 +MetaTest/__unnamed_task__/MinReturn -14.1107 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 30.2574 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 306200 +------------------------------------------------- ----------- +2025-04-03 01:41:34 | [pearl_trainer] epoch #182 | Training... +2025-04-03 01:43:10 | [pearl_trainer] epoch #182 | Evaluating... +2025-04-03 01:43:10 | [pearl_trainer] epoch #182 | Sampling for adapation and meta-testing... +2025-04-03 01:45:08 | [pearl_trainer] epoch #182 | Finished meta-testing... +2025-04-03 01:45:08 | [pearl_trainer] epoch #182 | Saving snapshot... +2025-04-03 01:45:09 | [pearl_trainer] epoch #182 | Saved +2025-04-03 01:45:09 | [pearl_trainer] epoch #182 | Time 43222.09 s +2025-04-03 01:45:09 | [pearl_trainer] epoch #182 | EpochTime 247.60 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 5.78393 +MetaTest/Average/AverageReturn 5.78393 +MetaTest/Average/Iteration 182 +MetaTest/Average/MaxReturn 53.351 +MetaTest/Average/MinReturn -18.6151 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 25.8497 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 5.78393 +MetaTest/__unnamed_task__/AverageReturn 5.78393 +MetaTest/__unnamed_task__/Iteration 182 +MetaTest/__unnamed_task__/MaxReturn 53.351 +MetaTest/__unnamed_task__/MinReturn -18.6151 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 25.8497 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 307800 +------------------------------------------------- ------------ +2025-04-03 01:45:40 | [pearl_trainer] epoch #183 | Training... +2025-04-03 01:47:11 | [pearl_trainer] epoch #183 | Evaluating... +2025-04-03 01:47:11 | [pearl_trainer] epoch #183 | Sampling for adapation and meta-testing... +2025-04-03 01:49:03 | [pearl_trainer] epoch #183 | Finished meta-testing... +2025-04-03 01:49:03 | [pearl_trainer] epoch #183 | Saving snapshot... +2025-04-03 01:49:04 | [pearl_trainer] epoch #183 | Saved +2025-04-03 01:49:04 | [pearl_trainer] epoch #183 | Time 43457.44 s +2025-04-03 01:49:04 | [pearl_trainer] epoch #183 | EpochTime 235.35 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 16.7775 +MetaTest/Average/AverageReturn 16.7775 +MetaTest/Average/Iteration 183 +MetaTest/Average/MaxReturn 104.586 +MetaTest/Average/MinReturn -30.8417 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 46.6432 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 16.7775 +MetaTest/__unnamed_task__/AverageReturn 16.7775 +MetaTest/__unnamed_task__/Iteration 183 +MetaTest/__unnamed_task__/MaxReturn 104.586 +MetaTest/__unnamed_task__/MinReturn -30.8417 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 46.6432 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 309400 +------------------------------------------------- ----------- +2025-04-03 01:49:40 | [pearl_trainer] epoch #184 | Training... +2025-04-03 01:51:28 | [pearl_trainer] epoch #184 | Evaluating... +2025-04-03 01:51:28 | [pearl_trainer] epoch #184 | Sampling for adapation and meta-testing... +2025-04-03 01:53:26 | [pearl_trainer] epoch #184 | Finished meta-testing... +2025-04-03 01:53:26 | [pearl_trainer] epoch #184 | Saving snapshot... +2025-04-03 01:53:27 | [pearl_trainer] epoch #184 | Saved +2025-04-03 01:53:27 | [pearl_trainer] epoch #184 | Time 43720.35 s +2025-04-03 01:53:27 | [pearl_trainer] epoch #184 | EpochTime 262.90 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 15.2378 +MetaTest/Average/AverageReturn 15.2378 +MetaTest/Average/Iteration 184 +MetaTest/Average/MaxReturn 73.3553 +MetaTest/Average/MinReturn -19.7287 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 40.1222 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 15.2378 +MetaTest/__unnamed_task__/AverageReturn 15.2378 +MetaTest/__unnamed_task__/Iteration 184 +MetaTest/__unnamed_task__/MaxReturn 73.3553 +MetaTest/__unnamed_task__/MinReturn -19.7287 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 40.1222 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 311000 +------------------------------------------------- ----------- +2025-04-03 01:53:58 | [pearl_trainer] epoch #185 | Training... +2025-04-03 01:55:23 | [pearl_trainer] epoch #185 | Evaluating... +2025-04-03 01:55:23 | [pearl_trainer] epoch #185 | Sampling for adapation and meta-testing... +2025-04-03 01:57:19 | [pearl_trainer] epoch #185 | Finished meta-testing... +2025-04-03 01:57:19 | [pearl_trainer] epoch #185 | Saving snapshot... +2025-04-03 01:57:21 | [pearl_trainer] epoch #185 | Saved +2025-04-03 01:57:21 | [pearl_trainer] epoch #185 | Time 43953.62 s +2025-04-03 01:57:21 | [pearl_trainer] epoch #185 | EpochTime 233.27 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 12.7644 +MetaTest/Average/AverageReturn 12.7644 +MetaTest/Average/Iteration 185 +MetaTest/Average/MaxReturn 75.7796 +MetaTest/Average/MinReturn -21.5014 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 34.5882 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 12.7644 +MetaTest/__unnamed_task__/AverageReturn 12.7644 +MetaTest/__unnamed_task__/Iteration 185 +MetaTest/__unnamed_task__/MaxReturn 75.7796 +MetaTest/__unnamed_task__/MinReturn -21.5014 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 34.5882 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 312600 +------------------------------------------------- ----------- +2025-04-03 01:57:53 | [pearl_trainer] epoch #186 | Training... +2025-04-03 01:59:28 | [pearl_trainer] epoch #186 | Evaluating... +2025-04-03 01:59:28 | [pearl_trainer] epoch #186 | Sampling for adapation and meta-testing... +2025-04-03 02:01:20 | [pearl_trainer] epoch #186 | Finished meta-testing... +2025-04-03 02:01:20 | [pearl_trainer] epoch #186 | Saving snapshot... +2025-04-03 02:01:21 | [pearl_trainer] epoch #186 | Saved +2025-04-03 02:01:21 | [pearl_trainer] epoch #186 | Time 44194.02 s +2025-04-03 02:01:21 | [pearl_trainer] epoch #186 | EpochTime 240.40 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 11.8556 +MetaTest/Average/AverageReturn 11.8556 +MetaTest/Average/Iteration 186 +MetaTest/Average/MaxReturn 62.9491 +MetaTest/Average/MinReturn -25.8231 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 35.6785 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 11.8556 +MetaTest/__unnamed_task__/AverageReturn 11.8556 +MetaTest/__unnamed_task__/Iteration 186 +MetaTest/__unnamed_task__/MaxReturn 62.9491 +MetaTest/__unnamed_task__/MinReturn -25.8231 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 35.6785 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 314200 +------------------------------------------------- ----------- +2025-04-03 02:01:52 | [pearl_trainer] epoch #187 | Training... +2025-04-03 02:03:30 | [pearl_trainer] epoch #187 | Evaluating... +2025-04-03 02:03:30 | [pearl_trainer] epoch #187 | Sampling for adapation and meta-testing... +2025-04-03 02:05:22 | [pearl_trainer] epoch #187 | Finished meta-testing... +2025-04-03 02:05:22 | [pearl_trainer] epoch #187 | Saving snapshot... +2025-04-03 02:05:23 | [pearl_trainer] epoch #187 | Saved +2025-04-03 02:05:23 | [pearl_trainer] epoch #187 | Time 44436.29 s +2025-04-03 02:05:23 | [pearl_trainer] epoch #187 | EpochTime 242.27 s +------------------------------------------------- ------------- +MetaTest/Average/AverageDiscountedReturn 0.661777 +MetaTest/Average/AverageReturn 0.661777 +MetaTest/Average/Iteration 187 +MetaTest/Average/MaxReturn 49.6992 +MetaTest/Average/MinReturn -30.419 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 29.6505 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 0.661777 +MetaTest/__unnamed_task__/AverageReturn 0.661777 +MetaTest/__unnamed_task__/Iteration 187 +MetaTest/__unnamed_task__/MaxReturn 49.6992 +MetaTest/__unnamed_task__/MinReturn -30.419 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 29.6505 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 315800 +------------------------------------------------- ------------- +2025-04-03 02:05:56 | [pearl_trainer] epoch #188 | Training... +2025-04-03 02:07:20 | [pearl_trainer] epoch #188 | Evaluating... +2025-04-03 02:07:20 | [pearl_trainer] epoch #188 | Sampling for adapation and meta-testing... +2025-04-03 02:09:17 | [pearl_trainer] epoch #188 | Finished meta-testing... +2025-04-03 02:09:17 | [pearl_trainer] epoch #188 | Saving snapshot... +2025-04-03 02:09:18 | [pearl_trainer] epoch #188 | Saved +2025-04-03 02:09:18 | [pearl_trainer] epoch #188 | Time 44671.15 s +2025-04-03 02:09:18 | [pearl_trainer] epoch #188 | EpochTime 234.85 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -12.8994 +MetaTest/Average/AverageReturn -12.8994 +MetaTest/Average/Iteration 188 +MetaTest/Average/MaxReturn 36.4141 +MetaTest/Average/MinReturn -65.2087 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 32.4246 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -12.8994 +MetaTest/__unnamed_task__/AverageReturn -12.8994 +MetaTest/__unnamed_task__/Iteration 188 +MetaTest/__unnamed_task__/MaxReturn 36.4141 +MetaTest/__unnamed_task__/MinReturn -65.2087 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 32.4246 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 317400 +------------------------------------------------- ----------- +2025-04-03 02:09:49 | [pearl_trainer] epoch #189 | Training... +2025-04-03 02:11:17 | [pearl_trainer] epoch #189 | Evaluating... +2025-04-03 02:11:17 | [pearl_trainer] epoch #189 | Sampling for adapation and meta-testing... +2025-04-03 02:13:11 | [pearl_trainer] epoch #189 | Finished meta-testing... +2025-04-03 02:13:11 | [pearl_trainer] epoch #189 | Saving snapshot... +2025-04-03 02:13:12 | [pearl_trainer] epoch #189 | Saved +2025-04-03 02:13:12 | [pearl_trainer] epoch #189 | Time 44905.49 s +2025-04-03 02:13:12 | [pearl_trainer] epoch #189 | EpochTime 234.34 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -79.9562 +MetaTest/Average/AverageReturn -79.9562 +MetaTest/Average/Iteration 189 +MetaTest/Average/MaxReturn -46.3846 +MetaTest/Average/MinReturn -99.6909 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 19.8443 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -79.9562 +MetaTest/__unnamed_task__/AverageReturn -79.9562 +MetaTest/__unnamed_task__/Iteration 189 +MetaTest/__unnamed_task__/MaxReturn -46.3846 +MetaTest/__unnamed_task__/MinReturn -99.6909 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 19.8443 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 319000 +------------------------------------------------- ----------- +2025-04-03 02:13:45 | [pearl_trainer] epoch #190 | Training... +2025-04-03 02:15:15 | [pearl_trainer] epoch #190 | Evaluating... +2025-04-03 02:15:15 | [pearl_trainer] epoch #190 | Sampling for adapation and meta-testing... +2025-04-03 02:17:10 | [pearl_trainer] epoch #190 | Finished meta-testing... +2025-04-03 02:17:10 | [pearl_trainer] epoch #190 | Saving snapshot... +2025-04-03 02:17:11 | [pearl_trainer] epoch #190 | Saved +2025-04-03 02:17:11 | [pearl_trainer] epoch #190 | Time 45144.20 s +2025-04-03 02:17:11 | [pearl_trainer] epoch #190 | EpochTime 238.72 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -75.456 +MetaTest/Average/AverageReturn -75.456 +MetaTest/Average/Iteration 190 +MetaTest/Average/MaxReturn -67.8174 +MetaTest/Average/MinReturn -84.652 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 5.92593 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -75.456 +MetaTest/__unnamed_task__/AverageReturn -75.456 +MetaTest/__unnamed_task__/Iteration 190 +MetaTest/__unnamed_task__/MaxReturn -67.8174 +MetaTest/__unnamed_task__/MinReturn -84.652 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 5.92593 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 320600 +------------------------------------------------- ------------ +2025-04-03 02:17:42 | [pearl_trainer] epoch #191 | Training... +2025-04-03 02:19:21 | [pearl_trainer] epoch #191 | Evaluating... +2025-04-03 02:19:21 | [pearl_trainer] epoch #191 | Sampling for adapation and meta-testing... +2025-04-03 02:21:22 | [pearl_trainer] epoch #191 | Finished meta-testing... +2025-04-03 02:21:22 | [pearl_trainer] epoch #191 | Saving snapshot... +2025-04-03 02:21:23 | [pearl_trainer] epoch #191 | Saved +2025-04-03 02:21:23 | [pearl_trainer] epoch #191 | Time 45395.95 s +2025-04-03 02:21:23 | [pearl_trainer] epoch #191 | EpochTime 251.75 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -51.9237 +MetaTest/Average/AverageReturn -51.9237 +MetaTest/Average/Iteration 191 +MetaTest/Average/MaxReturn -34.8819 +MetaTest/Average/MinReturn -81.4103 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 16.6977 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -51.9237 +MetaTest/__unnamed_task__/AverageReturn -51.9237 +MetaTest/__unnamed_task__/Iteration 191 +MetaTest/__unnamed_task__/MaxReturn -34.8819 +MetaTest/__unnamed_task__/MinReturn -81.4103 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 16.6977 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 322200 +------------------------------------------------- ----------- +2025-04-03 02:21:54 | [pearl_trainer] epoch #192 | Training... +2025-04-03 02:23:21 | [pearl_trainer] epoch #192 | Evaluating... +2025-04-03 02:23:21 | [pearl_trainer] epoch #192 | Sampling for adapation and meta-testing... +2025-04-03 02:25:16 | [pearl_trainer] epoch #192 | Finished meta-testing... +2025-04-03 02:25:16 | [pearl_trainer] epoch #192 | Saving snapshot... +2025-04-03 02:25:17 | [pearl_trainer] epoch #192 | Saved +2025-04-03 02:25:17 | [pearl_trainer] epoch #192 | Time 45629.53 s +2025-04-03 02:25:17 | [pearl_trainer] epoch #192 | EpochTime 233.57 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -41.3943 +MetaTest/Average/AverageReturn -41.3943 +MetaTest/Average/Iteration 192 +MetaTest/Average/MaxReturn -20.573 +MetaTest/Average/MinReturn -85.4755 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 23.0616 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -41.3943 +MetaTest/__unnamed_task__/AverageReturn -41.3943 +MetaTest/__unnamed_task__/Iteration 192 +MetaTest/__unnamed_task__/MaxReturn -20.573 +MetaTest/__unnamed_task__/MinReturn -85.4755 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 23.0616 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 323800 +------------------------------------------------- ----------- +2025-04-03 02:25:47 | [pearl_trainer] epoch #193 | Training... +2025-04-03 02:27:10 | [pearl_trainer] epoch #193 | Evaluating... +2025-04-03 02:27:10 | [pearl_trainer] epoch #193 | Sampling for adapation and meta-testing... +2025-04-03 02:29:03 | [pearl_trainer] epoch #193 | Finished meta-testing... +2025-04-03 02:29:03 | [pearl_trainer] epoch #193 | Saving snapshot... +2025-04-03 02:29:04 | [pearl_trainer] epoch #193 | Saved +2025-04-03 02:29:04 | [pearl_trainer] epoch #193 | Time 45856.93 s +2025-04-03 02:29:04 | [pearl_trainer] epoch #193 | EpochTime 227.40 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -4.22138 +MetaTest/Average/AverageReturn -4.22138 +MetaTest/Average/Iteration 193 +MetaTest/Average/MaxReturn 48.1992 +MetaTest/Average/MinReturn -23.392 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 26.9298 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -4.22138 +MetaTest/__unnamed_task__/AverageReturn -4.22138 +MetaTest/__unnamed_task__/Iteration 193 +MetaTest/__unnamed_task__/MaxReturn 48.1992 +MetaTest/__unnamed_task__/MinReturn -23.392 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 26.9298 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 325400 +------------------------------------------------- ------------ +2025-04-03 02:29:36 | [pearl_trainer] epoch #194 | Training... +2025-04-03 02:31:06 | [pearl_trainer] epoch #194 | Evaluating... +2025-04-03 02:31:06 | [pearl_trainer] epoch #194 | Sampling for adapation and meta-testing... +2025-04-03 02:32:56 | [pearl_trainer] epoch #194 | Finished meta-testing... +2025-04-03 02:32:56 | [pearl_trainer] epoch #194 | Saving snapshot... +2025-04-03 02:32:57 | [pearl_trainer] epoch #194 | Saved +2025-04-03 02:32:57 | [pearl_trainer] epoch #194 | Time 46090.14 s +2025-04-03 02:32:57 | [pearl_trainer] epoch #194 | EpochTime 233.20 s +------------------------------------------------- ------------- +MetaTest/Average/AverageDiscountedReturn -0.679536 +MetaTest/Average/AverageReturn -0.679536 +MetaTest/Average/Iteration 194 +MetaTest/Average/MaxReturn 129.691 +MetaTest/Average/MinReturn -76.9844 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 69.1909 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -0.679536 +MetaTest/__unnamed_task__/AverageReturn -0.679536 +MetaTest/__unnamed_task__/Iteration 194 +MetaTest/__unnamed_task__/MaxReturn 129.691 +MetaTest/__unnamed_task__/MinReturn -76.9844 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 69.1909 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 327000 +------------------------------------------------- ------------- +2025-04-03 02:33:32 | [pearl_trainer] epoch #195 | Training... +2025-04-03 02:34:53 | [pearl_trainer] epoch #195 | Evaluating... +2025-04-03 02:34:53 | [pearl_trainer] epoch #195 | Sampling for adapation and meta-testing... +2025-04-03 02:36:48 | [pearl_trainer] epoch #195 | Finished meta-testing... +2025-04-03 02:36:48 | [pearl_trainer] epoch #195 | Saving snapshot... +2025-04-03 02:36:49 | [pearl_trainer] epoch #195 | Saved +2025-04-03 02:36:49 | [pearl_trainer] epoch #195 | Time 46321.96 s +2025-04-03 02:36:49 | [pearl_trainer] epoch #195 | EpochTime 231.82 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -0.09025 +MetaTest/Average/AverageReturn -0.09025 +MetaTest/Average/Iteration 195 +MetaTest/Average/MaxReturn 38.5922 +MetaTest/Average/MinReturn -20.8185 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 22.7035 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -0.09025 +MetaTest/__unnamed_task__/AverageReturn -0.09025 +MetaTest/__unnamed_task__/Iteration 195 +MetaTest/__unnamed_task__/MaxReturn 38.5922 +MetaTest/__unnamed_task__/MinReturn -20.8185 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 22.7035 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 328600 +------------------------------------------------- ------------ +2025-04-03 02:37:20 | [pearl_trainer] epoch #196 | Training... +2025-04-03 02:39:00 | [pearl_trainer] epoch #196 | Evaluating... +2025-04-03 02:39:00 | [pearl_trainer] epoch #196 | Sampling for adapation and meta-testing... +2025-04-03 02:40:52 | [pearl_trainer] epoch #196 | Finished meta-testing... +2025-04-03 02:40:52 | [pearl_trainer] epoch #196 | Saving snapshot... +2025-04-03 02:40:53 | [pearl_trainer] epoch #196 | Saved +2025-04-03 02:40:53 | [pearl_trainer] epoch #196 | Time 46566.27 s +2025-04-03 02:40:53 | [pearl_trainer] epoch #196 | EpochTime 244.31 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -60.7578 +MetaTest/Average/AverageReturn -60.7578 +MetaTest/Average/Iteration 196 +MetaTest/Average/MaxReturn 54.6727 +MetaTest/Average/MinReturn -105.128 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 59.921 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -60.7578 +MetaTest/__unnamed_task__/AverageReturn -60.7578 +MetaTest/__unnamed_task__/Iteration 196 +MetaTest/__unnamed_task__/MaxReturn 54.6727 +MetaTest/__unnamed_task__/MinReturn -105.128 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 59.921 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 330200 +------------------------------------------------- ----------- +2025-04-03 02:41:28 | [pearl_trainer] epoch #197 | Training... +2025-04-03 02:42:54 | [pearl_trainer] epoch #197 | Evaluating... +2025-04-03 02:42:54 | [pearl_trainer] epoch #197 | Sampling for adapation and meta-testing... +2025-04-03 02:44:50 | [pearl_trainer] epoch #197 | Finished meta-testing... +2025-04-03 02:44:50 | [pearl_trainer] epoch #197 | Saving snapshot... +2025-04-03 02:44:52 | [pearl_trainer] epoch #197 | Saved +2025-04-03 02:44:52 | [pearl_trainer] epoch #197 | Time 46804.52 s +2025-04-03 02:44:52 | [pearl_trainer] epoch #197 | EpochTime 238.24 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -77.5646 +MetaTest/Average/AverageReturn -77.5646 +MetaTest/Average/Iteration 197 +MetaTest/Average/MaxReturn -69.2801 +MetaTest/Average/MinReturn -84.8606 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 5.85492 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -77.5646 +MetaTest/__unnamed_task__/AverageReturn -77.5646 +MetaTest/__unnamed_task__/Iteration 197 +MetaTest/__unnamed_task__/MaxReturn -69.2801 +MetaTest/__unnamed_task__/MinReturn -84.8606 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 5.85492 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 331800 +------------------------------------------------- ------------ +2025-04-03 02:45:22 | [pearl_trainer] epoch #198 | Training... +2025-04-03 02:46:49 | [pearl_trainer] epoch #198 | Evaluating... +2025-04-03 02:46:49 | [pearl_trainer] epoch #198 | Sampling for adapation and meta-testing... +2025-04-03 02:48:40 | [pearl_trainer] epoch #198 | Finished meta-testing... +2025-04-03 02:48:40 | [pearl_trainer] epoch #198 | Saving snapshot... +2025-04-03 02:48:41 | [pearl_trainer] epoch #198 | Saved +2025-04-03 02:48:41 | [pearl_trainer] epoch #198 | Time 47034.39 s +2025-04-03 02:48:41 | [pearl_trainer] epoch #198 | EpochTime 229.87 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -70.7899 +MetaTest/Average/AverageReturn -70.7899 +MetaTest/Average/Iteration 198 +MetaTest/Average/MaxReturn -58.4516 +MetaTest/Average/MinReturn -83.3238 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 9.89875 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -70.7899 +MetaTest/__unnamed_task__/AverageReturn -70.7899 +MetaTest/__unnamed_task__/Iteration 198 +MetaTest/__unnamed_task__/MaxReturn -58.4516 +MetaTest/__unnamed_task__/MinReturn -83.3238 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 9.89875 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 333400 +------------------------------------------------- ------------ +2025-04-03 02:49:14 | [pearl_trainer] epoch #199 | Training... +2025-04-03 02:50:53 | [pearl_trainer] epoch #199 | Evaluating... +2025-04-03 02:50:53 | [pearl_trainer] epoch #199 | Sampling for adapation and meta-testing... +2025-04-03 02:52:50 | [pearl_trainer] epoch #199 | Finished meta-testing... +2025-04-03 02:52:50 | [pearl_trainer] epoch #199 | Saving snapshot... +2025-04-03 02:52:51 | [pearl_trainer] epoch #199 | Saved +2025-04-03 02:52:51 | [pearl_trainer] epoch #199 | Time 47283.99 s +2025-04-03 02:52:51 | [pearl_trainer] epoch #199 | EpochTime 249.59 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -61.4313 +MetaTest/Average/AverageReturn -61.4313 +MetaTest/Average/Iteration 199 +MetaTest/Average/MaxReturn -46.805 +MetaTest/Average/MinReturn -83.3141 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 12.0995 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -61.4313 +MetaTest/__unnamed_task__/AverageReturn -61.4313 +MetaTest/__unnamed_task__/Iteration 199 +MetaTest/__unnamed_task__/MaxReturn -46.805 +MetaTest/__unnamed_task__/MinReturn -83.3141 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 12.0995 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 335000 +------------------------------------------------- ----------- +2025-04-03 02:53:23 | [pearl_trainer] epoch #200 | Training... +2025-04-03 02:54:50 | [pearl_trainer] epoch #200 | Evaluating... +2025-04-03 02:54:50 | [pearl_trainer] epoch #200 | Sampling for adapation and meta-testing... +2025-04-03 02:56:42 | [pearl_trainer] epoch #200 | Finished meta-testing... +2025-04-03 02:56:42 | [pearl_trainer] epoch #200 | Saving snapshot... +2025-04-03 02:56:44 | [pearl_trainer] epoch #200 | Saved +2025-04-03 02:56:44 | [pearl_trainer] epoch #200 | Time 47516.80 s +2025-04-03 02:56:44 | [pearl_trainer] epoch #200 | EpochTime 232.81 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -32.591 +MetaTest/Average/AverageReturn -32.591 +MetaTest/Average/Iteration 200 +MetaTest/Average/MaxReturn 21.9122 +MetaTest/Average/MinReturn -65.3591 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 29.5832 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -32.591 +MetaTest/__unnamed_task__/AverageReturn -32.591 +MetaTest/__unnamed_task__/Iteration 200 +MetaTest/__unnamed_task__/MaxReturn 21.9122 +MetaTest/__unnamed_task__/MinReturn -65.3591 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 29.5832 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 336600 +------------------------------------------------- ----------- +2025-04-03 02:57:16 | [pearl_trainer] epoch #201 | Training... +2025-04-03 02:58:47 | [pearl_trainer] epoch #201 | Evaluating... +2025-04-03 02:58:47 | [pearl_trainer] epoch #201 | Sampling for adapation and meta-testing... +2025-04-03 03:00:43 | [pearl_trainer] epoch #201 | Finished meta-testing... +2025-04-03 03:00:43 | [pearl_trainer] epoch #201 | Saving snapshot... +2025-04-03 03:00:44 | [pearl_trainer] epoch #201 | Saved +2025-04-03 03:00:44 | [pearl_trainer] epoch #201 | Time 47756.70 s +2025-04-03 03:00:44 | [pearl_trainer] epoch #201 | EpochTime 239.89 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -73.0361 +MetaTest/Average/AverageReturn -73.0361 +MetaTest/Average/Iteration 201 +MetaTest/Average/MaxReturn -41.8049 +MetaTest/Average/MinReturn -88.7916 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 18.3454 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -73.0361 +MetaTest/__unnamed_task__/AverageReturn -73.0361 +MetaTest/__unnamed_task__/Iteration 201 +MetaTest/__unnamed_task__/MaxReturn -41.8049 +MetaTest/__unnamed_task__/MinReturn -88.7916 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 18.3454 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 338200 +------------------------------------------------- ----------- +2025-04-03 03:01:16 | [pearl_trainer] epoch #202 | Training... +2025-04-03 03:02:42 | [pearl_trainer] epoch #202 | Evaluating... +2025-04-03 03:02:42 | [pearl_trainer] epoch #202 | Sampling for adapation and meta-testing... +2025-04-03 03:04:35 | [pearl_trainer] epoch #202 | Finished meta-testing... +2025-04-03 03:04:35 | [pearl_trainer] epoch #202 | Saving snapshot... +2025-04-03 03:04:36 | [pearl_trainer] epoch #202 | Saved +2025-04-03 03:04:36 | [pearl_trainer] epoch #202 | Time 47988.73 s +2025-04-03 03:04:36 | [pearl_trainer] epoch #202 | EpochTime 232.03 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -41.6624 +MetaTest/Average/AverageReturn -41.6624 +MetaTest/Average/Iteration 202 +MetaTest/Average/MaxReturn 31.4554 +MetaTest/Average/MinReturn -86.6605 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 44.4092 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -41.6624 +MetaTest/__unnamed_task__/AverageReturn -41.6624 +MetaTest/__unnamed_task__/Iteration 202 +MetaTest/__unnamed_task__/MaxReturn 31.4554 +MetaTest/__unnamed_task__/MinReturn -86.6605 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 44.4092 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 339800 +------------------------------------------------- ----------- +2025-04-03 03:05:08 | [pearl_trainer] epoch #203 | Training... +2025-04-03 03:06:37 | [pearl_trainer] epoch #203 | Evaluating... +2025-04-03 03:06:37 | [pearl_trainer] epoch #203 | Sampling for adapation and meta-testing... +2025-04-03 03:08:30 | [pearl_trainer] epoch #203 | Finished meta-testing... +2025-04-03 03:08:30 | [pearl_trainer] epoch #203 | Saving snapshot... +2025-04-03 03:08:32 | [pearl_trainer] epoch #203 | Saved +2025-04-03 03:08:32 | [pearl_trainer] epoch #203 | Time 48224.66 s +2025-04-03 03:08:32 | [pearl_trainer] epoch #203 | EpochTime 235.93 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 7.71145 +MetaTest/Average/AverageReturn 7.71145 +MetaTest/Average/Iteration 203 +MetaTest/Average/MaxReturn 59.9281 +MetaTest/Average/MinReturn -58.6368 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 45.4912 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 7.71145 +MetaTest/__unnamed_task__/AverageReturn 7.71145 +MetaTest/__unnamed_task__/Iteration 203 +MetaTest/__unnamed_task__/MaxReturn 59.9281 +MetaTest/__unnamed_task__/MinReturn -58.6368 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 45.4912 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 341400 +------------------------------------------------- ------------ +2025-04-03 03:09:03 | [pearl_trainer] epoch #204 | Training... +2025-04-03 03:10:32 | [pearl_trainer] epoch #204 | Evaluating... +2025-04-03 03:10:32 | [pearl_trainer] epoch #204 | Sampling for adapation and meta-testing... +2025-04-03 03:12:25 | [pearl_trainer] epoch #204 | Finished meta-testing... +2025-04-03 03:12:25 | [pearl_trainer] epoch #204 | Saving snapshot... +2025-04-03 03:12:26 | [pearl_trainer] epoch #204 | Saved +2025-04-03 03:12:26 | [pearl_trainer] epoch #204 | Time 48459.31 s +2025-04-03 03:12:26 | [pearl_trainer] epoch #204 | EpochTime 234.65 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 21.8079 +MetaTest/Average/AverageReturn 21.8079 +MetaTest/Average/Iteration 204 +MetaTest/Average/MaxReturn 80.4454 +MetaTest/Average/MinReturn -17.6872 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 37.8756 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 21.8079 +MetaTest/__unnamed_task__/AverageReturn 21.8079 +MetaTest/__unnamed_task__/Iteration 204 +MetaTest/__unnamed_task__/MaxReturn 80.4454 +MetaTest/__unnamed_task__/MinReturn -17.6872 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 37.8756 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 343000 +------------------------------------------------- ----------- +2025-04-03 03:12:59 | [pearl_trainer] epoch #205 | Training... +2025-04-03 03:14:36 | [pearl_trainer] epoch #205 | Evaluating... +2025-04-03 03:14:36 | [pearl_trainer] epoch #205 | Sampling for adapation and meta-testing... +2025-04-03 03:16:27 | [pearl_trainer] epoch #205 | Finished meta-testing... +2025-04-03 03:16:27 | [pearl_trainer] epoch #205 | Saving snapshot... +2025-04-03 03:16:28 | [pearl_trainer] epoch #205 | Saved +2025-04-03 03:16:28 | [pearl_trainer] epoch #205 | Time 48701.31 s +2025-04-03 03:16:28 | [pearl_trainer] epoch #205 | EpochTime 241.99 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 18.4281 +MetaTest/Average/AverageReturn 18.4281 +MetaTest/Average/Iteration 205 +MetaTest/Average/MaxReturn 65.1855 +MetaTest/Average/MinReturn -21.1644 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 33.5718 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 18.4281 +MetaTest/__unnamed_task__/AverageReturn 18.4281 +MetaTest/__unnamed_task__/Iteration 205 +MetaTest/__unnamed_task__/MaxReturn 65.1855 +MetaTest/__unnamed_task__/MinReturn -21.1644 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 33.5718 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 344600 +------------------------------------------------- ----------- +2025-04-03 03:17:00 | [pearl_trainer] epoch #206 | Training... +2025-04-03 03:18:42 | [pearl_trainer] epoch #206 | Evaluating... +2025-04-03 03:18:42 | [pearl_trainer] epoch #206 | Sampling for adapation and meta-testing... +2025-04-03 03:20:38 | [pearl_trainer] epoch #206 | Finished meta-testing... +2025-04-03 03:20:38 | [pearl_trainer] epoch #206 | Saving snapshot... +2025-04-03 03:20:39 | [pearl_trainer] epoch #206 | Saved +2025-04-03 03:20:39 | [pearl_trainer] epoch #206 | Time 48951.87 s +2025-04-03 03:20:39 | [pearl_trainer] epoch #206 | EpochTime 250.56 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 22.754 +MetaTest/Average/AverageReturn 22.754 +MetaTest/Average/Iteration 206 +MetaTest/Average/MaxReturn 96.6503 +MetaTest/Average/MinReturn -7.62737 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 37.911 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 22.754 +MetaTest/__unnamed_task__/AverageReturn 22.754 +MetaTest/__unnamed_task__/Iteration 206 +MetaTest/__unnamed_task__/MaxReturn 96.6503 +MetaTest/__unnamed_task__/MinReturn -7.62737 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 37.911 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 346200 +------------------------------------------------- ------------ +2025-04-03 03:21:13 | [pearl_trainer] epoch #207 | Training... +2025-04-03 03:22:49 | [pearl_trainer] epoch #207 | Evaluating... +2025-04-03 03:22:49 | [pearl_trainer] epoch #207 | Sampling for adapation and meta-testing... +2025-04-03 03:24:44 | [pearl_trainer] epoch #207 | Finished meta-testing... +2025-04-03 03:24:44 | [pearl_trainer] epoch #207 | Saving snapshot... +2025-04-03 03:24:45 | [pearl_trainer] epoch #207 | Saved +2025-04-03 03:24:45 | [pearl_trainer] epoch #207 | Time 49197.63 s +2025-04-03 03:24:45 | [pearl_trainer] epoch #207 | EpochTime 245.75 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 6.14271 +MetaTest/Average/AverageReturn 6.14271 +MetaTest/Average/Iteration 207 +MetaTest/Average/MaxReturn 101.996 +MetaTest/Average/MinReturn -66.0914 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 54.959 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 6.14271 +MetaTest/__unnamed_task__/AverageReturn 6.14271 +MetaTest/__unnamed_task__/Iteration 207 +MetaTest/__unnamed_task__/MaxReturn 101.996 +MetaTest/__unnamed_task__/MinReturn -66.0914 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 54.959 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 347800 +------------------------------------------------- ------------ +2025-04-03 03:25:18 | [pearl_trainer] epoch #208 | Training... +2025-04-03 03:26:43 | [pearl_trainer] epoch #208 | Evaluating... +2025-04-03 03:26:43 | [pearl_trainer] epoch #208 | Sampling for adapation and meta-testing... +2025-04-03 03:28:34 | [pearl_trainer] epoch #208 | Finished meta-testing... +2025-04-03 03:28:34 | [pearl_trainer] epoch #208 | Saving snapshot... +2025-04-03 03:28:36 | [pearl_trainer] epoch #208 | Saved +2025-04-03 03:28:36 | [pearl_trainer] epoch #208 | Time 49428.81 s +2025-04-03 03:28:36 | [pearl_trainer] epoch #208 | EpochTime 231.18 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 19.241 +MetaTest/Average/AverageReturn 19.241 +MetaTest/Average/Iteration 208 +MetaTest/Average/MaxReturn 88.5022 +MetaTest/Average/MinReturn -35.2067 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 43.9332 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 19.241 +MetaTest/__unnamed_task__/AverageReturn 19.241 +MetaTest/__unnamed_task__/Iteration 208 +MetaTest/__unnamed_task__/MaxReturn 88.5022 +MetaTest/__unnamed_task__/MinReturn -35.2067 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 43.9332 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 349400 +------------------------------------------------- ----------- +2025-04-03 03:29:10 | [pearl_trainer] epoch #209 | Training... +2025-04-03 03:30:42 | [pearl_trainer] epoch #209 | Evaluating... +2025-04-03 03:30:42 | [pearl_trainer] epoch #209 | Sampling for adapation and meta-testing... +2025-04-03 03:32:38 | [pearl_trainer] epoch #209 | Finished meta-testing... +2025-04-03 03:32:38 | [pearl_trainer] epoch #209 | Saving snapshot... +2025-04-03 03:32:39 | [pearl_trainer] epoch #209 | Saved +2025-04-03 03:32:39 | [pearl_trainer] epoch #209 | Time 49671.59 s +2025-04-03 03:32:39 | [pearl_trainer] epoch #209 | EpochTime 242.78 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -24.4654 +MetaTest/Average/AverageReturn -24.4654 +MetaTest/Average/Iteration 209 +MetaTest/Average/MaxReturn -12.8458 +MetaTest/Average/MinReturn -37.2841 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 9.65613 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -24.4654 +MetaTest/__unnamed_task__/AverageReturn -24.4654 +MetaTest/__unnamed_task__/Iteration 209 +MetaTest/__unnamed_task__/MaxReturn -12.8458 +MetaTest/__unnamed_task__/MinReturn -37.2841 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 9.65613 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 351000 +------------------------------------------------- ------------ +2025-04-03 03:33:11 | [pearl_trainer] epoch #210 | Training... +2025-04-03 03:34:39 | [pearl_trainer] epoch #210 | Evaluating... +2025-04-03 03:34:39 | [pearl_trainer] epoch #210 | Sampling for adapation and meta-testing... +2025-04-03 03:36:34 | [pearl_trainer] epoch #210 | Finished meta-testing... +2025-04-03 03:36:34 | [pearl_trainer] epoch #210 | Saving snapshot... +2025-04-03 03:36:35 | [pearl_trainer] epoch #210 | Saved +2025-04-03 03:36:35 | [pearl_trainer] epoch #210 | Time 49908.00 s +2025-04-03 03:36:35 | [pearl_trainer] epoch #210 | EpochTime 236.40 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 20.4553 +MetaTest/Average/AverageReturn 20.4553 +MetaTest/Average/Iteration 210 +MetaTest/Average/MaxReturn 96.743 +MetaTest/Average/MinReturn -21.4086 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 45.1557 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 20.4553 +MetaTest/__unnamed_task__/AverageReturn 20.4553 +MetaTest/__unnamed_task__/Iteration 210 +MetaTest/__unnamed_task__/MaxReturn 96.743 +MetaTest/__unnamed_task__/MinReturn -21.4086 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 45.1557 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 352600 +------------------------------------------------- ----------- +2025-04-03 03:37:08 | [pearl_trainer] epoch #211 | Training... +2025-04-03 03:38:44 | [pearl_trainer] epoch #211 | Evaluating... +2025-04-03 03:38:44 | [pearl_trainer] epoch #211 | Sampling for adapation and meta-testing... +2025-04-03 03:40:38 | [pearl_trainer] epoch #211 | Finished meta-testing... +2025-04-03 03:40:38 | [pearl_trainer] epoch #211 | Saving snapshot... +2025-04-03 03:40:39 | [pearl_trainer] epoch #211 | Saved +2025-04-03 03:40:39 | [pearl_trainer] epoch #211 | Time 50152.09 s +2025-04-03 03:40:39 | [pearl_trainer] epoch #211 | EpochTime 244.09 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 29.2549 +MetaTest/Average/AverageReturn 29.2549 +MetaTest/Average/Iteration 211 +MetaTest/Average/MaxReturn 80.579 +MetaTest/Average/MinReturn -21.3635 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 44.9113 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 29.2549 +MetaTest/__unnamed_task__/AverageReturn 29.2549 +MetaTest/__unnamed_task__/Iteration 211 +MetaTest/__unnamed_task__/MaxReturn 80.579 +MetaTest/__unnamed_task__/MinReturn -21.3635 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 44.9113 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 354200 +------------------------------------------------- ----------- +2025-04-03 03:41:14 | [pearl_trainer] epoch #212 | Training... +2025-04-03 03:42:40 | [pearl_trainer] epoch #212 | Evaluating... +2025-04-03 03:42:40 | [pearl_trainer] epoch #212 | Sampling for adapation and meta-testing... +2025-04-03 03:44:37 | [pearl_trainer] epoch #212 | Finished meta-testing... +2025-04-03 03:44:37 | [pearl_trainer] epoch #212 | Saving snapshot... +2025-04-03 03:44:38 | [pearl_trainer] epoch #212 | Saved +2025-04-03 03:44:38 | [pearl_trainer] epoch #212 | Time 50390.66 s +2025-04-03 03:44:38 | [pearl_trainer] epoch #212 | EpochTime 238.56 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 15.0804 +MetaTest/Average/AverageReturn 15.0804 +MetaTest/Average/Iteration 212 +MetaTest/Average/MaxReturn 77.8077 +MetaTest/Average/MinReturn -41.9908 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 47.9533 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 15.0804 +MetaTest/__unnamed_task__/AverageReturn 15.0804 +MetaTest/__unnamed_task__/Iteration 212 +MetaTest/__unnamed_task__/MaxReturn 77.8077 +MetaTest/__unnamed_task__/MinReturn -41.9908 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 47.9533 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 355800 +------------------------------------------------- ----------- +2025-04-03 03:45:09 | [pearl_trainer] epoch #213 | Training... +2025-04-03 03:46:44 | [pearl_trainer] epoch #213 | Evaluating... +2025-04-03 03:46:44 | [pearl_trainer] epoch #213 | Sampling for adapation and meta-testing... +2025-04-03 03:48:47 | [pearl_trainer] epoch #213 | Finished meta-testing... +2025-04-03 03:48:47 | [pearl_trainer] epoch #213 | Saving snapshot... +2025-04-03 03:48:49 | [pearl_trainer] epoch #213 | Saved +2025-04-03 03:48:49 | [pearl_trainer] epoch #213 | Time 50641.83 s +2025-04-03 03:48:49 | [pearl_trainer] epoch #213 | EpochTime 251.17 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 24.519 +MetaTest/Average/AverageReturn 24.519 +MetaTest/Average/Iteration 213 +MetaTest/Average/MaxReturn 100.562 +MetaTest/Average/MinReturn -32.0605 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 55.4262 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 24.519 +MetaTest/__unnamed_task__/AverageReturn 24.519 +MetaTest/__unnamed_task__/Iteration 213 +MetaTest/__unnamed_task__/MaxReturn 100.562 +MetaTest/__unnamed_task__/MinReturn -32.0605 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 55.4262 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 357400 +------------------------------------------------- ----------- +2025-04-03 03:49:27 | [pearl_trainer] epoch #214 | Training... +2025-04-03 03:50:56 | [pearl_trainer] epoch #214 | Evaluating... +2025-04-03 03:50:56 | [pearl_trainer] epoch #214 | Sampling for adapation and meta-testing... +2025-04-03 03:52:53 | [pearl_trainer] epoch #214 | Finished meta-testing... +2025-04-03 03:52:53 | [pearl_trainer] epoch #214 | Saving snapshot... +2025-04-03 03:52:54 | [pearl_trainer] epoch #214 | Saved +2025-04-03 03:52:54 | [pearl_trainer] epoch #214 | Time 50887.01 s +2025-04-03 03:52:54 | [pearl_trainer] epoch #214 | EpochTime 245.18 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -7.92876 +MetaTest/Average/AverageReturn -7.92876 +MetaTest/Average/Iteration 214 +MetaTest/Average/MaxReturn 72.5661 +MetaTest/Average/MinReturn -35.2509 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 40.5686 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -7.92876 +MetaTest/__unnamed_task__/AverageReturn -7.92876 +MetaTest/__unnamed_task__/Iteration 214 +MetaTest/__unnamed_task__/MaxReturn 72.5661 +MetaTest/__unnamed_task__/MinReturn -35.2509 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 40.5686 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 359000 +------------------------------------------------- ------------ +2025-04-03 03:53:26 | [pearl_trainer] epoch #215 | Training... +2025-04-03 03:54:58 | [pearl_trainer] epoch #215 | Evaluating... +2025-04-03 03:54:58 | [pearl_trainer] epoch #215 | Sampling for adapation and meta-testing... +2025-04-03 03:56:53 | [pearl_trainer] epoch #215 | Finished meta-testing... +2025-04-03 03:56:53 | [pearl_trainer] epoch #215 | Saving snapshot... +2025-04-03 03:56:55 | [pearl_trainer] epoch #215 | Saved +2025-04-03 03:56:55 | [pearl_trainer] epoch #215 | Time 51127.69 s +2025-04-03 03:56:55 | [pearl_trainer] epoch #215 | EpochTime 240.69 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 14.808 +MetaTest/Average/AverageReturn 14.808 +MetaTest/Average/Iteration 215 +MetaTest/Average/MaxReturn 132.005 +MetaTest/Average/MinReturn -21.9412 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 59.4856 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 14.808 +MetaTest/__unnamed_task__/AverageReturn 14.808 +MetaTest/__unnamed_task__/Iteration 215 +MetaTest/__unnamed_task__/MaxReturn 132.005 +MetaTest/__unnamed_task__/MinReturn -21.9412 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 59.4856 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 360600 +------------------------------------------------- ----------- +2025-04-03 03:57:27 | [pearl_trainer] epoch #216 | Training... +2025-04-03 03:59:00 | [pearl_trainer] epoch #216 | Evaluating... +2025-04-03 03:59:00 | [pearl_trainer] epoch #216 | Sampling for adapation and meta-testing... +2025-04-03 04:00:52 | [pearl_trainer] epoch #216 | Finished meta-testing... +2025-04-03 04:00:52 | [pearl_trainer] epoch #216 | Saving snapshot... +2025-04-03 04:00:53 | [pearl_trainer] epoch #216 | Saved +2025-04-03 04:00:53 | [pearl_trainer] epoch #216 | Time 51366.14 s +2025-04-03 04:00:53 | [pearl_trainer] epoch #216 | EpochTime 238.44 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -10.7377 +MetaTest/Average/AverageReturn -10.7377 +MetaTest/Average/Iteration 216 +MetaTest/Average/MaxReturn 19.3817 +MetaTest/Average/MinReturn -26.4059 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 16.1759 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -10.7377 +MetaTest/__unnamed_task__/AverageReturn -10.7377 +MetaTest/__unnamed_task__/Iteration 216 +MetaTest/__unnamed_task__/MaxReturn 19.3817 +MetaTest/__unnamed_task__/MinReturn -26.4059 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 16.1759 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 362200 +------------------------------------------------- ----------- +2025-04-03 04:01:27 | [pearl_trainer] epoch #217 | Training... +2025-04-03 04:03:04 | [pearl_trainer] epoch #217 | Evaluating... +2025-04-03 04:03:04 | [pearl_trainer] epoch #217 | Sampling for adapation and meta-testing... +2025-04-03 04:05:00 | [pearl_trainer] epoch #217 | Finished meta-testing... +2025-04-03 04:05:00 | [pearl_trainer] epoch #217 | Saving snapshot... +2025-04-03 04:05:01 | [pearl_trainer] epoch #217 | Saved +2025-04-03 04:05:01 | [pearl_trainer] epoch #217 | Time 51614.11 s +2025-04-03 04:05:01 | [pearl_trainer] epoch #217 | EpochTime 247.96 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 32.8805 +MetaTest/Average/AverageReturn 32.8805 +MetaTest/Average/Iteration 217 +MetaTest/Average/MaxReturn 72.9222 +MetaTest/Average/MinReturn -12.8016 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 34.1938 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 32.8805 +MetaTest/__unnamed_task__/AverageReturn 32.8805 +MetaTest/__unnamed_task__/Iteration 217 +MetaTest/__unnamed_task__/MaxReturn 72.9222 +MetaTest/__unnamed_task__/MinReturn -12.8016 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 34.1938 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 363800 +------------------------------------------------- ----------- +2025-04-03 04:05:33 | [pearl_trainer] epoch #218 | Training... +2025-04-03 04:06:56 | [pearl_trainer] epoch #218 | Evaluating... +2025-04-03 04:06:56 | [pearl_trainer] epoch #218 | Sampling for adapation and meta-testing... +2025-04-03 04:08:53 | [pearl_trainer] epoch #218 | Finished meta-testing... +2025-04-03 04:08:53 | [pearl_trainer] epoch #218 | Saving snapshot... +2025-04-03 04:08:54 | [pearl_trainer] epoch #218 | Saved +2025-04-03 04:08:54 | [pearl_trainer] epoch #218 | Time 51847.42 s +2025-04-03 04:08:54 | [pearl_trainer] epoch #218 | EpochTime 233.32 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 20.4773 +MetaTest/Average/AverageReturn 20.4773 +MetaTest/Average/Iteration 218 +MetaTest/Average/MaxReturn 115.285 +MetaTest/Average/MinReturn -77.1139 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 73.3181 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 20.4773 +MetaTest/__unnamed_task__/AverageReturn 20.4773 +MetaTest/__unnamed_task__/Iteration 218 +MetaTest/__unnamed_task__/MaxReturn 115.285 +MetaTest/__unnamed_task__/MinReturn -77.1139 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 73.3181 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 365400 +------------------------------------------------- ----------- +2025-04-03 04:09:27 | [pearl_trainer] epoch #219 | Training... +2025-04-03 04:11:05 | [pearl_trainer] epoch #219 | Evaluating... +2025-04-03 04:11:05 | [pearl_trainer] epoch #219 | Sampling for adapation and meta-testing... +2025-04-03 04:12:56 | [pearl_trainer] epoch #219 | Finished meta-testing... +2025-04-03 04:12:56 | [pearl_trainer] epoch #219 | Saving snapshot... +2025-04-03 04:12:57 | [pearl_trainer] epoch #219 | Saved +2025-04-03 04:12:57 | [pearl_trainer] epoch #219 | Time 52090.13 s +2025-04-03 04:12:57 | [pearl_trainer] epoch #219 | EpochTime 242.71 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -8.25358 +MetaTest/Average/AverageReturn -8.25358 +MetaTest/Average/Iteration 219 +MetaTest/Average/MaxReturn 110.006 +MetaTest/Average/MinReturn -105.028 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 69.1002 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -8.25358 +MetaTest/__unnamed_task__/AverageReturn -8.25358 +MetaTest/__unnamed_task__/Iteration 219 +MetaTest/__unnamed_task__/MaxReturn 110.006 +MetaTest/__unnamed_task__/MinReturn -105.028 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 69.1002 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 367000 +------------------------------------------------- ------------ +2025-04-03 04:13:32 | [pearl_trainer] epoch #220 | Training... +2025-04-03 04:14:56 | [pearl_trainer] epoch #220 | Evaluating... +2025-04-03 04:14:56 | [pearl_trainer] epoch #220 | Sampling for adapation and meta-testing... +2025-04-03 04:16:56 | [pearl_trainer] epoch #220 | Finished meta-testing... +2025-04-03 04:16:56 | [pearl_trainer] epoch #220 | Saving snapshot... +2025-04-03 04:16:58 | [pearl_trainer] epoch #220 | Saved +2025-04-03 04:16:58 | [pearl_trainer] epoch #220 | Time 52330.58 s +2025-04-03 04:16:58 | [pearl_trainer] epoch #220 | EpochTime 240.44 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 32.0682 +MetaTest/Average/AverageReturn 32.0682 +MetaTest/Average/Iteration 220 +MetaTest/Average/MaxReturn 99.6084 +MetaTest/Average/MinReturn -52.9136 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 65.6214 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 32.0682 +MetaTest/__unnamed_task__/AverageReturn 32.0682 +MetaTest/__unnamed_task__/Iteration 220 +MetaTest/__unnamed_task__/MaxReturn 99.6084 +MetaTest/__unnamed_task__/MinReturn -52.9136 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 65.6214 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 368600 +------------------------------------------------- ----------- +2025-04-03 04:17:31 | [pearl_trainer] epoch #221 | Training... +2025-04-03 04:19:06 | [pearl_trainer] epoch #221 | Evaluating... +2025-04-03 04:19:06 | [pearl_trainer] epoch #221 | Sampling for adapation and meta-testing... +2025-04-03 04:21:02 | [pearl_trainer] epoch #221 | Finished meta-testing... +2025-04-03 04:21:02 | [pearl_trainer] epoch #221 | Saving snapshot... +2025-04-03 04:21:03 | [pearl_trainer] epoch #221 | Saved +2025-04-03 04:21:03 | [pearl_trainer] epoch #221 | Time 52576.41 s +2025-04-03 04:21:03 | [pearl_trainer] epoch #221 | EpochTime 245.84 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -17.4233 +MetaTest/Average/AverageReturn -17.4233 +MetaTest/Average/Iteration 221 +MetaTest/Average/MaxReturn 64.9431 +MetaTest/Average/MinReturn -54.8253 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 43.4464 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -17.4233 +MetaTest/__unnamed_task__/AverageReturn -17.4233 +MetaTest/__unnamed_task__/Iteration 221 +MetaTest/__unnamed_task__/MaxReturn 64.9431 +MetaTest/__unnamed_task__/MinReturn -54.8253 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 43.4464 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 370200 +------------------------------------------------- ----------- +2025-04-03 04:21:36 | [pearl_trainer] epoch #222 | Training... +2025-04-03 04:23:00 | [pearl_trainer] epoch #222 | Evaluating... +2025-04-03 04:23:00 | [pearl_trainer] epoch #222 | Sampling for adapation and meta-testing... +2025-04-03 04:24:56 | [pearl_trainer] epoch #222 | Finished meta-testing... +2025-04-03 04:24:56 | [pearl_trainer] epoch #222 | Saving snapshot... +2025-04-03 04:24:57 | [pearl_trainer] epoch #222 | Saved +2025-04-03 04:24:57 | [pearl_trainer] epoch #222 | Time 52810.18 s +2025-04-03 04:24:57 | [pearl_trainer] epoch #222 | EpochTime 233.76 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 32.21 +MetaTest/Average/AverageReturn 32.21 +MetaTest/Average/Iteration 222 +MetaTest/Average/MaxReturn 75.5538 +MetaTest/Average/MinReturn -6.95281 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 28.2588 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 32.21 +MetaTest/__unnamed_task__/AverageReturn 32.21 +MetaTest/__unnamed_task__/Iteration 222 +MetaTest/__unnamed_task__/MaxReturn 75.5538 +MetaTest/__unnamed_task__/MinReturn -6.95281 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 28.2588 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 371800 +------------------------------------------------- ------------ +2025-04-03 04:25:29 | [pearl_trainer] epoch #223 | Training... +2025-04-03 04:26:57 | [pearl_trainer] epoch #223 | Evaluating... +2025-04-03 04:26:57 | [pearl_trainer] epoch #223 | Sampling for adapation and meta-testing... +2025-04-03 04:28:50 | [pearl_trainer] epoch #223 | Finished meta-testing... +2025-04-03 04:28:50 | [pearl_trainer] epoch #223 | Saving snapshot... +2025-04-03 04:28:52 | [pearl_trainer] epoch #223 | Saved +2025-04-03 04:28:52 | [pearl_trainer] epoch #223 | Time 53044.66 s +2025-04-03 04:28:52 | [pearl_trainer] epoch #223 | EpochTime 234.48 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -13.9319 +MetaTest/Average/AverageReturn -13.9319 +MetaTest/Average/Iteration 223 +MetaTest/Average/MaxReturn 18.0171 +MetaTest/Average/MinReturn -41.3533 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 20.3774 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -13.9319 +MetaTest/__unnamed_task__/AverageReturn -13.9319 +MetaTest/__unnamed_task__/Iteration 223 +MetaTest/__unnamed_task__/MaxReturn 18.0171 +MetaTest/__unnamed_task__/MinReturn -41.3533 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 20.3774 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 373400 +------------------------------------------------- ----------- +2025-04-03 04:29:25 | [pearl_trainer] epoch #224 | Training... +2025-04-03 04:30:46 | [pearl_trainer] epoch #224 | Evaluating... +2025-04-03 04:30:46 | [pearl_trainer] epoch #224 | Sampling for adapation and meta-testing... +2025-04-03 04:32:39 | [pearl_trainer] epoch #224 | Finished meta-testing... +2025-04-03 04:32:39 | [pearl_trainer] epoch #224 | Saving snapshot... +2025-04-03 04:32:40 | [pearl_trainer] epoch #224 | Saved +2025-04-03 04:32:40 | [pearl_trainer] epoch #224 | Time 53273.44 s +2025-04-03 04:32:40 | [pearl_trainer] epoch #224 | EpochTime 228.77 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 13.6444 +MetaTest/Average/AverageReturn 13.6444 +MetaTest/Average/Iteration 224 +MetaTest/Average/MaxReturn 63.9132 +MetaTest/Average/MinReturn -26.4223 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 29.0937 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 13.6444 +MetaTest/__unnamed_task__/AverageReturn 13.6444 +MetaTest/__unnamed_task__/Iteration 224 +MetaTest/__unnamed_task__/MaxReturn 63.9132 +MetaTest/__unnamed_task__/MinReturn -26.4223 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 29.0937 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 375000 +------------------------------------------------- ----------- +2025-04-03 04:33:12 | [pearl_trainer] epoch #225 | Training... +2025-04-03 04:34:44 | [pearl_trainer] epoch #225 | Evaluating... +2025-04-03 04:34:44 | [pearl_trainer] epoch #225 | Sampling for adapation and meta-testing... +2025-04-03 04:36:40 | [pearl_trainer] epoch #225 | Finished meta-testing... +2025-04-03 04:36:40 | [pearl_trainer] epoch #225 | Saving snapshot... +2025-04-03 04:36:41 | [pearl_trainer] epoch #225 | Saved +2025-04-03 04:36:41 | [pearl_trainer] epoch #225 | Time 53514.16 s +2025-04-03 04:36:41 | [pearl_trainer] epoch #225 | EpochTime 240.72 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -21.2373 +MetaTest/Average/AverageReturn -21.2373 +MetaTest/Average/Iteration 225 +MetaTest/Average/MaxReturn 58.2024 +MetaTest/Average/MinReturn -91.3463 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 47.9701 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -21.2373 +MetaTest/__unnamed_task__/AverageReturn -21.2373 +MetaTest/__unnamed_task__/Iteration 225 +MetaTest/__unnamed_task__/MaxReturn 58.2024 +MetaTest/__unnamed_task__/MinReturn -91.3463 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 47.9701 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 376600 +------------------------------------------------- ----------- +2025-04-03 04:37:14 | [pearl_trainer] epoch #226 | Training... +2025-04-03 04:38:39 | [pearl_trainer] epoch #226 | Evaluating... +2025-04-03 04:38:39 | [pearl_trainer] epoch #226 | Sampling for adapation and meta-testing... +2025-04-03 04:40:37 | [pearl_trainer] epoch #226 | Finished meta-testing... +2025-04-03 04:40:37 | [pearl_trainer] epoch #226 | Saving snapshot... +2025-04-03 04:40:38 | [pearl_trainer] epoch #226 | Saved +2025-04-03 04:40:38 | [pearl_trainer] epoch #226 | Time 53750.65 s +2025-04-03 04:40:38 | [pearl_trainer] epoch #226 | EpochTime 236.49 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -9.50977 +MetaTest/Average/AverageReturn -9.50977 +MetaTest/Average/Iteration 226 +MetaTest/Average/MaxReturn 13.5584 +MetaTest/Average/MinReturn -35.9201 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 16.3627 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -9.50977 +MetaTest/__unnamed_task__/AverageReturn -9.50977 +MetaTest/__unnamed_task__/Iteration 226 +MetaTest/__unnamed_task__/MaxReturn 13.5584 +MetaTest/__unnamed_task__/MinReturn -35.9201 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 16.3627 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 378200 +------------------------------------------------- ------------ +2025-04-03 04:41:08 | [pearl_trainer] epoch #227 | Training... +2025-04-03 04:42:40 | [pearl_trainer] epoch #227 | Evaluating... +2025-04-03 04:42:40 | [pearl_trainer] epoch #227 | Sampling for adapation and meta-testing... +2025-04-03 04:44:32 | [pearl_trainer] epoch #227 | Finished meta-testing... +2025-04-03 04:44:32 | [pearl_trainer] epoch #227 | Saving snapshot... +2025-04-03 04:44:33 | [pearl_trainer] epoch #227 | Saved +2025-04-03 04:44:33 | [pearl_trainer] epoch #227 | Time 53986.21 s +2025-04-03 04:44:33 | [pearl_trainer] epoch #227 | EpochTime 235.56 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 8.07044 +MetaTest/Average/AverageReturn 8.07044 +MetaTest/Average/Iteration 227 +MetaTest/Average/MaxReturn 50.556 +MetaTest/Average/MinReturn -30.0131 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 27.6177 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 8.07044 +MetaTest/__unnamed_task__/AverageReturn 8.07044 +MetaTest/__unnamed_task__/Iteration 227 +MetaTest/__unnamed_task__/MaxReturn 50.556 +MetaTest/__unnamed_task__/MinReturn -30.0131 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 27.6177 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 379800 +------------------------------------------------- ------------ +2025-04-03 04:45:06 | [pearl_trainer] epoch #228 | Training... +2025-04-03 04:46:38 | [pearl_trainer] epoch #228 | Evaluating... +2025-04-03 04:46:38 | [pearl_trainer] epoch #228 | Sampling for adapation and meta-testing... +2025-04-03 04:48:38 | [pearl_trainer] epoch #228 | Finished meta-testing... +2025-04-03 04:48:38 | [pearl_trainer] epoch #228 | Saving snapshot... +2025-04-03 04:48:39 | [pearl_trainer] epoch #228 | Saved +2025-04-03 04:48:39 | [pearl_trainer] epoch #228 | Time 54232.32 s +2025-04-03 04:48:39 | [pearl_trainer] epoch #228 | EpochTime 246.10 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 4.9689 +MetaTest/Average/AverageReturn 4.9689 +MetaTest/Average/Iteration 228 +MetaTest/Average/MaxReturn 106.91 +MetaTest/Average/MinReturn -84.5935 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 78.8277 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 4.9689 +MetaTest/__unnamed_task__/AverageReturn 4.9689 +MetaTest/__unnamed_task__/Iteration 228 +MetaTest/__unnamed_task__/MaxReturn 106.91 +MetaTest/__unnamed_task__/MinReturn -84.5935 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 78.8277 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 381400 +------------------------------------------------- ----------- +2025-04-03 04:49:12 | [pearl_trainer] epoch #229 | Training... +2025-04-03 04:50:42 | [pearl_trainer] epoch #229 | Evaluating... +2025-04-03 04:50:42 | [pearl_trainer] epoch #229 | Sampling for adapation and meta-testing... +2025-04-03 04:52:35 | [pearl_trainer] epoch #229 | Finished meta-testing... +2025-04-03 04:52:35 | [pearl_trainer] epoch #229 | Saving snapshot... +2025-04-03 04:52:36 | [pearl_trainer] epoch #229 | Saved +2025-04-03 04:52:36 | [pearl_trainer] epoch #229 | Time 54469.01 s +2025-04-03 04:52:36 | [pearl_trainer] epoch #229 | EpochTime 236.69 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -20.5867 +MetaTest/Average/AverageReturn -20.5867 +MetaTest/Average/Iteration 229 +MetaTest/Average/MaxReturn 90.0229 +MetaTest/Average/MinReturn -77.3924 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 67.152 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -20.5867 +MetaTest/__unnamed_task__/AverageReturn -20.5867 +MetaTest/__unnamed_task__/Iteration 229 +MetaTest/__unnamed_task__/MaxReturn 90.0229 +MetaTest/__unnamed_task__/MinReturn -77.3924 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 67.152 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 383000 +------------------------------------------------- ----------- +2025-04-03 04:53:08 | [pearl_trainer] epoch #230 | Training... +2025-04-03 04:54:42 | [pearl_trainer] epoch #230 | Evaluating... +2025-04-03 04:54:42 | [pearl_trainer] epoch #230 | Sampling for adapation and meta-testing... +2025-04-03 04:56:33 | [pearl_trainer] epoch #230 | Finished meta-testing... +2025-04-03 04:56:33 | [pearl_trainer] epoch #230 | Saving snapshot... +2025-04-03 04:56:35 | [pearl_trainer] epoch #230 | Saved +2025-04-03 04:56:35 | [pearl_trainer] epoch #230 | Time 54707.87 s +2025-04-03 04:56:35 | [pearl_trainer] epoch #230 | EpochTime 238.86 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -88.0745 +MetaTest/Average/AverageReturn -88.0745 +MetaTest/Average/Iteration 230 +MetaTest/Average/MaxReturn -72.1954 +MetaTest/Average/MinReturn -94.3677 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 8.2174 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -88.0745 +MetaTest/__unnamed_task__/AverageReturn -88.0745 +MetaTest/__unnamed_task__/Iteration 230 +MetaTest/__unnamed_task__/MaxReturn -72.1954 +MetaTest/__unnamed_task__/MinReturn -94.3677 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 8.2174 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 384600 +------------------------------------------------- ----------- +2025-04-03 04:57:06 | [pearl_trainer] epoch #231 | Training... +2025-04-03 04:58:39 | [pearl_trainer] epoch #231 | Evaluating... +2025-04-03 04:58:39 | [pearl_trainer] epoch #231 | Sampling for adapation and meta-testing... +2025-04-03 05:00:30 | [pearl_trainer] epoch #231 | Finished meta-testing... +2025-04-03 05:00:30 | [pearl_trainer] epoch #231 | Saving snapshot... +2025-04-03 05:00:31 | [pearl_trainer] epoch #231 | Saved +2025-04-03 05:00:31 | [pearl_trainer] epoch #231 | Time 54943.75 s +2025-04-03 05:00:31 | [pearl_trainer] epoch #231 | EpochTime 235.88 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -65.0364 +MetaTest/Average/AverageReturn -65.0364 +MetaTest/Average/Iteration 231 +MetaTest/Average/MaxReturn -55.056 +MetaTest/Average/MinReturn -73.1618 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 7.90654 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -65.0364 +MetaTest/__unnamed_task__/AverageReturn -65.0364 +MetaTest/__unnamed_task__/Iteration 231 +MetaTest/__unnamed_task__/MaxReturn -55.056 +MetaTest/__unnamed_task__/MinReturn -73.1618 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 7.90654 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 386200 +------------------------------------------------- ------------ +2025-04-03 05:01:03 | [pearl_trainer] epoch #232 | Training... +2025-04-03 05:02:38 | [pearl_trainer] epoch #232 | Evaluating... +2025-04-03 05:02:38 | [pearl_trainer] epoch #232 | Sampling for adapation and meta-testing... +2025-04-03 05:04:36 | [pearl_trainer] epoch #232 | Finished meta-testing... +2025-04-03 05:04:36 | [pearl_trainer] epoch #232 | Saving snapshot... +2025-04-03 05:04:37 | [pearl_trainer] epoch #232 | Saved +2025-04-03 05:04:37 | [pearl_trainer] epoch #232 | Time 55190.19 s +2025-04-03 05:04:37 | [pearl_trainer] epoch #232 | EpochTime 246.43 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -71.2021 +MetaTest/Average/AverageReturn -71.2021 +MetaTest/Average/Iteration 232 +MetaTest/Average/MaxReturn -49.1848 +MetaTest/Average/MinReturn -88.8565 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 15.2872 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -71.2021 +MetaTest/__unnamed_task__/AverageReturn -71.2021 +MetaTest/__unnamed_task__/Iteration 232 +MetaTest/__unnamed_task__/MaxReturn -49.1848 +MetaTest/__unnamed_task__/MinReturn -88.8565 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 15.2872 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 387800 +------------------------------------------------- ----------- +2025-04-03 05:05:09 | [pearl_trainer] epoch #233 | Training... +2025-04-03 05:06:34 | [pearl_trainer] epoch #233 | Evaluating... +2025-04-03 05:06:34 | [pearl_trainer] epoch #233 | Sampling for adapation and meta-testing... +2025-04-03 05:08:26 | [pearl_trainer] epoch #233 | Finished meta-testing... +2025-04-03 05:08:26 | [pearl_trainer] epoch #233 | Saving snapshot... +2025-04-03 05:08:28 | [pearl_trainer] epoch #233 | Saved +2025-04-03 05:08:28 | [pearl_trainer] epoch #233 | Time 55420.92 s +2025-04-03 05:08:28 | [pearl_trainer] epoch #233 | EpochTime 230.73 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -57.8782 +MetaTest/Average/AverageReturn -57.8782 +MetaTest/Average/Iteration 233 +MetaTest/Average/MaxReturn -44.1252 +MetaTest/Average/MinReturn -72.5995 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 11.0794 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -57.8782 +MetaTest/__unnamed_task__/AverageReturn -57.8782 +MetaTest/__unnamed_task__/Iteration 233 +MetaTest/__unnamed_task__/MaxReturn -44.1252 +MetaTest/__unnamed_task__/MinReturn -72.5995 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 11.0794 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 389400 +------------------------------------------------- ----------- +2025-04-03 05:09:00 | [pearl_trainer] epoch #234 | Training... +2025-04-03 05:10:32 | [pearl_trainer] epoch #234 | Evaluating... +2025-04-03 05:10:32 | [pearl_trainer] epoch #234 | Sampling for adapation and meta-testing... +2025-04-03 05:12:28 | [pearl_trainer] epoch #234 | Finished meta-testing... +2025-04-03 05:12:28 | [pearl_trainer] epoch #234 | Saving snapshot... +2025-04-03 05:12:29 | [pearl_trainer] epoch #234 | Saved +2025-04-03 05:12:29 | [pearl_trainer] epoch #234 | Time 55662.38 s +2025-04-03 05:12:29 | [pearl_trainer] epoch #234 | EpochTime 241.46 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -51.2427 +MetaTest/Average/AverageReturn -51.2427 +MetaTest/Average/Iteration 234 +MetaTest/Average/MaxReturn -30.484 +MetaTest/Average/MinReturn -68.9014 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 14.4842 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -51.2427 +MetaTest/__unnamed_task__/AverageReturn -51.2427 +MetaTest/__unnamed_task__/Iteration 234 +MetaTest/__unnamed_task__/MaxReturn -30.484 +MetaTest/__unnamed_task__/MinReturn -68.9014 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 14.4842 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 391000 +------------------------------------------------- ----------- +2025-04-03 05:13:01 | [pearl_trainer] epoch #235 | Training... +2025-04-03 05:14:27 | [pearl_trainer] epoch #235 | Evaluating... +2025-04-03 05:14:27 | [pearl_trainer] epoch #235 | Sampling for adapation and meta-testing... +2025-04-03 05:16:24 | [pearl_trainer] epoch #235 | Finished meta-testing... +2025-04-03 05:16:24 | [pearl_trainer] epoch #235 | Saving snapshot... +2025-04-03 05:16:25 | [pearl_trainer] epoch #235 | Saved +2025-04-03 05:16:25 | [pearl_trainer] epoch #235 | Time 55898.19 s +2025-04-03 05:16:25 | [pearl_trainer] epoch #235 | EpochTime 235.82 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -36.7749 +MetaTest/Average/AverageReturn -36.7749 +MetaTest/Average/Iteration 235 +MetaTest/Average/MaxReturn -29.1759 +MetaTest/Average/MinReturn -58.3712 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 10.9887 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -36.7749 +MetaTest/__unnamed_task__/AverageReturn -36.7749 +MetaTest/__unnamed_task__/Iteration 235 +MetaTest/__unnamed_task__/MaxReturn -29.1759 +MetaTest/__unnamed_task__/MinReturn -58.3712 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 10.9887 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 392600 +------------------------------------------------- ----------- +2025-04-03 05:17:00 | [pearl_trainer] epoch #236 | Training... +2025-04-03 05:18:52 | [pearl_trainer] epoch #236 | Evaluating... +2025-04-03 05:18:52 | [pearl_trainer] epoch #236 | Sampling for adapation and meta-testing... +2025-04-03 05:20:48 | [pearl_trainer] epoch #236 | Finished meta-testing... +2025-04-03 05:20:48 | [pearl_trainer] epoch #236 | Saving snapshot... +2025-04-03 05:20:49 | [pearl_trainer] epoch #236 | Saved +2025-04-03 05:20:49 | [pearl_trainer] epoch #236 | Time 56161.79 s +2025-04-03 05:20:49 | [pearl_trainer] epoch #236 | EpochTime 263.60 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -26.1352 +MetaTest/Average/AverageReturn -26.1352 +MetaTest/Average/Iteration 236 +MetaTest/Average/MaxReturn -21.0274 +MetaTest/Average/MinReturn -31.4245 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 4.41496 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -26.1352 +MetaTest/__unnamed_task__/AverageReturn -26.1352 +MetaTest/__unnamed_task__/Iteration 236 +MetaTest/__unnamed_task__/MaxReturn -21.0274 +MetaTest/__unnamed_task__/MinReturn -31.4245 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 4.41496 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 394200 +------------------------------------------------- ------------ +2025-04-03 05:21:22 | [pearl_trainer] epoch #237 | Training... +2025-04-03 05:22:45 | [pearl_trainer] epoch #237 | Evaluating... +2025-04-03 05:22:45 | [pearl_trainer] epoch #237 | Sampling for adapation and meta-testing... +2025-04-03 05:24:39 | [pearl_trainer] epoch #237 | Finished meta-testing... +2025-04-03 05:24:39 | [pearl_trainer] epoch #237 | Saving snapshot... +2025-04-03 05:24:40 | [pearl_trainer] epoch #237 | Saved +2025-04-03 05:24:40 | [pearl_trainer] epoch #237 | Time 56393.46 s +2025-04-03 05:24:40 | [pearl_trainer] epoch #237 | EpochTime 231.67 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -21.48 +MetaTest/Average/AverageReturn -21.48 +MetaTest/Average/Iteration 237 +MetaTest/Average/MaxReturn 3.28282 +MetaTest/Average/MinReturn -32.188 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 12.6096 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -21.48 +MetaTest/__unnamed_task__/AverageReturn -21.48 +MetaTest/__unnamed_task__/Iteration 237 +MetaTest/__unnamed_task__/MaxReturn 3.28282 +MetaTest/__unnamed_task__/MinReturn -32.188 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 12.6096 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 395800 +------------------------------------------------- ------------ +2025-04-03 05:25:11 | [pearl_trainer] epoch #238 | Training... +2025-04-03 05:26:41 | [pearl_trainer] epoch #238 | Evaluating... +2025-04-03 05:26:41 | [pearl_trainer] epoch #238 | Sampling for adapation and meta-testing... +2025-04-03 05:28:35 | [pearl_trainer] epoch #238 | Finished meta-testing... +2025-04-03 05:28:35 | [pearl_trainer] epoch #238 | Saving snapshot... +2025-04-03 05:28:36 | [pearl_trainer] epoch #238 | Saved +2025-04-03 05:28:36 | [pearl_trainer] epoch #238 | Time 56629.26 s +2025-04-03 05:28:36 | [pearl_trainer] epoch #238 | EpochTime 235.80 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 17.9586 +MetaTest/Average/AverageReturn 17.9586 +MetaTest/Average/Iteration 238 +MetaTest/Average/MaxReturn 102.091 +MetaTest/Average/MinReturn -37.9045 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 54.6846 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 17.9586 +MetaTest/__unnamed_task__/AverageReturn 17.9586 +MetaTest/__unnamed_task__/Iteration 238 +MetaTest/__unnamed_task__/MaxReturn 102.091 +MetaTest/__unnamed_task__/MinReturn -37.9045 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 54.6846 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 397400 +------------------------------------------------- ----------- +2025-04-03 05:29:09 | [pearl_trainer] epoch #239 | Training... +2025-04-03 05:30:35 | [pearl_trainer] epoch #239 | Evaluating... +2025-04-03 05:30:35 | [pearl_trainer] epoch #239 | Sampling for adapation and meta-testing... +2025-04-03 05:32:33 | [pearl_trainer] epoch #239 | Finished meta-testing... +2025-04-03 05:32:33 | [pearl_trainer] epoch #239 | Saving snapshot... +2025-04-03 05:32:34 | [pearl_trainer] epoch #239 | Saved +2025-04-03 05:32:34 | [pearl_trainer] epoch #239 | Time 56867.26 s +2025-04-03 05:32:34 | [pearl_trainer] epoch #239 | EpochTime 237.99 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -14.2641 +MetaTest/Average/AverageReturn -14.2641 +MetaTest/Average/Iteration 239 +MetaTest/Average/MaxReturn 31.8617 +MetaTest/Average/MinReturn -30.7641 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 23.2091 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -14.2641 +MetaTest/__unnamed_task__/AverageReturn -14.2641 +MetaTest/__unnamed_task__/Iteration 239 +MetaTest/__unnamed_task__/MaxReturn 31.8617 +MetaTest/__unnamed_task__/MinReturn -30.7641 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 23.2091 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 399000 +------------------------------------------------- ----------- +2025-04-03 05:33:06 | [pearl_trainer] epoch #240 | Training... +2025-04-03 05:34:43 | [pearl_trainer] epoch #240 | Evaluating... +2025-04-03 05:34:43 | [pearl_trainer] epoch #240 | Sampling for adapation and meta-testing... +2025-04-03 05:36:36 | [pearl_trainer] epoch #240 | Finished meta-testing... +2025-04-03 05:36:36 | [pearl_trainer] epoch #240 | Saving snapshot... +2025-04-03 05:36:37 | [pearl_trainer] epoch #240 | Saved +2025-04-03 05:36:37 | [pearl_trainer] epoch #240 | Time 57110.41 s +2025-04-03 05:36:37 | [pearl_trainer] epoch #240 | EpochTime 243.15 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 12.2297 +MetaTest/Average/AverageReturn 12.2297 +MetaTest/Average/Iteration 240 +MetaTest/Average/MaxReturn 65.2053 +MetaTest/Average/MinReturn -12.4389 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 27.6651 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 12.2297 +MetaTest/__unnamed_task__/AverageReturn 12.2297 +MetaTest/__unnamed_task__/Iteration 240 +MetaTest/__unnamed_task__/MaxReturn 65.2053 +MetaTest/__unnamed_task__/MinReturn -12.4389 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 27.6651 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 400600 +------------------------------------------------- ----------- +2025-04-03 05:37:12 | [pearl_trainer] epoch #241 | Training... +2025-04-03 05:38:47 | [pearl_trainer] epoch #241 | Evaluating... +2025-04-03 05:38:47 | [pearl_trainer] epoch #241 | Sampling for adapation and meta-testing... +2025-04-03 05:40:44 | [pearl_trainer] epoch #241 | Finished meta-testing... +2025-04-03 05:40:44 | [pearl_trainer] epoch #241 | Saving snapshot... +2025-04-03 05:40:45 | [pearl_trainer] epoch #241 | Saved +2025-04-03 05:40:45 | [pearl_trainer] epoch #241 | Time 57357.68 s +2025-04-03 05:40:45 | [pearl_trainer] epoch #241 | EpochTime 247.26 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -6.47682 +MetaTest/Average/AverageReturn -6.47682 +MetaTest/Average/Iteration 241 +MetaTest/Average/MaxReturn 75.2802 +MetaTest/Average/MinReturn -98.1808 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 65.5935 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -6.47682 +MetaTest/__unnamed_task__/AverageReturn -6.47682 +MetaTest/__unnamed_task__/Iteration 241 +MetaTest/__unnamed_task__/MaxReturn 75.2802 +MetaTest/__unnamed_task__/MinReturn -98.1808 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 65.5935 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 402200 +------------------------------------------------- ------------ +2025-04-03 05:41:16 | [pearl_trainer] epoch #242 | Training... +2025-04-03 05:42:53 | [pearl_trainer] epoch #242 | Evaluating... +2025-04-03 05:42:53 | [pearl_trainer] epoch #242 | Sampling for adapation and meta-testing... +2025-04-03 05:44:43 | [pearl_trainer] epoch #242 | Finished meta-testing... +2025-04-03 05:44:43 | [pearl_trainer] epoch #242 | Saving snapshot... +2025-04-03 05:44:44 | [pearl_trainer] epoch #242 | Saved +2025-04-03 05:44:44 | [pearl_trainer] epoch #242 | Time 57597.21 s +2025-04-03 05:44:44 | [pearl_trainer] epoch #242 | EpochTime 239.53 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 46.4458 +MetaTest/Average/AverageReturn 46.4458 +MetaTest/Average/Iteration 242 +MetaTest/Average/MaxReturn 87.2501 +MetaTest/Average/MinReturn -44.5674 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 46.809 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 46.4458 +MetaTest/__unnamed_task__/AverageReturn 46.4458 +MetaTest/__unnamed_task__/Iteration 242 +MetaTest/__unnamed_task__/MaxReturn 87.2501 +MetaTest/__unnamed_task__/MinReturn -44.5674 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 46.809 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 403800 +------------------------------------------------- ----------- +2025-04-03 05:45:16 | [pearl_trainer] epoch #243 | Training... +2025-04-03 05:46:43 | [pearl_trainer] epoch #243 | Evaluating... +2025-04-03 05:46:43 | [pearl_trainer] epoch #243 | Sampling for adapation and meta-testing... +2025-04-03 05:48:38 | [pearl_trainer] epoch #243 | Finished meta-testing... +2025-04-03 05:48:38 | [pearl_trainer] epoch #243 | Saving snapshot... +2025-04-03 05:48:39 | [pearl_trainer] epoch #243 | Saved +2025-04-03 05:48:39 | [pearl_trainer] epoch #243 | Time 57832.48 s +2025-04-03 05:48:39 | [pearl_trainer] epoch #243 | EpochTime 235.27 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 5.73869 +MetaTest/Average/AverageReturn 5.73869 +MetaTest/Average/Iteration 243 +MetaTest/Average/MaxReturn 102.757 +MetaTest/Average/MinReturn -86.2036 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 75.8821 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 5.73869 +MetaTest/__unnamed_task__/AverageReturn 5.73869 +MetaTest/__unnamed_task__/Iteration 243 +MetaTest/__unnamed_task__/MaxReturn 102.757 +MetaTest/__unnamed_task__/MinReturn -86.2036 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 75.8821 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 405400 +------------------------------------------------- ------------ +2025-04-03 05:49:12 | [pearl_trainer] epoch #244 | Training... +2025-04-03 05:50:51 | [pearl_trainer] epoch #244 | Evaluating... +2025-04-03 05:50:51 | [pearl_trainer] epoch #244 | Sampling for adapation and meta-testing... +2025-04-03 05:52:48 | [pearl_trainer] epoch #244 | Finished meta-testing... +2025-04-03 05:52:48 | [pearl_trainer] epoch #244 | Saving snapshot... +2025-04-03 05:52:49 | [pearl_trainer] epoch #244 | Saved +2025-04-03 05:52:49 | [pearl_trainer] epoch #244 | Time 58081.79 s +2025-04-03 05:52:49 | [pearl_trainer] epoch #244 | EpochTime 249.31 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -18.1567 +MetaTest/Average/AverageReturn -18.1567 +MetaTest/Average/Iteration 244 +MetaTest/Average/MaxReturn 64.5862 +MetaTest/Average/MinReturn -93.8826 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 54.3347 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -18.1567 +MetaTest/__unnamed_task__/AverageReturn -18.1567 +MetaTest/__unnamed_task__/Iteration 244 +MetaTest/__unnamed_task__/MaxReturn 64.5862 +MetaTest/__unnamed_task__/MinReturn -93.8826 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 54.3347 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 407000 +------------------------------------------------- ----------- +2025-04-03 05:53:22 | [pearl_trainer] epoch #245 | Training... +2025-04-03 05:54:50 | [pearl_trainer] epoch #245 | Evaluating... +2025-04-03 05:54:50 | [pearl_trainer] epoch #245 | Sampling for adapation and meta-testing... +2025-04-03 05:56:48 | [pearl_trainer] epoch #245 | Finished meta-testing... +2025-04-03 05:56:48 | [pearl_trainer] epoch #245 | Saving snapshot... +2025-04-03 05:56:49 | [pearl_trainer] epoch #245 | Saved +2025-04-03 05:56:49 | [pearl_trainer] epoch #245 | Time 58321.93 s +2025-04-03 05:56:49 | [pearl_trainer] epoch #245 | EpochTime 240.13 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -5.14519 +MetaTest/Average/AverageReturn -5.14519 +MetaTest/Average/Iteration 245 +MetaTest/Average/MaxReturn 69.9137 +MetaTest/Average/MinReturn -64.2444 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 52.4811 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -5.14519 +MetaTest/__unnamed_task__/AverageReturn -5.14519 +MetaTest/__unnamed_task__/Iteration 245 +MetaTest/__unnamed_task__/MaxReturn 69.9137 +MetaTest/__unnamed_task__/MinReturn -64.2444 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 52.4811 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 408600 +------------------------------------------------- ------------ +2025-04-03 05:57:20 | [pearl_trainer] epoch #246 | Training... +2025-04-03 05:58:52 | [pearl_trainer] epoch #246 | Evaluating... +2025-04-03 05:58:52 | [pearl_trainer] epoch #246 | Sampling for adapation and meta-testing... +2025-04-03 06:00:45 | [pearl_trainer] epoch #246 | Finished meta-testing... +2025-04-03 06:00:45 | [pearl_trainer] epoch #246 | Saving snapshot... +2025-04-03 06:00:46 | [pearl_trainer] epoch #246 | Saved +2025-04-03 06:00:46 | [pearl_trainer] epoch #246 | Time 58558.72 s +2025-04-03 06:00:46 | [pearl_trainer] epoch #246 | EpochTime 236.79 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 57.2325 +MetaTest/Average/AverageReturn 57.2325 +MetaTest/Average/Iteration 246 +MetaTest/Average/MaxReturn 85.3626 +MetaTest/Average/MinReturn -16.226 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 37.3887 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 57.2325 +MetaTest/__unnamed_task__/AverageReturn 57.2325 +MetaTest/__unnamed_task__/Iteration 246 +MetaTest/__unnamed_task__/MaxReturn 85.3626 +MetaTest/__unnamed_task__/MinReturn -16.226 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 37.3887 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 410200 +------------------------------------------------- ----------- +2025-04-03 06:01:18 | [pearl_trainer] epoch #247 | Training... +2025-04-03 06:02:43 | [pearl_trainer] epoch #247 | Evaluating... +2025-04-03 06:02:43 | [pearl_trainer] epoch #247 | Sampling for adapation and meta-testing... +2025-04-03 06:04:43 | [pearl_trainer] epoch #247 | Finished meta-testing... +2025-04-03 06:04:43 | [pearl_trainer] epoch #247 | Saving snapshot... +2025-04-03 06:04:44 | [pearl_trainer] epoch #247 | Saved +2025-04-03 06:04:44 | [pearl_trainer] epoch #247 | Time 58796.78 s +2025-04-03 06:04:44 | [pearl_trainer] epoch #247 | EpochTime 238.06 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 34.3699 +MetaTest/Average/AverageReturn 34.3699 +MetaTest/Average/Iteration 247 +MetaTest/Average/MaxReturn 91.0622 +MetaTest/Average/MinReturn -26.3217 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 39.0357 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 34.3699 +MetaTest/__unnamed_task__/AverageReturn 34.3699 +MetaTest/__unnamed_task__/Iteration 247 +MetaTest/__unnamed_task__/MaxReturn 91.0622 +MetaTest/__unnamed_task__/MinReturn -26.3217 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 39.0357 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 411800 +------------------------------------------------- ----------- +2025-04-03 06:05:15 | [pearl_trainer] epoch #248 | Training... +2025-04-03 06:06:51 | [pearl_trainer] epoch #248 | Evaluating... +2025-04-03 06:06:51 | [pearl_trainer] epoch #248 | Sampling for adapation and meta-testing... +2025-04-03 06:08:40 | [pearl_trainer] epoch #248 | Finished meta-testing... +2025-04-03 06:08:40 | [pearl_trainer] epoch #248 | Saving snapshot... +2025-04-03 06:08:42 | [pearl_trainer] epoch #248 | Saved +2025-04-03 06:08:42 | [pearl_trainer] epoch #248 | Time 59034.63 s +2025-04-03 06:08:42 | [pearl_trainer] epoch #248 | EpochTime 237.85 s +------------------------------------------------- ------------- +MetaTest/Average/AverageDiscountedReturn 0.221671 +MetaTest/Average/AverageReturn 0.221671 +MetaTest/Average/Iteration 248 +MetaTest/Average/MaxReturn 49.9859 +MetaTest/Average/MinReturn -50.3962 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 34.8465 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 0.221671 +MetaTest/__unnamed_task__/AverageReturn 0.221671 +MetaTest/__unnamed_task__/Iteration 248 +MetaTest/__unnamed_task__/MaxReturn 49.9859 +MetaTest/__unnamed_task__/MinReturn -50.3962 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 34.8465 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 413400 +------------------------------------------------- ------------- +2025-04-03 06:09:16 | [pearl_trainer] epoch #249 | Training... +2025-04-03 06:10:50 | [pearl_trainer] epoch #249 | Evaluating... +2025-04-03 06:10:50 | [pearl_trainer] epoch #249 | Sampling for adapation and meta-testing... +2025-04-03 06:12:45 | [pearl_trainer] epoch #249 | Finished meta-testing... +2025-04-03 06:12:45 | [pearl_trainer] epoch #249 | Saving snapshot... +2025-04-03 06:12:46 | [pearl_trainer] epoch #249 | Saved +2025-04-03 06:12:46 | [pearl_trainer] epoch #249 | Time 59279.17 s +2025-04-03 06:12:46 | [pearl_trainer] epoch #249 | EpochTime 244.54 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 57.6103 +MetaTest/Average/AverageReturn 57.6103 +MetaTest/Average/Iteration 249 +MetaTest/Average/MaxReturn 69.7697 +MetaTest/Average/MinReturn 39.1116 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 11.0838 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 57.6103 +MetaTest/__unnamed_task__/AverageReturn 57.6103 +MetaTest/__unnamed_task__/Iteration 249 +MetaTest/__unnamed_task__/MaxReturn 69.7697 +MetaTest/__unnamed_task__/MinReturn 39.1116 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 11.0838 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 415000 +------------------------------------------------- ----------- +2025-04-03 06:13:19 | [pearl_trainer] epoch #250 | Training... +2025-04-03 06:14:49 | [pearl_trainer] epoch #250 | Evaluating... +2025-04-03 06:14:49 | [pearl_trainer] epoch #250 | Sampling for adapation and meta-testing... +2025-04-03 06:16:43 | [pearl_trainer] epoch #250 | Finished meta-testing... +2025-04-03 06:16:43 | [pearl_trainer] epoch #250 | Saving snapshot... +2025-04-03 06:16:45 | [pearl_trainer] epoch #250 | Saved +2025-04-03 06:16:45 | [pearl_trainer] epoch #250 | Time 59517.52 s +2025-04-03 06:16:45 | [pearl_trainer] epoch #250 | EpochTime 238.35 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 35.2677 +MetaTest/Average/AverageReturn 35.2677 +MetaTest/Average/Iteration 250 +MetaTest/Average/MaxReturn 95.6817 +MetaTest/Average/MinReturn -12.8789 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 49.1152 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 35.2677 +MetaTest/__unnamed_task__/AverageReturn 35.2677 +MetaTest/__unnamed_task__/Iteration 250 +MetaTest/__unnamed_task__/MaxReturn 95.6817 +MetaTest/__unnamed_task__/MinReturn -12.8789 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 49.1152 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 416600 +------------------------------------------------- ----------- +2025-04-03 06:17:19 | [pearl_trainer] epoch #251 | Training... +2025-04-03 06:18:48 | [pearl_trainer] epoch #251 | Evaluating... +2025-04-03 06:18:48 | [pearl_trainer] epoch #251 | Sampling for adapation and meta-testing... +2025-04-03 06:20:57 | [pearl_trainer] epoch #251 | Finished meta-testing... +2025-04-03 06:20:57 | [pearl_trainer] epoch #251 | Saving snapshot... +2025-04-03 06:20:58 | [pearl_trainer] epoch #251 | Saved +2025-04-03 06:20:58 | [pearl_trainer] epoch #251 | Time 59771.10 s +2025-04-03 06:20:58 | [pearl_trainer] epoch #251 | EpochTime 253.58 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 21.8187 +MetaTest/Average/AverageReturn 21.8187 +MetaTest/Average/Iteration 251 +MetaTest/Average/MaxReturn 54.0524 +MetaTest/Average/MinReturn -48.6712 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 36.5627 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 21.8187 +MetaTest/__unnamed_task__/AverageReturn 21.8187 +MetaTest/__unnamed_task__/Iteration 251 +MetaTest/__unnamed_task__/MaxReturn 54.0524 +MetaTest/__unnamed_task__/MinReturn -48.6712 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 36.5627 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 418200 +------------------------------------------------- ----------- +2025-04-03 06:21:31 | [pearl_trainer] epoch #252 | Training... +2025-04-03 06:23:09 | [pearl_trainer] epoch #252 | Evaluating... +2025-04-03 06:23:09 | [pearl_trainer] epoch #252 | Sampling for adapation and meta-testing... +2025-04-03 06:25:04 | [pearl_trainer] epoch #252 | Finished meta-testing... +2025-04-03 06:25:04 | [pearl_trainer] epoch #252 | Saving snapshot... +2025-04-03 06:25:05 | [pearl_trainer] epoch #252 | Saved +2025-04-03 06:25:05 | [pearl_trainer] epoch #252 | Time 60018.09 s +2025-04-03 06:25:05 | [pearl_trainer] epoch #252 | EpochTime 246.99 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 63.8831 +MetaTest/Average/AverageReturn 63.8831 +MetaTest/Average/Iteration 252 +MetaTest/Average/MaxReturn 95.8424 +MetaTest/Average/MinReturn 9.70495 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 29.5623 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 63.8831 +MetaTest/__unnamed_task__/AverageReturn 63.8831 +MetaTest/__unnamed_task__/Iteration 252 +MetaTest/__unnamed_task__/MaxReturn 95.8424 +MetaTest/__unnamed_task__/MinReturn 9.70495 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 29.5623 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 419800 +------------------------------------------------- ------------ +2025-04-03 06:25:38 | [pearl_trainer] epoch #253 | Training... +2025-04-03 06:27:04 | [pearl_trainer] epoch #253 | Evaluating... +2025-04-03 06:27:04 | [pearl_trainer] epoch #253 | Sampling for adapation and meta-testing... +2025-04-03 06:29:03 | [pearl_trainer] epoch #253 | Finished meta-testing... +2025-04-03 06:29:03 | [pearl_trainer] epoch #253 | Saving snapshot... +2025-04-03 06:29:04 | [pearl_trainer] epoch #253 | Saved +2025-04-03 06:29:04 | [pearl_trainer] epoch #253 | Time 60257.09 s +2025-04-03 06:29:04 | [pearl_trainer] epoch #253 | EpochTime 238.99 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 9.36449 +MetaTest/Average/AverageReturn 9.36449 +MetaTest/Average/Iteration 253 +MetaTest/Average/MaxReturn 81.5637 +MetaTest/Average/MinReturn -31.5394 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 41.7376 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 9.36449 +MetaTest/__unnamed_task__/AverageReturn 9.36449 +MetaTest/__unnamed_task__/Iteration 253 +MetaTest/__unnamed_task__/MaxReturn 81.5637 +MetaTest/__unnamed_task__/MinReturn -31.5394 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 41.7376 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 421400 +------------------------------------------------- ------------ +2025-04-03 06:29:35 | [pearl_trainer] epoch #254 | Training... +2025-04-03 06:31:04 | [pearl_trainer] epoch #254 | Evaluating... +2025-04-03 06:31:04 | [pearl_trainer] epoch #254 | Sampling for adapation and meta-testing... +2025-04-03 06:32:57 | [pearl_trainer] epoch #254 | Finished meta-testing... +2025-04-03 06:32:57 | [pearl_trainer] epoch #254 | Saving snapshot... +2025-04-03 06:32:59 | [pearl_trainer] epoch #254 | Saved +2025-04-03 06:32:59 | [pearl_trainer] epoch #254 | Time 60491.52 s +2025-04-03 06:32:59 | [pearl_trainer] epoch #254 | EpochTime 234.43 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 1.58384 +MetaTest/Average/AverageReturn 1.58384 +MetaTest/Average/Iteration 254 +MetaTest/Average/MaxReturn 38.9174 +MetaTest/Average/MinReturn -26.2705 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 26.0531 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 1.58384 +MetaTest/__unnamed_task__/AverageReturn 1.58384 +MetaTest/__unnamed_task__/Iteration 254 +MetaTest/__unnamed_task__/MaxReturn 38.9174 +MetaTest/__unnamed_task__/MinReturn -26.2705 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 26.0531 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 423000 +------------------------------------------------- ------------ +2025-04-03 06:33:31 | [pearl_trainer] epoch #255 | Training... +2025-04-03 06:34:59 | [pearl_trainer] epoch #255 | Evaluating... +2025-04-03 06:34:59 | [pearl_trainer] epoch #255 | Sampling for adapation and meta-testing... +2025-04-03 06:36:56 | [pearl_trainer] epoch #255 | Finished meta-testing... +2025-04-03 06:36:56 | [pearl_trainer] epoch #255 | Saving snapshot... +2025-04-03 06:36:58 | [pearl_trainer] epoch #255 | Saved +2025-04-03 06:36:58 | [pearl_trainer] epoch #255 | Time 60730.68 s +2025-04-03 06:36:58 | [pearl_trainer] epoch #255 | EpochTime 239.16 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -1.60648 +MetaTest/Average/AverageReturn -1.60648 +MetaTest/Average/Iteration 255 +MetaTest/Average/MaxReturn 124.981 +MetaTest/Average/MinReturn -66.8516 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 66.4041 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -1.60648 +MetaTest/__unnamed_task__/AverageReturn -1.60648 +MetaTest/__unnamed_task__/Iteration 255 +MetaTest/__unnamed_task__/MaxReturn 124.981 +MetaTest/__unnamed_task__/MinReturn -66.8516 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 66.4041 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 424600 +------------------------------------------------- ------------ +2025-04-03 06:37:29 | [pearl_trainer] epoch #256 | Training... +2025-04-03 06:39:04 | [pearl_trainer] epoch #256 | Evaluating... +2025-04-03 06:39:04 | [pearl_trainer] epoch #256 | Sampling for adapation and meta-testing... +2025-04-03 06:40:51 | [pearl_trainer] epoch #256 | Finished meta-testing... +2025-04-03 06:40:51 | [pearl_trainer] epoch #256 | Saving snapshot... +2025-04-03 06:40:52 | [pearl_trainer] epoch #256 | Saved +2025-04-03 06:40:52 | [pearl_trainer] epoch #256 | Time 60965.11 s +2025-04-03 06:40:52 | [pearl_trainer] epoch #256 | EpochTime 234.43 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 42.3498 +MetaTest/Average/AverageReturn 42.3498 +MetaTest/Average/Iteration 256 +MetaTest/Average/MaxReturn 89.0112 +MetaTest/Average/MinReturn -32.9874 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 48.735 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 42.3498 +MetaTest/__unnamed_task__/AverageReturn 42.3498 +MetaTest/__unnamed_task__/Iteration 256 +MetaTest/__unnamed_task__/MaxReturn 89.0112 +MetaTest/__unnamed_task__/MinReturn -32.9874 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 48.735 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 426200 +------------------------------------------------- ----------- +2025-04-03 06:41:24 | [pearl_trainer] epoch #257 | Training... +2025-04-03 06:43:06 | [pearl_trainer] epoch #257 | Evaluating... +2025-04-03 06:43:06 | [pearl_trainer] epoch #257 | Sampling for adapation and meta-testing... +2025-04-03 06:44:59 | [pearl_trainer] epoch #257 | Finished meta-testing... +2025-04-03 06:44:59 | [pearl_trainer] epoch #257 | Saving snapshot... +2025-04-03 06:45:01 | [pearl_trainer] epoch #257 | Saved +2025-04-03 06:45:01 | [pearl_trainer] epoch #257 | Time 61213.53 s +2025-04-03 06:45:01 | [pearl_trainer] epoch #257 | EpochTime 248.42 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 23.3158 +MetaTest/Average/AverageReturn 23.3158 +MetaTest/Average/Iteration 257 +MetaTest/Average/MaxReturn 146.046 +MetaTest/Average/MinReturn -16.1072 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 62.3063 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 23.3158 +MetaTest/__unnamed_task__/AverageReturn 23.3158 +MetaTest/__unnamed_task__/Iteration 257 +MetaTest/__unnamed_task__/MaxReturn 146.046 +MetaTest/__unnamed_task__/MinReturn -16.1072 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 62.3063 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 427800 +------------------------------------------------- ----------- +2025-04-03 06:45:32 | [pearl_trainer] epoch #258 | Training... +2025-04-03 06:47:03 | [pearl_trainer] epoch #258 | Evaluating... +2025-04-03 06:47:03 | [pearl_trainer] epoch #258 | Sampling for adapation and meta-testing... +2025-04-03 06:48:54 | [pearl_trainer] epoch #258 | Finished meta-testing... +2025-04-03 06:48:54 | [pearl_trainer] epoch #258 | Saving snapshot... +2025-04-03 06:48:56 | [pearl_trainer] epoch #258 | Saved +2025-04-03 06:48:56 | [pearl_trainer] epoch #258 | Time 61448.75 s +2025-04-03 06:48:56 | [pearl_trainer] epoch #258 | EpochTime 235.21 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 29.4462 +MetaTest/Average/AverageReturn 29.4462 +MetaTest/Average/Iteration 258 +MetaTest/Average/MaxReturn 102.442 +MetaTest/Average/MinReturn -38.643 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 53.4382 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 29.4462 +MetaTest/__unnamed_task__/AverageReturn 29.4462 +MetaTest/__unnamed_task__/Iteration 258 +MetaTest/__unnamed_task__/MaxReturn 102.442 +MetaTest/__unnamed_task__/MinReturn -38.643 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 53.4382 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 429400 +------------------------------------------------- ----------- +2025-04-03 06:49:28 | [pearl_trainer] epoch #259 | Training... +2025-04-03 06:51:02 | [pearl_trainer] epoch #259 | Evaluating... +2025-04-03 06:51:02 | [pearl_trainer] epoch #259 | Sampling for adapation and meta-testing... +2025-04-03 06:52:56 | [pearl_trainer] epoch #259 | Finished meta-testing... +2025-04-03 06:52:56 | [pearl_trainer] epoch #259 | Saving snapshot... +2025-04-03 06:52:57 | [pearl_trainer] epoch #259 | Saved +2025-04-03 06:52:57 | [pearl_trainer] epoch #259 | Time 61689.80 s +2025-04-03 06:52:57 | [pearl_trainer] epoch #259 | EpochTime 241.05 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 10.3443 +MetaTest/Average/AverageReturn 10.3443 +MetaTest/Average/Iteration 259 +MetaTest/Average/MaxReturn 91.0498 +MetaTest/Average/MinReturn -43.4647 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 46.233 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 10.3443 +MetaTest/__unnamed_task__/AverageReturn 10.3443 +MetaTest/__unnamed_task__/Iteration 259 +MetaTest/__unnamed_task__/MaxReturn 91.0498 +MetaTest/__unnamed_task__/MinReturn -43.4647 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 46.233 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 431000 +------------------------------------------------- ----------- +2025-04-03 06:53:30 | [pearl_trainer] epoch #260 | Training... +2025-04-03 06:55:07 | [pearl_trainer] epoch #260 | Evaluating... +2025-04-03 06:55:07 | [pearl_trainer] epoch #260 | Sampling for adapation and meta-testing... +2025-04-03 06:57:04 | [pearl_trainer] epoch #260 | Finished meta-testing... +2025-04-03 06:57:04 | [pearl_trainer] epoch #260 | Saving snapshot... +2025-04-03 06:57:05 | [pearl_trainer] epoch #260 | Saved +2025-04-03 06:57:05 | [pearl_trainer] epoch #260 | Time 61937.98 s +2025-04-03 06:57:05 | [pearl_trainer] epoch #260 | EpochTime 248.18 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -9.025 +MetaTest/Average/AverageReturn -9.025 +MetaTest/Average/Iteration 260 +MetaTest/Average/MaxReturn 32.623 +MetaTest/Average/MinReturn -39.089 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 25.3793 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -9.025 +MetaTest/__unnamed_task__/AverageReturn -9.025 +MetaTest/__unnamed_task__/Iteration 260 +MetaTest/__unnamed_task__/MaxReturn 32.623 +MetaTest/__unnamed_task__/MinReturn -39.089 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 25.3793 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 432600 +------------------------------------------------- ----------- +2025-04-03 06:57:37 | [pearl_trainer] epoch #261 | Training... +2025-04-03 06:59:13 | [pearl_trainer] epoch #261 | Evaluating... +2025-04-03 06:59:13 | [pearl_trainer] epoch #261 | Sampling for adapation and meta-testing... +2025-04-03 07:01:05 | [pearl_trainer] epoch #261 | Finished meta-testing... +2025-04-03 07:01:05 | [pearl_trainer] epoch #261 | Saving snapshot... +2025-04-03 07:01:06 | [pearl_trainer] epoch #261 | Saved +2025-04-03 07:01:06 | [pearl_trainer] epoch #261 | Time 62179.12 s +2025-04-03 07:01:06 | [pearl_trainer] epoch #261 | EpochTime 241.13 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 5.71422 +MetaTest/Average/AverageReturn 5.71422 +MetaTest/Average/Iteration 261 +MetaTest/Average/MaxReturn 54.2541 +MetaTest/Average/MinReturn -21.9605 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 30.5316 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 5.71422 +MetaTest/__unnamed_task__/AverageReturn 5.71422 +MetaTest/__unnamed_task__/Iteration 261 +MetaTest/__unnamed_task__/MaxReturn 54.2541 +MetaTest/__unnamed_task__/MinReturn -21.9605 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 30.5316 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 434200 +------------------------------------------------- ------------ +2025-04-03 07:01:39 | [pearl_trainer] epoch #262 | Training... +2025-04-03 07:03:07 | [pearl_trainer] epoch #262 | Evaluating... +2025-04-03 07:03:07 | [pearl_trainer] epoch #262 | Sampling for adapation and meta-testing... +2025-04-03 07:05:02 | [pearl_trainer] epoch #262 | Finished meta-testing... +2025-04-03 07:05:02 | [pearl_trainer] epoch #262 | Saving snapshot... +2025-04-03 07:05:03 | [pearl_trainer] epoch #262 | Saved +2025-04-03 07:05:03 | [pearl_trainer] epoch #262 | Time 62415.71 s +2025-04-03 07:05:03 | [pearl_trainer] epoch #262 | EpochTime 236.59 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -12.3044 +MetaTest/Average/AverageReturn -12.3044 +MetaTest/Average/Iteration 262 +MetaTest/Average/MaxReturn 44.7118 +MetaTest/Average/MinReturn -53.1215 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 32.3035 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -12.3044 +MetaTest/__unnamed_task__/AverageReturn -12.3044 +MetaTest/__unnamed_task__/Iteration 262 +MetaTest/__unnamed_task__/MaxReturn 44.7118 +MetaTest/__unnamed_task__/MinReturn -53.1215 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 32.3035 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 435800 +------------------------------------------------- ----------- +2025-04-03 07:05:34 | [pearl_trainer] epoch #263 | Training... +2025-04-03 07:07:14 | [pearl_trainer] epoch #263 | Evaluating... +2025-04-03 07:07:14 | [pearl_trainer] epoch #263 | Sampling for adapation and meta-testing... +2025-04-03 07:09:05 | [pearl_trainer] epoch #263 | Finished meta-testing... +2025-04-03 07:09:05 | [pearl_trainer] epoch #263 | Saving snapshot... +2025-04-03 07:09:07 | [pearl_trainer] epoch #263 | Saved +2025-04-03 07:09:07 | [pearl_trainer] epoch #263 | Time 62659.86 s +2025-04-03 07:09:07 | [pearl_trainer] epoch #263 | EpochTime 244.15 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 34.7496 +MetaTest/Average/AverageReturn 34.7496 +MetaTest/Average/Iteration 263 +MetaTest/Average/MaxReturn 98.5401 +MetaTest/Average/MinReturn -13.6567 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 50.6931 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 34.7496 +MetaTest/__unnamed_task__/AverageReturn 34.7496 +MetaTest/__unnamed_task__/Iteration 263 +MetaTest/__unnamed_task__/MaxReturn 98.5401 +MetaTest/__unnamed_task__/MinReturn -13.6567 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 50.6931 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 437400 +------------------------------------------------- ----------- +2025-04-03 07:09:40 | [pearl_trainer] epoch #264 | Training... +2025-04-03 07:11:08 | [pearl_trainer] epoch #264 | Evaluating... +2025-04-03 07:11:08 | [pearl_trainer] epoch #264 | Sampling for adapation and meta-testing... +2025-04-03 07:13:01 | [pearl_trainer] epoch #264 | Finished meta-testing... +2025-04-03 07:13:01 | [pearl_trainer] epoch #264 | Saving snapshot... +2025-04-03 07:13:02 | [pearl_trainer] epoch #264 | Saved +2025-04-03 07:13:02 | [pearl_trainer] epoch #264 | Time 62895.36 s +2025-04-03 07:13:02 | [pearl_trainer] epoch #264 | EpochTime 235.49 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 34.9053 +MetaTest/Average/AverageReturn 34.9053 +MetaTest/Average/Iteration 264 +MetaTest/Average/MaxReturn 90.9074 +MetaTest/Average/MinReturn -22.9481 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 43.6088 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 34.9053 +MetaTest/__unnamed_task__/AverageReturn 34.9053 +MetaTest/__unnamed_task__/Iteration 264 +MetaTest/__unnamed_task__/MaxReturn 90.9074 +MetaTest/__unnamed_task__/MinReturn -22.9481 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 43.6088 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 439000 +------------------------------------------------- ----------- +2025-04-03 07:13:35 | [pearl_trainer] epoch #265 | Training... +2025-04-03 07:15:05 | [pearl_trainer] epoch #265 | Evaluating... +2025-04-03 07:15:05 | [pearl_trainer] epoch #265 | Sampling for adapation and meta-testing... +2025-04-03 07:17:02 | [pearl_trainer] epoch #265 | Finished meta-testing... +2025-04-03 07:17:02 | [pearl_trainer] epoch #265 | Saving snapshot... +2025-04-03 07:17:03 | [pearl_trainer] epoch #265 | Saved +2025-04-03 07:17:03 | [pearl_trainer] epoch #265 | Time 63135.92 s +2025-04-03 07:17:03 | [pearl_trainer] epoch #265 | EpochTime 240.56 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 39.7241 +MetaTest/Average/AverageReturn 39.7241 +MetaTest/Average/Iteration 265 +MetaTest/Average/MaxReturn 103.961 +MetaTest/Average/MinReturn -59.0503 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 55.0991 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 39.7241 +MetaTest/__unnamed_task__/AverageReturn 39.7241 +MetaTest/__unnamed_task__/Iteration 265 +MetaTest/__unnamed_task__/MaxReturn 103.961 +MetaTest/__unnamed_task__/MinReturn -59.0503 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 55.0991 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 440600 +------------------------------------------------- ----------- +2025-04-03 07:17:35 | [pearl_trainer] epoch #266 | Training... +2025-04-03 07:19:03 | [pearl_trainer] epoch #266 | Evaluating... +2025-04-03 07:19:03 | [pearl_trainer] epoch #266 | Sampling for adapation and meta-testing... +2025-04-03 07:21:00 | [pearl_trainer] epoch #266 | Finished meta-testing... +2025-04-03 07:21:00 | [pearl_trainer] epoch #266 | Saving snapshot... +2025-04-03 07:21:01 | [pearl_trainer] epoch #266 | Saved +2025-04-03 07:21:01 | [pearl_trainer] epoch #266 | Time 63373.97 s +2025-04-03 07:21:01 | [pearl_trainer] epoch #266 | EpochTime 238.04 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 57.3277 +MetaTest/Average/AverageReturn 57.3277 +MetaTest/Average/Iteration 266 +MetaTest/Average/MaxReturn 116.767 +MetaTest/Average/MinReturn 13.6464 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 44.0235 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 57.3277 +MetaTest/__unnamed_task__/AverageReturn 57.3277 +MetaTest/__unnamed_task__/Iteration 266 +MetaTest/__unnamed_task__/MaxReturn 116.767 +MetaTest/__unnamed_task__/MinReturn 13.6464 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 44.0235 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 442200 +------------------------------------------------- ----------- +2025-04-03 07:21:33 | [pearl_trainer] epoch #267 | Training... +2025-04-03 07:23:11 | [pearl_trainer] epoch #267 | Evaluating... +2025-04-03 07:23:11 | [pearl_trainer] epoch #267 | Sampling for adapation and meta-testing... +2025-04-03 07:25:01 | [pearl_trainer] epoch #267 | Finished meta-testing... +2025-04-03 07:25:01 | [pearl_trainer] epoch #267 | Saving snapshot... +2025-04-03 07:25:02 | [pearl_trainer] epoch #267 | Saved +2025-04-03 07:25:02 | [pearl_trainer] epoch #267 | Time 63615.23 s +2025-04-03 07:25:02 | [pearl_trainer] epoch #267 | EpochTime 241.26 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -5.98091 +MetaTest/Average/AverageReturn -5.98091 +MetaTest/Average/Iteration 267 +MetaTest/Average/MaxReturn 56.5427 +MetaTest/Average/MinReturn -40.5955 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 33.417 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -5.98091 +MetaTest/__unnamed_task__/AverageReturn -5.98091 +MetaTest/__unnamed_task__/Iteration 267 +MetaTest/__unnamed_task__/MaxReturn 56.5427 +MetaTest/__unnamed_task__/MinReturn -40.5955 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 33.417 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 443800 +------------------------------------------------- ------------ +2025-04-03 07:25:37 | [pearl_trainer] epoch #268 | Training... +2025-04-03 07:27:10 | [pearl_trainer] epoch #268 | Evaluating... +2025-04-03 07:27:10 | [pearl_trainer] epoch #268 | Sampling for adapation and meta-testing... +2025-04-03 07:29:12 | [pearl_trainer] epoch #268 | Finished meta-testing... +2025-04-03 07:29:12 | [pearl_trainer] epoch #268 | Saving snapshot... +2025-04-03 07:29:13 | [pearl_trainer] epoch #268 | Saved +2025-04-03 07:29:13 | [pearl_trainer] epoch #268 | Time 63866.23 s +2025-04-03 07:29:13 | [pearl_trainer] epoch #268 | EpochTime 251.00 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -42.6824 +MetaTest/Average/AverageReturn -42.6824 +MetaTest/Average/Iteration 268 +MetaTest/Average/MaxReturn -23.2534 +MetaTest/Average/MinReturn -78.5621 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 19.7617 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -42.6824 +MetaTest/__unnamed_task__/AverageReturn -42.6824 +MetaTest/__unnamed_task__/Iteration 268 +MetaTest/__unnamed_task__/MaxReturn -23.2534 +MetaTest/__unnamed_task__/MinReturn -78.5621 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 19.7617 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 445400 +------------------------------------------------- ----------- +2025-04-03 07:29:44 | [pearl_trainer] epoch #269 | Training... +2025-04-03 07:31:23 | [pearl_trainer] epoch #269 | Evaluating... +2025-04-03 07:31:23 | [pearl_trainer] epoch #269 | Sampling for adapation and meta-testing... +2025-04-03 07:33:12 | [pearl_trainer] epoch #269 | Finished meta-testing... +2025-04-03 07:33:12 | [pearl_trainer] epoch #269 | Saving snapshot... +2025-04-03 07:33:14 | [pearl_trainer] epoch #269 | Saved +2025-04-03 07:33:14 | [pearl_trainer] epoch #269 | Time 64106.53 s +2025-04-03 07:33:14 | [pearl_trainer] epoch #269 | EpochTime 240.29 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 2.83064 +MetaTest/Average/AverageReturn 2.83064 +MetaTest/Average/Iteration 269 +MetaTest/Average/MaxReturn 147.947 +MetaTest/Average/MinReturn -58.6668 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 73.7181 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 2.83064 +MetaTest/__unnamed_task__/AverageReturn 2.83064 +MetaTest/__unnamed_task__/Iteration 269 +MetaTest/__unnamed_task__/MaxReturn 147.947 +MetaTest/__unnamed_task__/MinReturn -58.6668 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 73.7181 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 447000 +------------------------------------------------- ------------ +2025-04-03 07:33:46 | [pearl_trainer] epoch #270 | Training... +2025-04-03 07:35:16 | [pearl_trainer] epoch #270 | Evaluating... +2025-04-03 07:35:16 | [pearl_trainer] epoch #270 | Sampling for adapation and meta-testing... +2025-04-03 07:37:08 | [pearl_trainer] epoch #270 | Finished meta-testing... +2025-04-03 07:37:08 | [pearl_trainer] epoch #270 | Saving snapshot... +2025-04-03 07:37:10 | [pearl_trainer] epoch #270 | Saved +2025-04-03 07:37:10 | [pearl_trainer] epoch #270 | Time 64342.75 s +2025-04-03 07:37:10 | [pearl_trainer] epoch #270 | EpochTime 236.22 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 32.3423 +MetaTest/Average/AverageReturn 32.3423 +MetaTest/Average/Iteration 270 +MetaTest/Average/MaxReturn 119.132 +MetaTest/Average/MinReturn -13.8664 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 49.6926 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 32.3423 +MetaTest/__unnamed_task__/AverageReturn 32.3423 +MetaTest/__unnamed_task__/Iteration 270 +MetaTest/__unnamed_task__/MaxReturn 119.132 +MetaTest/__unnamed_task__/MinReturn -13.8664 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 49.6926 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 448600 +------------------------------------------------- ----------- +2025-04-03 07:37:41 | [pearl_trainer] epoch #271 | Training... +2025-04-03 07:39:19 | [pearl_trainer] epoch #271 | Evaluating... +2025-04-03 07:39:19 | [pearl_trainer] epoch #271 | Sampling for adapation and meta-testing... +2025-04-03 07:41:12 | [pearl_trainer] epoch #271 | Finished meta-testing... +2025-04-03 07:41:12 | [pearl_trainer] epoch #271 | Saving snapshot... +2025-04-03 07:41:13 | [pearl_trainer] epoch #271 | Saved +2025-04-03 07:41:13 | [pearl_trainer] epoch #271 | Time 64585.90 s +2025-04-03 07:41:13 | [pearl_trainer] epoch #271 | EpochTime 243.14 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 24.3393 +MetaTest/Average/AverageReturn 24.3393 +MetaTest/Average/Iteration 271 +MetaTest/Average/MaxReturn 130.729 +MetaTest/Average/MinReturn -37.6724 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 67.9481 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 24.3393 +MetaTest/__unnamed_task__/AverageReturn 24.3393 +MetaTest/__unnamed_task__/Iteration 271 +MetaTest/__unnamed_task__/MaxReturn 130.729 +MetaTest/__unnamed_task__/MinReturn -37.6724 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 67.9481 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 450200 +------------------------------------------------- ----------- +2025-04-03 07:41:47 | [pearl_trainer] epoch #272 | Training... +2025-04-03 07:43:11 | [pearl_trainer] epoch #272 | Evaluating... +2025-04-03 07:43:11 | [pearl_trainer] epoch #272 | Sampling for adapation and meta-testing... +2025-04-03 07:45:07 | [pearl_trainer] epoch #272 | Finished meta-testing... +2025-04-03 07:45:07 | [pearl_trainer] epoch #272 | Saving snapshot... +2025-04-03 07:45:08 | [pearl_trainer] epoch #272 | Saved +2025-04-03 07:45:08 | [pearl_trainer] epoch #272 | Time 64820.92 s +2025-04-03 07:45:08 | [pearl_trainer] epoch #272 | EpochTime 235.03 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -15.4794 +MetaTest/Average/AverageReturn -15.4794 +MetaTest/Average/Iteration 272 +MetaTest/Average/MaxReturn 84.931 +MetaTest/Average/MinReturn -49.6002 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 50.4668 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -15.4794 +MetaTest/__unnamed_task__/AverageReturn -15.4794 +MetaTest/__unnamed_task__/Iteration 272 +MetaTest/__unnamed_task__/MaxReturn 84.931 +MetaTest/__unnamed_task__/MinReturn -49.6002 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 50.4668 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 451800 +------------------------------------------------- ----------- +2025-04-03 07:45:39 | [pearl_trainer] epoch #273 | Training... +2025-04-03 07:47:06 | [pearl_trainer] epoch #273 | Evaluating... +2025-04-03 07:47:06 | [pearl_trainer] epoch #273 | Sampling for adapation and meta-testing... +2025-04-03 07:49:00 | [pearl_trainer] epoch #273 | Finished meta-testing... +2025-04-03 07:49:00 | [pearl_trainer] epoch #273 | Saving snapshot... +2025-04-03 07:49:02 | [pearl_trainer] epoch #273 | Saved +2025-04-03 07:49:02 | [pearl_trainer] epoch #273 | Time 65054.66 s +2025-04-03 07:49:02 | [pearl_trainer] epoch #273 | EpochTime 233.73 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -27.8623 +MetaTest/Average/AverageReturn -27.8623 +MetaTest/Average/Iteration 273 +MetaTest/Average/MaxReturn 52.2663 +MetaTest/Average/MinReturn -57.581 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 41.4889 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -27.8623 +MetaTest/__unnamed_task__/AverageReturn -27.8623 +MetaTest/__unnamed_task__/Iteration 273 +MetaTest/__unnamed_task__/MaxReturn 52.2663 +MetaTest/__unnamed_task__/MinReturn -57.581 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 41.4889 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 453400 +------------------------------------------------- ----------- +2025-04-03 07:49:35 | [pearl_trainer] epoch #274 | Training... +2025-04-03 07:51:00 | [pearl_trainer] epoch #274 | Evaluating... +2025-04-03 07:51:00 | [pearl_trainer] epoch #274 | Sampling for adapation and meta-testing... +2025-04-03 07:52:59 | [pearl_trainer] epoch #274 | Finished meta-testing... +2025-04-03 07:52:59 | [pearl_trainer] epoch #274 | Saving snapshot... +2025-04-03 07:53:00 | [pearl_trainer] epoch #274 | Saved +2025-04-03 07:53:00 | [pearl_trainer] epoch #274 | Time 65293.10 s +2025-04-03 07:53:00 | [pearl_trainer] epoch #274 | EpochTime 238.44 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -10.8462 +MetaTest/Average/AverageReturn -10.8462 +MetaTest/Average/Iteration 274 +MetaTest/Average/MaxReturn 75.2447 +MetaTest/Average/MinReturn -55.6335 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 45.0279 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -10.8462 +MetaTest/__unnamed_task__/AverageReturn -10.8462 +MetaTest/__unnamed_task__/Iteration 274 +MetaTest/__unnamed_task__/MaxReturn 75.2447 +MetaTest/__unnamed_task__/MinReturn -55.6335 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 45.0279 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 455000 +------------------------------------------------- ----------- +2025-04-03 07:53:31 | [pearl_trainer] epoch #275 | Training... +2025-04-03 07:55:03 | [pearl_trainer] epoch #275 | Evaluating... +2025-04-03 07:55:03 | [pearl_trainer] epoch #275 | Sampling for adapation and meta-testing... +2025-04-03 07:56:58 | [pearl_trainer] epoch #275 | Finished meta-testing... +2025-04-03 07:56:58 | [pearl_trainer] epoch #275 | Saving snapshot... +2025-04-03 07:56:59 | [pearl_trainer] epoch #275 | Saved +2025-04-03 07:56:59 | [pearl_trainer] epoch #275 | Time 65532.43 s +2025-04-03 07:56:59 | [pearl_trainer] epoch #275 | EpochTime 239.32 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -17.6274 +MetaTest/Average/AverageReturn -17.6274 +MetaTest/Average/Iteration 275 +MetaTest/Average/MaxReturn 48.089 +MetaTest/Average/MinReturn -63.1224 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 43.3417 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -17.6274 +MetaTest/__unnamed_task__/AverageReturn -17.6274 +MetaTest/__unnamed_task__/Iteration 275 +MetaTest/__unnamed_task__/MaxReturn 48.089 +MetaTest/__unnamed_task__/MinReturn -63.1224 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 43.3417 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 456600 +------------------------------------------------- ----------- +2025-04-03 07:57:34 | [pearl_trainer] epoch #276 | Training... +2025-04-03 07:59:16 | [pearl_trainer] epoch #276 | Evaluating... +2025-04-03 07:59:16 | [pearl_trainer] epoch #276 | Sampling for adapation and meta-testing... +2025-04-03 08:01:13 | [pearl_trainer] epoch #276 | Finished meta-testing... +2025-04-03 08:01:13 | [pearl_trainer] epoch #276 | Saving snapshot... +2025-04-03 08:01:14 | [pearl_trainer] epoch #276 | Saved +2025-04-03 08:01:14 | [pearl_trainer] epoch #276 | Time 65787.49 s +2025-04-03 08:01:14 | [pearl_trainer] epoch #276 | EpochTime 255.07 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -8.52534 +MetaTest/Average/AverageReturn -8.52534 +MetaTest/Average/Iteration 276 +MetaTest/Average/MaxReturn 35.1097 +MetaTest/Average/MinReturn -45.7464 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 25.814 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -8.52534 +MetaTest/__unnamed_task__/AverageReturn -8.52534 +MetaTest/__unnamed_task__/Iteration 276 +MetaTest/__unnamed_task__/MaxReturn 35.1097 +MetaTest/__unnamed_task__/MinReturn -45.7464 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 25.814 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 458200 +------------------------------------------------- ------------ +2025-04-03 08:01:47 | [pearl_trainer] epoch #277 | Training... +2025-04-03 08:03:15 | [pearl_trainer] epoch #277 | Evaluating... +2025-04-03 08:03:15 | [pearl_trainer] epoch #277 | Sampling for adapation and meta-testing... +2025-04-03 08:05:11 | [pearl_trainer] epoch #277 | Finished meta-testing... +2025-04-03 08:05:11 | [pearl_trainer] epoch #277 | Saving snapshot... +2025-04-03 08:05:12 | [pearl_trainer] epoch #277 | Saved +2025-04-03 08:05:12 | [pearl_trainer] epoch #277 | Time 66024.84 s +2025-04-03 08:05:12 | [pearl_trainer] epoch #277 | EpochTime 237.34 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 43.7494 +MetaTest/Average/AverageReturn 43.7494 +MetaTest/Average/Iteration 277 +MetaTest/Average/MaxReturn 100.418 +MetaTest/Average/MinReturn -34.6797 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 50.7029 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 43.7494 +MetaTest/__unnamed_task__/AverageReturn 43.7494 +MetaTest/__unnamed_task__/Iteration 277 +MetaTest/__unnamed_task__/MaxReturn 100.418 +MetaTest/__unnamed_task__/MinReturn -34.6797 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 50.7029 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 459800 +------------------------------------------------- ----------- +2025-04-03 08:05:44 | [pearl_trainer] epoch #278 | Training... +2025-04-03 08:07:14 | [pearl_trainer] epoch #278 | Evaluating... +2025-04-03 08:07:14 | [pearl_trainer] epoch #278 | Sampling for adapation and meta-testing... +2025-04-03 08:09:11 | [pearl_trainer] epoch #278 | Finished meta-testing... +2025-04-03 08:09:11 | [pearl_trainer] epoch #278 | Saving snapshot... +2025-04-03 08:09:12 | [pearl_trainer] epoch #278 | Saved +2025-04-03 08:09:12 | [pearl_trainer] epoch #278 | Time 66265.16 s +2025-04-03 08:09:12 | [pearl_trainer] epoch #278 | EpochTime 240.32 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -5.29411 +MetaTest/Average/AverageReturn -5.29411 +MetaTest/Average/Iteration 278 +MetaTest/Average/MaxReturn 132.009 +MetaTest/Average/MinReturn -55.3308 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 70.0377 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -5.29411 +MetaTest/__unnamed_task__/AverageReturn -5.29411 +MetaTest/__unnamed_task__/Iteration 278 +MetaTest/__unnamed_task__/MaxReturn 132.009 +MetaTest/__unnamed_task__/MinReturn -55.3308 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 70.0377 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 461400 +------------------------------------------------- ------------ +2025-04-03 08:09:43 | [pearl_trainer] epoch #279 | Training... +2025-04-03 08:11:15 | [pearl_trainer] epoch #279 | Evaluating... +2025-04-03 08:11:15 | [pearl_trainer] epoch #279 | Sampling for adapation and meta-testing... +2025-04-03 08:13:09 | [pearl_trainer] epoch #279 | Finished meta-testing... +2025-04-03 08:13:09 | [pearl_trainer] epoch #279 | Saving snapshot... +2025-04-03 08:13:10 | [pearl_trainer] epoch #279 | Saved +2025-04-03 08:13:10 | [pearl_trainer] epoch #279 | Time 66503.33 s +2025-04-03 08:13:10 | [pearl_trainer] epoch #279 | EpochTime 238.17 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -8.55292 +MetaTest/Average/AverageReturn -8.55292 +MetaTest/Average/Iteration 279 +MetaTest/Average/MaxReturn 81.4838 +MetaTest/Average/MinReturn -80.3455 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 72.2383 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -8.55292 +MetaTest/__unnamed_task__/AverageReturn -8.55292 +MetaTest/__unnamed_task__/Iteration 279 +MetaTest/__unnamed_task__/MaxReturn 81.4838 +MetaTest/__unnamed_task__/MinReturn -80.3455 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 72.2383 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 463000 +------------------------------------------------- ------------ +2025-04-03 08:13:43 | [pearl_trainer] epoch #280 | Training... +2025-04-03 08:15:23 | [pearl_trainer] epoch #280 | Evaluating... +2025-04-03 08:15:23 | [pearl_trainer] epoch #280 | Sampling for adapation and meta-testing... +2025-04-03 08:17:21 | [pearl_trainer] epoch #280 | Finished meta-testing... +2025-04-03 08:17:21 | [pearl_trainer] epoch #280 | Saving snapshot... +2025-04-03 08:17:23 | [pearl_trainer] epoch #280 | Saved +2025-04-03 08:17:23 | [pearl_trainer] epoch #280 | Time 66755.63 s +2025-04-03 08:17:23 | [pearl_trainer] epoch #280 | EpochTime 252.30 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -14.0805 +MetaTest/Average/AverageReturn -14.0805 +MetaTest/Average/Iteration 280 +MetaTest/Average/MaxReturn 69.3329 +MetaTest/Average/MinReturn -81.0709 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 56.8348 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -14.0805 +MetaTest/__unnamed_task__/AverageReturn -14.0805 +MetaTest/__unnamed_task__/Iteration 280 +MetaTest/__unnamed_task__/MaxReturn 69.3329 +MetaTest/__unnamed_task__/MinReturn -81.0709 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 56.8348 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 464600 +------------------------------------------------- ----------- +2025-04-03 08:17:54 | [pearl_trainer] epoch #281 | Training... +2025-04-03 08:19:24 | [pearl_trainer] epoch #281 | Evaluating... +2025-04-03 08:19:24 | [pearl_trainer] epoch #281 | Sampling for adapation and meta-testing... +2025-04-03 08:21:19 | [pearl_trainer] epoch #281 | Finished meta-testing... +2025-04-03 08:21:19 | [pearl_trainer] epoch #281 | Saving snapshot... +2025-04-03 08:21:21 | [pearl_trainer] epoch #281 | Saved +2025-04-03 08:21:21 | [pearl_trainer] epoch #281 | Time 66993.52 s +2025-04-03 08:21:21 | [pearl_trainer] epoch #281 | EpochTime 237.89 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -46.3504 +MetaTest/Average/AverageReturn -46.3504 +MetaTest/Average/Iteration 281 +MetaTest/Average/MaxReturn -14.866 +MetaTest/Average/MinReturn -79.3975 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 28.1188 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -46.3504 +MetaTest/__unnamed_task__/AverageReturn -46.3504 +MetaTest/__unnamed_task__/Iteration 281 +MetaTest/__unnamed_task__/MaxReturn -14.866 +MetaTest/__unnamed_task__/MinReturn -79.3975 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 28.1188 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 466200 +------------------------------------------------- ----------- +2025-04-03 08:21:53 | [pearl_trainer] epoch #282 | Training... +2025-04-03 08:23:24 | [pearl_trainer] epoch #282 | Evaluating... +2025-04-03 08:23:24 | [pearl_trainer] epoch #282 | Sampling for adapation and meta-testing... +2025-04-03 08:25:16 | [pearl_trainer] epoch #282 | Finished meta-testing... +2025-04-03 08:25:16 | [pearl_trainer] epoch #282 | Saving snapshot... +2025-04-03 08:25:17 | [pearl_trainer] epoch #282 | Saved +2025-04-03 08:25:17 | [pearl_trainer] epoch #282 | Time 67229.93 s +2025-04-03 08:25:17 | [pearl_trainer] epoch #282 | EpochTime 236.41 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 1.914 +MetaTest/Average/AverageReturn 1.914 +MetaTest/Average/Iteration 282 +MetaTest/Average/MaxReturn 53.9741 +MetaTest/Average/MinReturn -48.8924 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 35.6001 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 1.914 +MetaTest/__unnamed_task__/AverageReturn 1.914 +MetaTest/__unnamed_task__/Iteration 282 +MetaTest/__unnamed_task__/MaxReturn 53.9741 +MetaTest/__unnamed_task__/MinReturn -48.8924 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 35.6001 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 467800 +------------------------------------------------- ----------- +2025-04-03 08:25:48 | [pearl_trainer] epoch #283 | Training... +2025-04-03 08:27:22 | [pearl_trainer] epoch #283 | Evaluating... +2025-04-03 08:27:22 | [pearl_trainer] epoch #283 | Sampling for adapation and meta-testing... +2025-04-03 08:29:12 | [pearl_trainer] epoch #283 | Finished meta-testing... +2025-04-03 08:29:12 | [pearl_trainer] epoch #283 | Saving snapshot... +2025-04-03 08:29:14 | [pearl_trainer] epoch #283 | Saved +2025-04-03 08:29:14 | [pearl_trainer] epoch #283 | Time 67466.54 s +2025-04-03 08:29:14 | [pearl_trainer] epoch #283 | EpochTime 236.61 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -16.5751 +MetaTest/Average/AverageReturn -16.5751 +MetaTest/Average/Iteration 283 +MetaTest/Average/MaxReturn 25.7035 +MetaTest/Average/MinReturn -62.5019 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 33.0761 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -16.5751 +MetaTest/__unnamed_task__/AverageReturn -16.5751 +MetaTest/__unnamed_task__/Iteration 283 +MetaTest/__unnamed_task__/MaxReturn 25.7035 +MetaTest/__unnamed_task__/MinReturn -62.5019 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 33.0761 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 469400 +------------------------------------------------- ----------- +2025-04-03 08:29:49 | [pearl_trainer] epoch #284 | Training... +2025-04-03 08:31:35 | [pearl_trainer] epoch #284 | Evaluating... +2025-04-03 08:31:35 | [pearl_trainer] epoch #284 | Sampling for adapation and meta-testing... +2025-04-03 08:33:27 | [pearl_trainer] epoch #284 | Finished meta-testing... +2025-04-03 08:33:27 | [pearl_trainer] epoch #284 | Saving snapshot... +2025-04-03 08:33:28 | [pearl_trainer] epoch #284 | Saved +2025-04-03 08:33:28 | [pearl_trainer] epoch #284 | Time 67721.38 s +2025-04-03 08:33:28 | [pearl_trainer] epoch #284 | EpochTime 254.83 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 28.8698 +MetaTest/Average/AverageReturn 28.8698 +MetaTest/Average/Iteration 284 +MetaTest/Average/MaxReturn 106.146 +MetaTest/Average/MinReturn -82.1065 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 68.1014 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 28.8698 +MetaTest/__unnamed_task__/AverageReturn 28.8698 +MetaTest/__unnamed_task__/Iteration 284 +MetaTest/__unnamed_task__/MaxReturn 106.146 +MetaTest/__unnamed_task__/MinReturn -82.1065 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 68.1014 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 471000 +------------------------------------------------- ----------- +2025-04-03 08:34:00 | [pearl_trainer] epoch #285 | Training... +2025-04-03 08:35:33 | [pearl_trainer] epoch #285 | Evaluating... +2025-04-03 08:35:33 | [pearl_trainer] epoch #285 | Sampling for adapation and meta-testing... +2025-04-03 08:37:25 | [pearl_trainer] epoch #285 | Finished meta-testing... +2025-04-03 08:37:25 | [pearl_trainer] epoch #285 | Saving snapshot... +2025-04-03 08:37:27 | [pearl_trainer] epoch #285 | Saved +2025-04-03 08:37:27 | [pearl_trainer] epoch #285 | Time 67959.83 s +2025-04-03 08:37:27 | [pearl_trainer] epoch #285 | EpochTime 238.45 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -2.22927 +MetaTest/Average/AverageReturn -2.22927 +MetaTest/Average/Iteration 285 +MetaTest/Average/MaxReturn 74.9034 +MetaTest/Average/MinReturn -85.8281 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 67.8015 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -2.22927 +MetaTest/__unnamed_task__/AverageReturn -2.22927 +MetaTest/__unnamed_task__/Iteration 285 +MetaTest/__unnamed_task__/MaxReturn 74.9034 +MetaTest/__unnamed_task__/MinReturn -85.8281 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 67.8015 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 472600 +------------------------------------------------- ------------ +2025-04-03 08:38:02 | [pearl_trainer] epoch #286 | Training... +2025-04-03 08:39:31 | [pearl_trainer] epoch #286 | Evaluating... +2025-04-03 08:39:31 | [pearl_trainer] epoch #286 | Sampling for adapation and meta-testing... +2025-04-03 08:41:28 | [pearl_trainer] epoch #286 | Finished meta-testing... +2025-04-03 08:41:28 | [pearl_trainer] epoch #286 | Saving snapshot... +2025-04-03 08:41:29 | [pearl_trainer] epoch #286 | Saved +2025-04-03 08:41:29 | [pearl_trainer] epoch #286 | Time 68201.64 s +2025-04-03 08:41:29 | [pearl_trainer] epoch #286 | EpochTime 241.81 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 12.3329 +MetaTest/Average/AverageReturn 12.3329 +MetaTest/Average/Iteration 286 +MetaTest/Average/MaxReturn 70.8143 +MetaTest/Average/MinReturn -57.3991 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 45.0273 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 12.3329 +MetaTest/__unnamed_task__/AverageReturn 12.3329 +MetaTest/__unnamed_task__/Iteration 286 +MetaTest/__unnamed_task__/MaxReturn 70.8143 +MetaTest/__unnamed_task__/MinReturn -57.3991 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 45.0273 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 474200 +------------------------------------------------- ----------- +2025-04-03 08:42:02 | [pearl_trainer] epoch #287 | Training... +2025-04-03 08:43:27 | [pearl_trainer] epoch #287 | Evaluating... +2025-04-03 08:43:27 | [pearl_trainer] epoch #287 | Sampling for adapation and meta-testing... +2025-04-03 08:45:25 | [pearl_trainer] epoch #287 | Finished meta-testing... +2025-04-03 08:45:25 | [pearl_trainer] epoch #287 | Saving snapshot... +2025-04-03 08:45:26 | [pearl_trainer] epoch #287 | Saved +2025-04-03 08:45:26 | [pearl_trainer] epoch #287 | Time 68438.66 s +2025-04-03 08:45:26 | [pearl_trainer] epoch #287 | EpochTime 237.02 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 28.9682 +MetaTest/Average/AverageReturn 28.9682 +MetaTest/Average/Iteration 287 +MetaTest/Average/MaxReturn 136.423 +MetaTest/Average/MinReturn -33.4865 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 61.3407 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 28.9682 +MetaTest/__unnamed_task__/AverageReturn 28.9682 +MetaTest/__unnamed_task__/Iteration 287 +MetaTest/__unnamed_task__/MaxReturn 136.423 +MetaTest/__unnamed_task__/MinReturn -33.4865 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 61.3407 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 475800 +------------------------------------------------- ----------- +2025-04-03 08:45:57 | [pearl_trainer] epoch #288 | Training... +2025-04-03 08:47:27 | [pearl_trainer] epoch #288 | Evaluating... +2025-04-03 08:47:27 | [pearl_trainer] epoch #288 | Sampling for adapation and meta-testing... +2025-04-03 08:49:24 | [pearl_trainer] epoch #288 | Finished meta-testing... +2025-04-03 08:49:24 | [pearl_trainer] epoch #288 | Saving snapshot... +2025-04-03 08:49:25 | [pearl_trainer] epoch #288 | Saved +2025-04-03 08:49:25 | [pearl_trainer] epoch #288 | Time 68678.01 s +2025-04-03 08:49:25 | [pearl_trainer] epoch #288 | EpochTime 239.35 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -33.3204 +MetaTest/Average/AverageReturn -33.3204 +MetaTest/Average/Iteration 288 +MetaTest/Average/MaxReturn -11.7368 +MetaTest/Average/MinReturn -68.968 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 21.2942 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -33.3204 +MetaTest/__unnamed_task__/AverageReturn -33.3204 +MetaTest/__unnamed_task__/Iteration 288 +MetaTest/__unnamed_task__/MaxReturn -11.7368 +MetaTest/__unnamed_task__/MinReturn -68.968 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 21.2942 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 477400 +------------------------------------------------- ----------- +2025-04-03 08:49:58 | [pearl_trainer] epoch #289 | Training... +2025-04-03 08:51:23 | [pearl_trainer] epoch #289 | Evaluating... +2025-04-03 08:51:23 | [pearl_trainer] epoch #289 | Sampling for adapation and meta-testing... +2025-04-03 08:53:21 | [pearl_trainer] epoch #289 | Finished meta-testing... +2025-04-03 08:53:21 | [pearl_trainer] epoch #289 | Saving snapshot... +2025-04-03 08:53:22 | [pearl_trainer] epoch #289 | Saved +2025-04-03 08:53:22 | [pearl_trainer] epoch #289 | Time 68914.79 s +2025-04-03 08:53:22 | [pearl_trainer] epoch #289 | EpochTime 236.78 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -28.6369 +MetaTest/Average/AverageReturn -28.6369 +MetaTest/Average/Iteration 289 +MetaTest/Average/MaxReturn 69.6977 +MetaTest/Average/MinReturn -77.2889 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 54.3023 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -28.6369 +MetaTest/__unnamed_task__/AverageReturn -28.6369 +MetaTest/__unnamed_task__/Iteration 289 +MetaTest/__unnamed_task__/MaxReturn 69.6977 +MetaTest/__unnamed_task__/MinReturn -77.2889 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 54.3023 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 479000 +------------------------------------------------- ----------- +2025-04-03 08:53:53 | [pearl_trainer] epoch #290 | Training... +2025-04-03 08:55:32 | [pearl_trainer] epoch #290 | Evaluating... +2025-04-03 08:55:32 | [pearl_trainer] epoch #290 | Sampling for adapation and meta-testing... +2025-04-03 08:57:17 | [pearl_trainer] epoch #290 | Finished meta-testing... +2025-04-03 08:57:17 | [pearl_trainer] epoch #290 | Saving snapshot... +2025-04-03 08:57:18 | [pearl_trainer] epoch #290 | Saved +2025-04-03 08:57:18 | [pearl_trainer] epoch #290 | Time 69151.31 s +2025-04-03 08:57:18 | [pearl_trainer] epoch #290 | EpochTime 236.52 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -27.8774 +MetaTest/Average/AverageReturn -27.8774 +MetaTest/Average/Iteration 290 +MetaTest/Average/MaxReturn 40.6828 +MetaTest/Average/MinReturn -75.9232 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 44.5562 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -27.8774 +MetaTest/__unnamed_task__/AverageReturn -27.8774 +MetaTest/__unnamed_task__/Iteration 290 +MetaTest/__unnamed_task__/MaxReturn 40.6828 +MetaTest/__unnamed_task__/MinReturn -75.9232 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 44.5562 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 480600 +------------------------------------------------- ----------- +2025-04-03 08:57:49 | [pearl_trainer] epoch #291 | Training... +2025-04-03 08:59:18 | [pearl_trainer] epoch #291 | Evaluating... +2025-04-03 08:59:18 | [pearl_trainer] epoch #291 | Sampling for adapation and meta-testing... +2025-04-03 09:01:20 | [pearl_trainer] epoch #291 | Finished meta-testing... +2025-04-03 09:01:20 | [pearl_trainer] epoch #291 | Saving snapshot... +2025-04-03 09:01:21 | [pearl_trainer] epoch #291 | Saved +2025-04-03 09:01:21 | [pearl_trainer] epoch #291 | Time 69393.94 s +2025-04-03 09:01:21 | [pearl_trainer] epoch #291 | EpochTime 242.63 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 2.21771 +MetaTest/Average/AverageReturn 2.21771 +MetaTest/Average/Iteration 291 +MetaTest/Average/MaxReturn 101.623 +MetaTest/Average/MinReturn -57.268 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 58.7101 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 2.21771 +MetaTest/__unnamed_task__/AverageReturn 2.21771 +MetaTest/__unnamed_task__/Iteration 291 +MetaTest/__unnamed_task__/MaxReturn 101.623 +MetaTest/__unnamed_task__/MinReturn -57.268 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 58.7101 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 482200 +------------------------------------------------- ------------ +2025-04-03 09:01:52 | [pearl_trainer] epoch #292 | Training... +2025-04-03 09:03:23 | [pearl_trainer] epoch #292 | Evaluating... +2025-04-03 09:03:23 | [pearl_trainer] epoch #292 | Sampling for adapation and meta-testing... +2025-04-03 09:05:11 | [pearl_trainer] epoch #292 | Finished meta-testing... +2025-04-03 09:05:11 | [pearl_trainer] epoch #292 | Saving snapshot... +2025-04-03 09:05:12 | [pearl_trainer] epoch #292 | Saved +2025-04-03 09:05:12 | [pearl_trainer] epoch #292 | Time 69624.79 s +2025-04-03 09:05:12 | [pearl_trainer] epoch #292 | EpochTime 230.84 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 2.67675 +MetaTest/Average/AverageReturn 2.67675 +MetaTest/Average/Iteration 292 +MetaTest/Average/MaxReturn 116.365 +MetaTest/Average/MinReturn -52.8964 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 59.3082 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 2.67675 +MetaTest/__unnamed_task__/AverageReturn 2.67675 +MetaTest/__unnamed_task__/Iteration 292 +MetaTest/__unnamed_task__/MaxReturn 116.365 +MetaTest/__unnamed_task__/MinReturn -52.8964 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 59.3082 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 483800 +------------------------------------------------- ------------ +2025-04-03 09:05:43 | [pearl_trainer] epoch #293 | Training... +2025-04-03 09:07:09 | [pearl_trainer] epoch #293 | Evaluating... +2025-04-03 09:07:09 | [pearl_trainer] epoch #293 | Sampling for adapation and meta-testing... +2025-04-03 09:09:03 | [pearl_trainer] epoch #293 | Finished meta-testing... +2025-04-03 09:09:03 | [pearl_trainer] epoch #293 | Saving snapshot... +2025-04-03 09:09:04 | [pearl_trainer] epoch #293 | Saved +2025-04-03 09:09:04 | [pearl_trainer] epoch #293 | Time 69857.52 s +2025-04-03 09:09:04 | [pearl_trainer] epoch #293 | EpochTime 232.73 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -16.1378 +MetaTest/Average/AverageReturn -16.1378 +MetaTest/Average/Iteration 293 +MetaTest/Average/MaxReturn 130.152 +MetaTest/Average/MinReturn -129.043 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 94.0768 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -16.1378 +MetaTest/__unnamed_task__/AverageReturn -16.1378 +MetaTest/__unnamed_task__/Iteration 293 +MetaTest/__unnamed_task__/MaxReturn 130.152 +MetaTest/__unnamed_task__/MinReturn -129.043 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 94.0768 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 485400 +------------------------------------------------- ----------- +2025-04-03 09:09:34 | [pearl_trainer] epoch #294 | Training... +2025-04-03 09:11:11 | [pearl_trainer] epoch #294 | Evaluating... +2025-04-03 09:11:11 | [pearl_trainer] epoch #294 | Sampling for adapation and meta-testing... +2025-04-03 09:12:58 | [pearl_trainer] epoch #294 | Finished meta-testing... +2025-04-03 09:12:58 | [pearl_trainer] epoch #294 | Saving snapshot... +2025-04-03 09:12:59 | [pearl_trainer] epoch #294 | Saved +2025-04-03 09:12:59 | [pearl_trainer] epoch #294 | Time 70092.05 s +2025-04-03 09:12:59 | [pearl_trainer] epoch #294 | EpochTime 234.53 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -32.0731 +MetaTest/Average/AverageReturn -32.0731 +MetaTest/Average/Iteration 294 +MetaTest/Average/MaxReturn 120.497 +MetaTest/Average/MinReturn -98.0214 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 78.0691 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -32.0731 +MetaTest/__unnamed_task__/AverageReturn -32.0731 +MetaTest/__unnamed_task__/Iteration 294 +MetaTest/__unnamed_task__/MaxReturn 120.497 +MetaTest/__unnamed_task__/MinReturn -98.0214 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 78.0691 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 487000 +------------------------------------------------- ----------- +2025-04-03 09:13:31 | [pearl_trainer] epoch #295 | Training... +2025-04-03 09:15:08 | [pearl_trainer] epoch #295 | Evaluating... +2025-04-03 09:15:08 | [pearl_trainer] epoch #295 | Sampling for adapation and meta-testing... +2025-04-03 09:16:55 | [pearl_trainer] epoch #295 | Finished meta-testing... +2025-04-03 09:16:55 | [pearl_trainer] epoch #295 | Saving snapshot... +2025-04-03 09:16:56 | [pearl_trainer] epoch #295 | Saved +2025-04-03 09:16:56 | [pearl_trainer] epoch #295 | Time 70328.94 s +2025-04-03 09:16:56 | [pearl_trainer] epoch #295 | EpochTime 236.88 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -28.0994 +MetaTest/Average/AverageReturn -28.0994 +MetaTest/Average/Iteration 295 +MetaTest/Average/MaxReturn 18.8515 +MetaTest/Average/MinReturn -66.3714 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 33.7038 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -28.0994 +MetaTest/__unnamed_task__/AverageReturn -28.0994 +MetaTest/__unnamed_task__/Iteration 295 +MetaTest/__unnamed_task__/MaxReturn 18.8515 +MetaTest/__unnamed_task__/MinReturn -66.3714 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 33.7038 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 488600 +------------------------------------------------- ----------- +2025-04-03 09:17:25 | [pearl_trainer] epoch #296 | Training... +2025-04-03 09:18:58 | [pearl_trainer] epoch #296 | Evaluating... +2025-04-03 09:18:58 | [pearl_trainer] epoch #296 | Sampling for adapation and meta-testing... +2025-04-03 09:20:47 | [pearl_trainer] epoch #296 | Finished meta-testing... +2025-04-03 09:20:47 | [pearl_trainer] epoch #296 | Saving snapshot... +2025-04-03 09:20:48 | [pearl_trainer] epoch #296 | Saved +2025-04-03 09:20:48 | [pearl_trainer] epoch #296 | Time 70560.96 s +2025-04-03 09:20:48 | [pearl_trainer] epoch #296 | EpochTime 232.02 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -22.1211 +MetaTest/Average/AverageReturn -22.1211 +MetaTest/Average/Iteration 296 +MetaTest/Average/MaxReturn 50.145 +MetaTest/Average/MinReturn -50.817 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 37.2925 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -22.1211 +MetaTest/__unnamed_task__/AverageReturn -22.1211 +MetaTest/__unnamed_task__/Iteration 296 +MetaTest/__unnamed_task__/MaxReturn 50.145 +MetaTest/__unnamed_task__/MinReturn -50.817 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 37.2925 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 490200 +------------------------------------------------- ----------- +2025-04-03 09:21:18 | [pearl_trainer] epoch #297 | Training... +2025-04-03 09:22:56 | [pearl_trainer] epoch #297 | Evaluating... +2025-04-03 09:22:56 | [pearl_trainer] epoch #297 | Sampling for adapation and meta-testing... +2025-04-03 09:24:43 | [pearl_trainer] epoch #297 | Finished meta-testing... +2025-04-03 09:24:43 | [pearl_trainer] epoch #297 | Saving snapshot... +2025-04-03 09:24:44 | [pearl_trainer] epoch #297 | Saved +2025-04-03 09:24:44 | [pearl_trainer] epoch #297 | Time 70797.00 s +2025-04-03 09:24:44 | [pearl_trainer] epoch #297 | EpochTime 236.04 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -41.1702 +MetaTest/Average/AverageReturn -41.1702 +MetaTest/Average/Iteration 297 +MetaTest/Average/MaxReturn 21.5941 +MetaTest/Average/MinReturn -68.7195 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 32.633 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -41.1702 +MetaTest/__unnamed_task__/AverageReturn -41.1702 +MetaTest/__unnamed_task__/Iteration 297 +MetaTest/__unnamed_task__/MaxReturn 21.5941 +MetaTest/__unnamed_task__/MinReturn -68.7195 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 32.633 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 491800 +------------------------------------------------- ----------- +2025-04-03 09:25:13 | [pearl_trainer] epoch #298 | Training... +2025-04-03 09:26:43 | [pearl_trainer] epoch #298 | Evaluating... +2025-04-03 09:26:43 | [pearl_trainer] epoch #298 | Sampling for adapation and meta-testing... +2025-04-03 09:28:36 | [pearl_trainer] epoch #298 | Finished meta-testing... +2025-04-03 09:28:36 | [pearl_trainer] epoch #298 | Saving snapshot... +2025-04-03 09:28:38 | [pearl_trainer] epoch #298 | Saved +2025-04-03 09:28:38 | [pearl_trainer] epoch #298 | Time 71030.75 s +2025-04-03 09:28:38 | [pearl_trainer] epoch #298 | EpochTime 233.75 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -14.3537 +MetaTest/Average/AverageReturn -14.3537 +MetaTest/Average/Iteration 298 +MetaTest/Average/MaxReturn 22.8994 +MetaTest/Average/MinReturn -62.7451 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 34.9494 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -14.3537 +MetaTest/__unnamed_task__/AverageReturn -14.3537 +MetaTest/__unnamed_task__/Iteration 298 +MetaTest/__unnamed_task__/MaxReturn 22.8994 +MetaTest/__unnamed_task__/MinReturn -62.7451 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 34.9494 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 493400 +------------------------------------------------- ----------- +2025-04-03 09:29:15 | [pearl_trainer] epoch #299 | Training... +2025-04-03 09:30:38 | [pearl_trainer] epoch #299 | Evaluating... +2025-04-03 09:30:38 | [pearl_trainer] epoch #299 | Sampling for adapation and meta-testing... +2025-04-03 09:32:30 | [pearl_trainer] epoch #299 | Finished meta-testing... +2025-04-03 09:32:30 | [pearl_trainer] epoch #299 | Saving snapshot... +2025-04-03 09:32:32 | [pearl_trainer] epoch #299 | Saved +2025-04-03 09:32:32 | [pearl_trainer] epoch #299 | Time 71264.55 s +2025-04-03 09:32:32 | [pearl_trainer] epoch #299 | EpochTime 233.80 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -53.0064 +MetaTest/Average/AverageReturn -53.0064 +MetaTest/Average/Iteration 299 +MetaTest/Average/MaxReturn -43.9922 +MetaTest/Average/MinReturn -70.3356 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 9.361 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -53.0064 +MetaTest/__unnamed_task__/AverageReturn -53.0064 +MetaTest/__unnamed_task__/Iteration 299 +MetaTest/__unnamed_task__/MaxReturn -43.9922 +MetaTest/__unnamed_task__/MinReturn -70.3356 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 9.361 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 495000 +------------------------------------------------- ----------- +2025-04-03 09:33:02 | [pearl_trainer] epoch #300 | Training... +2025-04-03 09:34:22 | [pearl_trainer] epoch #300 | Evaluating... +2025-04-03 09:34:22 | [pearl_trainer] epoch #300 | Sampling for adapation and meta-testing... +2025-04-03 09:36:13 | [pearl_trainer] epoch #300 | Finished meta-testing... +2025-04-03 09:36:13 | [pearl_trainer] epoch #300 | Saving snapshot... +2025-04-03 09:36:15 | [pearl_trainer] epoch #300 | Saved +2025-04-03 09:36:15 | [pearl_trainer] epoch #300 | Time 71487.85 s +2025-04-03 09:36:15 | [pearl_trainer] epoch #300 | EpochTime 223.30 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -30.0364 +MetaTest/Average/AverageReturn -30.0364 +MetaTest/Average/Iteration 300 +MetaTest/Average/MaxReturn -8.6461 +MetaTest/Average/MinReturn -47.8814 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 15.4506 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -30.0364 +MetaTest/__unnamed_task__/AverageReturn -30.0364 +MetaTest/__unnamed_task__/Iteration 300 +MetaTest/__unnamed_task__/MaxReturn -8.6461 +MetaTest/__unnamed_task__/MinReturn -47.8814 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 15.4506 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 496600 +------------------------------------------------- ----------- +2025-04-03 09:36:45 | [pearl_trainer] epoch #301 | Training... +2025-04-03 09:38:13 | [pearl_trainer] epoch #301 | Evaluating... +2025-04-03 09:38:13 | [pearl_trainer] epoch #301 | Sampling for adapation and meta-testing... +2025-04-03 09:40:03 | [pearl_trainer] epoch #301 | Finished meta-testing... +2025-04-03 09:40:03 | [pearl_trainer] epoch #301 | Saving snapshot... +2025-04-03 09:40:04 | [pearl_trainer] epoch #301 | Saved +2025-04-03 09:40:04 | [pearl_trainer] epoch #301 | Time 71716.61 s +2025-04-03 09:40:04 | [pearl_trainer] epoch #301 | EpochTime 228.76 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -17.0962 +MetaTest/Average/AverageReturn -17.0962 +MetaTest/Average/Iteration 301 +MetaTest/Average/MaxReturn 59.5645 +MetaTest/Average/MinReturn -49.4812 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 41.762 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -17.0962 +MetaTest/__unnamed_task__/AverageReturn -17.0962 +MetaTest/__unnamed_task__/Iteration 301 +MetaTest/__unnamed_task__/MaxReturn 59.5645 +MetaTest/__unnamed_task__/MinReturn -49.4812 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 41.762 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 498200 +------------------------------------------------- ----------- +2025-04-03 09:40:34 | [pearl_trainer] epoch #302 | Training... +2025-04-03 09:41:59 | [pearl_trainer] epoch #302 | Evaluating... +2025-04-03 09:41:59 | [pearl_trainer] epoch #302 | Sampling for adapation and meta-testing... +2025-04-03 09:43:46 | [pearl_trainer] epoch #302 | Finished meta-testing... +2025-04-03 09:43:46 | [pearl_trainer] epoch #302 | Saving snapshot... +2025-04-03 09:43:47 | [pearl_trainer] epoch #302 | Saved +2025-04-03 09:43:47 | [pearl_trainer] epoch #302 | Time 71939.91 s +2025-04-03 09:43:47 | [pearl_trainer] epoch #302 | EpochTime 223.30 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -47.1254 +MetaTest/Average/AverageReturn -47.1254 +MetaTest/Average/Iteration 302 +MetaTest/Average/MaxReturn -13.1969 +MetaTest/Average/MinReturn -65.4251 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 19.0576 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -47.1254 +MetaTest/__unnamed_task__/AverageReturn -47.1254 +MetaTest/__unnamed_task__/Iteration 302 +MetaTest/__unnamed_task__/MaxReturn -13.1969 +MetaTest/__unnamed_task__/MinReturn -65.4251 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 19.0576 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 499800 +------------------------------------------------- ----------- +2025-04-03 09:44:16 | [pearl_trainer] epoch #303 | Training... +2025-04-03 09:45:47 | [pearl_trainer] epoch #303 | Evaluating... +2025-04-03 09:45:47 | [pearl_trainer] epoch #303 | Sampling for adapation and meta-testing... +2025-04-03 09:47:36 | [pearl_trainer] epoch #303 | Finished meta-testing... +2025-04-03 09:47:36 | [pearl_trainer] epoch #303 | Saving snapshot... +2025-04-03 09:47:38 | [pearl_trainer] epoch #303 | Saved +2025-04-03 09:47:38 | [pearl_trainer] epoch #303 | Time 72170.56 s +2025-04-03 09:47:38 | [pearl_trainer] epoch #303 | EpochTime 230.66 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -13.3273 +MetaTest/Average/AverageReturn -13.3273 +MetaTest/Average/Iteration 303 +MetaTest/Average/MaxReturn 53.772 +MetaTest/Average/MinReturn -62.5353 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 39.3448 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -13.3273 +MetaTest/__unnamed_task__/AverageReturn -13.3273 +MetaTest/__unnamed_task__/Iteration 303 +MetaTest/__unnamed_task__/MaxReturn 53.772 +MetaTest/__unnamed_task__/MinReturn -62.5353 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 39.3448 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 501400 +------------------------------------------------- ----------- +2025-04-03 09:48:09 | [pearl_trainer] epoch #304 | Training... +2025-04-03 09:49:33 | [pearl_trainer] epoch #304 | Evaluating... +2025-04-03 09:49:33 | [pearl_trainer] epoch #304 | Sampling for adapation and meta-testing... +2025-04-03 09:51:28 | [pearl_trainer] epoch #304 | Finished meta-testing... +2025-04-03 09:51:28 | [pearl_trainer] epoch #304 | Saving snapshot... +2025-04-03 09:51:29 | [pearl_trainer] epoch #304 | Saved +2025-04-03 09:51:29 | [pearl_trainer] epoch #304 | Time 72402.09 s +2025-04-03 09:51:29 | [pearl_trainer] epoch #304 | EpochTime 231.52 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 30.9356 +MetaTest/Average/AverageReturn 30.9356 +MetaTest/Average/Iteration 304 +MetaTest/Average/MaxReturn 65.1879 +MetaTest/Average/MinReturn -53.5402 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 43.5363 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 30.9356 +MetaTest/__unnamed_task__/AverageReturn 30.9356 +MetaTest/__unnamed_task__/Iteration 304 +MetaTest/__unnamed_task__/MaxReturn 65.1879 +MetaTest/__unnamed_task__/MinReturn -53.5402 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 43.5363 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 503000 +------------------------------------------------- ----------- +2025-04-03 09:52:00 | [pearl_trainer] epoch #305 | Training... +2025-04-03 09:53:37 | [pearl_trainer] epoch #305 | Evaluating... +2025-04-03 09:53:37 | [pearl_trainer] epoch #305 | Sampling for adapation and meta-testing... +2025-04-03 09:55:27 | [pearl_trainer] epoch #305 | Finished meta-testing... +2025-04-03 09:55:27 | [pearl_trainer] epoch #305 | Saving snapshot... +2025-04-03 09:55:28 | [pearl_trainer] epoch #305 | Saved +2025-04-03 09:55:28 | [pearl_trainer] epoch #305 | Time 72641.47 s +2025-04-03 09:55:28 | [pearl_trainer] epoch #305 | EpochTime 239.38 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -23.387 +MetaTest/Average/AverageReturn -23.387 +MetaTest/Average/Iteration 305 +MetaTest/Average/MaxReturn 12.1226 +MetaTest/Average/MinReturn -46.2793 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 20.8659 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -23.387 +MetaTest/__unnamed_task__/AverageReturn -23.387 +MetaTest/__unnamed_task__/Iteration 305 +MetaTest/__unnamed_task__/MaxReturn 12.1226 +MetaTest/__unnamed_task__/MinReturn -46.2793 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 20.8659 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 504600 +------------------------------------------------- ----------- +2025-04-03 09:56:01 | [pearl_trainer] epoch #306 | Training... +2025-04-03 09:57:41 | [pearl_trainer] epoch #306 | Evaluating... +2025-04-03 09:57:41 | [pearl_trainer] epoch #306 | Sampling for adapation and meta-testing... +2025-04-03 09:59:45 | [pearl_trainer] epoch #306 | Finished meta-testing... +2025-04-03 09:59:45 | [pearl_trainer] epoch #306 | Saving snapshot... +2025-04-03 09:59:46 | [pearl_trainer] epoch #306 | Saved +2025-04-03 09:59:46 | [pearl_trainer] epoch #306 | Time 72899.46 s +2025-04-03 09:59:46 | [pearl_trainer] epoch #306 | EpochTime 257.99 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 3.79906 +MetaTest/Average/AverageReturn 3.79906 +MetaTest/Average/Iteration 306 +MetaTest/Average/MaxReturn 65.1416 +MetaTest/Average/MinReturn -69.1494 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 53.8548 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 3.79906 +MetaTest/__unnamed_task__/AverageReturn 3.79906 +MetaTest/__unnamed_task__/Iteration 306 +MetaTest/__unnamed_task__/MaxReturn 65.1416 +MetaTest/__unnamed_task__/MinReturn -69.1494 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 53.8548 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 506200 +------------------------------------------------- ------------ +2025-04-03 10:00:18 | [pearl_trainer] epoch #307 | Training... +2025-04-03 10:01:46 | [pearl_trainer] epoch #307 | Evaluating... +2025-04-03 10:01:46 | [pearl_trainer] epoch #307 | Sampling for adapation and meta-testing... +2025-04-03 10:03:40 | [pearl_trainer] epoch #307 | Finished meta-testing... +2025-04-03 10:03:40 | [pearl_trainer] epoch #307 | Saving snapshot... +2025-04-03 10:03:41 | [pearl_trainer] epoch #307 | Saved +2025-04-03 10:03:41 | [pearl_trainer] epoch #307 | Time 73134.32 s +2025-04-03 10:03:41 | [pearl_trainer] epoch #307 | EpochTime 234.86 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 3.21352 +MetaTest/Average/AverageReturn 3.21352 +MetaTest/Average/Iteration 307 +MetaTest/Average/MaxReturn 64.9929 +MetaTest/Average/MinReturn -33.7234 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 34.2489 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 3.21352 +MetaTest/__unnamed_task__/AverageReturn 3.21352 +MetaTest/__unnamed_task__/Iteration 307 +MetaTest/__unnamed_task__/MaxReturn 64.9929 +MetaTest/__unnamed_task__/MinReturn -33.7234 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 34.2489 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 507800 +------------------------------------------------- ------------ +2025-04-03 10:04:14 | [pearl_trainer] epoch #308 | Training... +2025-04-03 10:05:44 | [pearl_trainer] epoch #308 | Evaluating... +2025-04-03 10:05:44 | [pearl_trainer] epoch #308 | Sampling for adapation and meta-testing... +2025-04-03 10:07:38 | [pearl_trainer] epoch #308 | Finished meta-testing... +2025-04-03 10:07:38 | [pearl_trainer] epoch #308 | Saving snapshot... +2025-04-03 10:07:39 | [pearl_trainer] epoch #308 | Saved +2025-04-03 10:07:39 | [pearl_trainer] epoch #308 | Time 73372.47 s +2025-04-03 10:07:39 | [pearl_trainer] epoch #308 | EpochTime 238.14 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 13.1959 +MetaTest/Average/AverageReturn 13.1959 +MetaTest/Average/Iteration 308 +MetaTest/Average/MaxReturn 100.397 +MetaTest/Average/MinReturn -67.4582 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 63.8577 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 13.1959 +MetaTest/__unnamed_task__/AverageReturn 13.1959 +MetaTest/__unnamed_task__/Iteration 308 +MetaTest/__unnamed_task__/MaxReturn 100.397 +MetaTest/__unnamed_task__/MinReturn -67.4582 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 63.8577 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 509400 +------------------------------------------------- ----------- +2025-04-03 10:08:12 | [pearl_trainer] epoch #309 | Training... +2025-04-03 10:09:39 | [pearl_trainer] epoch #309 | Evaluating... +2025-04-03 10:09:39 | [pearl_trainer] epoch #309 | Sampling for adapation and meta-testing... +2025-04-03 10:11:36 | [pearl_trainer] epoch #309 | Finished meta-testing... +2025-04-03 10:11:36 | [pearl_trainer] epoch #309 | Saving snapshot... +2025-04-03 10:11:37 | [pearl_trainer] epoch #309 | Saved +2025-04-03 10:11:37 | [pearl_trainer] epoch #309 | Time 73609.90 s +2025-04-03 10:11:37 | [pearl_trainer] epoch #309 | EpochTime 237.43 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 10.8352 +MetaTest/Average/AverageReturn 10.8352 +MetaTest/Average/Iteration 309 +MetaTest/Average/MaxReturn 57.8994 +MetaTest/Average/MinReturn -26.6563 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 30.371 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 10.8352 +MetaTest/__unnamed_task__/AverageReturn 10.8352 +MetaTest/__unnamed_task__/Iteration 309 +MetaTest/__unnamed_task__/MaxReturn 57.8994 +MetaTest/__unnamed_task__/MinReturn -26.6563 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 30.371 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 511000 +------------------------------------------------- ----------- +2025-04-03 10:12:09 | [pearl_trainer] epoch #310 | Training... +2025-04-03 10:13:45 | [pearl_trainer] epoch #310 | Evaluating... +2025-04-03 10:13:45 | [pearl_trainer] epoch #310 | Sampling for adapation and meta-testing... +2025-04-03 10:15:38 | [pearl_trainer] epoch #310 | Finished meta-testing... +2025-04-03 10:15:38 | [pearl_trainer] epoch #310 | Saving snapshot... +2025-04-03 10:15:39 | [pearl_trainer] epoch #310 | Saved +2025-04-03 10:15:39 | [pearl_trainer] epoch #310 | Time 73852.02 s +2025-04-03 10:15:39 | [pearl_trainer] epoch #310 | EpochTime 242.12 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -20.0962 +MetaTest/Average/AverageReturn -20.0962 +MetaTest/Average/Iteration 310 +MetaTest/Average/MaxReturn 10.282 +MetaTest/Average/MinReturn -60.5831 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 23.3962 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -20.0962 +MetaTest/__unnamed_task__/AverageReturn -20.0962 +MetaTest/__unnamed_task__/Iteration 310 +MetaTest/__unnamed_task__/MaxReturn 10.282 +MetaTest/__unnamed_task__/MinReturn -60.5831 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 23.3962 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 512600 +------------------------------------------------- ----------- +2025-04-03 10:16:11 | [pearl_trainer] epoch #311 | Training... +2025-04-03 10:17:41 | [pearl_trainer] epoch #311 | Evaluating... +2025-04-03 10:17:41 | [pearl_trainer] epoch #311 | Sampling for adapation and meta-testing... +2025-04-03 10:19:36 | [pearl_trainer] epoch #311 | Finished meta-testing... +2025-04-03 10:19:36 | [pearl_trainer] epoch #311 | Saving snapshot... +2025-04-03 10:19:37 | [pearl_trainer] epoch #311 | Saved +2025-04-03 10:19:37 | [pearl_trainer] epoch #311 | Time 74089.71 s +2025-04-03 10:19:37 | [pearl_trainer] epoch #311 | EpochTime 237.69 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 9.2274 +MetaTest/Average/AverageReturn 9.2274 +MetaTest/Average/Iteration 311 +MetaTest/Average/MaxReturn 60.2636 +MetaTest/Average/MinReturn -20.0716 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 31.6808 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 9.2274 +MetaTest/__unnamed_task__/AverageReturn 9.2274 +MetaTest/__unnamed_task__/Iteration 311 +MetaTest/__unnamed_task__/MaxReturn 60.2636 +MetaTest/__unnamed_task__/MinReturn -20.0716 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 31.6808 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 514200 +------------------------------------------------- ----------- +2025-04-03 10:20:10 | [pearl_trainer] epoch #312 | Training... +2025-04-03 10:21:41 | [pearl_trainer] epoch #312 | Evaluating... +2025-04-03 10:21:41 | [pearl_trainer] epoch #312 | Sampling for adapation and meta-testing... +2025-04-03 10:23:36 | [pearl_trainer] epoch #312 | Finished meta-testing... +2025-04-03 10:23:36 | [pearl_trainer] epoch #312 | Saving snapshot... +2025-04-03 10:23:37 | [pearl_trainer] epoch #312 | Saved +2025-04-03 10:23:38 | [pearl_trainer] epoch #312 | Time 74330.52 s +2025-04-03 10:23:38 | [pearl_trainer] epoch #312 | EpochTime 240.80 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 8.63633 +MetaTest/Average/AverageReturn 8.63633 +MetaTest/Average/Iteration 312 +MetaTest/Average/MaxReturn 121.301 +MetaTest/Average/MinReturn -48.9712 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 61.2078 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 8.63633 +MetaTest/__unnamed_task__/AverageReturn 8.63633 +MetaTest/__unnamed_task__/Iteration 312 +MetaTest/__unnamed_task__/MaxReturn 121.301 +MetaTest/__unnamed_task__/MinReturn -48.9712 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 61.2078 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 515800 +------------------------------------------------- ------------ +2025-04-03 10:24:13 | [pearl_trainer] epoch #313 | Training... +2025-04-03 10:25:37 | [pearl_trainer] epoch #313 | Evaluating... +2025-04-03 10:25:37 | [pearl_trainer] epoch #313 | Sampling for adapation and meta-testing... +2025-04-03 10:27:33 | [pearl_trainer] epoch #313 | Finished meta-testing... +2025-04-03 10:27:33 | [pearl_trainer] epoch #313 | Saving snapshot... +2025-04-03 10:27:35 | [pearl_trainer] epoch #313 | Saved +2025-04-03 10:27:35 | [pearl_trainer] epoch #313 | Time 74567.79 s +2025-04-03 10:27:35 | [pearl_trainer] epoch #313 | EpochTime 237.27 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 5.22535 +MetaTest/Average/AverageReturn 5.22535 +MetaTest/Average/Iteration 313 +MetaTest/Average/MaxReturn 42.7275 +MetaTest/Average/MinReturn -16.4109 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 25.6501 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 5.22535 +MetaTest/__unnamed_task__/AverageReturn 5.22535 +MetaTest/__unnamed_task__/Iteration 313 +MetaTest/__unnamed_task__/MaxReturn 42.7275 +MetaTest/__unnamed_task__/MinReturn -16.4109 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 25.6501 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 517400 +------------------------------------------------- ------------ +2025-04-03 10:28:08 | [pearl_trainer] epoch #314 | Training... +2025-04-03 10:29:54 | [pearl_trainer] epoch #314 | Evaluating... +2025-04-03 10:29:54 | [pearl_trainer] epoch #314 | Sampling for adapation and meta-testing... +2025-04-03 10:31:45 | [pearl_trainer] epoch #314 | Finished meta-testing... +2025-04-03 10:31:45 | [pearl_trainer] epoch #314 | Saving snapshot... +2025-04-03 10:31:46 | [pearl_trainer] epoch #314 | Saved +2025-04-03 10:31:46 | [pearl_trainer] epoch #314 | Time 74818.95 s +2025-04-03 10:31:46 | [pearl_trainer] epoch #314 | EpochTime 251.16 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 26.1389 +MetaTest/Average/AverageReturn 26.1389 +MetaTest/Average/Iteration 314 +MetaTest/Average/MaxReturn 110.037 +MetaTest/Average/MinReturn -23.5545 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 56.7417 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 26.1389 +MetaTest/__unnamed_task__/AverageReturn 26.1389 +MetaTest/__unnamed_task__/Iteration 314 +MetaTest/__unnamed_task__/MaxReturn 110.037 +MetaTest/__unnamed_task__/MinReturn -23.5545 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 56.7417 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 519000 +------------------------------------------------- ----------- +2025-04-03 10:32:19 | [pearl_trainer] epoch #315 | Training... +2025-04-03 10:33:49 | [pearl_trainer] epoch #315 | Evaluating... +2025-04-03 10:33:49 | [pearl_trainer] epoch #315 | Sampling for adapation and meta-testing... +2025-04-03 10:35:47 | [pearl_trainer] epoch #315 | Finished meta-testing... +2025-04-03 10:35:47 | [pearl_trainer] epoch #315 | Saving snapshot... +2025-04-03 10:35:48 | [pearl_trainer] epoch #315 | Saved +2025-04-03 10:35:48 | [pearl_trainer] epoch #315 | Time 75061.13 s +2025-04-03 10:35:48 | [pearl_trainer] epoch #315 | EpochTime 242.18 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 12.6899 +MetaTest/Average/AverageReturn 12.6899 +MetaTest/Average/Iteration 315 +MetaTest/Average/MaxReturn 73.6685 +MetaTest/Average/MinReturn -38.7753 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 37.3866 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 12.6899 +MetaTest/__unnamed_task__/AverageReturn 12.6899 +MetaTest/__unnamed_task__/Iteration 315 +MetaTest/__unnamed_task__/MaxReturn 73.6685 +MetaTest/__unnamed_task__/MinReturn -38.7753 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 37.3866 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 520600 +------------------------------------------------- ----------- +2025-04-03 10:36:20 | [pearl_trainer] epoch #316 | Training... +2025-04-03 10:37:50 | [pearl_trainer] epoch #316 | Evaluating... +2025-04-03 10:37:50 | [pearl_trainer] epoch #316 | Sampling for adapation and meta-testing... +2025-04-03 10:39:43 | [pearl_trainer] epoch #316 | Finished meta-testing... +2025-04-03 10:39:43 | [pearl_trainer] epoch #316 | Saving snapshot... +2025-04-03 10:39:44 | [pearl_trainer] epoch #316 | Saved +2025-04-03 10:39:44 | [pearl_trainer] epoch #316 | Time 75296.85 s +2025-04-03 10:39:44 | [pearl_trainer] epoch #316 | EpochTime 235.72 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -2.33228 +MetaTest/Average/AverageReturn -2.33228 +MetaTest/Average/Iteration 316 +MetaTest/Average/MaxReturn 43.4564 +MetaTest/Average/MinReturn -38.9505 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 33.7828 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -2.33228 +MetaTest/__unnamed_task__/AverageReturn -2.33228 +MetaTest/__unnamed_task__/Iteration 316 +MetaTest/__unnamed_task__/MaxReturn 43.4564 +MetaTest/__unnamed_task__/MinReturn -38.9505 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 33.7828 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 522200 +------------------------------------------------- ------------ +2025-04-03 10:40:16 | [pearl_trainer] epoch #317 | Training... +2025-04-03 10:41:52 | [pearl_trainer] epoch #317 | Evaluating... +2025-04-03 10:41:52 | [pearl_trainer] epoch #317 | Sampling for adapation and meta-testing... +2025-04-03 10:43:52 | [pearl_trainer] epoch #317 | Finished meta-testing... +2025-04-03 10:43:52 | [pearl_trainer] epoch #317 | Saving snapshot... +2025-04-03 10:43:53 | [pearl_trainer] epoch #317 | Saved +2025-04-03 10:43:53 | [pearl_trainer] epoch #317 | Time 75545.79 s +2025-04-03 10:43:53 | [pearl_trainer] epoch #317 | EpochTime 248.93 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -14.9018 +MetaTest/Average/AverageReturn -14.9018 +MetaTest/Average/Iteration 317 +MetaTest/Average/MaxReturn 30.8978 +MetaTest/Average/MinReturn -35.0545 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 23.4756 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -14.9018 +MetaTest/__unnamed_task__/AverageReturn -14.9018 +MetaTest/__unnamed_task__/Iteration 317 +MetaTest/__unnamed_task__/MaxReturn 30.8978 +MetaTest/__unnamed_task__/MinReturn -35.0545 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 23.4756 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 523800 +------------------------------------------------- ----------- +2025-04-03 10:44:24 | [pearl_trainer] epoch #318 | Training... +2025-04-03 10:45:56 | [pearl_trainer] epoch #318 | Evaluating... +2025-04-03 10:45:56 | [pearl_trainer] epoch #318 | Sampling for adapation and meta-testing... +2025-04-03 10:47:50 | [pearl_trainer] epoch #318 | Finished meta-testing... +2025-04-03 10:47:50 | [pearl_trainer] epoch #318 | Saving snapshot... +2025-04-03 10:47:52 | [pearl_trainer] epoch #318 | Saved +2025-04-03 10:47:52 | [pearl_trainer] epoch #318 | Time 75784.83 s +2025-04-03 10:47:52 | [pearl_trainer] epoch #318 | EpochTime 239.04 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -25.5554 +MetaTest/Average/AverageReturn -25.5554 +MetaTest/Average/Iteration 318 +MetaTest/Average/MaxReturn 60.63 +MetaTest/Average/MinReturn -84.0839 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 53.2143 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -25.5554 +MetaTest/__unnamed_task__/AverageReturn -25.5554 +MetaTest/__unnamed_task__/Iteration 318 +MetaTest/__unnamed_task__/MaxReturn 60.63 +MetaTest/__unnamed_task__/MinReturn -84.0839 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 53.2143 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 525400 +------------------------------------------------- ----------- +2025-04-03 10:48:24 | [pearl_trainer] epoch #319 | Training... +2025-04-03 10:49:52 | [pearl_trainer] epoch #319 | Evaluating... +2025-04-03 10:49:52 | [pearl_trainer] epoch #319 | Sampling for adapation and meta-testing... +2025-04-03 10:51:49 | [pearl_trainer] epoch #319 | Finished meta-testing... +2025-04-03 10:51:49 | [pearl_trainer] epoch #319 | Saving snapshot... +2025-04-03 10:51:50 | [pearl_trainer] epoch #319 | Saved +2025-04-03 10:51:50 | [pearl_trainer] epoch #319 | Time 76023.21 s +2025-04-03 10:51:50 | [pearl_trainer] epoch #319 | EpochTime 238.38 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 10.5304 +MetaTest/Average/AverageReturn 10.5304 +MetaTest/Average/Iteration 319 +MetaTest/Average/MaxReturn 93.1018 +MetaTest/Average/MinReturn -88.3701 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 63.7471 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 10.5304 +MetaTest/__unnamed_task__/AverageReturn 10.5304 +MetaTest/__unnamed_task__/Iteration 319 +MetaTest/__unnamed_task__/MaxReturn 93.1018 +MetaTest/__unnamed_task__/MinReturn -88.3701 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 63.7471 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 527000 +------------------------------------------------- ----------- +2025-04-03 10:52:23 | [pearl_trainer] epoch #320 | Training... +2025-04-03 10:54:00 | [pearl_trainer] epoch #320 | Evaluating... +2025-04-03 10:54:00 | [pearl_trainer] epoch #320 | Sampling for adapation and meta-testing... +2025-04-03 10:55:56 | [pearl_trainer] epoch #320 | Finished meta-testing... +2025-04-03 10:55:56 | [pearl_trainer] epoch #320 | Saving snapshot... +2025-04-03 10:55:57 | [pearl_trainer] epoch #320 | Saved +2025-04-03 10:55:57 | [pearl_trainer] epoch #320 | Time 76270.41 s +2025-04-03 10:55:57 | [pearl_trainer] epoch #320 | EpochTime 247.20 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 12.56 +MetaTest/Average/AverageReturn 12.56 +MetaTest/Average/Iteration 320 +MetaTest/Average/MaxReturn 80.0444 +MetaTest/Average/MinReturn -42.9403 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 54.4529 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 12.56 +MetaTest/__unnamed_task__/AverageReturn 12.56 +MetaTest/__unnamed_task__/Iteration 320 +MetaTest/__unnamed_task__/MaxReturn 80.0444 +MetaTest/__unnamed_task__/MinReturn -42.9403 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 54.4529 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 528600 +------------------------------------------------- ----------- +2025-04-03 10:56:31 | [pearl_trainer] epoch #321 | Training... +2025-04-03 10:57:57 | [pearl_trainer] epoch #321 | Evaluating... +2025-04-03 10:57:57 | [pearl_trainer] epoch #321 | Sampling for adapation and meta-testing... +2025-04-03 10:59:57 | [pearl_trainer] epoch #321 | Finished meta-testing... +2025-04-03 10:59:57 | [pearl_trainer] epoch #321 | Saving snapshot... +2025-04-03 10:59:58 | [pearl_trainer] epoch #321 | Saved +2025-04-03 10:59:58 | [pearl_trainer] epoch #321 | Time 76511.38 s +2025-04-03 10:59:58 | [pearl_trainer] epoch #321 | EpochTime 240.96 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -7.41829 +MetaTest/Average/AverageReturn -7.41829 +MetaTest/Average/Iteration 321 +MetaTest/Average/MaxReturn 28.5343 +MetaTest/Average/MinReturn -48.9044 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 30.9217 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -7.41829 +MetaTest/__unnamed_task__/AverageReturn -7.41829 +MetaTest/__unnamed_task__/Iteration 321 +MetaTest/__unnamed_task__/MaxReturn 28.5343 +MetaTest/__unnamed_task__/MinReturn -48.9044 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 30.9217 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 530200 +------------------------------------------------- ------------ +2025-04-03 11:00:31 | [pearl_trainer] epoch #322 | Training... +2025-04-03 11:02:07 | [pearl_trainer] epoch #322 | Evaluating... +2025-04-03 11:02:07 | [pearl_trainer] epoch #322 | Sampling for adapation and meta-testing... +2025-04-03 11:04:02 | [pearl_trainer] epoch #322 | Finished meta-testing... +2025-04-03 11:04:02 | [pearl_trainer] epoch #322 | Saving snapshot... +2025-04-03 11:04:03 | [pearl_trainer] epoch #322 | Saved +2025-04-03 11:04:03 | [pearl_trainer] epoch #322 | Time 76755.92 s +2025-04-03 11:04:03 | [pearl_trainer] epoch #322 | EpochTime 244.54 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 39.0827 +MetaTest/Average/AverageReturn 39.0827 +MetaTest/Average/Iteration 322 +MetaTest/Average/MaxReturn 70.9828 +MetaTest/Average/MinReturn -17.3589 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 34.2731 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 39.0827 +MetaTest/__unnamed_task__/AverageReturn 39.0827 +MetaTest/__unnamed_task__/Iteration 322 +MetaTest/__unnamed_task__/MaxReturn 70.9828 +MetaTest/__unnamed_task__/MinReturn -17.3589 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 34.2731 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 531800 +------------------------------------------------- ----------- +2025-04-03 11:04:36 | [pearl_trainer] epoch #323 | Training... +2025-04-03 11:06:04 | [pearl_trainer] epoch #323 | Evaluating... +2025-04-03 11:06:04 | [pearl_trainer] epoch #323 | Sampling for adapation and meta-testing... +2025-04-03 11:07:58 | [pearl_trainer] epoch #323 | Finished meta-testing... +2025-04-03 11:07:58 | [pearl_trainer] epoch #323 | Saving snapshot... +2025-04-03 11:08:00 | [pearl_trainer] epoch #323 | Saved +2025-04-03 11:08:00 | [pearl_trainer] epoch #323 | Time 76992.59 s +2025-04-03 11:08:00 | [pearl_trainer] epoch #323 | EpochTime 236.67 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 3.37282 +MetaTest/Average/AverageReturn 3.37282 +MetaTest/Average/Iteration 323 +MetaTest/Average/MaxReturn 35.3136 +MetaTest/Average/MinReturn -31.09 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 25.3644 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 3.37282 +MetaTest/__unnamed_task__/AverageReturn 3.37282 +MetaTest/__unnamed_task__/Iteration 323 +MetaTest/__unnamed_task__/MaxReturn 35.3136 +MetaTest/__unnamed_task__/MinReturn -31.09 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 25.3644 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 533400 +------------------------------------------------- ------------ +2025-04-03 11:08:30 | [pearl_trainer] epoch #324 | Training... +2025-04-03 11:10:07 | [pearl_trainer] epoch #324 | Evaluating... +2025-04-03 11:10:07 | [pearl_trainer] epoch #324 | Sampling for adapation and meta-testing... +2025-04-03 11:11:56 | [pearl_trainer] epoch #324 | Finished meta-testing... +2025-04-03 11:11:56 | [pearl_trainer] epoch #324 | Saving snapshot... +2025-04-03 11:11:57 | [pearl_trainer] epoch #324 | Saved +2025-04-03 11:11:57 | [pearl_trainer] epoch #324 | Time 77229.78 s +2025-04-03 11:11:57 | [pearl_trainer] epoch #324 | EpochTime 237.18 s +------------------------------------------------- ------------- +MetaTest/Average/AverageDiscountedReturn 0.483766 +MetaTest/Average/AverageReturn 0.483766 +MetaTest/Average/Iteration 324 +MetaTest/Average/MaxReturn 39.3264 +MetaTest/Average/MinReturn -18.3366 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 21.0276 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 0.483766 +MetaTest/__unnamed_task__/AverageReturn 0.483766 +MetaTest/__unnamed_task__/Iteration 324 +MetaTest/__unnamed_task__/MaxReturn 39.3264 +MetaTest/__unnamed_task__/MinReturn -18.3366 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 21.0276 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 535000 +------------------------------------------------- ------------- +2025-04-03 11:12:29 | [pearl_trainer] epoch #325 | Training... +2025-04-03 11:14:02 | [pearl_trainer] epoch #325 | Evaluating... +2025-04-03 11:14:02 | [pearl_trainer] epoch #325 | Sampling for adapation and meta-testing... +2025-04-03 11:15:57 | [pearl_trainer] epoch #325 | Finished meta-testing... +2025-04-03 11:15:57 | [pearl_trainer] epoch #325 | Saving snapshot... +2025-04-03 11:15:58 | [pearl_trainer] epoch #325 | Saved +2025-04-03 11:15:58 | [pearl_trainer] epoch #325 | Time 77471.26 s +2025-04-03 11:15:58 | [pearl_trainer] epoch #325 | EpochTime 241.49 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 17.2318 +MetaTest/Average/AverageReturn 17.2318 +MetaTest/Average/Iteration 325 +MetaTest/Average/MaxReturn 69.8738 +MetaTest/Average/MinReturn -48.3106 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 45.7089 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 17.2318 +MetaTest/__unnamed_task__/AverageReturn 17.2318 +MetaTest/__unnamed_task__/Iteration 325 +MetaTest/__unnamed_task__/MaxReturn 69.8738 +MetaTest/__unnamed_task__/MinReturn -48.3106 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 45.7089 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 536600 +------------------------------------------------- ----------- +2025-04-03 11:16:29 | [pearl_trainer] epoch #326 | Training... +2025-04-03 11:18:03 | [pearl_trainer] epoch #326 | Evaluating... +2025-04-03 11:18:03 | [pearl_trainer] epoch #326 | Sampling for adapation and meta-testing... +2025-04-03 11:19:58 | [pearl_trainer] epoch #326 | Finished meta-testing... +2025-04-03 11:19:58 | [pearl_trainer] epoch #326 | Saving snapshot... +2025-04-03 11:19:59 | [pearl_trainer] epoch #326 | Saved +2025-04-03 11:19:59 | [pearl_trainer] epoch #326 | Time 77712.21 s +2025-04-03 11:19:59 | [pearl_trainer] epoch #326 | EpochTime 240.95 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 46.7056 +MetaTest/Average/AverageReturn 46.7056 +MetaTest/Average/Iteration 326 +MetaTest/Average/MaxReturn 95.8895 +MetaTest/Average/MinReturn -17.5795 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 37.9455 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 46.7056 +MetaTest/__unnamed_task__/AverageReturn 46.7056 +MetaTest/__unnamed_task__/Iteration 326 +MetaTest/__unnamed_task__/MaxReturn 95.8895 +MetaTest/__unnamed_task__/MinReturn -17.5795 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 37.9455 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 538200 +------------------------------------------------- ----------- +2025-04-03 11:20:32 | [pearl_trainer] epoch #327 | Training... +2025-04-03 11:22:01 | [pearl_trainer] epoch #327 | Evaluating... +2025-04-03 11:22:01 | [pearl_trainer] epoch #327 | Sampling for adapation and meta-testing... +2025-04-03 11:23:55 | [pearl_trainer] epoch #327 | Finished meta-testing... +2025-04-03 11:23:55 | [pearl_trainer] epoch #327 | Saving snapshot... +2025-04-03 11:23:56 | [pearl_trainer] epoch #327 | Saved +2025-04-03 11:23:56 | [pearl_trainer] epoch #327 | Time 77949.02 s +2025-04-03 11:23:56 | [pearl_trainer] epoch #327 | EpochTime 236.81 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 61.6927 +MetaTest/Average/AverageReturn 61.6927 +MetaTest/Average/Iteration 327 +MetaTest/Average/MaxReturn 136.142 +MetaTest/Average/MinReturn -27.5571 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 55.5801 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 61.6927 +MetaTest/__unnamed_task__/AverageReturn 61.6927 +MetaTest/__unnamed_task__/Iteration 327 +MetaTest/__unnamed_task__/MaxReturn 136.142 +MetaTest/__unnamed_task__/MinReturn -27.5571 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 55.5801 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 539800 +------------------------------------------------- ----------- +2025-04-03 11:24:28 | [pearl_trainer] epoch #328 | Training... +2025-04-03 11:26:01 | [pearl_trainer] epoch #328 | Evaluating... +2025-04-03 11:26:01 | [pearl_trainer] epoch #328 | Sampling for adapation and meta-testing... +2025-04-03 11:27:54 | [pearl_trainer] epoch #328 | Finished meta-testing... +2025-04-03 11:27:54 | [pearl_trainer] epoch #328 | Saving snapshot... +2025-04-03 11:27:56 | [pearl_trainer] epoch #328 | Saved +2025-04-03 11:27:56 | [pearl_trainer] epoch #328 | Time 78188.95 s +2025-04-03 11:27:56 | [pearl_trainer] epoch #328 | EpochTime 239.93 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 3.20139 +MetaTest/Average/AverageReturn 3.20139 +MetaTest/Average/Iteration 328 +MetaTest/Average/MaxReturn 67.2456 +MetaTest/Average/MinReturn -61.2681 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 43.4907 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 3.20139 +MetaTest/__unnamed_task__/AverageReturn 3.20139 +MetaTest/__unnamed_task__/Iteration 328 +MetaTest/__unnamed_task__/MaxReturn 67.2456 +MetaTest/__unnamed_task__/MinReturn -61.2681 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 43.4907 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 541400 +------------------------------------------------- ------------ +2025-04-03 11:28:30 | [pearl_trainer] epoch #329 | Training... +2025-04-03 11:30:01 | [pearl_trainer] epoch #329 | Evaluating... +2025-04-03 11:30:01 | [pearl_trainer] epoch #329 | Sampling for adapation and meta-testing... +2025-04-03 11:32:03 | [pearl_trainer] epoch #329 | Finished meta-testing... +2025-04-03 11:32:03 | [pearl_trainer] epoch #329 | Saving snapshot... +2025-04-03 11:32:04 | [pearl_trainer] epoch #329 | Saved +2025-04-03 11:32:04 | [pearl_trainer] epoch #329 | Time 78436.96 s +2025-04-03 11:32:04 | [pearl_trainer] epoch #329 | EpochTime 248.01 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 6.04284 +MetaTest/Average/AverageReturn 6.04284 +MetaTest/Average/Iteration 329 +MetaTest/Average/MaxReturn 95.0168 +MetaTest/Average/MinReturn -67.9091 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 59.9087 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 6.04284 +MetaTest/__unnamed_task__/AverageReturn 6.04284 +MetaTest/__unnamed_task__/Iteration 329 +MetaTest/__unnamed_task__/MaxReturn 95.0168 +MetaTest/__unnamed_task__/MinReturn -67.9091 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 59.9087 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 543000 +------------------------------------------------- ------------ +2025-04-03 11:32:36 | [pearl_trainer] epoch #330 | Training... +2025-04-03 11:34:02 | [pearl_trainer] epoch #330 | Evaluating... +2025-04-03 11:34:02 | [pearl_trainer] epoch #330 | Sampling for adapation and meta-testing... +2025-04-03 11:35:55 | [pearl_trainer] epoch #330 | Finished meta-testing... +2025-04-03 11:35:55 | [pearl_trainer] epoch #330 | Saving snapshot... +2025-04-03 11:35:57 | [pearl_trainer] epoch #330 | Saved +2025-04-03 11:35:57 | [pearl_trainer] epoch #330 | Time 78669.82 s +2025-04-03 11:35:57 | [pearl_trainer] epoch #330 | EpochTime 232.86 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 13.2148 +MetaTest/Average/AverageReturn 13.2148 +MetaTest/Average/Iteration 330 +MetaTest/Average/MaxReturn 79.7733 +MetaTest/Average/MinReturn -61.858 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 60.465 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 13.2148 +MetaTest/__unnamed_task__/AverageReturn 13.2148 +MetaTest/__unnamed_task__/Iteration 330 +MetaTest/__unnamed_task__/MaxReturn 79.7733 +MetaTest/__unnamed_task__/MinReturn -61.858 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 60.465 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 544600 +------------------------------------------------- ----------- +2025-04-03 11:36:30 | [pearl_trainer] epoch #331 | Training... +2025-04-03 11:38:00 | [pearl_trainer] epoch #331 | Evaluating... +2025-04-03 11:38:00 | [pearl_trainer] epoch #331 | Sampling for adapation and meta-testing... +2025-04-03 11:39:59 | [pearl_trainer] epoch #331 | Finished meta-testing... +2025-04-03 11:39:59 | [pearl_trainer] epoch #331 | Saving snapshot... +2025-04-03 11:40:00 | [pearl_trainer] epoch #331 | Saved +2025-04-03 11:40:00 | [pearl_trainer] epoch #331 | Time 78913.51 s +2025-04-03 11:40:00 | [pearl_trainer] epoch #331 | EpochTime 243.68 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -69.5265 +MetaTest/Average/AverageReturn -69.5265 +MetaTest/Average/Iteration 331 +MetaTest/Average/MaxReturn -54.482 +MetaTest/Average/MinReturn -93.4711 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 13.0685 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -69.5265 +MetaTest/__unnamed_task__/AverageReturn -69.5265 +MetaTest/__unnamed_task__/Iteration 331 +MetaTest/__unnamed_task__/MaxReturn -54.482 +MetaTest/__unnamed_task__/MinReturn -93.4711 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 13.0685 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 546200 +------------------------------------------------- ----------- +2025-04-03 11:40:33 | [pearl_trainer] epoch #332 | Training... +2025-04-03 11:42:03 | [pearl_trainer] epoch #332 | Evaluating... +2025-04-03 11:42:03 | [pearl_trainer] epoch #332 | Sampling for adapation and meta-testing... +2025-04-03 11:44:01 | [pearl_trainer] epoch #332 | Finished meta-testing... +2025-04-03 11:44:01 | [pearl_trainer] epoch #332 | Saving snapshot... +2025-04-03 11:44:02 | [pearl_trainer] epoch #332 | Saved +2025-04-03 11:44:02 | [pearl_trainer] epoch #332 | Time 79154.74 s +2025-04-03 11:44:02 | [pearl_trainer] epoch #332 | EpochTime 241.23 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -62.5691 +MetaTest/Average/AverageReturn -62.5691 +MetaTest/Average/Iteration 332 +MetaTest/Average/MaxReturn -56.2635 +MetaTest/Average/MinReturn -76.6142 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 7.24489 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -62.5691 +MetaTest/__unnamed_task__/AverageReturn -62.5691 +MetaTest/__unnamed_task__/Iteration 332 +MetaTest/__unnamed_task__/MaxReturn -56.2635 +MetaTest/__unnamed_task__/MinReturn -76.6142 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 7.24489 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 547800 +------------------------------------------------- ------------ +2025-04-03 11:44:34 | [pearl_trainer] epoch #333 | Training... +2025-04-03 11:46:06 | [pearl_trainer] epoch #333 | Evaluating... +2025-04-03 11:46:06 | [pearl_trainer] epoch #333 | Sampling for adapation and meta-testing... +2025-04-03 11:48:00 | [pearl_trainer] epoch #333 | Finished meta-testing... +2025-04-03 11:48:00 | [pearl_trainer] epoch #333 | Saving snapshot... +2025-04-03 11:48:01 | [pearl_trainer] epoch #333 | Saved +2025-04-03 11:48:01 | [pearl_trainer] epoch #333 | Time 79394.31 s +2025-04-03 11:48:01 | [pearl_trainer] epoch #333 | EpochTime 239.57 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -44.5781 +MetaTest/Average/AverageReturn -44.5781 +MetaTest/Average/Iteration 333 +MetaTest/Average/MaxReturn -38.1441 +MetaTest/Average/MinReturn -52.637 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 5.40979 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -44.5781 +MetaTest/__unnamed_task__/AverageReturn -44.5781 +MetaTest/__unnamed_task__/Iteration 333 +MetaTest/__unnamed_task__/MaxReturn -38.1441 +MetaTest/__unnamed_task__/MinReturn -52.637 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 5.40979 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 549400 +------------------------------------------------- ------------ +2025-04-03 11:48:37 | [pearl_trainer] epoch #334 | Training... +2025-04-03 11:50:05 | [pearl_trainer] epoch #334 | Evaluating... +2025-04-03 11:50:05 | [pearl_trainer] epoch #334 | Sampling for adapation and meta-testing... +2025-04-03 11:51:58 | [pearl_trainer] epoch #334 | Finished meta-testing... +2025-04-03 11:51:58 | [pearl_trainer] epoch #334 | Saving snapshot... +2025-04-03 11:51:59 | [pearl_trainer] epoch #334 | Saved +2025-04-03 11:51:59 | [pearl_trainer] epoch #334 | Time 79632.21 s +2025-04-03 11:51:59 | [pearl_trainer] epoch #334 | EpochTime 237.90 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -33.3897 +MetaTest/Average/AverageReturn -33.3897 +MetaTest/Average/Iteration 334 +MetaTest/Average/MaxReturn -26.8218 +MetaTest/Average/MinReturn -36.3687 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 3.36034 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -33.3897 +MetaTest/__unnamed_task__/AverageReturn -33.3897 +MetaTest/__unnamed_task__/Iteration 334 +MetaTest/__unnamed_task__/MaxReturn -26.8218 +MetaTest/__unnamed_task__/MinReturn -36.3687 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 3.36034 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 551000 +------------------------------------------------- ------------ +2025-04-03 11:52:33 | [pearl_trainer] epoch #335 | Training... +2025-04-03 11:54:04 | [pearl_trainer] epoch #335 | Evaluating... +2025-04-03 11:54:04 | [pearl_trainer] epoch #335 | Sampling for adapation and meta-testing... +2025-04-03 11:55:55 | [pearl_trainer] epoch #335 | Finished meta-testing... +2025-04-03 11:55:55 | [pearl_trainer] epoch #335 | Saving snapshot... +2025-04-03 11:55:57 | [pearl_trainer] epoch #335 | Saved +2025-04-03 11:55:57 | [pearl_trainer] epoch #335 | Time 79869.76 s +2025-04-03 11:55:57 | [pearl_trainer] epoch #335 | EpochTime 237.55 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -36.9502 +MetaTest/Average/AverageReturn -36.9502 +MetaTest/Average/Iteration 335 +MetaTest/Average/MaxReturn -26.4625 +MetaTest/Average/MinReturn -63.3816 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 13.5374 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -36.9502 +MetaTest/__unnamed_task__/AverageReturn -36.9502 +MetaTest/__unnamed_task__/Iteration 335 +MetaTest/__unnamed_task__/MaxReturn -26.4625 +MetaTest/__unnamed_task__/MinReturn -63.3816 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 13.5374 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 552600 +------------------------------------------------- ----------- +2025-04-03 11:56:28 | [pearl_trainer] epoch #336 | Training... +2025-04-03 11:57:58 | [pearl_trainer] epoch #336 | Evaluating... +2025-04-03 11:57:58 | [pearl_trainer] epoch #336 | Sampling for adapation and meta-testing... +2025-04-03 11:59:59 | [pearl_trainer] epoch #336 | Finished meta-testing... +2025-04-03 11:59:59 | [pearl_trainer] epoch #336 | Saving snapshot... +2025-04-03 12:00:00 | [pearl_trainer] epoch #336 | Saved +2025-04-03 12:00:00 | [pearl_trainer] epoch #336 | Time 80112.69 s +2025-04-03 12:00:00 | [pearl_trainer] epoch #336 | EpochTime 242.92 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -30.5947 +MetaTest/Average/AverageReturn -30.5947 +MetaTest/Average/Iteration 336 +MetaTest/Average/MaxReturn 5.91545 +MetaTest/Average/MinReturn -75.7241 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 27.3534 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -30.5947 +MetaTest/__unnamed_task__/AverageReturn -30.5947 +MetaTest/__unnamed_task__/Iteration 336 +MetaTest/__unnamed_task__/MaxReturn 5.91545 +MetaTest/__unnamed_task__/MinReturn -75.7241 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 27.3534 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 554200 +------------------------------------------------- ------------ +2025-04-03 12:00:31 | [pearl_trainer] epoch #337 | Training... +2025-04-03 12:01:57 | [pearl_trainer] epoch #337 | Evaluating... +2025-04-03 12:01:57 | [pearl_trainer] epoch #337 | Sampling for adapation and meta-testing... +2025-04-03 12:03:50 | [pearl_trainer] epoch #337 | Finished meta-testing... +2025-04-03 12:03:50 | [pearl_trainer] epoch #337 | Saving snapshot... +2025-04-03 12:03:51 | [pearl_trainer] epoch #337 | Saved +2025-04-03 12:03:51 | [pearl_trainer] epoch #337 | Time 80344.51 s +2025-04-03 12:03:51 | [pearl_trainer] epoch #337 | EpochTime 231.82 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 3.37662 +MetaTest/Average/AverageReturn 3.37662 +MetaTest/Average/Iteration 337 +MetaTest/Average/MaxReturn 81.4238 +MetaTest/Average/MinReturn -38.2987 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 41.0115 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 3.37662 +MetaTest/__unnamed_task__/AverageReturn 3.37662 +MetaTest/__unnamed_task__/Iteration 337 +MetaTest/__unnamed_task__/MaxReturn 81.4238 +MetaTest/__unnamed_task__/MinReturn -38.2987 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 41.0115 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 555800 +------------------------------------------------- ------------ +2025-04-03 12:04:23 | [pearl_trainer] epoch #338 | Training... +2025-04-03 12:05:50 | [pearl_trainer] epoch #338 | Evaluating... +2025-04-03 12:05:50 | [pearl_trainer] epoch #338 | Sampling for adapation and meta-testing... +2025-04-03 12:07:39 | [pearl_trainer] epoch #338 | Finished meta-testing... +2025-04-03 12:07:39 | [pearl_trainer] epoch #338 | Saving snapshot... +2025-04-03 12:07:40 | [pearl_trainer] epoch #338 | Saved +2025-04-03 12:07:40 | [pearl_trainer] epoch #338 | Time 80572.97 s +2025-04-03 12:07:40 | [pearl_trainer] epoch #338 | EpochTime 228.46 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 32.2515 +MetaTest/Average/AverageReturn 32.2515 +MetaTest/Average/Iteration 338 +MetaTest/Average/MaxReturn 76.6406 +MetaTest/Average/MinReturn -39.843 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 43.8435 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 32.2515 +MetaTest/__unnamed_task__/AverageReturn 32.2515 +MetaTest/__unnamed_task__/Iteration 338 +MetaTest/__unnamed_task__/MaxReturn 76.6406 +MetaTest/__unnamed_task__/MinReturn -39.843 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 43.8435 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 557400 +------------------------------------------------- ----------- +2025-04-03 12:08:12 | [pearl_trainer] epoch #339 | Training... +2025-04-03 12:09:39 | [pearl_trainer] epoch #339 | Evaluating... +2025-04-03 12:09:39 | [pearl_trainer] epoch #339 | Sampling for adapation and meta-testing... +2025-04-03 12:11:30 | [pearl_trainer] epoch #339 | Finished meta-testing... +2025-04-03 12:11:30 | [pearl_trainer] epoch #339 | Saving snapshot... +2025-04-03 12:11:31 | [pearl_trainer] epoch #339 | Saved +2025-04-03 12:11:31 | [pearl_trainer] epoch #339 | Time 80804.38 s +2025-04-03 12:11:31 | [pearl_trainer] epoch #339 | EpochTime 231.41 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 36.7318 +MetaTest/Average/AverageReturn 36.7318 +MetaTest/Average/Iteration 339 +MetaTest/Average/MaxReturn 89.3049 +MetaTest/Average/MinReturn -17.2443 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 44.6939 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 36.7318 +MetaTest/__unnamed_task__/AverageReturn 36.7318 +MetaTest/__unnamed_task__/Iteration 339 +MetaTest/__unnamed_task__/MaxReturn 89.3049 +MetaTest/__unnamed_task__/MinReturn -17.2443 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 44.6939 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 559000 +------------------------------------------------- ----------- +2025-04-03 12:12:02 | [pearl_trainer] epoch #340 | Training... +2025-04-03 12:13:26 | [pearl_trainer] epoch #340 | Evaluating... +2025-04-03 12:13:26 | [pearl_trainer] epoch #340 | Sampling for adapation and meta-testing... +2025-04-03 12:15:18 | [pearl_trainer] epoch #340 | Finished meta-testing... +2025-04-03 12:15:18 | [pearl_trainer] epoch #340 | Saving snapshot... +2025-04-03 12:15:19 | [pearl_trainer] epoch #340 | Saved +2025-04-03 12:15:19 | [pearl_trainer] epoch #340 | Time 81032.05 s +2025-04-03 12:15:19 | [pearl_trainer] epoch #340 | EpochTime 227.67 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 14.4478 +MetaTest/Average/AverageReturn 14.4478 +MetaTest/Average/Iteration 340 +MetaTest/Average/MaxReturn 70.458 +MetaTest/Average/MinReturn -13.8067 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 34.3707 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 14.4478 +MetaTest/__unnamed_task__/AverageReturn 14.4478 +MetaTest/__unnamed_task__/Iteration 340 +MetaTest/__unnamed_task__/MaxReturn 70.458 +MetaTest/__unnamed_task__/MinReturn -13.8067 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 34.3707 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 560600 +------------------------------------------------- ----------- +2025-04-03 12:15:50 | [pearl_trainer] epoch #341 | Training... +2025-04-03 12:17:17 | [pearl_trainer] epoch #341 | Evaluating... +2025-04-03 12:17:17 | [pearl_trainer] epoch #341 | Sampling for adapation and meta-testing... +2025-04-03 12:19:09 | [pearl_trainer] epoch #341 | Finished meta-testing... +2025-04-03 12:19:09 | [pearl_trainer] epoch #341 | Saving snapshot... +2025-04-03 12:19:10 | [pearl_trainer] epoch #341 | Saved +2025-04-03 12:19:10 | [pearl_trainer] epoch #341 | Time 81262.88 s +2025-04-03 12:19:10 | [pearl_trainer] epoch #341 | EpochTime 230.83 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -10.665 +MetaTest/Average/AverageReturn -10.665 +MetaTest/Average/Iteration 341 +MetaTest/Average/MaxReturn 5.5117 +MetaTest/Average/MinReturn -19.4067 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 9.06182 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -10.665 +MetaTest/__unnamed_task__/AverageReturn -10.665 +MetaTest/__unnamed_task__/Iteration 341 +MetaTest/__unnamed_task__/MaxReturn 5.5117 +MetaTest/__unnamed_task__/MinReturn -19.4067 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 9.06182 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 562200 +------------------------------------------------- ------------ +2025-04-03 12:19:41 | [pearl_trainer] epoch #342 | Training... +2025-04-03 12:21:04 | [pearl_trainer] epoch #342 | Evaluating... +2025-04-03 12:21:04 | [pearl_trainer] epoch #342 | Sampling for adapation and meta-testing... +2025-04-03 12:22:56 | [pearl_trainer] epoch #342 | Finished meta-testing... +2025-04-03 12:22:56 | [pearl_trainer] epoch #342 | Saving snapshot... +2025-04-03 12:22:57 | [pearl_trainer] epoch #342 | Saved +2025-04-03 12:22:57 | [pearl_trainer] epoch #342 | Time 81490.30 s +2025-04-03 12:22:57 | [pearl_trainer] epoch #342 | EpochTime 227.42 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 31.1601 +MetaTest/Average/AverageReturn 31.1601 +MetaTest/Average/Iteration 342 +MetaTest/Average/MaxReturn 151.311 +MetaTest/Average/MinReturn -24.5915 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 63.7762 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 31.1601 +MetaTest/__unnamed_task__/AverageReturn 31.1601 +MetaTest/__unnamed_task__/Iteration 342 +MetaTest/__unnamed_task__/MaxReturn 151.311 +MetaTest/__unnamed_task__/MinReturn -24.5915 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 63.7762 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 563800 +------------------------------------------------- ----------- +2025-04-03 12:23:29 | [pearl_trainer] epoch #343 | Training... +2025-04-03 12:24:54 | [pearl_trainer] epoch #343 | Evaluating... +2025-04-03 12:24:54 | [pearl_trainer] epoch #343 | Sampling for adapation and meta-testing... +2025-04-03 12:26:46 | [pearl_trainer] epoch #343 | Finished meta-testing... +2025-04-03 12:26:46 | [pearl_trainer] epoch #343 | Saving snapshot... +2025-04-03 12:26:47 | [pearl_trainer] epoch #343 | Saved +2025-04-03 12:26:47 | [pearl_trainer] epoch #343 | Time 81719.96 s +2025-04-03 12:26:47 | [pearl_trainer] epoch #343 | EpochTime 229.65 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 47.1894 +MetaTest/Average/AverageReturn 47.1894 +MetaTest/Average/Iteration 343 +MetaTest/Average/MaxReturn 96.7582 +MetaTest/Average/MinReturn -4.84373 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 37.4397 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 47.1894 +MetaTest/__unnamed_task__/AverageReturn 47.1894 +MetaTest/__unnamed_task__/Iteration 343 +MetaTest/__unnamed_task__/MaxReturn 96.7582 +MetaTest/__unnamed_task__/MinReturn -4.84373 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 37.4397 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 565400 +------------------------------------------------- ------------ +2025-04-03 12:27:19 | [pearl_trainer] epoch #344 | Training... +2025-04-03 12:28:51 | [pearl_trainer] epoch #344 | Evaluating... +2025-04-03 12:28:51 | [pearl_trainer] epoch #344 | Sampling for adapation and meta-testing... +2025-04-03 12:30:45 | [pearl_trainer] epoch #344 | Finished meta-testing... +2025-04-03 12:30:45 | [pearl_trainer] epoch #344 | Saving snapshot... +2025-04-03 12:30:46 | [pearl_trainer] epoch #344 | Saved +2025-04-03 12:30:46 | [pearl_trainer] epoch #344 | Time 81959.08 s +2025-04-03 12:30:46 | [pearl_trainer] epoch #344 | EpochTime 239.13 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 13.8492 +MetaTest/Average/AverageReturn 13.8492 +MetaTest/Average/Iteration 344 +MetaTest/Average/MaxReturn 57.1646 +MetaTest/Average/MinReturn -32.1495 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 35.0616 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 13.8492 +MetaTest/__unnamed_task__/AverageReturn 13.8492 +MetaTest/__unnamed_task__/Iteration 344 +MetaTest/__unnamed_task__/MaxReturn 57.1646 +MetaTest/__unnamed_task__/MinReturn -32.1495 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 35.0616 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 567000 +------------------------------------------------- ----------- +2025-04-03 12:31:17 | [pearl_trainer] epoch #345 | Training... +2025-04-03 12:32:41 | [pearl_trainer] epoch #345 | Evaluating... +2025-04-03 12:32:41 | [pearl_trainer] epoch #345 | Sampling for adapation and meta-testing... +2025-04-03 12:34:31 | [pearl_trainer] epoch #345 | Finished meta-testing... +2025-04-03 12:34:31 | [pearl_trainer] epoch #345 | Saving snapshot... +2025-04-03 12:34:33 | [pearl_trainer] epoch #345 | Saved +2025-04-03 12:34:33 | [pearl_trainer] epoch #345 | Time 82185.81 s +2025-04-03 12:34:33 | [pearl_trainer] epoch #345 | EpochTime 226.72 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 45.6131 +MetaTest/Average/AverageReturn 45.6131 +MetaTest/Average/Iteration 345 +MetaTest/Average/MaxReturn 116.577 +MetaTest/Average/MinReturn -19.754 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 46.794 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 45.6131 +MetaTest/__unnamed_task__/AverageReturn 45.6131 +MetaTest/__unnamed_task__/Iteration 345 +MetaTest/__unnamed_task__/MaxReturn 116.577 +MetaTest/__unnamed_task__/MinReturn -19.754 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 46.794 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 568600 +------------------------------------------------- ----------- +2025-04-03 12:35:04 | [pearl_trainer] epoch #346 | Training... +2025-04-03 12:36:33 | [pearl_trainer] epoch #346 | Evaluating... +2025-04-03 12:36:33 | [pearl_trainer] epoch #346 | Sampling for adapation and meta-testing... +2025-04-03 12:38:21 | [pearl_trainer] epoch #346 | Finished meta-testing... +2025-04-03 12:38:21 | [pearl_trainer] epoch #346 | Saving snapshot... +2025-04-03 12:38:22 | [pearl_trainer] epoch #346 | Saved +2025-04-03 12:38:22 | [pearl_trainer] epoch #346 | Time 82415.22 s +2025-04-03 12:38:22 | [pearl_trainer] epoch #346 | EpochTime 229.41 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 34.898 +MetaTest/Average/AverageReturn 34.898 +MetaTest/Average/Iteration 346 +MetaTest/Average/MaxReturn 155.727 +MetaTest/Average/MinReturn -20.8674 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 63.7375 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 34.898 +MetaTest/__unnamed_task__/AverageReturn 34.898 +MetaTest/__unnamed_task__/Iteration 346 +MetaTest/__unnamed_task__/MaxReturn 155.727 +MetaTest/__unnamed_task__/MinReturn -20.8674 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 63.7375 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 570200 +------------------------------------------------- ----------- +2025-04-03 12:38:53 | [pearl_trainer] epoch #347 | Training... +2025-04-03 12:40:19 | [pearl_trainer] epoch #347 | Evaluating... +2025-04-03 12:40:19 | [pearl_trainer] epoch #347 | Sampling for adapation and meta-testing... +2025-04-03 12:42:09 | [pearl_trainer] epoch #347 | Finished meta-testing... +2025-04-03 12:42:09 | [pearl_trainer] epoch #347 | Saving snapshot... +2025-04-03 12:42:10 | [pearl_trainer] epoch #347 | Saved +2025-04-03 12:42:10 | [pearl_trainer] epoch #347 | Time 82642.57 s +2025-04-03 12:42:10 | [pearl_trainer] epoch #347 | EpochTime 227.34 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 26.1619 +MetaTest/Average/AverageReturn 26.1619 +MetaTest/Average/Iteration 347 +MetaTest/Average/MaxReturn 110.932 +MetaTest/Average/MinReturn -22.702 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 55.7951 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 26.1619 +MetaTest/__unnamed_task__/AverageReturn 26.1619 +MetaTest/__unnamed_task__/Iteration 347 +MetaTest/__unnamed_task__/MaxReturn 110.932 +MetaTest/__unnamed_task__/MinReturn -22.702 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 55.7951 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 571800 +------------------------------------------------- ----------- +2025-04-03 12:42:40 | [pearl_trainer] epoch #348 | Training... +2025-04-03 12:44:07 | [pearl_trainer] epoch #348 | Evaluating... +2025-04-03 12:44:07 | [pearl_trainer] epoch #348 | Sampling for adapation and meta-testing... +2025-04-03 12:45:58 | [pearl_trainer] epoch #348 | Finished meta-testing... +2025-04-03 12:45:58 | [pearl_trainer] epoch #348 | Saving snapshot... +2025-04-03 12:45:59 | [pearl_trainer] epoch #348 | Saved +2025-04-03 12:45:59 | [pearl_trainer] epoch #348 | Time 82872.37 s +2025-04-03 12:45:59 | [pearl_trainer] epoch #348 | EpochTime 229.80 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -4.18437 +MetaTest/Average/AverageReturn -4.18437 +MetaTest/Average/Iteration 348 +MetaTest/Average/MaxReturn 28.5472 +MetaTest/Average/MinReturn -26.7155 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 20.0141 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -4.18437 +MetaTest/__unnamed_task__/AverageReturn -4.18437 +MetaTest/__unnamed_task__/Iteration 348 +MetaTest/__unnamed_task__/MaxReturn 28.5472 +MetaTest/__unnamed_task__/MinReturn -26.7155 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 20.0141 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 573400 +------------------------------------------------- ------------ +2025-04-03 12:46:31 | [pearl_trainer] epoch #349 | Training... +2025-04-03 12:47:57 | [pearl_trainer] epoch #349 | Evaluating... +2025-04-03 12:47:57 | [pearl_trainer] epoch #349 | Sampling for adapation and meta-testing... +2025-04-03 12:49:47 | [pearl_trainer] epoch #349 | Finished meta-testing... +2025-04-03 12:49:47 | [pearl_trainer] epoch #349 | Saving snapshot... +2025-04-03 12:49:48 | [pearl_trainer] epoch #349 | Saved +2025-04-03 12:49:48 | [pearl_trainer] epoch #349 | Time 83100.82 s +2025-04-03 12:49:48 | [pearl_trainer] epoch #349 | EpochTime 228.45 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 9.74776 +MetaTest/Average/AverageReturn 9.74776 +MetaTest/Average/Iteration 349 +MetaTest/Average/MaxReturn 91.3356 +MetaTest/Average/MinReturn -14.3409 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 40.907 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 9.74776 +MetaTest/__unnamed_task__/AverageReturn 9.74776 +MetaTest/__unnamed_task__/Iteration 349 +MetaTest/__unnamed_task__/MaxReturn 91.3356 +MetaTest/__unnamed_task__/MinReturn -14.3409 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 40.907 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 575000 +------------------------------------------------- ------------ +2025-04-03 12:50:19 | [pearl_trainer] epoch #350 | Training... +2025-04-03 12:51:44 | [pearl_trainer] epoch #350 | Evaluating... +2025-04-03 12:51:44 | [pearl_trainer] epoch #350 | Sampling for adapation and meta-testing... +2025-04-03 12:53:35 | [pearl_trainer] epoch #350 | Finished meta-testing... +2025-04-03 12:53:35 | [pearl_trainer] epoch #350 | Saving snapshot... +2025-04-03 12:53:37 | [pearl_trainer] epoch #350 | Saved +2025-04-03 12:53:37 | [pearl_trainer] epoch #350 | Time 83329.79 s +2025-04-03 12:53:37 | [pearl_trainer] epoch #350 | EpochTime 228.97 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 10.2583 +MetaTest/Average/AverageReturn 10.2583 +MetaTest/Average/Iteration 350 +MetaTest/Average/MaxReturn 66.0698 +MetaTest/Average/MinReturn -19.8861 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 30.1243 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 10.2583 +MetaTest/__unnamed_task__/AverageReturn 10.2583 +MetaTest/__unnamed_task__/Iteration 350 +MetaTest/__unnamed_task__/MaxReturn 66.0698 +MetaTest/__unnamed_task__/MinReturn -19.8861 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 30.1243 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 576600 +------------------------------------------------- ----------- +2025-04-03 12:54:08 | [pearl_trainer] epoch #351 | Training... +2025-04-03 12:55:37 | [pearl_trainer] epoch #351 | Evaluating... +2025-04-03 12:55:37 | [pearl_trainer] epoch #351 | Sampling for adapation and meta-testing... +2025-04-03 12:57:32 | [pearl_trainer] epoch #351 | Finished meta-testing... +2025-04-03 12:57:32 | [pearl_trainer] epoch #351 | Saving snapshot... +2025-04-03 12:57:33 | [pearl_trainer] epoch #351 | Saved +2025-04-03 12:57:33 | [pearl_trainer] epoch #351 | Time 83565.60 s +2025-04-03 12:57:33 | [pearl_trainer] epoch #351 | EpochTime 235.81 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 7.17942 +MetaTest/Average/AverageReturn 7.17942 +MetaTest/Average/Iteration 351 +MetaTest/Average/MaxReturn 49.7013 +MetaTest/Average/MinReturn -22.6876 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 30.4086 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 7.17942 +MetaTest/__unnamed_task__/AverageReturn 7.17942 +MetaTest/__unnamed_task__/Iteration 351 +MetaTest/__unnamed_task__/MaxReturn 49.7013 +MetaTest/__unnamed_task__/MinReturn -22.6876 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 30.4086 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 578200 +------------------------------------------------- ------------ +2025-04-03 12:58:06 | [pearl_trainer] epoch #352 | Training... +2025-04-03 12:59:30 | [pearl_trainer] epoch #352 | Evaluating... +2025-04-03 12:59:30 | [pearl_trainer] epoch #352 | Sampling for adapation and meta-testing... +2025-04-03 13:01:23 | [pearl_trainer] epoch #352 | Finished meta-testing... +2025-04-03 13:01:23 | [pearl_trainer] epoch #352 | Saving snapshot... +2025-04-03 13:01:24 | [pearl_trainer] epoch #352 | Saved +2025-04-03 13:01:24 | [pearl_trainer] epoch #352 | Time 83797.41 s +2025-04-03 13:01:24 | [pearl_trainer] epoch #352 | EpochTime 231.80 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 25.961 +MetaTest/Average/AverageReturn 25.961 +MetaTest/Average/Iteration 352 +MetaTest/Average/MaxReturn 127.237 +MetaTest/Average/MinReturn -21.389 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 56.3514 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 25.961 +MetaTest/__unnamed_task__/AverageReturn 25.961 +MetaTest/__unnamed_task__/Iteration 352 +MetaTest/__unnamed_task__/MaxReturn 127.237 +MetaTest/__unnamed_task__/MinReturn -21.389 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 56.3514 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 579800 +------------------------------------------------- ----------- +2025-04-03 13:01:56 | [pearl_trainer] epoch #353 | Training... +2025-04-03 13:03:20 | [pearl_trainer] epoch #353 | Evaluating... +2025-04-03 13:03:20 | [pearl_trainer] epoch #353 | Sampling for adapation and meta-testing... +2025-04-03 13:05:12 | [pearl_trainer] epoch #353 | Finished meta-testing... +2025-04-03 13:05:12 | [pearl_trainer] epoch #353 | Saving snapshot... +2025-04-03 13:05:13 | [pearl_trainer] epoch #353 | Saved +2025-04-03 13:05:13 | [pearl_trainer] epoch #353 | Time 84026.38 s +2025-04-03 13:05:13 | [pearl_trainer] epoch #353 | EpochTime 228.97 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -6.51604 +MetaTest/Average/AverageReturn -6.51604 +MetaTest/Average/Iteration 353 +MetaTest/Average/MaxReturn 53.1794 +MetaTest/Average/MinReturn -38.0898 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 31.1627 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -6.51604 +MetaTest/__unnamed_task__/AverageReturn -6.51604 +MetaTest/__unnamed_task__/Iteration 353 +MetaTest/__unnamed_task__/MaxReturn 53.1794 +MetaTest/__unnamed_task__/MinReturn -38.0898 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 31.1627 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 581400 +------------------------------------------------- ------------ +2025-04-03 13:05:44 | [pearl_trainer] epoch #354 | Training... +2025-04-03 13:07:11 | [pearl_trainer] epoch #354 | Evaluating... +2025-04-03 13:07:11 | [pearl_trainer] epoch #354 | Sampling for adapation and meta-testing... +2025-04-03 13:09:02 | [pearl_trainer] epoch #354 | Finished meta-testing... +2025-04-03 13:09:02 | [pearl_trainer] epoch #354 | Saving snapshot... +2025-04-03 13:09:03 | [pearl_trainer] epoch #354 | Saved +2025-04-03 13:09:03 | [pearl_trainer] epoch #354 | Time 84255.75 s +2025-04-03 13:09:03 | [pearl_trainer] epoch #354 | EpochTime 229.37 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 18.7191 +MetaTest/Average/AverageReturn 18.7191 +MetaTest/Average/Iteration 354 +MetaTest/Average/MaxReturn 86.2152 +MetaTest/Average/MinReturn -18.2246 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 39.4477 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 18.7191 +MetaTest/__unnamed_task__/AverageReturn 18.7191 +MetaTest/__unnamed_task__/Iteration 354 +MetaTest/__unnamed_task__/MaxReturn 86.2152 +MetaTest/__unnamed_task__/MinReturn -18.2246 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 39.4477 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 583000 +------------------------------------------------- ----------- +2025-04-03 13:09:34 | [pearl_trainer] epoch #355 | Training... +2025-04-03 13:10:57 | [pearl_trainer] epoch #355 | Evaluating... +2025-04-03 13:10:57 | [pearl_trainer] epoch #355 | Sampling for adapation and meta-testing... +2025-04-03 13:12:48 | [pearl_trainer] epoch #355 | Finished meta-testing... +2025-04-03 13:12:48 | [pearl_trainer] epoch #355 | Saving snapshot... +2025-04-03 13:12:50 | [pearl_trainer] epoch #355 | Saved +2025-04-03 13:12:50 | [pearl_trainer] epoch #355 | Time 84482.75 s +2025-04-03 13:12:50 | [pearl_trainer] epoch #355 | EpochTime 227.00 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 4.52683 +MetaTest/Average/AverageReturn 4.52683 +MetaTest/Average/Iteration 355 +MetaTest/Average/MaxReturn 50.4704 +MetaTest/Average/MinReturn -25.5829 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 24.9597 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 4.52683 +MetaTest/__unnamed_task__/AverageReturn 4.52683 +MetaTest/__unnamed_task__/Iteration 355 +MetaTest/__unnamed_task__/MaxReturn 50.4704 +MetaTest/__unnamed_task__/MinReturn -25.5829 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 24.9597 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 584600 +------------------------------------------------- ------------ +2025-04-03 13:13:21 | [pearl_trainer] epoch #356 | Training... +2025-04-03 13:14:55 | [pearl_trainer] epoch #356 | Evaluating... +2025-04-03 13:14:55 | [pearl_trainer] epoch #356 | Sampling for adapation and meta-testing... +2025-04-03 13:16:44 | [pearl_trainer] epoch #356 | Finished meta-testing... +2025-04-03 13:16:44 | [pearl_trainer] epoch #356 | Saving snapshot... +2025-04-03 13:16:45 | [pearl_trainer] epoch #356 | Saved +2025-04-03 13:16:45 | [pearl_trainer] epoch #356 | Time 84717.65 s +2025-04-03 13:16:45 | [pearl_trainer] epoch #356 | EpochTime 234.90 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 67.92 +MetaTest/Average/AverageReturn 67.92 +MetaTest/Average/Iteration 356 +MetaTest/Average/MaxReturn 105.532 +MetaTest/Average/MinReturn -5.00181 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 40.893 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 67.92 +MetaTest/__unnamed_task__/AverageReturn 67.92 +MetaTest/__unnamed_task__/Iteration 356 +MetaTest/__unnamed_task__/MaxReturn 105.532 +MetaTest/__unnamed_task__/MinReturn -5.00181 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 40.893 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 586200 +------------------------------------------------- ------------ +2025-04-03 13:17:16 | [pearl_trainer] epoch #357 | Training... +2025-04-03 13:18:41 | [pearl_trainer] epoch #357 | Evaluating... +2025-04-03 13:18:41 | [pearl_trainer] epoch #357 | Sampling for adapation and meta-testing... +2025-04-03 13:20:31 | [pearl_trainer] epoch #357 | Finished meta-testing... +2025-04-03 13:20:31 | [pearl_trainer] epoch #357 | Saving snapshot... +2025-04-03 13:20:32 | [pearl_trainer] epoch #357 | Saved +2025-04-03 13:20:32 | [pearl_trainer] epoch #357 | Time 84944.75 s +2025-04-03 13:20:32 | [pearl_trainer] epoch #357 | EpochTime 227.10 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 35.8811 +MetaTest/Average/AverageReturn 35.8811 +MetaTest/Average/Iteration 357 +MetaTest/Average/MaxReturn 156.349 +MetaTest/Average/MinReturn -23.1408 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 67.7169 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 35.8811 +MetaTest/__unnamed_task__/AverageReturn 35.8811 +MetaTest/__unnamed_task__/Iteration 357 +MetaTest/__unnamed_task__/MaxReturn 156.349 +MetaTest/__unnamed_task__/MinReturn -23.1408 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 67.7169 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 587800 +------------------------------------------------- ----------- +2025-04-03 13:21:03 | [pearl_trainer] epoch #358 | Training... +2025-04-03 13:22:26 | [pearl_trainer] epoch #358 | Evaluating... +2025-04-03 13:22:26 | [pearl_trainer] epoch #358 | Sampling for adapation and meta-testing... +2025-04-03 13:24:16 | [pearl_trainer] epoch #358 | Finished meta-testing... +2025-04-03 13:24:16 | [pearl_trainer] epoch #358 | Saving snapshot... +2025-04-03 13:24:18 | [pearl_trainer] epoch #358 | Saved +2025-04-03 13:24:18 | [pearl_trainer] epoch #358 | Time 85170.80 s +2025-04-03 13:24:18 | [pearl_trainer] epoch #358 | EpochTime 226.05 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -5.38751 +MetaTest/Average/AverageReturn -5.38751 +MetaTest/Average/Iteration 358 +MetaTest/Average/MaxReturn 36.8922 +MetaTest/Average/MinReturn -22.4944 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 22.9746 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -5.38751 +MetaTest/__unnamed_task__/AverageReturn -5.38751 +MetaTest/__unnamed_task__/Iteration 358 +MetaTest/__unnamed_task__/MaxReturn 36.8922 +MetaTest/__unnamed_task__/MinReturn -22.4944 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 22.9746 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 589400 +------------------------------------------------- ------------ +2025-04-03 13:24:49 | [pearl_trainer] epoch #359 | Training... +2025-04-03 13:26:16 | [pearl_trainer] epoch #359 | Evaluating... +2025-04-03 13:26:16 | [pearl_trainer] epoch #359 | Sampling for adapation and meta-testing... +2025-04-03 13:28:14 | [pearl_trainer] epoch #359 | Finished meta-testing... +2025-04-03 13:28:14 | [pearl_trainer] epoch #359 | Saving snapshot... +2025-04-03 13:28:15 | [pearl_trainer] epoch #359 | Saved +2025-04-03 13:28:15 | [pearl_trainer] epoch #359 | Time 85408.39 s +2025-04-03 13:28:15 | [pearl_trainer] epoch #359 | EpochTime 237.59 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 33.5737 +MetaTest/Average/AverageReturn 33.5737 +MetaTest/Average/Iteration 359 +MetaTest/Average/MaxReturn 120.907 +MetaTest/Average/MinReturn -36.1212 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 56.528 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 33.5737 +MetaTest/__unnamed_task__/AverageReturn 33.5737 +MetaTest/__unnamed_task__/Iteration 359 +MetaTest/__unnamed_task__/MaxReturn 120.907 +MetaTest/__unnamed_task__/MinReturn -36.1212 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 56.528 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 591000 +------------------------------------------------- ----------- +2025-04-03 13:28:47 | [pearl_trainer] epoch #360 | Training... +2025-04-03 13:30:22 | [pearl_trainer] epoch #360 | Evaluating... +2025-04-03 13:30:22 | [pearl_trainer] epoch #360 | Sampling for adapation and meta-testing... +2025-04-03 13:32:11 | [pearl_trainer] epoch #360 | Finished meta-testing... +2025-04-03 13:32:11 | [pearl_trainer] epoch #360 | Saving snapshot... +2025-04-03 13:32:12 | [pearl_trainer] epoch #360 | Saved +2025-04-03 13:32:12 | [pearl_trainer] epoch #360 | Time 85645.47 s +2025-04-03 13:32:12 | [pearl_trainer] epoch #360 | EpochTime 237.08 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -8.25884 +MetaTest/Average/AverageReturn -8.25884 +MetaTest/Average/Iteration 360 +MetaTest/Average/MaxReturn 5.2807 +MetaTest/Average/MinReturn -17.3014 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 9.44761 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -8.25884 +MetaTest/__unnamed_task__/AverageReturn -8.25884 +MetaTest/__unnamed_task__/Iteration 360 +MetaTest/__unnamed_task__/MaxReturn 5.2807 +MetaTest/__unnamed_task__/MinReturn -17.3014 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 9.44761 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 592600 +------------------------------------------------- ------------ +2025-04-03 13:32:44 | [pearl_trainer] epoch #361 | Training... +2025-04-03 13:34:15 | [pearl_trainer] epoch #361 | Evaluating... +2025-04-03 13:34:15 | [pearl_trainer] epoch #361 | Sampling for adapation and meta-testing... +2025-04-03 13:36:06 | [pearl_trainer] epoch #361 | Finished meta-testing... +2025-04-03 13:36:06 | [pearl_trainer] epoch #361 | Saving snapshot... +2025-04-03 13:36:07 | [pearl_trainer] epoch #361 | Saved +2025-04-03 13:36:07 | [pearl_trainer] epoch #361 | Time 85879.76 s +2025-04-03 13:36:07 | [pearl_trainer] epoch #361 | EpochTime 234.29 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 23.4378 +MetaTest/Average/AverageReturn 23.4378 +MetaTest/Average/Iteration 361 +MetaTest/Average/MaxReturn 97.5489 +MetaTest/Average/MinReturn -28.6676 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 40.9544 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 23.4378 +MetaTest/__unnamed_task__/AverageReturn 23.4378 +MetaTest/__unnamed_task__/Iteration 361 +MetaTest/__unnamed_task__/MaxReturn 97.5489 +MetaTest/__unnamed_task__/MinReturn -28.6676 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 40.9544 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 594200 +------------------------------------------------- ----------- +2025-04-03 13:36:39 | [pearl_trainer] epoch #362 | Training... +2025-04-03 13:38:00 | [pearl_trainer] epoch #362 | Evaluating... +2025-04-03 13:38:00 | [pearl_trainer] epoch #362 | Sampling for adapation and meta-testing... +2025-04-03 13:39:50 | [pearl_trainer] epoch #362 | Finished meta-testing... +2025-04-03 13:39:50 | [pearl_trainer] epoch #362 | Saving snapshot... +2025-04-03 13:39:51 | [pearl_trainer] epoch #362 | Saved +2025-04-03 13:39:51 | [pearl_trainer] epoch #362 | Time 86104.32 s +2025-04-03 13:39:51 | [pearl_trainer] epoch #362 | EpochTime 224.55 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -38.1122 +MetaTest/Average/AverageReturn -38.1122 +MetaTest/Average/Iteration 362 +MetaTest/Average/MaxReturn 73.0809 +MetaTest/Average/MinReturn -159.871 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 89.1064 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -38.1122 +MetaTest/__unnamed_task__/AverageReturn -38.1122 +MetaTest/__unnamed_task__/Iteration 362 +MetaTest/__unnamed_task__/MaxReturn 73.0809 +MetaTest/__unnamed_task__/MinReturn -159.871 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 89.1064 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 595800 +------------------------------------------------- ----------- +2025-04-03 13:40:23 | [pearl_trainer] epoch #363 | Training... +2025-04-03 13:41:50 | [pearl_trainer] epoch #363 | Evaluating... +2025-04-03 13:41:50 | [pearl_trainer] epoch #363 | Sampling for adapation and meta-testing... +2025-04-03 13:43:42 | [pearl_trainer] epoch #363 | Finished meta-testing... +2025-04-03 13:43:42 | [pearl_trainer] epoch #363 | Saving snapshot... +2025-04-03 13:43:43 | [pearl_trainer] epoch #363 | Saved +2025-04-03 13:43:43 | [pearl_trainer] epoch #363 | Time 86336.30 s +2025-04-03 13:43:43 | [pearl_trainer] epoch #363 | EpochTime 231.98 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -43.3148 +MetaTest/Average/AverageReturn -43.3148 +MetaTest/Average/Iteration 363 +MetaTest/Average/MaxReturn 156.95 +MetaTest/Average/MinReturn -139.429 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 109.632 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -43.3148 +MetaTest/__unnamed_task__/AverageReturn -43.3148 +MetaTest/__unnamed_task__/Iteration 363 +MetaTest/__unnamed_task__/MaxReturn 156.95 +MetaTest/__unnamed_task__/MinReturn -139.429 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 109.632 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 597400 +------------------------------------------------- ----------- +2025-04-03 13:44:15 | [pearl_trainer] epoch #364 | Training... +2025-04-03 13:45:39 | [pearl_trainer] epoch #364 | Evaluating... +2025-04-03 13:45:39 | [pearl_trainer] epoch #364 | Sampling for adapation and meta-testing... +2025-04-03 13:47:26 | [pearl_trainer] epoch #364 | Finished meta-testing... +2025-04-03 13:47:26 | [pearl_trainer] epoch #364 | Saving snapshot... +2025-04-03 13:47:28 | [pearl_trainer] epoch #364 | Saved +2025-04-03 13:47:28 | [pearl_trainer] epoch #364 | Time 86560.56 s +2025-04-03 13:47:28 | [pearl_trainer] epoch #364 | EpochTime 224.26 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -18.5848 +MetaTest/Average/AverageReturn -18.5848 +MetaTest/Average/Iteration 364 +MetaTest/Average/MaxReturn 96.33 +MetaTest/Average/MinReturn -108.308 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 65.9458 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -18.5848 +MetaTest/__unnamed_task__/AverageReturn -18.5848 +MetaTest/__unnamed_task__/Iteration 364 +MetaTest/__unnamed_task__/MaxReturn 96.33 +MetaTest/__unnamed_task__/MinReturn -108.308 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 65.9458 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 599000 +------------------------------------------------- ----------- +2025-04-03 13:48:00 | [pearl_trainer] epoch #365 | Training... +2025-04-03 13:49:26 | [pearl_trainer] epoch #365 | Evaluating... +2025-04-03 13:49:26 | [pearl_trainer] epoch #365 | Sampling for adapation and meta-testing... +2025-04-03 13:51:17 | [pearl_trainer] epoch #365 | Finished meta-testing... +2025-04-03 13:51:17 | [pearl_trainer] epoch #365 | Saving snapshot... +2025-04-03 13:51:18 | [pearl_trainer] epoch #365 | Saved +2025-04-03 13:51:18 | [pearl_trainer] epoch #365 | Time 86791.02 s +2025-04-03 13:51:18 | [pearl_trainer] epoch #365 | EpochTime 230.46 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 4.71861 +MetaTest/Average/AverageReturn 4.71861 +MetaTest/Average/Iteration 365 +MetaTest/Average/MaxReturn 73.3455 +MetaTest/Average/MinReturn -25.4644 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 38.2551 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 4.71861 +MetaTest/__unnamed_task__/AverageReturn 4.71861 +MetaTest/__unnamed_task__/Iteration 365 +MetaTest/__unnamed_task__/MaxReturn 73.3455 +MetaTest/__unnamed_task__/MinReturn -25.4644 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 38.2551 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 600600 +------------------------------------------------- ------------ +2025-04-03 13:51:50 | [pearl_trainer] epoch #366 | Training... +2025-04-03 13:53:14 | [pearl_trainer] epoch #366 | Evaluating... +2025-04-03 13:53:14 | [pearl_trainer] epoch #366 | Sampling for adapation and meta-testing... +2025-04-03 13:55:05 | [pearl_trainer] epoch #366 | Finished meta-testing... +2025-04-03 13:55:05 | [pearl_trainer] epoch #366 | Saving snapshot... +2025-04-03 13:55:06 | [pearl_trainer] epoch #366 | Saved +2025-04-03 13:55:06 | [pearl_trainer] epoch #366 | Time 87019.40 s +2025-04-03 13:55:06 | [pearl_trainer] epoch #366 | EpochTime 228.37 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 16.5848 +MetaTest/Average/AverageReturn 16.5848 +MetaTest/Average/Iteration 366 +MetaTest/Average/MaxReturn 80.3898 +MetaTest/Average/MinReturn -33.1223 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 47.1976 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 16.5848 +MetaTest/__unnamed_task__/AverageReturn 16.5848 +MetaTest/__unnamed_task__/Iteration 366 +MetaTest/__unnamed_task__/MaxReturn 80.3898 +MetaTest/__unnamed_task__/MinReturn -33.1223 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 47.1976 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 602200 +------------------------------------------------- ----------- +2025-04-03 13:55:37 | [pearl_trainer] epoch #367 | Training... +2025-04-03 13:57:10 | [pearl_trainer] epoch #367 | Evaluating... +2025-04-03 13:57:10 | [pearl_trainer] epoch #367 | Sampling for adapation and meta-testing... +2025-04-03 13:59:02 | [pearl_trainer] epoch #367 | Finished meta-testing... +2025-04-03 13:59:02 | [pearl_trainer] epoch #367 | Saving snapshot... +2025-04-03 13:59:03 | [pearl_trainer] epoch #367 | Saved +2025-04-03 13:59:03 | [pearl_trainer] epoch #367 | Time 87255.67 s +2025-04-03 13:59:03 | [pearl_trainer] epoch #367 | EpochTime 236.27 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 3.54754 +MetaTest/Average/AverageReturn 3.54754 +MetaTest/Average/Iteration 367 +MetaTest/Average/MaxReturn 94.7647 +MetaTest/Average/MinReturn -66.6974 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 64.6886 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 3.54754 +MetaTest/__unnamed_task__/AverageReturn 3.54754 +MetaTest/__unnamed_task__/Iteration 367 +MetaTest/__unnamed_task__/MaxReturn 94.7647 +MetaTest/__unnamed_task__/MinReturn -66.6974 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 64.6886 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 603800 +------------------------------------------------- ------------ +2025-04-03 13:59:35 | [pearl_trainer] epoch #368 | Training... +2025-04-03 14:01:05 | [pearl_trainer] epoch #368 | Evaluating... +2025-04-03 14:01:05 | [pearl_trainer] epoch #368 | Sampling for adapation and meta-testing... +2025-04-03 14:02:56 | [pearl_trainer] epoch #368 | Finished meta-testing... +2025-04-03 14:02:56 | [pearl_trainer] epoch #368 | Saving snapshot... +2025-04-03 14:02:57 | [pearl_trainer] epoch #368 | Saved +2025-04-03 14:02:57 | [pearl_trainer] epoch #368 | Time 87490.02 s +2025-04-03 14:02:57 | [pearl_trainer] epoch #368 | EpochTime 234.34 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 4.39906 +MetaTest/Average/AverageReturn 4.39906 +MetaTest/Average/Iteration 368 +MetaTest/Average/MaxReturn 47.2264 +MetaTest/Average/MinReturn -26.2366 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 30.0113 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 4.39906 +MetaTest/__unnamed_task__/AverageReturn 4.39906 +MetaTest/__unnamed_task__/Iteration 368 +MetaTest/__unnamed_task__/MaxReturn 47.2264 +MetaTest/__unnamed_task__/MinReturn -26.2366 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 30.0113 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 605400 +------------------------------------------------- ------------ +2025-04-03 14:03:28 | [pearl_trainer] epoch #369 | Training... +2025-04-03 14:04:53 | [pearl_trainer] epoch #369 | Evaluating... +2025-04-03 14:04:53 | [pearl_trainer] epoch #369 | Sampling for adapation and meta-testing... +2025-04-03 14:06:43 | [pearl_trainer] epoch #369 | Finished meta-testing... +2025-04-03 14:06:43 | [pearl_trainer] epoch #369 | Saving snapshot... +2025-04-03 14:06:44 | [pearl_trainer] epoch #369 | Saved +2025-04-03 14:06:44 | [pearl_trainer] epoch #369 | Time 87717.35 s +2025-04-03 14:06:44 | [pearl_trainer] epoch #369 | EpochTime 227.33 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -20.2899 +MetaTest/Average/AverageReturn -20.2899 +MetaTest/Average/Iteration 369 +MetaTest/Average/MaxReturn -7.02503 +MetaTest/Average/MinReturn -32.2838 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 8.70215 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -20.2899 +MetaTest/__unnamed_task__/AverageReturn -20.2899 +MetaTest/__unnamed_task__/Iteration 369 +MetaTest/__unnamed_task__/MaxReturn -7.02503 +MetaTest/__unnamed_task__/MinReturn -32.2838 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 8.70215 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 607000 +------------------------------------------------- ------------ +2025-04-03 14:07:16 | [pearl_trainer] epoch #370 | Training... +2025-04-03 14:08:49 | [pearl_trainer] epoch #370 | Evaluating... +2025-04-03 14:08:49 | [pearl_trainer] epoch #370 | Sampling for adapation and meta-testing... +2025-04-03 14:10:39 | [pearl_trainer] epoch #370 | Finished meta-testing... +2025-04-03 14:10:39 | [pearl_trainer] epoch #370 | Saving snapshot... +2025-04-03 14:10:40 | [pearl_trainer] epoch #370 | Saved +2025-04-03 14:10:40 | [pearl_trainer] epoch #370 | Time 87953.31 s +2025-04-03 14:10:40 | [pearl_trainer] epoch #370 | EpochTime 235.96 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -10.8438 +MetaTest/Average/AverageReturn -10.8438 +MetaTest/Average/Iteration 370 +MetaTest/Average/MaxReturn 5.87911 +MetaTest/Average/MinReturn -35.7788 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 15.5446 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -10.8438 +MetaTest/__unnamed_task__/AverageReturn -10.8438 +MetaTest/__unnamed_task__/Iteration 370 +MetaTest/__unnamed_task__/MaxReturn 5.87911 +MetaTest/__unnamed_task__/MinReturn -35.7788 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 15.5446 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 608600 +------------------------------------------------- ------------ +2025-04-03 14:11:12 | [pearl_trainer] epoch #371 | Training... +2025-04-03 14:12:36 | [pearl_trainer] epoch #371 | Evaluating... +2025-04-03 14:12:36 | [pearl_trainer] epoch #371 | Sampling for adapation and meta-testing... +2025-04-03 14:14:25 | [pearl_trainer] epoch #371 | Finished meta-testing... +2025-04-03 14:14:25 | [pearl_trainer] epoch #371 | Saving snapshot... +2025-04-03 14:14:26 | [pearl_trainer] epoch #371 | Saved +2025-04-03 14:14:26 | [pearl_trainer] epoch #371 | Time 88179.40 s +2025-04-03 14:14:26 | [pearl_trainer] epoch #371 | EpochTime 226.09 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 10.4548 +MetaTest/Average/AverageReturn 10.4548 +MetaTest/Average/Iteration 371 +MetaTest/Average/MaxReturn 98.8222 +MetaTest/Average/MinReturn -23.463 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 44.9803 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 10.4548 +MetaTest/__unnamed_task__/AverageReturn 10.4548 +MetaTest/__unnamed_task__/Iteration 371 +MetaTest/__unnamed_task__/MaxReturn 98.8222 +MetaTest/__unnamed_task__/MinReturn -23.463 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 44.9803 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 610200 +------------------------------------------------- ----------- +2025-04-03 14:14:59 | [pearl_trainer] epoch #372 | Training... +2025-04-03 14:16:22 | [pearl_trainer] epoch #372 | Evaluating... +2025-04-03 14:16:22 | [pearl_trainer] epoch #372 | Sampling for adapation and meta-testing... +2025-04-03 14:18:12 | [pearl_trainer] epoch #372 | Finished meta-testing... +2025-04-03 14:18:12 | [pearl_trainer] epoch #372 | Saving snapshot... +2025-04-03 14:18:13 | [pearl_trainer] epoch #372 | Saved +2025-04-03 14:18:13 | [pearl_trainer] epoch #372 | Time 88405.80 s +2025-04-03 14:18:13 | [pearl_trainer] epoch #372 | EpochTime 226.40 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 21.3689 +MetaTest/Average/AverageReturn 21.3689 +MetaTest/Average/Iteration 372 +MetaTest/Average/MaxReturn 98.4612 +MetaTest/Average/MinReturn -22.5853 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 44.7392 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 21.3689 +MetaTest/__unnamed_task__/AverageReturn 21.3689 +MetaTest/__unnamed_task__/Iteration 372 +MetaTest/__unnamed_task__/MaxReturn 98.4612 +MetaTest/__unnamed_task__/MinReturn -22.5853 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 44.7392 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 611800 +------------------------------------------------- ----------- +2025-04-03 14:18:44 | [pearl_trainer] epoch #373 | Training... +2025-04-03 14:20:06 | [pearl_trainer] epoch #373 | Evaluating... +2025-04-03 14:20:06 | [pearl_trainer] epoch #373 | Sampling for adapation and meta-testing... +2025-04-03 14:21:58 | [pearl_trainer] epoch #373 | Finished meta-testing... +2025-04-03 14:21:58 | [pearl_trainer] epoch #373 | Saving snapshot... +2025-04-03 14:22:00 | [pearl_trainer] epoch #373 | Saved +2025-04-03 14:22:00 | [pearl_trainer] epoch #373 | Time 88632.65 s +2025-04-03 14:22:00 | [pearl_trainer] epoch #373 | EpochTime 226.84 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -5.44493 +MetaTest/Average/AverageReturn -5.44493 +MetaTest/Average/Iteration 373 +MetaTest/Average/MaxReturn 27.5568 +MetaTest/Average/MinReturn -20.8563 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 17.7138 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -5.44493 +MetaTest/__unnamed_task__/AverageReturn -5.44493 +MetaTest/__unnamed_task__/Iteration 373 +MetaTest/__unnamed_task__/MaxReturn 27.5568 +MetaTest/__unnamed_task__/MinReturn -20.8563 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 17.7138 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 613400 +------------------------------------------------- ------------ +2025-04-03 14:22:32 | [pearl_trainer] epoch #374 | Training... +2025-04-03 14:23:56 | [pearl_trainer] epoch #374 | Evaluating... +2025-04-03 14:23:56 | [pearl_trainer] epoch #374 | Sampling for adapation and meta-testing... +2025-04-03 14:25:45 | [pearl_trainer] epoch #374 | Finished meta-testing... +2025-04-03 14:25:45 | [pearl_trainer] epoch #374 | Saving snapshot... +2025-04-03 14:25:46 | [pearl_trainer] epoch #374 | Saved +2025-04-03 14:25:46 | [pearl_trainer] epoch #374 | Time 88859.07 s +2025-04-03 14:25:46 | [pearl_trainer] epoch #374 | EpochTime 226.42 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 13.2602 +MetaTest/Average/AverageReturn 13.2602 +MetaTest/Average/Iteration 374 +MetaTest/Average/MaxReturn 53.6991 +MetaTest/Average/MinReturn -26.3898 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 28.9452 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 13.2602 +MetaTest/__unnamed_task__/AverageReturn 13.2602 +MetaTest/__unnamed_task__/Iteration 374 +MetaTest/__unnamed_task__/MaxReturn 53.6991 +MetaTest/__unnamed_task__/MinReturn -26.3898 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 28.9452 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 615000 +------------------------------------------------- ----------- +2025-04-03 14:26:18 | [pearl_trainer] epoch #375 | Training... +2025-04-03 14:27:45 | [pearl_trainer] epoch #375 | Evaluating... +2025-04-03 14:27:45 | [pearl_trainer] epoch #375 | Sampling for adapation and meta-testing... +2025-04-03 14:29:41 | [pearl_trainer] epoch #375 | Finished meta-testing... +2025-04-03 14:29:41 | [pearl_trainer] epoch #375 | Saving snapshot... +2025-04-03 14:29:43 | [pearl_trainer] epoch #375 | Saved +2025-04-03 14:29:43 | [pearl_trainer] epoch #375 | Time 89095.90 s +2025-04-03 14:29:43 | [pearl_trainer] epoch #375 | EpochTime 236.83 s +------------------------------------------------- ------------- +MetaTest/Average/AverageDiscountedReturn 0.565775 +MetaTest/Average/AverageReturn 0.565775 +MetaTest/Average/Iteration 375 +MetaTest/Average/MaxReturn 92.3306 +MetaTest/Average/MinReturn -32.6637 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 46.3666 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 0.565775 +MetaTest/__unnamed_task__/AverageReturn 0.565775 +MetaTest/__unnamed_task__/Iteration 375 +MetaTest/__unnamed_task__/MaxReturn 92.3306 +MetaTest/__unnamed_task__/MinReturn -32.6637 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 46.3666 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 616600 +------------------------------------------------- ------------- +2025-04-03 14:30:15 | [pearl_trainer] epoch #376 | Training... +2025-04-03 14:31:41 | [pearl_trainer] epoch #376 | Evaluating... +2025-04-03 14:31:41 | [pearl_trainer] epoch #376 | Sampling for adapation and meta-testing... +2025-04-03 14:33:33 | [pearl_trainer] epoch #376 | Finished meta-testing... +2025-04-03 14:33:33 | [pearl_trainer] epoch #376 | Saving snapshot... +2025-04-03 14:33:34 | [pearl_trainer] epoch #376 | Saved +2025-04-03 14:33:34 | [pearl_trainer] epoch #376 | Time 89327.40 s +2025-04-03 14:33:34 | [pearl_trainer] epoch #376 | EpochTime 231.50 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 5.73295 +MetaTest/Average/AverageReturn 5.73295 +MetaTest/Average/Iteration 376 +MetaTest/Average/MaxReturn 91.0228 +MetaTest/Average/MinReturn -36.2285 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 46.9891 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 5.73295 +MetaTest/__unnamed_task__/AverageReturn 5.73295 +MetaTest/__unnamed_task__/Iteration 376 +MetaTest/__unnamed_task__/MaxReturn 91.0228 +MetaTest/__unnamed_task__/MinReturn -36.2285 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 46.9891 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 618200 +------------------------------------------------- ------------ +2025-04-03 14:34:06 | [pearl_trainer] epoch #377 | Training... +2025-04-03 14:35:35 | [pearl_trainer] epoch #377 | Evaluating... +2025-04-03 14:35:35 | [pearl_trainer] epoch #377 | Sampling for adapation and meta-testing... +2025-04-03 14:37:23 | [pearl_trainer] epoch #377 | Finished meta-testing... +2025-04-03 14:37:23 | [pearl_trainer] epoch #377 | Saving snapshot... +2025-04-03 14:37:24 | [pearl_trainer] epoch #377 | Saved +2025-04-03 14:37:24 | [pearl_trainer] epoch #377 | Time 89557.51 s +2025-04-03 14:37:24 | [pearl_trainer] epoch #377 | EpochTime 230.11 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 30.5647 +MetaTest/Average/AverageReturn 30.5647 +MetaTest/Average/Iteration 377 +MetaTest/Average/MaxReturn 92.3494 +MetaTest/Average/MinReturn -22.1798 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 43.0759 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 30.5647 +MetaTest/__unnamed_task__/AverageReturn 30.5647 +MetaTest/__unnamed_task__/Iteration 377 +MetaTest/__unnamed_task__/MaxReturn 92.3494 +MetaTest/__unnamed_task__/MinReturn -22.1798 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 43.0759 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 619800 +------------------------------------------------- ----------- +2025-04-03 14:37:56 | [pearl_trainer] epoch #378 | Training... +2025-04-03 14:39:27 | [pearl_trainer] epoch #378 | Evaluating... +2025-04-03 14:39:27 | [pearl_trainer] epoch #378 | Sampling for adapation and meta-testing... +2025-04-03 14:41:17 | [pearl_trainer] epoch #378 | Finished meta-testing... +2025-04-03 14:41:17 | [pearl_trainer] epoch #378 | Saving snapshot... +2025-04-03 14:41:19 | [pearl_trainer] epoch #378 | Saved +2025-04-03 14:41:19 | [pearl_trainer] epoch #378 | Time 89791.81 s +2025-04-03 14:41:19 | [pearl_trainer] epoch #378 | EpochTime 234.30 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 40.2081 +MetaTest/Average/AverageReturn 40.2081 +MetaTest/Average/Iteration 378 +MetaTest/Average/MaxReturn 120.217 +MetaTest/Average/MinReturn -17.6676 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 62.7613 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 40.2081 +MetaTest/__unnamed_task__/AverageReturn 40.2081 +MetaTest/__unnamed_task__/Iteration 378 +MetaTest/__unnamed_task__/MaxReturn 120.217 +MetaTest/__unnamed_task__/MinReturn -17.6676 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 62.7613 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 621400 +------------------------------------------------- ----------- +2025-04-03 14:41:50 | [pearl_trainer] epoch #379 | Training... +2025-04-03 14:43:17 | [pearl_trainer] epoch #379 | Evaluating... +2025-04-03 14:43:17 | [pearl_trainer] epoch #379 | Sampling for adapation and meta-testing... +2025-04-03 14:45:07 | [pearl_trainer] epoch #379 | Finished meta-testing... +2025-04-03 14:45:07 | [pearl_trainer] epoch #379 | Saving snapshot... +2025-04-03 14:45:09 | [pearl_trainer] epoch #379 | Saved +2025-04-03 14:45:09 | [pearl_trainer] epoch #379 | Time 90021.57 s +2025-04-03 14:45:09 | [pearl_trainer] epoch #379 | EpochTime 229.75 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -18.1176 +MetaTest/Average/AverageReturn -18.1176 +MetaTest/Average/Iteration 379 +MetaTest/Average/MaxReturn -6.9661 +MetaTest/Average/MinReturn -25.9156 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 6.34079 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -18.1176 +MetaTest/__unnamed_task__/AverageReturn -18.1176 +MetaTest/__unnamed_task__/Iteration 379 +MetaTest/__unnamed_task__/MaxReturn -6.9661 +MetaTest/__unnamed_task__/MinReturn -25.9156 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 6.34079 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 623000 +------------------------------------------------- ------------ +2025-04-03 14:45:41 | [pearl_trainer] epoch #380 | Training... +2025-04-03 14:47:04 | [pearl_trainer] epoch #380 | Evaluating... +2025-04-03 14:47:04 | [pearl_trainer] epoch #380 | Sampling for adapation and meta-testing... +2025-04-03 14:48:54 | [pearl_trainer] epoch #380 | Finished meta-testing... +2025-04-03 14:48:54 | [pearl_trainer] epoch #380 | Saving snapshot... +2025-04-03 14:48:55 | [pearl_trainer] epoch #380 | Saved +2025-04-03 14:48:55 | [pearl_trainer] epoch #380 | Time 90248.50 s +2025-04-03 14:48:55 | [pearl_trainer] epoch #380 | EpochTime 226.93 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -9.22491 +MetaTest/Average/AverageReturn -9.22491 +MetaTest/Average/Iteration 380 +MetaTest/Average/MaxReturn 64.9227 +MetaTest/Average/MinReturn -64.1332 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 44.1646 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -9.22491 +MetaTest/__unnamed_task__/AverageReturn -9.22491 +MetaTest/__unnamed_task__/Iteration 380 +MetaTest/__unnamed_task__/MaxReturn 64.9227 +MetaTest/__unnamed_task__/MinReturn -64.1332 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 44.1646 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 624600 +------------------------------------------------- ------------ +2025-04-03 14:49:27 | [pearl_trainer] epoch #381 | Training... +2025-04-03 14:50:51 | [pearl_trainer] epoch #381 | Evaluating... +2025-04-03 14:50:51 | [pearl_trainer] epoch #381 | Sampling for adapation and meta-testing... +2025-04-03 14:52:41 | [pearl_trainer] epoch #381 | Finished meta-testing... +2025-04-03 14:52:41 | [pearl_trainer] epoch #381 | Saving snapshot... +2025-04-03 14:52:42 | [pearl_trainer] epoch #381 | Saved +2025-04-03 14:52:42 | [pearl_trainer] epoch #381 | Time 90474.85 s +2025-04-03 14:52:42 | [pearl_trainer] epoch #381 | EpochTime 226.35 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 7.36026 +MetaTest/Average/AverageReturn 7.36026 +MetaTest/Average/Iteration 381 +MetaTest/Average/MaxReturn 109.854 +MetaTest/Average/MinReturn -42.0764 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 56.3698 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 7.36026 +MetaTest/__unnamed_task__/AverageReturn 7.36026 +MetaTest/__unnamed_task__/Iteration 381 +MetaTest/__unnamed_task__/MaxReturn 109.854 +MetaTest/__unnamed_task__/MinReturn -42.0764 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 56.3698 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 626200 +------------------------------------------------- ------------ +2025-04-03 14:53:14 | [pearl_trainer] epoch #382 | Training... +2025-04-03 14:54:38 | [pearl_trainer] epoch #382 | Evaluating... +2025-04-03 14:54:38 | [pearl_trainer] epoch #382 | Sampling for adapation and meta-testing... +2025-04-03 14:56:26 | [pearl_trainer] epoch #382 | Finished meta-testing... +2025-04-03 14:56:26 | [pearl_trainer] epoch #382 | Saving snapshot... +2025-04-03 14:56:27 | [pearl_trainer] epoch #382 | Saved +2025-04-03 14:56:27 | [pearl_trainer] epoch #382 | Time 90700.35 s +2025-04-03 14:56:27 | [pearl_trainer] epoch #382 | EpochTime 225.50 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 38.1325 +MetaTest/Average/AverageReturn 38.1325 +MetaTest/Average/Iteration 382 +MetaTest/Average/MaxReturn 151.998 +MetaTest/Average/MinReturn -15.0029 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 60.7672 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 38.1325 +MetaTest/__unnamed_task__/AverageReturn 38.1325 +MetaTest/__unnamed_task__/Iteration 382 +MetaTest/__unnamed_task__/MaxReturn 151.998 +MetaTest/__unnamed_task__/MinReturn -15.0029 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 60.7672 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 627800 +------------------------------------------------- ----------- +2025-04-03 14:57:00 | [pearl_trainer] epoch #383 | Training... +2025-04-03 14:58:28 | [pearl_trainer] epoch #383 | Evaluating... +2025-04-03 14:58:28 | [pearl_trainer] epoch #383 | Sampling for adapation and meta-testing... +2025-04-03 15:00:23 | [pearl_trainer] epoch #383 | Finished meta-testing... +2025-04-03 15:00:23 | [pearl_trainer] epoch #383 | Saving snapshot... +2025-04-03 15:00:24 | [pearl_trainer] epoch #383 | Saved +2025-04-03 15:00:24 | [pearl_trainer] epoch #383 | Time 90937.39 s +2025-04-03 15:00:24 | [pearl_trainer] epoch #383 | EpochTime 237.04 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 30.9898 +MetaTest/Average/AverageReturn 30.9898 +MetaTest/Average/Iteration 383 +MetaTest/Average/MaxReturn 123.982 +MetaTest/Average/MinReturn -53.842 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 72.3082 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 30.9898 +MetaTest/__unnamed_task__/AverageReturn 30.9898 +MetaTest/__unnamed_task__/Iteration 383 +MetaTest/__unnamed_task__/MaxReturn 123.982 +MetaTest/__unnamed_task__/MinReturn -53.842 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 72.3082 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 629400 +------------------------------------------------- ----------- +2025-04-03 15:00:56 | [pearl_trainer] epoch #384 | Training... +2025-04-03 15:02:22 | [pearl_trainer] epoch #384 | Evaluating... +2025-04-03 15:02:22 | [pearl_trainer] epoch #384 | Sampling for adapation and meta-testing... +2025-04-03 15:04:10 | [pearl_trainer] epoch #384 | Finished meta-testing... +2025-04-03 15:04:10 | [pearl_trainer] epoch #384 | Saving snapshot... +2025-04-03 15:04:11 | [pearl_trainer] epoch #384 | Saved +2025-04-03 15:04:11 | [pearl_trainer] epoch #384 | Time 91164.36 s +2025-04-03 15:04:11 | [pearl_trainer] epoch #384 | EpochTime 226.98 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 30.8475 +MetaTest/Average/AverageReturn 30.8475 +MetaTest/Average/Iteration 384 +MetaTest/Average/MaxReturn 118.854 +MetaTest/Average/MinReturn -16.9015 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 56.6337 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 30.8475 +MetaTest/__unnamed_task__/AverageReturn 30.8475 +MetaTest/__unnamed_task__/Iteration 384 +MetaTest/__unnamed_task__/MaxReturn 118.854 +MetaTest/__unnamed_task__/MinReturn -16.9015 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 56.6337 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 631000 +------------------------------------------------- ----------- +2025-04-03 15:04:42 | [pearl_trainer] epoch #385 | Training... +2025-04-03 15:06:10 | [pearl_trainer] epoch #385 | Evaluating... +2025-04-03 15:06:10 | [pearl_trainer] epoch #385 | Sampling for adapation and meta-testing... +2025-04-03 15:08:01 | [pearl_trainer] epoch #385 | Finished meta-testing... +2025-04-03 15:08:01 | [pearl_trainer] epoch #385 | Saving snapshot... +2025-04-03 15:08:02 | [pearl_trainer] epoch #385 | Saved +2025-04-03 15:08:02 | [pearl_trainer] epoch #385 | Time 91395.01 s +2025-04-03 15:08:02 | [pearl_trainer] epoch #385 | EpochTime 230.64 s +------------------------------------------------- -------------- +MetaTest/Average/AverageDiscountedReturn -0.0276818 +MetaTest/Average/AverageReturn -0.0276818 +MetaTest/Average/Iteration 385 +MetaTest/Average/MaxReturn 77.1164 +MetaTest/Average/MinReturn -50.9203 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 45.7485 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -0.0276818 +MetaTest/__unnamed_task__/AverageReturn -0.0276818 +MetaTest/__unnamed_task__/Iteration 385 +MetaTest/__unnamed_task__/MaxReturn 77.1164 +MetaTest/__unnamed_task__/MinReturn -50.9203 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 45.7485 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 632600 +------------------------------------------------- -------------- +2025-04-03 15:08:33 | [pearl_trainer] epoch #386 | Training... +2025-04-03 15:10:03 | [pearl_trainer] epoch #386 | Evaluating... +2025-04-03 15:10:03 | [pearl_trainer] epoch #386 | Sampling for adapation and meta-testing... +2025-04-03 15:11:52 | [pearl_trainer] epoch #386 | Finished meta-testing... +2025-04-03 15:11:52 | [pearl_trainer] epoch #386 | Saving snapshot... +2025-04-03 15:11:53 | [pearl_trainer] epoch #386 | Saved +2025-04-03 15:11:53 | [pearl_trainer] epoch #386 | Time 91626.16 s +2025-04-03 15:11:53 | [pearl_trainer] epoch #386 | EpochTime 231.15 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -9.77541 +MetaTest/Average/AverageReturn -9.77541 +MetaTest/Average/Iteration 386 +MetaTest/Average/MaxReturn 31.5874 +MetaTest/Average/MinReturn -42.1631 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 24.1848 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -9.77541 +MetaTest/__unnamed_task__/AverageReturn -9.77541 +MetaTest/__unnamed_task__/Iteration 386 +MetaTest/__unnamed_task__/MaxReturn 31.5874 +MetaTest/__unnamed_task__/MinReturn -42.1631 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 24.1848 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 634200 +------------------------------------------------- ------------ +2025-04-03 15:12:24 | [pearl_trainer] epoch #387 | Training... +2025-04-03 15:13:56 | [pearl_trainer] epoch #387 | Evaluating... +2025-04-03 15:13:56 | [pearl_trainer] epoch #387 | Sampling for adapation and meta-testing... +2025-04-03 15:15:44 | [pearl_trainer] epoch #387 | Finished meta-testing... +2025-04-03 15:15:44 | [pearl_trainer] epoch #387 | Saving snapshot... +2025-04-03 15:15:45 | [pearl_trainer] epoch #387 | Saved +2025-04-03 15:15:45 | [pearl_trainer] epoch #387 | Time 91857.75 s +2025-04-03 15:15:45 | [pearl_trainer] epoch #387 | EpochTime 231.59 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 44.7332 +MetaTest/Average/AverageReturn 44.7332 +MetaTest/Average/Iteration 387 +MetaTest/Average/MaxReturn 94.3605 +MetaTest/Average/MinReturn -20.5399 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 39.9449 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 44.7332 +MetaTest/__unnamed_task__/AverageReturn 44.7332 +MetaTest/__unnamed_task__/Iteration 387 +MetaTest/__unnamed_task__/MaxReturn 94.3605 +MetaTest/__unnamed_task__/MinReturn -20.5399 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 39.9449 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 635800 +------------------------------------------------- ----------- +2025-04-03 15:16:16 | [pearl_trainer] epoch #388 | Training... +2025-04-03 15:17:42 | [pearl_trainer] epoch #388 | Evaluating... +2025-04-03 15:17:42 | [pearl_trainer] epoch #388 | Sampling for adapation and meta-testing... +2025-04-03 15:19:34 | [pearl_trainer] epoch #388 | Finished meta-testing... +2025-04-03 15:19:34 | [pearl_trainer] epoch #388 | Saving snapshot... +2025-04-03 15:19:35 | [pearl_trainer] epoch #388 | Saved +2025-04-03 15:19:35 | [pearl_trainer] epoch #388 | Time 92088.28 s +2025-04-03 15:19:35 | [pearl_trainer] epoch #388 | EpochTime 230.53 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -25.535 +MetaTest/Average/AverageReturn -25.535 +MetaTest/Average/Iteration 388 +MetaTest/Average/MaxReturn 40.5486 +MetaTest/Average/MinReturn -60.5575 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 37.225 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -25.535 +MetaTest/__unnamed_task__/AverageReturn -25.535 +MetaTest/__unnamed_task__/Iteration 388 +MetaTest/__unnamed_task__/MaxReturn 40.5486 +MetaTest/__unnamed_task__/MinReturn -60.5575 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 37.225 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 637400 +------------------------------------------------- ----------- +2025-04-03 15:20:07 | [pearl_trainer] epoch #389 | Training... +2025-04-03 15:21:29 | [pearl_trainer] epoch #389 | Evaluating... +2025-04-03 15:21:29 | [pearl_trainer] epoch #389 | Sampling for adapation and meta-testing... +2025-04-03 15:23:21 | [pearl_trainer] epoch #389 | Finished meta-testing... +2025-04-03 15:23:21 | [pearl_trainer] epoch #389 | Saving snapshot... +2025-04-03 15:23:22 | [pearl_trainer] epoch #389 | Saved +2025-04-03 15:23:22 | [pearl_trainer] epoch #389 | Time 92314.92 s +2025-04-03 15:23:22 | [pearl_trainer] epoch #389 | EpochTime 226.63 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 13.5886 +MetaTest/Average/AverageReturn 13.5886 +MetaTest/Average/Iteration 389 +MetaTest/Average/MaxReturn 64.5526 +MetaTest/Average/MinReturn -21.4951 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 35.8927 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 13.5886 +MetaTest/__unnamed_task__/AverageReturn 13.5886 +MetaTest/__unnamed_task__/Iteration 389 +MetaTest/__unnamed_task__/MaxReturn 64.5526 +MetaTest/__unnamed_task__/MinReturn -21.4951 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 35.8927 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 639000 +------------------------------------------------- ----------- +2025-04-03 15:23:54 | [pearl_trainer] epoch #390 | Training... +2025-04-03 15:25:22 | [pearl_trainer] epoch #390 | Evaluating... +2025-04-03 15:25:22 | [pearl_trainer] epoch #390 | Sampling for adapation and meta-testing... +2025-04-03 15:27:13 | [pearl_trainer] epoch #390 | Finished meta-testing... +2025-04-03 15:27:13 | [pearl_trainer] epoch #390 | Saving snapshot... +2025-04-03 15:27:14 | [pearl_trainer] epoch #390 | Saved +2025-04-03 15:27:14 | [pearl_trainer] epoch #390 | Time 92547.27 s +2025-04-03 15:27:14 | [pearl_trainer] epoch #390 | EpochTime 232.35 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -1.38464 +MetaTest/Average/AverageReturn -1.38464 +MetaTest/Average/Iteration 390 +MetaTest/Average/MaxReturn 27.0493 +MetaTest/Average/MinReturn -63.5025 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 32.6406 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -1.38464 +MetaTest/__unnamed_task__/AverageReturn -1.38464 +MetaTest/__unnamed_task__/Iteration 390 +MetaTest/__unnamed_task__/MaxReturn 27.0493 +MetaTest/__unnamed_task__/MinReturn -63.5025 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 32.6406 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 640600 +------------------------------------------------- ------------ +2025-04-03 15:27:49 | [pearl_trainer] epoch #391 | Training... +2025-04-03 15:29:12 | [pearl_trainer] epoch #391 | Evaluating... +2025-04-03 15:29:12 | [pearl_trainer] epoch #391 | Sampling for adapation and meta-testing... +2025-04-03 15:31:02 | [pearl_trainer] epoch #391 | Finished meta-testing... +2025-04-03 15:31:02 | [pearl_trainer] epoch #391 | Saving snapshot... +2025-04-03 15:31:03 | [pearl_trainer] epoch #391 | Saved +2025-04-03 15:31:03 | [pearl_trainer] epoch #391 | Time 92776.42 s +2025-04-03 15:31:03 | [pearl_trainer] epoch #391 | EpochTime 229.15 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 33.6559 +MetaTest/Average/AverageReturn 33.6559 +MetaTest/Average/Iteration 391 +MetaTest/Average/MaxReturn 119.202 +MetaTest/Average/MinReturn -37.5894 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 58.4311 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 33.6559 +MetaTest/__unnamed_task__/AverageReturn 33.6559 +MetaTest/__unnamed_task__/Iteration 391 +MetaTest/__unnamed_task__/MaxReturn 119.202 +MetaTest/__unnamed_task__/MinReturn -37.5894 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 58.4311 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 642200 +------------------------------------------------- ----------- +2025-04-03 15:31:35 | [pearl_trainer] epoch #392 | Training... +2025-04-03 15:33:01 | [pearl_trainer] epoch #392 | Evaluating... +2025-04-03 15:33:01 | [pearl_trainer] epoch #392 | Sampling for adapation and meta-testing... +2025-04-03 15:34:52 | [pearl_trainer] epoch #392 | Finished meta-testing... +2025-04-03 15:34:52 | [pearl_trainer] epoch #392 | Saving snapshot... +2025-04-03 15:34:53 | [pearl_trainer] epoch #392 | Saved +2025-04-03 15:34:53 | [pearl_trainer] epoch #392 | Time 93005.77 s +2025-04-03 15:34:53 | [pearl_trainer] epoch #392 | EpochTime 229.35 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 20.1428 +MetaTest/Average/AverageReturn 20.1428 +MetaTest/Average/Iteration 392 +MetaTest/Average/MaxReturn 69.4497 +MetaTest/Average/MinReturn -60.3807 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 46.3405 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 20.1428 +MetaTest/__unnamed_task__/AverageReturn 20.1428 +MetaTest/__unnamed_task__/Iteration 392 +MetaTest/__unnamed_task__/MaxReturn 69.4497 +MetaTest/__unnamed_task__/MinReturn -60.3807 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 46.3405 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 643800 +------------------------------------------------- ----------- +2025-04-03 15:35:24 | [pearl_trainer] epoch #393 | Training... +2025-04-03 15:36:50 | [pearl_trainer] epoch #393 | Evaluating... +2025-04-03 15:36:50 | [pearl_trainer] epoch #393 | Sampling for adapation and meta-testing... +2025-04-03 15:38:41 | [pearl_trainer] epoch #393 | Finished meta-testing... +2025-04-03 15:38:41 | [pearl_trainer] epoch #393 | Saving snapshot... +2025-04-03 15:38:42 | [pearl_trainer] epoch #393 | Saved +2025-04-03 15:38:42 | [pearl_trainer] epoch #393 | Time 93235.50 s +2025-04-03 15:38:42 | [pearl_trainer] epoch #393 | EpochTime 229.72 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 27.4463 +MetaTest/Average/AverageReturn 27.4463 +MetaTest/Average/Iteration 393 +MetaTest/Average/MaxReturn 106.722 +MetaTest/Average/MinReturn -64.4259 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 55.5632 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 27.4463 +MetaTest/__unnamed_task__/AverageReturn 27.4463 +MetaTest/__unnamed_task__/Iteration 393 +MetaTest/__unnamed_task__/MaxReturn 106.722 +MetaTest/__unnamed_task__/MinReturn -64.4259 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 55.5632 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 645400 +------------------------------------------------- ----------- +2025-04-03 15:39:14 | [pearl_trainer] epoch #394 | Training... +2025-04-03 15:40:42 | [pearl_trainer] epoch #394 | Evaluating... +2025-04-03 15:40:42 | [pearl_trainer] epoch #394 | Sampling for adapation and meta-testing... +2025-04-03 15:42:29 | [pearl_trainer] epoch #394 | Finished meta-testing... +2025-04-03 15:42:29 | [pearl_trainer] epoch #394 | Saving snapshot... +2025-04-03 15:42:30 | [pearl_trainer] epoch #394 | Saved +2025-04-03 15:42:30 | [pearl_trainer] epoch #394 | Time 93463.01 s +2025-04-03 15:42:30 | [pearl_trainer] epoch #394 | EpochTime 227.52 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -43.7545 +MetaTest/Average/AverageReturn -43.7545 +MetaTest/Average/Iteration 394 +MetaTest/Average/MaxReturn -19.6553 +MetaTest/Average/MinReturn -60.8936 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 14.7738 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -43.7545 +MetaTest/__unnamed_task__/AverageReturn -43.7545 +MetaTest/__unnamed_task__/Iteration 394 +MetaTest/__unnamed_task__/MaxReturn -19.6553 +MetaTest/__unnamed_task__/MinReturn -60.8936 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 14.7738 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 647000 +------------------------------------------------- ----------- +2025-04-03 15:43:01 | [pearl_trainer] epoch #395 | Training... +2025-04-03 15:44:31 | [pearl_trainer] epoch #395 | Evaluating... +2025-04-03 15:44:31 | [pearl_trainer] epoch #395 | Sampling for adapation and meta-testing... +2025-04-03 15:46:18 | [pearl_trainer] epoch #395 | Finished meta-testing... +2025-04-03 15:46:18 | [pearl_trainer] epoch #395 | Saving snapshot... +2025-04-03 15:46:19 | [pearl_trainer] epoch #395 | Saved +2025-04-03 15:46:19 | [pearl_trainer] epoch #395 | Time 93692.30 s +2025-04-03 15:46:19 | [pearl_trainer] epoch #395 | EpochTime 229.28 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 11.3554 +MetaTest/Average/AverageReturn 11.3554 +MetaTest/Average/Iteration 395 +MetaTest/Average/MaxReturn 122.856 +MetaTest/Average/MinReturn -46.4708 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 58.5169 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 11.3554 +MetaTest/__unnamed_task__/AverageReturn 11.3554 +MetaTest/__unnamed_task__/Iteration 395 +MetaTest/__unnamed_task__/MaxReturn 122.856 +MetaTest/__unnamed_task__/MinReturn -46.4708 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 58.5169 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 648600 +------------------------------------------------- ----------- +2025-04-03 15:46:50 | [pearl_trainer] epoch #396 | Training... +2025-04-03 15:48:21 | [pearl_trainer] epoch #396 | Evaluating... +2025-04-03 15:48:21 | [pearl_trainer] epoch #396 | Sampling for adapation and meta-testing... +2025-04-03 15:50:10 | [pearl_trainer] epoch #396 | Finished meta-testing... +2025-04-03 15:50:10 | [pearl_trainer] epoch #396 | Saving snapshot... +2025-04-03 15:50:11 | [pearl_trainer] epoch #396 | Saved +2025-04-03 15:50:11 | [pearl_trainer] epoch #396 | Time 93923.76 s +2025-04-03 15:50:11 | [pearl_trainer] epoch #396 | EpochTime 231.45 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -19.4318 +MetaTest/Average/AverageReturn -19.4318 +MetaTest/Average/Iteration 396 +MetaTest/Average/MaxReturn 78.7992 +MetaTest/Average/MinReturn -59.5237 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 50.0915 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -19.4318 +MetaTest/__unnamed_task__/AverageReturn -19.4318 +MetaTest/__unnamed_task__/Iteration 396 +MetaTest/__unnamed_task__/MaxReturn 78.7992 +MetaTest/__unnamed_task__/MinReturn -59.5237 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 50.0915 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 650200 +------------------------------------------------- ----------- +2025-04-03 15:50:42 | [pearl_trainer] epoch #397 | Training... +2025-04-03 15:52:13 | [pearl_trainer] epoch #397 | Evaluating... +2025-04-03 15:52:13 | [pearl_trainer] epoch #397 | Sampling for adapation and meta-testing... +2025-04-03 15:54:06 | [pearl_trainer] epoch #397 | Finished meta-testing... +2025-04-03 15:54:06 | [pearl_trainer] epoch #397 | Saving snapshot... +2025-04-03 15:54:07 | [pearl_trainer] epoch #397 | Saved +2025-04-03 15:54:07 | [pearl_trainer] epoch #397 | Time 94160.38 s +2025-04-03 15:54:07 | [pearl_trainer] epoch #397 | EpochTime 236.62 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -8.28416 +MetaTest/Average/AverageReturn -8.28416 +MetaTest/Average/Iteration 397 +MetaTest/Average/MaxReturn 54.6349 +MetaTest/Average/MinReturn -59.4604 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 44.7574 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -8.28416 +MetaTest/__unnamed_task__/AverageReturn -8.28416 +MetaTest/__unnamed_task__/Iteration 397 +MetaTest/__unnamed_task__/MaxReturn 54.6349 +MetaTest/__unnamed_task__/MinReturn -59.4604 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 44.7574 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 651800 +------------------------------------------------- ------------ +2025-04-03 15:54:39 | [pearl_trainer] epoch #398 | Training... +2025-04-03 15:56:07 | [pearl_trainer] epoch #398 | Evaluating... +2025-04-03 15:56:07 | [pearl_trainer] epoch #398 | Sampling for adapation and meta-testing... +2025-04-03 15:57:53 | [pearl_trainer] epoch #398 | Finished meta-testing... +2025-04-03 15:57:53 | [pearl_trainer] epoch #398 | Saving snapshot... +2025-04-03 15:57:55 | [pearl_trainer] epoch #398 | Saved +2025-04-03 15:57:55 | [pearl_trainer] epoch #398 | Time 94387.57 s +2025-04-03 15:57:55 | [pearl_trainer] epoch #398 | EpochTime 227.18 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 14.0647 +MetaTest/Average/AverageReturn 14.0647 +MetaTest/Average/Iteration 398 +MetaTest/Average/MaxReturn 101.288 +MetaTest/Average/MinReturn -59.6988 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 56.3735 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 14.0647 +MetaTest/__unnamed_task__/AverageReturn 14.0647 +MetaTest/__unnamed_task__/Iteration 398 +MetaTest/__unnamed_task__/MaxReturn 101.288 +MetaTest/__unnamed_task__/MinReturn -59.6988 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 56.3735 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 653400 +------------------------------------------------- ----------- +2025-04-03 15:58:25 | [pearl_trainer] epoch #399 | Training... +2025-04-03 15:59:50 | [pearl_trainer] epoch #399 | Evaluating... +2025-04-03 15:59:50 | [pearl_trainer] epoch #399 | Sampling for adapation and meta-testing... +2025-04-03 16:01:41 | [pearl_trainer] epoch #399 | Finished meta-testing... +2025-04-03 16:01:41 | [pearl_trainer] epoch #399 | Saving snapshot... +2025-04-03 16:01:42 | [pearl_trainer] epoch #399 | Saved +2025-04-03 16:01:42 | [pearl_trainer] epoch #399 | Time 94615.20 s +2025-04-03 16:01:42 | [pearl_trainer] epoch #399 | EpochTime 227.63 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -19.7024 +MetaTest/Average/AverageReturn -19.7024 +MetaTest/Average/Iteration 399 +MetaTest/Average/MaxReturn 17.8351 +MetaTest/Average/MinReturn -52.6797 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 29.3072 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -19.7024 +MetaTest/__unnamed_task__/AverageReturn -19.7024 +MetaTest/__unnamed_task__/Iteration 399 +MetaTest/__unnamed_task__/MaxReturn 17.8351 +MetaTest/__unnamed_task__/MinReturn -52.6797 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 29.3072 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 655000 +------------------------------------------------- ----------- +2025-04-03 16:02:15 | [pearl_trainer] epoch #400 | Training... +2025-04-03 16:03:41 | [pearl_trainer] epoch #400 | Evaluating... +2025-04-03 16:03:41 | [pearl_trainer] epoch #400 | Sampling for adapation and meta-testing... +2025-04-03 16:05:32 | [pearl_trainer] epoch #400 | Finished meta-testing... +2025-04-03 16:05:32 | [pearl_trainer] epoch #400 | Saving snapshot... +2025-04-03 16:05:33 | [pearl_trainer] epoch #400 | Saved +2025-04-03 16:05:33 | [pearl_trainer] epoch #400 | Time 94846.09 s +2025-04-03 16:05:33 | [pearl_trainer] epoch #400 | EpochTime 230.88 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 6.2983 +MetaTest/Average/AverageReturn 6.2983 +MetaTest/Average/Iteration 400 +MetaTest/Average/MaxReturn 36.2262 +MetaTest/Average/MinReturn -19.6943 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 24.4381 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 6.2983 +MetaTest/__unnamed_task__/AverageReturn 6.2983 +MetaTest/__unnamed_task__/Iteration 400 +MetaTest/__unnamed_task__/MaxReturn 36.2262 +MetaTest/__unnamed_task__/MinReturn -19.6943 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 24.4381 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 656600 +------------------------------------------------- ----------- +2025-04-03 16:06:05 | [pearl_trainer] epoch #401 | Training... +2025-04-03 16:07:27 | [pearl_trainer] epoch #401 | Evaluating... +2025-04-03 16:07:27 | [pearl_trainer] epoch #401 | Sampling for adapation and meta-testing... +2025-04-03 16:09:18 | [pearl_trainer] epoch #401 | Finished meta-testing... +2025-04-03 16:09:18 | [pearl_trainer] epoch #401 | Saving snapshot... +2025-04-03 16:09:19 | [pearl_trainer] epoch #401 | Saved +2025-04-03 16:09:19 | [pearl_trainer] epoch #401 | Time 95072.05 s +2025-04-03 16:09:19 | [pearl_trainer] epoch #401 | EpochTime 225.96 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -4.90834 +MetaTest/Average/AverageReturn -4.90834 +MetaTest/Average/Iteration 401 +MetaTest/Average/MaxReturn 109.296 +MetaTest/Average/MinReturn -54.1656 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 58.8671 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -4.90834 +MetaTest/__unnamed_task__/AverageReturn -4.90834 +MetaTest/__unnamed_task__/Iteration 401 +MetaTest/__unnamed_task__/MaxReturn 109.296 +MetaTest/__unnamed_task__/MinReturn -54.1656 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 58.8671 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 658200 +------------------------------------------------- ------------ +2025-04-03 16:09:50 | [pearl_trainer] epoch #402 | Training... +2025-04-03 16:11:17 | [pearl_trainer] epoch #402 | Evaluating... +2025-04-03 16:11:17 | [pearl_trainer] epoch #402 | Sampling for adapation and meta-testing... +2025-04-03 16:13:05 | [pearl_trainer] epoch #402 | Finished meta-testing... +2025-04-03 16:13:05 | [pearl_trainer] epoch #402 | Saving snapshot... +2025-04-03 16:13:06 | [pearl_trainer] epoch #402 | Saved +2025-04-03 16:13:06 | [pearl_trainer] epoch #402 | Time 95298.67 s +2025-04-03 16:13:06 | [pearl_trainer] epoch #402 | EpochTime 226.62 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -19.5623 +MetaTest/Average/AverageReturn -19.5623 +MetaTest/Average/Iteration 402 +MetaTest/Average/MaxReturn 2.9259 +MetaTest/Average/MinReturn -75.9223 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 28.6953 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -19.5623 +MetaTest/__unnamed_task__/AverageReturn -19.5623 +MetaTest/__unnamed_task__/Iteration 402 +MetaTest/__unnamed_task__/MaxReturn 2.9259 +MetaTest/__unnamed_task__/MinReturn -75.9223 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 28.6953 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 659800 +------------------------------------------------- ----------- +2025-04-03 16:13:39 | [pearl_trainer] epoch #403 | Training... +2025-04-03 16:15:03 | [pearl_trainer] epoch #403 | Evaluating... +2025-04-03 16:15:03 | [pearl_trainer] epoch #403 | Sampling for adapation and meta-testing... +2025-04-03 16:16:55 | [pearl_trainer] epoch #403 | Finished meta-testing... +2025-04-03 16:16:55 | [pearl_trainer] epoch #403 | Saving snapshot... +2025-04-03 16:16:56 | [pearl_trainer] epoch #403 | Saved +2025-04-03 16:16:56 | [pearl_trainer] epoch #403 | Time 95529.34 s +2025-04-03 16:16:56 | [pearl_trainer] epoch #403 | EpochTime 230.66 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 8.77227 +MetaTest/Average/AverageReturn 8.77227 +MetaTest/Average/Iteration 403 +MetaTest/Average/MaxReturn 151.565 +MetaTest/Average/MinReturn -56.8052 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 73.103 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 8.77227 +MetaTest/__unnamed_task__/AverageReturn 8.77227 +MetaTest/__unnamed_task__/Iteration 403 +MetaTest/__unnamed_task__/MaxReturn 151.565 +MetaTest/__unnamed_task__/MinReturn -56.8052 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 73.103 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 661400 +------------------------------------------------- ------------ +2025-04-03 16:17:29 | [pearl_trainer] epoch #404 | Training... +2025-04-03 16:18:53 | [pearl_trainer] epoch #404 | Evaluating... +2025-04-03 16:18:53 | [pearl_trainer] epoch #404 | Sampling for adapation and meta-testing... +2025-04-03 16:20:44 | [pearl_trainer] epoch #404 | Finished meta-testing... +2025-04-03 16:20:44 | [pearl_trainer] epoch #404 | Saving snapshot... +2025-04-03 16:20:45 | [pearl_trainer] epoch #404 | Saved +2025-04-03 16:20:45 | [pearl_trainer] epoch #404 | Time 95758.49 s +2025-04-03 16:20:45 | [pearl_trainer] epoch #404 | EpochTime 229.15 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 35.2192 +MetaTest/Average/AverageReturn 35.2192 +MetaTest/Average/Iteration 404 +MetaTest/Average/MaxReturn 136.087 +MetaTest/Average/MinReturn -52.6545 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 64.9377 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 35.2192 +MetaTest/__unnamed_task__/AverageReturn 35.2192 +MetaTest/__unnamed_task__/Iteration 404 +MetaTest/__unnamed_task__/MaxReturn 136.087 +MetaTest/__unnamed_task__/MinReturn -52.6545 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 64.9377 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 663000 +------------------------------------------------- ----------- +2025-04-03 16:21:17 | [pearl_trainer] epoch #405 | Training... +2025-04-03 16:22:52 | [pearl_trainer] epoch #405 | Evaluating... +2025-04-03 16:22:52 | [pearl_trainer] epoch #405 | Sampling for adapation and meta-testing... +2025-04-03 16:24:43 | [pearl_trainer] epoch #405 | Finished meta-testing... +2025-04-03 16:24:43 | [pearl_trainer] epoch #405 | Saving snapshot... +2025-04-03 16:24:44 | [pearl_trainer] epoch #405 | Saved +2025-04-03 16:24:44 | [pearl_trainer] epoch #405 | Time 95997.34 s +2025-04-03 16:24:44 | [pearl_trainer] epoch #405 | EpochTime 238.85 s +------------------------------------------------- ------------- +MetaTest/Average/AverageDiscountedReturn 0.924035 +MetaTest/Average/AverageReturn 0.924035 +MetaTest/Average/Iteration 405 +MetaTest/Average/MaxReturn 65.473 +MetaTest/Average/MinReturn -38.2148 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 40.9346 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 0.924035 +MetaTest/__unnamed_task__/AverageReturn 0.924035 +MetaTest/__unnamed_task__/Iteration 405 +MetaTest/__unnamed_task__/MaxReturn 65.473 +MetaTest/__unnamed_task__/MinReturn -38.2148 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 40.9346 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 664600 +------------------------------------------------- ------------- +2025-04-03 16:25:16 | [pearl_trainer] epoch #406 | Training... +2025-04-03 16:26:45 | [pearl_trainer] epoch #406 | Evaluating... +2025-04-03 16:26:45 | [pearl_trainer] epoch #406 | Sampling for adapation and meta-testing... +2025-04-03 16:28:35 | [pearl_trainer] epoch #406 | Finished meta-testing... +2025-04-03 16:28:35 | [pearl_trainer] epoch #406 | Saving snapshot... +2025-04-03 16:28:36 | [pearl_trainer] epoch #406 | Saved +2025-04-03 16:28:36 | [pearl_trainer] epoch #406 | Time 96228.63 s +2025-04-03 16:28:36 | [pearl_trainer] epoch #406 | EpochTime 231.29 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 3.84333 +MetaTest/Average/AverageReturn 3.84333 +MetaTest/Average/Iteration 406 +MetaTest/Average/MaxReturn 91.2768 +MetaTest/Average/MinReturn -58.8732 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 53.3643 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 3.84333 +MetaTest/__unnamed_task__/AverageReturn 3.84333 +MetaTest/__unnamed_task__/Iteration 406 +MetaTest/__unnamed_task__/MaxReturn 91.2768 +MetaTest/__unnamed_task__/MinReturn -58.8732 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 53.3643 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 666200 +------------------------------------------------- ------------ +2025-04-03 16:29:10 | [pearl_trainer] epoch #407 | Training... +2025-04-03 16:30:36 | [pearl_trainer] epoch #407 | Evaluating... +2025-04-03 16:30:36 | [pearl_trainer] epoch #407 | Sampling for adapation and meta-testing... +2025-04-03 16:32:29 | [pearl_trainer] epoch #407 | Finished meta-testing... +2025-04-03 16:32:29 | [pearl_trainer] epoch #407 | Saving snapshot... +2025-04-03 16:32:30 | [pearl_trainer] epoch #407 | Saved +2025-04-03 16:32:30 | [pearl_trainer] epoch #407 | Time 96462.97 s +2025-04-03 16:32:30 | [pearl_trainer] epoch #407 | EpochTime 234.34 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 12.8365 +MetaTest/Average/AverageReturn 12.8365 +MetaTest/Average/Iteration 407 +MetaTest/Average/MaxReturn 126.453 +MetaTest/Average/MinReturn -51.1779 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 61.7941 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 12.8365 +MetaTest/__unnamed_task__/AverageReturn 12.8365 +MetaTest/__unnamed_task__/Iteration 407 +MetaTest/__unnamed_task__/MaxReturn 126.453 +MetaTest/__unnamed_task__/MinReturn -51.1779 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 61.7941 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 667800 +------------------------------------------------- ----------- +2025-04-03 16:33:02 | [pearl_trainer] epoch #408 | Training... +2025-04-03 16:34:34 | [pearl_trainer] epoch #408 | Evaluating... +2025-04-03 16:34:34 | [pearl_trainer] epoch #408 | Sampling for adapation and meta-testing... +2025-04-03 16:36:23 | [pearl_trainer] epoch #408 | Finished meta-testing... +2025-04-03 16:36:23 | [pearl_trainer] epoch #408 | Saving snapshot... +2025-04-03 16:36:24 | [pearl_trainer] epoch #408 | Saved +2025-04-03 16:36:24 | [pearl_trainer] epoch #408 | Time 96697.30 s +2025-04-03 16:36:24 | [pearl_trainer] epoch #408 | EpochTime 234.33 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -38.1569 +MetaTest/Average/AverageReturn -38.1569 +MetaTest/Average/Iteration 408 +MetaTest/Average/MaxReturn -22.5651 +MetaTest/Average/MinReturn -53.6749 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 11.1846 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -38.1569 +MetaTest/__unnamed_task__/AverageReturn -38.1569 +MetaTest/__unnamed_task__/Iteration 408 +MetaTest/__unnamed_task__/MaxReturn -22.5651 +MetaTest/__unnamed_task__/MinReturn -53.6749 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 11.1846 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 669400 +------------------------------------------------- ----------- +2025-04-03 16:36:56 | [pearl_trainer] epoch #409 | Training... +2025-04-03 16:38:27 | [pearl_trainer] epoch #409 | Evaluating... +2025-04-03 16:38:27 | [pearl_trainer] epoch #409 | Sampling for adapation and meta-testing... +2025-04-03 16:40:17 | [pearl_trainer] epoch #409 | Finished meta-testing... +2025-04-03 16:40:17 | [pearl_trainer] epoch #409 | Saving snapshot... +2025-04-03 16:40:18 | [pearl_trainer] epoch #409 | Saved +2025-04-03 16:40:18 | [pearl_trainer] epoch #409 | Time 96930.61 s +2025-04-03 16:40:18 | [pearl_trainer] epoch #409 | EpochTime 233.30 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 9.67759 +MetaTest/Average/AverageReturn 9.67759 +MetaTest/Average/Iteration 409 +MetaTest/Average/MaxReturn 133.816 +MetaTest/Average/MinReturn -45.319 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 69.6247 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 9.67759 +MetaTest/__unnamed_task__/AverageReturn 9.67759 +MetaTest/__unnamed_task__/Iteration 409 +MetaTest/__unnamed_task__/MaxReturn 133.816 +MetaTest/__unnamed_task__/MinReturn -45.319 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 69.6247 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 671000 +------------------------------------------------- ------------ +2025-04-03 16:40:49 | [pearl_trainer] epoch #410 | Training... +2025-04-03 16:42:13 | [pearl_trainer] epoch #410 | Evaluating... +2025-04-03 16:42:13 | [pearl_trainer] epoch #410 | Sampling for adapation and meta-testing... +2025-04-03 16:44:02 | [pearl_trainer] epoch #410 | Finished meta-testing... +2025-04-03 16:44:02 | [pearl_trainer] epoch #410 | Saving snapshot... +2025-04-03 16:44:03 | [pearl_trainer] epoch #410 | Saved +2025-04-03 16:44:03 | [pearl_trainer] epoch #410 | Time 97156.33 s +2025-04-03 16:44:03 | [pearl_trainer] epoch #410 | EpochTime 225.72 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 23.351 +MetaTest/Average/AverageReturn 23.351 +MetaTest/Average/Iteration 410 +MetaTest/Average/MaxReturn 63.2932 +MetaTest/Average/MinReturn -40.1227 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 43.6718 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 23.351 +MetaTest/__unnamed_task__/AverageReturn 23.351 +MetaTest/__unnamed_task__/Iteration 410 +MetaTest/__unnamed_task__/MaxReturn 63.2932 +MetaTest/__unnamed_task__/MinReturn -40.1227 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 43.6718 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 672600 +------------------------------------------------- ----------- +2025-04-03 16:44:35 | [pearl_trainer] epoch #411 | Training... +2025-04-03 16:46:00 | [pearl_trainer] epoch #411 | Evaluating... +2025-04-03 16:46:00 | [pearl_trainer] epoch #411 | Sampling for adapation and meta-testing... +2025-04-03 16:47:49 | [pearl_trainer] epoch #411 | Finished meta-testing... +2025-04-03 16:47:49 | [pearl_trainer] epoch #411 | Saving snapshot... +2025-04-03 16:47:50 | [pearl_trainer] epoch #411 | Saved +2025-04-03 16:47:50 | [pearl_trainer] epoch #411 | Time 97382.68 s +2025-04-03 16:47:50 | [pearl_trainer] epoch #411 | EpochTime 226.34 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -17.1627 +MetaTest/Average/AverageReturn -17.1627 +MetaTest/Average/Iteration 411 +MetaTest/Average/MaxReturn 52.1466 +MetaTest/Average/MinReturn -75.7605 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 49.6419 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -17.1627 +MetaTest/__unnamed_task__/AverageReturn -17.1627 +MetaTest/__unnamed_task__/Iteration 411 +MetaTest/__unnamed_task__/MaxReturn 52.1466 +MetaTest/__unnamed_task__/MinReturn -75.7605 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 49.6419 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 674200 +------------------------------------------------- ----------- +2025-04-03 16:48:21 | [pearl_trainer] epoch #412 | Training... +2025-04-03 16:49:49 | [pearl_trainer] epoch #412 | Evaluating... +2025-04-03 16:49:49 | [pearl_trainer] epoch #412 | Sampling for adapation and meta-testing... +2025-04-03 16:51:38 | [pearl_trainer] epoch #412 | Finished meta-testing... +2025-04-03 16:51:38 | [pearl_trainer] epoch #412 | Saving snapshot... +2025-04-03 16:51:39 | [pearl_trainer] epoch #412 | Saved +2025-04-03 16:51:39 | [pearl_trainer] epoch #412 | Time 97612.28 s +2025-04-03 16:51:39 | [pearl_trainer] epoch #412 | EpochTime 229.61 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 9.5189 +MetaTest/Average/AverageReturn 9.5189 +MetaTest/Average/Iteration 412 +MetaTest/Average/MaxReturn 108.931 +MetaTest/Average/MinReturn -42.6259 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 56.6108 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 9.5189 +MetaTest/__unnamed_task__/AverageReturn 9.5189 +MetaTest/__unnamed_task__/Iteration 412 +MetaTest/__unnamed_task__/MaxReturn 108.931 +MetaTest/__unnamed_task__/MinReturn -42.6259 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 56.6108 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 675800 +------------------------------------------------- ----------- +2025-04-03 16:52:11 | [pearl_trainer] epoch #413 | Training... +2025-04-03 16:53:34 | [pearl_trainer] epoch #413 | Evaluating... +2025-04-03 16:53:34 | [pearl_trainer] epoch #413 | Sampling for adapation and meta-testing... +2025-04-03 16:55:29 | [pearl_trainer] epoch #413 | Finished meta-testing... +2025-04-03 16:55:29 | [pearl_trainer] epoch #413 | Saving snapshot... +2025-04-03 16:55:30 | [pearl_trainer] epoch #413 | Saved +2025-04-03 16:55:30 | [pearl_trainer] epoch #413 | Time 97843.32 s +2025-04-03 16:55:30 | [pearl_trainer] epoch #413 | EpochTime 231.03 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 1.96549 +MetaTest/Average/AverageReturn 1.96549 +MetaTest/Average/Iteration 413 +MetaTest/Average/MaxReturn 46.4565 +MetaTest/Average/MinReturn -27.9972 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 28.4451 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 1.96549 +MetaTest/__unnamed_task__/AverageReturn 1.96549 +MetaTest/__unnamed_task__/Iteration 413 +MetaTest/__unnamed_task__/MaxReturn 46.4565 +MetaTest/__unnamed_task__/MinReturn -27.9972 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 28.4451 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 677400 +------------------------------------------------- ------------ +2025-04-03 16:56:02 | [pearl_trainer] epoch #414 | Training... +2025-04-03 16:57:28 | [pearl_trainer] epoch #414 | Evaluating... +2025-04-03 16:57:28 | [pearl_trainer] epoch #414 | Sampling for adapation and meta-testing... +2025-04-03 16:59:19 | [pearl_trainer] epoch #414 | Finished meta-testing... +2025-04-03 16:59:19 | [pearl_trainer] epoch #414 | Saving snapshot... +2025-04-03 16:59:20 | [pearl_trainer] epoch #414 | Saved +2025-04-03 16:59:20 | [pearl_trainer] epoch #414 | Time 98072.98 s +2025-04-03 16:59:20 | [pearl_trainer] epoch #414 | EpochTime 229.66 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 24.0606 +MetaTest/Average/AverageReturn 24.0606 +MetaTest/Average/Iteration 414 +MetaTest/Average/MaxReturn 107.521 +MetaTest/Average/MinReturn -22.3719 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 49.0327 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 24.0606 +MetaTest/__unnamed_task__/AverageReturn 24.0606 +MetaTest/__unnamed_task__/Iteration 414 +MetaTest/__unnamed_task__/MaxReturn 107.521 +MetaTest/__unnamed_task__/MinReturn -22.3719 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 49.0327 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 679000 +------------------------------------------------- ----------- +2025-04-03 16:59:52 | [pearl_trainer] epoch #415 | Training... +2025-04-03 17:01:17 | [pearl_trainer] epoch #415 | Evaluating... +2025-04-03 17:01:17 | [pearl_trainer] epoch #415 | Sampling for adapation and meta-testing... +2025-04-03 17:03:07 | [pearl_trainer] epoch #415 | Finished meta-testing... +2025-04-03 17:03:07 | [pearl_trainer] epoch #415 | Saving snapshot... +2025-04-03 17:03:08 | [pearl_trainer] epoch #415 | Saved +2025-04-03 17:03:08 | [pearl_trainer] epoch #415 | Time 98301.27 s +2025-04-03 17:03:08 | [pearl_trainer] epoch #415 | EpochTime 228.29 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -7.32245 +MetaTest/Average/AverageReturn -7.32245 +MetaTest/Average/Iteration 415 +MetaTest/Average/MaxReturn 106.974 +MetaTest/Average/MinReturn -47.2075 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 57.8872 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -7.32245 +MetaTest/__unnamed_task__/AverageReturn -7.32245 +MetaTest/__unnamed_task__/Iteration 415 +MetaTest/__unnamed_task__/MaxReturn 106.974 +MetaTest/__unnamed_task__/MinReturn -47.2075 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 57.8872 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 680600 +------------------------------------------------- ------------ +2025-04-03 17:03:40 | [pearl_trainer] epoch #416 | Training... +2025-04-03 17:05:10 | [pearl_trainer] epoch #416 | Evaluating... +2025-04-03 17:05:10 | [pearl_trainer] epoch #416 | Sampling for adapation and meta-testing... +2025-04-03 17:07:01 | [pearl_trainer] epoch #416 | Finished meta-testing... +2025-04-03 17:07:01 | [pearl_trainer] epoch #416 | Saving snapshot... +2025-04-03 17:07:02 | [pearl_trainer] epoch #416 | Saved +2025-04-03 17:07:02 | [pearl_trainer] epoch #416 | Time 98534.81 s +2025-04-03 17:07:02 | [pearl_trainer] epoch #416 | EpochTime 233.54 s +------------------------------------------------- -------------- +MetaTest/Average/AverageDiscountedReturn -0.0626967 +MetaTest/Average/AverageReturn -0.0626967 +MetaTest/Average/Iteration 416 +MetaTest/Average/MaxReturn 21.8556 +MetaTest/Average/MinReturn -26.2582 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 19.0848 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -0.0626967 +MetaTest/__unnamed_task__/AverageReturn -0.0626967 +MetaTest/__unnamed_task__/Iteration 416 +MetaTest/__unnamed_task__/MaxReturn 21.8556 +MetaTest/__unnamed_task__/MinReturn -26.2582 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 19.0848 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 682200 +------------------------------------------------- -------------- +2025-04-03 17:07:34 | [pearl_trainer] epoch #417 | Training... +2025-04-03 17:08:59 | [pearl_trainer] epoch #417 | Evaluating... +2025-04-03 17:08:59 | [pearl_trainer] epoch #417 | Sampling for adapation and meta-testing... +2025-04-03 17:10:49 | [pearl_trainer] epoch #417 | Finished meta-testing... +2025-04-03 17:10:49 | [pearl_trainer] epoch #417 | Saving snapshot... +2025-04-03 17:10:50 | [pearl_trainer] epoch #417 | Saved +2025-04-03 17:10:50 | [pearl_trainer] epoch #417 | Time 98762.80 s +2025-04-03 17:10:50 | [pearl_trainer] epoch #417 | EpochTime 227.99 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -5.26511 +MetaTest/Average/AverageReturn -5.26511 +MetaTest/Average/Iteration 417 +MetaTest/Average/MaxReturn 40.3673 +MetaTest/Average/MinReturn -29.8355 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 25.8117 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -5.26511 +MetaTest/__unnamed_task__/AverageReturn -5.26511 +MetaTest/__unnamed_task__/Iteration 417 +MetaTest/__unnamed_task__/MaxReturn 40.3673 +MetaTest/__unnamed_task__/MinReturn -29.8355 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 25.8117 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 683800 +------------------------------------------------- ------------ +2025-04-03 17:11:22 | [pearl_trainer] epoch #418 | Training... +2025-04-03 17:12:52 | [pearl_trainer] epoch #418 | Evaluating... +2025-04-03 17:12:52 | [pearl_trainer] epoch #418 | Sampling for adapation and meta-testing... +2025-04-03 17:14:40 | [pearl_trainer] epoch #418 | Finished meta-testing... +2025-04-03 17:14:40 | [pearl_trainer] epoch #418 | Saving snapshot... +2025-04-03 17:14:42 | [pearl_trainer] epoch #418 | Saved +2025-04-03 17:14:42 | [pearl_trainer] epoch #418 | Time 98994.79 s +2025-04-03 17:14:42 | [pearl_trainer] epoch #418 | EpochTime 231.98 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 4.22653 +MetaTest/Average/AverageReturn 4.22653 +MetaTest/Average/Iteration 418 +MetaTest/Average/MaxReturn 84.8351 +MetaTest/Average/MinReturn -62.8133 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 64.5809 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 4.22653 +MetaTest/__unnamed_task__/AverageReturn 4.22653 +MetaTest/__unnamed_task__/Iteration 418 +MetaTest/__unnamed_task__/MaxReturn 84.8351 +MetaTest/__unnamed_task__/MinReturn -62.8133 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 64.5809 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 685400 +------------------------------------------------- ------------ +2025-04-03 17:15:13 | [pearl_trainer] epoch #419 | Training... +2025-04-03 17:16:39 | [pearl_trainer] epoch #419 | Evaluating... +2025-04-03 17:16:39 | [pearl_trainer] epoch #419 | Sampling for adapation and meta-testing... +2025-04-03 17:18:28 | [pearl_trainer] epoch #419 | Finished meta-testing... +2025-04-03 17:18:28 | [pearl_trainer] epoch #419 | Saving snapshot... +2025-04-03 17:18:29 | [pearl_trainer] epoch #419 | Saved +2025-04-03 17:18:29 | [pearl_trainer] epoch #419 | Time 99221.64 s +2025-04-03 17:18:29 | [pearl_trainer] epoch #419 | EpochTime 226.85 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 12.1852 +MetaTest/Average/AverageReturn 12.1852 +MetaTest/Average/Iteration 419 +MetaTest/Average/MaxReturn 106.512 +MetaTest/Average/MinReturn -53.6617 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 61.1193 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 12.1852 +MetaTest/__unnamed_task__/AverageReturn 12.1852 +MetaTest/__unnamed_task__/Iteration 419 +MetaTest/__unnamed_task__/MaxReturn 106.512 +MetaTest/__unnamed_task__/MinReturn -53.6617 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 61.1193 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 687000 +------------------------------------------------- ----------- +2025-04-03 17:19:00 | [pearl_trainer] epoch #420 | Training... +2025-04-03 17:20:23 | [pearl_trainer] epoch #420 | Evaluating... +2025-04-03 17:20:23 | [pearl_trainer] epoch #420 | Sampling for adapation and meta-testing... +2025-04-03 17:22:12 | [pearl_trainer] epoch #420 | Finished meta-testing... +2025-04-03 17:22:12 | [pearl_trainer] epoch #420 | Saving snapshot... +2025-04-03 17:22:14 | [pearl_trainer] epoch #420 | Saved +2025-04-03 17:22:14 | [pearl_trainer] epoch #420 | Time 99446.64 s +2025-04-03 17:22:14 | [pearl_trainer] epoch #420 | EpochTime 225.00 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 26.8934 +MetaTest/Average/AverageReturn 26.8934 +MetaTest/Average/Iteration 420 +MetaTest/Average/MaxReturn 133.334 +MetaTest/Average/MinReturn -32.7339 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 67.3392 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 26.8934 +MetaTest/__unnamed_task__/AverageReturn 26.8934 +MetaTest/__unnamed_task__/Iteration 420 +MetaTest/__unnamed_task__/MaxReturn 133.334 +MetaTest/__unnamed_task__/MinReturn -32.7339 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 67.3392 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 688600 +------------------------------------------------- ----------- +2025-04-03 17:22:47 | [pearl_trainer] epoch #421 | Training... +2025-04-03 17:24:16 | [pearl_trainer] epoch #421 | Evaluating... +2025-04-03 17:24:16 | [pearl_trainer] epoch #421 | Sampling for adapation and meta-testing... +2025-04-03 17:26:05 | [pearl_trainer] epoch #421 | Finished meta-testing... +2025-04-03 17:26:05 | [pearl_trainer] epoch #421 | Saving snapshot... +2025-04-03 17:26:06 | [pearl_trainer] epoch #421 | Saved +2025-04-03 17:26:06 | [pearl_trainer] epoch #421 | Time 99679.34 s +2025-04-03 17:26:06 | [pearl_trainer] epoch #421 | EpochTime 232.69 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 38.9706 +MetaTest/Average/AverageReturn 38.9706 +MetaTest/Average/Iteration 421 +MetaTest/Average/MaxReturn 121.753 +MetaTest/Average/MinReturn -27.6133 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 58.8008 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 38.9706 +MetaTest/__unnamed_task__/AverageReturn 38.9706 +MetaTest/__unnamed_task__/Iteration 421 +MetaTest/__unnamed_task__/MaxReturn 121.753 +MetaTest/__unnamed_task__/MinReturn -27.6133 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 58.8008 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 690200 +------------------------------------------------- ----------- +2025-04-03 17:26:38 | [pearl_trainer] epoch #422 | Training... +2025-04-03 17:28:05 | [pearl_trainer] epoch #422 | Evaluating... +2025-04-03 17:28:05 | [pearl_trainer] epoch #422 | Sampling for adapation and meta-testing... +2025-04-03 17:29:53 | [pearl_trainer] epoch #422 | Finished meta-testing... +2025-04-03 17:29:53 | [pearl_trainer] epoch #422 | Saving snapshot... +2025-04-03 17:29:54 | [pearl_trainer] epoch #422 | Saved +2025-04-03 17:29:54 | [pearl_trainer] epoch #422 | Time 99907.45 s +2025-04-03 17:29:54 | [pearl_trainer] epoch #422 | EpochTime 228.11 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 51.912 +MetaTest/Average/AverageReturn 51.912 +MetaTest/Average/Iteration 422 +MetaTest/Average/MaxReturn 99.4377 +MetaTest/Average/MinReturn -12.0865 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 47.9696 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 51.912 +MetaTest/__unnamed_task__/AverageReturn 51.912 +MetaTest/__unnamed_task__/Iteration 422 +MetaTest/__unnamed_task__/MaxReturn 99.4377 +MetaTest/__unnamed_task__/MinReturn -12.0865 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 47.9696 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 691800 +------------------------------------------------- ----------- +2025-04-03 17:30:26 | [pearl_trainer] epoch #423 | Training... +2025-04-03 17:31:57 | [pearl_trainer] epoch #423 | Evaluating... +2025-04-03 17:31:57 | [pearl_trainer] epoch #423 | Sampling for adapation and meta-testing... +2025-04-03 17:33:48 | [pearl_trainer] epoch #423 | Finished meta-testing... +2025-04-03 17:33:48 | [pearl_trainer] epoch #423 | Saving snapshot... +2025-04-03 17:33:49 | [pearl_trainer] epoch #423 | Saved +2025-04-03 17:33:49 | [pearl_trainer] epoch #423 | Time 100142.18 s +2025-04-03 17:33:49 | [pearl_trainer] epoch #423 | EpochTime 234.72 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -25.5766 +MetaTest/Average/AverageReturn -25.5766 +MetaTest/Average/Iteration 423 +MetaTest/Average/MaxReturn -10.8325 +MetaTest/Average/MinReturn -48.2838 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 12.5134 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -25.5766 +MetaTest/__unnamed_task__/AverageReturn -25.5766 +MetaTest/__unnamed_task__/Iteration 423 +MetaTest/__unnamed_task__/MaxReturn -10.8325 +MetaTest/__unnamed_task__/MinReturn -48.2838 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 12.5134 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 693400 +------------------------------------------------- ----------- +2025-04-03 17:34:21 | [pearl_trainer] epoch #424 | Training... +2025-04-03 17:35:47 | [pearl_trainer] epoch #424 | Evaluating... +2025-04-03 17:35:47 | [pearl_trainer] epoch #424 | Sampling for adapation and meta-testing... +2025-04-03 17:37:37 | [pearl_trainer] epoch #424 | Finished meta-testing... +2025-04-03 17:37:37 | [pearl_trainer] epoch #424 | Saving snapshot... +2025-04-03 17:37:38 | [pearl_trainer] epoch #424 | Saved +2025-04-03 17:37:38 | [pearl_trainer] epoch #424 | Time 100370.60 s +2025-04-03 17:37:38 | [pearl_trainer] epoch #424 | EpochTime 228.42 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -7.31726 +MetaTest/Average/AverageReturn -7.31726 +MetaTest/Average/Iteration 424 +MetaTest/Average/MaxReturn 43.7709 +MetaTest/Average/MinReturn -29.9512 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 26.2195 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -7.31726 +MetaTest/__unnamed_task__/AverageReturn -7.31726 +MetaTest/__unnamed_task__/Iteration 424 +MetaTest/__unnamed_task__/MaxReturn 43.7709 +MetaTest/__unnamed_task__/MinReturn -29.9512 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 26.2195 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 695000 +------------------------------------------------- ------------ +2025-04-03 17:38:09 | [pearl_trainer] epoch #425 | Training... +2025-04-03 17:39:33 | [pearl_trainer] epoch #425 | Evaluating... +2025-04-03 17:39:33 | [pearl_trainer] epoch #425 | Sampling for adapation and meta-testing... +2025-04-03 17:41:22 | [pearl_trainer] epoch #425 | Finished meta-testing... +2025-04-03 17:41:22 | [pearl_trainer] epoch #425 | Saving snapshot... +2025-04-03 17:41:23 | [pearl_trainer] epoch #425 | Saved +2025-04-03 17:41:23 | [pearl_trainer] epoch #425 | Time 100596.35 s +2025-04-03 17:41:23 | [pearl_trainer] epoch #425 | EpochTime 225.75 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -10.0542 +MetaTest/Average/AverageReturn -10.0542 +MetaTest/Average/Iteration 425 +MetaTest/Average/MaxReturn 16.7501 +MetaTest/Average/MinReturn -28.2766 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 15.3711 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -10.0542 +MetaTest/__unnamed_task__/AverageReturn -10.0542 +MetaTest/__unnamed_task__/Iteration 425 +MetaTest/__unnamed_task__/MaxReturn 16.7501 +MetaTest/__unnamed_task__/MinReturn -28.2766 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 15.3711 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 696600 +------------------------------------------------- ----------- +2025-04-03 17:41:55 | [pearl_trainer] epoch #426 | Training... +2025-04-03 17:43:20 | [pearl_trainer] epoch #426 | Evaluating... +2025-04-03 17:43:20 | [pearl_trainer] epoch #426 | Sampling for adapation and meta-testing... +2025-04-03 17:45:07 | [pearl_trainer] epoch #426 | Finished meta-testing... +2025-04-03 17:45:07 | [pearl_trainer] epoch #426 | Saving snapshot... +2025-04-03 17:45:08 | [pearl_trainer] epoch #426 | Saved +2025-04-03 17:45:08 | [pearl_trainer] epoch #426 | Time 100821.49 s +2025-04-03 17:45:08 | [pearl_trainer] epoch #426 | EpochTime 225.13 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 20.5901 +MetaTest/Average/AverageReturn 20.5901 +MetaTest/Average/Iteration 426 +MetaTest/Average/MaxReturn 67.6486 +MetaTest/Average/MinReturn -20.4607 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 34.9636 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 20.5901 +MetaTest/__unnamed_task__/AverageReturn 20.5901 +MetaTest/__unnamed_task__/Iteration 426 +MetaTest/__unnamed_task__/MaxReturn 67.6486 +MetaTest/__unnamed_task__/MinReturn -20.4607 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 34.9636 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 698200 +------------------------------------------------- ----------- +2025-04-03 17:45:40 | [pearl_trainer] epoch #427 | Training... +2025-04-03 17:47:08 | [pearl_trainer] epoch #427 | Evaluating... +2025-04-03 17:47:08 | [pearl_trainer] epoch #427 | Sampling for adapation and meta-testing... +2025-04-03 17:48:54 | [pearl_trainer] epoch #427 | Finished meta-testing... +2025-04-03 17:48:54 | [pearl_trainer] epoch #427 | Saving snapshot... +2025-04-03 17:48:55 | [pearl_trainer] epoch #427 | Saved +2025-04-03 17:48:55 | [pearl_trainer] epoch #427 | Time 101048.14 s +2025-04-03 17:48:55 | [pearl_trainer] epoch #427 | EpochTime 226.65 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 16.2387 +MetaTest/Average/AverageReturn 16.2387 +MetaTest/Average/Iteration 427 +MetaTest/Average/MaxReturn 80.1615 +MetaTest/Average/MinReturn -52.1116 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 53.7284 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 16.2387 +MetaTest/__unnamed_task__/AverageReturn 16.2387 +MetaTest/__unnamed_task__/Iteration 427 +MetaTest/__unnamed_task__/MaxReturn 80.1615 +MetaTest/__unnamed_task__/MinReturn -52.1116 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 53.7284 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 699800 +------------------------------------------------- ----------- +2025-04-03 17:49:27 | [pearl_trainer] epoch #428 | Training... +2025-04-03 17:50:50 | [pearl_trainer] epoch #428 | Evaluating... +2025-04-03 17:50:50 | [pearl_trainer] epoch #428 | Sampling for adapation and meta-testing... +2025-04-03 17:52:44 | [pearl_trainer] epoch #428 | Finished meta-testing... +2025-04-03 17:52:44 | [pearl_trainer] epoch #428 | Saving snapshot... +2025-04-03 17:52:46 | [pearl_trainer] epoch #428 | Saved +2025-04-03 17:52:46 | [pearl_trainer] epoch #428 | Time 101278.89 s +2025-04-03 17:52:46 | [pearl_trainer] epoch #428 | EpochTime 230.74 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 19.0562 +MetaTest/Average/AverageReturn 19.0562 +MetaTest/Average/Iteration 428 +MetaTest/Average/MaxReturn 93.8806 +MetaTest/Average/MinReturn -21.6725 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 48.2906 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 19.0562 +MetaTest/__unnamed_task__/AverageReturn 19.0562 +MetaTest/__unnamed_task__/Iteration 428 +MetaTest/__unnamed_task__/MaxReturn 93.8806 +MetaTest/__unnamed_task__/MinReturn -21.6725 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 48.2906 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 701400 +------------------------------------------------- ----------- +2025-04-03 17:53:18 | [pearl_trainer] epoch #429 | Training... +2025-04-03 17:54:43 | [pearl_trainer] epoch #429 | Evaluating... +2025-04-03 17:54:43 | [pearl_trainer] epoch #429 | Sampling for adapation and meta-testing... +2025-04-03 17:56:31 | [pearl_trainer] epoch #429 | Finished meta-testing... +2025-04-03 17:56:31 | [pearl_trainer] epoch #429 | Saving snapshot... +2025-04-03 17:56:32 | [pearl_trainer] epoch #429 | Saved +2025-04-03 17:56:32 | [pearl_trainer] epoch #429 | Time 101504.68 s +2025-04-03 17:56:32 | [pearl_trainer] epoch #429 | EpochTime 225.79 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -16.1895 +MetaTest/Average/AverageReturn -16.1895 +MetaTest/Average/Iteration 429 +MetaTest/Average/MaxReturn 30.2516 +MetaTest/Average/MinReturn -44.3204 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 25.633 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -16.1895 +MetaTest/__unnamed_task__/AverageReturn -16.1895 +MetaTest/__unnamed_task__/Iteration 429 +MetaTest/__unnamed_task__/MaxReturn 30.2516 +MetaTest/__unnamed_task__/MinReturn -44.3204 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 25.633 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 703000 +------------------------------------------------- ----------- +2025-04-03 17:57:03 | [pearl_trainer] epoch #430 | Training... +2025-04-03 17:58:27 | [pearl_trainer] epoch #430 | Evaluating... +2025-04-03 17:58:27 | [pearl_trainer] epoch #430 | Sampling for adapation and meta-testing... +2025-04-03 18:00:18 | [pearl_trainer] epoch #430 | Finished meta-testing... +2025-04-03 18:00:18 | [pearl_trainer] epoch #430 | Saving snapshot... +2025-04-03 18:00:19 | [pearl_trainer] epoch #430 | Saved +2025-04-03 18:00:19 | [pearl_trainer] epoch #430 | Time 101732.36 s +2025-04-03 18:00:19 | [pearl_trainer] epoch #430 | EpochTime 227.68 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 13.4377 +MetaTest/Average/AverageReturn 13.4377 +MetaTest/Average/Iteration 430 +MetaTest/Average/MaxReturn 95.1138 +MetaTest/Average/MinReturn -28.742 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 43.5508 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 13.4377 +MetaTest/__unnamed_task__/AverageReturn 13.4377 +MetaTest/__unnamed_task__/Iteration 430 +MetaTest/__unnamed_task__/MaxReturn 95.1138 +MetaTest/__unnamed_task__/MinReturn -28.742 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 43.5508 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 704600 +------------------------------------------------- ----------- +2025-04-03 18:00:52 | [pearl_trainer] epoch #431 | Training... +2025-04-03 18:02:18 | [pearl_trainer] epoch #431 | Evaluating... +2025-04-03 18:02:18 | [pearl_trainer] epoch #431 | Sampling for adapation and meta-testing... +2025-04-03 18:04:08 | [pearl_trainer] epoch #431 | Finished meta-testing... +2025-04-03 18:04:08 | [pearl_trainer] epoch #431 | Saving snapshot... +2025-04-03 18:04:09 | [pearl_trainer] epoch #431 | Saved +2025-04-03 18:04:09 | [pearl_trainer] epoch #431 | Time 101961.69 s +2025-04-03 18:04:09 | [pearl_trainer] epoch #431 | EpochTime 229.33 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 19.5951 +MetaTest/Average/AverageReturn 19.5951 +MetaTest/Average/Iteration 431 +MetaTest/Average/MaxReturn 67.8398 +MetaTest/Average/MinReturn -38.0156 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 40.9953 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 19.5951 +MetaTest/__unnamed_task__/AverageReturn 19.5951 +MetaTest/__unnamed_task__/Iteration 431 +MetaTest/__unnamed_task__/MaxReturn 67.8398 +MetaTest/__unnamed_task__/MinReturn -38.0156 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 40.9953 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 706200 +------------------------------------------------- ----------- +2025-04-03 18:04:40 | [pearl_trainer] epoch #432 | Training... +2025-04-03 18:06:06 | [pearl_trainer] epoch #432 | Evaluating... +2025-04-03 18:06:06 | [pearl_trainer] epoch #432 | Sampling for adapation and meta-testing... +2025-04-03 18:07:55 | [pearl_trainer] epoch #432 | Finished meta-testing... +2025-04-03 18:07:55 | [pearl_trainer] epoch #432 | Saving snapshot... +2025-04-03 18:07:56 | [pearl_trainer] epoch #432 | Saved +2025-04-03 18:07:56 | [pearl_trainer] epoch #432 | Time 102189.00 s +2025-04-03 18:07:56 | [pearl_trainer] epoch #432 | EpochTime 227.30 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 7.80155 +MetaTest/Average/AverageReturn 7.80155 +MetaTest/Average/Iteration 432 +MetaTest/Average/MaxReturn 69.9329 +MetaTest/Average/MinReturn -34.4648 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 35.479 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 7.80155 +MetaTest/__unnamed_task__/AverageReturn 7.80155 +MetaTest/__unnamed_task__/Iteration 432 +MetaTest/__unnamed_task__/MaxReturn 69.9329 +MetaTest/__unnamed_task__/MinReturn -34.4648 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 35.479 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 707800 +------------------------------------------------- ------------ +2025-04-03 18:08:27 | [pearl_trainer] epoch #433 | Training... +2025-04-03 18:09:50 | [pearl_trainer] epoch #433 | Evaluating... +2025-04-03 18:09:50 | [pearl_trainer] epoch #433 | Sampling for adapation and meta-testing... +2025-04-03 18:11:39 | [pearl_trainer] epoch #433 | Finished meta-testing... +2025-04-03 18:11:39 | [pearl_trainer] epoch #433 | Saving snapshot... +2025-04-03 18:11:40 | [pearl_trainer] epoch #433 | Saved +2025-04-03 18:11:40 | [pearl_trainer] epoch #433 | Time 102413.18 s +2025-04-03 18:11:40 | [pearl_trainer] epoch #433 | EpochTime 224.18 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 5.43056 +MetaTest/Average/AverageReturn 5.43056 +MetaTest/Average/Iteration 433 +MetaTest/Average/MaxReturn 48.3915 +MetaTest/Average/MinReturn -16.9975 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 22.9568 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 5.43056 +MetaTest/__unnamed_task__/AverageReturn 5.43056 +MetaTest/__unnamed_task__/Iteration 433 +MetaTest/__unnamed_task__/MaxReturn 48.3915 +MetaTest/__unnamed_task__/MinReturn -16.9975 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 22.9568 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 709400 +------------------------------------------------- ------------ +2025-04-03 18:12:12 | [pearl_trainer] epoch #434 | Training... +2025-04-03 18:13:36 | [pearl_trainer] epoch #434 | Evaluating... +2025-04-03 18:13:36 | [pearl_trainer] epoch #434 | Sampling for adapation and meta-testing... +2025-04-03 18:15:23 | [pearl_trainer] epoch #434 | Finished meta-testing... +2025-04-03 18:15:23 | [pearl_trainer] epoch #434 | Saving snapshot... +2025-04-03 18:15:24 | [pearl_trainer] epoch #434 | Saved +2025-04-03 18:15:24 | [pearl_trainer] epoch #434 | Time 102637.48 s +2025-04-03 18:15:24 | [pearl_trainer] epoch #434 | EpochTime 224.30 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 33.714 +MetaTest/Average/AverageReturn 33.714 +MetaTest/Average/Iteration 434 +MetaTest/Average/MaxReturn 108.075 +MetaTest/Average/MinReturn -20.0828 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 48.8527 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 33.714 +MetaTest/__unnamed_task__/AverageReturn 33.714 +MetaTest/__unnamed_task__/Iteration 434 +MetaTest/__unnamed_task__/MaxReturn 108.075 +MetaTest/__unnamed_task__/MinReturn -20.0828 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 48.8527 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 711000 +------------------------------------------------- ----------- +2025-04-03 18:15:56 | [pearl_trainer] epoch #435 | Training... +2025-04-03 18:17:22 | [pearl_trainer] epoch #435 | Evaluating... +2025-04-03 18:17:22 | [pearl_trainer] epoch #435 | Sampling for adapation and meta-testing... +2025-04-03 18:19:14 | [pearl_trainer] epoch #435 | Finished meta-testing... +2025-04-03 18:19:14 | [pearl_trainer] epoch #435 | Saving snapshot... +2025-04-03 18:19:15 | [pearl_trainer] epoch #435 | Saved +2025-04-03 18:19:15 | [pearl_trainer] epoch #435 | Time 102868.10 s +2025-04-03 18:19:15 | [pearl_trainer] epoch #435 | EpochTime 230.62 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 32.3647 +MetaTest/Average/AverageReturn 32.3647 +MetaTest/Average/Iteration 435 +MetaTest/Average/MaxReturn 101.307 +MetaTest/Average/MinReturn -35.1101 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 59.0282 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 32.3647 +MetaTest/__unnamed_task__/AverageReturn 32.3647 +MetaTest/__unnamed_task__/Iteration 435 +MetaTest/__unnamed_task__/MaxReturn 101.307 +MetaTest/__unnamed_task__/MinReturn -35.1101 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 59.0282 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 712600 +------------------------------------------------- ----------- +2025-04-03 18:19:46 | [pearl_trainer] epoch #436 | Training... +2025-04-03 18:21:13 | [pearl_trainer] epoch #436 | Evaluating... +2025-04-03 18:21:13 | [pearl_trainer] epoch #436 | Sampling for adapation and meta-testing... +2025-04-03 18:23:07 | [pearl_trainer] epoch #436 | Finished meta-testing... +2025-04-03 18:23:07 | [pearl_trainer] epoch #436 | Saving snapshot... +2025-04-03 18:23:08 | [pearl_trainer] epoch #436 | Saved +2025-04-03 18:23:08 | [pearl_trainer] epoch #436 | Time 103100.94 s +2025-04-03 18:23:08 | [pearl_trainer] epoch #436 | EpochTime 232.84 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 40.533 +MetaTest/Average/AverageReturn 40.533 +MetaTest/Average/Iteration 436 +MetaTest/Average/MaxReturn 114.189 +MetaTest/Average/MinReturn -28.7487 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 54.5034 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 40.533 +MetaTest/__unnamed_task__/AverageReturn 40.533 +MetaTest/__unnamed_task__/Iteration 436 +MetaTest/__unnamed_task__/MaxReturn 114.189 +MetaTest/__unnamed_task__/MinReturn -28.7487 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 54.5034 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 714200 +------------------------------------------------- ----------- +2025-04-03 18:23:40 | [pearl_trainer] epoch #437 | Training... +2025-04-03 18:25:06 | [pearl_trainer] epoch #437 | Evaluating... +2025-04-03 18:25:06 | [pearl_trainer] epoch #437 | Sampling for adapation and meta-testing... +2025-04-03 18:26:55 | [pearl_trainer] epoch #437 | Finished meta-testing... +2025-04-03 18:26:55 | [pearl_trainer] epoch #437 | Saving snapshot... +2025-04-03 18:26:57 | [pearl_trainer] epoch #437 | Saved +2025-04-03 18:26:57 | [pearl_trainer] epoch #437 | Time 103329.52 s +2025-04-03 18:26:57 | [pearl_trainer] epoch #437 | EpochTime 228.58 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 1.07882 +MetaTest/Average/AverageReturn 1.07882 +MetaTest/Average/Iteration 437 +MetaTest/Average/MaxReturn 86.4552 +MetaTest/Average/MinReturn -35.0412 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 43.4833 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 1.07882 +MetaTest/__unnamed_task__/AverageReturn 1.07882 +MetaTest/__unnamed_task__/Iteration 437 +MetaTest/__unnamed_task__/MaxReturn 86.4552 +MetaTest/__unnamed_task__/MinReturn -35.0412 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 43.4833 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 715800 +------------------------------------------------- ------------ +2025-04-03 18:27:29 | [pearl_trainer] epoch #438 | Training... +2025-04-03 18:28:51 | [pearl_trainer] epoch #438 | Evaluating... +2025-04-03 18:28:51 | [pearl_trainer] epoch #438 | Sampling for adapation and meta-testing... +2025-04-03 18:30:42 | [pearl_trainer] epoch #438 | Finished meta-testing... +2025-04-03 18:30:42 | [pearl_trainer] epoch #438 | Saving snapshot... +2025-04-03 18:30:43 | [pearl_trainer] epoch #438 | Saved +2025-04-03 18:30:43 | [pearl_trainer] epoch #438 | Time 103556.30 s +2025-04-03 18:30:43 | [pearl_trainer] epoch #438 | EpochTime 226.77 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 4.84468 +MetaTest/Average/AverageReturn 4.84468 +MetaTest/Average/Iteration 438 +MetaTest/Average/MaxReturn 44.4917 +MetaTest/Average/MinReturn -32.6912 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 27.7667 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 4.84468 +MetaTest/__unnamed_task__/AverageReturn 4.84468 +MetaTest/__unnamed_task__/Iteration 438 +MetaTest/__unnamed_task__/MaxReturn 44.4917 +MetaTest/__unnamed_task__/MinReturn -32.6912 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 27.7667 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 717400 +------------------------------------------------- ------------ +2025-04-03 18:31:16 | [pearl_trainer] epoch #439 | Training... +2025-04-03 18:32:40 | [pearl_trainer] epoch #439 | Evaluating... +2025-04-03 18:32:40 | [pearl_trainer] epoch #439 | Sampling for adapation and meta-testing... +2025-04-03 18:34:29 | [pearl_trainer] epoch #439 | Finished meta-testing... +2025-04-03 18:34:29 | [pearl_trainer] epoch #439 | Saving snapshot... +2025-04-03 18:34:31 | [pearl_trainer] epoch #439 | Saved +2025-04-03 18:34:31 | [pearl_trainer] epoch #439 | Time 103783.56 s +2025-04-03 18:34:31 | [pearl_trainer] epoch #439 | EpochTime 227.26 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 31.3788 +MetaTest/Average/AverageReturn 31.3788 +MetaTest/Average/Iteration 439 +MetaTest/Average/MaxReturn 93.4957 +MetaTest/Average/MinReturn -40.7696 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 53.3385 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 31.3788 +MetaTest/__unnamed_task__/AverageReturn 31.3788 +MetaTest/__unnamed_task__/Iteration 439 +MetaTest/__unnamed_task__/MaxReturn 93.4957 +MetaTest/__unnamed_task__/MinReturn -40.7696 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 53.3385 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 719000 +------------------------------------------------- ----------- +2025-04-03 18:35:02 | [pearl_trainer] epoch #440 | Training... +2025-04-03 18:36:27 | [pearl_trainer] epoch #440 | Evaluating... +2025-04-03 18:36:27 | [pearl_trainer] epoch #440 | Sampling for adapation and meta-testing... +2025-04-03 18:38:17 | [pearl_trainer] epoch #440 | Finished meta-testing... +2025-04-03 18:38:17 | [pearl_trainer] epoch #440 | Saving snapshot... +2025-04-03 18:38:19 | [pearl_trainer] epoch #440 | Saved +2025-04-03 18:38:19 | [pearl_trainer] epoch #440 | Time 104011.59 s +2025-04-03 18:38:19 | [pearl_trainer] epoch #440 | EpochTime 228.04 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 24.6641 +MetaTest/Average/AverageReturn 24.6641 +MetaTest/Average/Iteration 440 +MetaTest/Average/MaxReturn 104.288 +MetaTest/Average/MinReturn -37.1585 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 47.499 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 24.6641 +MetaTest/__unnamed_task__/AverageReturn 24.6641 +MetaTest/__unnamed_task__/Iteration 440 +MetaTest/__unnamed_task__/MaxReturn 104.288 +MetaTest/__unnamed_task__/MinReturn -37.1585 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 47.499 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 720600 +------------------------------------------------- ----------- +2025-04-03 18:38:50 | [pearl_trainer] epoch #441 | Training... +2025-04-03 18:40:14 | [pearl_trainer] epoch #441 | Evaluating... +2025-04-03 18:40:14 | [pearl_trainer] epoch #441 | Sampling for adapation and meta-testing... +2025-04-03 18:42:02 | [pearl_trainer] epoch #441 | Finished meta-testing... +2025-04-03 18:42:02 | [pearl_trainer] epoch #441 | Saving snapshot... +2025-04-03 18:42:03 | [pearl_trainer] epoch #441 | Saved +2025-04-03 18:42:03 | [pearl_trainer] epoch #441 | Time 104236.06 s +2025-04-03 18:42:03 | [pearl_trainer] epoch #441 | EpochTime 224.47 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 25.8248 +MetaTest/Average/AverageReturn 25.8248 +MetaTest/Average/Iteration 441 +MetaTest/Average/MaxReturn 89.1028 +MetaTest/Average/MinReturn -22.9805 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 41.2875 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 25.8248 +MetaTest/__unnamed_task__/AverageReturn 25.8248 +MetaTest/__unnamed_task__/Iteration 441 +MetaTest/__unnamed_task__/MaxReturn 89.1028 +MetaTest/__unnamed_task__/MinReturn -22.9805 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 41.2875 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 722200 +------------------------------------------------- ----------- +2025-04-03 18:42:34 | [pearl_trainer] epoch #442 | Training... +2025-04-03 18:44:01 | [pearl_trainer] epoch #442 | Evaluating... +2025-04-03 18:44:01 | [pearl_trainer] epoch #442 | Sampling for adapation and meta-testing... +2025-04-03 18:45:49 | [pearl_trainer] epoch #442 | Finished meta-testing... +2025-04-03 18:45:49 | [pearl_trainer] epoch #442 | Saving snapshot... +2025-04-03 18:45:50 | [pearl_trainer] epoch #442 | Saved +2025-04-03 18:45:50 | [pearl_trainer] epoch #442 | Time 104463.30 s +2025-04-03 18:45:50 | [pearl_trainer] epoch #442 | EpochTime 227.24 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 12.073 +MetaTest/Average/AverageReturn 12.073 +MetaTest/Average/Iteration 442 +MetaTest/Average/MaxReturn 61.7463 +MetaTest/Average/MinReturn -39.0713 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 38.3729 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 12.073 +MetaTest/__unnamed_task__/AverageReturn 12.073 +MetaTest/__unnamed_task__/Iteration 442 +MetaTest/__unnamed_task__/MaxReturn 61.7463 +MetaTest/__unnamed_task__/MinReturn -39.0713 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 38.3729 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 723800 +------------------------------------------------- ----------- +2025-04-03 18:46:22 | [pearl_trainer] epoch #443 | Training... +2025-04-03 18:47:49 | [pearl_trainer] epoch #443 | Evaluating... +2025-04-03 18:47:49 | [pearl_trainer] epoch #443 | Sampling for adapation and meta-testing... +2025-04-03 18:49:38 | [pearl_trainer] epoch #443 | Finished meta-testing... +2025-04-03 18:49:38 | [pearl_trainer] epoch #443 | Saving snapshot... +2025-04-03 18:49:39 | [pearl_trainer] epoch #443 | Saved +2025-04-03 18:49:39 | [pearl_trainer] epoch #443 | Time 104692.29 s +2025-04-03 18:49:39 | [pearl_trainer] epoch #443 | EpochTime 228.99 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 11.9713 +MetaTest/Average/AverageReturn 11.9713 +MetaTest/Average/Iteration 443 +MetaTest/Average/MaxReturn 88.7956 +MetaTest/Average/MinReturn -44.6453 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 54.5818 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 11.9713 +MetaTest/__unnamed_task__/AverageReturn 11.9713 +MetaTest/__unnamed_task__/Iteration 443 +MetaTest/__unnamed_task__/MaxReturn 88.7956 +MetaTest/__unnamed_task__/MinReturn -44.6453 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 54.5818 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 725400 +------------------------------------------------- ----------- +2025-04-03 18:50:11 | [pearl_trainer] epoch #444 | Training... +2025-04-03 18:51:40 | [pearl_trainer] epoch #444 | Evaluating... +2025-04-03 18:51:40 | [pearl_trainer] epoch #444 | Sampling for adapation and meta-testing... +2025-04-03 18:53:35 | [pearl_trainer] epoch #444 | Finished meta-testing... +2025-04-03 18:53:35 | [pearl_trainer] epoch #444 | Saving snapshot... +2025-04-03 18:53:36 | [pearl_trainer] epoch #444 | Saved +2025-04-03 18:53:36 | [pearl_trainer] epoch #444 | Time 104929.06 s +2025-04-03 18:53:36 | [pearl_trainer] epoch #444 | EpochTime 236.77 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 26.6623 +MetaTest/Average/AverageReturn 26.6623 +MetaTest/Average/Iteration 444 +MetaTest/Average/MaxReturn 94.8284 +MetaTest/Average/MinReturn -49.0581 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 56.0561 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 26.6623 +MetaTest/__unnamed_task__/AverageReturn 26.6623 +MetaTest/__unnamed_task__/Iteration 444 +MetaTest/__unnamed_task__/MaxReturn 94.8284 +MetaTest/__unnamed_task__/MinReturn -49.0581 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 56.0561 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 727000 +------------------------------------------------- ----------- +2025-04-03 18:54:07 | [pearl_trainer] epoch #445 | Training... +2025-04-03 18:55:32 | [pearl_trainer] epoch #445 | Evaluating... +2025-04-03 18:55:32 | [pearl_trainer] epoch #445 | Sampling for adapation and meta-testing... +2025-04-03 18:57:22 | [pearl_trainer] epoch #445 | Finished meta-testing... +2025-04-03 18:57:22 | [pearl_trainer] epoch #445 | Saving snapshot... +2025-04-03 18:57:24 | [pearl_trainer] epoch #445 | Saved +2025-04-03 18:57:24 | [pearl_trainer] epoch #445 | Time 105156.59 s +2025-04-03 18:57:24 | [pearl_trainer] epoch #445 | EpochTime 227.52 s +------------------------------------------------- ------------- +MetaTest/Average/AverageDiscountedReturn 0.338301 +MetaTest/Average/AverageReturn 0.338301 +MetaTest/Average/Iteration 445 +MetaTest/Average/MaxReturn 120.307 +MetaTest/Average/MinReturn -43.8575 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 61.3527 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 0.338301 +MetaTest/__unnamed_task__/AverageReturn 0.338301 +MetaTest/__unnamed_task__/Iteration 445 +MetaTest/__unnamed_task__/MaxReturn 120.307 +MetaTest/__unnamed_task__/MinReturn -43.8575 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 61.3527 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 728600 +------------------------------------------------- ------------- +2025-04-03 18:57:56 | [pearl_trainer] epoch #446 | Training... +2025-04-03 18:59:22 | [pearl_trainer] epoch #446 | Evaluating... +2025-04-03 18:59:22 | [pearl_trainer] epoch #446 | Sampling for adapation and meta-testing... +2025-04-03 19:01:10 | [pearl_trainer] epoch #446 | Finished meta-testing... +2025-04-03 19:01:10 | [pearl_trainer] epoch #446 | Saving snapshot... +2025-04-03 19:01:11 | [pearl_trainer] epoch #446 | Saved +2025-04-03 19:01:11 | [pearl_trainer] epoch #446 | Time 105384.50 s +2025-04-03 19:01:11 | [pearl_trainer] epoch #446 | EpochTime 227.91 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 2.00264 +MetaTest/Average/AverageReturn 2.00264 +MetaTest/Average/Iteration 446 +MetaTest/Average/MaxReturn 105.717 +MetaTest/Average/MinReturn -65.2911 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 60.9871 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 2.00264 +MetaTest/__unnamed_task__/AverageReturn 2.00264 +MetaTest/__unnamed_task__/Iteration 446 +MetaTest/__unnamed_task__/MaxReturn 105.717 +MetaTest/__unnamed_task__/MinReturn -65.2911 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 60.9871 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 730200 +------------------------------------------------- ------------ +2025-04-03 19:01:43 | [pearl_trainer] epoch #447 | Training... +2025-04-03 19:03:07 | [pearl_trainer] epoch #447 | Evaluating... +2025-04-03 19:03:07 | [pearl_trainer] epoch #447 | Sampling for adapation and meta-testing... +2025-04-03 19:04:57 | [pearl_trainer] epoch #447 | Finished meta-testing... +2025-04-03 19:04:57 | [pearl_trainer] epoch #447 | Saving snapshot... +2025-04-03 19:04:58 | [pearl_trainer] epoch #447 | Saved +2025-04-03 19:04:58 | [pearl_trainer] epoch #447 | Time 105610.70 s +2025-04-03 19:04:58 | [pearl_trainer] epoch #447 | EpochTime 226.20 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 27.1423 +MetaTest/Average/AverageReturn 27.1423 +MetaTest/Average/Iteration 447 +MetaTest/Average/MaxReturn 89.7261 +MetaTest/Average/MinReturn -27.7602 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 42.2783 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 27.1423 +MetaTest/__unnamed_task__/AverageReturn 27.1423 +MetaTest/__unnamed_task__/Iteration 447 +MetaTest/__unnamed_task__/MaxReturn 89.7261 +MetaTest/__unnamed_task__/MinReturn -27.7602 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 42.2783 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 731800 +------------------------------------------------- ----------- +2025-04-03 19:05:29 | [pearl_trainer] epoch #448 | Training... +2025-04-03 19:06:54 | [pearl_trainer] epoch #448 | Evaluating... +2025-04-03 19:06:54 | [pearl_trainer] epoch #448 | Sampling for adapation and meta-testing... +2025-04-03 19:08:38 | [pearl_trainer] epoch #448 | Finished meta-testing... +2025-04-03 19:08:38 | [pearl_trainer] epoch #448 | Saving snapshot... +2025-04-03 19:08:39 | [pearl_trainer] epoch #448 | Saved +2025-04-03 19:08:39 | [pearl_trainer] epoch #448 | Time 105832.22 s +2025-04-03 19:08:39 | [pearl_trainer] epoch #448 | EpochTime 221.52 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 20.622 +MetaTest/Average/AverageReturn 20.622 +MetaTest/Average/Iteration 448 +MetaTest/Average/MaxReturn 79.5468 +MetaTest/Average/MinReturn -16.9131 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 34.7635 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 20.622 +MetaTest/__unnamed_task__/AverageReturn 20.622 +MetaTest/__unnamed_task__/Iteration 448 +MetaTest/__unnamed_task__/MaxReturn 79.5468 +MetaTest/__unnamed_task__/MinReturn -16.9131 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 34.7635 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 733400 +------------------------------------------------- ----------- +2025-04-03 19:09:10 | [pearl_trainer] epoch #449 | Training... +2025-04-03 19:10:34 | [pearl_trainer] epoch #449 | Evaluating... +2025-04-03 19:10:34 | [pearl_trainer] epoch #449 | Sampling for adapation and meta-testing... +2025-04-03 19:12:16 | [pearl_trainer] epoch #449 | Finished meta-testing... +2025-04-03 19:12:16 | [pearl_trainer] epoch #449 | Saving snapshot... +2025-04-03 19:12:17 | [pearl_trainer] epoch #449 | Saved +2025-04-03 19:12:17 | [pearl_trainer] epoch #449 | Time 106050.45 s +2025-04-03 19:12:17 | [pearl_trainer] epoch #449 | EpochTime 218.23 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 30.1017 +MetaTest/Average/AverageReturn 30.1017 +MetaTest/Average/Iteration 449 +MetaTest/Average/MaxReturn 96.1067 +MetaTest/Average/MinReturn -20.2809 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 46.2677 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 30.1017 +MetaTest/__unnamed_task__/AverageReturn 30.1017 +MetaTest/__unnamed_task__/Iteration 449 +MetaTest/__unnamed_task__/MaxReturn 96.1067 +MetaTest/__unnamed_task__/MinReturn -20.2809 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 46.2677 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 735000 +------------------------------------------------- ----------- +2025-04-03 19:12:47 | [pearl_trainer] epoch #450 | Training... +2025-04-03 19:14:12 | [pearl_trainer] epoch #450 | Evaluating... +2025-04-03 19:14:12 | [pearl_trainer] epoch #450 | Sampling for adapation and meta-testing... +2025-04-03 19:15:54 | [pearl_trainer] epoch #450 | Finished meta-testing... +2025-04-03 19:15:54 | [pearl_trainer] epoch #450 | Saving snapshot... +2025-04-03 19:15:56 | [pearl_trainer] epoch #450 | Saved +2025-04-03 19:15:56 | [pearl_trainer] epoch #450 | Time 106268.79 s +2025-04-03 19:15:56 | [pearl_trainer] epoch #450 | EpochTime 218.34 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 19.1586 +MetaTest/Average/AverageReturn 19.1586 +MetaTest/Average/Iteration 450 +MetaTest/Average/MaxReturn 92.9216 +MetaTest/Average/MinReturn -51.3612 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 49.7785 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 19.1586 +MetaTest/__unnamed_task__/AverageReturn 19.1586 +MetaTest/__unnamed_task__/Iteration 450 +MetaTest/__unnamed_task__/MaxReturn 92.9216 +MetaTest/__unnamed_task__/MinReturn -51.3612 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 49.7785 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 736600 +------------------------------------------------- ----------- +2025-04-03 19:16:26 | [pearl_trainer] epoch #451 | Training... +2025-04-03 19:17:51 | [pearl_trainer] epoch #451 | Evaluating... +2025-04-03 19:17:51 | [pearl_trainer] epoch #451 | Sampling for adapation and meta-testing... +2025-04-03 19:19:37 | [pearl_trainer] epoch #451 | Finished meta-testing... +2025-04-03 19:19:37 | [pearl_trainer] epoch #451 | Saving snapshot... +2025-04-03 19:19:38 | [pearl_trainer] epoch #451 | Saved +2025-04-03 19:19:38 | [pearl_trainer] epoch #451 | Time 106491.33 s +2025-04-03 19:19:38 | [pearl_trainer] epoch #451 | EpochTime 222.54 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 19.0949 +MetaTest/Average/AverageReturn 19.0949 +MetaTest/Average/Iteration 451 +MetaTest/Average/MaxReturn 88.1721 +MetaTest/Average/MinReturn -34.0693 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 47.7108 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 19.0949 +MetaTest/__unnamed_task__/AverageReturn 19.0949 +MetaTest/__unnamed_task__/Iteration 451 +MetaTest/__unnamed_task__/MaxReturn 88.1721 +MetaTest/__unnamed_task__/MinReturn -34.0693 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 47.7108 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 738200 +------------------------------------------------- ----------- +2025-04-03 19:20:10 | [pearl_trainer] epoch #452 | Training... +2025-04-03 19:21:33 | [pearl_trainer] epoch #452 | Evaluating... +2025-04-03 19:21:33 | [pearl_trainer] epoch #452 | Sampling for adapation and meta-testing... +2025-04-03 19:23:16 | [pearl_trainer] epoch #452 | Finished meta-testing... +2025-04-03 19:23:16 | [pearl_trainer] epoch #452 | Saving snapshot... +2025-04-03 19:23:17 | [pearl_trainer] epoch #452 | Saved +2025-04-03 19:23:17 | [pearl_trainer] epoch #452 | Time 106710.47 s +2025-04-03 19:23:17 | [pearl_trainer] epoch #452 | EpochTime 219.14 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -3.22716 +MetaTest/Average/AverageReturn -3.22716 +MetaTest/Average/Iteration 452 +MetaTest/Average/MaxReturn 19.6831 +MetaTest/Average/MinReturn -27.8822 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 19.6892 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -3.22716 +MetaTest/__unnamed_task__/AverageReturn -3.22716 +MetaTest/__unnamed_task__/Iteration 452 +MetaTest/__unnamed_task__/MaxReturn 19.6831 +MetaTest/__unnamed_task__/MinReturn -27.8822 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 19.6892 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 739800 +------------------------------------------------- ------------ +2025-04-03 19:23:48 | [pearl_trainer] epoch #453 | Training... +2025-04-03 19:25:08 | [pearl_trainer] epoch #453 | Evaluating... +2025-04-03 19:25:08 | [pearl_trainer] epoch #453 | Sampling for adapation and meta-testing... +2025-04-03 19:26:53 | [pearl_trainer] epoch #453 | Finished meta-testing... +2025-04-03 19:26:53 | [pearl_trainer] epoch #453 | Saving snapshot... +2025-04-03 19:26:55 | [pearl_trainer] epoch #453 | Saved +2025-04-03 19:26:55 | [pearl_trainer] epoch #453 | Time 106927.57 s +2025-04-03 19:26:55 | [pearl_trainer] epoch #453 | EpochTime 217.09 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 14.1014 +MetaTest/Average/AverageReturn 14.1014 +MetaTest/Average/Iteration 453 +MetaTest/Average/MaxReturn 79.1227 +MetaTest/Average/MinReturn -29.5059 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 40.2249 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 14.1014 +MetaTest/__unnamed_task__/AverageReturn 14.1014 +MetaTest/__unnamed_task__/Iteration 453 +MetaTest/__unnamed_task__/MaxReturn 79.1227 +MetaTest/__unnamed_task__/MinReturn -29.5059 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 40.2249 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 741400 +------------------------------------------------- ----------- +2025-04-03 19:27:25 | [pearl_trainer] epoch #454 | Training... +2025-04-03 19:28:47 | [pearl_trainer] epoch #454 | Evaluating... +2025-04-03 19:28:47 | [pearl_trainer] epoch #454 | Sampling for adapation and meta-testing... +2025-04-03 19:30:31 | [pearl_trainer] epoch #454 | Finished meta-testing... +2025-04-03 19:30:31 | [pearl_trainer] epoch #454 | Saving snapshot... +2025-04-03 19:30:32 | [pearl_trainer] epoch #454 | Saved +2025-04-03 19:30:32 | [pearl_trainer] epoch #454 | Time 107145.32 s +2025-04-03 19:30:32 | [pearl_trainer] epoch #454 | EpochTime 217.75 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 19.2545 +MetaTest/Average/AverageReturn 19.2545 +MetaTest/Average/Iteration 454 +MetaTest/Average/MaxReturn 66.0941 +MetaTest/Average/MinReturn -31.2117 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 40.5138 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 19.2545 +MetaTest/__unnamed_task__/AverageReturn 19.2545 +MetaTest/__unnamed_task__/Iteration 454 +MetaTest/__unnamed_task__/MaxReturn 66.0941 +MetaTest/__unnamed_task__/MinReturn -31.2117 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 40.5138 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 743000 +------------------------------------------------- ----------- +2025-04-03 19:31:02 | [pearl_trainer] epoch #455 | Training... +2025-04-03 19:32:28 | [pearl_trainer] epoch #455 | Evaluating... +2025-04-03 19:32:28 | [pearl_trainer] epoch #455 | Sampling for adapation and meta-testing... +2025-04-03 19:34:11 | [pearl_trainer] epoch #455 | Finished meta-testing... +2025-04-03 19:34:11 | [pearl_trainer] epoch #455 | Saving snapshot... +2025-04-03 19:34:12 | [pearl_trainer] epoch #455 | Saved +2025-04-03 19:34:12 | [pearl_trainer] epoch #455 | Time 107365.46 s +2025-04-03 19:34:12 | [pearl_trainer] epoch #455 | EpochTime 220.14 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 18.7912 +MetaTest/Average/AverageReturn 18.7912 +MetaTest/Average/Iteration 455 +MetaTest/Average/MaxReturn 100.746 +MetaTest/Average/MinReturn -20.4204 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 42.6866 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 18.7912 +MetaTest/__unnamed_task__/AverageReturn 18.7912 +MetaTest/__unnamed_task__/Iteration 455 +MetaTest/__unnamed_task__/MaxReturn 100.746 +MetaTest/__unnamed_task__/MinReturn -20.4204 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 42.6866 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 744600 +------------------------------------------------- ----------- +2025-04-03 19:34:42 | [pearl_trainer] epoch #456 | Training... +2025-04-03 19:36:09 | [pearl_trainer] epoch #456 | Evaluating... +2025-04-03 19:36:09 | [pearl_trainer] epoch #456 | Sampling for adapation and meta-testing... +2025-04-03 19:37:53 | [pearl_trainer] epoch #456 | Finished meta-testing... +2025-04-03 19:37:53 | [pearl_trainer] epoch #456 | Saving snapshot... +2025-04-03 19:37:54 | [pearl_trainer] epoch #456 | Saved +2025-04-03 19:37:54 | [pearl_trainer] epoch #456 | Time 107587.09 s +2025-04-03 19:37:54 | [pearl_trainer] epoch #456 | EpochTime 221.62 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 33.006 +MetaTest/Average/AverageReturn 33.006 +MetaTest/Average/Iteration 456 +MetaTest/Average/MaxReturn 94.2495 +MetaTest/Average/MinReturn -22.0511 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 42.8857 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 33.006 +MetaTest/__unnamed_task__/AverageReturn 33.006 +MetaTest/__unnamed_task__/Iteration 456 +MetaTest/__unnamed_task__/MaxReturn 94.2495 +MetaTest/__unnamed_task__/MinReturn -22.0511 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 42.8857 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 746200 +------------------------------------------------- ----------- +2025-04-03 19:38:24 | [pearl_trainer] epoch #457 | Training... +2025-04-03 19:39:51 | [pearl_trainer] epoch #457 | Evaluating... +2025-04-03 19:39:51 | [pearl_trainer] epoch #457 | Sampling for adapation and meta-testing... +2025-04-03 19:41:35 | [pearl_trainer] epoch #457 | Finished meta-testing... +2025-04-03 19:41:35 | [pearl_trainer] epoch #457 | Saving snapshot... +2025-04-03 19:41:36 | [pearl_trainer] epoch #457 | Saved +2025-04-03 19:41:36 | [pearl_trainer] epoch #457 | Time 107809.49 s +2025-04-03 19:41:36 | [pearl_trainer] epoch #457 | EpochTime 222.40 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 24.7678 +MetaTest/Average/AverageReturn 24.7678 +MetaTest/Average/Iteration 457 +MetaTest/Average/MaxReturn 122.072 +MetaTest/Average/MinReturn -27.9194 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 53.4196 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 24.7678 +MetaTest/__unnamed_task__/AverageReturn 24.7678 +MetaTest/__unnamed_task__/Iteration 457 +MetaTest/__unnamed_task__/MaxReturn 122.072 +MetaTest/__unnamed_task__/MinReturn -27.9194 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 53.4196 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 747800 +------------------------------------------------- ----------- +2025-04-03 19:42:08 | [pearl_trainer] epoch #458 | Training... +2025-04-03 19:43:31 | [pearl_trainer] epoch #458 | Evaluating... +2025-04-03 19:43:31 | [pearl_trainer] epoch #458 | Sampling for adapation and meta-testing... +2025-04-03 19:45:19 | [pearl_trainer] epoch #458 | Finished meta-testing... +2025-04-03 19:45:19 | [pearl_trainer] epoch #458 | Saving snapshot... +2025-04-03 19:45:21 | [pearl_trainer] epoch #458 | Saved +2025-04-03 19:45:21 | [pearl_trainer] epoch #458 | Time 108033.72 s +2025-04-03 19:45:21 | [pearl_trainer] epoch #458 | EpochTime 224.23 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 33.02 +MetaTest/Average/AverageReturn 33.02 +MetaTest/Average/Iteration 458 +MetaTest/Average/MaxReturn 140.014 +MetaTest/Average/MinReturn -21.7305 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 61.5339 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 33.02 +MetaTest/__unnamed_task__/AverageReturn 33.02 +MetaTest/__unnamed_task__/Iteration 458 +MetaTest/__unnamed_task__/MaxReturn 140.014 +MetaTest/__unnamed_task__/MinReturn -21.7305 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 61.5339 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 749400 +------------------------------------------------- ----------- +2025-04-03 19:45:52 | [pearl_trainer] epoch #459 | Training... +2025-04-03 19:47:20 | [pearl_trainer] epoch #459 | Evaluating... +2025-04-03 19:47:20 | [pearl_trainer] epoch #459 | Sampling for adapation and meta-testing... +2025-04-03 19:49:20 | [pearl_trainer] epoch #459 | Finished meta-testing... +2025-04-03 19:49:20 | [pearl_trainer] epoch #459 | Saving snapshot... +2025-04-03 19:49:21 | [pearl_trainer] epoch #459 | Saved +2025-04-03 19:49:21 | [pearl_trainer] epoch #459 | Time 108274.05 s +2025-04-03 19:49:21 | [pearl_trainer] epoch #459 | EpochTime 240.33 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 3.8522 +MetaTest/Average/AverageReturn 3.8522 +MetaTest/Average/Iteration 459 +MetaTest/Average/MaxReturn 39.4369 +MetaTest/Average/MinReturn -15.7982 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 21.9051 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 3.8522 +MetaTest/__unnamed_task__/AverageReturn 3.8522 +MetaTest/__unnamed_task__/Iteration 459 +MetaTest/__unnamed_task__/MaxReturn 39.4369 +MetaTest/__unnamed_task__/MinReturn -15.7982 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 21.9051 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 751000 +------------------------------------------------- ----------- +2025-04-03 19:49:53 | [pearl_trainer] epoch #460 | Training... +2025-04-03 19:51:18 | [pearl_trainer] epoch #460 | Evaluating... +2025-04-03 19:51:18 | [pearl_trainer] epoch #460 | Sampling for adapation and meta-testing... +2025-04-03 19:53:11 | [pearl_trainer] epoch #460 | Finished meta-testing... +2025-04-03 19:53:11 | [pearl_trainer] epoch #460 | Saving snapshot... +2025-04-03 19:53:12 | [pearl_trainer] epoch #460 | Saved +2025-04-03 19:53:12 | [pearl_trainer] epoch #460 | Time 108505.47 s +2025-04-03 19:53:12 | [pearl_trainer] epoch #460 | EpochTime 231.42 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 20.468 +MetaTest/Average/AverageReturn 20.468 +MetaTest/Average/Iteration 460 +MetaTest/Average/MaxReturn 68.1826 +MetaTest/Average/MinReturn -19.2543 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 34.1633 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 20.468 +MetaTest/__unnamed_task__/AverageReturn 20.468 +MetaTest/__unnamed_task__/Iteration 460 +MetaTest/__unnamed_task__/MaxReturn 68.1826 +MetaTest/__unnamed_task__/MinReturn -19.2543 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 34.1633 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 752600 +------------------------------------------------- ----------- +2025-04-03 19:53:44 | [pearl_trainer] epoch #461 | Training... +2025-04-03 19:55:17 | [pearl_trainer] epoch #461 | Evaluating... +2025-04-03 19:55:17 | [pearl_trainer] epoch #461 | Sampling for adapation and meta-testing... +2025-04-03 19:57:14 | [pearl_trainer] epoch #461 | Finished meta-testing... +2025-04-03 19:57:14 | [pearl_trainer] epoch #461 | Saving snapshot... +2025-04-03 19:57:15 | [pearl_trainer] epoch #461 | Saved +2025-04-03 19:57:15 | [pearl_trainer] epoch #461 | Time 108748.28 s +2025-04-03 19:57:15 | [pearl_trainer] epoch #461 | EpochTime 242.80 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 58.2631 +MetaTest/Average/AverageReturn 58.2631 +MetaTest/Average/Iteration 461 +MetaTest/Average/MaxReturn 110.677 +MetaTest/Average/MinReturn -19.9204 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 46.5894 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 58.2631 +MetaTest/__unnamed_task__/AverageReturn 58.2631 +MetaTest/__unnamed_task__/Iteration 461 +MetaTest/__unnamed_task__/MaxReturn 110.677 +MetaTest/__unnamed_task__/MinReturn -19.9204 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 46.5894 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 754200 +------------------------------------------------- ----------- +2025-04-03 19:57:50 | [pearl_trainer] epoch #462 | Training... +2025-04-03 19:59:19 | [pearl_trainer] epoch #462 | Evaluating... +2025-04-03 19:59:19 | [pearl_trainer] epoch #462 | Sampling for adapation and meta-testing... +2025-04-03 20:01:08 | [pearl_trainer] epoch #462 | Finished meta-testing... +2025-04-03 20:01:08 | [pearl_trainer] epoch #462 | Saving snapshot... +2025-04-03 20:01:10 | [pearl_trainer] epoch #462 | Saved +2025-04-03 20:01:10 | [pearl_trainer] epoch #462 | Time 108982.59 s +2025-04-03 20:01:10 | [pearl_trainer] epoch #462 | EpochTime 234.31 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 22.047 +MetaTest/Average/AverageReturn 22.047 +MetaTest/Average/Iteration 462 +MetaTest/Average/MaxReturn 110.539 +MetaTest/Average/MinReturn -33.5382 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 49.8464 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 22.047 +MetaTest/__unnamed_task__/AverageReturn 22.047 +MetaTest/__unnamed_task__/Iteration 462 +MetaTest/__unnamed_task__/MaxReturn 110.539 +MetaTest/__unnamed_task__/MinReturn -33.5382 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 49.8464 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 755800 +------------------------------------------------- ----------- +2025-04-03 20:01:42 | [pearl_trainer] epoch #463 | Training... +2025-04-03 20:03:09 | [pearl_trainer] epoch #463 | Evaluating... +2025-04-03 20:03:09 | [pearl_trainer] epoch #463 | Sampling for adapation and meta-testing... +2025-04-03 20:05:00 | [pearl_trainer] epoch #463 | Finished meta-testing... +2025-04-03 20:05:00 | [pearl_trainer] epoch #463 | Saving snapshot... +2025-04-03 20:05:02 | [pearl_trainer] epoch #463 | Saved +2025-04-03 20:05:02 | [pearl_trainer] epoch #463 | Time 109214.76 s +2025-04-03 20:05:02 | [pearl_trainer] epoch #463 | EpochTime 232.16 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -2.66169 +MetaTest/Average/AverageReturn -2.66169 +MetaTest/Average/Iteration 463 +MetaTest/Average/MaxReturn 63.4855 +MetaTest/Average/MinReturn -34.7402 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 34.7376 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -2.66169 +MetaTest/__unnamed_task__/AverageReturn -2.66169 +MetaTest/__unnamed_task__/Iteration 463 +MetaTest/__unnamed_task__/MaxReturn 63.4855 +MetaTest/__unnamed_task__/MinReturn -34.7402 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 34.7376 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 757400 +------------------------------------------------- ------------ +2025-04-03 20:05:34 | [pearl_trainer] epoch #464 | Training... +2025-04-03 20:07:01 | [pearl_trainer] epoch #464 | Evaluating... +2025-04-03 20:07:01 | [pearl_trainer] epoch #464 | Sampling for adapation and meta-testing... +2025-04-03 20:08:52 | [pearl_trainer] epoch #464 | Finished meta-testing... +2025-04-03 20:08:52 | [pearl_trainer] epoch #464 | Saving snapshot... +2025-04-03 20:08:53 | [pearl_trainer] epoch #464 | Saved +2025-04-03 20:08:53 | [pearl_trainer] epoch #464 | Time 109445.62 s +2025-04-03 20:08:53 | [pearl_trainer] epoch #464 | EpochTime 230.86 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 30.0579 +MetaTest/Average/AverageReturn 30.0579 +MetaTest/Average/Iteration 464 +MetaTest/Average/MaxReturn 85.2535 +MetaTest/Average/MinReturn -35.9025 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 45.1216 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 30.0579 +MetaTest/__unnamed_task__/AverageReturn 30.0579 +MetaTest/__unnamed_task__/Iteration 464 +MetaTest/__unnamed_task__/MaxReturn 85.2535 +MetaTest/__unnamed_task__/MinReturn -35.9025 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 45.1216 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 759000 +------------------------------------------------- ----------- +2025-04-03 20:09:25 | [pearl_trainer] epoch #465 | Training... +2025-04-03 20:10:56 | [pearl_trainer] epoch #465 | Evaluating... +2025-04-03 20:10:56 | [pearl_trainer] epoch #465 | Sampling for adapation and meta-testing... +2025-04-03 20:12:46 | [pearl_trainer] epoch #465 | Finished meta-testing... +2025-04-03 20:12:46 | [pearl_trainer] epoch #465 | Saving snapshot... +2025-04-03 20:12:47 | [pearl_trainer] epoch #465 | Saved +2025-04-03 20:12:47 | [pearl_trainer] epoch #465 | Time 109680.46 s +2025-04-03 20:12:47 | [pearl_trainer] epoch #465 | EpochTime 234.84 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 32.1899 +MetaTest/Average/AverageReturn 32.1899 +MetaTest/Average/Iteration 465 +MetaTest/Average/MaxReturn 126.554 +MetaTest/Average/MinReturn -30.9801 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 56.4318 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 32.1899 +MetaTest/__unnamed_task__/AverageReturn 32.1899 +MetaTest/__unnamed_task__/Iteration 465 +MetaTest/__unnamed_task__/MaxReturn 126.554 +MetaTest/__unnamed_task__/MinReturn -30.9801 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 56.4318 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 760600 +------------------------------------------------- ----------- +2025-04-03 20:13:19 | [pearl_trainer] epoch #466 | Training... +2025-04-03 20:14:45 | [pearl_trainer] epoch #466 | Evaluating... +2025-04-03 20:14:45 | [pearl_trainer] epoch #466 | Sampling for adapation and meta-testing... +2025-04-03 20:16:29 | [pearl_trainer] epoch #466 | Finished meta-testing... +2025-04-03 20:16:29 | [pearl_trainer] epoch #466 | Saving snapshot... +2025-04-03 20:16:30 | [pearl_trainer] epoch #466 | Saved +2025-04-03 20:16:30 | [pearl_trainer] epoch #466 | Time 109902.62 s +2025-04-03 20:16:30 | [pearl_trainer] epoch #466 | EpochTime 222.16 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 42.5572 +MetaTest/Average/AverageReturn 42.5572 +MetaTest/Average/Iteration 466 +MetaTest/Average/MaxReturn 113.903 +MetaTest/Average/MinReturn -63.6468 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 61.8881 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 42.5572 +MetaTest/__unnamed_task__/AverageReturn 42.5572 +MetaTest/__unnamed_task__/Iteration 466 +MetaTest/__unnamed_task__/MaxReturn 113.903 +MetaTest/__unnamed_task__/MinReturn -63.6468 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 61.8881 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 762200 +------------------------------------------------- ----------- +2025-04-03 20:17:00 | [pearl_trainer] epoch #467 | Training... +2025-04-03 20:18:39 | [pearl_trainer] epoch #467 | Evaluating... +2025-04-03 20:18:39 | [pearl_trainer] epoch #467 | Sampling for adapation and meta-testing... +2025-04-03 20:20:45 | [pearl_trainer] epoch #467 | Finished meta-testing... +2025-04-03 20:20:45 | [pearl_trainer] epoch #467 | Saving snapshot... +2025-04-03 20:20:46 | [pearl_trainer] epoch #467 | Saved +2025-04-03 20:20:46 | [pearl_trainer] epoch #467 | Time 110159.48 s +2025-04-03 20:20:46 | [pearl_trainer] epoch #467 | EpochTime 256.85 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -0.68955 +MetaTest/Average/AverageReturn -0.68955 +MetaTest/Average/Iteration 467 +MetaTest/Average/MaxReturn 70.0864 +MetaTest/Average/MinReturn -67.7803 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 48.3408 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -0.68955 +MetaTest/__unnamed_task__/AverageReturn -0.68955 +MetaTest/__unnamed_task__/Iteration 467 +MetaTest/__unnamed_task__/MaxReturn 70.0864 +MetaTest/__unnamed_task__/MinReturn -67.7803 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 48.3408 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 763800 +------------------------------------------------- ------------ +2025-04-03 20:21:18 | [pearl_trainer] epoch #468 | Training... +2025-04-03 20:22:44 | [pearl_trainer] epoch #468 | Evaluating... +2025-04-03 20:22:44 | [pearl_trainer] epoch #468 | Sampling for adapation and meta-testing... +2025-04-03 20:24:29 | [pearl_trainer] epoch #468 | Finished meta-testing... +2025-04-03 20:24:29 | [pearl_trainer] epoch #468 | Saving snapshot... +2025-04-03 20:24:31 | [pearl_trainer] epoch #468 | Saved +2025-04-03 20:24:31 | [pearl_trainer] epoch #468 | Time 110383.92 s +2025-04-03 20:24:31 | [pearl_trainer] epoch #468 | EpochTime 224.44 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -10.6929 +MetaTest/Average/AverageReturn -10.6929 +MetaTest/Average/Iteration 468 +MetaTest/Average/MaxReturn 69.3964 +MetaTest/Average/MinReturn -67.8981 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 45.5155 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -10.6929 +MetaTest/__unnamed_task__/AverageReturn -10.6929 +MetaTest/__unnamed_task__/Iteration 468 +MetaTest/__unnamed_task__/MaxReturn 69.3964 +MetaTest/__unnamed_task__/MinReturn -67.8981 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 45.5155 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 765400 +------------------------------------------------- ----------- +2025-04-03 20:25:01 | [pearl_trainer] epoch #469 | Training... +2025-04-03 20:26:33 | [pearl_trainer] epoch #469 | Evaluating... +2025-04-03 20:26:33 | [pearl_trainer] epoch #469 | Sampling for adapation and meta-testing... +2025-04-03 20:28:18 | [pearl_trainer] epoch #469 | Finished meta-testing... +2025-04-03 20:28:18 | [pearl_trainer] epoch #469 | Saving snapshot... +2025-04-03 20:28:19 | [pearl_trainer] epoch #469 | Saved +2025-04-03 20:28:19 | [pearl_trainer] epoch #469 | Time 110611.64 s +2025-04-03 20:28:19 | [pearl_trainer] epoch #469 | EpochTime 227.71 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -5.39989 +MetaTest/Average/AverageReturn -5.39989 +MetaTest/Average/Iteration 469 +MetaTest/Average/MaxReturn 42.0339 +MetaTest/Average/MinReturn -92.6644 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 48.0261 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -5.39989 +MetaTest/__unnamed_task__/AverageReturn -5.39989 +MetaTest/__unnamed_task__/Iteration 469 +MetaTest/__unnamed_task__/MaxReturn 42.0339 +MetaTest/__unnamed_task__/MinReturn -92.6644 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 48.0261 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 767000 +------------------------------------------------- ------------ +2025-04-03 20:28:50 | [pearl_trainer] epoch #470 | Training... +2025-04-03 20:30:15 | [pearl_trainer] epoch #470 | Evaluating... +2025-04-03 20:30:15 | [pearl_trainer] epoch #470 | Sampling for adapation and meta-testing... +2025-04-03 20:31:57 | [pearl_trainer] epoch #470 | Finished meta-testing... +2025-04-03 20:31:57 | [pearl_trainer] epoch #470 | Saving snapshot... +2025-04-03 20:31:59 | [pearl_trainer] epoch #470 | Saved +2025-04-03 20:31:59 | [pearl_trainer] epoch #470 | Time 110831.54 s +2025-04-03 20:31:59 | [pearl_trainer] epoch #470 | EpochTime 219.90 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 31.5111 +MetaTest/Average/AverageReturn 31.5111 +MetaTest/Average/Iteration 470 +MetaTest/Average/MaxReturn 89.5832 +MetaTest/Average/MinReturn -15.0511 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 36.4966 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 31.5111 +MetaTest/__unnamed_task__/AverageReturn 31.5111 +MetaTest/__unnamed_task__/Iteration 470 +MetaTest/__unnamed_task__/MaxReturn 89.5832 +MetaTest/__unnamed_task__/MinReturn -15.0511 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 36.4966 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 768600 +------------------------------------------------- ----------- +2025-04-03 20:32:29 | [pearl_trainer] epoch #471 | Training... +2025-04-03 20:33:54 | [pearl_trainer] epoch #471 | Evaluating... +2025-04-03 20:33:54 | [pearl_trainer] epoch #471 | Sampling for adapation and meta-testing... +2025-04-03 20:35:42 | [pearl_trainer] epoch #471 | Finished meta-testing... +2025-04-03 20:35:42 | [pearl_trainer] epoch #471 | Saving snapshot... +2025-04-03 20:35:43 | [pearl_trainer] epoch #471 | Saved +2025-04-03 20:35:43 | [pearl_trainer] epoch #471 | Time 111056.19 s +2025-04-03 20:35:43 | [pearl_trainer] epoch #471 | EpochTime 224.65 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 39.6663 +MetaTest/Average/AverageReturn 39.6663 +MetaTest/Average/Iteration 471 +MetaTest/Average/MaxReturn 85.3554 +MetaTest/Average/MinReturn -17.4468 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 46.5693 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 39.6663 +MetaTest/__unnamed_task__/AverageReturn 39.6663 +MetaTest/__unnamed_task__/Iteration 471 +MetaTest/__unnamed_task__/MaxReturn 85.3554 +MetaTest/__unnamed_task__/MinReturn -17.4468 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 46.5693 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 770200 +------------------------------------------------- ----------- +2025-04-03 20:36:14 | [pearl_trainer] epoch #472 | Training... +2025-04-03 20:37:45 | [pearl_trainer] epoch #472 | Evaluating... +2025-04-03 20:37:45 | [pearl_trainer] epoch #472 | Sampling for adapation and meta-testing... +2025-04-03 20:39:34 | [pearl_trainer] epoch #472 | Finished meta-testing... +2025-04-03 20:39:34 | [pearl_trainer] epoch #472 | Saving snapshot... +2025-04-03 20:39:35 | [pearl_trainer] epoch #472 | Saved +2025-04-03 20:39:35 | [pearl_trainer] epoch #472 | Time 111287.56 s +2025-04-03 20:39:35 | [pearl_trainer] epoch #472 | EpochTime 231.36 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 31.4833 +MetaTest/Average/AverageReturn 31.4833 +MetaTest/Average/Iteration 472 +MetaTest/Average/MaxReturn 110.618 +MetaTest/Average/MinReturn -38.5854 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 50.382 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 31.4833 +MetaTest/__unnamed_task__/AverageReturn 31.4833 +MetaTest/__unnamed_task__/Iteration 472 +MetaTest/__unnamed_task__/MaxReturn 110.618 +MetaTest/__unnamed_task__/MinReturn -38.5854 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 50.382 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 771800 +------------------------------------------------- ----------- +2025-04-03 20:40:05 | [pearl_trainer] epoch #473 | Training... +2025-04-03 20:41:28 | [pearl_trainer] epoch #473 | Evaluating... +2025-04-03 20:41:28 | [pearl_trainer] epoch #473 | Sampling for adapation and meta-testing... +2025-04-03 20:43:18 | [pearl_trainer] epoch #473 | Finished meta-testing... +2025-04-03 20:43:18 | [pearl_trainer] epoch #473 | Saving snapshot... +2025-04-03 20:43:20 | [pearl_trainer] epoch #473 | Saved +2025-04-03 20:43:20 | [pearl_trainer] epoch #473 | Time 111512.71 s +2025-04-03 20:43:20 | [pearl_trainer] epoch #473 | EpochTime 225.15 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 13.8058 +MetaTest/Average/AverageReturn 13.8058 +MetaTest/Average/Iteration 473 +MetaTest/Average/MaxReturn 109.599 +MetaTest/Average/MinReturn -51.3822 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 57.8471 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 13.8058 +MetaTest/__unnamed_task__/AverageReturn 13.8058 +MetaTest/__unnamed_task__/Iteration 473 +MetaTest/__unnamed_task__/MaxReturn 109.599 +MetaTest/__unnamed_task__/MinReturn -51.3822 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 57.8471 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 773400 +------------------------------------------------- ----------- +2025-04-03 20:43:52 | [pearl_trainer] epoch #474 | Training... +2025-04-03 20:45:22 | [pearl_trainer] epoch #474 | Evaluating... +2025-04-03 20:45:22 | [pearl_trainer] epoch #474 | Sampling for adapation and meta-testing... +2025-04-03 20:47:06 | [pearl_trainer] epoch #474 | Finished meta-testing... +2025-04-03 20:47:06 | [pearl_trainer] epoch #474 | Saving snapshot... +2025-04-03 20:47:07 | [pearl_trainer] epoch #474 | Saved +2025-04-03 20:47:07 | [pearl_trainer] epoch #474 | Time 111740.47 s +2025-04-03 20:47:07 | [pearl_trainer] epoch #474 | EpochTime 227.76 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 3.07103 +MetaTest/Average/AverageReturn 3.07103 +MetaTest/Average/Iteration 474 +MetaTest/Average/MaxReturn 73.4848 +MetaTest/Average/MinReturn -30.8134 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 37.4017 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 3.07103 +MetaTest/__unnamed_task__/AverageReturn 3.07103 +MetaTest/__unnamed_task__/Iteration 474 +MetaTest/__unnamed_task__/MaxReturn 73.4848 +MetaTest/__unnamed_task__/MinReturn -30.8134 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 37.4017 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 775000 +------------------------------------------------- ------------ +2025-04-03 20:47:38 | [pearl_trainer] epoch #475 | Training... +2025-04-03 20:49:08 | [pearl_trainer] epoch #475 | Evaluating... +2025-04-03 20:49:08 | [pearl_trainer] epoch #475 | Sampling for adapation and meta-testing... +2025-04-03 20:51:01 | [pearl_trainer] epoch #475 | Finished meta-testing... +2025-04-03 20:51:01 | [pearl_trainer] epoch #475 | Saving snapshot... +2025-04-03 20:51:02 | [pearl_trainer] epoch #475 | Saved +2025-04-03 20:51:02 | [pearl_trainer] epoch #475 | Time 111975.44 s +2025-04-03 20:51:02 | [pearl_trainer] epoch #475 | EpochTime 234.97 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 42.3586 +MetaTest/Average/AverageReturn 42.3586 +MetaTest/Average/Iteration 475 +MetaTest/Average/MaxReturn 70.1143 +MetaTest/Average/MinReturn -27.1737 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 35.5455 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 42.3586 +MetaTest/__unnamed_task__/AverageReturn 42.3586 +MetaTest/__unnamed_task__/Iteration 475 +MetaTest/__unnamed_task__/MaxReturn 70.1143 +MetaTest/__unnamed_task__/MinReturn -27.1737 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 35.5455 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 776600 +------------------------------------------------- ----------- +2025-04-03 20:51:34 | [pearl_trainer] epoch #476 | Training... +2025-04-03 20:53:03 | [pearl_trainer] epoch #476 | Evaluating... +2025-04-03 20:53:03 | [pearl_trainer] epoch #476 | Sampling for adapation and meta-testing... +2025-04-03 20:54:51 | [pearl_trainer] epoch #476 | Finished meta-testing... +2025-04-03 20:54:51 | [pearl_trainer] epoch #476 | Saving snapshot... +2025-04-03 20:54:52 | [pearl_trainer] epoch #476 | Saved +2025-04-03 20:54:52 | [pearl_trainer] epoch #476 | Time 112205.40 s +2025-04-03 20:54:52 | [pearl_trainer] epoch #476 | EpochTime 229.95 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 42.8571 +MetaTest/Average/AverageReturn 42.8571 +MetaTest/Average/Iteration 476 +MetaTest/Average/MaxReturn 111.086 +MetaTest/Average/MinReturn -27.0331 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 48.9669 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 42.8571 +MetaTest/__unnamed_task__/AverageReturn 42.8571 +MetaTest/__unnamed_task__/Iteration 476 +MetaTest/__unnamed_task__/MaxReturn 111.086 +MetaTest/__unnamed_task__/MinReturn -27.0331 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 48.9669 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 778200 +------------------------------------------------- ----------- +2025-04-03 20:55:23 | [pearl_trainer] epoch #477 | Training... +2025-04-03 20:56:46 | [pearl_trainer] epoch #477 | Evaluating... +2025-04-03 20:56:46 | [pearl_trainer] epoch #477 | Sampling for adapation and meta-testing... +2025-04-03 20:58:32 | [pearl_trainer] epoch #477 | Finished meta-testing... +2025-04-03 20:58:32 | [pearl_trainer] epoch #477 | Saving snapshot... +2025-04-03 20:58:33 | [pearl_trainer] epoch #477 | Saved +2025-04-03 20:58:33 | [pearl_trainer] epoch #477 | Time 112425.63 s +2025-04-03 20:58:33 | [pearl_trainer] epoch #477 | EpochTime 220.23 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 3.2952 +MetaTest/Average/AverageReturn 3.2952 +MetaTest/Average/Iteration 477 +MetaTest/Average/MaxReturn 74.2127 +MetaTest/Average/MinReturn -22.2787 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 37.0951 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 3.2952 +MetaTest/__unnamed_task__/AverageReturn 3.2952 +MetaTest/__unnamed_task__/Iteration 477 +MetaTest/__unnamed_task__/MaxReturn 74.2127 +MetaTest/__unnamed_task__/MinReturn -22.2787 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 37.0951 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 779800 +------------------------------------------------- ----------- +2025-04-03 20:59:03 | [pearl_trainer] epoch #478 | Training... +2025-04-03 21:00:27 | [pearl_trainer] epoch #478 | Evaluating... +2025-04-03 21:00:27 | [pearl_trainer] epoch #478 | Sampling for adapation and meta-testing... +2025-04-03 21:02:12 | [pearl_trainer] epoch #478 | Finished meta-testing... +2025-04-03 21:02:12 | [pearl_trainer] epoch #478 | Saving snapshot... +2025-04-03 21:02:14 | [pearl_trainer] epoch #478 | Saved +2025-04-03 21:02:14 | [pearl_trainer] epoch #478 | Time 112646.82 s +2025-04-03 21:02:14 | [pearl_trainer] epoch #478 | EpochTime 221.19 s +------------------------------------------------- ------------- +MetaTest/Average/AverageDiscountedReturn -0.845878 +MetaTest/Average/AverageReturn -0.845878 +MetaTest/Average/Iteration 478 +MetaTest/Average/MaxReturn 36.0964 +MetaTest/Average/MinReturn -36.1795 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 22.9865 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -0.845878 +MetaTest/__unnamed_task__/AverageReturn -0.845878 +MetaTest/__unnamed_task__/Iteration 478 +MetaTest/__unnamed_task__/MaxReturn 36.0964 +MetaTest/__unnamed_task__/MinReturn -36.1795 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 22.9865 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 781400 +------------------------------------------------- ------------- +2025-04-03 21:02:44 | [pearl_trainer] epoch #479 | Training... +2025-04-03 21:04:11 | [pearl_trainer] epoch #479 | Evaluating... +2025-04-03 21:04:11 | [pearl_trainer] epoch #479 | Sampling for adapation and meta-testing... +2025-04-03 21:05:57 | [pearl_trainer] epoch #479 | Finished meta-testing... +2025-04-03 21:05:57 | [pearl_trainer] epoch #479 | Saving snapshot... +2025-04-03 21:05:58 | [pearl_trainer] epoch #479 | Saved +2025-04-03 21:05:58 | [pearl_trainer] epoch #479 | Time 112870.99 s +2025-04-03 21:05:58 | [pearl_trainer] epoch #479 | EpochTime 224.17 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 18.5428 +MetaTest/Average/AverageReturn 18.5428 +MetaTest/Average/Iteration 479 +MetaTest/Average/MaxReturn 136.759 +MetaTest/Average/MinReturn -38.4719 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 67.5787 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 18.5428 +MetaTest/__unnamed_task__/AverageReturn 18.5428 +MetaTest/__unnamed_task__/Iteration 479 +MetaTest/__unnamed_task__/MaxReturn 136.759 +MetaTest/__unnamed_task__/MinReturn -38.4719 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 67.5787 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 783000 +------------------------------------------------- ----------- +2025-04-03 21:06:29 | [pearl_trainer] epoch #480 | Training... +2025-04-03 21:07:58 | [pearl_trainer] epoch #480 | Evaluating... +2025-04-03 21:07:58 | [pearl_trainer] epoch #480 | Sampling for adapation and meta-testing... +2025-04-03 21:09:48 | [pearl_trainer] epoch #480 | Finished meta-testing... +2025-04-03 21:09:48 | [pearl_trainer] epoch #480 | Saving snapshot... +2025-04-03 21:09:49 | [pearl_trainer] epoch #480 | Saved +2025-04-03 21:09:49 | [pearl_trainer] epoch #480 | Time 113102.10 s +2025-04-03 21:09:49 | [pearl_trainer] epoch #480 | EpochTime 231.11 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 7.14873 +MetaTest/Average/AverageReturn 7.14873 +MetaTest/Average/Iteration 480 +MetaTest/Average/MaxReturn 33.7987 +MetaTest/Average/MinReturn -2.74226 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 13.5516 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 7.14873 +MetaTest/__unnamed_task__/AverageReturn 7.14873 +MetaTest/__unnamed_task__/Iteration 480 +MetaTest/__unnamed_task__/MaxReturn 33.7987 +MetaTest/__unnamed_task__/MinReturn -2.74226 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 13.5516 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 784600 +------------------------------------------------- ------------ +2025-04-03 21:10:20 | [pearl_trainer] epoch #481 | Training... +2025-04-03 21:11:43 | [pearl_trainer] epoch #481 | Evaluating... +2025-04-03 21:11:43 | [pearl_trainer] epoch #481 | Sampling for adapation and meta-testing... +2025-04-03 21:13:27 | [pearl_trainer] epoch #481 | Finished meta-testing... +2025-04-03 21:13:27 | [pearl_trainer] epoch #481 | Saving snapshot... +2025-04-03 21:13:28 | [pearl_trainer] epoch #481 | Saved +2025-04-03 21:13:28 | [pearl_trainer] epoch #481 | Time 113321.02 s +2025-04-03 21:13:28 | [pearl_trainer] epoch #481 | EpochTime 218.92 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 10.635 +MetaTest/Average/AverageReturn 10.635 +MetaTest/Average/Iteration 481 +MetaTest/Average/MaxReturn 82.9227 +MetaTest/Average/MinReturn -43.0597 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 43.3881 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 10.635 +MetaTest/__unnamed_task__/AverageReturn 10.635 +MetaTest/__unnamed_task__/Iteration 481 +MetaTest/__unnamed_task__/MaxReturn 82.9227 +MetaTest/__unnamed_task__/MinReturn -43.0597 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 43.3881 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 786200 +------------------------------------------------- ----------- +2025-04-03 21:13:58 | [pearl_trainer] epoch #482 | Training... +2025-04-03 21:15:19 | [pearl_trainer] epoch #482 | Evaluating... +2025-04-03 21:15:19 | [pearl_trainer] epoch #482 | Sampling for adapation and meta-testing... +2025-04-03 21:17:02 | [pearl_trainer] epoch #482 | Finished meta-testing... +2025-04-03 21:17:02 | [pearl_trainer] epoch #482 | Saving snapshot... +2025-04-03 21:17:03 | [pearl_trainer] epoch #482 | Saved +2025-04-03 21:17:03 | [pearl_trainer] epoch #482 | Time 113535.71 s +2025-04-03 21:17:03 | [pearl_trainer] epoch #482 | EpochTime 214.68 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -22.1609 +MetaTest/Average/AverageReturn -22.1609 +MetaTest/Average/Iteration 482 +MetaTest/Average/MaxReturn 3.92754 +MetaTest/Average/MinReturn -48.0199 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 16.6818 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -22.1609 +MetaTest/__unnamed_task__/AverageReturn -22.1609 +MetaTest/__unnamed_task__/Iteration 482 +MetaTest/__unnamed_task__/MaxReturn 3.92754 +MetaTest/__unnamed_task__/MinReturn -48.0199 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 16.6818 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 787800 +------------------------------------------------- ------------ +2025-04-03 21:17:32 | [pearl_trainer] epoch #483 | Training... +2025-04-03 21:19:04 | [pearl_trainer] epoch #483 | Evaluating... +2025-04-03 21:19:04 | [pearl_trainer] epoch #483 | Sampling for adapation and meta-testing... +2025-04-03 21:21:12 | [pearl_trainer] epoch #483 | Finished meta-testing... +2025-04-03 21:21:12 | [pearl_trainer] epoch #483 | Saving snapshot... +2025-04-03 21:21:13 | [pearl_trainer] epoch #483 | Saved +2025-04-03 21:21:13 | [pearl_trainer] epoch #483 | Time 113786.47 s +2025-04-03 21:21:13 | [pearl_trainer] epoch #483 | EpochTime 250.76 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 3.83799 +MetaTest/Average/AverageReturn 3.83799 +MetaTest/Average/Iteration 483 +MetaTest/Average/MaxReturn 27.571 +MetaTest/Average/MinReturn -21.5429 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 16.1595 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 3.83799 +MetaTest/__unnamed_task__/AverageReturn 3.83799 +MetaTest/__unnamed_task__/Iteration 483 +MetaTest/__unnamed_task__/MaxReturn 27.571 +MetaTest/__unnamed_task__/MinReturn -21.5429 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 16.1595 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 789400 +------------------------------------------------- ------------ +2025-04-03 21:21:45 | [pearl_trainer] epoch #484 | Training... +2025-04-03 21:23:41 | [pearl_trainer] epoch #484 | Evaluating... +2025-04-03 21:23:41 | [pearl_trainer] epoch #484 | Sampling for adapation and meta-testing... +2025-04-03 21:25:31 | [pearl_trainer] epoch #484 | Finished meta-testing... +2025-04-03 21:25:31 | [pearl_trainer] epoch #484 | Saving snapshot... +2025-04-03 21:25:32 | [pearl_trainer] epoch #484 | Saved +2025-04-03 21:25:32 | [pearl_trainer] epoch #484 | Time 114045.42 s +2025-04-03 21:25:32 | [pearl_trainer] epoch #484 | EpochTime 258.95 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 10.6699 +MetaTest/Average/AverageReturn 10.6699 +MetaTest/Average/Iteration 484 +MetaTest/Average/MaxReturn 58.0889 +MetaTest/Average/MinReturn -28.9702 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 27.8654 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 10.6699 +MetaTest/__unnamed_task__/AverageReturn 10.6699 +MetaTest/__unnamed_task__/Iteration 484 +MetaTest/__unnamed_task__/MaxReturn 58.0889 +MetaTest/__unnamed_task__/MinReturn -28.9702 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 27.8654 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 791000 +------------------------------------------------- ----------- +2025-04-03 21:26:02 | [pearl_trainer] epoch #485 | Training... +2025-04-03 21:27:29 | [pearl_trainer] epoch #485 | Evaluating... +2025-04-03 21:27:29 | [pearl_trainer] epoch #485 | Sampling for adapation and meta-testing... +2025-04-03 21:29:15 | [pearl_trainer] epoch #485 | Finished meta-testing... +2025-04-03 21:29:15 | [pearl_trainer] epoch #485 | Saving snapshot... +2025-04-03 21:29:16 | [pearl_trainer] epoch #485 | Saved +2025-04-03 21:29:16 | [pearl_trainer] epoch #485 | Time 114268.76 s +2025-04-03 21:29:16 | [pearl_trainer] epoch #485 | EpochTime 223.34 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 50.5523 +MetaTest/Average/AverageReturn 50.5523 +MetaTest/Average/Iteration 485 +MetaTest/Average/MaxReturn 107.46 +MetaTest/Average/MinReturn -57.5906 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 67.5577 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 50.5523 +MetaTest/__unnamed_task__/AverageReturn 50.5523 +MetaTest/__unnamed_task__/Iteration 485 +MetaTest/__unnamed_task__/MaxReturn 107.46 +MetaTest/__unnamed_task__/MinReturn -57.5906 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 67.5577 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 792600 +------------------------------------------------- ----------- +2025-04-03 21:29:46 | [pearl_trainer] epoch #486 | Training... +2025-04-03 21:31:11 | [pearl_trainer] epoch #486 | Evaluating... +2025-04-03 21:31:11 | [pearl_trainer] epoch #486 | Sampling for adapation and meta-testing... +2025-04-03 21:32:55 | [pearl_trainer] epoch #486 | Finished meta-testing... +2025-04-03 21:32:55 | [pearl_trainer] epoch #486 | Saving snapshot... +2025-04-03 21:32:57 | [pearl_trainer] epoch #486 | Saved +2025-04-03 21:32:57 | [pearl_trainer] epoch #486 | Time 114489.52 s +2025-04-03 21:32:57 | [pearl_trainer] epoch #486 | EpochTime 220.76 s +------------------------------------------------- ------------- +MetaTest/Average/AverageDiscountedReturn 0.410183 +MetaTest/Average/AverageReturn 0.410183 +MetaTest/Average/Iteration 486 +MetaTest/Average/MaxReturn 105.28 +MetaTest/Average/MinReturn -42.7479 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 53.6983 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 0.410183 +MetaTest/__unnamed_task__/AverageReturn 0.410183 +MetaTest/__unnamed_task__/Iteration 486 +MetaTest/__unnamed_task__/MaxReturn 105.28 +MetaTest/__unnamed_task__/MinReturn -42.7479 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 53.6983 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 794200 +------------------------------------------------- ------------- +2025-04-03 21:33:27 | [pearl_trainer] epoch #487 | Training... +2025-04-03 21:34:51 | [pearl_trainer] epoch #487 | Evaluating... +2025-04-03 21:34:51 | [pearl_trainer] epoch #487 | Sampling for adapation and meta-testing... +2025-04-03 21:36:35 | [pearl_trainer] epoch #487 | Finished meta-testing... +2025-04-03 21:36:35 | [pearl_trainer] epoch #487 | Saving snapshot... +2025-04-03 21:36:36 | [pearl_trainer] epoch #487 | Saved +2025-04-03 21:36:36 | [pearl_trainer] epoch #487 | Time 114709.13 s +2025-04-03 21:36:36 | [pearl_trainer] epoch #487 | EpochTime 219.61 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 13.1286 +MetaTest/Average/AverageReturn 13.1286 +MetaTest/Average/Iteration 487 +MetaTest/Average/MaxReturn 105.639 +MetaTest/Average/MinReturn -47.0197 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 62.3143 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 13.1286 +MetaTest/__unnamed_task__/AverageReturn 13.1286 +MetaTest/__unnamed_task__/Iteration 487 +MetaTest/__unnamed_task__/MaxReturn 105.639 +MetaTest/__unnamed_task__/MinReturn -47.0197 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 62.3143 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 795800 +------------------------------------------------- ----------- +2025-04-03 21:37:06 | [pearl_trainer] epoch #488 | Training... +2025-04-03 21:38:30 | [pearl_trainer] epoch #488 | Evaluating... +2025-04-03 21:38:30 | [pearl_trainer] epoch #488 | Sampling for adapation and meta-testing... +2025-04-03 21:40:13 | [pearl_trainer] epoch #488 | Finished meta-testing... +2025-04-03 21:40:13 | [pearl_trainer] epoch #488 | Saving snapshot... +2025-04-03 21:40:14 | [pearl_trainer] epoch #488 | Saved +2025-04-03 21:40:14 | [pearl_trainer] epoch #488 | Time 114927.28 s +2025-04-03 21:40:14 | [pearl_trainer] epoch #488 | EpochTime 218.15 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn 15.0037 +MetaTest/Average/AverageReturn 15.0037 +MetaTest/Average/Iteration 488 +MetaTest/Average/MaxReturn 58.0948 +MetaTest/Average/MinReturn -5.94397 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 23.2412 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 15.0037 +MetaTest/__unnamed_task__/AverageReturn 15.0037 +MetaTest/__unnamed_task__/Iteration 488 +MetaTest/__unnamed_task__/MaxReturn 58.0948 +MetaTest/__unnamed_task__/MinReturn -5.94397 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 23.2412 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 797400 +------------------------------------------------- ------------ +2025-04-03 21:40:44 | [pearl_trainer] epoch #489 | Training... +2025-04-03 21:42:06 | [pearl_trainer] epoch #489 | Evaluating... +2025-04-03 21:42:06 | [pearl_trainer] epoch #489 | Sampling for adapation and meta-testing... +2025-04-03 21:43:49 | [pearl_trainer] epoch #489 | Finished meta-testing... +2025-04-03 21:43:49 | [pearl_trainer] epoch #489 | Saving snapshot... +2025-04-03 21:43:50 | [pearl_trainer] epoch #489 | Saved +2025-04-03 21:43:50 | [pearl_trainer] epoch #489 | Time 115143.43 s +2025-04-03 21:43:50 | [pearl_trainer] epoch #489 | EpochTime 216.15 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 19.8231 +MetaTest/Average/AverageReturn 19.8231 +MetaTest/Average/Iteration 489 +MetaTest/Average/MaxReturn 94.5544 +MetaTest/Average/MinReturn -50.9513 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 61.1562 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 19.8231 +MetaTest/__unnamed_task__/AverageReturn 19.8231 +MetaTest/__unnamed_task__/Iteration 489 +MetaTest/__unnamed_task__/MaxReturn 94.5544 +MetaTest/__unnamed_task__/MinReturn -50.9513 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 61.1562 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 799000 +------------------------------------------------- ----------- +2025-04-03 21:44:21 | [pearl_trainer] epoch #490 | Training... +2025-04-03 21:45:45 | [pearl_trainer] epoch #490 | Evaluating... +2025-04-03 21:45:45 | [pearl_trainer] epoch #490 | Sampling for adapation and meta-testing... +2025-04-03 21:47:29 | [pearl_trainer] epoch #490 | Finished meta-testing... +2025-04-03 21:47:29 | [pearl_trainer] epoch #490 | Saving snapshot... +2025-04-03 21:47:30 | [pearl_trainer] epoch #490 | Saved +2025-04-03 21:47:30 | [pearl_trainer] epoch #490 | Time 115363.09 s +2025-04-03 21:47:30 | [pearl_trainer] epoch #490 | EpochTime 219.66 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 10.0747 +MetaTest/Average/AverageReturn 10.0747 +MetaTest/Average/Iteration 490 +MetaTest/Average/MaxReturn 109.499 +MetaTest/Average/MinReturn -25.8448 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 50.6889 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 10.0747 +MetaTest/__unnamed_task__/AverageReturn 10.0747 +MetaTest/__unnamed_task__/Iteration 490 +MetaTest/__unnamed_task__/MaxReturn 109.499 +MetaTest/__unnamed_task__/MinReturn -25.8448 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 50.6889 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 800600 +------------------------------------------------- ----------- +2025-04-03 21:48:01 | [pearl_trainer] epoch #491 | Training... +2025-04-03 21:49:28 | [pearl_trainer] epoch #491 | Evaluating... +2025-04-03 21:49:28 | [pearl_trainer] epoch #491 | Sampling for adapation and meta-testing... +2025-04-03 21:51:10 | [pearl_trainer] epoch #491 | Finished meta-testing... +2025-04-03 21:51:10 | [pearl_trainer] epoch #491 | Saving snapshot... +2025-04-03 21:51:11 | [pearl_trainer] epoch #491 | Saved +2025-04-03 21:51:11 | [pearl_trainer] epoch #491 | Time 115584.49 s +2025-04-03 21:51:11 | [pearl_trainer] epoch #491 | EpochTime 221.39 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 24.5031 +MetaTest/Average/AverageReturn 24.5031 +MetaTest/Average/Iteration 491 +MetaTest/Average/MaxReturn 109.485 +MetaTest/Average/MinReturn -30.5692 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 59.5138 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 24.5031 +MetaTest/__unnamed_task__/AverageReturn 24.5031 +MetaTest/__unnamed_task__/Iteration 491 +MetaTest/__unnamed_task__/MaxReturn 109.485 +MetaTest/__unnamed_task__/MinReturn -30.5692 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 59.5138 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 802200 +------------------------------------------------- ----------- +2025-04-03 21:51:41 | [pearl_trainer] epoch #492 | Training... +2025-04-03 21:53:01 | [pearl_trainer] epoch #492 | Evaluating... +2025-04-03 21:53:01 | [pearl_trainer] epoch #492 | Sampling for adapation and meta-testing... +2025-04-03 21:54:44 | [pearl_trainer] epoch #492 | Finished meta-testing... +2025-04-03 21:54:44 | [pearl_trainer] epoch #492 | Saving snapshot... +2025-04-03 21:54:45 | [pearl_trainer] epoch #492 | Saved +2025-04-03 21:54:45 | [pearl_trainer] epoch #492 | Time 115797.54 s +2025-04-03 21:54:45 | [pearl_trainer] epoch #492 | EpochTime 213.04 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -9.26119 +MetaTest/Average/AverageReturn -9.26119 +MetaTest/Average/Iteration 492 +MetaTest/Average/MaxReturn 19.3871 +MetaTest/Average/MinReturn -54.4423 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 26.4983 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -9.26119 +MetaTest/__unnamed_task__/AverageReturn -9.26119 +MetaTest/__unnamed_task__/Iteration 492 +MetaTest/__unnamed_task__/MaxReturn 19.3871 +MetaTest/__unnamed_task__/MinReturn -54.4423 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 26.4983 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 803800 +------------------------------------------------- ------------ +2025-04-03 21:55:14 | [pearl_trainer] epoch #493 | Training... +2025-04-03 21:56:45 | [pearl_trainer] epoch #493 | Evaluating... +2025-04-03 21:56:45 | [pearl_trainer] epoch #493 | Sampling for adapation and meta-testing... +2025-04-03 21:58:26 | [pearl_trainer] epoch #493 | Finished meta-testing... +2025-04-03 21:58:26 | [pearl_trainer] epoch #493 | Saving snapshot... +2025-04-03 21:58:28 | [pearl_trainer] epoch #493 | Saved +2025-04-03 21:58:28 | [pearl_trainer] epoch #493 | Time 116020.58 s +2025-04-03 21:58:28 | [pearl_trainer] epoch #493 | EpochTime 223.04 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 15.828 +MetaTest/Average/AverageReturn 15.828 +MetaTest/Average/Iteration 493 +MetaTest/Average/MaxReturn 62.5891 +MetaTest/Average/MinReturn -27.3296 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 37.3972 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 15.828 +MetaTest/__unnamed_task__/AverageReturn 15.828 +MetaTest/__unnamed_task__/Iteration 493 +MetaTest/__unnamed_task__/MaxReturn 62.5891 +MetaTest/__unnamed_task__/MinReturn -27.3296 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 37.3972 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 805400 +------------------------------------------------- ----------- +2025-04-03 21:58:57 | [pearl_trainer] epoch #494 | Training... +2025-04-03 22:00:24 | [pearl_trainer] epoch #494 | Evaluating... +2025-04-03 22:00:24 | [pearl_trainer] epoch #494 | Sampling for adapation and meta-testing... +2025-04-03 22:02:08 | [pearl_trainer] epoch #494 | Finished meta-testing... +2025-04-03 22:02:08 | [pearl_trainer] epoch #494 | Saving snapshot... +2025-04-03 22:02:09 | [pearl_trainer] epoch #494 | Saved +2025-04-03 22:02:09 | [pearl_trainer] epoch #494 | Time 116242.02 s +2025-04-03 22:02:09 | [pearl_trainer] epoch #494 | EpochTime 221.44 s +------------------------------------------------- ------------- +MetaTest/Average/AverageDiscountedReturn -33.4417 +MetaTest/Average/AverageReturn -33.4417 +MetaTest/Average/Iteration 494 +MetaTest/Average/MaxReturn -0.628851 +MetaTest/Average/MinReturn -57.3461 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 25.0415 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -33.4417 +MetaTest/__unnamed_task__/AverageReturn -33.4417 +MetaTest/__unnamed_task__/Iteration 494 +MetaTest/__unnamed_task__/MaxReturn -0.628851 +MetaTest/__unnamed_task__/MinReturn -57.3461 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 25.0415 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 807000 +------------------------------------------------- ------------- +2025-04-03 22:02:39 | [pearl_trainer] epoch #495 | Training... +2025-04-03 22:04:04 | [pearl_trainer] epoch #495 | Evaluating... +2025-04-03 22:04:04 | [pearl_trainer] epoch #495 | Sampling for adapation and meta-testing... +2025-04-03 22:05:48 | [pearl_trainer] epoch #495 | Finished meta-testing... +2025-04-03 22:05:48 | [pearl_trainer] epoch #495 | Saving snapshot... +2025-04-03 22:05:49 | [pearl_trainer] epoch #495 | Saved +2025-04-03 22:05:49 | [pearl_trainer] epoch #495 | Time 116462.24 s +2025-04-03 22:05:49 | [pearl_trainer] epoch #495 | EpochTime 220.21 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 11.3229 +MetaTest/Average/AverageReturn 11.3229 +MetaTest/Average/Iteration 495 +MetaTest/Average/MaxReturn 99.4591 +MetaTest/Average/MinReturn -66.8605 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 71.2398 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 11.3229 +MetaTest/__unnamed_task__/AverageReturn 11.3229 +MetaTest/__unnamed_task__/Iteration 495 +MetaTest/__unnamed_task__/MaxReturn 99.4591 +MetaTest/__unnamed_task__/MinReturn -66.8605 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 71.2398 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 808600 +------------------------------------------------- ----------- +2025-04-03 22:06:19 | [pearl_trainer] epoch #496 | Training... +2025-04-03 22:07:43 | [pearl_trainer] epoch #496 | Evaluating... +2025-04-03 22:07:43 | [pearl_trainer] epoch #496 | Sampling for adapation and meta-testing... +2025-04-03 22:09:27 | [pearl_trainer] epoch #496 | Finished meta-testing... +2025-04-03 22:09:27 | [pearl_trainer] epoch #496 | Saving snapshot... +2025-04-03 22:09:28 | [pearl_trainer] epoch #496 | Saved +2025-04-03 22:09:28 | [pearl_trainer] epoch #496 | Time 116681.33 s +2025-04-03 22:09:28 | [pearl_trainer] epoch #496 | EpochTime 219.09 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 10.5874 +MetaTest/Average/AverageReturn 10.5874 +MetaTest/Average/Iteration 496 +MetaTest/Average/MaxReturn 69.6008 +MetaTest/Average/MinReturn -40.5837 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 46.7872 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 10.5874 +MetaTest/__unnamed_task__/AverageReturn 10.5874 +MetaTest/__unnamed_task__/Iteration 496 +MetaTest/__unnamed_task__/MaxReturn 69.6008 +MetaTest/__unnamed_task__/MinReturn -40.5837 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 46.7872 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 810200 +------------------------------------------------- ----------- +2025-04-03 22:09:58 | [pearl_trainer] epoch #497 | Training... +2025-04-03 22:11:23 | [pearl_trainer] epoch #497 | Evaluating... +2025-04-03 22:11:23 | [pearl_trainer] epoch #497 | Sampling for adapation and meta-testing... +2025-04-03 22:13:11 | [pearl_trainer] epoch #497 | Finished meta-testing... +2025-04-03 22:13:11 | [pearl_trainer] epoch #497 | Saving snapshot... +2025-04-03 22:13:12 | [pearl_trainer] epoch #497 | Saved +2025-04-03 22:13:12 | [pearl_trainer] epoch #497 | Time 116904.66 s +2025-04-03 22:13:12 | [pearl_trainer] epoch #497 | EpochTime 223.33 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn 22.4041 +MetaTest/Average/AverageReturn 22.4041 +MetaTest/Average/Iteration 497 +MetaTest/Average/MaxReturn 52.2851 +MetaTest/Average/MinReturn -13.4534 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 27.0072 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn 22.4041 +MetaTest/__unnamed_task__/AverageReturn 22.4041 +MetaTest/__unnamed_task__/Iteration 497 +MetaTest/__unnamed_task__/MaxReturn 52.2851 +MetaTest/__unnamed_task__/MinReturn -13.4534 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 27.0072 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 811800 +------------------------------------------------- ----------- +2025-04-03 22:13:42 | [pearl_trainer] epoch #498 | Training... +2025-04-03 22:15:12 | [pearl_trainer] epoch #498 | Evaluating... +2025-04-03 22:15:12 | [pearl_trainer] epoch #498 | Sampling for adapation and meta-testing... +2025-04-03 22:17:03 | [pearl_trainer] epoch #498 | Finished meta-testing... +2025-04-03 22:17:03 | [pearl_trainer] epoch #498 | Saving snapshot... +2025-04-03 22:17:04 | [pearl_trainer] epoch #498 | Saved +2025-04-03 22:17:04 | [pearl_trainer] epoch #498 | Time 117137.26 s +2025-04-03 22:17:04 | [pearl_trainer] epoch #498 | EpochTime 232.60 s +------------------------------------------------- ------------ +MetaTest/Average/AverageDiscountedReturn -1.89038 +MetaTest/Average/AverageReturn -1.89038 +MetaTest/Average/Iteration 498 +MetaTest/Average/MaxReturn 43.8129 +MetaTest/Average/MinReturn -50.9332 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 35.4336 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -1.89038 +MetaTest/__unnamed_task__/AverageReturn -1.89038 +MetaTest/__unnamed_task__/Iteration 498 +MetaTest/__unnamed_task__/MaxReturn 43.8129 +MetaTest/__unnamed_task__/MinReturn -50.9332 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 35.4336 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 813400 +------------------------------------------------- ------------ +2025-04-03 22:17:36 | [pearl_trainer] epoch #499 | Training... +2025-04-03 22:19:00 | [pearl_trainer] epoch #499 | Evaluating... +2025-04-03 22:19:00 | [pearl_trainer] epoch #499 | Sampling for adapation and meta-testing... +2025-04-03 22:21:05 | [pearl_trainer] epoch #499 | Finished meta-testing... +2025-04-03 22:21:05 | [pearl_trainer] epoch #499 | Saving snapshot... +2025-04-03 22:21:06 | [pearl_trainer] epoch #499 | Saved +2025-04-03 22:21:06 | [pearl_trainer] epoch #499 | Time 117378.66 s +2025-04-03 22:21:06 | [pearl_trainer] epoch #499 | EpochTime 241.40 s +------------------------------------------------- ----------- +MetaTest/Average/AverageDiscountedReturn -17.5821 +MetaTest/Average/AverageReturn -17.5821 +MetaTest/Average/Iteration 499 +MetaTest/Average/MaxReturn 115.778 +MetaTest/Average/MinReturn -62.8263 +MetaTest/Average/NumEpisodes 5 +MetaTest/Average/StdReturn 67.3179 +MetaTest/Average/TerminationRate 0 +MetaTest/__unnamed_task__/AverageDiscountedReturn -17.5821 +MetaTest/__unnamed_task__/AverageReturn -17.5821 +MetaTest/__unnamed_task__/Iteration 499 +MetaTest/__unnamed_task__/MaxReturn 115.778 +MetaTest/__unnamed_task__/MinReturn -62.8263 +MetaTest/__unnamed_task__/NumEpisodes 5 +MetaTest/__unnamed_task__/StdReturn 67.3179 +MetaTest/__unnamed_task__/TerminationRate 0 +TotalEnvSteps 815000 +------------------------------------------------- -----------