| {"hidden_sizes": [16], "n_train_episodes": 1000, "n_eval_episodes": 10, "max_steps": 1000, "learning_rate": 0.01, "gamma": 1.0} |
| {"hidden_sizes": [16], "n_train_episodes": 1000, "n_eval_episodes": 10, "max_steps": 1000, "learning_rate": 0.01, "gamma": 1.0} |