MattStammers
/

appo-atari_atlantis-sota-only10mill_steps

Reinforcement Learning

deep-reinforcement-learning

Eval Results (legacy)

Model card Files Files and versions

Metrics Training metrics Community

MattStammers commited on Oct 7, 2023

Commit

fef09a8

·

1 Parent(s): 8f5b016

Environment Solved

Files changed (1) hide show

README.md +8 -1

README.md CHANGED Viewed

@@ -53,4 +53,11 @@ python -m sf_examples.atari.train_atari --algo=APPO --env=atari_atlantis --train
 ```
 Note, you may have to adjust `--train_for_env_steps` to a suitably high number as the experiment will resume at the number of steps it concluded at.

 ```
 Note, you may have to adjust `--train_for_env_steps` to a suitably high number as the experiment will resume at the number of steps it concluded at.
+## SOTA Performance
+This model as with all the others was trained at 10 million steps to create a baseline. Interestingly, in this environment, it reaches SOTA performance at even this level suggesting that the Atlantis game is pretty easy to beat.
+For more information on this environment see: https://www.endtoend.ai/envs/gym/atari/atlantis/. Because rewards are plentiful and the Gorgons have to pass 4 times to reach attack range the environment is relatively easy to reach SOTA on.
+I have now compared this with the performance of the TQC, SAC and the DQN models which all underperformed PPO. I now consider this atari environment solved.