DQN playing Space Invaders

Trained with Deep Q-Learning (DQN) using stable-baselines3.

Training

  • Algorithm: DQN
  • Environment: SpaceInvadersNoFrameskip-v4 (Atari)
  • Timesteps: 10,000,000
  • Environments: 8 parallel
  • Batch size: 256
  • GPU: Quadro RTX 5000 (16GB)
  • Training time: ~5.5 hours

Results

  • Final evaluation reward: 1175 +/ 216
  • Certification score: 959 (threshold: 200)

Replay

Replay

Downloads last month
1
Video Preview
loading

Evaluation results

  • mean_reward on Space Invaders (Atari)
    self-reported
    1175.00 +/ 215.89