battleship-ppo / README.md
Hanks1234's picture
Update model card
35a8b9a verified
metadata
library_name: stable-baselines3
tags:
  - reinforcement-learning
  - battleship
  - ppo
  - maskable-ppo
  - sb3-contrib
  - custom-environment

Battleship PPO Agent — Hanks1234/battleship-ppo

A MaskablePPO agent trained on a 10x20 Battleship board with custom T-shaped and Z-shaped ships using sb3-contrib.

Environment

  • Board: 10 columns x 20 rows
  • Ships: 10 ships including T-shaped Battleships and Z-shaped Carriers
  • Observation: 5-channel binary image (5, 20, 10)
  • Action: Discrete(200) with action masking (no repeat shots)

Usage

from training.hub import load_model_from_hub

model = load_model_from_hub("Hanks1234/battleship-ppo")