---
language: en
license: mit
tags:
- chess
- reinforcement-learning
- alphazero
- pytorch
library_name: pytorch
---

# Chess Bot Model - TinyPCN

A chess playing neural network trained on expert games from the Lichess Elite Database.

## Model Description

This is a policy-value network inspired by AlphaZero, designed to evaluate chess positions and suggest moves.

### Architecture

- **Input**: 18-plane board representation (12 pieces + 6 metadata planes)
- **Convolutional backbone**: 32 filters, 1 residual block, ~9,611,202 parameters
- **Policy head**: 4,672-dimensional output (one per legal move encoding)
- **Value head**: Single tanh output (-1 to +1 for position evaluation)

### Training Data

- **Dataset**: Lichess Elite Database (games from 2200+ ELO players)
- **Positions trained**: 16,800,000
- **Epochs**: 10

### Performance

- **Final Policy Loss**: 2.8000
- **Final Value Loss**: 0.8500

### Usage

```python
import torch
import chess
from model import TinyPCN, encode_board

# Load model
model = TinyPCN(board_channels=18, policy_size=4672)
model.load_state_dict(torch.load("chess_model.pth"))
model.eval()

# Evaluate a position
board = chess.Board()  # or chess.Board("fen string")
board_tensor = encode_board(board).unsqueeze(0)

with torch.no_grad():
    policy_logits, value = model(board_tensor)

# Value interpretation:
# +1.0 = winning for current player
#  0.0 = drawn/equal position
# -1.0 = losing for current player

print(f"Position evaluation: {value.item():.4f}")
```

### Model Files

- `chess_model.pth` - PyTorch model weights
- `model.py` - Model architecture and board encoding
- `mcts.py` - Monte Carlo Tree Search implementation
- `requirements.txt` - Python dependencies

### Limitations

- Trained on expert games only (no self-play yet)
- Lightweight architecture for educational purposes
- May not handle unusual openings or endgames well

### Training Details

- **Framework**: PyTorch
- **Optimizer**: Adam
- **Learning Rate**: 0.001
- **Batch Size**: 256
- **Loss Functions**: CrossEntropyLoss (policy) + MSELoss (value)

### Authors

Created as part of an AlphaZero-style chess engine project.

### License

MIT License