AubreeL
/

chess-bot

Reinforcement Learning

Model card Files Files and versions

chess-bot / README.md

AubreeL's picture

Upload README.md with huggingface_hub

f58422d verified 3 months ago

|

history blame contribute delete

2.27 kB

	---
	language: en
	license: mit
	tags:
	- chess
	- reinforcement-learning
	- alphazero
	- pytorch
	library_name: pytorch
	---

	# Chess Bot Model - TinyPCN

	A chess playing neural network trained on expert games from the Lichess Elite Database.

	## Model Description

	This is a policy-value network inspired by AlphaZero, designed to evaluate chess positions and suggest moves.

	### Architecture

	- Input: 18-plane board representation (12 pieces + 6 metadata planes)
	- Convolutional backbone: 32 filters, 1 residual block, ~9,611,202 parameters
	- Policy head: 4,672-dimensional output (one per legal move encoding)
	- Value head: Single tanh output (-1 to +1 for position evaluation)

	### Training Data

	- Dataset: Lichess Elite Database (games from 2200+ ELO players)
	- Positions trained: 16,800,000
	- Epochs: 10

	### Performance

	- Final Policy Loss: 2.8000
	- Final Value Loss: 0.8500

	### Usage

	```python
	import torch
	import chess
	from model import TinyPCN, encode_board

	# Load model
	model = TinyPCN(board_channels=18, policy_size=4672)
	model.load_state_dict(torch.load("chess_model.pth"))
	model.eval()

	# Evaluate a position
	board = chess.Board() # or chess.Board("fen string")
	board_tensor = encode_board(board).unsqueeze(0)

	with torch.no_grad():
	policy_logits, value = model(board_tensor)

	# Value interpretation:
	# +1.0 = winning for current player
	# 0.0 = drawn/equal position
	# -1.0 = losing for current player

	print(f"Position evaluation: {value.item():.4f}")
	```

	### Model Files

	- `chess_model.pth` - PyTorch model weights
	- `model.py` - Model architecture and board encoding
	- `mcts.py` - Monte Carlo Tree Search implementation
	- `requirements.txt` - Python dependencies

	### Limitations

	- Trained on expert games only (no self-play yet)
	- Lightweight architecture for educational purposes
	- May not handle unusual openings or endgames well

	### Training Details

	- Framework: PyTorch
	- Optimizer: Adam
	- Learning Rate: 0.001
	- Batch Size: 256
	- Loss Functions: CrossEntropyLoss (policy) + MSELoss (value)

	### Authors

	Created as part of an AlphaZero-style chess engine project.

	### License

	MIT License