Kona Sudoku EBM
Energy-based Sudoku solver trained with contrastive learning on puzzle/solution pairs.
Model
- Architecture: CNN + Transformer encoder energy head (
KonaEnergyModel) - Weights:
kona_ebm_best.pt - Best epoch: 9
- Hidden dim: 64, Layers: 2, Heads: 4
Usage
Clone the solver code from GitHub and download this checkpoint:
pip install torch typer tqdm numpy huggingface_hub
hf download Sph3inxz/ebm-sudoku-solver kona_ebm_best.pt --local-dir checkpoints
Solve a puzzle:
python -m kona_sudoku.main solve data/my_puzzle.txt `
--checkpoint checkpoints/kona_ebm_best.pt `
--device cuda `
--max-steps 3000 `
--langevin
Inference settings
Recommended solver flags:
--max-steps 3000--langevin--lambda-learned 0.1--lambda-constraints 1.0
Training
Trained on train_large.txt with AdamW, contrastive EBM loss (margin=2.0), and MCMC negatives.
- Downloads last month
- 19
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support