TicTacChess EfficientZero Checkpoint

This repository stores retained CRPT EfficientZero checkpoints for tic_tac_chess.

Latest Upload

Checkpoint: checkpoints/envstep_100000.pth.tar
Source path: models/simplified5_constant_lr_fixed_20260519/round-01/tic_tac_chess/attempt-01_260519_093506/ckpt/envstep_100000.pth.tar
Uploaded at: 2026-05-21T09:56:43Z
Selection: latest retained env-step checkpoint from the previous simplified5 training run per user instruction.

Metadata and resolved runtime config snapshots are under metadata/.

Latest Best Checkpoint

The current latest best checkpoint pointer for this repository is:

checkpoints/main5_bot_only_stabilize_overnight_20260524/ckpt_last.pth.tar

Metadata:

Game: tic_tac_chess
Source run: main5_bot_only_stabilize_overnight_20260524
Checkpoint role: final bot-only stabilization ckpt_last.pth.tar
Local source at upload: /mnt/nvme/home/molfetta/molfetta-reasoning/models/main5_bot_only_stabilize_overnight_20260524/tic_tac_chess/attempt-01_260524_201957/ckpt/ckpt_last.pth.tar
Checkpoint SHA256 at upload: 53046fdb3c54669f8926e426f4e7005d380ffb5dbb6d195a1bb0fd63158476f2
Final env steps: 200050
Validation note: final watchdog gate on 2026-05-24/2026-05-25 reported the last three non-initial fixed-bot evals at reward_mean=1.0; one-sided fixed-bot gate by experiment design; seat_swap=false and seat1_reward_mean=1.0.
Upload manifest: metadata/main5_bot_only_stabilize_overnight_20260524_latest_best_manifest.json

Older checkpoint files in this repository are preserved; this section is the canonical pointer to use when a consumer needs the latest best checkpoint.

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support