TicTacChess EfficientZero Checkpoint

This repository stores retained CRPT EfficientZero checkpoints for tic_tac_chess.

Latest Upload

  • Checkpoint: checkpoints/envstep_100000.pth.tar
  • Source path: models/simplified5_constant_lr_fixed_20260519/round-01/tic_tac_chess/attempt-01_260519_093506/ckpt/envstep_100000.pth.tar
  • Uploaded at: 2026-05-21T09:56:43Z
  • Selection: latest retained env-step checkpoint from the previous simplified5 training run per user instruction.

Metadata and resolved runtime config snapshots are under metadata/.

Latest Best Checkpoint

The current latest best checkpoint pointer for this repository is:

checkpoints/main5_bot_only_stabilize_overnight_20260524/ckpt_last.pth.tar

Metadata:

  • Game: tic_tac_chess
  • Source run: main5_bot_only_stabilize_overnight_20260524
  • Checkpoint role: final bot-only stabilization ckpt_last.pth.tar
  • Local source at upload: /mnt/nvme/home/molfetta/molfetta-reasoning/models/main5_bot_only_stabilize_overnight_20260524/tic_tac_chess/attempt-01_260524_201957/ckpt/ckpt_last.pth.tar
  • Checkpoint SHA256 at upload: 53046fdb3c54669f8926e426f4e7005d380ffb5dbb6d195a1bb0fd63158476f2
  • Final env steps: 200050
  • Validation note: final watchdog gate on 2026-05-24/2026-05-25 reported the last three non-initial fixed-bot evals at reward_mean=1.0; one-sided fixed-bot gate by experiment design; seat_swap=false and seat1_reward_mean=1.0.
  • Upload manifest: metadata/main5_bot_only_stabilize_overnight_20260524_latest_best_manifest.json

Older checkpoint files in this repository are preserved; this section is the canonical pointer to use when a consumer needs the latest best checkpoint.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support