Add UCI engine generation, Parquet data pipeline, and Lc0/Stockfish containers 4c4517f thomas-schweich commited on about 10 hours ago
Add Lichess dataset extraction pipeline and update pod.sh 830a330 thomas-schweich commited on about 13 hours ago
Add RoSA adapter with gradient-informed sparse masks (#3) 95f9aba unverified thomas-schweich commited on about 14 hours ago
Fix terminal position diagnostics: report pad_prob for checkmate/stalemate, not legal_rate 42346d4 thomas-schweich commited on about 18 hours ago
Add sweep migration script (rsync between pods via local staging) f5fd4da thomas-schweich commited on 1 day ago
Fix sweep: don't pass --epochs to train.py, handle all-pruned results e177cce thomas-schweich commited on 1 day ago
Fix: create SQLite parent directory before Optuna study creation e9ecc7f thomas-schweich commited on 1 day ago
Architecture sweep: GPU affinity, arch search space, train.py overrides 0fe8a5f thomas-schweich commited on 1 day ago
Fix outcome array dtype in ceiling computation, np.asarray in script 75cf8a6 thomas-schweich commited on 1 day ago
Fix bugs, performance issues, and doc errors from code review ae46efa thomas-schweich commited on 1 day ago
Add theoretical accuracy ceiling computation (E[1/N_legal] and outcome-conditioned) 44311e2 thomas-schweich commited on 1 day ago
Log patience counter, best val loss/step in val records a050f72 thomas-schweich commited on 1 day ago
Per-model early stopping: freeze converged variants individually 190085d thomas-schweich commited on 1 day ago
Monitor script: show step time, games/sec, ETA from synced metrics 5a4ed63 thomas-schweich commited on 1 day ago
Fix monitor script to show per-variant metrics from pod 8e86dac thomas-schweich commited on 1 day ago
Fix SSH: generate host keys, use 'ip' field from runpodctl a47b56d thomas-schweich commited on 1 day ago
Push metrics to HF at eval intervals, add dashboard HF sync 86ec60c thomas-schweich commited on 1 day ago
Remove hardcoded IP from monitor script, resolve SSH via runpodctl 660f2d0 thomas-schweich commited on 1 day ago
Remove .item() CUDA sync from hot path, batch size 512, run slugs fc9d7f7 thomas-schweich commited on 1 day ago
Add post-training evals, /dev/shm checkpoints, async HF push, and _orig_mod fix 87b2fa6 thomas-schweich commited on 1 day ago
Safetensors migration, checkpoint integrity, and multi-model training. (#1) 230508d unverified thomas-schweich commited on 1 day ago
Add attributions and inline links for open source projects and academic publications e27101c thomas-schweich Claude Opus 4.6 (1M context) commited on 2 days ago
Add git version baking in Dockerfile, progress monitoring, and probe eval script 5fbb1fb thomas-schweich Claude Opus 4.6 (1M context) commited on 2 days ago
Add Docker BYOC deployment, discard-ply-limit ablation, dashboard, and deploy tooling a188746 thomas-schweich Claude Opus 4.6 (1M context) commited on 4 days ago