0x960 / docs

Commit History

docs: strengthen Chess960 thesis — why it's the right self-improvement benchmark
7b15ef1

qtzx06 commited on

docs: expand architecture doc with full search stack and training pipeline details
7109aa9

qtzx06 commited on

docs: rewrite demo script with concrete before/after metrics and full results
5a8e942

qtzx06 commited on

feat: add thesis section + Codex agent swarm narrative + 9B scaling probe + rewrite process log
4ed9a84

qtzx06 commited on

docs: sharpen demo script with concrete Elo gains and before/after metrics
219232e

qtzx06 commited on

docs: add GRPO deep-dive — environment-grounded RL over bounded tool use
55b59f4

qtzx06 commited on

feat: finalize swarm tooling and submission artifacts
eac9d9f

qtzx06 commited on

docs: log QLoRA debugging and successful training start
12532fa

qtzx06 commited on

feat: rewrite training to use TRL rollout_func + OpenEnv multi-turn pattern
93f58fd

qtzx06 commited on

docs: log Qwen 3.5 9B inference test on H100 (reward=0.25)
8da9024

qtzx06 commited on

feat: fix openenv 0.2.1 API, add deployment files and GRPO training
ea3bbb3

qtzx06 commited on

docs: add process log and agent instruction
67b703b

qtzx06 commited on

docs: add chess960 background and demo notes
4199b3c

qtzx06 commited on

docs: scope hackathon mvp
f8ef003

qtzx06 commited on