Spaces:

qtzx06
/

0x960

Sleeping

App Files Files Community

0x960 / docs

Commit History

docs: strengthen Chess960 thesis — why it's the right self-improvement benchmark

7b15ef1

qtzx06 commited on Mar 8

docs: expand architecture doc with full search stack and training pipeline details

7109aa9

qtzx06 commited on Mar 8

docs: rewrite demo script with concrete before/after metrics and full results

5a8e942

qtzx06 commited on Mar 8

feat: add thesis section + Codex agent swarm narrative + 9B scaling probe + rewrite process log

4ed9a84

qtzx06 commited on Mar 8

docs: sharpen demo script with concrete Elo gains and before/after metrics

219232e

qtzx06 commited on Mar 8

docs: add GRPO deep-dive — environment-grounded RL over bounded tool use

55b59f4

qtzx06 commited on Mar 8

feat: finalize swarm tooling and submission artifacts

eac9d9f

qtzx06 commited on Mar 8

docs: log QLoRA debugging and successful training start

12532fa

qtzx06 commited on Mar 8

feat: rewrite training to use TRL rollout_func + OpenEnv multi-turn pattern

93f58fd

qtzx06 commited on Mar 8

docs: log Qwen 3.5 9B inference test on H100 (reward=0.25)

8da9024

qtzx06 commited on Mar 8

feat: fix openenv 0.2.1 API, add deployment files and GRPO training

ea3bbb3

qtzx06 commited on Mar 8

docs: add process log and agent instruction

67b703b

qtzx06 commited on Mar 8

docs: add chess960 background and demo notes

4199b3c

qtzx06 commited on Mar 8

docs: scope hackathon mvp

f8ef003

qtzx06 commited on Mar 8