QuantHive / training

Commit History

Semantic observation prompts — rich text replaces raw floats (judge feedback #1)
213c699

ARKAISW commited on

Update latest changes
aec0295

ARKAISW commited on

Update training notebook and verifiers
a3c00eb

ARKAISW commited on

fix(notebook): correct clone step order, extract prompt utils, fix github url
30a586b

ARKAISW commited on

Hackathon Final Submission: PettingZoo multi-agent arch, GRPO training, docs
9cb3002

ARKAISW commited on