QuantHive / training /train_grpo_multiagent.py

Commit History

Update training notebook and verifiers
a3c00eb

ARKAISW commited on

fix(notebook): correct clone step order, extract prompt utils, fix github url
30a586b

ARKAISW commited on

Hackathon Final Submission: PettingZoo multi-agent arch, GRPO training, docs
9cb3002

ARKAISW commited on