QuantHive / mate_training.ipynb

Commit History

Delete compiled cache + re-inject Unsloth attrs for Kaggle/Colab compat
48b2f2f

ARKAISW commited on

Bypass Unsloth GRPO compilation - fix SymFloat crash on Colab/Kaggle
96fac64

ARKAISW commited on

Inject unsloth_logit_chunk_multiplier into GRPOConfig
6fe98d9

ARKAISW commited on

Inject all expected Unsloth GRPO args
707a332

ARKAISW commited on

Inject unsloth_num_chunks into GRPOConfig
da2eb01

ARKAISW commited on

Remove unsupported kwargs from GRPOConfig
2f2fb3e

ARKAISW commited on

Disable PatchFastRL to fix Colab OSError
ba90699

ARKAISW commited on

Fix openenv dependency name in notebook
2ed1d89

ARKAISW commited on

Fix notebook dependencies and CUDA attribute
b22b8d5

ARKAISW commited on

Add Colab GRPO training notebook
117a7c7

ARKAISW commited on

Add pyarrow to notebook dependencies to fix colab mismatch
a90f241

ARKAISW commited on

Fix openenv missing from notebook
7331420

ARKAISW commited on

Update latest changes
aec0295

ARKAISW commited on

Update training notebook and verifiers
a3c00eb

ARKAISW commited on

Hackathon Final Submission: PettingZoo multi-agent arch, GRPO training, docs
9cb3002

ARKAISW commited on