Spaces:

ARKAISW
/

QuantHive

Running

App Files Files Community

QuantHive / mate_training.ipynb

Commit History

Delete compiled cache + re-inject Unsloth attrs for Kaggle/Colab compat

48b2f2f

ARKAISW commited on Apr 25

Bypass Unsloth GRPO compilation - fix SymFloat crash on Colab/Kaggle

96fac64

ARKAISW commited on Apr 25

Inject unsloth_logit_chunk_multiplier into GRPOConfig

6fe98d9

ARKAISW commited on Apr 25

Inject all expected Unsloth GRPO args

707a332

ARKAISW commited on Apr 25

Inject unsloth_num_chunks into GRPOConfig

da2eb01

ARKAISW commited on Apr 25

Remove unsupported kwargs from GRPOConfig

2f2fb3e

ARKAISW commited on Apr 25

Disable PatchFastRL to fix Colab OSError

ba90699

ARKAISW commited on Apr 25

Fix openenv dependency name in notebook

2ed1d89

ARKAISW commited on Apr 25

Fix notebook dependencies and CUDA attribute

b22b8d5

ARKAISW commited on Apr 25

Add Colab GRPO training notebook

117a7c7

ARKAISW commited on Apr 25

Add pyarrow to notebook dependencies to fix colab mismatch

a90f241

ARKAISW commited on Apr 25

Fix openenv missing from notebook

7331420

ARKAISW commited on Apr 25

Update latest changes

aec0295

ARKAISW commited on Apr 25

Update training notebook and verifiers

a3c00eb

ARKAISW commited on Apr 25

Hackathon Final Submission: PettingZoo multi-agent arch, GRPO training, docs

9cb3002

ARKAISW commited on Apr 25

Commit History

Delete compiled cache + re-inject Unsloth attrs for Kaggle/Colab compat 48b2f2f

Bypass Unsloth GRPO compilation - fix SymFloat crash on Colab/Kaggle 96fac64

Inject unsloth_logit_chunk_multiplier into GRPOConfig 6fe98d9

Inject all expected Unsloth GRPO args 707a332

Inject unsloth_num_chunks into GRPOConfig da2eb01

Remove unsupported kwargs from GRPOConfig 2f2fb3e

Disable PatchFastRL to fix Colab OSError ba90699

Fix openenv dependency name in notebook 2ed1d89

Fix notebook dependencies and CUDA attribute b22b8d5

Add Colab GRPO training notebook 117a7c7

Add pyarrow to notebook dependencies to fix colab mismatch a90f241

Fix openenv missing from notebook 7331420

Update latest changes aec0295

Update training notebook and verifiers a3c00eb

Hackathon Final Submission: PettingZoo multi-agent arch, GRPO training, docs 9cb3002

Delete compiled cache + re-inject Unsloth attrs for Kaggle/Colab compat

48b2f2f

Bypass Unsloth GRPO compilation - fix SymFloat crash on Colab/Kaggle

96fac64

Inject unsloth_logit_chunk_multiplier into GRPOConfig

6fe98d9

Inject all expected Unsloth GRPO args

707a332

Inject unsloth_num_chunks into GRPOConfig

da2eb01

Remove unsupported kwargs from GRPOConfig

2f2fb3e

Disable PatchFastRL to fix Colab OSError

ba90699

Fix openenv dependency name in notebook

2ed1d89

Fix notebook dependencies and CUDA attribute

b22b8d5

Add Colab GRPO training notebook

117a7c7

Add pyarrow to notebook dependencies to fix colab mismatch

a90f241

Fix openenv missing from notebook

7331420

Update latest changes

aec0295

Update training notebook and verifiers

a3c00eb

Hackathon Final Submission: PettingZoo multi-agent arch, GRPO training, docs

9cb3002