Commit History

GRPO trained LoRA model based on unsloth/Qwen3-4B (Trained with Unsloth)
b8d51bd
verified

thejaminator commited on

GRPO trained LoRA model based on unsloth/Qwen3-4B (Trained with Unsloth)
99d21f0
verified

thejaminator commited on

Upload README.md with huggingface_hub
a8ac82f
verified

thejaminator commited on

initial commit
f3afb85
verified

thejaminator commited on