Commit History

GRPO trained LoRA model based on unsloth/Qwen3-4B (Trained with Unsloth)
302f519
verified

thejaminator commited on

GRPO trained LoRA model based on unsloth/Qwen3-4B (Trained with Unsloth)
f65b02a
verified

thejaminator commited on

Upload README.md with huggingface_hub
08d77fe
verified

thejaminator commited on

initial commit
63891c9
verified

thejaminator commited on