Commit History

GRPO trained LoRA model based on unsloth/Qwen3-4B (Trained with Unsloth)
6f6ae9b
verified

thejaminator commited on

GRPO trained LoRA model based on unsloth/Qwen3-4B (Trained with Unsloth)
ce3dd2d
verified

thejaminator commited on

Upload README.md with huggingface_hub
1ff38da
verified

thejaminator commited on

initial commit
2694321
verified

thejaminator commited on