Commit History

GRPO trained LoRA model based on unsloth/Qwen3-4B (Trained with Unsloth)
2f3dd40
verified

thejaminator commited on

GRPO trained LoRA model based on unsloth/Qwen3-4B (Trained with Unsloth)
8c51183
verified

thejaminator commited on

Upload README.md with huggingface_hub
587bcfd
verified

thejaminator commited on

initial commit
21d488c
verified

thejaminator commited on