GRPO RL model
SunJack
SunJack
·
AI & ML interests
None yet
Organizations
models 14
SunJack/Qwen2.5-3B-R1-GGUF
3B • Updated • 11
SunJack/Qwen2.5-3B-R1
Updated • 6
SunJack/Phi-4-R1
Updated
SunJack/Phi-4-R1-GGUF
Updated
SunJack/Qwen2.5-7b-sft
Updated • 2
SunJack/phi4-o1
15B • Updated • 36
SunJack/Qwen2.5-3B-GRPO_lora
Updated
SunJack/qwen2.5-7b-o1
8B • Updated • 29 • 1
SunJack/qwen2.5-7b-cve
8B • Updated • 31 • 1
SunJack/qwen2-7b-ruozhiba-finetuning
8B • Updated • 29 • 2