AI & ML interests
None yet
Organizations
None yet
models
11
Blancy/Qwen3-1.7B-Open-R1-Code-GRPO
Text Generation
•
2B
•
Updated
•
1
Blancy/Qwen3-0.6B-Open-R1-Distill
Text Generation
•
2B
•
Updated
•
5
Blancy/Qwen3-0.6B-Open-R1-GRPO
Text Generation
•
2B
•
Updated
Blancy/Qwen3-1.7B-Open-R1-GRPO
Text Generation
•
2B
•
Updated
•
6
•
•
2
Blancy/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Text Generation
•
2B
•
Updated
•
3
Blancy/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
2B
•
Updated
•
2
•
1
Blancy/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
2B
•
Updated
•
3
Blancy/Qwen-2.5-7B-Simple-RL
Text Generation
•
8B
•
Updated
•
4
•
1
Blancy/Qwen2.5-1.5B-Open-R1-Code-GRPO
Text Generation
•
2B
•
Updated
Blancy/DeepSeek-R1-Distill-Qwen-0.5B-GRPO
Text Generation
•
0.6B
•
Updated
•
61
datasets
43
Blancy/1ktestfrom10kwithdifficultyclasses_selfguided
Viewer
•
Updated
•
1k
•
3
Blancy/verifiable-coding-problems-SFT
Viewer
•
Updated
•
1.09k
•
7
Blancy/verifiable-coding-problems-CoT
Viewer
•
Updated
•
1.09k
•
9
Blancy/verifiable-coding-problems-python-filtered
Viewer
•
Updated
•
2k
•
6
Blancy/OpenThoughts-114k-Code_fit_code_reward
Viewer
•
Updated
•
1k
•
17
Blancy/OpenThoughts-114k-Code_oj_format
Viewer
•
Updated
•
1k
•
4
Blancy/OpenThoughts-114k-Code_decontaminated_final_verinfo
Viewer
•
Updated
•
1k
•
4
Blancy/OpenThoughts-114k-Code_decontaminated_final
Viewer
•
Updated
•
1k
•
3
Blancy/OpenThoughts-114k-Code_decontaminated_3000to5000_problem_leq400
Viewer
•
Updated
•
1.82k
•
6
Blancy/OpenThoughts-114k-Code_decontaminated_3000to5000
Viewer
•
Updated
•
2.92k
•
3