AI & ML interests
None yet
Organizations
models
33
jyc0325/Qwen2.5-1.5B-Open-R1-Code-GRPO
Text Generation
•
2B
•
Updated
•
1
jyc0325/Qwen2.5-1.5B-Instruct-gccpSFT-GRPO
Text Generation
•
2B
•
Updated
•
2
jyc0325/Qwen2.5-1.5B-Instruct-gccpSFT
Text Generation
•
2B
•
Updated
•
3
jyc0325/Qwen2.5-7B-Instruct-SFT
Text Generation
•
8B
•
Updated
•
9
jyc0325/Qwen2.5-1.5B-Open-R1-Code-GRPOv2
Text Generation
•
2B
•
Updated
•
2
jyc0325/Qwen2.5-1.5B-SFT-ORPO
Text Generation
•
2B
•
Updated
•
3
jyc0325/Qwen2.5-1.5B-DPO-SFT-code
Text Generation
•
2B
•
Updated
•
2
jyc0325/Qwen2.5-1.5B-SFT-v1
Text Generation
•
2B
•
Updated
•
3
jyc0325/Qwen2.5-1.5B-ORPO-code-hard
Text Generation
•
2B
•
Updated
•
3
jyc0325/Qwen2.5-1.5B-DPO-code-hard
Text Generation
•
2B
•
Updated
•
2
datasets
10
Viewer
•
Updated
•
35.7k
•
3
Viewer
•
Updated
•
35.7k
•
4
jyc0325/vcpp-pref-hard-pairs
Viewer
•
Updated
•
26.9k
•
3
jyc0325/vcpp-pref-code-only
Viewer
•
Updated
•
32.9k
•
3
jyc0325/vezora-pref-code-only
Viewer
•
Updated
•
52.9k
•
3
jyc0325/vezora-pref-clean
Viewer
•
Updated
•
54k
•
1
jyc0325/verifiable-coding-problems-python-pref
Viewer
•
Updated
•
32.9k
•
4
jyc0325/Code-Preference-Pairs
Viewer
•
Updated
•
54k
•
2
jyc0325/Mixture-of-Thoughts-code-8k
Viewer
•
Updated
•
25.2k
•
7
jyc0325/python_decontaminated_OpenR1-Math-220k
Preview
•
Updated
•
1