AI & ML interests
None yet
Organizations
jyc0325/Qwen2.5-1.5B-Open-R1-Code-GRPO
Text Generation
•
2B
•
Updated
•
1
jyc0325/Qwen2.5-1.5B-Instruct-gccpSFT-GRPO
Text Generation
•
2B
•
Updated
•
2
jyc0325/Qwen2.5-1.5B-Instruct-gccpSFT
Text Generation
•
2B
•
Updated
•
3
jyc0325/Qwen2.5-7B-Instruct-SFT
Text Generation
•
8B
•
Updated
•
9
jyc0325/Qwen2.5-1.5B-Open-R1-Code-GRPOv2
Text Generation
•
2B
•
Updated
•
2
jyc0325/Qwen2.5-1.5B-SFT-ORPO
Text Generation
•
2B
•
Updated
•
3
jyc0325/Qwen2.5-1.5B-DPO-SFT-code
Text Generation
•
2B
•
Updated
•
2
jyc0325/Qwen2.5-1.5B-SFT-v1
Text Generation
•
2B
•
Updated
•
3
jyc0325/Qwen2.5-1.5B-ORPO-code-hard
Text Generation
•
2B
•
Updated
•
3
jyc0325/Qwen2.5-1.5B-DPO-code-hard
Text Generation
•
2B
•
Updated
•
2
jyc0325/Qwen2.5-1.5B-DPO-vcpp
Text Generation
•
2B
•
Updated
•
2
jyc0325/Qwen2.5-1.5B-ORPO-vcpp
Text Generation
•
2B
•
Updated
•
2
jyc0325/Qwen2.5-1.5B-DPO-vezora
Text Generation
•
2B
•
Updated
•
1
jyc0325/Qwen2.5-1.5B-ORPO-vezora
Text Generation
•
2B
•
Updated
•
1
jyc0325/Qwen2.5-1.5B-DPO-code-fix
Text Generation
•
2B
•
Updated
•
1
jyc0325/Qwen2.5-1.5B-ORPO-code-fix
Text Generation
•
2B
•
Updated
•
2
jyc0325/Qwen2.5-1.5B-DPO-code
Text Generation
•
2B
•
Updated
•
1
jyc0325/Qwen2.5-1.5B-ORPO-code
Text Generation
•
2B
•
Updated
•
3
jyc0325/Qwen2.5-7B-DPO-code
Updated
jyc0325/Qwen2.5-7B-DPO-Merged
jyc0325/Qwen2.5-7B-ORPO-code-Merged
jyc0325/Qwen2.5-7B-ORPO-Merged
Text Generation
•
8B
•
Updated
•
1
Text Generation
•
Updated
•
1
jyc0325/Qwen2.5-7B-ORPO-code
Text Generation
•
Updated
Text Generation
•
Updated
•
1
jyc0325/Qwen2.5-1.5B-Instruct-SFT
Text Generation
•
2B
•
Updated
•
1
jyc0325/Qwen2.5-1.5B-Instruct-SFT-code
Text Generation
•
2B
•
Updated
•
2
•
1
jyc0325/llama2-7b-sft-ultrachat-hhrlhf-dpo
Text Generation
•
7B
•
Updated
•
1
jyc0325/Mistral-v0.1-7b-sft-ultrachat-hhrlhf-dpo
Text Generation
•
7B
•
Updated
•
1
jyc0325/Qwen2.5-7B-sft-ultrachat-hhrlhf-dpo
Text Generation
•
8B
•
Updated
•
1