Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
David Cho
jyc0325
Follow
AmberYifan's profile picture
1 follower
·
1 following
AI & ML interests
None yet
Organizations
jyc0325
's models
33
Sort: Recently updated
jyc0325/Qwen2.5-1.5B-Open-R1-Code-GRPO
Text Generation
•
2B
•
Updated
Jul 13
•
13
jyc0325/Qwen2.5-1.5B-Instruct-gccpSFT-GRPO
Text Generation
•
2B
•
Updated
Jul 11
•
10
jyc0325/Qwen2.5-1.5B-Instruct-gccpSFT
Text Generation
•
2B
•
Updated
Jul 11
•
7
jyc0325/Qwen2.5-7B-Instruct-SFT
Text Generation
•
8B
•
Updated
Jul 11
•
7
jyc0325/Qwen2.5-1.5B-Open-R1-Code-GRPOv2
Text Generation
•
2B
•
Updated
Jul 10
•
8
jyc0325/Qwen2.5-1.5B-SFT-ORPO
Text Generation
•
2B
•
Updated
Jul 9
•
8
jyc0325/Qwen2.5-1.5B-DPO-SFT-code
Text Generation
•
2B
•
Updated
Jul 8
•
6
jyc0325/Qwen2.5-1.5B-SFT-v1
Text Generation
•
2B
•
Updated
Jul 8
•
11
jyc0325/Qwen2.5-1.5B-ORPO-code-hard
Text Generation
•
2B
•
Updated
Jul 8
•
8
jyc0325/Qwen2.5-1.5B-DPO-code-hard
Text Generation
•
2B
•
Updated
Jul 8
•
7
jyc0325/Qwen2.5-1.5B-DPO-vcpp
Text Generation
•
2B
•
Updated
Jul 7
•
8
jyc0325/Qwen2.5-1.5B-ORPO-vcpp
Text Generation
•
2B
•
Updated
Jul 7
•
7
jyc0325/Qwen2.5-1.5B-DPO-vezora
Text Generation
•
2B
•
Updated
Jul 7
•
8
jyc0325/Qwen2.5-1.5B-ORPO-vezora
Text Generation
•
2B
•
Updated
Jul 7
•
4
jyc0325/Qwen2.5-1.5B-DPO-code-fix
Text Generation
•
2B
•
Updated
Jul 3
•
7
jyc0325/Qwen2.5-1.5B-ORPO-code-fix
Text Generation
•
2B
•
Updated
Jul 3
•
7
jyc0325/Qwen2.5-1.5B-DPO-code
Text Generation
•
2B
•
Updated
Jul 2
•
7
jyc0325/Qwen2.5-1.5B-ORPO-code
Text Generation
•
2B
•
Updated
Jul 2
•
9
jyc0325/Qwen2.5-7B-DPO-code
Updated
Jul 1
jyc0325/Qwen2.5-7B-DPO-Merged
8B
•
Updated
Jul 1
•
2
jyc0325/Qwen2.5-7B-ORPO-code-Merged
8B
•
Updated
Jul 1
•
5
jyc0325/Qwen2.5-7B-ORPO-Merged
Text Generation
•
8B
•
Updated
Jul 1
•
5
jyc0325/Qwen2.5-7B-DPO
Text Generation
•
Updated
Jul 1
•
4
jyc0325/Qwen2.5-7B-ORPO-code
Text Generation
•
Updated
Jul 1
•
8
jyc0325/Qwen2.5-7B-ORPO
Text Generation
•
Updated
Jul 1
•
7
jyc0325/Qwen2.5-1.5B-Instruct-SFT
Text Generation
•
2B
•
Updated
Jun 27
•
7
jyc0325/Qwen2.5-1.5B-Instruct-SFT-code
Text Generation
•
2B
•
Updated
Apr 19
•
6
•
1
jyc0325/llama2-7b-sft-ultrachat-hhrlhf-dpo
Text Generation
•
7B
•
Updated
Feb 19
•
9
jyc0325/Mistral-v0.1-7b-sft-ultrachat-hhrlhf-dpo
Text Generation
•
7B
•
Updated
Feb 19
•
7
jyc0325/Qwen2.5-7B-sft-ultrachat-hhrlhf-dpo
Text Generation
•
8B
•
Updated
Feb 19
•
7
Previous
1
2
Next