cheongmyeong17/Qwen2.5-MATH-7B-base_model-CVAPO-EP5-LR1e06 Text Generation • 8B • Updated Nov 20, 2025 • 9
cheongmyeong17/DeepSeek-Distill-Qwen-7B-CVAPO-EP5-LR1e06-G4 Text Generation • 8B • Updated Nov 19, 2025 • 2
cheongmyeong17/Qwen2.5-MATH-7B-base_model-DRGRPO-EP3-G8 Text Generation • 8B • Updated Nov 19, 2025 • 3
cheongmyeong17/DeepSeek-R1-Distill-Qwen7B-RLOO-G16-Best Text Generation • 8B • Updated Jul 30, 2025 • 1
cheongmyeong17/DeepSeek-R1-Distill-Qwen7B-GRPO-G16-Best Text Generation • 8B • Updated Jul 30, 2025 • 3
cheongmyeong17/DeepSeek-R1-Distill-Qwen-7B-MATH345-RLOO-G16 Text Generation • 8B • Updated Jul 30, 2025 • 2
cheongmyeong17/DeepSeek-R1-Distill-Qwen-7B-MATH345-GRPO-G16 Text Generation • 8B • Updated Jul 30, 2025 • 2