arxiv:2506.13502
Zhikun Xu
JerrrrryKun
AI & ML interests
None yet
Organizations
None yet
models 25
JerrrrryKun/DeepSeek-R1-Distill-Qwen-7B-ER-v1-2-1epoch500steps
Text Generation • 8B • Updated
• 5
JerrrrryKun/DeepSeek-R1-Distill-Qwen-7B-ER-v1-2-300steps
Text Generation • 8B • Updated
• 2
JerrrrryKun/DeepSeek-R1-Distill-Qwen-7B-ER-v1-2-200steps
Text Generation • 8B • Updated
• 5
JerrrrryKun/DeepSeek-R1-Distill-Qwen-7B-ER-v1-2-100steps
Text Generation • 8B • Updated
• 1
JerrrrryKun/DeepSeek-R1-Distill-Qwen-7B-ER-v1-1-1430steps
Text Generation • 8B • Updated
• 1
JerrrrryKun/Qwen2.5-Math-7B-Instruct-LLM4Math-V2data-Sequential-perturbationsignalonly-ispass-400steps
Text Generation • 8B • Updated
• 2
JerrrrryKun/Llama-3.1-8B-Instruct-LLM4Math-V2data-Sequential-vanillaRL-200steps
Text Generation • 8B • Updated
• 2
JerrrrryKun/Llama-3.1-8B-Instruct-LLM4Math-V2data-Sequential-perturbationsignalonly-200steps
Text Generation • 8B • Updated
• 2
JerrrrryKun/Llama-3.1-8B-Instruct-LLM4Math-V2data-Sequential-vanillaRL-100steps
Text Generation • 8B • Updated
• 2
JerrrrryKun/Llama-3.1-8B-Instruct-LLM4Math-V2data-Sequential-perturbationsignalonly-100steps
Text Generation • 8B • Updated
• 2
datasets 0
None public yet