arxiv:2502.20545
Liu
Shiweiliuiiiiiii
AI & ML interests
LLM, pre-training, post-training, RL, efficient AI
Recent Activity
upvoted a paper about 15 hours ago
Learning from the Self-future: On-policy Self-distillation for dLLMs submitted a paper about 15 hours ago
Learning from the Self-future: On-policy Self-distillation for dLLMs upvoted a paper 7 months ago
The Path Not Taken: RLVR Provably Learns Off the PrincipalsOrganizations
None yet