Zhang Tiantian
Sanyichangnuanyang
ยท
AI & ML interests
Reinforcement Learning, Continual Learning
Recent Activity
upvoted a paper about 17 hours ago
Learning from the Self-future: On-policy Self-distillation for dLLMs upvoted a paper about 1 year ago
Reinforcement Fine-Tuning Powers Reasoning Capability of Multimodal
Large Language ModelsOrganizations
None yet