Repo for paper Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability.
Qihan Ren
jasonrqh
AI & ML interests
explainable AI, LLM
Recent Activity
upvoted a paper about 4 hours ago
OpenSeeker-v2: Pushing the Limits of Search Agents with Informative and High-Difficulty Trajectories upvoted a paper about 4 hours ago
ARIS: Autonomous Research via Adversarial Multi-Agent Collaboration upvoted a paper 21 days ago
Seedance 2.0: Advancing Video Generation for World Complexity