ymh233
ymh233
AI & ML interests
None yet
Recent Activity
upvoted a paper 30 days ago
Improving Data and Reward Design for Scientific Reasoning in Large Language Models upvoted a paper 30 days ago
TermiGen: High-Fidelity Environment and Robust Trajectory Synthesis for Terminal Agents upvoted a paper about 1 month ago
MSign: An Optimizer Preventing Training Instability in Large Language Models via Stable Rank Restoration