arxiv:2504.00502
zuijiang
zuijiang
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 15 hours ago
Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning
upvoted
a
paper
29 days ago
The Reasoning-Creativity Trade-off: Toward Creativity-Driven Problem Solving