arxiv:2606.14476
zhongyuan wang
3dk
AI & ML interests
None yet
Recent Activity
authored a paper 3 days ago
When the Tool Decides: LLM Agents Defer Blindly to Graph Neural Network Tools, and Stronger Backbones Defer More upvoted a paper 10 months ago
On the Generalization of SFT: A Reinforcement Learning Perspective with
Reward Rectification liked a model over 2 years ago
sentence-transformers/all-MiniLM-L6-v2Organizations
None yet