When the Tool Decides: LLM Agents Defer Blindly to Graph Neural Network Tools, and Stronger Backbones Defer More Paper • 2606.14476 • Published 7 days ago
On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification Paper • 2508.05629 • Published Aug 7, 2025 • 190
sentence-transformers/all-MiniLM-L6-v2 Sentence Similarity • 22.7M • Updated 17 days ago • 216M • • 4.97k