ToolSense: A Diagnostic Framework for Auditing Parametric Tool Knowledge in LLMs Paper • 2606.12451 • Published 23 days ago • 2
ToolSense: A Diagnostic Framework for Auditing Parametric Tool Knowledge in LLMs Paper • 2606.12451 • Published 23 days ago • 2
CoHyDE: Iterative Co-Training of LLM Rewriter & Dense Encoder for Tool Retrieval Paper • 2605.29271 • Published about 1 month ago • 9
CoHyDE: Iterative Co-Training of LLM Rewriter & Dense Encoder for Tool Retrieval Paper • 2605.29271 • Published about 1 month ago • 9
MirrorBench: An Extensible Framework to Evaluate User-Proxy Agents for Human-Likeness Paper • 2601.08118 • Published Jan 13 • 2
MirrorBench: An Extensible Framework to Evaluate User-Proxy Agents for Human-Likeness Paper • 2601.08118 • Published Jan 13 • 2
Disambiguation-Centric Finetuning Makes Enterprise Tool-Calling LLMs More Realistic and Less Risky Paper • 2507.03336 • Published Jul 4, 2025 • 7