Enhancing Language Multi-Agent Learning with Multi-Agent Credit Re-Assignment for Interactive Environment Generalization Paper • 2502.14496 • Published Feb 20, 2025
MedEBench: Revisiting Text-instructed Image Editing on Medical Domain Paper • 2506.01921 • Published Jun 2, 2025
CultureCLIP: Empowering CLIP with Cultural Awareness through Synthetic Images and Contextualized Captions Paper • 2507.06210 • Published Jul 8, 2025 • 1
MMBoundary: Advancing MLLM Knowledge Boundary Awareness through Reasoning Step Confidence Calibration Paper • 2505.23224 • Published May 29, 2025 • 1
MAC-Tuning: LLM Multi-Compositional Problem Reasoning with Enhanced Knowledge Boundary Awareness Paper • 2504.21773 • Published Apr 30, 2025 • 1
Mathematical Proof as a Litmus Test: Revealing Failure Modes of Advanced Large Reasoning Models Paper • 2506.17114 • Published Jun 20, 2025
MARS-SQL: A multi-agent reinforcement learning framework for Text-to-SQL Paper • 2511.01008 • Published Nov 2, 2025
Dancing in Chains: Strategic Persuasion in Academic Rebuttal via Theory of Mind Paper • 2601.15715 • Published 6 days ago • 13
Dancing in Chains: Strategic Persuasion in Academic Rebuttal via Theory of Mind Paper • 2601.15715 • Published 6 days ago • 13
RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models Paper • 2406.10890 • Published Jun 16, 2024 • 1