In-Context Reinforcement Learning for Tool Use in Large Language Models Paper • 2603.08068 • Published 4 days ago • 17
In-Context Reinforcement Learning for Tool Use in Large Language Models Paper • 2603.08068 • Published 4 days ago • 17
MOOSE-Star: Unlocking Tractable Training for Scientific Discovery by Breaking the Complexity Barrier Paper • 2603.03756 • Published 9 days ago • 86
From Perception to Action: An Interactive Benchmark for Vision Reasoning Paper • 2602.21015 • Published 16 days ago • 23
From Perception to Action: An Interactive Benchmark for Vision Reasoning Paper • 2602.21015 • Published 16 days ago • 23
AQE: Argument Quadruplet Extraction via a Quad-Tagging Augmented Generative Approach Paper • 2305.19902 • Published May 31, 2023
GlobalWoZ: Globalizing MultiWoZ to Develop Multilingual Task-Oriented Dialogue Systems Paper • 2110.07679 • Published Oct 14, 2021
UniME-V2: MLLM-as-a-Judge for Universal Multimodal Embedding Learning Paper • 2510.13515 • Published Oct 15, 2025 • 12
MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling Paper • 2511.11793 • Published Nov 14, 2025 • 190
OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe Paper • 2511.16334 • Published Nov 20, 2025 • 93
LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling Paper • 2511.20785 • Published Nov 25, 2025 • 187
Large Language Models Do NOT Really Know What They Don't Know Paper • 2510.09033 • Published Oct 10, 2025 • 17
First Try Matters: Revisiting the Role of Reflection in Reasoning Models Paper • 2510.08308 • Published Oct 9, 2025 • 24
Scaling Language-Centric Omnimodal Representation Learning Paper • 2510.11693 • Published Oct 13, 2025 • 104
Scaling Language-Centric Omnimodal Representation Learning Paper • 2510.11693 • Published Oct 13, 2025 • 104
Scaling Language-Centric Omnimodal Representation Learning Paper • 2510.11693 • Published Oct 13, 2025 • 104
MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use Paper • 2509.24002 • Published Sep 28, 2025 • 176