Understanding the Behaviors of Environment-aware Information Retrieval Paper • 2606.16817 • Published 11 days ago • 9
FINEREASON: Evaluating and Improving LLMs' Deliberate Reasoning through Reflective Puzzle Solving Paper • 2502.20238 • Published Feb 27, 2025 • 23
From Pixels to Insights: A Survey on Automatic Chart Understanding in the Era of Large Foundation Models Paper • 2403.12027 • Published Mar 18, 2024 • 1
GeoPQA: Bridging the Visual Perception Gap in MLLMs for Geometric Reasoning Paper • 2509.17437 • Published Sep 22, 2025 • 17
Scaling Environments for LLM Agents in the Era of Learning from Interaction: A Survey Paper • 2511.09586 • Published Nov 12, 2025 • 2
SeaLLMs-Audio: Large Audio-Language Models for Southeast Asia Paper • 2511.01670 • Published Nov 3, 2025
Debate-to-Write: A Persona-Driven Multi-Agent Framework for Diverse Argument Generation Paper • 2406.19643 • Published Jan 3, 2025
Understanding the Behaviors of Environment-aware Information Retrieval Paper • 2606.16817 • Published 11 days ago • 9
Understanding the Behaviors of Environment-aware Information Retrieval Paper • 2606.16817 • Published 11 days ago • 9
ChatGPT's One-year Anniversary: Are Open-Source Large Language Models Catching up? Paper • 2311.16989 • Published Nov 28, 2023
Verify-and-Edit: A Knowledge-Enhanced Chain-of-Thought Framework Paper • 2305.03268 • Published May 5, 2023 • 3
Retrieving Multimodal Information for Augmented Generation: A Survey Paper • 2303.10868 • Published Mar 20, 2023
How Much are LLMs Contaminated? A Comprehensive Survey and the LLMSanitize Library Paper • 2404.00699 • Published Mar 31, 2024
Can We Further Elicit Reasoning in LLMs? Critic-Guided Planning with Retrieval-Augmentation for Solving Challenging Tasks Paper • 2410.01428 • Published Oct 2, 2024 • 1
MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs Paper • 2504.00993 • Published Apr 1, 2025 • 3
OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe Paper • 2511.16334 • Published Nov 20, 2025 • 96
LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling Paper • 2511.20785 • Published Nov 25, 2025 • 188
MiroThinker-1.7 & H1: Towards Heavy-Duty Research Agents via Verification Paper • 2603.15726 • Published Mar 16 • 187
In-Context Reinforcement Learning for Tool Use in Large Language Models Paper • 2603.08068 • Published Mar 9 • 43