FINEREASON: Evaluating and Improving LLMs' Deliberate Reasoning through Reflective Puzzle Solving Paper • 2502.20238 • Published Feb 27, 2025 • 23
From Pixels to Insights: A Survey on Automatic Chart Understanding in the Era of Large Foundation Models Paper • 2403.12027 • Published Mar 18, 2024 • 1
GeoPQA: Bridging the Visual Perception Gap in MLLMs for Geometric Reasoning Paper • 2509.17437 • Published Sep 22, 2025 • 17
Scaling Environments for LLM Agents in the Era of Learning from Interaction: A Survey Paper • 2511.09586 • Published Nov 12, 2025 • 2
SeaLLMs-Audio: Large Audio-Language Models for Southeast Asia Paper • 2511.01670 • Published Nov 3, 2025
Debate-to-Write: A Persona-Driven Multi-Agent Framework for Diverse Argument Generation Paper • 2406.19643 • Published Jan 3, 2025
Understanding the Behaviors of Environment-aware Information Retrieval Paper • 2606.16817 • Published 12 days ago • 9
Understanding the Behaviors of Environment-aware Information Retrieval Paper • 2606.16817 • Published 12 days ago • 9
Understanding the Behaviors of Environment-aware Information Retrieval Paper • 2606.16817 • Published 12 days ago • 9
Agentic Fusion of Large Atomic and Language Models to Accelerate Superconductors Discovery Paper • 2604.23758 • Published Apr 29 • 7
MiroThinker-1.7 & H1: Towards Heavy-Duty Research Agents via Verification Paper • 2603.15726 • Published Mar 16 • 187
In-Context Reinforcement Learning for Tool Use in Large Language Models Paper • 2603.08068 • Published Mar 9 • 43
DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation Paper • 2601.09688 • Published Jan 14 • 128
LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling Paper • 2511.20785 • Published Nov 25, 2025 • 188
RynnVLA-002: A Unified Vision-Language-Action and World Model Paper • 2511.17502 • Published Nov 21, 2025 • 28