DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation Paper • 2601.09688 • Published 16 days ago • 125
MemoBrain: Executive Memory as an Agentic Brain for Reasoning Paper • 2601.08079 • Published 17 days ago • 37
Unlocking Implicit Experience: Synthesizing Tool-Use Trajectories from Text Paper • 2601.10355 • Published 15 days ago • 39
ArenaRL: Scaling RL for Open-Ended Agents via Tournament-based Relative Ranking Paper • 2601.06487 • Published 20 days ago • 52
Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking Paper • 2601.04720 • Published 22 days ago • 51
Nested Learning: The Illusion of Deep Learning Architectures Paper • 2512.24695 • Published about 1 month ago • 42
The Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought Reasoning Paper • 2601.06002 • Published 21 days ago • 51
Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization Paper • 2512.24615 • Published about 1 month ago • 119
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization Paper • 2601.05242 • Published 22 days ago • 211
EAGLE3 Collection The collection of eagle3 series models for Qwen3 and Hunyuan. • 15 items • Updated 17 days ago • 3
view post Post 2847 AgentCPM-Explore🔥 on device agent foundation model released by OpenBMB openbmb/AgentCPM-Explore✨ 4B - Apache2.0✨ Supports 100+ multi-turn environment interactions with search + verification✨ Full training/inference stack is openly shared as well See translation 🚀 8 8 ❤️ 2 2 + Reply