-
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
Paper • 2411.03562 • Published • 69 -
Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning
Paper • 2502.06060 • Published • 38 -
MLGym: A New Framework and Benchmark for Advancing AI Research Agents
Paper • 2502.14499 • Published • 194 -
SurveyX: Academic Survey Automation via Large Language Models
Paper • 2502.14776 • Published • 100
Collections
Discover the best community collections!
Collections including paper arxiv:2601.16206
-
A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Doubao 1.8, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5
Paper • 2601.10527 • Published • 23 -
PACEvolve: Enabling Long-Horizon Progress-Aware Consistent Evolution
Paper • 2601.10657 • Published • 19 -
TranslateGemma Technical Report
Paper • 2601.09012 • Published • 19 -
Recursive Language Models
Paper • 2512.24601 • Published • 73
-
Imagine-then-Plan: Agent Learning from Adaptive Lookahead with World Models
Paper • 2601.08955 • Published • 13 -
EvoFSM: Controllable Self-Evolution for Deep Research with Finite State Machines
Paper • 2601.09465 • Published • 40 -
MAXS: Meta-Adaptive Exploration with LLM Agents
Paper • 2601.09259 • Published • 92 -
Toward Efficient Agents: Memory, Tool learning, and Planning
Paper • 2601.14192 • Published • 44
-
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
Paper • 2411.03562 • Published • 69 -
Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning
Paper • 2502.06060 • Published • 38 -
MLGym: A New Framework and Benchmark for Advancing AI Research Agents
Paper • 2502.14499 • Published • 194 -
SurveyX: Academic Survey Automation via Large Language Models
Paper • 2502.14776 • Published • 100
-
Imagine-then-Plan: Agent Learning from Adaptive Lookahead with World Models
Paper • 2601.08955 • Published • 13 -
EvoFSM: Controllable Self-Evolution for Deep Research with Finite State Machines
Paper • 2601.09465 • Published • 40 -
MAXS: Meta-Adaptive Exploration with LLM Agents
Paper • 2601.09259 • Published • 92 -
Toward Efficient Agents: Memory, Tool learning, and Planning
Paper • 2601.14192 • Published • 44
-
A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Doubao 1.8, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5
Paper • 2601.10527 • Published • 23 -
PACEvolve: Enabling Long-Horizon Progress-Aware Consistent Evolution
Paper • 2601.10657 • Published • 19 -
TranslateGemma Technical Report
Paper • 2601.09012 • Published • 19 -
Recursive Language Models
Paper • 2512.24601 • Published • 73