Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2601.16206

about 6 hours ago

Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

Paper • 2411.03562 • Published Nov 5, 2024 • 69
Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning

Paper • 2502.06060 • Published Feb 9, 2025 • 38
MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Paper • 2502.14499 • Published Feb 20, 2025 • 194
SurveyX: Academic Survey Automation via Large Language Models

Paper • 2502.14776 • Published Feb 20, 2025 • 100

about 4 hours ago

HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding

Paper • 2601.14724 • Published 2 days ago • 52
LLM-in-Sandbox Elicits General Agentic Intelligence

Paper • 2601.16206 • Published about 22 hours ago • 43

about 4 hours ago

A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Doubao 1.8, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5

Paper • 2601.10527 • Published 8 days ago • 23
PACEvolve: Enabling Long-Horizon Progress-Aware Consistent Evolution

Paper • 2601.10657 • Published 8 days ago • 19
TranslateGemma Technical Report

Paper • 2601.09012 • Published 10 days ago • 19
Recursive Language Models

Paper • 2512.24601 • Published 24 days ago • 73

about 4 hours ago

LLM-in-Sandbox Elicits General Agentic Intelligence

Paper • 2601.16206 • Published about 22 hours ago • 43

about 4 hours ago

Imagine-then-Plan: Agent Learning from Adaptive Lookahead with World Models

Paper • 2601.08955 • Published 10 days ago • 13
EvoFSM: Controllable Self-Evolution for Deep Research with Finite State Machines

Paper • 2601.09465 • Published 9 days ago • 40
MAXS: Meta-Adaptive Exploration with LLM Agents

Paper • 2601.09259 • Published 9 days ago • 92
Toward Efficient Agents: Memory, Tool learning, and Planning

Paper • 2601.14192 • Published 3 days ago • 44

about 6 hours ago

Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

Paper • 2411.03562 • Published Nov 5, 2024 • 69
Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning

Paper • 2502.06060 • Published Feb 9, 2025 • 38
MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Paper • 2502.14499 • Published Feb 20, 2025 • 194
SurveyX: Academic Survey Automation via Large Language Models

Paper • 2502.14776 • Published Feb 20, 2025 • 100

about 4 hours ago

LLM-in-Sandbox Elicits General Agentic Intelligence

Paper • 2601.16206 • Published about 22 hours ago • 43

about 4 hours ago

HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding

Paper • 2601.14724 • Published 2 days ago • 52
LLM-in-Sandbox Elicits General Agentic Intelligence

Paper • 2601.16206 • Published about 22 hours ago • 43

about 4 hours ago

Imagine-then-Plan: Agent Learning from Adaptive Lookahead with World Models

Paper • 2601.08955 • Published 10 days ago • 13
EvoFSM: Controllable Self-Evolution for Deep Research with Finite State Machines

Paper • 2601.09465 • Published 9 days ago • 40
MAXS: Meta-Adaptive Exploration with LLM Agents

Paper • 2601.09259 • Published 9 days ago • 92
Toward Efficient Agents: Memory, Tool learning, and Planning

Paper • 2601.14192 • Published 3 days ago • 44

about 4 hours ago

A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Doubao 1.8, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5

Paper • 2601.10527 • Published 8 days ago • 23
PACEvolve: Enabling Long-Horizon Progress-Aware Consistent Evolution

Paper • 2601.10657 • Published 8 days ago • 19
TranslateGemma Technical Report

Paper • 2601.09012 • Published 10 days ago • 19
Recursive Language Models

Paper • 2512.24601 • Published 24 days ago • 73

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs