ResearchMath-14K: Scaling Research-Level Mathematics via Agents Paper • 2605.28003 • Published 1 day ago • 36
DCAgent3/medagentbench_nemotron_100000_opt100k__Qwen3_8B_20260521_085621 Viewer • Updated 7 days ago • 1.05k • 30 • 1
Solve the Loop: Attractor Models for Language and Reasoning Paper • 2605.12466 • Published 17 days ago • 6
Rethinking State Tracking in Recurrent Models Through Error Control Dynamics Paper • 2605.07755 • Published 21 days ago • 23
sentence-transformers/all-MiniLM-L6-v2 Sentence Similarity • 22.7M • Updated Mar 6, 2025 • 263M • • 4.85k
GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning Paper • 2604.02721 • Published Apr 3 • 630
CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence Paper • 2603.28032 • Published Mar 30 • 342
FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization Paper • 2603.19835 • Published Mar 20 • 352