DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning Paper • 2511.22570 • Published Nov 27, 2025 • 90
lixiaochuan2020/octothinker_reproduce_llama3.2_1b_stable_stage_10B_decay_stage_short_1B Text Generation • 1B • Updated Nov 7, 2025
lixiaochuan2020/octothinker_reproduce_llama3.2_1b_stable_stage_10B_decay_stage_short_1B Text Generation • 1B • Updated Nov 7, 2025
R-Horizon: How Far Can Your Large Reasoning Model Really Go in Breadth and Depth? Paper • 2510.08189 • Published Oct 9, 2025 • 27
OpenCUA: Open Foundations for Computer-Use Agents Paper • 2508.09123 • Published Aug 12, 2025 • 31
OpenCUA: Open Foundations for Computer-Use Agents Collection This is the official versions of OpenCUA models and AgentNet datasets. Website: https://opencua.xlang.ai/ • 8 items • Updated Dec 1, 2025 • 23
DeepResearchGym: A Free, Transparent, and Reproducible Evaluation Sandbox for Deep Research Paper • 2505.19253 • Published May 25, 2025 • 32
Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning Paper • 2410.14208 • Published Oct 18, 2024 • 3
Scaling Computer-Use Grounding via User Interface Decomposition and Synthesis Paper • 2505.13227 • Published May 19, 2025 • 45
Scaling Computer-Use Grounding via User Interface Decomposition and Synthesis Paper • 2505.13227 • Published May 19, 2025 • 45