OceanGym: A Benchmark Environment for Underwater Embodied Agents Paper • 2509.26536 • Published Sep 30, 2025 • 36
Executable Knowledge Graphs for Replicating AI Research Paper • 2510.17795 • Published Oct 20, 2025 • 15
InnoGym: Benchmarking the Innovation Potential of AI Agents Paper • 2512.01822 • Published Dec 1, 2025 • 36
LightThinker++: From Reasoning Compression to Memory Management Paper • 2604.03679 • Published Apr 4 • 38
Rewarding the Scientific Process: Process-Level Reward Modeling for Agentic Data Analysis Paper • 2604.24198 • Published 9 days ago • 20
Rewarding the Scientific Process: Process-Level Reward Modeling for Agentic Data Analysis Paper • 2604.24198 • Published 9 days ago • 20
SkillX: Automatically Constructing Skill Knowledge Bases for Agents Paper • 2604.04804 • Published about 1 month ago • 33
LightThinker++: From Reasoning Compression to Memory Management Paper • 2604.03679 • Published Apr 4 • 38
LightThinker++: From Reasoning Compression to Memory Management Paper • 2604.03679 • Published Apr 4 • 38