InnoGym: Benchmarking the Innovation Potential of AI Agents Paper • 2512.01822 • Published 11 days ago • 33
Skywork/Skywork-Reward-Llama-3.1-8B-v0.2 Text Classification • 8B • Updated Oct 25, 2024 • 19.8k • 38
LightMem: Lightweight and Efficient Memory-Augmented Generation Paper • 2510.18866 • Published Oct 21 • 110
Executable Knowledge Graphs for Replicating AI Research Paper • 2510.17795 • Published Oct 20 • 14
OceanGym: A Benchmark Environment for Underwater Embodied Agents Paper • 2509.26536 • Published Sep 30 • 34
Towards Personalized Deep Research: Benchmarks and Evaluations Paper • 2509.25106 • Published Sep 29 • 29