zhangchenxu/BV-Qwen2.5-Math-7B-deepmath_pw_lv-TinyV-1.5B-addon-2_step468 8B • Updated Jul 29, 2025 • 10
view article Article Visual Aesthetic Benchmark: Can Frontier Models Judge Beauty? 16 days ago • 12
view article Article Visual Aesthetic Benchmark: Can Frontier Models Judge Beauty? 16 days ago • 12
ToolPRMBench: Evaluating and Advancing Process Reward Models for Tool-using Agents Paper • 2601.12294 • Published Jan 18 • 19
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16 Text Generation • 32B • Updated 20 days ago • 956k • 667
Efficient Long-context Language Model Training by Core Attention Disaggregation Paper • 2510.18121 • Published Oct 20, 2025 • 123
Building a Foundational Guardrail for General Agentic Systems via Synthetic Data Paper • 2510.09781 • Published Oct 10, 2025 • 27