Reasoning over mathematical objects: on-policy reward modeling and test time aggregation Paper • 2603.18886 • Published 6 days ago • 4
LeWorldModel: Stable End-to-End Joint-Embedding Predictive Architecture from Pixels Paper • 2603.19312 • Published 12 days ago • 4
AutoHarness: improving LLM agents by automatically synthesizing a code harness Paper • 2603.03329 • Published Feb 10 • 2
Qianfan-OCR: A Unified End-to-End Model for Document Intelligence Paper • 2603.13398 • Published 14 days ago • 145
SAMA: Factorized Semantic Anchoring and Motion Alignment for Instruction-Guided Video Editing Paper • 2603.19228 • Published 6 days ago • 64
AndroTMem: From Interaction Trajectories to Anchored Memory in Long-Horizon GUI Agents Paper • 2603.18429 • Published 7 days ago • 24
Alignment Makes Language Models Normative, Not Descriptive Paper • 2603.17218 • Published 8 days ago • 46
Mamba-3: Improved Sequence Modeling using State Space Principles Paper • 2603.15569 • Published 9 days ago • 6
Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation Paper • 2603.19220 • Published 6 days ago • 58
Spatial-TTT: Streaming Visual-based Spatial Intelligence with Test-Time Training Paper • 2603.12255 • Published 13 days ago • 90
PostTrainBench: Can LLM Agents Automate LLM Post-Training? Paper • 2603.08640 • Published 16 days ago • 1
Covenant-72B: Pre-Training a 72B LLM with Trustless Peers Over-the-Internet Paper • 2603.08163 • Published 16 days ago • 5
CHMv2: Improvements in Global Canopy Height Mapping using DINOv3 Paper • 2603.06382 • Published 19 days ago • 1