Benchmarks Saturate When The Model Gets Smarter Than The Judge Paper • 2601.19532 • Published 2 days ago • 2
HyperAlign: Hypernetwork for Efficient Test-Time Alignment of Diffusion Models Paper • 2601.15968 • Published 7 days ago • 5
Towards Pixel-Level VLM Perception via Simple Points Prediction Paper • 2601.19228 • Published 2 days ago • 12
Visual Generation Unlocks Human-Like Reasoning through Multimodal World Models Paper • 2601.19834 • Published 2 days ago • 23
Youtu-VL: Unleashing Visual Potential via Unified Vision-Language Supervision Paper • 2601.19798 • Published 2 days ago • 29
AdaReasoner: Dynamic Tool Orchestration for Iterative Visual Reasoning Paper • 2601.18631 • Published 3 days ago • 45
Yunjue Agent Tech Report: A Fully Reproducible, Zero-Start In-Situ Self-Evolving Agent System for Open-Ended Tasks Paper • 2601.18226 • Published 3 days ago • 5
Diffusion In Diffusion: Reclaiming Global Coherence in Semi-Autoregressive Diffusion Paper • 2601.13599 • Published 10 days ago • 6
A Mechanistic View on Video Generation as World Models: State and Dynamics Paper • 2601.17067 • Published 7 days ago • 8
AR-Omni: A Unified Autoregressive Model for Any-to-Any Generation Paper • 2601.17761 • Published 4 days ago • 11
DeepPlanning: Benchmarking Long-Horizon Agentic Planning with Verifiable Constraints Paper • 2601.18137 • Published 4 days ago • 19
Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability Paper • 2601.18778 • Published 3 days ago • 31
Elastic Attention: Test-time Adaptive Sparsity Ratios for Efficient Transformers Paper • 2601.17367 • Published 5 days ago • 31
Scientific Image Synthesis: Benchmarking, Methodologies, and Downstream Utility Paper • 2601.17027 • Published 12 days ago • 37