Unified Generation and Self-Verification for Vision-Language Models via Advantage Decoupled Preference Optimization Paper • 2601.01483 • Published 9 days ago • 1
FlexSelect: Flexible Token Selection for Efficient Long Video Understanding Paper • 2506.00993 • Published Jun 1, 2025
Code Graph Model (CGM): A Graph-Integrated Large Language Model for Repository-Level Software Engineering Tasks Paper • 2505.16901 • Published May 22, 2025 • 48
VideoGrain: Modulating Space-Time Attention for Multi-grained Video Editing Paper • 2502.17258 • Published Feb 24, 2025 • 79
FreeLong: Training-Free Long Video Generation with SpectralBlend Temporal Attention Paper • 2407.19918 • Published Jul 29, 2024 • 51