From 2D Grids to 1D Tokens: Reforming Shared Representations for Multimodal Image Fusion Paper • 2606.12303 • Published 4 days ago • 12
VIA-SD: Verification via Intra-Model Routing for Speculative Decoding Paper • 2606.12243 • Published 4 days ago • 14
RefineAnything: Multimodal Region-Specific Refinement for Perfect Local Details Paper • 2604.06870 • Published Apr 8 • 43
SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization Paper • 2505.12346 • Published May 18, 2025 • 19
VideoGrain: Modulating Space-Time Attention for Multi-grained Video Editing Paper • 2502.17258 • Published Feb 24, 2025 • 79