TC-AE: Unlocking Token Capacity for Deep Compression Autoencoders Paper • 2604.07340 • Published 2 days ago • 9
How Well Do Models Follow Visual Instructions? VIBE: A Systematic Benchmark for Visual Instruction-Driven Image Editing Paper • 2602.01851 • Published Feb 2 • 16