Small Vision-Language Models are Smart Compressors for Long Video Understanding Paper โข 2604.08120 โข Published 7 days ago โข 20