ππ‘π New Research Alert - ICCV 2025 (Oral)! ππͺπ π Title: LoftUp: Learning a Coordinate-based Feature Upsampler for Vision Foundation Models π
π Description: LoftUp is a coordinate-based transformer that upscales the low-resolution features of VFMs (e.g. DINOv2 and CLIP) using cross-attention and self-distilled pseudo-ground truth (pseudo-GT) from SAM.
π₯ Authors: Haiwen Huang, Anpei Chen, Volodymyr Havrylov, Andreas Geiger, and Dan Zhang
π Description: The HeLlO framework is a new corpus distillation method that removes the need for large soft labels. It uses a lightweight, online image-to-label projector based on CLIP. This projector has been adapted using LoRA-style, parameter-efficient tuning. It has also been initialized with text embeddings.
ππ€π New Research Alert - ICCV 2025 (Oral)! ππ€π π Title: Variance-based Pruning for Accelerating and Compressing Trained Networks π
π Description: The one-shot pruning method efficiently compresses networks, reducing computation and memory usage while retaining almost full performance and requiring minimal fine-tuning.
π₯ Authors: Uranik Berisha, Jens Mehnert, and Alexandru Paul Condurache