AuralSAM2: Enabling SAM2 Hear Through Pyramid Audio-Visual Feature Prompting
Paper • 2506.01015 • Published • 1
YAML Metadata Warning:empty or missing yaml metadata in repo card
Check out the documentation for more information.
[CVPRF'26] AuralSAM2: Enabling SAM2 Hear Through Pyramid Audio-Visual Feature Prompting
by Yuyuan Liu, Yuanhong Chen, Chong Wang, Junlin Han, Junde Wu, Can Peng, Jingkun Chen, Yu Tian and Gustavo Carneiro
please install the dependencies and dataset based on this installation document.
please follow this instruction document to reproduce our results.
please consider citing our work in your publications if it helps your research.
@article{liu2025auralsam2,
title={AuralSAM2: Enabling SAM2 Hear Through Pyramid Audio-Visual Feature Prompting},
author={Liu, Yuyuan and Chen, Yuanhong and Wang, Chong and Han, Junlin and Wu, Junde and Peng, Can and Chen, Jingkun and Tian, Yu and Carneiro, Gustavo},
journal={arXiv preprint arXiv:2506.01015},
year={2025}
}