AuralSAM2: Enabling SAM2 Hear Through Pyramid Audio-Visual Feature Prompting
Paper • 2506.01015 • Published • 1
None defined yet.
AuralSAM2: Enabling SAM2 Hear Through Pyramid Audio-Visual Feature Prompting
PoseDreamer: Scalable and Photorealistic Human Data Generation Pipeline with Diffusion Models