SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models Paper • 2603.16859 • Published 2 days ago • 94
laion/CLIP-ViT-L-14-laion2B-s32B-b82K Zero-Shot Image Classification • 0.4B • Updated Jan 16, 2024 • 492k • 62