audeering/wav2vec2-large-robust-6-ft-age-gender Audio Classification • 90.8M • Updated Nov 27, 2023 • 5.03k • 6
FaceLLM Collection A multimodal large language model trained specifically for facial image understanding. Project page: https://www.idiap.ch/paper/facellm • 3 items • Updated Jul 23, 2025 • 4
DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control Paper • 2410.13830 • Published Oct 17, 2024 • 26