aud facebook/wav2vec2-lv-60-espeak-cv-ft Automatic Speech Recognition • Updated Oct 31, 2023 • 158k • 60 sesame/csm-1b Text-to-Speech • Updated Dec 1, 2025 • 23.8k • 2.31k
facebook/wav2vec2-lv-60-espeak-cv-ft Automatic Speech Recognition • Updated Oct 31, 2023 • 158k • 60
papers TimeMarker: A Versatile Video-LLM for Long and Short Video Understanding with Superior Temporal Localization Ability Paper • 2411.18211 • Published Nov 27, 2024
TimeMarker: A Versatile Video-LLM for Long and Short Video Understanding with Superior Temporal Localization Ability Paper • 2411.18211 • Published Nov 27, 2024
aud facebook/wav2vec2-lv-60-espeak-cv-ft Automatic Speech Recognition • Updated Oct 31, 2023 • 158k • 60 sesame/csm-1b Text-to-Speech • Updated Dec 1, 2025 • 23.8k • 2.31k
facebook/wav2vec2-lv-60-espeak-cv-ft Automatic Speech Recognition • Updated Oct 31, 2023 • 158k • 60
papers TimeMarker: A Versatile Video-LLM for Long and Short Video Understanding with Superior Temporal Localization Ability Paper • 2411.18211 • Published Nov 27, 2024
TimeMarker: A Versatile Video-LLM for Long and Short Video Understanding with Superior Temporal Localization Ability Paper • 2411.18211 • Published Nov 27, 2024