meta-llama/Llama-3.2-11B-Vision-Instruct Image-Text-to-Text • 11B • Updated Dec 4, 2024 • 165k • 1.59k
Running Agents Featured 207 Voxtral TTS Demo ⚡ 207 Generate realistic speech from text with custom or preset voices
distil-whisper/distil-large-v3.5 Automatic Speech Recognition • 0.8B • Updated 24 days ago • 57.3k • 89
distil-whisper/distil-large-v3 Automatic Speech Recognition • 0.8B • Updated 16 days ago • 1.31M • 376
Frame2KG Collection A Benchmark and Evaluation Toolkit for Interpretable Frame-to-Graph Generation • 6 items • Updated Feb 19 • 1