Audio Dataset MLCommons/peoples_speech_v1.0 Updated Aug 25, 2024 • 99 • 8 amphion/Emilia-Dataset Viewer • Updated Feb 28, 2025 • 54.8M • 69.3k • 460 simon3000/genshin-voice Viewer • Updated Apr 22, 2025 • 424k • 7.62k • 236 facebook/multilingual_librispeech Viewer • Updated Aug 12, 2024 • 1.49M • 20.6k • 182
Omni model collection of Omni modal model inclusionAI/Ming-flash-omni-2.0 Any-to-Any • 104B • Updated Feb 12 • 2.56k • 268 Qwen/Qwen3-Omni-30B-A3B-Instruct Any-to-Any • 35B • Updated Sep 22, 2025 • 2.05M • 943 naver-hyperclovax/HyperCLOVAX-SEED-Omni-8B Text Generation • 11B • Updated Jan 6 • 569 • 191 meituan-longcat/LongCat-Flash-Omni Any-to-Any • 561B • Updated Nov 11, 2025 • 47 • 113
Audio Dataset MLCommons/peoples_speech_v1.0 Updated Aug 25, 2024 • 99 • 8 amphion/Emilia-Dataset Viewer • Updated Feb 28, 2025 • 54.8M • 69.3k • 460 simon3000/genshin-voice Viewer • Updated Apr 22, 2025 • 424k • 7.62k • 236 facebook/multilingual_librispeech Viewer • Updated Aug 12, 2024 • 1.49M • 20.6k • 182
Omni model collection of Omni modal model inclusionAI/Ming-flash-omni-2.0 Any-to-Any • 104B • Updated Feb 12 • 2.56k • 268 Qwen/Qwen3-Omni-30B-A3B-Instruct Any-to-Any • 35B • Updated Sep 22, 2025 • 2.05M • 943 naver-hyperclovax/HyperCLOVAX-SEED-Omni-8B Text Generation • 11B • Updated Jan 6 • 569 • 191 meituan-longcat/LongCat-Flash-Omni Any-to-Any • 561B • Updated Nov 11, 2025 • 47 • 113