OpenMOSS-Team/MOSS-TTS-Local-Transformer-v1.5 Text-to-Speech • 5B • Updated 7 days ago • 9.65k • 54
OpenMOSS-Team/MOSS-Audio-Tokenizer-v2 Image Feature Extraction • 2B • Updated 14 days ago • 5.99k • 16
Mega-ASR: Towards In-the-wild^2 Speech Recognition via Scaling up Real-world Acoustic Simulation Paper • 2605.19833 • Published May 19 • 137
World Action Models: The Next Frontier in Embodied AI Paper • 2605.12090 • Published May 12 • 68
MOSS-Audio Collection An open-source audio understanding model supporting speech recognition, environmental sound analysis, music understanding, time-aware QA, and complex • 9 items • Updated 13 days ago • 66