A Computational Analysis of Real-World DJ Mixes using Mix-To-Track Subsequence Alignment Paper • 2008.10267 • Published Aug 24, 2020 • 1
Moshi: a speech-text foundation model for real-time dialogue Paper • 2410.00037 • Published Sep 17, 2024 • 16
Moshi v0.1 Release Collection MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 16 items • Updated Dec 24, 2025 • 244
NeuTTS Nano Multilingual Collection Collection NeuTTS Nano is a TTS model, 3x smaller than NeuTTS Air, that runs on CPU in real-time - now in English, Spanish, French, and German versions! • 13 items • Updated 3 days ago • 18
🎵 The MusicBox Collection A collection full of musical tasks demos, for musicians & music enthusiasts • 39 items • Updated 23 days ago • 33
Runtime error Agents 82 Vocal Separation SOTA 🎤 82 Separate vocals and music from any audio file or YouTube video