view article Article Arabic TTS Arena: Ranking Voice Models the Way Chess Ranks Grandmasters 15 days ago • 14
Arabic Speech Datasets Collection Best Datasets for Arabic Speech Tasks • 16 items • Updated Jan 1 • 16
MoshiVis v0.1 Collection MoshiVis is a Vision Speech Model built as a perceptually-augmented version of Moshi v0.1 for conversing about image inputs • 6 items • Updated 25 days ago • 23
view article Article How to Choose the Best Open Source LLM for Your Project in 2025 Sep 9, 2025 • 75
view article Article Welcome GPT OSS, the new open-source model family from OpenAI! +10 Aug 5, 2025 • 511
Loopy: Taming Audio-Driven Portrait Avatar with Long-Term Motion Dependency Paper • 2409.02634 • Published Sep 4, 2024 • 97
view article Article PaliGemma – Google's Cutting-Edge Open Vision Language Model +1 May 14, 2024 • 285