view article Article Arabic TTS Arena: Ranking Voice Models the Way Chess Ranks Grandmasters Navid-AI • Mar 12 • 17
Arabic Speech Datasets Collection Best Datasets for Arabic Speech Tasks • 20 items • Updated 6 days ago • 18
MoshiVis v0.1 Collection MoshiVis is a Vision Speech Model built as a perceptually-augmented version of Moshi v0.1 for conversing about image inputs • 6 items • Updated Mar 2 • 23
view article Article Supercharge your OCR Pipelines with Open Models +5 merve, ariG23498, davanstrien, hynky, andito, reach-vb, pcuenq • Oct 21, 2025 • 312
view article Article How to Choose the Best Open Source LLM for Your Project in 2025 dvilasuero • Sep 9, 2025 • 77
view article Article Welcome GPT OSS, the new open-source model family from OpenAI! +10 reach-vb, pcuenq, lewtun, clem, Rocketknight1, clefourrier, celinah, Wauplin, marcsun13, pagezyhf, ahadnagy, joaogante • Aug 5, 2025 • 513
Loopy: Taming Audio-Driven Portrait Avatar with Long-Term Motion Dependency Paper • 2409.02634 • Published Sep 4, 2024 • 97
view article Article PaliGemma – Google's Cutting-Edge Open Vision Language Model +1 merve, andsteing, pcuenq • May 14, 2024 • 287