view article Article How to make NeuTTS-air generate over 200 seconds of audio in a single second. Nov 21, 2025 • 22
GLiNER-X Collection The Multilingual Named Entity Recognition (NER) model which is capable of identifying any entity type. • 6 items • Updated Jun 24, 2025 • 21
Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model Paper • 2408.17175 • Published Aug 30, 2024 • 6
view article Article From Llasa to Llasagna 🍕: Finetuning LLaSA to generates Italian speech and other languages Feb 11, 2025 • 33
Llasa: Scaling Train-Time and Inference-Time Compute for Llama-based Speech Synthesis Paper • 2502.04128 • Published Feb 6, 2025 • 27
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 11 items • Updated 27 days ago • 553
Lost in the Middle: How Language Models Use Long Contexts Paper • 2307.03172 • Published Jul 6, 2023 • 43
DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence Paper • 2401.14196 • Published Jan 25, 2024 • 69