SpotEdit: Selective Region Editing in Diffusion Transformers Paper • 2512.22323 • Published 5 days ago • 27
Mini-Omni: Language Models Can Hear, Talk While Thinking in Streaming Paper • 2408.16725 • Published Aug 29, 2024 • 53
Audio-Reasoner: Improving Reasoning Capability in Large Audio Language Models Paper • 2503.02318 • Published Mar 4 • 2