Advancing Speech Understanding in Speech-Aware Language Models with GRPO Paper • 2509.16990 • Published Sep 21, 2025 • 21
Alterbute: Editing Intrinsic Attributes of Objects in Images Paper • 2601.10714 • Published 12 days ago • 30
Beyond Transcription: Mechanistic Interpretability in ASR Paper • 2508.15882 • Published Aug 21, 2025 • 86
Auto-Regressive vs Flow-Matching: a Comparative Study of Modeling Paradigms for Text-to-Music Generation Paper • 2506.08570 • Published Jun 10, 2025 • 33
CHIMERA: A Knowledge Base of Idea Recombination in Scientific Literature Paper • 2505.20779 • Published May 27, 2025 • 15
Don't Overthink it. Preferring Shorter Thinking Chains for Improved LLM Reasoning Paper • 2505.17813 • Published May 23, 2025 • 58
WHISTRESS: Enriching Transcriptions with Sentence Stress Detection Paper • 2505.19103 • Published May 25, 2025 • 13
Slamming: Training a Speech Language Model on One GPU in a Day Paper • 2502.15814 • Published Feb 19, 2025 • 69