Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and
Dialogue Abilities
Paper
• 2402.01831
• Published • 17
Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding
and Expert Reasoning Abilities
Paper
• 2503.03983
• Published • 27
Audio Flamingo 3: Advancing Audio Intelligence with Fully Open Large
Audio Language Models
Paper
• 2507.08128
• Published • 13
Jamendo-QA: A Large-Scale Music Question Answering Dataset
Paper
• 2509.15662
• Published • 1
SightSound-R1: Cross-Modal Reasoning Distillation from Vision to Audio
Language Models
Paper
• 2509.15661
• Published • 2
UALM: Unified Audio Language Model for Understanding, Generation and Reasoning
Paper
• 2510.12000
• Published • 1
MusiCRS: Benchmarking Audio-Centric Conversational Recommendation
Paper
• 2509.19469
• Published
AudioMarathon: A Comprehensive Benchmark for Long-Context Audio
Understanding and Efficiency in Audio LLMs
Paper
• 2510.07293
• Published
SongPrep: A Preprocessing Framework and End-to-end Model for Full-song
Structure Parsing and Lyrics Transcription
Paper
• 2509.17404
• Published