AIBA: Attention-based Instrument Band Alignment for Text-to-Audio Diffusion Paper β’ 2509.20891 β’ Published Sep 25, 2025
Jamendo-QA: A Large-Scale Music Question Answering Dataset Paper β’ 2509.15662 β’ Published Sep 19, 2025 β’ 1
Illustrious: an Open Advanced Illustration Model Paper β’ 2409.19946 β’ Published Sep 30, 2024 β’ 14
CAT: Contrastive Adapter Training for Personalized Image Generation Paper β’ 2404.07554 β’ Published Apr 11, 2024 β’ 2
Improving Text Generation on Images with Synthetic Captions Paper β’ 2406.00505 β’ Published Jun 1, 2024 β’ 1