DIFFA / README.md

zhoujiaming777

Update README.md

2748cf0 verified 6 months ago

preview code

raw

history blame contribute delete

639 Bytes

metadata

license: cc-by-nc-sa-4.0

DIFFA: Large Language Diffusion Models Can Listen and Understand

DIFFA is the first diffusion-based large audio-language model for spoken language understanding.
It combines a frozen diffusion LLM with dual adapters (semantic + acoustic) to enhance audio perception and reasoning.