IEMF / Audio Visual Question Answering
533 MB
xianghe's picture
Well-trained model weights
b6e3ea0