Duplicated from microsoft/Phi-4-multimodal-instruct

mjtechguy
/

phi-4-multimodal-instruct

Automatic Speech Recognition

text-generation

speech-summarization

speech-translation

visual-question-answering

phi-4-multimodal

Model card Files Files and versions

phi-4-multimodal-instruct / examples

Ctrl+K

Ctrl+K

2 contributors

History: 1 commit

mjtechguy's picture

Duplicate from microsoft/Phi-4-multimodal-instruct

df2cb0d verified over 1 year ago

what_is_shown_in_this_image.wav

113 kB
Duplicate from microsoft/Phi-4-multimodal-instruct over 1 year ago
what_is_the_traffic_sign_in_the_image.wav

741 kB
Duplicate from microsoft/Phi-4-multimodal-instruct over 1 year ago