Ruchao Fan

fanruchao

·

Diamondfan

AI & ML interests

Speech, NLP

Organizations

None yet

New activity in microsoft/Phi-4-multimodal-instruct about 1 year ago

Experience with Phi-4-Multimodal vs. Whisper-1 for Speech-to-Text

#39 opened over 1 year ago by

Inference with an external LM?

#69 opened about 1 year ago by

New activity in microsoft/Phi-4-multimodal-instruct over 1 year ago

Decoding strategy of the Phi4 Multimodal

#50 opened over 1 year ago by

Does the model support beam search for ASR?

#31 opened over 1 year ago by

Audio transcription is not finishing the full dialogue

#21 opened over 1 year ago by

Audio transcription is not finishing the full dialogue

#21 opened over 1 year ago by