Ruchao Fan
fanruchao
·
AI & ML interests
Speech, NLP
Organizations
None yet
Experience with Phi-4-Multimodal vs. Whisper-1 for Speech-to-Text
7
#39 opened about 1 year ago
by
hdevio
Inference with an external LM?
1
#69 opened 12 months ago
by
agonzalezd
Decoding strategy of the Phi4 Multimodal
1
#50 opened about 1 year ago
by
Zhengyang
Does the model support beam search for ASR?
👍 1
6
#31 opened about 1 year ago
by
h9LtLSb
Audio transcription is not finishing the full dialogue
3
#21 opened about 1 year ago
by
Farhang87
Audio transcription is not finishing the full dialogue
3
#21 opened about 1 year ago
by
Farhang87