Results tend to return summaries by default instead of raw transcription

#26

by shaunck96 - opened Mar 5, 2025

When using for transcription, the model tends to return summaries instead of raw text. Any recommendations to better align this behavior.

Microsoft org Mar 5, 2025

@shaunck96 Thank you for your interest in Phi-4-multimodal.
Can you share your prompt and audio files?

You need to use some special commands to get a transcription.Check online. I don't remember exactly what code to type but I know that I read it.

Microsoft org Mar 9, 2025

Thanks a lot guys, will check this out.

nguyenbh changed discussion status to closed Mar 9, 2025

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment