Automatic Speech Recognition
Transformers
Safetensors
phi4mm
text-generation
nlp
code
audio
speech-summarization
speech-translation
visual-question-answering
phi-4-multimodal
phi
phi-4-mini
custom_code
Eval Results
Instructions to use microsoft/Phi-4-multimodal-instruct with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use microsoft/Phi-4-multimodal-instruct with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("automatic-speech-recognition", model="microsoft/Phi-4-multimodal-instruct", trust_remote_code=True)# Load model directly from transformers import AutoModelForCausalLM model = AutoModelForCausalLM.from_pretrained("microsoft/Phi-4-multimodal-instruct", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
Multilingual powerhouse — testing for mobile deployment
#95
by 3morixd - opened
This model covers Thai, Vietnamese, Polish, German, Swedish — exactly the kind of multilingual capability we need for global mobile AI.
At Dispatch AI (FZE, UAE), we're building mobile AI that works for everyone. Models like this are the foundation.
We benchmark multilingual models on our 40-phone farm (Snapdragon 865) to see which maintain quality across languages when quantized to 4-bit. Results vary wildly — some lose 30% quality in non-English after quantization.
Would love to see multilingual eval at different quantization levels.
- Dispatch AI (FZE), Sharjah UAE