Automatic Speech Recognition
Transformers
Safetensors
phi4mm
text-generation
nlp
code
audio
speech-summarization
speech-translation
visual-question-answering
phi-4-multimodal
phi
phi-4-mini
custom_code
Eval Results
Instructions to use microsoft/Phi-4-multimodal-instruct with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use microsoft/Phi-4-multimodal-instruct with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("automatic-speech-recognition", model="microsoft/Phi-4-multimodal-instruct", trust_remote_code=True)# Load model directly from transformers import AutoModelForCausalLM model = AutoModelForCausalLM.from_pretrained("microsoft/Phi-4-multimodal-instruct", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
Update readme
Browse files
README.md
CHANGED
|
@@ -268,7 +268,7 @@ BLINK is an aggregated benchmark with 14 visual tasks that humans can solve very
|
|
| 268 |
### Requirements
|
| 269 |
|
| 270 |
Phi-4 family has been integrated in the `4.48.2` version of `transformers`. The current `transformers` version can be verified with: `pip list | grep transformers`.
|
| 271 |
-
|
| 272 |
Examples of required packages:
|
| 273 |
```
|
| 274 |
flash_attn==2.7.4.post1
|
|
|
|
| 268 |
### Requirements
|
| 269 |
|
| 270 |
Phi-4 family has been integrated in the `4.48.2` version of `transformers`. The current `transformers` version can be verified with: `pip list | grep transformers`.
|
| 271 |
+
We suggest to run with Python 3.10.
|
| 272 |
Examples of required packages:
|
| 273 |
```
|
| 274 |
flash_attn==2.7.4.post1
|