How to use Lexius/Phi-4-multimodal-instruct with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("automatic-speech-recognition", model="Lexius/Phi-4-multimodal-instruct", trust_remote_code=True)
# Load model directly from transformers import AutoModelForCausalLM model = AutoModelForCausalLM.from_pretrained("Lexius/Phi-4-multimodal-instruct", trust_remote_code=True, dtype="auto")