microsoft/Phi-4-reasoning-vision-15B Image-Text-to-Text β’ 15B β’ Updated 8 days ago β’ 18.2k β’ 144
Running on Zero MCP Featured 81 GLM OCR Demo π 81 Multimodal OCR model for complex document understanding.
mistralai/Voxtral-Mini-4B-Realtime-2602 Automatic Speech Recognition β’ 4B β’ Updated 1 day ago β’ 600k β’ 701