microsoft/Phi-4-reasoning-vision-15B
Image-Text-to-Text • 15B • Updated • 5.02k • 172
Ask questions and get detailed answers
OpenAI's Deep Research, but open
Need to analyze data? Let a Llama-3.1 agent do it for you!
Embedded MinerU document extraction demo
Chat with an AI assistant that can search the web
MidJour | A RealVisXL_Turbo | IRL HI-Res Images Gen
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
Generate music from lyrics and genre tags