Running on Zero Featured 229 Spark TTS ๐ 229 A text-to-speech model powered by SparkAudio and Mobvoi.
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition โข 6B โข Updated Dec 10, 2025 โข 193k โข 1.56k
Runtime error 72 VLM R1 Referral Expression ๐ฌ 72 Mark regions in images based on text descriptions
Running on Zero Featured 2.02k Chat With Janus-Pro-7B ๐ 2.02k A unified multimodal understanding and generation model.
Running 43 YOLOv10 Document Layout Analysis ๐ 43 Analyze scanned documents to detect and label content