Running on Zero Agents Featured 827 OmniVoice ๐ 827 High-quality voice cloning TTS for 600+ languages
Running on CPU Upgrade 283 Inference Playground ๐ 283 Try Hugging Face models in an interactive web playground
Paused Agents Featured 229 Spark TTS ๐ 229 A text-to-speech model powered by SparkAudio and Mobvoi.
Runtime error Agents 13 Whisper Realtime Transcription ๐ 13 Transcribe audio in realtime with Whisper
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition โข 6B โข Updated Dec 10, 2025 โข 395k โข 1.6k
Build error Agents 32 SmolVLM2 XSPFGenerator (VLC prototype) ๐ 32 Generate video highlights and playlist
Runtime error Agents 72 VLM R1 Referral Expression ๐ฌ 72 Mark regions in images based on text descriptions
Running on Zero Agents 310 BiRefNet Demo ๐ 310 Remove background from photos with accurate segmentation
Runtime error Agents Featured 2.02k Chat With Janus-Pro-7B ๐ 2.02k A unified multimodal understanding and generation model.
Running Agents 43 YOLOv10 Document Layout Analysis ๐ 43 Analyze scanned documents to detect and label content