HuggingFaceTB/SmolVLM-500M-Instruct
Image-Text-to-Text
•
0.5B
•
Updated
•
25.6k
•
184
Transcribe audio files or YouTube videos into text
Generate videos from text prompts and optional images
Track your online presence with reverse face search
Generate a 3D mesh model from an image
Generate code for applications
flux.1-dev / flux.1-krea-dev
Import a portrait, click to move the head!
Chat with Mini-Omni 2 - powered by Gradio and WebRTC ⚡️
Add vectors to Hub datasets and do in memory vector search.
An end-to-end (e2e) Voice Language Model by Fish Audio.