fancyfeast/llama-joycaption-beta-one-hf-llava
Image-Text-to-Text β’ 8B β’ Updated β’ 121k β’ 359
fast video generation from images & text
Generate custom captions, tags, or prompts for any image
Generate captions for images using text prompts
Generate synchronized audio for videos from text prompts
Generate depth map from your photo
Generate creative Stable Diffusion prompts
Generate detailed anime tags from images
A unified multimodal understanding and generation model.
Launch an interactive web interface for the tool
Chat with Gemini 2.5 to get detailed responses