openbmb/MiniCPM-V-2
Visual Question Answering
β’
3B
β’
Updated
β’
58.3k
β’
486
Generate text responses in a chat interface
Transform images based on text instructions
Generate responses using a GPT-4 language model
Create quantized models from Hugging Face repos
Create an animated video from audio and a reference image
Create your own AI comic with a single prompt
Generate or edit images using text prompts
Convert screenshots to HTML
Video Understanding with Interleaved Visual-Textual Tokens
MagicTime: Time-lapse Video Generation Models as Metamorphic