gpt-omni/mini-omni2
Any-to-Any β’ Updated β’ 63 β’ 283
MoonDream 2 Vision Model on the Browser: Candle/Rust/WASM
a tiny vision language model
A unified multimodal understanding and generation model.
Interact with an AI by sending text, images, or audio
Cosmos-R1 / docscopeOCR / Captioner-7B / visionOCR-3B