DeepSeek-OCR-2-Demo
💻
14
DeepSeek-OCR 2: Visual Causal Flow
Interact with an AI agent to perform web tasks
Universal Image Editing is worth a single LoRA
An interactive digit classification demo
Gemini 2.0 native image generation co-doodling
interactive demo for cube 3d model
A text-to-speech model powered by SparkAudio and Mobvoi.
A unified multimodal understanding and generation model.
Demo for MiniCPM-o 2.6 to answer questions about images
Unified Framework for Generalized Video Face Restoration