DeepSeek-OCR
π
16
Extract text from images and convert to markdown
A unified multimodal understanding and generation model.
Edit photos with scribbles and AI-driven color changes
Easily expand image boundaries
Engage in multimedia chat with LLMs and ML models
Generate audio from text descriptions