Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
CIMAI 's Collections
Document Understanding
VL Embedding Models
VL Embedding (multi-vec) Models
VL Instruct Models
VL Reasoning Models
VL Reranker Models
Text Embedding Models
Text Instruct Edge Models
Text Instruct Models
Text Reasoning Models
Text Reranking Models
Speech-to-Text Models
Coding Models

Document Understanding

updated Feb 20

https://www.2077ai.com/dataset/dataset-omnidocbench

Upvote
1

  • zai-org/GLM-OCR

    Image-to-Text • Updated Apr 14 • 7.26M • • 1.74k

  • deepseek-ai/DeepSeek-OCR-2

    Image-Text-to-Text • 3B • Updated Feb 3 • 1.65M • 954

  • PaddlePaddle/PaddleOCR-VL-1.5

    Image-Text-to-Text • 1.0B • Updated 16 days ago • 38k • 622

  • lightonai/LightOnOCR-2-1B-base

    Image-Text-to-Text • 1B • Updated Jan 21 • 7.59k • 12

  • opendatalab/MinerU2.5-2509-1.2B

    Image-Text-to-Text • 1B • Updated Apr 9 • 80.1k • 357

    Note agpl-3.0 license: "If you use AGPL-3.0 licensed software in a network-accessible application, you must make the entire source code of your application available to users of that application." :(


  • rednote-hilab/dots.ocr

    Image-Text-to-Text • 3B • Updated Oct 31, 2025 • 207k • 1.3k
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs