Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
CIMAI 's Collections
Document Understanding
VL Embedding Models
VL Instruct Models
VL Reasoning Models
Text Embedding Models
Text Instruct Edge Models
Text Instruct Models
Text Reasoning Models
Text Reranking Models
Speech-to-Text Models
Coding Models

Document Understanding

updated 23 days ago

https://www.2077ai.com/dataset/dataset-omnidocbench

Upvote
-

  • PaddlePaddle/PaddleOCR-VL

    Image-Text-to-Text • 1.0B • Updated 15 days ago • 17.9k • 1.43k

    Note Multi page support? Maybe with: "concatenate_markdown_pages"


  • opendatalab/MinerU2.5-2509-1.2B

    Image-Text-to-Text • 1B • Updated Sep 29 • 1.06M • 302

    Note agpl-3.0 license: "If you use AGPL-3.0 licensed software in a network-accessible application, you must make the entire source code of your application available to users of that application." :(


  • rednote-hilab/dots.ocr

    Image-Text-to-Text • 3B • Updated Oct 31 • 923k • 1.17k

  • deepseek-ai/DeepSeek-OCR

    Image-Text-to-Text • 3B • Updated Nov 4 • 4.37M • 3k
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs