etiennebcp commited on
Commit
d45c9f4
·
verified ·
1 Parent(s): 2cf26ee

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -29,7 +29,7 @@ pipeline_tag: text-generation
29
 
30
  # Reasoning comes to OCR 🧠✨📄🤘
31
 
32
- **NuMarkdown-8B-Thinking** is the first reasoning OCR VLM. It is specifically trained to convert documents into clean GitHub-flavoured Markdown. It generates thoughts tokens to figure out the layout of the document before generating the Markdown file.
33
  It is particularly good at understanding documents with weird layouts and complex tables. The number of thinking tokens can vary from 20% to 500% of the final answer, depending on the task difficulty.
34
 
35
  **NuMarkdown-8B-Thinking** is a fine-tune of **Qwen 2.5-VL-7B** on synthetic Doc → Reasoning → Markdown examples, followed by an RL phase (GRPO) with a layout-centric reward.
 
29
 
30
  # Reasoning comes to OCR 🧠✨📄🤘
31
 
32
+ **NuMarkdown-8B-Thinking** is the first reasoning OCR VLM. It is specifically trained to convert documents into clean Markdown files, well suited for RAG applications. It generates thoughts tokens to figure out the layout of the document before generating the Markdown file.
33
  It is particularly good at understanding documents with weird layouts and complex tables. The number of thinking tokens can vary from 20% to 500% of the final answer, depending on the task difficulty.
34
 
35
  **NuMarkdown-8B-Thinking** is a fine-tune of **Qwen 2.5-VL-7B** on synthetic Doc → Reasoning → Markdown examples, followed by an RL phase (GRPO) with a layout-centric reward.