richarddavison commited on
Commit
895f92a
·
verified ·
1 Parent(s): 436ab1d

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +59 -0
README.md ADDED
@@ -0,0 +1,59 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: openrail
3
+ base_model: datalab-to/chandra
4
+ tags:
5
+ - ocr
6
+ - vlm
7
+ - qwen3_vl
8
+ - fp8
9
+ - quantized
10
+ pipeline_tag: image-to-text
11
+ ---
12
+
13
+ # Chandra FP8
14
+
15
+ FP8 quantized version of [datalab-to/chandra](https://huggingface.co/datalab-to/chandra) for efficient inference with vLLM.
16
+
17
+ ## Quantization
18
+
19
+ - **Method**: FP8 Dynamic (W8A8)
20
+ - **Tool**: llmcompressor
21
+ - **Scheme**: Static per-channel weights, dynamic per-token activations
22
+ - **Ignored layers**: `lm_head`, `visual.*`
23
+
24
+ ## Usage with vLLM
25
+
26
+ ```python
27
+ from vllm import LLM
28
+
29
+ llm = LLM("richarddavison/chandra-fp8")
30
+ ```
31
+
32
+ ## Original Model
33
+
34
+ Chandra is an OCR model that outputs markdown, HTML, and JSON. It is highly accurate at extracting text from images and PDFs, while preserving layout information.
35
+
36
+ ### Features
37
+
38
+ - Convert documents to markdown, html, or json with detailed layout information
39
+ - Good handwriting support
40
+ - Reconstructs forms accurately, including checkboxes
41
+ - Good support for tables, math, and complex layouts
42
+ - Extracts images and diagrams, with captions and structured data
43
+ - Support for 40+ languages
44
+
45
+ ### Benchmarks
46
+
47
+ | Model | Overall |
48
+ |-------|---------|
49
+ | Datalab Chandra v0.1.0 | **83.1** |
50
+ | olmOCR v0.3.0 | 78.5 |
51
+ | dots.ocr | 79.1 |
52
+ | Mistral OCR API | 72.0 |
53
+ | GPT-4o (Anchored) | 69.9 |
54
+
55
+ See the [original model card](https://huggingface.co/datalab-to/chandra) for full details.
56
+
57
+ ## Credits
58
+
59
+ Original model by [Datalab](https://huggingface.co/datalab-to).