File size: 1,433 Bytes
895f92a | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 | ---
license: openrail
base_model: datalab-to/chandra
tags:
- ocr
- vlm
- qwen3_vl
- fp8
- quantized
pipeline_tag: image-to-text
---
# Chandra FP8
FP8 quantized version of [datalab-to/chandra](https://huggingface.co/datalab-to/chandra) for efficient inference with vLLM.
## Quantization
- **Method**: FP8 Dynamic (W8A8)
- **Tool**: llmcompressor
- **Scheme**: Static per-channel weights, dynamic per-token activations
- **Ignored layers**: `lm_head`, `visual.*`
## Usage with vLLM
```python
from vllm import LLM
llm = LLM("richarddavison/chandra-fp8")
```
## Original Model
Chandra is an OCR model that outputs markdown, HTML, and JSON. It is highly accurate at extracting text from images and PDFs, while preserving layout information.
### Features
- Convert documents to markdown, html, or json with detailed layout information
- Good handwriting support
- Reconstructs forms accurately, including checkboxes
- Good support for tables, math, and complex layouts
- Extracts images and diagrams, with captions and structured data
- Support for 40+ languages
### Benchmarks
| Model | Overall |
|-------|---------|
| Datalab Chandra v0.1.0 | **83.1** |
| olmOCR v0.3.0 | 78.5 |
| dots.ocr | 79.1 |
| Mistral OCR API | 72.0 |
| GPT-4o (Anchored) | 69.9 |
See the [original model card](https://huggingface.co/datalab-to/chandra) for full details.
## Credits
Original model by [Datalab](https://huggingface.co/datalab-to).
|