richarddavison
/

chandra-fp8

+---
+license: openrail
+base_model: datalab-to/chandra
+tags:
+- ocr
+- vlm
+- qwen3_vl
+- fp8
+- quantized
+pipeline_tag: image-to-text
+---
+# Chandra FP8
+FP8 quantized version of [datalab-to/chandra](https://huggingface.co/datalab-to/chandra) for efficient inference with vLLM.
+## Quantization
+- **Method**: FP8 Dynamic (W8A8)
+- **Tool**: llmcompressor
+- **Scheme**: Static per-channel weights, dynamic per-token activations
+- **Ignored layers**: `lm_head`, `visual.*`
+## Usage with vLLM
+```python
+from vllm import LLM
+llm = LLM("richarddavison/chandra-fp8")
+```
+## Original Model
+Chandra is an OCR model that outputs markdown, HTML, and JSON. It is highly accurate at extracting text from images and PDFs, while preserving layout information.
+### Features
+- Convert documents to markdown, html, or json with detailed layout information
+- Good handwriting support
+- Reconstructs forms accurately, including checkboxes
+- Good support for tables, math, and complex layouts
+- Extracts images and diagrams, with captions and structured data
+- Support for 40+ languages
+### Benchmarks
+| Model | Overall |
+|-------|---------|
+| Datalab Chandra v0.1.0 | **83.1** |
+| olmOCR v0.3.0 | 78.5 |
+| dots.ocr | 79.1 |
+| Mistral OCR API | 72.0 |
+| GPT-4o (Anchored) | 69.9 |
+See the [original model card](https://huggingface.co/datalab-to/chandra) for full details.
+## Credits
+Original model by [Datalab](https://huggingface.co/datalab-to).