Spaces:

lsextractor
/

deepseek-ocr2-api

Running

File size: 2,382 Bytes

---
title: DeepSeek OCR-2 API
emoji: 🔍
colorFrom: blue
colorTo: purple
sdk: gradio
sdk_version: 4.31.0
python_version: 3.11
app_file: app.py
pinned: false
license: apache-2.0
---

# DeepSeek-OCR-2 Table Structure Recognition API

High-accuracy OCR and table structure recognition using DeepSeek-OCR-2 (3B parameters).

## Features

- 📊 **Table Detection & Recognition**: Extract complex table structures
- 📦 **Cell-Level Bounding Boxes**: Precise coordinates for all cells
- 📋 **Header Detection**: Automatic header identification
- 🔗 **Merged Cells**: Rowspan/colspan support
- 🎯 **High Accuracy**: State-of-the-art performance

## API Usage

### Python Client

```python
import requests
import base64

# Load and encode image
with open("document.png", "rb") as f:
    image_b64 = base64.b64encode(f.read()).decode()

# Call API
response = requests.post(
    "https://your-username-space-name.hf.space/api/predict",
    json={"data": [image_b64]},
    headers={"Authorization": f"Bearer {YOUR_HF_TOKEN}"}
)

result = response.json()
print(result)
```

### cURL

```bash
curl -X POST https://your-username-space-name.hf.space/api/predict \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer YOUR_HF_TOKEN" \
  -d '{"data": ["base64_encoded_image"]}'
```

## Response Format

```json
{
  "status": "success",
  "tables": [
    {
      "bbox": [x1, y1, x2, y2],
      "cells": [
        {
          "row": 0,
          "col": 0,
          "rowSpan": 1,
          "colSpan": 1,
          "bbox": [x1, y1, x2, y2],
          "text": "Cell content"
        }
      ],
      "headers": [...],
      "rows": [...]
    }
  ],
  "blocks": [...],
  "text": "Extracted text...",
  "metadata": {
    "model": "deepseek-ai/DeepSeek-OCR-2",
    "device": "cuda",
    "image_size": [width, height]
  }
}
```

## Model Info

- **Model:** deepseek-ai/DeepSeek-OCR-2
- **Parameters:** 3B
- **Precision:** FP16
- **GPU:** T4 (16GB VRAM)
- **License:** Apache-2.0

## Links

- [Model on HuggingFace](https://huggingface.co/deepseek-ai/DeepSeek-OCR-2)
- [Project Repository](https://git.epam.com/epm-gpt/badgerdoc/ls-extractor)
- [Documentation](https://git.epam.com/epm-gpt/badgerdoc/ls-extractor/-/tree/main/docs)

## Citation

```bibtex
@article{deepseek-ocr-2,
  title={DeepSeek-OCR-2: Advanced Document Understanding},
  author={DeepSeek AI},
  year={2026}
}
```