Spaces:
Running
Running
| title: DeepSeek OCR-2 API | |
| emoji: π | |
| colorFrom: blue | |
| colorTo: purple | |
| sdk: gradio | |
| sdk_version: 4.31.0 | |
| python_version: 3.11 | |
| app_file: app.py | |
| pinned: false | |
| license: apache-2.0 | |
| # DeepSeek-OCR-2 Table Structure Recognition API | |
| High-accuracy OCR and table structure recognition using DeepSeek-OCR-2 (3B parameters). | |
| ## Features | |
| - π **Table Detection & Recognition**: Extract complex table structures | |
| - π¦ **Cell-Level Bounding Boxes**: Precise coordinates for all cells | |
| - π **Header Detection**: Automatic header identification | |
| - π **Merged Cells**: Rowspan/colspan support | |
| - π― **High Accuracy**: State-of-the-art performance | |
| ## API Usage | |
| ### Python Client | |
| ```python | |
| import requests | |
| import base64 | |
| # Load and encode image | |
| with open("document.png", "rb") as f: | |
| image_b64 = base64.b64encode(f.read()).decode() | |
| # Call API | |
| response = requests.post( | |
| "https://your-username-space-name.hf.space/api/predict", | |
| json={"data": [image_b64]}, | |
| headers={"Authorization": f"Bearer {YOUR_HF_TOKEN}"} | |
| ) | |
| result = response.json() | |
| print(result) | |
| ``` | |
| ### cURL | |
| ```bash | |
| curl -X POST https://your-username-space-name.hf.space/api/predict \ | |
| -H "Content-Type: application/json" \ | |
| -H "Authorization: Bearer YOUR_HF_TOKEN" \ | |
| -d '{"data": ["base64_encoded_image"]}' | |
| ``` | |
| ## Response Format | |
| ```json | |
| { | |
| "status": "success", | |
| "tables": [ | |
| { | |
| "bbox": [x1, y1, x2, y2], | |
| "cells": [ | |
| { | |
| "row": 0, | |
| "col": 0, | |
| "rowSpan": 1, | |
| "colSpan": 1, | |
| "bbox": [x1, y1, x2, y2], | |
| "text": "Cell content" | |
| } | |
| ], | |
| "headers": [...], | |
| "rows": [...] | |
| } | |
| ], | |
| "blocks": [...], | |
| "text": "Extracted text...", | |
| "metadata": { | |
| "model": "deepseek-ai/DeepSeek-OCR-2", | |
| "device": "cuda", | |
| "image_size": [width, height] | |
| } | |
| } | |
| ``` | |
| ## Model Info | |
| - **Model:** deepseek-ai/DeepSeek-OCR-2 | |
| - **Parameters:** 3B | |
| - **Precision:** FP16 | |
| - **GPU:** T4 (16GB VRAM) | |
| - **License:** Apache-2.0 | |
| ## Links | |
| - [Model on HuggingFace](https://huggingface.co/deepseek-ai/DeepSeek-OCR-2) | |
| - [Project Repository](https://git.epam.com/epm-gpt/badgerdoc/ls-extractor) | |
| - [Documentation](https://git.epam.com/epm-gpt/badgerdoc/ls-extractor/-/tree/main/docs) | |
| ## Citation | |
| ```bibtex | |
| @article{deepseek-ocr-2, | |
| title={DeepSeek-OCR-2: Advanced Document Understanding}, | |
| author={DeepSeek AI}, | |
| year={2026} | |
| } | |
| ``` | |