Spaces:

ricklon
/

DeepSeek-OCR-2-Math

Running on Zero

ricklon Claude Sonnet 4.6 commited on 10 days ago

Commit

b9d5e1c

1 Parent(s): 25ba1bf

Fix flash_attention_2 startup crash on ZeroGPU and LaTeX delimiter rendering

- Use sdpa attention impl when CUDA is unavailable at load time (ZeroGPU
has no GPU until inside @spaces.GPU); fall back to flash_attention_2
locally where CUDA is present
- Pre-convert model's \[...\] and $...$ delimiters to $$...$$ and $...$
in to_math_html() before passing to markdown; markdown strips backslashes
before arithmatex can protect them, causing equations to render as bare
brackets instead of math
- Document delimiter pre-conversion in TECHNICAL.md

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Files changed (2) hide show

TECHNICAL.md +13 -0
app.py +11 -1

TECHNICAL.md CHANGED Viewed

@@ -304,6 +304,19 @@ After arithmatex + markdown:
 The `_` inside the math is never touched by the markdown processor.
 ### MathJax configuration
 MathJax is loaded once in the page `<head>` and configured to process `\(...\)` for inline math and `\[...\]` for display math — matching the output format of arithmatex:

 The `_` inside the math is never touched by the markdown processor.
+### Delimiter pre-conversion
+The model outputs `\[...\]` for display math and `\(...\)` for inline math. But `pymdownx.arithmatex` only recognises `$...$` and `$$...$$` by default. Worse, if `\[...\]` is passed directly to the markdown processor, the backslashes are stripped first — before arithmatex can intercept them — leaving bare `[...]` brackets in the output.
+`to_math_html()` therefore pre-converts the model's native delimiters before calling `markdown()`:
+```python
+text = re.sub(r'\\\[(.+?)\\\]', r'$$\1$$', text, flags=re.DOTALL)
+text = re.sub(r'\\\((.+?)\\\)', r'$\1$', text)
+```
+After this step, arithmatex sees `$$...$$` and `$...$`, protects the content from markdown, and wraps it in `\[...\]` and `\(...\)` for MathJax to render.
 ### MathJax configuration
 MathJax is loaded once in the page `<head>` and configured to process `\(...\)` for inline math and `\[...\]` for display math — matching the output format of arithmatex:

app.py CHANGED Viewed

@@ -28,7 +28,12 @@ MODEL_NAME = 'deepseek-ai/DeepSeek-OCR-2'
 # MODEL_NAME = 'mzbac/DeepSeek-OCR-2-8bit'
 tokenizer = AutoTokenizer.from_pretrained(MODEL_NAME, trust_remote_code=True)
-model = AutoModel.from_pretrained(MODEL_NAME, _attn_implementation='flash_attention_2', torch_dtype=torch.bfloat16, trust_remote_code=True, use_safetensors=True).eval()
 # .cuda() is NOT called here — on ZeroGPU, GPU is only available inside @spaces.GPU
 # functions. Locally, model.cuda() is called inside process_image on first run.
@@ -161,6 +166,11 @@ window.MathJax = {
 def to_math_html(text):
     if not text:
         return ""
     html = md_lib.markdown(text, extensions=[
         'pymdownx.arithmatex',
         'tables',

 # MODEL_NAME = 'mzbac/DeepSeek-OCR-2-8bit'
 tokenizer = AutoTokenizer.from_pretrained(MODEL_NAME, trust_remote_code=True)
+# flash_attention_2 requires a CUDA device at init time — not available on ZeroGPU at
+# module load. Use sdpa (PyTorch scaled dot product attention) as the fallback; it works
+# on CPU at load time and on GPU at inference time. Locally with CUDA present, use
+# flash_attention_2 for maximum throughput.
+_attn_impl = 'flash_attention_2' if torch.cuda.is_available() else 'sdpa'
+model = AutoModel.from_pretrained(MODEL_NAME, _attn_implementation=_attn_impl, torch_dtype=torch.bfloat16, trust_remote_code=True, use_safetensors=True).eval()
 # .cuda() is NOT called here — on ZeroGPU, GPU is only available inside @spaces.GPU
 # functions. Locally, model.cuda() is called inside process_image on first run.
 def to_math_html(text):
     if not text:
         return ""
+    # Pre-convert \[...\] and \(...\) to $$...$$ and $...$
+    # Markdown strips backslashes before arithmatex can protect them,
+    # so convert to $-delimiters first (arithmatex recognises those).
+    text = re.sub(r'\\\[(.+?)\\\]', r'$$\1$$', text, flags=re.DOTALL)
+    text = re.sub(r'\\\((.+?)\\\)', r'$\1$', text)
     html = md_lib.markdown(text, extensions=[
         'pymdownx.arithmatex',
         'tables',