What is the testing setup?
#46
by wamreyaz - opened
The README.md says this about the benchmark
For speed, we compared different OCR methods under identical hardware and testing conditions (single replica, single concurrency), evaluating their performance in parsing and exporting Markdown files from both image and PDF inputs. Results show GLM-OCR achieves a throughput of 1.86 pages/second for PDF documents and 0.67 images/second for images, significantly outperforming comparable models.
How is this measured? The supplied Docker image for vLLM does not have the PP-DocLayout enabled, so how much do you batch? How many concurrent requests?
Could you please explain?