Update README.md
Browse files
README.md
CHANGED
|
@@ -38,8 +38,8 @@ Structured eval data: [`.eval_results/parsebench.yaml`](./.eval_results/parseben
|
|
| 38 |
|
| 39 |
- **Benchmark**: [ParseBench-Full](https://huggingface.co/datasets/llamaindex/ParseBench) — 2,037 single-page PDFs from real enterprise documents (insurance, finance, government, scientific, etc.)
|
| 40 |
- **Evaluator**: official [`parse-bench`](https://github.com/run-llama/ParseBench) CLI
|
| 41 |
-
- **Scoring mode**: rule-only (`LLAMACLOUD_BENCH_LLM_NORMALIZATION=off`) — stricter than the leaderboard's default judge mode
|
| 42 |
-
|
| 43 |
## Public leaderboard
|
| 44 |
|
| 45 |
Full benchmark comparison across all 47 entries: [parsebench.ai](https://www.parsebench.ai/)
|
|
|
|
| 38 |
|
| 39 |
- **Benchmark**: [ParseBench-Full](https://huggingface.co/datasets/llamaindex/ParseBench) — 2,037 single-page PDFs from real enterprise documents (insurance, finance, government, scientific, etc.)
|
| 40 |
- **Evaluator**: official [`parse-bench`](https://github.com/run-llama/ParseBench) CLI
|
| 41 |
+
- **Scoring mode**: rule-only (`LLAMACLOUD_BENCH_LLM_NORMALIZATION=off`) — stricter than the leaderboard's default judge mode.
|
| 42 |
+
-
|
| 43 |
## Public leaderboard
|
| 44 |
|
| 45 |
Full benchmark comparison across all 47 entries: [parsebench.ai](https://www.parsebench.ai/)
|