Vik Paruchuri
commited on
Commit
·
80316f3
1
Parent(s):
af53da5
Update README
Browse files
README.md
CHANGED
|
@@ -3,11 +3,11 @@
|
|
| 3 |
Marker converts PDFs and images to markdown, JSON, and HTML quickly and accurately.
|
| 4 |
|
| 5 |
- Supports a range of documents in all languages
|
| 6 |
-
- Formats tables, forms, equations, links, references, and code blocks
|
| 7 |
-
- Extracts and saves images
|
| 8 |
- Removes headers/footers/other artifacts
|
| 9 |
-
-
|
| 10 |
-
- Optionally boost accuracy with
|
| 11 |
- Works on GPU, CPU, or MPS
|
| 12 |
|
| 13 |
## Performance
|
|
@@ -485,8 +485,8 @@ It only uses models where necessary, which improves speed and accuracy.
|
|
| 485 |
|
| 486 |
PDF is a tricky format, so marker will not always work perfectly. Here are some known limitations that are on the roadmap to address:
|
| 487 |
|
| 488 |
-
- Marker will only convert block equations
|
| 489 |
- Very complex layouts, with nested tables and forms, may not work
|
|
|
|
| 490 |
|
| 491 |
Note: Passing the `--use_llm` flag will mostly solve these issues.
|
| 492 |
|
|
|
|
| 3 |
Marker converts PDFs and images to markdown, JSON, and HTML quickly and accurately.
|
| 4 |
|
| 5 |
- Supports a range of documents in all languages
|
| 6 |
+
- Formats tables, forms, equations, inline math, links, references, and code blocks
|
| 7 |
+
- Extracts and saves images
|
| 8 |
- Removes headers/footers/other artifacts
|
| 9 |
+
- Extensible with your own formatting and logic
|
| 10 |
+
- Optionally boost accuracy with LLMs
|
| 11 |
- Works on GPU, CPU, or MPS
|
| 12 |
|
| 13 |
## Performance
|
|
|
|
| 485 |
|
| 486 |
PDF is a tricky format, so marker will not always work perfectly. Here are some known limitations that are on the roadmap to address:
|
| 487 |
|
|
|
|
| 488 |
- Very complex layouts, with nested tables and forms, may not work
|
| 489 |
+
- Forms may not be rendered well
|
| 490 |
|
| 491 |
Note: Passing the `--use_llm` flag will mostly solve these issues.
|
| 492 |
|