Add pipeline tag, library name, paper link, and basic description

#1
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +23 -3
README.md CHANGED
@@ -1,3 +1,23 @@
1
- ---
2
- license: cc-by-nc-sa-4.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: cc-by-nc-sa-4.0
3
+ pipeline_tag: image-text-to-text
4
+ library_name: transformers
5
+ ---
6
+
7
+ # DocLayLLM: An Efficient and Effective Multi-modal Extension of Large Language Models for Text-rich Document Understanding
8
+
9
+ This model is presented in the paper [DocLayLLM: An Efficient and Effective Multi-modal Extension of Large Language Models for Text-rich Document Understanding](https://huggingface.co/papers/2408.15045). DocLayLLM is designed for text-rich document understanding, integrating visual patch tokens and 2D positional tokens into LLMs to enhance their document comprehension and OCR information perception.
10
+
11
+ ## How to Use
12
+
13
+ A more complete usage example will be added when available. For now, a basic example:
14
+
15
+ ```python
16
+ from transformers import pipeline
17
+
18
+ pipe = pipeline("text-generation", model="your_model_id") # replace your_model_id
19
+ result = pipe("Your input text here.")
20
+ print(result)
21
+ ```
22
+
23
+ Replace `"your_model_id"` with the actual Hugging Face model ID.