Update README.md
Browse files
README.md
CHANGED
|
@@ -13,11 +13,19 @@ The key idea is to combine the bounding box modality with text, achieving a pixe
|
|
| 13 |
|
| 14 |

|
| 15 |
|
| 16 |
-
The name "Lougat" is a combination of LLama and Nougat.
|
| 17 |
|
|
|
|
|
|
|
|
|
|
| 18 |
- Florence2 + LLama → Flougat
|
| 19 |
- Sam2 + LLama → Slougat
|
| 20 |
- Nougat + Relative Position Embedding LLama → Rlougat
|
| 21 |
|
| 22 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 23 |
|
|
|
|
| 13 |
|
| 14 |

|
| 15 |
|
| 16 |
+
The name "Lougat" is a combination of LLama and Nougat. The key idea is nature continues of this paper [LOCR: Location-Guided Transformer for Optical Character Recognition]([[2403.02127\] LOCR: Location-Guided Transformer for Optical Character Recognition (arxiv.org)](https://arxiv.org/abs/2403.02127))
|
| 17 |
|
| 18 |
+
Current Branch: The **LOCR** model
|
| 19 |
+
|
| 20 |
+
Other Branch:
|
| 21 |
- Florence2 + LLama → Flougat
|
| 22 |
- Sam2 + LLama → Slougat
|
| 23 |
- Nougat + Relative Position Embedding LLama → Rlougat
|
| 24 |
|
| 25 |
+
|
| 26 |
+
# Inference and Train
|
| 27 |
+
|
| 28 |
+
Please see `https://github.com/veya2ztn/Lougat`
|
| 29 |
+
|
| 30 |
+
|
| 31 |
|