prithivMLmods commited on
Commit
72b7cde
ยท
verified ยท
1 Parent(s): 639f41e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +28 -1
README.md CHANGED
@@ -37,6 +37,8 @@ This model is particularly effective in **retrieving mathematical notations and
37
 
38
  ![latexqwen.png](https://cdn-uploads.huggingface.co/production/uploads/65bb837dbfb878f46c77de4c/-h5z3giEudPrdM9qRMMTe.png)
39
 
 
 
40
  # Use it with Transformers
41
 
42
 
@@ -102,7 +104,7 @@ output_text = processor.batch_decode(
102
  )
103
  print(output_text)
104
  ```
105
- ### Buf
106
  ```python
107
  buffer = ""
108
  for new_text in streamer:
@@ -111,5 +113,30 @@ print(output_text)
111
  buffer = buffer.replace("<|im_end|>", "")
112
  yield buffer
113
  ```
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
114
 
 
115
 
 
 
 
 
 
 
 
 
37
 
38
  ![latexqwen.png](https://cdn-uploads.huggingface.co/production/uploads/65bb837dbfb878f46c77de4c/-h5z3giEudPrdM9qRMMTe.png)
39
 
40
+ Demo: https://huggingface.co/prithivMLmods/LatexMind-2B-Codec/blob/main/latexmind/latexmind-codec.ipynb
41
+
42
  # Use it with Transformers
43
 
44
 
 
104
  )
105
  print(output_text)
106
  ```
107
+ # Buf
108
  ```python
109
  buffer = ""
110
  for new_text in streamer:
 
113
  buffer = buffer.replace("<|im_end|>", "")
114
  yield buffer
115
  ```
116
+ Hereโ€™s the **Intended Use & Limitations** section for **LatexMind-2B-Codec**:
117
+
118
+ ---
119
+
120
+ # Intended Use
121
+
122
+ **LatexMind-2B-Codec** is designed for tasks that require **image-based text recognition**, **math equation extraction**, and **multi-modal understanding**. It is particularly useful in the following scenarios:
123
+
124
+ ๐Ÿ”น **Optical Character Recognition (OCR)** โ€“ Extracting printed and handwritten text from images, documents, and scanned pages.
125
+ ๐Ÿ”น **Math Expression Recognition** โ€“ Converting mathematical notations into structured **LaTeX format** for further computation and documentation.
126
+ ๐Ÿ”น **Image-to-Text Conversion** โ€“ Generating accurate descriptions for text-rich and math-heavy images.
127
+ ๐Ÿ”น **Document and Academic Processing** โ€“ Assisting researchers, students, and professionals in digitizing handwritten notes and extracting structured content from books, PDFs, and whiteboards.
128
+ ๐Ÿ”น **Automated Educational Support** โ€“ Enabling AI-powered tutors, content summarization, and interactive learning for subjects involving complex equations.
129
+ ๐Ÿ”น **Multi-Language OCR** โ€“ Recognizing text inside images across multiple languages, including English, Chinese, Japanese, Korean, Arabic, and various European languages.
130
+ ๐Ÿ”น **Video-Based Question Answering** โ€“ Understanding long-duration videos for content summarization, question answering, and structured data extraction.
131
+
132
+ # Limitations
133
 
134
+ Despite its capabilities, **LatexMind-2B-Codec** has some inherent limitations:
135
 
136
+ โš  **Handwritten Text Accuracy** โ€“ While it can recognize handwritten equations, performance may degrade with highly unstructured or messy handwriting.
137
+ โš  **Complex LaTeX Formatting** โ€“ The model may struggle with deeply nested or ambiguous LaTeX expressions, requiring manual corrections for precise formatting.
138
+ โš  **Low-Resolution Images** โ€“ Extracting accurate text from blurry or low-resolution images can lead to misinterpretations or OCR errors.
139
+ โš  **Contextual Understanding in Multi-Step Equations** โ€“ While it recognizes math expressions, solving multi-step problems autonomously may be limited.
140
+ โš  **Limited Support for Rare Mathematical Notations** โ€“ Some specialized or domain-specific symbols may not be recognized with high accuracy.
141
+ โš  **Processing Speed for Large Documents** โ€“ Performance may slow down when handling extremely large documents or dense mathematical content in real-time applications.
142
+ โš  **Language-Specific OCR Variability** โ€“ While it supports multiple languages, OCR accuracy may vary depending on the script complexity and font style.