VLM For OCR
updated
Text Generation
• Updated • 43k
• 277
Image-to-Text
• 1B • Updated • 1.31k
• 34
Text Generation
• 18B • Updated • 314
• 69
openbmb/MiniCPM-Llama3-V-2_5
Image-Text-to-Text
• 9B • Updated • 60.4k
• 1.41k
google/paligemma-3b-pt-896
Image-Text-to-Text
• 3B • Updated • 593
• 124
UCSC-VLAA/Recap-DataComp-1B
Viewer
• Updated • 1.88B • 16.9k
• 198
WildVision: Evaluating Vision-Language Models in the Wild with Human
Preferences
Paper
• 2406.11069
• Published • 14
pbevan11/synthetic-ocr-correction-gpt4o
Viewer
• Updated • 10k • 30
• 6
yifeihu/ACL-23-Paper-OCR-Markdown
Viewer
• Updated • 2.15k • 32
• 19
LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs
Paper
• 2406.15319
• Published • 64