Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
AHS-uni 's Collections
DocRAG Datasets
DocRAG Sample Datasets
DocRAG Baseline Models

DocRAG Baseline Models

updated May 30

Baseline VLMs used in DocRAG for either retrieval or generation.

Upvote
-

  • Qwen/Qwen2.5-VL-7B-Instruct

    Image-Text-to-Text • 8B • Updated Apr 6 • 2.55M • • 1.41k

  • Qwen/Qwen2.5-VL-7B-Instruct-AWQ

    Image-Text-to-Text • 8B • Updated Apr 6 • 95.9k • 96

  • Metric-AI/ColQwen2.5-7b-multilingual-v1.0

    Visual Document Retrieval • Updated Jul 28 • 1.34k • 15

  • Metric-AI/ColQwen2.5-3b-multilingual-v1.0

    Visual Document Retrieval • Updated Jul 28 • 718 • 9

  • google/paligemma2-3b-pt-448

    Image-Text-to-Text • 3B • Updated Dec 5, 2024 • 5.85k • 46

  • OpenGVLab/InternVL3-2B

    Image-Text-to-Text • 2B • Updated Sep 11 • 24k • 43

  • OpenGVLab/InternVL3-8B

    Image-Text-to-Text • 8B • Updated Sep 11 • 131k • 103

  • google/siglip-so400m-patch14-384

    Zero-Shot Image Classification • 0.9B • Updated Sep 26, 2024 • 6.4M • 634

  • google/siglip2-base-patch16-512

    Zero-Shot Image Classification • 0.4B • Updated Feb 21 • 150k • 32

  • nomic-ai/colnomic-embed-multimodal-3b

    Visual Document Retrieval • Updated Apr 15 • 2.63k • 33

  • vidore/colqwen2.5-v0.2

    Visual Document Retrieval • Updated Jun 16 • 54.8k • 91
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs