Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
AHS-uni 's Collections
DocRAG Datasets
DocRAG Sample Datasets
DocRAG Baseline Models

DocRAG Baseline Models

updated May 30, 2025

Baseline VLMs used in DocRAG for either retrieval or generation.

Upvote
-

  • Qwen/Qwen2.5-VL-7B-Instruct

    Image-Text-to-Text • 8B • Updated Apr 6, 2025 • 4.36M • • 1.49k

  • Qwen/Qwen2.5-VL-7B-Instruct-AWQ

    Image-Text-to-Text • 8B • Updated Apr 6, 2025 • 304k • 99

  • Metric-AI/ColQwen2.5-7b-multilingual-v1.0

    Visual Document Retrieval • Updated Jul 28, 2025 • 21k • 16

  • Metric-AI/ColQwen2.5-3b-multilingual-v1.0

    Visual Document Retrieval • Updated Jul 28, 2025 • 1.43k • 9

  • google/paligemma2-3b-pt-448

    Image-Text-to-Text • 3B • Updated Dec 5, 2024 • 45.9k • 47

  • OpenGVLab/InternVL3-2B

    Image-Text-to-Text • Updated Sep 11, 2025 • 39.8k • 45

  • OpenGVLab/InternVL3-8B

    Image-Text-to-Text • Updated Sep 11, 2025 • 124k • 103

  • google/siglip-so400m-patch14-384

    Zero-Shot Image Classification • 0.9B • Updated Sep 26, 2024 • 2.01M • 667

  • google/siglip2-base-patch16-512

    Zero-Shot Image Classification • 0.4B • Updated Feb 21, 2025 • 147k • 37

  • nomic-ai/colnomic-embed-multimodal-3b

    Visual Document Retrieval • Updated Apr 15, 2025 • 1.77k • 37

  • vidore/colqwen2.5-v0.2

    Visual Document Retrieval • Updated Jun 16, 2025 • 74.8k • 98
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs