Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
AHS-uni 's Collections
DocRAG Datasets
DocRAG Sample Datasets
DocRAG Baseline Models

DocRAG Baseline Models

updated May 30, 2025

Baseline VLMs used in DocRAG for either retrieval or generation.

Upvote
-

  • Qwen/Qwen2.5-VL-7B-Instruct

    Image-Text-to-Text • 8B • Updated Apr 6, 2025 • 9.71M • 1.6k

  • Qwen/Qwen2.5-VL-7B-Instruct-AWQ

    Image-Text-to-Text • 8B • Updated Apr 6, 2025 • 1.42M • 106

  • Metric-AI/ColQwen2.5-7b-multilingual-v1.0

    Visual Document Retrieval • Updated Jul 28, 2025 • 11 • 16

  • Metric-AI/ColQwen2.5-3b-multilingual-v1.0

    Visual Document Retrieval • Updated Jul 28, 2025 • 671 • 9

  • google/paligemma2-3b-pt-448

    Image-Text-to-Text • 3B • Updated Dec 5, 2024 • 57.7k • 50

  • OpenGVLab/InternVL3-2B

    Image-Text-to-Text • 2B • Updated Sep 11, 2025 • 51.7k • 46

  • OpenGVLab/InternVL3-8B

    Image-Text-to-Text • 8B • Updated Sep 11, 2025 • 70.7k • 106

  • google/siglip-so400m-patch14-384

    Zero-Shot Image Classification • 0.9B • Updated Sep 26, 2024 • 1.71M • 678

  • google/siglip2-base-patch16-512

    Zero-Shot Image Classification • 0.4B • Updated Feb 21, 2025 • 124k • 47

  • nomic-ai/colnomic-embed-multimodal-3b

    Visual Document Retrieval • Updated Apr 15, 2025 • 11.8k • 39

  • vidore/colqwen2.5-v0.2

    Visual Document Retrieval • Updated Jun 16, 2025 • 78.7k • 99
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs