vidore
/

colpali-v1.2

Visual Document Retrieval

vidore-experimental

Model card Files Files and versions

Update example with infinity

#9

by michaelfeil - opened Nov 15, 2024

base: refs/heads/main

←

from: refs/pr/9

Discussion Files changed

Files changed (1) hide show

README.md +11 -0

README.md CHANGED Viewed

@@ -103,6 +103,17 @@ with torch.no_grad():
 scores = processor.score_multi_vector(querry_embeddings, image_embeddings)
 ```
 ## Limitations
  - **Focus**: The model primarily focuses on PDF-type documents and high-ressources languages, potentially limiting its generalization to other document types or less represented languages.

 scores = processor.score_multi_vector(querry_embeddings, image_embeddings)
 ```
+## Infinity
+Usage with docker and [Infinity](https://github.com/michaelfeil/infinity).
+Infinity only works with the `-merged` weight variants of ColPali and ColQwen.
+```bash
+docker run --gpus all -v $PWD/data:/app/.cache -p "7997":"7997" \
+michaelf34/infinity:0.0.69 \
+v2 --model-id vidore/colpali-v1.2-merged --revision "cd80ee4200c591b788a9c4e21bb5d549d4a04637" --dtype bfloat16 --batch-size 8 --device cuda --engine torch --port 7997
+```
 ## Limitations
  - **Focus**: The model primarily focuses on PDF-type documents and high-ressources languages, potentially limiting its generalization to other document types or less represented languages.