Update example with infinity
#9
by
michaelfeil
- opened
README.md
CHANGED
|
@@ -103,6 +103,17 @@ with torch.no_grad():
|
|
| 103 |
scores = processor.score_multi_vector(querry_embeddings, image_embeddings)
|
| 104 |
```
|
| 105 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 106 |
## Limitations
|
| 107 |
|
| 108 |
- **Focus**: The model primarily focuses on PDF-type documents and high-ressources languages, potentially limiting its generalization to other document types or less represented languages.
|
|
|
|
| 103 |
scores = processor.score_multi_vector(querry_embeddings, image_embeddings)
|
| 104 |
```
|
| 105 |
|
| 106 |
+
## Infinity
|
| 107 |
+
|
| 108 |
+
Usage with docker and [Infinity](https://github.com/michaelfeil/infinity).
|
| 109 |
+
Infinity only works with the `-merged` weight variants of ColPali and ColQwen.
|
| 110 |
+
|
| 111 |
+
```bash
|
| 112 |
+
docker run --gpus all -v $PWD/data:/app/.cache -p "7997":"7997" \
|
| 113 |
+
michaelf34/infinity:0.0.69 \
|
| 114 |
+
v2 --model-id vidore/colpali-v1.2-merged --revision "cd80ee4200c591b788a9c4e21bb5d549d4a04637" --dtype bfloat16 --batch-size 8 --device cuda --engine torch --port 7997
|
| 115 |
+
```
|
| 116 |
+
|
| 117 |
## Limitations
|
| 118 |
|
| 119 |
- **Focus**: The model primarily focuses on PDF-type documents and high-ressources languages, potentially limiting its generalization to other document types or less represented languages.
|