Video-Text-to-Text
Transformers
Safetensors
English
qwen3_vl
image-text-to-text
video
retrieval
reranking
qwen3-vl
Instructions to use hltcoe/RankVideo with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use hltcoe/RankVideo with Transformers:
# Load model directly from transformers import AutoProcessor, AutoModelForImageTextToText processor = AutoProcessor.from_pretrained("hltcoe/RankVideo") model = AutoModelForImageTextToText.from_pretrained("hltcoe/RankVideo") - Notebooks
- Google Colab
- Kaggle
Update README.md
Browse files
README.md
CHANGED
|
@@ -18,6 +18,11 @@ Video-native reasoning reranker for text-to-video retrieval. Fine-tuned from Qwe
|
|
| 18 |
## Reference
|
| 19 |
- [arXiv:2602.02444](https://arxiv.org/abs/2602.02444)
|
| 20 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 21 |
|
| 22 |
## Usage
|
| 23 |
|
|
|
|
| 18 |
## Reference
|
| 19 |
- [arXiv:2602.02444](https://arxiv.org/abs/2602.02444)
|
| 20 |
|
| 21 |
+
## Training Data
|
| 22 |
+
|
| 23 |
+
This model was trained using the [MultiVENT 2.0 dataset](https://huggingface.co/datasets/hltcoe/MultiVENT2.0 ).
|
| 24 |
+
|
| 25 |
+
|
| 26 |
|
| 27 |
## Usage
|
| 28 |
|