Video-Text-to-Text
Transformers
Safetensors
English
qwen3_vl
image-text-to-text
video
retrieval
reranking
qwen3-vl
Instructions to use hltcoe/RankVideo with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use hltcoe/RankVideo with Transformers:
# Load model directly from transformers import AutoProcessor, AutoModelForMultimodalLM processor = AutoProcessor.from_pretrained("hltcoe/RankVideo") model = AutoModelForMultimodalLM.from_pretrained("hltcoe/RankVideo") - Notebooks
- Google Colab
- Kaggle
Update README.md
Browse files
README.md
CHANGED
|
@@ -11,6 +11,8 @@ tags:
|
|
| 11 |
- retrieval
|
| 12 |
- reranking
|
| 13 |
- qwen3-vl
|
|
|
|
|
|
|
| 14 |
---
|
| 15 |
|
| 16 |
# RankVideo
|
|
@@ -24,7 +26,7 @@ The model explicitly reasons over query-video pairs using video content to asses
|
|
| 24 |
|
| 25 |
## Training Data
|
| 26 |
|
| 27 |
-
This model was trained using the [MultiVENT 2.0 dataset](https://huggingface.co/datasets/hltcoe/MultiVENT2.0).
|
| 28 |
|
| 29 |
## Usage
|
| 30 |
|
|
|
|
| 11 |
- retrieval
|
| 12 |
- reranking
|
| 13 |
- qwen3-vl
|
| 14 |
+
datasets:
|
| 15 |
+
- hltcoe/RankVideo-Dataset
|
| 16 |
---
|
| 17 |
|
| 18 |
# RankVideo
|
|
|
|
| 26 |
|
| 27 |
## Training Data
|
| 28 |
|
| 29 |
+
This model was trained using the [MultiVENT 2.0 dataset](https://huggingface.co/datasets/hltcoe/MultiVENT2.0) and [RankVideo-Dataset](https://huggingface.co/datasets/hltcoe/RankVideo-Dataset).
|
| 30 |
|
| 31 |
## Usage
|
| 32 |
|