hltcoe
/

RankVideo

@@ -1,46 +1,51 @@
 ---
-license: mit
 language:
 - en
 tags:
 - video
 - retrieval
 - reranking
 - qwen3-vl
-base_model: Qwen/Qwen3-VL-8B-Instruct
-pipeline_tag: video-text-to-text
 ---
 # RankVideo
-Video-native reasoning reranker for text-to-video retrieval. Fine-tuned from Qwen3-VL-8B-Instruct.
-## Reference
-- [arXiv:2602.02444](https://arxiv.org/abs/2602.02444)
-## Training Data
-This model was trained using the [MultiVENT 2.0 dataset](https://huggingface.co/datasets/hltcoe/MultiVENT2.0 ).
 ## Usage
-```
 from rankvideo import VLMReranker
 reranker = VLMReranker(model_path="hltcoe/RankVideo")
 scores = reranker.score_batch(
     queries=["person playing guitar"],
     video_paths=["/path/to/video.mp4"],
 )
 print(f"Relevance score: {scores[0]['logit_delta_yes_minus_no']:.3f}")
 ```
 ## BibTeX
 ```bibtex
 @misc{skow2026rankvideoreasoningrerankingtexttovideo,
       title={RANKVIDEO: Reasoning Reranking for Text-to-Video Retrieval},
@@ -50,4 +55,5 @@ print(f"Relevance score: {scores[0]['logit_delta_yes_minus_no']:.3f}")
       archivePrefix={arXiv},
       primaryClass={cs.IR},
       url={https://arxiv.org/abs/2602.02444},
-}

 ---
+base_model: Qwen/Qwen3-VL-8B-Instruct
 language:
 - en
+license: mit
+pipeline_tag: video-text-to-text
+library_name: transformers
+arxiv: 2602.02444
 tags:
 - video
 - retrieval
 - reranking
 - qwen3-vl
 ---
 # RankVideo
+RankVideo is a video-native reasoning reranker for text-to-video retrieval, fine-tuned from [Qwen3-VL-8B-Instruct](https://huggingface.co/Qwen/Qwen3-VL-8B-Instruct).
+The model explicitly reasons over query-video pairs using video content to assess relevance. It was introduced in the paper [RANKVIDEO: Reasoning Reranking for Text-to-Video Retrieval](https://huggingface.co/papers/2602.02444).
+- **Repository:** [https://github.com/tskow99/RANKVIDEO-Reasoning-Reranker](https://github.com/tskow99/RANKVIDEO-Reasoning-Reranker)
+- **Paper:** [RANKVIDEO: Reasoning Reranking for Text-to-Video Retrieval](https://arxiv.org/abs/2602.02444)
+## Training Data
+This model was trained using the [MultiVENT 2.0 dataset](https://huggingface.co/datasets/hltcoe/MultiVENT2.0).
 ## Usage
+You can use the model for scoring query-video pairs via the `rankvideo` library as follows:
+```python
 from rankvideo import VLMReranker
 reranker = VLMReranker(model_path="hltcoe/RankVideo")
+# Score query-video pairs for relevance
 scores = reranker.score_batch(
     queries=["person playing guitar"],
     video_paths=["/path/to/video.mp4"],
 )
 print(f"Relevance score: {scores[0]['logit_delta_yes_minus_no']:.3f}")
 ```
 ## BibTeX
 ```bibtex
 @misc{skow2026rankvideoreasoningrerankingtexttovideo,
       title={RANKVIDEO: Reasoning Reranking for Text-to-Video Retrieval},
       archivePrefix={arXiv},
       primaryClass={cs.IR},
       url={https://arxiv.org/abs/2602.02444},
+}
+```