Text Ranking
sentence-transformers
Safetensors
PyTorch
English
qwen3_vl
reranking
retrieval
rag
cross-encoder
qwen3-vl
Instructions to use Surpem/Supertron2-Reranker-8B with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- sentence-transformers
How to use Surpem/Supertron2-Reranker-8B with sentence-transformers:
from sentence_transformers import CrossEncoder model = CrossEncoder("Surpem/Supertron2-Reranker-8B") query = "Which planet is known as the Red Planet?" passages = [ "Venus is often called Earth's twin because of its similar size and proximity.", "Mars, known for its reddish appearance, is often referred to as the Red Planet.", "Jupiter, the largest planet in our solar system, has a prominent red spot.", "Saturn, famous for its rings, is sometimes mistaken for the Red Planet." ] scores = model.predict([(query, passage) for passage in passages]) print(scores) - Notebooks
- Google Colab
- Kaggle
File size: 3,495 Bytes
e946ffc c15ca2d e946ffc c15ca2d e946ffc c15ca2d e946ffc c15ca2d e946ffc c15ca2d e946ffc c15ca2d e946ffc c15ca2d e946ffc c15ca2d e946ffc c15ca2d e946ffc c15ca2d e946ffc c15ca2d e946ffc c15ca2d e946ffc c15ca2d | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 | ---
license: apache-2.0
language:
- en
base_model:
- Qwen/Qwen3-VL-Reranker-8B
pipeline_tag: text-ranking
library_name: sentence-transformers
tags:
- reranking
- retrieval
- rag
- cross-encoder
- qwen3-vl
- pytorch
---
# **Supertron2-Reranker-8B: A Compact Cross-Encoder Reranking Model**
## **Model Description**
**Supertron2-Reranker-8B** is a reranking model built on top of [Qwen/Qwen3-VL-Reranker-8B](https://huggingface.co/Qwen/Qwen3-VL-Reranker-8B). It is designed to score query-document pairs for retrieval pipelines, search systems, and RAG applications where a stronger second-stage ranker is useful.
* **Developed by:** Surpem
* **Model type:** Cross-Encoder Reranker
* **Architecture:** Qwen3-VL reranker, 8B parameters
* **License:** Apache 2.0
---
## **Capabilities**
### **Search Reranking**
Supertron2-Reranker-8B can compare a user query against candidate passages and assign relevance scores. It is intended as a second-stage reranker after a faster retriever has already selected candidate documents.
### **RAG Pipelines**
The model can help improve retrieval-augmented generation by pushing more relevant documents toward the top of the context window before answer generation.
### **Question-Document Matching**
Supertron2-Reranker-8B is useful for matching questions to passages, snippets, help-center articles, documentation chunks, and other text candidates.
### **Instruction-Aware Retrieval**
The model is prompted for relevance scoring, making it suitable for natural language search tasks where query intent matters.
---
## **Get Started**
```python
from sentence_transformers import CrossEncoder
model_id = "Surpem/Supertron2-Reranker-8B"
model = CrossEncoder(model_id)
pairs = [
("What is the capital of France?", "Paris is the capital and largest city of France."),
("What is the capital of France?", "Mars is often called the red planet."),
]
scores = model.predict(pairs)
print(scores)
```
Example reranking:
```python
query = "How do I reset my password?"
documents = [
"Use the account recovery page to reset your password.",
"Our refund policy allows returns within 30 days.",
"Two-factor authentication adds extra login security.",
]
results = model.rank(query, documents)
print(results)
```
---
## **Hardware Requirements**
| Precision | Min VRAM | Recommended |
|---|---|---|
| bfloat16 | 18 GB | 24 GB+ |
| 4-bit quantized | 6 GB | 10 GB+ |
For larger batches or long documents, use more VRAM or reduce the batch size/max sequence length.
---
## **Intended Use**
Supertron2-Reranker-8B is intended for:
* Search reranking
* RAG document reranking
* Query-passage relevance scoring
* Documentation and knowledge-base retrieval
* Evaluation of candidate retrieval results
It is not intended to be used as a standalone chat model.
---
## **Limitations**
* The model scores relevance; it does not generate answers.
* It should be evaluated on your own retrieval domain before production use.
* Long documents may need chunking before reranking.
* Relevance scores are relative and may not be calibrated across unrelated queries.
* The model may still rank incorrect, outdated, or unsafe content highly if it appears textually relevant.
---
## **Citation**
```bibtex
@misc{surpem2026supertron2-reranker-8b,
title={Supertron2-Reranker-8B -- Compact Cross-Encoder Reranking Model},
author={Surpem},
year={2026},
url={https://huggingface.co/Surpem/Supertron2-Reranker-8B},
}
```
|