Text Ranking
sentence-transformers
Safetensors
Vietnamese
xlm-roberta
cross-encoder
Generated from Trainer
dataset_size:7806
loss:MultipleNegativesRankingLoss
text-embeddings-inference
Instructions to use KhoaUIT/Halong-CrossEncoder-UIT-R2GQA with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- sentence-transformers
How to use KhoaUIT/Halong-CrossEncoder-UIT-R2GQA with sentence-transformers:
from sentence_transformers import CrossEncoder model = CrossEncoder("KhoaUIT/Halong-CrossEncoder-UIT-R2GQA") query = "Which planet is known as the Red Planet?" passages = [ "Venus is often called Earth's twin because of its similar size and proximity.", "Mars, known for its reddish appearance, is often referred to as the Red Planet.", "Jupiter, the largest planet in our solar system, has a prominent red spot.", "Saturn, famous for its rings, is sometimes mistaken for the Red Planet." ] scores = model.predict([(query, passage) for passage in passages]) print(scores) - Notebooks
- Google Colab
- Kaggle
Update README.md
Browse files
README.md
CHANGED
|
@@ -118,6 +118,8 @@ You can finetune this model on your own dataset.
|
|
| 118 |
|
| 119 |
#### Unnamed Dataset
|
| 120 |
|
|
|
|
|
|
|
| 121 |
* Size: 7,806 training samples
|
| 122 |
* Columns: <code>question</code>, <code>context</code>, and <code>negative</code>
|
| 123 |
* Approximate statistics based on the first 1000 samples:
|
|
@@ -143,8 +145,6 @@ You can finetune this model on your own dataset.
|
|
| 143 |
### Evaluation Dataset
|
| 144 |
|
| 145 |
- model_id: KhoaUIT/KhoaUIT-CrossEncoder-UIT-R2GQA
|
| 146 |
-
- dataset:
|
| 147 |
-
name: Custom UIT-R2GQA using function [`mine_hard_negatives`](https://sbert.net/docs/package_reference/util.html#sentence_transformers.util.mine_hard_negatives)
|
| 148 |
- evaluator: [`CrossEncoderRerankingEvaluator`](https://sbert.net/docs/package_reference/cross_encoder/evaluation.html#crossencoderrerankingevaluator)
|
| 149 |
- valid set:
|
| 150 |
| map | mrr@10 | ndcg@10 |
|
|
@@ -158,6 +158,8 @@ You can finetune this model on your own dataset.
|
|
| 158 |
|
| 159 |
#### Unnamed Dataset
|
| 160 |
|
|
|
|
|
|
|
| 161 |
* Size: 974 evaluation samples
|
| 162 |
* Columns: <code>question</code>, <code>context</code>, and <code>negative</code>
|
| 163 |
* Approximate statistics based on the first 974 samples:
|
|
|
|
| 118 |
|
| 119 |
#### Unnamed Dataset
|
| 120 |
|
| 121 |
+
Customize original dataset UIT-R2GQA using function [`mine_hard_negatives`](https://sbert.net/docs/package_reference/util.html#sentence_transformers.util.mine_hard_negatives)
|
| 122 |
+
|
| 123 |
* Size: 7,806 training samples
|
| 124 |
* Columns: <code>question</code>, <code>context</code>, and <code>negative</code>
|
| 125 |
* Approximate statistics based on the first 1000 samples:
|
|
|
|
| 145 |
### Evaluation Dataset
|
| 146 |
|
| 147 |
- model_id: KhoaUIT/KhoaUIT-CrossEncoder-UIT-R2GQA
|
|
|
|
|
|
|
| 148 |
- evaluator: [`CrossEncoderRerankingEvaluator`](https://sbert.net/docs/package_reference/cross_encoder/evaluation.html#crossencoderrerankingevaluator)
|
| 149 |
- valid set:
|
| 150 |
| map | mrr@10 | ndcg@10 |
|
|
|
|
| 158 |
|
| 159 |
#### Unnamed Dataset
|
| 160 |
|
| 161 |
+
Customize original dataset UIT-R2GQA using function [`mine_hard_negatives`](https://sbert.net/docs/package_reference/util.html#sentence_transformers.util.mine_hard_negatives)
|
| 162 |
+
|
| 163 |
* Size: 974 evaluation samples
|
| 164 |
* Columns: <code>question</code>, <code>context</code>, and <code>negative</code>
|
| 165 |
* Approximate statistics based on the first 974 samples:
|