namdp-ptit
/

ViRanker

Text Classification

text-embeddings-inference

Model card Files Files and versions

namdp-ptit commited on Aug 19, 2024

Commit

7fae5e8

·

verified ·

1 Parent(s): bc8d82d

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -116,6 +116,8 @@ Train data should be a json file, where each line is a dict like this:
 `query` is the query, and `pos` is a list of positive texts, `neg` is a list of negative texts. If you have no negative
 texts for a query, you can random sample some from the entire corpus as the negatives.
 ## Performance
 Below is a comparision table of the results we achieved compared to some other pre-trained Cross-Encoders on

 `query` is the query, and `pos` is a list of positive texts, `neg` is a list of negative texts. If you have no negative
 texts for a query, you can random sample some from the entire corpus as the negatives.
+Besides, for each query in the train data, we used LLMs to generate hard negative for them by asking LLMs to create a document that is the opposite one of the documents in 'pos'.
 ## Performance
 Below is a comparision table of the results we achieved compared to some other pre-trained Cross-Encoders on