quora-competitions/quora
Updated โข 1.48k โข 23
How to use navteca/quora-roberta-base with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("text-classification", model="navteca/quora-roberta-base") # Load model directly
from transformers import AutoTokenizer, AutoModelForSequenceClassification
tokenizer = AutoTokenizer.from_pretrained("navteca/quora-roberta-base")
model = AutoModelForSequenceClassification.from_pretrained("navteca/quora-roberta-base")This model was trained using SentenceTransformers Cross-Encoder class.
This model uses roberta-base.
This model was trained on the Quora Duplicate Questions dataset.
The model will predict a score between 0 and 1: How likely the two given questions are duplicates.
Note: The model is not suitable to estimate the similarity of questions, e.g. the two questions "How to learn Java" and "How to learn Python" will result in a rahter low score, as these are not duplicates.
The trained model can be used like this:
from sentence_transformers import CrossEncoder
model = CrossEncoder('model_name')
scores = model.predict([('Question 1', 'Question 2'), ('Question 3', 'Question 4')])
print(scores)