DeBERTa-v3-large Fine-tuned on WiC

Cross-encoder model for the Word-in-Context (WiC) binary sense disambiguation task. Both sentences are fed together so the model can attend across them simultaneously.

Target Word Marking

The target word is wrapped with <tgt>word</tgt> in both sentences before encoding.

Performance

Split Accuracy
Validation 0.7524
Test 0.7264

Usage

from transformers import AutoTokenizer, AutoModelForSequenceClassification
import torch

tokenizer = AutoTokenizer.from_pretrained("Deehan1866/finetuned-deberta-wic")
model = AutoModelForSequenceClassification.from_pretrained("Deehan1866/finetuned-deberta-wic")

word = "bank"
s1 = f"The <tgt>{word}</tgt> raised its interest rates."
s2 = f"She visited her local <tgt>{word}</tgt> to deposit a cheque."

enc = tokenizer(s1, s2, return_tensors="pt", truncation=True, max_length=256)
with torch.no_grad():
    logits = model(**enc).logits
pred = torch.argmax(logits).item()
print("Same sense" if pred == 1 else "Different sense")
Downloads last month
37
Safetensors
Model size
0.4B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Dataset used to train Deehan1866/finetuned-deberta-wic