nyu-mll/glue
Viewer • Updated • 1.49M • 480k • 498
How to use cnut1648/biolinkbert-large-mnli-resampled with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("text-classification", model="cnut1648/biolinkbert-large-mnli-resampled") # Load model directly
from transformers import AutoTokenizer, AutoModelForSequenceClassification
tokenizer = AutoTokenizer.from_pretrained("cnut1648/biolinkbert-large-mnli-resampled")
model = AutoModelForSequenceClassification.from_pretrained("cnut1648/biolinkbert-large-mnli-resampled")This model is a fine-tuned version of BioLinkBERT-large on the
resampled version of GLUE MNLI, i.e. cnut1648/mnli_resampled_as_mednli.
The performance is reported below:
| Model | Dataset | Acc |
|---|---|---|
| Roberta-large-mnli | MNLI dev mm | 90.12 |
| MNLI dev m | 90.59 | |
| SNLI test | 88.25 | |
| BioLinkBERT-large | MNLI dev mm | 33.56 |
| MNLI dev m | 33.18 | |
| SNLI test | 32.66 | |
| BioLinkBERT-large-mnli-snli | MNLI dev mm | 85.75 |
| MNLI dev m | 85.30 | |
| SNLI test | 89.82 | |
| BioLinkBERT-large-mnli-resampled | MNLI dev mm | 80.22 |
| MNLI dev m | 78.07 | |
| SNLI test | 71.33 |
Compared to cnut1648/biolinkbert-large-mnli-snli, this checkpoint is never trained on SNLI.
The labels are "0": "entailment", "1": "neutral", "2": "contradiction"
We follow the same training procedure as cnut1648/biolinkbert-large-mnli-snli.
The following hyperparameters were used during training: