SoFairOA/sofair_softcite_somesci
Viewer • Updated • 514k • 2
How to use SoFairOA/sofair-modernBERT-base-filter with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("text-classification", model="SoFairOA/sofair-modernBERT-base-filter") # Load model directly
from transformers import AutoTokenizer, AutoModelForSequenceClassification
tokenizer = AutoTokenizer.from_pretrained("SoFairOA/sofair-modernBERT-base-filter")
model = AutoModelForSequenceClassification.from_pretrained("SoFairOA/sofair-modernBERT-base-filter")Fine-tuned ModernBERT to identify candidate documents for software mention extraction.
It was trained on SoFairOA/sofair_softcite_somesci (sofair_softcite_somesci_documents) to classify whether the given document contains at least one annotation.
We created https://github.com/SoFairOA/filter, a simple command-line tool to use this model for processing a collection of documents.
We evaluated this model on the test set of SoFairOA/sofair_softcite_somesci (sofair_softcite_somesci_documents) dataset:
| precision | 0.8625730994152047 |
|---|---|
| recall | 0.9104938271604939 |
| f1 | 0.8858858858858859 |
| accuracy | 0.9268527430221367 |
Base model
answerdotai/ModernBERT-base