Quantifying the Carbon Emissions of Machine Learning
Paper
•
1910.09700
•
Published
•
29
This model is a fine-tuned version of answerdotai/ModernBERT-base on the CoNLL-2003 Named Entity Recognition dataset.
This model can be used for Named Entity Recognition in English text, identifying the following entity types:
from transformers import AutoTokenizer, AutoModelForTokenClassification, pipeline
# Load model and tokenizer
tokenizer = AutoTokenizer.from_pretrained("joe-xhedi/ModernBERT-NER")
model = AutoModelForTokenClassification.from_pretrained("joe-xhedi/ModernBERT-NER")
# Create NER pipeline
ner_pipeline = pipeline("ner",
model=model,
tokenizer=tokenizer,
aggregation_strategy="simple")
# Example usage
text = "John Doe works at OpenAI in San Francisco."
results = ner_pipeline(text)
print(results)
The model was fine-tuned on the CoNLL-2003 Named Entity Recognition dataset, which contains:
The model was evaluated on the CoNLL-2003 test dataset.
The model was evaluated using seqeval metrics:
| Metric | Value |
|---|---|
| Accuracy | 0.9892527549550251 |
| F1 Score | 0.9363408521303258 |
| Precision | N/A |
| Recall | N/A |
The model recognizes the following entity types:
Carbon emissions can be estimated using the Machine Learning Impact calculator presented in Lacoste et al. (2019).
If you use this model, please cite:
@misc{ModernBERT-NER,
title={ModernBERT Fine-tuned for Named Entity Recognition},
author={Jayesh Thakare},
year={2025},
url={https://huggingface.co/joe-xhedi/ModernBERT-NER}
}
Base model
answerdotai/ModernBERT-base