Model Card for Model ID

This project involves fine-tuning a BERT-based model (dslim/bert-large-NER) to perform Named Entity Recognition (NER) on mountain names in text.

The model has been trained to identify mentions of mountain names and differentiate them from other geographic entities or non-entities.

Features:

Fine-tuned on a custom dataset that includes sentences both with and without mountain names.

Uses focal loss to handle class imbalance, which ensures the model focuses on correctly classifying rare mountain names.

Token-level classification for identifying the B-MOUNTAIN, I-MOUNTAIN, and O (non-entity) labels.

Balances training between sentences with mountains (80%) and without mountains (20%).

Model Description

This is the model card of a 🤗 transformers model that has been pushed on the Hub.

Developed by: Oleksandr Kharytonov
Model type: BERT
Language(s) (NLP): Python
License: MIT
Finetuned from model [optional]: https://huggingface.co/dslim/bert-large-NER

Model Sources

Repository: https://github.com/Shah1st/mountain-ner

Direct Use

from transformers import AutoTokenizer, AutoModelForTokenClassification

tokenizer = AutoTokenizer.from_pretrained('./saved_model')
model = AutoModelForTokenClassification.from_pretrained('./saved_model')

Recommendations

Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model.

How to Get Started with the Model

Use the github below to get started with the model.

https://github.com/Shah1st/mountain-ner

Training Details

Training Data

"DFKI-SLT/few-nerd", "supervised"

Filter for sentences with 'fine_ner_tags' == 24 (mountains)

Evaluation

'eval_loss': 0.009154710918664932, 'eval_macro_f1': 0.8952192988290304, 'eval_accuracy': 0.9746226793108054

Testing Data, Factors & Metrics

Metrics

macro F1: 0.895 Accuracy: 0.974

Summary

This project involves fine-tuning a BERT-based model (dslim/bert-large-NER) to perform Named Entity Recognition (NER) on mountain names in text. The model has been trained to identify mentions of mountain names and differentiate them from other geographic entities or non-entities.

Downloads last month: 2

Safetensors

Model size

0.3B params

Tensor type

F32