Lizrek
/

bert-base-mountain-NER

Token Classification

named-entity-recognition

Model card Files Files and versions

bert-base-mountain-NER / README.md

Lizrek's picture

Update README.md

5717663 verified over 1 year ago

|

history blame contribute delete

2.44 kB

	---
	base_model:
	- dslim/bert-base-NER
	pipeline_tag: token-classification
	tags:
	- token-classification
	- pytorch
	- transformers
	- named-entity-recognition
	metrics:
	- seqeval
	---

	# bert-base-mountain-NER

	This model is a specialized adaptation of [dslim/bert-base-NER](https://huggingface.co/dslim/bert-base-NER), tailored for recognizing mountain names with a focus on geographical texts. Unlike the original, this model retains all 12 hidden layers and has been specifically fine-tuned to achieve high precision in identifying mountain-related entities across diverse texts.

	It is ideal for applications that involve extracting geographic information from travel literature, research documents, or any content related to natural landscapes.

	## Dataset

	The model was trained using approximately 115 samples generated specifically for mountain name recognition. These samples were created with the assistance of ChatGPT, focusing on realistic use cases for mountain-related content in the NER format.

	## How to Use

	You can easily integrate this model with the Transformers library's NER pipeline:

	```python
	import torch
	from transformers import AutoTokenizer, AutoModelForTokenClassification
	from transformers import pipeline

	device = "cuda" if torch.cuda.is_available() else "cpu"

	# Load model and tokenizer
	model_name = "Lizrek/bert-base-mountain-NER"
	tokenizer = AutoTokenizer.from_pretrained(model_name)
	model = AutoModelForTokenClassification.from_pretrained(model_name)

	# Create a pipeline for NER
	nlp = pipeline("ner", model=model, tokenizer=tokenizer)

	# Example usage
	example = "Mount Fuji in Japan are example of volcanic mountain.."
	ner_results = nlp(example)
	print(ner_results)
	```

	## Example Output

	For the above input, the model provides the following output:

	```python
	[{'entity': 'B-MOUNTAIN_NAME', 'score': np.float32(0.9827131), 'index': 1, 'word': 'Mount', 'start': 0, 'end': 5}, {'entity': 'I-MOUNTAIN_NAME', 'score': np.float32(0.98952174), 'index': 2, 'word': 'Fuji', 'start': 6, 'end': 10}]
	```

	This output highlights recognized mountain names, providing metadata such as entity type, confidence score, and word position.

	## Limitations

	- The model is specialized for mountain names and may not be effective in recognizing other types of geographical entities such as rivers or lakes.
	- If the input text is significantly different from the training data in style or terminology, accuracy may be affected.