Update README.md
Browse files
README.md
CHANGED
|
@@ -10,20 +10,42 @@ tags:
|
|
| 10 |
- BERT
|
| 11 |
---
|
| 12 |
# Computing Skill NER
|
| 13 |
-
|
| 14 |
|
|
|
|
| 15 |
|
| 16 |
-
## How to Use
|
| 17 |
-
You can employ this model using the Transformers library's *pipeline* for NER, or incorporate it as a conventional Transformer in the HuggingFace ecosystem.
|
| 18 |
|
|
|
|
|
|
|
|
|
|
| 19 |
```python
|
| 20 |
-
#
|
| 21 |
-
from transformers import
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 22 |
|
| 23 |
-
# Load pre-trained BERT model and tokenizer
|
| 24 |
-
tokenizer = BertTokenizer.from_pretrained('Nucha/Nucha_SkillNER_BERT')
|
| 25 |
-
model = BertForTokenClassification.from_pretrained('Nucha/Nucha_SkillNER_BERT', num_labels=5)
|
| 26 |
```
|
|
|
|
|
|
|
|
|
|
|
|
|
| 27 |
|
| 28 |
## Demo
|
| 29 |
https://huggingface.co/spaces/Nucha/NuchaSkillNER
|
|
|
|
| 10 |
- BERT
|
| 11 |
---
|
| 12 |
# Computing Skill NER
|
| 13 |
+
Nucha_SkillNER_BERT is a Named Entity Recognition (NER) model specifically fine-tuned to recognize skill-related entities from text, focusing on identifying both hard and soft skills. This model is built on top of a BERT-based architecture, allowing it to leverage contextual understanding for accurate extraction of skill-related information. It is particularly useful for analyzing job descriptions, resumes, or any text where skills are explicitly mentioned.
|
| 14 |
|
| 15 |
+
The model supports the recognition of multiple skill categories, including technical skills (e.g., programming languages, software tools) and soft skills (e.g., communication, leadership). It is ideal for applications in recruitment, talent management, or skill-based data analysis.
|
| 16 |
|
|
|
|
|
|
|
| 17 |
|
| 18 |
+
## How to Use
|
| 19 |
+
You can use the Nucha/Nucha_SkillNER_BERT model for Named Entity Recognition (NER) by loading it directly from Hugging Face's transformers library. Below is an example of how to use the model with the pipeline API for entity extraction.
|
| 20 |
+
### Step-by-Step Example:
|
| 21 |
```python
|
| 22 |
+
# Libraly
|
| 23 |
+
from transformers import AutoTokenizer, AutoModelForTokenClassification, pipeline
|
| 24 |
+
|
| 25 |
+
# Load the pre-trained model and tokenizer
|
| 26 |
+
model_name = "Nucha/Nucha_SkillNER_BERT"
|
| 27 |
+
tokenizer = AutoTokenizer.from_pretrained(model_name)
|
| 28 |
+
model = AutoModelForTokenClassification.from_pretrained(model_name)
|
| 29 |
+
|
| 30 |
+
# Create a NER pipeline
|
| 31 |
+
ner_pipeline = pipeline("ner", model=model, tokenizer=tokenizer, aggregation_strategy="simple")
|
| 32 |
+
|
| 33 |
+
# Sample text
|
| 34 |
+
text = "I have experience in Python, JavaScript, and cloud technologies like AWS and Azure."
|
| 35 |
+
|
| 36 |
+
# Run the pipeline on the text
|
| 37 |
+
ner_results = ner_pipeline(text)
|
| 38 |
+
|
| 39 |
+
# Display the results
|
| 40 |
+
for entity in ner_results:
|
| 41 |
+
print(f"Entity: {entity['word']}, Label: {entity['entity_group']}, Score: {entity['score']:.4f}")
|
| 42 |
+
|
| 43 |
|
|
|
|
|
|
|
|
|
|
| 44 |
```
|
| 45 |
+
### Output Explanation:
|
| 46 |
+
Entity: This is the word or phrase identified in the text that matches one of the model's recognized categories.
|
| 47 |
+
Label: The classification label assigned to the entity, such as SKILL or TECHNOLOGY.
|
| 48 |
+
Score: The confidence score of the model for the identified entity, represented as a floating-point number.
|
| 49 |
|
| 50 |
## Demo
|
| 51 |
https://huggingface.co/spaces/Nucha/NuchaSkillNER
|