Job Title NER

Fine-tuned from dslim/bert-base-NER to extract a single job title span from a job description.

Label	Meaning
`B-JOB_TITLE`	First token of the job title
`I-JOB_TITLE`	Continuation token of the job title
`O`	Not part of a job title

The model returns an empty sequence when no job title is detectable in the text.

Usage

from transformers import pipeline

pipe = pipeline(
    "ner",
    model="AP678/job-title-ner",
    aggregation_strategy="simple",
)

text = """
We are seeking a Senior Data Engineer to join our platform team.
You will design and maintain our data pipelines at scale.
"""

results = pipe(text)
job_titles = [r for r in results if r["entity_group"] == "JOB_TITLE"]
print(job_titles)
# [{ 'entity_group': 'JOB_TITLE', 'score': 0.99, 'word': 'Senior Data Engineer', ... }]

Training

Base model: dslim/bert-base-NER
Dataset: ~137K job descriptions scraped from job boards
Positives: up to 30 000 examples with a labeled title span
Negatives: up to 10 000 examples with all-O labels (no title found)
Label scheme: BIO — O / B-JOB_TITLE / I-JOB_TITLE
Max sequence length: 384 tokens

Downloads last month: 2

Safetensors

Model size

0.1B params

Tensor type

F32

Model tree for AP678/job-title-ner

Base model

dslim/bert-base-NER

Finetuned

(38)

this model