Instructions to use dpravinv/modernbert-job-role-matcher with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use dpravinv/modernbert-job-role-matcher with sentence-transformers:

from sentence_transformers import SentenceTransformer

model = SentenceTransformer("dpravinv/modernbert-job-role-matcher")

sentences = [
    "Logistics Services Manager",
    "The Logistics Contracts Manager/Logistics Programme Manager is responsible for managing multiple logistics programmes and related customer service activities. He/She is also responsible for managing the contracts to ensure customer requirements are met and managing overall programme resources, including manpower, internal assets and external vendors.\n\nAnalytical and logical, he is required to manage resources and obtain buy-in from internal and external stakeholders. He is also expected to lead programmes and make business decisions independently.",
    "The Quality Engineer identifies user requirements and expectations to inform quality standards for end-products, and analyses product development processes to identify relevant quality standards. He/She incorporates relevant and suitable international standards into product development processes, quality standards and testing processes. He identifies quality-testing types and variations based on business needs and requirements and develops testing processes. He identifies suitable measures of quality for testing and contributes to the development of test scenarios and plans. He conducts various quality tests, and analyses data to identify operating and usage conditions in which performance of quality measures starts to decline. He also automates quality testing for applicable and suitable tests.\n\nHe works in a team setting and is proficient in programming languages required by the organisation. He is familiar with international quality standards, and uses test automation frameworks and tools, as well as applicable quality testing and analysis tools.  \n\nThe Quality Engineer possesses strong analytical ability with excellent communication and interpersonal skills. He is highly meticulous in nature, curious and work dynamically.",
    "The Customer Services Agent provides assistance to customers at check-in counters. He/She ensures that passengers details match the information on travel documents and handles customer issues regarding flight operations and automated check-in systems. To maintain a safe working environment, he complies with all safety and/or security standards and reports safety and/or security breaches to officers and supervisors.\n\nThe Customer Services Agent demonstrates professional behaviour when responding to passenger complaints and acts as a service ambassador for the organisation. He works in shifts to accommodate round-the-clock flight arrivals and departures. He is physically strong to assist passengers with lifting of their baggage. Furthermore, he is service-oriented, possesses good communication skills as well as handles passengers with special needs in an appropriate manner."
]
embeddings = model.encode(sentences)

similarities = model.similarity(embeddings, embeddings)
print(similarities.shape)
# [4, 4]

Notebooks
Google Colab
Kaggle

modernbert-job-role-matcher

This is a sentence-transformers model finetuned from nomic-ai/modernbert-embed-base. It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.

Model Details

Model Description

Model Type: Sentence Transformer
Base model: nomic-ai/modernbert-embed-base
Maximum Sequence Length: 8192 tokens
Output Dimensionality: 768 dimensions
Similarity Function: Cosine Similarity
Language: en
License: apache-2.0

Model Sources

Documentation: Sentence Transformers Documentation
Repository: Sentence Transformers on GitHub
Hugging Face: Sentence Transformers on Hugging Face

Full Model Architecture

SentenceTransformer(
  (0): Transformer({'max_seq_length': 8192, 'do_lower_case': False}) with Transformer model: ModernBertModel 
  (1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
  (2): Normalize()
)

Usage

Direct Usage (Sentence Transformers)

First install the Sentence Transformers library:

pip install -U sentence-transformers

Then you can load this model and run inference.

from sentence_transformers import SentenceTransformer

# Download from the 🤗 Hub
model = SentenceTransformer("dpravinv/modernbert-job-role-matcher")
# Run inference
sentences = [
    'Senior Aerospace Quality Control Engineer',
    "The Senior NDT Level 3 Engineer (Aircraft Maintenance) manages non-destructive testing (NDT) operations for assessing the quality of aircraft structures. He/She establishes new NDT techniques and qualifies new procedures. He drives compliance of all NDT inspections with the requirements of customers, original equipment manufacturer (OEM) and EN 4179, NAS 410, NADCAP as appropriate. He drives collaboration with workshops and engineering teams for failure investigations and recommends engineering solutions for structural flaws and defects. He develops special process control plans and manages equipment maintenance and operator certification programmes. He also monitors results of NDT for trends and corrective actions, and leads technical audits to ensure compliance with relevant standards and NDT requirements. He reviews compliance with airworthiness and legislative requirements, while proposing enhancements to the organisation's standard operating procedures (SOPs), and safety, health and quality systems. He proactively contributes to the development of lean and sustainability practices, and conducts research and digital innovation in NDT for continuous process improvements. As a team leader, he appraises staff performance and conducts coaching and training for level 1 and level 2 NDT personnel. He is able to work cross-functionally, employing critical reasoning, analytical thinking and problem-solving skills to identify deviations and mitigate potential quality risks in aircraft maintenance processes.",
    'The Automation Coordinator/Robot Coordinator oversees automated equipment and robots used in manufacturing processes. He/She is the primary responder, responsible for troubleshooting automated production systems and performing preventive and predictive maintenance on equipment. He also contributes to process optimisation by managing data from automated manufacturing systems to facilitate real-time insight gathering and decision-making.\n\nHe may be required to work on rotating shifts in a factory setting, and under strict compliance to workplace safety and health requirements, organisational quality control and other parameters.\n\nHe is able to work independently, and as part of a team, to achieve production and quality targets, and interact effectively with others to ensure that all issues are resolved appropriately and efficiently.',
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 768]

# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities.shape)
# [3, 3]

Evaluation

Metrics

Triplet

Evaluated with TripletEvaluator

Metric	Value
cosine_accuracy	0.98

Training Details

Training Dataset

Unnamed Dataset

Size: 800 training samples
Columns: anchor, positive, and negative

Approximate statistics based on the first 800 samples:

	anchor	positive	negative
type	string	string	string
details	min: 4 tokens mean: 6.82 tokens max: 14 tokens	min: 73 tokens mean: 180.48 tokens max: 380 tokens	min: 73 tokens mean: 181.7 tokens max: 380 tokens

Samples:

anchor	positive	negative
`Port Vessel Navigator`	The Helmsman manoeuvres and handles boats or crafts operating within the Port Limit of Singapore Territorial Waters. He/She is able to use the craft's navigational, fire-fighting and safety equipment and appreciate weather conditions, tides and tidal currents. He also performs basic chartwork, monitors and anticipates potential problems that may arise during daily operations and alerts the relevant authorities to them. He must pass a colour vision test and fulfil the requirements of the Port Limit Helmsman Licence issued by the Maritime and Port Authority of Singapore (MPA).	The Associate Counsellor assists in providing counselling services and support to individuals and families experiencing socioemotional and mental health challenges. This includes case management for cases of low complexity and risk and provision of clinical services through various modes of counselling such as face-to-face or online counselling and group work.He/She may also support department research through data collection and coordinate internal and external training/programmes. A patient and compassionate professional, the Associate Counsellor works in diverse settings across social services, including care homes, educational institutes, family service centres and healthcare facilities. He works under supervision as part of a collaborative team.
`Room Reservations Manager`	The Reservations Executive/Reservations Supervisor is responsible for supervising the operations of the department in selling rooms and managing room inventory to maximise sales. He/She ensures that all guest requests, concerns and feedback relating to rooms reservations are addressed in a timely and professional manner and collaborates with relevant departments on booking requirements and special guest requests to provide a seamless guest experience. He performs checks to ensure the accuracy of reservation bookings and records, keeps track of room availability and inventory, monitors room sales and occupancy levels and analyses reservations forecast to maximise the property's occupancy potential. He assists to meet monthly revenue targets by identifying new contacts and proposing promotional packages to increase room sales and revenue. He is also responsible for monitoring the team's compliance with the property's policies and procedures for reservations operations. He guides and coa...	The Membership Director/Assistant Director assumes overall responsibility in driving member attraction, recruitment and retention. He/she develops membership development and engagement strategies with the support of member research. He works with multiple stakeholders to advocate for member needs and interests to the relevant government agencies. He also oversees the execution and delivery of membership activities and events. The Membership Director/Assistant Director is highly driven, detail-oriented and strategic in handling all aspects of member relations. He is articulate and has excellent communication and people management skills to develop and maintain strong relationships among various stakeholders. He is able to multi-task and rally his team to deliver excellent membership experiences.
`Quality Assurance Engineer`	The Quality Engineer identifies user requirements and expectations to inform quality standards for end-products, and analyses product development processes to identify relevant quality standards. He/She incorporates relevant and suitable international standards into product development processes, quality standards and testing processes. He identifies quality-testing types and variations based on business needs and requirements and develops testing processes. He identifies suitable measures of quality for testing and contributes to the development of test scenarios and plans. He conducts various quality tests, and analyses data to identify operating and usage conditions in which performance of quality measures starts to decline. He also automates quality testing for applicable and suitable tests. He works in a team setting and is proficient in programming languages required by the organisation. He is familiar with international quality standards, and uses test automation frameworks and...	The Industry Development Director/Assistant Director plays a key role in collaborating with key government agencies and other organisations to drive industry development and transformation. He/she collaborates with multiple stakeholders to represent the industry needs and interests at relevant platforms. He drives industry innovation and adoption of technology, and oversees the execution and delivery of local industry projects and initiatives. He also builds effective relationships with strategic partners and stakeholders to identify growth opportunities for the industry.The Industry Development Director/Assistant Director is an effective communicator and presenter, able to develop strong working relationships with relevant stakeholders and strategic partners. He has good listening skills and is analytical and professional in addressing the concerns of the industry. He is forward-looking, able to set out a clear strategic direction and inspire the team towards achieving desired outcome...

Loss: TripletLoss with these parameters:

{
    "distance_metric": "TripletDistanceMetric.EUCLIDEAN",
    "triplet_margin": 5
}

Evaluation Dataset

Unnamed Dataset

Size: 200 evaluation samples
Columns: anchor, positive, and negative

Approximate statistics based on the first 200 samples:

	anchor	positive	negative
type	string	string	string
details	min: 4 tokens mean: 6.95 tokens max: 12 tokens	min: 73 tokens mean: 178.67 tokens max: 313 tokens	min: 73 tokens mean: 173.81 tokens max: 380 tokens

Samples:

anchor	positive	negative
`Room Sales Coordinator`	The Reservations Executive/Reservations Supervisor is responsible for supervising the operations of the department in selling rooms and managing room inventory to maximise sales. He/She ensures that all guest requests, concerns and feedback relating to rooms reservations are addressed in a timely and professional manner and collaborates with relevant departments on booking requirements and special guest requests to provide a seamless guest experience. He performs checks to ensure the accuracy of reservation bookings and records, keeps track of room availability and inventory, monitors room sales and occupancy levels and analyses reservations forecast to maximise the property's occupancy potential. He assists to meet monthly revenue targets by identifying new contacts and proposing promotional packages to increase room sales and revenue. He is also responsible for monitoring the team's compliance with the property's policies and procedures for reservations operations. He guides and coa...	The Technician supports the team to perform routine bus servicing and preventive corrective maintenance activities. His/Her duties include preparation of work activities, perform assigned servicing and maintenance tasks of different bus sub-systems, perform general housekeeping of workshop tools and equipment as well as adheres to Workplace Safety and Health (WSH) procedures. He may be deployed to support on-the-road bus breakdown assistance and recovery. He is required to work in the bus workshop and/or depot environment based on rotating shifts. He is able to interact effectively with others when carrying out his duties and has the opportunity to gain experience, knowledge as well as deepen his technical and maintenance skills on various bus sub-systems.
`Data Compliance Officer`	The Data Protection Officer executes data governance policies and procedures. He/She ensures the Data Protection Act is implemented and enforced in the organisation, and amongst the respective teams and users. He collaborates with business and project teams in projects and ensures alignment and compliance with the organisation's data protection guidelines and policies, and with industry standards and guidelines. He also directs a team of professionals and third-party vendors or service providers to achieve organisational goals in accordance with the data governance and data protection policies. He manages risks and data breach incidents. The Data Protection Officer is knowledgeable in areas of data governance, compliance and data protection policies and frameworks, and works within and across teams to mitigate data breaches. He is expected to be proficient in the requirements under the Personal Data Protection Act 2012. The Data Protection Officer adopts a broad and global perspective ...	The Senior Process Safety Engineer provides technical advice and guidance on process safety-related activities. He/She leads the implementation of the Process Safety Management (PSM) framework in the organisation, and reviews plant safeguarding system requirements to ensure compliance with process safety standards. In addition, he provides technical input for the development and maintenance of the organisations Major Hazard Installation (MHI) Safety Case. The Senior Process Safety Engineer administers the Workplace Safety and Health (WSH) and Environmental Management Systems (EMS) by advising on the development and improvement of Safe System of Work (SSoW) frameworks, and by ensuring proper closure of process safety incident investigations and their notification to relevant authorities. He provides support and advice for asset integrity assurance and compliance, and leads process safety reviews during new projects. In addition, he contributes to staff capability development by coachin...
`Maritime Craft Handler`	The Helmsman manoeuvres and handles boats or crafts operating within the Port Limit of Singapore Territorial Waters. He/She is able to use the craft's navigational, fire-fighting and safety equipment and appreciate weather conditions, tides and tidal currents. He also performs basic chartwork, monitors and anticipates potential problems that may arise during daily operations and alerts the relevant authorities to them. He must pass a colour vision test and fulfil the requirements of the Port Limit Helmsman Licence issued by the Maritime and Port Authority of Singapore (MPA).	The Head of Design strategises the design and development of the product line lifecycle, including the end-to-end iterative design process. He/She establishes design policy principles to drive product development in the conceptualisation and design phase, including endorsement of design strategies, and achieving design solutions based on insights researched by the team He provides insightful directives based on the evaluation of design concepts and drawings by the team to determine the best product and ensure that it is aligned to the latest market trends. He has a strong understanding on how product technologies and frameworks can formulate impactful design concepts, is well-versed in product development lifecycles and stays abreast of the latest emerging industry trends in terms of product design. The Head of Design adopts a global mindset while distilling market trends to incorporate them into novel product design strategies, with a clear view of how this sits within the product ...

Loss: TripletLoss with these parameters:

{
    "distance_metric": "TripletDistanceMetric.EUCLIDEAN",
    "triplet_margin": 5
}

Training Hyperparameters

Non-Default Hyperparameters

eval_strategy: epoch
per_device_train_batch_size: 4
per_device_eval_batch_size: 4
gradient_accumulation_steps: 4
learning_rate: 2e-05
lr_scheduler_type: cosine
warmup_ratio: 0.1
load_best_model_at_end: True
batch_sampler: no_duplicates

All Hyperparameters

Click to expand

overwrite_output_dir: False
do_predict: False
eval_strategy: epoch
prediction_loss_only: True
per_device_train_batch_size: 4
per_device_eval_batch_size: 4
per_gpu_train_batch_size: None
per_gpu_eval_batch_size: None
gradient_accumulation_steps: 4
eval_accumulation_steps: None
torch_empty_cache_steps: None
learning_rate: 2e-05
weight_decay: 0.0
adam_beta1: 0.9
adam_beta2: 0.999
adam_epsilon: 1e-08
max_grad_norm: 1.0
num_train_epochs: 3
max_steps: -1
lr_scheduler_type: cosine
lr_scheduler_kwargs: {}
warmup_ratio: 0.1
warmup_steps: 0
log_level: passive
log_level_replica: warning
log_on_each_node: True
logging_nan_inf_filter: True
save_safetensors: True
save_on_each_node: False
save_only_model: False
restore_callback_states_from_checkpoint: False
no_cuda: False
use_cpu: False
use_mps_device: False
seed: 42
data_seed: None
jit_mode_eval: False
use_ipex: False
bf16: False
fp16: False
fp16_opt_level: O1
half_precision_backend: auto
bf16_full_eval: False
fp16_full_eval: False
tf32: None
local_rank: 0
ddp_backend: None
tpu_num_cores: None
tpu_metrics_debug: False
debug: []
dataloader_drop_last: False
dataloader_num_workers: 0
dataloader_prefetch_factor: None
past_index: -1
disable_tqdm: False
remove_unused_columns: True
label_names: None
load_best_model_at_end: True
ignore_data_skip: False
fsdp: []
fsdp_min_num_params: 0
fsdp_config: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
fsdp_transformer_layer_cls_to_wrap: None
accelerator_config: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
deepspeed: None
label_smoothing_factor: 0.0
optim: adamw_torch
optim_args: None
adafactor: False
group_by_length: False
length_column_name: length
ddp_find_unused_parameters: None
ddp_bucket_cap_mb: None
ddp_broadcast_buffers: False
dataloader_pin_memory: True
dataloader_persistent_workers: False
skip_memory_metrics: True
use_legacy_prediction_loop: False
push_to_hub: False
resume_from_checkpoint: None
hub_model_id: None
hub_strategy: every_save
hub_private_repo: None
hub_always_push: False
gradient_checkpointing: False
gradient_checkpointing_kwargs: None
include_inputs_for_metrics: False
include_for_metrics: []
eval_do_concat_batches: True
fp16_backend: auto
push_to_hub_model_id: None
push_to_hub_organization: None
mp_parameters:
auto_find_batch_size: False
full_determinism: False
torchdynamo: None
ray_scope: last
ddp_timeout: 1800
torch_compile: False
torch_compile_backend: None
torch_compile_mode: None
dispatch_batches: None
split_batches: None
include_tokens_per_second: False
include_num_input_tokens_seen: False
neftune_noise_alpha: None
optim_target_modules: None
batch_eval_metrics: False
eval_on_start: False
use_liger_kernel: False
eval_use_gather_object: False
average_tokens_across_devices: False
prompts: None
batch_sampler: no_duplicates
multi_dataset_batch_sampler: proportional

Training Logs

Epoch	Step	Training Loss	Validation Loss	cosine_accuracy
1.0	50	-	4.3018	0.9800
2.0	100	4.3737	4.1720	0.9850
3.0	150	-	4.1523	0.98

The bold row denotes the saved checkpoint.

Framework Versions

Python: 3.12.6
Sentence Transformers: 4.1.0
Transformers: 4.48.0.dev0
PyTorch: 2.6.0+cu124
Accelerate: 1.2.1
Datasets: 3.1.0
Tokenizers: 0.21.0

Citation

BibTeX

Sentence Transformers

@inproceedings{reimers-2019-sentence-bert,
    title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
    author = "Reimers, Nils and Gurevych, Iryna",
    booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
    month = "11",
    year = "2019",
    publisher = "Association for Computational Linguistics",
    url = "https://arxiv.org/abs/1908.10084",
}

TripletLoss

@misc{hermans2017defense,
    title={In Defense of the Triplet Loss for Person Re-Identification},
    author={Alexander Hermans and Lucas Beyer and Bastian Leibe},
    year={2017},
    eprint={1703.07737},
    archivePrefix={arXiv},
    primaryClass={cs.CV}
}