File size: 1,107 Bytes
24fe663 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 | # Job Tag Embedding Model (Dev)
Fine-tuned embedding model for job category recommendation based on [BAAI/bge-large-zh-v1.5](https://huggingface.co/BAAI/bge-large-zh-v1.5).
## Model Details
- **Base Model:** 1111DataScience/job_tag_embedding
- **Training Data:** Job titles and category pairs
- **Training Steps:** 1,920 (3 epochs)
- **Final Loss:** 2.126
## Usage
```python
from FlagEmbedding import FlagModel
# Load model
model = FlagModel('1111DataScience/job_tag_embedding_dev', use_fp16=True)
# Encode query (job title)
query_embedding = model.encode_queries(["內外場儲備幹部"])
# Encode candidates (job categories)
candidate_embeddings = model.encode([
"儲備幹部",
"餐廚助手",
"餐飲服務人員"
])
# Calculate similarity
similarities = query_embedding @ candidate_embeddings.T
```
## Training Command
```bash
torchrun --nproc_per_node 1 \
-m FlagEmbedding.finetune.embedder.encoder_only.base \
--model_name_or_path 1111DataScience/job_tag_embedding \
--cache_dir ./cache/model \
--train_data training_data.jsonl \
--output_dir ./output
```
|