youchengChung commited on
Commit
24fe663
·
verified ·
1 Parent(s): 79c10a7

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +43 -0
README.md ADDED
@@ -0,0 +1,43 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Job Tag Embedding Model (Dev)
2
+
3
+ Fine-tuned embedding model for job category recommendation based on [BAAI/bge-large-zh-v1.5](https://huggingface.co/BAAI/bge-large-zh-v1.5).
4
+
5
+ ## Model Details
6
+
7
+ - **Base Model:** 1111DataScience/job_tag_embedding
8
+ - **Training Data:** Job titles and category pairs
9
+ - **Training Steps:** 1,920 (3 epochs)
10
+ - **Final Loss:** 2.126
11
+
12
+ ## Usage
13
+
14
+ ```python
15
+ from FlagEmbedding import FlagModel
16
+
17
+ # Load model
18
+ model = FlagModel('1111DataScience/job_tag_embedding_dev', use_fp16=True)
19
+
20
+ # Encode query (job title)
21
+ query_embedding = model.encode_queries(["內外場儲備幹部"])
22
+
23
+ # Encode candidates (job categories)
24
+ candidate_embeddings = model.encode([
25
+ "儲備幹部",
26
+ "餐廚助手",
27
+ "餐飲服務人員"
28
+ ])
29
+
30
+ # Calculate similarity
31
+ similarities = query_embedding @ candidate_embeddings.T
32
+ ```
33
+
34
+ ## Training Command
35
+
36
+ ```bash
37
+ torchrun --nproc_per_node 1 \
38
+ -m FlagEmbedding.finetune.embedder.encoder_only.base \
39
+ --model_name_or_path 1111DataScience/job_tag_embedding \
40
+ --cache_dir ./cache/model \
41
+ --train_data training_data.jsonl \
42
+ --output_dir ./output
43
+ ```