Sentence Similarity
sentence-transformers
Safetensors
Transformers
qwen3
text-generation
feature-extraction
qwen
recruitment
LoRA
text-embeddings-inference
Instructions to use JayThinkDiff/CRE-1.1 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- sentence-transformers
How to use JayThinkDiff/CRE-1.1 with sentence-transformers:
from sentence_transformers import SentenceTransformer model = SentenceTransformer("JayThinkDiff/CRE-1.1") sentences = [ "That is a happy person", "That is a happy dog", "That is a very happy person", "Today is a sunny day" ] embeddings = model.encode(sentences) similarities = model.similarity(embeddings, embeddings) print(similarities.shape) # [4, 4] - Transformers
How to use JayThinkDiff/CRE-1.1 with Transformers:
# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("JayThinkDiff/CRE-1.1") model = AutoModelForCausalLM.from_pretrained("JayThinkDiff/CRE-1.1") - Notebooks
- Google Colab
- Kaggle
Update README.md
Browse files
README.md
CHANGED
|
@@ -59,6 +59,24 @@ print("查询结果:", util.cos_sim(query_embedding, passage_embedding))
|
|
| 59 |
| **CRE-1.1** | 0.5816 | **0.6093** |
|
| 60 |
| **Qwen3-Embedding-8B** | **0.7731** | 0.7638 |
|
| 61 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 62 |
### 🛠️ 技术规格 (Technical Specifications)
|
| 63 |
|
| 64 |
* **Pooling Strategy**: 推荐使用模型默认的表征方式(last token pooling)。
|
|
|
|
| 59 |
| **CRE-1.1** | 0.5816 | **0.6093** |
|
| 60 |
| **Qwen3-Embedding-8B** | **0.7731** | 0.7638 |
|
| 61 |
|
| 62 |
+
### 🌐 跨领域招聘场景评测 (Cross-Domain Evaluation)
|
| 63 |
+
|
| 64 |
+
为了验证模型在不同垂直行业的泛化能力,我们在**技术岗(Technical)**与**职能岗(Functional)**两个极具代表性的招聘领域进行了对比测试。结果显示,**CRE-1.1** 在指令微调的加持下,不仅全面超越了传统 Embedding 模型,相比原始基座模型也有质的突破。
|
| 65 |
+
|
| 66 |
+
| 模型 (Model) | 技术岗 (Technical Domain) | 职能岗 (Functional Domain) |
|
| 67 |
+
| :--- | :---: | :---: |
|
| 68 |
+
| BGE | 34.05 | 58.18 |
|
| 69 |
+
| CRE-0.4 | 42.88 | 63.70 |
|
| 70 |
+
| Conan-embedding-v1 | 43.37 | 54.69 |
|
| 71 |
+
| CRE-0.5 | 45.44 | 64.14 |
|
| 72 |
+
| Qwen3-Embedding-8B | 58.96 | 66.25 |
|
| 73 |
+
| **CRE-1.1** | **64.44** | **69.29** |
|
| 74 |
+
|
| 75 |
+
> **核心结论 (Key Insights)**:
|
| 76 |
+
> 1. **指令微调的威力**:相比于基座模型 `Qwen3-Embedding-8B`,**CRE-1.1** 通过特定领域的指令增强,在算法领域得分提升了 **5.48**,金融领域提升了 **3.04**。
|
| 77 |
+
> 2. **压制级优势**:相比行业标杆 `BGE`,**CRE-1.1** 在算法领域的表现近乎**翻倍**(34.05 -> 64.44),充分证明了其在处理复杂专业术语对齐时的卓越性能。
|
| 78 |
+
> 3. **领域泛化**:即便是泛化在职能岗领域,CRE-1.1 依然达到了 **69.29** 的高分,展现了极强的跨行业迁移能力。
|
| 79 |
+
|
| 80 |
### 🛠️ 技术规格 (Technical Specifications)
|
| 81 |
|
| 82 |
* **Pooling Strategy**: 推荐使用模型默认的表征方式(last token pooling)。
|