---
frameworks: PyTorch
license: apache-2.0
tags:
- 向量检索
- 中医
- 医疗
tasks:
- sentence-embedding
base_model:
- BAAI/bge-m3
base_model_relation: finetune
---
# DeepPulse-Embedding Dense Retrieval Model Series
**DeepPulse (深度把脉)** is a member of 心语心言's open-source TCM series models. This project includes two dense retrieval (Embedding) models fine-tuned on different base models:
* **DeepPulse-Embedding-m3**: Fine-tuned based on `BGE-m3`.
* **DeepPulse-Embedding-0.6b**: Fine-tuned based on `Qwen3-0.6B`.
Both models are fine-tuned using a self-built medical dataset (especially TCM data) and are optimized for document chunk retrieval scenarios in the medical field.
# Self-Built Dataset Evaluation Metrics
The evaluation results on the self-built medical dataset are as follows. It can be seen that the DeepPulse series models outperform the original base models on all metrics.
| Model Name | MRR | NDCG@10 | Recall@1 | Recall@5 | Recall@10 |
| :--- | :--- | :--- | :--- | :--- | :--- |
| Qwen3-0.6B | 0.9458 | 0.9566 | 0.9157 | 0.9822 | 0.99 |
| bge-m3 | 0.9418 | 0.9519 | 0.9109 | 0.98 | 0.9831 |
| DeepPulse-Embedding-0.6b | 0.9693 | 0.9751 | 0.9513 | 0.9891 | 0.9935 |
| DeepPulse-Embedding-m3 | 0.9729 | 0.9781 | 0.957 | 0.9896 | 0.9948 |
# Acknowledgement
DeepPulse-Embedding was trained by the algorithm team from 心语心言.
* [ChenCh2002](https://www.modelscope.cn/profile/ChenCh2002) completed the code implementation, model training, and evaluation work.
* [ChenCh2002](https://www.modelscope.cn/profile/ChenCh2002) and [Night-Quiet](https://huggingface.co/Night-Quiet) completed the data collection and organization work together.
* The project was led by [Night-Quiet](https://huggingface.co/Night-Quiet).