KaiYinTAMU commited on
Commit
eab47fb
·
verified ·
1 Parent(s): 83f705c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +30 -0
README.md CHANGED
@@ -13,6 +13,36 @@ This model is trained through the approach described in [DMRetriever: A Family o
13
  The associated GitHub repository is available [here](https://github.com/KaiYin97/DMRETRIEVER).
14
  This model has 335M parameters.
15
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
16
  ## Usage
17
 
18
  Using HuggingFace Transformers:
 
13
  The associated GitHub repository is available [here](https://github.com/KaiYin97/DMRETRIEVER).
14
  This model has 335M parameters.
15
 
16
+ ## Model Overview
17
+
18
+ **DMRetriever-335M** has the following features:
19
+
20
+ - Model Type: Text Embedding
21
+ - Supported Languages: English
22
+ - Number of Paramaters: 335
23
+ - Context Length: 512
24
+ - Embedding Dimension: 1024
25
+
26
+ For more details, including model training, benchmark evaluation, and inference performance, please refer to our [paper](https://www.arxiv.org/abs/2510.15087), [GitHub](https://github.com/KaiYin97/DMRETRIEVER).
27
+
28
+ ## DMRetriever series model list
29
+
30
+ | **Model** | **Description** | **Backbone** | **Backbone Type** | **Hidden Size** | **#Layers** |
31
+ |:--|:--|:--|:--|:--:|:--:|
32
+ | [DMRetriever-33M](https://huggingface.co/DMIR01/DMRetriever-33M) | Base 33M variant | MiniLM | Encoder-only | 384 | 12 |
33
+ | [DMRetriever-33M-PT](https://huggingface.co/DMIR01/DMRetriever-33M-PT) | Pre-trained version of 33M | MiniLM | Encoder-only | 384 | 12 |
34
+ | [DMRetriever-109M](https://huggingface.co/DMIR01/DMRetriever-109M) | Base 109M variant | BERT-base-uncased | Encoder-only | 768 | 12 |
35
+ | [DMRetriever-109M-PT](https://huggingface.co/DMIR01/DMRetriever-109M-PT) | Pre-trained version of 109M | BERT-base-uncased | Encoder-only | 768 | 12 |
36
+ | [DMRetriever-335M](https://huggingface.co/DMIR01/DMRetriever-335M) | Base 335M variant | BERT-large-uncased-WWM | Encoder-only | 1024 | 24 |
37
+ | [DMRetriever-335M-PT](https://huggingface.co/DMIR01/DMRetriever-335M-PT) | Pre-trained version of 335M | BERT-large-uncased-WWM | Encoder-only | 1024 | 24 |
38
+ | [DMRetriever-596M](https://huggingface.co/DMIR01/DMRetriever-596M) | Base 596M variant | Qwen3-0.6B | Decoder-only | 1024 | 28 |
39
+ | [DMRetriever-596M-PT](https://huggingface.co/DMIR01/DMRetriever-596M-PT) | Pre-trained version of 596M | Qwen3-0.6B | Decoder-only | 1024 | 28 |
40
+ | [DMRetriever-4B](https://huggingface.co/DMIR01/DMRetriever-4B) | Base 4B variant | Qwen3-4B | Decoder-only | 2560 | 36 |
41
+ | [DMRetriever-4B-PT](https://huggingface.co/DMIR01/DMRetriever-4B-PT) | Pre-trained version of 4B | Qwen3-4B | Decoder-only | 2560 | 36 |
42
+ | [DMRetriever-7.6B](https://huggingface.co/DMIR01/DMRetriever-7.6B) | Base 7.6B variant | Qwen3-8B | Decoder-only | 4096 | 36 |
43
+ | [DMRetriever-7.6B-PT](https://huggingface.co/DMIR01/DMRetriever-7.6B-PT) | Pre-trained version of 7.6B | Qwen3-8B | Decoder-only | 4096 | 36 |
44
+
45
+
46
  ## Usage
47
 
48
  Using HuggingFace Transformers: