DMIR01
/

DMRetriever-335M

feature-extraction

information-retrieval

disaster-management

Model card Files Files and versions

KaiYinTAMU commited on Oct 23, 2025

Commit

eab47fb

·

verified ·

1 Parent(s): 83f705c

Update README.md

Files changed (1) hide show

README.md +30 -0

README.md CHANGED Viewed

@@ -13,6 +13,36 @@ This model is trained through the approach described in [DMRetriever: A Family o
 The associated GitHub repository is available [here](https://github.com/KaiYin97/DMRETRIEVER).
 This model has 335M parameters.
 ## Usage
 Using HuggingFace Transformers:

 The associated GitHub repository is available [here](https://github.com/KaiYin97/DMRETRIEVER).
 This model has 335M parameters.
+## Model Overview
+**DMRetriever-335M** has the following features:
+- Model Type: Text Embedding
+- Supported Languages: English
+- Number of Paramaters: 335
+- Context Length: 512
+- Embedding Dimension: 1024
+For more details, including model training, benchmark evaluation, and inference performance, please refer to our [paper](https://www.arxiv.org/abs/2510.15087), [GitHub](https://github.com/KaiYin97/DMRETRIEVER).
+## DMRetriever series model list
+| **Model** | **Description** | **Backbone** | **Backbone Type** | **Hidden Size** | **#Layers** |
+|:--|:--|:--|:--|:--:|:--:|
+| [DMRetriever-33M](https://huggingface.co/DMIR01/DMRetriever-33M) | Base 33M variant | MiniLM | Encoder-only | 384 | 12 |
+| [DMRetriever-33M-PT](https://huggingface.co/DMIR01/DMRetriever-33M-PT) | Pre-trained version of 33M | MiniLM | Encoder-only | 384 | 12 |
+| [DMRetriever-109M](https://huggingface.co/DMIR01/DMRetriever-109M) | Base 109M variant | BERT-base-uncased | Encoder-only | 768 | 12 |
+| [DMRetriever-109M-PT](https://huggingface.co/DMIR01/DMRetriever-109M-PT) | Pre-trained version of 109M | BERT-base-uncased | Encoder-only | 768 | 12 |
+| [DMRetriever-335M](https://huggingface.co/DMIR01/DMRetriever-335M) | Base 335M variant | BERT-large-uncased-WWM | Encoder-only | 1024 | 24 |
+| [DMRetriever-335M-PT](https://huggingface.co/DMIR01/DMRetriever-335M-PT) | Pre-trained version of 335M | BERT-large-uncased-WWM | Encoder-only | 1024 | 24 |
+| [DMRetriever-596M](https://huggingface.co/DMIR01/DMRetriever-596M) | Base 596M variant | Qwen3-0.6B | Decoder-only | 1024 | 28 |
+| [DMRetriever-596M-PT](https://huggingface.co/DMIR01/DMRetriever-596M-PT) | Pre-trained version of 596M | Qwen3-0.6B | Decoder-only | 1024 | 28 |
+| [DMRetriever-4B](https://huggingface.co/DMIR01/DMRetriever-4B) | Base 4B variant | Qwen3-4B | Decoder-only | 2560 | 36 |
+| [DMRetriever-4B-PT](https://huggingface.co/DMIR01/DMRetriever-4B-PT) | Pre-trained version of 4B | Qwen3-4B | Decoder-only | 2560 | 36 |
+| [DMRetriever-7.6B](https://huggingface.co/DMIR01/DMRetriever-7.6B) | Base 7.6B variant | Qwen3-8B | Decoder-only | 4096 | 36 |
+| [DMRetriever-7.6B-PT](https://huggingface.co/DMIR01/DMRetriever-7.6B-PT) | Pre-trained version of 7.6B | Qwen3-8B | Decoder-only | 4096 | 36 |
 ## Usage
 Using HuggingFace Transformers: