Upload README.md
Browse files
README.md
CHANGED
|
@@ -22,13 +22,14 @@ language:
|
|
| 22 |
|
| 23 |
## Introduction
|
| 24 |
|
| 25 |
-
`mdbr-leaf-ir` is a compact high-performance text embedding model specifically designed for **information retrieval (IR)** tasks, e.g., the
|
| 26 |
|
| 27 |
Enabling even greater efficiency, `mdbr-leaf-ir` supports [flexible asymmetric architectures](#asymmetric-retrieval-setup) and is robust to [vector quantization](#vector-quantization) and [MRL truncation](#mrl).
|
| 28 |
|
| 29 |
If you are looking to perform other tasks such as classification, clustering, semantic sentence similarity, summarization, please check out our [`mdbr-leaf-mt`](https://huggingface.co/MongoDB/mdbr-leaf-mt) model.
|
| 30 |
|
| 31 |
-
Note
|
|
|
|
| 32 |
|
| 33 |
## Technical Report
|
| 34 |
|
|
@@ -89,7 +90,7 @@ See [here](https://huggingface.co/MongoDB/mdbr-leaf-ir/blob/main/transformers_ex
|
|
| 89 |
|
| 90 |
### Asymmetric Retrieval Setup
|
| 91 |
|
| 92 |
-
`mdbr-leaf-ir` is *aligned* to [`snowflake-arctic-embed-m-v1.5`](https://huggingface.co/Snowflake/snowflake-arctic-embed-m-v1.5), the model it has been distilled from. This enables flexible
|
| 93 |
```python
|
| 94 |
# Use mdbr-leaf-ir for query encoding (real-time, low latency)
|
| 95 |
query_model = SentenceTransformer("MongoDB/mdbr-leaf-ir")
|
|
|
|
| 22 |
|
| 23 |
## Introduction
|
| 24 |
|
| 25 |
+
`mdbr-leaf-ir` is a compact high-performance text embedding model specifically designed for **information retrieval (IR)** tasks, e.g., the retrieval part of RAGs.
|
| 26 |
|
| 27 |
Enabling even greater efficiency, `mdbr-leaf-ir` supports [flexible asymmetric architectures](#asymmetric-retrieval-setup) and is robust to [vector quantization](#vector-quantization) and [MRL truncation](#mrl).
|
| 28 |
|
| 29 |
If you are looking to perform other tasks such as classification, clustering, semantic sentence similarity, summarization, please check out our [`mdbr-leaf-mt`](https://huggingface.co/MongoDB/mdbr-leaf-mt) model.
|
| 30 |
|
| 31 |
+
> [!Note]
|
| 32 |
+
> this model is the result of MongoDB Research's ML team. At the time of writing it is not used in any of MongoDB's commercial product or service offerings.
|
| 33 |
|
| 34 |
## Technical Report
|
| 35 |
|
|
|
|
| 90 |
|
| 91 |
### Asymmetric Retrieval Setup
|
| 92 |
|
| 93 |
+
`mdbr-leaf-ir` is *aligned* to [`snowflake-arctic-embed-m-v1.5`](https://huggingface.co/Snowflake/snowflake-arctic-embed-m-v1.5), the model it has been distilled from. This enables flexible architectures in which, for example, documents are encoded using the larger model, while queries can be encoded faster and more efficiently with the compact `leaf` model:
|
| 94 |
```python
|
| 95 |
# Use mdbr-leaf-ir for query encoding (real-time, low latency)
|
| 96 |
query_model = SentenceTransformer("MongoDB/mdbr-leaf-ir")
|