MongoDB
/

mdbr-leaf-ir

Sentence Similarity

sentence-transformers

Transformers.js

feature-extraction

text-embeddings-inference

information-retrieval

knowledge-distillation

Model card Files Files and versions

rvo commited on Aug 12, 2025

Commit

b6beb1a

·

verified ·

1 Parent(s): ed21118

Upload README.md

Files changed (1) hide show

README.md +4 -3

README.md CHANGED Viewed

@@ -22,13 +22,14 @@ language:
 ## Introduction
-`mdbr-leaf-ir` is a compact high-performance text embedding model specifically designed for **information retrieval (IR)** tasks, e.g., the retrieveal part of RAGs.
 Enabling even greater efficiency, `mdbr-leaf-ir` supports [flexible asymmetric architectures](#asymmetric-retrieval-setup) and is robust to [vector quantization](#vector-quantization) and [MRL truncation](#mrl).
 If you are looking to perform other tasks such as classification, clustering, semantic sentence similarity, summarization, please check out our [`mdbr-leaf-mt`](https://huggingface.co/MongoDB/mdbr-leaf-mt) model.
-Note: this model is the result of MongoDB Research's ML team. At the time of writing it is not used in any of MongoDB's commercial product or service offerings.
 ## Technical Report
@@ -89,7 +90,7 @@ See [here](https://huggingface.co/MongoDB/mdbr-leaf-ir/blob/main/transformers_ex
 ### Asymmetric Retrieval Setup
-`mdbr-leaf-ir` is *aligned* to [`snowflake-arctic-embed-m-v1.5`](https://huggingface.co/Snowflake/snowflake-arctic-embed-m-v1.5), the model it has been distilled from. This enables flexible archiectures in which, for example, documents are encoded using the larger model, while queries can be encoded faster and more efficiently with the compact `leaf` model:
 ```python
 # Use mdbr-leaf-ir for query encoding (real-time, low latency)
 query_model = SentenceTransformer("MongoDB/mdbr-leaf-ir")

 ## Introduction
+`mdbr-leaf-ir` is a compact high-performance text embedding model specifically designed for **information retrieval (IR)** tasks, e.g., the retrieval part of RAGs.
 Enabling even greater efficiency, `mdbr-leaf-ir` supports [flexible asymmetric architectures](#asymmetric-retrieval-setup) and is robust to [vector quantization](#vector-quantization) and [MRL truncation](#mrl).
 If you are looking to perform other tasks such as classification, clustering, semantic sentence similarity, summarization, please check out our [`mdbr-leaf-mt`](https://huggingface.co/MongoDB/mdbr-leaf-mt) model.
+> [!Note]
+> this model is the result of MongoDB Research's ML team. At the time of writing it is not used in any of MongoDB's commercial product or service offerings.
 ## Technical Report
 ### Asymmetric Retrieval Setup
+`mdbr-leaf-ir` is *aligned* to [`snowflake-arctic-embed-m-v1.5`](https://huggingface.co/Snowflake/snowflake-arctic-embed-m-v1.5), the model it has been distilled from. This enables flexible architectures in which, for example, documents are encoded using the larger model, while queries can be encoded faster and more efficiently with the compact `leaf` model:
 ```python
 # Use mdbr-leaf-ir for query encoding (real-time, low latency)
 query_model = SentenceTransformer("MongoDB/mdbr-leaf-ir")