Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
|
@@ -8,11 +8,11 @@ pinned: true
|
|
| 8 |
---
|
| 9 |
# ConTEB: Context is Gold to find the Gold Passage: Evaluating and Training Contextual Document Embeddings
|
| 10 |
|
| 11 |
-
[ benchmark.
|
| 17 |
|
| 18 |
### Abstract
|
|
@@ -32,7 +32,7 @@ We open-source all artifacts here and at https://github.com/illuin-tech/contextu
|
|
| 32 |
- [*(Code) Contextual Document Engine*](https://github.com/illuin-tech/contextual-embeddings): The code used to train and run inference with our architecture.
|
| 33 |
- [*(Code) ConTEB Benchmarkk*](https://github.com/illuin-tech/conteb): A Python package/CLI tool to evaluate document retrieval systems on the ConTEB benchmark.
|
| 34 |
- [*Blog*](https://huggingface.co/XXX): TODO
|
| 35 |
-
- [*Preprint*](https://
|
| 36 |
|
| 37 |
## Contact
|
| 38 |
|
|
@@ -44,7 +44,14 @@ We open-source all artifacts here and at https://github.com/illuin-tech/contextu
|
|
| 44 |
If you use any datasets or models from this organization in your research, please cite the original dataset as follows:
|
| 45 |
|
| 46 |
```latex
|
| 47 |
-
@misc{
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 48 |
}
|
| 49 |
```
|
| 50 |
|
|
|
|
| 8 |
---
|
| 9 |
# ConTEB: Context is Gold to find the Gold Passage: Evaluating and Training Contextual Document Embeddings
|
| 10 |
|
| 11 |
+
[](https://arxiv.org/abs/2505.24782)
|
| 12 |
|
| 13 |
<img src="https://cdn-uploads.huggingface.co/production/uploads/60f2e021adf471cbdf8bb660/jq_zYRy23bOZ9qey3VY4v.png" width="800">
|
| 14 |
|
| 15 |
+
This organization contains all artifacts released with our preprint [*Context is Gold to find the Gold Passage: Evaluating and Training Contextual Document Embeddings*](https://arxiv.org/abs/2505.24782),
|
| 16 |
including the [ConTEB](https://huggingface.co/collections/illuin-conteb/conteb-datasets-6839fffd25f1d3685f3ad604) benchmark.
|
| 17 |
|
| 18 |
### Abstract
|
|
|
|
| 32 |
- [*(Code) Contextual Document Engine*](https://github.com/illuin-tech/contextual-embeddings): The code used to train and run inference with our architecture.
|
| 33 |
- [*(Code) ConTEB Benchmarkk*](https://github.com/illuin-tech/conteb): A Python package/CLI tool to evaluate document retrieval systems on the ConTEB benchmark.
|
| 34 |
- [*Blog*](https://huggingface.co/XXX): TODO
|
| 35 |
+
- [*Preprint*](https://arxiv.org/abs/2505.24782): The paper with all details !
|
| 36 |
|
| 37 |
## Contact
|
| 38 |
|
|
|
|
| 44 |
If you use any datasets or models from this organization in your research, please cite the original dataset as follows:
|
| 45 |
|
| 46 |
```latex
|
| 47 |
+
@misc{conti2025contextgoldgoldpassage,
|
| 48 |
+
title={Context is Gold to find the Gold Passage: Evaluating and Training Contextual Document Embeddings},
|
| 49 |
+
author={Max Conti and Manuel Faysse and Gautier Viaud and Antoine Bosselut and Céline Hudelot and Pierre Colombo},
|
| 50 |
+
year={2025},
|
| 51 |
+
eprint={2505.24782},
|
| 52 |
+
archivePrefix={arXiv},
|
| 53 |
+
primaryClass={cs.IR},
|
| 54 |
+
url={https://arxiv.org/abs/2505.24782},
|
| 55 |
}
|
| 56 |
```
|
| 57 |
|