manu commited on
Commit
feea58b
·
verified ·
1 Parent(s): 9f159e3

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -4
README.md CHANGED
@@ -8,11 +8,11 @@ pinned: true
8
  ---
9
  # ConTEB: Context is Gold to find the Gold Passage: Evaluating and Training Contextual Document Embeddings
10
 
11
- [![arXiv](https://img.shields.io/badge/arXiv-2407.01449-b31b1b.svg?style=for-the-badge)](https://arxiv.org/abs/XXX)
12
 
13
  <img src="https://cdn-uploads.huggingface.co/production/uploads/60f2e021adf471cbdf8bb660/jq_zYRy23bOZ9qey3VY4v.png" width="800">
14
 
15
- This organization contains all artifacts released with our preprint [*Context is Gold to find the Gold Passage: Evaluating and Training Contextual Document Embeddings*](https://arxiv.org/abs/XXX),
16
  including the [ConTEB](https://huggingface.co/collections/illuin-conteb/conteb-datasets-6839fffd25f1d3685f3ad604) benchmark.
17
 
18
  ### Abstract
@@ -32,7 +32,7 @@ We open-source all artifacts here and at https://github.com/illuin-tech/contextu
32
  - [*(Code) Contextual Document Engine*](https://github.com/illuin-tech/contextual-embeddings): The code used to train and run inference with our architecture.
33
  - [*(Code) ConTEB Benchmarkk*](https://github.com/illuin-tech/conteb): A Python package/CLI tool to evaluate document retrieval systems on the ConTEB benchmark.
34
  - [*Blog*](https://huggingface.co/XXX): TODO
35
- - [*Preprint*](https://huggingface.co/XXX): The paper with all details !
36
 
37
  ## Contact
38
 
@@ -44,7 +44,14 @@ We open-source all artifacts here and at https://github.com/illuin-tech/contextu
44
  If you use any datasets or models from this organization in your research, please cite the original dataset as follows:
45
 
46
  ```latex
47
- @misc{
 
 
 
 
 
 
 
48
  }
49
  ```
50
 
 
8
  ---
9
  # ConTEB: Context is Gold to find the Gold Passage: Evaluating and Training Contextual Document Embeddings
10
 
11
+ [![arXiv](https://img.shields.io/badge/arXiv-2505.24782-b31b1b.svg?style=for-the-badge)](https://arxiv.org/abs/2505.24782)
12
 
13
  <img src="https://cdn-uploads.huggingface.co/production/uploads/60f2e021adf471cbdf8bb660/jq_zYRy23bOZ9qey3VY4v.png" width="800">
14
 
15
+ This organization contains all artifacts released with our preprint [*Context is Gold to find the Gold Passage: Evaluating and Training Contextual Document Embeddings*](https://arxiv.org/abs/2505.24782),
16
  including the [ConTEB](https://huggingface.co/collections/illuin-conteb/conteb-datasets-6839fffd25f1d3685f3ad604) benchmark.
17
 
18
  ### Abstract
 
32
  - [*(Code) Contextual Document Engine*](https://github.com/illuin-tech/contextual-embeddings): The code used to train and run inference with our architecture.
33
  - [*(Code) ConTEB Benchmarkk*](https://github.com/illuin-tech/conteb): A Python package/CLI tool to evaluate document retrieval systems on the ConTEB benchmark.
34
  - [*Blog*](https://huggingface.co/XXX): TODO
35
+ - [*Preprint*](https://arxiv.org/abs/2505.24782): The paper with all details !
36
 
37
  ## Contact
38
 
 
44
  If you use any datasets or models from this organization in your research, please cite the original dataset as follows:
45
 
46
  ```latex
47
+ @misc{conti2025contextgoldgoldpassage,
48
+ title={Context is Gold to find the Gold Passage: Evaluating and Training Contextual Document Embeddings},
49
+ author={Max Conti and Manuel Faysse and Gautier Viaud and Antoine Bosselut and Céline Hudelot and Pierre Colombo},
50
+ year={2025},
51
+ eprint={2505.24782},
52
+ archivePrefix={arXiv},
53
+ primaryClass={cs.IR},
54
+ url={https://arxiv.org/abs/2505.24782},
55
  }
56
  ```
57