Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
|
@@ -12,7 +12,7 @@ pinned: true
|
|
| 12 |
|
| 13 |
<img src="https://cdn-uploads.huggingface.co/production/uploads/60f2e021adf471cbdf8bb660/jq_zYRy23bOZ9qey3VY4v.png" width="800">
|
| 14 |
|
| 15 |
-
This organization contains all artifacts released with our preprint [*Context is Gold to find the Gold Passage: Evaluating and Training Contextual Document Embeddings
|
| 16 |
including the [ConTEB](https://huggingface.co/collections/illuin-conteb/conteb-datasets-6839fffd25f1d3685f3ad604) benchmark.
|
| 17 |
|
| 18 |
### Abstract
|
|
@@ -25,29 +25,13 @@ We open-source all artifacts here and at https://github.com/illuin-tech/contextu
|
|
| 25 |
|
| 26 |
## Models
|
| 27 |
|
| 28 |
-
- TODO
|
| 29 |
-
|
| 30 |
-
## Benchmark
|
| 31 |
-
|
| 32 |
- [*Leaderboard*](TODO)
|
| 33 |
-
-
|
| 34 |
-
|
| 35 |
-
|
| 36 |
-
|
| 37 |
-
|
| 38 |
-
- [*ConTEB Benchmark*](TODO)
|
| 39 |
-
-
|
| 40 |
-
## Code
|
| 41 |
-
|
| 42 |
-
|
| 43 |
-
CHANGE
|
| 44 |
-
|
| 45 |
-
- [*Contextual Document Engine*](https://github.com/illuin-tech/contextual-document-embeddings): The code used to train and run inference with our architecture.
|
| 46 |
-
- [*ConTEB Benchmarkk*](https://github.com/illuin-tech/conteb-benchmark): A Python package/CLI tool to evaluate document retrieval systems on the ViDoRe benchmark.
|
| 47 |
-
|
| 48 |
-
## Extra
|
| 49 |
-
|
| 50 |
-
- [*Blog*](https://huggingface.co/XXX: TODO
|
| 51 |
- [*Preprint*](https://huggingface.co/XXX): The paper with all details !
|
| 52 |
|
| 53 |
## Contact
|
|
|
|
| 12 |
|
| 13 |
<img src="https://cdn-uploads.huggingface.co/production/uploads/60f2e021adf471cbdf8bb660/jq_zYRy23bOZ9qey3VY4v.png" width="800">
|
| 14 |
|
| 15 |
+
This organization contains all artifacts released with our preprint [*Context is Gold to find the Gold Passage: Evaluating and Training Contextual Document Embeddings*](https://arxiv.org/abs/XXX),
|
| 16 |
including the [ConTEB](https://huggingface.co/collections/illuin-conteb/conteb-datasets-6839fffd25f1d3685f3ad604) benchmark.
|
| 17 |
|
| 18 |
### Abstract
|
|
|
|
| 25 |
|
| 26 |
## Models
|
| 27 |
|
| 28 |
+
- [*(Model) ModernBERT*](TODO) The Contextualized ModernBERT bi-encoder trained with InSENT loss and Late Chunking
|
| 29 |
+
- [*(Model) ModernColBERT*](TODO) The Contextualized ModernColBERT trained with InSENT loss and Late Chunking
|
|
|
|
|
|
|
| 30 |
- [*Leaderboard*](TODO)
|
| 31 |
+
- [*(Data) ConTEB Benchmark Datasets*](TODO)
|
| 32 |
+
- [*(Code) Contextual Document Engine*](https://github.com/illuin-tech/contextual-embeddings): The code used to train and run inference with our architecture.
|
| 33 |
+
- [*(Code) ConTEB Benchmarkk*](https://github.com/illuin-tech/conteb): A Python package/CLI tool to evaluate document retrieval systems on the ConTEB benchmark.
|
| 34 |
+
- [*Blog*](https://huggingface.co/XXX): TODO
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 35 |
- [*Preprint*](https://huggingface.co/XXX): The paper with all details !
|
| 36 |
|
| 37 |
## Contact
|