manu commited on
Commit
78154a6
·
verified ·
1 Parent(s): 39e9e8b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -23
README.md CHANGED
@@ -12,7 +12,7 @@ pinned: true
12
 
13
  <img src="https://cdn-uploads.huggingface.co/production/uploads/60f2e021adf471cbdf8bb660/jq_zYRy23bOZ9qey3VY4v.png" width="800">
14
 
15
- This organization contains all artifacts released with our preprint [*Context is Gold to find the Gold Passage: Evaluating and Training Contextual Document Embeddings *](https://arxiv.org/abs/XXX),
16
  including the [ConTEB](https://huggingface.co/collections/illuin-conteb/conteb-datasets-6839fffd25f1d3685f3ad604) benchmark.
17
 
18
  ### Abstract
@@ -25,29 +25,13 @@ We open-source all artifacts here and at https://github.com/illuin-tech/contextu
25
 
26
  ## Models
27
 
28
- - TODO
29
-
30
- ## Benchmark
31
-
32
  - [*Leaderboard*](TODO)
33
- -
34
- ## Datasets
35
-
36
- We organized datasets into collections to constitute our benchmark ViDoRe and its derivates (OCR and Captioning). Below is a brief description of each of them.
37
-
38
- - [*ConTEB Benchmark*](TODO)
39
- -
40
- ## Code
41
-
42
-
43
- CHANGE
44
-
45
- - [*Contextual Document Engine*](https://github.com/illuin-tech/contextual-document-embeddings): The code used to train and run inference with our architecture.
46
- - [*ConTEB Benchmarkk*](https://github.com/illuin-tech/conteb-benchmark): A Python package/CLI tool to evaluate document retrieval systems on the ViDoRe benchmark.
47
-
48
- ## Extra
49
-
50
- - [*Blog*](https://huggingface.co/XXX: TODO
51
  - [*Preprint*](https://huggingface.co/XXX): The paper with all details !
52
 
53
  ## Contact
 
12
 
13
  <img src="https://cdn-uploads.huggingface.co/production/uploads/60f2e021adf471cbdf8bb660/jq_zYRy23bOZ9qey3VY4v.png" width="800">
14
 
15
+ This organization contains all artifacts released with our preprint [*Context is Gold to find the Gold Passage: Evaluating and Training Contextual Document Embeddings*](https://arxiv.org/abs/XXX),
16
  including the [ConTEB](https://huggingface.co/collections/illuin-conteb/conteb-datasets-6839fffd25f1d3685f3ad604) benchmark.
17
 
18
  ### Abstract
 
25
 
26
  ## Models
27
 
28
+ - [*(Model) ModernBERT*](TODO) The Contextualized ModernBERT bi-encoder trained with InSENT loss and Late Chunking
29
+ - [*(Model) ModernColBERT*](TODO) The Contextualized ModernColBERT trained with InSENT loss and Late Chunking
 
 
30
  - [*Leaderboard*](TODO)
31
+ - [*(Data) ConTEB Benchmark Datasets*](TODO)
32
+ - [*(Code) Contextual Document Engine*](https://github.com/illuin-tech/contextual-embeddings): The code used to train and run inference with our architecture.
33
+ - [*(Code) ConTEB Benchmarkk*](https://github.com/illuin-tech/conteb): A Python package/CLI tool to evaluate document retrieval systems on the ConTEB benchmark.
34
+ - [*Blog*](https://huggingface.co/XXX): TODO
 
 
 
 
 
 
 
 
 
 
 
 
 
 
35
  - [*Preprint*](https://huggingface.co/XXX): The paper with all details !
36
 
37
  ## Contact