Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
|
@@ -88,6 +88,20 @@ Comparing two-stage pipeline (bi-encoder + reranker) vs bi-encoder alone:
|
|
| 88 |
|
| 89 |
The reranker provides significant gains on complex, multi-part queries typical of board exam questions.
|
| 90 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 91 |
## Quick Start
|
| 92 |
|
| 93 |
### Installation
|
|
@@ -359,13 +373,13 @@ reranker = CrossEncoder(
|
|
| 359 |
If you use RadLITE in your work, please cite:
|
| 360 |
|
| 361 |
```bibtex
|
| 362 |
-
@
|
| 363 |
-
title = {
|
| 364 |
-
author = {
|
| 365 |
year = {2026},
|
| 366 |
-
|
| 367 |
-
|
| 368 |
-
|
| 369 |
}
|
| 370 |
```
|
| 371 |
|
|
|
|
| 88 |
|
| 89 |
The reranker provides significant gains on complex, multi-part queries typical of board exam questions.
|
| 90 |
|
| 91 |
+
### Published Benchmark Results
|
| 92 |
+
|
| 93 |
+
From [Matulich & Mason, 2026](https://huggingface.co/matulichpt/radlit-biencoder):
|
| 94 |
+
|
| 95 |
+
| Benchmark | RadLIT Result | Key Finding |
|
| 96 |
+
|-----------|---------------|-------------|
|
| 97 |
+
| NFCorpus nDCG@10 | 0.268 | **17.9x improvement** over RadBERT bi-encoder (0.015) |
|
| 98 |
+
| VQA-RAD MRR | 0.972 | Near-perfect retrieval on radiology Q&A |
|
| 99 |
+
| RadLIT-9 Thoracic | 0.736 nDCG@10 | **Best-in-class** (beat BGE-large, ColBERTv2) |
|
| 100 |
+
| RadLIT-9 Pediatric | 0.625 nDCG@10 | **Best-in-class** (beat BGE-large, ColBERTv2) |
|
| 101 |
+
| Zebra Test | 92% found rate | 2.1x improvement on rare conditions vs ColBERTv2 |
|
| 102 |
+
|
| 103 |
+
**Vocabulary Alignment Hypothesis**: Domain training provides measurable advantage when queries use radiology-specific terminology that aligns with the training domain.
|
| 104 |
+
|
| 105 |
## Quick Start
|
| 106 |
|
| 107 |
### Installation
|
|
|
|
| 373 |
If you use RadLITE in your work, please cite:
|
| 374 |
|
| 375 |
```bibtex
|
| 376 |
+
@article{matulich2026radlit,
|
| 377 |
+
title = {Late Interaction Retrieval Unlocks Domain Knowledge in Radiology Language Models},
|
| 378 |
+
author = {Matulich, Patrick and Mason, Dan},
|
| 379 |
year = {2026},
|
| 380 |
+
journal = {Radiology: Artificial Intelligence},
|
| 381 |
+
note = {17.9x improvement over RadBERT; best-in-class on Thoracic/Pediatric subspecialties},
|
| 382 |
+
url = {https://huggingface.co/matulichpt/radlit-biencoder}
|
| 383 |
}
|
| 384 |
```
|
| 385 |
|