matulichpt commited on
Commit
57a94a6
·
verified ·
1 Parent(s): 0290339

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +20 -6
README.md CHANGED
@@ -88,6 +88,20 @@ Comparing two-stage pipeline (bi-encoder + reranker) vs bi-encoder alone:
88
 
89
  The reranker provides significant gains on complex, multi-part queries typical of board exam questions.
90
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
91
  ## Quick Start
92
 
93
  ### Installation
@@ -359,13 +373,13 @@ reranker = CrossEncoder(
359
  If you use RadLITE in your work, please cite:
360
 
361
  ```bibtex
362
- @software{radlite_2026,
363
- title = {RadLITE: Calibrated Multi-Stage Retrieval for Radiology Education},
364
- author = {Grai Team},
365
  year = {2026},
366
- month = {January},
367
- url = {https://huggingface.co/matulichpt/radlit-crossencoder},
368
- note = {MRR 0.829 on RadLIT-9 benchmark}
369
  }
370
  ```
371
 
 
88
 
89
  The reranker provides significant gains on complex, multi-part queries typical of board exam questions.
90
 
91
+ ### Published Benchmark Results
92
+
93
+ From [Matulich & Mason, 2026](https://huggingface.co/matulichpt/radlit-biencoder):
94
+
95
+ | Benchmark | RadLIT Result | Key Finding |
96
+ |-----------|---------------|-------------|
97
+ | NFCorpus nDCG@10 | 0.268 | **17.9x improvement** over RadBERT bi-encoder (0.015) |
98
+ | VQA-RAD MRR | 0.972 | Near-perfect retrieval on radiology Q&A |
99
+ | RadLIT-9 Thoracic | 0.736 nDCG@10 | **Best-in-class** (beat BGE-large, ColBERTv2) |
100
+ | RadLIT-9 Pediatric | 0.625 nDCG@10 | **Best-in-class** (beat BGE-large, ColBERTv2) |
101
+ | Zebra Test | 92% found rate | 2.1x improvement on rare conditions vs ColBERTv2 |
102
+
103
+ **Vocabulary Alignment Hypothesis**: Domain training provides measurable advantage when queries use radiology-specific terminology that aligns with the training domain.
104
+
105
  ## Quick Start
106
 
107
  ### Installation
 
373
  If you use RadLITE in your work, please cite:
374
 
375
  ```bibtex
376
+ @article{matulich2026radlit,
377
+ title = {Late Interaction Retrieval Unlocks Domain Knowledge in Radiology Language Models},
378
+ author = {Matulich, Patrick and Mason, Dan},
379
  year = {2026},
380
+ journal = {Radiology: Artificial Intelligence},
381
+ note = {17.9x improvement over RadBERT; best-in-class on Thoracic/Pediatric subspecialties},
382
+ url = {https://huggingface.co/matulichpt/radlit-biencoder}
383
  }
384
  ```
385