Upload folder using huggingface_hub

Browse files

Files changed (8) hide show

LICENSE +190 -0
README.md +377 -0
config.json +32 -0
model.safetensors +3 -0
special_tokens_map.json +37 -0
tokenizer.json +0 -0
tokenizer_config.json +58 -0
vocab.txt +0 -0

LICENSE ADDED Viewed

	@@ -0,0 +1,190 @@

+                                 Apache License
+                           Version 2.0, January 2004
+                        http://www.apache.org/licenses/
+   TERMS AND CONDITIONS FOR USE, REPRODUCTION, AND DISTRIBUTION
+   1. Definitions.
+      "License" shall mean the terms and conditions for use, reproduction,
+      and distribution as defined by Sections 1 through 9 of this document.
+      "Licensor" shall mean the copyright owner or entity authorized by
+      the copyright owner that is granting the License.
+      "Legal Entity" shall mean the union of the acting entity and all
+      other entities that control, are controlled by, or are under common
+      control with that entity. For the purposes of this definition,
+      "control" means (i) the power, direct or indirect, to cause the
+      direction or management of such entity, whether by contract or
+      otherwise, or (ii) ownership of fifty percent (50%) or more of the
+      outstanding shares, or (iii) beneficial ownership of such entity.
+      "You" (or "Your") shall mean an individual or Legal Entity
+      exercising permissions granted by this License.
+      "Source" form shall mean the preferred form for making modifications,
+      including but not limited to software source code, documentation
+      source, and configuration files.
+      "Object" form shall mean any form resulting from mechanical
+      transformation or translation of a Source form, including but
+      not limited to compiled object code, generated documentation,
+      and conversions to other media types.
+      "Work" shall mean the work of authorship, whether in Source or
+      Object form, made available under the License, as indicated by a
+      copyright notice that is included in or attached to the work
+      (an example is provided in the Appendix below).
+      "Derivative Works" shall mean any work, whether in Source or Object
+      form, that is based on (or derived from) the Work and for which the
+      editorial revisions, annotations, elaborations, or other modifications
+      represent, as a whole, an original work of authorship. For the purposes
+      of this License, Derivative Works shall not include works that remain
+      separable from, or merely link (or bind by name) to the interfaces of,
+      the Work and Derivative Works thereof.
+      "Contribution" shall mean any work of authorship, including
+      the original version of the Work and any modifications or additions
+      to that Work or Derivative Works thereof, that is intentionally
+      submitted to the Licensor for inclusion in the Work by the copyright owner
+      or by an individual or Legal Entity authorized to submit on behalf of
+      the copyright owner. For the purposes of this definition, "submitted"
+      means any form of electronic, verbal, or written communication sent
+      to the Licensor or its representatives, including but not limited to
+      communication on electronic mailing lists, source code control systems,
+      and issue tracking systems that are managed by, or on behalf of, the
+      Licensor for the purpose of discussing and improving the Work, but
+      excluding communication that is conspicuously marked or otherwise
+      designated in writing by the copyright owner as "Not a Contribution."
+      "Contributor" shall mean Licensor and any individual or Legal Entity
+      on behalf of whom a Contribution has been received by Licensor and
+      subsequently incorporated within the Work.
+   2. Grant of Copyright License. Subject to the terms and conditions of
+      this License, each Contributor hereby grants to You a perpetual,
+      worldwide, non-exclusive, no-charge, royalty-free, irrevocable
+      copyright license to reproduce, prepare Derivative Works of,
+      publicly display, publicly perform, sublicense, and distribute the
+      Work and such Derivative Works in Source or Object form.
+   3. Grant of Patent License. Subject to the terms and conditions of
+      this License, each Contributor hereby grants to You a perpetual,
+      worldwide, non-exclusive, no-charge, royalty-free, irrevocable
+      (except as stated in this section) patent license to make, have made,
+      use, offer to sell, sell, import, and otherwise transfer the Work,
+      where such license applies only to those patent claims licensable
+      by such Contributor that are necessarily infringed by their
+      Contribution(s) alone or by combination of their Contribution(s)
+      with the Work to which such Contribution(s) was submitted. If You
+      institute patent litigation against any entity (including a
+      cross-claim or counterclaim in a lawsuit) alleging that the Work
+      or a Contribution incorporated within the Work constitutes direct
+      or contributory patent infringement, then any patent licenses
+      granted to You under this License for that Work shall terminate
+      as of the date such litigation is filed.
+   4. Redistribution. You may reproduce and distribute copies of the
+      Work or Derivative Works thereof in any medium, with or without
+      modifications, and in Source or Object form, provided that You
+      meet the following conditions:
+      (a) You must give any other recipients of the Work or
+          Derivative Works a copy of this License; and
+      (b) You must cause any modified files to carry prominent notices
+          stating that You changed the files; and
+      (c) You must retain, in the Source form of any Derivative Works
+          that You distribute, all copyright, patent, trademark, and
+          attribution notices from the Source form of the Work,
+          excluding those notices that do not pertain to any part of
+          the Derivative Works; and
+      (d) If the Work includes a "NOTICE" text file as part of its
+          distribution, then any Derivative Works that You distribute must
+          include a readable copy of the attribution notices contained
+          within such NOTICE file, excluding those notices that do not
+          pertain to any part of the Derivative Works, in at least one
+          of the following places: within a NOTICE text file distributed
+          as part of the Derivative Works; within the Source form or
+          documentation, if provided along with the Derivative Works; or,
+          within a display generated by the Derivative Works, if and
+          wherever such third-party notices normally appear. The contents
+          of the NOTICE file are for informational purposes only and
+          do not modify the License. You may add Your own attribution
+          notices within Derivative Works that You distribute, alongside
+          or as an addendum to the NOTICE text from the Work, provided
+          that such additional attribution notices cannot be construed
+          as modifying the License.
+      You may add Your own copyright statement to Your modifications and
+      may provide additional or different license terms and conditions
+      for use, reproduction, or distribution of Your modifications, or
+      for any such Derivative Works as a whole, provided Your use,
+      reproduction, and distribution of the Work otherwise complies with
+      the conditions stated in this License.
+   5. Submission of Contributions. Unless You explicitly state otherwise,
+      any Contribution intentionally submitted for inclusion in the Work
+      by You to the Licensor shall be under the terms and conditions of
+      this License, without any additional terms or conditions.
+      Notwithstanding the above, nothing herein shall supersede or modify
+      the terms of any separate license agreement you may have executed
+      with Licensor regarding such Contributions.
+   6. Trademarks. This License does not grant permission to use the trade
+      names, trademarks, service marks, or product names of the Licensor,
+      except as required for reasonable and customary use in describing the
+      origin of the Work and reproducing the content of the NOTICE file.
+   7. Disclaimer of Warranty. Unless required by applicable law or
+      agreed to in writing, Licensor provides the Work (and each
+      Contributor provides its Contributions) on an "AS IS" BASIS,
+      WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or
+      implied, including, without limitation, any warranties or conditions
+      of TITLE, NON-INFRINGEMENT, MERCHANTABILITY, or FITNESS FOR A
+      PARTICULAR PURPOSE. You are solely responsible for determining the
+      appropriateness of using or redistributing the Work and assume any
+      risks associated with Your exercise of permissions under this License.
+   8. Limitation of Liability. In no event and under no legal theory,
+      whether in tort (including negligence), contract, or otherwise,
+      unless required by applicable law (such as deliberate and grossly
+      negligent acts) or agreed to in writing, shall any Contributor be
+      liable to You for damages, including any direct, indirect, special,
+      incidental, or consequential damages of any character arising as a
+      result of this License or out of the use or inability to use the
+      Work (including but not limited to damages for loss of goodwill,
+      work stoppage, computer failure or malfunction, or any and all
+      other commercial damages or losses), even if such Contributor
+      has been advised of the possibility of such damages.
+   9. Accepting Warranty or Additional Liability. While redistributing
+      the Work or Derivative Works thereof, You may choose to offer,
+      and charge a fee for, acceptance of support, warranty, indemnity,
+      or other liability obligations and/or rights consistent with this
+      License. However, in accepting such obligations, You may act only
+      on Your own behalf and on Your sole responsibility, not on behalf
+      of any other Contributor, and only if You agree to indemnify,
+      defend, and hold each Contributor harmless for any liability
+      incurred by, or claims asserted against, such Contributor by reason
+      of your accepting any such warranty or additional liability.
+   END OF TERMS AND CONDITIONS
+   Copyright 2026 Grai Team
+   Licensed under the Apache License, Version 2.0 (the "License");
+   you may not use this file except in compliance with the License.
+   You may obtain a copy of the License at
+       http://www.apache.org/licenses/LICENSE-2.0
+   Unless required by applicable law or agreed to in writing, software
+   distributed under the License is distributed on an "AS IS" BASIS,
+   WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+   See the License for the specific language governing permissions and
+   limitations under the License.

README.md ADDED Viewed

	@@ -0,0 +1,377 @@

+---
+license: apache-2.0
+language:
+- en
+tags:
+- cross-encoder
+- reranker
+- radiology
+- medical
+- retrieval
+- sentence-similarity
+- healthcare
+- clinical
+base_model: cross-encoder/ms-marco-MiniLM-L-12-v2
+pipeline_tag: text-classification
+library_name: sentence-transformers
+datasets:
+- radiology-education-corpus
+metrics:
+- mrr
+- ndcg
+model-index:
+- name: RadLITE-Reranker
+  results:
+  - task:
+      type: reranking
+      name: Document Reranking
+    dataset:
+      name: RadLIT-9 (Radiology Retrieval Benchmark)
+      type: radiology-retrieval
+    metrics:
+    - type: mrr
+      value: 0.829
+      name: MRR (with bi-encoder)
+    - type: mrr_improvement
+      value: 0.303
+      name: MRR Improvement on ACR Core Exam (+30.3%)
+---
+# RadLITE-Reranker
+**Radiology Late Interaction Transformer Enhanced - Cross-Encoder Reranker**
+A domain-specialized cross-encoder for reranking radiology search results. This model takes a query-document pair and predicts a relevance score, providing more accurate ranking than bi-encoder similarity alone.
+> **Recommended:** Use this reranker together with [RadLITE-Encoder](https://huggingface.co/matulichpt/RadLITE-Encoder) in a two-stage pipeline for optimal performance. The bi-encoder handles fast retrieval over large corpora, then this cross-encoder reranks the top candidates for precision. This combination achieves **MRR 0.829** on radiology benchmarks (+30% on board exam questions).
+## Model Description
+| Property | Value |
+|----------|-------|
+| **Model Type** | Cross-Encoder (Reranker) |
+| **Base Model** | [ms-marco-MiniLM-L-12-v2](https://huggingface.co/cross-encoder/ms-marco-MiniLM-L-12-v2) |
+| **Domain** | Radiology / Medical Imaging |
+| **Hidden Size** | 384 |
+| **Max Sequence Length** | 512 tokens |
+| **Output** | Single relevance score |
+| **License** | Apache 2.0 |
+### Why Use a Reranker?
+Bi-encoders (like RadLITE-Encoder) are fast but encode query and document independently. Cross-encoders process them together, capturing fine-grained interactions:
+| Approach | Speed | Accuracy | Use Case |
+|----------|-------|----------|----------|
+| Bi-Encoder | Fast (1000s docs/sec) | Good | First-stage retrieval |
+| Cross-Encoder | Slow (10s docs/sec) | Excellent | Reranking top candidates |
+**Two-stage pipeline**: Use bi-encoder to get top 50-100 candidates, then rerank with cross-encoder for best results.
+## Performance
+### Impact on RadLIT-9 Benchmark
+| Configuration | MRR | Improvement |
+|---------------|-----|-------------|
+| Bi-Encoder only | 0.78 | baseline |
+| **Bi-Encoder + Reranker** | **0.829** | **+6.3%** |
+### ACR Core Exam (Board-Style Questions)
+| Dataset | With Reranker | Without | Improvement |
+|---------|---------------|---------|-------------|
+| Core Exam Chest | 0.533 | 0.409 | **+30.3%** |
+| Core Exam Combined | 0.466 | 0.381 | **+22.5%** |
+The reranker is especially valuable for complex, multi-part queries typical of board exam questions.
+## Quick Start
+### Installation
+```bash
+pip install sentence-transformers>=2.2.0
+```
+### Basic Usage
+```python
+from sentence_transformers import CrossEncoder
+# Load the reranker
+reranker = CrossEncoder("matulichpt/RadLITE-Reranker", max_length=512)
+# Query and candidate documents
+query = "What are the imaging features of hepatocellular carcinoma?"
+documents = [
+    "HCC typically shows arterial enhancement with portal venous washout on CT.",
+    "Fatty liver disease presents as decreased attenuation on non-contrast CT.",
+    "Hepatic hemangiomas show peripheral nodular enhancement.",
+]
+# Create query-document pairs
+pairs = [[query, doc] for doc in documents]
+# Get relevance scores
+scores = reranker.predict(pairs)
+# Apply temperature calibration (RECOMMENDED)
+calibrated_scores = scores / 1.5
+print("Scores:", calibrated_scores)
+# Document about HCC will have highest score
+```
+### Temperature Calibration
+**Important**: This model outputs scores with high variance. Apply temperature scaling for better fusion with other signals:
+```python
+# Raw scores might be: [4.2, -1.5, 0.8]
+# After calibration:   [2.8, -1.0, 0.53]
+TEMPERATURE = 1.5  # Recommended value
+def calibrated_predict(reranker, pairs):
+    raw_scores = reranker.predict(pairs)
+    return raw_scores / TEMPERATURE
+```
+### Full Two-Stage Search Pipeline
+```python
+from sentence_transformers import SentenceTransformer, CrossEncoder
+import numpy as np
+class RadLITESearch:
+    def __init__(self, device="cuda"):
+        # Stage 1: Fast bi-encoder
+        self.encoder = SentenceTransformer(
+            "matulichpt/RadLITE-Encoder",
+            device=device
+        )
+        # Stage 2: Precise reranker
+        self.reranker = CrossEncoder(
+            "matulichpt/RadLITE-Reranker",
+            max_length=512,
+            device=device
+        )
+        self.temperature = 1.5
+        self.corpus_embeddings = None
+        self.corpus = None
+    def index_corpus(self, documents: list):
+        """Pre-compute embeddings for your corpus."""
+        self.corpus = documents
+        self.corpus_embeddings = self.encoder.encode(
+            documents,
+            normalize_embeddings=True,
+            show_progress_bar=True,
+            batch_size=32
+        )
+    def search(self, query: str, top_k: int = 10, candidates: int = 50):
+        """Two-stage search: retrieve then rerank."""
+        # Stage 1: Bi-encoder retrieval
+        query_emb = self.encoder.encode(query, normalize_embeddings=True)
+        scores = query_emb @ self.corpus_embeddings.T
+        top_indices = np.argsort(scores)[-candidates:][::-1]
+        # Stage 2: Cross-encoder reranking
+        candidate_docs = [self.corpus[i] for i in top_indices]
+        pairs = [[query, doc] for doc in candidate_docs]
+        rerank_scores = self.reranker.predict(pairs) / self.temperature
+        # Sort by reranked scores
+        sorted_indices = np.argsort(rerank_scores)[::-1]
+        results = []
+        for idx in sorted_indices[:top_k]:
+            results.append({
+                "document": candidate_docs[idx],
+                "corpus_index": int(top_indices[idx]),
+                "score": float(rerank_scores[idx]),
+                "biencoder_score": float(scores[top_indices[idx]])
+            })
+        return results
+# Usage
+searcher = RadLITESearch()
+searcher.index_corpus(your_radiology_documents)
+results = searcher.search("pneumothorax CT findings")
+```
+## Integration with Any Corpus
+### Radiopaedia / Educational Content
+```python
+import json
+# Load your content (e.g., Radiopaedia articles)
+with open("radiopaedia_articles.json") as f:
+    articles = json.load(f)
+corpus = [article["content"] for article in articles]
+# Initialize search
+searcher = RadLITESearch()
+searcher.index_corpus(corpus)
+# Search
+results = searcher.search("classic findings of pulmonary embolism on CTPA")
+for r in results[:5]:
+    print(f"Score: {r['score']:.3f}")
+    print(f"Content: {r['document'][:200]}...")
+    print()
+```
+### Integration with Elasticsearch/OpenSearch
+```python
+from sentence_transformers import CrossEncoder
+reranker = CrossEncoder("matulichpt/RadLITE-Reranker", max_length=512)
+def rerank_elasticsearch_results(query: str, es_results: list, top_k: int = 10):
+    """Rerank Elasticsearch BM25 results."""
+    documents = [hit["_source"]["content"] for hit in es_results]
+    pairs = [[query, doc] for doc in documents]
+    scores = reranker.predict(pairs) / 1.5  # Temperature calibration
+    # Combine with ES scores (optional)
+    for i, hit in enumerate(es_results):
+        hit["rerank_score"] = float(scores[i])
+        hit["combined_score"] = 0.3 * hit["_score"] + 0.7 * scores[i]
+    # Sort by combined score
+    reranked = sorted(es_results, key=lambda x: x["combined_score"], reverse=True)
+    return reranked[:top_k]
+```
+## Optimal Fusion Weights
+When combining multiple signals (bi-encoder, cross-encoder, BM25), use these weights:
+```python
+# Optimal weights from grid search on RadLIT-9
+FUSION_WEIGHTS = {
+    "biencoder": 0.5,    # RadLITE-Encoder similarity
+    "crossencoder": 0.2, # RadLITE-Reranker (after temp calibration)
+    "bm25": 0.3          # Lexical matching (if available)
+}
+def fused_score(bienc_score, ce_score, bm25_score=0):
+    return (
+        FUSION_WEIGHTS["biencoder"] * bienc_score +
+        FUSION_WEIGHTS["crossencoder"] * ce_score +
+        FUSION_WEIGHTS["bm25"] * bm25_score
+    )
+```
+## Architecture
+```
+[Query] + [SEP] + [Document]
+           |
+           v
+    [BERT Tokenizer]
+           |
+           v
+    [MiniLM Encoder] (12 layers, 384 hidden)
+           |
+           v
+    [Classification Head]
+           |
+           v
+    Relevance Score (float)
+```
+## Training Details
+- **Base Model**: ms-marco-MiniLM-L-12-v2 (trained on MS MARCO passage ranking)
+- **Fine-tuning**: Radiology query-document relevance pairs
+- **Training Steps**: 5,626
+- **Best Validation Loss**: 0.691
+- **Learning Rate**: 2e-5
+- **Batch Size**: 32
+- **Category Weighting**: Yes (balanced across radiology subspecialties)
+## Best Practices
+### 1. Always Use Temperature Calibration
+Raw cross-encoder scores can be extreme. Temperature scaling (1.5) produces better fusion:
+```python
+calibrated = raw_score / 1.5
+```
+### 2. Limit Candidates for Reranking
+Cross-encoders are slow. Only rerank top 50-100 candidates from bi-encoder:
+```python
+# Good: Rerank top 50
+rerank_candidates = 50
+# Bad: Rerank entire corpus
+rerank_candidates = len(corpus)  # Too slow!
+```
+### 3. Batch Predictions
+```python
+# Efficient: Single batch call
+pairs = [[query, doc] for doc in candidates]
+scores = reranker.predict(pairs, batch_size=32)
+# Inefficient: Individual calls
+scores = [reranker.predict([[query, doc]])[0] for doc in candidates]
+```
+### 4. GPU Acceleration
+```python
+reranker = CrossEncoder(
+    "matulichpt/RadLITE-Reranker",
+    max_length=512,
+    device="cuda"  # Use GPU
+)
+```
+## Limitations
+- **English only**: Trained on English radiology text
+- **Speed**: ~10-50 pairs/second (use for reranking, not full corpus)
+- **512 token limit**: Long documents are truncated
+- **Domain-specific**: Optimized for radiology, may underperform on general medical content
+## Citation
+If you use RadLITE in your work, please cite:
+```bibtex
+@software{radlite_2026,
+    title = {RadLITE: Calibrated Multi-Stage Retrieval for Radiology Education},
+    author = {Grai Team},
+    year = {2026},
+    month = {January},
+    url = {https://huggingface.co/matulichpt/RadLITE-Reranker},
+    note = {+30% MRR improvement on ACR Core Exam questions}
+}
+```
+## Related Models
+- [RadLITE-Encoder](https://huggingface.co/matulichpt/RadLITE-Encoder) - Bi-encoder for first-stage retrieval
+- [RadBERT-RoBERTa-4m](https://huggingface.co/zzxslp/RadBERT-RoBERTa-4m) - Base radiology language model
+## License
+Apache 2.0 - Free for commercial and research use.

config.json ADDED Viewed

	@@ -0,0 +1,32 @@

+{
+  "architectures": [
+    "BertForSequenceClassification"
+  ],
+  "attention_probs_dropout_prob": 0.1,
+  "classifier_dropout": null,
+  "dtype": "float32",
+  "gradient_checkpointing": false,
+  "hidden_act": "gelu",
+  "hidden_dropout_prob": 0.1,
+  "hidden_size": 384,
+  "id2label": {
+    "0": "relevance"
+  },
+  "initializer_range": 0.02,
+  "intermediate_size": 1536,
+  "label2id": {
+    "relevance": 0
+  },
+  "layer_norm_eps": 1e-12,
+  "max_position_embeddings": 512,
+  "model_type": "bert",
+  "num_attention_heads": 12,
+  "num_hidden_layers": 12,
+  "pad_token_id": 0,
+  "position_embedding_type": "absolute",
+  "sbert_ce_default_activation_function": "torch.nn.modules.linear.Identity",
+  "transformers_version": "4.56.0",
+  "type_vocab_size": 2,
+  "use_cache": true,
+  "vocab_size": 30522
+}

model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:3dfc8832e0d99ed4c39d357bd5be9ea2552eab7107daa09b30db39a43f741a73
+size 133464836

special_tokens_map.json ADDED Viewed

	@@ -0,0 +1,37 @@

+{
+  "cls_token": {
+    "content": "[CLS]",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "mask_token": {
+    "content": "[MASK]",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "pad_token": {
+    "content": "[PAD]",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "sep_token": {
+    "content": "[SEP]",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "unk_token": {
+    "content": "[UNK]",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  }
+}

tokenizer.json ADDED Viewed

The diff for this file is too large to render. See raw diff

tokenizer_config.json ADDED Viewed

	@@ -0,0 +1,58 @@

+{
+  "added_tokens_decoder": {
+    "0": {
+      "content": "[PAD]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "100": {
+      "content": "[UNK]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "101": {
+      "content": "[CLS]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "102": {
+      "content": "[SEP]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "103": {
+      "content": "[MASK]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    }
+  },
+  "clean_up_tokenization_spaces": true,
+  "cls_token": "[CLS]",
+  "do_basic_tokenize": true,
+  "do_lower_case": true,
+  "extra_special_tokens": {},
+  "mask_token": "[MASK]",
+  "model_max_length": 512,
+  "never_split": null,
+  "pad_token": "[PAD]",
+  "sep_token": "[SEP]",
+  "strip_accents": null,
+  "tokenize_chinese_chars": true,
+  "tokenizer_class": "BertTokenizer",
+  "unk_token": "[UNK]"
+}

vocab.txt ADDED Viewed

The diff for this file is too large to render. See raw diff