You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

The UniPath-7B model and its associated materials are released under the CC-BY-NC-ND 4.0 license. Access is restricted to non-commercial, academic research purposes only, with proper citation required. Any commercial usage, redistribution, or derivative work (including training models based on this model or generating datasets from its outputs) is strictly prohibited without prior written approval.
Users must register with an official institutional email address (generic domains such as @gmail, @qq, @hotmail, etc. will not be accepted). By requesting access, you confirm that your information is accurate and current, and that you agree to comply with all terms listed herein. If other members of your organization wish to use the model, they must register independently and agree to the same terms.

Log in or Sign Up to review the conditions and access this model content.


[CVPR 2026] Beyond Pixel Simulation: Pathology Image Generation via Diagnostic Semantic Tokens and Prototype Control

🏠 Project Page | 📖 Paper | 🤗 UniPath-1M | 🤗 UniPath-68K | 🧠 Model Weight

Abstract: In computational pathology, understanding and generation have evolved along disparate paths: advanced understanding models already exhibit diagnostic-level competence, whereas generative models largely simulate pixels. Progress remains hindered by three coupled factors: the scarcity of large, high-quality image–text corpora; the lack of precise, fine-grained semantic control, which forces reliance on non-semantic cues; and terminological heterogeneity, where diverse phrasings for the same diagnostic concept impede reliable text conditioning. We introduce UniPath, a semantics-driven pathology image generation framework that leverages mature diagnostic understanding to enable controllable generation. UniPath implements Multi-Stream Control: a Raw-Text stream; a High-Level Semantics stream that uses learnable queries to a frozen pathology MLLM to distill paraphrase-robust Diagnostic Semantic Tokens and to expand prompts into diagnosis-aware attribute bundles; and a Prototype stream that affords component-level morphological control via a prototype bank. On the data front, we curate a 2.65M image–text corpus and a finely annotated, high-quality 68K subset to alleviate data scarcity. For a comprehensive assessment, we establish a four-tier evaluation hierarchy tailored to pathology. Extensive experiments demonstrate UniPath's SOTA performance, including a Patho‑FID of 80.9 (51% better than the second-best) and fine-grained semantic control achieving 98.7% of the real-image.


Highlights

  • Semantic-first pathology generation: shifts from pixel-level imitation to diagnosis-aware semantic control.
  • Multi-Stream Control: combines raw text, distilled diagnostic semantic tokens, and prototype-level morphology guidance.
  • Data curation at scale: uses a 2.65M corpus plus a high-quality 68K subset for robust training.
  • Pathology-specific evaluation: introduces a four-tier benchmark protocol for generation quality and controllability.

Installation

Recommended environment:

  • Python 3.11
  • GPU with at least 24 GB VRAM

Install dependencies:

pip install -r requirements.txt

Quickstart

Run from repository root:

python src/inference.py \
  --model_path /path/to/checkpoints \
  --rag_root_dir /path/to/RAG_8K \
  --output_dir ./generated_images \
  --num_seeds 5

RAG Data Requirements

src/inference.py expects --rag_root_dir to contain:

<RAG_ROOT>/
  llm_filtered_vocab_gemini_pro.txt
  keyword_inverted_index.json
  selected_8k.h5
  images/

Acknowledgements

This repository substantially reuses and adapts components from:

We thank the original authors for open-sourcing their code and model weights.

Citation

If you find UniPath useful, please cite:

@article{han2025beyond,
  title={Beyond Pixel Simulation: Pathology Image Generation via Diagnostic Semantic Tokens and Prototype Control},
  author={Han, Minghao and Liu, YiChen and Liu, Yizhou and Chen, Zizhi and Tang, Jingqun and Wu, Xuecheng and Yang, Dingkang and Zhang, Lihua},
  journal={arXiv preprint arXiv:2512.21058},
  year={2025}
}
Downloads last month
-
Safetensors
Model size
9B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Paper for minghaofdu/UniPath-7B