Improve model card: add paper link, GitHub link, and summary
#1
by nielsr HF Staff - opened
README.md
CHANGED
|
@@ -1,7 +1,8 @@
|
|
| 1 |
---
|
| 2 |
-
license: apache-2.0
|
| 3 |
language:
|
| 4 |
- en
|
|
|
|
|
|
|
| 5 |
tags:
|
| 6 |
- cytology
|
| 7 |
- hematology
|
|
@@ -10,14 +11,25 @@ tags:
|
|
| 10 |
- pytorch
|
| 11 |
- self-supervised
|
| 12 |
- vit
|
| 13 |
-
pipeline_tag: image-feature-extraction
|
| 14 |
---
|
| 15 |
|
| 16 |
-
|
| 17 |
# GenBloom
|
| 18 |
|
| 19 |
-
|
| 20 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 21 |
|
| 22 |
```text
|
| 23 |
checkpoints/
|
|
@@ -31,6 +43,32 @@ checkpoints/
|
|
| 31 |
genbloom_g_fold4.pth
|
| 32 |
```
|
| 33 |
|
| 34 |
-
The `genbloom_v` checkpoint corresponds to image-only pretraining.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 35 |
|
| 36 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
|
|
|
| 2 |
language:
|
| 3 |
- en
|
| 4 |
+
license: apache-2.0
|
| 5 |
+
pipeline_tag: image-feature-extraction
|
| 6 |
tags:
|
| 7 |
- cytology
|
| 8 |
- hematology
|
|
|
|
| 11 |
- pytorch
|
| 12 |
- self-supervised
|
| 13 |
- vit
|
|
|
|
| 14 |
---
|
| 15 |
|
|
|
|
| 16 |
# GenBloom
|
| 17 |
|
| 18 |
+
[GenBloom](https://huggingface.co/papers/2605.29980) is a genetically-aligned foundation model for peripheral blood smears. It aligns single white blood cell images with chromosomal aberrations (karyotype) and somatic mutations from targeted gene panels.
|
| 19 |
+
|
| 20 |
+
For the source code, setup, and evaluation scripts, see the [GenBloom GitHub repository](https://github.com/marrlab/GenBloom).
|
| 21 |
+
|
| 22 |
+
## Model Description
|
| 23 |
+
|
| 24 |
+
GenBloom is a patient-level encoder trained using a two-stage approach:
|
| 25 |
+
1. **GenBloom-V (Self-supervised Pretraining)**: Vision-only pretraining of a transformer aggregator using an iBOT head on a cohort of over 1,500 patients.
|
| 26 |
+
2. **GenBloom-G (Genetic Alignment)**: Further alignment of visual features with chromosomal aberrations and somatic mutations via supervised contrastive loss on acute myeloid leukemia patients.
|
| 27 |
+
|
| 28 |
+
The model provides improved representations for hematological diagnostic tasks and provides off-the-shelf retrieval capabilities for diseases and genetic alterations.
|
| 29 |
+
|
| 30 |
+
## Checkpoints
|
| 31 |
+
|
| 32 |
+
This repository contains the model weights used for the public visual downstream reproduction:
|
| 33 |
|
| 34 |
```text
|
| 35 |
checkpoints/
|
|
|
|
| 43 |
genbloom_g_fold4.pth
|
| 44 |
```
|
| 45 |
|
| 46 |
+
- The `genbloom_v` checkpoint corresponds to image-only pretraining.
|
| 47 |
+
- The `genbloom_g` checkpoints were further genetically aligned.
|
| 48 |
+
|
| 49 |
+
## Usage
|
| 50 |
+
|
| 51 |
+
### Download Checkpoints
|
| 52 |
+
You can download the checkpoints using the `huggingface_hub` library:
|
| 53 |
+
|
| 54 |
+
```python
|
| 55 |
+
from huggingface_hub import snapshot_download
|
| 56 |
+
|
| 57 |
+
snapshot_download("MarrLab/GenBloom", local_dir="checkpoints")
|
| 58 |
+
```
|
| 59 |
+
|
| 60 |
+
### Inference
|
| 61 |
+
A minimal end-to-end inference example is available in the [`inference_genbloom.ipynb`](https://github.com/marrlab/GenBloom/blob/main/inference_genbloom.ipynb) notebook in the official repository.
|
| 62 |
+
|
| 63 |
+
## Citation
|
| 64 |
+
|
| 65 |
+
If you use GenBloom in your research, please cite:
|
| 66 |
|
| 67 |
+
```bibtex
|
| 68 |
+
@article{genbloom2024,
|
| 69 |
+
title={Genetically Aligned Patient Representations Improve Hematological Diagnosis},
|
| 70 |
+
author={Adelpantidis, Georgios and others},
|
| 71 |
+
journal={arXiv preprint arXiv:2605.29980},
|
| 72 |
+
year={2024}
|
| 73 |
+
}
|
| 74 |
+
```
|