Improve model card: add paper link, GitHub link, and summary

#1
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +45 -7
README.md CHANGED
@@ -1,7 +1,8 @@
1
  ---
2
- license: apache-2.0
3
  language:
4
  - en
 
 
5
  tags:
6
  - cytology
7
  - hematology
@@ -10,14 +11,25 @@ tags:
10
  - pytorch
11
  - self-supervised
12
  - vit
13
- pipeline_tag: image-feature-extraction
14
  ---
15
 
16
-
17
  # GenBloom
18
 
19
- This repository contains the model weights used for the public visual
20
- downstream reproduction.
 
 
 
 
 
 
 
 
 
 
 
 
 
21
 
22
  ```text
23
  checkpoints/
@@ -31,6 +43,32 @@ checkpoints/
31
  genbloom_g_fold4.pth
32
  ```
33
 
34
- The `genbloom_v` checkpoint corresponds to image-only pretraining. The `genbloom_g` checkpoints were further genetically aligned.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
35
 
36
- For the source code, see the [GenBloom GitHub repository](https://github.com/marrlab/GenBloom)
 
 
 
 
 
 
 
 
1
  ---
 
2
  language:
3
  - en
4
+ license: apache-2.0
5
+ pipeline_tag: image-feature-extraction
6
  tags:
7
  - cytology
8
  - hematology
 
11
  - pytorch
12
  - self-supervised
13
  - vit
 
14
  ---
15
 
 
16
  # GenBloom
17
 
18
+ [GenBloom](https://huggingface.co/papers/2605.29980) is a genetically-aligned foundation model for peripheral blood smears. It aligns single white blood cell images with chromosomal aberrations (karyotype) and somatic mutations from targeted gene panels.
19
+
20
+ For the source code, setup, and evaluation scripts, see the [GenBloom GitHub repository](https://github.com/marrlab/GenBloom).
21
+
22
+ ## Model Description
23
+
24
+ GenBloom is a patient-level encoder trained using a two-stage approach:
25
+ 1. **GenBloom-V (Self-supervised Pretraining)**: Vision-only pretraining of a transformer aggregator using an iBOT head on a cohort of over 1,500 patients.
26
+ 2. **GenBloom-G (Genetic Alignment)**: Further alignment of visual features with chromosomal aberrations and somatic mutations via supervised contrastive loss on acute myeloid leukemia patients.
27
+
28
+ The model provides improved representations for hematological diagnostic tasks and provides off-the-shelf retrieval capabilities for diseases and genetic alterations.
29
+
30
+ ## Checkpoints
31
+
32
+ This repository contains the model weights used for the public visual downstream reproduction:
33
 
34
  ```text
35
  checkpoints/
 
43
  genbloom_g_fold4.pth
44
  ```
45
 
46
+ - The `genbloom_v` checkpoint corresponds to image-only pretraining.
47
+ - The `genbloom_g` checkpoints were further genetically aligned.
48
+
49
+ ## Usage
50
+
51
+ ### Download Checkpoints
52
+ You can download the checkpoints using the `huggingface_hub` library:
53
+
54
+ ```python
55
+ from huggingface_hub import snapshot_download
56
+
57
+ snapshot_download("MarrLab/GenBloom", local_dir="checkpoints")
58
+ ```
59
+
60
+ ### Inference
61
+ A minimal end-to-end inference example is available in the [`inference_genbloom.ipynb`](https://github.com/marrlab/GenBloom/blob/main/inference_genbloom.ipynb) notebook in the official repository.
62
+
63
+ ## Citation
64
+
65
+ If you use GenBloom in your research, please cite:
66
 
67
+ ```bibtex
68
+ @article{genbloom2024,
69
+ title={Genetically Aligned Patient Representations Improve Hematological Diagnosis},
70
+ author={Adelpantidis, Georgios and others},
71
+ journal={arXiv preprint arXiv:2605.29980},
72
+ year={2024}
73
+ }
74
+ ```