canergen commited on
Commit
8119aac
·
verified ·
1 Parent(s): 0abff4f

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +116 -0
README.md ADDED
@@ -0,0 +1,116 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: popV
3
+ license: cc-by-4.0
4
+ tags:
5
+ - biology
6
+ - genomics
7
+ - single-cell
8
+ - AnnData:0.12.2
9
+ - scikit_learn:1.7.2
10
+ - organism:Homo sapiens
11
+ - Python:3.12.8
12
+ - popV:0.6.0
13
+ - 'tissue: Lung'
14
+ ---
15
+
16
+ Popular Vote (popV) model for automated cell type annotation of single-cell RNA-seq data. We provide here pretrained models
17
+ for plug-in use in your own analysis.
18
+ Follow our [tutorial](https://github.com/YosefLab/popV/blob/main/tabula_sapiens_tutorial.ipynb) to learn how to use the model for cell type annotation.
19
+
20
+ # Model description
21
+
22
+ Tabula Sapiens is a benchmark, first-draft human cell atlas of over 1.1M cells from 28 organs of 24 normal human subjects. This work is the product of the Tabula Sapiens Consortium. Taking the organs from the same individual controls for genetic background, age, environment, and epigenetic effects, and allows detailed analysis and comparison of cell types that are shared between tissues.
23
+
24
+ **Link to CELLxGENE**:
25
+ Link to the [data](https://cellxgene.cziscience.com/e/0d2ee4ac-05ee-40b2-afb6-ebb584caa867.cxg/) in the CELLxGENE browser for interactive exploration of the data and download of the source data.
26
+
27
+ **Training Code URL**:
28
+ Not provided by uploader.
29
+
30
+ # Metrics
31
+
32
+ We provide here accuracies for each of the experts and the ensemble model. The validation set accuracies are
33
+ computed on a 10% random subset of the data that was not used for training.
34
+
35
+ | Cell Type | N cells | celltypist | knn bbknn | knn harmony | knn on scvi | onclass | scanvi | svm | xgboost | Consensus Prediction |
36
+ | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
37
+ | macrophage | 1640 | 0.97 | 0.97 | 0.97 | 0.97 | 0.00 | 0.93 | 0.96 | 0.96 | 0.97 |
38
+ | pulmonary alveolar type 2 cell | 1146 | 0.97 | 0.98 | 0.98 | 0.97 | 0.00 | 0.97 | 0.98 | 0.97 | 0.98 |
39
+ | capillary endothelial cell | 695 | 0.96 | 0.96 | 0.97 | 0.96 | 0.00 | 0.95 | 0.96 | 0.96 | 0.97 |
40
+ | basal cell | 408 | 0.91 | 0.92 | 0.92 | 0.94 | 0.00 | 0.92 | 0.94 | 0.93 | 0.95 |
41
+ | pulmonary alveolar type 1 cell | 303 | 0.97 | 0.98 | 0.96 | 0.97 | 0.00 | 0.98 | 0.98 | 0.98 | 0.99 |
42
+ | intermediate monocyte | 287 | 0.66 | 0.64 | 0.71 | 0.68 | 0.00 | 0.64 | 0.75 | 0.86 | 0.77 |
43
+ | CD4-positive, alpha-beta T cell | 212 | 0.87 | 0.90 | 0.88 | 0.88 | 0.00 | 0.88 | 0.85 | 0.87 | 0.89 |
44
+ | CD8-positive, alpha-beta T cell | 196 | 0.84 | 0.87 | 0.85 | 0.83 | 0.00 | 0.84 | 0.80 | 0.84 | 0.86 |
45
+ | endothelial cell of artery | 161 | 0.79 | 0.81 | 0.83 | 0.81 | 0.00 | 0.77 | 0.74 | 0.79 | 0.84 |
46
+ | club cell | 172 | 0.78 | 0.85 | 0.87 | 0.83 | 0.00 | 0.82 | 0.85 | 0.83 | 0.88 |
47
+ | classical monocyte | 153 | 0.55 | 0.56 | 0.62 | 0.60 | 0.00 | 0.64 | 0.72 | 0.90 | 0.75 |
48
+ | vein endothelial cell | 140 | 0.83 | 0.81 | 0.83 | 0.84 | 0.00 | 0.84 | 0.80 | 0.86 | 0.88 |
49
+ | basophil | 153 | 0.99 | 0.99 | 0.99 | 0.99 | 0.00 | 0.99 | 0.97 | 0.98 | 0.99 |
50
+ | lung ciliated cell | 128 | 0.98 | 0.98 | 0.98 | 0.98 | 0.00 | 0.97 | 0.99 | 0.98 | 0.98 |
51
+ | alveolar adventitial fibroblast | 115 | 0.83 | 0.86 | 0.86 | 0.90 | 0.00 | 0.90 | 0.91 | 0.92 | 0.92 |
52
+ | respiratory goblet cell | 108 | 0.74 | 0.82 | 0.82 | 0.81 | 0.00 | 0.79 | 0.85 | 0.82 | 0.83 |
53
+ | natural killer cell | 102 | 0.85 | 0.89 | 0.88 | 0.89 | 0.00 | 0.90 | 0.88 | 0.89 | 0.89 |
54
+ | pericyte | 78 | 0.83 | 0.89 | 0.89 | 0.92 | 0.00 | 0.87 | 0.92 | 0.90 | 0.92 |
55
+ | B cell | 64 | 0.98 | 0.99 | 0.99 | 0.97 | 0.00 | 0.95 | 0.99 | 0.98 | 0.99 |
56
+ | adventitial cell | 61 | 0.83 | 0.84 | 0.78 | 0.84 | 0.00 | 0.83 | 0.85 | 0.89 | 0.89 |
57
+ | non-classical monocyte | 54 | 0.36 | 0.12 | 0.26 | 0.26 | 0.00 | 0.38 | 0.54 | 0.80 | 0.53 |
58
+ | monocyte | 50 | 0.34 | 0.24 | 0.38 | 0.39 | 0.00 | 0.42 | 0.42 | 0.73 | 0.52 |
59
+ | neutrophil | 37 | 0.93 | 0.96 | 0.93 | 0.97 | 0.00 | 0.95 | 0.96 | 0.91 | 0.97 |
60
+ | endothelial cell of lymphatic vessel | 30 | 0.95 | 0.94 | 0.94 | 0.94 | 0.00 | 0.95 | 0.98 | 0.97 | 0.97 |
61
+ | bronchial smooth muscle cell | 25 | 0.52 | 0.58 | 0.49 | 0.43 | 0.00 | 0.60 | 0.75 | 0.68 | 0.69 |
62
+ | mature NK T cell | 27 | 0.00 | 0.26 | 0.19 | 0.19 | 0.00 | 0.68 | 0.46 | 0.60 | 0.43 |
63
+ | plasma cell | 13 | 0.83 | 0.74 | 0.92 | 0.92 | 0.00 | 0.85 | 0.96 | 0.89 | 0.92 |
64
+ | vascular associated smooth muscle cell | 16 | 0.00 | 0.67 | 0.85 | 0.67 | 0.00 | 0.76 | 0.91 | 0.73 | 0.90 |
65
+ | myeloid dendritic cell | 5 | 0.00 | 0.00 | 0.80 | 0.57 | 0.00 | 0.46 | 0.50 | 1.00 | 0.75 |
66
+ | pulmonary ionocyte | 1 | 0.00 | 1.00 | 1.00 | 0.00 | 0.00 | 0.00 | 1.00 | 1.00 | 1.00 |
67
+ | plasmacytoid dendritic cell | 1 | 1.00 | 0.00 | 1.00 | 0.67 | 0.00 | 0.50 | 1.00 | 0.00 | 1.00 |
68
+ | mesothelial cell | 2 | 0.00 | 0.67 | 0.67 | 0.67 | 0.00 | 0.67 | 0.50 | 0.67 | 0.67 |
69
+ | serous cell of epithelium of bronchus | 2 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.19 | 0.80 | 1.00 | 0.00 |
70
+ | mast cell | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
71
+
72
+ The train accuracies are computed on the training data.
73
+
74
+ | Cell Type | N cells | celltypist | knn bbknn | knn harmony | knn on scvi | onclass | scanvi | svm | xgboost | Consensus Prediction |
75
+ | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
76
+ | macrophage | 14846 | 0.96 | 0.98 | 0.97 | 0.98 | 0.00 | 0.94 | 0.96 | 0.96 | 0.98 |
77
+ | pulmonary alveolar type 2 cell | 10448 | 0.97 | 0.97 | 0.98 | 0.98 | 0.00 | 0.96 | 0.97 | 0.97 | 0.98 |
78
+ | capillary endothelial cell | 6548 | 0.96 | 0.96 | 0.97 | 0.97 | 0.00 | 0.95 | 0.96 | 0.96 | 0.98 |
79
+ | basal cell | 3607 | 0.92 | 0.94 | 0.95 | 0.95 | 0.00 | 0.94 | 0.95 | 0.95 | 0.97 |
80
+ | pulmonary alveolar type 1 cell | 2813 | 0.95 | 0.98 | 0.97 | 0.97 | 0.00 | 0.97 | 0.98 | 0.97 | 0.98 |
81
+ | intermediate monocyte | 2498 | 0.67 | 0.69 | 0.75 | 0.74 | 0.00 | 0.69 | 0.81 | 0.86 | 0.83 |
82
+ | CD4-positive, alpha-beta T cell | 1912 | 0.87 | 0.89 | 0.89 | 0.89 | 0.00 | 0.90 | 0.90 | 0.91 | 0.92 |
83
+ | CD8-positive, alpha-beta T cell | 1702 | 0.85 | 0.85 | 0.86 | 0.86 | 0.00 | 0.87 | 0.87 | 0.88 | 0.90 |
84
+ | endothelial cell of artery | 1593 | 0.78 | 0.81 | 0.83 | 0.83 | 0.00 | 0.79 | 0.80 | 0.83 | 0.85 |
85
+ | club cell | 1575 | 0.80 | 0.83 | 0.89 | 0.86 | 0.00 | 0.84 | 0.88 | 0.87 | 0.91 |
86
+ | classical monocyte | 1416 | 0.59 | 0.61 | 0.65 | 0.67 | 0.00 | 0.72 | 0.82 | 0.93 | 0.82 |
87
+ | vein endothelial cell | 1196 | 0.79 | 0.83 | 0.85 | 0.84 | 0.00 | 0.85 | 0.86 | 0.89 | 0.88 |
88
+ | basophil | 1169 | 0.97 | 0.98 | 0.99 | 0.98 | 0.00 | 0.97 | 0.98 | 0.98 | 0.98 |
89
+ | lung ciliated cell | 1081 | 0.97 | 0.97 | 0.98 | 0.98 | 0.00 | 0.96 | 0.98 | 0.98 | 0.98 |
90
+ | alveolar adventitial fibroblast | 998 | 0.87 | 0.87 | 0.87 | 0.92 | 0.00 | 0.93 | 0.97 | 0.96 | 0.95 |
91
+ | respiratory goblet cell | 932 | 0.77 | 0.87 | 0.88 | 0.87 | 0.00 | 0.85 | 0.87 | 0.89 | 0.91 |
92
+ | natural killer cell | 917 | 0.91 | 0.92 | 0.91 | 0.92 | 0.00 | 0.93 | 0.96 | 0.96 | 0.96 |
93
+ | pericyte | 661 | 0.79 | 0.89 | 0.92 | 0.92 | 0.00 | 0.94 | 0.98 | 0.98 | 0.97 |
94
+ | B cell | 599 | 0.97 | 0.97 | 0.97 | 0.98 | 0.00 | 0.99 | 0.99 | 0.98 | 0.99 |
95
+ | adventitial cell | 520 | 0.82 | 0.82 | 0.78 | 0.87 | 0.00 | 0.87 | 0.95 | 0.94 | 0.93 |
96
+ | non-classical monocyte | 473 | 0.27 | 0.11 | 0.22 | 0.19 | 0.00 | 0.52 | 0.71 | 0.80 | 0.62 |
97
+ | monocyte | 456 | 0.42 | 0.33 | 0.50 | 0.47 | 0.00 | 0.65 | 0.76 | 0.84 | 0.73 |
98
+ | neutrophil | 334 | 0.94 | 0.97 | 0.97 | 0.96 | 0.00 | 0.95 | 0.98 | 0.96 | 0.98 |
99
+ | endothelial cell of lymphatic vessel | 285 | 0.96 | 0.98 | 0.97 | 0.96 | 0.00 | 0.96 | 0.99 | 0.99 | 0.98 |
100
+ | bronchial smooth muscle cell | 195 | 0.51 | 0.60 | 0.68 | 0.56 | 0.00 | 0.78 | 0.91 | 0.93 | 0.90 |
101
+ | mature NK T cell | 136 | 0.00 | 0.30 | 0.30 | 0.32 | 0.00 | 0.68 | 0.83 | 0.85 | 0.85 |
102
+ | plasma cell | 135 | 0.87 | 0.85 | 0.91 | 0.95 | 0.00 | 0.96 | 0.96 | 0.97 | 0.98 |
103
+ | vascular associated smooth muscle cell | 115 | 0.00 | 0.56 | 0.72 | 0.56 | 0.00 | 0.84 | 0.95 | 0.96 | 0.93 |
104
+ | myeloid dendritic cell | 28 | 0.00 | 0.19 | 0.83 | 0.71 | 0.00 | 0.37 | 0.89 | 0.98 | 0.97 |
105
+ | pulmonary ionocyte | 24 | 0.00 | 0.92 | 0.86 | 0.83 | 0.00 | 0.79 | 0.92 | 0.94 | 1.00 |
106
+ | plasmacytoid dendritic cell | 18 | 0.90 | 0.00 | 0.95 | 0.84 | 0.00 | 0.65 | 1.00 | 1.00 | 1.00 |
107
+ | mesothelial cell | 16 | 0.00 | 0.78 | 0.75 | 0.84 | 0.00 | 0.55 | 0.86 | 0.91 | 0.86 |
108
+ | serous cell of epithelium of bronchus | 13 | 0.00 | 0.47 | 0.00 | 0.47 | 0.00 | 0.17 | 0.84 | 0.93 | 0.63 |
109
+ | mast cell | 3 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.10 | 1.00 | 0.86 | 0.80 |
110
+
111
+ </details>
112
+
113
+
114
+ # References
115
+
116
+ Tabula Sapiens reveals transcription factor expression, senescence effects, and sex-specific features in cell types from 28 human organs and tissues, The Tabula Sapiens Consortium; bioRxiv, doi: https://doi.org/10.1101/2024.12.03.626516