Update README.md
Browse files
README.md
CHANGED
|
@@ -1,3 +1,77 @@
|
|
| 1 |
-
---
|
| 2 |
-
license: mit
|
| 3 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: mit
|
| 3 |
+
pretty_name: >-
|
| 4 |
+
hu.MAP3.0: Atlas of human protein complexes by integration of > 25,000
|
| 5 |
+
proteomic experiments.
|
| 6 |
+
repo: https://github.com/KDrewLab/huMAP3.0_analysis
|
| 7 |
+
---
|
| 8 |
+
# hu.MAP3.0: Atlas of human protein complexes by integration of > 25,000 proteomic experiments.
|
| 9 |
+
Proteins interact with each other and organize themselves into macromolecular machines (ie. complexes)
|
| 10 |
+
to carry out essential functions of the cell. We have a good understanding of a few complexes such as
|
| 11 |
+
the proteasome and the ribosome but currently we have an incomplete view of all protein complexes as
|
| 12 |
+
well as their functions. The hu.MAP attempts to address this lack of understanding by integrating several
|
| 13 |
+
large scale protein interaction datasets to obtain the most comprehensive view of protein complexes.
|
| 14 |
+
In hu.MAP 3.0 we integrated large scale affinity purification mass spectrometry (AP/MS) datasets from Bioplex,
|
| 15 |
+
Bioplex2.0, Bioplex3.0, Boldt et al. and Hein et al., large scale biochemical fractionation data (Wan et al.),
|
| 16 |
+
proximity labeling data (Gupta et al., Youn et al.), and RNA hairpin pulldown data (Treiber et al.) to produce
|
| 17 |
+
a complex map with over 15k complexes.
|
| 18 |
+
|
| 19 |
+
|
| 20 |
+
## Funding
|
| 21 |
+
|
| 22 |
+
NIH R00, NSF/BBSRC
|
| 23 |
+
|
| 24 |
+
## Citation
|
| 25 |
+
|
| 26 |
+
Samantha N. Fischer, Erin R Claussen, Savvas Kourtis, Sara Sdelci, Sandra Orchard, Henning Hermjakob, Georg Kustatscher, Kevin Drew hu.MAP3.0: Atlas of human protein complexes by integration of > 25,000 proteomic experiments BioRxiv https://doi.org/10.1101/2024.10.11.617930
|
| 27 |
+
|
| 28 |
+
## References
|
| 29 |
+
|
| 30 |
+
Kevin Drew, John B. Wallingford, Edward M. Marcotte hu.MAP 2.0: integration of over 15,000 proteomic experiments builds a global compendium of human multiprotein assemblies Mol Syst Biol (2021)17:e10016https://doi.org/10.15252/msb.202010016
|
| 31 |
+
Kevin Drew, Chanjae Lee, Ryan L Huizar, Fan Tu, Blake Borgeson, Claire D McWhite, Yun Ma, John B Wallingford, Edward M Marcotte Integration of over 9,000 mass spectrometry experiments builds a global map of human protein complexes. Molecular Systems Biology (2017) 13, 932. DOI 10.15252/msb.20167490
|
| 32 |
+
Huttlin et al. Dual proteome-scale networks reveal cell-specific remodeling of the human interactome Cell. 2021 May 27;184(11):3022-3040.e28. doi: 10.1016/j.cell.2021.04.011.
|
| 33 |
+
Huttlin et al. Architecture of the human interactome defines protein communities and disease networks. Nature. 2017 May 25;545(7655):505-509. DOI: 10.1038/nature22366.
|
| 34 |
+
Treiber et al. A Compendium of RNA-Binding Proteins that Regulate MicroRNA Biogenesis.. Mol Cell. 2017 Apr 20;66(2):270-284.e13. doi: 10.1016/j.molcel.2017.03.014.
|
| 35 |
+
Boldt et al. An organelle-specific protein landscape identifies novel diseases and molecular mechanisms. Nat Commun. 2016 May 13;7:11491. doi: 10.1038/ncomms11491.
|
| 36 |
+
Youn et al. High-Density Proximity Mapping Reveals the Subcellular Organization of mRNA-Associated Granules and Bodies. Mol Cell. 2018 Feb 1;69(3):517-532.e11. doi: 10.1016/j.molcel.2017.12.020.
|
| 37 |
+
Gupta et al. A Dynamic Protein Interaction Landscape of the Human Centrosome-Cilium Interface. Cell. 2015 Dec 3;163(6):1484-99. doi: 10.1016/j.cell.2015.10.065.
|
| 38 |
+
Wan, Borgeson et al. Panorama of ancient metazoan macromolecular complexes. Nature. 2015 Sep 17;525(7569):339-44. doi: 10.1038/nature14877. Epub 2015 Sep 7.
|
| 39 |
+
Hein et al. A human interactome in three quantitative dimensions organized by stoichiometries and abundances. Cell. 2015 Oct 22;163(3):712-23. doi: 10.1016/j.cell.2015.09.053. Epub 2015 Oct 22.
|
| 40 |
+
Huttlin et al. The BioPlex Network: A Systematic Exploration of the Human Interactome. Cell. 2015 Jul 16;162(2):425-40. doi: 10.1016/j.cell.2015.06.043.
|
| 41 |
+
Reimand et al. g:Profiler-a web server for functional interpretation of gene lists (2016 update). Nucleic Acids Res. 2016 Jul 8;44(W1):W83-9. doi: 10.1093/nar/gkw199.
|
| 42 |
+
|
| 43 |
+
## Associated code
|
| 44 |
+
|
| 45 |
+
Code examples using the hu.MAP 3.0 model and downstream analysis can be found on our
|
| 46 |
+
[GitHub](https://github.com/KDrewLab/huMAP3.0_analysis)
|
| 47 |
+
All feature matrices and associated files can be found in the [sfisch/hu.MAP3.0](https://huggingface.co/datasets/sfisch/hu.MAP3.0) datasets
|
| 48 |
+
repo
|
| 49 |
+
|
| 50 |
+
# Usage
|
| 51 |
+
|
| 52 |
+
## Accessing the model
|
| 53 |
+
hu.MAP 3.0 was built using the auto-ML tool [AutoGluon](https://auto.gluon.ai/stable/index.html) and the [TabularPredictor](https://auto.gluon.ai/stable/api/autogluon.tabular.TabularPredictor.html)
|
| 54 |
+
module is used train, test, and make predictions with the model.
|
| 55 |
+
|
| 56 |
+
This can be downloaded using the following:
|
| 57 |
+
|
| 58 |
+
$ pip install autogluon==0.4.0
|
| 59 |
+
|
| 60 |
+
Then it can be imported as:
|
| 61 |
+
|
| 62 |
+
>>> from autogluon.tabular import TabularPredictor
|
| 63 |
+
Note that to perform operations with our model the **0.4.0 version** must be used
|
| 64 |
+
|
| 65 |
+
Our trained model can be downloaded through Huggingface using [huggingface_hub](https://huggingface.co/docs/hub/index)
|
| 66 |
+
|
| 67 |
+
>>> from huggingface_hub import snapshot_download
|
| 68 |
+
|
| 69 |
+
>>> model_dir = snapshot_download(repo_id="sfisch/hu.MAP3.0_AutoGluon")
|
| 70 |
+
|
| 71 |
+
>>> predictor = TabularPredictor.load(f"{model_dir}/huMAP3_20230503_complexportal_subset10kNEG_notScaled_accuracy")
|
| 72 |
+
|
| 73 |
+
To use the model and make predictions, we show two full code examples using the [full feature matrix](https://github.com/KDrewLab/huMAP3.0_analysis/blob/main/huMAP3.0_model_devel/generating_predictions_w_hu.MAP3.0.ipynb)
|
| 74 |
+
and the [test feature matrix](https://github.com/KDrewLab/huMAP3.0_analysis/blob/main/huMAP3.0_model_devel/humap3_test_20230503.pairsWprob) in jupyter notebooks.
|
| 75 |
+
|
| 76 |
+
All feature matrices can be pulled using the 'datasets' module from HuggingFace and examples of that are seen on our [GitHub](https://github.com/KDrewLab/huMAP3.0_analysis/tree/main/huMAP3.0_model_devel)
|
| 77 |
+
and on our HuggingFace dataset repo [sfisch/hu.MAP3.0](https://huggingface.co/datasets/sfisch/hu.MAP3.0)
|