scfoundation-cell / README.md
krkawzq's picture
Upload folder using huggingface_hub
7b06458 verified
---
library_name: perturblab
tags:
- biology
- genomics
- scfoundation
- foundation-model
license: apache-2.0
base_model: biomap-research/scFoundation
---
# scfoundation-cell
## Model Description
This is the **cell embedding** model from scFoundation. It generates cell-level embeddings from single-cell RNA-seq data.
Model weights were originally from the [biomap-research/scFoundation](https://github.com/biomap-research/scFoundation) repository and have been re-uploaded here for ease of use with the `perturblab` library.
## Model Details
- **Model Type**: Cell embedding model
- **Architecture**: xTrimoGene with MAE (Masked Autoencoder), Performer/Transformer modules
- **Parameters**: 100M parameters
- **Training Data**: 50M+ human single-cell transcriptomics data
- **Input**: Single-cell or bulk RNA-seq expression data (19,264 fixed genes)
- **Output**: Cell-level embeddings
## Source
- **Original Repository**: [biomap-research/scFoundation](https://github.com/biomap-research/scFoundation)
- **Paper**: [Large Scale Foundation Model on Single-cell Transcriptomics](https://www.nature.com/articles/s41592-024-02305-7) (_Nature Methods_, 2024)
## Usage
```python
from perturblab.model.scfoundation import scFoundationModel
# Load model
model = scFoundationModel.from_pretrained('scfoundation-cell', device='cuda')
# Generate cell embeddings
cell_embeddings = model.predict_embedding(
adata,
output_type='cell',
pool_type='all'
)
```
## Note
Intended for internal use with the PerturbLab framework.