|
|
--- |
|
|
license: mit |
|
|
tags: |
|
|
- DNA-methylation |
|
|
- epigenetics |
|
|
- foundation-model |
|
|
- aging |
|
|
- biology |
|
|
--- |
|
|
|
|
|
# CpGPT Model Checkpoints |
|
|
|
|
|
Model weights, configurations, and vocabularies for [CpGPT: A Foundation Model for DNA Methylation](https://github.com/lcamillo/CpGPT). |
|
|
|
|
|
## Contents |
|
|
|
|
|
``` |
|
|
weights/ # PyTorch Lightning checkpoint files (.ckpt) |
|
|
config/ # Hydra YAML configuration files |
|
|
vocab/ # CpG vocabulary files (.json) |
|
|
``` |
|
|
|
|
|
## Pre-trained Models |
|
|
|
|
|
| Model | Size | Parameters | Model Name | |
|
|
|-------|------|------------|------------| |
|
|
| CpGPT-2M | 30MB | ~2.5M | `small` | |
|
|
| CpGPT-100M | 1.1GB | ~101M | `large` | |
|
|
|
|
|
## Download |
|
|
|
|
|
```bash |
|
|
# Install huggingface_hub |
|
|
pip install huggingface_hub |
|
|
|
|
|
# Download all model files |
|
|
huggingface-cli download lucascamillomd/cpgpt-models --local-dir dependencies/model |
|
|
|
|
|
# Or download a specific model |
|
|
huggingface-cli download lucascamillomd/cpgpt-models weights/small.ckpt config/small.yaml vocab/small.json --local-dir dependencies/model |
|
|
``` |
|
|
|
|
|
## Dependencies |
|
|
|
|
|
You will also need the DNA embeddings for your species of interest: |
|
|
- **Human**: [lucascamillomd/cpgpt-human-dependencies](https://huggingface.co/lucascamillomd/cpgpt-human-dependencies) |
|
|
- **Mammalian (multi-species)**: [lucascamillomd/cpgpt-mammalian-dependencies](https://huggingface.co/lucascamillomd/cpgpt-mammalian-dependencies) |
|
|
|
|
|
## Usage |
|
|
|
|
|
After downloading the model files and species dependencies, follow the tutorials at the [CpGPT GitHub repository](https://github.com/lcamillo/CpGPT) to get started. |
|
|
|
|
|
## Citation |
|
|
|
|
|
```bibtex |
|
|
@article{camillo2024cpgpt, |
|
|
title={CpGPT: A Foundation Model for DNA Methylation}, |
|
|
author={de Lima Camillo, Lucas Paulo et al.}, |
|
|
journal={bioRxiv}, |
|
|
year={2024}, |
|
|
doi={10.1101/2024.10.24.619766} |
|
|
} |
|
|
``` |
|
|
|
|
|
## License |
|
|
|
|
|
MIT License — see the [GitHub repository](https://github.com/lcamillo/CpGPT) for details. |
|
|
|