CpGPT Model Checkpoints

Model weights, configurations, and vocabularies for CpGPT: A Foundation Model for DNA Methylation.

Contents

weights/     # PyTorch Lightning checkpoint files (.ckpt)
config/      # Hydra YAML configuration files
vocab/       # CpG vocabulary files (.json)

Pre-trained Models

Model Size Parameters Model Name
CpGPT-2M 30MB ~2.5M small
CpGPT-100M 1.1GB ~101M large

Download

# Install huggingface_hub
pip install huggingface_hub

# Download all model files
huggingface-cli download lucascamillomd/cpgpt-models --local-dir dependencies/model

# Or download a specific model
huggingface-cli download lucascamillomd/cpgpt-models weights/small.ckpt config/small.yaml vocab/small.json --local-dir dependencies/model

Dependencies

You will also need the DNA embeddings for your species of interest:

Usage

After downloading the model files and species dependencies, follow the tutorials at the CpGPT GitHub repository to get started.

Citation

@article{camillo2024cpgpt,
  title={CpGPT: A Foundation Model for DNA Methylation},
  author={de Lima Camillo, Lucas Paulo et al.},
  journal={bioRxiv},
  year={2024},
  doi={10.1101/2024.10.24.619766}
}

License

MIT License — see the GitHub repository for details.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support