MuMo-Pretrained / README.md
zihaojing's picture
Update README.md
5574517 verified
---
license: apache-2.0
tags:
- chemistry
- drug-discovery
- molecular-modeling
- mumo
pipeline_tag: graph-ml
library_name: transformers
---
# mumo-pretrain
This model was trained using MuMo (Multi-Modal Molecular) framework, as presented in the paper [Structure-Aware Fusion with Progressive Injection for Multimodal Molecular Representation Learning](https://huggingface.co/papers/2510.23640).
The official code repository is available at: https://github.com/selmiss/MuMo
## Model Description
- **Model Type**: MuMo Pretrained Model
- **Training Data**: Molecular structures and properties
- **Framework**: PyTorch + Transformers + Mamba-ssm
## Usage
### Loading the Model
MuMo uses a custom loading function. Here's how to load the pretrained model:
```shell
git clone https://github.com/selmiss/MuMo.git
```
```python
from transformers import AutoConfig, AutoTokenizer
from model.load_model import load_model
from dataclasses import dataclass
# Load configuration and tokenizer
repo = "zihaojing/MuMo-Pretrained"
config = AutoConfig.from_pretrained(repo, trust_remote_code=True)
tokenizer = AutoTokenizer.from_pretrained(repo)
# Set up model arguments
@dataclass
class ModelArgs:
model_name_or_path: str = repo
model_class: str = "MuMoFinetune" # or "MuMoPretrain" for pretraining
cache_dir: str = None
model_revision: str = "main"
use_auth_token: bool = False
task_type: str = None # e.g., "classification" or "regression" for finetuning
model_args = ModelArgs()
# Load the model
model = load_model(config, tokenizer=tokenizer, model_args=model_args)
```
**Notes:**
- Use `model_class="MuMoPretrain"` for pretraining or inference
- Use `model_class="MuMoFinetune"` for finetuning tasks
- Set `task_type` to `"classification"` or `"regression"` when using `MuMoFinetune`
- The model supports loading from both Hugging Face Hub (e.g., `"zihaojing/MuMo-Pretrained"`) and local paths (e.g., `"/path/to/model"`)
## Training Details
- Training script: See the [official GitHub repository](https://github.com/selmiss/MuMo) for details.
- Framework: Transformers + DeepSpeed
## Citation
If you use this model or the MuMo framework, please cite our paper:
```bibtex
@inproceedings{jing2025mumo,
title = {MuMo: Multimodal Molecular Representation Learning via Structural Fusion and Progressive Injection},
author = {Jing, Zihao and Sun, Yan and Li, Yan Yi and Janarthanan, Sugitha and Deng, Alana and Hu, Pingzhao},
booktitle = {Advances in Neural Information Processing Systems (NeurIPS)},
year = {2025}
}
```