Enhance model card for JAMUN

#1
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +68 -3
README.md CHANGED
@@ -1,3 +1,68 @@
1
- ---
2
- license: mit
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ pipeline_tag: other
4
+ tags:
5
+ - molecular-dynamics
6
+ - protein-folding
7
+ - drug-discovery
8
+ - bioinformatics
9
+ - generative-models
10
+ - diffusion-models
11
+ ---
12
+
13
+ # JAMUN: Bridging Smoothed Molecular Dynamics and Score-Based Learning for Conformational Ensembles
14
+
15
+ JAMUN is a novel approach for generating conformational ensembles of protein structures, presented in the paper [JAMUN: Bridging Smoothed Molecular Dynamics and Score-Based Learning for Conformational Ensembles](https://huggingface.co/papers/2410.14621).
16
+
17
+ Conformational ensembles of protein structures are immensely important both for understanding protein function and drug discovery in novel modalities such as cryptic pockets. JAMUN performs molecular dynamics (MD) in a smoothed, noised space of all-atom 3D conformations of molecules by utilizing the framework of walk-jump sampling. It enables ensemble generation for small peptides at rates of an order of magnitude faster than traditional molecular dynamics. The physical priors in JAMUN enable transferability to systems outside of its training data, even to peptides that are longer than those originally trained on.
18
+
19
+ <p align="center">
20
+ <img src="https://github.com/prescient-design/jamun/blob/main/figures/walk-jump-overview.png?raw=true" alt="Overview of walk-jump sampling in JAMUN" width="400"/>
21
+ </p>
22
+
23
+ For the official code, setup instructions, and further details, please refer to the [GitHub repository](https://github.com/prescient-design/jamun).
24
+
25
+ ## Usage
26
+
27
+ To use the JAMUN model for sampling conformations, you first need to set up the environment and download the pre-trained checkpoints as described in the [official GitHub repository](https://github.com/prescient-design/jamun#setup) and the [model collection on Hugging Face](https://huggingface.co/ameya98/JAMUN).
28
+
29
+ Once set up and checkpoints are downloaded, you can run sampling for a peptide sequence (e.g., "AGPF") as follows:
30
+
31
+ 1. **Load Trained Models:**
32
+ Ensure you have `git-lfs` installed, then clone the model collection:
33
+ ```bash
34
+ git lfs install
35
+ git clone https://huggingface.co/ameya98/JAMUN
36
+ ```
37
+
38
+ 2. **Generate a `.pdb` file for your peptide (if you don't have one):**
39
+ ```bash
40
+ python scripts/prepare_pdb.py AGPF --mode uncapped --outputdir .
41
+ ```
42
+ This command will create a PDB file (e.g., `uncapped-AGPF.pdb`) in the specified output directory.
43
+
44
+ 3. **Run the sampling command:**
45
+ Ensure your `jamun` conda environment is active and necessary environment variables are set. Replace `<INIT_PDB_PATH>` with the path to your generated `.pdb` file and `<CHECKPOINT_DIR>` with the path to your downloaded model checkpoint (e.g., `./JAMUN/2AA_uncapped_large_model`).
46
+ ```bash
47
+ jamun_sample --config-dir=configs experiment=sample_custom \
48
+ ++init_pdb=<INIT_PDB_PATH> \
49
+ ++checkpoint_dir=<CHECKPOINT_DIR>
50
+ ```
51
+
52
+ For more detailed information on training, inference, and analysis, please visit the [official GitHub repository](https://github.com/prescient-design/jamun#usage).
53
+
54
+ ## Citation
55
+
56
+ If you find this work useful, please cite the paper:
57
+
58
+ ```bibtex
59
+ @misc{daigavane2024jamuntransferablemolecularconformational,
60
+ title={JAMUN: Bridging Smoothed Molecular Dynamics and Score-Based Learning for Conformational Ensembles},
61
+ author={Ameya Daigavane and Bodhi P. Vani and Darcy Davidson and Saeed Saremi and Joshua Rackers and Joseph Kleinhenz},
62
+ year={2024},
63
+ eprint={2410.14621},
64
+ archivePrefix={arXiv},
65
+ primaryClass={physics.bio-ph},
66
+ url={https://arxiv.org/abs/2410.14621},
67
+ }
68
+ ```