VicenteAlex commited on
Commit
30c2703
·
verified ·
1 Parent(s): 1e12b3b

Init model card

Browse files
Files changed (1) hide show
  1. README.md +63 -3
README.md CHANGED
@@ -1,3 +1,63 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language: en
3
+ library_name: transformers
4
+ pipeline_tag: text2text-generation
5
+ tags:
6
+ - t5
7
+ - molecule-to-protein
8
+ - smiles
9
+ - protein-generation
10
+ - binder
11
+ - ligand
12
+ license: apache-2.0
13
+ datasets:
14
+ - AI4PD/Mol2Pro-Binder-Dataset
15
+ ---
16
+
17
+ # Mol2Pro-base
18
+
19
+ ## Model description
20
+
21
+ - **Architecture:** T5-efficient-base https://huggingface.co/google/t5-efficient-base
22
+ - **Tokenization:** https://huggingface.co/AI4PD/Mol2Pro-tokenizer
23
+
24
+
25
+ - **Code:** https://github.com/AI4PDLab/Mol2Pro
26
+ - **Training data** https://huggingface.co/datasets/AI4PD/Mol2Pro-Binder-Dataset
27
+ - **Paper:** https://doi.org/10.64898/2026.02.06.704305
28
+
29
+
30
+
31
+ ## How to use
32
+
33
+ ```python
34
+ from transformers import AutoTokenizer, AutoModelForSeq2SeqLM
35
+ import torch
36
+
37
+ model_id = "AI4PD/Mol2Pro-base"
38
+ tokenizer_id = "AI4PD/Mol2Pro-tokenizer"
39
+
40
+ # Load tokenizers
41
+ tokenizer_mol = AutoTokenizer.from_pretrained(tokenizer_id, subfolder="smiles")
42
+ tokenizer_aa = AutoTokenizer.from_pretrained(tokenizer_id, subfolder="aa")
43
+
44
+ # Load model
45
+ model = AutoModelForSeq2SeqLM.from_pretrained(model_id)
46
+ ```
47
+
48
+ ## Intended use
49
+ Research use only. The model generates candidate sequences conditioned on small-molecule inputs; it does not guarantee binding or function and must be validated experimentally.
50
+
51
+ ## Citation
52
+
53
+ If you find this work useful, please cite:
54
+
55
+ ```bibtex
56
+ @article{VicenteSola2026Generalise,
57
+ title = {Generalise or Memorise? Benchmarking Ligand-Conditioned Protein Generation from Sequence-Only Data},
58
+ author = {Vicente-Sola, Alex and Dornfeld, Lars and Coines, Joan and Ferruz, Noelia},
59
+ journal = {bioRxiv},
60
+ year = {2026},
61
+ doi = {10.64898/2026.02.06.704305},
62
+ }
63
+