File size: 1,751 Bytes
4438d29
01baba3
 
 
 
 
4438d29
 
ec68252
 
9efb8e7
ec68252
01baba3
 
a2a99c2
01baba3
 
 
f0a90d8
 
 
01baba3
 
 
72833ea
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
---
language:
- en
tags:
- biology
- protein
---

## Model Card for Pinal: Toward De Novo Protein Design from Natural Language

<a href="https://www.biorxiv.org/content/10.1101/2024.08.01.606258"><img src="https://img.shields.io/badge/Paper-bioRxiv-green" style="max-width: 100%;"></a> <a href="http://www.denovo-pinal.com/"><img src="https://img.shields.io/badge/Pinal-red?label=Server" style="max-width: 100%;"></a> <a href="https://github.com/westlake-repl/Denovo-Pinal"><img src="https://img.shields.io/badge/Github-Code-blue?logo=github" style="max-width: 100%;"></a>

Pinal is an advanced protein design framework that translates human design intent into novel protein sequences. It utilizes a two-stage process: first generating protein structures from language instructions, then designing sequences based on those structures. With a substantial parameter count of 16 billion and trained on a diverse dataset comprising 1.7 billion protein-text pairs, Pinal demonstrates marked improvements in both performance and generalization capabilities for novel protein structures.

![image/png](https://cdn-uploads.huggingface.co/production/uploads/665feb99ea2cc04a641db7f6/JZFBfNeWeQhXyBySKMHiT.png)

For more information, please refer to our [preprint](https://www.biorxiv.org/content/10.1101/2024.08.01.606258).

### SaProt-T(ext) and SaProt-O(mni)

SaProt-T was specifically developed for **protein reverse folding** within the Pinal framework, whereas SaProt-O is trained with configurable inputs (optional textual prompts and structural features) to accommodate real-world research scenarios.

## Usage

Please refer to the README.md in the  [Github repository](https://github.com/westlake-repl/Denovo-Pinal) for details on how to use the model.