Update README.md
Browse files
README.md
CHANGED
|
@@ -3,4 +3,67 @@ license: apache-2.0
|
|
| 3 |
pipeline_tag: tabular-regression
|
| 4 |
tags:
|
| 5 |
- biology
|
| 6 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 3 |
pipeline_tag: tabular-regression
|
| 4 |
tags:
|
| 5 |
- biology
|
| 6 |
+
---
|
| 7 |
+
|
| 8 |
+
# NexaBio: Advanced Protein Structure Prediction Models
|
| 9 |
+
|
| 10 |
+
**NexaBio** is a sophisticated two-stage model suite designed for high-accuracy protein structure prediction from amino acid sequences. It comprises two complementary models:
|
| 11 |
+
|
| 12 |
+
- **NexaBio_1**: A Convolutional Neural Network (CNN) and Bidirectional LSTM (BiLSTM) model for secondary structure prediction.
|
| 13 |
+
- **NexaBio_2**: A Variational Autoencoder (VAE) and Diffusion-based model for tertiary (3D) structure prediction.
|
| 14 |
+
|
| 15 |
+
NexaBio is a core component of the [Nexa Scientific Model Suite](https://huggingface.co/spaces/Allanatrix/NexaHub), a collection of machine learning models advancing scientific discovery.
|
| 16 |
+
|
| 17 |
+
## Model Overview
|
| 18 |
+
|
| 19 |
+
### NexaBio_1: Secondary Structure Prediction
|
| 20 |
+
- **Architecture**: CNN combined with BiLSTM for robust sequence modeling.
|
| 21 |
+
- **Input**: Amino acid sequence (one-hot encoded or embedded).
|
| 22 |
+
- **Output**: Secondary structure classifications (e.g., Helix, Sheet, Coil).
|
| 23 |
+
- **Use Case**: Identification of local structural motifs and protein folding patterns.
|
| 24 |
+
|
| 25 |
+
### NexaBio_2: Tertiary Structure Prediction
|
| 26 |
+
- **Architecture**: VAE integrated with a Diffusion Model for generative 3D modeling.
|
| 27 |
+
- **Input**: Amino acid sequence (optionally augmented with secondary structure predictions).
|
| 28 |
+
- **Output**: 3D coordinates of protein backbone atoms.
|
| 29 |
+
- **Use Case**: Full tertiary structure prediction for structural analysis and design.
|
| 30 |
+
|
| 31 |
+
## Applications
|
| 32 |
+
- **Structural Bioinformatics**: Enabling precise protein structure analysis for research.
|
| 33 |
+
- **Drug Discovery**: Supporting protein-ligand interaction studies and therapeutic design.
|
| 34 |
+
- **Protein Engineering**: Facilitating the design of novel proteins for industrial and medical applications.
|
| 35 |
+
- **Synthetic Biology**: Generating protein structures for biotechnological innovation.
|
| 36 |
+
- **Academic Research**: Serving as a tool for educational and exploratory studies.
|
| 37 |
+
|
| 38 |
+
## Getting Started
|
| 39 |
+
|
| 40 |
+
### Example Usage
|
| 41 |
+
```python
|
| 42 |
+
from transformers import AutoModel
|
| 43 |
+
|
| 44 |
+
# Initialize the secondary structure prediction model
|
| 45 |
+
model_sec = AutoModel.from_pretrained("Allanatrix/NexaBio_1")
|
| 46 |
+
|
| 47 |
+
# Initialize the tertiary structure prediction model
|
| 48 |
+
model_ter = AutoModel.from_pretrained("Allanatrix/NexaBio_2")
|
| 49 |
+
|
| 50 |
+
# Process an amino acid sequence (refer to model documentation for input formatting)
|
| 51 |
+
```
|
| 52 |
+
|
| 53 |
+
For comprehensive instructions, including inference APIs and preprocessing details, consult the individual model cards on Hugging Face.
|
| 54 |
+
|
| 55 |
+
## Citation and License
|
| 56 |
+
If you utilize NexaBio in your research or applications, please cite this repository and include a link to the [Nexa R&D Space](https://huggingface.co/spaces/Allanatrix/NexaR&D).
|
| 57 |
+
The models and associated code are licensed under the **Boost Software License 1.1 (BSL-1.1)**.
|
| 58 |
+
|
| 59 |
+
## Part of the Nexa Scientific Ecosystem
|
| 60 |
+
Discover other components of the Nexa Scientific Stack:
|
| 61 |
+
- [Nexa Data Studio](https://huggingface.co/spaces/Allanatrix/NexaDataStudio): Data processing and visualization tools.
|
| 62 |
+
- [Nexa R&D](https://huggingface.co/spaces/Allanatrix/NexaR&D): Research-focused model development environment.
|
| 63 |
+
- [Nexa Infrastructure](https://huggingface.co/spaces/Allanatrix/NexaInfrastructure): Scalable ML deployment solutions.
|
| 64 |
+
- [Nexa Hub](https://huggingface.co/spaces/Allanatrix/NexaHub): Central portal for Nexa resources.
|
| 65 |
+
|
| 66 |
+
---
|
| 67 |
+
|
| 68 |
+
*Developed and maintained by [Allan](https://huggingface.co/Allanatrix), an independent machine learning researcher specializing in scientific AI and infrastructure.*
|
| 69 |
+
|