Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
|
@@ -36,8 +36,9 @@ Relevant Huggingface hosted models and datasets
|
|
| 36 |
- [GleghornLab/DSM_150_ppi_lora](https://huggingface.co/GleghornLab/DSM_150_ppi_lora) - 150M parameter LoRA DSM-ppi model
|
| 37 |
- [GleghornLab/DSM_650_ppi_lora](https://huggingface.co/GleghornLab/DSM_650_ppi_Lora) - 650M parameter LoRA DSM-ppi model
|
| 38 |
- [GleghornLab/DSM_150_ppi_control](https://huggingface.co/GleghornLab/DSM_150_ppi_control) - Control version of LoRA DSM-ppi
|
| 39 |
-
|
| 40 |
-
|
|
|
|
| 41 |
|
| 42 |
- **Datasets**:
|
| 43 |
- [Synthyra/omg_prot50](https://huggingface.co/Synthyra/omg_prot50) - Open MetaGenomic dataset clustered at 50% identity (207M sequences)
|
|
@@ -338,6 +339,7 @@ The repository includes a comprehensive suite for evaluating model performance,
|
|
| 338 |
3. **Representation Quality (Model Probing):**
|
| 339 |
* Evaluate learned embeddings by training linear probes (or simple transformer blocks) on various downstream tasks (e.g., secondary structure prediction, localization prediction, etc.).
|
| 340 |
* Performance is compared against random vectors, randomized transformers, and other established pLMs.
|
|
|
|
| 341 |
|
| 342 |
4. **Conditional Generation (Binder Design for DSM-ppi):**
|
| 343 |
* Evaluate DSM-ppi on benchmarks like BenchBB.
|
|
|
|
| 36 |
- [GleghornLab/DSM_150_ppi_lora](https://huggingface.co/GleghornLab/DSM_150_ppi_lora) - 150M parameter LoRA DSM-ppi model
|
| 37 |
- [GleghornLab/DSM_650_ppi_lora](https://huggingface.co/GleghornLab/DSM_650_ppi_Lora) - 650M parameter LoRA DSM-ppi model
|
| 38 |
- [GleghornLab/DSM_150_ppi_control](https://huggingface.co/GleghornLab/DSM_150_ppi_control) - Control version of LoRA DSM-ppi
|
| 39 |
+
|
| 40 |
+
(Fully finetuned - recommended for real use)
|
| 41 |
+
- [Synthyra/DSM_ppi_full](https://huggingface.co/Synthyra/DSM_ppi_full) - 650M parameter DSM-ppi model
|
| 42 |
|
| 43 |
- **Datasets**:
|
| 44 |
- [Synthyra/omg_prot50](https://huggingface.co/Synthyra/omg_prot50) - Open MetaGenomic dataset clustered at 50% identity (207M sequences)
|
|
|
|
| 339 |
3. **Representation Quality (Model Probing):**
|
| 340 |
* Evaluate learned embeddings by training linear probes (or simple transformer blocks) on various downstream tasks (e.g., secondary structure prediction, localization prediction, etc.).
|
| 341 |
* Performance is compared against random vectors, randomized transformers, and other established pLMs.
|
| 342 |
+
* The assessment was done with [Protify](https://github.com/Synthyra/Protify), an open-source framework that can be used for pLM training and evaluation.
|
| 343 |
|
| 344 |
4. **Conditional Generation (Binder Design for DSM-ppi):**
|
| 345 |
* Evaluate DSM-ppi on benchmarks like BenchBB.
|