Update README.md
Browse files
README.md
CHANGED
|
@@ -1,3 +1,21 @@
|
|
| 1 |
-
---
|
| 2 |
-
license: llama2
|
| 3 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: llama2
|
| 3 |
+
datasets:
|
| 4 |
+
- mims-harvard/ProCyon-Instruct
|
| 5 |
+
language:
|
| 6 |
+
- en
|
| 7 |
+
base_model:
|
| 8 |
+
- meta-llama/Llama-2-7b-hf
|
| 9 |
+
tags:
|
| 10 |
+
- biology
|
| 11 |
+
- protein
|
| 12 |
+
---
|
| 13 |
+
# ProCyon-Split
|
| 14 |
+
|
| 15 |
+
ProCyon-Split is a multimodal foundation model for protein phenotypes, which combines a large language model with protein encoders to support inputs of interleaved free text and proteins.
|
| 16 |
+
In contrast to ProCyon-Full, this model is instruction-tuned using the training split of the [ProCyon-Instruct](https://huggingface.co/datasets/mims-harvard/ProCyon-Instruct) dataset to
|
| 17 |
+
enable rigorous model evaluation on held-out protein-phenotype pairs.
|
| 18 |
+
|
| 19 |
+
For more information on the model design, training, and validation, please see the [overview page](https://zitniklab.hms.harvard.edu/ProCyon/).
|
| 20 |
+
|
| 21 |
+
Additional versions of the model are available as [ProCyon-Full](https://huggingface.co/mims-harvard/ProCyon-Full) and [ProCyon-Bind](https://huggingface.co/mims-harvard/ProCyon-Bind).
|