Translation
Transformers
PyTorch
t5
text2text-generation
chemistry
biology
text-generation-inference
Instructions to use AI4PD/REXzyme with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use AI4PD/REXzyme with Transformers:
# Use a pipeline as a high-level helper # Warning: Pipeline type "translation" is no longer supported in transformers v5. # You must load the model directly (see below) or downgrade to v4.x with: # 'pip install "transformers<5.0.0' from transformers import pipeline pipe = pipeline("translation", model="AI4PD/REXzyme")# Load model directly from transformers import AutoTokenizer, AutoModelForSeq2SeqLM tokenizer = AutoTokenizer.from_pretrained("AI4PD/REXzyme") model = AutoModelForSeq2SeqLM.from_pretrained("AI4PD/REXzyme") - Notebooks
- Google Colab
- Kaggle
Update README.md
Browse files
README.md
CHANGED
|
@@ -73,7 +73,7 @@ See below for information about the models' performance in different in-silico t
|
|
| 73 |
## **Model description**
|
| 74 |
|
| 75 |
REXzyme is based on the [Efficient T5 Large Transformer](https://huggingface.co/google/t5-efficient-large) architecture (which in turn is very similar to the current version of Google Translator)
|
| 76 |
-
and contains 48 (24
|
| 77 |
|
| 78 |
REXzyme is a translation machine trained on portion the RHEA database containing 31970152 reaction-enzyme pairs.
|
| 79 |
The pre-training was done on pairs of smiles and amino acid sequences, tokenized with a char-level
|
|
|
|
| 73 |
## **Model description**
|
| 74 |
|
| 75 |
REXzyme is based on the [Efficient T5 Large Transformer](https://huggingface.co/google/t5-efficient-large) architecture (which in turn is very similar to the current version of Google Translator)
|
| 76 |
+
and contains 48 (24 encoder/ 24 decoder) layers with a model dimensionality of 1024, totaling 737.72 million parameters.
|
| 77 |
|
| 78 |
REXzyme is a translation machine trained on portion the RHEA database containing 31970152 reaction-enzyme pairs.
|
| 79 |
The pre-training was done on pairs of smiles and amino acid sequences, tokenized with a char-level
|