Update README.md
Browse files
README.md
CHANGED
|
@@ -37,6 +37,6 @@ and 24 layers; and SARITA-XL, with 1.2 billion parameters, featuring an embeddin
|
|
| 37 |
its unique role in protein structure and function. The tokenizer also includes special tokens like <PAD> for padding shorter sequences and <EOS> for marking sequence ends, ensuring consistency across datasets. This process reduces variability and enhances the model's ability to learn meaningful
|
| 38 |
patterns from protein sequences. At the end each token is transformed into a numerical representation using a look-up table
|
| 39 |
|
| 40 |
-

|
| 41 |
|
| 42 |
|