Instructions to use Synthyra/FastESM2_650 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use Synthyra/FastESM2_650 with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("fill-mask", model="Synthyra/FastESM2_650", trust_remote_code=True)# Load model directly from transformers import AutoModelForMaskedLM model = AutoModelForMaskedLM.from_pretrained("Synthyra/FastESM2_650", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
Update README.md
Browse files
README.md
CHANGED
|
@@ -26,4 +26,8 @@ with torch.no_grad():
|
|
| 26 |
print(embeddings.shape) # (1, 11, 1280)
|
| 27 |
```
|
| 28 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 29 |
|
|
|
|
| 26 |
print(embeddings.shape) # (1, 11, 1280)
|
| 27 |
```
|
| 28 |
|
| 29 |
+
Because we trained in mixed-precision float16, float16 has closer outputs to the float32 weights then bfloat16.
|
| 30 |
+
When summing the MSE of 1000 sequences vs. the float32 weights:
|
| 31 |
+
Average MSE for FP16: 0.00000140
|
| 32 |
+
Average MSE for BF16: 0.00004125
|
| 33 |
|