Instructions to use Synthyra/ESMplusplus_small with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use Synthyra/ESMplusplus_small with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("fill-mask", model="Synthyra/ESMplusplus_small", trust_remote_code=True)# Load model directly from transformers import AutoModelForMaskedLM model = AutoModelForMaskedLM.from_pretrained("Synthyra/ESMplusplus_small", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
|
@@ -17,7 +17,7 @@ The small version corresponds to the 300 million parameter version of ESMC.
|
|
| 17 |
| Backend | Key | Notes |
|
| 18 |
| :--- | :--- | :--- |
|
| 19 |
| PyTorch SDPA | `"sdpa"` | Default. Exact numerics, stable on all hardware. |
|
| 20 |
-
| Flash Attention | `"kernels_flash"` | Fastest on Ampere/Hopper GPUs. Requires `pip install kernels` (pre-built β no hours-long compilation). Outputs
|
| 21 |
| Flex Attention | `"flex"` | Skips padding tokens via block mask β faster on variable-length batches. Near-exact numerics. First use compiles a Triton kernel (30β120 s). Best combined with `torch.compile`. |
|
| 22 |
| Auto | `"auto"` | Picks the best available: `kernels_flash` β `flex` β `sdpa`. |
|
| 23 |
|
|
|
|
| 17 |
| Backend | Key | Notes |
|
| 18 |
| :--- | :--- | :--- |
|
| 19 |
| PyTorch SDPA | `"sdpa"` | Default. Exact numerics, stable on all hardware. |
|
| 20 |
+
| Flash Attention | `"kernels_flash"` | Fastest on Ampere/Hopper GPUs. Requires `pip install kernels` (pre-built β no hours-long compilation). Outputs are not bitwise identical to SDPA due to online softmax reordering; differences are often small but not guaranteed to be inconsequential β use `"sdpa"` if exact numerics matter. |
|
| 21 |
| Flex Attention | `"flex"` | Skips padding tokens via block mask β faster on variable-length batches. Near-exact numerics. First use compiles a Triton kernel (30β120 s). Best combined with `torch.compile`. |
|
| 22 |
| Auto | `"auto"` | Picks the best available: `kernels_flash` β `flex` β `sdpa`. |
|
| 23 |
|