Instructions to use Codemaster67/OLMO_Smiles_aware_tokenizer with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use Codemaster67/OLMO_Smiles_aware_tokenizer with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("Codemaster67/OLMO_Smiles_aware_tokenizer", dtype="auto") - Notebooks
- Google Colab
- Kaggle
Ctrl+K
Add SPE tokenizer (1125 SMILES subword tokens) + <|start_of_smiles|>/<|end_of_smiles|> special tokens. Trained on 2M ZINC20 + 2M ChEMBL canonical SMILES. SPE vocab_size=1000, min_freq=4000.
f6eb1d8 verified