Instructions to use salyamq/smq-tokenizer-kz with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use salyamq/smq-tokenizer-kz with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("salyamq/smq-tokenizer-kz", dtype="auto") - Notebooks
- Google Colab
- Kaggle
SMQ Kazakh Tokenizer
SentencePiece Unigram tokenizer trained on Kazakh language data.
Usage
from transformers import AutoTokenizer
tokenizer = AutoTokenizer.from_pretrained("salyamq/smq-tokenizer-kz")
print(tokenizer.tokenize("Сәлеметсіз бе!"))
Details
- Algorithm: Unigram
- Language: Kazakh (kk)
- Vocab size: 32000
Author
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support