Instructions to use mjbommar/ogbert-tokenizer-32768 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use mjbommar/ogbert-tokenizer-32768 with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("mjbommar/ogbert-tokenizer-32768", dtype="auto") - Notebooks
- Google Colab
- Kaggle
Upload OGBERT tokenizer (vocab_size=32768)
Browse files- tokenizer.json +3 -1
tokenizer.json
CHANGED
|
@@ -76,7 +76,9 @@
|
|
| 76 |
"special": false
|
| 77 |
}
|
| 78 |
],
|
| 79 |
-
"normalizer":
|
|
|
|
|
|
|
| 80 |
"pre_tokenizer": {
|
| 81 |
"type": "Split",
|
| 82 |
"pattern": {
|
|
|
|
| 76 |
"special": false
|
| 77 |
}
|
| 78 |
],
|
| 79 |
+
"normalizer": {
|
| 80 |
+
"type": "Lowercase"
|
| 81 |
+
},
|
| 82 |
"pre_tokenizer": {
|
| 83 |
"type": "Split",
|
| 84 |
"pattern": {
|