File size: 296 Bytes
7c9b6e3 2b8e7b7 7c9b6e3 8880a0d |
1 2 3 4 5 6 7 |
---
library_name: transformers
tags: []
---
This is a tokenizer with same settings as [bert-base-uncased](https://huggingface.co/google-bert/bert-base-uncased) but sets `strip_accents=False`.
Has a vocab size of 30K and was trained on various corpus including Nepali wikipedia, new articles etc. |