File size: 296 Bytes

7c9b6e3
 
 
 
2b8e7b7
7c9b6e3
8880a0d

---
library_name: transformers
tags: []
---
This is a tokenizer with same settings as [bert-base-uncased](https://huggingface.co/google-bert/bert-base-uncased) but sets `strip_accents=False`.

Has a vocab size of 30K and was trained on various corpus including Nepali wikipedia, new articles etc.