ne_bert_tiny / README.md
jangedoo's picture
Update README.md
2b8e7b7 verified
---
library_name: transformers
tags: []
---
This is a tokenizer with same settings as [bert-base-uncased](https://huggingface.co/google-bert/bert-base-uncased) but sets `strip_accents=False`.
Has a vocab size of 30K and was trained on various corpus including Nepali wikipedia, new articles etc.