kannada-tokenizer-50k / metadata.json
shwethd's picture
Upload 5 files
ae75114 verified
{
"vocab_size": 50000,
"corpus_file": "kannada_corpus.txt",
"min_frequency": 1,
"language": "Kannada (kn)",
"pre_tokenizer": "Whitespace"
}