File size: 116 Bytes
252a85f
cb301d1
1
2
shuf kazakh_latin_corpus.jsonl -o kazakh_latin_corpus.jsonl
grep '\S' kazakh_latin_corpus.jsonl > clean_corpus.jsonl