C0_LID_DEV / vocab.json
ntoldalagi's picture
add tokenizer
b4009cb
raw
history blame contribute delete
48 Bytes
{"|": 0, "M": 1, "E": 2, "[UNK]": 3, "[PAD]": 4}