docs: prominent input-format warning (this is a gene-ID model, not nucleotide) d8fc632 verified deskull commited on 8 days ago
fix: add [UNK] (and [PAD]) to WordLevel vocab to enable text-mode encoding 47498fa verified deskull commited on 8 days ago
Add working list-of-tokens inference example (the WordLevel tokenizer has no pre_tokenizer; text-mode encode fails) c4a7411 verified deskull commited on 12 days ago