docs: prominent input-format warning (this is a gene-ID model, not nucleotide) a489418 verified deskull commited on 21 days ago
Add working list-of-tokens inference example (the WordLevel tokenizer has no pre_tokenizer; text-mode encode fails) 0bde49f verified deskull commited on 26 days ago
Remove redundant Training section (merged into Source Code) a5486c2 verified deskull commited on Apr 24
Remove private MolCrawl-HFuploader URL from dataset references 072a1a8 verified deskull commited on Apr 24
Update model card: restore eos_token_id=None workaround, fix BERT mask_token, correct RNA prompt 73bf8b5 verified deskull commited on Apr 14
Fix tokenizer: add [UNK] to WordLevel vocab and Whitespace pre_tokenizer 906a422 verified deskull commited on Apr 6