SmilesTokenizer_PubChem_1M / tokenizer.json

Commit History

fix: correct vocab size in config.json and update token IDs in tokenizer.json
de5c2cf

kohbanye commited on

update vocab size in config.json and adjust token IDs in tokenizer.json
48c2531

kohbanye commited on

add additional special tokens for stereochemistry representation in tokenizer.json
b1641e4

kohbanye commited on

add pre-tokenizer configuration to tokenizer.json for stereochemistry
4448d6e

kohbanye commited on

add stereochemistry support
f3d21fa

kohbanye commited on

duplicate smiles-tokenizer 1m model
dbf8ea8

Seyone Chithrananda commited on