Carbon-3B / tokenizer.py

Commit History

tokenizer: expose .vocab property for fast-tokenizer-style callers
4c899c4
verified

kashif HF Staff commited on

tokenizer: fix decode() to handle torch tensor input via .tolist()
ac0ca80
verified

kashif HF Staff commited on

Update tokenizer.py
cac27f2
verified

GenerTeam commited on

Fix tokenizer: EOS bug + decode skip_special_tokens=True empty string (#1)
f47e012

kashif HF Staff commited on

Initial release 路 sourced from hf-carbon/carbon-3B-longctx-32k-from-mix2decay-429k @ step-453000
12de471
verified

loubnabnl HF Staff commited on