Carbon-500M / tokenizer.py

Commit History

tokenizer: fix decode() to handle torch tensor input via .tolist()
6411d45
verified

kashif HF Staff commited on

Update tokenizer.py
e1cc2f4
verified

GenerTeam commited on

tokenizer: fix EOS append bug, decode skip_special_tokens=True, add auto_dna_tags
a5f56cd
verified

kashif HF Staff commited on

Promote hybrid step-286000 to main (300B CE + 300B FNS, total 600B tokens)
401b752
verified

loubnabnl HF Staff commited on