Commit History

Replace OpenAI baselines with size-matched code tokenizers (StarCoder2, CodeGen, DeepSeek-Coder, GPT-2)
2507447
verified

TerminatorPower commited on

Add model card with vocab layout + benchmarks
6db98e5
verified

TerminatorPower commited on

Upload ezellm-lite tokenizer (24,600 vocab, FIM + repo specials)
36f1b8c
verified

TerminatorPower commited on