bigscience
/

bloom-560m

@@ -191,7 +191,7 @@ The BLOOM tokenizer ([link](https://huggingface.co/bigscience/tokenizer)) is a l
 - A simple pre-tokenization rule, no normalization
-- A vocabulary size of 250,680
 It was trained on a subset of a preliminary version of the corpus using alpha-weighting per language.

 - A simple pre-tokenization rule, no normalization
+- A vocabulary size of 250,880
 It was trained on a subset of a preliminary version of the corpus using alpha-weighting per language.