--- license: apache-2.0 datasets: - EleutherAI/the_pile language: - en --- This is a tokenizer for the Parva models, based off of the GPT-Neox tokenizers