Impact of Tokenization on LLaMa Russian Adaptation
Paper
•
2312.02598
•
Published
•
7
This model is a fine-tuned (embeddings, lm head) version of TheBloke/Llama-2-7B-fp16 on the Russian dataset (33GB). It achieves the following results on the evaluation set:
Instruct version: https://huggingface.co/rccmsu/ruadapt_saiga2_7b_v0.1
Russian adaptation of LLaMa-2-7B by replacing the tokenizer. Paper: Tikhomirov M., Chernyshev D. Impact of Tokenization on LLaMa Russian Adaptation //arXiv preprint arXiv:2312.02598. – 2023.
LLAMA 2 COMMUNITY LICENSE AGREEMENT
The following hyperparameters were used during training: