llama-3.1-8b-grpo-v1.3 / tokenizer_config.json

Commit History

GRPO-trained model from checkpoint-2450
38f1232
verified

CodCodingCode commited on