Fix bugs: use token param, apply Llama 3.1 chat template, decode only new tokens 1a77428 Jn-Huang commited on Dec 1, 2025