Embed chat_template in tokenizer_config.json

by piero-atelico - opened Apr 3

base: refs/heads/main

←

from: refs/pr/8

Discussion Files changed

-2

piero-atelico

Apr 3

No description provided.

Embed chat_template in tokenizer_config.json954b9fac

piero-atelico

Apr 3

What

Adds the chat template directly to tokenizer_config.json so that tokenizer.apply_chat_template() works out of the box without needing to separately download and load chat_template.jinja.

Why

Right now the chat template is only in chat_template.jinja as a separate file. The transformers library auto-loads it when it's present in the model directory, but many third-party tools and deployment pipelines only copy the standard tokenizer files (tokenizer.json + tokenizer_config.json). When the .jinja file is missing, tokenizer.chat_template is None and apply_chat_template() fails with:

ValueError: Cannot use chat template functions because tokenizer.chat_template is not set

Other Gemma models (Gemma 2, Gemma 3) embed the template in tokenizer_config.json, so this seems like an oversight in the Gemma 4 release.

What changed

Embedded the contents of chat_template.jinja into the chat_template field of tokenizer_config.json. No functional change, just makes the template accessible through the standard API.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Ready to merge

This branch is ready to get merged automatically.

· Sign up or log in to comment