hmahadik's picture
Update Modelfile: FROM v7 GGUF, drop <end> stop (was truncating multi-tool)
a8332b8 verified
# FunctionGemma 270M Physical AI — v7, function-token format
# Function tokens (<tool_N>) + <end> terminator. ~8-15 output tokens per call.
# Optimized for CPU decode on small Cortex-A55 / similar edge targets.
FROM ./functiongemma-physical-ai-v7-Q5_K_M.gguf
PARAMETER temperature 0
PARAMETER top_p 1
PARAMETER num_ctx 1024
PARAMETER num_predict 80
# Stop on the turn-level markers ONLY, not on <end>. Multi-tool sequences
# emit <tool_A>(args)<end><tool_B>(args)<end>, and stopping at the first
# <end> truncates the second call. <end_of_turn> + <eos> are the right
# stops for both single- and multi-tool output.
PARAMETER stop "<end_of_turn>"
PARAMETER stop "<eos>"
# Use base model's chat template — training data is in messages+tools form,
# the tokenizer's chat_template.jinja already handles it.