Nahush-27's picture
Replace grpo_gk with base_fmt: Qwen3-1.7B base + format-forcing chat template (default system prompt + per-question boxed reminder). Zero training. eval-v2 34.7% overall / 50.2% MMLU-Pro (vs grpo_gk 27.0%). Template pre-baked; not re-patched.
bbaf4f0 verified