general_knowledge_model / generation_config.json
Nahush-27's picture
Replace grpo_gk with base_fmt: Qwen3-1.7B base + format-forcing chat template (default system prompt + per-question boxed reminder). Zero training. eval-v2 34.7% overall / 50.2% MMLU-Pro (vs grpo_gk 27.0%). Template pre-baked; not re-patched.
bbaf4f0 verified
raw
history blame contribute delete
239 Bytes
{
"bos_token_id": 151643,
"do_sample": true,
"eos_token_id": [
151645,
151643
],
"pad_token_id": 151643,
"temperature": 0.6,
"top_k": 20,
"top_p": 0.95,
"transformers_version": "4.51.0"
}