Commit History

Replace grpo_gk with base_fmt: Qwen3-1.7B base + format-forcing chat template (default system prompt + per-question boxed reminder). Zero training. eval-v2 34.7% overall / 50.2% MMLU-Pro (vs grpo_gk 27.0%). Template pre-baked; not re-patched.
bbaf4f0
verified

Nahush-27 commited on

Push SFT GK model (MMLU 3k + NaturalReasoning 3k, LoRA r=64)
e3c9ac3
verified

Nahush-27 commited on

Push SFT GK model (MMLU 3k + NaturalReasoning 3k, LoRA r=64)
65b0a27
verified

Nahush-27 commited on

Push SFT GK model (MMLU 3k + NaturalReasoning 3k, LoRA r=64)
c7eae7a
verified

Nahush-27 commited on

Add patched chat template: thinking ON + system prompt baked in
47ed28a
verified

Nahush-27 commited on

initial commit
31bd977
verified

Nahush-27 commited on