Replace grpo_gk with base_fmt: Qwen3-1.7B base + format-forcing chat template (default system prompt + per-question boxed reminder). Zero training. eval-v2 34.7% overall / 50.2% MMLU-Pro (vs grpo_gk 27.0%). Template pre-baked; not re-patched. bbaf4f0 verified Nahush-27 commited on 2 days ago
Push SFT GK model (MMLU 3k + NaturalReasoning 3k, LoRA r=64) e3c9ac3 verified Nahush-27 commited on 3 days ago
Push SFT GK model (MMLU 3k + NaturalReasoning 3k, LoRA r=64) 65b0a27 verified Nahush-27 commited on 5 days ago
Automated MNLP evaluation report (2026-05-20) (#1) 8c78441 Nahush-27 zechen-nlp commited on 15 days ago
Push SFT GK model (MMLU 3k + NaturalReasoning 3k, LoRA r=64) c7eae7a verified Nahush-27 commited on 17 days ago
Add patched chat template: thinking ON + system prompt baked in 47ed28a verified Nahush-27 commited on 17 days ago