Proper grad norm and alpha. Fixed template
Fine-tuned on 1k dataset distilled from Gemini 3.
Chat template
Files info
Base model