nynorsk_second_test_GRPO / chat_template.jinja

Commit History

GRPO model (assistant split heuristic reward)
78decbe
verified

pere commited on