koutch/paper_qwen_qwen3-instruct-4b_train_sft_all_train_code Text Generation • 4B • Updated Feb 1 • 4
koutch/paper_qwen_qwen3-instruct-4b_train_grpo_v1_train_code Text Generation • 4B • Updated Jan 28 • 1
koutch/paper_qwen_qwen3-instruct-4b_train_sft_train_thought Text Generation • 4B • Updated Jan 26 • 1