koutch/paper_qwen_qwen3-instruct-4b_train_sft_all_train_dual Text Generation • 4B • Updated Jan 24 • 6
koutch/paper_qwen_qwen3-instruct-4b_train_sft_train_no_think Text Generation • 4B • Updated Jan 17 • 1
koutch/paper_qwen_qwen3-instruct-4b_train_sft_all_train_think Text Generation • 4B • Updated Jan 17 • 4
koutch/short_paper_llama_2.json_train_dpo_v1_train_no_think Text Generation • 8B • Updated Jan 14 • 1
koutch/short_paper_llama_llama3.1-8b_train_sft_all_train_no_think Text Generation • 8B • Updated Jan 14 • 19
koutch/short_paper_llama_llama3.1-8b_train_sft_train_no_think Text Generation • 8B • Updated Jan 14 • 19
koutch/short_paper_qwen_qwen3-instruct-4b_train_sft_all_train_no_think Text Generation • 4B • Updated Jan 14 • 8
koutch/short_paper_qwen_qwen3-instruct-4b_train_sft_train_para Text Generation • 4B • Updated Jan 14 • 3
koutch/short_paper_qwen_qwen3-instruct-4b_train_sft_train_no_think Text Generation • 4B • Updated Jan 14 • 5
koutch/short_paper_smol_smol3-3B_train_sft_all_train_no_think Text Generation • 3B • Updated Jan 14 • 8