is-sft-271828/synlogic_grpo_qwen3-4b-base-qwen3-8b-3e-5-seq-seqlen_8192_chunksize_8 4B • Updated Jan 27
is-sft-271828/synlogic_grpo_qwen3-4b-base-qwen3-8b-3e-5-seq-seqlen_8192_chunksize_4 4B • Updated Jan 27
is-sft-271828/math_grpo_qwen3-4b-base-qwen3-8b-3e-5-seq-seqlen_8192_chunksize_8 4B • Updated Jan 27 • 1
is-sft-271828/grpo-is-qwen3-0.6b-base-beh-qwen3-8b-lr-3e-5-logspace-dm-0.3-ncoeff-1.0-nmode-flip 0.6B • Updated Jan 20