Training Reasoning Models on Saturated Problems via Failure-Prefix Conditioning Paper • 2601.20829 • Published Jan 28 • 7
guactastesgood/DeepSeek-R1-Distill-Qwen-1.5B-failure-prefix-conditioning-iteration1 2B • Updated Feb 4 • 3
guactastesgood/DeepSeek-R1-Distill-Qwen-1.5B-failure-prefix-conditioning-iteration2 Text Generation • 2B • Updated Feb 4 • 2