ReasoningDiversity/openthoughts3-math-300k-embedding-k4-similar Viewer • Updated 28 days ago • 75k • 16
ReasoningDiversity/openthoughts3-math-300k-embedding-k4-diverse Viewer • Updated 28 days ago • 75k • 15
ReasoningDiversity/openthoughts3-math-300k-embedding-k4-similar Viewer • Updated 28 days ago • 75k • 16
ReasoningDiversity/openthoughts3-math-300k-embedding-k4-diverse Viewer • Updated 28 days ago • 75k • 15
Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning Paper • 2602.01058 • Published Feb 1 • 42