meituan-longcat/LongCat-Flash-Thinking-2601 Text Generation • 562B • Updated 24 days ago • 5.65k • 100
Skywork-Reward-Data-Collection Collection Open-source preference datasets used to train the Skywork reward model series • 17 items • Updated Oct 12, 2024 • 21