reinforcement-learning THU-KEG/LongWriter-Zero-32B Text Generation • 33B • Updated Jul 3, 2025 • 15 • • 111
Datasetets for general finetuning argilla/distilabel-capybara-dpo-7k-binarized Viewer • Updated Jul 16, 2024 • 7.56k • 562 • 182 Locutusque/function-calling-chatml Viewer • Updated Jul 16, 2024 • 113k • 230 • 174 pints-ai/Expository-Prose-V1 Viewer • Updated Aug 12, 2024 • 6.67M • 6 • 19
reinforcement-learning THU-KEG/LongWriter-Zero-32B Text Generation • 33B • Updated Jul 3, 2025 • 15 • • 111
Datasetets for general finetuning argilla/distilabel-capybara-dpo-7k-binarized Viewer • Updated Jul 16, 2024 • 7.56k • 562 • 182 Locutusque/function-calling-chatml Viewer • Updated Jul 16, 2024 • 113k • 230 • 174 pints-ai/Expository-Prose-V1 Viewer • Updated Aug 12, 2024 • 6.67M • 6 • 19