reinforcement-learning THU-KEG/LongWriter-Zero-32B Text Generation • 33B • Updated Jul 3, 2025 • 84 • • 113
Datasetets for general finetuning argilla/distilabel-capybara-dpo-7k-binarized Viewer • Updated Jul 16, 2024 • 7.56k • 5.99k • 184 Locutusque/function-calling-chatml Viewer • Updated Jul 16, 2024 • 113k • 1.37k • 177 pints-ai/Expository-Prose-V1 Viewer • Updated Aug 12, 2024 • 6.67M • 30 • 20
reinforcement-learning THU-KEG/LongWriter-Zero-32B Text Generation • 33B • Updated Jul 3, 2025 • 84 • • 113
Datasetets for general finetuning argilla/distilabel-capybara-dpo-7k-binarized Viewer • Updated Jul 16, 2024 • 7.56k • 5.99k • 184 Locutusque/function-calling-chatml Viewer • Updated Jul 16, 2024 • 113k • 1.37k • 177 pints-ai/Expository-Prose-V1 Viewer • Updated Aug 12, 2024 • 6.67M • 30 • 20