reinforcement-learning THU-KEG/LongWriter-Zero-32B Text Generation • 33B • Updated Jul 3, 2025 • 124 • • 113
Datasetets for general finetuning argilla/distilabel-capybara-dpo-7k-binarized Viewer • Updated Jul 16, 2024 • 7.56k • 2.78k • 182 Locutusque/function-calling-chatml Viewer • Updated Jul 16, 2024 • 113k • 1.93k • 175 pints-ai/Expository-Prose-V1 Viewer • Updated Aug 12, 2024 • 6.67M • 17 • 20
reinforcement-learning THU-KEG/LongWriter-Zero-32B Text Generation • 33B • Updated Jul 3, 2025 • 124 • • 113
Datasetets for general finetuning argilla/distilabel-capybara-dpo-7k-binarized Viewer • Updated Jul 16, 2024 • 7.56k • 2.78k • 182 Locutusque/function-calling-chatml Viewer • Updated Jul 16, 2024 • 113k • 1.93k • 175 pints-ai/Expository-Prose-V1 Viewer • Updated Aug 12, 2024 • 6.67M • 17 • 20