dataset-lang HuggingFaceFW/fineweb Viewer • Updated Jul 11, 2025 • 52.5B • 208k • 2.63k google/smol Viewer • Updated Oct 31, 2025 • 798k • 1.26k • 84
Language Models deepseek-ai/DeepSeek-R1 Text Generation • 685B • Updated Mar 27, 2025 • 410k • • 13k facebook/opt-125m Text Generation • Updated Sep 15, 2023 • 3.55M • 232 meta-llama/Llama-3.3-70B-Instruct Text Generation • 71B • Updated Dec 21, 2024 • 727k • • 2.64k meta-llama/Llama-3.1-8B-Instruct Text Generation • 8B • Updated Sep 25, 2024 • 10M • • 5.34k
dataset-math-reasoning bethgelab/CuratedThoughts Viewer • Updated Feb 26, 2025 • 222k • 239 • 44 open-r1/OpenR1-Math-220k Viewer • Updated Feb 18, 2025 • 450k • 13.5k • 699 facebook/natural_reasoning Viewer • Updated Feb 21, 2025 • 1.15M • 1.44k • 549 open-thoughts/OpenThoughts-114k Viewer • Updated Aug 31, 2025 • 228k • 69.6k • 794
old-language-models openai-community/gpt2 Text Generation • 0.1B • Updated Feb 19, 2024 • 6.81M • 3.1k
Code Language Models Models generating code or performing code completion refactai/Refact-1_6B-fim Text Generation • 2B • Updated Nov 9, 2023 • 166k • 141 bigcode/starcoder2-3b Text Generation • 3B • Updated Mar 4, 2024 • 128k • 215 Kwaipilot/KwaiCoder-DS-V2-Lite-Base Text Generation • 16B • Updated Jan 6, 2025 • 6 • 6
dataset-lang HuggingFaceFW/fineweb Viewer • Updated Jul 11, 2025 • 52.5B • 208k • 2.63k google/smol Viewer • Updated Oct 31, 2025 • 798k • 1.26k • 84
dataset-math-reasoning bethgelab/CuratedThoughts Viewer • Updated Feb 26, 2025 • 222k • 239 • 44 open-r1/OpenR1-Math-220k Viewer • Updated Feb 18, 2025 • 450k • 13.5k • 699 facebook/natural_reasoning Viewer • Updated Feb 21, 2025 • 1.15M • 1.44k • 549 open-thoughts/OpenThoughts-114k Viewer • Updated Aug 31, 2025 • 228k • 69.6k • 794
old-language-models openai-community/gpt2 Text Generation • 0.1B • Updated Feb 19, 2024 • 6.81M • 3.1k
Language Models deepseek-ai/DeepSeek-R1 Text Generation • 685B • Updated Mar 27, 2025 • 410k • • 13k facebook/opt-125m Text Generation • Updated Sep 15, 2023 • 3.55M • 232 meta-llama/Llama-3.3-70B-Instruct Text Generation • 71B • Updated Dec 21, 2024 • 727k • • 2.64k meta-llama/Llama-3.1-8B-Instruct Text Generation • 8B • Updated Sep 25, 2024 • 10M • • 5.34k
Code Language Models Models generating code or performing code completion refactai/Refact-1_6B-fim Text Generation • 2B • Updated Nov 9, 2023 • 166k • 141 bigcode/starcoder2-3b Text Generation • 3B • Updated Mar 4, 2024 • 128k • 215 Kwaipilot/KwaiCoder-DS-V2-Lite-Base Text Generation • 16B • Updated Jan 6, 2025 • 6 • 6