MQA malmaud/onestop_qa Viewer • Updated Aug 8, 2024 • 1.46k • 203 • 13 tasksource/ScienceQA_text_only Viewer • Updated Jul 13, 2023 • 10.9k • 1.07k • 32 EleutherAI/logiqa Updated Nov 2, 2023 • 3.25k • 4 tasksource/reclor Viewer • Updated May 31, 2023 • 5.14k • 1.13k • 16
Small-ish SoTA (<5B), (quasi-)base nvidia/Minitron-4B-Base Text Generation • Updated Feb 14, 2025 • 4.21k • 138 h2oai/h2o-danube3-4b-base Text Generation • 4B • Updated Jul 15, 2024 • 2.24k • 23 stabilityai/stablelm-3b-4e1t Text Generation • 3B • Updated Mar 7, 2024 • 95.2k • 312 Qwen/Qwen2-1.5B Text Generation • 2B • Updated Jun 6, 2024 • 186k • • 102
SuperMC Various multiple-choice datasets, for preference learning, focused on reasoning longface/logicLM Viewer • Updated Aug 25, 2023 • 1.2k • 64 • 11 allenai/cosmos_qa Updated Jan 18, 2024 • 2.74k • 33 EleutherAI/logiqa Updated Nov 2, 2023 • 3.25k • 4 tasksource/spartqa-mchoice Viewer • Updated Jun 9, 2023 • 29.9k • 116 • 6
Interesting smol pretraining expirements UUFO-Aigis/Pico-OpenLAiNN-250M 0.3B • Updated Feb 24, 2025 • 28 • 3 crumb/distilpythia Text Generation • 95.6M • Updated Jul 20, 2023 • 400 • 6 crumb/GLORT2 Text Generation • 0.2B • Updated Aug 26, 2024 • 7 pszemraj/jamba-900M-v0.13-KIx2 Text Generation • 0.9B • Updated Dec 29, 2025 • 8 • 4
MQA malmaud/onestop_qa Viewer • Updated Aug 8, 2024 • 1.46k • 203 • 13 tasksource/ScienceQA_text_only Viewer • Updated Jul 13, 2023 • 10.9k • 1.07k • 32 EleutherAI/logiqa Updated Nov 2, 2023 • 3.25k • 4 tasksource/reclor Viewer • Updated May 31, 2023 • 5.14k • 1.13k • 16
SuperMC Various multiple-choice datasets, for preference learning, focused on reasoning longface/logicLM Viewer • Updated Aug 25, 2023 • 1.2k • 64 • 11 allenai/cosmos_qa Updated Jan 18, 2024 • 2.74k • 33 EleutherAI/logiqa Updated Nov 2, 2023 • 3.25k • 4 tasksource/spartqa-mchoice Viewer • Updated Jun 9, 2023 • 29.9k • 116 • 6
Small-ish SoTA (<5B), (quasi-)base nvidia/Minitron-4B-Base Text Generation • Updated Feb 14, 2025 • 4.21k • 138 h2oai/h2o-danube3-4b-base Text Generation • 4B • Updated Jul 15, 2024 • 2.24k • 23 stabilityai/stablelm-3b-4e1t Text Generation • 3B • Updated Mar 7, 2024 • 95.2k • 312 Qwen/Qwen2-1.5B Text Generation • 2B • Updated Jun 6, 2024 • 186k • • 102
Interesting smol pretraining expirements UUFO-Aigis/Pico-OpenLAiNN-250M 0.3B • Updated Feb 24, 2025 • 28 • 3 crumb/distilpythia Text Generation • 95.6M • Updated Jul 20, 2023 • 400 • 6 crumb/GLORT2 Text Generation • 0.2B • Updated Aug 26, 2024 • 7 pszemraj/jamba-900M-v0.13-KIx2 Text Generation • 0.9B • Updated Dec 29, 2025 • 8 • 4