torch tcapelle/train_ds_triton Viewer • Updated May 21, 2025 • 887 • 10 • 1 predibase/Predibase-T2T-32B-RFT 33B • Updated Mar 19, 2025 • 4 • 20 GPUMODE/KernelBook Viewer • Updated Feb 5 • 18.2k • 522 • 51
safety Running on CPU Upgrade Agents 93 LLM Safety Leaderboard 🥇 93 Explore and submit LLM benchmarks
SFT meta-math/MetaMathQA Viewer • Updated Dec 21, 2023 • 395k • 60.2k • 457 HuggingFaceH4/ultrachat_200k Viewer • Updated Oct 16, 2024 • 515k • 62.6k • 700 HuggingFaceH4/ultrafeedback_binarized Viewer • Updated Oct 16, 2024 • 187k • 13.2k • 334 openai/gsm8k Benchmark • Updated Mar 23 • 17.6k • 918k • 1.3k
Math meta-math/MetaMathQA Viewer • Updated Dec 21, 2023 • 395k • 60.2k • 457 open-r1/OpenR1-Math-220k Viewer • Updated Feb 18, 2025 • 450k • 20.1k • 741
torch tcapelle/train_ds_triton Viewer • Updated May 21, 2025 • 887 • 10 • 1 predibase/Predibase-T2T-32B-RFT 33B • Updated Mar 19, 2025 • 4 • 20 GPUMODE/KernelBook Viewer • Updated Feb 5 • 18.2k • 522 • 51
safety Running on CPU Upgrade Agents 93 LLM Safety Leaderboard 🥇 93 Explore and submit LLM benchmarks
Math meta-math/MetaMathQA Viewer • Updated Dec 21, 2023 • 395k • 60.2k • 457 open-r1/OpenR1-Math-220k Viewer • Updated Feb 18, 2025 • 450k • 20.1k • 741
SFT meta-math/MetaMathQA Viewer • Updated Dec 21, 2023 • 395k • 60.2k • 457 HuggingFaceH4/ultrachat_200k Viewer • Updated Oct 16, 2024 • 515k • 62.6k • 700 HuggingFaceH4/ultrafeedback_binarized Viewer • Updated Oct 16, 2024 • 187k • 13.2k • 334 openai/gsm8k Benchmark • Updated Mar 23 • 17.6k • 918k • 1.3k