torch tcapelle/train_ds_triton Viewer • Updated May 21, 2025 • 887 • 17 • 1 predibase/Predibase-T2T-32B-RFT 33B • Updated Mar 19, 2025 • 7 • 20 GPUMODE/KernelBook Viewer • Updated 19 days ago • 18.2k • 575 • 56
safety Running on CPU Upgrade Agents 93 LLM Safety Leaderboard 🥇 93 Search, filter and submit LLM benchmark evaluations
Running on CPU Upgrade Agents 93 LLM Safety Leaderboard 🥇 93 Search, filter and submit LLM benchmark evaluations
SFT meta-math/MetaMathQA Viewer • Updated Dec 21, 2023 • 395k • 46.4k • 463 HuggingFaceH4/ultrachat_200k Viewer • Updated Oct 16, 2024 • 515k • 57.3k • 736 HuggingFaceH4/ultrafeedback_binarized Viewer • Updated Oct 16, 2024 • 187k • 11.8k • 341 openai/gsm8k Benchmark • Updated Mar 23 • 17.6k • 900k • 1.41k
eval Running on CPU Upgrade 14k Open LLM Leaderboard 🏆 14k Track, rank and evaluate open LLMs and chatbots
Running on CPU Upgrade 14k Open LLM Leaderboard 🏆 14k Track, rank and evaluate open LLMs and chatbots
Math meta-math/MetaMathQA Viewer • Updated Dec 21, 2023 • 395k • 46.4k • 463 open-r1/OpenR1-Math-220k Viewer • Updated Feb 18, 2025 • 450k • 39.9k • 762
torch tcapelle/train_ds_triton Viewer • Updated May 21, 2025 • 887 • 17 • 1 predibase/Predibase-T2T-32B-RFT 33B • Updated Mar 19, 2025 • 7 • 20 GPUMODE/KernelBook Viewer • Updated 19 days ago • 18.2k • 575 • 56
eval Running on CPU Upgrade 14k Open LLM Leaderboard 🏆 14k Track, rank and evaluate open LLMs and chatbots
Running on CPU Upgrade 14k Open LLM Leaderboard 🏆 14k Track, rank and evaluate open LLMs and chatbots
safety Running on CPU Upgrade Agents 93 LLM Safety Leaderboard 🥇 93 Search, filter and submit LLM benchmark evaluations
Running on CPU Upgrade Agents 93 LLM Safety Leaderboard 🥇 93 Search, filter and submit LLM benchmark evaluations
Math meta-math/MetaMathQA Viewer • Updated Dec 21, 2023 • 395k • 46.4k • 463 open-r1/OpenR1-Math-220k Viewer • Updated Feb 18, 2025 • 450k • 39.9k • 762
SFT meta-math/MetaMathQA Viewer • Updated Dec 21, 2023 • 395k • 46.4k • 463 HuggingFaceH4/ultrachat_200k Viewer • Updated Oct 16, 2024 • 515k • 57.3k • 736 HuggingFaceH4/ultrafeedback_binarized Viewer • Updated Oct 16, 2024 • 187k • 11.8k • 341 openai/gsm8k Benchmark • Updated Mar 23 • 17.6k • 900k • 1.41k