Fine Tuning Datasets ethanolivertroy/nist-cybersecurity-training Viewer • Updated Oct 22, 2025 • 531k • 936 • 52 darkknight25/Vulnerable_Programming_Dataset Updated May 24, 2025 • 95 • 1 WNT3D/Ultimate-Offensive-Red-Team Viewer • Updated Aug 23, 2025 • 25.6k • 571 • 146 yevzh1/Ultimate-Offensive-Red-Team Viewer • Updated Jan 4 • 25.6k • 76
Bench Datasets Idavidrein/gpqa Benchmark • Updated Mar 5 • 1.25k • 100k • 469 openai/gsm8k Benchmark • Updated Mar 23 • 17.6k • 891k • 1.4k princeton-nlp/SWE-bench_Verified Viewer • Updated Feb 18, 2025 • 500 • 654k • 359 ScaleAI/SWE-bench_Pro Benchmark • Updated Feb 23 • 731 • 67.5k • 138
Gym ServiceNow-AI/EnterpriseOps-Gym Viewer • Updated Apr 30 • 2.56k • 8.63k • 89 allenai/MolmoWeb-HumanSkills Viewer • Updated Apr 13 • 116k • 3.04k • 15 allenai/MolmoWeb-SyntheticSkills Viewer • Updated Apr 13 • 5.55k • 995 • 7 allenai/MolmoWeb-SyntheticTrajs Viewer • Updated Apr 10 • 108k • 1.09k • 11
Fine Tuning Datasets ethanolivertroy/nist-cybersecurity-training Viewer • Updated Oct 22, 2025 • 531k • 936 • 52 darkknight25/Vulnerable_Programming_Dataset Updated May 24, 2025 • 95 • 1 WNT3D/Ultimate-Offensive-Red-Team Viewer • Updated Aug 23, 2025 • 25.6k • 571 • 146 yevzh1/Ultimate-Offensive-Red-Team Viewer • Updated Jan 4 • 25.6k • 76
Gym ServiceNow-AI/EnterpriseOps-Gym Viewer • Updated Apr 30 • 2.56k • 8.63k • 89 allenai/MolmoWeb-HumanSkills Viewer • Updated Apr 13 • 116k • 3.04k • 15 allenai/MolmoWeb-SyntheticSkills Viewer • Updated Apr 13 • 5.55k • 995 • 7 allenai/MolmoWeb-SyntheticTrajs Viewer • Updated Apr 10 • 108k • 1.09k • 11
Bench Datasets Idavidrein/gpqa Benchmark • Updated Mar 5 • 1.25k • 100k • 469 openai/gsm8k Benchmark • Updated Mar 23 • 17.6k • 891k • 1.4k princeton-nlp/SWE-bench_Verified Viewer • Updated Feb 18, 2025 • 500 • 654k • 359 ScaleAI/SWE-bench_Pro Benchmark • Updated Feb 23 • 731 • 67.5k • 138