Reasoning Datasets Collection Distilled synthetic Reasoning datasets • 7 items • Updated Feb 2, 2025 • 61
Running 3.74k The Ultra-Scale Playbook 🌌 3.74k The ultimate guide to training LLM on large GPU Clusters
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness Paper • 2205.14135 • Published May 27, 2022 • 15
open-llm-leaderboard/Qwen__Qwen2.5-Math-7B-Instruct-details Viewer • Updated Feb 13, 2025 • 43.2k • 9 • 1
view article Article Preference Tuning LLMs with Direct Preference Optimization Methods +3 Jan 18, 2024 • 79
Salesforce/blip-image-captioning-large Image-to-Text • 0.5B • Updated Feb 3, 2025 • 1.09M • 1.46k