Utilities Running 147 Find a leaderboard ๐ 147 Explore and discover all leaderboards from the HF community
TrainingMethodology Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients Paper โข 2407.08296 โข Published Jul 11, 2024 โข 33 Running 3.9k The Ultra-Scale Playbook ๐ 3.9k The ultimate guide to training LLM on large GPU Clusters
Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients Paper โข 2407.08296 โข Published Jul 11, 2024 โข 33
Running 3.9k The Ultra-Scale Playbook ๐ 3.9k The ultimate guide to training LLM on large GPU Clusters
SyntheticDataPrep Are You Sure? Rank Them Again: Repeated Ranking For Better Preference Datasets Paper โข 2405.18952 โข Published May 29, 2024 โข 10
Are You Sure? Rank Them Again: Repeated Ranking For Better Preference Datasets Paper โข 2405.18952 โข Published May 29, 2024 โข 10
Utilities Running 147 Find a leaderboard ๐ 147 Explore and discover all leaderboards from the HF community
SyntheticDataPrep Are You Sure? Rank Them Again: Repeated Ranking For Better Preference Datasets Paper โข 2405.18952 โข Published May 29, 2024 โข 10
Are You Sure? Rank Them Again: Repeated Ranking For Better Preference Datasets Paper โข 2405.18952 โข Published May 29, 2024 โข 10
TrainingMethodology Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients Paper โข 2407.08296 โข Published Jul 11, 2024 โข 33 Running 3.9k The Ultra-Scale Playbook ๐ 3.9k The ultimate guide to training LLM on large GPU Clusters
Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients Paper โข 2407.08296 โข Published Jul 11, 2024 โข 33
Running 3.9k The Ultra-Scale Playbook ๐ 3.9k The ultimate guide to training LLM on large GPU Clusters