view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 22 days ago • 484
Quantization-Aware Distillation for NVFP4 Inference Accuracy Recovery Paper • 2601.20088 • Published Jan 27 • 3