Everything need
Quantize Hugging Face models to GGUF and publish repo
Calculate VRAM needed to run LLMs on your GPU