Gemma 4 MTP assistant drafters as GGUF (F16/Q8_0/Q5_K_M/Q4_K_M/Q4_K_S). Speculative-decoding heads for the atomic-llama-cpp-turboquant fork.
AI & ML interests
Free Local AI Chat. Building an app for launching LLM locally and boosting it with Google TurboQuant