"Not all quantized model perform good", serving framework ollama uses NVIDIA gpu, llama.cpp uses CPU with AVX & AMX
v1k
xbruce22
AI & ML interests
None yet
Recent Activity
updated a model about 10 hours ago
xbruce22/gemma-4-e2b-reasoning-lora published a model about 10 hours ago
xbruce22/gemma-4-e2b-reasoning-lora liked a dataset about 13 hours ago
Jackrong/GLM-5.1-Reasoning-1M-Cleaned