Pussy Hut

PussyHut

·

AI & ML interests

None yet

Recent Activity

liked a model about 1 month ago

nvidia/LocateAnything-3B

new activity about 1 month ago

unsloth/GLM-4.7-Flash-GGUF:NEW: LLama.cpp: Using `ngram-mod` to Get 2x Speed Boost on Long-Chats/Agent!

new activity about 1 month ago

tencent/Hy-MT2-30B-A3B:Need GGUF Quantization

View all activity

Organizations

None yet

upvoted a collection 11 months ago

MMLU Pro benchmark for GGUFs (1 shot)

"Not all quantized model perform good", serving framework ollama uses NVIDIA gpu, llama.cpp uses CPU with AVX & AMX • 13 items • Updated Aug 15, 2025 • 9