John
cmp-nct
AI & ML interests
None yet
Recent Activity
new activity
7 days ago
zai-org/GLM-4.7-Flash:llama.cpp inference - 20 times (!) slower than OSS 20 on a RTX 5090
new activity
10 days ago
Alibaba-Apsara/DASD-30B-A3B-Thinking-Preview:Interesting model - how to control reasoning length?
new activity
about 1 month ago
unsloth/Nemotron-3-Nano-30B-A3B-GGUF:Should UD-Q6_K_XL identical to Q6_K.gguf?