Anjey Sapkovski
anjeysapkovski
AI & ML interests
None yet
Recent Activity
liked a dataset 27 days ago
eaddario/imatrix-calibration liked a model about 1 month ago
DavidAU/Qwen3.6-27B-Heretic-Uncensored-FINETUNE-NEO-CODE-Di-IMatrix-MAX-GGUF new activity about 1 month ago
Jackrong/Qwopus3.6-35B-A3B-v1-MTP-GGUF:Request for UD quants of the modelOrganizations
None yet
Request for UD quants of the model
🚀 1
#2 opened about 1 month ago
by
anjeysapkovski
Same speed on 5060 Ti as llamacpp MTP model
➕ 1
#5 opened about 1 month ago
by
anjeysapkovski
Starts with 50% speedup, but speed very fast decreases
2
#21 opened about 1 month ago
by
seleznyov
draft with llama.cpp?
👀 2
3
#2 opened 2 months ago
by
Schnabulator
Thank you!!
🤗 3
2
#4 opened about 2 months ago
by
zrfior
Inference broken with Jan
👀🚀 4
2
#22 opened 4 months ago
by
redaihf
The generation falls into constant repetition without any good result
🔥➕ 3
16
#2 opened 5 months ago
by
ddd2r2
cool model !!
👍 1
3
#3 opened 5 months ago
by
gopi87
Check in here for tok/s and benchmarks for local gguf models
👍 1
6
#1 opened 5 months ago
by
ykarout
Why does the KV cache occupy so much GPU memory?
13
#21 opened 5 months ago
by
yyg201708
1.5b?
🔥 4
7
#3 opened 7 months ago
by
cchance27
please make a 2.1 autoround model (NT)
❤️ 2
3
#1 opened 6 months ago
by
Khatvathiren
Intel AutoRound for best low-bit quantization
1
#5 opened 6 months ago
by
anjeysapkovski
tool calling not working as expected?
👍 2
15
#80 opened 11 months ago
by
Spider-Jerusalem
2507 Thinking model release
11
#4 opened 9 months ago
by
anjeysapkovski
Not working runtime error
👍 10
3
#25 opened 11 months ago
by
GkyEla
Request for Qwen3-30B-A3B-Thinking-2507 q2ks autoround gguf
1
#1 opened 9 months ago
by
anjeysapkovski
Qwen3-30B-A3B Coder Instruct-2507-gguf-q2ks-mixed-AutoRound
#3 opened 11 months ago
by
anjeysapkovski
Low quality inference
2
#1 opened 11 months ago
by
anjeysapkovski
Outputs senseless texts as opposed to non-QAT 27b abliterated Q6_K_M
9
#1 opened about 1 year ago
by
anjeysapkovski