Anjey Sapkovski

anjeysapkovski

21 42

AI & ML interests

None yet

Recent Activity

liked a dataset 27 days ago

eaddario/imatrix-calibration

liked a model about 1 month ago

DavidAU/Qwen3.6-27B-Heretic-Uncensored-FINETUNE-NEO-CODE-Di-IMatrix-MAX-GGUF

new activity about 1 month ago

Jackrong/Qwopus3.6-35B-A3B-v1-MTP-GGUF:Request for UD quants of the model

View all activity

Organizations

None yet

New activity in Jackrong/Qwopus3.6-35B-A3B-v1-MTP-GGUF about 1 month ago

Request for UD quants of the model

🚀 1

#2 opened about 1 month ago by

anjeysapkovski

New activity in byteshape/Qwen3.6-35B-A3B-MTP-GGUF about 1 month ago

Same speed on 5060 Ti as llamacpp MTP model

➕ 1

#5 opened about 1 month ago by

anjeysapkovski

New activity in unsloth/Qwen3.6-27B-MTP-GGUF about 1 month ago

Starts with 50% speedup, but speed very fast decreases

#21 opened about 1 month ago by

seleznyov

New activity in z-lab/Qwen3.6-35B-A3B-DFlash about 2 months ago

draft with llama.cpp?

👀 2

#2 opened 2 months ago by

Schnabulator

New activity in Jackrong/Qwen3.5-9B-DeepSeek-V4-Flash-GGUF about 2 months ago

Thank you!!

🤗 3

#4 opened about 2 months ago by

zrfior

New activity in Nanbeige/Nanbeige4.1-3B 4 months ago

Inference broken with Jan

👀🚀 4

#22 opened 4 months ago by

redaihf

New activity in unsloth/GLM-4.7-Flash-REAP-23B-A3B-GGUF 4 months ago

The generation falls into constant repetition without any good result

🔥➕ 3

#2 opened 5 months ago by

ddd2r2

New activity in stepfun-ai/Step-3.5-Flash-GGUF-Q4_K_S 5 months ago

cool model !!

👍 1

#3 opened 5 months ago by

gopi87

New activity in unsloth/Qwen3-Coder-Next-GGUF 5 months ago

Check in here for tok/s and benchmarks for local gguf models

👍 1

#1 opened 5 months ago by

ykarout

New activity in zai-org/GLM-4.7-Flash 5 months ago

Why does the KV cache occupy so much GPU memory?

#21 opened 5 months ago by

yyg201708

New activity in FunAudioLLM/Fun-CosyVoice3-0.5B-2512 6 months ago

1.5b?

🔥 4

#3 opened 7 months ago by

cchance27

New activity in Intel/MiniMax-M2-gguf-q2ks-mixed-AutoRound 6 months ago

please make a 2.1 autoround model (NT)

❤️ 2

#1 opened 6 months ago by

Khatvathiren

New activity in Ex0bit/MiniMax-M2.1-PRISM 6 months ago

Intel AutoRound for best low-bit quantization

#5 opened 6 months ago by

anjeysapkovski

New activity in openai/gpt-oss-20b 6 months ago

tool calling not working as expected?

👍 2

#80 opened 11 months ago by

Spider-Jerusalem

New activity in Intel/Qwen3-30B-A3B-Instruct-2507-gguf-q2ks-mixed-AutoRound 6 months ago

2507 Thinking model release

#4 opened 9 months ago by

anjeysapkovski

New activity in openlifescienceai/open_medical_llm_leaderboard 8 months ago

Not working runtime error

👍 10

#25 opened 11 months ago by

GkyEla

New activity in Intel/Qwen3-30B-A3B-Thinking-2507-int4-AutoRound 9 months ago

Request for Qwen3-30B-A3B-Thinking-2507 q2ks autoround gguf

#1 opened 9 months ago by

anjeysapkovski

New activity in Intel/Qwen3-30B-A3B-Instruct-2507-gguf-q2ks-mixed-AutoRound 11 months ago

Qwen3-30B-A3B Coder Instruct-2507-gguf-q2ks-mixed-AutoRound

#3 opened 11 months ago by

anjeysapkovski

New activity in lmstudio-community/OpenReasoning-Nemotron-7B-GGUF 11 months ago

Low quality inference

#1 opened 11 months ago by

anjeysapkovski

New activity in mlabonne/gemma-3-27b-it-qat-abliterated about 1 year ago

Outputs senseless texts as opposed to non-QAT 27b abliterated Q6_K_M

#1 opened about 1 year ago by

anjeysapkovski

Anjey Sapkovski

AI & ML interests

Recent Activity

Organizations

anjeysapkovski's activity

Request for UD quants of the model

Same speed on 5060 Ti as llamacpp MTP model

Starts with 50% speedup, but speed very fast decreases

draft with llama.cpp?

Thank you!!

Inference broken with Jan

The generation falls into constant repetition without any good result

cool model !!

Check in here for tok/s and benchmarks for local gguf models

Why does the KV cache occupy so much GPU memory?

1.5b?

please make a 2.1 autoround model (NT)

Intel AutoRound for best low-bit quantization

tool calling not working as expected?

2507 Thinking model release

Not working runtime error

Request for Qwen3-30B-A3B-Thinking-2507 q2ks autoround gguf

Qwen3-30B-A3B Coder Instruct-2507-gguf-q2ks-mixed-AutoRound

Low quality inference

Outputs senseless texts as opposed to non-QAT 27b abliterated Q6_K_M