Gleb Kurchanov

nephepritou

17 13

AI & ML interests

None yet

Recent Activity

liked a model 13 days ago

lovesenko/DeepSeek-V4-Flash-DSpark-Abliterated

new activity 2 months ago

Lasimeri/MiniMax-M2.7-int4-AutoRound:OutOfMemory during weights loading (vLLM)

liked a model 3 months ago

olka-fi/Qwen3.5-122B-A10B-MXFP4

View all activity

Organizations

None yet

liked a model 13 days ago

lovesenko/DeepSeek-V4-Flash-DSpark-Abliterated

165B • Updated 13 days ago • 1.77k • 7

New activity in Lasimeri/MiniMax-M2.7-int4-AutoRound 2 months ago

OutOfMemory during weights loading (vLLM)

#1 opened 2 months ago by

nephepritou

liked a model 3 months ago

olka-fi/Qwen3.5-122B-A10B-MXFP4

Text Generation • 71B • Updated Feb 25 • 167 • 11

New activity in cyankiwi/Qwen3.5-122B-A10B-AWQ-4bit 4 months ago

Updated weights

#5 opened 4 months ago by

nephepritou

liked a model 4 months ago

fishaudio/s2-pro

Text-to-Speech • 5B • Updated Mar 11 • 268k • 1.1k

New activity in Intel/Qwen3.5-122B-A10B-int4-AutoRound 4 months ago

Does the A100 work?

#1 opened 5 months ago by

xz123321

New activity in Sehyo/Qwen3.5-122B-A10B-NVFP4 5 months ago

Quantization instruction

#6 opened 5 months ago by

nephepritou

liked a model 5 months ago

Qwen/Qwen3.5-122B-A10B-FP8

Image-Text-to-Text • 125B • Updated Apr 24 • 1.02M • 109

New activity in unsloth/Qwen3-Coder-Next-FP8-Dynamic 5 months ago

Inconsistent output (resolved)

#2 opened 5 months ago by

nephepritou

liked 2 models 5 months ago

Qwen/Qwen3-Coder-Next

Text Generation • 80B • Updated Feb 3 • 1.08M • • 1.53k

Qwen/Qwen3-Coder-Next-FP8

Text Generation • 80B • Updated Feb 3 • 2.95M • 161

New activity in Qwen/Qwen3-Coder-Next 5 months ago

Very specific json formatting issue in tool calls

➕ 1

#14 opened 5 months ago by deleted

liked a model 6 months ago

meituan-longcat/LongCat-Flash-Lite

Text Generation • 69B • Updated Feb 6 • 5.71k • 198

New activity in zai-org/GLM-4.7-Flash 6 months ago

Thank you Z.AI, I love this model! ❤

👀❤️ 8

#43 opened 6 months ago by

MrDevolver

Model breaks apart when used with different languages

#38 opened 6 months ago by

nephepritou

Enormous KV-cache size?

👍➕ 6

#3 opened 6 months ago by

nephepritou

Why does the KV cache occupy so much GPU memory?

#21 opened 6 months ago by

yyg201708

New activity in cyankiwi/GLM-4.5-Air-AWQ-4bit 7 months ago

Running on 4 GPUs with TP=4

#11 opened 8 months ago by

nephepritou

New activity in nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16 7 months ago

Tool calling with reasoning parsing broken

#3 opened 7 months ago by

nephepritou

New activity in cyankiwi/GLM-4.6V-AWQ-4bit 7 months ago

Question about group size

#1 opened 7 months ago by

nephepritou

Gleb Kurchanov

AI & ML interests

Recent Activity

Organizations

nephepritou's activity

OutOfMemory during weights loading (vLLM)

Updated weights

Does the A100 work?

Quantization instruction

Inconsistent output (resolved)

Very specific json formatting issue in tool calls

Thank you Z.AI, I love this model! ❤

Model breaks apart when used with different languages

Enormous KV-cache size?

Why does the KV cache occupy so much GPU memory?

Running on 4 GPUs with TP=4

Tool calling with reasoning parsing broken

Question about group size