20 3

Jeremy Lam

whoisjeremylam

AI & ML interests

None yet

Recent Activity

new activity 7 days ago

AesSedai/MiMo-V2.5-Pro-GGUF:MiMo V2.5 Pro might be the most stable IQ2_S quant

new activity about 1 month ago

ubergarm/GLM-5.1-GGUF:Draft llama.cpp PR for DSA (Deepseek Sparse Attention)

new activity about 2 months ago

ubergarm/GLM-5.1-GGUF:render_message_to_json: Neither string content nor typed content is supported by the template. This is unexpected and may lead to issues.

View all activity

Organizations

None yet

New activity in AesSedai/MiMo-V2.5-Pro-GGUF 7 days ago

MiMo V2.5 Pro might be the most stable IQ2_S quant

❤️🚀 1

#4 opened 7 days ago by

whoisjeremylam

New activity in ubergarm/GLM-5.1-GGUF about 1 month ago

Draft llama.cpp PR for DSA (Deepseek Sparse Attention)

👍 1

#8 opened about 1 month ago by

whoisjeremylam

New activity in ubergarm/GLM-5.1-GGUF about 2 months ago

render_message_to_json: Neither string content nor typed content is supported by the template. This is unexpected and may lead to issues.

#7 opened about 2 months ago by

whoisjeremylam

New activity in HauhauCS/Qwen3.5-35B-A3B-Uncensored-HauhauCS-Aggressive 2 months ago

Thanks. This is by far the best denial stripping I've ever seen.

👍 4

#30 opened 2 months ago by

phil111

New activity in unsloth/Qwen3.5-397B-A17B-GGUF 3 months ago

Do the 397B quants need to be downloaded again?

➕ 1

#15 opened 3 months ago by

whoisjeremylam

New activity in concavity-ai/superlinear-exp-v0.1 4 months ago

Q8 and smaller quants

#1 opened 4 months ago by

whoisjeremylam

New activity in ubergarm/Kimi-K2-Thinking-GGUF 6 months ago

smol-IQ2_KS also passes the official K2 Vendor Verifier test!

🔥 2

#15 opened 6 months ago by

whoisjeremylam

Output not respecting lines breaks?

#10 opened 7 months ago by

justj0sh

New activity in ubergarm/Kimi-K2-Instruct-GGUF 6 months ago

IQ2_KS passes the Moonshot K2 Vendor Verifier test

🔥 3

#8 opened 6 months ago by

whoisjeremylam

New activity in ubergarm/Kimi-K2-Instruct-0905-GGUF 7 months ago

IQ2_KS

#1 opened 9 months ago by

gghfez

New activity in cyankiwi/Qwen3-Omni-30B-A3B-Instruct-AWQ-4bit 7 months ago

Strange warning on first completion w/ vLLM 0.11.0

#6 opened 7 months ago by

whoisjeremylam

New activity in zai-org/GLM-4.6 7 months ago

How to stop reasoning?

#16 opened 8 months ago by

yuchenxie

New activity in cyankiwi/GLM-4.5V-AWQ-4bit 7 months ago

Best local VLM so far that I've found that fits in 96 GB of VRAM

#2 opened 7 months ago by

whoisjeremylam

New activity in INC4AI/GLM-4.6-gguf-q2ks-mixed-AutoRound 8 months ago

Model quality relative to other quantization techniques?

#1 opened 8 months ago by

spanspek

New activity in Intel/Ling-flash-2.0-gguf-q2ks-mixed-AutoRound 8 months ago

Inference with llama.cpp + Open WebUI gives repeating `?`

#1 opened 8 months ago by

whoisjeremylam

New activity in cerebras/Qwen3-Coder-REAP-25B-A3B 8 months ago

Aider Polyglot Benchmark

👍 1

#1 opened 8 months ago by

whoisjeremylam

New activity in Intel/Qwen3-Next-80B-A3B-Thinking-int4-AutoRound 9 months ago

Difference between int4-mixed and int4

#1 opened 9 months ago by

whoisjeremylam

New activity in ubergarm/DeepSeek-V3.1-GGUF 9 months ago

quant req: 256 GB RAM + 96 GB VRAM

👍 1

#1 opened 10 months ago by

whoisjeremylam

New activity in ubergarm/Kimi-K2-Instruct-GGUF 9 months ago

Neglible loss of PPL when using only 6 of 8 experts

🔥 2

#7 opened 9 months ago by

whoisjeremylam

New activity in unsloth/DeepSeek-V3-GGUF about 1 year ago

Dynamic quants

#13 opened over 1 year ago by

XelotX

Jeremy Lam

AI & ML interests

Recent Activity

Organizations

whoisjeremylam's activity

MiMo V2.5 Pro might be the most stable IQ2_S quant

Draft llama.cpp PR for DSA (Deepseek Sparse Attention)

render_message_to_json: Neither string content nor typed content is supported by the template. This is unexpected and may lead to issues.

Thanks. This is by far the best denial stripping I've ever seen.

Do the 397B quants need to be downloaded again?

Q8 and smaller quants

smol-IQ2_KS also passes the official K2 Vendor Verifier test!

Output not respecting lines breaks?

IQ2_KS passes the Moonshot K2 Vendor Verifier test

IQ2_KS

Strange warning on first completion w/ vLLM 0.11.0

How to stop reasoning?

Best local VLM so far that I've found that fits in 96 GB of VRAM

Model quality relative to other quantization techniques?

Inference with llama.cpp + Open WebUI gives repeating `?`

Aider Polyglot Benchmark

Difference between int4-mixed and int4

quant req: 256 GB RAM + 96 GB VRAM

Neglible loss of PPL when using only 6 of 8 experts

Dynamic quants