Jeremy Lam
whoisjeremylam
·
AI & ML interests
None yet
Recent Activity
new activity about 10 hours ago
AesSedai/MiMo-V2.5-Pro-GGUF:MiMo V2.5 Pro might be the most stable IQ2_S quant new activity 26 days ago
ubergarm/GLM-5.1-GGUF:Draft llama.cpp PR for DSA (Deepseek Sparse Attention)Organizations
None yet
MiMo V2.5 Pro might be the most stable IQ2_S quant
#4 opened about 10 hours ago
by
whoisjeremylam
Draft llama.cpp PR for DSA (Deepseek Sparse Attention)
👍 1
1
#8 opened 26 days ago
by
whoisjeremylam
Thanks. This is by far the best denial stripping I've ever seen.
👍 4
2
#30 opened 2 months ago
by
phil111
Do the 397B quants need to be downloaded again?
➕ 1
2
#15 opened 3 months ago
by
whoisjeremylam
Q8 and smaller quants
3
#1 opened 4 months ago
by
whoisjeremylam
smol-IQ2_KS also passes the official K2 Vendor Verifier test!
🔥 2
1
#15 opened 6 months ago
by
whoisjeremylam
Output not respecting lines breaks?
12
#10 opened 7 months ago
by
justj0sh
IQ2_KS passes the Moonshot K2 Vendor Verifier test
🔥 3
4
#8 opened 6 months ago
by
whoisjeremylam
Strange warning on first completion w/ vLLM 0.11.0
#6 opened 7 months ago
by
whoisjeremylam
How to stop reasoning?
3
#16 opened 8 months ago
by
yuchenxie
Best local VLM so far that I've found that fits in 96 GB of VRAM
#2 opened 7 months ago
by
whoisjeremylam
Model quality relative to other quantization techniques?
1
#1 opened 7 months ago
by
spanspek
Inference with llama.cpp + Open WebUI gives repeating `?`
4
#1 opened 7 months ago
by
whoisjeremylam
Aider Polyglot Benchmark
👍 1
1
#1 opened 7 months ago
by
whoisjeremylam
Difference between int4-mixed and int4
1
#1 opened 9 months ago
by
whoisjeremylam
quant req: 256 GB RAM + 96 GB VRAM
👍 1
28
#1 opened 9 months ago
by
whoisjeremylam
Neglible loss of PPL when using only 6 of 8 experts
🔥 2
3
#7 opened 9 months ago
by
whoisjeremylam
Dynamic quants
3
#13 opened over 1 year ago
by
XelotX