Josh Warner
JDWarner
AI & ML interests
None yet
Recent Activity
liked a model 4 days ago
DJLougen/Qwable-5-27B-Coder liked a model 9 days ago
InternScience/Agents-K1 new activity 18 days ago
reinforce20001/gemma4-26b-a4b-it-qat-w4a16-ct:Methodology?Organizations
None yet
Methodology?
1
#1 opened 18 days ago
by
JDWarner
Is MTP possible?
3
#2 opened about 2 months ago
by
JDWarner
Scaling with concurrency?
2
#1 opened 2 months ago
by
JDWarner
dflash with quantize model
1
#5 opened 2 months ago
by
Shimon324
FP8 work for base model or is 16-bit of 27B required?
17
#2 opened 3 months ago
by
unoid
pruned version
🔥👀 1
2
#16 opened 3 months ago
by
pirola
There's got to be a better way.
23
#6 opened 3 months ago
by
phil111
Recall from embed documents not as good as the original
5
#4 opened 3 months ago
by
o0Linny0o
A wild idea / suggestion...
🔥 3
2
#4 opened 3 months ago
by
MrDevolver
Consider releasing full BF16 weights
2
#1 opened 3 months ago
by
JDWarner
good model
5
#1 opened 3 months ago
by
Roman1111111
Work great on 3090 except for weird (...) generation
❤️ 1
6
#1 opened 3 months ago
by
ortegaalfredo
Qwopus with visual capabilities?
2
#19 opened 3 months ago
by
AQLabs
Security/Compliance Audit: EU AI Act & NIST Exposure
🔥 1
3
#8 opened 4 months ago
by
tradeapollo
FP8 models
3
#1 opened 4 months ago
by
ecopoiesis
IQ5_K 136.891 GiB
🔥 2
30
#9 opened 5 months ago
by
Hunterx
Request: GGUF / quantized weights for Intern-S1-Pro
1
#7 opened 5 months ago
by
gileneo
INT8 quantization for KVCache on DGX Spark/GB10
4
#6 opened 5 months ago
by
JDWarner
This just trades general performance for domain specific gains.
🔥👍 16
11
#3 opened 10 months ago
by
phil111
Disable thinking mode in Jan-v1-4B model
2
#9 opened 10 months ago
by
vuhaix95