Steve Li
CHNtentes
AI & ML interests
None yet
Recent Activity
new activity 2 days ago
unsloth/Qwen-AgentWorld-35B-A3B-GGUF:mmproj file so small! liked a model 7 days ago
zai-org/GLM-5.2-FP8 liked a model 11 days ago
zai-org/GLM-5.2Organizations
None yet
mmproj file so small!
7
#1 opened 2 days ago
by
CHHORVORN
You open-sourced my ass - 你“开源”我的屁吧!
🚀👍 3
3
#1 opened about 1 month ago
by
JLouisBiz
Noticeable Performance Decrease
👍 3
4
#23 opened about 1 month ago
by
WebWeaverWraith
Can I run this model on 2x H20 141GB?
3
#1 opened about 2 months ago
by
CHNtentes
Is it possible to only download the mtp gguf (<1GB one) to use with existing ggufs?
4
#3 opened about 2 months ago
by
CHNtentes
Will there be small models like 12b?
👍👀 5
15
#164 opened about 2 months ago
by
Crownelius
Too big to run locally.
🤯👍 12
20
#12 opened 2 months ago
by
Dampfinchen
所以我猜是混合精度加模型太大导致暂时还没有量化的模型出来
6
#96 opened 2 months ago
by
lzm1066258
May I ask if there is a deployment document?
2
#10 opened 2 months ago
by
jerryliujiawei
Parameter model.layers.15.mlp.gate_gate_up_proj.weight_scale_inv not found in params_dict
5
#3 opened 2 months ago
by
CHNtentes
太吃显存啦
6
#21 opened 2 months ago
by
yukojiangjiang
These are NOT actual AWQ-quantized models.
4
#2 opened 2 months ago
by
cai-cai
larger file size for same quant
5
#4 opened 3 months ago
by
CHNtentes
will we ever get 32b and 235b versions?
#4 opened 3 months ago
by
CHNtentes
GLM5.1角色问题-重要
9
#17 opened 3 months ago
by
liuyt6515
can we get minimax-m2.7
🤗 13
5
#49 opened 3 months ago
by
CHNtentes
35b variant?
👍 4
9
#2 opened 4 months ago
by
dagbs
FP8 Version for running on vLLM with hardware optimizations from Ada+ generation GPUs
4
#14 opened 4 months ago
by
AQLabs
Could someone make Qwen/Qwen3.5-0.4B?
3
#4 opened 4 months ago
by
MihaiPopa-1