Steve Li's picture

Steve Li

CHNtentes

·

CHNtentes

AI & ML interests

None yet

Recent Activity

new activity 3 days ago

unsloth/Qwen-AgentWorld-35B-A3B-GGUF:mmproj file so small!

liked a model 8 days ago

zai-org/GLM-5.2-FP8

liked a model 11 days ago

zai-org/GLM-5.2

View all activity

Organizations

None yet

New activity in unsloth/Qwen-AgentWorld-35B-A3B-GGUF 3 days ago

mmproj file so small!

#1 opened 3 days ago by

New activity in Comfy-Org/Ideogram-4 17 days ago

Flash attention optimization for significant speedup. - old title: Optimization tips to maximize generation speed?

#6 opened 23 days ago by

New activity in tencent/Hy-MT2-1.8B about 1 month ago

You open-sourced my ass - 你“开源”我的屁吧！

#1 opened about 1 month ago by

New activity in unsloth/Qwen3.6-27B-MTP-GGUF about 1 month ago

Noticeable Performance Decrease

#23 opened about 1 month ago by

WebWeaverWraith

New activity in canada-quant/DeepSeek-V4-Flash-W4A16-FP8 about 2 months ago

Can I run this model on 2x H20 141GB?

#1 opened about 2 months ago by

New activity in havenoammo/Qwen3.6-35B-A3B-MTP-GGUF about 2 months ago

Is it possible to only download the mtp gguf (<1GB one) to use with existing ggufs?

#3 opened about 2 months ago by

New activity in deepseek-ai/DeepSeek-V4-Pro about 2 months ago

Will there be small models like 12b?

#164 opened about 2 months ago by

New activity in deepseek-ai/DeepSeek-V4-Flash about 2 months ago

Too big to run locally.

#12 opened 2 months ago by

New activity in deepseek-ai/DeepSeek-V4-Pro 2 months ago

所以我猜是混合精度加模型太大导致暂时还没有量化的模型出来

#96 opened 2 months ago by

New activity in deepseek-ai/DeepSeek-V4-Flash 2 months ago

May I ask if there is a deployment document?

#10 opened 2 months ago by

New activity in Qwen/Qwen3.6-27B-FP8 2 months ago

Parameter model.layers.15.mlp.gate_gate_up_proj.weight_scale_inv not found in params_dict

#3 opened 2 months ago by

New activity in Qwen/Qwen3.6-35B-A3B 2 months ago

太吃显存啦

#21 opened 2 months ago by

New activity in cyankiwi/MiniMax-M2.7-AWQ-4bit 2 months ago

These are NOT actual AWQ-quantized models.

#2 opened 2 months ago by

New activity in unsloth/MiniMax-M2.7-GGUF 3 months ago

larger file size for same quant

#4 opened 3 months ago by

New activity in Tongyi-MAI/MAI-UI-8B 3 months ago

will we ever get 32b and 235b versions?

#4 opened 3 months ago by

New activity in zai-org/GLM-5.1 3 months ago

GLM5.1角色问题-重要

#17 opened 3 months ago by

New activity in MiniMaxAI/MiniMax-M2.5 3 months ago

can we get minimax-m2.7

#49 opened 3 months ago by

New activity in Tesslate/OmniCoder-9B 3 months ago

35b variant?

#2 opened 4 months ago by

New activity in Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled 4 months ago

FP8 Version for running on vLLM with hardware optimizations from Ada+ generation GPUs

#14 opened 4 months ago by

New activity in Qwen/Qwen3.5-0.8B 4 months ago

Could someone make Qwen/Qwen3.5-0.4B?

#4 opened 4 months ago by