17 6

zhangchenchen

zhnagchenchne

AI & ML interests

None yet

Recent Activity

liked a model about 2 months ago

openai/circuit-sparsity

new activity 3 months ago

Qwen/Qwen3-Omni-30B-A3B-Thinking:vLLM serve for Qwen3-Omni currently only supports the thinker model.

new activity 4 months ago

ByteDance-Seed/Seed-OSS-36B-Instruct:Hiding Thinking Process

View all activity

Organizations

liked a model about 2 months ago

openai/circuit-sparsity

Text Generation • 0.4B • Updated Dec 12, 2025 • 242 • 202

New activity in Qwen/Qwen3-Omni-30B-A3B-Thinking 3 months ago

vLLM serve for Qwen3-Omni currently only supports the thinker model.

#8 opened 3 months ago by

zhnagchenchne

New activity in ByteDance-Seed/Seed-OSS-36B-Instruct 4 months ago

Hiding Thinking Process

#25 opened 5 months ago by

Xzendor7

New activity in ByteDance-Seed/Seed-OSS-36B-Instruct 5 months ago

Where is the inference/vllm_tool_call.py inference/vllm_chat.py generate.py ？

#27 opened 5 months ago by

zhnagchenchne

New activity in moonshotai/Kimi-K2-Instruct 7 months ago

What actually is the EOS token for this model?

#31 opened 7 months ago by

jukofyork

New activity in RedHatAI/Kimi-K2-Instruct-quantized.w4a16 7 months ago

Keep outputting empty content without stopping

#3 opened 7 months ago by

zhnagchenchne

New activity in adamo1139/DeepSeek-R1-0528-AWQ 7 months ago

About AWQ

#3 opened 7 months ago by

zhnagchenchne

New activity in moonshotai/Kimi-K2-Instruct 7 months ago

Run 1T-param on A100/H100(80G)x8 using FP4

🚀 🔥 5

#9 opened 7 months ago by

ghostplant

New activity in Qwen/Qwen3-Embedding-0.6B 8 months ago

onnx-community/Qwen3-Embedding-0.6B-ONNX

#25 opened 8 months ago by

zhnagchenchne

New activity in QuixiAI/DeepSeek-R1-0528-AWQ 8 months ago

The result is problematic.

#3 opened 8 months ago by

zhnagchenchne

tokenizer_config.json

#2 opened 8 months ago by

zhnagchenchne

liked a model 8 months ago

QuixiAI/DeepSeek-R1-0528-AWQ

Text Generation • 671B • Updated Jun 1, 2025 • 28 • 19

liked a model 10 months ago

deepseek-ai/DeepSeek-Prover-V2-671B

Text Generation • 685B • Updated Apr 30, 2025 • 294 • • 817

New activity in deepseek-ai/DeepSeek-Prover-V2-671B 10 months ago

DeepSeek-Prover-V1 的升级版？

#13 opened 10 months ago by

zhnagchenchne

liked 2 models 11 months ago

deepseek-ai/DeepSeek-V3-0324

Text Generation • 685B • Updated Mar 27, 2025 • 242k • • 3.09k

QuixiAI/DeepSeek-V3-0324-AWQ

Text Generation • 671B • Updated Mar 29, 2025 • 115 • 23

updated a model 11 months ago

YiXin-AILab/YiXin-Distill-Qwen-72B-AWQ

73B • Updated Mar 25, 2025 • 6 • 1

published a model 11 months ago

YiXin-AILab/YiXin-Distill-Qwen-72B-AWQ

73B • Updated Mar 25, 2025 • 6 • 1

New activity in YiXin-AILab/YiXin-Distill-Qwen-72B 11 months ago

Calibration dataset

#4 opened 11 months ago by

AlphaGaO

Please increase code reasoning skills

#5 opened 11 months ago by

xldistance

zhangchenchen

AI & ML interests

Recent Activity

Organizations

zhnagchenchne's activity

vLLM serve for Qwen3-Omni currently only supports the thinker model.

Hiding Thinking Process

Where is the inference/vllm_tool_call.py inference/vllm_chat.py generate.py ？

What actually is the EOS token for this model?

Keep outputting empty content without stopping

About AWQ

Run 1T-param on A100/H100(80G)x8 using FP4

onnx-community/Qwen3-Embedding-0.6B-ONNX

The result is problematic.

tokenizer_config.json

DeepSeek-Prover-V1 的升级版？

Calibration dataset

Please increase code reasoning skills