zhangchenchen
zhnagchenchne
AI & ML interests
None yet
Organizations
vLLM serve for Qwen3-Omni currently only supports the thinker model.
#8 opened 5 months ago
by
zhnagchenchne
Hiding Thinking Process
2
#25 opened 7 months ago
by
Xzendor7
Where is the inference/vllm_tool_call.py inference/vllm_chat.py generate.py ๏ผ
2
#27 opened 6 months ago
by
zhnagchenchne
What actually is the EOS token for this model?
4
#31 opened 8 months ago
by
jukofyork
Keep outputting empty content without stopping
2
#3 opened 8 months ago
by
zhnagchenchne
About AWQ
#3 opened 8 months ago
by
zhnagchenchne
Run 1T-param on A100/H100(80G)x8 using FP4
๐๐ฅ 5
7
#9 opened 9 months ago
by
ghostplant
onnx-community/Qwen3-Embedding-0.6B-ONNX
#25 opened 10 months ago
by
zhnagchenchne
The result is problematic.
1
#3 opened 10 months ago
by
zhnagchenchne
tokenizer_config.json
#2 opened 10 months ago
by
zhnagchenchne
DeepSeek-Prover-V1 ็ๅ็บง็๏ผ
1
#13 opened 11 months ago
by
zhnagchenchne
Calibration dataset
1
#4 opened about 1 year ago
by
AlphaGaO
Please increase code reasoning skills
2
#5 opened about 1 year ago
by
xldistance
่ฝฌonnxใ
6
#12 opened over 2 years ago
by
angellee
vllm version
2
#4 opened about 1 year ago
by
zhnagchenchne