宋小猫
SongXiaoMao
AI & ML interests
None yet
Recent Activity
liked a model 3 days ago
Qwen/Qwen3.6-27B-FP8 liked a model 3 days ago
Qwen/Qwen3.6-27B new activity 5 days ago
nerkyor/Qwen3.6-27B-DSV4Pro-Thinking-Distill:VLLM启动报错Organizations
None yet
VLLM启动报错
9
#2 opened 6 days ago
by
SongXiaoMao
MTP efficiency without official FP8 ha
👍 1
#1 opened about 1 month ago
by
SongXiaoMao
MTP cannot be accelerated
#1 opened about 1 month ago
by
SongXiaoMao
The official VLLM example starts normal inference error
#3 opened 2 months ago
by
SongXiaoMao
This model cannot use MTP
4
#2 opened 2 months ago
by
SongXiaoMao
Modify the configuration file
🔥 1
1
#1 opened 2 months ago
by
SongXiaoMao
FP8 work for base model or is 16-bit of 27B required?
17
#2 opened 3 months ago
by
unoid
Is there anyone who can tell me how to run this model with vllm correctly?
😔 3
7
#8 opened 3 months ago
by
beginor
Can the big guy quantify this model into MXFP4? Thank you!!
#3 opened 3 months ago
by
SongXiaoMao
How does the VLLM start this model?
2
#4 opened 3 months ago
by
SongXiaoMao
This quantization model is amzing
❤️👍 2
5
#1 opened 4 months ago
by
hyunw55
Why is the file size of 4bit similar to FP8?
3
#2 opened 3 months ago
by
SongXiaoMao
Sensitive information is not a question
2
#3 opened 3 months ago
by
SongXiaoMao
VLLM 0.18.0 runs with an error
#2 opened 3 months ago
by
SongXiaoMao
I get an error using vllm0.18.0
1
#1 opened 3 months ago
by
SongXiaoMao
使用VLLM启动会报错
#3 opened 3 months ago
by
SongXiaoMao
Tokenizer class TokenizersBackend does not exist in vllm v0.17.1
12
#26 opened 4 months ago
by
putcn
Can you make a quantitative model? Qwen3.5-122B-A10B-GPTQ-Int4
#2 opened 4 months ago
by
SongXiaoMao
When will you fix the model replies missing</think>\n start tags
18
#19 opened over 1 year ago
by
xldistance