Hebangwen
hebangwen
AI & ML interests
None yet
Recent Activity
new activity
10 days ago
Qwen/Qwen3.5-35B-A3B:RIP liked
a model 10 days ago
stepfun-ai/NextStep-1.1 new activity
11 days ago
Qwen/Qwen3.5-9B:DGX SPARK VLLM RESULTS Organizations
RIP
๐ค 10
2
#44 opened 11 days ago
by
kabachuha
DGX SPARK VLLM RESULTS
๐ 2
2
#4 opened 12 days ago
by
RGMC98
llama.cpp support please.
โค๏ธ 17
3
#1 opened about 2 months ago
by
rosspanda0
Some question about Inference Benchmark
1
#8 opened about 2 months ago
by
hebangwen
How can we access the acoustic encoder and semantics encoder?
1
#20 opened 3 months ago
by
hebangwen
Humble request for a stable vLLM/SGLang deployment setup for DeepSeek-V3.2
๐ โค๏ธ 13
11
#15 opened 3 months ago
by
burrowswang
make deepseekocr compatible with mps and latest transformers
#96 opened 4 months ago
by
hebangwen
Wrong link to the `ARCHITECTURE_BLOG`
1
#16 opened 8 months ago
by
hebangwen
Question about the figure showing GQA.
โ 2
1
#8 opened 8 months ago
by
hebangwen
AssertionError: Both operands must be same dtype. Got fp16 and bf16
20
#8 opened 11 months ago
by
treehugg3
Will R2 be released before the National Day or Mid-Autumn Festival?
4
#28 opened 10 months ago
by
dashi6174
just a question about params
6
#17 opened 12 months ago
by
xubin-bruce
How to get speech synthesized and speech recognized?
4
#3 opened about 1 year ago
by
supercharge19
Thank you, Elon Musk! You are truly leading by example! Respect to you sir!
๐ โค๏ธ 22
7
#15 opened almost 2 years ago
by
jobenb