Performance Discussion
#1
by
IndenScale
- opened
能代替 Qwen3 30B 系列吗?
这个尺寸的模型非常适合上生产。或者作为 pre-commit/push hooks 用来构建多层次的 CI Guardrail。
后续会更新在 MLX 上的性能表现。
I wanted to know token per second speed.
I wanted to know token per second speed.
I'll try using it with vllm on my AI max 395+