Paddi
Butzermoggel
AI & ML interests
None yet
Organizations
None yet
Max model len is 32768 when serving with vllm and not 40960
2
#19 opened 10 months ago
by
f14
Multimodal ToolMessage
#77 opened 10 months ago
by
Butzermoggel
vLLM example for 'Offline' should include an input image.
❤️ 1
2
#47 opened 12 months ago
by
stev236
Multi-GPU inference: RuntimeError: Expected all tensors to be on the same device
🔥 1
3
#4 opened over 1 year ago
by
Butzermoggel