serving (like vllm, etc)

#7
by prudant - opened

is there any way to serve the model in a high concurrent scenario?

regards

Sign up or log in to comment