hi i write a openai compatible api server for wedlm 8b
#7
by
CHONGYOEYAT
- opened
π Just got Tencent's WeDLM running with an OpenAI-compatible API server! π
Super smooth setup using FastAPI + UV package manager. Now I can easily integrate WeDLM-8B-Instruct into my projects with standard OpenAI client libraries.
π¦ Quick setup:
1οΈβ£ Install dependencies
2οΈβ£ Download the model
3οΈβ£ Run the server
4οΈβ£ Start making API calls!
Already seeing great performance with flash-attention optimization. If you're working with LLMs, definitely worth checking out.
Check it out here: https://gist.github.com/cyysky/bedad2eeb7e440f5838e7cb8d2f11ca6