tencent
/

WeDLM-8B-Instruct

Text Generation

parallel-decoding

Model card Files Files and versions

exlaw commited on 4 days ago

Commit

c09b3b5

·

verified ·

1 Parent(s): 711665c

Upload folder using huggingface_hub

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -19,7 +19,7 @@ tags:
 - 📈 Outperforms base Qwen3-8B-Instruct on most benchmarks
 - ✅ Native KV cache compatible (FlashAttention, PagedAttention, CUDA Graphs)
-For the base (pretrained) version, see [WeDLM-8B](https://huggingface.co/tencent/WeDLM-8B-Base).
 📄 Paper (Coming Soon) | 🌐 [Project Page](https://wedlm.github.io) | 💻 [GitHub](https://github.com/tencent/WeDLM)

 - 📈 Outperforms base Qwen3-8B-Instruct on most benchmarks
 - ✅ Native KV cache compatible (FlashAttention, PagedAttention, CUDA Graphs)
+For the base (pretrained) version, see [WeDLM-8B](https://huggingface.co/tencent/WeDLM-8B-Base), which is based on Qwen3-8B-Base.
 📄 Paper (Coming Soon) | 🌐 [Project Page](https://wedlm.github.io) | 💻 [GitHub](https://github.com/tencent/WeDLM)