exlaw commited on
Commit
711665c
Β·
verified Β·
1 Parent(s): ba45e22

Upload folder using huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -19,7 +19,7 @@ tags:
19
  - πŸ“ˆ Outperforms base Qwen3-8B-Instruct on most benchmarks
20
  - βœ… Native KV cache compatible (FlashAttention, PagedAttention, CUDA Graphs)
21
 
22
- For the base (pretrained) version, see [WeDLM-8B](https://huggingface.co/tencent/WeDLM-8B).
23
 
24
  πŸ“„ Paper (Coming Soon) | 🌐 [Project Page](https://wedlm.github.io) | πŸ’» [GitHub](https://github.com/tencent/WeDLM)
25
 
@@ -27,7 +27,7 @@ For the base (pretrained) version, see [WeDLM-8B](https://huggingface.co/tencent
27
 
28
  | Attribute | Value |
29
  |:----------|:------|
30
- | Base Model | [WeDLM-8B](https://huggingface.co/tencent/WeDLM-8B) |
31
  | Parameters | 8B |
32
  | Context Length | 32,768 |
33
 
 
19
  - πŸ“ˆ Outperforms base Qwen3-8B-Instruct on most benchmarks
20
  - βœ… Native KV cache compatible (FlashAttention, PagedAttention, CUDA Graphs)
21
 
22
+ For the base (pretrained) version, see [WeDLM-8B](https://huggingface.co/tencent/WeDLM-8B-Base).
23
 
24
  πŸ“„ Paper (Coming Soon) | 🌐 [Project Page](https://wedlm.github.io) | πŸ’» [GitHub](https://github.com/tencent/WeDLM)
25
 
 
27
 
28
  | Attribute | Value |
29
  |:----------|:------|
30
+ | Base Model | [WeDLM-8B](https://huggingface.co/tencent/WeDLM-8B-Base) |
31
  | Parameters | 8B |
32
  | Context Length | 32,768 |
33