Update README.md (#2)

- Update README.md (f9b16284eebe09659076369c0075f5e5937e8c63)

Co-authored-by: Bo Zheng <bzheng@users.noreply.huggingface.co>

Files changed (1) hide show

README.md CHANGED Viewed

@@ -59,7 +59,7 @@ For more details, please refer to our blog post [Qwen3.5](https://qwen.ai/blog?i
         - Rotary Position Embedding Dimension: 64
     - Feed Forward Network:
         - Intermediate Dimension: 9216
-    - LM Output: 248320 (Padded)
     - MTP: trained with multi-steps
 - Context Length: 262,144 natively and extensible up to 1,010,000 tokens.

         - Rotary Position Embedding Dimension: 64
     - Feed Forward Network:
         - Intermediate Dimension: 9216
+    - LM Output: 248320 (Tied to token embedding)
     - MTP: trained with multi-steps
 - Context Length: 262,144 natively and extensible up to 1,010,000 tokens.