Minh2508
/

Decode

Text Generation

Mixture of Experts

mixture-of-experts

Model card Files Files and versions

Minh2508 commited on Mar 30

Commit

e32e654

·

verified ·

1 Parent(s): 172db2d

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -24,7 +24,7 @@ model-index:
 # 🚀 Decode-12B-MoE: High-Performance Mixture of Experts Model
 **Decode-12B-MoE** is a Large Language Model (LLM) utilizing a **Sparse Mixture of Experts (MoE)** architecture with a total of **12.5 billion parameters**. This model is engineered to bridge the gap between massive parameter counts and computational efficiency, activating only a fraction of its weights (~2.5B) during inference.
 ## 📌 Technical Specifications
 | Attribute | Value |

 # 🚀 Decode-12B-MoE: High-Performance Mixture of Experts Model
 **Decode-12B-MoE** is a Large Language Model (LLM) utilizing a **Sparse Mixture of Experts (MoE)** architecture with a total of **12.5 billion parameters**. This model is engineered to bridge the gap between massive parameter counts and computational efficiency, activating only a fraction of its weights (~2.5B) during inference.
+** Untrained model! **
 ## 📌 Technical Specifications
 | Attribute | Value |