inclusionAI
/

LLaDA2.0-flash-preview

Text Generation

text_generation

Model card Files Files and versions

lccurious commited on Oct 25, 2025

Commit

e43dd0e

·

verified ·

1 Parent(s): b75e044

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -8,7 +8,7 @@ tags:
 - text_generation
 ---
 DA2.0-flash-preview
-**LLaDA2-flash-preview** is a diffusion language model featuring a 100BA6B Mixture-of-Experts (MoE) architecture. As an enhanced, instruction-tuned iteration of the LLaDA series, it is optimized for practical applications.
 <div align="center">
   <img src="https://mdn.alipayobjects.com/huamei_qa8qxu/afts/img/A*kLORSaRfSK8AAAAAgIAAAAgAemJ7AQ/original" width="800" />
@@ -47,7 +47,7 @@ DA2.0-flash-preview
 + **Leading MoE Architecture**:
 The open-source **Mixture-of-Experts (MoE) diffusion large language model**, pre-trained from scratch on approximately **20 trillion tokens**.
 + **Efficient Inference**:
-With **100 billion total parameters**, only **6.1 billion** are activated during inference. LLaDA-flash-preview significantly reduces computational costs while outperforming open-source dense models of similar scale.
 + **Impressive Performance on Code & Complex Reasoning**:
 Excels in tasks such as **code generation** and **advanced mathematical reasoning**, demonstrating strong reasoning capabilities.
 + **Tool Use**:

 - text_generation
 ---
 DA2.0-flash-preview
+**LLaDA2-flash-preview** is a diffusion language model featuring a 100BA6B Mixture-of-Experts (MoE) architecture. As an enhanced, instruction-tuned iteration of the LLaDA2.0 series, it is optimized for practical applications.
 <div align="center">
   <img src="https://mdn.alipayobjects.com/huamei_qa8qxu/afts/img/A*kLORSaRfSK8AAAAAgIAAAAgAemJ7AQ/original" width="800" />
 + **Leading MoE Architecture**:
 The open-source **Mixture-of-Experts (MoE) diffusion large language model**, pre-trained from scratch on approximately **20 trillion tokens**.
 + **Efficient Inference**:
+With **100 billion total parameters**, only **6.1 billion** are activated during inference. LLaDA2.0-flash-preview significantly reduces computational costs while outperforming open-source dense models of similar scale.
 + **Impressive Performance on Code & Complex Reasoning**:
 Excels in tasks such as **code generation** and **advanced mathematical reasoning**, demonstrating strong reasoning capabilities.
 + **Tool Use**: