DMindAI
/

DMind-2-4B

Model card Files Files and versions

yuzhe commited on Aug 20, 2025

Commit

d842129

·

verified ·

1 Parent(s): cd609ef

Update README.md

Files changed (1) hide show

README.md +3 -0

README.md CHANGED Viewed

@@ -58,6 +58,9 @@ Where:
 - $\alpha_i = \exp(-\delta \cdot i/T)$ implements exponential decay for later reasoning steps
 - $\mathcal{L}_{\text{QS}}$ is the quality scoring loss ensuring reasoning coherence
 #### Dynamic Weight Adjustment Mechanism
 The complexity-aware weight adjustment is formulated as:

 - $\alpha_i = \exp(-\delta \cdot i/T)$ implements exponential decay for later reasoning steps
 - $\mathcal{L}_{\text{QS}}$ is the quality scoring loss ensuring reasoning coherence
+$a_o$
 #### Dynamic Weight Adjustment Mechanism
 The complexity-aware weight adjustment is formulated as: