Update README.md
Browse files
README.md
CHANGED
|
@@ -58,7 +58,9 @@ Where:
|
|
| 58 |
- $\alpha_i = \exp(-\delta \cdot i/T)$ implements exponential decay for later reasoning steps
|
| 59 |
- $\mathcal{L}_{\text{QS}}$ is the quality scoring loss ensuring reasoning coherence
|
| 60 |
|
| 61 |
-
|
|
|
|
|
|
|
| 62 |
|
| 63 |
|
| 64 |
#### Dynamic Weight Adjustment Mechanism
|
|
|
|
| 58 |
- $\alpha_i = \exp(-\delta \cdot i/T)$ implements exponential decay for later reasoning steps
|
| 59 |
- $\mathcal{L}_{\text{QS}}$ is the quality scoring loss ensuring reasoning coherence
|
| 60 |
|
| 61 |
+
$$
|
| 62 |
+
a_o
|
| 63 |
+
$$
|
| 64 |
|
| 65 |
|
| 66 |
#### Dynamic Weight Adjustment Mechanism
|