yuzhe commited on
Commit
7b28e2e
·
verified ·
1 Parent(s): d842129

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -1
README.md CHANGED
@@ -58,7 +58,9 @@ Where:
58
  - $\alpha_i = \exp(-\delta \cdot i/T)$ implements exponential decay for later reasoning steps
59
  - $\mathcal{L}_{\text{QS}}$ is the quality scoring loss ensuring reasoning coherence
60
 
61
- $a_o$
 
 
62
 
63
 
64
  #### Dynamic Weight Adjustment Mechanism
 
58
  - $\alpha_i = \exp(-\delta \cdot i/T)$ implements exponential decay for later reasoning steps
59
  - $\mathcal{L}_{\text{QS}}$ is the quality scoring loss ensuring reasoning coherence
60
 
61
+ $$
62
+ a_o
63
+ $$
64
 
65
 
66
  #### Dynamic Weight Adjustment Mechanism