Update README.md
Browse files
README.md
CHANGED
|
@@ -41,7 +41,7 @@ licence: license
|
|
| 41 |
frameworks encourages the policy model to improve the structural quality of reasoning. Consequently, this leads to
|
| 42 |
consistent performance improvements over existing sparse reward frameworks.
|
| 43 |
|
| 44 |
-
# Illustration of
|
| 45 |
|
| 46 |
<div align="center">
|
| 47 |
<img src="https://arxiv.org/html/2510.25065v1/x1.png" width="600"/>
|
|
|
|
| 41 |
frameworks encourages the policy model to improve the structural quality of reasoning. Consequently, this leads to
|
| 42 |
consistent performance improvements over existing sparse reward frameworks.
|
| 43 |
|
| 44 |
+
# Illustration of TACReward
|
| 45 |
|
| 46 |
<div align="center">
|
| 47 |
<img src="https://arxiv.org/html/2510.25065v1/x1.png" width="600"/>
|