Knowledge Graphs are Implicit Reward Models: Path-Derived Signals Enable Compositional Reasoning Paper • 2601.15160 • Published Jan 21 • 1