| title: README | |
| emoji: 🔥 | |
| colorFrom: blue | |
| colorTo: red | |
| sdk: static | |
| pinned: false | |
|  | |
| TL;DR: Vision tool-use RL enhances model performance by reducing tool-induced harm, but does not significantly improve tool-based correction of intrinsic failures. |