Submitted by Tianze Yang 17 TRON: Targeted Rule-Verifiable Online Environments for Visual Reasoning RL University of Georgia 2 1