Spatial-Aware VLA Pretraining through Visual-Physical Alignment from Human Videos Paper โข 2512.13080 โข Published 11 days ago โข 15