--- title: README emoji: 🚀 colorFrom: blue colorTo: purple sdk: static pinned: true license: apache-2.0 --- 🦅 **FALCON**: an effective vision-language-action (VLA) model injects rich 3D spatial tokens into the action head, enabling robust spatial understanding and SOTA performance across diverse manipulation tasks without disrupting vision-language alignment. Accepted at **ICLR 2026**.