---
title: README
emoji: 🚀
colorFrom: blue
colorTo: purple
sdk: static
pinned: true
license: apache-2.0
---

🦅 **FALCON**: an effective vision-language-action (VLA) model injects rich 3D spatial tokens into the action head, enabling robust spatial understanding and SOTA performance across diverse manipulation tasks without disrupting vision-language alignment. Accepted at **ICLR 2026**.