Vision-Language-Action models (OpenVLA, RT-2, Pi0) achieve state-of-the-art robot manipulation — but at 7B+ parameters, they run at 0.5 fps. Robots need 10+ fps for real-time control. FORGE solves this with automated knowledge distillation.