FLOWER VLA
Collection
Collection of checkpoints for the FLOWER VLA policy. A small and versatile VLA for language-conditioned robot manipulation with less than 1B parameter • 10 items • Updated
• 4
This is a pretrained FlowerVLA model for robotic manipulation trained on the CALVIN ABCD dataset. Flower is an efficient Vision-Language-Action Flow policy for robot learning that only contains 1B parameters.
FlowerVLA is a novel architecture that:
This checkpoint contains weights for the CALVIN ABCD challenge and currently ranks 1 with the following results:
| Train→Test | Method | 1 | 2 | 3 | 4 | 5 | Avg. Len. |
|---|---|---|---|---|---|---|---|
| {dataset_name} | FlowerVLA | 99.1% | 97.8% | 95.2% | 92.4% | 87.8% | 4.72 |
(B, T, 3, H, W) tensor(B, T, 3, H, W) tensor(B, T, 7) tensor representing delta EEF actionsCheck out our full model implementation on Github todo and follow the instructions in the readme to test the model on one of the environments.
obs = {
"rgb_obs": {
"rgb_static": static_image,
"rgb_gripper": gripper_image
}
}
goal = {"lang_text": "pick up the blue cube"}
action = model.step(obs, goal)
@inproceedings{ reuss2025flower, # Add citation when available }
This model is released under the MIT license.
Unable to build the model tree, the base model loops to the model itself. Learn more.