HuggingFaceVLA/libero
Viewer • Updated • 273k • 21.6k • 58
This project fine-tunes lerobot/smolvla_base using the HuggingFaceVLA/libero dataset to improve vision-language-action understanding for multi-task robotic manipulation.
LIBERO provides multi-task robotic manipulation demonstrations with visual observations, language instructions, and action sequences.
The model was fine-tuned on the LIBERO dataset following the original smolVLA instructions, mapping visual observations and language instructions to robot actions.
Base model
lerobot/smolvla_base