SmolVLA-OMY Model Checkpoints

This repository contains training checkpoints for a SmolVLA (Small Vision-Language-Action) model trained on the ArrangeVegetables task.

Model Details

Model Type: SmolVLA (Vision-Language-Action model)
Task: ArrangeVegetables manipulation task
Training Steps: 20,000 steps
Batch Size: 350
Chunk Size: 5 action steps
Input Features:
- Visual observations: 256x256 RGB images (both main camera and wrist camera)
- State observations: 6-dimensional state vector
Output Features: 12-dimensional action space

The repository contains checkpoints saved at different training steps:

Each checkpoint contains:

To load a checkpoint:

from your_training_framework import load_checkpoint

# Load the latest checkpoint (2000 steps)
model = load_checkpoint("./002000/pretrained_model/")

Trained on the ArrangeVegetables dataset available at: lava8888/ArrangeVegetables

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support